comparative performance assessment: Topics by Science.gov

Sample records for comparative performance assessment

Performance Indicators and Rational Management Tools: A Comparative Assessment of Projects in North America and Europe. AIR 1993 Annual Forum Paper.

ERIC Educational Resources Information Center

Nedwek, Brian P.; Neal, John E.

This study developed a classification scheme to critically compare performance assessment projects at higher education universities in North America and Europe. Performance indicators and assessment initiatives were compared using nine basic dimensions: (1) locus of control, (2) degree of governmental involvement, (3) focus of performance…
Evaluating Comparability in the Scoring of Performance Assessments for Accountability Purposes

ERIC Educational Resources Information Center

Lyons, Susan; Evans, Carla

2017-01-01

This brief summarizes "Comparability in Balanced Assessment Systems for State Accountability," published in "Educational Measurement: Issues and Practice" (Evans & Lyons 2017). The study evaluated comparability claims in local scoring of performance assessments across districts participating in New Hampshire's Performance…
Assessing does not mean threatening: the purpose of assessment as a key determinant of girls' and boys' performance in a science class.

PubMed

Souchal, Carine; Toczek, Marie-Christine; Darnon, Céline; Smeding, Annique; Butera, Fabrizio; Martinot, Delphine

2014-03-01

Is it possible to reach performance equality between boys and girls in a science class? Given the stereotypes targeting their groups in scientific domains, diagnostic contexts generally lower girls' performance and non-diagnostic contexts may harm boys' performance. The present study tested the effectiveness of a mastery-oriented assessment, allowing both boys and girls to perform at an optimal level in a science class. Participants were 120 boys and 72 girls (all high-school students). Participants attended a science lesson while expecting a performance-oriented assessment (i.e., an assessment designed to compare and select students), a mastery-oriented assessment (i.e., an assessment designed to help students in their learning), or no assessment of this lesson. In the mastery-oriented assessment condition, both boys and girls performed at a similarly high level, whereas the performance-oriented assessment condition reduced girls' performance and the no-assessment condition reduced boys' performance. One way to increase girls' performance on a science test without harming boys' performance is to present assessment as a tool for improving mastery rather than as a tool for comparing performances. © 2013 The British Psychological Society.
Do repeated assessments of performance status improve predictions for risk of death among patients with cancer? A population-based cohort study.

PubMed

Su, Jiandong; Barbera, Lisa; Sutradhar, Rinku

2015-06-01

Prior work has utilized longitudinal information on performance status to demonstrate its association with risk of death among cancer patients; however, no study has assessed whether such longitudinal information improves the predictions for risk of death. To examine whether the use of repeated performance status assessments improve predictions for risk of death compared to using only performance status assessment at the time of cancer diagnosis. This was a population-based longitudinal study of adult outpatients who had a cancer diagnosis and had at least one assessment of performance status. To account for each patient's changing performance status over time, we implemented a Cox model with a time-varying covariate for performance status. This model was compared to a Cox model using only a time-fixed (baseline) covariate for performance status. The regression coefficients of each model were derived based on a randomly selected 60% of patients, and then, the predictive ability of each model was assessed via concordance probabilities when applied to the remaining 40% of patients. Our study consisted of 15,487 cancer patients with over 53,000 performance status assessments. The utilization of repeated performance status assessments improved predictions for risk of death compared to using only the performance status assessment taken at diagnosis. When studying the hazard of death among patients with cancer, if available, researchers should incorporate changing information on performance status scores, instead of simply baseline information on performance status. © The Author(s) 2015.
Comparing health system performance assessment and management approaches in the Netherlands and Ontario, Canada

PubMed Central

Tawfik-Shukor, Ali R; Klazinga, Niek S; Arah, Onyebuchi A

2007-01-01

Background Given the proliferation and the growing complexity of performance measurement initiatives in many health systems, the Netherlands and Ontario, Canada expressed interests in cross-national comparisons in an effort to promote knowledge transfer and best practise. To support this cross-national learning, a study was undertaken to compare health system performance approaches in The Netherlands with Ontario, Canada. Methods We explored the performance assessment framework and system of each constituency, the embeddedness of performance data in management and policy processes, and the interrelationships between the frameworks. Methods used included analysing governmental strategic planning and policy documents, literature and internet searches, comparative descriptive tables, and schematics. Data collection and analysis took place in Ontario and The Netherlands. A workshop to validate and discuss the findings was conducted in Toronto, adding important insights to the study. Results Both Ontario and The Netherlands conceive health system performance within supportive frameworks. However they differ in their assessment approaches. Ontario's Scorecard links performance measurement with strategy, aimed at health system integration. The Dutch Health Care Performance Report (Zorgbalans) does not explicitly link performance with strategy, and focuses on the technical quality of healthcare by measuring dimensions of quality, access, and cost against healthcare needs. A backbone 'five diamond' framework maps both frameworks and articulates the interrelations and overlap between their goals, themes, dimensions and indicators. The workshop yielded more contextual insights and further validated the comparative values of each constituency's performance assessment system. Conclusion To compare the health system performance approaches between The Netherlands and Ontario, Canada, several important conceptual and contextual issues must be addressed, before even attempting any future content comparisons and benchmarking. Such issues would lend relevant interpretational credibility to international comparative assessments of the two health systems. PMID:17319947
Peer video review and feedback improve performance in basic surgical skills.

PubMed

Vaughn, Carolyn J; Kim, Edward; O'Sullivan, Patricia; Huang, Emily; Lin, Matthew Y C; Wyles, Susannah; Palmer, Barnard J A; Pierce, Jonathan L; Chern, Hueylan

2016-02-01

Incorporation of home-video assessments allows flexibility in feedback but requires faculty time. Peer feedback (PF) may provide additional benefits while avoiding these constraints. Twenty-four surgical interns completed a 12-week skills curriculum with home-video assignments focused on knot tying and suturing. Interns were randomized into 2 groups: PF or faculty feedback (FF). Peers and faculty provided feedback on home videos with checklists, global rating, and comments. Learners' skills were assessed at baseline, during, and at the conclusion of the curriculum. Performance of the 2 groups as rated by experts was compared. FF and PF were compared. Both groups improved from baseline, and the highest rated scores were seen on their home-video assessments. The PF group performed better at the final assessment than the FF group (effect size, .84). When using a checklist, there was no significant difference between scores given by peers and faculty. The PF group performed better at the final assessment, suggesting reviewing and analyzing another's performance may improve one's own performance. With checklists as guidance, peers can serve as raters comparable to faculty. Copyright © 2016 Elsevier Inc. All rights reserved.
Towards an Operational Definition of Clinical Competency in Pharmacy

PubMed Central

2015-01-01

Objective. To estimate the inter-rater reliability and accuracy of ratings of competence in student pharmacist/patient clinical interactions as depicted in videotaped simulations and to compare expert panelist and typical preceptor ratings of those interactions. Methods. This study used a multifactorial experimental design to estimate inter-rater reliability and accuracy of preceptors’ assessment of student performance in clinical simulations. The study protocol used nine 5-10 minute video vignettes portraying different levels of competency in student performance in simulated clinical interactions. Intra-Class Correlation (ICC) was used to calculate inter-rater reliability and Fisher exact test was used to compare differences in distribution of scores between expert and nonexpert assessments. Results. Preceptors (n=42) across 5 states assessed the simulated performances. Intra-Class Correlation estimates were higher for 3 nonrandomized video simulations compared to the 6 randomized simulations. Preceptors more readily identified high and low student performances compared to satisfactory performances. In nearly two-thirds of the rating opportunities, a higher proportion of expert panelists than preceptors rated the student performance correctly (18 of 27 scenarios). Conclusion. Valid and reliable assessments are critically important because they affect student grades and formative student feedback. Study results indicate the need for pharmacy preceptor training in performance assessment. The process demonstrated in this study can be used to establish minimum preceptor benchmarks for future national training programs. PMID:26089563
Improving Student Performance through Computer-Based Assessment: Insights from Recent Research.

ERIC Educational Resources Information Center

Ricketts, C.; Wilks, S. J.

2002-01-01

Compared student performance on computer-based assessment to machine-graded multiple choice tests. Found that performance improved dramatically on the computer-based assessment when students were not required to scroll through the question paper. Concluded that students may be disadvantaged by the introduction of online assessment unless care is…
Behind the Final Grade in Hybrid v. Traditional Courses: Comparing Student Performance by Assessment Type, Core Competency, and Course Objective

ERIC Educational Resources Information Center

Bain, Lisa Z.

2012-01-01

There are many different delivery methods used by institutions of higher education. These include traditional, hybrid, and online course offerings. The comparisons of these typically use final grade as the measure of student performance. This research study looks behind the final grade and compares student performance by assessment type, core…
University Students' Attainment and Perceptions of Computer Delivered Assessment; A Comparison between Computer-Based and Traditional Tests in a "High-Stakes" Examination

ERIC Educational Resources Information Center

Escudier, M. P.; Newton, T. J.; Cox, M. J.; Reynolds, P. A.; Odell, E. W.

2011-01-01

This study compared higher education dental undergraduate student performance in online assessments with performance in traditional paper-based tests and investigated students' perceptions of the fairness and acceptability of online tests, and showed performance to be comparable. The project design involved two parallel cross-over trials, one in…
Objective structured assessment of technical skills evaluation of theoretical compared with hands-on training of shoulder dystocia management: a randomized controlled trial.

PubMed

Buerkle, Bernd; Pueth, Julia; Hefler, Lukas A; Tempfer-Bentz, Eva-Katrin; Tempfer, Clemens B

2012-10-01

To compare the skills of performing a shoulder dystocia management algorithm after hands-on training compared with demonstration. We randomized medical students to a 30-minute hands-on (group 1) and a 30-minute demonstration (group 2) training session teaching a standardized shoulder dystocia management scheme on a pelvic training model. Participants were tested with a 22-item Objective Structured Assessment of Technical Skills scoring system after training and 72 hours thereafter. Objective Structured Assessment of Technical Skills scores were the primary outcome. Performance time, self-assessment, confidence, and global rating scale were the secondary outcomes. Statistics were performed using Mann-Whitney U test, χ test, and multiple linear regression analysis. Two hundred three participants were randomized. Objective Structured Assessment of Technical Skills scores were significantly higher in group 1 (n=103) compared with group 2 (n=100) (17.95±3.14 compared with 15.67±3.18, respectively; P<.001). The secondary outcomes global rating scale (GRS; 10.94±2.71 compared with 8.57±2.61, respectively; P<.001), self-assessment (3.15±0.94 compared with 2.72±1.01; P=.002), and confidence (3.72±0.98 compared with 3.34±0.90, respectively; P=.005), but not performance time (3:19±0:48 minutes compared with 3:31±1:05 minutes; P=.1), were also significantly different, favoring group 1. After 72 hours, Objective Structured Assessment of Technical Skills scores were still significantly higher in group 1 (n=67) compared with group 2 (n=60) (18.17±2.76 compared with 14.98±3.03, respectively; P<.001) as were GRS (10.80±2.62 compared with 8.15±2.59; P<.001) and self assessment (SA; 3.44±0.87 compared with 2.95±0.94; P=.003). In a multiple linear regression analysis, group assignment (group 1 compared with 2; P<.001) and sex (P=.002) independently influenced Objective Structured Assessment of Technical Skills scores. Hands-on training helps to achieve a significant improvement of shoulder dystocia management on a pelvic training model. www.ClinicalTrials.gov, NCT01618565. I.
Comparative values of medical school assessments in the prediction of internship performance.

PubMed

Lee, Ming; Vermillion, Michelle

2018-02-01

Multiple undergraduate achievements have been used for graduate admission consideration. Their relative values in the prediction of residency performance are not clear. This study compared the contributions of major undergraduate assessments to the prediction of internship performance. Internship performance ratings of the graduates of a medical school were collected from 2012 to 2015. Hierarchical multiple regression analyses were used to examine the predictive values of undergraduate measures assessing basic and clinical sciences knowledge and clinical performances, after controlling for differences in the Medical College Admission Test (MCAT). Four hundred eighty (75%) graduates' archived data were used in the study. Analyses revealed that clinical competencies, assessed by the USMLE Step 2 CK, NBME medicine exam, and an eight-station objective structured clinical examination (OSCE), were strong predictors of internship performance. Neither the USMLE Step 1 nor the inpatient internal medicine clerkship evaluation predicted internship performance. The undergraduate assessments as a whole showed a significant collective relationship with internship performance (ΔR 2 = 0.12, p < 0.001). The study supports the use of clinical competency assessments, instead of pre-clinical measures, in graduate admission consideration. It also provides validity evidence for OSCE scores in the prediction of workplace performance.
Resident self-other assessor agreement: influence of assessor, competency, and performance level.

PubMed

Lipsett, Pamela A; Harris, Ilene; Downing, Steven

2011-08-01

To review the literature on self-assessment in the context of resident performance and to determine the correlation between self-assessment across competencies in high- and low-performing residents and assessments performed by raters from a variety of professional roles (peers, nurses, and faculty). Retrospective analysis of prospectively collected anonymous self-assessment and multiprofessional (360) performance assessments by competency and overall. University-based academic general surgical program. Sixty-two residents rotating in general surgery. Mean difference for each self-assessment dyad (self-peer, self-nurse, and self-attending physician) by resident performance quartile, adjusted for measurement error, correlation coefficients, and summed differences across all competencies. Irrespective of self-other dyad, residents asked to rate their global performance overestimated their skills. Residents in the upper quartile underestimated their specific skills while those in the lowest-performing quartile overestimated their abilities when compared with faculty, peers, and especially nurse raters. Moreover, overestimation was greatest in competencies related to interpersonal skills, communication, teamwork, and professionalism. Rater, level of performance, and the competency being assessed all influence the comparison of the resident's self-assessment and those of other raters. Self-assessment of competencies related to behavior may be inaccurate when compared with raters from various professions. Residents in the lowest-performing quartile are least able to identify their weakness. These data have important implications for residents, program directors, and the public and suggest that strategies that help the lowest-performing residents recognize areas in need of improvement are needed.
Assessing the Performance of Educational Research in Australian Universities: An Alternative Perspective

ERIC Educational Resources Information Center

Perry, Laura B.

2018-01-01

This study uses bibliometric data to assess the performance of educational research in Australian universities. It provides an alternative perspective to the Australian government's Excellence in Research for Australia (ERA) assessment. ERA results suggest that the performance of educational research is substantially less compared to other…
The Comparative Performance of Conditional Independence Indices

ERIC Educational Resources Information Center

Kim, Doyoung; De Ayala, R. J.; Ferdous, Abdullah A.; Nering, Michael L.

2011-01-01

To realize the benefits of item response theory (IRT), one must have model-data fit. One facet of a model-data fit investigation involves assessing the tenability of the conditional item independence (CII) assumption. In this Monte Carlo study, the comparative performance of 10 indices for identifying conditional item dependence is assessed. The…
The National Curriculum: A Study to Compare Levels of Attainment with Data from APU Science Surveys (1980-4).

ERIC Educational Resources Information Center

Taylor, R. M.

1990-01-01

Compared are the levels of attainment for the Science in the National Curriculum assessment in Great Britain in 1989 and the performance of students on the application of science concepts part of the Assessment of Performance Unit-Science carried out in 1980-84. (KR)
The effects of performance-based assessment criteria on student performance and self-assessment skills

PubMed Central

van der Klink, Marcel R.; van Merriënboer, Jeroen J. G.

2010-01-01

This study investigated the effect of performance-based versus competence-based assessment criteria on task performance and self-assessment skills among 39 novice secondary vocational education students in the domain of nursing and care. In a performance-based assessment group students are provided with a preset list of performance-based assessment criteria, describing what students should do, for the task at hand. The performance-based group is compared to a competence-based assessment group in which students receive a preset list of competence-based assessment criteria, describing what students should be able to do. The test phase revealed that the performance-based group outperformed the competence-based group on test task performance. In addition, higher performance of the performance-based group was reached with lower reported mental effort during training, indicating a higher instructional efficiency for novice students. PMID:20054648
Teacher Quality and Quality Teaching: Examining the Relationship of a Teacher Assessment to Practice

ERIC Educational Resources Information Center

Hill, Heather C.; Umland, Kristin; Litke, Erica; Kapitula, Laura R.

2012-01-01

Multiple-choice assessments are frequently used for gauging teacher quality. However, research seldom examines whether results from such assessments generalize to practice. To illuminate this issue, we compare teacher performance on a mathematics assessment, during mathematics instruction, and by student performance on a state assessment. Poor…
A Comparison of High and Low Performing Secondary Physical Education Programs in South Carolina.

ERIC Educational Resources Information Center

Castelli, Darla M.

This study compared high and low performing schools in a state secondary physical education high stakes assessment and accountability program. The South Carolina Physical Education Assessment Program (SCPEAP) required teachers to assess samples of students on competency across four state mandated performance indicators. This study examined the…
Comparative Life Cycle Assessment between Warm SMA and Conventional SMA

DOT National Transportation Integrated Search

2011-09-01

This report presents the comparative life cycle assessment (LCA) between warm stone mastic asphalt (SMA) and conventional : SMA. Specifically, the study evaluated and compared the life cycle environmental and economic performances of two mixtures: a ...

Can medical students accurately predict their learning? A study comparing perceived and actual performance in neuroanatomy.

PubMed

Hall, Samuel R; Stephens, Jonny R; Seaby, Eleanor G; Andrade, Matheus Gesteira; Lowry, Andrew F; Parton, Will J C; Smith, Claire F; Border, Scott

2016-10-01

It is important that clinicians are able to adequately assess their level of knowledge and competence in order to be safe practitioners of medicine. The medical literature contains numerous examples of poor self-assessment accuracy amongst medical students over a range of subjects however this ability in neuroanatomy has yet to be observed. Second year medical students attending neuroanatomy revision sessions at the University of Southampton and the competitors of the National Undergraduate Neuroanatomy Competition were asked to rate their level of knowledge in neuroanatomy. The responses from the former group were compared to performance on a ten item multiple choice question examination and the latter group were compared to their performance within the competition. In both cohorts, self-assessments of perceived level of knowledge correlated weakly to their performance in their respective objective knowledge assessments (r = 0.30 and r = 0.44). Within the NUNC, this correlation improved when students were instead asked to rate their performance on a specific examination within the competition (spotter, rS = 0.68; MCQ, rS = 0.58). Despite its inherent difficulty, medical student self-assessment accuracy in neuroanatomy is comparable to other subjects within the medical curriculum. Anat Sci Educ 9: 488-495. © 2016 American Association of Anatomists. © 2016 American Association of Anatomists.
Co-Teaching in Middle School Classrooms: Quantitative Comparative Study of Special Education Student Assessment Performance

ERIC Educational Resources Information Center

Reese, De'borah Reese

2017-01-01

The purpose of this quantitative comparative study was to determine the existence or nonexistence of performance pass rate differences of special education middle school students on standardized assessments between pre and post co-teaching eras disaggregated by subject area and school. Co-teaching has altered classroom environments in many ways.…
DOE Office of Scientific and Technical Information (OSTI.GOV)

Seitz, R.R.; Rittmann, P.D.; Wood, M.I.

The US Department of Energy Headquarters established a performance assessment task team (PATT) to integrate the activities of DOE sites that are preparing performance assessments for the disposal of newly generated low-level waste. The PATT chartered a subteam with the task of comparing computer codes and exposure scenarios used for dose calculations in performance assessments. This report documents the efforts of the subteam. Computer codes considered in the comparison include GENII, PATHRAE-EPA, MICROSHIELD, and ISOSHLD. Calculations were also conducted using spreadsheets to provide a comparison at the most fundamental level. Calculations and modeling approaches are compared for unit radionuclide concentrationsmore » in water and soil for the ingestion, inhalation, and external dose pathways. Over 30 tables comparing inputs and results are provided.« less
Impact of the site specialty of a continuity practice on students' clinical skills: performance with standardized patients.

PubMed

Pfeiffer, Carol A; Palley, Jane E; Harrington, Karen L

2010-07-01

The assessment of clinical competence and the impact of training in ambulatory settings are two issues of importance in the evaluation of medical student performance. This study compares the clinical skills performance of students placed in three types of community preceptors' offices (pediatrics, medicine, family medicine) on yearly clinical skills assessments with standardized patients. Our goal was to see if the site specialty impacted on clinical performance. The students in the study were completing a 3-year continuity preceptorship at a site representing one of the disciplines. Their performance on the four clinical skills assessments was compared. There was no significant difference in history taking, physical exam, communication, or clinical reasoning in any year (ANOVA p< or = .05) There was a small but significant difference in performance on a measure of interpersonal and interviewing skills during Years 1 and 2. The site specialty of an early clinical experience does not have a significant impact on performance of most of the skills measured by the assessments.
Quantifying biological integrity by taxonomic completeness: its utility in regional and global assessments.

PubMed

Hawkins, Charles P

2006-08-01

Water resources managers and conservation biologists need reliable, quantitative, and directly comparable methods for assessing the biological integrity of the world's aquatic ecosystems. Large-scale assessments are constrained by the lack of consistency in the indicators used to assess biological integrity and our current inability to translate between indicators. In theory, assessments based on estimates of taxonomic completeness, i.e., the proportion of expected taxa that were observed (observed/expected, O/E) are directly comparable to one another and should therefore allow regionally and globally consistent summaries of the biological integrity of freshwater ecosystems. However, we know little about the true comparability of O/E assessments derived from different data sets or how well O/E assessments perform relative to other indicators in use. I compared the performance (precision, bias, and sensitivity to stressors) of O/E assessments based on five different data sets with the performance of the indicators previously applied to these data (three multimetric indices, a biotic index, and a hybrid method used by the state of Maine). Analyses were based on data collected from U.S. stream ecosystems in North Carolina, the Mid-Atlantic Highlands, Maine, and Ohio. O/E assessments resulted in very similar estimates of mean regional conditions compared with most other indicators once these indicators' values were standardized relative to reference-site means. However, other indicators tended to be biased estimators of O/E, a consequence of differences in their response to natural environmental gradients and sensitivity to stressors. These results imply that, in some cases, it may be possible to compare assessments derived from different indicators by standardizing their values (a statistical approach to data harmonization). In situations where it is difficult to standardize or otherwise harmonize two or more indicators, O/E values can easily be derived from existing raw sample data. With some caveats, O/E should provide more directly comparable assessments of biological integrity across regions than is possible by harmonizing values of a mix of indicators.
Teaching Performance Assessment: A Comparative Study of Implementation and Impact amongst California State University Campuses

ERIC Educational Resources Information Center

Guaglianone, Curtis L.; Payne, Maggie; Kinsey, Gary W.; Chiero, Robin

2009-01-01

This article is based on the perceptions of California State University administrators and provides a comparative study of the challenges and benefits resulting from the implementation of the teaching performance assessment requirement of SB 2042 standards 19-21 on the California State University (CSU) campuses. With 23 campuses and almost 450,000…
Quantitative assessments of municipal waste management systems: using different indicators to compare and rank programs in New York State.

PubMed

Greene, Krista L; Tonjes, David J

2014-04-01

The primary objective of waste management technologies and policies in the United States is to reduce the harmful environmental impacts of waste, particularly those relating to energy consumption and climate change. Performance indicators are frequently used to evaluate the environmental quality of municipal waste systems, as well as to compare and rank programs relative to each other in terms of environmental performance. However, there currently is no consensus on the best indicator for performing these environmental evaluations. The purpose of this study is to examine the common performance indicators used to assess the environmental benefits of municipal waste systems to determine if there is agreement between them regarding which system performs best environmentally. Focus is placed on how indicator selection influences comparisons between municipal waste management programs and subsequent system rankings. The waste systems of ten municipalities in the state of New York, USA, were evaluated using each common performance indicator and Spearman correlations were calculated to see if there was a significant association between system rank orderings. Analyses showed that rank orders of waste systems differ substantially when different indicators are used. Therefore, comparative system assessments based on indicators should be considered carefully, especially those intended to gauge environmental quality. Insight was also gained into specific factors which may lead to one system achieving higher rankings than another. However, despite the insufficiencies of indicators for comparative quality assessments, they do provide important information for waste managers and they can assist in evaluating internal programmatic performance and progress. To enhance these types of assessments, a framework for scoring indicators based on criteria that evaluate their utility and value for system evaluations was developed. This framework was used to construct an improved model for waste system performance assessments. Copyright © 2014 Elsevier Ltd. All rights reserved.
Endobronchial ultrasound-guided transbronchial needle aspiration: performance of biomedical scientists on rapid on-site evaluation and preliminary diagnosis.

PubMed

Schacht, M J; Toustrup, C B; Madsen, L B; Martiny, M S; Larsen, B B; Simonsen, J T

2016-10-01

Rapid on-site evaluation (ROSE) of endobronchial ultrasound-guided transbronchial needle aspiration (EBUS-TBNA) followed by a subsequent preliminary adequacy assessment and a preliminary diagnosis, was performed at Aarhus University Hospital by biomedical scientists (BMS). The aim of this study was to evaluate the BMS accuracy of ROSE adequacy assessment, the preliminary adequacy assessment and the preliminary diagnosis as compared with the cytopathologist-rendered final adequacy assessment and final diagnosis. The BMS-rendered assessments for 717 sites from 319 consecutive patients over a 4-month period were compared with the cytopathologist-rendered assessments. Comparisons of adequacy and preliminary diagnoses were based on inter-observer Cohen's Kappa coefficient with a 95% confidence interval (CI). Strong correlations between ROSE and final adequacy assessments [Kappa coefficient of 0.90 (CI: 0.85-0.96)] and between the preliminary and final adequacy assessments [Kappa coefficient of 0.93 (CI: 0.87-0.99)] were found. As for the correlation between the preliminary and final diagnoses, the Kappa coefficient was 0.99 (CI: 0.98-1). Both ROSE and preliminary adequacy assessments as well as preliminary diagnoses, all performed by BMS, were highly accurate when compared with the final assessment by the cytopathologist. © 2016 John Wiley & Sons Ltd.
Comparison of differences in performance evaluation of faculty by students with faculty's self-assessment.

PubMed

Azizi, Kourosh; Aghamolaei, Teamur; Parsa, Nader; Dabbaghmanesh, Tahereh

2014-07-01

The present study aimed to compare self-assessment forms of coursework taught in the school of public health at undergraduate, graduate, and postgraduate levels and students' evaluation of the performance of the faculty members at these levels. The subjects in this cross-sectional study were the faculty members and students of the School of Public Health and Nutrition, Shiraz University of Medical Sciences, Shiraz, Iran. The data were collected using a socio-demographic information form and evaluation forms of professors prepared by the Educational Development Center (EDC). The faculty members were assessed by the students in undergraduate and graduate classes. Among the study subjects, 23 faculty members filled out the self-assessment forms which were then evaluated by 23 students. Then, the data were analyzed using the SPSS statistical 14. Paired t-test was used to compare the students' evaluation of the faculty members' performance and the professors' self-assessment. The mean score of self-assessment of the faculty members who taught undergraduate courses was 289.7±8.3, while that of the students' evaluation was 281.3±16.1; the difference was statistically significant (t=3.56, p=0.001). Besides, the mean score of the self-assessment of the faculty members who taught graduate courses was 269.0±9.7, while that of the students' evaluation was 265.7±14.6 but the difference was not statistically significant (t=1.09, p=0.28). Teaching performance perceptions of the faculty were similar to those of the graduate students as compared to the undergraduate ones. This may reflect better understanding of coursework at this level compared to the undergraduate students. Faculty members may need to adjust teaching methods to improve students' performance and understanding especially in the undergraduate level.
Assessing technical performance in differential gene expression experiments with external spike-in RNA control ratio mixtures.

PubMed

Munro, Sarah A; Lund, Steven P; Pine, P Scott; Binder, Hans; Clevert, Djork-Arné; Conesa, Ana; Dopazo, Joaquin; Fasold, Mario; Hochreiter, Sepp; Hong, Huixiao; Jafari, Nadereh; Kreil, David P; Łabaj, Paweł P; Li, Sheng; Liao, Yang; Lin, Simon M; Meehan, Joseph; Mason, Christopher E; Santoyo-Lopez, Javier; Setterquist, Robert A; Shi, Leming; Shi, Wei; Smyth, Gordon K; Stralis-Pavese, Nancy; Su, Zhenqiang; Tong, Weida; Wang, Charles; Wang, Jian; Xu, Joshua; Ye, Zhan; Yang, Yong; Yu, Ying; Salit, Marc

2014-09-25

There is a critical need for standard approaches to assess, report and compare the technical performance of genome-scale differential gene expression experiments. Here we assess technical performance with a proposed standard 'dashboard' of metrics derived from analysis of external spike-in RNA control ratio mixtures. These control ratio mixtures with defined abundance ratios enable assessment of diagnostic performance of differentially expressed transcript lists, limit of detection of ratio (LODR) estimates and expression ratio variability and measurement bias. The performance metrics suite is applicable to analysis of a typical experiment, and here we also apply these metrics to evaluate technical performance among laboratories. An interlaboratory study using identical samples shared among 12 laboratories with three different measurement processes demonstrates generally consistent diagnostic power across 11 laboratories. Ratio measurement variability and bias are also comparable among laboratories for the same measurement process. We observe different biases for measurement processes using different mRNA-enrichment protocols.
Education Reforms and Innovations to Improve Student Assessment Performance

ERIC Educational Resources Information Center

McAfee, Wade J.

2014-01-01

International assessments such as the Trends in International Mathematics and Science Study (TIMSS) and Program for International Student Assessment (PISA) have exhibited United States students specifically in the fourth and eighth grades, are not performing well when compared to their international peers. Educational stakeholders including…
Comparative Performance Assessment of 5kW-Class Solid Oxide Fuel Cell Engines Integrated With Single/Dual-Spool Turbochargers

DTIC Science & Technology

2011-01-01

Comparative Performance Assessment of 5kW-Class Solid Oxide Fuel Cell Engines Integrated with Single/Dual-Spool Turbochargers So-Ryeok Oh, Jing Sun... Turbochargers 5a. CONTRACT NUMBER 5b. GRANT NUMBER 5c. PROGRAM ELEMENT NUMBER 6. AUTHOR(S) 5d. PROJECT NUMBER 5e. TASK NUMBER 5f. WORK UNIT... fundamental operating regime to the part load performance. Two different mechanical designs are assumed: dual shaft and single shaft as the compressor
Comparing the Performance and Preference of Students Experiencing a Reading Aloud Accommodation to Those Who Do Not on a Virtual Science Assessment

ERIC Educational Resources Information Center

Shelton, Angela

2012-01-01

Many United States secondary students perform poorly on standardized summative science assessments. Situated Assessments using Virtual Environments (SAVE) Science is an innovative assessment project that seeks to capture students' science knowledge and understanding by contextualizing problems in a game-based virtual environment called…
Evaluating the Effect of Learning Style and Student Background on Self-Assessment Accuracy

ERIC Educational Resources Information Center

Alaoutinen, Satu

2012-01-01

This study evaluates a new taxonomy-based self-assessment scale and examines factors that affect assessment accuracy and course performance. The scale is based on Bloom's Revised Taxonomy and is evaluated by comparing students' self-assessment results with course performance in a programming course. Correlation has been used to reveal possible…
Comparing Assessment Methods in Undergraduate Statistics Courses

ERIC Educational Resources Information Center

Baxter, Sarah E.

2017-01-01

The purpose of this study was to compare undergraduate students' academic performance and attitudes about statistics in the context of two different types of assessment structures for an introductory statistics course. One assessment structure used in-class quizzes that emphasized computation and procedural fluency as well as vocabulary…
COMPARATIVE ANALYSIS OF HEALTH RISK ASSESSMENTS FOR MUNICIPAL WASTE COMBUSTORS

EPA Science Inventory

Quantitative health risk assessments have been performed for a number of proposed municipal waste combustor (MWC) facilities over the past several years. his article presents the results of a comparative analysis of a total of 21 risk assessments, focusing on seven of the most co...
The Impact on Student Achievement Following Professional Development on the Principles of Formative Assessment

ERIC Educational Resources Information Center

DeNome, Evonne C.

2015-01-01

This quantitative study reviews the impact on student achievement following professional development on the principles of formative assessment. The study compared mathematics and reading performance data from student populations with teachers who received training in formative assessment to performance data from student populations with teachers…
A comparative analysis of multiple-choice and student performance-task assessment in the high school biology classroom

NASA Astrophysics Data System (ADS)

Cushing, Patrick Ryan

This study compared the performance of high school students on laboratory assessments. Thirty-four high school students who were enrolled in the second semester of a regular biology class or had completed the biology course the previous semester participated in this study. They were randomly assigned to examinations of two formats, performance-task and traditional multiple-choice, from two content areas, using a compound light microscope and diffusion. Students were directed to think-aloud as they performed the assessments. Additional verbal data were obtained during interviews following the assessment. The tape-recorded narrative data were analyzed for type and diversity of knowledge and skill categories, and percentage of in-depth processing demonstrated. While overall mean scores on the assessments were low, elicited statements provided additional insight into student cognition. Results indicated that a greater diversity of knowledge and skill categories was elicited by the two microscope assessments and by the two performance-task assessments. In addition, statements demonstrating in-depth processing were coded most frequently in narratives elicited during clinical interviews following the diffusion performance-task assessment. This study calls for individual teachers to design authentic assessment practices and apply them to daily classroom routines. Authentic assessment should be an integral part of the learning process and not merely an end result. In addition, teachers are encouraged to explicitly identify and model, through think-aloud methods, desired cognitive behaviors in the classroom.
Self-assessment differences between genders in a low-stakes objective structured clinical examination (OSCE).

PubMed

Madrazo, Lorenzo; Lee, Claire B; McConnell, Meghan; Khamisa, Karima

2018-06-15

Physicians and medical students are generally poor-self assessors. Research suggests that this inaccuracy in self-assessment differs by gender among medical students whereby females underestimate their performance compared to their male counterparts. However, whether this gender difference in self-assessment is observable in low-stakes scenarios remains unclear. Our study's objective was to determine whether self-assessment differed between male and female medical students when compared to peer-assessment in a low-stakes objective structured clinical examination. Thirty-three (15 males, 18 females) third-year students participated in a 5-station mock objective structured clinical examination. Trained fourth-year student examiners scored their performance on a 6-point Likert-type global rating scale. Examinees also scored themselves using the same scale. To examine gender differences in medical students' self-assessment abilities, mean self-assessment global rating scores were compared with peer-assessment global rating scores using an independent samples t test. Overall, female students' self-assessment scores were significantly lower compared to peer-assessment (p < 0.001), whereas no significant difference was found between self- and peer-assessment scores for male examinees (p = 0.228). This study provides further evidence that underestimation in self-assessment among females is observable even in a low-stakes formative objective structured clinical examination facilitated by fellow medical students.
Educational consequences of developmental speech disorder: Key Stage 1 National Curriculum assessment results in English and mathematics.

PubMed

Nathan, Liz; Stackhouse, Joy; Goulandris, Nata; Snowling, Margaret J

2004-06-01

Children with speech difficulties may have associated educational problems. This paper reports a study examining the educational attainment of children at Key Stage 1 of the National Curriculum who had previously been identified with a speech difficulty. (1) To examine the educational attainment at Key Stage 1 of children diagnosed with speech difficulties two/three years prior to the present study. (2) To compare the Key Stage 1 assessment results of children whose speech problems had resolved at the time of assessment with those whose problems persisted. Data were available from 39 children who had an earlier diagnosis of speech difficulties at age 4/5 (from an original cohort of 47) at the age of 7. A control group of 35 children identified and matched at preschool on age, nonverbal ability and gender provided comparative data. Results of Statutory Assessment Tests (SATs) in reading, reading comprehension, spelling, writing and maths, administered to children at the end of Year 2 of school were analysed. Performance across the two groups was compared. Performance was also compared to published statistics on national levels of attainment. Children with a history of speech difficulties performed less well than controls on reading, spelling and maths. However, children whose speech problems had resolved by the time of assessment performed no differently to controls. Children with persisting speech problems performed less well than controls on tests of literacy and maths. Spelling performance was a particular area of difficulty for children with persisting speech problems. Children with speech difficulties are likely to perform less well than expected on literacy and maths SAT's at age 7. Performance is related to whether the speech problem resolves early on and whether associated language problems exist. Whilst it is unclear whether poorer performance on maths is because of the language components of this task, the results indicate that speech problems, especially persisting ones, can affect the ability to access the National Curriculum to expected levels.

Web-based application on employee performance assessment using exponential comparison method

NASA Astrophysics Data System (ADS)

Maryana, S.; Kurnia, E.; Ruyani, A.

2017-02-01

Employee performance assessment is also called a performance review, performance evaluation, or assessment of employees, is an effort to assess the achievements of staffing performance with the aim to increase productivity of employees and companies. This application helps in the assessment of employee performance using five criteria: Presence, Quality of Work, Quantity of Work, Discipline, and Teamwork. The system uses the Exponential Comparative Method and Weighting Eckenrode. Calculation results using graphs were provided to see the assessment of each employee. Programming language used in this system is written in Notepad++ and MySQL database. The testing result on the system can be concluded that this application is correspond with the design and running properly. The test conducted is structural test, functional test, and validation, sensitivity analysis, and SUMI testing.
Can Online Course-Based Assessment Methods Be Fair and Equitable? Relationships between Students' Preferences and Performance within Online and Offline Assessments

ERIC Educational Resources Information Center

Hewson, C.

2012-01-01

To address concerns raised regarding the use of online course-based summative assessment methods, a quasi-experimental design was implemented in which students who completed a summative assessment either online or offline were compared on performance scores when using their self-reported "preferred" or "non-preferred" modes.…
The comparison of performances of preschool children on two motor assessments.

PubMed

Logan, S Wood; Robinson, Leah E; Getchell, Nancy

2011-12-01

Understanding children's motor performance on different assessments is important for researchers. The Test of Gross Motor Development-2 (TGMD-2) and the Movement Assessment Battery for Children-2 (MABC-2) are motor assessments that use either a process- or product-oriented scoring approach. However, no studies have examined how performances are related to these two types of assessment. This study compared the performance of preschool children on the TGMD-2 and the MABC-2. 32 children (M age = 4.2 yr., SD = 9) completed each test to assess whether each described motor performance similarly. Significant low to moderate Spearman's rank correlations (r2 range = .13-.40) were found between the subscales of the assessments. A related-samples Wilcoxon signed rank test was not significant between total performances on the TGMD-2 and MABC-2. From a practical standpoint, each assessment provides a similar overall description of motor competence in preschool children. However, each assessment results in scores that present different information about motor performance.
Formative Assessment of Procedural Skills: Students' Responses to the Objective Structured Clinical Examination and the Integrated Performance Procedural Instrument

ERIC Educational Resources Information Center

Nestel, Debra; Kneebone, Roger; Nolan, Carmel; Akhtar, Kash; Darzi, Ara

2011-01-01

Assessment of clinical skills is a critical element of undergraduate medical education. We compare a traditional approach to procedural skills assessment--the Objective Structured Clinical Examination (OSCE) with the Integrated Performance Procedural Instrument (IPPI). In both approaches, students work through "stations" or…
Criterion Validity and Practical Utility of the Minnesota Multiphasic Personality Inventory-2-Restructured Form (MMPI-2-RF) in Assessments of Police Officer Candidates.

PubMed

Tarescavage, Anthony M; Corey, David M; Gupton, Herbert M; Ben-Porath, Yossef S

2015-01-01

Minnesota Multiphasic Personality Inventory-2-Restructured Form scores for 145 male police officer candidates were compared with supervisor ratings of field performance and problem behaviors during their initial probationary period. Results indicated that the officers produced meaningfully lower and less variant substantive scale scores compared to the general population. After applying a statistical correction for range restriction, substantive scale scores from all domains assessed by the inventory demonstrated moderate to large correlations with performance criteria. The practical significance of these results was assessed with relative risk ratio analyses that examined the utility of specific cutoffs on scales demonstrating associations with performance criteria.
The NASA Performance Assessment Workstation: cognitive performance during head-down bed rest.

PubMed

Shehab, R L; Schlegel, R E; Schiflett, S G; Eddy, D R

1998-01-01

The NASA Performance Assessment Workstation was used to assess cognitive performance changes in eight males subjected to seventeen days of 6 degrees head-down bed rest. PAWS uses six performance tasks to assess directed and divided attention, spatial, mathematical, and memory skills, and tracking ability. Subjective scales assess overall fatigue and mood state. Subjects completed training trials, practice trials, bed rest trials, and recovery trials. The last eight practice trials and all bed rest trials were performed with subjects lying face-down on a gurney. In general, there was no apparent cumulative effect of bed rest. Following a short period of performance stabilization, a slight but steady trend of performance improvement was observed across all trials. For most tasks, this trend of performance improvement was enhanced during recovery. No statistically significant differences in performance were observed when comparing bed rest with the control period. Additionally, fatigue scores showed little change across all periods.
The NASA performance assessment workstation: Cognitive performance during head-down bed rest

NASA Astrophysics Data System (ADS)

Shehab, Randa L.; Schlegel, Robert E.; Schiflett, Samuel G.; Eddy, Douglas R.

The NASA Performance Assessment Workstation was used to assess cognitive performance changes in eight males subjected to seventeen days of 6 ° head-down bed rest. PAWS uses six performance tasks to assess directed and divided attention, spatial, mathematical, and memory skills, and tracking ability. Subjective scales assess overall fatigue and mood state. Subjects completed training trials, practice trials, bed rest trials, and recovery trials. The last eight practice trials and all bed rest trials were performed with subjects lying face-down on a gurney. In general, there was no apparent cumulative effect of bed rest. Following a short period of performance stabilization, a slight but steady trend of performance improvement was observed across all trials. For most tasks, this trend of performance improvement was enhanced during recovery. No statistically significant differences in performance were observed when comparing bed rest with the control period. Additionally, fatigue scores showed little change across all periods.
Using a virtual reality game to assess goal-directed hand movements in children: A pilot feasibility study.

PubMed

Gabyzon, M Elboim; Engel-Yeger, B; Tresser, S; Springer, S

2016-01-01

Virtual reality gaming environments may be used as a supplement to the motor performance assessment tool box by providing clinicians with quantitative information regarding motor performance in terms of movement accuracy and speed, as well as sensory motor integration under different levels of dual tasking. To examine the feasibility of using the virtual reality game `Timocco' as an assessment tool for evaluating goal-directed hand movements among typically developing children. In this pilot study, 47 typically-developing children were divided into two age groups, 4-6 years old and 6-8 years old. Performance was measured using two different virtual environment games (Bubble Bath and Falling Fruit), each with two levels of difficulty. Discriminative validity (age effect) was examined by comparing the performance of the two groups, and by comparing the performance between levels of the games for each group (level effect). Test-retest reliability was examined by reassessing the older children 3-7 days after the first session. The older children performed significantly better in terms of response time, action time, game duration, and efficiency in both games compared to the younger children. Both age groups demonstrated poorer performance at the higher game level in the Bubble Bath game compared to the lower level. A similar level effect was found in the Falling Fruit game for both age groups in response time and efficiency, but not in action time. The performance of the older children was not significantly different between the two sessions at both game levels. The discriminative validity and test-retest reliability indicate the feasibility of using the Timocco virtual reality game as a tool for assessing goal-directed hand movements in children. Further studies should examine its feasibility for use in children with disabilities.
The method of educational assessment affects children's neural processing and performance: behavioural and fMRI Evidence

NASA Astrophysics Data System (ADS)

Howard, Steven J.; Burianová, Hana; Calleia, Alysha; Fynes-Clinton, Samuel; Kervin, Lisa; Bokosmaty, Sahar

2017-08-01

Standardised educational assessments are now widespread, yet their development has given comparatively more consideration to what to assess than how to optimally assess students' competencies. Existing evidence from behavioural studies with children and neuroscience studies with adults suggest that the method of assessment may affect neural processing and performance, but current evidence remains limited. To investigate the impact of assessment methods on neural processing and performance in young children, we used functional magnetic resonance imaging to identify and quantify the neural correlates during performance across a range of current approaches to standardised spelling assessment. Results indicated that children's test performance declined as the cognitive load of assessment method increased. Activation of neural nodes associated with working memory further suggests that this performance decline may be a consequence of a higher cognitive load, rather than the complexity of the content. These findings provide insights into principles of assessment (re)design, to ensure assessment results are an accurate reflection of students' true levels of competency.
Video self-assessment of basic suturing and knot tying skills by novice trainees.

PubMed

Hu, Yinin; Tiemann, Debbie; Michael Brunt, L

2013-01-01

Self-assessment is important to learning but few studies have utilized video self-assessment of basic surgical skills. We compared a video self-assessment of suturing and knot tying skills by novice trainees to the assessment by a senior attending surgeon. Sixteen senior medical students and 7 beginner surgical interns were video-recorded while performing five suturing and knot tying tasks. All videos were analyzed using an objective structured assessment of technical skills (OSATS) metrics (1-5 scale; 1 = novice, 5 = expert). Video self-assessment was carried out within 4 weeks of an instructional session and subsequently by one senior surgery instructor (blinded to the individual). Both a Global score and total combined OSATS scores were analyzed. Total possible OSATS scores were: interrupted suture-30, subcuticular closure-30, one and two-handed knot tying-25 each, tying in a restricted space 20; maximum combined score-130 points). Confidence levels in performing the tasks pre-test and the value of video self-assessment were rated on a 1-5 Likert scale (1 = low and 5 = high). Data are mean±SD and statistical significance was evaluated using Friedman's test. Self-assessment scoring was significantly higher than the assessment by a senior instructor for three tasks by global score and all five tasks by combined OSATS score (self-assessment 71.8±16.7 vs attending assessment 56.7±11.0, p = 0.007). Mean self-assessment Global scores ranged from 2.5 to 2.8 for all tasks performed compared to 1.8-2.3 for attending surgeon assessment (p<0.05). Confidence levels demonstrated no correlation to performance speed or proficiency. The video self-assessment was rated as a highly valuable (mean 4.3±0.8) component to skills training. Novice trainees over-estimate their basic technical skills performance compared to the assessment by a senior surgeon. Video self-assessment may be a valuable addition to a pre-residency and surgical internship preparatory curriculum in basic suturing and knot tying. Copyright © 2013 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights reserved.
[Teaching performance assessment in Public Health employing three different strategies].

PubMed

Martínez-González, Adrián; Moreno-Altamirano, Laura; Ponce-Rosas, Efrén Raúl; Martínez-Franco, Adrián Israel; Urrutia-Aguilar, María Esther

2011-01-01

The educational system depends upon the quality and performance of their faculty and should therefore be process of continuous improvement. To assess the teaching performance of the Public Health professors, at the Faculty of Medicine, UNAM through three strategies. Justification study. The evaluation was conducted under a mediational model through three strategies: students' opinion assessment, self-assessment and students' academic achievement. We applied descriptive statistics, Student t test, ANOVA and Pearson correlation. Twenty professors were evaluated from the Public Health department, representing 57% of all them who teach the subject. The professor's performance was highly valued self-assessment compared with assessment of student opinion, was confirmed by statistical analysis the difference was significant. The difference amongst the three evaluation strategies became more evident between self-assessment and the scores obtained by students in their academic achievement. The integration of these three strategies offers a more complete view of the teacher's performance quality. Academic achievement appears to be a more objective strategy for teaching performance assessment than students' opinion and self-assessment.
Factors That May Explain Differences between Home and Clinic Meal Preparation Task Assessments in Frail Older Adults

ERIC Educational Resources Information Center

Provencher, Veronique; Demers, Louise; Gelinas, Isabelle

2012-01-01

Meal preparation assessments conducted in clinical environments (such as rehabilitation settings) might not reflect frail patients' performance at home. In addition, factors that may explain differences in performance between settings remain unknown. The aim of this study was to compare home and clinic performance on meal preparation tasks in…
How Well Do U.S. High School Students Achieve in Spanish When Compared to Native Spanish Speakers?

ERIC Educational Resources Information Center

Sparks, Richard L.; Luebbers, Julie; Castañeda, Martha E.

2017-01-01

Foreign language educators have developed measures to assess the proficiency of U.S. high school learners. Most have compared language learners to clearly defined criteria for proficiency in the language (criterion-referenced assessment) or to the performance of other monolingual English speakers (norm-referenced assessment). In this study, the…
High-performance thin layer chromatography to assess pharmaceutical product quality.

PubMed

Kaale, Eliangiringa; Manyanga, Vicky; Makori, Narsis; Jenkins, David; Michael Hope, Samuel; Layloff, Thomas

2014-06-01

To assess the sustainability, robustness and economic advantages of high-performance thin layer chromatography (HPTLC) for quality control of pharmaceutical products. We compared three laboratories where three lots of cotrimoxazole tablets were assessed using different techniques for quantifying the active ingredient. The average assay relative standard deviation for the three lots was 1.2 with a range of 0.65-2.0. High-performance thin layer chromatography assessments are yielding valid results suitable for assessing product quality. The local pharmaceutical manufacturer had evolved the capacity to produce very high quality products. © 2014 John Wiley & Sons Ltd.
Use of diagnostic accuracy as a metric for evaluating laboratory proficiency with microarray assays using mixed-tissue RNA reference samples.

PubMed

Pine, P S; Boedigheimer, M; Rosenzweig, B A; Turpaz, Y; He, Y D; Delenstarr, G; Ganter, B; Jarnagin, K; Jones, W D; Reid, L H; Thompson, K L

2008-11-01

Effective use of microarray technology in clinical and regulatory settings is contingent on the adoption of standard methods for assessing performance. The MicroArray Quality Control project evaluated the repeatability and comparability of microarray data on the major commercial platforms and laid the groundwork for the application of microarray technology to regulatory assessments. However, methods for assessing performance that are commonly applied to diagnostic assays used in laboratory medicine remain to be developed for microarray assays. A reference system for microarray performance evaluation and process improvement was developed that includes reference samples, metrics and reference datasets. The reference material is composed of two mixes of four different rat tissue RNAs that allow defined target ratios to be assayed using a set of tissue-selective analytes that are distributed along the dynamic range of measurement. The diagnostic accuracy of detected changes in expression ratios, measured as the area under the curve from receiver operating characteristic plots, provides a single commutable value for comparing assay specificity and sensitivity. The utility of this system for assessing overall performance was evaluated for relevant applications like multi-laboratory proficiency testing programs and single-laboratory process drift monitoring. The diagnostic accuracy of detection of a 1.5-fold change in signal level was found to be a sensitive metric for comparing overall performance. This test approaches the technical limit for reliable discrimination of differences between two samples using this technology. We describe a reference system that provides a mechanism for internal and external assessment of laboratory proficiency with microarray technology and is translatable to performance assessments on other whole-genome expression arrays used for basic and clinical research.
COMPARATIVE ASSESSMENT OF BASELINE GASOLINE AND OXYFUELS

EPA Science Inventory

Despite the ubiquity of gasoline for several decades and more recent modifications in fuel formulations to achieve “cleaner” gasoline, a quantitative comparative assessment of the health risks related to these fuels remains to be performed. Under authority of Clean Air Act secti...
Distributed Low Temperature Combustion: Fundamental Understanding of Combustion Regime Transitions

DTIC Science & Technology

2016-09-07

behaviour as compared to ethanol. The latter fuel has also been considered along with methane. Work has also been performed on the further assessment of... behaviour as compared to ethanol. The latter fuel has also been considered along with methane. Work has also been performed on the further assess- ment of...identification of various combustion gas states. A range of Damköhler numbers (Da) from the conventional propagating flamelet regime well into the distributed
Game-Based Assessment: Investigating the Impact on Test Anxiety and Exam Performance

ERIC Educational Resources Information Center

Mavridis, A.; Tsiatsos, T.

2017-01-01

The aim of this study is to assess the impact of a 3D educational computer game on students' test anxiety and exam performance when used in evaluative situations as compared to the traditional method of examination. The participants of the study were students in tertiary education who were examined using game-based assessment and traditional…
Brief International Cognitive Assessment for Multiple Sclerosis (BICAMS) and performance of everyday life tasks: Actual Reality.

PubMed

Goverover, Yael; Chiaravalloti, Nancy; DeLuca, John

2016-04-01

Recently, a brief cognitive assessment (Brief International Cognitive Assessment for Multiple Sclerosis: BICAMS) has been recommended for use with patients diagnosed with multiple sclerosis (MS) to screen for cognitive impairments. However, the relationship between the BICAMS and everyday life activity has not been examined. The aim of this study was to examine whether the BICAMS can predict performance of activities of daily living using Actual Reality(TM) (AR) in persons with MS. A between-subjects design was utilized to compare 41 individuals with MS and 32 healthy controls (HC) performing BICAMS and an AR task. Participants were asked to access the internet to purchase a flight ticket or cookies, and were administered the BICAMS and questionnaires to assess quality of life (QOL), affect symptomatology, and prior internet experience. Participants with MS performed significantly worse than HC on the BICAMS and the AR. Additionally, better BICAMS performance was associated with more independent AR performance. Self-reports of QOL were not correlated with AR or BICAMS performance. Individuals with MS have greater problems with actual everyday life tasks as compared to HC. The BICAMS is a promising cognitive screening tool to predict actual functional performance in participants with MS. © The Author(s), 2015.
Comparison of performance-based assessment and real world skill in people with serious mental illness: Ecological validity of the Test of Grocery Shopping Skills.

PubMed

Faith, Laura A; Rempfer, Melisa V

2018-05-07

Valid functional measures are essential for clinical and research efforts that address recovery and community functioning in people with serious mental illness. Although there is a great deal of interest in functional assessment, there is limited research supporting how well current evaluation methods provide a true assessment of real world functioning or naturalistic behavior. To address this gap in the literature, the present study examined the performance of individuals with serious mental illness (i.e., diagnosis of schizophrenia-spectrum, bipolar disorder, or other depression/anxiety diagnoses and accompanying functional disability) on the Test of Grocery Shopping Skills (TOGSS), a performance-based naturalistic task. We compared TOGSS performance to two dimensions of real world functioning: directly observed real world grocery shopping and ratings of community functioning. Results indicated that the TOGSS was significantly associated with real life grocery shopping, in terms of both shopping accuracy (r = 0.424) and time (r = 0.491). Further, self-report and observer-rated methods of assessing real world shopping behaviors were significantly correlated (r = 0.455). To our knowledge, this is one of the first studies to directly compare a performance-based naturalistic skill assessment with carefully observed real world performance of that skill in people with serious mental illness. These findings support the feasibility and ecological validity of performance-based naturalistic assessment with the TOGSS. Copyright © 2018 Elsevier B.V. All rights reserved.

Dual-Task Performance: Influence of Frailty, Level of Physical Activity, and Cognition.

PubMed

Giusti Rossi, Paulo; Pires de Andrade, Larissa; Hotta Ansai, Juliana; Silva Farche, Ana Claudia; Carnaz, Leticia; Dalpubel, Daniela; Ferriolli, Eduardo; Assis Carvalho Vale, Francisco; de Medeiros Takahashi, Anielle Cristhine

2018-03-08

Cognition and level of physical activity have been associated with frailty syndrome. The development of tools that assess deficits related to physical and cognitive frailties simultaneously are of common interest. However, little is known about how much these aspects influence the performance of dual-task tests. Our aims were (a) to verify the influence of frailty syndrome and objectively measured physical activity and cognition on the Timed Up and Go (TUG) test and Timed Up and Go associated with dual-task (TUG-DT) performances; and (b) to compare TUG and TUG-DT performances between older adults who develop frailty syndrome. Sixty-four community-dwelling older adults were divided into frail, prefrail, and nonfrail groups, according to frailty phenotype. Assessments included anamnesis, screening of frailty syndrome, cognitive assessment (Addenbrooke's cognitive examination), placement of a triaxial accelerometer to assess level of physical activity, and TUG and TUG-DT (TUG associated with a motor-cognitive task of calling a phone number) performances. After 7 days, the accelerometer was removed. A multiple linear regression was applied to identify which independent variables could explain performances in the TUG and TUG-DT. Subsequently, the analysis of covariance test, adjusted for age, cognition, and level of physical activity covariates, was used to compare test performances. There were no differences in cognition between groups. Significant differences in the level of physical activity were found in the frail group. Compared with the frail group, the nonfrail group required less time and fewer steps to complete the TUG. Regarding the TUG-DT, cognition and age influenced the time spent and number of steps, respectively; however, no differences were found between groups. Frail older adults presented worse performance in the TUG when compared with nonfrail older adults. The dual-task test does not differentiate older adults with frailty syndrome, regardless of cognitive performance.
[Aerosol deposition and clinical performance verified with a spacer device made in Brazil

PubMed

Camargos, P A; Rubim, J A; Simal, C J; Lasmar, L M

2000-01-01

OBJECTIVE: To assess the lung deposition pattern of radioaerosol and the clinical performance of a spacer developed and made in Brazil. METHODS: Qualitative - in a patient with cystic fibrosis - and semi-quantitative - in two healthy volunteers - assessment of pulmonary deposition of (99)mtechnetium was done using the Aerogama Medical oxigen driven nebulizer system attached to the spacer and a gama-camera (Siemens, model Orbiter) connected to a microcomputer. In the next step, clinical assessment was carried out in 50 asthmatic children, aged from four months to 13 years old with an acute attack, using conventional doses of albuterol through a metered dose inhaler attached to the spacer device. RESULTS: Qualitative assessment revealed a lung silhouette comparable with those obtained in the inhalation scintigraphy and semiquantitative assessment reveals that 7.5% to 8.0% of the inhaled (99m)technetium reached the volunteerś lungs. Statistically significant differences (p < 0.001) were observed comparing clinical scores at admission with those verified 20 and 40 minutes after albuterol inhalation; conversely, no significance was obtained for scores taken at 60 and 80 minutes. CONCLUSIONS: Although we used an alternative method, the scintigraphic assessment reveals an expected pattern of pulmonary deposition. Similarly, clinical performance in the treatment of an acute attack showed results comparable with those obtained with other spacers devices.
Assessing students' conceptual knowledge of electricity and magnetism

NASA Astrophysics Data System (ADS)

McColgan, Michele W.; Finn, Rose A.; Broder, Darren L.; Hassel, George E.

2017-12-01

We present the Electricity and Magnetism Conceptual Assessment (EMCA), a new assessment aligned with second-semester introductory physics courses. Topics covered include electrostatics, electric fields, circuits, magnetism, and induction. We have two motives for writing a new assessment. First, we find other assessments such as the Brief Electricity and Magnetism Assessment and the Conceptual Survey on Electricity and Magnetism not well aligned with the topics and content depth of our courses. We want to test introductory physics content at a level appropriate for our students. Second, we want the assessment to yield scores and gains comparable to the widely used Force Concept Inventory (FCI). After five testing and revision cycles, the assessment was finalized in early 2015 and is available online. We present performance results for a cohort of 225 students at Siena College who were enrolled in our algebra- and calculus-based physics courses during the spring 2015 and 2016 semesters. We provide pretest, post-test, and gain analyses, as well as individual question and whole test statistics to quantify difficulty and reliability. In addition, we compare EMCA and FCI scores and gains, and we find that students' FCI scores are strongly correlated with their performance on the EMCA. Finally, the assessment was piloted in an algebra-based physics course at George Washington University (GWU). We present performance results for a cohort of 130 GWU students and we find that their EMCA scores are comparable to the scores of students in our calculus-based physics course.
Performance evaluation of Space Shuttle SRB parachutes from air drop and scaled model wind tunnel tests. [Solid Rocket Booster recovery system

NASA Technical Reports Server (NTRS)

Moog, R. D.; Bacchus, D. L.; Utreja, L. R.

1979-01-01

The aerodynamic performance characteristics have been determined for the Space Shuttle Solid Rocket Booster drogue, main, and pilot parachutes. The performance evaluation on the 20-degree conical ribbon parachutes is based primarily on air drop tests of full scale prototype parachutes. In addition, parametric wind tunnel tests were performed and used in parachute configuration development and preliminary performance assessments. The wind tunnel test data are compared to the drop test results and both sets of data are used to determine the predicted performance of the Solid Rocket Booster flight parachutes. Data from other drop tests of large ribbon parachutes are also compared with the Solid Rocket Booster parachute performance characteristics. Parameters assessed include full open terminal drag coefficients, reefed drag area, opening characteristics, clustering effects, and forebody interference.
The Validation of a Case-Based, Cumulative Assessment and Progressions Examination

PubMed Central

Coker, Adeola O.; Copeland, Jeffrey T.; Gottlieb, Helmut B.; Horlen, Cheryl; Smith, Helen E.; Urteaga, Elizabeth M.; Ramsinghani, Sushma; Zertuche, Alejandra; Maize, David

2016-01-01

Objective. To assess content and criterion validity, as well as reliability of an internally developed, case-based, cumulative, high-stakes third-year Annual Student Assessment and Progression Examination (P3 ASAP Exam). Methods. Content validity was assessed through the writing-reviewing process. Criterion validity was assessed by comparing student scores on the P3 ASAP Exam with the nationally validated Pharmacy Curriculum Outcomes Assessment (PCOA). Reliability was assessed with psychometric analysis comparing student performance over four years. Results. The P3 ASAP Exam showed content validity through representation of didactic courses and professional outcomes. Similar scores on the P3 ASAP Exam and PCOA with Pearson correlation coefficient established criterion validity. Consistent student performance using Kuder-Richardson coefficient (KR-20) since 2012 reflected reliability of the examination. Conclusion. Pharmacy schools can implement internally developed, high-stakes, cumulative progression examinations that are valid and reliable using a robust writing-reviewing process and psychometric analyses. PMID:26941435
A comparative study of students' performance in preclinical physiology assessed by multiple choice and short essay questions.

PubMed

Oyebola, D D; Adewoye, O E; Iyaniwura, J O; Alada, A R; Fasanmade, A A; Raji, Y

2000-01-01

This study was designed to compare the performance of medical students in physiology when assessed by multiple choice questions (MCQs) and short essay questions (SEQs). The study also examined the influence of factors such as age, sex, O/level grades and JAMB scores on performance in the MCQs and SEQs. A structured questionnaire was administered to 264 medical students' four months before the Part I MBBS examination. Apart from personal data of each student, the questionnaire sought information on the JAMB scores and GCE O' Level grades of each student in English Language, Biology, Chemistry, Physics and Mathematics. The physiology syllabus was divided into five parts and the students were administered separate examinations (tests) on each part. Each test consisted of MCQs and SEQs. The performance in MCQs and SEQs were compared. Also, the effects of JAMB scores and GCE O/level grades on the performance in both the MCQs and SEQs were assessed. The results showed that the students performed better in all MCQ tests than in the SEQs. JAMB scores and O' level English Language grade had no significant effect on students' performance in MCQs and SEQs. However O' level grades in Biology, Chemistry, Physics and Mathematics had significant effects on performance in MCQs and SEQs. Inadequate knowledge of physiology and inability to present information in a logical sequence are believed to be major factors contributing to the poorer performance in the SEQs compared with MCQs. In view of the finding of significant association between performance in MCQs and SEQs and GCE O/level grades in science subjects and mathematics, it was recommended that both JAMB results and the GCE results in the four O/level subjects above may be considered when selecting candidates for admission into the medical schools.
Comparative cath-lab assessment of coronary stenosis by radiology technician, junior and senior interventional cardiologist in patients treated with coronary angioplasty.

PubMed

Brunetti, Natale Daniele; Delli Carri, Felice; Ruggiero, Maria Assunta; Cuculo, Andrea; Ruggiero, Antonio; Ziccardi, Luigi; De Gennaro, Luisa; Di Biase, Matteo

2014-03-01

Exact quantification of plaque extension during coronary angioplasty (PCI) usually falls on interventional cardiologist (IC). Quantitative coronary stenosis assessment (QCA) may be possibly committed to the radiology technician (RT), who usually supports cath-lab nurse and IC during PCI. We therefore sought to investigate the reliability of QCA performed by RT in comparison with IC. Forty-four consecutive patients with acute coronary syndrome underwent PCI; target coronary vessel size beneath target coronary lesion (S) and target coronary lesion length (L) were assessed by the RT, junior IC (JIC), and senior IC (SIC) and then compared. SIC evaluation, which determined the final stent selection for coronary stenting, was considered as a reference benchmark. RT performance with QCA support in assessing target vessel size and target lesion length was not significantly different from SIC (r = 0.46, p < 0.01; r = 0.64, p < 0.001, respectively) as well as JIC (r = 0.79, r = 0.75, p < 0.001, respectively). JIC performance was significantly better than RT in assessing target vessel size (p < 0.05), while not significant when assessing target lesion length. RT may reliably assess target lesion by using adequate QCA software in the cath-lab in case of PCI; RT performance does not differ from SIC.
Holistic rubric vs. analytic rubric for measuring clinical performance levels in medical students.

PubMed

Yune, So Jung; Lee, Sang Yeoup; Im, Sun Ju; Kam, Bee Sung; Baek, Sun Yong

2018-06-05

Task-specific checklists, holistic rubrics, and analytic rubrics are often used for performance assessments. We examined what factors evaluators consider important in holistic scoring of clinical performance assessment, and compared the usefulness of applying holistic and analytic rubrics respectively, and analytic rubrics in addition to task-specific checklists based on traditional standards. We compared the usefulness of a holistic rubric versus an analytic rubric in effectively measuring the clinical skill performances of 126 third-year medical students who participated in a clinical performance assessment conducted by Pusan National University School of Medicine. We conducted a questionnaire survey of 37 evaluators who used all three evaluation methods-holistic rubric, analytic rubric, and task-specific checklist-for each student. The relationship between the scores on the three evaluation methods was analyzed using Pearson's correlation. Inter-rater agreement was analyzed by Kappa index. The effect of holistic and analytic rubric scores on the task-specific checklist score was analyzed using multiple regression analysis. Evaluators perceived accuracy and proficiency to be major factors in objective structured clinical examinations evaluation, and history taking and physical examination to be major factors in clinical performance examinations evaluation. Holistic rubric scores were highly related to the scores of the task-specific checklist and analytic rubric. Relatively low agreement was found in clinical performance examinations compared to objective structured clinical examinations. Meanwhile, the holistic and analytic rubric scores explained 59.1% of the task-specific checklist score in objective structured clinical examinations and 51.6% in clinical performance examinations. The results show the usefulness of holistic and analytic rubrics in clinical performance assessment, which can be used in conjunction with task-specific checklists for more efficient evaluation.
Forces Changing Our Nation's Future: The Comparative Performance of U.S. Adults and Youth on International Literacy Assessments, the Importance of Literacy/Numeracy Proficiencies for Labor Market Success, and the Projected Outlook for Literacy Proficiencies of U.S. Adults

ERIC Educational Resources Information Center

Sum, Andrew

2007-01-01

This presentation is devoted to four main topics: (1) the comparative performance of U.S. adults and high school students on international literacy assessments; (2) the literacy/numeracy proficiencies of the nation's adults in different educational groups and among those who recently participated in federally-funded adult education programs; (3)…
DOT/NASA comparative assessment of Brayton engines for guideway vehicles and busses. Volume 2: Analysis and results

NASA Technical Reports Server (NTRS)

1975-01-01

Gas turbine engines were assessed for application to hear duty transportation. A summary of the assumptions, applications, and methods of analysis is included along with a discussion of the approach taken, the technical program flow chart, and weighting criteria used for performance evaluation. The various engines are compared on the bases of weight, performance, emissions and noise, technology status, and growth potential. The results of the engine screening phase and the conceptual design phase are presented.
Comparing the Performance of Older Low-Progress Readers on the York Assessment of Reading for Comprehension with Performance on the Neale Analysis of Reading Ability and Other Measures of Reading and Related Skills

ERIC Educational Resources Information Center

Wheldall, Kevin; Arakelian, Sarah

2016-01-01

The aim of this study was to compare the York Assessment of Reading for Comprehension (YARC) with the Neale Analysis of Reading Ability (NARA) and other measures of reading and related skills with a sample of older low-progress readers and to provide additional information regarding the validity of the YARC in Australia. The data from an…
ANALYSING PERFORMANCE ASSESSMENT IN PUBLIC SERVICES: HOW USEFUL IS THE CONCEPT OF A PERFORMANCE REGIME?

PubMed

Martin, Steve; Nutley, Sandra; Downe, James; Grace, Clive

2016-03-01

Approaches to performance assessment have been described as 'performance regimes', but there has been little analysis of what is meant by this concept and whether it has any real value. We draw on four perspectives on regimes - 'institutions and instruments', 'risk regulation regimes', 'internal logics and effects' and 'analytics of government' - to explore how the concept of a multi-dimensional regime can be applied to performance assessment in public services. We conclude that the concept is valuable. It helps to frame comparative and longitudinal analyses of approaches to performance assessment and draws attention to the ways in which public service performance regimes operate at different levels, how they change over time and what drives their development. Areas for future research include analysis of the impacts of performance regimes and interactions between their visible features (such as inspections, performance indicators and star ratings) and the veiled rationalities which underpin them.
Evaluation of Course-Specific Self-Efficacy Assessment Methods.

ERIC Educational Resources Information Center

Bong, Mimi

A study was conducted to compare three methods of assessing course-level self-efficacy beliefs within a multitrait multimethod (MTMM) framework. The methods involved: (1) successfully performing a number of domain-related tasks; (2) obtaining specific letter grades in the course; and (3) successfully performing generic academic tasks in the…
NAEP, Race, Sex and Political Attitudes.

ERIC Educational Resources Information Center

Loney, Brian D.

This study was designed to examine the effects of race and sex on performance on selected affective exercises from the first social studies assessment conducted in 1971-72 by the National Assessment of Educational Progress (NAEP). Compared were the performances of black males versus other males, black females versus other females, black males…
Differences in Neuropsychological Functioning Between Homicidal and Nonviolent Schizophrenia Samples.

PubMed

Stratton, John; Cobia, Derin J; Reilly, James; Brook, Michael; Hanlon, Robert E

2018-02-07

Few studies have compared performance on neurocognitive measures between violent and nonviolent schizophrenia samples. A better understanding of neurocognitive dysfunction in violent individuals with schizophrenia could increase the efficacy of violence reduction strategies and aid in risk assessment and adjudication processes. This study aimed to compare neuropsychological performance between 25 homicide offenders with schizophrenia and 25 nonviolent schizophrenia controls. The groups were matched for age, race, sex, and handedness. Independent t-tests and Mann-Whitney U-tests were used to compare the schizophrenia groups' performance on measures of cognition, including composite scores assessing domain level functioning and individual neuropsychological tests. Results indicated the violent schizophrenia group performed worse on measures of memory and executive functioning, and the Intellectual Functioning composite score, when compared to the nonviolent schizophrenia sample. These findings replicate previous research documenting neuropsychological deficits specific to violent individuals with schizophrenia and support research implicating fronto-limbic dysfunction among violent offenders with schizophrenia. © 2018 American Academy of Forensic Sciences.
Angiographic core laboratory reproducibility analyses: implications for planning clinical trials using coronary angiography and left ventriculography end-points.

PubMed

Steigen, Terje K; Claudio, Cheryl; Abbott, David; Schulzer, Michael; Burton, Jeff; Tymchak, Wayne; Buller, Christopher E; John Mancini, G B

2008-06-01

To assess reproducibility of core laboratory performance and impact on sample size calculations. Little information exists about overall reproducibility of core laboratories in contradistinction to performance of individual technicians. Also, qualitative parameters are being adjudicated increasingly as either primary or secondary end-points. The comparative impact of using diverse indexes on sample sizes has not been previously reported. We compared initial and repeat assessments of five quantitative parameters [e.g., minimum lumen diameter (MLD), ejection fraction (EF), etc.] and six qualitative parameters [e.g., TIMI myocardial perfusion grade (TMPG) or thrombus grade (TTG), etc.], as performed by differing technicians and separated by a year or more. Sample sizes were calculated from these results. TMPG and TTG were also adjudicated by a second core laboratory. MLD and EF were the most reproducible, yielding the smallest sample size calculations, whereas percent diameter stenosis and centerline wall motion require substantially larger trials. Of the qualitative parameters, all except TIMI flow grade gave reproducibility characteristics yielding sample sizes of many 100's of patients. Reproducibility of TMPG and TTG was only moderately good both within and between core laboratories, underscoring an intrinsic difficulty in assessing these. Core laboratories can be shown to provide reproducibility performance that is comparable to performance commonly ascribed to individual technicians. The differences in reproducibility yield huge differences in sample size when comparing quantitative and qualitative parameters. TMPG and TTG are intrinsically difficult to assess and conclusions based on these parameters should arise only from very large trials.
Assessment of the WRF-ARW model during fog conditions in a coastal arid region using different PBL schemes

NASA Astrophysics Data System (ADS)

Temimi, Marouane; Chaouch, Naira; Weston, Michael; Ghedira, Hosni

2017-04-01

This study covers five fog events reported in 2014 at Abu Dhabi International Airport in the United Arab Emirates (UAE). We assess the performance of WRF-ARW model during fog conditions and we intercompare seven different PBL schemes and assess their impact on the performance of the simulations. Seven PBL schemes, namely, Yonsei University (YSU), Mellor-Yamada-Janjic (MYJ), Moller-Yamada Nakanishi and Niino (MYNN) level 2.5, Quasi-Normal Scale Elimination (QNSE-EDMF), Asymmetric Convective Model (ACM2), Grenier-Bretherton-McCaa (GBM) and MYNN level 3 were tested. Radiosonde data from the Abu Dhabi International Airport and surface measurements of relative humidity (RH), dew point temperature, wind speed, and temperature profiles were used to assess the performance of the model. All PBL schemes showed comparable skills with relatively higher performance with the QNSE scheme. The average RH Root Mean Square Error (RMSE) and BIAS for all PBLs were 15.75 % and -9.07 %, respectively, whereas the obtained RMSE and BIAS when QNSE was used were 14.65 % and -6.3 % respectively. Comparable skills were obtained for the rest of the variables. Local PBL schemes showed better performance than non-local schemes. Discrepancies between simulated and observed values were higher at the surface level compared to high altitude values. The sensitivity to lead time showed that best simulation performances were obtained when the lead time varies between 12 and 18 hours. In addition, the results of the simulations show that better performance is obtained when the starting condition is dry.
Functional impairment and cognitive performance in mood disorders: A community sample of young adults.

PubMed

Reyes, Amanda N; Cardoso, Taiane A; Jansen, Karen; Mondin, Thaíse C; Souza, Luciano D M; Magalhães, Pedro V S; Kapczinski, Flavio; Silva, Ricardo A

2017-05-01

The aim of this study was to compare the global functioning and cognitive performance in a community sample of young adults with mood disorders versus community controls. This was a cross-sectional study nested in a cohort study with a community sample. Data was collected from February 2012 to June 2014; specifically, at a mean of five years after the first phase, all young adults were invited to participate in a re-evaluation. Mini International Neuropsychiatric Interview - PLUS (MINI-PLUS) was used for the diagnosis of mood disorders. The Functional Assessment Short Test (FAST) and the Montreal Cognitive Assessment (MoCA) were used to assess the global functioning, and cognitive performance, respectively. Were included 1258 subjects. Functional impairment was greater in subjects with bipolar disorder when compared to community controls, and there were no differences between major depressive disorder and community controls. There were no significant differences in cognitive performance between young adults with mood disorders when compared to community controls. Functional impairment is a marker for bipolar disorder in young adults; however, gross cognitive impairment assessed by a screening test is not, possibly because cognition is impaired in more advanced stages of the disorder. Copyright © 2017 Elsevier Ireland Ltd. All rights reserved.
Post-duty psychomotor performance in young and senior anaesthetists.

PubMed

Lederer, W; Kopp, M; Hahn, O; Kurzthaler, I; Traweger, C; Kinzl, J; Benzer, A

2006-03-01

The level of performance in junior and senior anaesthetists was investigated after 24-h shift working and on-call duties. Pre- and post-duty psychomotor function, influence on response time, cognitive function and well-being in 23 individuals (13 junior and 12 senior anaesthetists) was assessed before and after 24-h in-house on-call duty. Subjective perception of tiredness and concentration abilities was estimated by applying a visual analogue scale. The self-assessed tiredness prior to duty was high in both age groups and significantly increased in senior anaesthetists after night duty (P = 0.01). Post-duty impairment of concentration abilities was reported in both groups. Comparing results from pre- and post-duty psychometric testing showed a comparable decline in junior and senior anaesthetists as well. Assessment of burnout showed a significant lack of personal accomplishment in junior anaesthetists as compared to their older colleagues (P = 0.038). Senior anaesthetists judged their contribution to patient well-being significantly higher than did their younger colleagues (P = 0.035). Although tiredness and subjective impairment of concentration abilities was high in senior anaesthetists after 24-h in-house on-call duty, performance assessed by psychometric testing does not support the hypothesis that senior colleague's performance cannot keep up with routine hospital shift work.
Comparing the Use of Global Rating Scale with Checklists for the Assessment of Central Venous Catheterization Skills Using Simulation

ERIC Educational Resources Information Center

Ma, Irene W. Y.; Zalunardo, Nadia; Pachev, George; Beran, Tanya; Brown, Melanie; Hatala, Rose; McLaughlin, Kevin

2012-01-01

The use of checklists is recommended for the assessment of competency in central venous catheterization (CVC) insertion. To explore the use of a global rating scale in the assessment of CVC skills, this study seeks to compare its use with two checklists, within the context of a formative examination using simulation. Video-recorded performances of…

Transient Elastography is Superior to FIB-4 in Assessing the Risk of Hepatocellular Carcinoma in Patients With Chronic Hepatitis B

PubMed Central

Kim, Seung Up; Kim, Beom Kyung; Park, Jun Yong; Kim, Do Young; Ahn, Sang Hoon; Song, Kijun; Han, Kwang-Hyub

2016-01-01

Abstract Liver stiffness (LS), assessed using transient elastography (TE), and (FIB-4) can both estimate the risk of developing hepatocellular carcinoma (HCC). We compared prognostic performances of LS and FIB-4 to predict HCC development in patients with chronic hepatitis B (CHB). Data from 1308 patients with CHB, who underwent TE, were retrospectively analyzed. FIB-4 was calculated for all patients. The cumulative rate of HCC development was assessed using Kaplan–Meier curves. The predictive performances of LS and FIB-4 were evaluated using time-dependent receiver-operating characteristic (ROC) curves. The mean age (883 men) was 50 years. During follow-up (median 6.1 years), 119 patients developed HCC. The areas under the ROC curves (AUROCs) predicting HCC risk at 3, 5, and 7 years were consistently greater for LS than for FIB-4 (0.791–0.807 vs 0.691–0.725; all P < 0.05). Similarly, when the respective AUROCs for LS and FIB-4 at every time point during the 7-year follow-up were plotted, LS also showed consistently better performance than FIB-4 after 1 year of enrollment. The combined use of LS and FIB-4 significantly enhanced the prognostic performance compared with the use of FIB-4 alone (P < 0.05), but the performance of the combined scores was statistically similar to that of LS alone (P > 0.05). LS showed significantly better performance than FIB-4 in assessing the risk of HCC development, and the combined use of LS and FIB-4 did not provide additional benefit compared with the use of LS alone. Hence, LS assessed using TE might be helpful for optimizing HCC surveillance strategies. PMID:27196449
Transient Elastography is Superior to FIB-4 in Assessing the Risk of Hepatocellular Carcinoma in Patients With Chronic Hepatitis B.

PubMed

Kim, Seung Up; Kim, Beom Kyung; Park, Jun Yong; Kim, Do Young; Ahn, Sang Hoon; Song, Kijun; Han, Kwang-Hyub

2016-05-01

Liver stiffness (LS), assessed using transient elastography (TE), and (FIB-4) can both estimate the risk of developing hepatocellular carcinoma (HCC). We compared prognostic performances of LS and FIB-4 to predict HCC development in patients with chronic hepatitis B (CHB).Data from 1308 patients with CHB, who underwent TE, were retrospectively analyzed. FIB-4 was calculated for all patients. The cumulative rate of HCC development was assessed using Kaplan-Meier curves. The predictive performances of LS and FIB-4 were evaluated using time-dependent receiver-operating characteristic (ROC) curves.The mean age (883 men) was 50 years. During follow-up (median 6.1 years), 119 patients developed HCC. The areas under the ROC curves (AUROCs) predicting HCC risk at 3, 5, and 7 years were consistently greater for LS than for FIB-4 (0.791-0.807 vs 0.691-0.725; all P < 0.05). Similarly, when the respective AUROCs for LS and FIB-4 at every time point during the 7-year follow-up were plotted, LS also showed consistently better performance than FIB-4 after 1 year of enrollment. The combined use of LS and FIB-4 significantly enhanced the prognostic performance compared with the use of FIB-4 alone (P < 0.05), but the performance of the combined scores was statistically similar to that of LS alone (P > 0.05).LS showed significantly better performance than FIB-4 in assessing the risk of HCC development, and the combined use of LS and FIB-4 did not provide additional benefit compared with the use of LS alone. Hence, LS assessed using TE might be helpful for optimizing HCC surveillance strategies.
The relationship between faculty performance assessment and results on the in-training examination for residents in an emergency medicine training program.

PubMed

Ryan, James G; Barlas, David; Pollack, Simcha

2013-12-01

Medical knowledge (MK) in residents is commonly assessed by the in-training examination (ITE) and faculty evaluations of resident performance. We assessed the reliability of clinical evaluations of residents by faculty and the relationship between faculty assessments of resident performance and ITE scores. We conducted a cross-sectional, observational study at an academic emergency department with a postgraduate year (PGY)-1 to PGY-3 emergency medicine residency program, comparing summative, quarterly, faculty evaluation data for MK and overall clinical competency (OC) with annual ITE scores, accounting for PGY level. We also assessed the reliability of faculty evaluations using a random effects, intraclass correlation analysis. We analyzed data for 59 emergency medicine residents during a 6-year period. Faculty evaluations of MK and OC were highly reliable (κ = 0.99) and remained reliable after stratification by year of training (mean κ = 0.68-0.84). Assessments of resident performance (MK and OC) and the ITE increased with PGY level. The MK and OC results had high correlations with PGY level, and ITE scores correlated moderately with PGY. The OC and MK results had a moderate correlation with ITE score. When residents were grouped by PGY level, there was no significant correlation between MK as assessed by the faculty and the ITE score. Resident clinical performance and ITE scores both increase with resident PGY level, but ITE scores do not predict resident clinical performance compared with peers at their PGY level.
The Relationship Between Faculty Performance Assessment and Results on the In-Training Examination for Residents in an Emergency Medicine Training Program

PubMed Central

Ryan, James G.; Barlas, David; Pollack, Simcha

2013-01-01

Background Medical knowledge (MK) in residents is commonly assessed by the in-training examination (ITE) and faculty evaluations of resident performance. Objective We assessed the reliability of clinical evaluations of residents by faculty and the relationship between faculty assessments of resident performance and ITE scores. Methods We conducted a cross-sectional, observational study at an academic emergency department with a postgraduate year (PGY)-1 to PGY-3 emergency medicine residency program, comparing summative, quarterly, faculty evaluation data for MK and overall clinical competency (OC) with annual ITE scores, accounting for PGY level. We also assessed the reliability of faculty evaluations using a random effects, intraclass correlation analysis. Results We analyzed data for 59 emergency medicine residents during a 6-year period. Faculty evaluations of MK and OC were highly reliable (κ = 0.99) and remained reliable after stratification by year of training (mean κ = 0.68–0.84). Assessments of resident performance (MK and OC) and the ITE increased with PGY level. The MK and OC results had high correlations with PGY level, and ITE scores correlated moderately with PGY. The OC and MK results had a moderate correlation with ITE score. When residents were grouped by PGY level, there was no significant correlation between MK as assessed by the faculty and the ITE score. Conclusions Resident clinical performance and ITE scores both increase with resident PGY level, but ITE scores do not predict resident clinical performance compared with peers at their PGY level. PMID:24455005
Source term model evaluations for the low-level waste facility performance assessment

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yim, M.S.; Su, S.I.

1995-12-31

The estimation of release of radionuclides from various waste forms to the bottom boundary of the waste disposal facility (source term) is one of the most important aspects of LLW facility performance assessment. In this work, several currently used source term models are comparatively evaluated for the release of carbon-14 based on a test case problem. The models compared include PRESTO-EPA-CPG, IMPACTS, DUST and NEFTRAN-II. Major differences in assumptions and approaches between the models are described and key parameters are identified through sensitivity analysis. The source term results from different models are compared and other concerns or suggestions are discussed.
Validating the Assessment for Measuring Indonesian Secondary School Students Performance in Ecology

NASA Astrophysics Data System (ADS)

Rachmatullah, A.; Roshayanti, F.; Ha, M.

2017-09-01

The aims of this current study are validating the American Association for the Advancement of Science (AAAS) Ecology assessment and examining the performance of Indonesian secondary school students on the assessment. A total of 611 Indonesian secondary school students (218 middle school students and 393 high school students) participated in the study. Forty-five items of AAAS assessment in the topic of Interdependence in Ecosystems were divided into two versions which every version has 21 similar items. Linking item method was used as the method to combine those two versions of assessment and further Rasch analyses were utilized to validate the instrument. Independent sample t-test was also run to compare the performance of Indonesian students and American students based on the mean of item difficulty. We found that from the total of 45 items, three items were identified as misfitting items. Later on, we also found that both Indonesian middle and high school students were significantly lower performance with very large and medium effect size compared to American students. We will discuss our findings in the regard of validation issue and the connection to Indonesian student’s science literacy.
Crowdsourcing Assessment of Surgeon Dissection of Renal Artery and Vein During Robotic Partial Nephrectomy: A Novel Approach for Quantitative Assessment of Surgical Performance.

PubMed

Powers, Mary K; Boonjindasup, Aaron; Pinsky, Michael; Dorsey, Philip; Maddox, Michael; Su, Li-Ming; Gettman, Matthew; Sundaram, Chandru P; Castle, Erik P; Lee, Jason Y; Lee, Benjamin R

2016-04-01

We sought to describe a methodology of crowdsourcing for obtaining quantitative performance ratings of surgeons performing renal artery and vein dissection of robotic partial nephrectomy (RPN). We sought to compare assessment of technical performance obtained from the crowdsourcers with that of surgical content experts (CE). Our hypothesis is that the crowd can score performances of renal hilar dissection comparably to surgical CE using the Global Evaluative Assessment of Robotic Skills (GEARS). A group of resident and attending robotic surgeons submitted a total of 14 video clips of RPN during hilar dissection. These videos were rated by both crowd and CE for technical skills performance using GEARS. A minimum of 3 CE and 30 Amazon Mechanical Turk crowdworkers evaluated each video with the GEARS scale. Within 13 days, we received ratings of all videos from all CE, and within 11.5 hours, we received 548 GEARS ratings from crowdworkers. Even though CE were exposed to a training module, internal consistency across videos of CE GEARS ratings remained low (ICC = 0.38). Despite this, we found that crowdworker GEARS ratings of videos were highly correlated with CE ratings at both the video level (R = 0.82, p < 0.001) and surgeon level (R = 0.84, p < 0.001). Similarly, crowdworker ratings of the renal artery dissection were highly correlated with expert assessments (R = 0.83, p < 0.001) for the unique surgery-specific assessment question. We conclude that crowdsourced assessment of qualitative performance ratings may be an alternative and/or adjunct to surgical experts' ratings and would provide a rapid scalable solution to triage technical skills.
Performance of U.S. 15-Year-Old Students in Mathematics, Science, and Reading Literacy in an International Context. First Look at PISA 2012. NCES 2014-024

ERIC Educational Resources Information Center

Kelly, Dana; Nord, Christine Winquist; Jenkins, Frank; Chan, Jessica Ying; Kastberg, David

2013-01-01

The Program for International Student Assessment (PISA) is a system of international assessments that allows countries to compare outcomes of learning as students near the end of compulsory schooling. PISA core assessments measure the performance of 15-year-old students in mathematics, science, and reading literacy every 3 years. Coordinated by…
Follow-up after early medical abortion: Comparing clinical assessment with self-assessment in a rural hospital in northern Norway.

PubMed

Mählck, Carl-Gustav; Bäckström, Torbjörn

2017-06-01

A follow-up study was performed on women who had requested medical abortions in a rural hospital in northern Norway to compare clinical assessment with self-assessment of early medical abortion in terms of safety. During the three-year study period, 392 women requested termination of pregnancy. After excluding those who changed their mind, those who had a spontaneous miscarriage, those who were referred to a central hospital for a two-stage abortion, and those who had the abortion performed surgically, 242 cases remained, and all the medical files were reviewed. Five cases (2%) were lost to follow-up, so the study group consists of 237 cases. Out of the 237 cases, in which a medical abortion was performed, 106 were performed at home with a self-assessment (44.7%), and 131 (55.3%) were performed at the department of Gynecology. The percentage of cases with self-assessment did not noticeably change during the three-year study period. The registered complications were infection, incomplete abortion requiring a surgical procedure and hospitalization due to severe pain. No significant difference in registered complications was found between medical abortions with self-assessment (n=9, 8.5% out of 106 cases) and medical abortions at the gynecological out-patient department (n=6, 4.6% out of 131 cases). According to this investigation, it is equally safe to perform a medical abortion at home with a self-assessment as it is to have a medical abortion at an outpatient clinic. These results could be useful for health care provision in rural areas where access to hospitals is impeded by logistical difficulties. Copyright © 2017 Elsevier B.V. All rights reserved.
Use of cognitive task analysis to guide the development of performance-based assessments for intraoperative decision making.

PubMed

Pugh, Carla M; DaRosa, Debra A

2013-10-01

There is a paucity of performance-based assessments that focus on intraoperative decision making. The purpose of this article is to review the performance outcomes and usefulness of two performance-based assessments that were developed using cognitive task analysis (CTA) frameworks. Assessment-A used CTA to create a "think aloud" oral examination that was administered while junior residents (PGY 1-2's, N = 69) performed a porcine-based laparoscopic cholecystectomy. Assessment-B used CTA to create a simulation-based, formative assessment of senior residents' (PGY 4-5's, N = 29) decision making during a laparoscopic ventral hernia repair. In addition to survey-based assessments of usefulness, a multiconstruct evaluation was performed using eight variables. When comparing performance outcomes, both approaches revealed major deficiencies in residents' intraoperative decision-making skills. Multiconstruct evaluation of the two CTA approaches revealed assessment method advantages for five of the eight evaluation areas: (1) Cognitive Complexity, (2) Content Quality, (3) Content Coverage, (4) Meaningfulness, and (5) Transfer and Generalizability. The two CTA performance assessments were useful in identifying significant training needs. While there are pros and cons to each approach, the results serve as a useful blueprint for program directors seeking to develop performance-based assessments for intraoperative decision making. Reprint & Copyright © 2013 Association of Military Surgeons of the U.S.
A Descriptive-Comparative Study of Teacher Performance Evaluation on Student Achievement in a Public School District

ERIC Educational Resources Information Center

Christensen, William Howard

2013-01-01

In 2010, the federal government increased accountability expectations by placing more emphasis on monitoring teacher performance. Using a model that focuses on the New York State teacher evaluation system, that is comprised of a rubric for observation, local student assessment scores, and student state assessment scores, this…
Improving Performance: Leading from the Bottom. PISA in Focus. No. 2

ERIC Educational Resources Information Center

OECD Publishing (NJ1), 2011

2011-01-01

Since the PISA (Programme for International Student Assessment) 2000 and 2009 surveys both focused on reading, one can track in detail how student reading performance has changed over that period. Among the 26 OECD (Organisation for Economic Cooperation and Development) countries with comparable results in both assessments, Chile, Germany,…
PLAB and UK graduates' performance on MRCP(UK) and MRCGP examinations: data linkage study.

PubMed

McManus, I C; Wakeford, Richard

2014-04-17

To assess whether international medical graduates passing the two examinations set by the Professional and Linguistic Assessments Board (PLAB1 and PLAB2) of the General Medical Council (GMC) are equivalent to UK graduates at the end of the first foundation year of medical training (F1), as the GMC requires, and if not, to assess what changes in the PLAB pass marks might produce equivalence. Data linkage of GMC PLAB performance data with data from the Royal Colleges of Physicians and the Royal College of General Practitioners on performance of PLAB graduates and UK graduates at the MRCP(UK) and MRCGP examinations. Doctors in training for internal medicine or general practice in the United Kingdom. 7829, 5135, and 4387 PLAB graduates on their first attempt at MRCP(UK) Part 1, Part 2, and PACES assessments from 2001 to 2012 compared with 18,532, 14,094, and 14,376 UK graduates taking the same assessments; 3160 PLAB1 graduates making their first attempt at the MRCGP AKT during 2007-12 compared with 14,235 UK graduates; and 1411 PLAB2 graduates making their first attempt at the MRCGP CSA during 2010-12 compared with 6935 UK graduates. Performance at MRCP(UK) Part 1, Part 2, and PACES assessments, and MRCGP AKT and CSA assessments in relation to performance on PLAB1 and PLAB2 assessments, as well as to International English Language Testing System (IELTS) scores. MRCP(UK), MRCGP, and PLAB results were analysed as marks relative to the pass mark at the first attempt. PLAB1 marks were a valid predictor of MRCP(UK) Part 1, MRCP(UK) Part 2, and MRCGP AKT (r=0.521, 0.390, and 0.490; all P<0.001). PLAB2 marks correlated with MRCP(UK) PACES and MRCGP CSA (r=0.274, 0.321; both P<0.001). PLAB graduates had significantly lower MRCP(UK) and MRCGP assessments (Glass's Δ=0.94, 0.91, 1.40, 1.01, and 1.82 for MRCP(UK) Part 1, Part 2, and PACES and MRCGP AKT and CSA), and were more likely to fail assessments and to progress more slowly than UK medical graduates. IELTS scores correlated significantly with later performance, multiple regression showing that the effect of PLAB1 (β=0.496) was much stronger than the effect of IELTS (β=0.086). Changes to PLAB pass marks that would result in international medical graduate and UK medical graduate equivalence were assessed in two ways. Method 1 adjusted PLAB pass marks to equate median performance of PLAB and UK graduates. Method 2 divided PLAB graduates into 12 equally spaced groups according to PLAB performance, and compared these with mean performance of graduates from individual UK medical schools, assessing which PLAB groups were equivalent in MRCP(UK) and MRCGP performance to UK graduates. The two methods produced similar results. To produce equivalent performance on the MRCP and MRGP examinations, the pass mark for PLAB1 would require raising by about 27 marks (13%) and for PLAB2 by about 15-16 marks (20%) above the present standard. PLAB is a valid assessment of medical knowledge and clinical skills, correlating well with performance at MRCP(UK) and MRCGP. PLAB graduates' knowledge and skills at MRCP(UK) and MRCGP are over one standard deviation below those of UK graduates, although differences in training quality cannot be taken into account. Equivalent performance in MRCGP(UK) and MRCGP would occur if the pass marks of PLAB1 and PLAB2 were raised considerably, but that would also reduce the pass rate, with implications for medical workforce planning. Increasing IELTS requirements would have less impact on equivalence than raising PLAB pass marks.
PLAB and UK graduates’ performance on MRCP(UK) and MRCGP examinations: data linkage study

PubMed Central

Wakeford, Richard

2014-01-01

Objectives To assess whether international medical graduates passing the two examinations set by the Professional and Linguistic Assessments Board (PLAB1 and PLAB2) of the General Medical Council (GMC) are equivalent to UK graduates at the end of the first foundation year of medical training (F1), as the GMC requires, and if not, to assess what changes in the PLAB pass marks might produce equivalence. Design Data linkage of GMC PLAB performance data with data from the Royal Colleges of Physicians and the Royal College of General Practitioners on performance of PLAB graduates and UK graduates at the MRCP(UK) and MRCGP examinations. Setting Doctors in training for internal medicine or general practice in the United Kingdom. Participants 7829, 5135, and 4387 PLAB graduates on their first attempt at MRCP(UK) Part 1, Part 2, and PACES assessments from 2001 to 2012 compared with 18 532, 14 094, and 14 376 UK graduates taking the same assessments; 3160 PLAB1 graduates making their first attempt at the MRCGP AKT during 2007-12 compared with 14 235 UK graduates; and 1411 PLAB2 graduates making their first attempt at the MRCGP CSA during 2010-12 compared with 6935 UK graduates. Main outcome measures Performance at MRCP(UK) Part 1, Part 2, and PACES assessments, and MRCGP AKT and CSA assessments in relation to performance on PLAB1 and PLAB2 assessments, as well as to International English Language Testing System (IELTS) scores. MRCP(UK), MRCGP, and PLAB results were analysed as marks relative to the pass mark at the first attempt. Results PLAB1 marks were a valid predictor of MRCP(UK) Part 1, MRCP(UK) Part 2, and MRCGP AKT (r=0.521, 0.390, and 0.490; all P<0.001). PLAB2 marks correlated with MRCP(UK) PACES and MRCGP CSA (r=0.274, 0.321; both P<0.001). PLAB graduates had significantly lower MRCP(UK) and MRCGP assessments (Glass’s Δ=0.94, 0.91, 1.40, 1.01, and 1.82 for MRCP(UK) Part 1, Part 2, and PACES and MRCGP AKT and CSA), and were more likely to fail assessments and to progress more slowly than UK medical graduates. IELTS scores correlated significantly with later performance, multiple regression showing that the effect of PLAB1 (β=0.496) was much stronger than the effect of IELTS (β=0.086). Changes to PLAB pass marks that would result in international medical graduate and UK medical graduate equivalence were assessed in two ways. Method 1 adjusted PLAB pass marks to equate median performance of PLAB and UK graduates. Method 2 divided PLAB graduates into 12 equally spaced groups according to PLAB performance, and compared these with mean performance of graduates from individual UK medical schools, assessing which PLAB groups were equivalent in MRCP(UK) and MRCGP performance to UK graduates. The two methods produced similar results. To produce equivalent performance on the MRCP and MRGP examinations, the pass mark for PLAB1 would require raising by about 27 marks (13%) and for PLAB2 by about 15-16 marks (20%) above the present standard. Conclusions PLAB is a valid assessment of medical knowledge and clinical skills, correlating well with performance at MRCP(UK) and MRCGP. PLAB graduates’ knowledge and skills at MRCP(UK) and MRCGP are over one standard deviation below those of UK graduates, although differences in training quality cannot be taken into account. Equivalent performance in MRCGP(UK) and MRCGP would occur if the pass marks of PLAB1 and PLAB2 were raised considerably, but that would also reduce the pass rate, with implications for medical workforce planning. Increasing IELTS requirements would have less impact on equivalence than raising PLAB pass marks. PMID:24742473
Differential Cognitive and Perceptual Correlates of Print Reading versus Braille Reading

ERIC Educational Resources Information Center

Veispak, Anneli; Boets, Bart; Ghesquiere, Pol

2013-01-01

The relations between reading, auditory, speech, phonological and tactile spatial processing are investigated in a Dutch speaking sample of blind braille readers as compared to sighted print readers. Performance is assessed in blind and sighted children and adults. Regarding phonological ability, braille readers perform equally well compared to…
Interactive Technologies for Teacher Training: Comparing Performance and Assessment in Second Life and simSchool

ERIC Educational Resources Information Center

Meritt, Julia; Gibson, David; Christensen, Rhonda; Knezek, Gerald

2013-01-01

Two alternative technologies forming the basis of computer-mediated teacher preparation systems are compared and contrasted regarding implementation, operation, and assessment considerations. The role-playing system in Second Life is shown to have the unique characteristic of developing a co-constructed pedagogical identity, while the flight…
The Assessment of Cognitive Development in Blind Infants and Preschoolers.

ERIC Educational Resources Information Center

Brambring, M.; Troster, H.

1994-01-01

This study evaluated the Bielefeld Developmental Test for Blind Infants and Preschoolers by comparing cognitive performance of blind and sighted children (ages three and four). Results indicated that even this test (with "blind-neutral" items) did not permit a fair comparative assessment, though it did prove suitable for within-group…
Comparability in Balanced Assessment Systems for State Accountability

ERIC Educational Resources Information Center

Evans, Carla M.; Lyons, Susan

2017-01-01

The purpose of this study was to test methods that strengthen the comparability claims about annual determinations of student proficiency in English language arts, math, and science (Grades 3-12) in the New Hampshire Performance Assessment of Competency Education (NH PACE) pilot project. First, we examined the literature in order to define…
What We've Learned about Assessing Hands-On Science.

ERIC Educational Resources Information Center

Shavelson, Richard J.; Baxter, Gail P.

1992-01-01

A recent study compared hands-on scientific inquiry assessment to assessments involving lab notebooks, computer simulations, short-answer paper-and-pencil problems, and multiple-choice questions. Creating high quality performance assessments is a costly, time-consuming process requiring considerable scientific and technological know-how. Improved…
Comparative cath-lab assessment of coronary stenosis by radiology technician, junior and senior interventional cardiologist in patients treated with coronary angioplasty

PubMed Central

Delli Carri, Felice; Ruggiero, Maria Assunta; Cuculo, Andrea; Ruggiero, Antonio; Ziccardi, Luigi; De Gennaro, Luisa; Di Biase, Matteo

2014-01-01

Background Exact quantification of plaque extension during coronary angioplasty (PCI) usually falls on interventional cardiologist (IC). Quantitative coronary stenosis assessment (QCA) may be possibly committed to the radiology technician (RT), who usually supports cath-lab nurse and IC during PCI. We therefore sought to investigate the reliability of QCA performed by RT in comparison with IC. Methods Forty-four consecutive patients with acute coronary syndrome underwent PCI; target coronary vessel size beneath target coronary lesion (S) and target coronary lesion length (L) were assessed by the RT, junior IC (JIC), and senior IC (SIC) and then compared. SIC evaluation, which determined the final stent selection for coronary stenting, was considered as a reference benchmark. Results RT performance with QCA support in assessing target vessel size and target lesion length was not significantly different from SIC (r = 0.46, p < 0.01; r = 0.64, p < 0.001, respectively) as well as JIC (r = 0.79, r = 0.75, p < 0.001, respectively). JIC performance was significantly better than RT in assessing target vessel size (p < 0.05), while not significant when assessing target lesion length. Conclusions RT may reliably assess target lesion by using adequate QCA software in the cath-lab in case of PCI; RT performance does not differ from SIC. PMID:24672672

Psychometric Properties of Performance-based Measurements of Functional Capacity: Test-Retest Reliability, Practice Effects, and Potential Sensitivity to Change

PubMed Central

Leifker, Feea R.; Patterson, Thomas L.; Bowie, Christopher R.; Mausbach, Brent T.; Harvey, Philip D.

2010-01-01

Performance-based measures of the ability to perform social and everyday living skills are being more widely used to assess functional capacity in people with serious mental illnesses such as schizophrenia and bipolar disorder. Since they are also being used as outcome measures in pharmacological and cognitive remediation studies aimed at cognitive impairments in schizophrenia, understanding their measurement properties and potential sensitivity to change is important. In this study, the test-retest reliability, practice effects, and reliable change indices of two different performance-based functional capacity measures, the UCSD Performance-based skills assessment (UPSA) and Social skills performance assessment (SSPA) were examined over several different retest intervals in two different samples of people with schizophrenia (n’s=238 and 116) and a healthy comparison sample (n=109). These psychometric properties were compared to those of a neuropsychological assessment battery. Test-retest reliabilities of the long form of the UPSA ranged from r=.63 to r=.80 over follow-up periods up to 36 months in people with schizophrenia, while brief UPSA reliabilities ranged from r=.66 to r=.81. Test-retest reliability of the NP performance scores ranged from r=.77 to r=.79. Test-retest reliabilities of the UPSA were lower in healthy controls, while NP performance was slightly more reliable. SSPA test-retest reliability was lower. Practice effect sizes ranged from .05 to .16 for the UPSA and .07 to .19 for the NP assessment in patients, with HC having more practice effects. Reliable change intervals were consistent across NP and both FC measures, indicating equal potential for detection of change. These performance-based measures of functional capacity appear to have similar potential to be sensitive to change compared to NP performance in people with schizophrenia. PMID:20399613
Linking Workplace Health Promotion Best Practices and Organizational Financial Performance: Tracking Market Performance of Companies With Highest Scores on the HERO Scorecard.

PubMed

Grossmeier, Jessica; Fabius, Ray; Flynn, Jennifer P; Noeldner, Steven P; Fabius, Dan; Goetzel, Ron Z; Anderson, David R

2016-01-01

The aim of the study was to evaluate the stock performance of publicly traded companies that received high scores on the HERO Employee Health Management Best Practices Scorecard in Collaboration with Mercer© based on their implementation of evidence-based workplace health promotion practices. A portfolio of companies that received high scores in a corporate health and wellness self-assessment was simulated based on past market performance and compared with past performance of companies represented on the Standard and Poor's (S&P) 500 Index. Stock values for a portfolio of companies that received high scores in a corporate health and wellness self-assessment appreciated by 235% compared with the S&P 500 Index appreciation of 159% over a 6-year simulation period. Robust investment in workforce health and well-being appears to be one of multiple practices pursued by high-performing, well-managed companies.
Scenarios for the Hanford immobilized Low-Activity waste (ILAW) performance assessment

DOE Office of Scientific and Technical Information (OSTI.GOV)

MANN, F.M.

The purpose of the next version of the Hanford Immobilized Low-Activity Tank Waste (ILAW) Performance Assessment (ILAW PA) is to provide an updated estimate of the long-term human health and environmental impact of the disposal of ILAW and to compare these estimates against performance objectives displayed in Tables 1,2, and 3 (Mann 1999a). Such a radiological performance assessment is required by U.S. Department of Energy (DOE) Orders on radioactive waste management (DOE 1988a and DOE 1999a). This document defines the scenarios that will be used for the next update of the PA that is scheduled to be issued in 2001.more » Since the previous performance assessment (Mann 1998) was issued, considerable additional data on waste form behavior and site-specific soil geotechnical properties have been collected. In addition, the 2001 ILAW PA will benefit from improved computer models and the experience gained from the previous performance assessment. However, the scenarios (that is, the features, events, and processes analyzed in the Performance assessment) for the next PA are very similar to the ones in the 1998 PA.« less
Preliminary assessment of the impact of incorporating a detailed algorithm for the effects of nuclear irradiation on combat crew performance into the Janus combat simulation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Warshawsky, A.S.; Uzelac, M.J.; Pimper, J.E.

The Crew III algorithm for assessing time and dose dependent combat crew performance subsequent to nuclear irradiation was incorporated into the Janus combat simulation system. Battle outcomes using this algorithm were compared to outcomes based on the currently used time-independent cookie-cutter'' assessment methodology. The results illustrate quantifiable differences in battle outcome between the two assessment techniques. Results suggest that tactical nuclear weapons are more effective than currently assumed if performance degradation attributed to radiation doses between 150 to 3000 rad are taken into account. 6 refs., 9 figs.
Executive dysfunctions in migraine with and without aura: what is the role of white matter lesions?

PubMed

Le Pira, Francesco; Reggio, Ester; Quattrocchi, Graziella; Sanfilippo, Cristina; Maci, Tiziana; Cavallaro, Tiziana; Zappia, Mario

2014-01-01

Executive dysfunctions and white matter lesions on magnetic resonance imaging have been reported in migraine. The aim of this study was to determine whether any correlation between these 2 variables exists. Forty-four subjects affected by migraine with or without aura were compared with 16 healthy subjects. A battery of neuropsychological tests assessing executive functions was administered to all subjects. Number and total volume of white matter lesions were assessed in the whole brain and in the frontal lobe. The performances of both groups of migraineurs, with and without aura, were significantly worse when compared with controls on Boston Scanning Test. Moreover, we found lower performances compared with controls respectively on Frontal Assessment Battery in patients with migraine with aura and on Controlled Oral Word Association Test in patients with migraine without aura. Nineteen patients (43.2%) and one control subject (6.2%) had white matter lesions. We did not find any significant correlation between white matter lesions load and neuropsychological performances. On the basis of our results, white matter lesions load on magnetic resonance imaging do not seem to contribute to neuropsychological performances deficit in migraineurs. © 2013 American Headache Society.
Compared to controls, patients with ruptured aneurysm and surgical intervention show increase in symptoms of depression and lower cognitive performance, but their objective sleep is not affected.

PubMed

Brand, Serge; Zimmerer, Stefan; Kalak, Nadeem; Planta, Sandra Von; Schwenzer-Zimmerer, Katja; Müller, Andreas Albert; Zeilhofer, Hans-Florian; Holsboer-Trachsler, Edith

2015-02-01

Patients with aneurysmal subarachnoid haemorrhage (aSAH) have impaired sleep and cognitive performance together with more difficulties in social and everyday life. Hypocortisolism has also been reported. However, a study assessing all dimensions between aSAH severity, objective and subjective sleep, cortisol secretion, cognitive performance and social and everyday life has not so far been performed. The aim of the present study was therefore two-fold: (1) to assess, in a sample of patients with aSAH, objective and subjective sleep, cognitive functioning, social skills and cortisol secretion concurrently, and (2) to compare patients on these variables with a control group. Twenty-one patients (17 females; mean age: 58.80 years) with ruptured aneurysm and surgical intervention and 21 (14 females; mean age: 58.90 years) age- and gender-matched controls took part in the study. Assessments covered objective sleep-EGG recordings, subjective sleep, salivary cortisol analysis, and psychological functioning including memory performance, mood, and emotion recognition. Compared to healthy controls, patients had lower scores for verbal memory performance and emotion recognition; they also reported more marked depressive symptoms and complained of poor sleep. However, no differences were found for objective sleep or cortisol secretion. Subjective and objective sleep, cortisol secretion and psychological functioning were unrelated. Findings indicate that patients with aSAH face psychological rather than physiological issues.
Statistical Issues in the Comparison of Quantitative Imaging Biomarker Algorithms using Pulmonary Nodule Volume as an Example

PubMed Central

2014-01-01

Quantitative imaging biomarkers (QIBs) are being used increasingly in medicine to diagnose and monitor patients’ disease. The computer algorithms that measure QIBs have different technical performance characteristics. In this paper we illustrate the appropriate statistical methods for assessing and comparing the bias, precision, and agreement of computer algorithms. We use data from three studies of pulmonary nodules. The first study is a small phantom study used to illustrate metrics for assessing repeatability. The second study is a large phantom study allowing assessment of four algorithms’ bias and reproducibility for measuring tumor volume and the change in tumor volume. The third study is a small clinical study of patients whose tumors were measured on two occasions. This study allows a direct assessment of six algorithms’ performance for measuring tumor change. With these three examples we compare and contrast study designs and performance metrics, and we illustrate the advantages and limitations of various common statistical methods for QIB studies. PMID:24919828
Factors Associated with the Performance and Cost-Effectiveness of Using Lymphatic Filariasis Transmission Assessment Surveys for Monitoring Soil-Transmitted Helminths: A Case Study in Kenya

PubMed Central

Smith, Jennifer L.; Sturrock, Hugh J. W.; Assefa, Liya; Nikolay, Birgit; Njenga, Sammy M.; Kihara, Jimmy; Mwandawiro, Charles S.; Brooker, Simon J.

2015-01-01

Transmission assessment surveys (TAS) for lymphatic filariasis have been proposed as a platform to assess the impact of mass drug administration (MDA) on soil-transmitted helminths (STHs). This study used computer simulation and field data from pre- and post-MDA settings across Kenya to evaluate the performance and cost-effectiveness of the TAS design for STH assessment compared with alternative survey designs. Variations in the TAS design and different sample sizes and diagnostic methods were also evaluated. The district-level TAS design correctly classified more districts compared with standard STH designs in pre-MDA settings. Aggregating districts into larger evaluation units in a TAS design decreased performance, whereas age group sampled and sample size had minimal impact. The low diagnostic sensitivity of Kato-Katz and mini-FLOTAC methods was found to increase misclassification. We recommend using a district-level TAS among children 8–10 years of age to assess STH but suggest that key consideration is given to evaluation unit size. PMID:25487730
Self-Paced Reaching after Stroke: A Quantitative Assessment of Longitudinal and Directional Sensitivity Using the H-Man Planar Robot for Upper Limb Neurorehabilitation.

PubMed

Hussain, Asif; Budhota, Aamani; Hughes, Charmayne Mary Lee; Dailey, Wayne D; Vishwanath, Deshmukh A; Kuah, Christopher W K; Yam, Lester H L; Loh, Yong J; Xiang, Liming; Chua, Karen S G; Burdet, Etienne; Campolo, Domenico

2016-01-01

Technology aided measures offer a sensitive, accurate and time-efficient approach for the assessment of sensorimotor function after neurological insult compared to standard clinical assessments. This study investigated the sensitivity of robotic measures to capture differences in planar reaching movements as a function of neurological status (stroke, healthy), direction (front, ipsilateral, contralateral), movement segment (outbound, inbound), and time (baseline, post-training, 2-week follow-up) using a planar, two-degrees of freedom, robotic-manipulator (H-Man). Twelve chronic stroke (age: 55 ± 10.0 years, 5 female, 7 male, time since stroke: 11.2 ± 6.0 months) and nine aged-matched healthy participants (age: 53 ± 4.3 years, 5 female, 4 male) participated in this study. Both healthy and stroke participants performed planar reaching movements in contralateral, ipsilateral and front directions with the H-Man, and the robotic measures, spectral arc length (SAL), normalized time to peak velocities ( T peakN ), and root-mean square error (RMSE) were evaluated. Healthy participants went through a one-off session of assessment to investigate the baseline. Stroke participants completed a 2-week intensive robotic training plus standard arm therapy (8 × 90 min sessions). Motor function for stroke participants was evaluated prior to training (baseline, week-0), immediately following training (post-training, week-2), and 2-weeks after training (follow-up, week-4) using robotic assessment and the clinical measures Fugl-Meyer Assessment (FMA), Activity-Research-Arm Test (ARAT), and grip-strength. Robotic assessments were able to capture differences due to neurological status, movement direction, and movement segment. Movements performed by stroke participants were less-smooth, featured longer T peakN , and larger RMSE values, compared to healthy controls. Significant movement direction differences were observed, with improved reaching performance for the front, compared to ipsilateral and contralateral movement directions. There were group differences depending on movement segment. Outbound reaching movements were smoother and featured longer T peakN values than inbound movements for control participants, whereas SAL, T peakN , and RMSE values were similar regardless of movement segment for stroke patients. Significant change in performance was observed between initial and post-assessments using H-Man in stroke participants, compared to conventional scales which showed no significant difference. Results of the study indicate the potential of H-Man as a sensitive tool for tracking changes in performance compared to ordinal scales (i.e., FM, ARAT).
Assessment of a 40-kilowatt stirling engine for underground mining applications

NASA Technical Reports Server (NTRS)

Cairelli, J. E.; Kelm, G. G.; Slaby, J. G.

1982-01-01

An assessment of alternative power souces for underground mining applications was performed. A 40-kW Stirling research engine was tested to evaluate its performance and emission characteristics when operated with helium working gas and diesel fuel. The engine, the test facility, and the test procedures are described. Performance and emission data for the engine operating with helium working gas and diesel fuel are reported and compared with data obtained with hydrogen working gas and unleaded gasoline fuel. Helium diesel test results are compared with the characteristics of current diesel engines and other Stirling engines. External surface temperature data are also presented. Emission and temperature results are compared with the Federal requirements for diesel underground mine engines. The durability potential of Stirling engines is discussed on the basis of the experience gaind during the engine tests.
The Impact of Assessment Tasks on Subsequent Examination Performance

ERIC Educational Resources Information Center

Van Gaal, Frank; De Ridder, Annemieke

2013-01-01

In this article, the impact of assessment tasks on examination result (measured by examination grades) is investigated. Although many describe the advantages of electronic assessment tasks, few studies have been undertaken which compare a traditional approach using a classical examination with a new approach using assessment tasks. The main…
Participation in International Large-Scale Assessments from a US Perspective

ERIC Educational Resources Information Center

Plisko, Valena White

2013-01-01

International large-scale assessments (ILSAs) play a distinct role in the United States' decentralized federal education system. Separate from national and state assessments, they offer an external, objective measure for the United States to assess student performance comparatively with other countries and over time. The US engagement in ILSAs…
Assessment of sodium conductor distribution cable

DOE Office of Scientific and Technical Information (OSTI.GOV)

None

1979-06-01

The study assesses the barriers and incentives for using sodium conductor distribution cable. The assessment considers environmental, safety, energy conservation, electrical performance and economic factors. Along with all of these factors considered in the assessment, the sodium distribution cable system is compared to the present day alternative - an aluminum conductor system. (TFD)
The effect of providing feedback on the characteristics of student responses to a videotaped format for high school physics assessment

NASA Astrophysics Data System (ADS)

Lawrence, Michael John

1997-12-01

The problem of science illiteracy has been well documented. The development of the critical thinking skills in science education are often sacrificed in favor of content coverage. Opportunities for critical thinking within a context of science have been recommended to promote science literacy (AAAS, 1993). One means of doing this is to have students make and explain predictions involving physical phenomena, observe feedback, and then revise the prediction. A videotaped assessment using this process served as the focus for this study. High school physics students were asked to predict and explain what would happen in situations involving optics. They were then given different feedback treatments. The purpose of this study was to: (a) examine the effect of providing feedback on the quality of responses in making both revisions and subsequent predictions, and (b) examine the relationship between content knowledge and qualitative performance. Sixty-four high ability students were separated into three treatment groups: no feedback (NF), visual feedback (F), and teacher-explained feedback (TE). These students responded to six items on the Optics Videotape Assessment and ten optics multiple choice items from the National Physics Exam (NPE). Their teachers had previously attended a professional development institute which emphasized the practice and philosophy of assessments like the Optics Assessment. The assessment responses were categorized by two raters who used a taxonomy that ranged from simple descriptions to complete explanations. NPE performance was compared using one-way ANOVA, Optics Assessment performance was compared using a chi-square test of homogeneity, and a point-biserial correlation was done to compare qualitative and quantitative performance. The study found that students were unable to use feedback to make a significant change in the quality of their responses, whether revision or subsequent prediction. There was no correlation between content knowledge and qualitative performance. It was concluded that for students to succeed on an assessment of this type, their classroom teachers must be given the time to implement the appropriate instruction. Instruction and assessment of this nature are crucial to the development of science literacy.
No Country Left Behind: Rhetoric and Reality of International Large-Scale Assessment. William H. Angoff Memorial Lecture Series

ERIC Educational Resources Information Center

Feuer, Michael J.

2011-01-01

Few arguments about education are as effective at galvanizing public attention and motivating political action as those that compare the performance of students with their counterparts in other countries and that connect academic achievement to economic performance. Because data from international large-scale assessments (ILSA) have a powerful…
Performance-Based Assessment in an Online Course: Comparing Different Types of Information Literacy Instruction

ERIC Educational Resources Information Center

Mery, Yvonne; Newby, Jill; Peng, Ke

2012-01-01

This study investigates whether the type of instruction (a single face-to-face librarian-led instruction, instructor-led instruction, or an online IL course--the Online Research Lab) has an impact on student information literacy gains in a Freshman English Composition program. A performance-based assessment was carried out by analyzing…
Exploring the Potential for and Promise of Incorporating Distributive and Procedural Justices into Post-Secondary Assessment of Student Learning

ERIC Educational Resources Information Center

Grace, Christine Cooper

2017-01-01

This paper explores the potential of incorporating constructs of distributive justice and procedural justice into summative assessment of student learning in higher education. I systematically compare the process used by managers to evaluate employee performance in organizations--performance appraisal (PA)--with processes used by professors to…
Analyzing Performance by Grade 10 Hispanic High School Students on the Massachusetts State Assessment. Summary. Issues & Answers. REL 2009-No. 071

ERIC Educational Resources Information Center

Sanchez, Maria Teresa; Ehrlich, Stacy; Midouhas, Emily; O'Dwyer, Laura

2009-01-01

Massachusetts policymakers have expressed concern about the consistently lower scores of Hispanic students, compared to other subgroups, on the Massachusetts Comprehensive Assessment System (MCAS). This summary describes a larger report that examines Hispanic high school students' performance on the MCAS tests in English language arts and…
Biological and functional relevance of CASP predictions

PubMed Central

Liu, Tianyun; Ish‐Shalom, Shirbi; Torng, Wen; Lafita, Aleix; Bock, Christian; Mort, Matthew; Cooper, David N; Bliven, Spencer; Capitani, Guido; Mooney, Sean D.

2017-01-01

Abstract Our goal is to answer the question: compared with experimental structures, how useful are predicted models for functional annotation? We assessed the functional utility of predicted models by comparing the performances of a suite of methods for functional characterization on the predictions and the experimental structures. We identified 28 sites in 25 protein targets to perform functional assessment. These 28 sites included nine sites with known ligand binding (holo‐sites), nine sites that are expected or suggested by experimental authors for small molecule binding (apo‐sites), and Ten sites containing important motifs, loops, or key residues with important disease‐associated mutations. We evaluated the utility of the predictions by comparing their microenvironments to the experimental structures. Overall structural quality correlates with functional utility. However, the best‐ranked predictions (global) may not have the best functional quality (local). Our assessment provides an ability to discriminate between predictions with high structural quality. When assessing ligand‐binding sites, most prediction methods have higher performance on apo‐sites than holo‐sites. Some servers show consistently high performance for certain types of functional sites. Finally, many functional sites are associated with protein‐protein interaction. We also analyzed biologically relevant features from the protein assemblies of two targets where the active site spanned the protein‐protein interface. For the assembly targets, we find that the features in the models are mainly determined by the choice of template. PMID:28975675
Performance needs assessment of maternal and newborn health service delivery in urban and rural areas of Osun State, South-West, Nigeria.

PubMed

Esan, Oluwaseun T; Fatusi, Adesegun O

2014-06-01

The study aimed to determine performance and compare gaps in maternal and newborn health (MNH) services in urban and rural areas of Osun State, Nigeria, to inform decisions for improved services. This study involved 14 urban and 10 rural-based randomly selected PHC facilities. Using a Performance Needs Assessment framework, desired performances were determined by key stakeholders and actual performances measured by conducting facility survey. Questionnaire interview of 143 health workers and 153 antenatal clients were done. Performance gaps were determined for the urban and rural areas and compared using Chi-square tests with SPSS version 17. PHC facilities and health workers in Osun State, Nigeria, were found to have significant gaps in MNH service performance and this was worse in the rural areas. Root cause of most of the performance gaps was poor political will of local government authorities. Improved government commitment to MNH is needful to address most of the gaps.

Cluster Detection Tests in Spatial Epidemiology: A Global Indicator for Performance Assessment

PubMed Central

Guttmann, Aline; Li, Xinran; Feschet, Fabien; Gaudart, Jean; Demongeot, Jacques; Boire, Jean-Yves; Ouchchane, Lemlih

2015-01-01

In cluster detection of disease, the use of local cluster detection tests (CDTs) is current. These methods aim both at locating likely clusters and testing for their statistical significance. New or improved CDTs are regularly proposed to epidemiologists and must be subjected to performance assessment. Because location accuracy has to be considered, performance assessment goes beyond the raw estimation of type I or II errors. As no consensus exists for performance evaluations, heterogeneous methods are used, and therefore studies are rarely comparable. A global indicator of performance, which assesses both spatial accuracy and usual power, would facilitate the exploration of CDTs behaviour and help between-studies comparisons. The Tanimoto coefficient (TC) is a well-known measure of similarity that can assess location accuracy but only for one detected cluster. In a simulation study, performance is measured for many tests. From the TC, we here propose two statistics, the averaged TC and the cumulated TC, as indicators able to provide a global overview of CDTs performance for both usual power and location accuracy. We evidence the properties of these two indicators and the superiority of the cumulated TC to assess performance. We tested these indicators to conduct a systematic spatial assessment displayed through performance maps. PMID:26086911
Evaluating the Performance of Online K-12 Schools

ERIC Educational Resources Information Center

Carpenter, Dick; Kafer, Krista; Reeser, Kelly; Shafer, Sheryl

2015-01-01

This article examines K-12 online student and school performance across an entire state (Colorado) in the United States through two comparisons. First, state assessment scores of students in online schools are compared to those in traditional brick and mortar schools. Second, the accountability scores of online schools are compared to those of…
Topical Knowledge in L2 Speaking Assessment: Comparing Independent and Integrated Speaking Test Tasks

ERIC Educational Resources Information Center

Huang, Heng-Tsung Danny; Hung, Shao-Ting Alan; Plakans, Lia

2018-01-01

Integrated speaking test tasks (integrated tasks) provide reading and/or listening input to serve as the basis for test-takers to formulate their oral responses. This study examined the influence of topical knowledge on integrated speaking test performance and compared independent speaking test performance and integrated speaking test performance…
Comparability of Computer-Based and Paper-Based Science Assessments

ERIC Educational Resources Information Center

Herrmann-Abell, Cari F.; Hardcastle, Joseph; DeBoer, George E.

2018-01-01

We compared students' performance on a paper-based test (PBT) and three computer-based tests (CBTs). The three computer-based tests used different test navigation and answer selection features, allowing us to examine how these features affect student performance. The study sample consisted of 9,698 fourth through twelfth grade students from across…
Multilevel Structural Equation Models for the Analysis of Comparative Data on Educational Performance

ERIC Educational Resources Information Center

Goldstein, Harvey; Bonnet, Gerard; Rocher, Thierry

2007-01-01

The Programme for International Student Assessment comparative study of reading performance among 15-year-olds is reanalyzed using statistical procedures that allow the full complexity of the data structures to be explored. The article extends existing multilevel factor analysis and structural equation models and shows how this can extract richer…
48 CFR 873.116 - Source selection decision.

Code of Federal Regulations, 2014 CFR

2014-10-01

... Source selection decision. (a) An integrated comparative assessment of proposals should be performed... source selection team, or advisory boards or panels, may conduct comparative analysis(es) of proposals...
48 CFR 873.116 - Source selection decision.

Code of Federal Regulations, 2013 CFR

2013-10-01

... Source selection decision. (a) An integrated comparative assessment of proposals should be performed... source selection team, or advisory boards or panels, may conduct comparative analysis(es) of proposals...
48 CFR 873.116 - Source selection decision.

Code of Federal Regulations, 2011 CFR

2011-10-01

... Source selection decision. (a) An integrated comparative assessment of proposals should be performed... source selection team, or advisory boards or panels, may conduct comparative analysis(es) of proposals...
48 CFR 873.116 - Source selection decision.

Code of Federal Regulations, 2012 CFR

2012-10-01

... Source selection decision. (a) An integrated comparative assessment of proposals should be performed... source selection team, or advisory boards or panels, may conduct comparative analysis(es) of proposals...
48 CFR 873.116 - Source selection decision.

Code of Federal Regulations, 2010 CFR

2010-10-01

... Source selection decision. (a) An integrated comparative assessment of proposals should be performed... source selection team, or advisory boards or panels, may conduct comparative analysis(es) of proposals...
Breast volume assessment: comparing five different techniques.

PubMed

Bulstrode, N; Bellamy, E; Shrotria, S

2001-04-01

Breast volume assessment is not routinely performed pre-operatively because as yet there is no accepted technique. There have been a variety of methods published, but this is the first study to compare these techniques. We compared volume measurements obtained from mammograms (previously compared to mastectomy specimens) with estimates of volume obtained from four other techniques: thermoplastic moulding, magnetic resonance imaging, Archimedes principle and anatomical measurements. We also assessed the acceptability of each method to the patient. Measurements were performed on 10 women, which produced results for 20 breasts. We were able to calculate regression lines between volume measurements obtained from mammography to the other four methods: (1) magnetic resonance imaging (MRI), 379+(0.75 MRI) [r=0.48], (2) Thermoplastic moulding, 132+(1.46 Thermoplastic moulding) [r=0.82], (3) Anatomical measurements, 168+(1.55 Anatomical measurements) [r=0.83]. (4) Archimedes principle, 359+(0.6 Archimedes principle) [r=0.61] all units in cc. The regression curves for the different techniques are variable and it is difficult to reliably compare results. A standard method of volume measurement should be used when comparing volumes before and after intervention or between individual patients, and it is unreliable to compare volume measurements using different methods. Calculating the breast volume from mammography has previously been compared to mastectomy samples and shown to be reasonably accurate. However we feel thermoplastic moulding shows promise and should be further investigated as it gives not only a volume assessment but a three-dimensional impression of the breast shape, which may be valuable in assessing cosmesis following breast-conserving-surgery.
Word Recall: Cognitive Performance Within Internet Surveys

PubMed Central

Craig, Benjamin M; Jim, Heather S

2015-01-01

Background The use of online surveys for data collection has increased exponentially, yet it is often unclear whether interview-based cognitive assessments (such as face-to-face or telephonic word recall tasks) can be adapted for use in application-based research settings. Objective The objective of the current study was to compare and characterize the results of online word recall tasks to those of the Health and Retirement Study (HRS) and determine the feasibility and reliability of incorporating word recall tasks into application-based cognitive assessments. Methods The results of the online immediate and delayed word recall assessment, included within the Women’s Health and Valuation (WHV) study, were compared to the results of the immediate and delayed recall tasks of Waves 5-11 (2000-2012) of the HRS. Results Performance on the WHV immediate and delayed tasks demonstrated strong concordance with performance on the HRS tasks (ρc=.79, 95% CI 0.67-0.91), despite significant differences between study populations (P<.001) and study design. Sociodemographic characteristics and self-reported memory demonstrated similar relationships with performance on both the HRS and WHV tasks. Conclusions The key finding of this study is that the HRS word recall tasks performed similarly when used as an online cognitive assessment in the WHV. Online administration of cognitive tests, which has the potential to significantly reduce participant and administrative burden, should be considered in future research studies and health assessments. PMID:26543924
Effects of Learning on Performance When Computerized Dynamic Posturography Assessments Are Repeated

NASA Technical Reports Server (NTRS)

Taylor, Laura C.; Paloski, William H.; Wood, Scott J.

2008-01-01

Background: Computerized dynamic posturography is widely used to measure balance control performance. Clinically, performance is assessed by comparing individual data against standards obtained from a normative population. When performing repeated assessments to track performance changes, one must be concerned with the influence of learning effects. Subjects do not have the opportunity to practice before the first session, and often a second session is not performed prior to an experiment. Thus, the objective of this activity was to examine learning effects on balance control performance. We hypothesize that subjects will perform better on the second session when compared to the first, and that the difference will be greater for more difficult conditions. Methods: Data were collected from 204 subjects using the NeuroCom Equitest system during quiet stance with arms crossed at the chest on up to two sessions. All subjects performed standard sensory organization tests (SOTs) including 1) normal vision, fixed support; 2) absent vision, fixed support; 3) sway-referenced vision, fixed support; 4) normal vision, swayreferenced support; 5) absent vision, sway-referenced support; and 6) sway-referenced vision, sway-referenced support. 120 of these subjects performed modified sensory organization tests (mSOTs 2 and 5) which included static (20 back) and dynamic (20, 0.33Hz) head tilts. Median equilibrium scores (mEQ) were calculated from peak-to-peak anterior-posterior sway across trials. Data collected on the first session were then compared with the second to examine learning effect. Results: There were no differences in mEQ scores between the first and second sessions for SOTs 1, 2, and 4, while mEQ scores were higher for the second session when compared to the first for SOTs 3, 5, and 6 and for all mSOTs. Discussion: An additional familiarization session or practice trials prior to the first session may be necessary for more challenging SOT and mSOT conditions to minimize learning effect.
Proposed evaluation framework for assessing operator performance with multisensor displays

NASA Technical Reports Server (NTRS)

Foyle, David C.

1992-01-01

Despite aggressive work on the development of sensor fusion algorithms and techniques, no formal evaluation procedures have been proposed. Based on existing integration models in the literature, an evaluation framework is developed to assess an operator's ability to use multisensor, or sensor fusion, displays. The proposed evaluation framework for evaluating the operator's ability to use such systems is a normative approach: The operator's performance with the sensor fusion display can be compared to the models' predictions based on the operator's performance when viewing the original sensor displays prior to fusion. This allows for the determination as to when a sensor fusion system leads to: 1) poorer performance than one of the original sensor displays (clearly an undesirable system in which the fused sensor system causes some distortion or interference); 2) better performance than with either single sensor system alone, but at a sub-optimal (compared to the model predictions) level; 3) optimal performance (compared to model predictions); or, 4) super-optimal performance, which may occur if the operator were able to use some highly diagnostic 'emergent features' in the sensor fusion display, which were unavailable in the original sensor displays. An experiment demonstrating the usefulness of the proposed evaluation framework is discussed.
Perception versus reality: a comparative study of the clinical judgment skills of nurses during a simulated activity.

PubMed

Fenske, Cynthia L; Harris, Margaret A; Aebersold, Michelle L; Hartman, Laurie S

2013-09-01

This study was conducted to determine how closely nurses' perceptions of their clinical judgment abilities matched their demonstrated clinical judgment skills during a simulation. Seventy-four registered nurses participated in a simulation using a video format. After the simulation, the nurses self-assessed their performance using the Lasater Clinical Judgment Rubric. This rubric was then used to rate the nurses' actual performance in the simulation activity. The study results showed a significant discrepancy between nurses' perceptions of their own clinical judgment abilities and their demonstrated clinical judgment skills. Age and length of nursing experience enhanced the difference between the findings of self-assessment and actual performance. Younger nurses and those with 1 year or less of nursing experience were significantly more likely to have self-assessed their abilities at a much higher level compared with their actual skills. Copyright 2013, SLACK Incorporated.
A Comparative Analysis of the Consistency and Difference among Teacher-Assessment, Student Self-Assessment and Peer-Assessment in a Web-Based Portfolio Assessment Environment for High School Students

ERIC Educational Resources Information Center

Chang, Chi-Cheng; Tseng, Kuo-Hung; Lou, Shi-Jer

2012-01-01

This study explored the consistency and difference of teacher-, student self- and peer-assessment in the context of Web-based portfolio assessment. Participants were 72 senior high school students enrolled in a computer application course. Through the assessment system, the students performed portfolio creation, inspection, self- and…
The predictive power of physical function assessed by questionnaire and physical performance measures for subsequent disability.

PubMed

Hoshi, Masayuki; Hozawa, Atsushi; Kuriyama, Shinichi; Nakaya, Naoki; Ohmori-Matsuda, Kaori; Sone, Toshimasa; Kakizaki, Masako; Niu, Kaijun; Fujita, Kazuki; Ueki, Shouzoh; Haga, Hiroshi; Nagatomi, Ryoichi; Tsuji, Ichiro

2012-08-01

To compare the predictive power of physical function assessed by questionnaire and physical performance measures for subsequent disability in community-dwelling elderly persons. Prospective cohort study. Participants were 813 aged 70 years and older, elderly Japanese residing in the community, included in the Tsurugaya Project, who were not disabled at the baseline in 2003. Physical function was assessed by the questionnaire of "Motor Fitness Scale". Physical performance measures consisted of maximum walking velocity, timed up and go test (TUG), leg extension power, and functional reach test. The area under the curve (AUC) of the receiver operating characteristic curve for disability was used to compare screening accuracy between Motor Fitness Scale and physical performance measures. Incident disability, defined as certification for long-term care insurance, was used as the endpoint. We observed 135 cases of incident disability during follow-up. The third or fourth quartile for each measure was associated with a significantly increased risk of disability in comparison with the highest quartile. The AUC was 0.70, 0.72, 0.70, 0.68, 0.69 and 0.74, for Motor Fitness Scale, maxi- mum walking velocity, TUG, leg extension power, functional reach test, and total performance score, respectively. The predictive power of physical function assessed by the Motor Fitness Scale was equivalent to that assessed by physical performance measures. Since Motor Fitness Scale can evaluate physical function safely and simply in comparison with physical performance tests, it would be a practical tool for screening persons at high risk of disability.
Segmentized Clear Channel Assessment for IEEE 802.15.4 Networks.

PubMed

Son, Kyou Jung; Hong, Sung Hyeuck; Moon, Seong-Pil; Chang, Tae Gyu; Cho, Hanjin

2016-06-03

This paper proposed segmentized clear channel assessment (CCA) which increases the performance of IEEE 802.15.4 networks by improving carrier sense multiple access with collision avoidance (CSMA/CA). Improving CSMA/CA is important because the low-power consumption feature and throughput performance of IEEE 802.15.4 are greatly affected by CSMA/CA behavior. To improve the performance of CSMA/CA, this paper focused on increasing the chance to transmit a packet by assessing precise channel status. The previous method used in CCA, which is employed by CSMA/CA, assesses the channel by measuring the energy level of the channel. However, this method shows limited channel assessing behavior, which comes from simple threshold dependent channel busy evaluation. The proposed method solves this limited channel decision problem by dividing CCA into two groups. Two groups of CCA compare their energy levels to get precise channel status. To evaluate the performance of the segmentized CCA method, a Markov chain model has been developed. The validation of analytic results is confirmed by comparing them with simulation results. Additionally, simulation results show the proposed method is improving a maximum 8.76% of throughput and decreasing a maximum 3.9% of the average number of CCAs per packet transmission than the IEEE 802.15.4 CCA method.
Segmentized Clear Channel Assessment for IEEE 802.15.4 Networks

PubMed Central

Son, Kyou Jung; Hong, Sung Hyeuck; Moon, Seong-Pil; Chang, Tae Gyu; Cho, Hanjin

2016-01-01

This paper proposed segmentized clear channel assessment (CCA) which increases the performance of IEEE 802.15.4 networks by improving carrier sense multiple access with collision avoidance (CSMA/CA). Improving CSMA/CA is important because the low-power consumption feature and throughput performance of IEEE 802.15.4 are greatly affected by CSMA/CA behavior. To improve the performance of CSMA/CA, this paper focused on increasing the chance to transmit a packet by assessing precise channel status. The previous method used in CCA, which is employed by CSMA/CA, assesses the channel by measuring the energy level of the channel. However, this method shows limited channel assessing behavior, which comes from simple threshold dependent channel busy evaluation. The proposed method solves this limited channel decision problem by dividing CCA into two groups. Two groups of CCA compare their energy levels to get precise channel status. To evaluate the performance of the segmentized CCA method, a Markov chain model has been developed. The validation of analytic results is confirmed by comparing them with simulation results. Additionally, simulation results show the proposed method is improving a maximum 8.76% of throughput and decreasing a maximum 3.9% of the average number of CCAs per packet transmission than the IEEE 802.15.4 CCA method. PMID:27271626
Implementation of Performance Assessment in STEM (Science, Technology, Engineering, Mathematics) Education to Detect Science Process Skill

NASA Astrophysics Data System (ADS)

Septiani, A.; Rustaman, N. Y.

2017-02-01

A descriptive study about the implementation of performance assessment in STEM based instruction was carried out to investigate the tenth grade of Vocational school students’ science process skills during the teaching learning processes. A number of tenth grade agriculture students was involved as research subjects selected through cluster random sampling technique (n=35). Performance assessment was planned on skills during the teaching learning process through observation and on product resulted from their engineering practice design. The procedure conducted in this study included thinking phase (identifying problem and sharing idea), designing phase, construction phase, and evaluation phase. Data was collected through the use of science process skills (SPS) test, observation sheet on student activity, as well as tasks and rubrics for performance assessment during the instruction. Research findings show that the implementation of performance assessment in STEM education in planting media could detect students science process skills better from the observation individually compared through SPS test. It was also found that the result of performance assessment was diverse when it was correlated to each indicator of SPS (strong and positive; weak and positive).

Performance of two different digital evaluation systems used for assessing pre-clinical dental students' prosthodontic technical skills.

PubMed

Gratton, D G; Kwon, S R; Blanchette, D R; Aquilino, S A

2017-11-01

Proper integration of newly emerging digital assessment tools is a central issue in dental education in an effort to provide more accurate and objective feedback to students. The study examined how the outcomes of students' tooth preparation were correlated when evaluated using traditional faculty assessment and two types of digital assessment approaches. Specifically, incorporation of the Romexis Compare 2.0 (Compare) and Sirona prepCheck 1.1 (prepCheck) systems was evaluated. Additionally, satisfaction of students based on the type of software was evaluated through a survey. Students in a second-year pre-clinical prosthodontics course were allocated to either Compare (n = 42) or prepCheck (n = 37) systems. All students received conventional instruction and used their assigned digital system as an additional evaluation tool to aid in assessing their work. Examinations assessed crown preparations of the maxillary right central incisor (#8) and the mandibular left first molar (#19). All submissions were graded by faculty, Compare and prepCheck. Technical scores did not differ between student groups for any of the assessment approaches. Compare and prepCheck had modest, statistically significant correlations with faculty scores with a minimum correlation of 0.3944 (P = 0.0011) and strong, statistically significant correlations with each other with a minimum correlation of 0.8203 (P < 0.0001). A post-course student survey found that 55.26% of the students felt unfavourably about learning the digital evaluation protocols. A total of 62.31% felt favourably about the integration of these digital tools into the curriculum. Comparison of Compare and prepCheck showed no evidence of significant difference in students' prosthodontics technical performance and perception. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Comparison of answer-until-correct and full-credit assessments in a team-based learning course.

PubMed

Farland, Michelle Z; Barlow, Patrick B; Levi Lancaster, T; Franks, Andrea S

2015-03-25

To assess the impact of awarding partial credit to team assessments on team performance and on quality of team interactions using an answer-until-correct method compared to traditional methods of grading (multiple-choice, full-credit). Subjects were students from 3 different offerings of an ambulatory care elective course, taught using team-based learning. The control group (full-credit) consisted of those enrolled in the course when traditional methods of assessment were used (2 course offerings). The intervention group consisted of those enrolled in the course when answer-until-correct method was used for team assessments (1 course offering). Study outcomes included student performance on individual and team readiness assurance tests (iRATs and tRATs), individual and team final examinations, and student assessment of quality of team interactions using the Team Performance Scale. Eighty-four students enrolled in the courses were included in the analysis (full-credit, n=54; answer-until-correct, n=30). Students who used traditional methods of assessment performed better on iRATs (full-credit mean 88.7 (5.9), answer-until-correct mean 82.8 (10.7), p<0.001). Students who used answer-until-correct method of assessment performed better on the team final examination (full-credit mean 45.8 (1.5), answer-until-correct 47.8 (1.4), p<0.001). There was no significant difference in performance on tRATs and the individual final examination. Students who used the answer-until-correct method had higher quality of team interaction ratings (full-credit 97.1 (9.1), answer-until-correct 103.0 (7.8), p=0.004). Answer-until-correct assessment method compared to traditional, full-credit methods resulted in significantly lower scores for iRATs, similar scores on tRATs and individual final examinations, improved scores on team final examinations, and improved perceptions of the quality of team interactions.
Engineered Barrier System performance requirements systems study report. Revision 02

DOE Office of Scientific and Technical Information (OSTI.GOV)

Balady, M.A.

This study evaluates the current design concept for the Engineered Barrier System (EBS), in concert with the current understanding of the geologic setting to assess whether enhancements to the required performance of the EBS are necessary. The performance assessment calculations are performed by coupling the EBS with the geologic setting based on the models (some of which were updated for this study) and assumptions used for the 1995 Total System Performance Assessment (TSPA). The need for enhancements is determined by comparing the performance assessment results against the EBS related performance requirements. Subsystem quantitative performance requirements related to the EBS includemore » the requirement to allow no more than 1% of the waste packages (WPs) to fail before 1,000 years after permanent closure of the repository, as well as a requirement to control the release rate of radionuclides from the EBS. The EBS performance enhancements considered included additional engineered components as well as evaluating additional performance available from existing design features but for which no performance credit is currently being taken.« less
Haemophilia & Exercise Project (HEP): subjective and objective physical performance in adult haemophilia patients--results of a cross-sectional study.

PubMed

Czepa, D; Von Mackensen, S; Hilberg, T

2012-01-01

Recurrent musculoskeletal haemorrhages in people with haemophilia (PWH) lead to restrictions in the locomotor system and consequently in physical performance. Patients' perceptions of their health status have gained an important role in the last few years. The assessment of subjective physical performance in PWH is a new approach. This study aimed to compare the subjective physical performance of PWH with healthy controls and to correlate the results with objective data. Subjective physical performance was assessed via the new questionnaire HEP-Test-Q, which consists of 25 items pertaining to four subscales 'mobility', 'strength & coordination', 'endurance' and 'body perception'. HEP-Test-Q subscales were compared with objective data in terms of range of motion, one-leg-stand and 12-minute walk test. Forty-eight patients (44 ± 11 years) with haemophilia A (43 severe, three moderate) or B (two severe) and 43 controls without haemophilia (42 ± 11 years) were enrolled. PWH showed an impaired subjective physical performance in all HEP-Test-Q subscales and in the total score (52 ± 20) compared with controls (77 ± 10; P ≤ 0.001). Correlation analyses for the total score of the HEP-Test-Q and objective data revealed values ranging from r = 0.403 (one-leg-stand) to r = 0.757 (12-minute walk test) (P ≤ 0.001). PWH evaluated their physical performance poorer in comparison with healthy people. As self-assessment did not always correlate highly with objective data, objective examinations of physical performance in PWH should be complemented with subjective perceptions. © 2011 Blackwell Publishing Ltd.
Cost, Time, and Risk Assessment of Different Wave Energy Converter Technology Development Trajectories: Preprint

DOE Office of Scientific and Technical Information (OSTI.GOV)

Weber, Jochem W; Laird, Daniel; Costello, Ronan

This paper presents a comparative assessment of three fundamentally different wave energy converter technology development trajectories. The three technology development trajectories are expressed and visualised as a function of technology readiness levels and technology performance levels. The assessment shows that development trajectories that initially prioritize technology readiness over technology performance are likely to require twice the development time, consume a threefold of the development cost, and are prone to a risk of technical or commercial failure of one order of magnitude higher than those development trajectories that initially prioritize technology performance over technology readiness.
Assessing Student Reasoning in Upper-Division Electricity and Magnetism at Oregon State University

ERIC Educational Resources Information Center

Zwolak, Justyna P.; Manogue, Corinne A.

2015-01-01

Standardized assessment tests that allow researchers to compare the performance of students under various curricula are highly desirable. There are several research-based conceptual tests that serve as instruments to assess and identify students' difficulties in lower-division courses. At the upper-division level assessing students' difficulties…
Promoting Learning and Achievement through Self-Assessment

ERIC Educational Resources Information Center

Andrade, Heidi; Valtcheva, Anna

2009-01-01

Criteria-referenced self-assessment is a process during which students collect information about their own performance or progress; compare it to explicitly stated criteria, goals, or standards; and revise accordingly. The authors argue that self-assessment must be a formative type of assessment, done on drafts of works in progress: It should not…
Impact of hybrid delivery of education on student academic performance and the student experience.

PubMed

Congdon, Heather Brennan; Nutter, Douglas A; Charneski, Lisa; Butko, Peter

2009-11-12

To compare student academic performance and the student experience in the first-year doctor of pharmacy (PharmD) program between the main and newly opened satellite campuses of the University of Maryland. Student performance indicators including graded assessments, course averages, cumulative first-year grade point average (GPA), and introductory pharmacy practice experience (IPPE) evaluations were analyzed retrospectively. Student experience indicators were obtained via an online survey instrument and included involvement in student organizations; time-budgeting practices; and stress levels and their perceived effect on performance. Graded assessments, course averages, GPA, and IPPE evaluations were indistinguishable between campuses. Students' time allocation was not different between campuses, except for time spent attending class and watching lecture videos. There was no difference between students' stress levels at each campus. The implementation of a satellite campus to expand pharmacy education yielded academic performance and student engagement comparable to those from traditional delivery methods.
Agreement between Computerized and Human Assessment of Performance on the Ruff Figural Fluency Test

PubMed Central

Elderson, Martin F.; Pham, Sander; van Eersel, Marlise E. A.; Wolffenbuttel, Bruce H. R.; Kok, Johan; Gansevoort, Ron T.; Tucha, Oliver; van der Klauw, Melanie M.; Slaets, Joris P. J.

2016-01-01

The Ruff Figural Fluency Test (RFFT) is a sensitive test for nonverbal fluency suitable for all age groups. However, assessment of performance on the RFFT is time-consuming and may be affected by interrater differences. Therefore, we developed computer software specifically designed to analyze performance on the RFFT by automated pattern recognition. The aim of this study was to compare assessment by the new software with conventional assessment by human raters. The software was developed using data from the Lifelines Cohort Study and validated in an independent cohort of the Prevention of Renal and Vascular End Stage Disease (PREVEND) study. The total study population included 1,761 persons: 54% men; mean age (SD), 58 (10) years. All RFFT protocols were assessed by the new software and two independent human raters (criterion standard). The mean number of unique designs (SD) was 81 (29) and the median number of perseverative errors (interquartile range) was 9 (4 to 16). The intraclass correlation coefficient (ICC) between the computerized and human assessment was 0.994 (95%CI, 0.988 to 0.996; p<0.001) and 0.991 (95%CI, 0.990 to 0.991; p<0.001) for the number of unique designs and perseverative errors, respectively. The mean difference (SD) between the computerized and human assessment was -1.42 (2.78) and +0.02 (1.94) points for the number of unique designs and perseverative errors, respectively. This was comparable to the agreement between two independent human assessments: ICC, 0.995 (0.994 to 0.995; p<0.001) and 0.985 (0.982 to 0.988; p<0.001), and mean difference (SD), -0.44 (2.98) and +0.56 (2.36) points for the number of unique designs and perseverative errors, respectively. We conclude that the agreement between the computerized and human assessment was very high and comparable to the agreement between two independent human assessments. Therefore, the software is an accurate tool for the assessment of performance on the RFFT. PMID:27661083
Anxiety and performance of nursing students in regard to assessment via clinical simulations in the classroom versus filmed assessments.

PubMed

de Souza Teixeira, Carla Regina; Kusumota, Luciana; Alves Pereira, Marta Cristiane; Merizio Martins Braga, Fernanda Titareli; Pirani Gaioso, Vanessa; Mara Zamarioli, Cristina; Campos de Carvalho, Emilia

2014-01-01

To compare the level of anxiety and performance of nursing students when performing a clinical simulation through the traditional method of assessment with the presence of an evaluator and through a filmed assessment without the presence of an evaluator. Controlled trial with the participation of Brazilian public university 20 students who were randomly assigned to one of two groups: a) assessment through the traditional method with the presence of an evaluator; or b) filmed assessment. The level of anxiety was assessed using the Zung test and performance was measured based on the number of correct answers. Averages of 32 and 27 were obtained on the anxiety scale by the group assessed through the traditional method before and after the simulation, respectively, while the filmed group obtained averages of 33 and 26; the final scores correspond to mild anxiety. Even though there was a statistically significant reduction in the intra-groups scores before and after the simulation, there was no difference between the groups. As for the performance assessments in the clinical simulation, the groups obtained similar percentages of correct answers (83% in the traditional assessment and 84% in the filmed assessment) without statistically significant differences. Filming can be used and encouraged as a strategy to assess nursing undergraduate students.
Education Watch: New Mexico. Key Education Facts and Figures. Achievement, Attainment and Opportunity. From Elementary School through College.

ERIC Educational Resources Information Center

Education Trust, Washington, DC.

This report compares New Mexico's reading and mathematics performance on the most recent administrations of the state assessment with performance on the National Assessment of Educational Progress (NAEP). To indicate how New Mexico is doing in narrowing the academic achievement gap between African American and Latino students and their white,…
GFO and JASON Altimeter Engineering Assessment Report. Update: GFO--Acceptance to December 27, 2007, JASON--Acceptance to December 26, 2007. Version 1: June 2008

NASA Technical Reports Server (NTRS)

Conger, A. M.; Hancock, D. W.; Hayne, G. S.; Brooks, R. L.

2008-01-01

The purpose of this document is to present and document GEOSAT Follow-On (GFO) performance analyses and results. This is the eighth Assessment Report since the initial report. This report extends the performance assessment since acceptance to 27 December 2007. Since launch, a variety of GFO performance studies have been performed: Appendix A provides an accumulative index of those studies. We began the inclusion of analyses of the JASON altimeter after the end of the Topographic Experiment (TOPEX) mission. Prior to this, JASON and TOPEX were compared during our assessment of theTOPEX altimeter. With the end of the TOPEX mission, we developed methods to report on JASON as it relates to GFO.
Performance of high intensity fed-batch mammalian cell cultures in disposable bioreactor systems.

PubMed

Smelko, John Paul; Wiltberger, Kelly Rae; Hickman, Eric Francis; Morris, Beverly Janey; Blackburn, Tobias James; Ryll, Thomas

2011-01-01

The adoption of disposable bioreactor technology as an alternate to traditional nondisposable technology is gaining momentum in the biotechnology industry. Evaluation of current disposable bioreactors systems to sustain high intensity fed-batch mammalian cell culture processes needs to be explored. In this study, an assessment was performed comparing single-use bioreactors (SUBs) systems of 50-, 250-, and 1,000-L operating scales with traditional stainless steel (SS) and glass vessels using four distinct mammalian cell culture processes. This comparison focuses on expansion and production stage performance. The SUB performance was evaluated based on three main areas: operability, process scalability, and process performance. The process performance and operability aspects were assessed over time and product quality performance was compared at the day of harvest. Expansion stage results showed disposable bioreactors mirror traditional bioreactors in terms of cellular growth and metabolism. Set-up and disposal times were dramatically reduced using the SUB systems when compared with traditional systems. Production stage runs for both Chinese hamster ovary and NS0 cell lines in the SUB system were able to model SS bioreactors runs at 100-, 200-, 2,000-, and 15,000-L scales. A single 1,000-L SUB run applying a high intensity fed-batch process was able to generate 7.5 kg of antibody with comparable product quality. Copyright © 2011 American Institute of Chemical Engineers (AIChE).
Recovery in Level 7-10 Women's USA Artistic Gymnastics.

PubMed

Buckner, Stephen B; Bacon, Nicholas T; Bishop, Phillip A

2017-01-01

This study assessed physical performance in women's artistic gymnastics following three variable recovery periods. Participants included fifteen female gymnasts (mean age = 13.5 ± 1.1) who had competed at USA Gymnastics (USAG) levels 7 - 10 within at least one year prior to the study. Each testing session consisted of a warm-up followed by four muscular endurance tests and one explosive maximal test. Assessments included pull-ups, leg lifts, handstand push-ups, vertical jump, and push-ups. After the performance assessments, the participants completed a typical practice session. The performance measures were reassessed at the beginning of each of the recovery periods of 24, 48, and 72 hours in a counterbalanced design. Performance assessments were converted into Z-scores and then averaged for a composite session Z-score. The composite session Z-scores were compared to evaluate the recovery duration. Composite Z's were significantly lower (p=0.000), after the 24 (z=-1.10) and the 48 hour (z=-0.71) recovery periods compared to baseline (z=0.00). However, there was no difference in scores (p=1.00) between the baseline and 72 hours (z=0.004) recovery. Full recovery required 72 hours under the conditions of this study.
Continuous Performance Tasks: Not Just about Sustaining Attention

ERIC Educational Resources Information Center

Roebuck, Hettie; Freigang, Claudia; Barry, Johanna G.

2016-01-01

Purpose: Continuous performance tasks (CPTs) are used to measure individual differences in sustained attention. Many different stimuli have been used as response targets without consideration of their impact on task performance. Here, we compared CPT performance in typically developing adults and children to assess the role of stimulus processing…
Video prompting versus other instruction strategies for persons with Alzheimer's disease.

PubMed

Perilli, Viviana; Lancioni, Giulio E; Hoogeveen, Frans; Caffó, Alessandro; Singh, Nirbhay; O'Reilly, Mark; Sigafoos, Jeff; Cassano, Germana; Oliva, Doretta

2013-06-01

Two studies assessed the effectiveness of video prompting as a strategy to support persons with mild and moderate Alzheimer's disease in performing daily activities. In study I, video prompting was compared to an existing strategy relying on verbal instructions. In study II, video prompting was compared to another existing strategy relying on static pictorial cues. Video prompting and the other strategies were counterbalanced across tasks and participants and compared within alternating treatments designs. Video prompting was effective in all participants. Similarly effective were the other 2 strategies, and only occasional differences between the strategies were reported. Two social validation assessments showed that university psychology students and graduates rated the patients' performance with video prompting more favorably than their performance with the other strategies. Video prompting may be considered a valuable alternative to the other strategies to support daily activities in persons with Alzheimer's disease.
Risk Assessment Methodology for Hazardous Waste Management (1998)

EPA Pesticide Factsheets

A methodology is described for systematically assessing and comparing the risks to human health and the environment of hazardous waste management alternatives. The methodology selects and links appropriate models and techniques for performing the process.
Assessing the Idaho Transportation Department's customer service performance.

DOT National Transportation Integrated Search

2011-10-23

This report assesses customer satisfaction with the Idaho Transportation Department. It also compares and contrasts the results of customer satisfaction surveys conducted for the Idaho Transportation Department with the results from other state trans...
Know thyself: misperceptions of actual performance undermine achievement motivation, future performance, and subjective well-being.

PubMed

Kim, Young-Hoon; Chiu, Chi-Yue; Zou, Zhimin

2010-09-01

Contrary to the popular assumption that self-enhancement improves task motivation and future performance, the authors propose that both inflated and deflated self-assessments of performance are linked to an increased likelihood of practicing self-handicapping and having relatively poor performance in future tasks. Consistent with this proposal, we found that irrespective of the level of actual performance, compared with accurate self-assessment, both inflated and deflated self-assessments of task performance are associated with a greater tendency to (a) practice self-handicapping (Study 1: prefer to work under distraction; Study 2: withhold preparatory effort), (b) perform relatively poorly in a subsequent task (Study 3), (c) have relatively low academic achievement (Study 4), and (d) report a relatively low level of subjective well-being (Study 5). The authors discuss these results in terms of their educational implications. (PsycINFO Database Record (c) 2010 APA, all rights reserved).
Helpers' Self-Assessment Biases Before and after Helping Skills Training.

PubMed

Jaeken, Marine; Zech, Emmanuelle; Brison, Céline; Verhofstadt, Lesley L; Van Broeck, Nady; Mikolajczak, Moïra

2017-01-01

Several studies have shown that therapists are generally biased concerning their performed helping skills, as compared to judges' ratings. As clients' ratings of therapists' performance are better predictors of psychotherapy effectiveness than judges' ratings, this study examined the validity and effectiveness of a helping skills training program at reducing novice helpers' self-enhancement biases concerning their helping skills, in comparison to their clients' ratings. Helping skills were assessed by three objective measures (a knowledge multiple choice test, a video test and a role play), as well as by a self- and peer-reported questionnaire. In addition, some performed helping skills' correlates (relationship quality, session quality, and helpers' therapeutic attitudes) were assessed both by helpers and their simulated helpees. Seventy-two sophomores in psychology participated to this study, 37 being assigned to a 12-h helping skills training program, and 35 to a control group. Helpers were expected to assess the aforementioned performed helping skills and correlates as being better than their helpees' assessments at pretest, thus revealing a self-enhancement bias. At posttest, we expected that trained helpers would objectively exhibit better helping skills than untrained helpers while beginning to underestimate their performance, thus indexing a self-diminishment bias. In contrast, we hypothesized that untrained helpers would continue to overestimate their performance. Our hypotheses were only partly confirmed but results reflected a skilled-unaware pattern among trainees. Trained helpers went either from a pretest overestimation to a posttest equivalence (performed helping skills and performed therapeutic attitudes), or from a pretest equivalence to a posttest underestimation (performed session quality and performed therapeutic relationship), as compared to helpees' ratings. Results showed that trained helpers improved on all helping skills objective measures and that helpees' perceptions of their performance had increased at posttest. In conclusion, helping skills training leads helpers not only to improve their helping skills but also to have more doubts about their skills, two variables associated with psychotherapy outcome.

Helpers' Self-Assessment Biases Before and after Helping Skills Training

PubMed Central

Jaeken, Marine; Zech, Emmanuelle; Brison, Céline; Verhofstadt, Lesley L.; Van Broeck, Nady; Mikolajczak, Moïra

2017-01-01

Several studies have shown that therapists are generally biased concerning their performed helping skills, as compared to judges' ratings. As clients' ratings of therapists' performance are better predictors of psychotherapy effectiveness than judges' ratings, this study examined the validity and effectiveness of a helping skills training program at reducing novice helpers' self-enhancement biases concerning their helping skills, in comparison to their clients' ratings. Helping skills were assessed by three objective measures (a knowledge multiple choice test, a video test and a role play), as well as by a self- and peer-reported questionnaire. In addition, some performed helping skills' correlates (relationship quality, session quality, and helpers' therapeutic attitudes) were assessed both by helpers and their simulated helpees. Seventy-two sophomores in psychology participated to this study, 37 being assigned to a 12-h helping skills training program, and 35 to a control group. Helpers were expected to assess the aforementioned performed helping skills and correlates as being better than their helpees' assessments at pretest, thus revealing a self-enhancement bias. At posttest, we expected that trained helpers would objectively exhibit better helping skills than untrained helpers while beginning to underestimate their performance, thus indexing a self-diminishment bias. In contrast, we hypothesized that untrained helpers would continue to overestimate their performance. Our hypotheses were only partly confirmed but results reflected a skilled-unaware pattern among trainees. Trained helpers went either from a pretest overestimation to a posttest equivalence (performed helping skills and performed therapeutic attitudes), or from a pretest equivalence to a posttest underestimation (performed session quality and performed therapeutic relationship), as compared to helpees' ratings. Results showed that trained helpers improved on all helping skills objective measures and that helpees' perceptions of their performance had increased at posttest. In conclusion, helping skills training leads helpers not only to improve their helping skills but also to have more doubts about their skills, two variables associated with psychotherapy outcome. PMID:28861015
Ethnicity and academic performance in UK trained doctors and medical students: systematic review and meta-analysis

PubMed Central

Potts, Henry W W; McManus, I C

2011-01-01

Objective To determine whether the ethnicity of UK trained doctors and medical students is related to their academic performance. Design Systematic review and meta-analysis. Data sources Online databases PubMed, Scopus, and ERIC; Google and Google Scholar; personal knowledge; backwards and forwards citations; specific searches of medical education journals and medical education conference abstracts. Study selection The included quantitative reports measured the performance of medical students or UK trained doctors from different ethnic groups in undergraduate or postgraduate assessments. Exclusions were non-UK assessments, only non-UK trained candidates, only self reported assessment data, only dropouts or another non-academic variable, obvious sampling bias, or insufficient details of ethnicity or outcomes. Results 23 reports comparing the academic performance of medical students and doctors from different ethnic groups were included. Meta-analyses of effects from 22 reports (n=23 742) indicated candidates of “non-white” ethnicity underperformed compared with white candidates (Cohen’s d=−0.42, 95% confidence interval −0.50 to −0.34; P<0.001). Effects in the same direction and of similar magnitude were found in meta-analyses of undergraduate assessments only, postgraduate assessments only, machine marked written assessments only, practical clinical assessments only, assessments with pass/fail outcomes only, assessments with continuous outcomes only, and in a meta-analysis of white v Asian candidates only. Heterogeneity was present in all meta-analyses. Conclusion Ethnic differences in academic performance are widespread across different medical schools, different types of exam, and in undergraduates and postgraduates. They have persisted for many years and cannot be dismissed as atypical or local problems. We need to recognise this as an issue that probably affects all of UK medical and higher education. More detailed information to track the problem as well as further research into its causes is required. Such actions are necessary to ensure a fair and just method of training and of assessing current and future doctors. PMID:21385802
Ethnicity and academic performance in UK trained doctors and medical students: systematic review and meta-analysis.

PubMed

Woolf, Katherine; Potts, Henry W W; McManus, I C

2011-03-08

To determine whether the ethnicity of UK trained doctors and medical students is related to their academic performance. Systematic review and meta-analysis. Online databases PubMed, Scopus, and ERIC; Google and Google Scholar; personal knowledge; backwards and forwards citations; specific searches of medical education journals and medical education conference abstracts. The included quantitative reports measured the performance of medical students or UK trained doctors from different ethnic groups in undergraduate or postgraduate assessments. Exclusions were non-UK assessments, only non-UK trained candidates, only self reported assessment data, only dropouts or another non-academic variable, obvious sampling bias, or insufficient details of ethnicity or outcomes. Results 23 reports comparing the academic performance of medical students and doctors from different ethnic groups were included. Meta-analyses of effects from 22 reports (n = 23,742) indicated candidates of "non-white" ethnicity underperformed compared with white candidates (Cohen's d = -0.42, 95% confidence interval -0.50 to -0.34; P<0.001). Effects in the same direction and of similar magnitude were found in meta-analyses of undergraduate assessments only, postgraduate assessments only, machine marked written assessments only, practical clinical assessments only, assessments with pass/fail outcomes only, assessments with continuous outcomes only, and in a meta-analysis of white v Asian candidates only. Heterogeneity was present in all meta-analyses. Ethnic differences in academic performance are widespread across different medical schools, different types of exam, and in undergraduates and postgraduates. They have persisted for many years and cannot be dismissed as atypical or local problems. We need to recognise this as an issue that probably affects all of UK medical and higher education. More detailed information to track the problem as well as further research into its causes is required. Such actions are necessary to ensure a fair and just method of training and of assessing current and future doctors.
Comparing Real-time Versus Delayed Video Assessments for Evaluating ACGME Sub-competency Milestones in Simulated Patient Care Environments

PubMed Central

Stiegler, Marjorie; Hobbs, Gene; Martinelli, Susan M; Zvara, David; Arora, Harendra; Chen, Fei

2018-01-01

Background Simulation is an effective method for creating objective summative assessments of resident trainees. Real-time assessment (RTA) in simulated patient care environments is logistically challenging, especially when evaluating a large group of residents in multiple simulation scenarios. To date, there is very little data comparing RTA with delayed (hours, days, or weeks later) video-based assessment (DA) for simulation-based assessments of Accreditation Council for Graduate Medical Education (ACGME) sub-competency milestones. We hypothesized that sub-competency milestone evaluation scores obtained from DA, via audio-video recordings, are equivalent to the scores obtained from RTA. Methods Forty-one anesthesiology residents were evaluated in three separate simulated scenarios, representing different ACGME sub-competency milestones. All scenarios had one faculty member perform RTA and two additional faculty members perform DA. Subsequently, the scores generated by RTA were compared with the average scores generated by DA. Variance component analysis was conducted to assess the amount of variation in scores attributable to residents and raters. Results Paired t-tests showed no significant difference in scores between RTA and averaged DA for all cases. Cases 1, 2, and 3 showed an intraclass correlation coefficient (ICC) of 0.67, 0.85, and 0.50 for agreement between RTA scores and averaged DA scores, respectively. Analysis of variance of the scores assigned by the three raters showed a small proportion of variance attributable to raters (4% to 15%). Conclusions The results demonstrate that video-based delayed assessment is as reliable as real-time assessment, as both assessment methods yielded comparable scores. Based on a department’s needs or logistical constraints, our findings support the use of either real-time or delayed video evaluation for assessing milestones in a simulated patient care environment. PMID:29736352
Sensors vs. experts - a performance comparison of sensor-based fall risk assessment vs. conventional assessment in a sample of geriatric patients.

PubMed

Marschollek, Michael; Rehwald, Anja; Wolf, Klaus-Hendrik; Gietzelt, Matthias; Nemitz, Gerhard; zu Schwabedissen, Hubertus Meyer; Schulze, Mareike

2011-06-28

Fall events contribute significantly to mortality, morbidity and costs in our ageing population. In order to identify persons at risk and to target preventive measures, many scores and assessment tools have been developed. These often require expertise and are costly to implement. Recent research investigates the use of wearable inertial sensors to provide objective data on motion features which can be used to assess individual fall risk automatically. So far it is unknown how well this new method performs in comparison with conventional fall risk assessment tools. The aim of our research is to compare the predictive performance of our new sensor-based method with conventional and established methods, based on prospective data. In a first study phase, 119 inpatients of a geriatric clinic took part in motion measurements using a wireless triaxial accelerometer during a Timed Up&Go (TUG) test and a 20 m walk. Furthermore, the St. Thomas Risk Assessment Tool in Falling Elderly Inpatients (STRATIFY) was performed, and the multidisciplinary geriatric care team estimated the patients' fall risk. In a second follow-up phase of the study, 46 of the participants were interviewed after one year, including a fall and activity assessment. The predictive performances of the TUG, the STRATIFY and team scores are compared. Furthermore, two automatically induced logistic regression models based on conventional clinical and assessment data (CONV) as well as sensor data (SENSOR) are matched. Among the risk assessment scores, the geriatric team score (sensitivity 56%, specificity 80%) outperforms STRATIFY and TUG. The induced logistic regression models CONV and SENSOR achieve similar performance values (sensitivity 68%/58%, specificity 74%/78%, AUC 0.74/0.72, +LR 2.64/2.61). Both models are able to identify more persons at risk than the simple scores. Sensor-based objective measurements of motion parameters in geriatric patients can be used to assess individual fall risk, and our prediction model's performance matches that of a model based on conventional clinical and assessment data. Sensor-based measurements using a small wearable device may contribute significant information to conventional methods and are feasible in an unsupervised setting. More prospective research is needed to assess the cost-benefit relation of our approach.
Sensors vs. experts - A performance comparison of sensor-based fall risk assessment vs. conventional assessment in a sample of geriatric patients

PubMed Central

2011-01-01

Background Fall events contribute significantly to mortality, morbidity and costs in our ageing population. In order to identify persons at risk and to target preventive measures, many scores and assessment tools have been developed. These often require expertise and are costly to implement. Recent research investigates the use of wearable inertial sensors to provide objective data on motion features which can be used to assess individual fall risk automatically. So far it is unknown how well this new method performs in comparison with conventional fall risk assessment tools. The aim of our research is to compare the predictive performance of our new sensor-based method with conventional and established methods, based on prospective data. Methods In a first study phase, 119 inpatients of a geriatric clinic took part in motion measurements using a wireless triaxial accelerometer during a Timed Up&Go (TUG) test and a 20 m walk. Furthermore, the St. Thomas Risk Assessment Tool in Falling Elderly Inpatients (STRATIFY) was performed, and the multidisciplinary geriatric care team estimated the patients' fall risk. In a second follow-up phase of the study, 46 of the participants were interviewed after one year, including a fall and activity assessment. The predictive performances of the TUG, the STRATIFY and team scores are compared. Furthermore, two automatically induced logistic regression models based on conventional clinical and assessment data (CONV) as well as sensor data (SENSOR) are matched. Results Among the risk assessment scores, the geriatric team score (sensitivity 56%, specificity 80%) outperforms STRATIFY and TUG. The induced logistic regression models CONV and SENSOR achieve similar performance values (sensitivity 68%/58%, specificity 74%/78%, AUC 0.74/0.72, +LR 2.64/2.61). Both models are able to identify more persons at risk than the simple scores. Conclusions Sensor-based objective measurements of motion parameters in geriatric patients can be used to assess individual fall risk, and our prediction model's performance matches that of a model based on conventional clinical and assessment data. Sensor-based measurements using a small wearable device may contribute significant information to conventional methods and are feasible in an unsupervised setting. More prospective research is needed to assess the cost-benefit relation of our approach. PMID:21711504
The Doors and People Test: The Effect of Frontal Lobe Lesions on Recall and Recognition Memory Performance

PubMed Central

2016-01-01

Objective: Memory deficits in patients with frontal lobe lesions are most apparent on free recall tasks that require the selection, initiation, and implementation of retrieval strategies. The effect of frontal lesions on recognition memory performance is less clear with some studies reporting recognition memory impairments but others not. The majority of these studies do not directly compare recall and recognition within the same group of frontal patients, assessing only recall or recognition memory performance. Other studies that do compare recall and recognition in the same frontal group do not consider recall or recognition tests that are comparable for difficulty. Recognition memory impairments may not be reported because recognition memory tasks are less demanding. Method: This study aimed to investigate recall and recognition impairments in the same group of 47 frontal patients and 78 healthy controls. The Doors and People Test was administered as a neuropsychological test of memory as it assesses both verbal and visual recall and recognition using subtests that are matched for difficulty. Results: Significant verbal and visual recall and recognition impairments were found in the frontal patients. Conclusion: These results demonstrate that when frontal patients are assessed on recall and recognition memory tests of comparable difficulty, memory impairments are found on both types of episodic memory test. PMID:26752123
The Doors and People Test: The effect of frontal lobe lesions on recall and recognition memory performance.

PubMed

MacPherson, Sarah E; Turner, Martha S; Bozzali, Marco; Cipolotti, Lisa; Shallice, Tim

2016-03-01

Memory deficits in patients with frontal lobe lesions are most apparent on free recall tasks that require the selection, initiation, and implementation of retrieval strategies. The effect of frontal lesions on recognition memory performance is less clear with some studies reporting recognition memory impairments but others not. The majority of these studies do not directly compare recall and recognition within the same group of frontal patients, assessing only recall or recognition memory performance. Other studies that do compare recall and recognition in the same frontal group do not consider recall or recognition tests that are comparable for difficulty. Recognition memory impairments may not be reported because recognition memory tasks are less demanding. This study aimed to investigate recall and recognition impairments in the same group of 47 frontal patients and 78 healthy controls. The Doors and People Test was administered as a neuropsychological test of memory as it assesses both verbal and visual recall and recognition using subtests that are matched for difficulty. Significant verbal and visual recall and recognition impairments were found in the frontal patients. These results demonstrate that when frontal patients are assessed on recall and recognition memory tests of comparable difficulty, memory impairments are found on both types of episodic memory test. (c) 2016 APA, all rights reserved).
Assessment of the dose reduction potential of a model-based iterative reconstruction algorithm using a task-based performance metrology

DOE Office of Scientific and Technical Information (OSTI.GOV)

Samei, Ehsan, E-mail: samei@duke.edu; Richard, Samuel

2015-01-15

Purpose: Different computed tomography (CT) reconstruction techniques offer different image quality attributes of resolution and noise, challenging the ability to compare their dose reduction potential against each other. The purpose of this study was to evaluate and compare the task-based imaging performance of CT systems to enable the assessment of the dose performance of a model-based iterative reconstruction (MBIR) to that of an adaptive statistical iterative reconstruction (ASIR) and a filtered back projection (FBP) technique. Methods: The ACR CT phantom (model 464) was imaged across a wide range of mA setting on a 64-slice CT scanner (GE Discovery CT750 HD,more » Waukesha, WI). Based on previous work, the resolution was evaluated in terms of a task-based modulation transfer function (MTF) using a circular-edge technique and images from the contrast inserts located in the ACR phantom. Noise performance was assessed in terms of the noise-power spectrum (NPS) measured from the uniform section of the phantom. The task-based MTF and NPS were combined with a task function to yield a task-based estimate of imaging performance, the detectability index (d′). The detectability index was computed as a function of dose for two imaging tasks corresponding to the detection of a relatively small and a relatively large feature (1.5 and 25 mm, respectively). The performance of MBIR in terms of the d′ was compared with that of ASIR and FBP to assess its dose reduction potential. Results: Results indicated that MBIR exhibits a variability spatial resolution with respect to object contrast and noise while significantly reducing image noise. The NPS measurements for MBIR indicated a noise texture with a low-pass quality compared to the typical midpass noise found in FBP-based CT images. At comparable dose, the d′ for MBIR was higher than those of FBP and ASIR by at least 61% and 19% for the small feature and the large feature tasks, respectively. Compared to FBP and ASIR, MBIR indicated a 46%–84% dose reduction potential, depending on task, without compromising the modeled detection performance. Conclusions: The presented methodology based on ACR phantom measurements extends current possibilities for the assessment of CT image quality under the complex resolution and noise characteristics exhibited with statistical and iterative reconstruction algorithms. The findings further suggest that MBIR can potentially make better use of the projections data to reduce CT dose by approximately a factor of 2. Alternatively, if the dose held unchanged, it can improve image quality by different levels for different tasks.« less
Biological and functional relevance of CASP predictions.

PubMed

Liu, Tianyun; Ish-Shalom, Shirbi; Torng, Wen; Lafita, Aleix; Bock, Christian; Mort, Matthew; Cooper, David N; Bliven, Spencer; Capitani, Guido; Mooney, Sean D; Altman, Russ B

2018-03-01

Our goal is to answer the question: compared with experimental structures, how useful are predicted models for functional annotation? We assessed the functional utility of predicted models by comparing the performances of a suite of methods for functional characterization on the predictions and the experimental structures. We identified 28 sites in 25 protein targets to perform functional assessment. These 28 sites included nine sites with known ligand binding (holo-sites), nine sites that are expected or suggested by experimental authors for small molecule binding (apo-sites), and Ten sites containing important motifs, loops, or key residues with important disease-associated mutations. We evaluated the utility of the predictions by comparing their microenvironments to the experimental structures. Overall structural quality correlates with functional utility. However, the best-ranked predictions (global) may not have the best functional quality (local). Our assessment provides an ability to discriminate between predictions with high structural quality. When assessing ligand-binding sites, most prediction methods have higher performance on apo-sites than holo-sites. Some servers show consistently high performance for certain types of functional sites. Finally, many functional sites are associated with protein-protein interaction. We also analyzed biologically relevant features from the protein assemblies of two targets where the active site spanned the protein-protein interface. For the assembly targets, we find that the features in the models are mainly determined by the choice of template. © 2017 The Authors Proteins: Structure, Function and Bioinformatics Published by Wiley Periodicals, Inc.
Percutaneous transhepatic cholangiographic endobiliary forceps biopsy versus endoscopic ultrasound fine needle aspiration for proximal biliary strictures: a single-centre experience.

PubMed

Mohkam, Kayvan; Malik, Yaseen; Derosas, Carlos; Isaac, John; Marudanayagam, Ravi; Mehrzad, Homoyoon; Mirza, Darius F; Muiesan, Paolo; Roberts, Keith J; Sutcliffe, Robert P

2017-06-01

Endoscopic ultrasound fine needle aspiration (EUS-FNA) and percutaneous transhepatic cholangiographic endobiliary forceps biopsy (PTC-EFB) are valid procedures for histological assessment of proximal biliary strictures (PBS), but their performances have never been compared. This study aimed to compare the diagnostic performance of these two techniques. The diagnostic performances of EUS-FNA and PTC-EFB were compared in a retrospective cohort of patients assessed for PBS from 2011 to 2015 at a single tertiary centre. An inverse probability of treatment weighting (IPTW) was performed to adjust for covariate imbalance. A total of 102 EUS-FNAs and 75 PTC-EFBs (performed in 137 patients) were compared. Patients in the PTC-EFB group had higher preoperative bilirubin (243 versus 169 μmol/l, p = 0.005) and a higher incidence of malignancy (87% versus 67%, p = 0.008). Both techniques showed specificity and positive predictive value of 100%, and similar sensitivity (69% versus 75%, p = 0.45), negative predictive value (58% versus 38%, p = 0.15) and accuracy (78% versus 79%, p = 1.00). After IPTW, the diagnostic performance of the two techniques remained similar. Compared to EUS-FNA, PTC-EFB provides similar sensitivity, negative predictive value and accuracy. It should therefore be considered as the preferred tissue-sampling procedure, if biliary drainage is indicated. Crown Copyright © 2017. Published by Elsevier Ltd. All rights reserved.
The Future Value of Serious Games for Assessment: Where Do We Go Now?

ERIC Educational Resources Information Center

de Klerk, Sebastiaan; Kato, Pamela M.

2017-01-01

Game-based assessments will most likely be an increasing part of testing programs in future generations because they provide promising possibilities for more valid and reliable measurement of students' skills as compared to the traditional methods of assessment like paper-and-pencil tests or performance-based assessments. The current status of…
Government Performance and Results Act: Annual Report to the President and Congress. Fiscal Year 2008

ERIC Educational Resources Information Center

National Council on Disability, 2009

2009-01-01

This report compares actual performance with the projected levels of performance set out in the National Council on Disability's annual performance plan. The findings of this report show a positive link between the allocated resources and NCD's performance. NCD's assessment review showed that it was successful in meeting its goals and achieving…
Assessing Bilingual Dominance.

ERIC Educational Resources Information Center

Flege, James Emil; Mackay, Ian R. A.; Piske, Thorsten

2002-01-01

Used two methods to assess bilingual dominance in four groups of Italian-English bilinguals. Ratios were derived from bilinguals' self-rating of ability to speak and understand Italian compared to English. Dominance in Italian was associated with a relatively high level of performance in Italian (assessed in a translation task) and relatively poor…
Mobility performance in glaucoma.

PubMed

Turano, K A; Rubin, G S; Quigley, H A

1999-11-01

To determine whether glaucoma affects mobility performance and whether there is a relationship between mobility performance and stage of disease as estimated from vision-function measures. The mobility performance of 47 glaucoma subjects was compared with that of 47 normal-vision subjects who were of similar age. Mobility performance was assessed by the time required to complete an established travel path and the number of mobility incidents. The subjective assessment of falling and fear of falling were also compared. Vision function was assessed by measures of visual acuity, contrast sensitivity, monocular automated threshold perimetry, and suprathreshold; binocular visual fields were assessed with the Esterman test. The glaucoma subjects walked on average 10% more slowly than did the normal-vision subjects. The number of people who experienced bumps, stumbles, or orientation problems was almost twice as high in the glaucoma group than the normal-vision group, but the difference did not reach statistical significance. The difference between groups also was not significant with respect to the number of people who reported falling in the past year (38% for the glaucoma group and 30% for the normal-vision group) or a fear of falling (28% for the glaucoma group and 23% for the normal-vision group). The visual fields assessed with a Humphrey 24-2 test were more highly correlated with walking speed in glaucoma than the visual fields scored by the Esterman scale or than visual acuity or contrast sensitivity. Glaucoma is associated with a modest decrease in mobility performance. Walking speed decreases with severity of the disease as estimated by threshold perimetry.
Statistical issues in the comparison of quantitative imaging biomarker algorithms using pulmonary nodule volume as an example.

PubMed

Obuchowski, Nancy A; Barnhart, Huiman X; Buckler, Andrew J; Pennello, Gene; Wang, Xiao-Feng; Kalpathy-Cramer, Jayashree; Kim, Hyun J Grace; Reeves, Anthony P

2015-02-01

Quantitative imaging biomarkers are being used increasingly in medicine to diagnose and monitor patients' disease. The computer algorithms that measure quantitative imaging biomarkers have different technical performance characteristics. In this paper we illustrate the appropriate statistical methods for assessing and comparing the bias, precision, and agreement of computer algorithms. We use data from three studies of pulmonary nodules. The first study is a small phantom study used to illustrate metrics for assessing repeatability. The second study is a large phantom study allowing assessment of four algorithms' bias and reproducibility for measuring tumor volume and the change in tumor volume. The third study is a small clinical study of patients whose tumors were measured on two occasions. This study allows a direct assessment of six algorithms' performance for measuring tumor change. With these three examples we compare and contrast study designs and performance metrics, and we illustrate the advantages and limitations of various common statistical methods for quantitative imaging biomarker studies. © The Author(s) 2014 Reprints and permissions: sagepub.co.uk/journalsPermissions.nav.
Performance Assessment of Kernel Density Clustering for Gene Expression Profile Data

PubMed Central

Zeng, Beiyan; Chen, Yiping P.; Smith, Oscar H.

2003-01-01

Kernel density smoothing techniques have been used in classification or supervised learning of gene expression profile (GEP) data, but their applications to clustering or unsupervised learning of those data have not been explored and assessed. Here we report a kernel density clustering method for analysing GEP data and compare its performance with the three most widely-used clustering methods: hierarchical clustering, K-means clustering, and multivariate mixture model-based clustering. Using several methods to measure agreement, between-cluster isolation, and withincluster coherence, such as the Adjusted Rand Index, the Pseudo F test, the r2 test, and the profile plot, we have assessed the effectiveness of kernel density clustering for recovering clusters, and its robustness against noise on clustering both simulated and real GEP data. Our results show that the kernel density clustering method has excellent performance in recovering clusters from simulated data and in grouping large real expression profile data sets into compact and well-isolated clusters, and that it is the most robust clustering method for analysing noisy expression profile data compared to the other three methods assessed. PMID:18629292
Assessment of fatty degeneration of the gluteal muscles in patients with THA using MRI: reliability and accuracy of the Goutallier and quartile classification systems.

PubMed

Engelken, Florian; Wassilew, Georgi I; Köhlitz, Torsten; Brockhaus, Sebastian; Hamm, Bernd; Perka, Carsten; Diederichs, und Gerd

2014-01-01

The purpose of this study was to quantify the performance of the Goutallier classification for assessing fatty degeneration of the gluteus muscles from magnetic resonance (MR) images and to compare its performance to a newly proposed system. Eighty-four hips with clinical signs of gluteal insufficiency and 50 hips from asymptomatic controls were analyzed using a standard classification system (Goutallier) and a new scoring system (Quartile). Interobserver reliability and intraobserver repeatability were determined, and accuracy was assessed by comparing readers' scores with quantitative estimates of the proportion of intramuscular fat based on MR signal intensities (gold standard). The existing Goutallier classification system and the new Quartile system performed equally well in assessing fatty degeneration of the gluteus muscles, both showing excellent levels of interrater and intrarater agreement. While the Goutallier classification system has the advantage of being widely known, the benefit of the Quartile system is that it is based on more clearly defined grades of fatty degeneration. Copyright © 2014 Elsevier Inc. All rights reserved.
Evaluation of background parenchymal enhancement on breast MRI: a systematic review

PubMed Central

Signori, Alessio; Valdora, Francesca; Rossi, Federica; Calabrese, Massimo; Durando, Manuela; Mariscotto, Giovanna; Tagliafico, Alberto

2017-01-01

Objective: To perform a systematic review of the methods used for background parenchymal enhancement (BPE) evaluation on breast MRI. Methods: Studies dealing with BPE assessment on breast MRI were retrieved from major medical libraries independently by four reviewers up to 6 October 2015. The keywords used for database searching are “background parenchymal enhancement”, “parenchymal enhancement”, “MRI” and “breast”. The studies were included if qualitative and/or quantitative methods for BPE assessment were described. Results: Of the 420 studies identified, a total of 52 articles were included in the systematic review. 28 studies performed only a qualitative assessment of BPE, 13 studies performed only a quantitative assessment and 11 studies performed both qualitative and quantitative assessments. A wide heterogeneity was found in the MRI sequences and in the quantitative methods used for BPE assessment. Conclusion: A wide variability exists in the quantitative evaluation of BPE on breast MRI. More studies focused on a reliable and comparable method for quantitative BPE assessment are needed. Advances in knowledge: More studies focused on a quantitative BPE assessment are needed. PMID:27925480
Method of assessing heterogeneity in images

DOEpatents

Jacob, Richard E.; Carson, James P.

2016-08-23

A method of assessing heterogeneity in images is disclosed. 3D images of an object are acquired. The acquired images may be filtered and masked. Iterative decomposition is performed on the masked images to obtain image subdivisions that are relatively homogeneous. Comparative analysis, such as variogram analysis or correlogram analysis, is performed of the decomposed images to determine spatial relationships between regions of the images that are relatively homogeneous.

Quality of Care for PTSD and Depression in the Military Health System

DTIC Science & Technology

evaluate the receipt of recommended assessments and treatments. These measures draw on multiple data sources including administrative encounter data...services are effective in reducing symptoms. When comparing performance between 20122013 and 20132014, most measures demonstrated slight improvement ...in 20132014 for over 38,000 active-component service members with PTSD or depression. The assessment includes performance on 30 quality measures to
Assessing understanding of relative clauses: a comparison of multiple-choice comprehension versus sentence repetition.

PubMed

Frizelle, Pauline; O'Neill, Clodagh; Bishop, Dorothy V M

2017-11-01

Although sentence repetition is considered a reliable measure of children's grammatical knowledge, few studies have directly compared children's sentence repetition performance with their understanding of grammatical structures. The current study aimed to compare children's performance on these two assessment measures, using a multiple-choice picture-matching sentence comprehension task and a sentence repetition task. Thirty-three typically developing children completed both assessments, which included relative clauses representing a range of syntactic roles. Results revealed a similar order of difficulty of constructions on both measures but little agreement between them when evaluating individual differences. Interestingly, repetition was the easier of the two measures, with children showing the ability to repeat sentences they did not understand. This discrepancy is primarily attributed to the additional processing load resulting from the design of multiple-choice comprehension tasks, and highlights the fact that these assessments are invoking skills beyond those of linguistic competence.
Medical students' clinical performance in general practice - Triangulating assessments from patients, teachers and students.

PubMed

Braend, Anja Maria; Gran, Sarah Frandsen; Frich, Jan C; Lindbaek, Morten

2010-01-01

Formative assessment of medical students' clinical performance during general practice clerkship is necessary to learn consultation skills. Our aim was to triangulate feedback using patient questionnaires, written self-assessment and teachers' observation-based assessment, and to describe the content of this feedback. We developed StudentPEP, a 15-item version of EUROPEP, a tool for measuring patients' evaluation of quality in general practice. The teacher and student forms consisted of five StudentPEP-items and open-ended questions asking for approval and improvement needed on four aspects. Quantitative scores were analyzed statistically. Free-text comments were analyzed and categorized into 'specific and concrete' versus 'general and unspecific'. One hundred seventy-three students returned data from 2643 consultations. Mean patients' scores for 15 items were 4.3-4.8 on a five-point Likert scale. Mean teacher scores were 4.4 on five items, while students' mean self-assessments were 3.6-3.8. In an analysis of 380 consultations, students were more specific and concrete in their self-evaluation compared with teachers (p < 0.01). Patients scored students' performance high compared with students' self-assessments. Teachers' scores were in accordance with patients' scores. Teachers' written evaluations of students were often general. There is a potential for improving teachers' feedback in terms of more specific and concrete comments.
Quality assurance of lower limb venous duplex scans performed by vascular surgeons.

PubMed

Kordowicz, A; Ferguson, G; Salaman, R; Onwudike, M

2015-02-01

Duplex scanning is the gold standard for investigating venous reflux; increasingly surgeons perform these scans themselves. There has been no data published analysing the accuracy of Duplex scans performed by vascular surgeons. We aimed to evaluate an objective method of comparing the results of lower limb Duplex scans performed by one consultant vascular surgeon with those performed by a vascular technologist. We assessed 100 legs with symptomatic varicose veins. Each patient underwent two lower limb venous Duplex scans; one performed by a consultant vascular surgeon and one by a vascular technologist. Scan results were randomised and sent to two consultant vascular surgeons blinded to the identity and experience of the sonographer. They were asked to recommend treatment. A k score was calculated in each case to assess the level of agreement between the scans performed by the consultant and the technologist. Eighty-one patients were studied (53 females). The kappa score for assessor 1 was 0.60 (95%CI:0.44-0.75) and for assessor 2 was 0.62 (95%CI:0.48-0.75). k scores >0.60 represent a substantial strength of agreement. Duplex scans performed by this surgeon were comparable to those performed by a vascular technologist. It is possible to quality-assure duplex performed by vascular surgeons without directly observing the scanning process or reviewing digitally recorded images. We propose standardisation of training, assessment and quality assurance for vascular surgeons wishing to perform ultrasound scans.
Comparative assessment of three standardized robotic surgery training methods.

PubMed

Hung, Andrew J; Jayaratna, Isuru S; Teruya, Kara; Desai, Mihir M; Gill, Inderbir S; Goh, Alvin C

2013-10-01

To evaluate three standardized robotic surgery training methods, inanimate, virtual reality and in vivo, for their construct validity. To explore the concept of cross-method validity, where the relative performance of each method is compared. Robotic surgical skills were prospectively assessed in 49 participating surgeons who were classified as follows: 'novice/trainee': urology residents, previous experience <30 cases (n = 38) and 'experts': faculty surgeons, previous experience ≥30 cases (n = 11). Three standardized, validated training methods were used: (i) structured inanimate tasks; (ii) virtual reality exercises on the da Vinci Skills Simulator (Intuitive Surgical, Sunnyvale, CA, USA); and (iii) a standardized robotic surgical task in a live porcine model with performance graded by the Global Evaluative Assessment of Robotic Skills (GEARS) tool. A Kruskal-Wallis test was used to evaluate performance differences between novices and experts (construct validity). Spearman's correlation coefficient (ρ) was used to measure the association of performance across inanimate, simulation and in vivo methods (cross-method validity). Novice and expert surgeons had previously performed a median (range) of 0 (0-20) and 300 (30-2000) robotic cases, respectively (P < 0.001). Construct validity: experts consistently outperformed residents with all three methods (P < 0.001). Cross-method validity: overall performance of inanimate tasks significantly correlated with virtual reality robotic performance (ρ = -0.7, P < 0.001) and in vivo robotic performance based on GEARS (ρ = -0.8, P < 0.0001). Virtual reality performance and in vivo tissue performance were also found to be strongly correlated (ρ = 0.6, P < 0.001). We propose the novel concept of cross-method validity, which may provide a method of evaluating the relative value of various forms of skills education and assessment. We externally confirmed the construct validity of each featured training tool. © 2013 BJU International.
Comparing Cognitive Models of Domain Mastery and Task Performance in Algebra: Validity Evidence for a State Assessment

ERIC Educational Resources Information Center

Warner, Zachary B.

2013-01-01

This study compared an expert-based cognitive model of domain mastery with student-based cognitive models of task performance for Integrated Algebra. Interpretations of student test results are limited by experts' hypotheses of how students interact with the items. In reality, the cognitive processes that students use to solve each item may be…
Augmented Affordances Support Learning: Comparing the Instructional Effects of the Augmented Reality Sandbox and Conventional Maps to Teach Topographic Map Skills

ERIC Educational Resources Information Center

Richardson, R. Thomas; Sammons, Dotty; Del-Parte, Donna

2018-01-01

This study compared learning performance during and following AR and non-AR topographic map instruction and practice Two-way ANOVA testing indicated no significant differences on a posttest assessment between map type and spatial ability. Prior learning activity results revealed a significant performance difference between AR and non-AR treatment…
Personality and Occupational Stress in Elite Performers.

ERIC Educational Resources Information Center

Hamilton, Linda H.; Kella, John J.

Performing Arts Psychology has recently emerged as a unique subspecialty comparable to that of Sports Psychology. Attention has been focused on problems common to all performers (e.g., performance anxiety); however, the various stresses within each art form often remain hidden from view. To assess the psychological aspects of different art forms,…
A Performance Assessment of Eight Low-Boom High-Speed Civil Transport Concepts

NASA Technical Reports Server (NTRS)

Baize, Daniel G.; McElroy, Marcus O.; Fenbert, James A.; Coen, Peter G.; Ozoroski, Lori P.; Domack, Chris S.; Needleman, Kathy E.; Geiselhart, Karl A.

1999-01-01

A performance assessment of eight low-boom high speed civil transport (HSCT) configurations and a reference HSCT configuration has been performed. Although each of the configurations was designed with different engine concepts, for consistency, a year 2005 technology, 0.4 bypass ratio mixed-flow turbofan (MFTF) engine was used for all of the performance assessments. Therefore, all original configuration nacelles were replaced by a year 2005 MFRF nacelle design which corresponds to the engine deck utilized. The engine thrust level was optimized to minimize vehicle takeoff gross weight. To preserve the configuration's sonic-boom shaping, wing area was not optimized or altered from its original design value. Performance sizings were completed when possible for takeoff balanced field lengths of 11,000 ft and 12,000 ft, not considering FAR Part 36 Stage III noise compliance. Additionally, an arbitrary sizing with thrust-to-weight ratio equal to 0.25 was performed, enabling performance levels to be compared independent of takeoff characteristics. The low-boom configurations analyzed included designs from the Boeing Commercial Airplane Group, Douglas Aircraft Company, Ames Research Center, and Langley Research Center. This paper discusses the technology level assumptions, mission profile, analysis methodologies, and the results of the assessment. The results include maximum lift-to-drag ratios, total fuel consumption, number of passengers, optimum engine sizing plots, takeoff performance, mission block time, and takeoff gross weight for all configurations. Results from the low-boom configurations are also compared with a non-low-boom reference configuration. Configuration dependent advantages or deficiencies are discussed as warranted.
The effects of cognitive rehabilitation on Alzheimer's dementia patients' cognitive assessment reference diagnosis system performance based on level of cognitive functioning.

PubMed

Hwang, Jung-Ha; Cha, Hyun-Gyu; Cho, Hyuk-Shin

2015-09-01

[Purpose] The purpose of this study is to apply cognitive rehabilitation according to Alzheimer's disease (AD) patients' level of cognitive functioning to compare changes in Cognitive Assessment Reference Diagnosis System performance and present standards for effective intervention. [Subjects] Subjects were 30 inpatients diagnosed with AD. Subjects were grouped by Clinical Dementia Rating (CDR) class (CDR-0.5, CDR-1, or CDR-2, n = 10 per group), which is based on level of cognitive functioning, and cognitive rehabilitation was applied for 50 minutes per day, five days per week, for four weeks. [Methods] After cognitive rehabilitation intervention, CARDS tests were conducted to evaluate memory. [Results] Bonferroni tests comparing the three groups revealed that the CDR-0.5 and CDR-1 groups showed significant increases in Delayed 10 word-list, Delayed 10 object-list, Recognition 10 object, and Recent memory performance compared to the CDR-2 group. In addition, the CDR-0.5 group showed significant decreases in Recognition 10 word performance compared to the CDR-1 group. [Conclusion] Cognitive rehabilitation, CDR-0.5 or CDR-1 subjects showed significantly greater memory improvements than CDR-2 subjects. Moreover, was not effective for CDR-2 subjects.
Contrast-Enhanced Spectral Mammography is Comparable to MRI in the Assessment of Residual Breast Cancer Following Neoadjuvant Systemic Therapy.

PubMed

Patel, Bhavika K; Hilal, Talal; Covington, Matthew; Zhang, Nan; Kosiorek, Heidi E; Lobbes, Marc; Northfelt, Donald W; Pockaj, Barbara A

2018-05-01

To evaluate the performance of contrast-enhanced spectral mammography (CESM) compared to MRI in the assessment of tumor response in breast cancer patients undergoing neoadjuvant systemic therapy (NST). The institutional review board approved this study. From September 2014 to June 2017, we identified patients with pathologically confirmed invasive breast cancer who underwent NST. All patients had both CESM and MRI performed pre- and post-NST with pathological assessment after surgical management. Size of residual malignancy on post-NST CESM and MRI was compared with surgical pathology. Lin concordance and Pearson correlation coefficient were used to assess agreement. Bland-Altman plots were used to visualize the differences between tumor size on imaging and pathology. Sixty-five patients were identified. Mean age was 52.7 (range 30-76) years. Type of NST included chemotherapy in 53 (82%) and endocrine therapy in 12 (18%). Mean tumor size after NST was 14.6 (range 0-105) mm for CESM and 14.2 mm (range 0-75 mm) for MRI compared with 19.6 (range 0-100) mm on final surgical pathology. Equivalence tests demonstrated that mean tumor size measured by CESM (p = 0.009) or by MRI (p = 0.01) was equivalent to the mean tumor size measured by pathology within - 1 and 1-cm range. Comparing CESM versus MRI for assessment of complete response, the sensitivity was 95% versus 95%, specificity 66.7% versus 68.9%, positive predictive value 55.9% versus 57.6%, and negative predictive value 96.7% versus 96.9% respectively. CESM was comparable to MRI in assessing residual malignancy after completion of NST.
Validity and test-retest reliability of an at-work production loss instrument.

PubMed

Aboagye, E; Jensen, I; Bergström, G; Hagberg, J; Axén, I; Lohela-Karlsson, M

2016-07-01

Besides causing ill health, a poor work environment may contribute to production loss. Production loss assessment instruments emphasize health-related consequences but there is no instrument to measure reduced work performance related to the work environment. To examine convergent validity and test-retest reliability of health-related production loss (HRPL) and work environment-related production loss (WRPL) against a valid comparable instrument, the Health and Work Performance Questionnaire (HPQ). Cross-sectional study of employees, not on sick leave, who were asked to self-rate their work performance and production losses. Using the Pearson correlation and Bland and Altman's Test of Agreement, convergent validity was examined. Subgroup analyses were performed for employees recording problem-specific reduced work performance. Consistency of pairs of HRPL and WRPL for samples responding to both assessments was expressed using Intraclass Correlation Coefficient (ICC) and tests of repeatability. A total of 88 employees participated and 44 responded to both assessments. Test of agreement between measurements estimates a mean difference of 0.34 for HRPL and -0.03 for WRPL compared with work performance. This indicates that the production loss questions are valid and moderately associated with work performance for the total sample and subgroups. ICC for paired HRPL assessments was 0.90 and 0.91 for WRPL, i.e. the test-retest reliability was good and suggests stability in the instrument. HRPL and WRPL can be used to measure production loss due to health-related and work environment-related problems. These results may have implications for advancing methods of assessing production loss, which represents an important cost to employers. © The Author 2016. Published by Oxford University Press on behalf of the Society of Occupational Medicine. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Life-Cycle Assessment of Cookstove Fuels in India and China ...

EPA Pesticide Factsheets

A life cycle assessment (LCA) was conducted to compare the environmental footprint of current and possible fuels used for cooking within China and India. Current fuel mix profiles are compared to scenarios of projected differences in and/or cleaner cooking fuels. Results are reported for a suite of relevant life cycle impact assessment indicators: global climate change, energy demand, fossil depletion, water consumption, particulate matter formation, acidification, eutrophication and photochemical smog formation. Traditional fuels demonstrate notably poor relative performance in particulate matter formation, photochemical oxidant formation, freshwater eutrophication, and black carbon emissions. Most fuels demonstrate trade-offs between impact categories. Stove efficiency is found to be a crucial variable determining environmental performance across all impact categories. The study shows that electricity and many of the processed fuels, while yielding emission reductions in homes at the point of use, transfer many of those emissions upstream into the processing and distribution life cycle stage. To conduct LCA study of the cookstove fuels being used in India and China to determine how fuels and stoves compare based on a holistic assessment considering the LCA environmental tradeoffs
Computed Tomography Window Blending: Feasibility in Thoracic Trauma.

PubMed

Mandell, Jacob C; Wortman, Jeremy R; Rocha, Tatiana C; Folio, Les R; Andriole, Katherine P; Khurana, Bharti

2018-02-07

This study aims to demonstrate the feasibility of processing computed tomography (CT) images with a custom window blending algorithm that combines soft-tissue, bone, and lung window settings into a single image; to compare the time for interpretation of chest CT for thoracic trauma with window blending and conventional window settings; and to assess diagnostic performance of both techniques. Adobe Photoshop was scripted to process axial DICOM images from retrospective contrast-enhanced chest CTs performed for trauma with a window-blending algorithm. Two emergency radiologists independently interpreted the axial images from 103 chest CTs with both blended and conventional windows. Interpretation time and diagnostic performance were compared with Wilcoxon signed-rank test and McNemar test, respectively. Agreement with Nexus CT Chest injury severity was assessed with the weighted kappa statistic. A total of 13,295 images were processed without error. Interpretation was faster with window blending, resulting in a 20.3% time saving (P < .001), with no difference in diagnostic performance, within the power of the study to detect a difference in sensitivity of 5% as determined by post hoc power analysis. The sensitivity of the window-blended cases was 82.7%, compared to 81.6% for conventional windows. The specificity of the window-blended cases was 93.1%, compared to 90.5% for conventional windows. All injuries of major clinical significance (per Nexus CT Chest criteria) were correctly identified in all reading sessions, and all negative cases were correctly classified. All readers demonstrated near-perfect agreement with injury severity classification with both window settings. In this pilot study utilizing retrospective data, window blending allows faster preliminary interpretation of axial chest CT performed for trauma, with no significant difference in diagnostic performance compared to conventional window settings. Future studies would be required to assess the utility of window blending in clinical practice. Copyright © 2018 The Association of University Radiologists. All rights reserved.
The effect of badminton-specific exercise on badminton short-serve performance in competition and practice climates.

PubMed

Duncan, Michael J; Chan, Cheryl K Y; Clarke, Neil D; Cox, Martin; Smith, Mike

2017-03-01

This study examined the effects of changes in physiological and psychological arousal on badminton short-serve performance in competitive and practice climates. Twenty competitive badminton players (10 males and 10 females) volunteered to participate in the study following ethics approval. After familiarisation, badminton short-serve performance was measured at rest, mid-way through and at the end of a badminton-specific exercise protocol in two conditions; competition vs. practice. Ratings of cognitive and somatic anxiety were assessed at three time points prior to badminton short-serve performance using the Mental Readiness Form 3. Heart rate and rating of perceived exertion (RPE) were assessed during the exercise protocol. Results indicated that better short-serve performance was evident in practice compared to competition (P = .034). RPE values were significantly higher in the competition condition compared to practice (P = .007). Cognitive anxiety intensity was significantly lower post-exercise in the practice condition compared to competition (P = .001). Cognitive anxiety direction showed greater debilitation post-exercise in the competition condition compared to practice (P = .01). Somatic anxiety intensity increased from pre-, to mid- to post-exercise (P = .001) irrespective of condition. This study suggests that badminton serve performance is negatively affected when physiological arousal, via badminton-specific exercise, and cognitive anxiety, via perceived competition, are high.
Catching-up: Children with developmental coordination disorder compared to healthy children before and after sensorimotor therapy

PubMed Central

2017-01-01

The aims of the present study were to (a) compare healthy children in terms of sensorimotor maturity to untreated children diagnosed with developmental coordination disorder (DCD) and (b) compare healthy children to diagnosed children following completed treatment with sensorimotor therapy. Participants were 298 children, 196 boys and 102 girls, distributed into a Norm group of healthy children (n = 99) and a group of children diagnosed with DCD (n = 199) with a total mean age of 8.77 years (SD = 2.88). Participants in both groups were assessed on instruments aimed to detect sensorimotor deviations. The children in the DCD group completed, during on average 36 months, sensorimotor therapy which comprised stereotypical fetal- and infant movements, vestibular stimulation, tactile stimulation, auditory stimulation, complementary play exercises, gross motor milestones, and sports-related gross motor skills. At the final visit a full assessment was once more performed. Results showed that the Norm group performed better on all sensorimotor tests as compared to the untreated children from the DCD group, with the exception of an audiometric test where both groups performed at the same level. Girls performed better on tests assessing proprioceptive and balance abilities. Results also showed, after controls for natural maturing effects, that the children from the DCD group after sensorimotor therapy did catch up with the healthy children. The concept of “catching-up” is used within developmental medicine but has not earlier been documented with regard to children and youth in connection with DCD. PMID:29020061
Effect of Industry Sponsorship on Dental Restorative Trials.

PubMed

Schwendicke, F; Tu, Y-K; Blunck, U; Paris, S; Göstemeyer, G

2016-01-01

Industry sponsorship was found to potentially introduce bias into clinical trials. We assessed the effects of industry sponsorship on the design, comparator choice, and findings of randomized controlled trials on dental restorative materials. A systematic review was performed via MEDLINE, CENTRAL, and EMBASE. Randomized trials on dental restorative and adhesive materials published 2005 to 2015 were included. The design of sponsored and nonsponsored trials was compared statistically (risk of bias, treatment indication, setting, transferability, sample size). Comparator choice and network geometry of sponsored and nonsponsored trials were assessed via network analysis. Material performance rankings in different trial types were estimated via Bayesian network meta-analysis. Overall, 114 studies were included (15,321 restorations in 5,232 patients). We found 21 and 41 (18% and 36%) trials being clearly or possibly industry sponsored, respectively. Trial design of sponsored and nonsponsored trials did not significantly differ for most assessed items. Sponsored trials evaluated restorations of load-bearing cavities significantly more often than nonsponsored trials, had longer follow-up periods, and showed significantly increased risk of detection bias. Regardless of sponsorship status, comparisons were mainly performed within material classes. The proportion of trials comparing against gold standard restorative or adhesive materials did not differ between trial types. If ranked for performance according to the need to re-treat (best: least re-treatments), most material combinations were ranked similarly in sponsored and nonsponsored trials. The effect of industry sponsorship on dental restorative trials seems limited. © International & American Associations for Dental Research 2015.
A comparison of online versus face-to-face teaching delivery in statistics instruction for undergraduate health science students.

PubMed

Lu, Fletcher; Lemonde, Manon

2013-12-01

The objective of this study was to assess if online teaching delivery produces comparable student test performance as the traditional face-to-face approach irrespective of academic aptitude. This study involves a quasi-experimental comparison of student performance in an undergraduate health science statistics course partitioned in two ways. The first partition involves one group of students taught with a traditional face-to-face classroom approach and the other through a completely online instructional approach. The second partition of the subjects categorized the academic aptitude of the students into groups of higher and lower academically performing based on their assignment grades during the course. Controls that were placed on the study to reduce the possibility of confounding variables were: the same instructor taught both groups covering the same subject information, using the same assessment methods and delivered over the same period of time. The results of this study indicate that online teaching delivery is as effective as a traditional face-to-face approach in terms of producing comparable student test performance but only if the student is academically higher performing. For academically lower performing students, the online delivery method produced significantly poorer student test results compared to those lower performing students taught in a traditional face-to-face environment.
Performance assessment of static lead-lag feedforward controllers for disturbance rejection in PID control loops.

PubMed

Yu, Zhenpeng; Wang, Jiandong

2016-09-01

This paper assesses the performance of feedforward controllers for disturbance rejection in univariate feedback plus feedforward control loops. The structures of feedback and feedforward controllers are confined to proportional-integral-derivative and static-lead-lag forms, respectively, and the effects of feedback controllers are not considered. The integral squared error (ISE) and total squared variation (TSV) are used as performance metrics. A performance index is formulated by comparing the current ISE and TSV metrics to their own lower bounds as performance benchmarks. A controller performance assessment (CPA) method is proposed to calculate the performance index from measurements. The proposed CPA method resolves two critical limitations in the existing CPA methods, in order to be consistent with industrial scenarios. Numerical and experimental examples illustrate the effectiveness of the obtained results. Copyright © 2016 ISA. Published by Elsevier Ltd. All rights reserved.
The Impact of Captivity and Posttraumatic Stress Disorder on Cognitive Performance Among Former Prisoners of War: A Longitudinal Study.

PubMed

Aloni, Roy; Crompton, Laura; Levin, Yafit; Solomon, Zahava

2018-04-24

War captivity is a potent pathogen for various aspects of mental health, including cognitive impairments. However, little is known about the long-term impact of war captivity and posttraumatic stress disorder (PTSD) on cognitive functioning among former prisoners of war (ex-POWs). This study assesses the effect of captivity, PTSD trajectories, and the accumulating differential effect in the prediction of cognitive performance. This longitudinal research includes 4 assessments (1991 [T1], 2003 [T2], 2008 [T3], 2015 [T4]) of Israeli ex-POWs and comparable combatants from the 1973 Yom Kippur War. Accordingly, 95 ex-POWs and 26 comparable combatants were included in this study. PTSD was assessed according to the DSM-IV, and cognitive performance was assessed using the Montreal Cognitive Assessment (MoCA). Ex-POWs reported higher levels of PTSD symptoms compared to controls (P = 0.007). No difference was found between the groups regarding MoCA total score. Ex-POWs with chronic PTSD were found to have more difficulty in overall cognitive functioning, compared to ex-POWs with delayed, recovery, and resilient trajectories (P = 0.03). Finally, physical and psychological suffering in captivity and intrusion symptoms predicted cognitive performance (P < .001, R² = 37.9%). These findings support the potent pathogenic effects of war captivity on cognitive abilities, more than 4 decades after the end of the traumatic event. Our results showed captivity to be a unique and powerful traumatic experience, leading to PTSD and long-lasting and enduring neuropsychological implications. These findings highlight the importance of viewing ex-POWs, in particular those suffering from chronic PTSD, especially as they age, as a high-risk population for cognitive disorders. This requires the appropriate diagnosis and cognitive therapy as a way to preserve cognitive abilities among this population. © Copyright 2018 Physicians Postgraduate Press, Inc.

External validation of Global Evaluative Assessment of Robotic Skills (GEARS).

PubMed

Aghazadeh, Monty A; Jayaratna, Isuru S; Hung, Andrew J; Pan, Michael M; Desai, Mihir M; Gill, Inderbir S; Goh, Alvin C

2015-11-01

We demonstrate the construct validity, reliability, and utility of Global Evaluative Assessment of Robotic Skills (GEARS), a clinical assessment tool designed to measure robotic technical skills, in an independent cohort using an in vivo animal training model. Using a cross-sectional observational study design, 47 voluntary participants were categorized as experts (>30 robotic cases completed as primary surgeon) or trainees. The trainee group was further divided into intermediates (≥5 but ≤30 cases) or novices (<5 cases). All participants completed a standardized in vivo robotic task in a porcine model. Task performance was evaluated by two expert robotic surgeons and self-assessed by the participants using the GEARS assessment tool. Kruskal-Wallis test was used to compare the GEARS performance scores to determine construct validity; Spearman's rank correlation measured interobserver reliability; and Cronbach's alpha was used to assess internal consistency. Performance evaluations were completed on nine experts and 38 trainees (14 intermediate, 24 novice). Experts demonstrated superior performance compared to intermediates and novices overall and in all individual domains (p < 0.0001). In comparing intermediates and novices, the overall performance difference trended toward significance (p = 0.0505), while the individual domains of efficiency and autonomy were significantly different between groups (p = 0.0280 and 0.0425, respectively). Interobserver reliability between expert ratings was confirmed with a strong correlation observed (r = 0.857, 95 % CI [0.691, 0.941]). Experts and participant scoring showed less agreement (r = 0.435, 95 % CI [0.121, 0.689] and r = 0.422, 95 % CI [0.081, 0.0672]). Internal consistency was excellent for experts and participants (α = 0.96, 0.98, 0.93). In an independent cohort, GEARS was able to differentiate between different robotic skill levels, demonstrating excellent construct validity. As a standardized assessment tool, GEARS maintained consistency and reliability for an in vivo robotic surgical task and may be applied for skills evaluation in a broad range of robotic procedures.
Abu Sayyaf Group (ASG): An Al-Qaeda Associate Case Study

DTIC Science & Technology

2017-10-01

completed in August 2017. In order to conduct this assessment, CNA used a comparative methodology that included eight case studies on groups affiliated...assessment, CNA used a comparative methodology that included eight case studies on groups affiliated or associated with Al-Qaeda. These case studies ...Case Study P. Kathleen Hammerberg and Pamela G. Faber With contributions from Alexander Powell October 2017 This work was performed
A Limited Rotary-Wing Flight Investigation of Hyperstereo in Helmet-Mounted Display Designs

DTIC Science & Technology

2009-07-01

when compared to current and near-term I2 systems with a direct optical linkage. In summary, the current binocular I2 HMD design of ANVIS, which...terms of visual and optical performance. This assessment was performed by measuring a number of system parameters and by comparing the obtained...to subject #2 who had 800 NVG flight hours. Interestingly, across all maneuvers for which the hyperstereo HMD was asked to be compared to ANVIS
Impact of a Paper vs Virtual Simulated Patient Case on Student-Perceived Confidence and Engagement

PubMed Central

Gallimore, Casey E.; Pitterle, Michael; Morrill, Josh

2016-01-01

Objective. To evaluate online case simulation vs a paper case on student confidence and engagement. Design. Students enrolled in a pharmacotherapy laboratory course completed a patient case scenario as a component of an osteoarthritis laboratory module. Two laboratory sections used a paper case (n=53); three sections used an online virtual case simulation (n=81). Student module performance was assessed through a submitted subjective objective assessment plan (SOAP) note. Students completed pre/post surveys to measure self-perceived confidence in providing medication management. The simulation group completed postmodule questions related to realism and engagement of the online virtual case simulation. Group assessments were performed using chi-square and Mann Whitney tests. Assessment. A significant increase in all 13 confidence items was seen in both student groups following completion of the laboratory module. The simulation group had an increased change of confidence compared to the paper group in assessing medication efficacy and documenting a thorough assessment. Comparing the online virtual simulation to a paper case, students agreed the learning experience increased interest, enjoyment, relevance, and realism. The simulation group performed better on the subjective SOAP note domain though no differences in total SOAP note scores was found between the two groups. Conclusion. Virtual case simulations result in increased student engagement and may lead to improved documentation performance in the subjective domain of SOAP notes. However, virtual patient cases may offer limited benefit over paper cases in improving overall student self-confidence to provide medication management. PMID:26941442
Assessing Cognitive Ability and Simulator-Based Driving Performance in Poststroke Adults

PubMed Central

Falkmer, Torbjörn; Willstrand, Tania Dukic

2017-01-01

Driving is an important activity of daily living, which is increasingly relied upon as the population ages. It has been well-established that cognitive processes decline following a stroke and these processes may influence driving performance. There is much debate on the use of off-road neurological assessments and driving simulators as tools to predict driving performance; however, the majority of research uses unlicensed poststroke drivers, making the comparability of poststroke adults to that of a control group difficult. It stands to reason that in order to determine whether simulators and cognitive assessments can accurately assess driving performance, the baseline should be set by licenced drivers. Therefore, the aim of this study was to assess differences in cognitive ability and driving simulator performance in licensed community-dwelling poststroke drivers and controls. Two groups of licensed drivers (37 poststroke and 43 controls) were assessed using several cognitive tasks and using a driving simulator. The poststroke adults exhibited poorer cognitive ability; however, there were no differences in simulator performance between groups except that the poststroke drivers demonstrated less variability in driver headway. The application of these results as a prescreening toolbox for poststroke drivers is discussed. PMID:28559646
First-year medical students use of ultrasound or physical examination to diagnose hepatomegaly and ascites: a randomized controlled trial.

PubMed

Arora, Samantha; Cheung, Angela C; Tarique, Usman; Agarwal, Arnav; Firdouse, Mohammed; Ailon, Jonathan

2017-09-01

To compare point-of-care ultrasound and physical examination (PEx), each performed by first-year medical students after brief teaching, for assessing ascites and hepatomegaly. Ultrasound and PEx were compared on: (1) reliability, validity and performance, (2) diagnostic confidence, ease of use, utility, and applicability. A single-center, randomized controlled trial was performed at a tertiary centre. First-year medical students were randomized to use ultrasound or PEx to assess for ascites and hepatomegaly. Cohen's kappa and interclass coefficient (ICC) were used to measure interrater reliability between trainee assessments and the reference standard (a same day ultrasound by a radiologist). Sensitivity, specificity, accuracy, positive predictive value (PPV), and negative predictive value (NPV) were compared. A ten-point Likert scale was used to assess trainee diagnostic confidence and perceptions of utility. There were no significant differences in interobserver reliability, sensitivity, specificity, accuracy, PPV, or NPV between the ultrasound and PEx groups. However, students in the ultrasound group provided higher scores for perceived utility (ascites 8.38 ± 1.35 vs 7.08 ± 1.86, p = 0.008; hepatomegaly 7.68 ± 1.52 vs 5.36 ± 2.48, p < 0.001) and likelihood of adoption (ascites 8.67 ± 1.61 vs 7.46 ± 1.79, p = 0.02; hepatomegaly 8.12 ± 1.90 vs 5.92 ± 2.32, p = 0.001). When performed by first-year medical students, the validity and reliability of ultrasound is comparable to PEx, but with greater perceived utility and likelihood of adoption. With similarly brief instruction, point-of-care ultrasonography can be as effectively learned and performed as PEx, with a high degree of interest from trainees.
Is functional MR imaging assessment of hemispheric language dominance as good as the Wada test?: a meta-analysis.

PubMed

Dym, R Joshua; Burns, Judah; Freeman, Katherine; Lipton, Michael L

2011-11-01

To perform a systematic review and meta-analysis to quantitatively assess functional magnetic resonance (MR) imaging lateralization of language function in comparison with the Wada test. This study was determined to be exempt from review by the institutional review board. A systematic review and meta-analysis were performed in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines. A structured Medline search was conducted to identify all studies that compared functional MR imaging with the Wada test for determining hemispheric language dominance prior to brain surgery. Studies meeting predetermined inclusion criteria were selected independently by two radiologists who also assessed their quality using the Quality Assessment of Diagnostic Accuracy Studies tool. Language dominance was classified as typical (left hemispheric language dominance) or atypical (right hemispheric language dominance or bilateral language representation) for each patient. A meta-analysis was then performed by using a bivariate random-effects model to derive estimates of sensitivity and specificity, with Wada as the standard of reference. Subgroup analyses were also performed to compare the different functional MR imaging techniques utilized by the studies. Twenty-three studies, comprising 442 patients, met inclusion criteria. The sensitivity and specificity of functional MR imaging for atypical language dominance (compared with the Wada test) were 83.5% (95% confidence interval: 80.2%, 86.7%) and 88.1% (95% confidence interval: 87.0%, 89.2%), respectively. Functional MR imaging provides an excellent, noninvasive alternative for language lateralization and should be considered for the initial preoperative assessment of hemispheric language dominance. Further research may help determine which functional MR methods are most accurate for specific patient populations. RSNA, 2011
Histological evaluation and optimization of surgical vessel sealing systems

NASA Astrophysics Data System (ADS)

Lathrop, Robert; Ryan, Thomas; Gaspredes, Jonathan; Woloszko, Jean; Coad, James E.

2017-02-01

Surgical vessel sealing systems are widely used to achieve hemostasis and dissection in open surgery and minimally invasive, laparoscopic surgery. This enabling technology was developed about 17 years ago and continues to evolve with new devices and systems achieving improved outcomes. Histopathological assessment of thermally sealed tissues is a valuable tool for refining and comparing performance among surgical vessel sealing systems. Early work in this field typically assessed seal time, burst rate, and failure rate (in-situ). Later work compared histological staining methods with birefringence to assess the extent of thermal damage to tissues adjacent to the device. Understanding the microscopic architecture of a sealed vessel is crucial to optimizing the performance of power delivery algorithms and device design parameters. Manufacturers rely on these techniques to develop new products. A system for histopathological evaluation of vessels and sealing performance was established, to enable the direct assessment of a treatment's tissue effects. The parameters included the commonly used seal time, pressure burst rate and failure rate, as well as extensions of the assessment to include its likelihood to form steam vacuoles, adjacent thermal effect near the device, and extent of thermally affected tissue extruded back into the vessel lumen. This comprehensive assessment method provides an improved means of assessing the quality of a sealed vessel and understanding the exact mechanisms which create an optimally sealed vessel.
Glycemic penalty index for adequately assessing and comparing different blood glucose control algorithms

PubMed Central

Van Herpe, Tom; De Brabanter, Jos; Beullens, Martine; De Moor, Bart; Van den Berghe, Greet

2008-01-01

Introduction Blood glucose (BG) control performed by intensive care unit (ICU) nurses is becoming standard practice for critically ill patients. New (semi-automated) 'BG control' algorithms (or 'insulin titration' algorithms) are under development, but these require stringent validation before they can replace the currently used algorithms. Existing methods for objectively comparing different insulin titration algorithms show weaknesses. In the current study, a new approach for appropriately assessing the adequacy of different algorithms is proposed. Methods Two ICU patient populations (with different baseline characteristics) were studied, both treated with a similar 'nurse-driven' insulin titration algorithm targeting BG levels of 80 to 110 mg/dl. A new method for objectively evaluating BG deviations from normoglycemia was founded on a smooth penalty function. Next, the performance of this new evaluation tool was compared with the current standard assessment methods, on an individual as well as a population basis. Finally, the impact of four selected parameters (the average BG sampling frequency, the duration of algorithm application, the severity of disease, and the type of illness) on the performance of an insulin titration algorithm was determined by multiple regression analysis. Results The glycemic penalty index (GPI) was proposed as a tool for assessing the overall glycemic control behavior in ICU patients. The GPI of a patient is the average of all penalties that are individually assigned to each measured BG value based on the optimized smooth penalty function. The computation of this index returns a number between 0 (no penalty) and 100 (the highest penalty). For some patients, the assessment of the BG control behavior using the traditional standard evaluation methods was different from the evaluation with GPI. Two parameters were found to have a significant impact on GPI: the BG sampling frequency and the duration of algorithm application. A higher BG sampling frequency and a longer algorithm application duration resulted in an apparently better performance, as indicated by a lower GPI. Conclusion The GPI is an alternative method for evaluating the performance of BG control algorithms. The blood glucose sampling frequency and the duration of algorithm application should be similar when comparing algorithms. PMID:18302732
How Should Blood Glucose Meter System Analytical Performance Be Assessed?

PubMed

Simmons, David A

2015-08-31

Blood glucose meter system analytical performance is assessed by comparing pairs of meter system and reference instrument blood glucose measurements measured over time and across a broad array of glucose values. Consequently, no single, complete, and ideal parameter can fully describe the difference between meter system and reference results. Instead, a number of assessment tools, both graphical (eg, regression plots, modified Bland-Altman plots, and error grid analysis) and tabular (eg, International Organization for Standardization guidelines, mean absolute difference, and mean absolute relative difference) have been developed to evaluate meter system performance. The strengths and weaknesses of these methods of presenting meter system performance data, including a new method known as Radar Plots, are described here. © 2015 Diabetes Technology Society.
The Score-Boosting Game.

ERIC Educational Resources Information Center

Popham, W. James

2000-01-01

Teachers everywhere are playing the score-boosting game to raise scores on mandated standardized achievement tests, although five nationally recognized assessments compare student performance instead of measuring classroom learning. Since curriculum standards are often vague and misaligned with assessments, teachers sprinkle instruction with…
Substantia nigra hyperechogenicity is related to decline in verbal memory in healthy elderly adults.

PubMed

Yilmaz, R; Behnke, S; Liepelt-Scarfone, I; Roeben, B; Pausch, C; Runkel, A; Heinzel, S; Niebler, R; Suenkel, U; Eschweiler, G W; Maetzler, W; Berg, D

2016-05-01

Deficits in cognition have been reported in Parkinson's disease (PD) already in the early and even in the pre-motor stages. Whilst substantia nigra hyperechogenicity measured by transcranial B-mode sonography (TCS) represents a strong PD marker and is associated with an increased risk for PD in still healthy individuals, its association with cognitive performance in prodromal PD stages is not well established. Two different cohorts of healthy elderly individuals were assessed by TCS and two different neuropsychological test batteries covering executive functions, verbal memory, language, visuo-constructional function and attention. Cognitive performance was compared between individuals with hyperechogenicity (SN+) and without hyperechogenicity (SN-). In both cohorts, SN+ individuals performed significantly worse than the SN- group in tests assessing verbal memory (word list delayed recall P = 0.05, logical memory II P < 0.017). Significant differences in Mini-Mental State Examination score (cohort 1, P = 0.02) and executive function tests (cohort 2, Stroop Color-Word Reading, P = 0.004) could only be shown in one of the two cohorts. No between-group effects were found in other cognitive tests and domains. These results indicate that individuals with the PD risk marker SN+ perform worse in verbal memory compared to SN- independent of the assessment battery. Memory performance should be assessed in detail in individuals at risk for PD. © 2016 EAN.
Advanced vehicle systems assessment. Volume 3: Systems assessment

NASA Technical Reports Server (NTRS)

Hardy, K.

1985-01-01

The systems analyses integrate the advanced component and vehicle characteristics into conceptual vehicles with identical performance (for a given application) and evaluates the vehicles in typical use patterns. Initial and life-cycle costs are estimated and compared to conventional reference vehicles with comparable technological advances, assuming the vehicles will be in competition in the early 1990s. Electric vans, commuter vehicles, and full-size vehicles, in addition to electric/heat-engine hybrid and fuel-cell powered vehicles, are addressed in terms of performance and economics. System and subsystem recommendations for vans and two-passenger commuter vehicles are based on the economic analyses in this volume.
Computer System Performance Measurement Techniques for ARTS III Computer Systems

DOT National Transportation Integrated Search

1973-12-01

The potential contribution of direct system measurement in the evolving ARTS 3 Program is discussed and software performance measurement techniques are comparatively assessed in terms of credibility of results, ease of implementation, volume of data,...
42 CFR 482.21 - Condition of participation: Quality assessment and performance improvement program.

Code of Federal Regulations, 2011 CFR

2011-10-01

... learning throughout the hospital. (3) The hospital must take actions aimed at performance improvement and... QIO cooperative project, but its own projects are required to be of comparable effort. (e) Standard...
An assessment technique for computer-socket manufacturing

PubMed Central

Sanders, Joan; Severance, Michael

2015-01-01

An assessment strategy is presented for testing the quality of carving and forming of individual computer aided manufacturing facilities. The strategy is potentially useful to facilities making sockets and companies marketing manufacturing equipment. To execute the strategy, an evaluator fabricates a collection of test models and sockets using the manufacturing suite under evaluation, and then measures their shapes using scanning equipment. Overall socket quality is assessed by comparing socket shapes with electronic file shapes. Then model shapes are compared with electronic file shapes to characterize carving performance. Socket shapes are compared with model shapes to characterize forming performance. The mean radial error (MRE), which is the average difference in radii between the two shapes being compared, provides insight into sizing quality. Inter-quartile range (IQR), the range of radial error for the best matched half of the points on the surfaces being compared, provides insight into shape quality. By determining MRE and IQR for carving and forming separately, the source(s) of socket shape error may be pinpointed. The developed strategy may provide a useful tool to the prosthetics community and industry to help identify problems and limitations in computer aided manufacturing and insight into appropriate modifications to overcome them. PMID:21938663
A clinically guided approach for improving performance measurement for hypertension.

PubMed

Steinman, Michael A; Lee, Sei J; Peterson, Carolyn A; Fung, Kathy Z; Goldstein, Mary K

2012-05-01

Performance measures often fail to account for legitimate reasons why patients do not achieve recommended treatment targets. We tested a novel performance measurement system for blood pressure (BP) control that was designed to mimic clinical reasoning. This clinically guided approach focuses on (1) exempting patients for whom tight BP control may not be appropriate or feasible and (2) assessing BP over time. Trained abstractors conducted structured chart reviews of 201 adults with hypertension in 2 VA health care systems. Results were compared with traditional methods of performance measurement. Among 201 veterans, 183 (91%) were male, and the mean age was 71±11 years. Using the clinically guided approach, 61 patients (30%) were exempted from performance measurement. The most common reasons for exemption were inadequate opportunity to manage BP (35 patients, 17%) and the use of 4 or more antihypertensive medications (19 patients, 9%). Among patients eligible for performance measurement, there was little agreement on the presence of controlled versus uncontrolled BP when comparing the most recent BP (the traditional approach) with an integrated assessment of BP control (κ 0.14). After accounting for clinically guided exemptions and methods of BP assessment, only 15 of 72 patients (21%) whose last BP was ≥140/90 mm Hg were classified as problematic by the clinically guided approach. Many patients have legitimate reasons for not achieving tight BP control, and the methods used for BP assessment have marked effects on whether a patient is classified as having adequate or inadequate BP control.
Human behavior and human performance: Psychomotor demands

NASA Technical Reports Server (NTRS)

1992-01-01

The results of several experiments are presented in abstract form. These studies are critical for the interpretation and acceptance of flight based science to be conducted by the Behavior and Performance project. Some representative titles are as follow: External audio for IBM/PC compatible computers; A comparative assessment of psychomotor performance (target prediction by humans and macaques); Response path (a dependent measure for computer maze solving and other tasks); Behavioral asymmetries of psychomotor performance in Rhesus monkey (a dissociation between hand preference and skill); Testing primates with joystick based automated apparatus; and Environmental enrichment and performance assessment for ground or flight based research with primates;
An Experimental Study of the Effects of Monetary Incentives on Performance on the 12th-Grade NAEP Reading Assessment

ERIC Educational Resources Information Center

Braun, Henry; Kirsch, Irwin; Yamamoto, Kentaro

2011-01-01

Background/context: The National Assessment of Educational Progress (NAEP) is the only comparative assessment of academic competencies regularly administered to nationally representative samples of students enrolled in Grades 4, 8, and 12. Because NAEP is a low-stakes assessment, there are long-standing questions about the level of engagement and…
Field Assessment of Energy Audit Tools for Retrofit Programs

DOE Office of Scientific and Technical Information (OSTI.GOV)

Edwards, J.; Bohac, D.; Nelson, C.

2013-07-01

This project focused on the use of home energy ratings as a tool to promote energy retrofits in existing homes. A home energy rating provides a quantitative appraisal of a home’s energy performance, usually compared to a benchmark such as the average energy use of similar homes in the same region. Rating systems based on energy performance models, the focus of this report, can establish a home’s achievable energy efficiency potential and provide a quantitative assessment of energy savings after retrofits are completed, although their accuracy needs to be verified by actual measurement or billing data. Ratings can also showmore » homeowners where they stand compared to their neighbors, thus creating social pressure to conform to or surpass others. This project field-tested three different building performance models of varying complexity, in order to assess their value as rating systems in the context of a residential retrofit program: Home Energy Score, SIMPLE, and REM/Rate.« less

Use of an Analytical Grading Rubric for Self-Assessment: A Pilot Study for a Periodontal Oral Competency Examination in Predoctoral Dental Education.

PubMed

Satheesh, Keerthana M; Brockmann, Lorraine B; Liu, Ying; Gadbury-Amyot, Cynthia C

2015-12-01

While educators agree that using self-assessment in education is valuable, a major challenge is the poor agreement often found between faculty assessment and student self-assessment. The aim of this study was to determine if use of a predefined grading rubric would improve reliability between faculty and dental student assessment on a periodontal oral competency examination. Faculty members used the grading rubric to assess students' performance on the exam. Immediately after taking the exam, students used the same rubric to self-assess their performance on it. Data were collected from all third- and/or fourth-year students in four classes at one U.S. dental school from 2011 to 2014. Since two of the four classes took the exam in both the third and fourth years, those data were compared to determine if those students' self-assessment skills improved over time. Statistical analyses were performed to determine agreement between the two faculty graders and between the students' and faculty assessments on each criterion in the rubric and the overall grade. Data from the upper and lower performing quartiles of students were sub-analyzed. The results showed that faculty reliability for the overall grades was high (K=0.829) and less so for individual criteria, while student-faculty reliability was weak to moderate for both overall grades (Spearman's rho=0.312) and individual criteria. Students in the upper quartile self-evaluated themselves more harshly than the faculty (p<0.0001), while the lower quartile students overestimated their performance (p=0.0445) compared to faculty evaluation. No significant improvement was found in assessment over time in the students who took the exam in the third and fourth years. This study found only limited support for the hypothesis that a grading rubric used by both faculty and students would increase correspondence between faculty and student assessment and points to a need to reexamine the rubric and instructional strategies to help students improve their ability to self-assess their work.
Performance Status and Change--Measuring Education System Effectiveness with Data from PISA 2000-2009

ERIC Educational Resources Information Center

Lenkeit, Jenny; Caro, Daniel H.

2014-01-01

Reports of international large-scale assessments tend to evaluate and compare education system performance based on absolute scores. And policymakers refer to high-performing and economically prosperous education systems to enhance their own systemic features. But socioeconomic differences between systems compromise the plausibility of those…
ASSESSMENT PROTOCOLS - DURABILITY OF PERFORMANCE OF A HOME RADON REDUCTION SYSTEM FOR SUB-SLAB DEPRESSURIZA- TION SYSTEMS

EPA Science Inventory

This handbook contains protocols that compare the immediate performance of subslab depressurization (SSD) mitigation system with performance months or years later. These protocols provide a methodology to test SSD radon mitigation systems in situ to determine long-term performanc...
The Effect of Initial Knee Angle on Concentric-Only Squat Jump Performance

ERIC Educational Resources Information Center

Mitchell, Lachlan J.; Argus, Christos K.; Taylor, Kristie-Lee; Sheppard, Jeremy M.; Chapman, Dale W.

2017-01-01

Purpose: There is uncertainty as to which knee angle during a squat jump (SJ) produces maximal jump performance. Importantly, understanding this information will aid in determining appropriate ratios for assessment and monitoring of the explosive characteristics of athletes. Method: This study compared SJ performance across different knee…
The Influence of Test Conditions on the Performance of Chironomus dilutus and Hyalella azteca in Sediment Toxicity Tests

EPA Science Inventory

In most all sediment toxicity assessments, the performance of organisms in control sediments is a key parameter in defining sediment toxicity, whether through direct statistical comparison to control or by normalizing to control performance to compare results across sites or batc...
The Role of Informal Parent and Teacher Assessment in Diagnosing Learning Disabilities.

ERIC Educational Resources Information Center

Sikora, Darryn M.; Plapinger, Donald S.

1997-01-01

This study compared parent and teacher perceptions of academic performance and cognitive deficits with the standardized test performance of 19 students (ages 7-13) with hearing impairments. Results indicate that parents and educators were equally skilled in predicting academic performance, but had greater difficulty predicting specific cognitive…
Some Architecture for Embedded-Assessment Systems

ERIC Educational Resources Information Center

Kane, Michael T.; Tannenbaum, Richard J.

2016-01-01

It is one thing to produce an innovative, construct-based assessment task; it's another to produce 10 a year that are comparable in difficulty, measure the same competencies, are free of differential item functioning, and can be scaled and equated. These challenges contributed to the failure of the performance (or authentic) assessment movement of…
Dealing with Flexibility in Assessments for Students with Significant Cognitive Disabilities. Synthesis Report 60

ERIC Educational Resources Information Center

Gong, Brian; Marion, Scott

2006-01-01

Dealing with flexibility--or its converse, the extent of standardization--is fundamental to alignment, assessment design, and interpretation of results in fully inclusive assessment systems. Highly standardized tests make it easier to compare (performances, students, and schools) across time and to common standards because certain conditions are…
Virtual reality measures in neuropsychological assessment: a meta-analytic review.

PubMed

Neguț, Alexandra; Matu, Silviu-Andrei; Sava, Florin Alin; David, Daniel

2016-02-01

Virtual reality-based assessment is a new paradigm for neuropsychological evaluation, that might provide an ecological assessment, compared to paper-and-pencil or computerized neuropsychological assessment. Previous research has focused on the use of virtual reality in neuropsychological assessment, but no meta-analysis focused on the sensitivity of virtual reality-based measures of cognitive processes in measuring cognitive processes in various populations. We found eighteen studies that compared the cognitive performance between clinical and healthy controls on virtual reality measures. Based on a random effects model, the results indicated a large effect size in favor of healthy controls (g = .95). For executive functions, memory and visuospatial analysis, subgroup analysis revealed moderate to large effect sizes, with superior performance in the case of healthy controls. Participants' mean age, type of clinical condition, type of exploration within virtual reality environments, and the presence of distractors were significant moderators. Our findings support the sensitivity of virtual reality-based measures in detecting cognitive impairment. They highlight the possibility of using virtual reality measures for neuropsychological assessment in research applications, as well as in clinical practice.
Puzzle-based versus traditional lecture: comparing the effects of pedagogy on academic performance in an undergraduate human anatomy and physiology II lab.

PubMed

Stetzik, Lucas; Deeter, Anthony; Parker, Jamie; Yukech, Christine

2015-06-23

A traditional lecture-based pedagogy conveys information and content while lacking sufficient development of critical thinking skills and problem solving. A puzzle-based pedagogy creates a broader contextual framework, and fosters critical thinking as well as logical reasoning skills that can then be used to improve a student's performance on content specific assessments. This paper describes a pedagogical comparison of traditional lecture-based teaching and puzzle-based teaching in a Human Anatomy and Physiology II Lab. Using a single subject/cross-over design half of the students from seven sections of the course were taught using one type of pedagogy for the first half of the semester, and then taught with a different pedagogy for the second half of the semester. The other half of the students were taught the same material but with the order of the pedagogies reversed. Students' performance on quizzes and exams specific to the course, and in-class assignments specific to this study were assessed for: learning outcomes (the ability to form the correct conclusion or recall specific information), and authentic academic performance as described by (Am J Educ 104:280-312, 1996). Our findings suggest a significant improvement in students' performance on standard course specific assessments using a puzzle-based pedagogy versus a traditional lecture-based teaching style. Quiz and test scores for students improved by 2.1 and 0.4% respectively in the puzzle-based pedagogy, versus the traditional lecture-based teaching. Additionally, the assessments of authentic academic performance may only effectively measure a broader conceptual understanding in a limited set of contexts, and not in the context of a Human Anatomy and Physiology II Lab. In conclusion, a puzzle-based pedagogy, when compared to traditional lecture-based teaching, can effectively enhance the performance of students on standard course specific assessments, even when the assessments only test a limited conceptual understanding of the material.
Comparative performances of eggs and embryos of sea urchin (Paracentrotus lividus) in toxicity bioassays used for assessment of marine sediment quality.

PubMed

Khosrovyan, A; Rodríguez-Romero, A; Salamanca, M J; Del Valls, T A; Riba, I; Serrano, F

2013-05-15

The potential toxicity of sediments from various ports was assessed by means of two different liquid-phase toxicity bioassays (acute and chronic) with embryos and eggs of sea urchin Paracentrotus lividus. Performances of embryos and eggs of P. lividus in these bioassays were compared for their interchangeable applicability in integrated sediment quality assessment. The obtained endpoints (percentages of normally developed plutei and fertilized eggs) were linked to physical and chemical properties of sediments and demonstrated dependence on sediment contamination. The endpoints in the two bioassays were strongly correlated and generally exhibited similar tendency throughout the samples. Therein, embryos demonstrated higher sensitivity to elutriate exposure, compared to eggs. It was concluded that these tests could be used interchangeably for testing toxicity of marine sediments. Preferential use of any of the bioassays can be determined by the discriminatory capacity of the test or vulnerability consideration of the test subject to the surrounding conditions. Copyright © 2013 Elsevier Ltd. All rights reserved.
Space shuttle orbiter leading-edge flight performance compared to design goals

NASA Technical Reports Server (NTRS)

Curry, D. M.; Johnson, D. W.; Kelly, R. E.

1983-01-01

Thermo-structural performance of the Space Shuttle orbiter Columbia's leading-edge structural subsystem for the first five (5) flights is compared with the design goals. Lessons learned from thse initial flights of the first reusable manned spacecraft are discussed in order to assess design maturity, deficiencies, and modifications required to rectify the design deficiencies. Flight data and post-flight inspections support the conclusion that the leading-edge structural subsystem hardware performance was outstanding for the initial five (5) flights.
Awareness of Memory Ability and Change: (In)Accuracy of Memory Self-Assessments in Relation to Performance.

PubMed

Rickenbach, Elizabeth Hahn; Agrigoroaei, Stefan; Lachman, Margie E

2015-03-01

Little is known about subjective assessments of memory abilities and decline among middle-aged adults or their association with objective memory performance in the general population. In this study we examined self-ratings of memory ability and change in relation to episodic memory performance in two national samples of middle-aged and older adults from the Midlife in the United States study (MIDUS II in 2005-06) and the Health and Retirement Study (HRS; every two years from 2002 to 2012). MIDUS (Study 1) participants (N=3,581) rated their memory compared to others their age and to themselves five years ago; HRS (Study 2) participants (N=14,821) rated their current memory and their memory compared to two years ago, with up to six occasions of longitudinal data over ten years. In both studies, episodic memory performance was the total number of words recalled in immediate and delayed conditions. When controlling for demographic and health correlates, self-ratings of memory abilities, but not subjective change, were related to performance. We examined accuracy by comparing subjective and objective memory ability and change. More than one third of the participants across the studies had self-assessments that were inaccurate relative to their actual level of performance and change, and accuracy differed as a function of demographic and health factors. Further understanding of self-awareness of memory abilities and change beginning in midlife may be useful for identifying early warning signs of decline, with implications regarding policies and practice for early detection and treatment of cognitive impairment.
RELIABILITY AND VALIDITY OF A MODIFIED ISOMETRIC DYNAMOMETER IN THE ASSESSMENT OF MUSCULAR PERFORMANCE IN INDIVIDUALS WITH ANTERIOR CRUCIATE LIGAMENT RECONSTRUCTION

PubMed Central

de Vasconcelos, Rodrigo Antunes; Bevilaqua-Grossi, Débora; Shimano, Antonio Carlos; Paccola, Cleber Jansen; Salvini, Tânia Fátima; Prado, Christiane Lanatovits; Junior, Wilson A. Mello

2015-01-01

Objectives: The aim of this study was to evaluate the reliability and validity of a modified isometric dynamometer (MID) in performance deficits of the knee extensor and flexor muscles in normal individuals and in those with ACL reconstructions. Methods: Sixty male subjects were invited to participate of the study, being divided into three groups with 20 subjects each: control group (GC), group of individuals with ACL reconstruction with patellar tendon graft (GTP, and group of individuals with ACL reconstruction with hamstrings graft (GTF). All individuals performed isometric tests in the MID, muscular strength deficits collected were subsequently compared to the tests performed on the Biodex System 3 operating in the isometric and isokinetic mode at speeds of 60°/s and 180o/s. Intraclass ICC correlation calculations were done in order to assess MID reliability, specificity, sensitivity and Kappa's consistency coefficient calculations, respectively, for assessing the MID's validity in detecting muscular deficits and intra- and intergroup comparisons when performing the four strength tests using the ANOVA method. Results: The modified isometric dynamometer (MID) showed excellent reliability and good validity in the assessment of the performance of the knee extensor and flexor muscles groups. In the comparison between groups, the GTP showed significantly greater deficits as compared to the GTF and GC groups. Conclusion: Isometric dynamometers connected to mechanotherapy equipments could be an alternative option to collect data concerning performance deficits of the extensor and flexor muscles groups of the knee in subjects with ACL reconstruction. PMID:27004175
Basic life support skills of high school students before and after cardiopulmonary resuscitation training: a longitudinal investigation.

PubMed

Meissner, Theresa M; Kloppe, Cordula; Hanefeld, Christoph

2012-04-14

Immediate bystander cardiopulmonary resuscitation (CPR) significantly improves survival after a sudden cardiopulmonary collapse. This study assessed the basic life support (BLS) knowledge and performance of high school students before and after CPR training. This study included 132 teenagers (mean age 14.6 ± 1.4 years). Students completed a two-hour training course that provided theoretical background on sudden cardiac death (SCD) and a hands-on CPR tutorial. They were asked to perform BLS on a manikin to simulate an SCD scenario before the training. Afterwards, participants encountered the same scenario and completed a questionnaire for self-assessment of their pre- and post-training confidence. Four months later, we assessed the knowledge retention rate of the participants with a BLS performance score. Before the training, 29.5% of students performed chest compressions as compared to 99.2% post-training (P < 0.05). At the four-month follow-up, 99% of students still performed correct chest compressions. The overall improvement, assessed by the BLS performance score, was also statistically significant (median of 4 and 10 pre- and post-training, respectively, P < 0.05). After the training, 99.2% stated that they felt confident about performing CPR, as compared to 26.9% (P < 0.05) before the training. BLS training in high school seems highly effective considering the minimal amount of previous knowledge the students possess. We observed significant improvement and a good retention rate four months after training. Increasing the number of trained students may minimize the reluctance to conduct bystander CPR and increase the number of positive outcomes after sudden cardiopulmonary collapse.
Evaluation of virtual monoenergetic imaging algorithms for dual-energy carotid and intracerebral CT angiography: Effects on image quality, artefacts and diagnostic performance for the detection of stenosis.

PubMed

Leithner, Doris; Mahmoudi, Scherwin; Wichmann, Julian L; Martin, Simon S; Lenga, Lukas; Albrecht, Moritz H; Booz, Christian; Arendt, Christophe T; Beeres, Martin; D'Angelo, Tommaso; Bodelle, Boris; Vogl, Thomas J; Scholtz, Jan-Erik

2018-02-01

To investigate the impact of traditional (VMI) and noise-optimized virtual monoenergetic imaging (VMI+) algorithms on quantitative and qualitative image quality, and the assessment of stenosis in carotid and intracranial dual-energy CTA (DE-CTA). DE-CTA studies of 40 patients performed on a third-generation 192-slice dual-source CT scanner were included in this retrospective study. 120-kVp image-equivalent linearly-blended, VMI and VMI+ series were reconstructed. Quantitative analysis included evaluation of contrast-to-noise ratios (CNR) of the aorta, common carotid artery, internal carotid artery, middle cerebral artery, and basilar artery. VMI and VMI+ with highest CNR, and linearly-blended series were rated qualitatively. Three radiologists assessed artefacts and suitability for evaluation at shoulder height, carotid bifurcation, siphon, and intracranial using 5-point Likert scales. Detection and grading of stenosis were performed at carotid bifurcation and siphon. Highest CNR values were observed for 40-keV VMI+ compared to 65-keV VMI and linearly-blended images (P < 0.001). Artefacts were low in all qualitatively assessed series with excellent suitability for supraaortic artery evaluation at shoulder and bifurcation height. Suitability was significantly higher in VMI+ and VMI compared to linearly-blended images for intracranial and ICA assessment (P < 0.002). VMI and VMI+ showed excellent accordance for detection and grading of stenosis at carotid bifurcation and siphon with no differences in diagnostic performance. 40-keV VMI+ showed improved quantitative image quality compared to 65-keV VMI and linearly-blended series in supraaortic DE-CTA. VMI and VMI+ provided increased suitability for carotid and intracranial artery evaluation with excellent assessment of stenosis, but did not translate into increased diagnostic performance. Copyright © 2017 Elsevier B.V. All rights reserved.
Caffeine administration at night during extended wakefulness effectively mitigates performance impairment but not subjective assessments of fatigue and sleepiness.

PubMed

Paech, Gemma M; Banks, Siobhan; Pajcin, Maja; Grant, Crystal; Johnson, Kayla; Kamimori, Gary H; Vedova, Chris B Della

2016-06-01

The current study investigated the effects of repeated caffeine administration on performance and subjective reports of sleepiness and fatigue during 50h extended wakefulness. Twenty-four, non-smokers aged 22.5±2.9y (mean±SD) remained awake for two nights (50h) in a controlled laboratory environment. During this period, 200mg of caffeine or placebo gum was administered at 01:00, 03:00, 05:00 and 07:00 on both nights (total of 800mg/night). Neurobehavioral performance and subjective reports were assessed throughout the wake period. Caffeine improved performance compared to placebo, but did not affect overall ratings of subjective sleepiness and fatigue. Performance and sleepiness worsened with increasing time awake for both conditions. However, caffeine slowed performance impairments such that after 50h of wakefulness performance was better following caffeine administration compared to placebo. Caffeine also slowed the increase in subjective sleepiness and performance ratings, but only during the first night of wakefulness. After two nights of sleep deprivation, there was no difference in sleepiness ratings between the two conditions. These results demonstrate that strategic administration of caffeine effectively mitigates performance impairments associated with 50h wakefulness but does not improve overall subjective assessments of sleepiness, fatigue and performance. Results indicate that while performance impairment is alleviated, individuals may continue to report feelings of sleepiness. Individuals who use caffeine as a countermeasure in sustained operations may feel as though caffeine is not effective despite impairments in objective performance being largely mitigated. Copyright © 2016 Elsevier Inc. All rights reserved.
Peer-assessment of medical communication skills: the impact of students' personality, academic and social reputation on behavioural assessment.

PubMed

Hulsman, Robert L; Peters, Joline F; Fabriek, Marcel

2013-09-01

Peer-assessment of communication skills may contribute to mastery of assessment criteria. When students develop the capacity to judge their peers' performance, they might improve their capacity to examine their own clinical performance. In this study peer-assessment ratings are compared to teacher-assessment ratings. The aim of this paper is to explore the impact of personality and social reputation as source of bias in assessment of communication skills. Second year students were trained and assessed history taking communication skills. Peers rated the students' personality and academic and social reputation. Peer-assessment ratings were significantly correlated with teacher-ratings in a summative assessment of medical communication. Peers did not provide negative ratings on final scales but did provide negative ratings on subcategories. Peer- and teacher-assessments were both related to the students' personality and academic reputation. Peer-assessment cannot replace teacher-assessment if the assessment should result in high-stake decisions about students. Our data do not confirm the hypothesis that peers are overly biased by personality and reputation characteristics in peer-assessment of performance. Early introduction of peer-assessment in medical education would facilitate early acceptance of this mode of evaluation and would promote early on the habit of critical evaluation of professional clinical performance and acceptance of being evaluated critically by peers. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Impact of Hybrid Delivery of Education on Student Academic Performance and the Student Experience

PubMed Central

Nutter, Douglas A.; Charneski, Lisa; Butko, Peter

2009-01-01

Objectives To compare student academic performance and the student experience in the first-year doctor of pharmacy (PharmD) program between the main and newly opened satellite campuses of the University of Maryland. Methods Student performance indicators including graded assessments, course averages, cumulative first-year grade point average (GPA), and introductory pharmacy practice experience (IPPE) evaluations were analyzed retrospectively. Student experience indicators were obtained via an online survey instrument and included involvement in student organizations; time-budgeting practices; and stress levels and their perceived effect on performance. Results Graded assessments, course averages, GPA, and IPPE evaluations were indistinguishable between campuses. Students' time allocation was not different between campuses, except for time spent attending class and watching lecture videos. There was no difference between students' stress levels at each campus. Conclusions The implementation of a satellite campus to expand pharmacy education yielded academic performance and student engagement comparable to those from traditional delivery methods. PMID:19960080
Serum antioxidant levels and nutritional status in early and advanced stage lung cancer patients.

PubMed

Klarod, Kultida; Hongsprabhas, Pranithi; Khampitak, Tueanjit; Wirasorn, Kosin; Kiertiburanakul, Sasisopin; Tangrassameeprasert, Roongpet; Daduang, Jureerut; Yongvanit, Puangrat; Boonsiri, Patcharee

2011-01-01

Malnutrition frequently occurs in lung cancer patients. We aimed to determine nutritional status and antioxidant and mineral levels in Thai patients with lung cancer. A prospective study with matched case-control was conducted. Nutritional status was assessed by body mass index (BMI) and subjective global assessment (SGA). Eastern Cooperative Oncology Group (ECOG) performance status was used to assess the performance. The serum antioxidant and mineral levels were determined. Forty-nine patients with a mean age of 58.8 (range, 35-82) who were first diagnosed with lung cancer were enrolled. They were compared with 60 healthy controls, and levels of retinol, α-tocopherol, β-carotene, lycopene, β-cryptoxanthin, selenium, and zinc were lower (P < 0.05). However, peroxidase activity was higher (P = 0.002) in patients. Selenium levels were higher in early stage compared to advanced stage patients (P = 0.041). Overweight patients had higher selenium levels (0.04 mg/L) than normal BMI patients (β = 0.04, P = 0.035). Patients with SGA class C had lower selenium levels (0.03 mg/L) than those with class A (β = -0.03, P = 0.035). The poorer ECOG performance patients had significantly lower β-carotene (β = -0.192, P = 0.003) and selenium (β = -0.031, P = 0.011) levels compared with those with good ECOG performance status. Significantly lower levels of antioxidants and selenium were found in lung cancer patients compared to healthy controls. Levels of some antioxidants and minerals differed among categories of BMI, SGA categories, or ECOG performance status. These findings may be helpful for further studies, such as the effect of nutritional supplementation on clinical outcomes. Copyright © 2011 Elsevier Inc. All rights reserved.

Corrosion performance tests for reinforcing steel in concrete : test procedures.

DOT National Transportation Integrated Search

2009-09-01

The existing test method to assess the corrosion performance of reinforcing steel embedded in concrete, mainly : ASTM G109, is labor intensive, time consuming, slow to provide comparative results, and often expensive. : However, corrosion of reinforc...
Corrosion performance tests for reinforcing steel in concrete : technical report.

DOT National Transportation Integrated Search

2009-10-01

The existing test method used to assess the corrosion performance of reinforcing steel embedded in : concrete, mainly ASTM G 109, is labor intensive, time consuming, slow to provide comparative results, : and can be expensive. However, with corrosion...
Assessment of construct validity of a virtual reality laparoscopy simulator.

PubMed

Rosenthal, Rachel; Gantert, Walter A; Hamel, Christian; Hahnloser, Dieter; Metzger, Juerg; Kocher, Thomas; Vogelbach, Peter; Scheidegger, Daniel; Oertli, Daniel; Clavien, Pierre-Alain

2007-08-01

The aim of this study was to assess whether virtual reality (VR) can discriminate between the skills of novices and intermediate-level laparoscopic surgical trainees (construct validity), and whether the simulator assessment correlates with an expert's evaluation of performance. Three hundred and seven (307) participants of the 19th-22nd Davos International Gastrointestinal Surgery Workshops performed the clip-and-cut task on the Xitact LS 500 VR simulator (Xitact S.A., Morges, Switzerland). According to their previous experience in laparoscopic surgery, participants were assigned to the basic course (BC) or the intermediate course (IC). Objective performance parameters recorded by the simulator were compared to the standardized assessment by the course instructors during laparoscopic pelvitrainer and conventional surgery exercises. IC participants performed significantly better on the VR simulator than BC participants for the task completion time as well as the economy of movement of the right instrument, not the left instrument. Participants with maximum scores in the pelvitrainer cholecystectomy task performed the VR trial significantly faster, compared to those who scored less. In the conventional surgery task, a significant difference between those who scored the maximum and those who scored less was found not only for task completion time, but also for economy of movement of the right instrument. VR simulation provides a valid assessment of psychomotor skills and some basic aspects of spatial skills in laparoscopic surgery. Furthermore, VR allows discrimination between trainees with different levels of experience in laparoscopic surgery establishing construct validity for the Xitact LS 500 clip-and-cut task. Virtual reality may become the gold standard to assess and monitor surgical skills in laparoscopic surgery.
Performance on International Assessments and Learning Time: A Snapshot of How the U.S. Compares to Other Education Systems on an International Scale. Informing Policy & Improving Practice. Policy Brief

ERIC Educational Resources Information Center

Saxena, Pooja; Sell, LeeAnn

2016-01-01

Drawing from two international measures, Trends in International Mathematics and Science Studies (TIMSS) and Program for International Student Assessment (PISA), this brief provides a snapshot comparison of the United States to other education systems. Specifically, this brief addresses how the U.S. compares to other countries in overall…
Functional performance of school children diagnosed with developmental delay up to two years of age

PubMed Central

Dornelas, Lílian de Fátima; Magalhães, Lívia de Castro

2016-01-01

Abstract Objective: To compare the functional performance of students diagnosed with developmental delay (DD) up to two years of age with peers exhibiting typical development. Methods: Cross-sectional study with functional performance assessment of children diagnosed with DD up to two years of age compared to those with typical development at seven to eight years of age. Each group consisted of 45 children, selected by non-random sampling, evaluated for motor skills, quality of home environment, school participation and performance. ANOVA and the Binomial test for two proportions were used to assess differences between groups. Results: The group with DD had lower motor skills when compared to the typical group. While 66.7% of children in the typical group showed adequate school participation, receiving aid in cognitive and behavioral tasks similar to that offered to other children at the same level, only 22.2% of children with DD showed the same performance. Although 53.3% of the children with DD achieved an academic performance expected for the school level, there were limitations in some activities. Only two indicators of family environment, diversity and activities with parents at home, showed statistically significant difference between the groups, with advantage being shown for the typical group. Conclusions: Children with DD have persistent difficulties at school age, with motor deficit, restrictions in school activity performance and low participation in the school context, as well as significantly lower functional performance when compared to children without DD. A systematic monitoring of this population is recommended to identify needs and minimize future problems. PMID:26553573
A theoretical-experimental methodology for assessing the sensitivity of biomedical spectral imaging platforms, assays, and analysis methods.

PubMed

Leavesley, Silas J; Sweat, Brenner; Abbott, Caitlyn; Favreau, Peter; Rich, Thomas C

2018-01-01

Spectral imaging technologies have been used for many years by the remote sensing community. More recently, these approaches have been applied to biomedical problems, where they have shown great promise. However, biomedical spectral imaging has been complicated by the high variance of biological data and the reduced ability to construct test scenarios with fixed ground truths. Hence, it has been difficult to objectively assess and compare biomedical spectral imaging assays and technologies. Here, we present a standardized methodology that allows assessment of the performance of biomedical spectral imaging equipment, assays, and analysis algorithms. This methodology incorporates real experimental data and a theoretical sensitivity analysis, preserving the variability present in biomedical image data. We demonstrate that this approach can be applied in several ways: to compare the effectiveness of spectral analysis algorithms, to compare the response of different imaging platforms, and to assess the level of target signature required to achieve a desired performance. Results indicate that it is possible to compare even very different hardware platforms using this methodology. Future applications could include a range of optimization tasks, such as maximizing detection sensitivity or acquisition speed, providing high utility for investigators ranging from design engineers to biomedical scientists. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
A sampling strategy for promoting and assessing medical student retention of physical examination skills.

PubMed

Williams, Reed G; Klamen, Debra L; Mayer, David; Valaski, Maureen; Roberts, Nicole K

2007-10-01

Skill acquisition and maintenance requires spaced deliberate practice. Assessing medical students' physical examination performance ability is resource intensive. The authors assessed the nature and size of physical examination performance samples necessary to accurately estimate total physical examination skill. Physical examination assessment data were analyzed from second year students at the University of Illinois College of Medicine at Chicago in 2002, 2003, and 2004 (N = 548). Scores on subgroups of physical exam maneuvers were compared with scores on the total physical exam, to identify sound predictors of total test performance. Five exam subcomponents were sufficiently correlated to overall test performance and provided adequate sensitivity and specificity to serve as a means to prompt continued student review and rehearsal of physical examination technical skills. Selection and administration of samples of the total physical exam provide a resource-saving approach for promoting and estimating overall physical examination skills retention.
Structural assessment of a Space Station solar dynamic heat receiver thermal energy storage canister

NASA Technical Reports Server (NTRS)

Tong, M. T.; Kerslake, T. W.; Thompson, R. L.

1988-01-01

This paper assesses the structural performance of a Space Station thermal energy storage (TES) canister subject to orbital solar flux variation and engine cold start-up operating conditions. The impact of working fluid temperature and salt-void distribution on the canister structure are assessed. Both analytical and experimental studies were conducted to determine the temperature distribution of the canister. Subsequent finite-element structural analyses of the canister were performed using both analytically and experimentally obtained temperatures. The Arrhenius creep law was incorporated into the procedure, using secondary creep data for the canister material, Haynes-188 alloy. The predicted cyclic creep strain accumulations at the hot spot were used to assess the structural performance of the canister. In addition, the structural performance of the canister based on the analytically-determined temperature was compared with that based on the experimentally-measured temperature data.
Structural assessment of a space station solar dynamic heat receiver thermal energy storage canister

NASA Technical Reports Server (NTRS)

Thompson, R. L.; Kerslake, T. W.; Tong, M. T.

1988-01-01

The structural performance of a space station thermal energy storage (TES) canister subject to orbital solar flux variation and engine cold start up operating conditions was assessed. The impact of working fluid temperature and salt-void distribution on the canister structure are assessed. Both analytical and experimental studies were conducted to determine the temperature distribution of the canister. Subsequent finite element structural analyses of the canister were performed using both analytically and experimentally obtained temperatures. The Arrhenius creep law was incorporated into the procedure, using secondary creep data for the canister material, Haynes 188 alloy. The predicted cyclic creep strain accumulations at the hot spot were used to assess the structural performance of the canister. In addition, the structural performance of the canister based on the analytically determined temperature was compared with that based on the experimentally measured temperature data.
Digital Analysis of Sit-to-Stand in Masters Athletes, Healthy Old People, and Young Adults Using a Depth Sensor.

PubMed

Leightley, Daniel; Yap, Moi Hoon

2018-03-02

The aim of this study was to compare the performance between young adults ( n = 15), healthy old people ( n = 10), and masters athletes ( n = 15) using a depth sensor and automated digital assessment framework. Participants were asked to complete a clinically validated assessment of the sit-to-stand technique (five repetitions), which was recorded using a depth sensor. A feature encoding and evaluation framework to assess balance, core, and limb performance using time- and speed-related measurements was applied to markerless motion capture data. The associations between the measurements and participant groups were examined and used to evaluate the assessment framework suitability. The proposed framework could identify phases of sit-to-stand, stability, transition style, and performance between participant groups with a high degree of accuracy. In summary, we found that a depth sensor coupled with the proposed framework could identify performance subtleties between groups.
Digital Analysis of Sit-to-Stand in Masters Athletes, Healthy Old People, and Young Adults Using a Depth Sensor

PubMed Central

2018-01-01

The aim of this study was to compare the performance between young adults (n = 15), healthy old people (n = 10), and masters athletes (n = 15) using a depth sensor and automated digital assessment framework. Participants were asked to complete a clinically validated assessment of the sit-to-stand technique (five repetitions), which was recorded using a depth sensor. A feature encoding and evaluation framework to assess balance, core, and limb performance using time- and speed-related measurements was applied to markerless motion capture data. The associations between the measurements and participant groups were examined and used to evaluate the assessment framework suitability. The proposed framework could identify phases of sit-to-stand, stability, transition style, and performance between participant groups with a high degree of accuracy. In summary, we found that a depth sensor coupled with the proposed framework could identify performance subtleties between groups. PMID:29498644
Parasitology: United Kingdom National Quality Assessment Scheme.

PubMed Central

Hawthorne, M.; Chiodini, P. L.; Snell, J. J.; Moody, A. H.; Ramsay, A.

1992-01-01

AIMS: To assess the results from parasitology laboratories taking part in a quality assessment scheme between 1986 and 1991; and to compare performance with repeat specimens. METHODS: Quality assessment of blood parasitology, including tissue parasites (n = 444; 358 UK, 86 overseas), and faecal parasitology, including extra-intestinal parasites (n = 205; 141 UK, 64 overseas), was performed. RESULTS: Overall, the standard of performance was poor. A questionnaire distributed to participants showed that a wide range of methods was used, some of which were considered inadequate to achieve reliable results. Teaching material was distributed to participants from time to time in an attempt to improve standards. CONCLUSIONS: Since the closure of the IMLS fellowship course in 1972, fewer opportunities for specialised training in parasitology are available: more training is needed. Poor performance in the detection of malarial parasites is mainly attributable to incorrect speciation, misidentification, and lack of equipment such as an eyepiece graticule. PMID:1452791
Sleep and Sleepiness among First-Time Postpartum Parents: A Field- and Laboratory-Based Multimethod Assessment

PubMed Central

Insana, Salvatore P.; Montgomery-Downs, Hawley E.

2012-01-01

The study aim was to compare sleep, sleepiness, fatigue, and neurobehavioral performance among first-time mothers and fathers during their early postpartum period. Participants were 21 first-time postpartum mother-father dyads (N=42) and seven childless control dyads (N=14). Within their natural environment, participants completed one week of wrist actigraphy monitoring, along with multi-day self-administered sleepiness, fatigue, and neurobehavioral performance measures. The assessment week was followed by an objective laboratory based test of sleepiness. Mothers obtained more sleep compared to fathers, but mothers’ sleep was more disturbed by awakenings. Fathers had greater objectively measured sleepiness than mothers. Mothers and fathers did not differ on subjectively measured sleep quality, sleepiness, or fatigue; however, mothers had worse neurobehavioral performance than fathers. Compared to control dyads, postpartum parents experienced greater sleep disturbance, sleepiness, and sleepiness associated impairments. Study results inform social policy, postpartum sleep interventions, and research on postpartum family systems and mechanisms that propagate sleepiness. PMID:22553114
The Influence of Methylphenidate on Hyperactivity and Attention Deficits in Children With ADHD: A Virtual Classroom Test.

PubMed

Mühlberger, A; Jekel, K; Probst, T; Schecklmann, M; Conzelmann, A; Andreatta, M; Rizzo, A A; Pauli, P; Romanos, M

2016-05-13

This study compares the performance in a continuous performance test within a virtual reality classroom (CPT-VRC) between medicated children with ADHD, unmedicated children with ADHD, and healthy children. N = 94 children with ADHD (n = 26 of them received methylphenidate and n = 68 were unmedicated) and n = 34 healthy children performed the CPT-VRC. Omission errors, reaction time/variability, commission errors, and body movements were assessed. Furthermore, ADHD questionnaires were administered and compared with the CPT-VRC measures. The unmedicated ADHD group exhibited more omission errors and showed slower reaction times than the healthy group. Reaction time variability was higher in the unmedicated ADHD group compared with both the healthy and the medicated ADHD group. Omission errors and reaction time variability were associated with inattentiveness ratings of experimenters. Head movements were correlated with hyperactivity ratings of parents and experimenters. Virtual reality is a promising technology to assess ADHD symptoms in an ecologically valid environment. © The Author(s) 2016.
Comparative effectiveness of instructional methods: oral and pharyngeal cancer examination.

PubMed

Clark, Nereyda P; Marks, John G; Sandow, Pamela R; Seleski, Christine E; Logan, Henrietta L

2014-04-01

This study compared the effectiveness of different methods of instruction for the oral and pharyngeal cancer examination. A group of thirty sophomore students at the University of Florida College of Dentistry were randomly assigned to three training groups: video instruction, a faculty-led hands-on instruction, or both video and hands-on instruction. The training intervention involved attending two sessions spaced two weeks apart. The first session used a pretest to assess students' baseline didactic knowledge and clinical examination technique. The second session utilized two posttests to assess the comparative effectiveness of the training methods on didactic knowledge and clinical technique. The key findings were that students performed the clinical examination significantly better with the combination of video and faculty-led hands-on instruction (p<0.01). All students improved their clinical exam skills, knowledge, and confidence in performing the oral and pharyngeal cancer examination independent of which training group they were assigned. Utilizing both video and interactive practice promoted greater performance of the clinical technique on the oral and pharyngeal cancer examination.
Self-assessment of social cognitive ability in schizophrenia: Association with social cognitive test performance, informant assessments of social cognitive ability, and everyday outcomes.

PubMed

Silberstein, Juliet M; Pinkham, Amy E; Penn, David L; Harvey, Philip D

2018-04-17

Impairments in self-assessment are common in people with schizophrenia and impairments in self-assessment of cognitive ability have been found to predict impaired functional outcome. In this study, we examined self-assessment of social cognitive ability and related them to assessments of social cognition provided by informants, to performance on tests of social cognition, and to everyday outcomes. The difference between self-reported social cognition and informant ratings was used to predict everyday functioning. People with schizophrenia (n=135) performed 8 different tests of social cognition. They were asked to rate their social cognitive abilities on the Observable Social Cognition Rating Scale (OSCARs). High contact informants also rated social cognitive ability and everyday outcomes, while unaware of the patients' social cognitive performance and self-assessments. Social competence was measured with a performance-based assessment and clinical ratings of negative symptoms were also performed. Patient reports of their social cognitive abilities were uncorrelated with performance on social cognitive tests and with three of the four domains of functional outcomes. Differences between self-reported and informant rated social cognitive ability predicted impaired everyday functioning across all four functional domains. This difference score predicted disability even when the influences of social cognitive performance, social competence, and negative symptoms were considered. Mis-estimation of social cognitive ability was an important predictor of social and nonsocial outcomes in schizophrenia compared to performance on social cognitive tests. These results suggest that consideration of self-assessment is critical when attempting to evaluate the causes of disability and when trying to implement interventions targeting disability reduction. Copyright © 2018 Elsevier B.V. All rights reserved.
Strategies for increasing the feasibility of performance assessments during competency-based education: Subjective and objective evaluations correlate in the operating room.

PubMed

Szasz, Peter; Louridas, Marisa; Harris, Kenneth A; Grantcharov, Teodor P

2017-08-01

Competency-based education necessitates assessments that determine whether trainees have acquired specific competencies. The evidence on the ability of internal raters (staff surgeons) to provide accurate assessments is mixed; however, this has not yet been directly explored in the operating room. This study's objective is to compare the ratings given by internal raters vs an expert external rater (independent to the training process) in the operating room. Raters assessed general surgery residents during a laparoscopic cholecystectomy for their technical and nontechnical performance. Fifteen cases were observed. There was a moderately positive correlation (r s = .618, P = .014) for technical performance and a strong positive correlation (r s = .731, P = .002) for nontechnical performance. The internal raters were less stringent for technical (mean rank 3.33 vs 8.64, P = .007) and nontechnical (mean rank 3.83 vs 8.50, P = .01) performances. This study provides evidence to help operationalize competency-based assessments. Copyright © 2016 Elsevier Inc. All rights reserved.
Concurrently examining unrealistic absolute and comparative optimism: Temporal shifts, individual-difference and event-specific correlates, and behavioural outcomes.

PubMed

Ruthig, Joelle C; Gamblin, Bradlee W; Jones, Kelly; Vanderzanden, Karen; Kehn, Andre

2017-02-01

Researchers have spent considerable effort examining unrealistic absolute optimism and unrealistic comparative optimism, yet there is a lack of research exploring them concurrently. This longitudinal study repeatedly assessed unrealistic absolute and comparative optimism within a performance context over several months to identify the degree to which they shift as a function of proximity to performance and performance feedback, their associations with global individual difference and event-specific factors, and their link to subsequent behavioural outcomes. Results showed similar shifts in unrealistic absolute and comparative optimism based on proximity to performance and performance feedback. Moreover, increases in both types of unrealistic optimism were associated with better subsequent performance beyond the effect of prior performance. However, several differences were found between the two forms of unrealistic optimism in their associations with global individual difference factors and event-specific factors, highlighting the distinctiveness of the two constructs. © 2016 The British Psychological Society.
Student science achievement and the integration of Indigenous knowledge on standardized tests

NASA Astrophysics Data System (ADS)

Dupuis, Juliann; Abrams, Eleanor

2017-09-01

In this article, we examine how American Indian students in Montana performed on standardized state science assessments when a small number of test items based upon traditional science knowledge from a cultural curriculum, "Indian Education for All", were included. Montana is the first state in the US to mandate the use of a culturally relevant curriculum in all schools and to incorporate this curriculum into a portion of the standardized assessment items. This study compares White and American Indian student test scores on these particular test items to determine how White and American Indian students perform on culturally relevant test items compared to traditional standard science test items. The connections between student achievement on adapted culturally relevant science test items versus traditional items brings valuable insights to the fields of science education, research on student assessments, and Indigenous studies.
Improving educational objectives of the Industrial and Management Systems Engineering programme at Kuwait University

NASA Astrophysics Data System (ADS)

Aldowaisan, Tariq; Allahverdi, Ali

2016-05-01

This paper describes the process of developing programme educational objectives (PEOs) for the Industrial and Management Systems Engineering programme at Kuwait University, and the process of deployment of these PEOs. Input of the four constituents of the programme, faculty, students, alumni, and employers, is incorporated in the development and update of the PEOs. For each PEO an assessment process is employed where performance measures are defined along with target attainment levels. Results from assessment tools are compared with the target attainment levels to measure performance with regard to the PEOs. The assessment indicates that the results meet or exceed the target attainment levels of the PEOs' performance measures.

To Rubric or Not to Rubric? The Effects of Self-Assessment on Self-Regulation, Performance and Self-Efficacy

ERIC Educational Resources Information Center

Panadero, Ernesto; Romero, Margarida

2014-01-01

The objective of this study was to compare the effects of situations in which self-assessment was conducted using rubrics and situations in which no specific self-assessment tool was used. Two hundred and eighteen third-year pre-service teachers were assigned to either non-rubric or rubric self-assessment for designing a conceptual map. They then…
Differential Item Functioning by Gender on a Large-Scale Science Performance Assessment: A Comparison across Grade Levels.

ERIC Educational Resources Information Center

Holweger, Nancy; Taylor, Grace

The fifth-grade and eighth-grade science items on a state performance assessment were compared for differential item functioning (DIF) due to gender. The grade 5 sample consisted of 8,539 females and 8,029 males and the grade 8 sample consisted of 7,477 females and 7,891 males. A total of 30 fifth grade items and 26 eighth grade items were…
Investigating the roles of touchscreen and physical control interface characteristics on driver distraction and multitasking performance.

DOT National Transportation Integrated Search

2016-01-01

This study aimed to assess the potential of driver distraction, task performance, orientation of : attention, and perceived workload in a multitasking situation involving interaction with touchscreen : interface, compared to physical interface. Autho...
Scientific Inquiry Self-Efficacy and Computer Game Self-Efficacy as Predictors and Outcomes of Middle School Boys' and Girls' Performance in a Science Assessment in a Virtual Environment

NASA Astrophysics Data System (ADS)

Bergey, Bradley W.; Ketelhut, Diane Jass; Liang, Senfeng; Natarajan, Uma; Karakus, Melissa

2015-10-01

The primary aim of the study was to examine whether performance on a science assessment in an immersive virtual environment was associated with changes in scientific inquiry self-efficacy. A secondary aim of the study was to examine whether performance on the science assessment was equitable for students with different levels of computer game self-efficacy, including whether gender differences were observed. We examined 407 middle school students' scientific inquiry self-efficacy and computer game self-efficacy before and after completing a computer game-like assessment about a science mystery. Results from path analyses indicated that prior scientific inquiry self-efficacy predicted achievement on end-of-module questions, which in turn predicted change in scientific inquiry self-efficacy. By contrast, computer game self-efficacy was neither predictive of nor predicted by performance on the science assessment. While boys had higher computer game self-efficacy compared to girls, multi-group analyses suggested only minor gender differences in how efficacy beliefs related to performance. Implications for assessments with virtual environments and future design and research are discussed.
Spectral performance of a whole-body research photon counting detector CT: quantitative accuracy in derived image sets

NASA Astrophysics Data System (ADS)

Leng, Shuai; Zhou, Wei; Yu, Zhicong; Halaweish, Ahmed; Krauss, Bernhard; Schmidt, Bernhard; Yu, Lifeng; Kappler, Steffen; McCollough, Cynthia

2017-09-01

Photon-counting computed tomography (PCCT) uses a photon counting detector to count individual photons and allocate them to specific energy bins by comparing photon energy to preset thresholds. This enables simultaneous multi-energy CT with a single source and detector. Phantom studies were performed to assess the spectral performance of a research PCCT scanner by assessing the accuracy of derived images sets. Specifically, we assessed the accuracy of iodine quantification in iodine map images and of CT number accuracy in virtual monoenergetic images (VMI). Vials containing iodine with five known concentrations were scanned on the PCCT scanner after being placed in phantoms representing the attenuation of different size patients. For comparison, the same vials and phantoms were also scanned on 2nd and 3rd generation dual-source, dual-energy scanners. After material decomposition, iodine maps were generated, from which iodine concentration was measured for each vial and phantom size and compared with the known concentration. Additionally, VMIs were generated and CT number accuracy was compared to the reference standard, which was calculated based on known iodine concentration and attenuation coefficients at each keV obtained from the U.S. National Institute of Standards and Technology (NIST). Results showed accurate iodine quantification (root mean square error of 0.5 mgI/cc) and accurate CT number of VMIs (percentage error of 8.9%) using the PCCT scanner. The overall performance of the PCCT scanner, in terms of iodine quantification and VMI CT number accuracy, was comparable to that of EID-based dual-source, dual-energy scanners.
Comparison between the Movement ABC-2 and the Zurich Neuromotor Assessment in Preschool Children.

PubMed

Kakebeeke, Tanja H; Knaier, Elisa; Köchli, Sabrina; Chaouch, Aziz; Rousson, Valentin; Kriemler, Susi; Jenni, Oskar G

2016-12-01

An established test instrument for the assessment of motor performance in children between 3 and 16 years is the Movement Assessment Battery for Children - Second Edition (M-ABC-2). The Zurich Neuromotor Assessment (ZNA) is also widely used for the evaluation of children's motor performance but has not been compared with the M-ABC-2 for children below five years for the purpose of convergent validity. Forty-seven children (26 boys, 21 girls) between three and five years of age were assessed using the M-ABC-2 and the ZNA3-5. Rank correlations between scores of different test components were calculated. Only low-to-moderate correlations were observed when separate components of these tests were compared (.31 to .68, p < .05), especially when involving the associated movements from the ZNA3-5 (-.05 to -.13, p > .05). However, the correlation between summary scores of the two tests was .77 (p < .001), and it increased to .84 when associated movements were excluded, which was comparable in magnitude to the test-retest reliability of the M-ABC-2, supporting convergent validity between the two tests. Although the ZNA3-5 and M-ABC-2 measure different aspects of motor behavior, the two instruments may thus measure essentially the same construct. © The Author(s) 2016.
Comparison of Self-Report Versus Sensor-Based Methods for Measuring the Amount of Upper Limb Activity Outside the Clinic.

PubMed

Waddell, Kimberly J; Lang, Catherine E

2018-03-10

To compare self-reported with sensor-measured upper limb (UL) performance in daily life for individuals with chronic (≥6mo) UL paresis poststroke. Secondary analysis of participants enrolled in a phase II randomized, parallel, dose-response UL movement trial. This analysis compared the accuracy and consistency between self-reported UL performance and sensor-measured UL performance at baseline and immediately post an 8-week intensive UL task-specific intervention. Outpatient rehabilitation. Community-dwelling individuals with chronic (≥6mo) UL paresis poststroke (N=64). Not applicable. Motor Activity Log amount of use scale and the sensor-derived use ratio from wrist-worn accelerometers. There was a high degree of variability between self-reported UL performance and the sensor-derived use ratio. Using sensor-based values as a reference, 3 distinct categories were identified: accurate reporters (reporting difference ±0.1), overreporters (difference >0.1), and underreporters (difference <-0.1). Five of 64 participants accurately self-reported UL performance at baseline and postintervention. Over half of participants (52%) switched categories from pre-to postintervention (eg, moved from underreporting preintervention to overreporting postintervention). For the consistent reporters, no participant characteristics were found to influence whether someone over- or underreported performance compared with sensor-based assessment. Participants did not consistently or accurately self-report UL performance when compared with the sensor-derived use ratio. Although self-report and sensor-based assessments are moderately associated and appear similar conceptually, these results suggest self-reported UL performance is often not consistent with sensor-measured performance and the measures cannot be used interchangeably. Copyright © 2018 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Assessing Probabilistic Reasoning in Verbal-Numerical and Graphical-Pictorial Formats: An Evaluation of the Psychometric Properties of an Instrument

ERIC Educational Resources Information Center

Agus, Mirian; Penna, Maria Pietronilla; Peró-Cebollero, Maribel; Guàrdia-Olmos, Joan

2016-01-01

Research on the graphical facilitation of probabilistic reasoning has been characterised by the effort expended to identify valid assessment tools. The authors developed an assessment instrument to compare reasoning performances when problems were presented in verbal-numerical and graphical-pictorial formats. A sample of undergraduate psychology…
Assessing Preservice Teachers' Presentation Capabilities: Contrasting the Modes of Communication with the Constructed Impression

ERIC Educational Resources Information Center

Bower, Matt G.; Moloney, Robyn A.; Cavanagh, Michael S.; Sweller, Naomi

2013-01-01

A research-based understanding of how to develop and assess classroom presentation skills is vital for the effective development of pre-service teacher communication capabilities. This paper identifies and compares two different models of assessing pre-service teachers' presentation performance--one based on the Modes of Communication (voice,…
Assessing the Performance of Classical Test Theory Item Discrimination Estimators in Monte Carlo Simulations

ERIC Educational Resources Information Center

Bazaldua, Diego A. Luna; Lee, Young-Sun; Keller, Bryan; Fellers, Lauren

2017-01-01

The performance of various classical test theory (CTT) item discrimination estimators has been compared in the literature using both empirical and simulated data, resulting in mixed results regarding the preference of some discrimination estimators over others. This study analyzes the performance of various item discrimination estimators in CTT:…
Assessment of recovery in older patients hospitalized with different diagnoses and functional levels, evaluated with and without geriatric assessment.

PubMed

Abrahamsen, Jenny Foss; Haugland, Cathrine; Ranhoff, Anette Hylen

2016-01-01

The objective of the present study was to investigate 1) the role of different admission diagnoses and 2) the degree of functional loss, on the rate of recovery of older patients after acute hospitalization. Furthermore, to compare the predictive value of simple assessments that can be carried out in a hospital lacking geriatric service, with assessments including geriatric screening tests. Prospective, observational cohort study, including 961community dwelling patients aged ≥ 70 years, transferred from medical, cardiac, pulmonary and orthopedic acute hospital departments to intermediate care in nursing home. Functional assessment with Barthel index (BI) was performed at admission to the nursing home and further geriatric assessment tests was performed during the first week. Logistic regression models with and without geriatric assessment were compared concerning the patients having 1) slow recovery (nursing home stay up to 2 months before return home) or, 2) poor recovery (dead or still in nursing home at 2 months). Slow recovery was independently associated with a diagnosis of non-vertebral fracture, BI subgroups 50-79 and <50, and, in the model including geriatric assessment, also with cognitive impairment. Poor recovery was more complex, and independently associated both with BI < 50, receiving home care before admission, higher age, admission with a non-vertebral fracture, and in the geriatric assessment model, cognitive impairment. Geriatric assessment is optimal for determining the recovery potential of older patients after acute hospitalization. As some hospitals lack geriatric services and ability to perform geriatric screening tests, a simpler assessment based on admission diagnoses and ADL function (BI), gives good information regarding the possible rehabilitation time and possibility to return home.
Identify and Quantify the Mechanistic Sources of Sensor Performance Variation Between Individual Sensors SN1 and SN2

DOE Office of Scientific and Technical Information (OSTI.GOV)

Diaz, Aaron A.; Baldwin, David L.; Cinson, Anthony D.

2014-08-06

This Technical Letter Report satisfies the M3AR-14PN2301022 milestone, and is focused on identifying and quantifying the mechanistic sources of sensor performance variation between individual 22-element, linear phased-array sensor prototypes, SN1 and SN2. This effort constitutes an iterative evolution that supports the longer term goal of producing and demonstrating a pre-manufacturing prototype ultrasonic probe that possesses the fundamental performance characteristics necessary to enable the development of a high-temperature sodium-cooled fast reactor inspection system. The scope of the work for this portion of the PNNL effort conducted in FY14 includes performing a comparative evaluation and assessment of the performance characteristics of themore » SN1 and SN2 22 element PA-UT probes manufactured at PNNL. Key transducer performance parameters, such as sound field dimensions, resolution capabilities, frequency response, and bandwidth are used as a metric for the comparative evaluation and assessment of the SN1 and SN2 engineering test units.« less
Smoking abstinence and reinstatement effects in adolescent cigarette smokers.

PubMed

Colby, Suzanne M; Leventhal, Adam M; Brazil, Linda; Lewis-Esquerre, Johanna; Stein, L A R; Rohsenow, Damaris J; Monti, Peter M; Niaura, Raymond S

2010-01-01

The study objectives were to examine smoking abstinence and reinstatement effects on subjective experience and cognitive performance among adolescent smokers. Adolescents (aged 14-17 years, 60 daily smokers and 32 nonsmokers) participated. Participants completed baseline assessments (Session 1) and returned to the laboratory 1-3 days later to repeat assessments (Session 2); half of the smokers were randomly assigned to 15-17 hr tobacco abstinence preceding Session 2. During Session 2, abstaining smokers reported significantly greater increases in withdrawal symptoms, smoking urges, and negative affect compared with smokers who did not abstain and compared with nonsmokers. Smoking reinstatement reversed abstinence effects, returning to baseline levels for smoking urges and negative affect. Abstaining smokers showed significantly enhanced cognitive performance on two of six tasks (two-letter search compared with nonabstaining smokers; serial reaction time compared with nonsmokers); smoking reinstatement resulted in significant decrements on these two tasks relative to nonabstaining smokers. Effects of smoking abstinence and reinstatement on self-report measures are consistent with earlier research with adolescent as well as adult smokers and may help to elucidate the motivational underpinnings of smoking maintenance among adolescent smokers. Effects found on cognitive performance were contrary to hypotheses; further research is needed to understand better the role of cognitive performance effects in smoking maintenance among adolescents.
Smoking abstinence and reinstatement effects in adolescent cigarette smokers

PubMed Central

Leventhal, Adam M.; Brazil, Linda; Lewis-Esquerre, Johanna; Stein, L. A. R.; Rohsenow, Damaris J.; Monti, Peter M.; Niaura, Raymond S.

2010-01-01

Introduction The study objectives were to examine smoking abstinence and reinstatement effects on subjective experience and cognitive performance among adolescent smokers. Methods Adolescents (aged 14–17 years, 60 daily smokers and 32 nonsmokers) participated. Participants completed baseline assessments (Session 1) and returned to the laboratory 1–3 days later to repeat assessments (Session 2); half of the smokers were randomly assigned to 15–17 hr tobacco abstinence preceding Session 2. Results During Session 2, abstaining smokers reported significantly greater increases in withdrawal symptoms, smoking urges, and negative affect compared with smokers who did not abstain and compared with nonsmokers. Smoking reinstatement reversed abstinence effects, returning to baseline levels for smoking urges and negative affect. Abstaining smokers showed significantly enhanced cognitive performance on two of six tasks (two-letter search compared with nonabstaining smokers; serial reaction time compared with nonsmokers); smoking reinstatement resulted in significant decrements on these two tasks relative to nonabstaining smokers. Discussion Effects of smoking abstinence and reinstatement on self-report measures are consistent with earlier research with adolescent as well as adult smokers and may help to elucidate the motivational underpinnings of smoking maintenance among adolescent smokers. Effects found on cognitive performance were contrary to hypotheses; further research is needed to understand better the role of cognitive performance effects in smoking maintenance among adolescents. PMID:19933776
Measuring Medical Housestaff Teamwork Performance Using Multiple Direct Observation Instruments: Comparing Apples and Apples.

PubMed

Weingart, Saul N; Yaghi, Omar; Wetherell, Matthew; Sweeney, Megan

2018-04-10

To examine the composition and concordance of existing instruments used to assess medical teams' performance. A trained observer joined 20 internal medicine housestaff teams for morning work rounds at Tufts Medical Center, a 415-bed Boston teaching hospital, from October through December 2015. The observer rated each team's performance using 9 teamwork observation instruments that examined domains including team structure, leadership, situation monitoring, mutual support, and communication. Observations recorded on paper forms were stored electronically. Scores were normalized from 1 (low) to 5 (high) to account for different rating scales. Overall mean scores were calculated and graphed; weighted scores adjusted for the number of items in each teamwork domain. Teamwork scores were analyzed using t-tests, pair-wise correlations, and the Kruskal-Wallis statistic, and team performance was compared across instruments by domain. The 9 tools incorporated 5 major domains, with 5-35 items per instrument for a total of 161 items per observation session. In weighted and unweighted analyses, the overall teamwork performance score for a given team on a given day varied by instrument. While all of the tools identified the same low outlier, high performers on some instruments were low performers on others. Inconsistent scores for a given team across instruments persisted in domain-level analyses. There was substantial variation in the rating of individual teams assessed concurrently by a single observer using multiple instruments. Since existing teamwork observation tools do not yield concordant assessments, researchers should create better tools for measuring teamwork performance.
Launch commit criteria performance trending analysis, phase 1, revision A. SRM and QA mission services

NASA Technical Reports Server (NTRS)

1989-01-01

An assessment of quantitative methods and measures for measuring launch commit criteria (LCC) performance measurement trends is made. A statistical performance trending analysis pilot study was processed and compared to STS-26 mission data. This study used four selected shuttle measurement types (solid rocket booster, external tank, space shuttle main engine, and range safety switch safe and arm device) from the five missions prior to mission 51-L. After obtaining raw data coordinates, each set of measurements was processed to obtain statistical confidence bounds and mean data profiles for each of the selected measurement types. STS-26 measurements were compared to the statistical data base profiles to verify the statistical capability of assessing occurrences of data trend anomalies and abnormal time-varying operational conditions associated with data amplitude and phase shifts.
T59. VIRTUAL REALTY ASSESSMENT OF FUNCTIONAL CAPACITY IN EARLY SCHIZOPHRENIA: ASSOCIATIONS WITH NEUROCOGNITION, FUNCTIONAL CAPACITY PERFORMANCE, AND DAILY FUNCTIONING

PubMed Central

Ventura, Joseph; Welikson, Tamara; Subotnik, Kenneth L; Ered, Arielle; Keefe, Richard; Hellemann, Gerhard H; Nuechterlein, Keith H

2018-01-01

Abstract Background Research using virtual reality assessment of functional capacity has shown promise as a reliable and valid way to assess treatment response in patients with established schizophrenia. There has been little work on virtual reality based assessments of functional capacity for patients in the early phase of schizophrenia. We examined whether virtual reality based assessment methods reveal functional capacity deficits in young patients and relevant relationships with established measures of neurocognition, functional capacity performance, and daily functioning. Methods The sample consisted of UCLA Aftercare Research Program patients (n=42) who were diagnosed by trained raters administering the SCID and who met criteria for schizophrenia, schizoaffective disorder, or schizophreniform disorder, and screened normal control subjects (n=13). Patients were within 2 years of their first psychotic episode upon clinic entry, were an average of 23.2 years old, and had an average of 12.9 years of education. The Virtual Reality Functional Capacity Assessment Tool (VRFCAT) was the computer-based measure of functional capacity. We used the MATRICS Consensus Cognitive Battery (MCCB) as an objective measure of neurocognition and the UCSD Performance-Based Skills Assessment (UPSA) to assess functional capacity performance. The Global Functioning Scale: Role and Social, and the Role Functioning Scale were used to assess work and school performance, familial interactions, and social functioning. Results We were able to confirm that the deficit in functional capacity performance measured using VRFCAT is present in the early course of schizophrenia in that the patients were slower and committed more errors (M=830.41) as compared with normal controls (M=716.84; t=3.0, p<.01). Virtual reality based assessment of functional capacity was correlated with objective measures of neurocognition (MCCB Overall Composite), r=-.71, p=<.01, standard approaches to functional capacity assessment (UPSA), r=-.66, p=<.01, work and school functioning (r=-.52, p<.01), and level of social relationships (r=-.43, p=<.03), but not familial relationships (r=-.03, p=.87). Interestingly, neither neurocognition (MCCB) nor functional capacity performance (UPSA) were correlated with the level of familial relationships. Discussion We extend previous findings in that even patients in the early course of schizophrenia showed virtual reality based functional capacity performance deficits when compared with normal control subjects. Virtual reality based performance was correlated with neurocognition, suggesting that it may be sensitive to changes in cognition. Furthermore, correlations with everyday work/school and social functioning indicate promise as a co-primary measure to index change in functioning in response to treatment. Interestingly, none of our measures of functional capacity or neurocognition were correlated with familial relationships indicating that the determinates of family interactions might be driven by factors other than cognitive capacities.
TAMDAR Sensor Validation in 2003 AIRS II

NASA Technical Reports Server (NTRS)

Daniels, Taumi S.; Murray, John J.; Anderson, Mark V.; Mulally, Daniel J.; Jensen, Kristopher R.; Grainger, Cedric A.; Delene, David J.

2005-01-01

This study entails an assessment of TAMDAR in situ temperature, relative humidity and winds sensor data from seven flights of the UND Citation II. These data are undergoing rigorous assessment to determine their viability to significantly augment domestic Meteorological Data Communications Reporting System (MDCRS) and the international Aircraft Meteorological Data Reporting (AMDAR) system observational databases to improve the performance of regional and global numerical weather prediction models. NASA Langley Research Center participated in the Second Alliance Icing Research Study from November 17 to December 17, 2003. TAMDAR data taken during this period is compared with validation data from the UND Citation. The data indicate acceptable performance of the TAMDAR sensor when compared to measurements from the UND Citation research instruments.
Balanced scorecards for specialists: a tool for quality improvement.

PubMed

Marr, Thomas J; Mullen, Deborah M

2004-04-01

This article describes a program that HealthPartners uses to assess and compare the performance of specialists that serve its members. HealthPartners' Balanced Scorecards program focuses on cardiologist and orthopedist practices in the Minneapolis/St. Paul metro area and St. Cloud, Minnesota. The scorecards assess the clinical and business processes of specialist practices, their use of resources, the degree to which patients and referring physicians are satisfied with their performance, and their patient outcomes. Unblinded comparative data is made available to referring physicians, employers, and consumers only after each individual specialist group has had the opportunity to review its own data against blinded results, discuss the methodology, and comment on the results.
[Impairment of executive function in elderly patients with major unipolar depression: influence of psychomotor retardation].

PubMed

Baudic, Sophie; Benisty, Sarah; Dalla Barba, Gianfrano; Traykov, Latchezar

2007-03-01

The results from several studies assessing the executive function in depressed patients compared to control subjects varied from significant impairment to normal performance. To assess the executive impairment in elderly patients with major unipolar depression and to evaluate the influence of psychomotor retardation and severity of depression in the executive deficits, the performance of 15 elderly patients with unipolar depression was compared to that of 15 elderly control subjects on executive tasks. The severity of depression was evaluated by the Montgomery and Asberg depressive scale and that of psychomotor retardation by the Widlöcher's scale. In depressed patients, deficits were found on tasks assessing cognitive flexibility (Modified card sorting test (MCST) and Trail making test B), planification and elaboration of strategies (cognitive estimates), motor initiation (graphic sequences), categorisation and hypothesis making (MCST) and interference resistance (Stroop test). However, depressed patients performed normally on the Hayling test assessing the inhibition processes. Intensity of psychomotor retardation was not correlated to the performance of executive tasks. Conversely, severity of depression was related to the scores of MCST (number of errors and perseverations), Stroop and Hayling tests (time taken to complete the end of the sentence). Unipolar depressed patients showed deficits in most tasks assessing executive function. However, inhibition processes appeared to be intact in depressed patients although their implementation was difficult. The severity of depression but not that of psychomotor retardation was associated with executive deficits.

Can the epirubicin cardiotoxicity in cancer patients be prevented by angiotensin converting enzyme inhibitors?

PubMed

Radulescu, D; Buzdugan, E; Ciuleanu, T E; Todor, N; Stoicescu, L

2013-01-01

The aim of this study was to assess whether treatment with angiotensin converting enzyme inhibitors (ACEI) can prevent the alteration of left ventricular systolic and diastolic performance in cancer patients treated with different chemotherapy regimens containing epirubicin. In this prospective study , 68 patients with different malignant tumors treated with epirubicin and perindopril in different chemotherapy protocols (study group), and a gender- and age-matched group of 68 patients with different malignant tumors treated with epirubicin without perindopril in different chemotherapy protocols (control group), were assessed by Doppler echocardiography. Left ventricular systolic function was assessed by measuring left ventricular ejection fraction (EF). Left ventricular diastolic function was assessed by Doppler ultrasound by evaluating the transmitral flow. We also assessed the QTc on the 12 lead electrocardiograms. At the end of chemotherapy the left ventricular systolic function was less altered in the study group compared to the control group and was superior in the study group (epirubicin+ACEI) compared to the control group (epirubicin alone). We documented a significantly deteriorated left ventricular diastolic function in both groups at the completion of chemotherapy. QTc time in both arms was also significantly prolonged. In the present echo-Doppler study we documented a preserved left ventricular systolic performance in patients with various malignancies treated with epirubicin plus perindopril. Although co-treatment with ACEI prevented the alteration of systolic performance, it failed to prevent the deterioration of the left ventricular diastolic performance impairment due to poor left ventricular compliance.
Impact of a Paper vs Virtual Simulated Patient Case on Student-Perceived Confidence and Engagement.

PubMed

Barnett, Susanne G; Gallimore, Casey E; Pitterle, Michael; Morrill, Josh

2016-02-25

To evaluate online case simulation vs a paper case on student confidence and engagement. Students enrolled in a pharmacotherapy laboratory course completed a patient case scenario as a component of an osteoarthritis laboratory module. Two laboratory sections used a paper case (n=53); three sections used an online virtual case simulation (n=81). Student module performance was assessed through a submitted subjective objective assessment plan (SOAP) note. Students completed pre/post surveys to measure self-perceived confidence in providing medication management. The simulation group completed postmodule questions related to realism and engagement of the online virtual case simulation. Group assessments were performed using chi-square and Mann Whitney tests. A significant increase in all 13 confidence items was seen in both student groups following completion of the laboratory module. The simulation group had an increased change of confidence compared to the paper group in assessing medication efficacy and documenting a thorough assessment. Comparing the online virtual simulation to a paper case, students agreed the learning experience increased interest, enjoyment, relevance, and realism. The simulation group performed better on the subjective SOAP note domain though no differences in total SOAP note scores was found between the two groups. Virtual case simulations result in increased student engagement and may lead to improved documentation performance in the subjective domain of SOAP notes. However, virtual patient cases may offer limited benefit over paper cases in improving overall student self-confidence to provide medication management.
Who to Interview? Low Adherence by U.S. Medical Schools to Medical Student Performance Evaluation Format Makes Resident Selection Difficult.

PubMed

Boysen-Osborn, Megan; Yanuck, Justin; Mattson, James; Toohey, Shannon; Wray, Alisa; Wiechmann, Warren; Lahham, Shadi; Langdorf, Mark I

2017-01-01

The Medical Student Performance Evaluation (MSPE) appendices provide a program director with comparative performance for a student's academic and professional attributes, but they are frequently absent or incomplete. We reviewed MSPEs from applicants to our emergency medicine residency program from 134 of 136 (99%) U.S. allopathic medical schools, over two application cycles (2012-13, 2014-15). We determined the degree of compliance with each of the five recommended MSPE appendices. Only three (2%) medical schools were compliant with all five appendices. The medical school information page (MSIP, appendix E) was present most commonly (85%), followed by comparative clerkship performance (appendix B, 82%), overall performance (appendix D, 59%), preclinical performance (appendix A, 57%), and professional attributes (appendix C, 18%). Few schools (7%) provided student-specific, comparative professionalism assessments. Medical schools inconsistently provide graphic, comparative data for their students in the MSPE. Although program directors (PD) value evidence of an applicant's professionalism when selecting residents, medical schools rarely provide such useful, comparative professionalism data in their MSPEs. As PDs seek to evaluate applicants based on academic performance and professionalism, rather than standardized testing alone, medical schools must make MSPEs more consistent, objective, and comparative.
EVALUATION OF THE HTA CORE MODEL FOR NATIONAL HEALTH TECHNOLOGY ASSESSMENT REPORTS: COMPARATIVE STUDY AND EXPERIENCES FROM EUROPEAN COUNTRIES.

PubMed

Kõrge, Kristina; Berndt, Nadine; Hohmann, Juergen; Romano, Florence; Hiligsmann, Mickael

2017-01-01

The health technology assessment (HTA) Core Model® is a tool for defining and standardizing the elements of HTA analyses within several domains for producing structured reports. This study explored the parallels between the Core Model and a national HTA report. Experiences from various European HTA agencies were also investigated to determine the Core Model's adaptability to national reports. A comparison between a national report on Genetic Counseling, produced by the Cellule d'expertise médicale Luxembourg, and the Core Model was performed to identify parallels in terms of relevant and comparable assessment elements (AEs). Semi-structured interviews with five representatives from European HTA agencies were performed to assess their user experiences with the Core Model. The comparative study revealed that 50 percent of the total number (n = 144) of AEs in the Core Model were relevant for the national report. Of these 144 AEs from the Core Model, 34 (24 percent) were covered in the national report. Some AEs were covered only partly. The interviewees emphasized flexibility in using the Core Model and stated that the most important aspects to be evaluated include characteristics of the disease and technology, clinical effectiveness, economic aspects, and safety. In the present study, the national report covered an acceptable number of AEs of the Core Model. These results need to be interpreted with caution because only one comparison was performed. The Core Model can be used in a flexible manner, applying only those elements that are relevant from the perspective of the technology assessment and specific country context.
Characterization of controlled bone defects using 2D and 3D ultrasound imaging techniques.

PubMed

Parmar, Biren J; Longsine, Whitney; Sabonghy, Eric P; Han, Arum; Tasciotti, Ennio; Weiner, Bradley K; Ferrari, Mauro; Righetti, Raffaella

2010-08-21

Ultrasound is emerging as an attractive alternative modality to standard x-ray and CT methods for bone assessment applications. As of today, however, there is a lack of systematic studies that investigate the performance of diagnostic ultrasound techniques in bone imaging applications. This study aims at understanding the performance limitations of new ultrasound techniques for imaging bones in controlled experiments in vitro. Experiments are performed on samples of mammalian and non-mammalian bones with controlled defects with size ranging from 400 microm to 5 mm. Ultrasound findings are statistically compared with those obtained from the same samples using standard x-ray imaging modalities and optical microscopy. The results of this study demonstrate that it is feasible to use diagnostic ultrasound imaging techniques to assess sub-millimeter bone defects in real time and with high accuracy and precision. These results also demonstrate that ultrasound imaging techniques perform comparably better than x-ray imaging and optical imaging methods, in the assessment of a wide range of controlled defects both in mammalian and non-mammalian bones. In the future, ultrasound imaging techniques might provide a cost-effective, real-time, safe and portable diagnostic tool for bone imaging applications.
Second- to third-trimester longitudinal growth assessment for prediction of small-for-gestational age and late fetal growth restriction.

PubMed

Caradeux, J; Eixarch, E; Mazarico, E; Basuki, T R; Gratacós, E; Figueras, F

2018-02-01

Detection of fetal growth restriction (FGR) remains poor and most screening strategies rely on cross-sectional evaluation of fetal size during the third trimester. A longitudinal and individualized approach has been proposed as an alternative method of evaluation. The aim of this study was to compare second- to third-trimester longitudinal growth assessment to cross-sectional evaluation in the third trimester for the prediction of small-for-gestational age (SGA) and late FGR in low-risk singleton pregnancy. This was a prospective cohort study of 2696 unselected consecutive low-risk singleton pregnancies scanned at 21 ± 2 and 32 ± 2 weeks. For cross-sectional growth assessment, abdominal circumference (AC) measurements were transformed to z-values according the 21st-INTERGROWTH standards. Longitudinal growth assessment was performed by calculating the AC z-velocity and the second- to third-trimester AC conditional growth centile. Longitudinal assessment was compared with cross-sectional assessment at 32 weeks. Association of cross-sectional and longitudinal evaluations with SGA and late FGR was assessed by logistic regression analysis. Predictive performance was determined by receiver-operating characteristics curve analysis. In total, 210 (7.8%) newborns were classified as SGA and 103 (3.8%) as late FGR. Neither longitudinal measurement improved the association with SGA or late FGR provided by cross-sectional evaluation of AC z-score at 32 weeks. Areas under the curves of AC z-velocity and conditional AC growth were significantly smaller than those of cross-sectional AC z-scores (P < 0.001), although AC z-velocity performed significantly better than did conditional AC growth (P < 0.001). Longitudinal assessment of fetal growth from the second to third trimester has a low predictive capacity for SGA and late FGR in low-risk singleton pregnancy compared with cross-sectional growth evaluation. Copyright © 2017 ISUOG. Published by John Wiley & Sons Ltd. Copyright © 2017 ISUOG. Published by John Wiley & Sons Ltd.
[Comparison between administrative and clinical databases in the evaluation of cardiac surgery performance].

PubMed

Rosato, Stefano; D'Errigo, Paola; Badoni, Gabriella; Fusco, Danilo; Perucci, Carlo A; Seccareccia, Fulvia

2008-08-01

The availability of two contemporary sources of information about coronary artery bypass graft (CABG) interventions, allowed 1) to verify the feasibility of performing outcome evaluation studies using administrative data sources, and 2) to compare hospital performance obtainable using the CABG Project clinical database with hospital performance derived from the use of current administrative data. Interventions recorded in the CABG Project were linked to the hospital discharge record (HDR) administrative database. Only the linked records were considered for subsequent analyses (46% of the total CABG Project). A new selected population "clinical card-HDR" was then defined. Two independent risk-adjustment models were applied, each of them using information derived from one of the two different sources. Then, HDR information was supplemented with some patient preoperative conditions from the CABG clinical database. The two models were compared in terms of their adaptability to data. Hospital performances identified by the two different models and significantly different from the mean was compared. In only 4 of the 13 hospitals considered for analysis, the results obtained using the HDR model did not completely overlap with those obtained by the CABG model. When comparing statistical parameters of the HDR model and the HDR model + patient preoperative conditions, the latter showed the best adaptability to data. In this "clinical card-HDR" population, hospital performance assessment obtained using information from the clinical database is similar to that derived from the use of current administrative data. However, when risk-adjustment models built on administrative databases are supplemented with a few clinical variables, their statistical parameters improve and hospital performance assessment becomes more accurate.
Graders' Mathematics Achievement

ERIC Educational Resources Information Center

Bond, John B.; Ellis, Arthur K.

2013-01-01

The purpose of this experimental study was to investigate the effects of metacognitive reflective assessment instruction on student achievement in mathematics. The study compared the performance of 141 students who practiced reflective assessment strategies with students who did not. A posttest-only control group design was employed, and results…
Usefulness of the Montreal Cognitive Assessment (MoCA) in Huntington's disease.

PubMed

Gluhm, Shea; Goldstein, Jody; Brown, Daniel; Van Liew, Charles; Gilbert, Paul E; Corey-Bloom, Jody

2013-10-01

The Montreal Cognitive Assessment (MoCA) is a brief screening instrument for dementia that is sensitive to executive dysfunction. This study examined its usefulness for assessing cognitive performance in mild, moderate, and severe Huntington's disease (HD), compared with the use of the Mini-Mental State Examination (MMSE). We compared MoCA and MMSE total scores and the number of correct answers in 5 cognitive-specific domains in 104 manifest HD patients and 100 matched controls. For the total HD sample, and for the moderate and severe patients, significant differences between both MoCA and MMSE total scores and almost all cognitive-specific domains emerged. Even mild HD subjects showed significant differences with regard to total score and several cognitive domains on both instruments. We conclude that the MoCA, although not necessarily superior to the MMSE, is a useful instrument for assessing cognitive performance over a broad level of functioning in HD. © 2013 Movement Disorder Society.
Assessment of eye lens doses for workers during interventional radiology procedures.

PubMed

Urboniene, A; Sadzeviciene, E; Ziliukas, J

2015-07-01

The assessment of eye lens doses for workers during interventional radiology (IR) procedures was performed using a new eye lens dosemeter. In parallel, the results of routine individual monitoring were analysed and compared with the results obtained from measurements with a new eye lens dosemeter. The eye lens doses were assessed using Hp(3) measured at the level of the eyes and were compared with Hp(10) measured with the whole-body dosemeter above the lead collar. The information about use of protective measures, the number of performed interventional procedures per month and their fluoroscopy time was also collected. The assessment of doses to the lens of the eye was done for 50 IR workers at 9 Lithuanian hospitals for the period of 2012-2013. If the use of lead glasses is not taken into account, the estimated maximum annual dose equivalent to the lens of the eye was 82 mSv. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Developing Novel Machine Learning Algorithms to Improve Sedentary Assessment for Youth Health Enhancement.

PubMed

Golla, Gowtham Kumar; Carlson, Jordan A; Huan, Jun; Kerr, Jacqueline; Mitchell, Tarrah; Borner, Kelsey

2016-10-01

Sedentary behavior of youth is an important determinant of health. However, better measures are needed to improve understanding of this relationship and the mechanisms at play, as well as to evaluate health promotion interventions. Wearable accelerometers are considered as the standard for assessing physical activity in research, but do not perform well for assessing posture (i.e., sitting vs. standing), a critical component of sedentary behavior. The machine learning algorithms that we propose for assessing sedentary behavior will allow us to re-examine existing accelerometer data to better understand the association between sedentary time and health in various populations. We collected two datasets, a laboratory-controlled dataset and a free-living dataset. We trained machine learning classifiers separately on each dataset and compared performance across datasets. The classifiers predict five postures: sit, stand, sit-stand, stand-sit, and stand\\walk. We compared a manually constructed Hidden Markov model (HMM) with an automated HMM from existing software. The manually constructed HMM gave more F1-Macro score on both datasets.
Validating the psycholinguistic aspects of LITMUS-CLT: Evidence from Polish and Norwegian.

PubMed

Hansen, Pernille; Simonsen, Hanne Gram; Łuniewska, Magdalena; Haman, Ewa

2017-01-01

The novel assessment tool Cross-Linguistic Lexical Tasks (LITMUS-CLT) aims for comparable cross-linguistic assessment of multilingual children's lexical skills by basing each language version on two language-specific variables: age of acquisition (AoA) and complexity index (CI), a novel measure related to phonology, morphology, exposure and etymology. This article investigates the validity of this methodology, asking whether the underlying properties are robust predictors of children's performance. The Polish and Norwegian CLTs were used to assess 32 bilingual Polish-Norwegian, 34 monolingual Norwegian and 36 monolingual Polish children. The effects of AoA and CI were contrasted with frequency in child directed speech (CDS) and imageability, two known predictors of lexical development. AoA was a reliable predictor of performance within all parts of CLT, in contrast to CI. Apart from AoA, only exposure and CDS frequency had a significant effect within both monolinguals and bilinguals. These results indicate that CLT assesses lexical skills in a cross-linguistically comparable manner, but suggest a revision of the CI measure.
Does practice make perfect? Prospectively comparing effects of 2 amounts of practice on tourniquet use performance.

PubMed

Baruch, Erez N; Benov, Avi; Shina, Avi; Berg, Amy L; Shlaifer, Amir; Glassberg, Elon; Aden, James K; Bader, Tarif; Kragh, John F; Yitzhak, Avraham

2016-12-01

Although a lifesaving skill, currently, there is no consensus for the required amount of practice in tourniquet use. We compared the effect of 2 amounts of practice on performance of tourniquet use by nonmedical personnel. Israeli military recruits without previous medical training underwent their standard tactical first aid course, and their initial performance in use of the Combat Application Tourniquet (CAT; Composite Resources, Rock Hill, SC) was assessed. The educational intervention was to allocate the participants into a monthly tourniquet practice program: either a single-application practice (SAP) group or a triple-application practice (TAP) group. Each group practiced according to its program. After 3 months, the participants' tourniquet use performance was reassessed. Assessments were conducted using the HapMed Leg Tourniquet Trainer (CHI Systems, Fort Washington, PA), a mannequin which measures time and pressure. A total of 151 participants dropped out, leaving 87 in the TAP group and 69 in the SAP group. On initial assessment, the TAP group and the SAP group performed similarly. Both groups improved their performance from the initial to the final assessment. The TAP group improved more than the SAP group in mean application time (faster by 18 vs 8 seconds, respectively; P = .023) and in reducing the proportion of participants who were unable to apply any pressure to the mannequin (less by 18% vs 8%, respectively; P = .009). Three applications per monthly practice session were superior to one. This is the first prospective validation of a tourniquet practice program based on objective measurements. Copyright © 2016 Elsevier Inc. All rights reserved.
What does the multiple mini interview have to offer over the panel interview?

PubMed

Pau, Allan; Chen, Yu Sui; Lee, Verna Kar Mun; Sow, Chew Fei; De Alwis, Ranjit

2016-01-01

This paper compares the panel interview (PI) performance with the multiple mini interview (MMI) performance and indication of behavioural concerns of a sample of medical school applicants. The acceptability of the MMI was also assessed. All applicants shortlisted for a PI were invited to an MMI. Applicants attended a 30-min PI with two faculty interviewers followed by an MMI consisting of ten 8-min stations. Applicants were assessed on their performance at each MMI station by one faculty. The interviewer also indicated if they perceived the applicant to be a concern. Finally, applicants completed an acceptability questionnaire. From the analysis of 133 (75.1%) completed MMI scoresheets, the MMI scores correlated statistically significantly with the PI scores (r=0.438, p=0.001). Both were not statistically associated with sex, age, race, or pre-university academic ability to any significance. Applicants assessed as a concern at two or more stations performed statistically significantly less well at the MMI when compared with those who were assessed as a concern at one station or none at all. However, there was no association with PI performance. Acceptability scores were generally high, and comparison of mean scores for each of the acceptability questionnaire items did not show statistically significant differences between sex and race categories. Although PI and MMI performances are correlated, the MMI may have the added advantage of more objectively generating multiple impressions of the applicant's interpersonal skill, thoughtfulness, and general demeanour. Results of the present study indicated that the MMI is acceptable in a multicultural context.
What does the multiple mini interview have to offer over the panel interview?

PubMed Central

Pau, Allan; Chen, Yu Sui; Lee, Verna Kar Mun; Sow, Chew Fei; Alwis, Ranjit De

2016-01-01

Introduction This paper compares the panel interview (PI) performance with the multiple mini interview (MMI) performance and indication of behavioural concerns of a sample of medical school applicants. The acceptability of the MMI was also assessed. Materials and methods All applicants shortlisted for a PI were invited to an MMI. Applicants attended a 30-min PI with two faculty interviewers followed by an MMI consisting of ten 8-min stations. Applicants were assessed on their performance at each MMI station by one faculty. The interviewer also indicated if they perceived the applicant to be a concern. Finally, applicants completed an acceptability questionnaire. Results From the analysis of 133 (75.1%) completed MMI scoresheets, the MMI scores correlated statistically significantly with the PI scores (r=0.438, p=0.001). Both were not statistically associated with sex, age, race, or pre-university academic ability to any significance. Applicants assessed as a concern at two or more stations performed statistically significantly less well at the MMI when compared with those who were assessed as a concern at one station or none at all. However, there was no association with PI performance. Acceptability scores were generally high, and comparison of mean scores for each of the acceptability questionnaire items did not show statistically significant differences between sex and race categories. Conclusions Although PI and MMI performances are correlated, the MMI may have the added advantage of more objectively generating multiple impressions of the applicant's interpersonal skill, thoughtfulness, and general demeanour. Results of the present study indicated that the MMI is acceptable in a multicultural context. PMID:26873337
What does the multiple mini interview have to offer over the panel interview?

PubMed

Pau, Allan; Chen, Yu Sui; Lee, Verna Kar Mun; Sow, Chew Fei; Alwis, Ranjit De

2016-01-01

Introduction This paper compares the panel interview (PI) performance with the multiple mini interview (MMI) performance and indication of behavioural concerns of a sample of medical school applicants. The acceptability of the MMI was also assessed. Materials and methods All applicants shortlisted for a PI were invited to an MMI. Applicants attended a 30-min PI with two faculty interviewers followed by an MMI consisting of ten 8-min stations. Applicants were assessed on their performance at each MMI station by one faculty. The interviewer also indicated if they perceived the applicant to be a concern. Finally, applicants completed an acceptability questionnaire. Results From the analysis of 133 (75.1%) completed MMI scoresheets, the MMI scores correlated statistically significantly with the PI scores (r=0.438, p=0.001). Both were not statistically associated with sex, age, race, or pre-university academic ability to any significance. Applicants assessed as a concern at two or more stations performed statistically significantly less well at the MMI when compared with those who were assessed as a concern at one station or none at all. However, there was no association with PI performance. Acceptability scores were generally high, and comparison of mean scores for each of the acceptability questionnaire items did not show statistically significant differences between sex and race categories. Conclusions Although PI and MMI performances are correlated, the MMI may have the added advantage of more objectively generating multiple impressions of the applicant's interpersonal skill, thoughtfulness, and general demeanour. Results of the present study indicated that the MMI is acceptable in a multicultural context.
Investigating Conversational Dynamics: Interactive Alignment, Interpersonal Synergy, and Collective Task Performance

ERIC Educational Resources Information Center

Fusaroli, Riccardo; Tylén, Kristian

2016-01-01

This study investigates interpersonal processes underlying dialog by comparing two approaches, "interactive alignment" and "interpersonal synergy", and assesses how they predict collective performance in a joint task. While the interactive alignment approach highlights imitative patterns between interlocutors, the synergy…
A computational approach to compare regression modelling strategies in prediction research.

PubMed

Pajouheshnia, Romin; Pestman, Wiebe R; Teerenstra, Steven; Groenwold, Rolf H H

2016-08-25

It is often unclear which approach to fit, assess and adjust a model will yield the most accurate prediction model. We present an extension of an approach for comparing modelling strategies in linear regression to the setting of logistic regression and demonstrate its application in clinical prediction research. A framework for comparing logistic regression modelling strategies by their likelihoods was formulated using a wrapper approach. Five different strategies for modelling, including simple shrinkage methods, were compared in four empirical data sets to illustrate the concept of a priori strategy comparison. Simulations were performed in both randomly generated data and empirical data to investigate the influence of data characteristics on strategy performance. We applied the comparison framework in a case study setting. Optimal strategies were selected based on the results of a priori comparisons in a clinical data set and the performance of models built according to each strategy was assessed using the Brier score and calibration plots. The performance of modelling strategies was highly dependent on the characteristics of the development data in both linear and logistic regression settings. A priori comparisons in four empirical data sets found that no strategy consistently outperformed the others. The percentage of times that a model adjustment strategy outperformed a logistic model ranged from 3.9 to 94.9 %, depending on the strategy and data set. However, in our case study setting the a priori selection of optimal methods did not result in detectable improvement in model performance when assessed in an external data set. The performance of prediction modelling strategies is a data-dependent process and can be highly variable between data sets within the same clinical domain. A priori strategy comparison can be used to determine an optimal logistic regression modelling strategy for a given data set before selecting a final modelling approach.
Effort in Low-Stakes Assessments: What Does It Take to Perform as Well as in a High-Stakes Setting?

ERIC Educational Resources Information Center

Attali, Yigal

2016-01-01

Performance of students in low-stakes testing situations has been a concern and focus of recent research. However, researchers who have examined the effect of stakes on performance have not been able to compare low-stakes performance to truly high-stakes performance of the same students. Results of such a comparison are reported in this article.…
Assessment of individual hand performance in box trainers compared to virtual reality trainers.

PubMed

Madan, Atul K; Frantzides, Constantine T; Shervin, Nina; Tebbit, Christopher L

2003-12-01

Training residents in laparoscopic skills is ideally initiated in an inanimate laboratory with both box trainers and virtual reality trainers. Virtual reality trainers have the ability to score individual hand performance although they are expensive. Here we compared the ability to assess dominant and nondominant hand performance in box trainers with virtual reality trainers. Medical students without laparoscopic experience were utilized in this study (n = 16). Each student performed tasks on the LTS 2000, an inanimate box trainer (placing pegs with both hands and transferring pegs from one hand to another), as well as a task on the MIST-VR, a virtual reality trainer (grasping a virtual object and placing it in a virtual receptable with alternating hands). A surgeon scored students for the inanimate box trainer exercises (time and errors) while the MIST-VR scored students (time, economy of movements, and errors for each hand). Statistical analysis included Pearson correlations. Errors and time for the one-handed tasks on the box trainer did not correlate with errors, time, or economy measured for each hand by the MIST-VR (r = 0.01 to 0.30; P = NS). Total errors on the virtual reality trainer did correlate with errors on transferring pege (r = 0.61; P < 0.05). Economy and time of both dominant and nondominant hand from the MIST-VR correlated with time of transferring pegs in the box trainer (r = 0.53 to 0.77; P < 0.05). While individual hand assessment by the box trainer during 2-handed tasks was related to assessment by the virtual reality trainer, individual hand assessment during 1-handed tasks did not correlate with the virtual reality trainer. Virtual reality trainers, such as the MIST-VR, allow assessment of individual hand skills which may lead to improved laparoscopic skill acquisition. It is difficult to assess individual hand performance with box trainers alone.

[Guidelines for the sociomedical assessment of performance in patients suffering from chronic non-malignant diseases of the liver and the bile ducts--for the Medical Assessment Services of the German Pension Fund].

PubMed

Horn, S; Irle, H; Knorr, I; Pottins, I; Rohwetter, M; Schuhknecht, P; Timner, K; Becker, E

2009-06-01

The following guidelines were developed for the medical assessment services of the German pension fund. Starting from day-to-day practice, criteria and attributes to guide decisions for a systematisation of the sociomedical assessment of performance in diseases of the liver and the bile ducts were compiled. The guidelines aim at standardising the sociomedical assessment of performance and help to make the decision-making process more transparent, e. g., for the assessment of applications for decreased earning capacity benefits. The guidelines summarise the typical manifestations of diseases of the liver and the bile ducts and describe the necessary medical information for the sociomedical assessment of performance. Relevant assessment criteria for the medical history, clinical examination, and for diagnostic tests are illustrated. The assessment of the individual's capacity is outlined, taking occupational factors into account. Following the determination of dysfunctions, the remaining abilities and disabilities, respectively, are deduced and compared with occupational demands. Finally, inferences are drawn regarding the occupational capacity of the individual. The guidelines followed from an extended procedure to attain a wide consensus in the setting of the German Pension Fund and an upgraded evidence base.
Static and dynamic single leg postural control performance during dual-task paradigms.

PubMed

Talarico, Maria K; Lynall, Robert C; Mauntel, Timothy C; Weinhold, Paul S; Padua, Darin A; Mihalik, Jason P

2017-06-01

Combining dynamic postural control assessments and cognitive tasks may give clinicians a more accurate indication of postural control under sport-like conditions compared to single-task assessments. We examined postural control, cognitive and squatting performance of healthy individuals during static and dynamic postural control assessments in single- and dual-task paradigms. Thirty participants (female = 22, male = 8; age = 20.8 ± 1.6 years, height = 157.9 ± 13.0 cm, mass = 67.8 ± 20.6 kg) completed single-leg stance and single-leg squat assessments on a force plate individually (single-task) and concurrently (dual-task) with two cognitive assessments, a modified Stroop test and the Brooks Spatial Memory Test. Outcomes included centre of pressure speed, 95% confidence ellipse, squat depth and speed and cognitive test measures (percentage of correct answers and reaction time). Postural control performance varied between postural control assessments and testing paradigms. Participants did not squat as deep and squatted slower (P < 0.001) during dual-task paradigms (≤12.69 ± 3.4 cm squat depth, ≤16.20 ± 4.6 cm · s -1 squat speed) compared to single-task paradigms (14.57 ± 3.6 cm squat depth, 19.65 ± 5.5 cm · s -1 squat speed). The percentage of correct answers did not change across testing conditions, but Stroop reaction time (725.81 ± 59.2 ms; F 2,58 = 7.725, P = 0.001) was slowest during single-leg squats compared to baseline (691.64 ± 80.1 ms; P = 0.038) and single-task paradigms (681.33 ± 51.5 ms; P < 0.001). Dynamic dual-task assessments may be more challenging to the postural control system and may better represent postural control performance during dynamic activities.
[Functional performance of school children diagnosed with developmental delay up to two years of age].

PubMed

Dornelas, Lílian de Fátima; Magalhães, Lívia de Castro

2016-01-01

To compare the functional performance of students diagnosed with developmental delay (DD) up to two years of age with peers exhibiting typical development. Cross-sectional study with functional performance assessment of children diagnosed with DD up to two years of age compared to those with typical development at seven to eight years of age. Each group consisted of 45 children, selected by non-random sampling, evaluated for motor skills, quality of home environment, school participation and performance. ANOVA and the Binomial test for two proportions were used to assess differences between groups. The group with DD had lower motor skills when compared to the typical group. While 66.7% of children in the typical group showed adequate school participation, receiving aid in cognitive and behavioral tasks similar to that offered to other children at the same level, only 22.2% of children with DD showed the same performance. Although 53.3% of the children with DD achieved an academic performance expected for the school level, there were limitations in some activities. Only two indicators of family environment, diversity and activities with parents at home, showed statistically significant difference between the groups, with advantage being shown for the typical group. Children with DD have persistent difficulties at school age, with motor deficit, restrictions in school activity performance and low participation in the school context, as well as significantly lower functional performance when compared to children without DD. A systematic monitoring of this population is recommended to identify needs and minimize future problems. Copyright © 2015 Sociedade de Pediatria de São Paulo. Publicado por Elsevier Editora Ltda. All rights reserved.
The effect of postnatal depression on mother-infant interaction, infant response to the Still-face perturbation, and performance on an Instrumental Learning task.

PubMed

Stanley, Charles; Murray, Lynne; Stein, Alan

2004-01-01

A representative community sample of primiparous depressed women and a nondepressed control group were assessed while in interaction with their infants at 2 months postpartum. At 3 months, infants were assessed on the Still-face perturbation of face to face interaction, and a subsample completed an Instrumental Learning paradigm. Compared to nondepressed women, depressed mothers' interactions were both less contingent and less affectively attuned to infant behavior. Postnatal depression did not adversely affect the infant's performance in either the Still-face perturbation or the Instrumental Learning assessment. Maternal responsiveness in interactions at 2 months predicted the infant's performance in the Instrumental Learning assessment but not in the Still-face perturbation. The implications of these findings for theories of infant cognitive and emotional development are discussed.
A national framework for flood forecasting model assessment for use in operations and investment planning over England and Wales

NASA Astrophysics Data System (ADS)

Moore, Robert J.; Wells, Steven C.; Cole, Steven J.

2016-04-01

It has been common for flood forecasting systems to be commissioned at a catchment or regional level in response to local priorities and hydrological conditions, leading to variety in system design and model choice. As systems mature and efficiencies of national management are sought, there can be a drive towards system rationalisation, gaining an overview of model performance and consideration of simplification through model-type convergence. Flood forecasting model assessments, whilst overseen at a national level, may be commissioned and managed at a catchment and regional level, take a variety of forms and be large in number. This presents a challenge when an integrated national assessment is required to guide operational use of flood forecasts and plan future investment in flood forecasting models and supporting hydrometric monitoring. This contribution reports on how a nationally consistent framework for flood forecasting model performance has been developed to embrace many past, ongoing and future assessments for local river systems by engineering consultants across England & Wales. The outcome is a Performance Summary for every site model assessed which, on a single page, contains relevant catchment information for context, a selection of overlain forecast and observed hydrographs and a set of performance statistics with associated displays of novel condensed form. One display provides performance comparison with other models that may exist for the site. The performance statistics include skill scores for forecasting events (flow/level threshold crossings) of differing severity/rarity, indicating their probability and likely timing, which have real value in an operational setting. The local models assessed can be of any type and span rainfall-runoff (conceptual and transfer function) and flow routing (hydrological and hydrodynamic) forms. Also accommodated by the framework is the national G2G (Grid-to-Grid) distributed hydrological model, providing area-wide coverage across the fluvial rivers of England and Wales, which can be assessed at gauged sites. Thus the performance of the national G2G model forecasts can be directly compared with that from the local models. The Performance Summary for each site model is complemented by a national spatial analysis of model performance stratified by model-type, geographical region and forecast lead-time. The map displays provide an extensive evidence-base that can be interrogated, through a Flood Forecasting Model Performance web portal, to reveal fresh insights into comparative performance across locations, lead-times and models. This work was commissioned by the Environment Agency in partnership with Natural Resources Wales and the Flood Forecasting Centre for England and Wales.
Ergonomics assessment of composite ballistic inserts for bullet- and fragment-proof vests.

PubMed

Majchrzycka, Katarzyna; Brochocka, Agnieszka; Luczak, Anna; Lężak, Krzysztof

2013-01-01

Personal protective equipment worn by uniformed services (e.g., the police and the military) must ensure protection against bodily injuries. However, a high degree of protection is always associated with significant discomfort. This article presents the results of an assessment of the ergonomics parameters of new special purpose products, ballistic inserts with improved ballistic resistance, and an assessment of the impact of the burden related to their use on the psychomotor performance of the subjects. An obstacle course and subjective ergonomics assessment questionnaires were used in tests. Thermal discomfort was also assessed. Psychological testing included tests enabling an assessment of the subjects' cognitive and psychomotor performance, and a subjective assessment of mental load. The tests did not show any decrease in the comfort of use of the new inserts with improved ballistic resistance compared to the inserts currently used.
Comparison of the Predictive Performance and Interpretability of Random Forest and Linear Models on Benchmark Data Sets.

PubMed

Marchese Robinson, Richard L; Palczewska, Anna; Palczewski, Jan; Kidley, Nathan

2017-08-28

The ability to interpret the predictions made by quantitative structure-activity relationships (QSARs) offers a number of advantages. While QSARs built using nonlinear modeling approaches, such as the popular Random Forest algorithm, might sometimes be more predictive than those built using linear modeling approaches, their predictions have been perceived as difficult to interpret. However, a growing number of approaches have been proposed for interpreting nonlinear QSAR models in general and Random Forest in particular. In the current work, we compare the performance of Random Forest to those of two widely used linear modeling approaches: linear Support Vector Machines (SVMs) (or Support Vector Regression (SVR)) and partial least-squares (PLS). We compare their performance in terms of their predictivity as well as the chemical interpretability of the predictions using novel scoring schemes for assessing heat map images of substructural contributions. We critically assess different approaches for interpreting Random Forest models as well as for obtaining predictions from the forest. We assess the models on a large number of widely employed public-domain benchmark data sets corresponding to regression and binary classification problems of relevance to hit identification and toxicology. We conclude that Random Forest typically yields comparable or possibly better predictive performance than the linear modeling approaches and that its predictions may also be interpreted in a chemically and biologically meaningful way. In contrast to earlier work looking at interpretation of nonlinear QSAR models, we directly compare two methodologically distinct approaches for interpreting Random Forest models. The approaches for interpreting Random Forest assessed in our article were implemented using open-source programs that we have made available to the community. These programs are the rfFC package ( https://r-forge.r-project.org/R/?group_id=1725 ) for the R statistical programming language and the Python program HeatMapWrapper [ https://doi.org/10.5281/zenodo.495163 ] for heat map generation.
Reading, writing, and phonological processing skills of adolescents with 10 or more years of cochlear implant experience.

PubMed

Geers, Ann E; Hayes, Heather

2011-02-01

This study had three goals: (1) to document the literacy skills of deaf adolescents who received cochlear implants (CIs) as preschoolers; (2) to examine reading growth from elementary grades to high school; (3) to assess the contribution of early literacy levels and phonological processing skills, among other factors, to literacy levels in high school. A battery of reading, spelling, expository writing, and phonological processing assessments were administered to 112 high school (CI-HS) students, ages 15.5 to 18.5 yrs, who had participated in a reading assessment battery in early elementary grades (CI-E), ages 8.0 to 9.9 yrs. The CI-HS students' performance was compared with either a control group of hearing peers (N = 46) or hearing norms provided by the assessment developer. Many of the CI-HS students (47 to 66%) performed within or above the average range for hearing peers on reading tests. When compared with their CI-E performance, good early readers were also good readers in high school. Importantly, the majority of CI-HS students maintained their reading levels over time compared with hearing peers, indicating that the gap in performance was, at the very least, not widening for most students. Written expression and phonological processing tasks posed a great deal of difficulty for the CI-HS students. They were poorer spellers, poorer expository writers, and displayed poorer phonological knowledge than hearing age-mates. Phonological processing skills were a critical predictor of high school literacy skills (reading, spelling, and expository writing), accounting for 39% of variance remaining after controlling for child, family, and implant characteristics. Many children who receive CIs as preschoolers achieve age-appropriate literacy levels as adolescents. However, significant delays in spelling and written expression are evident compared with hearing peers. For children with CIs, the development of phonological processing skills is not just important for early reading skills, such as decoding, but is critical for later literacy success as well.
Quality Assurance in American and British Higher Education: A Comparison.

ERIC Educational Resources Information Center

Stanley, Elizabeth C.; Patrick, William J.

1998-01-01

Compares quality improvement and accountability processes in the United States and United Kingdom. For the United Kingdom, looks at quality audits, institutional assessment, standards-based quality assurance, and research assessment; in the United States, looks at regional and specialized accreditation, performance indicator systems, academic…
Identifying the Comparative Academic Performance of Secondary Schools

ERIC Educational Resources Information Center

Bendikson, Linda; Hattie, John; Robinson, Viviane

2011-01-01

Purpose: One of the features of the New Zealand secondary schools system is that achievement closely reflects the taught curriculum. The National Certificate of Educational Achievement (NCEA) directly assesses student achievement on the secondary school curriculum through a combination of criterion-based internal and external assessments. The…
The feasibility and concurrent validity of performing the Movement Assessment Battery for Children - 2nd Edition via telerehabilitation technology.

PubMed

Nicola, Kristy; Waugh, Jemimah; Charles, Emily; Russell, Trevor

2018-06-01

In rural and remote communities children with motor difficulties have less access to rehabilitation services. Telerehabilitation technology is a potential method to overcome barriers restricting access to healthcare in these areas. Assessment is necessary to guide clinical reasoning; however it is unclear which paediatric assessments can be administered remotely. The Movement Assessment Battery for Children - 2nd Edition is commonly used by various health professionals to assess motor performance of children. The aim of this study was to investigate the feasibility and concurrent validity of performing the Movement Assessment Battery for Children - 2nd Edition remotely via telerehabilitation technology compared to the conventional in-person method. Fifty-nine children enrolled in a state school (5-11 years old) volunteered to perform one in-person and one telerehabilitation mediated assessment. The order of the method of delivery and the therapist performing the assessment were randomized. After both assessments were complete, a participant satisfaction questionnaire was completed by each child. The Bland-Altman limits of agreement for the total test standard score were -3.15 to 3.22 which is smaller than a pre-determined clinically acceptable margin based on the smallest detectable change. This study establishes the feasibility and concurrent validity of the administration of the Movement Assessment Battery for Children - 2nd Edition via telerehabilitation technology. Overall, participants perceived their experience with telerehabilitation positively. Copyright © 2018 Elsevier Ltd. All rights reserved.
Do cats with a cranial cruciate ligament injury and osteoarthritis demonstrate a different gait pattern and behaviour compared to sound cats?

PubMed

Stadig, Sarah; Lascelles, B Duncan X; Bergh, Anna

2016-10-20

Osteoarthritis (OA) is a common cause of chronic pain and dysfunction in older cats. The majority of cats with OA do not show signs of overt lameness, yet cats with orthopaedic disease are known to redistribute their body weight from the affected limb. OA can cause changes in the cat's behaviour, which is often misinterpreted as signs of aging. The aim of the present study was to investigate if cats with a previous cranial cruciate ligament (CCL) injury perform differently on the pressure mat and exhibit different behaviour compared to sound cats according to the owner´s subjective assessment. Ten cats with a previous CCL injury were assessed with a pressure mat system and their owners were asked to complete an assessment questionnaire. The results were compared to those of 15 sound cats, matched to have the same weight and body condition score. The front/hind limb index for peak vertical force (PVF) was significantly higher for CCL cats, and there was a decreased PVF and vertical impulse (VI) on the affected hindlimb compared to the unaffected one. The results indicate that cats with a previous CCL injury put less weight, on the affected hindlimb but for a longer time. There was a significantly higher owner assessment questionnaire score for the group of cats with CCL injury compared to sound cats. Cats with a previous CCL injury have a different gait pattern compared to sound cats and a different behaviour according to owner subjective assessment. It is of great importance that further studies are performed to investigate the long term effects of CCL injury as a cause of pain and physical dysfunction, and its role in the development of OA in cats. Improved assessment tools for chronic pain caused by OA in cats are needed, both to facilitate diagnosis and to evaluate pain-relieving treatment.
The quality of systematic reviews of health-related outcome measurement instruments.

PubMed

Terwee, C B; Prinsen, C A C; Ricci Garotti, M G; Suman, A; de Vet, H C W; Mokkink, L B

2016-04-01

Systematic reviews of outcome measurement instruments are important tools for the selection of instruments for research and clinical practice. Our aim was to assess the quality of systematic reviews of health-related outcome measurement instruments and to determine whether the quality has improved since our previous study in 2007. A systematic literature search was performed in MEDLINE and EMBASE between July 1, 2013, and June 19, 2014. The quality of the reviews was rated using a study-specific checklist. A total of 102 reviews were included. In many reviews the search strategy was considered not comprehensive; in only 59 % of the reviews a search was performed in EMBASE and in about half of the reviews there was doubt about the comprehensiveness of the search terms used for type of measurement instruments and measurement properties. In 41 % of the reviews, compared to 30 % in our previous study, the methodological quality of the included studies was assessed. In 58 %, compared to 55 %, the quality of the included instruments was assessed. In 42 %, compared to 7 %, a data synthesis was performed in which the results from multiple studies on the same instrument were somehow combined. Despite a clear improvement in the quality of systematic reviews of outcome measurement instruments in comparison with our previous study in 2007, there is still room for improvement with regard to the search strategy, and especially the quality assessment of the included studies and the included instruments, and the data synthesis.
Statistical/Documentary Report, 1974 and 1975 Assessments of 17-Year-Old Students, Summary Volume; Functional Literacy Basic Reading Performance.

ERIC Educational Resources Information Center

Gadway, Charles J.; Wilson, H.A.

This document provides statistical data on the 1974 and 1975 Mini-Assessment of Functional Literacy, which was designed to determine the extent of functional literacy among seventeen year olds in America. Also presented are data from comparable test items from the 1971 assessment. Three standards are presented, to allow different methods of…
Achievement Levels of Middle School Students in the Standardized Science and Technology Exam and Formative Assessment Probes: A Comparative Study

ERIC Educational Resources Information Center

Bulunuz, Nermin; Bulunuz, Mizrap; Karagoz, Funda; Tavsanli, Omer Faruk

2016-01-01

The present study has two aims. Firstly, it aims to determine eighth grade students' conceptual understanding of floating and sinking through formative assessment probes. Secondly, it aims to determine whether or not there is a significant difference between students' performance in formative assessment probes and their achievement in the…
Using the SSIS Assessments with Australian Students: A Comparative Analysis of Test Psychometrics to the US Normative Sample

ERIC Educational Resources Information Center

Sherbow, Amanda; Kettler, Ryan J.; Elliott, Stephen N.; Davies, Michael; Dembitzer, Leah

2015-01-01

The Social Skills Improvement System (SSIS; Gresham & Elliott, 2008) is a multiple stage, broadband system for assessing and intervening with children in preschool through 12th grade originally normed in the USA. Two of the assessment components of this system were analysed: (a) the Performance Screening Guides (PSGs); and (b) the Rating…
The sensitivity of laboratory tests assessing driving related skills to dose-related impairment of alcohol: A literature review.

PubMed

Jongen, S; Vuurman, E F P M; Ramaekers, J G; Vermeeren, A

2016-04-01

Laboratory tests assessing driving related skills can be useful as initial screening tools to assess potential drug induced impairment as part of a standardized behavioural assessment. Unfortunately, consensus about which laboratory tests should be included to reliably assess drug induced impairment has not yet been reached. The aim of the present review was to evaluate the sensitivity of laboratory tests to the dose dependent effects of alcohol, as a benchmark, on performance parameters. In total, 179 experimental studies were included. Results show that a cued go/no-go task and a divided attention test with primary tracking and secondary visual search were consistently sensitive to the impairing effects at medium and high blood alcohol concentrations. Driving performance assessed in a simulator was less sensitive to the effects of alcohol as compared to naturalistic, on-the-road driving. In conclusion, replicating results of several potentially useful tests and their predictive validity of actual driving impairment should deserve further research. In addition, driving simulators should be validated and compared head to head to naturalistic driving in order to increase construct validity. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
Relevance similarity: an alternative means to monitor information retrieval systems

PubMed Central

Dong, Peng; Loh, Marie; Mondry, Adrian

2005-01-01

Background Relevance assessment is a major problem in the evaluation of information retrieval systems. The work presented here introduces a new parameter, "Relevance Similarity", for the measurement of the variation of relevance assessment. In a situation where individual assessment can be compared with a gold standard, this parameter is used to study the effect of such variation on the performance of a medical information retrieval system. In such a setting, Relevance Similarity is the ratio of assessors who rank a given document same as the gold standard over the total number of assessors in the group. Methods The study was carried out on a collection of Critically Appraised Topics (CATs). Twelve volunteers were divided into two groups of people according to their domain knowledge. They assessed the relevance of retrieved topics obtained by querying a meta-search engine with ten keywords related to medical science. Their assessments were compared to the gold standard assessment, and Relevance Similarities were calculated as the ratio of positive concordance with the gold standard for each topic. Results The similarity comparison among groups showed that a higher degree of agreements exists among evaluators with more subject knowledge. The performance of the retrieval system was not significantly different as a result of the variations in relevance assessment in this particular query set. Conclusion In assessment situations where evaluators can be compared to a gold standard, Relevance Similarity provides an alternative evaluation technique to the commonly used kappa scores, which may give paradoxically low scores in highly biased situations such as document repositories containing large quantities of relevant data. PMID:16029513
The flipped classroom allows for more class time devoted to critical thinking.

PubMed

DeRuisseau, Lara R

2016-12-01

The flipped classroom was utilized in a two-semester, high-content science course that enrolled between 50 and 80 students at a small liberal arts college. With the flipped model, students watched ~20-min lectures 2 days/wk outside of class. These videos were recorded via screen capture and included a detailed note outline, PowerPoint slides, and review questions. The traditional format included the same materials, except that lectures were delivered in class each week and spanned the entire period. During the flipped course, the instructor reviewed common misconceptions and asked questions requiring higher-order thinking, and five graded case studies were performed each semester. To determine whether assessments included additional higher-order thinking skills in the flipped vs. traditional model, questions across course formats were compared via Blooms Taxonomy. Application-level questions that required prediction of an outcome in a new scenario comprised 38 ± 3 vs. 12 ± 1% of summative assessment questions (<0.01): flipped vs. traditional. Final letter grades in both formats of the course were compared with major GPA. Students in the flipped model performed better than their GPA predicted, as 85.5% earned a higher grade (vs. 42.2% in the traditional classroom) compared with their major GPA. These data demonstrate that assessments transitioned to more application-level compared with factual knowledge-based questions with this particular flipped model, and students performed better in their final letter grade compared with the traditional lecture format. Although the benefits to a flipped classroom are highlighted, student evaluations did suffer. More detailed studies comparing the traditional and flipped formats are warranted. Copyright © 2016 the American Physiological Society.
Directly Comparing Computer and Human Performance in Language Understanding and Visual Reasoning.

ERIC Educational Resources Information Center

Baker, Eva L.; And Others

Evaluation models are being developed for assessing artificial intelligence (AI) systems in terms of similar performance by groups of people. Natural language understanding and vision systems are the areas of concentration. In simplest terms, the goal is to norm a given natural language system's performance on a sample of people. The specific…

A STUDY OF COGNITIVE DEVELOPMENT AND PERFORMANCE IN CHILDREN WITH NORMAL AND DEFECTIVE HEARING.

ERIC Educational Resources Information Center

TEMPLIN, MILDRED C.

A COMPARATIVE, LONGITUDINAL STUDY WAS CONDUCTED TO EXAMINE SPECIFIC PERFORMANCE CHARACTERISTICS OF DEAF AND NORMAL CHILDREN ON SELECTED COGNITIVE TASKS. THE SAMPLE, DISTRIBUTED INTO 3 AGE CATEGORIES, CONSISTED OF 72 NORMAL AND 60 DEAF CHILDREN. MEASURES WERE SELECTED TO ASSESS THE PERFORMANCE OF SUBJECTS (1) IN DIFFERENT AREAS OF COGNITION, (2) BY…
A Comparative Assessment of the Performance of Select Higher Education Institutes in India

ERIC Educational Resources Information Center

Sahney, Sangeeta; Thakkar, Jitesh

2016-01-01

Purpose: The purpose of this paper is to evaluate the performance of select technical higher education institutes of national importance in India. This helps to judge the efficiency and effectiveness of an institute to provide valuable insights on performance measurement and effectiveness not only to the respective institute but also to…
Impact of familiar and unfamiliar settings on cooking task assessments in frail older adults with poor and preserved executive functions.

PubMed

Provencher, Véronique; Demers, Louise; Gagnon, Lise; Gélinas, Isabelle

2012-05-01

Hospitalized frail older patients are usually assessed for their ability to perform some daily living activities in a clinical setting prior to discharge. However, assessments that take place in this unfamiliar environment might not be as representative of their functional performance as assessments at home. This may be related to a decline in some cognitive components, such as executive functions (EF), which enable one to cope with new environments. This study thus aims to compare cooking task performance in familiar and unfamiliar settings in a population of frail older adults with poor and preserved EF. Thirty-seven frail older adults were assigned to one of two groups: poor EF or preserved EF. Participants performed two cooking tasks in familiar and unfamiliar settings, using a counterbalanced design. Their performance was assessed with a reliable tool based on observation of motor and process skills (Assessment of Motor and Process Skills). Thirty-three participants were retained for analysis. They demonstrated significantly better motor skills (F = 5.536; p = 0.025) and process skills (F = 8.149; p = 0.008) in the familiar setting. The difference between settings was particularly marked for process skills in participants with poor EF (F = 16.920; p < 0.001). This study suggests that a home setting may be preferable for a more accurate assessment of cooking task performance in frail older adults, especially those with poor EF. These findings highlight the risk of underestimating frail older adults' performance when assessed in an unfamiliar setting (e.g. hospital), which could lead to inefficient allocation of home care services.
Critical assessment of I-85 CRCP crack spacing patterns and their implications for long-term performance.

DOT National Transportation Integrated Search

2014-01-01

Transverse crack patterns in Continuously Reinforced Concrete Pavement (CRCP) are important : indicators of pavement performance. This study is a) to study the Mean Crack Spacing (MCS) and localized : cracks of a newly constructed CRCP and compare th...
ECG-gated imaging of the left atrium and pulmonary veins: Intra-individual comparison of CTA and MRA.

PubMed

Fahlenkamp, U L; Lembcke, A; Roesler, R; Schwenke, C; Huppertz, A; Streitparth, F; Taupitz, M; Hamm, B; Wagner, M

2013-10-01

To compare electrocardiography (ECG)-gated computed tomography angiography (CTA) with ECG-gated magnetic resonance angiography (MRA) for assessment of the left atrium (LA) and pulmonary veins (PVs). Twenty-nine consecutive patients who underwent both cardiac CTA and MRA were evaluated. Contrast-enhanced CTA was performed with prospective ECG-gating using a 320 detector row CT system. Contrast-enhanced MRA was performed with prospective ECG-gating using a 1.5 T MRI system equipped with a 32 channel cardiac coil. MRA was acquired during free-breathing with a navigator-gated inversion-recovery prepared steady-state free precession sequence. Two readers independently assessed the CTA and MRA images for vascular definition of the PVs (from 0, not visualized, to 4, excellent definition) and ostial PV diameters. Variants of LA anatomy were assessed in consensus. CTA was successfully performed in all patients with a mean radiation exposure of 5.1 ± 2.2 mSv. MRA was successfully performed in 27 of 29 patients (93 %). Visual definition of PVs was rated significantly higher on CTA compared to MRA (p < 0.0001; reader 1: excellent/good ratings of CTA versus MRA: 100% versus 86%; reader 2: excellent/good ratings of CTA versus MRA: 99% versus 89%). Assessment of ostial PV diameters showed good correlation between CTA and MRA (reader 1: Pearson r = 0.91; reader 2: Pearson r = 0.82). Moreover, agreement between both imaging methods for evaluation of variants of LA anatomy was high (agreement rate of 95% (95% CI: 92-99%). ECG-gated CTA provides higher image quality compared to ECG-gated MRA. Nevertheless, both CTA and MRA provided similar information of LA anatomy and ostial PV diameters. Copyright © 2013 The Royal College of Radiologists. Published by Elsevier Ltd. All rights reserved.
Loose and Tight GNSS/INS Integrations: Comparison of Performance Assessed in Real Urban Scenarios.

PubMed

Falco, Gianluca; Pini, Marco; Marucco, Gianluca

2017-01-29

Global Navigation Satellite Systems (GNSSs) remain the principal mean of positioning in many applications and systems, but in several types of environment, the performance of standalone receivers is degraded. Although many works show the benefits of the integration between GNSS and Inertial Navigation Systems (INSs), tightly-coupled architectures are mainly implemented in professional devices and are based on high-grade Inertial Measurement Units (IMUs). This paper investigates the performance improvements enabled by the tight integration, using low-cost sensors and a mass-market GNSS receiver. Performance is assessed through a series of tests carried out in real urban scenarios and is compared against commercial modules, operating in standalone mode or featuring loosely-coupled integrations. The paper describes the developed tight-integration algorithms with a terse mathematical model and assesses their efficacy from a practical perspective.
Neuropsychological assessment of driving ability and self-evaluation: a comparison between driving offenders and a control group.

PubMed

Zingg, Christina; Puelschen, Dietrich; Soyka, Michael

2009-12-01

The relationship between performance in neuropsychological tests and actual driving performance is unclear and results of studies on this topic differ. This makes it difficult to use neuropsychological tests to assess driving ability. The ability to compensate cognitive deficits plays a crucial role in this context. We compared neuropsychological test results and self-evaluation ratings between three groups: driving offenders with a psychiatric diagnosis relevant for driving ability (mainly alcohol dependence), driving offenders without such a diagnosis and a control group of non-offending drivers. Subjects were divided into two age categories (19-39 and 40-66 years). It was assumed that drivers with a psychiatric diagnosis relevant for driving ability and younger driving offenders without a psychiatric diagnosis would be less able to adequately assess their own capabilities than the control group. The driving offenders with a psychiatric diagnosis showed poorer concentration, reactivity, cognitive flexibility and problem solving, and tended to overassess their abilities in intelligence and attentional functions, compared to the other two groups. Conversely, younger drivers rather underassessed their performance.
Estimating learning outcomes from pre- and posttest student self-assessments: a longitudinal study.

PubMed

Schiekirka, Sarah; Reinhardt, Deborah; Beißbarth, Tim; Anders, Sven; Pukrop, Tobias; Raupach, Tobias

2013-03-01

Learning outcome is an important measure for overall teaching quality and should be addressed by comprehensive evaluation tools. The authors evaluated the validity of a novel evaluation tool based on student self-assessments, which may help identify specific strengths and weaknesses of a particular course. In 2011, the authors asked 145 fourth-year students at Göttingen Medical School to self-assess their knowledge on 33 specific learning objectives in a pretest and posttest as part of a cardiorespiratory module. The authors compared performance gain calculated from self-assessments with performance gain derived from formative examinations that were closely matched to these 33 learning objectives. Eighty-three students (57.2%) completed the assessment. There was good agreement between performance gain derived from subjective data and performance gain derived from objective examinations (Pearson r=0.78; P<.0001) on the group level. The association between the two measures was much weaker when data were analyzed on the individual level. Further analysis determined a quality cutoff for performance gain derived from aggregated student self-assessments. When using this cutoff, the evaluation tool was highly sensitive in identifying specific learning objectives with favorable or suboptimal objective performance gains. The tool is easy to implement, takes initial performance levels into account, and does not require extensive pre-post testing. By providing valid estimates of actual performance gain obtained during a teaching module, it may assist medical teachers in identifying strengths and weaknesses of a particular course on the level of specific learning objectives.
Performance assessment in a flight simulator test—Validation of a space psychology methodology

NASA Astrophysics Data System (ADS)

Johannes, B.; Salnitski, Vyacheslav; Soll, Henning; Rauch, Melina; Goeters, Klaus-Martin; Maschke, Peter; Stelling, Dirk; Eißfeldt, Hinnerk

2007-02-01

The objective assessment of operator performance in hand controlled docking of a spacecraft on a space station has 30 years of tradition and is well established. In the last years the performance assessment was successfully combined with a psycho-physiological approach for the objective assessment of the levels of physiological arousal and psychological load. These methods are based on statistical reference data. For the enhancement of the statistical power of the evaluation methods, both were actually implemented into a comparable terrestrial task: the flight simulator test of DLR in the selection procedure for ab initio pilot applicants for civil airlines. In the first evaluation study 134 male subjects were analysed. Subjects underwent a flight simulator test including three tasks, which were evaluated by instructors applying well-established and standardised rating scales. The principles of the performance algorithms of the docking training were adapted for the automated flight performance assessment. They are presented here. The increased human errors under instrument flight conditions without visual feedback required a manoeuvre recognition algorithm before calculating the deviation of the flown track from the given task elements. Each manoeuvre had to be evaluated independently of former failures. The expert rated performance showed a highly significant correlation with the automatically calculated performance for each of the three tasks: r=.883, r=.874, r=.872, respectively. An automated algorithm successfully assessed the flight performance. This new method will possibly provide a wide range of other future applications in aviation and space psychology.
Behavioral, cognitive, and motor performance and physical development of five-year-old children who were born after intracytoplasmic sperm injection with the use of testicular sperm.

PubMed

Meijerink, Aukje M; Ramos, Liliana; Janssen, Anjo J W M; Maas-van Schaaijk, Nienke M; Meissner, Andreas; Repping, Sjoerd; Mochtar, Monique H; Braat, Didi D M; Fleischer, Kathrin

2016-12-01

To evaluate at the age of 5 years the behavioral, cognitive, and motor performance and physical development of children born after testicular sperm extraction (TESE) and intracytoplasmic sperm injection (ICSI). A prospective longitudinal cohort study. Two university medical centers. A total of 103 5-year-olds who were born after TESE-ICSI. The follow-up of the children was performed by questionnaires at birth and again at 1 year and at 4 years of age. Five-year-old children were invited for individual assessment. Behavioral performance was assessed with the use of the Child Behavior Checklist for parents and teachers. Cognitive performance was assessed with the use of the Dutch Wechsler Preschool and Primary Scale of Intelligence test, 3rd version. Motor performance was assessed with the use of the Dutch Movement Assessment Battery for Children, 2nd version. Physical development was assessed by means of physical examination and medical history. Behavioral, cognitive, and motor performance and physical development. Eighty-nine children were completely assessed, and 14 were partially assessed at the age of 5 years. The 5-year-old cohort assessed significantly better on behavioral and cognitive performance and significantly worse on motor performance-but still in the normal range-compared with the theoretic distribution in the general population. Four children (3.8%) of the 5-year-old cohort had developmental problems/delays. Two of them were previously diagnosed with a form of autism (pervasive developmental disorder-not otherwise specified). Two children had developmental problems based on our behavioral, cognitive, and/or motor assessments. The long-term effects on development and health in children born after TESE-ICSI procedures seem to be reassuring. Copyright © 2016 American Society for Reproductive Medicine. Published by Elsevier Inc. All rights reserved.
Development and validation of trauma surgical skills metrics: Preliminary assessment of performance after training.

PubMed

Shackelford, Stacy; Garofalo, Evan; Shalin, Valerie; Pugh, Kristy; Chen, Hegang; Pasley, Jason; Sarani, Babak; Henry, Sharon; Bowyer, Mark; Mackenzie, Colin F

2015-07-01

Maintaining trauma-specific surgical skills is an ongoing challenge for surgical training programs. An objective assessment of surgical skills is needed. We hypothesized that a validated surgical performance assessment tool could detect differences following a training intervention. We developed surgical performance assessment metrics based on discussion with expert trauma surgeons, video review of 10 experts and 10 novice surgeons performing three vascular exposure procedures and lower extremity fasciotomy on cadavers, and validated the metrics with interrater reliability testing by five reviewers blinded to level of expertise and a consensus conference. We tested these performance metrics in 12 surgical residents (Year 3-7) before and 2 weeks after vascular exposure skills training in the Advanced Surgical Skills for Exposure in Trauma (ASSET) course. Performance was assessed in three areas as follows: knowledge (anatomic, management), procedure steps, and technical skills. Time to completion of procedures was recorded, and these metrics were combined into a single performance score, the Trauma Readiness Index (TRI). Wilcoxon matched-pairs signed-ranks test compared pretraining/posttraining effects. Mean time to complete procedures decreased by 4.3 minutes (from 13.4 minutes to 9.1 minutes). The performance component most improved by the 1-day skills training was procedure steps, completion of which increased by 21%. Technical skill scores improved by 12%. Overall knowledge improved by 3%, with 18% improvement in anatomic knowledge. TRI increased significantly from 50% to 64% with ASSET training. Interrater reliability of the surgical performance assessment metrics was validated with single intraclass correlation coefficient of 0.7 to 0.98. A trauma-relevant surgical performance assessment detected improvements in specific procedure steps and anatomic knowledge taught during a 1-day course, quantified by the TRI. ASSET training reduced time to complete vascular control by one third. Future applications include assessing specific skills in a larger surgeon cohort, assessing military surgical readiness, and quantifying skill degradation with time since training.
Evaluation of a Mobile Phone Image-Based Dietary Assessment Method in Adults with Type 2 Diabetes.

PubMed

Rollo, Megan E; Ash, Susan; Lyons-Wall, Philippa; Russell, Anthony W

2015-06-17

Image-based dietary records have limited evidence evaluating their performance and use among adults with a chronic disease. This study evaluated the performance of a 3-day mobile phone image-based dietary record, the Nutricam Dietary Assessment Method (NuDAM), in adults with type 2 diabetes mellitus (T2DM). Criterion validity was determined by comparing energy intake (EI) with total energy expenditure (TEE) measured by the doubly-labelled water technique. Relative validity was established by comparison to a weighed food record (WFR). Inter-rater reliability was assessed by comparing estimates of intake from three dietitians. Ten adults (6 males, age: 61.2 ± 6.9 years old, BMI: 31.0 ± 4.5 kg/m(2)) participated. Compared to TEE, mean EI (MJ/day) was significantly under-reported using both methods, with a mean ratio of EI:TEE 0.76 ± 0.20 for the NuDAM and 0.76 ± 0.17 for the WFR. Correlations between the NuDAM and WFR were mostly moderate for energy (r = 0.57), carbohydrate (g/day) (r = 0.63, p < 0.05), protein (g/day) (r = 0.78, p < 0.01) and alcohol (g/day) (rs = 0.85, p < 0.01), with a weaker relationship for fat (g/day) (r = 0.24). Agreement between dietitians for nutrient intake for the 3-day NuDAM (Intra-class Correlation Coefficient (ICC) = 0.77-0.99) was lower when compared with the 3-day WFR (ICC = 0.82-0.99). These findings demonstrate the performance and feasibility of the NuDAM to assess energy and macronutrient intake in a small sample. Some modifications to the NuDAM could improve efficiency and an evaluation in a larger group of adults with T2DM is required.
Assessment of driving-related performance in chronic whiplash using an advanced driving simulator.

PubMed

Takasaki, Hiroshi; Treleaven, Julia; Johnston, Venerina; Rakotonirainy, Andry; Haines, Andrew; Jull, Gwendolen

2013-11-01

Driving is often nominated as problematic by individuals with chronic whiplash associated disorders (WAD), yet driving-related performance has not been evaluated objectively. The purpose of this study was to test driving-related performance in persons with chronic WAD against healthy controls of similar age, gender and driving experience to determine if driving-related performance in the WAD group was sufficiently impaired to recommend fitness to drive assessment. Driving-related performance was assessed using an advanced driving simulator during three driving scenarios; freeway, residential and a central business district (CBD). Total driving duration was approximately 15min. Five driving tasks which could cause a collision (critical events) were included in the scenarios. In addition, the effect of divided attention (identify red dots projected onto side or rear view mirrors) was assessed three times in each scenario. Driving performance was measured using the simulator performance index (SPI) which is calculated from 12 measures. z-Scores for all SPI measures were calculated for each WAD subject based on mean values of the control subjects. The z-scores were then averaged for the WAD group. A z-score of ≤-2 indicated a driving failing grade in the simulator. The number of collisions over the five critical events was compared between the WAD and control groups as was reaction time and missed response ratio in identifying the red dots. Seventeen WAD and 26 control subjects commenced the driving assessment. Demographic data were comparable between the groups. All subjects completed the freeway scenario but four withdrew during the residential and eight during the CBD scenario because of motion sickness. All scenarios were completed by 14 WAD and 17 control subjects. Mean z-scores for the SPI over the three scenarios was statistically lower in the WAD group (-0.3±0.3; P<0.05) but the score was not below the cut-off point for safe driving. There were no differences in the reaction time and missed response ratio in divided attention tasks between the groups (All P>0.05). Assessment of driving in an advanced driving simulator for approximately 15min revealed that driving-related performance in chronic WAD was not sufficiently impaired to recommend the need for fitness to drive assessment. Copyright © 2013 Elsevier Ltd. All rights reserved.
Comparison of Static and Dynamic Balance in Female Collegiate Soccer, Basketball, and Gymnastics Athletes

PubMed Central

Bressel, Eadric; Yonker, Joshua C; Kras, John; Heath, Edward M

2007-01-01

Context: How athletes from different sports perform on balance tests is not well understood. When prescribing balance exercises to athletes in different sports, it may be important to recognize performance variations. Objective: To compare static and dynamic balance among collegiate athletes competing or training in soccer, basketball, and gymnastics. Design: A quasi-experimental, between-groups design. Independent variables included limb (dominant and nondominant) and sport played. Setting: A university athletic training facility. Patients or Other Participants: Thirty-four female volunteers who competed in National Collegiate Athletic Association Division I soccer (n = 11), basketball (n = 11), or gymnastics (n = 12). Intervention(s): To assess static balance, participants performed 3 stance variations (double leg, single leg, and tandem leg) on 2 surfaces (stiff and compliant). For assessment of dynamic balance, participants performed multidirectional maximal single-leg reaches from a unilateral base of support. Main Outcome Measure(s): Errors from the Balance Error Scoring System and normalized leg reach distances from the Star Excursion Balance Test were used to assess static and dynamic balance, respectively. Results: Balance Error Scoring System error scores for the gymnastics group were 55% lower than for the basketball group (P = .01), and Star Excursion Balance Test scores were 7% higher in the soccer group than the basketball group (P = .04). Conclusions: Gymnasts and soccer players did not differ in terms of static and dynamic balance. In contrast, basketball players displayed inferior static balance compared with gymnasts and inferior dynamic balance compared with soccer players. PMID:17597942
Lower-limb kinematics of single-leg squat performance in young adults.

PubMed

Horan, Sean A; Watson, Steven L; Carty, Christopher P; Sartori, Massimo; Weeks, Benjamin K

2014-01-01

To determine the kinematic parameters that characterize good and poor single-leg squat (SLS) performance. A total of 22 healthy young adults free from musculoskeletal impairment were recruited for testing. For each SLS, both two-dimensional video and three-dimensional motion analysis data were collected. Pelvis, hip, and knee angles were calculated using a reliable and validated lower-limb (LL) biomechanical model. Two-dimensional video clips of SLSs were blindly assessed in random order by eight musculoskeletal physiotherapists using a 10-point ordinal scale. To facilitate between-group comparisons, SLS performances were stratified by tertiles corresponding to poor, intermediate, and good SLS performance. Mean ratings of SLS performance assessed by physiotherapists were 8.3 (SD 0.5), 6.8 (SD 0.7), and 4.0 (SD 0.8) for good, intermediate, and poor squats, respectively. Three-dimensional analysis revealed that people whose SLS performance was assessed as poor exhibited increased hip adduction, reduced knee flexion, and increased medio-lateral displacement of the knee joint centre compared to those whose SLS performance was assessed as good (p≤0.05). Overall, poor SLS performance is characterized by inadequate knee flexion and excessive frontal plane motion of the knee and hip. Future investigations of SLS performance should consider standardizing knee flexion angle to illuminate other influential kinematic parameters.
Towards an operational definition of pharmacy clinical competency

NASA Astrophysics Data System (ADS)

Douglas, Charles Allen

The scope of pharmacy practice and the training of future pharmacists have undergone a strategic shift over the last few decades. The pharmacy profession recognizes greater pharmacist involvement in patient care activities. Towards this strategic objective, pharmacy schools are training future pharmacists to meet these new clinical demands. Pharmacy students have clerkships called Advanced Pharmacy Practice Experiences (APPEs), and these clerkships account for 30% of the professional curriculum. APPEs provide the only opportunity for students to refine clinical skills under the guidance of an experienced pharmacist. Nationwide, schools of pharmacy need to evaluate whether students have successfully completed APPEs and are ready treat patients. Schools are left to their own devices to develop assessment programs that demonstrate to the public and regulatory agencies, students are clinically competent prior to graduation. There is no widely accepted method to evaluate whether these assessment programs actually discriminate between the competent and non-competent students. The central purpose of this study is to demonstrate a rigorous method to evaluate the validity and reliability of APPE assessment programs. The method introduced in this study is applicable to a wide variety of assessment programs. To illustrate this method, the study evaluated new performance criteria with a novel rating scale. The study had two main phases. In the first phase, a Delphi panel was created to bring together expert opinions. Pharmacy schools nominated exceptional preceptors to join a Delphi panel. Delphi is a method to achieve agreement of complex issues among experts. The principal researcher recruited preceptors representing a variety of practice settings and geographical regions. The Delphi panel evaluated and refined the new performance criteria. In the second phase, the study produced a novel set of video vignettes that portrayed student performances based on recommendations of an expert panel. Pharmacy preceptors assessed the performances with the new performance criteria. Estimates of reliability and accuracy from preceptors' assessments can be used to establish benchmarks for future comparisons. Findings from the first phase suggested preceptors held a unique perspective, where APPE assessments are based in relevance to clinical activities. The second phase analyzed assessment results from pharmacy preceptors who watched the video simulations. Reliability results were higher for non-randomized compared to randomized video simulations. Accuracy results showed preceptors more readily identified high and low student performances compared to average students. These results indicated the need for pharmacy preceptor training in performance assessment. The study illustrated a rigorous method to evaluate the validity and reliability of APPE assessment instruments.
Video-task assessment of learning and memory in Macaques (Macaca mulatta) - Effects of stimulus movement on performance

NASA Technical Reports Server (NTRS)

Washburn, David A.; Hopkins, William D.; Rumbaugh, Duane M.

1989-01-01

Effects of stimulus movement on learning, transfer, matching, and short-term memory performance were assessed with 2 monkeys using a video-task paradigm in which the animals responded to computer-generated images by manipulating a joystick. Performance on tests of learning set, transfer index, matching to sample, and delayed matching to sample in the video-task paradigm was comparable to that obtained in previous investigations using the Wisconsin General Testing Apparatus. Additionally, learning, transfer, and matching were reliably and significantly better when the stimuli or discriminanda moved than when the stimuli were stationary. External manipulations such as stimulus movement may increase attention to the demands of a task, which in turn should increase the efficiency of learning. These findings have implications for the investigation of learning in other populations, as well as for the application of the video-task paradigm to comparative study.
Simplified Analysis of Pulse Detonation Rocket Engine Blowdown Gasdynamics and Performance

NASA Technical Reports Server (NTRS)

Morris, C. I.; Rodgers, Stephen L. (Technical Monitor)

2002-01-01

Pulse detonation rocket engines (PDREs) offer potential performance improvements over conventional designs, but represent a challenging modellng task. A simplified model for an idealized, straight-tube, single-shot PDRE blowdown process and thrust determination is described and implemented. In order to form an assessment of the accuracy of the model, the flowfield time history is compared to experimental data from Stanford University. Parametric Studies of the effect of mixture stoichiometry, initial fill temperature, and blowdown pressure ratio on the performance of a PDRE are performed using the model. PDRE performance is also compared with a conventional steady-state rocket engine over a range of pressure ratios using similar gasdynamic assumptions.
The influence of test mode and visuospatial ability on mathematics assessment performance

NASA Astrophysics Data System (ADS)

Logan, Tracy

2015-12-01

Mathematics assessment and testing are increasingly situated within digital environments with international tests moving to computer-based testing in the near future. This paper reports on a secondary data analysis which explored the influence the mode of assessment—computer-based (CBT) and pencil-and-paper based (PPT)—and visuospatial ability had on students' mathematics test performance. Data from 804 grade 6 Singaporean students were analysed using the knowledge discovery in data design. The results revealed statistically significant differences between performance on CBT and PPT test modes across content areas concerning whole number algebraic patterns and data and chance. However, there were no performance differences for content areas related to spatial arrangements geometric measurement or other number. There were also statistically significant differences in performance between those students who possess higher levels of visuospatial ability compared to those with lower levels across all six content areas. Implications include careful consideration for the comparability of CBT and PPT testing and the need for increased attention to the role of visuospatial reasoning in student's mathematics reasoning.
A computerized assessment to compare the impact of standard, stereoscopic, and high-definition laparoscopic monitor displays on surgical technique.

PubMed

Feng, Chuan; Rozenblit, Jerzy W; Hamilton, Allan J

2010-11-01

Surgeons performing laparoscopic surgery have strong biases regarding the quality and nature of the laparoscopic video monitor display. In a comparative study, we used a unique computerized sensing and analysis system to evaluate the various types of monitors employed in laparoscopic surgery. We compared the impact of different types of monitor displays on an individual's performance of a laparoscopic training task which required the subject to move the instrument to a set of targets. Participants (varying from no laparoscopic experience to board-certified surgeons) were asked to perform the assigned task while using all three display systems, which were randomly assigned: a conventional laparoscopic monitor system (2D), a high-definition monitor system (HD), and a stereoscopic display (3D). The effects of monitor system on various performance parameters (total time consumed to finish the task, average speed, and movement economy) were analyzed by computer. Each of the subjects filled out a subjective questionnaire at the end of their training session. A total of 27 participants completed our study. Performance with the HD monitor was significantly slower than with either the 3D or 2D monitor (p < 0.0001). Movement economy with the HD monitor was significantly reduced compared with the 3D (p < 0.0004) or 2D (p < 0.0001) monitor. In terms of average time required to complete the task, performance with the 3D monitor was significantly faster than with the HD (p < 0.0001) or 2D (p < 0.0086) monitor. However, the HD system was the overwhelming favorite according to subjective evaluation. Computerized sensing and analysis is capable of quantitatively assessing the seemingly minor effect of monitor display on surgical training performance. The study demonstrates that, while users expressed a decided preference for HD systems, actual quantitative analysis indicates that HD monitors offer no statistically significant advantage and may even worsen performance compared with standard 2D or 3D laparoscopic monitors.

The International Performance of the South African Academic Institutions: A Citation Assessment

ERIC Educational Resources Information Center

Pouris, Anastassios

2007-01-01

This article reports the results of an investigation to identify the disciplinary strengths and the international standing of the higher education institutions in South Africa. Even though comparative assessments provide valuable information for research administrations, researchers and students such information is not available in South Africa…
Cautions about Inferences from International Assessments: The Case of PISA 2009

ERIC Educational Resources Information Center

Ercikan, Kadriye; Roth, Wolff-Michael; Asil, Mustafa

2015-01-01

Background/Context: Two key uses of international assessments of achievement have been (a) comparing country performances for identifying the countries with the best education systems and (2) generating insights about effective policy and practice strategies that are associated with higher learning outcomes. Do country rankings really reflect the…
PISA 2009 Assessment Framework: Key Competencies in Reading, Mathematics and Science

ERIC Educational Resources Information Center

Schleicher, Andreas; Zimmer, Karin; Evans, Juliet; Clements, Niccolina

2009-01-01

In response to the need for cross-nationally comparable evidence on student performance, the Organisation for Economic Co-operation and Development (OECD) launched the OECD Programme for International Student Assessment (PISA) in 1997. PISA represents a commitment by governments to monitor the outcomes of education systems through measuring…
NATIONAL CROP LOSS ASSESSMENT NETWORK (NCLAN) 1984 ANNUAL REPORT

EPA Science Inventory

Research for 1984 involved performance of a preliminary economic assessment of simulated changes in ambient O3 on U.S. agriculture using recent NCLAN response data for six major crops. Four hypothetical ambient O3 levels are measured and compared with a 1980 base situation. The r...
Self-Assessment and Continuing Professional Development: The Canadian Perspective

ERIC Educational Resources Information Center

Silver, Ivan; Campbell, Craig; Marlow, Bernard; Sargeant, Joan

2008-01-01

Introduction: Several recent studies highlight that physicians are not very accurate at assessing their competence in clinical domains when compared to objective measures of knowledge and performance. Instead of continuing to try to train physicians to be more accurate self-assessors, the research suggests that physicians will benefit from…
Assessment of the Phototoxicity of Weathered Alaska North Slope Crude Oil to Juvenile Pink Salmon

EPA Science Inventory

Petroleum products are known to have greater toxicity to the translucent embryos and larvae of aquatic organisms in the presence of ultraviolet radiation (UV) compared to toxicity determined in tests performed under standard laboratory lighting with minimal UV. This study assesse...
Ecosystem Model Skill Assessment. Yes We Can!

PubMed

Olsen, Erik; Fay, Gavin; Gaichas, Sarah; Gamble, Robert; Lucey, Sean; Link, Jason S

2016-01-01

Accelerated changes to global ecosystems call for holistic and integrated analyses of past, present and future states under various pressures to adequately understand current and projected future system states. Ecosystem models can inform management of human activities in a complex and changing environment, but are these models reliable? Ensuring that models are reliable for addressing management questions requires evaluating their skill in representing real-world processes and dynamics. Skill has been evaluated for just a limited set of some biophysical models. A range of skill assessment methods have been reviewed but skill assessment of full marine ecosystem models has not yet been attempted. We assessed the skill of the Northeast U.S. (NEUS) Atlantis marine ecosystem model by comparing 10-year model forecasts with observed data. Model forecast performance was compared to that obtained from a 40-year hindcast. Multiple metrics (average absolute error, root mean squared error, modeling efficiency, and Spearman rank correlation), and a suite of time-series (species biomass, fisheries landings, and ecosystem indicators) were used to adequately measure model skill. Overall, the NEUS model performed above average and thus better than expected for the key species that had been the focus of the model tuning. Model forecast skill was comparable to the hindcast skill, showing that model performance does not degenerate in a 10-year forecast mode, an important characteristic for an end-to-end ecosystem model to be useful for strategic management purposes. We identify best-practice approaches for end-to-end ecosystem model skill assessment that would improve both operational use of other ecosystem models and future model development. We show that it is possible to not only assess the skill of a complicated marine ecosystem model, but that it is necessary do so to instill confidence in model results and encourage their use for strategic management. Our methods are applicable to any type of predictive model, and should be considered for use in fields outside ecology (e.g. economics, climate change, and risk assessment).
Assessment of prognostic performance of Albumin-Bilirubin, Child-Pugh, and Model for End-stage Liver Disease scores in patients with liver cirrhosis complicated with acute upper gastrointestinal bleeding.

PubMed

Xavier, Sofia A; Vilas-Boas, Ricardo; Boal Carvalho, Pedro; Magalhães, Joana T; Marinho, Carla M; Cotter, José B

2018-06-01

The Albumin-Bilirubin (ALBI) score was developed recently to assess the severity of liver dysfunction. We aimed to assess its prognostic performance in patients with liver cirrhosis complicated with upper gastrointestinal bleeding (UGIB) while comparing it with Child-Pugh (CP) and Model for End-stage Liver Disease (MELD) scores. This was a retrospective unicentric study, including consecutive adult patients with cirrhosis admitted for UGIB between January 2011 and November 2015. Clinical, analytical, and endoscopic variables were assessed and ALBI, CP, and MELD scores at admission were calculated. This study included 111 patients. During the first 30 days of follow-up, 12 (10.8%) patients died, and during the first year of follow-up, another 10 patients died (first-year mortality of 19.8%).On comparing the three scores, for in-stay and 30-day mortality, only the ALBI score showed statistically significant results, with an area under the curve (AUC) of 0.80 (P<0.01) for both outcomes. For first-year mortality, AUC for ALBI, CP, and MELD scores were 0.71 (P<0.01), 0.64 (P<0.05), and 0.66 (P=0.02), respectively, whereas for global mortality, AUC were 0.75 (P<0.01), 0.72 (P<0.01), and 0.72 (P<0.01), respectively. On comparing the AUC of the three scores, no significant differences were found in first-year mortality and global mortality. In our series, the ALBI score accurately predicted both in-stay and 30-day mortality, whereas CP and MELD scores could not predict these outcomes. All scores showed a fair prognostic prediction performance for first-year and global mortality. These results suggest that the ALBI score is particularly useful in the assessment of short-term outcomes, with a better performance than the most commonly used scores.
A comparative assessment of statistical methods for extreme weather analysis

NASA Astrophysics Data System (ADS)

Schlögl, Matthias; Laaha, Gregor

2017-04-01

Extreme weather exposure assessment is of major importance for scientists and practitioners alike. We compare different extreme value approaches and fitting methods with respect to their value for assessing extreme precipitation and temperature impacts. Based on an Austrian data set from 25 meteorological stations representing diverse meteorological conditions, we assess the added value of partial duration series over the standardly used annual maxima series in order to give recommendations for performing extreme value statistics of meteorological hazards. Results show the merits of the robust L-moment estimation, which yielded better results than maximum likelihood estimation in 62 % of all cases. At the same time, results question the general assumption of the threshold excess approach (employing partial duration series, PDS) being superior to the block maxima approach (employing annual maxima series, AMS) due to information gain. For low return periods (non-extreme events) the PDS approach tends to overestimate return levels as compared to the AMS approach, whereas an opposite behavior was found for high return levels (extreme events). In extreme cases, an inappropriate threshold was shown to lead to considerable biases that may outperform the possible gain of information from including additional extreme events by far. This effect was neither visible from the square-root criterion, nor from standardly used graphical diagnosis (mean residual life plot), but from a direct comparison of AMS and PDS in synoptic quantile plots. We therefore recommend performing AMS and PDS approaches simultaneously in order to select the best suited approach. This will make the analyses more robust, in cases where threshold selection and dependency introduces biases to the PDS approach, but also in cases where the AMS contains non-extreme events that may introduce similar biases. For assessing the performance of extreme events we recommend conditional performance measures that focus on rare events only in addition to standardly used unconditional indicators. The findings of this study are of relevance for a broad range of environmental variables, including meteorological and hydrological quantities.
Waste gasification vs. conventional Waste-to-Energy: a comparative evaluation of two commercial technologies.

PubMed

Consonni, Stefano; Viganò, Federico

2012-04-01

A number of waste gasification technologies are currently proposed as an alternative to conventional Waste-to-Energy (WtE) plants. Assessing their potential is made difficult by the scarce operating experience and the fragmentary data available. After defining a conceptual framework to classify and assess waste gasification technologies, this paper compares two of the proposed technologies with conventional WtE plants. Performances are evaluated by proprietary software developed at Politecnico di Milano and compared on the basis of a coherent set of assumptions. Since the two gasification technologies are configured as "two-step oxidation" processes, their energy performances are very similar to those of conventional plants. The potential benefits that may justify their adoption relate to material recovery and operation/emission control: recovery of metals in non-oxidized form; collection of ashes in inert, vitrified form; combustion control; lower generation of some pollutants. Copyright Â© 2012 Elsevier Ltd. All rights reserved.
Comparison of the Utility of Two Assessments for Explaining and Predicting Productivity Change: Well-Being Versus an HRA.

PubMed

Gandy, William M; Coberley, Carter; Pope, James E; Rula, Elizabeth Y

2016-01-01

To compare utility of employee well-being to health risk assessment (HRA) as predictors of productivity change. Panel data from 2189 employees who completed surveys 2 years apart were used in hierarchical models comparing the influence of well-being and health risk on longitudinal changes in presenteeism and job performance. Absenteeism change was evaluated in a nonexempt subsample. Change in well-being was the most significant independent predictor of productivity change across all three measures. Comparing hierarchical models, well-being models performed significantly better than HRA models. The HRA added no incremental explanatory power over well-being in combined models. Alone, nonphysical health well-being components outperformed the HRA for all productivity measures. Well-being offers a more comprehensive measure of factors that influence productivity and can be considered preferential to HRA in understanding and addressing suboptimal productivity.
Introducing a laparoscopic simulation training and credentialing program in gynaecology: an observational study.

PubMed

Janssens, Sarah; Beckmann, Michael; Bonney, Donna

2015-08-01

Simulation training in laparoscopic surgery has been shown to improve surgical performance. To describe the implementation of a laparoscopic simulation training and credentialing program for gynaecology registrars. A pilot program consisting of protected, supervised laparoscopic simulation time, a tailored curriculum and a credentialing process, was developed and implemented. Quantitative measures assessing simulated surgical performance were measured over the simulation training period. Laparoscopic procedures requiring credentialing were assessed for both the frequency of a registrar being the primary operator and the duration of surgery and compared to a presimulation cohort. Qualitative measures regarding quality of surgical training were assessed pre- and postsimulation. Improvements were seen in simulated surgical performance in efficiency domains. Operative time for procedures requiring credentialing was reduced by 12%. Primary operator status in the operating theatre for registrars was unchanged. Registrar assessment of training quality improved. The introduction of a laparoscopic simulation training and credentialing program resulted in improvements in simulated performance, reduced operative time and improved registrar assessment of the quality of training. © 2015 The Royal Australian and New Zealand College of Obstetricians and Gynaecologists.
Development of Internet-Based Tasks for the Executive Function Performance Test.

PubMed

Rand, Debbie; Lee Ben-Haim, Keren; Malka, Rachel; Portnoy, Sigal

The Executive Function Performance Test (EFPT) is a reliable and valid performance-based tool to assess executive functions (EFs). This study's objective was to develop and verify two Internet-based tasks for the EFPT. A cross-sectional study assessed the alternate-form reliability of the Internet-based bill-paying and telephone-use tasks in healthy adults and people with subacute stroke (Study 1). It also sought to establish the tasks' criterion reliability for assessing EF deficits by correlating performance with that on the Trail Making Test in five groups: healthy young adults, healthy older adults, people with subacute stroke, people with chronic stroke, and young adults with attention deficit hyperactivity disorder (Study 2). The alternative-form reliability and initial construct validity for the Internet-based bill-paying task were verified. Criterion validity was established for both tasks. The Internet-based tasks are comparable to the original EFPT tasks and can be used for assessment of EF deficits. Copyright © 2018 by the American Occupational Therapy Association, Inc.
Validation of an Alzheimer’s disease assessment battery in Asian participants with mild to moderate Alzheimer’s disease

PubMed Central

Shen, Joan HQ; Shen, Qi; Yu, Holly; Lai, Jin-Shei; Beaumont, Jennifer L; Zhang, Zhenxin; Wang, Huali; Kim, Seong Yoon; Chen, Christopher; Kwok, Timothy; Wang, Shuu-Jiun; Lee, Dong Young; Harrison, John; Cummings, Jeffrey

2014-01-01

There is a lack of validated tools for assessing Alzheimer’s disease (AD) across Asia. This study evaluates the psychometric properties of the Alzheimer’s Disease Assessment Scale-Cognitive Subscale (ADAS-Cog), Disability Assessment for Dementia (DAD), and Neuropsychological Test Battery (NTB) in Asian participants. Participants with mild to moderate AD (n=251) and healthy controls (n=51) from Mainland China, Taiwan, Singapore, Hong Kong, and South Korea completed selected instruments at several time points. Test-retest reliability was better than 0.70 for all tests. AD participants performed significantly more poorly than controls on every score. Within the AD group, greater disease severity corresponded to significantly poorer performance. The AD group test performance worsened over time and there was a trend for worse performance in AD compared to healthy controls over time. The ADAS-Cog, DAD, and NTB are reliable, valid, and responsive measures in this population and could be used for clinical trials across Asian countries/regions. PMID:25628967
Frame-of-Reference Training: Establishing Reliable Assessment of Teaching Effectiveness.

PubMed

Newman, Lori R; Brodsky, Dara; Jones, Richard N; Schwartzstein, Richard M; Atkins, Katharyn Meredith; Roberts, David H

2016-01-01

Frame-of-reference (FOR) training has been used successfully to teach faculty how to produce accurate and reliable workplace-based ratings when assessing a performance. We engaged 21 Harvard Medical School faculty members in our pilot and implementation studies to determine the effectiveness of using FOR training to assess health professionals' teaching performances. All faculty were novices at rating their peers' teaching effectiveness. Before FOR training, we asked participants to evaluate a recorded lecture using a criterion-based peer assessment of medical lecturing instrument. At the start of training, we discussed the instrument and emphasized its precise behavioral standards. During training, participants practiced rating lectures and received immediate feedback on how well they categorized and scored performances as compared with expert-derived scores of the same lectures. At the conclusion of the training, we asked participants to rate a post-training recorded lecture to determine agreement with the experts' scores. Participants and experts had greater rating agreement for the post-training lecture compared with the pretraining lecture. Through this investigation, we determined that FOR training is a feasible method to teach faculty how to accurately and reliably assess medical lectures. Medical school instructors and continuing education presenters should have the opportunity to be observed and receive feedback from trained peer observers. Our results show that it is possible to use FOR rater training to teach peer observers how to accurately rate medical lectures. The process is time efficient and offers the prospect for assessment and feedback beyond traditional learner evaluation of instruction.
Comparison of Effectiveness in Differentiating Benign from Malignant Ovarian Masses between IOTA Simple Rules and Subjective Sonographic Assessment.

PubMed

Tongsong, Theera; Tinnangwattana, Dangcheewan; Vichak-Ururote, Linlada; Tontivuthikul, Paponrad; Charoenratana, Cholaros; Lerthiranwong, Thitikarn

2016-01-01

To compare diagnostic performance in differentiating benign from malignant ovarian masses between IOTA (the International Ovarian Tumor Analysis) simple rules and subjective sonographic assessment. Women scheduled for elective surgery because of ovarian masses were recruited into the study and underwent ultrasound examination within 24 hours of surgery to apply the IOTA simple rules by general gynecologists and to record video clips for subjective assessment by an experienced sonographer. The diagnostic performance of the IOTA rules and subjective assessment for differentiation between benign and malignant masses was compared. The gold standard diagnosis was pathological or operative findings. A total of 150 ovarian masses were covered, comprising 105 (70%) benign and 45 (30%) malignant. Of them, the IOTA simple rules could be applied in 119 (79.3%) and were inconclusive in 31 (20.7%) whereas subjective assessment could be applied in all cases (100%). The sensitivity and the specificity of the IOTA simple rules and subjective assessment were not significantly different, 82.9% vs 86.7% and 94.0% vs 94.3% respectively. The agreement of the two methods in prediction was high with a Kappa index of 0.835. Both techniques had a high diagnostic performance in differentiation between benign and malignant ovarian masses but the IOTA rules had a relatively high rate of inconclusive results. The IOTA rules can be used as an effective screening technique by general gynecologists but when the results are inconclusive they should consult experienced sonographers.
Stakeholder perspectives on workplace-based performance assessment: towards a better understanding of assessor behaviour.

PubMed

de Jonge, Laury P J W M; Timmerman, Angelique A; Govaerts, Marjan J B; Muris, Jean W M; Muijtjens, Arno M M; Kramer, Anneke W M; van der Vleuten, Cees P M

2017-12-01

Workplace-Based Assessment (WBA) plays a pivotal role in present-day competency-based medical curricula. Validity in WBA mainly depends on how stakeholders (e.g. clinical supervisors and learners) use the assessments-rather than on the intrinsic qualities of instruments and methods. Current research on assessment in clinical contexts seems to imply that variable behaviours during performance assessment of both assessors and learners may well reflect their respective beliefs and perspectives towards WBA. We therefore performed a Q methodological study to explore perspectives underlying stakeholders' behaviours in WBA in a postgraduate medical training program. Five different perspectives on performance assessment were extracted: Agency, Mutuality, Objectivity, Adaptivity and Accountability. These perspectives reflect both differences and similarities in stakeholder perceptions and preferences regarding the utility of WBA. In comparing and contrasting the various perspectives, we identified two key areas of disagreement, specifically 'the locus of regulation of learning' (i.e., self-regulated versus externally regulated learning) and 'the extent to which assessment should be standardised' (i.e., tailored versus standardised assessment). Differing perspectives may variously affect stakeholders' acceptance, use-and, consequently, the effectiveness-of assessment programmes. Continuous interaction between all stakeholders is essential to monitor, adapt and improve assessment practices and to stimulate the development of a shared mental model. Better understanding of underlying stakeholder perspectives could be an important step in bridging the gap between psychometric and socio-constructivist approaches in WBA.
48 CFR 15.305 - Proposal evaluation.

Code of Federal Regulations, 2011 CFR

2011-10-01

... comparative assessment of past performance information is separate from the responsibility determination... requirements; and (ii) A summary, matrix, or quantitative ranking, along with appropriate supporting narrative...
48 CFR 15.305 - Proposal evaluation.

Code of Federal Regulations, 2010 CFR

2010-10-01

... comparative assessment of past performance information is separate from the responsibility determination... requirements; and (ii) A summary, matrix, or quantitative ranking, along with appropriate supporting narrative...
48 CFR 15.305 - Proposal evaluation.

Code of Federal Regulations, 2014 CFR

2014-10-01

... comparative assessment of past performance information is separate from the responsibility determination... requirements; and (ii) A summary, matrix, or quantitative ranking, along with appropriate supporting narrative...

48 CFR 15.305 - Proposal evaluation.

Code of Federal Regulations, 2012 CFR

2012-10-01

... comparative assessment of past performance information is separate from the responsibility determination... requirements; and (ii) A summary, matrix, or quantitative ranking, along with appropriate supporting narrative...
48 CFR 15.305 - Proposal evaluation.

Code of Federal Regulations, 2013 CFR

2013-10-01

... comparative assessment of past performance information is separate from the responsibility determination... requirements; and (ii) A summary, matrix, or quantitative ranking, along with appropriate supporting narrative...
Regional Educational Performance Patterns in Europe

ERIC Educational Resources Information Center

Radó, Péter

2011-01-01

The paper aims to contribute to the assessment of the contextual relevance of various educational policies through an analysis of three aspects of the performance profiles of European countries: participation, the quality of learning outcomes and the equity of learning outcomes. Comparative analysis of international student achievement assessment…
Cognitive compensatory processes of older, clinically fit patients with hematologic malignancies undergoing chemotherapy: A longitudinal cohort study.

PubMed

Libert, Yves; Borghgraef, Cindy; Beguin, Yves; Delvaux, Nicole; Devos, Martine; Doyen, Chantal; Dubruille, Stéphanie; Etienne, Anne-Marie; Liénard, Aurore; Merckaert, Isabelle; Reynaert, Christine; Slachmuylder, Jean-Louis; Straetmans, Nicole; Van Den Neste, Eric; Bron, Dominique; Razavi, Darius

2017-12-01

Despite the well-known negative impacts of cancer and anticancer therapies on cognitive performance, little is known about the cognitive compensatory processes of older patients with cancer. This study was designed to investigate the cognitive compensatory processes of older, clinically fit patients with hematologic malignancies undergoing chemotherapy. We assessed 89 consecutive patients (age ≥ 65 y) without severe cognitive impairment and 89 age-, sex-, and education level-matched healthy controls. Cognitive compensatory processes were investigated by (1) comparing cognitive performance of patients and healthy controls in novel (first exposure to cognitive tasks) and non-novel (second exposure to the same cognitive tasks) contexts, and (2) assessing psychological factors that may facilitate or inhibit cognitive performance, such as motivation, psychological distress, and perceived cognitive performance. We assessed cognitive performance with the Trail-Making, Digit Span and FCSR-IR tests, psychological distress with the Hospital Anxiety and Depression Scale, and perceived cognitive performance with the FACT-Cog questionnaire. In novel and non-novel contexts, average cognitive performances of healthy controls were higher than those of patients and were associated with motivation. Cognitive performance of patients was not associated with investigated psychological factors in the novel context but was associated with motivation and psychological distress in the non-novel context. Older, clinically fit patients with hematologic malignancies undergoing chemotherapy demonstrated lower cognitive compensatory processes compared to healthy controls. Reducing distress and increasing motivation may improve cognitive compensatory processes of patients in non-novel contexts. Copyright © 2017 John Wiley & Sons, Ltd.
Assessment of Stage 35 With APNASA

NASA Technical Reports Server (NTRS)

Celestina, Mark L.; Mulac, Richard

2009-01-01

An assessment of APNASA was conducted at NASA Glenn Research Center under the Fundamental Aeronautics Program to determine their predictive capabilities. The geometry selected for this study was Stage 35 which is a single stage transonic compressor. A speedline at 100% speed was generated and compared to experimental data at 100% speed for two turbulence models. Performance of the stage at 100% speed and profiles of several key aerodynamic parameters are compared to the survey data downstream of the stator in this report. In addition, hub leakage was modeled and compared to solutions without leakage and the available experimental data.
Good, better, best? A comprehensive comparison of healthcare providers' performance: An application to physiotherapy practices in primary care.

PubMed

Steenhuis, Sander; Groeneweg, Niels; Koolman, Xander; Portrait, France

2017-12-01

Most payment methods in healthcare stimulate volume-driven care, rather than value-driven care. Value-based payment methods such as Pay-For-Performance have the potential to reduce costs and improve quality of care. Ideally, outcome indicators are used in the assessment of providers' performance. The aim of this paper is to describe the feasibility of assessing and comparing the performances of providers using a comprehensive set of quality and cost data. We had access to unique and extensive datasets containing individual data on PROMs, PREMs and costs of physiotherapy practices in Dutch primary care. We merged these datasets at the patient-level and compared the performances of these practices using case-mix corrected linear regression models. Several significant differences in performance were detected between practices. These results can be used by both physiotherapists, to improve treatment given, and insurers to support their purchasing decisions. The study demonstrates that it is feasible to compare the performance of providers using PROMs and PREMs. However, it would take an extra effort to increase usefulness and it remains unclear under which conditions this effort is cost-effective. Healthcare providers need to be aware of the added value of registering outcomes to improve their quality. Insurers need to facilitate this by designing value-based contracts with the right incentives. Only then can payment methods contribute to value-based healthcare and increase value for patients. Copyright © 2017 Elsevier B.V. All rights reserved.
Evaluation of staff performance and interpretation of the screening program for prevention of thalassemia.

PubMed

Prommetta, Simaporn; Sanchaisuriya, Kanokwan; Fucharoen, Goonnapa; Yamsri, Supawadee; Chaiboonroeng, Attawut; Fucharoen, Supan

2017-06-15

Thalassemia screening program has been implemented for years in Southeast Asia, but no external quality assessment program has been established. We have developed and initiated the proficiency testing (PT) program for the first time in Thailand with the aim to assess the screening performance of laboratory staff and their competency in interpretation of the screening results. Three PT cycles per year were organized. From the first to the third cycle of the PT scheme, a total number of participant laboratories increased from 59 to 67. In each cycle, 2 PT items (assigned as blood samples of the couple) were provided. Performance evaluation was based on the accuracy of screening results, i.e . mean corpuscular volume (MCV), mean corpuscular haemoglobin (MCH) and the dichlorophenolindophenol (DCIP) test for haemoglobin E, including the competency in interpretation of screening results and assessment of foetal risk. Performance was assessed by comparing the participants' result against the assigned value. Of all 3 cycles, most laboratories reported acceptable MCV and MCH values. From the first to the third cycle, incorrect DCIP test and misinterpretation rates were decreased while incorrect risk assessment varied by cycle to cycle. Combining the accuracy of thalassemia screening and the competency in interpretation and risk assessment, approximately half of participants showed excellent performance. Improved performance observed in many laboratories reflects the achievement and benefit of the PT program which should be regularly provided.
Abstraction of information in repository performance assessments. Examples from the SKI project Site-94

DOE Office of Scientific and Technical Information (OSTI.GOV)

Dverstorp, B.; Andersson, J.

1995-12-01

Performance Assessment of a nuclear waste repository implies an analysis of a complex system with many interacting processes. Even if some of these processes may be known to large detail, problems arise when combining all information, and means of abstracting information from complex detailed models into models that couple different processes are needed. Clearly, one of the major objectives of performance assessment, to calculate doses or other performance indicators, implies an enormous abstraction of information compared to all information that is used as input. Other problems are that the knowledge of different parts or processes is strongly variable and adjustments,more » interpretations, are needed when combining models from different disciplines. In addition, people as well as computers, even today, always have a limited capacity to process information and choices have to be made. However, because abstraction of information clearly is unavoidable in performance assessment the validity of choices made, always need to be scrutinized and judgements made need to be updated in an iterative process.« less
Application of Athletic Movement Tests that Predict Injury Risk in a Military Population: Development of Normative Data.

PubMed

Teyhen, Deydre S; Shaffer, Scott W; Butler, Robert J; Goffar, Stephen L; Kiesel, Kyle B; Rhon, Daniel I; Boyles, Robert E; McMillian, Daniel J; Williamson, Jared N; Plisky, Phillip J

2016-10-01

Performance on movement tests helps to predict injury risk in a variety of physically active populations. Understanding baseline measures for normal is an important first step. Determine differences in physical performance assessments and describe normative values for these tests based on military unit type. Assessment of power, balance, mobility, motor control, and performance on the Army Physical Fitness Test were assessed in a cohort of 1,466 soldiers. Analysis of variance was performed to compare the results based on military unit type (Rangers, Combat, Combat Service, and Combat Service Support) and analysis of covariance was performed to determine the influence of age and gender. Rangers performed the best on all performance and fitness measures (p < 0.05). Combat soldiers performed better than Combat Service and Service Support soldiers on several physical performance tests and the Army Physical Fitness Test (p < 0.05). Performance in Combat Service and Service Support soldiers was equivalent on most measures (p < 0.05). Functional performance and level of fitness varied significantly by military unit type. Understanding these differences will provide a foundation for future injury prediction and prevention strategies. Reprint & Copyright © 2016 Association of Military Surgeons of the U.S.
Top-mounted inlet system feasibility for transonic-supersonic fighter aircraft. [V/STOL aircraft

NASA Technical Reports Server (NTRS)

Williams, T. L.; Hunt, B. L.; Smeltzer, D. B.; Nelms, W. P.

1981-01-01

The more salient findings are presented of recent top inlet performance evaluations aimed at assessing the feasibility of top-mounted inlet systems for transonic-supersonic fighter aircraft applications. Top inlet flow field and engine-inlet performance test data show the influence of key aircraft configuration variables-inlet longitudinal position, wing leading-edge extension planform area, canopy-dorsal integration, and variable incidence canards-on top inlet performance over the Mach range of 0.6 to 2.0. Top inlet performance data are compared with those or more conventional inlet/airframe integrations in an effort to assess the viability of top-mounted inlet systems relative to conventional inlet installations.
Massive Transfusion: The Revised Assessment of Bleeding and Transfusion (RABT) Score.

PubMed

Joseph, Bellal; Khan, Muhammad; Truitt, Michael; Jehan, Faisal; Kulvatunyou, Narong; Azim, Asad; Jain, Arpana; Zeeshan, Muhammad; Tang, Andrew; O'Keeffe, Terence

2018-05-21

Massive transfusion (MT) is a lifesaving treatment for trauma patients with hemorrhagic shock, assessed by Assessment of Blood Consumption (ABC) Score based on mechanism of injury, systolic blood pressure (SBP), tachycardia, and FAST exam. The aim of this study was to assess the performance of ABC score by replacing hypotension and tachycardia; with Shock Index (SI) > 1.0 and including pelvic fractures. We performed a 2-year (2014-2015) analysis of all high-level trauma activations and excluded patients dead on arrival. The ABC score was calculated using the 4-point score [blunt (0)/penetrating trauma (1), HR ≥ 120 (1), SBP ≤ 90 mmHg (1), and FAST positive (1)]. The Revised Assessment of Bleeding and Transfusion (RABT) score also included 4 points, calculated by replacing HR and SBP with SI > 1.0 and including pelvic fracture. AUROC compared performances of the two scores. A total of 380 patients were included. The overall MT was 27%. Patients receiving MT had higher median ABC scores [1.1 (0-2) vs. 1 (0-2), p = 0.15] and RABT scores [2 (1-3) vs. 1 (0-2), p < 0.001]. The RABT score had better discriminative power (AUROC = 0.828) compared to ABC score (AUROC = 0.617) for predicting the need for MT. Cutoff of RABT score ≥ 2 had a sensitivity of 84% and specificity of 77% for predicting need for MT compared to ABC score with 39% sensitivity and 72% specificity. Replacement of hypotension and tachycardia with a SI > 1.0 and inclusion of pelvic fracture enhanced discrimination of ABC score for predicting the need for MT. The current ABC score would benefit from revision to more appropriately identify patients requiring MT.
Superior staging of liver tumors with laparoscopy and laparoscopic ultrasound.

PubMed Central

John, T G; Greig, J D; Crosbie, J L; Miles, W F; Garden, O J

1994-01-01

OBJECTIVE. The authors describe the technique of staging laparoscopy with laparoscopic contact ultrasonography in the preoperative assessment of patients with liver tumors, and assess its impact on the selection of patients for hepatic resection with curative intent. SUMMARY BACKGROUND DATA. Laparoscopy may be useful in the selection of patients with a variety of intra-abdominal malignancies for operative intervention. Laparoscopic ultrasonography is a new technique that combines the principles of high resolution intraoperative contact ultrasound with those of the laparoscopic examination, and thus, allows the laparoscopist to perform detailed assessment of the liver. METHODS. This study analyzes a cohort of 50 consecutive patients who were diagnosed as having potentially resectable liver tumors, and in whom staging laparoscopy was successfully undertaken. Laparoscopic ultrasonography was performed in 43 patients, and the impact of the ensuing findings on the decision to proceed to operative assessment of resectability is examined. The resectability rate in those patients assessed laparoscopically and subsequently submitted to laparotomy is compared with a preceding group of patients in whom no laparoscopic assessment was performed. RESULTS. Laparoscopy demonstrated factors precluding curative resection in 23 patients (46%). Laparoscopic ultrasonography identified liver tumors not visible during laparoscopy in 14 patients (33%), and provided staging information in addition to that derived from laparoscopy alone in 18/43 patients (42%). The resectability rate was significantly higher among those patients undergoing laparoscopic staging (93%) compared with those in whom operative assessment was undertaken without laparoscopy (58%). CONCLUSIONS. Staging laparoscopy with laparoscopic ultrasonography optimizes patient selection for liver resection with curative intent. Images Figure 1. Figure 2. PMID:7986136
Prediction of Esophageal Varices in Patients with Cirrhosis: Usefulness of Three-dimensional MR Elastography with Echo-planar Imaging Technique

PubMed Central

Shin, Sung Ui; Yu, Mi Hye; Yoon, Jeong Hee; Han, Joon Koo; Choi, Byung-Ihn; Glaser, Kevin J.; Ehman, Richard L.

2014-01-01

Purpose To determine the diagnostic performance of magnetic resonance (MR) elastography in comparison to spleen length and dynamic contrast material–enhanced (DCE) MR imaging in association with esophageal varices in patients with liver cirrhosis by using endoscopy as the reference standard. Materials and Methods This retrospective study received institutional review board approval, and informed consent was waived. One hundred thirty-nine patients with liver cirrhosis who underwent liver DCE MR imaging, including MR elastography, were included. Hepatic stiffness (HS) and spleen stiffness (SS) values assessed with MR elastography, as well as spleen length, were correlated with the presence of esophageal varices and high-risk varices by using Spearman correlation analysis. The diagnostic performance of MR elastography was compared with that of DCE MR imaging and combined assessment of MR elastography and DCE MR imaging by using receiver operating characteristic analysis. MR elastography reproducibility was assessed prospectively, with informed consent, in another 15 patients by using intraclass correlation coefficients. Results There were significant positive linear correlations between HS, SS, and spleen length and the grade of esophageal varices (r = 0.46, r = 0.48, and r = 0.36, respectively; all P < .0001). HS and SS values (>4.81 kPa and >7.60 kPa, respectively) showed better performance than did spleen length in the association with esophageal varices (P = .0306 and P = .0064, respectively). Diagnostic performance of HS and SS in predicting high-risk varices was comparable to that of DCE MR imaging (P = .1282 and P = .1371, respectively). When MR elastography and DCE MR imaging were combined, sensitivity improved significantly (P = .0004). MR elastography was highly reproducible (intraclass correlation coefficient > 0.9). Conclusion HS and SS are associated with esophageal varices and showed better performance than did spleen length in assessing the presence of esophageal varices. MR elastography is comparable to DCE MR imaging in predicting the presence of esophageal varices and high-risk varices, but, when assessed in combination, sensitivity is higher. © RSNA, 2014 Online supplemental material is available for this article. PMID:24620910
Constructivism and the use of performance assessment in science: A comparative study of beliefs among preservice and inservice teachers

NASA Astrophysics Data System (ADS)

Bednarski, Marsha H.

Reform efforts in science education stress the importance of preservice and inservice teacher education in curriculum, instruction, and assessment. A change in current student assessment practices is seen as the catalyst in the reform of curriculum and instruction. Recommended for assessment of the proposed inquiry-based science programs are performance-based assessments (National Research Council, 1996). The constructivist philosophy, the foundation for these reform efforts, proposes that knowledge acquisition by the learner is a result of the interaction between what is brought to the learning situation and what is experienced while in it. Literature supports the use of constructivist-based instructional strategies for preservice and inservice teacher education (American Federation of Teachers, National Council on Measurement in Education, and National Education Association, 1990). Literature also provides support for the importance of teacher beliefs in relation to the successful transfer of these instructional strategies (Keegan, 1992; Nespor, 1987). There is not supporting evidence related to constructivist instructional strategies and teacher beliefs transferring to the use of performance assessment. This study identified whether preservice and inservice teachers differed with respect to their beliefs about constructivist-based learning strategies and performance assessment. It also identified whether teacher beliefs held about constructivist-based learning strategies were related to the construction of assessments they developed for use in their classrooms. Education majors enrolled in a Northeastern university's assessment course and inservice teachers from three Northeast public school districts participated in this study. Results of a 36-item belief survey, administered to preservice and inservice teachers, and a 10-item checklist, used to score assessment examples provided by the teachers, concluded that attitudes toward constructivist-based learning strategies is a predictor for group membership with the inservice teacher group. There is a correlation between attitudes toward constructivism and attitudes related to the benefits of using performance assessments for both the preservice and inservice groups. There is not a significant correlation between constructivist attitudes and using performance assessment. Although teachers in this study hold constructivist attitudes and acknowledge the benefits of using performance assessment, they do not use performance assessments.
78 FR 59866 - New Car Assessment Program (NCAP)

Federal Register 2010, 2011, 2012, 2013, 2014

2013-09-30

... because ESC is now required for all light vehicles. For many years, NCAP has provided comparative... site, www.safercar.gov . NCAP provides comparative information on the safety performance and features... Features on www.safercar.gov are designed to assist drivers in avoiding backover crashes. After considering...
Validating the Japanese translation of the Force and Motion Conceptual Evaluation and comparing performance levels of American and Japanese students

NASA Astrophysics Data System (ADS)

Ishimoto, Michi; Thornton, Ronald K.; Sokoloff, David R.

2014-12-01

This study assesses the Japanese translation of the Force and Motion Conceptual Evaluation (FMCE). Researchers are often interested in comparing the conceptual ideas of students with different cultural backgrounds. The FMCE has been useful in identifying the concepts of English-speaking students from different backgrounds. To identify effectively the conceptual ideas of Japanese students and to compare them to those of their English-speaking counterparts, more work is required. Because of differences between the Japanese and English languages, and between the Japanese and American educational systems, it is important to assess the Japanese translation of the FMCE, a conceptual evaluation originally developed in English for American students. To assess its appropriateness, we examined the performance of a large sample of students on the translated version of the FMCE and then compared the results to those of English-speaking students. The data comprise the pretest results of 1095 students, most of whom were first-year students at a midlevel engineering school between 2003 and 2012. Basic statistics and the classical test theory indices of the translated FMCE indicate that its reliability and discrimination are appropriate to assess Japanese students' concepts about force and motion. In general, the preconcepts of Japanese students assessed with the Japanese translation of the FMCE are quite similar to those of American students assessed with the FMCE, thereby supporting the validity of the translated version. However, our findings do show (1) that only a small percentage of Japanese students grasped Newtonian concepts and (2) that the percentage of Japanese students who used two different concept models together to answer some questions seems to be higher than that of American students.
Economics within Social Studies: A Comparative Analysis of Student Performance on the 2012 Kansas History-Government Assessment

ERIC Educational Resources Information Center

Deplazes, Svetlana P.

2014-01-01

The purpose of this study was to examine the overall level of student achievement on the 2012 Kansas History-Government Assessment in Grades 6, 8, and high school, with major emphasis on the subject area of economics. It explored four specific research questions in order to: (1) determine the level of student knowledge of assessed economic…
Assessing Conceptual Understanding via Literacy-Infused, Inquiry-Based Science among Middle School English Learners and Economically-Challenged Students

ERIC Educational Resources Information Center

Lara-Alecio, Rafael; Irby, Beverly J.; Tong, Fuhui; Guerrero, Cindy; Koch, Janice; Sutton-Jones, Kara L.

2018-01-01

The overarching purpose of our study was to compare performances of treatment and control condition students who completed a literacy-infused, inquiry-based science intervention through sixth grade as measured by a big idea assessment tool which we refer to as the Big Ideas in Science Assessment (BISA). First, we determine the concurrent validity…
Poor performances of EuroSCORE and CARE score for prediction of perioperative mortality in octogenarians undergoing aortic valve replacement for aortic stenosis.

PubMed

Chhor, Vibol; Merceron, Sybille; Ricome, Sylvie; Baron, Gabriel; Daoud, Omar; Dilly, Marie-Pierre; Aubier, Benjamin; Provenchere, Sophie; Philip, Ivan

2010-08-01

Although results of cardiac surgery are improving, octogenarians have a higher procedure-related mortality and more complications with increased length of stay in ICU. Consequently, careful evaluation of perioperative risk seems necessary. The aims of our study were to assess and compare the performances of EuroSCORE and CARE score in the prediction of perioperative mortality among octogenarians undergoing aortic valve replacement for aortic stenosis and to compare these predictive performances with those obtained in younger patients. This retrospective study included all consecutive patients undergoing cardiac surgery in our institution between November 2005 and December 2007. For each patient, risk assessment for mortality was performed using logistic EuroSCORE, additive EuroSCORE and CARE score. The main outcome measure was early postoperative mortality. Predictive performances of these scores were assessed by calibration and discrimination using goodness-of-fit test and area under the receiver operating characteristic curve, respectively. During this 2-year period, we studied 2117 patients, among whom 134/211 octogenarians and 335/1906 nonoctogenarians underwent an aortic valve replacement for aortic stenosis. When considering patients with aortic stenosis, discrimination was poor in octogenarians and the difference from nonoctogenarians was significant for each score (0.58, 0.59 and 0.56 vs. 0.82, 0.81 and 0.77 for additive EuroSCORE, logistic EuroSCORE and CARE score in octogenarians and nonoctogenarians, respectively, P < 0.05). Moreover, in the whole cohort, logistic EuroSCORE significantly overestimated mortality among octogenarians. Predictive performances of these scores are poor in octogenarians undergoing cardiac surgery, especially aortic valve replacement. Risk assessment and therapeutic decisions in octogenarians should not be made with these scoring systems alone.
Ecosystem Model Skill Assessment. Yes We Can!

PubMed Central

Olsen, Erik; Fay, Gavin; Gaichas, Sarah; Gamble, Robert; Lucey, Sean; Link, Jason S.

2016-01-01

Need to Assess the Skill of Ecosystem Models Accelerated changes to global ecosystems call for holistic and integrated analyses of past, present and future states under various pressures to adequately understand current and projected future system states. Ecosystem models can inform management of human activities in a complex and changing environment, but are these models reliable? Ensuring that models are reliable for addressing management questions requires evaluating their skill in representing real-world processes and dynamics. Skill has been evaluated for just a limited set of some biophysical models. A range of skill assessment methods have been reviewed but skill assessment of full marine ecosystem models has not yet been attempted. Northeast US Atlantis Marine Ecosystem Model We assessed the skill of the Northeast U.S. (NEUS) Atlantis marine ecosystem model by comparing 10-year model forecasts with observed data. Model forecast performance was compared to that obtained from a 40-year hindcast. Multiple metrics (average absolute error, root mean squared error, modeling efficiency, and Spearman rank correlation), and a suite of time-series (species biomass, fisheries landings, and ecosystem indicators) were used to adequately measure model skill. Overall, the NEUS model performed above average and thus better than expected for the key species that had been the focus of the model tuning. Model forecast skill was comparable to the hindcast skill, showing that model performance does not degenerate in a 10-year forecast mode, an important characteristic for an end-to-end ecosystem model to be useful for strategic management purposes. Skill Assessment Is Both Possible and Advisable We identify best-practice approaches for end-to-end ecosystem model skill assessment that would improve both operational use of other ecosystem models and future model development. We show that it is possible to not only assess the skill of a complicated marine ecosystem model, but that it is necessary do so to instill confidence in model results and encourage their use for strategic management. Our methods are applicable to any type of predictive model, and should be considered for use in fields outside ecology (e.g. economics, climate change, and risk assessment). PMID:26731540

Judicial Assessment of the Credibility of Child Witnesses

PubMed Central

Bala, Nicholas; Ramakrishnan, Karuna; Lindsay, Roderick; Lee, Kang

2010-01-01

This article reports on the results of two research studies carried out by the authors that address the questions of how and how well judges assess the honesty and reliability of children’s testimony. One study tested the accuracy of judges and other professionals in assessing the honesty of children giving mock testimony. Judges performed at only slightly above chance levels, though the performance of judges was comparable to other justice system professionals, and significantly better than the performance of law students. The second study, a survey of Canadian judges about their perceptions of child witnesses, reveals that judges believe that compared to adults, children are generally more likely when testifying to make errors due to limitations of their memory or communication skills and due to the effects of suggestive questions. However, children are perceived to generally be more honest than adult witnesses. The survey also revealed that judges believe that children are often asked developmentally inappropriate questions in court, especially by defence counsel. There were no gender differences among the judges in either study. To put this research in context, the article first discusses the inherent challenges in assessing the credibility of witnesses and provides a review of the psychological literature and leading Canadian jurisprudence on the credibility and evidence of children. PMID:26566290
Assessment of Alternative RF Linac Structures for APT

DOE Office of Scientific and Technical Information (OSTI.GOV)

None

The APT program has been examining both normal and superconducting variants of the APT linac for the past two years. A decision on which of the two will be the selected technology will depend upon several considerations including the results of ongoing feasibility experiments, the performance and overall attractiveness of each of the design concepts, and an assessment of the system-level features of both alternatives. The primary objective of the Assessment of Alternative RF Linac Structures for APT study reported herein was to assess and compare, at the system-level, the performance, capital and life cycle costs, reliability/availability/maintainability (RAM) and manufacturingmore » schedules of APT RF linear accelerators based upon both superconducting and normal conducting technologies. A secondary objective was to perform trade studies to explore opportunities for system optimization, technology substitution and alternative growth pathways and to identify sensitivities to design uncertainties.« less
Why did the bear cross the road? Comparing the performance of multiple resistance surfaces and connectivity modeling methods

Treesearch

Samuel A. Cushman; Jesse S. Lewis; Erin L. Landguth

2014-01-01

There have been few assessments of the performance of alternative resistance surfaces, and little is known about how connectivity modeling approaches differ in their ability to predict organism movements. In this paper, we evaluate the performance of four connectivity modeling approaches applied to two resistance surfaces in predicting the locations of highway...
What Does a Student Know Who Earns a Top Score on the Advanced Placement Chemistry Exam?

ERIC Educational Resources Information Center

Claesgens, Jennifer; Daubenmire, Paul L.; Scalise, Kathleen M.; Balicki, Scott; Gochyyev, Perman; Stacy, Angelica M.

2014-01-01

This paper compares the performance of students at a high-performing U.S. public school (n = 64) on the advanced placement (AP) chemistry exam to their performance on the ChemQuery assessment system. The AP chemistry exam was chosen because, as the National Research Council acknowledges, it is the "perceived standard of excellence and school…
Memory and metacognition in dangerous situations: investigating cognitive impairment from gas narcosis in undersea divers.

PubMed

Hobbs, Malcolm; Higham, Philip A; Kneller, Wendy

2014-06-01

The current study tested whether undersea divers are able to accurately judge their level of memory impairment from inert gas narcosis. Inert gas narcosis causes a number of cognitive impairments, including a decrement in memory ability. Undersea divers may be unable to accurately judge their level of impairment, affecting safety and work performance. In two underwater field experiments, performance decrements on tests of memory at 33 to 42 m were compared with self-ratings of impairment and resolution. The effect of depth (shallow [I-II m] vs. deep [33-42 m]) was measured on free-recall (Experiment I; n = 41) and cued-recall (Experiment 2; n = 39) performance, a visual-analogue self-assessment rating of narcotic impairment, and the accuracy of judgements-of-learning JOLs). Both free- and cued-recall were significantly reduced in deep, compared to shallow, conditions. This decrement was accompanied by an increase in self-assessed impairment. In contrast, resolution (based on JOLs) remained unaffected by depth. The dissociation of memory accuracy and resolution, coupled with a shift in a self-assessment of impairment, indicated that divers were able to accurately judge their decrease in memory performance at depth. These findings suggest that impaired self-assessment and resolution may not actually be a symptom of narcosis in the depth range of 33 to 42 m underwater and that the divers in this study were better equipped to manage narcosis than prior literature suggested. The results are discussed in relation to implications for diver safety and work performance.
Transfer of skills on LapSim virtual reality laparoscopic simulator into the operating room in urology.

PubMed

Alwaal, Amjad; Al-Qaoud, Talal M; Haddad, Richard L; Alzahrani, Tarek M; Delisle, Josee; Anidjar, Maurice

2015-01-01

Assessing the predictive validity of the LapSim simulator within a urology residency program. Twelve urology residents at McGill University were enrolled in the study between June 2008 and December 2011. The residents had weekly training on the LapSim that consisted of 3 tasks (cutting, clip-applying, and lifting and grasping). They underwent monthly assessment of their LapSim performance using total time, tissue damage and path length among other parameters as surrogates for their economy of movement and respect for tissue. The last residents' LapSim performance was compared with their first performance of radical nephrectomy on anesthetized porcine models in their 4(th) year of training. Two independent urologic surgeons rated the resident performance on the porcine models, and kappa test with standardized weight function was used to assess for inter-observer bias. Nonparametric spearman correlation test was used to compare each rater's cumulative score with the cumulative score obtained on the porcine models in order to test the predictive validity of the LapSim simulator. The kappa results demonstrated acceptable agreement between the two observers among all domains of the rating scale of performance except for confidence of movement and efficiency. In addition, poor predictive validity of the LapSim simulator was demonstrated. Predictive validity was not demonstrated for the LapSim simulator in the context of a urology residency training program.
Residents' response to bleeding during a simulated robotic surgery task.

PubMed

Walker, Jessica L; Nathwani, Jay N; Mohamadipanah, Hossein; Laufer, Shlomi; Jocewicz, Frank F; Gwillim, Eran; Pugh, Carla M

2017-12-01

The aim of this study was to assess performance measurement validity of our newly developed robotic surgery task trainer. We hypothesized that residents would exhibit wide variations in their intercohort performance as well as a measurable difference compared to surgeons in fellowship training. Our laboratory synthesized a model of a pelvic tumor that simulates unexpected bleeding. Surgical residents and fellows of varying specialties completed a demographic survey and were allowed 20 minutes to resect the tumor using the da Vinci robot and achieve hemostasis. At a standardized event in the simulation, venous bleeding began, and participants attempted hemostasis using suture ligation. A motion tracking system, using electromagnetic sensors, recorded participants' hand movements. A postparticipation Likert scale survey evaluated participants' assessment of the model's realism and usefulness. Three of the seven residents (postgraduate year 2-5), and the fellow successfully resected the tumor in the allotted time. Residents showed high variability in performance and blood loss (125-700 mL) both within their cohort and compared to the fellow (150 mL blood). All participants rated the model as having high realism and utility for trainees. The results support that our bleeding pelvic tumor simulator has the ability to discriminate resident performance in robotic surgery. The combination of motion, decision-making, and blood loss metrics offers a multilevel performance assessment, analyzing both technical and decision-making abilities. Copyright © 2017 Elsevier Inc. All rights reserved.
ASSESSMENT OF TWO PHYSICALLY BASED WATERSHED MODELS BASED ON THEIR PERFORMANCES OF SIMULATING SEDIMENT MOVEMENT OVER SMALL WATERSHEDS

EPA Science Inventory

Abstract: Two physically based and deterministic models, CASC2-D and KINEROS are evaluated and compared for their performances on modeling sediment movement on a small agricultural watershed over several events. Each model has different conceptualization of a watershed. CASC...
Qualified to Lead? A Comparative, Contextual and Cultural View of Educational Policy Borrowing

ERIC Educational Resources Information Center

Harris, Alma; Jones, Michelle; Adams, Donnie

2016-01-01

Background: Around the globe, education policy borrowing remains pervasive and prevalent. The strategies, interventions and innovations of education systems that perform well, in international assessments, are enthusiastically borrowed and copied in the anticipation of similar educational performance and outcomes. Purpose: This purpose of the…
Facilities Performance Indicators Report, 2006-07

ERIC Educational Resources Information Center

Glazner, Steve, Ed.

2008-01-01

The "Facilities Performance Indicators Survey" ("FPI") supersedes and builds upon the two major surveys APPA conducted in the past: the Comparative Costs and Staffing (CCAS) survey and the Strategic Assessment Model (SAM). The "FPI" covers all the materials collected in CCAS and SAM, along with some select new data points and improved survey…
EFFECTS OF VERTICAL-LAYER STRUCTURE AND BOUNDARY CONDITIONS ON CMAQ-V4.5 AND V4.6 MODELS

EPA Science Inventory

This work is aimed at determining whether the increased vertical layers in CMAQ provides substantially improved model performance and assess whether using the spatially and temporally varying boundary conditions from GEOS-CHEM offer improved model performance as compared to the d...
Development of Malayalam Handwriting Scale for School Students in Kerala

ERIC Educational Resources Information Center

Gafoor, K. Abdul; Naseer, A. R.

2015-01-01

With a view to support instruction, formative and summative assessment and to provide model handwriting performance for students to compare their own performance, a Malayalam handwriting scale is developed. Data from 2640 school students belonging to Malappuram, Palakkad and Kozhikode districts, sampled by taking 240 students per each grade…
ASSESSMENT OF TWO PHYSICALLY-BASED WATERSHED MODELS BASED ON THEIR PERFORMANCES OF SIMULATING WATER AND SEDIMENT MOVEMENT

EPA Science Inventory

Two physically based watershed models, GSSHA and KINEROS-2 are evaluated and compared for their performances on modeling flow and sediment movement. Each model has a different watershed conceptualization. GSSHA divides the watershed into cells, and flow and sediments are routed t...
Assessment of sand quality on concrete performance : examination of acidic and sulfate/sulfide-bearing sands.

DOT National Transportation Integrated Search

2014-12-01

The purpose of this research is to examine how the presence of sulfide- and sulfate-containing : minerals in acidic aggregates may affect the properties of mortar and concrete. Analyses were : performed to compare two sands from a deposit in the Geor...
Ten Years of Experience with a Performance-Based Promotional Selection and Career Development System within State Government.

ERIC Educational Resources Information Center

Baugher, Dan; And Others

1994-01-01

The New York State Division of Budget uses a decentralized system to assess promotion candidates by comparing their training, experience, and recent performance to the proposed position. Managers and candidates find the system more effective than traditional written/oral exams. (SK)
Assessing Interval Estimation Methods for Hill Model ...

EPA Pesticide Factsheets

The Hill model of concentration-response is ubiquitous in toxicology, perhaps because its parameters directly relate to biologically significant metrics of toxicity such as efficacy and potency. Point estimates of these parameters obtained through least squares regression or maximum likelihood are commonly used in high-throughput risk assessment, but such estimates typically fail to include reliable information concerning confidence in (or precision of) the estimates. To address this issue, we examined methods for assessing uncertainty in Hill model parameter estimates derived from concentration-response data. In particular, using a sample of ToxCast concentration-response data sets, we applied four methods for obtaining interval estimates that are based on asymptotic theory, bootstrapping (two varieties), and Bayesian parameter estimation, and then compared the results. These interval estimation methods generally did not agree, so we devised a simulation study to assess their relative performance. We generated simulated data by constructing four statistical error models capable of producing concentration-response data sets comparable to those observed in ToxCast. We then applied the four interval estimation methods to the simulated data and compared the actual coverage of the interval estimates to the nominal coverage (e.g., 95%) in order to quantify performance of each of the methods in a variety of cases (i.e., different values of the true Hill model paramet
The fractured landscape of RNA-seq alignment: the default in our STARs.

PubMed

Ballouz, Sara; Dobin, Alexander; Gingeras, Thomas R; Gillis, Jesse

2018-06-01

Many tools are available for RNA-seq alignment and expression quantification, with comparative value being hard to establish. Benchmarking assessments often highlight methods' good performance, but are focused on either model data or fail to explain variation in performance. This leaves us to ask, what is the most meaningful way to assess different alignment choices? And importantly, where is there room for progress? In this work, we explore the answers to these two questions by performing an exhaustive assessment of the STAR aligner. We assess STAR's performance across a range of alignment parameters using common metrics, and then on biologically focused tasks. We find technical metrics such as fraction mapping or expression profile correlation to be uninformative, capturing properties unlikely to have any role in biological discovery. Surprisingly, we find that changes in alignment parameters within a wide range have little impact on both technical and biological performance. Yet, when performance finally does break, it happens in difficult regions, such as X-Y paralogs and MHC genes. We believe improved reporting by developers will help establish where results are likely to be robust or fragile, providing a better baseline to establish where methodological progress can still occur.
Information theoretic analysis of edge detection in visual communication

NASA Astrophysics Data System (ADS)

Jiang, Bo; Rahman, Zia-ur

2010-08-01

Generally, the designs of digital image processing algorithms and image gathering devices remain separate. Consequently, the performance of digital image processing algorithms is evaluated without taking into account the artifacts introduced into the process by the image gathering process. However, experiments show that the image gathering process profoundly impacts the performance of digital image processing and the quality of the resulting images. Huck et al. proposed one definitive theoretic analysis of visual communication channels, where the different parts, such as image gathering, processing, and display, are assessed in an integrated manner using Shannon's information theory. In this paper, we perform an end-to-end information theory based system analysis to assess edge detection methods. We evaluate the performance of the different algorithms as a function of the characteristics of the scene, and the parameters, such as sampling, additive noise etc., that define the image gathering system. The edge detection algorithm is regarded to have high performance only if the information rate from the scene to the edge approaches the maximum possible. This goal can be achieved only by jointly optimizing all processes. People generally use subjective judgment to compare different edge detection methods. There is not a common tool that can be used to evaluate the performance of the different algorithms, and to give people a guide for selecting the best algorithm for a given system or scene. Our information-theoretic assessment becomes this new tool to which allows us to compare the different edge detection operators in a common environment.
Performance assessment of a closed-loop system for diabetes management.

PubMed

Martinez-Millana, A; Fico, G; Fernández-Llatas, C; Traver, V

2015-12-01

Telemedicine systems can play an important role in the management of diabetes, a chronic condition that is increasing worldwide. Evaluations on the consistency of information across these systems and on their performance in a real situation are still missing. This paper presents a remote monitoring system for diabetes management based on physiological sensors, mobile technologies and patient/doctor applications over a service-oriented architecture that has been evaluated in an international trial (83,905 operation records). The proposed system integrates three types of running environments and data engines in a single service-oriented architecture. This feature is used to assess key performance indicators comparing them with other type of architectures. Data sustainability across the applications has been evaluated showing better outcomes for full integrated sensors. At the same time, runtime performance of clients has been assessed spotting no differences regarding the operative environment.
An exploration of the ecological validity of the Virtual Action Planning-Supermarket (VAP-S) with people with schizophrenia.

PubMed

Aubin, Ginette; Béliveau, Marie-France; Klinger, Evelyne

2018-07-01

People with schizophrenia often have functional limitations that affect their daily activities due to executive function deficits. One way to assess these deficits is through the use of virtual reality programmes that reproduce real-life instrumental activities of daily living (IADLs). One such programme is the Virtual Action Planning-Supermarket (VAP-S). This exploratory study aimed to examine the ecological validity of this programme, specifically, how task performance in both virtual and natural environments compares. Case studies were used and involved five participants with schizophrenia, who were familiar with grocery shopping. They were assessed during both the VAP-S shopping task and a real-life grocery shopping task using an observational assessment tool, the Perceive, Recall, Plan and Perform (PRPP) System of Task Analysis. The results show that when difficulties were present in the virtual task, difficulties were also observed in the real-life task. For some participants, greater difficulties were observed in the virtual task. These difficulties could be explained by the presence of perceptual deficits and problems remembering the required sequenced actions in the virtual task. In conclusion, performance on the VAP-S by these five participants was generally comparable to the performance in a natural environment.

A Qualitative Analysis of Narrative Preclerkship Assessment Data to Evaluate Teamwork Skills.

PubMed

Dolan, Brigid M; O'Brien, Celia Laird; Cameron, Kenzie A; Green, Marianne M

2018-04-16

Construct: Students entering the health professions require competency in teamwork. Although many teamwork curricula and assessments exist, studies have not demonstrated robust longitudinal assessment of preclerkship students' teamwork skills and attitudes. Assessment portfolios may serve to fill this gap, but it is unknown how narrative comments within portfolios describe student teamwork behaviors. We performed a qualitative analysis of narrative data in 15 assessment portfolios. Student portfolios were randomly selected from 3 groups stratified by quantitative ratings of teamwork performance gathered from small-group and clinical preceptor assessment forms. Narrative data included peer and faculty feedback from these same forms. Data were coded for teamwork-related behaviors using a constant comparative approach combined with an identification of the valence of the coded statements as either "positive observation" or "suggestion for improvement." Eight codes related to teamwork emerged: attitude and demeanor, information facilitation, leadership, preparation and dependability, professionalism, team orientation, values team member contributions, and nonspecific teamwork comments. The frequency of codes and valence varied across the 3 performance groups, with students in the low-performing group receiving more suggestions for improvement across all teamwork codes. Narrative data from assessment portfolios included specific descriptions of teamwork behavior, with important contributions provided by both faculty and peers. A variety of teamwork domains were represented. Such feedback as collected in an assessment portfolio can be used for longitudinal assessment of preclerkship student teamwork skills and attitudes.
Printed Wiring Board Cleaner Technologies Substitutes Assessment: Making Holes Conductive

EPA Pesticide Factsheets

This document presents comparative risk, competitiveness, and resource requirements on technologies for performing the “making holes conductive” function during printed wiring board manufacturing.
How do gender and anxiety affect students' self-assessment and actual performance on a high-stakes clinical skills examination?

PubMed

Colbert-Getz, Jorie M; Fleishman, Carol; Jung, Julianna; Shilkofski, Nicole

2013-01-01

Research suggests that medical students are not accurate in self-assessment, but it is not clear whether students over- or underestimate their skills or how certain characteristics correlate with accuracy in self-assessment. The goal of this study was to determine the effect of gender and anxiety on accuracy of students' self-assessment and on actual performance in the context of a high-stakes assessment. Prior to their fourth year of medical school, two classes of medical students at Johns Hopkins University School of Medicine completed a required clinical skills exam in fall 2010 and 2011, respectively. Two hundred two students rated their anxiety in anticipation of the exam and predicted their overall scores in the history taking and physical examination performance domains. A self-assessment deviation score was calculated by subtracting each student's predicted score from his or her score as rated by standardized patients. When students self-assessed their data gathering performance, there was a weak negative correlation between their predicted scores and their actual scores on the examination. Additionally, there was an interaction effect of anxiety and gender on both self-assessment deviation scores and actual performance. Specifically, females with high anxiety were more accurate in self-assessment and achieved higher actual scores compared with males with high anxiety. No differences by gender emerged for students with moderate or low anxiety. Educators should take into account not only gender but also the role of emotion, in this case anxiety, when planning interventions to help improve accuracy of students' self-assessment.
Measurement properties of the English and Chinese versions of the Functional Assessment of Cancer Therapy-Breast (FACT-B) in Asian breast cancer patients.

PubMed

Ng, Raymond; Lee, Chun Fan; Wong, Nan Soon; Luo, Nan; Yap, Yoon Sim; Lo, Soo Kien; Chia, Whay Kuang; Yee, Alethea; Krishna, Lalit; Goh, Cynthia; Cheung, Yin Bun

2012-01-01

The objective of the study was to examine the measurement properties of and comparability between the English and Chinese versions of the Functional Assessment of Cancer Therapy-Breast (FACT-B) in breast cancer patients in Singapore. This is an observational study of 271 Singaporean breast cancer patients. The known-group validity of FACT-B total score and Trial Outcome Index (TOI) were assessed in relation to performance status, evidence of disease, and treatment status cross-sectionally; responsiveness to change was assessed in relation to change in performance status longitudinally. Internal consistency and test-retest reliability were evaluated by the Cronbach's alpha and intraclass correlation coefficient (ICC), respectively. Multiple regression analyses were performed to compare the scores on the two language versions, adjusting for covariates. The FACT-B total score and TOI demonstrated known-group validity in differentiating patients with different clinical status. They showed high internal consistency and test-retest reliability, with Cronbach's alpha ranging from 0.87 to 0.91 and ICC ranging from 0.82 to 0.89. The English version was responsive to the change in performance status. The Chinese version was shown to be responsive to decline in performance status but the sample size of Chinese-speaking patients who improved in performance status was too small (N = 6) for conclusive analysis about responsiveness to improvement. Two items concerning sexuality had a high item non-response rate (50.2 and 14.4%). No practically significant difference was found in the total score and TOI between the two language versions despite minor differences in two of the 37 items. The English and Chinese versions of the FACT-B are valid, responsive, and reliable instruments in assessing health-related quality of life in breast cancer patients in Singapore. Data collected from the English and Chinese versions can be pooled and either version could be used for bilingual patients.
Using screen-based simulation to improve performance during pediatric resuscitation.

PubMed

Biese, Kevin J; Moro-Sutherland, Donna; Furberg, Robert D; Downing, Brian; Glickman, Larry; Murphy, Alison; Jackson, Cheryl L; Snyder, Graham; Hobgood, Cherri

2009-12-01

To assess the ability of a screen-based simulation-training program to improve emergency medicine and pediatric resident performance in critical pediatric resuscitation knowledge, confidence, and skills. A pre-post, interventional design was used. Three measures of performance were created and assessed before and after intervention: a written pre-course knowledge examination, a self-efficacy confidence score, and a skills-based high-fidelity simulation code scenario. For the high-fidelity skills assessment, independent physician raters recorded and reviewed subject performance. The intervention consisted of eight screen-based pediatric resuscitation scenarios that subjects had 4 weeks to complete. Upon completion of the scenarios, all three measures were repeated. For the confidence assessment, summary pre- and post-test summary confidence scores were compared using a t-test, and for the skills assessment, pre-scores were compared with post-test measures for each individual using McNemar's chi-square test for paired samples. Twenty-six of 35 (71.3%) enrolled subjects completed the institutional review board-approved study. Increases were observed in written test scores, confidence, and some critical interventions in high-fidelity simulation. The mean improvement in cumulative confidence scores for all residents was 10.1 (SD +/-4.9; range 0-19; p < 0.001), with no resident feeling less confident after the intervention. Although overall performance in simulated codes did not change significantly, with average scores of 6.65 (+/-1.76) to 7.04 (+/-1.37) out of 9 possible points (p = 0.58), improvement was seen in the administering of appropriate amounts of IV fluids (59-89%, p = 0.03). In this study, improvements in resident knowledge, confidence, and performance of certain skills in simulated pediatric cardiac arrest scenarios suggest that screen-based simulations may be an effective way to enhance resuscitation skills of pediatric providers. These results should be confirmed using a randomized design with an appropriate control group. (c) 2009 by the Society for Academic Emergency Medicine.
Employee Performance in the Context of the Problems of Measurement and Evaluation in Practice

NASA Astrophysics Data System (ADS)

Szabó, Peter; Mĺkva, Miroslava; Vaňová, Jaromíra; Marková, Petra

2017-09-01

Employee performance is a condition and an assumption for the performance and success of a company on the market. In order to ensure competitive ability, the quality of human resources, their management, and related measurement and performance assessment are at the forefront of company interest. Employee assessment affects the performance, development and motivation of people and also provides the necessary information about the employees. It allows the organization to monitor employee performance and compare their work with other collaborators. Many companies have the problem of setting up evaluation system so that it carried itself elements of responsibility and objectivity. The result of conceptual work in this area is the ultimate use of tools whose deployment, if possible, motivates employees to perform better. The aim of the paper is to refer to problems that arise in companies in evaluating the performance of employees.
Effect of vinpocetine (cognitol™) on cognitive performances of a nigerian population.

PubMed

Ogunrin, Ao

2014-07-01

Chronic medical disorders are often complicated by cognitive impairments, making medical intervention that can alleviate cognitive disturbances desirable. Vinpocetine enhances cerebral utilization of oxygen and glucose and consequently improves cerebral functions including memory. This study assessed the efficacy of vinpocetine (Cognitol™) in improving memory and concentration in cognitively impaired patients. A prospective analytical study of 56 cognitively impaired patients compared with age, sex and level of education matched 56 controls. Cognitive performance was assessed with the Short Blessed Test, which was pilot-tested. Baseline cognitive performances of the patients and controls were obtained and thereafter cognitive performances of the patients were assessed at 6 and 12 weeks after administration of vinpocetine at a dose of 5 mg twice-a-day. Comparative analysis of their performances at baseline was done using the Student t-test, while the improvement in patients' performances and effect of disease variables on cognitive performances were analyzed with one-way analysis of variance and likelihood ratio analysis respectively. The mean (standard deviation) [SD] ages of the cognitively impaired patients (56/112) and controls (56/112) were 49.5 (18.9) and 53.8 (15.8) years respectively (P = 0.19; 95% confidence interval [CI]: 2.2-10.8). The pilot study yielded an optimal cut-off error score of 6 with a sensitivity of 71.4%, specificity of 96.4% and accuracy of 83.9%. Patients performed significantly worse than the controls (P < 0.001; 95% CI 6.7-11.4). There were significant improvements in memory and concentration with vinpocetine therapy (P < 0.05). The clinical variables of the patients had no effect on the trend of cognitive performances. Vinpocetine was effective in improving memory and concentration of patients with epilepsy and dementia although the efficacy was minimal in demented patients.
Synthesized view comparison method for no-reference 3D image quality assessment

NASA Astrophysics Data System (ADS)

Luo, Fangzhou; Lin, Chaoyi; Gu, Xiaodong; Ma, Xiaojun

2018-04-01

We develop a no-reference image quality assessment metric to evaluate the quality of synthesized view rendered from the Multi-view Video plus Depth (MVD) format. Our metric is named Synthesized View Comparison (SVC), which is designed for real-time quality monitoring at the receiver side in a 3D-TV system. The metric utilizes the virtual views in the middle which are warped from left and right views by Depth-image-based rendering algorithm (DIBR), and compares the difference between the virtual views rendered from different cameras by Structural SIMilarity (SSIM), a popular 2D full-reference image quality assessment metric. The experimental results indicate that our no-reference quality assessment metric for the synthesized images has competitive prediction performance compared with some classic full-reference image quality assessment metrics.
Performance assessment and microbial diversity of two pilot scale multi-stage sub-surface flow constructed wetland systems.

PubMed

Babatunde, A O; Miranda-CasoLuengo, Raul; Imtiaz, Mehreen; Zhao, Y Q; Meijer, Wim G

2016-08-01

This study assessed the performance and diversity of microbial communities in multi-stage sub-surface flow constructed wetland systems (CWs). Our aim was to assess the impact of configuration on treatment performance and microbial diversity in the systems. Results indicate that at loading rates up to 100gBOD5/(m(2)·day), similar treatment performances can be achieved using either a 3 or 4 stage configuration. In the case of phosphorus (P), the impact of configuration was less obvious and a minimum of 80% P removal can be expected for loadings up to 10gP/(m(2)·day) based on the performance results obtained within the first 16months of operation. Microbial analysis showed an increased bacterial diversity in stage four compared to the first stage. These results indicate that the design and configuration of multi-stage constructed wetland systems may have an impact on the treatment performance and the composition of the microbial community in the systems, and such knowledge can be used to improve their design and performance. Copyright © 2016. Published by Elsevier B.V.
A Team, Case-based Examination and Its Impact on Student Performance in a Patient Safety and Informatics Course

PubMed Central

Etheridge, Kierstan; DeLellis, Teresa

2017-01-01

Objective. To describe the redesigned assessment plan for a patient safety and informatics course and assess student pharmacist performance and perceptions. Methods. The final examination of a patient safety course was redesigned from traditional multiple choice and short answer to team-based, open-ended, and case-based. Faculty for each class session developed higher level activities, focused on developing key skills or attitudes deemed essential for practice, for a progressive patient case consisting of nine activities. Student performance and perceptions were analyzed with pre- and post-surveys using 5-point scales. Results. Mean performance on the examination was 93.6%; median scores for each assessed course outcome ranged from 90% to 100%. Eighty-five percent of students completed both surveys. Confidence performing skills and demonstrating attitudes improved for each item on post-survey compared with pre-survey. Eighty-one percent of students indicated the experience of taking the examination was beneficial for their professional development. Conclusion. A team, case-based examination was associated with high student performance and improved self-confidence in performing medication safety-related skills. PMID:28970618
Implications of a Comparative Study for Mathematics Education in the English Education System

ERIC Educational Resources Information Center

Delice, Ali; Roper, Tom

2006-01-01

This paper reports upon particular aspects of a study carried out by Delice in 2003, the main aim of which was to compare the performance of students in the 16-19 age group from Turkey and England on trigonometry of "A-level standard" and then to compare the curriculum and assessment provision in each country to seek possible…
Evaluation of machine learning algorithms for improved risk assessment for Down's syndrome.

PubMed

Koivu, Aki; Korpimäki, Teemu; Kivelä, Petri; Pahikkala, Tapio; Sairanen, Mikko

2018-05-04

Prenatal screening generates a great amount of data that is used for predicting risk of various disorders. Prenatal risk assessment is based on multiple clinical variables and overall performance is defined by how well the risk algorithm is optimized for the population in question. This article evaluates machine learning algorithms to improve performance of first trimester screening of Down syndrome. Machine learning algorithms pose an adaptive alternative to develop better risk assessment models using the existing clinical variables. Two real-world data sets were used to experiment with multiple classification algorithms. Implemented models were tested with a third, real-world, data set and performance was compared to a predicate method, a commercial risk assessment software. Best performing deep neural network model gave an area under the curve of 0.96 and detection rate of 78% with 1% false positive rate with the test data. Support vector machine model gave area under the curve of 0.95 and detection rate of 61% with 1% false positive rate with the same test data. When compared with the predicate method, the best support vector machine model was slightly inferior, but an optimized deep neural network model was able to give higher detection rates with same false positive rate or similar detection rate but with markedly lower false positive rate. This finding could further improve the first trimester screening for Down syndrome, by using existing clinical variables and a large training data derived from a specific population. Copyright © 2018 Elsevier Ltd. All rights reserved.
Life cycle assessment of a packaging waste recycling system in Portugal

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ferreira, S.; Cabral, M.; Cruz, N.F. da, E-mail: nunocruz@tecnico.ulisboa.pt

Highlights: • We modeled a real packaging waste recycling system. • The analysis was performed using the life cycle assessment methodology. • The 2010 situation was compared with scenarios where the materials were not recycled. • The “Baseline” scenario seems to be more beneficial to the environment. - Abstract: Life Cycle Assessment (LCA) has been used to assess the environmental impacts associated with an activity or product life cycle. It has also been applied to assess the environmental performance related to waste management activities. This study analyses the packaging waste management system of a local public authority in Portugal. Themore » operations of selective and refuse collection, sorting, recycling, landfilling and incineration of packaging waste were considered. The packaging waste management system in operation in 2010, which we called “Baseline” scenario, was compared with two hypothetical scenarios where all the packaging waste that was selectively collected in 2010 would undergo the refuse collection system and would be sent directly to incineration (called “Incineration” scenario) or to landfill (“Landfill” scenario). Overall, the results show that the “Baseline” scenario is more environmentally sound than the hypothetical scenarios.« less
Semiautomated external defibrillators for in-hospital early defibrillation: a comparative study.

PubMed

Nocchi, Federico; Derrico, Pietro; Masucci, Gerardina; Capussotto, Carlo; Cecchetti, Corrado; Ritrovato, Matteo

2014-01-01

Semiautomated external defibrillators (AEDs) should be considered as a means to facilitate in-hospital early defibrillation (IHED) in areas where advanced life support rescuers are not readily available. In this study, we aimed to develop a checklist and a measurement protocol to evaluate and compare AEDs by assessing factors that may affect IHED. A clinical and technical comparison of six AEDs was performed. Technical specifications were analyzed, while an emergency team evaluated ergonomics and appropriateness for IHED at Bambino Gesù Children's Hospital. A measurement protocol was implemented, which aimed to assess the ability of defibrillators to recognize shockable and nonshockable rhythms, accuracy of delivered energy, and charging time. Designs of AEDs differed in several features which influence their appropriateness for IHED. Some units showed poor ergonomics and instructions/feedback for cardiopulmonary resuscitation. Differences between defibrillators in recognizing shockable and nonshockable rhythms emerged for polymorphic ventricular tachycardia waveforms and when the frequency and amplitude of input signals varied. Tests for accuracy revealed poor performances at low and high impedance levels for most AEDs. Notably, differences greater than 20 seconds were found in the time from power-on to "ready for discharge." The approach we used to assess AEDs allowed us to evaluate their appropriateness with respect to the organizational context, to measure their parameters, and to compare models. Results showed that ergonomics and/or performances (timing and accuracy) could be improved in each device.
Evaluating Curriculum-Based Measurement from a Behavioral Assessment Perspective

ERIC Educational Resources Information Center

Ardoin, Scott P.; Roof, Claire M.; Klubnick, Cynthia; Carfolite, Jessica

2008-01-01

Curriculum-based measurement Reading (CBM-R) is an assessment procedure used to evaluate students' relative performance compared to peers and to evaluate their growth in reading. Within the response to intervention (RtI) model, CBM-R data are plotted in time series fashion as a means modeling individual students' response to varying levels of…
Faculty Mentors', Graduate Students', and Performance-Based Assessments of Students' Research Skill Development

ERIC Educational Resources Information Center

Feldon, David F.; Maher, Michelle A.; Hurst, Melissa; Timmerman, Briana

2015-01-01

Faculty mentorship is thought to be a linchpin of graduate education in STEM disciplines. This mixed-method study investigates agreement between student mentees' and their faculty mentors' perceptions of the students' developing research knowledge and skills in STEM. We also compare both assessments against independent ratings of the students'…
An Evaluation of the Clinical Performance of Newly Qualified Nurses: A Competency Based Assessment.

ERIC Educational Resources Information Center

O'Connor, S. E.; Pearce, J.; Smith, R. L.; Voegeli, D.; Walton, P.

2001-01-01

Senior nurses' (n=139) expectations of 36 beginning nurses were compared with the beginners' competence ratings by their clinical preceptors. Senior nurses' expectations were lower than the actual competence demonstrated by the graduates, suggesting that assessment instruments should not be derived solely from supervisor expectations. (SK)
Towards a Model of School-Based Curriculum Development and Assessment Using the SOLO Taxonomy.

ERIC Educational Resources Information Center

Biggs, John

1989-01-01

One factor preventing the wider acceptance of school-based curriculum development and assessment is the problem of comparing performances of different students, in different schools. The SOLO taxonomy is used to describe the complexity of learning outcomes in a language that is generally applicable across the curriculum. (Author/MLW)
The Impact of a Flexible Assessment System on Students' Motivation, Performance and Attitude

ERIC Educational Resources Information Center

Pacharn, Parunchana; Bay, Darlene; Felton, Sandra

2013-01-01

We examine a flexible assessment system that allows students to determine the weights allocated to each course component and to re-allocate the weights in response to achieved scores. The flexibility is intended to encourage students' participation in the learning process, thereby promoting self-regulated learning skills. We compare this…
The IGAP and the ITBS: A Comparative Study.

ERIC Educational Resources Information Center

Perlman, Carole L.; And Others

This study was designed to examine the extent to which Illinois Goal Assessment Program (IGAP) constructing meaning scores correlate with Iowa Tests of Basic Skills (ITBS) reading scores and with performance on ITBS items dealing with literal meaning, inferences, and generalizations. In addition, this study assessed the ability of the IGAP reading…

The Use of Animated Videos to Illustrate Oral Solid Dosage Form Manufacturing in a Pharmaceutics Course.

PubMed

Yellepeddi, Venkata Kashyap; Roberson, Charles

2016-10-25

Objective. To evaluate the impact of animated videos of oral solid dosage form manufacturing as visual instructional aids on pharmacy students' perception and learning. Design. Data were obtained using a validated, paper-based survey instrument designed to evaluate the effectiveness, appeal, and efficiency of the animated videos in a pharmaceutics course offered in spring 2014 and 2015. Basic demographic data were also collected and analyzed. Assessment data at the end of pharmaceutics course was collected for 2013 and compared with assessment data from 2014, and 2015. Assessment. Seventy-six percent of the respondents supported the idea of incorporating animated videos as instructional aids for teaching pharmaceutics. Students' performance on the formative assessment in 2014 and 2015 improved significantly compared to the performance of students in 2013 whose lectures did not include animated videos as instructional aids. Conclusions. Implementing animated videos of oral solid dosage form manufacturing as instructional aids resulted in improved student learning and favorable student perceptions about the instructional approach. Therefore, use of animated videos can be incorporated in pharmaceutics teaching to enhance visual learning.
The Use of Animated Videos to Illustrate Oral Solid Dosage Form Manufacturing in a Pharmaceutics Course

PubMed Central

Roberson, Charles

2016-01-01

Objective. To evaluate the impact of animated videos of oral solid dosage form manufacturing as visual instructional aids on pharmacy students’ perception and learning. Design. Data were obtained using a validated, paper-based survey instrument designed to evaluate the effectiveness, appeal, and efficiency of the animated videos in a pharmaceutics course offered in spring 2014 and 2015. Basic demographic data were also collected and analyzed. Assessment data at the end of pharmaceutics course was collected for 2013 and compared with assessment data from 2014, and 2015. Assessment. Seventy-six percent of the respondents supported the idea of incorporating animated videos as instructional aids for teaching pharmaceutics. Students’ performance on the formative assessment in 2014 and 2015 improved significantly compared to the performance of students in 2013 whose lectures did not include animated videos as instructional aids. Conclusions. Implementing animated videos of oral solid dosage form manufacturing as instructional aids resulted in improved student learning and favorable student perceptions about the instructional approach. Therefore, use of animated videos can be incorporated in pharmaceutics teaching to enhance visual learning. PMID:27899837
Lenke and King classification systems for adolescent idiopathic scoliosis: interobserver agreement and postoperative results

PubMed Central

Hosseinpour-Feizi, Hojjat; Soleimanpour, Jafar; Sales, Jafar Ganjpour; Arzroumchilar, Ali

2011-01-01

Purpose The aim of this study was to investigate the interobserver agreement of the Lenke and King classifications for adolescent idiopathic scoliosis, and to compare the results of surgery performed based on classification of the scoliosis according to each of these classification systems. Methods The study was conducted in Shohada Hospital in Tabriz, Iran, between 2009 and 2010. First, a reliability assessment was undertaken to assess interobserver agreement of the Lenke and King classifications for adolescent idiopathic scoliosis. Second, postoperative efficacy and safety of surgery performed based on the Lenke and King classifications were compared. Kappa coefficients of agreement were calculated to assess the agreement. Outcomes were compared using bivariate tests and repeated measures analysis of variance. Results A low to moderate interobserver agreement was observed for the King classification; the Lenke classification yielded mostly high agreement coefficients. The outcome of surgery was not found to be substantially different between the two systems. Conclusion Based on the results, the Lenke classification method seems advantageous. This takes into consideration the Lenke classification’s priority in providing details of curvatures in different anatomical surfaces to explain precise intensity of scoliosis, that it has higher interobserver agreement scores, and also that it leads to noninferior postoperative results compared with the King classification method. PMID:22267934
Lenke and King classification systems for adolescent idiopathic scoliosis: interobserver agreement and postoperative results.

PubMed

Hosseinpour-Feizi, Hojjat; Soleimanpour, Jafar; Sales, Jafar Ganjpour; Arzroumchilar, Ali

2011-01-01

The aim of this study was to investigate the interobserver agreement of the Lenke and King classifications for adolescent idiopathic scoliosis, and to compare the results of surgery performed based on classification of the scoliosis according to each of these classification systems. The study was conducted in Shohada Hospital in Tabriz, Iran, between 2009 and 2010. First, a reliability assessment was undertaken to assess interobserver agreement of the Lenke and King classifications for adolescent idiopathic scoliosis. Second, postoperative efficacy and safety of surgery performed based on the Lenke and King classifications were compared. Kappa coefficients of agreement were calculated to assess the agreement. Outcomes were compared using bivariate tests and repeated measures analysis of variance. A low to moderate interobserver agreement was observed for the King classification; the Lenke classification yielded mostly high agreement coefficients. The outcome of surgery was not found to be substantially different between the two systems. Based on the results, the Lenke classification method seems advantageous. This takes into consideration the Lenke classification's priority in providing details of curvatures in different anatomical surfaces to explain precise intensity of scoliosis, that it has higher interobserver agreement scores, and also that it leads to noninferior postoperative results compared with the King classification method.
Piloting an outcome-based programme evaluation tool in undergraduate medical education.

PubMed

Raupach, Tobias; Schiekirka, Sarah; Münscher, Christian; Beißbarth, Tim; Himmel, Wolfgang; Burckhardt, Gerhard; Pukrop, Tobias

2012-01-01

Different approaches to performance-oriented allocation of resources according to teaching quality are currently being discussed within German medical schools. The implementation of these programmes is impeded by a lack of valid criteria to measure teaching quality. An assessment of teaching quality should include structural and procedural aspects but focus on learning outcome itself. The aim of this study was to implement a novel, outcome-based evaluation tool within the clinical phase of a medical curriculum and address differences between the novel tool and traditional evaluation methods. Student self-assessments before and after completion of a teaching module were used to compute performance gains for specific learning objectives. Mean performance gains in each module were compared to student expectations before the module and data derived from a traditional evaluation tool using overall course ratings at the end of the module. A ranking of the 21 modules according to computed performance gains yielded entirely different results than module rankings based on overall course ratings. There was no significant correlation between performance gain and overall ratings. However, the latter were significantly correlated to student expectations before entering the module as well as structural and procedural parameters (Pearson's r 0.7-0.9). Performance gain computed from comparative self-assessments adds an important new dimension to course evaluation in medical education. In contrast to overall course ratings, the novel tool is less heavily confounded by construct-irrelevant factors. Thus, it appears to be more appropriate than overall course ratings in determining teaching quality and developing algorithms to guide performance-oriented resource allocation in medical education.
The Role of Digital 3D Scanned Models in Dental Students' Self-Assessments in Preclinical Operative Dentistry.

PubMed

Lee, Cliff; Kobayashi, Hiro; Lee, Samuel R; Ohyama, Hiroe

2018-04-01

The aim of this study was to determine how dental student self-assessment and faculty assessment of operative preparations compared for conventional visual assessment versus assessment of scanned digital 3D models. In 2016, all third-year students in the Class of 2018 (N=35) at Harvard School of Dental Medicine performed preclinical exams of Class II amalgam preparations (C2AP) and Class III composite preparations (C3CP) and completed self-assessment forms; in 2017, all third-year students in the Class of 2019 (N=34) performed the same exams. Afterwards, the prepared typodont teeth were digitally scanned. Students self-assessed their preparations digitally, and four faculty members graded the preparations conventionally and digitally. The results showed that, overall, the students assessed their preparations higher than the faculty assessments. The mean student-faculty gaps for C2AP and C3CP in the conventional assessments were 11% and 5%, respectively. The mean digital student-faculty gap for C2AP and C3CP were 8% and 2%, respectively. In the conventional assessments, preclinical performance was negatively correlated with the student-faculty gap (r=-0.47, p<0.001). The correlations were not statistically significant with the digital assessments (p=0.39, p=0.26). Students in the bottom quartile significantly improved their self-assessment accuracy using digital self-assessments over conventional assessments (C2AP 10% vs. 17% and C3CP 3% vs. 10%, respectively). These results suggest that digital assessments offered a significant learning opportunity for students to critically self-assess themselves in operative preclinical dentistry. The lower performing students benefitted the most, improving their assessment ability to the level of the rest of the class.
Assessing the Amazon Cloud Suitability for CLARREO's Computational Needs

NASA Technical Reports Server (NTRS)

Goldin, Daniel; Vakhnin, Andrei A.; Currey, Jon C.

2015-01-01

In this document we compare the performance of the Amazon Web Services (AWS), also known as Amazon Cloud, with the CLARREO (Climate Absolute Radiance and Refractivity Observatory) cluster and assess its suitability for computational needs of the CLARREO mission. A benchmark executable to process one month and one year of PARASOL (Polarization and Anistropy of Reflectances for Atmospheric Sciences coupled with Observations from a Lidar) data was used. With the optimal AWS configuration, adequate data-processing times, comparable to the CLARREO cluster, were found. The assessment of alternatives to the CLARREO cluster continues and several options, such as a NASA-based cluster, are being considered.
Techno-economic assessment of a hybrid solar receiver and combustor

NASA Astrophysics Data System (ADS)

Lim, Jin Han; Nathan, Graham; Dally, Bassam; Chinnici, Alfonso

2016-05-01

A techno-economic analysis is performed to compare two different configurations of hybrid solar thermal systems with fossil fuel backup to provide continuous electricity output. The assessment compares a Hybrid Solar Receiver Combustor (HSRC), in which the functions of a solar cavity receiver and a combustor are integrated into a single device with a reference conventional solar thermal system using a regular solar cavity receiver with a backup boiler, termed the Solar Gas Hybrid (SGH). The benefits of the integration is assessed by varying the size of the storage capacity and heliostat field while maintaining the same overall thermal input to the power block.
Thermal Ablation of T1c Renal Cell Carcinoma: A Comparative Assessment of Technical Performance, Procedural Outcome, and Safety of Microwave Ablation, Radiofrequency Ablation, and Cryoablation.

PubMed

Zhou, Wenhui; Arellano, Ronald S

2018-04-06

To evaluate perioperative outcomes of thermal ablation with microwave (MW), radiofrequency (RF), and cryoablation for stage T1c renal cell carcinoma (RCC). A retrospective analysis of 384 patients (mean age, 71 y; range, 22-88 y) was performed between October 2006 and October 2016. Mean radius, exophytic/endophytic, nearness to collecting system or sinus, anterior/posterior, and location relative to polar lines; preoperative aspects and dimensions used for anatomic classification; and centrality index scores were 6.3, 7.9, and 2.7, respectively. Assessment of pre- and postablation serum blood urea nitrogen, creatinine, and estimated glomerular filtration rate was performed to assess functional outcomes. Linear regression analyses were performed to compare sedation medication dosages among the three treatment cohorts. Univariable and multivariable logistic regression analyses were performed to compare rates of residual disease and complications among treatment modalities. A total of 437 clinical stage T1N0M0 biopsy-proven RCCs measuring 1.2-6.9 cm were treated with computed tomography (CT)-guided MW ablation (n = 44; 10%), RF ablation (n = 347; 79%), or cryoablation (n = 46; 11%). There were no significant differences in patient demographic or tumor characteristics among cohorts. Complication rates and immediate renal function changes were similar among the three ablation modalities (P = .46 and P = .08, respectively). MW ablation was associated with significantly decreased ablation time (P < .05), procedural time (P < .05), and dosage of sedative medication (P < .05) compared with RF ablation and cryoablation. CT-guided percutaneous MW ablation is comparable to RF ablation or cryoablation for the treatment of stage T1N0M0 RCC with regard to treatment response and is associated with shorter treatment times and less sedation than RF ablation or cryoablation. In addition, the safety profile of CT-guided MW ablation is noninferior to those of RF ablation or cryoablation. Copyright © 2017 SIR. Published by Elsevier Inc. All rights reserved.
Effect of online formative assessment on summative performance in integrated musculoskeletal system module.

PubMed

Mitra, Nilesh Kumar; Barua, Ankur

2015-03-03

The impact of web-based formative assessment practices on performance of undergraduate medical students in summative assessments is not widely studied. This study was conducted among third-year undergraduate medical students of a designated university in Malaysia to compare the effect, on performance in summative assessment, of repeated computer-based formative assessment with automated feedback with that of single paper-based formative assessment with face-to face feedback. This quasi-randomized trial was conducted among two groups of undergraduate medical students who were selected by stratified random technique from a cohort undertaking the Musculoskeletal module. The control group C (n = 102) was subjected to a paper-based formative MCQ test. The experimental group E (n = 65) was provided three online formative MCQ tests with automated feedback. The summative MCQ test scores for both these groups were collected after the completion of the module. In this study, no significant difference was observed between the mean summative scores of the two groups. However, Band 1 students from group E with higher entry qualification showed higher mean score in the summative assessment. A trivial, but significant and positive correlation (r(2) = +0.328) was observed between the online formative test scores and summative assessment scores of group E. The proportionate increase of performance in group E was found to be almost double than group C. The use of computer based formative test with automated feedback improved the performance of the students with better academic background in the summative assessment. Computer-based formative test can be explored as an optional addition to the curriculum of pre-clinical integrated medical program to improve the performance of the students with higher academic ability.
Unravelling the influence of mild traumatic brain injury (MTBI) on cognitive-linguistic processing: a comparative group analysis.

PubMed

Barwood, Caroline H S; Murdoch, Bruce E

2013-06-01

Cognitive-linguistic deficits often accompany traumatic brain injury (TBI) and can negatively impact communicative competency. The linguistic sequelae underpinning mild TBI (MTBI) remain largely unexplored in contemporary literature. The present research methods aim to provide group evidence pertaining to the influence of MTBI on linguistic and higher-level language processing. Extrapolating on the findings of recent case reports, it is hypothesized that performance of the MTBI patients will be significantly reduced compared to normal controls performance on the employed high-level linguistic tasks. Sixteen patients with MTBI and 16 age- and education-matched normal control participants were assessed using a comprehensive battery of cognitive-linguistic assessments. The results demonstrated statistically significant differences between MTBI and normal control group performance across a number of higher-level linguistic, general cognitive and general language tasks. MTBI group performance was significantly lower than the normal control group on tasks requiring complex lexical semantic operations and memory demands, including: Recall, organization, making inferences, naming and perception/discrimination. These outcomes confer that post-MTBI, cognitive, high-level language and isolated general language performance (e.g. naming) is significantly reduced in MTBI patients, compared to normal controls. Furthermore, the detailed cognitive-linguistic profile offered provides a necessary direction for the identification of areas of linguistic decline in MTBI and targets for therapeutic intervention of impaired cognitive-linguistic processes to ultimately improve communicative outcomes in MTBI.
Comparison of virtual patient simulation with mannequin-based simulation for improving clinical performances in assessing and managing clinical deterioration: randomized controlled trial.

PubMed

Liaw, Sok Ying; Chan, Sally Wai-Chi; Chen, Fun-Gee; Hooi, Shing Chuan; Siau, Chiang

2014-09-17

Virtual patient simulation has grown substantially in health care education. A virtual patient simulation was developed as a refresher training course to reinforce nursing clinical performance in assessing and managing deteriorating patients. The objective of this study was to describe the development of the virtual patient simulation and evaluate its efficacy, by comparing with a conventional mannequin-based simulation, for improving the nursing students' performances in assessing and managing patients with clinical deterioration. A randomized controlled study was conducted with 57 third-year nursing students who were recruited through email. After a baseline evaluation of all participants' clinical performance in a simulated environment, the experimental group received a 2-hour fully automated virtual patient simulation while the control group received 2-hour facilitator-led mannequin-based simulation training. All participants were then re-tested one day (first posttest) and 2.5 months (second posttest) after the intervention. The participants from the experimental group completed a survey to evaluate their learning experiences with the newly developed virtual patient simulation. Compared to their baseline scores, both experimental and control groups demonstrated significant improvements (P<.001) in first and second post-test scores. While the experimental group had significantly lower (P<.05) second post-test scores compared with the first post-test scores, no significant difference (P=.94) was found between these two scores for the control group. The scores between groups did not differ significantly over time (P=.17). The virtual patient simulation was rated positively. A virtual patient simulation for a refreshing training course on assessing and managing clinical deterioration was developed. Although the randomized controlled study did not show that the virtual patient simulation was superior to mannequin-based simulation, both simulations have demonstrated to be effective refresher learning strategies for improving nursing students' clinical performance. Given the greater resource requirements of mannequin-based simulation, the virtual patient simulation provides a more promising alternative learning strategy to mitigate the decay of clinical performance over time.
ATHLI16: the ATHens Lidar Intercomparison campaign

NASA Astrophysics Data System (ADS)

Amodeo, Aldo; D'Amico, Giuseppe; Giunta, Aldo; Papagiannopoulos, Nikolaos; Papayannis, Alex; Argyrouli, Athina; Mylonaki, Maria; Tsaknakis, Georgios; Kokkalis, Panos; Soupiona, Ourania; Tzanis, Chris

2018-04-01

The results of the ATHLI16 (ATHens Lidar Intercomparison) campaign, held in Athens from 26/09 to 07/10 2016 are presented. The campaign was performed within the Lidar Calibration Centre activities (EU H2020 ACTRIS-2 project) to assess the performance of the EOLE lidar system (NTUA, Athens, Greece), operating within EARLINET, by comparing against the EARLINET reference lidar system MUSA (CNR-IMAA, Potenza, Italy). For both lidars only products retrieved by the EARLINET Single Calculus Chain have been compared.
Assessing teamwork performance in obstetrics: A systematic search and review of validated tools.

PubMed

Fransen, Annemarie F; de Boer, Liza; Kienhorst, Dieneke; Truijens, Sophie E; van Runnard Heimel, Pieter J; Oei, S Guid

2017-09-01

Teamwork performance is an essential component for the clinical efficiency of multi-professional teams in obstetric care. As patient safety is related to teamwork performance, it has become an important learning goal in simulation-based education. In order to improve teamwork performance, reliable assessment tools are required. These can be used to provide feedback during training courses, or to compare learning effects between different types of training courses. The aim of the current study is to (1) identify the available assessment tools to evaluate obstetric teamwork performance in a simulated environment, and (2) evaluate their psychometric properties in order to identify the most valuable tool(s) to use. We performed a systematic search in PubMed, MEDLINE, and EMBASE to identify articles describing assessment tools for the evaluation of obstetric teamwork performance in a simulated environment. In order to evaluate the quality of the identified assessment tools the standards and grading rules have been applied as recommended by the Accreditation Council for Graduate Medical Education (ACGME) Committee on Educational Outcomes. The included studies were also assessed according to the Oxford Centre for Evidence Based Medicine (OCEBM) levels of evidence. This search resulted in the inclusion of five articles describing the following six tools: Clinical Teamwork Scale, Human Factors Rating Scale, Global Rating Scale, Assessment of Obstetric Team Performance, Global Assessment of Obstetric Team Performance, and the Teamwork Measurement Tool. Based on the ACGME guidelines we assigned a Class 3, level C of evidence, to all tools. Regarding the OCEBM levels of evidence, a level 3b was assigned to two studies and a level 4 to four studies. The Clinical Teamwork Scale demonstrated the most comprehensive validation, and the Teamwork Measurement Tool demonstrated promising results, however it is recommended to further investigate its reliability. Copyright © 2017. Published by Elsevier B.V.
Comparison of mathematic models for assessment of glomerular filtration rate with electron-beam CT in pigs.

PubMed

Daghini, Elena; Juillard, Laurent; Haas, John A; Krier, James D; Romero, Juan C; Lerman, Lilach O

2007-02-01

To prospectively compare in pigs three mathematic models for assessment of glomerular filtration rate (GFR) on electron-beam (EB) computed tomographic (CT) images, with concurrent inulin clearance serving as the reference standard. This study was approved by the institutional animal care and use committee. Inulin clearance was measured in nine pigs (18 kidneys) and compared with single-kidney GFR assessed from renal time-attenuation curves (TACs) obtained with EB CT before and after infusion of the vasodilator acetylcholine. CT-derived GFR was calculated with the original and modified Patlak methods and with previously validated extended gamma variate modeling of first-pass cortical TACs. Statistical analysis was performed to assess correlation between CT methods and inulin clearance for estimation of GFR with least-squares regression analysis and Bland-Altman graphical representation. Comparisons within groups were performed with a paired t test. GFR assessed with the original Patlak method indicated poor correlation with inulin clearance, whereas GFR assessed with the modified Patlak method (P < .001, r = 0.75) and with gamma variate modeling (P < .001, r = 0.79) correlated significantly with inulin clearance and indicated an increase in response to acetylcholine. CT-derived estimates of GFR can be significantly improved by modifications in image analysis methods (eg, use of a cortical region of interest). (c) RSNA, 2007.
The importance of quality control in validating concentrations ...

EPA Pesticide Factsheets

A national-scale survey of 247 contaminants of emerging concern (CECs), including organic and inorganic chemical compounds, and microbial contaminants, was conducted in source and treated drinking water samples from 25 treatment plants across the United States. Multiple methods were used to determine these CECs, including six analytical methods to measure 174 pharmaceuticals, personal care products, and pesticides. A three-component quality assurance/quality control (QA/QC) program was designed for the subset of 174 CECs which allowed us to assess and compare performances of the methods used. The three components included: 1) a common field QA/QC protocol and sample design, 2) individual investigator-developed method-specific QA/QC protocols, and 3) a suite of 46 method comparison analytes that were determined in two or more analytical methods. Overall method performance for the 174 organic chemical CECs was assessed by comparing spiked recoveries in reagent, source, and treated water over a two-year period. In addition to the 247 CECs reported in the larger drinking water study, another 48 pharmaceutical compounds measured did not consistently meet predetermined quality standards. Methodologies that did not seem suitable for these analytes are overviewed. The need to exclude analytes based on method performance demonstrates the importance of additional QA/QC protocols. This paper compares the method performance of six analytical methods used to measure 174 emer
Simulation-trained junior residents perform better than general surgeons on advanced laparoscopic cases.

PubMed

Boza, Camilo; León, Felipe; Buckel, Erwin; Riquelme, Arnoldo; Crovari, Fernando; Martínez, Jorge; Aggarwal, Rajesh; Grantcharov, Teodor; Jarufe, Nicolás; Varas, Julián

2017-01-01

Multiple simulation training programs have demonstrated that effective transfer of skills can be attained and applied into a more complex scenario, but evidence regarding transfer to the operating room is limited. To assess junior residents trained with simulation performing an advanced laparoscopic procedure in the OR and compare results to those of general surgeons without simulation training and expert laparoscopic surgeons. Experimental study: After a validated 16-session advanced laparoscopy simulation training program, junior trainees were compared to general surgeons (GS) with no simulation training and expert bariatric surgeons (BS) in performing a stapled jejuno-jejunostomy (JJO) in the OR. Global rating scale (GRS) and specific rating scale scores, operative time and the distance traveled by both hands measured with a tracking device, were assessed. In addition, all perioperative and immediate postoperative morbidities were registered. Ten junior trainees, 12 GS and 5 BS experts were assessed performing a JJO in the OR. All trainees completed the entire JJO in the OR without any takeovers by the BS. Six (50 %) BS takeovers took place in the GS group. Trainees had significantly better results in all measured outcomes when compared to GS with considerable higher GRS median [19.5 (18.8-23.5) vs. 12 (9-13.8) p < 0.001] and lower operative time. One morbidity was registered; a patient in the trainees group was readmitted at postoperative day 10 for mechanical ileus that resolved with medical treatment. This study demonstrated transfer of advanced laparoscopic skills acquired through a simulated training program in novice surgical residents to the OR.
Assessing hemispheric specialization for processing arithmetic skills in adults: A functional transcranial doppler ultrasonography (fTCD) study.

PubMed

Connaughton, Veronica M; Amiruddin, Azhani; Clunies-Ross, Karen L; French, Noel; Fox, Allison M

2017-05-01

A major model of the cerebral circuits that underpin arithmetic calculation is the triple-code model of numerical processing. This model proposes that the lateralization of mathematical operations is organized across three circuits: a left-hemispheric dominant verbal code; a bilateral magnitude representation of numbers and a bilateral Arabic number code. This study simultaneously measured the blood flow of both middle cerebral arteries using functional transcranial Doppler ultrasonography to assess hemispheric specialization during the performance of both language and arithmetic tasks. The propositions of the triple-code model were assessed in a non-clinical adult group by measuring cerebral blood flow during the performance of multiplication and subtraction problems. Participants were 17 adults aged between 18-27 years. We obtained laterality indices for each type of mathematical operation and compared these in participants with left-hemispheric language dominance. It was hypothesized that blood flow would lateralize to the left hemisphere during the performance of multiplication operations, but would not lateralize during the performance of subtraction operations. Hemispheric blood flow was significantly left lateralized during the multiplication task, but was not lateralized during the subtraction task. Compared to high spatial resolution neuroimaging techniques previously used to measure cerebral lateralization, functional transcranial Doppler ultrasonography is a cost-effective measure that provides a superior temporal representation of arithmetic cognition. These results provide support for the triple-code model of arithmetic processing and offer complementary evidence that multiplication operations are processed differently in the adult brain compared to subtraction operations. Copyright © 2017 Elsevier B.V. All rights reserved.
Mathematics deficits in adolescents with bipolar I disorder.

PubMed

Lagace, Diane C; Kutcher, Stanley P; Robertson, Heather A

2003-01-01

This study examined mathematical ability in adolescents with bipolar I disorder, compared to adolescents with major depressive disorder and psychiatrically healthy comparison subjects. Participants (N=119) included adolescents in remission from bipolar disorder (N=44) or major depressive disorder (N=30), as well as comparison subjects (N=45) with no psychiatric history. Participants were assessed with the following measures: the Wide-Range Achievement Test, Revised 2 (WRAT-R2), Peabody Individual Achievement Test, Bay Area Functional Performance Evaluation Task-Oriented Assessment (functional mathematics subtest), Test of Nonverbal Intellegence-2, and a self-report of mathematics performance. WRAT-R2 and Peabody Individual Achievement Test scores for spelling, mathematics, and reading revealed that adolescents with bipolar disorder had significantly lower achievement in mathematics, compared to subjects with major depressive disorder and comparison subjects. Results for the Test of Nonverbal Intellegence-2 were not significantly different between groups. Adolescents with bipolar disorder took significantly longer to complete the Bay Area Functional Performance Evaluation mathematics task. Significantly fewer adolescents with bipolar disorder (9%) reported above-average mathematics performance, compared with the other groups. Adolescents with remitted bipolar disorder have a specific profile of mathematics difficulties that differentiates them from both adolescents with unipolar depression and psychiatrically healthy comparison subjects. These mathematics deficits may not derive simply from more global deficits in nonverbal intelligence or executive functioning, but may be associated with neuroanatomical abnormalities that result in cognitive deficits, including a slowed response time. These deficits suggest the need for specialized assessment of mathematics as part of a comprehensive clinical follow-up treatment plan.
Predicting in-patient falls in a geriatric clinic: a clinical study combining assessment data and simple sensory gait measurements.

PubMed

Marschollek, M; Nemitz, G; Gietzelt, M; Wolf, K H; Meyer Zu Schwabedissen, H; Haux, R

2009-08-01

Falls are among the predominant causes for morbidity and mortality in elderly persons and occur most often in geriatric clinics. Despite several studies that have identified parameters associated with elderly patients' fall risk, prediction models -- e.g., based on geriatric assessment data -- are currently not used on a regular basis. Furthermore, technical aids to objectively assess mobility-associated parameters are currently not used. To assess group differences in clinical as well as common geriatric assessment data and sensory gait measurements between fallers and non-fallers in a geriatric sample, and to derive and compare two prediction models based on assessment data alone (model #1) and added sensory measurement data (model #2). For a sample of n=110 geriatric in-patients (81 women, 29 men) the following fall risk-associated assessments were performed: Timed 'Up & Go' (TUG) test, STRATIFY score and Barthel index. During the TUG test the subjects wore a triaxial accelerometer, and sensory gait parameters were extracted from the data recorded. Group differences between fallers (n=26) and non-fallers (n=84) were compared using Student's t-test. Two classification tree prediction models were computed and compared. Significant differences between the two groups were found for the following parameters: time to complete the TUG test, transfer item (Barthel), recent falls (STRATIFY), pelvic sway while walking and step length. Prediction model #1 (using common assessment data only) showed a sensitivity of 38.5% and a specificity of 97.6%, prediction model #2 (assessment data plus sensory gait parameters) performed with 57.7% and 100%, respectively. Significant differences between fallers and non-fallers among geriatric in-patients can be detected for several assessment subscores as well as parameters recorded by simple accelerometric measurements during a common mobility test. Existing geriatric assessment data may be used for falls prediction on a regular basis. Adding sensory data improves the specificity of our test markedly.

How Strong and Weak Readers Perform on the Developmental Eye Movement Test (DEM): Norms for Latvian School-Aged Children

ERIC Educational Resources Information Center

Serdjukova, Jelena; Ekimane, Lasma; Valeinis, Janis; Skilters, Jurgis; Krumina, Gunta

2017-01-01

The aim of our study was to determine DEM test performance norms for school-aged children in Latvia, assess how DEM test results correlate with children's reading rates, compare test performance between strong and weak readers. A modified DEM test and a newly developed reading test were administered to 1487 children during a screening survey. Our…
A Comparative Evaluation of Group IV Personnel Assigned to the U.S.S. Catskill; Followup Performance Evaluation.

ERIC Educational Resources Information Center

Van Matre, Nicholas H.; Harrigan, Robert J.

A followup performance evaluation was conducted on a sample of Group 4 (low ability) personnel who had served 14 months aboard the mine contermeasures support ship U.S.S. Catskill (MCS-1). Shipboard assessments were made of the Group 4 sample and the non-Group 4 comparison sample in terms of performance test proficiency, supervisors' ratings, and…
Comparative Performance of Ground vs. Aerially Assessed RGB and Multispectral Indices for Early-Growth Evaluation of Maize Performance under Phosphorus Fertilization

PubMed Central

Gracia-Romero, Adrian; Kefauver, Shawn C.; Vergara-Díaz, Omar; Zaman-Allah, Mainassara A.; Prasanna, Boddupalli M.; Cairns, Jill E.; Araus, José L.

2017-01-01

Low soil fertility is one of the factors most limiting agricultural production, with phosphorus deficiency being among the main factors, particularly in developing countries. To deal with such environmental constraints, remote sensing measurements can be used to rapidly assess crop performance and to phenotype a large number of plots in a rapid and cost-effective way. We evaluated the performance of a set of remote sensing indices derived from Red-Green-Blue (RGB) images and multispectral (visible and infrared) data as phenotypic traits and crop monitoring tools for early assessment of maize performance under phosphorus fertilization. Thus, a set of 26 maize hybrids grown under field conditions in Zimbabwe was assayed under contrasting phosphorus fertilization conditions. Remote sensing measurements were conducted in seedlings at two different levels: at the ground and from an aerial platform. Within a particular phosphorus level, some of the RGB indices strongly correlated with grain yield. In general, RGB indices assessed at both ground and aerial levels correlated in a comparable way with grain yield except for indices a* and u*, which correlated better when assessed at the aerial level than at ground level and Greener Area (GGA) which had the opposite correlation. The Normalized Difference Vegetation Index (NDVI) evaluated at ground level with an active sensor also correlated better with grain yield than the NDVI derived from the multispectral camera mounted in the aerial platform. Other multispectral indices like the Soil Adjusted Vegetation Index (SAVI) performed very similarly to NDVI assessed at the aerial level but overall, they correlated in a weaker manner with grain yield than the best RGB indices. This study clearly illustrates the advantage of RGB-derived indices over the more costly and time-consuming multispectral indices. Moreover, the indices best correlated with GY were in general those best correlated with leaf phosphorous content. However, these correlations were clearly weaker than against grain yield and only under low phosphorous conditions. This work reinforces the effectiveness of canopy remote sensing for plant phenotyping and crop management of maize under different phosphorus nutrient conditions and suggests that the RGB indices are the best option. PMID:29230230
The SAFE-T assessment tool: derivation and validation of a web-based application for point-of-care evaluation of gastroenterology fellow performance in colonoscopy.

PubMed

Kumar, Navin L; Kugener, Guillaume; Perencevich, Molly L; Saltzman, John R

2018-01-01

Attending assessment is a critical part of endoscopic education for gastroenterology fellows. The aim of this study was to develop and validate a concise assessment tool to evaluate real-time fellow performance in colonoscopy administered via a web-based application. The Skill Assessment in Fellow Endoscopy Training (SAFE-T) tool was derived as a novel 5-question evaluation tool that captures both summative and formative feedback adapted into a web-based application. A prospective study of 15 gastroenterology fellows (5 fellows each from years 1 to 3 of training) was performed using the SAFE-T tool. An independent reviewer evaluated a subset of these procedures and completed the SAFE-T tool and Mayo Colonoscopy Skills Assessment Tool (MCSAT) for reliability testing. Twenty-six faculty completed 350 SAFE-T evaluations of the 15 fellows in the study. The mean SAFE-T overall score (year 1, 2.00; year 2, 3.84; year 3, 4.28) differentiated each sequential fellow year of training (P < .0001). The mean SAFE-T overall score decreased with increasing case complexity score, with straightforward cases compared with average cases (4.07 vs 3.50, P < .0001), and average cases compared with challenging cases (3.50 vs 3.08, P = .0134). In dual-observed procedures, the SAFE-T tool showed excellent inter-rater reliability with a kappa agreement statistic of 0.898 (P < .0001). Correlation of the SAFE-T overall score with the MCSAT overall hands-on and individual motor scores was excellent (each r > 0.90, P < .0001). We developed and validated the SAFE-T assessment tool, a concise and web-based means of assessing real-time gastroenterology fellow performance in colonoscopy. Copyright © 2018 American Society for Gastrointestinal Endoscopy. Published by Elsevier Inc. All rights reserved.
Prefrontal Neural Activity When Feedback Is Not Relevant to Adjust Performance

PubMed Central

Özyurt, Jale; Rietze, Mareike; Thiel, Christiane M.

2012-01-01

It has been shown that the rostral cingulate zone (RCZ) in humans uses both positive and negative feedback to evaluate performance and to flexibly adjust behaviour. Less is known on how the feedback types are processed by the RCZ and other prefrontal brain areas, when feedback can only be used to evaluate performance, but cannot be used to adjust behaviour. The present fMRI study aimed at investigating feedback that can only be used to evaluate performance in a word-learning paradigm. One group of volunteers (N = 17) received informative, performance-dependent positive or negative feedback after each trial. Since new words had to be learnt in each trial, the feedback could not be used for task-specific adaptations. The other group (N = 17) always received non-informative feedback, providing neither information about performance nor about possible task-specific adaptations. Effects of the informational value of feedback were assessed between-subjects, comparing trials with positive and negative informative feedback to non-informative feedback. Effects of feedback valence were assessed by comparing neural activity to positive and negative feedback within the informative-feedback group. Our results show that several prefrontal regions, including the pre-SMA, the inferior frontal cortex and the insula were sensitive to both, the informational value and the valence aspect of the feedback with stronger activations to informative as compared to non-informative feedback and to informative negative compared to informative positive feedback. The only exception was RCZ which was sensitive to the informational value of the feedback, but not to feedback valence. The findings indicate that outcome information per se is sufficient to activate prefrontal brain regions, with the RCZ being the only prefrontal brain region which is equally sensitive to positive and negative feedback. PMID:22615774
Comparison of Supraglottic Activity and Spectral Slope Between Theater Actors and Vocally Untrained Subjects.

PubMed

Guzman, Marco; Ortega, Andres; Olavarria, Christian; Muñoz, Daniel; Cortés, Pedro; Azocar, Maria Josefina; Cayuleo, David; Quintana, Felipe; Silva, Catalina

2016-11-01

The present study aimed to assess supraglottic activity in theater actors and to observe whether they present differences compared with subjects with no voice training. Acoustic and perceptual analyses were also performed. A total of 20 participants were divided into two groups: an experimental group of trained theater actors, and a comparative group of subjects with no voice training. Absence of laryngeal pathology was confirmed by rigid videostroboscopy. Flexible laryngoscopy was performed to assess supraglottic activity during speaking phonatory tasks. Voice recording was also carried out. Four blinded judges were asked to assess laryngoscopic and perceptual variables using a visual analog scale. A comparison between groups, phonatory tasks, and loudness levels was performed. Multivariate linear regression showed that trained participants had a higher degree of both laryngeal and pharyngeal activities compared with untrained participants. Moreover, phonatory tasks at high intensity showed higher activity than those at medium and low intensities for most phonatory tasks and laryngoscopic parameters. Vocally trained participants evidenced higher values for all spectral variables compared with untrained participants. Actors have a greater degree of both laryngeal and pharyngeal activities than vocally untrained subjects. Apparently, this higher activity is associated to speaking voice training and not to a hyperfunctional vocal behavior. Anterior-posterior laryngeal compression is greater than medial compression. Intensity and phonatory tasks have an effect on all laryngoscopic variables. Supraglottic activity during professional speaking voice may be not necessarily a hyperfunctional behavior, but a strategy to avoid vocal fold damage while producing the desired voice quality. Copyright Â© 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Accuracy of 3D white light scanning of abutment teeth impressions: evaluation of trueness and precision.

PubMed

Jeon, Jin-Hun; Kim, Hae-Young; Kim, Ji-Hwan; Kim, Woong-Chul

2014-12-01

This study aimed to evaluate the accuracy of digitizing dental impressions of abutment teeth using a white light scanner and to compare the findings among teeth types. To assess precision, impressions of the canine, premolar, and molar prepared to receive all-ceramic crowns were repeatedly scanned to obtain five sets of 3-D data (STL files). Point clouds were compared and error sizes were measured (n=10 per type). Next, to evaluate trueness, impressions of teeth were rotated by 10°-20° and scanned. The obtained data were compared with the first set of data for precision assessment, and the error sizes were measured (n=5 per type). The Kruskal-Wallis test was performed to evaluate precision and trueness among three teeth types, and post-hoc comparisons were performed using the Mann-Whitney U test with Bonferroni correction (α=.05). Precision discrepancies for the canine, premolar, and molar were 3.7 µm, 3.2 µm, and 7.3 µm, respectively, indicating the poorest precision for the molar (P<.001). Trueness discrepancies for teeth types were 6.2 µm, 11.2 µm, and 21.8 µm, respectively, indicating the poorest trueness for the molar (P=.007). In respect to accuracy the molar showed the largest discrepancies compared with the canine and premolar. Digitizing of dental impressions of abutment teeth using a white light scanner was assessed to be a highly accurate method and provided discrepancy values in a clinically acceptable range. Further study is needed to improve digitizing performance of white light scanning in axial wall.
‘Below average’ Self-Assessed School Performance and Alzheimer's disease in the Aging, Demographics and Memory Study

PubMed Central

Mehta, Kala M.; Stewart, Anita L.; Langa, Kenneth M.; Yaffe, Kristine; Moody-Ayers, Sandra; Williams, Brie A.; Covinsky, Kenneth E.

2009-01-01

Background: Low formal education level is becoming accepted as a risk factor for Alzheimer's disease (AD). Though increasing attention has been paid to educational quality differences, no prior studies have addressed participants' own characterization of their overall performance in school. We examined whether self-assessed school performance is associated with AD beyond the effects of educational level alone. Methods: Participants were drawn from the population-representative Aging, Demographics and Memory Study (ADAMS), 2000-2002. ADAMS participants were asked about their performance in school; possible response options were ‘above average,’ ‘average,’ or ‘below average’. ADAMS participants also had a full neuropsychological battery and were given a research diagnosis of possible/probable AD. Results: The 725 participants (mean age 81.8 years, 59% female, and 16% African-American) varied in their educational performance: 29% reported ‘above average’; 64% ‘average’; and 7% reported ‘below average’ school performance. Participants with lower self-assessed school performance had higher proportions of AD: eleven percent of participants with above average self-assessed performance had AD; 12 percent of participants with ‘average’ performance and 26% of participants with ‘below average’ performance (p<0.001). After controlling for subjects' years in school, literacy test score(W-RAT), age, sex, race/ethnicity, and ApoE-ε4 status, socioeconomic status and self-reported comorbidity, respondents with ‘below average’ self-assessed school performance were 4 times more likely to have AD compared to those who had average performance.(OR 4.0; 95% CI 1.2-14) Above average' and ‘average’ self-assessed school performance did not increase or decrease the odds of AD.(OR 0.9; 95% CI 0.5-1.7) Conclusion: We suggest an association between ‘below average’ self-assessed school performance and AD beyond the known association with formal education. Efforts to increase cognitive reserve through better school performance in addition to increasing the number of years of formal education in early life may be important to reduce vulnerability throughout the life course. PMID:19751917
Student-led tutorials in problem-based learning: educational outcomes and students' perceptions.

PubMed

Kassab, Salah; Abu-Hijleh, Marwan F; Al-Shboul, Qasim; Hamdy, Hossam

2005-09-01

The aim of this study was to examine the effectiveness of using students as tutors in a problem-based learning (PBL) medical curriculum. Ninety-one third-year medical students were divided into ten tutorial groups. The groups were randomly allocated into student-led tutorials (SLT) (five groups, n = 44 students) and faculty-led tutorials (FLT) (five groups, n = 47 students). Outcome measurements included assessment of students' performance in tutorials individually and as a group, end-unit examinations scores, assessment of tutoring skills and identifying students' perceptions about peer tutoring. Student tutors were perceived better in providing feedback and in understanding the difficulties students face in tutorials. Tutorial atmosphere, decision-making and support for the group leader were better in SLT compared with FLT groups. Self-assessment of student performance in SLT was not different from FLT. Student scores in the written and practical examinations were comparable in both groups. However, SLT groups found difficulties in analysis of problems presented in the first tutorial session. We conclude that the impact of peer tutoring on student performance in tutorials, group dynamics, and student achievement in examinations is positive overall. However, student tutors require special training before adopting this approach in PBL programs.
Comparison of two simulation systems to support robotic-assisted surgical training: a pilot study (Swine model).

PubMed

Whitehurst, Sabrina V; Lockrow, Ernest G; Lendvay, Thomas S; Propst, Anthony M; Dunlow, Susan G; Rosemeyer, Christopher J; Gobern, Joseph M; White, Lee W; Skinner, Anna; Buller, Jerome L

2015-01-01

To compare the efficacy of simulation-based training between the Mimic dV- Trainer and traditional dry lab da Vinci robot training. A prospective randomized study analyzing the performance of 20 robotics-naive participants. Participants were enrolled in an online da Vinci Intuitive Surgical didactic training module, followed by training in use of the da Vinci standard surgical robot. Spatial ability tests were performed as well. Participants were randomly assigned to 1 of 2 training conditions: performance of 3 Fundamentals of Laparoscopic Surgery dry lab tasks using the da Vinci or performance of 4 dV-Trainer tasks. Participants in both groups performed all tasks to empirically establish proficiency criterion. Participants then performed the transfer task, a cystotomy closure using the daVinci robot on a live animal (swine) model. The performance of robotic tasks was blindly assessed by a panel of experienced surgeons using objective tracking data and using the validated Global Evaluative Assessment of Robotic Surgery (GEARS), a structured assessment tool. No statistically significant difference in surgeon performance was found between the 2 training conditions, dV-Trainer and da Vinci robot. Analysis of a 95% confidence interval for the difference in means (-0.803 to 0.543) indicated that the 2 methods are unlikely to differ to an extent that would be clinically meaningful. Based on the results of this study, a curriculum on the dV- Trainer was shown to be comparable to traditional da Vinci robot training. Therefore, we have identified that training on a virtual reality system may be an alternative to live animal training for future robotic surgeons. Published by Elsevier Inc.
The daily life of patients with dementia: A comparative study between the information provided by the caregiver and direct patient assessment

PubMed Central

Bressan, Lucia Aparecida; Vale, Francisco de Assis Carvalho; Speciali, José Geraldo

2007-01-01

The functionality concept is very important, as the diagnosis of dementia presupposes the existence of functional impairment. Instruments assessing functional performance present some limitations. In most cases, the assessment is based on the caregiver’s report. Some studies in international literature have evaluated this issue and concluded that a difference exists between the caregiver’s report and direct patient assessment. American and European caregivers tend to underestimate the patient’s functional limitations. However, this issue has hitherto not been investigated in our context. Objective To compare the caregiver’s information with direct assessment of the patient’s performance based on the same functional assessment questionnaire. Methods Seventy-two patients and caregivers were attended by the Occupational Therapy service of the Behavioral Neurology Outpatient Clinic between 1999 and 2001, 25 of whom fulfilled the inclusion criteria: having a confirmed diagnosis of dementia according to the DSM-IV; having attended three or more return appointments, and where the caregiver belonged to the patient’s family nucleus. The remaining subjects were excluded because of non-adherence to treatment or refusal to participate in the study. The Functional Activities Questionnaire by Pfeffer et al., 1982 was applied to patients in a laboratory simulation, while another evaluator interviewed the respective caregivers. The data were analyzed based on the weighted Kappa coefficient, and Wilcoxon test. Results There were significative differences between caregiver’s answers and direct observation of the patient’s performance. The information provided by the caregivers proved unreliable since caregivers underestimated the patient’s functional capacity. PMID:29213403
GFO and JASON Altimeter Engineering Assessment Report. Update: GFO-Acceptance to End of Mission on October 22, 2008, JASON-Acceptance to September 29, 2008

NASA Technical Reports Server (NTRS)

Conger, A. M.; Hancock, D. W., III; Hayne, G. S.; Brooks, R. L.

2009-01-01

The purpose of this document is to present and document GEOSAT Follow-On (GFO) performance analyses and results. This is the ninth Assessment Report since the initial report and is our final one. This report extends the performance assessment since acceptance on November 29, 2000 to the end of mission (EOM) on October 22, 2008. Since launch, February 10, 1998 to the EOM, we performed a variety of GFO performance studies; Appendix A provides an accumulative index of those studies. We began the inclusion of analyses of the JASON altimeter after the end of the Topographic Experiment (TOPEX) mission. Prior to this, JASON and TOPEX were compared during our assessment of the TOPEX altimeter. With the end of the TOPEX mission, we developed methods to report on JASON as it related to GFO. It should be noted the GFO altimeter, after operating for over 7 years, was power cycled off to on and on to off approximately 14 times a day for over 18 months in space with no failure. The GFO altimeter proved to be a remarkable instrument providing stable ocean surface measurements for nearly eight years. This report completes our GFO altimeter performance assessment.
Assessment of Age-Related Differences in Functional Capacity Using the Virtual Reality Functional Capacity Assessment Tool (VRFCAT)

PubMed Central

Atkins, A.S.; Stroescu, I.; Spagnola, N.B.; Davis, V.G.; Patterson, T.D.; Narasimhan, M.; Harvey, P.D.; Keefe, R.S.E.

2015-01-01

Clinical trials for primary prevention and early intervention in preclinical AD require measures of functional capacity with improved sensitivity to deficits in healthier, non-demented individuals. To this end, the Virtual Reality Functional Capacity Assessment Tool (VRFCAT) was developed as a direct performance-based assessment of functional capacity that is sensitive to changes in function across multiple populations. Using a realistic virtual reality environment, the VRFCAT assesses a subject's ability to complete instrumental activities associated with a shopping trip. The present investigation represents an initial evaluation of the VRFCAT as a potential co-primary measure of functional capacity in healthy aging and preclinical MCI/AD by examining test-retest reliability and associations with cognitive performance in healthy young and older adults. The VRFCAT was compared and contrasted with the UPSA-2-VIM, a traditional performance-based assessment utilizing physical props. Results demonstrated strong age-related differences in performance on each VRFCAT outcome measure, including total completion time, total errors, and total forced progressions. VRFCAT performance showed strong correlations with cognitive performance across both age groups. VRFCAT Total Time demonstrated good test-retest reliability (ICC=.80 in young adults; ICC=.64 in older adults) and insignificant practice effects, indicating the measure is suitable for repeated testing in healthy populations. Taken together, these results provide preliminary support for the VRFCAT as a potential measure of functionally relevant change in primary prevention and preclinical AD/MCI trials. PMID:26618145
Self-assessment in schizophrenia: Accuracy of evaluation of cognition and everyday functioning.

PubMed

Gould, Felicia; McGuire, Laura Stone; Durand, Dante; Sabbag, Samir; Larrauri, Carlos; Patterson, Thomas L; Twamley, Elizabeth W; Harvey, Philip D

2015-09-01

Self-assessment deficits, often referred to as impaired insight or unawareness of illness, are well established in people with schizophrenia. There are multiple levels of awareness, including awareness of symptoms, functional deficits, cognitive impairments, and the ability to monitor cognitive and functional performance in an ongoing manner. The present study aimed to evaluate the comparative predictive value of each aspect of awareness on the levels of everyday functioning in people with schizophrenia. We examined multiple aspects of self-assessment of functioning in 214 people with schizophrenia. We also collected information on everyday functioning rated by high contact clinicians and examined the importance of self-assessment for the prediction of real-world functional outcomes. The relative impact of performance-based measures of cognition, functional capacity, and metacognitive performance on everyday functioning was also examined. Misestimation of ability emerged as the strongest predictor of real-world functioning and exceeded the influences of cognitive performance, functional capacity performance, and performance-based assessment of metacognitive monitoring. The relative contribution of the factors other than self-assessment varied according to which domain of everyday functioning was being examined, but, in all cases, accounted for less predictive variance. These results underscore the functional impact of misestimating one's current functioning and relative level of ability. These findings are consistent with the use of insight-focused treatments and compensatory strategies designed to increase self-awareness in multiple functional domains. (c) 2015 APA, all rights reserved).
Self Assessment in Schizophrenia: Accuracy of Evaluation of Cognition and Everyday Functioning

PubMed Central

Gould, Felicia; McGuire, Laura Stone; Durand, Dante; Sabbag, Samir; Larrauri, Carlos; Patterson, Thomas L.; Twamley, Elizabeth W.; Harvey, Philip D.

2015-01-01

Objective Self-assessment deficits, often referred to as impaired insight or unawareness of illness, are well established in people with schizophrenia. There are multiple levels of awareness, including awareness of symptoms, functional deficits, cognitive impairments, and the ability to monitor cognitive and functional performance in an ongoing manner. The present study aimed to evaluate the comparative predictive value of each aspect of awareness on the levels of everyday functioning in people with schizophrenia. Method We examined multiple aspects of self-assessment of functioning in 214 people with schizophrenia. We also collected information on everyday functioning rated by high contact clinicians and examined the importance of self-assessment for the prediction of real world functional outcomes. The relative impact of performance based measures of cognition, functional capacity, and metacognitive performance on everyday functioning was also examined. Results Misestimation of ability emerged as the strongest predictor of real world functioning and exceeded the influences of cognitive performance, functional capacity performance, and performance-based assessment of metacognitive monitoring. The relative contribution of the factors other than self-assessment varied according to which domain of everyday functioning was being examined, but in all cases, accounted for less predictive variance. Conclusions These results underscore the functional impact of misestimating one’s current functioning and relative level of ability. These findings are consistent with the use of insight-focused treatments and compensatory strategies designed to increase self-awareness in multiple functional domains. PMID:25643212
Health system frameworks and performance indicators in eight countries: A comparative international analysis

PubMed Central

Braithwaite, Jeffrey; Hibbert, Peter; Blakely, Brette; Plumb, Jennifer; Hannaford, Natalie; Long, Janet Cameron; Marks, Danielle

2017-01-01

Objectives: Performance indicators are a popular mechanism for measuring the quality of healthcare to facilitate both quality improvement and systems management. Few studies make comparative assessments of different countries’ performance indicator frameworks. This study identifies and compares frameworks and performance indicators used in selected Organisation for Economic Co-operation and Development health systems to measure and report on the performance of healthcare organisations and local health systems. Countries involved are Australia, Canada, Denmark, England, the Netherlands, New Zealand, Scotland and the United States. Methods: Identification of comparable international indicators and analyses of their characteristics and of their broader national frameworks and contexts were undertaken. Two dimensions of indicators – that they are nationally consistent (used across the country rather than just regionally) and locally relevant (measured and reported publicly at a local level, for example, a health service) – were deemed important. Results: The most commonly used domains in performance frameworks were safety, effectiveness and access. The search found 401 indicators that fulfilled the ‘nationally consistent and locally relevant’ criteria. Of these, 45 indicators are reported in more than one country. Cardiovascular, surgery and mental health were the most frequently reported disease groups. Conclusion: These comparative data inform researchers and policymakers internationally when designing health performance frameworks and indicator sets. PMID:28228948
Sensor fusion display evaluation using information integration models in enhanced/synthetic vision applications

NASA Technical Reports Server (NTRS)

Foyle, David C.

1993-01-01

Based on existing integration models in the psychological literature, an evaluation framework is developed to assess sensor fusion displays as might be implemented in an enhanced/synthetic vision system. The proposed evaluation framework for evaluating the operator's ability to use such systems is a normative approach: The pilot's performance with the sensor fusion image is compared to models' predictions based on the pilot's performance when viewing the original component sensor images prior to fusion. This allows for the determination as to when a sensor fusion system leads to: poorer performance than one of the original sensor displays, clearly an undesirable system in which the fused sensor system causes some distortion or interference; better performance than with either single sensor system alone, but at a sub-optimal level compared to model predictions; optimal performance compared to model predictions; or, super-optimal performance, which may occur if the operator were able to use some highly diagnostic 'emergent features' in the sensor fusion display, which were unavailable in the original sensor displays.
Initial construct validity evidence of a virtual human application for competency assessment in breaking bad news to a cancer patient.

PubMed

Guetterman, Timothy C; Kron, Frederick W; Campbell, Toby C; Scerbo, Mark W; Zelenski, Amy B; Cleary, James F; Fetters, Michael D

2017-01-01

Despite interest in using virtual humans (VHs) for assessing health care communication, evidence of validity is limited. We evaluated the validity of a VH application, MPathic-VR, for assessing performance-based competence in breaking bad news (BBN) to a VH patient. We used a two-group quasi-experimental design, with residents participating in a 3-hour seminar on BBN. Group A (n=15) completed the VH simulation before and after the seminar, and Group B (n=12) completed the VH simulation only after the BBN seminar to avoid the possibility that testing alone affected performance. Pre- and postseminar differences for Group A were analyzed with a paired t -test, and comparisons between Groups A and B were analyzed with an independent t -test. Compared to the preseminar result, Group A's postseminar scores improved significantly, indicating that the VH program was sensitive to differences in assessing performance-based competence in BBN. Postseminar scores of Group A and Group B were not significantly different, indicating that both groups performed similarly on the VH program. Improved pre-post scores demonstrate acquisition of skills in BBN to a VH patient. Pretest sensitization did not appear to influence posttest assessment. These results provide initial construct validity evidence that the VH program is effective for assessing BBN performance-based communication competence.
Initial construct validity evidence of a virtual human application for competency assessment in breaking bad news to a cancer patient

PubMed Central

Guetterman, Timothy C; Kron, Frederick W; Campbell, Toby C; Scerbo, Mark W; Zelenski, Amy B; Cleary, James F; Fetters, Michael D

2017-01-01

Background Despite interest in using virtual humans (VHs) for assessing health care communication, evidence of validity is limited. We evaluated the validity of a VH application, MPathic-VR, for assessing performance-based competence in breaking bad news (BBN) to a VH patient. Methods We used a two-group quasi-experimental design, with residents participating in a 3-hour seminar on BBN. Group A (n=15) completed the VH simulation before and after the seminar, and Group B (n=12) completed the VH simulation only after the BBN seminar to avoid the possibility that testing alone affected performance. Pre- and postseminar differences for Group A were analyzed with a paired t-test, and comparisons between Groups A and B were analyzed with an independent t-test. Results Compared to the preseminar result, Group A’s postseminar scores improved significantly, indicating that the VH program was sensitive to differences in assessing performance-based competence in BBN. Postseminar scores of Group A and Group B were not significantly different, indicating that both groups performed similarly on the VH program. Conclusion Improved pre–post scores demonstrate acquisition of skills in BBN to a VH patient. Pretest sensitization did not appear to influence posttest assessment. These results provide initial construct validity evidence that the VH program is effective for assessing BBN performance-based communication competence. PMID:28794664
Promotion Factors For Enlisted Infantry Marines

DTIC Science & Technology

2017-06-01

description , billet accomplishments, mission accomplishment, individual character, leadership, intellect and wisdom, fulfillment of evaluation , RS...staff sergeant. To assess which ranks proportionally promote more high-quality Marines, we compare two performance evaluation methods: proficiency and...adverse fitness reports. From the two performance evaluation methods we find that the Marine Corps promotes proportionally more high-quality Marines

Evolutionary space platform concept study. Volume 2, part B: Manned space platform concepts

NASA Technical Reports Server (NTRS)

1982-01-01

Logical, cost-effective steps in the evolution of manned space platforms are investigated and assessed. Tasks included the analysis of requirements for a manned space platform, identifying alternative concepts, performing system analysis and definition of the concepts, comparing the concepts and performing programmatic analysis for a reference concept.
An Evaluation of the IntelliMetric[SM] Essay Scoring System

ERIC Educational Resources Information Center

Rudner, Lawrence M.; Garcia, Veronica; Welch, Catherine

2006-01-01

This report provides a two-part evaluation of the IntelliMetric[SM] automated essay scoring system based on its performance scoring essays from the Analytic Writing Assessment of the Graduate Management Admission Test[TM] (GMAT[TM]). The IntelliMetric system performance is first compared to that of individual human raters, a Bayesian system…
Monitoring the Performance of Human and Automated Scores for Spoken Responses

ERIC Educational Resources Information Center

Wang, Zhen; Zechner, Klaus; Sun, Yu

2018-01-01

As automated scoring systems for spoken responses are increasingly used in language assessments, testing organizations need to analyze their performance, as compared to human raters, across several dimensions, for example, on individual items or based on subgroups of test takers. In addition, there is a need in testing organizations to establish…
The Relative Performance of Female and Male Students in Accounting Principles Classes.

ERIC Educational Resources Information Center

Bouillon, Marvin L.; Doran, B. Michael

1992-01-01

The performance of female and male students in Accounting Principles (AP) I and II was compared by using multiple regression techniques to assess the incremental explanatory effects of gender. Males significantly outperformed females in AP I, contradicting earlier studies. Similar gender of instructor and student was insignificant. (JOW)
Linking TIMSS and NAEP Assessments to Evaluate International Trends in Achievement

ERIC Educational Resources Information Center

Lim, Hwanggyu; Sireci, Stephen G.

2017-01-01

The Trends in International Mathematics and Science Study (TIMSS) makes it possible to compare the performance of students in the US in Mathematics and Science to the performance of students in other countries. TIMSS uses four international benchmarks for describing student achievement: Low, Intermediate, High, and Advanced. In this study, we…
Performance of Building Technology Graduates in the Construction Industry in Ghana

ERIC Educational Resources Information Center

Ayarkwa, J.; Dansoh, Ayirebi; Adinyira, E.; Amoah, P.

2011-01-01

Purpose: This paper aims to assess the perception of the Ghanaian construction industry of the performance of entry-level building technology graduates. Also, other non-technical skills or attributes expected from building technology graduates are to be compared with the actual proficiency of the graduates. Design/methodology/approach: The…
The objective assessment of experts' and novices' suturing skills using an image analysis program.

PubMed

Frischknecht, Adam C; Kasten, Steven J; Hamstra, Stanley J; Perkins, Noel C; Gillespie, R Brent; Armstrong, Thomas J; Minter, Rebecca M

2013-02-01

To objectively assess suturing performance using an image analysis program and to provide validity evidence for this assessment method by comparing experts' and novices' performance. In 2009, the authors used an image analysis program to extract objective variables from digital images of suturing end products obtained during a previous study involving third-year medical students (novices) and surgical faculty and residents (experts). Variables included number of stitches, stitch length, total bite size, travel, stitch orientation, total bite-size-to-travel ratio, and symmetry across the incision ratio. The authors compared all variables between groups to detect significant differences and two variables (total bite-size-to-travel ratio and symmetry across the incision ratio) to ideal values. Five experts and 15 novices participated. Experts' and novices' performances differed significantly (P < .05) with large effect sizes attributable to experience (Cohen d > 0.8) for total bite size (P = .009, d = 1.5), travel (P = .045, d = 1.1), total bite-size-to-travel ratio (P < .0001, d = 2.6), stitch orientation (P = .014,d = 1.4), and symmetry across the incision ratio (P = .022, d = 1.3). The authors found that a simple computer algorithm can extract variables from digital images of a running suture and rapidly provide quantitative summative assessment feedback. The significant differences found between groups confirm that this system can discriminate between skill levels. This image analysis program represents a viable training tool for objectively assessing trainees' suturing, a foundational skill for many medical specialties.
Does feedback matter? Practice-based learning for medical students after a multi-institutional clinical performance examination.

PubMed

Srinivasan, Malathi; Hauer, Karen E; Der-Martirosian, Claudia; Wilkes, Michael; Gesundheit, Neil

2007-09-01

Achieving competence in 'practice-based learning' implies that doctors can accurately self- assess their clinical skills to identify behaviours that need improvement. This study examines the impact of receiving feedback via performance benchmarks on medical students' self-assessment after a clinical performance examination (CPX). The authors developed a practice-based learning exercise at 3 institutions following a required 8-station CPX for medical students at the end of Year 3. Standardised patients (SPs) scored students after each station using checklists developed by experts. Students assessed their own performance immediately after the CPX (Phase 1). One month later, students watched their videotaped performance and reassessed (Phase 2). Some students received performance benchmarks (their scores, plus normative class data) before the video review. Pearson's correlations between self-ratings and SP ratings were calculated for overall performance and specific skill areas (history taking, physical examination, doctor-patient communication) for Phase 1 and Phase 2. The 2 correlations were then compared for each student group (i.e. those who received and those who did not receive feedback). A total of 280 students completed both study phases. Mean CPX scores ranged from 51% to 71% of items correct overall and for each skill area. Phase 1 self-assessment correlated weakly with SP ratings of student performance (r = 0.01-0.16). Without feedback, Phase 2 correlations remained weak (r = 0.13-0.18; n = 109). With feedback, Phase 2 correlations improved significantly (r = 0.26-0.47; n = 171). Low-performing students showed the greatest improvement after receiving feedback. The accuracy of student self-assessment was poor after a CPX, but improved significantly with performance feedback (scores and benchmarks). Videotape review alone (without feedback) did not improve self-assessment accuracy. Practice-based learning exercises that incorporate feedback to medical students hold promise to improve self-assessment skills.
Energy and exergy assessments for an enhanced use of energy in buildings

NASA Astrophysics Data System (ADS)

Goncalves, Pedro Manuel Ferreira

Exergy analysis has been found to be a useful method for improving the conversion efficiency of energy resources, since it helps to identify locations, types and true magnitudes of wastes and losses. It has also been applied for other purposes, such as distinguishing high- from low-quality energy sources or defining the engineering technological limits in designing more energy-efficient systems. In this doctoral thesis, the exergy analysis is widely applied in order to highlight and demonstrate it as a significant method of performing energy assessments of buildings and related energy supply systems. It aims to make the concept more familiar and accessible for building professionals and to encourage its wider use in engineering practice. Case study I aims to show the importance of exergy analysis in the energy performance assessment of eight space heating building options evaluated under different outdoor environmental conditions. This study is concerned with the so-called "reference state", which in this study is calculated using the average outdoor temperature for a given period of analysis. Primary energy and related exergy ratios are assessed and compared. Higher primary exergy ratios are obtained for low outdoor temperatures, while the primary energy ratios are assumed as constant for the same scenarios. The outcomes of this study demonstrate the significance of exergy analysis in comparison with energy analysis when different reference states are compared. Case study II and Case study III present two energy and exergy assessment studies applied to a hotel and a student accommodation building, respectively. Case study II compares the energy and exergy performance of the main end uses of a hotel building located in Coimbra in central Portugal, using data derived from an energy audit. Case study III uses data collected from energy utilities bills to estimate the energy and exergy performance associated to each building end use. Additionally, a set of energy supply options are proposed and assessed as primary energy demand and exergy efficiency, showing it as a possible benchmarking method for future legislative frameworks regarding the energy performance assessment of buildings. Case study IV proposes a set of complementary indicators for comparing cogeneration and separate heat and electricity production systems. It aims to identify the advantages of exergy analysis relative to energy analysis, giving particular examples where these advantages are significant. The results demonstrate that exergy analysis can reveal meaningful information that might not be accessible using a conventional energy analysis approach, which is particularly evident when cogeneration and separated systems provide heat at very different temperatures. Case study V follows the exergy analysis method to evaluate the energy and exergy performance of a desiccant cooling system, aiming to assess and locate irreversibilities sources. The results reveal that natural gas boiler is the most inefficient component of the plant in question, followed by the chiller and heating coil. A set of alternative heating supply options for desiccant wheel regeneration is proposed, showing that, while some renewables may effectively reduce the primary energy demand of the plant, although this may not correspond to the optimum level of exergy efficiency. The thermal and chemical exergy components of moist air are also evaluated, as well as, the influence of outdoor environmental conditions on the energy/exergy performance of the plant. This research provides knowledge that is essential for the future development of complementary energy- and exergy-based indicators, helping to improve the current methodologies on performance assessments of buildings, cogeneration and desiccant cooling systems. The significance of exergy analysis is demonstrated for different types of buildings, which may be located in different climates (reference states) and be supplied by different types of energy sources. (Abstract shortened by ProQuest.).
Subjective and objective quantification of physician's workload and performance during radiation therapy planning tasks.

PubMed

Mazur, Lukasz M; Mosaly, Prithima R; Hoyle, Lesley M; Jones, Ellen L; Marks, Lawrence B

2013-01-01

To quantify, and compare, workload for several common physician-based treatment planning tasks using objective and subjective measures of workload. To assess the relationship between workload and performance to define workload levels where performance could be expected to decline. Nine physicians performed the same 3 tasks on each of 2 cases ("easy" vs "hard"). Workload was assessed objectively throughout the tasks (via monitoring of pupil size and blink rate), and subjectively at the end of each case (via National Aeronautics and Space Administration Task Load Index; NASA-TLX). NASA-TLX assesses the 6 dimensions (mental, physical, and temporal demands, frustration, effort, and performance); scores > or ≈ 50 are associated with reduced performance in other industries. Performance was measured using participants' stated willingness to approve the treatment plan. Differences in subjective and objective workload between cases, tasks, and experience were assessed using analysis of variance (ANOVA). The correlation between subjective and objective workload measures were assessed via the Pearson correlation test. The relationships between workload and performance measures were assessed using the t test. Eighteen case-wise and 54 task-wise assessments were obtained. Subjective NASA-TLX scores (P < .001), but not time-weighted averages of objective scores (P > .1), were significantly lower for the easy vs hard case. Most correlations between the subjective and objective measures were not significant, except between average blink rate and NASA-TLX scores (r = -0.34, P = .02), for task-wise assessments. Performance appeared to decline at NASA-TLX scores of ≥55. The NASA-TLX may provide a reasonable method to quantify subjective workload for broad activities, and objective physiologic eye-based measures may be useful to monitor workload for more granular tasks within activities. The subjective and objective measures, as herein quantified, do not necessarily track each other, and more work is needed to assess their utilities. From a series of controlled experiments, we found that performance appears to decline at subjective workload levels ≥55 (as measured via NASA-TLX), which is consistent with findings from other industries. Copyright © 2013 American Society for Radiation Oncology. Published by Elsevier Inc. All rights reserved.
Assessment of motor and process skills: assessing client work performance in Belgium.

PubMed

Vandamme, Dirk

2010-01-01

The aim of this study is to establish whether the Assessment of Motor and Process Skills (AMPS) is an appropriate tool to evaluate the quality of work performance by comparing clients' results on the AMPS with the quality of the skills that they demonstrate on the shop floor. A convenience sample of chronically unemployed (vocationally disabled) participants (N=139) with no formal training who were seeking unskilled work through Jobcentrum West-Vlaanderen (West Flanders Job Centre, Belgium) was used. Results demonstrated that in 75.2% of cases the prediction of employment outcome was correct; it is suggested that an AMPS motor score < 2.5 and a process score < 1.2 is insufficient for regular employment, while a motor score > 3.1 and process score > 1.5 indicates that regular employment is a realistic goal. The quality of the motor skills measured by the AMPS and measured on the shop floor are comparable, but little similarity was found in the measurement of process skills.
Technical and Economic Assessment of Span-Loaded Cargo Aircraft Concepts

NASA Technical Reports Server (NTRS)

1976-01-01

The benefits are assessed of span distributed loading concepts as applied to future commercial air cargo operations. A two phased program is used to perform this assessment. The first phase consists of selected parametric studies to define significant configuration, performance, and economic trends. The second phase consists of more detailed engineering design, analysis, and economic evaluations to define the technical and economic feasibility of a selected spanloader design. A conventional all-cargo aircraft of comparable technology and size is used as a comparator system. The technical feasibility is demonstrated of the spanloader concept with no new major technology efforts required to implement the system. However, certain high pay-off technologies such as winglets, airfoil design, and advanced structural materials and manufacturing techniques need refinement and definition prior to application. In addition, further structural design analysis could establish the techniques and criteria necessary to fully capitalize upon the high degree of structural commonality and simplicity inherent in the spanloader concept.
Crowd-sourced assessment of technical skills: an adjunct to urology resident surgical simulation training.

PubMed

Holst, Daniel; Kowalewski, Timothy M; White, Lee W; Brand, Timothy C; Harper, Jonathan D; Sorenson, Mathew D; Kirsch, Sarah; Lendvay, Thomas S

2015-05-01

Crowdsourcing is the practice of obtaining services from a large group of people, typically an online community. Validated methods of evaluating surgical video are time-intensive, expensive, and involve participation of multiple expert surgeons. We sought to obtain valid performance scores of urologic trainees and faculty on a dry-laboratory robotic surgery task module by using crowdsourcing through a web-based grading tool called Crowd Sourced Assessment of Technical Skill (CSATS). IRB approval was granted to test the technical skills grading accuracy of Amazon.com Mechanical Turk™ crowd-workers compared to three expert faculty surgeon graders. The two groups assessed dry-laboratory robotic surgical suturing performances of three urology residents (PGY-2, -4, -5) and two faculty using three performance domains from the validated Global Evaluative Assessment of Robotic Skills assessment tool. After an average of 2 hours 50 minutes, each of the five videos received 50 crowd-worker assessments. The inter-rater reliability (IRR) between the surgeons and crowd was 0.91 using Cronbach's alpha statistic (confidence intervals=0.20-0.92), indicating an agreement level between the two groups of "excellent." The crowds were able to discriminate the surgical level, and both the crowds and the expert faculty surgeon graders scored one senior trainee's performance above a faculty's performance. Surgery-naive crowd-workers can rapidly assess varying levels of surgical skill accurately relative to a panel of faculty raters. The crowds provided rapid feedback and were inexpensive. CSATS may be a valuable adjunct to surgical simulation training as requirements for more granular and iterative performance tracking of trainees become mandated and commonplace.
Carbon-Carbon Recuperators in Closed-Brayton-Cycle Nuclear Space Power Systems: A Feasibility Assessment

NASA Technical Reports Server (NTRS)

Barrett, Michael J.; Johnson, Paul K.

2004-01-01

The feasibility of using carbon-carbon recuperators in closed-Brayton-cycle (CBC) nuclear space power conversion systems (PCS) was assessed. Recuperator performance expectations were forecast based on projected thermodynamic cycle state values for a planetary mission. Resulting thermal performance, mass and volume for a plate-fin carbon-carbon recuperator were estimated and quantitatively compared with values for a conventional offset-strip-fin metallic design. Material compatibility issues regarding carbon-carbon surfaces exposed to the working fluid in the CBC PCS were also discussed.
A Study of Grid Resolution, Transition and Turbulence Model Using the Transonic Simple Straked Delta Wing

NASA Technical Reports Server (NTRS)

Bartels, Robert E.

2001-01-01

Three-dimensional transonic flow over a delta wing is investigated using several turbulence models. The performance of linear eddy viscosity models and an explicit algebraic stress model is assessed at the start of vortex flow, and the results compared with experimental data. To assess the effect of transition location, computations that either fix transition aft of the leading edge or are fully turbulent are performed. These computations show that grid resolution, transition location and turbulence model significantly affect the 3D flowfield.
Performance of Four Frailty Classifications in Older Patients With Cancer: Prospective Elderly Cancer Patients Cohort Study.

PubMed

Ferrat, Emilie; Paillaud, Elena; Caillet, Philippe; Laurent, Marie; Tournigand, Christophe; Lagrange, Jean-Léon; Droz, Jean-Pierre; Balducci, Lodovico; Audureau, Etienne; Canouï-Poitrine, Florence; Bastuji-Garin, Sylvie

2017-03-01

Purpose Frailty classifications of older patients with cancer have been developed to assist physicians in selecting cancer treatments and geriatric interventions. They have not been compared, and their performance in predicting outcomes has not been assessed. Our objectives were to assess agreement among four classifications and to compare their predictive performance in a large cohort of in- and outpatients with various cancers. Patients and Methods We prospectively included 1,021 patients age 70 years or older who had solid or hematologic malignancies and underwent a geriatric assessment in one of two French teaching hospitals between 2007 and 2012. Among them, 763 were assessed using four classifications: Balducci, International Society of Geriatric Oncology (SIOG) 1, SIOG2, and a latent class typology. Agreement was assessed using the κ statistic. Outcomes were 1-year mortality and 6-month unscheduled admissions. Results All four classifications had good discrimination for 1-year mortality (C-index ≥ 0.70); discrimination was best with SIOG1. For 6-month unscheduled admissions, discrimination was good with all four classifications (C-index ≥ 0.70). For classification into three (fit, vulnerable, or frail) or two categories (fit v vulnerable or frail and fit or vulnerable v frail), agreement among the four classifications ranged from very poor (κ ≤ 0.20) to good (0.60 < κ ≤ 0.80). Agreement was best between SIOG1 and the latent class typology and between SIOG1 and Balducci. Conclusion These four frailty classifications have good prognostic performance among older in- and outpatients with various cancers. They may prove useful in decision making about cancer treatments and geriatric interventions and/or in stratifying older patients with cancer in clinical trials.
Comparative performance assessment of point-of-care testing devices for measuring glucose and ketones at the patient bedside.

PubMed

Ceriotti, Ferruccio; Kaczmarek, Ewa; Guerra, Elena; Mastrantonio, Fabrizio; Lucarelli, Fausto; Valgimigli, Francesco; Mosca, Andrea

2015-03-01

Point-of-care (POC) testing devices for monitoring glucose and ketones can play a key role in the management of dysglycemia in hospitalized diabetes patients. The accuracy of glucose devices can be influenced by biochemical changes that commonly occur in critically ill hospital patients and by the medication prescribed. Little is known about the influence of these factors on ketone POC measurements. The aim of this study was to assess the analytical performance of POC hospital whole-blood glucose and ketone meters and the extent of glucose interference factors on the design and accuracy of ketone results. StatStrip glucose/ketone, Optium FreeStyle glucose/ketone, and Accu-Chek Performa glucose were also assessed and results compared to a central laboratory reference method. The analytical evaluation was performed according to Clinical and Laboratory Standards Institute (CLSI) protocols for precision, linearity, method comparison, and interference. The interferences assessed included acetoacetate, acetaminophen, ascorbic acid, galactose, maltose, uric acid, and sodium. The accuracies of both Optium ketone and glucose measurements were significantly influenced by varying levels of hematocrit and ascorbic acid. StatStrip ketone and glucose measurements were unaffected by the interferences tested with exception of ascorbic acid, which reduced the higher level ketone value. The accuracy of Accu-Chek glucose measurements was affected by hematocrit, by ascorbic acid, and significantly by galactose. The method correlation assessment indicated differences between the meters in compliance to ISO 15197 and CLSI 12-A3 performance criteria. Combined POC glucose/ketone methods are now available. The use of these devices in a hospital setting requires careful consideration with regard to the selection of instruments not sensitive to hematocrit variation and presence of interfering substances. © 2014 Diabetes Technology Society.
Incorporation of core competency questions into an annual national self-assessment examination for residents in physical medicine and rehabilitation: results and implications.

PubMed

Webster, Joseph B

2009-03-01

To determine the performance and change over time when incorporating questions in the core competency domains of practice-based learning and improvement (PBLI), systems-based practice (SBP), and professionalism (PROF) into the national PM&R Self-Assessment Examination for Residents (SAER). Prospective, longitudinal analysis. The national Self-Assessment Examination for Residents (SAER) in Physical Medicine and Rehabilitation, which is administered annually. Approximately 1100 PM&R residents who take the examination annually. Inclusion of progressively more challenging questions in the core competency domains of PBLI, SBP, and PROF. Individual test item level of difficulty (P value) and discrimination (point biserial index). Compared with the overall test, questions in the subtopic areas of PBLI, SBP, and PROF were relatively easier and less discriminating (correlation of resident performance on these domains compared with that on the total test). These differences became smaller during the 3-year time period. The difficulty level of the questions in each of the subtopic domains was raised during the 3 year period to a level close to the overall exam. Discrimination of the test items improved or remained stable. This study demonstrates that, with careful item writing and review, multiple-choice items in the PBLI, SBP, and PROF domains can be successfully incorporated into an annual, national self-assessment examination for residents. The addition of these questions had value in assessing competency while not compromising the overall validity and reliability of the exam. It is yet to be determined if resident performance on these questions corresponds to performance on other measures of competency in the areas of PBLI, SBP, and PROF.
Cross-cultural standardization of the South Texas Assessment of Neurocognition in India.

PubMed

Cherkil, S; Satish, S; Mathew, S S; Dinesh, N; Kumar, C T S; Lombardo, L E; Glahn, D C; Frangou, S

2012-08-01

Despite the central role of cognition for mental disorders most studies have been conducted in western countries. Similar research from other parts of the world, particularly India, is very limited. As a first step in closing this gap this cross-cultural comparability study of the South Texas Assessment of Neurocognition (STAN) battery was conducted between USA and India. One hundred healthy adults from Kerala, India, were administered six language independent subtests of the Java Neuropsychological Test (JANET) version of the STAN, assessing aspects of general intellectual ability (Matrix Reasoning), attention (Identical Pairs Continuous Performance, 3 Symbol Version Test; IPCPTS), working memory (Spatial Capacity Delayed Response Test; SCAP), response inhibition (Stop Signal Reaction Time; SSRT), Emotional Recognition and Risk taking (Balloon Analogue Risk Task; BART). Test results were compared to a demographically matched US sample. Overall test performance in the Kerala sample was comparable to that of the US sample and commensurate to that generally described in studies from western countries. Our results support the metric equivalence of currently available cognitive test batteries developed in western countries for use in India. However, the sample was restricted to individuals who were literate and had completed basic primary and secondary education.
Environmental and economic comparisons of the satellite power system and six alternative energy technologies

DOE Office of Scientific and Technical Information (OSTI.GOV)

Whitfield, R.G.; Habegger, L.J.; Levine, E.P.

1981-04-01

The objective of the comparative assessment is to provide an initial, traceable and consistent comparison of the SPS and selected current, near-term, and advanced energy technologies. Terrestrial alternatives were selected, and their cost, performance, and environmental and societal attributes were specified for use in the comparison with the SPS in the post-2000 era. The framework for comparisons was established. The SPS was compared with alternative systems in terms of key issues such as life-cycle cost and environmental impacts. The results of the assessments were assembled and integrated into a consistent comparative assessment. Environmental and economic effects are evaluated, which weremore » subdivided into the following issue areas: human health and safety, environmental welfare, resources (land, materials, energy, water, labor), macroeconomics, socioeconomics, and institutional. These evaluations were based on technology characterization data and alternative futures scenarios, which were developed as part of CDEP by supporting studies. The technologies and the scenarios are described. An additional major issue area concerned the cost and performance of the SPS and the alternative technologies: results in this area provided part of the basis of the macroeconomic analyses. 159 references.« less

Design and evaluation of a miniature laser speckle imaging device to assess gingival health

PubMed Central

Regan, Caitlin; White, Sean M.; Yang, Bruce Y.; Takesh, Thair; Ho, Jessica; Wink, Cherie; Wilder-Smith, Petra; Choi, Bernard

2016-01-01

Abstract. Current methods used to assess gingivitis are qualitative and subjective. We hypothesized that gingival perfusion measurements could provide a quantitative metric of disease severity. We constructed a compact laser speckle imaging (LSI) system that could be mounted in custom-made oral molds. Rigid fixation of the LSI system in the oral cavity enabled measurement of blood flow in the gingiva. In vitro validation performed in controlled flow phantoms demonstrated that the compact LSI system had comparable accuracy and linearity compared to a conventional bench-top LSI setup. In vivo validation demonstrated that the compact LSI system was capable of measuring expected blood flow dynamics during a standard postocclusive reactive hyperemia and that the compact LSI system could be used to measure gingival blood flow repeatedly without significant variation in measured blood flow values (p<0.05). Finally, compact LSI system measurements were collected from the interdental papilla of nine subjects and compared to a clinical assessment of gingival bleeding on probing. A statistically significant correlation (ρ=0.53; p<0.005) was found between these variables, indicating that quantitative gingival perfusion measurements performed using our system may aid in the diagnosis and prognosis of periodontal disease. PMID:27787545
Design and evaluation of a miniature laser speckle imaging device to assess gingival health

NASA Astrophysics Data System (ADS)

Regan, Caitlin; White, Sean M.; Yang, Bruce Y.; Takesh, Thair; Ho, Jessica; Wink, Cherie; Wilder-Smith, Petra; Choi, Bernard

2016-10-01

Current methods used to assess gingivitis are qualitative and subjective. We hypothesized that gingival perfusion measurements could provide a quantitative metric of disease severity. We constructed a compact laser speckle imaging (LSI) system that could be mounted in custom-made oral molds. Rigid fixation of the LSI system in the oral cavity enabled measurement of blood flow in the gingiva. In vitro validation performed in controlled flow phantoms demonstrated that the compact LSI system had comparable accuracy and linearity compared to a conventional bench-top LSI setup. In vivo validation demonstrated that the compact LSI system was capable of measuring expected blood flow dynamics during a standard postocclusive reactive hyperemia and that the compact LSI system could be used to measure gingival blood flow repeatedly without significant variation in measured blood flow values (p<0.05). Finally, compact LSI system measurements were collected from the interdental papilla of nine subjects and compared to a clinical assessment of gingival bleeding on probing. A statistically significant correlation (ρ=0.53 p<0.005) was found between these variables, indicating that quantitative gingival perfusion measurements performed using our system may aid in the diagnosis and prognosis of periodontal disease.
Comparing Models of Spontaneous Variations, Maneuvers and Indexes to Assess Dynamic Cerebral Autoregulation.

PubMed

Chacón, Max; Noh, Sun-Ho; Landerretche, Jean; Jara, José L

2018-01-01

We analyzed the performance of linear and nonlinear models to assess dynamic cerebral autoregulation (dCA) from spontaneous variations in healthy subjects and compared it with the use of two known maneuvers to abruptly change arterial blood pressure (BP): thigh cuffs and sit-to-stand. Cerebral blood flow velocity and BP were measured simultaneously at rest and while the maneuvers were performed in 20 healthy subjects. To analyze the spontaneous variations, we implemented two types of models using support vector machine (SVM): linear and nonlinear finite impulse response models. The classic autoregulation index (ARI) and the more recently proposed model-free ARI (mfARI) were used as measures of dCA. An ANOVA analysis was applied to compare the different methods and the coefficient of variation was calculated to evaluate their variability. There are differences between indexes, but not between models and maneuvers. The mfARI index with the sit-to-stand maneuver shows the least variability. Support vector machine modeling of spontaneous variation with the mfARI index could be used for the assessment of dCA as an alternative to maneuvers to introduce large BP fluctuations.
Comparative Study on Code-based Linear Evaluation of an Existing RC Building Damaged during 1998 Adana-Ceyhan Earthquake

NASA Astrophysics Data System (ADS)

Toprak, A. Emre; Gülay, F. Gülten; Ruge, Peter

2008-07-01

Determination of seismic performance of existing buildings has become one of the key concepts in structural analysis topics after recent earthquakes (i.e. Izmit and Duzce Earthquakes in 1999, Kobe Earthquake in 1995 and Northridge Earthquake in 1994). Considering the need for precise assessment tools to determine seismic performance level, most of earthquake hazardous countries try to include performance based assessment in their seismic codes. Recently, Turkish Earthquake Code 2007 (TEC'07), which was put into effect in March 2007, also introduced linear and non-linear assessment procedures to be applied prior to building retrofitting. In this paper, a comparative study is performed on the code-based seismic assessment of RC buildings with linear static methods of analysis, selecting an existing RC building. The basic principles dealing the procedure of seismic performance evaluations for existing RC buildings according to Eurocode 8 and TEC'07 will be outlined and compared. Then the procedure is applied to a real case study building is selected which is exposed to 1998 Adana-Ceyhan Earthquake in Turkey, the seismic action of Ms = 6.3 with a maximum ground acceleration of 0.28 g It is a six-storey RC residential building with a total of 14.65 m height, composed of orthogonal frames, symmetrical in y direction and it does not have any significant structural irregularities. The rectangular shaped planar dimensions are 16.40 m×7.80 m = 127.90 m2 with five spans in x and two spans in y directions. It was reported that the building had been moderately damaged during the 1998 earthquake and retrofitting process was suggested by the authorities with adding shear-walls to the system. The computations show that the performing methods of analysis with linear approaches using either Eurocode 8 or TEC'07 independently produce similar performance levels of collapse for the critical storey of the structure. The computed base shear value according to Eurocode is much higher than the requirements of the Turkish Earthquake Code while the selected ground conditions represent the same characteristics. The main reason is that the ordinate of the horizontal elastic response spectrum for Eurocode 8 is increased by the soil factor. In TEC'07 force-based linear assessment, the seismic demands at cross-sections are to be checked with residual moment capacities; however, the chord rotations of primary ductile elements must be checked for Eurocode safety verifications. On the other hand, the demand curvatures from linear methods of analysis of Eurocode 8 together with TEC'07 are almost similar.
Attention deficits after aneurysmal subarachnoid hemorrhage measured using the test of variables of attention.

PubMed

Wallmark, Svante; Lundström, Erik; Wikström, Johan; Ronne-Engström, Elisabeth

2015-05-01

The aim of this pilot study was to assess attention deficits in patients with aneurysmal subarachnoid hemorrhage using the test of variables of attention (TOVA). This is a computer-based continuous performance test providing objective measures of attention. We also compared the TOVA results with the attention and concentration domains of Montgomery Åsberg Depression Rating Scale and Montreal cognitive assessment, 2 examiner-administrated neuropsychological instruments. Nineteen patients with moderate to good recovery (Glasgow outcome scale, 4-5) were assessed using the TOVA, Montgomery Åsberg Depression Rating Scale, and Montreal cognitive assessment. The measurements were done when the patients visited the hospital for a routine magnetic resonance imaging control of the aneurysm. TOVA performance was pathological in 58%. The dominating pattern was a worsening of performance in the second half of the test, commonly a failing to react to correct stimuli. We found no correlation between TOVA and the performance in concentration and attention domains of Montgomery Åsberg Depression Rating Scale and Montreal cognitive assessment. Attention deficits, measured by the TOVA, were common after subarachnoid hemorrhage. This should be further studied to improve outcome. © 2015 American Heart Association, Inc.
Comparative ergonomic assessment of manual wheelchairs by paraplegic users.

PubMed

Gil-Agudo, Angel; Solís-Mozos, Marta; del-Ama, Antonio J; Crespo-Ruiz, Beatriz; de la Peña-González, Ana Isabel; Pérez-Nombela, Soraya

2013-07-01

The aim of the present study was to describe and test the reliability of a comprehensive product-centered approach to assessing functional performance and wheelchair user perceptions on device ergonomics and satisfaction of performance. A pilot study was implemented using this approach to evaluate differences among four manual wheelchairs. Six wheelchair users with complete spinal cord injury (SCI) at the thoracic level and with no previous upper limbs impairment were recruited for this study. After finishing circuit tasks, subjects were asked to complete a questionnaire about ergonomic wheelchair characteristics (manoeuvrability, stability, comfort and ease of propulsion) and satisfaction about task performance. On the other hand, objective data were recorded during user performance as the time required to complete each test, kinetic wheelchair propulsion data obtained with two SMARTWheels® and physiological parameters (heart rate and physiological index). Kuschall Champion® and Otto Bock Voyage® wheelchairs were ranked best for most ergonomic aspects specially in manoeuvrability (p < 0.05). Less time was required to execute most of the circuit tasks in both wheelchair models (p < 0.05). This approach proposed highlight the importance of looking both kinds of information, user perception and user functional performance when evaluating a wheelchair or comparing across devices.
Transforming Biology Assessment with Machine Learning: Automated Scoring of Written Evolutionary Explanations

NASA Astrophysics Data System (ADS)

Nehm, Ross H.; Ha, Minsu; Mayfield, Elijah

2012-02-01

This study explored the use of machine learning to automatically evaluate the accuracy of students' written explanations of evolutionary change. Performance of the Summarization Integrated Development Environment (SIDE) program was compared to human expert scoring using a corpus of 2,260 evolutionary explanations written by 565 undergraduate students in response to two different evolution instruments (the EGALT-F and EGALT-P) that contained prompts that differed in various surface features (such as species and traits). We tested human-SIDE scoring correspondence under a series of different training and testing conditions, using Kappa inter-rater agreement values of greater than 0.80 as a performance benchmark. In addition, we examined the effects of response length on scoring success; that is, whether SIDE scoring models functioned with comparable success on short and long responses. We found that SIDE performance was most effective when scoring models were built and tested at the individual item level and that performance degraded when suites of items or entire instruments were used to build and test scoring models. Overall, SIDE was found to be a powerful and cost-effective tool for assessing student knowledge and performance in a complex science domain.
Social competence and learning difficulties: Teacher perceptions.

PubMed

Wight, Megan; Chapparo, Christine

2008-12-01

Social competence has been linked to children's classroom performance with three out of four children with learning difficulties reported to have problems with social skills. Social participation remains a predominant childhood occupation and a key indicator of school performance. Occupational therapists work with teachers to accurately assess the social performance of children in context and to provide targeted intervention. There is limited research about what teachers perceive are the specific nature of social difficulties experienced by children with learning difficulties in the classroom. This study investigated teacher perceptions of the social competence of a small sample of Australian boys with learning difficulties within the classroom context. The Teacher Skillstreaming Checklist was used to investigate teacher perceptions of the social abilities of 21 primary school aged boys with learning difficulties compared to a control group. A correlational analysis was used to examine the relationship. The study identified that the boys with learning difficulties were perceived by their teachers as having poorer social performance across multiple domains when compared to their typically developing peers. Implications of these findings are that children's social performance may negatively impact learning and classroom participation and that for some children, social competence should be a focus of occupational therapy assessment and treatment.
Persian version of frontal assessment battery: Correlations with formal measures of executive functioning and providing normative data for Persian population.

PubMed

Asaadi, Sina; Ashrafi, Farzad; Omidbeigi, Mahmoud; Nasiri, Zahra; Pakdaman, Hossein; Amini-Harandi, Ali

2016-01-05

Cognitive impairment in patients with Parkinson's disease (PD) mainly involves executive function (EF). The frontal assessment battery (FAB) is an efficient tool for the assessment of EFs. The aims of this study were to determine the validity and reliability of the psychometric properties of the Persian version of FAB and assess its correlation with formal measures of EFs to provide normative data for the Persian version of FAB in patients with PD. The study recruited 149 healthy participants and 49 patients with idiopathic PD. In PD patients, FAB results were compared to their performance on EF tests. Reliability analysis involved test-retest reliability and internal consistency, whereas validity analysis involved convergent validity approach. FAB scores compared in normal controls and in PD patients matched for age, education, and Mini-Mental State Examination (MMSE) score. In PD patients, FAB scores were significantly decreased compared to normal controls, and correlated with Stroop test and Wisconsin Card Sorting Test (WCST). In healthy subjects, FAB scores varied according to the age, education, and MMSE. In the FAB subtest analysis, the performances of PD patients were worse than the healthy participants on similarities, fluency tasks, and Luria's motor series. Persian version of FAB could be used as a reliable scale for the assessment of frontal lobe functions in Iranian patients with PD. Furthermore, normative data provided for the Persian version of this test improve the accuracy and confidence in the clinical application of the FAB.
Communities of Practice and PISA for Schools: Comparative Learning or a Mode of Educational Governance?

ERIC Educational Resources Information Center

Lewis, Steven

2017-01-01

This paper examines the Organization for Economic Cooperation and Development's (OECD) "PISA for Schools," a new variant of the Programme for International Student Assessment (PISA) that compares school-level performance on reading, math and science with international schooling systems (e.g., Shanghai-China, Finland). Specifically, I…
Making Sense of Student Performance Data: Data Use Logics and Mathematics Teachers' Learning Opportunities

ERIC Educational Resources Information Center

Horn, Ilana Seidel; Kane, Britnie Delinger; Wilson, Jonee

2015-01-01

In the accountability era, educators are pressed to use evidence-based practice. In this comparative case study, we examine the learning opportunities afforded by teachers' data use conversations. Using situated discourse analysis, we compare two middle school mathematics teacher workgroups interpreting data from the same district assessment.…
Does the Podcast Video Playback Speed Affect Comprehension for Novel Curriculum Delivery? A Randomized Trial.

PubMed

Song, Kristine; Chakraborty, Amit; Dawson, Matthew; Dugan, Adam; Adkins, Brian; Doty, Christopher

2018-01-01

Medical education is a rapidly evolving field that has been using new technology to improve how medical students learn. One of the recent implementations in medical education is the recording of lectures for the purpose of playback at various speeds. Though previous studies done via surveys have shown a subjective increase in the rate of knowledge acquisition when learning from sped-up lectures, no quantitative studies have measured information retention. The purpose of this study was to compare mean test scores on written assessments to objectively determine if watching a video of a recorded lecture at 1.5× speed was significantly different than 1.0× speed for the immediate retention of novel material. Fifty-four University of Kentucky medical students volunteered to participate in this study. The subjects were divided into two separate groups: Group A and Group B. Each group watched two separate videos, the first at 1.5× speed and the second at 1.0× speed, then completed assessments following each. The topics of the two videos were ultrasonography artifacts and transducers. Group A watched the artifacts video first at 1.5× speed followed by the transducers video at 1.0× speed. Group B watched the transducers video first at 1.5× speed followed by the artifacts video at 1.0× speed. The percentage correct on the written assessment were calculated for each subject at each video speed. The mean and standard deviation were also calculated using a t-test to determine if there was a significant difference in assessment scores between 1.5× and 1.0× speeds. There was a significant (p=0.0188) detriment in performance on the artifacts quiz at 1.5× speed (mean 61.4; 95% confidence interval [CI]-53.9, 68.9) compared to the control group at normal speed (mean 72.7; 95% CI-66.8, 78.6). On the transducers assessment, there was not a significant (p=0.1365) difference in performance in the 1.5× speed group (mean 66.9; CI- 59.8, 74.0) compared to the control group (mean 73.8; CI- 67.7, 79.8). These findings suggest that, unlike previously published studies that showed subjective improvement in performance with sped-up video-recorded lectures compared to normal speed, objective performance may be worse.
Improving patient safety culture in Saudi Arabia (2012-2015): trending, improvement and benchmarking.

PubMed

Alswat, Khalid; Abdalla, Rawia Ahmad Mustafa; Titi, Maher Abdelraheim; Bakash, Maram; Mehmood, Faiza; Zubairi, Beena; Jamal, Diana; El-Jardali, Fadi

2017-08-02

Measuring patient safety culture can provide insight into areas for improvement and help monitor changes over time. This study details the findings of a re-assessment of patient safety culture in a multi-site Medical City in Riyadh, Kingdom of Saudi Arabia (KSA). Results were compared to an earlier assessment conducted in 2012 and benchmarked with regional and international studies. Such assessments can provide hospital leadership with insight on how their hospital is performing on patient safety culture composites as a result of quality improvement plans. This paper also explored the association between patient safety culture predictors and patient safety grade, perception of patient safety, frequency of events reported and number of events reported. We utilized a customized version of the patient safety culture survey developed by the Agency for Healthcare Research and Quality. The Medical City is a tertiary care teaching facility composed of two sites (total capacity of 904 beds). Data was analyzed using SPSS 24 at a significance level of 0.05. A t-Test was used to compare results from the 2012 survey to that conducted in 2015. Two adopted Generalized Estimating Equations in addition to two linear models were used to assess the association between composites and patient safety culture outcomes. Results were also benchmarked against similar initiatives in Lebanon, Palestine and USA. Areas of strength in 2015 included Teamwork within units, and Organizational Learning-Continuous Improvement; areas requiring improvement included Non-Punitive Response to Error, and Staffing. Comparing results to the 2012 survey revealed improvement on some areas but non-punitive response to error and Staffing remained the lowest scoring composites in 2015. Regression highlighted significant association between managerial support, organizational learning and feedback and improved survey outcomes. Comparison to international benchmarks revealed that the hospital is performing at or better than benchmark on several composites. The Medical City has made significant progress on several of the patient safety culture composites despite still having areas requiring additional improvement. Patient safety culture outcomes are evidently linked to better performance on specific composites. While results are comparable with regional and international benchmarks, findings confirm that regular assessment can allow hospitals to better understand and visualize changes in their performance and identify additional areas for improvement.
Development of bimanual performance in young children with cerebral palsy.

PubMed

Klevberg, Gunvor L; Elvrum, Ann-Kristin G; Zucknick, Manuela; Elkjaer, Sonja; Østensjø, Sigrid; Krumlinde-Sundholm, Lena; Kjeken, Ingvild; Jahnsen, Reidun

2018-05-01

To describe the development of bimanual performance among young children with unilateral or bilateral cerebral palsy (CP). A population-based sample of 102 children (53 males, 49 females), median age 28.5 months (interquartile range [IQR] 16mo) at first assessment and 47 months (IQR 18mo) at last assessment, was assessed half-yearly with the Assisting Hand Assessment (AHA) or the Both Hands Assessment (BoHA) for a total of 329 assessments. Developmental limits and rates were estimated by nonlinear mixed-effects models. Developmental trajectories were compared between levels of manual ability (Mini-Manual Ability Classification System [Mini-MACS] and MACS) and AHA or BoHA performance at 18 months of age (AHA-18/BoHA-18) for both CP subgroups, and additionally between children with bilateral CP with symmetric or asymmetric hand use. For both CP subgroups, children classified in Mini-MACS/MACS level I, and those with high AHA-18 or BoHA-18 reached the highest limits of performance. For children with bilateral CP the developmental change was small, and children with symmetric hand use reached the highest limits. Mini-MACS/MACS levels and AHA-18 or BoHA-18 distinguished between various developmental trajectories both for children with unilateral and bilateral CP. Children with bilateral CP changed their performance to a smaller extent than children with unilateral CP. Manual Ability Classification System levels and Assisting Hand Assessment/Both Hands Assessment performance at 18 months are important predictors of hand use development in cerebral palsy (CP). Children with bilateral CP improved less than those with unilateral CP. Children with bilateral CP and symmetric hand use reached higher limits than those with asymmetry. © 2018 Mac Keith Press.
Enabling performance skills: Assessment in engineering education

NASA Astrophysics Data System (ADS)

Ferrone, Jenny Kristina

Current reform in engineering education is part of a national trend emphasizing student learning as well as accountability in instruction. Assessing student performance to demonstrate accountability has become a necessity in academia. In newly adopted criterion proposed by the Accreditation Board for Engineering and Technology (ABET), undergraduates are expected to demonstrate proficiency in outcomes considered essential for graduating engineers. The case study was designed as a formative evaluation of freshman engineering students to assess the perceived effectiveness of performance skills in a design laboratory environment. The mixed methodology used both quantitative and qualitative approaches to assess students' performance skills and congruency among the respondents, based on individual, team, and faculty perceptions of team effectiveness in three ABET areas: Communications Skills. Design Skills, and Teamwork. The findings of the research were used to address future use of the assessment tool and process. The results of the study found statistically significant differences in perceptions of Teamwork Skills (p < .05). When groups composed of students and professors were compared, professors were less likely to perceive student's teaming skills as effective. The study indicated the need to: (1) improve non-technical performance skills, such as teamwork, among freshman engineering students; (2) incorporate feedback into the learning process; (3) strengthen the assessment process with a follow-up plan that specifically targets performance skill deficiencies, and (4) integrate the assessment instrument and practice with ongoing curriculum development. The findings generated by this study provides engineering departments engaged in assessment activity, opportunity to reflect, refine, and develop their programs as it continues. It also extends research on ABET competencies of engineering students in an under-investigated topic of factors correlated with team processes, behavior, and student learning.
A structured framework improves clinical patient assessment and nontechnical skills of early career emergency nurses: a pre-post study using full immersion simulation.

PubMed

Munroe, Belinda; Curtis, Kate; Murphy, Margaret; Strachan, Luke; Considine, Julie; Hardy, Jennifer; Wilson, Mark; Ruperto, Kate; Fethney, Judith; Buckley, Thomas

2016-08-01

The aim of this study was to evaluate the effect of the new evidence-informed nursing assessment framework HIRAID (History, Identify Red flags, Assessment, Interventions, Diagnostics, reassessment and communication) on the quality of patient assessment and fundamental nontechnical skills including communication, decision making, task management and situational awareness. Assessment is a core component of nursing practice and underpins clinical decisions and the safe delivery of patient care. Yet there is no universal or validated system used to teach emergency nurses how to comprehensively assess and care for patients. A pre-post design was used. The performance of thirty eight emergency nurses from five Australian hospitals was evaluated before and after undertaking education in the application of the HIRAID assessment framework. Video recordings of participant performance in immersive simulations of common presentations to the emergency department were evaluated, as well as participant documentation during the simulations. Paired parametric and nonparametric tests were used to compare changes from pre to postintervention. From pre to postintervention, participant performance increases were observed in the percentage of patient history elements collected, critical indicators of urgency collected and reported to medical officers, and patient reassessments performed. Participants also demonstrated improvement in each of the four nontechnical skills categories: communication, decision making, task management and situational awareness. The HIRAID assessment framework improves clinical patient assessments performed by emergency nurses and has the potential to enhance patient care. HIRAID should be considered for integration into clinical practice to provide nurses with a systematic approach to patient assessment and potentially improve the delivery of safe patient care. © 2016 John Wiley & Sons Ltd.
Personality Assessment Inventory Profiles of Deployed Combat Troops: An Empirical Investigation of Normative Performance

ERIC Educational Resources Information Center

Morey, Leslie C.; Lowmaster, Sara E.; Coldren, Rodney L.; Kelly, Mark P.; Parish, Robert V.; Russell, Michael L.

2011-01-01

The present study examined the normative scores and psychometric properties of the Personality Assessment Inventory (PAI; Morey, 1991) within a non-treatment-seeking sample of soldiers deployed to combat zones in Iraq, compared with a sample of community adults matched with respect to age and gender. Results indicate the scores and properties of…
The Cost-Effectiveness of Replacing the Bottom Quartile of Novice Teachers through Value-Added Teacher Assessment

ERIC Educational Resources Information Center

Yeh, Stuart S.; Ritter, Joseph

2009-01-01

A cost-effectiveness analysis was conducted of Gordon, Kane, and Staiger's (2006) proposal to raise student achievement by identifying and replacing the bottom quartile of novice teachers, using value-added assessment of teacher performance. The cost effectiveness of this proposal was compared to the cost effectiveness of voucher programs, charter…
Psychometric Comparisons of Three Measures for Assessing Motor Functions in Preschoolers with Intellectual Disabilities

ERIC Educational Resources Information Center

Wuang, Y-P.; Su, C-Y.; Huang, M-H.

2012-01-01

Background: Deficit in motor performance is common in children with intellectual disabilities (ID). A motor function measure with sound psychometric properties is indispensable for clinical and research use. The purpose of this study was to compare the psychometric properties of three commonly used clinical measures for assessing motor function in…
Correspondence between Gonadal Steroid Hormone Concentrations and Secondary Sexual Characteristics Assessed by Clinicians, Adolescents, and Parents

ERIC Educational Resources Information Center

Huang, Bin; Hillman, Jennifer; Biro, Frank M.; Ding, Lili; Dorn, Lorah D.; Susman, Elizabeth J.

2012-01-01

Adolescent sexual maturation is staged using Tanner criteria assessed by clinicians, parents, or adolescents. The physiology of sexual maturation is driven by gonadal hormones. We investigate Tanner stage progression as a function of increasing gonadal hormone concentration and compare performances of different raters. Fifty-six boys (mean age,…

Some links on this page may take you to non-federal websites. Their policies may differ from this site.