ERIC Educational Resources Information Center
Shriver, Edgar L.; Foley, John P., Jr.
A battery of criterion referenced Job Task Performance Tests (JTPT) was developed because paper and pencil tests of job knowledge and electronic theory had very poor criterion-related or empirical validity with respect to the ability of electronic maintenance men to perform their job. Although the original JTPT required the use of actual…
ERIC Educational Resources Information Center
Beretvas, S. Natasha; Murphy, Daniel L.
2013-01-01
The authors assessed correct model identification rates of Akaike's information criterion (AIC), corrected criterion (AICC), consistent AIC (CAIC), Hannon and Quinn's information criterion (HQIC), and Bayesian information criterion (BIC) for selecting among cross-classified random effects models. Performance of default values for the 5…
Criterion vs. Norm-referenced Testing.
ERIC Educational Resources Information Center
Pimsleur, Paul
1975-01-01
A norm-referenced evaluation system, which evaluates the student in comparison to his peers, is rejected in favor of a criterion-referenced system. The latter, which rates the performance of a student on an absolute standard, makes for an individualized approach. Two kinds of tests are distinguished, the formative, administered during the course…
Performance index and meta-optimization of a direct search optimization method
NASA Astrophysics Data System (ADS)
Krus, P.; Ölvander, J.
2013-10-01
Design optimization is becoming an increasingly important tool for design, often using simulation as part of the evaluation of the objective function. A measure of the efficiency of an optimization algorithm is of great importance when comparing methods. The main contribution of this article is the introduction of a singular performance criterion, the entropy rate index based on Shannon's information theory, taking both reliability and rate of convergence into account. It can also be used to characterize the difficulty of different optimization problems. Such a performance criterion can also be used for optimization of the optimization algorithms itself. In this article the Complex-RF optimization method is described and its performance evaluated and optimized using the established performance criterion. Finally, in order to be able to predict the resources needed for optimization an objective function temperament factor is defined that indicates the degree of difficulty of the objective function.
Evaluation of Self-Perceptions of Creativity: Is It a Useful Criterion?
ERIC Educational Resources Information Center
Reiter-Palmon, Roni; Robinson-Morral, Erika J.; Kaufman, James C.; Santo, Jonathan B.
2012-01-01
Self-evaluations or self-perceptions of creativity have been used in the past both as predictors of creative performance and as criteria. Four measures utilizing self-perceptions of creativity were assessed for their usefulness as criterion measures of creativity. Analyses provided evidence of domain specificity of self-perceptions. The scales…
Evaluation of Regression Models of Balance Calibration Data Using an Empirical Criterion
NASA Technical Reports Server (NTRS)
Ulbrich, Norbert; Volden, Thomas R.
2012-01-01
An empirical criterion for assessing the significance of individual terms of regression models of wind tunnel strain gage balance outputs is evaluated. The criterion is based on the percent contribution of a regression model term. It considers a term to be significant if its percent contribution exceeds the empirical threshold of 0.05%. The criterion has the advantage that it can easily be computed using the regression coefficients of the gage outputs and the load capacities of the balance. First, a definition of the empirical criterion is provided. Then, it is compared with an alternate statistical criterion that is widely used in regression analysis. Finally, calibration data sets from a variety of balances are used to illustrate the connection between the empirical and the statistical criterion. A review of these results indicated that the empirical criterion seems to be suitable for a crude assessment of the significance of a regression model term as the boundary between a significant and an insignificant term cannot be defined very well. Therefore, regression model term reduction should only be performed by using the more universally applicable statistical criterion.
ERIC Educational Resources Information Center
Shriver, Edgar L.; And Others
This document furnishes a complete copy of the Test Subject's Instructions and the Test Administrator's Handbook for a battery of criterion referenced Job Task Performance Tests (JTPT) for electronic maintenance. General information is provided on soldering, Radar Set AN/APN-147(v), Radar Set Special Equipment, Radar Set Bench Test Set-Up, and…
ERIC Educational Resources Information Center
Tibbetts, Katherine A.; And Others
This paper describes the development of a criterion-referenced, performance-based measure of third grade reading comprehension. The primary purpose of the assessment is to contribute unique and valid information for use in the formative evaluation of a whole literacy program. A secondary purpose is to supplement other program efforts to…
Failure Study of Composite Materials by the Yeh-Stratton Criterion
NASA Technical Reports Server (NTRS)
Yeh, Hsien-Yang; Richards, W. Lance
1997-01-01
The newly developed Yeh-Stratton (Y-S) Strength Criterion was used to study the failure of composite materials with central holes and normal cracks. To evaluate the interaction parameters for the Y-S failure theory, it is necessary to perform several biaxial loading tests. However, it is indisputable that the inhomogeneous and anisotropic nature of composite materials have made their own contribution to the complication of the biaxial testing problem. To avoid the difficulties of performing many biaxial tests and still consider the effects of the interaction term in the Y-S Criterion, a simple modification of the Y-S Criterion was developed. The preliminary predictions by the modified Y-S Criterion were relatively conservative compared to the testing data. Thus, the modified Y-S Criterion could be used as a design tool. To further understand the composite failure problem, an investigation of the damage zone in front of the crack tip coupled with the Y-S Criterion is imperative.
ERIC Educational Resources Information Center
Shriver, Edgar L.; Foley, John P., Jr.
A battery of criterion referenced job task performance tests (JIPT) for typical electronic maintenance activities were developed. The construction of a battery of such tests together with an appropriate scoring for reporting the results is detailed. The development of a Test Administrators Handbook also is described. This battery is considered to…
NASA Astrophysics Data System (ADS)
Huang, Xiangsheng; Zhong, Mingqiu; Li, Ying; Yang, Hongyuan
2018-05-01
High-power of the offshore wind turbine is in the early stage of development, then how to establish a scientific and impartial performance evaluation system of the offshore wind turbine becomes the key to the health development of the industry. This paper adopts the method of multi-level analysis and site testing, which can reduce the impact of human factors on evaluation to the most extent. A more reasonable judging criterion with the relative importance of different factors of the same criterion level is also put forward, which constructs a more scientific and fair evaluation system of the high-power offshore wind turbine.
NASA Astrophysics Data System (ADS)
Lehmann, Thomas M.
2002-05-01
Reliable evaluation of medical image processing is of major importance for routine applications. Nonetheless, evaluation is often omitted or methodically defective when novel approaches or algorithms are introduced. Adopted from medical diagnosis, we define the following criteria to classify reference standards: 1. Reliance, if the generation or capturing of test images for evaluation follows an exactly determined and reproducible protocol. 2. Equivalence, if the image material or relationships considered within an algorithmic reference standard equal real-life data with respect to structure, noise, or other parameters of importance. 3. Independence, if any reference standard relies on a different procedure than that to be evaluated, or on other images or image modalities than that used routinely. This criterion bans the simultaneous use of one image for both, training and test phase. 4. Relevance, if the algorithm to be evaluated is self-reproducible. If random parameters or optimization strategies are applied, reliability of the algorithm must be shown before the reference standard is applied for evaluation. 5. Significance, if the number of reference standard images that are used for evaluation is sufficient large to enable statistically founded analysis. We demand that a true gold standard must satisfy the Criteria 1 to 3. Any standard only satisfying two criteria, i.e., Criterion 1 and Criterion 2 or Criterion 1 and Criterion 3, is referred to as silver standard. Other standards are termed to be from plastic. Before exhaustive evaluation based on gold or silver standards is performed, its relevance must be shown (Criterion 4) and sufficient tests must be carried out to found statistical analysis (Criterion 5). In this paper, examples are given for each class of reference standards.
NASA Technical Reports Server (NTRS)
Parada, N. D. J. (Principal Investigator); Dutra, L. V.; Mascarenhas, N. D. A.; Mitsuo, Fernando Augusta, II
1984-01-01
A study area near Ribeirao Preto in Sao Paulo state was selected, with predominance in sugar cane. Eight features were extracted from the 4 original bands of LANDSAT image, using low-pass and high-pass filtering to obtain spatial features. There were 5 training sites in order to acquire the necessary parameters. Two groups of four channels were selected from 12 channels using JM-distance and entropy criterions. The number of selected channels was defined by physical restrictions of the image analyzer and computacional costs. The evaluation was performed by extracting the confusion matrix for training and tests areas, with a maximum likelihood classifier, and by defining performance indexes based on those matrixes for each group of channels. Results show that in spatial features and supervised classification, the entropy criterion is better in the sense that allows a more accurate and generalized definition of class signature. On the other hand, JM-distance criterion strongly reduces the misclassification within training areas.
Reliability and Validity of the Professional Counseling Performance Evaluation
ERIC Educational Resources Information Center
Shepherd, J. Brad; Britton, Paula J.; Kress, Victoria E.
2008-01-01
The definition and measurement of counsellor trainee competency is an issue that has received increased attention yet lacks quantitative study. This research evaluates item responses, scale reliability and intercorrelations, interrater agreement, and criterion-related validity of the Professional Performance Fitness Evaluation/Professional…
Improved optical design of nontracking concentrators
NASA Astrophysics Data System (ADS)
Kwan, B. M.; Bannerot, R. B.
1984-08-01
Optical designs based on a two reflections or less criterion have been developed for one and two-facet trapezoidal concentrators. Collector designs resulting from this criterion have been evaluated with the aid of a ray-trace computer simulation which includes the effects of nonideal reflectors. Results indicate a marked increase in performance, particularly for the one-facet designs, as compared to the collectors previously designed with the one reflection or less criterion. A significant result is that when a proper accounting is made for the actual acceptance angle for the concentrators, the performances of the optimal one and two-facet designs become nearly identical, indicating that the previously held contention that improved performance could be achieved with multifaceted reflectors (geometrically approaching the compound parabolic shape) may be incorrect.
12 CFR 228.41 - Assessment area delineation.
Code of Federal Regulations, 2010 CFR
2010-01-01
... does not evaluate the bank's delineation of its assessment area(s) as a separate performance criterion..., such as those consumer loans on which the bank elects to have its performance assessed). (d... area(s) delineated by a bank in its evaluation of the bank's CRA performance unless the Board...
Physical employment standards for U.K. fire and rescue service personnel.
Blacker, S D; Rayson, M P; Wilkinson, D M; Carter, J M; Nevill, A M; Richmond, V L
2016-01-01
Evidence-based physical employment standards are vital for recruiting, training and maintaining the operational effectiveness of personnel in physically demanding occupations. (i) Develop criterion tests for in-service physical assessment, which simulate the role-related physical demands of UK fire and rescue service (UK FRS) personnel. (ii) Develop practical physical selection tests for FRS applicants. (iii) Evaluate the validity of the selection tests to predict criterion test performance. Stage 1: we conducted a physical demands analysis involving seven workshops and an expert panel to document the key physical tasks required of UK FRS personnel and to develop 'criterion' and 'selection' tests. Stage 2: we measured the performance of 137 trainee and 50 trained UK FRS personnel on selection, criterion and 'field' measures of aerobic power, strength and body size. Statistical models were developed to predict criterion test performance. Stage 3: matter experts derived minimum performance standards. We developed single person simulations of the key physical tasks required of UK FRS personnel as criterion and selection tests (rural fire, domestic fire, ladder lift, ladder extension, ladder climb, pump assembly, enclosed space search). Selection tests were marginally stronger predictors of criterion test performance (r = 0.88-0.94, 95% Limits of Agreement [LoA] 7.6-14.0%) than field test scores (r = 0.84-0.94, 95% LoA 8.0-19.8%) and offered greater face and content validity and more practical implementation. This study outlines the development of role-related, gender-free physical employment tests for the UK FRS, which conform to equal opportunities law. © The Author 2015. Published by Oxford University Press on behalf of the Society of Occupational Medicine. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Final Pilot Performance Rating Scales.
ERIC Educational Resources Information Center
Horner, Walter R.; And Others
These rating scales are intended for evaluation of student pilot performance. Each student is evaluated individually on the basis of video recordings of the student in flight. Ten point rating lines are used for the ten criterion performance elements of each of three maneuvers, (1) Final Turn to Landing, (2) Lazy Eight, and (3) Vertical S "A".…
The cross-validated AUC for MCP-logistic regression with high-dimensional data.
Jiang, Dingfeng; Huang, Jian; Zhang, Ying
2013-10-01
We propose a cross-validated area under the receiving operator characteristic (ROC) curve (CV-AUC) criterion for tuning parameter selection for penalized methods in sparse, high-dimensional logistic regression models. We use this criterion in combination with the minimax concave penalty (MCP) method for variable selection. The CV-AUC criterion is specifically designed for optimizing the classification performance for binary outcome data. To implement the proposed approach, we derive an efficient coordinate descent algorithm to compute the MCP-logistic regression solution surface. Simulation studies are conducted to evaluate the finite sample performance of the proposed method and its comparison with the existing methods including the Akaike information criterion (AIC), Bayesian information criterion (BIC) or Extended BIC (EBIC). The model selected based on the CV-AUC criterion tends to have a larger predictive AUC and smaller classification error than those with tuning parameters selected using the AIC, BIC or EBIC. We illustrate the application of the MCP-logistic regression with the CV-AUC criterion on three microarray datasets from the studies that attempt to identify genes related to cancers. Our simulation studies and data examples demonstrate that the CV-AUC is an attractive method for tuning parameter selection for penalized methods in high-dimensional logistic regression models.
Generalized Majority Logic Criterion to Analyze the Statistical Strength of S-Boxes
NASA Astrophysics Data System (ADS)
Hussain, Iqtadar; Shah, Tariq; Gondal, Muhammad Asif; Mahmood, Hasan
2012-05-01
The majority logic criterion is applicable in the evaluation process of substitution boxes used in the advanced encryption standard (AES). The performance of modified or advanced substitution boxes is predicted by processing the results of statistical analysis by the majority logic criteria. In this paper, we use the majority logic criteria to analyze some popular and prevailing substitution boxes used in encryption processes. In particular, the majority logic criterion is applied to AES, affine power affine (APA), Gray, Lui J, residue prime, S8 AES, Skipjack, and Xyi substitution boxes. The majority logic criterion is further extended into a generalized majority logic criterion which has a broader spectrum of analyzing the effectiveness of substitution boxes in image encryption applications. The integral components of the statistical analyses used for the generalized majority logic criterion are derived from results of entropy analysis, contrast analysis, correlation analysis, homogeneity analysis, energy analysis, and mean of absolute deviation (MAD) analysis.
Link, W.A.; Armitage, Peter; Colton, Theodore
1998-01-01
Unbiasedness is probably the best known criterion for evaluating the performance of estimators. This note describes unbiasedness, demonstrating various failings of the criterion. It is shown that unbiased estimators might not exist, or might not be unique; an example of a unique but clearly unacceptable unbiased estimator is given. It is shown that unbiased estimators are not translation invariant. Various alternative criteria are described, and are illustrated through examples.
The Arthroscopic Surgical Skill Evaluation Tool (ASSET).
Koehler, Ryan J; Amsdell, Simon; Arendt, Elizabeth A; Bisson, Leslie J; Braman, Jonathan P; Bramen, Jonathan P; Butler, Aaron; Cosgarea, Andrew J; Harner, Christopher D; Garrett, William E; Olson, Tyson; Warme, Winston J; Nicandri, Gregg T
2013-06-01
Surgeries employing arthroscopic techniques are among the most commonly performed in orthopaedic clinical practice; however, valid and reliable methods of assessing the arthroscopic skill of orthopaedic surgeons are lacking. The Arthroscopic Surgery Skill Evaluation Tool (ASSET) will demonstrate content validity, concurrent criterion-oriented validity, and reliability when used to assess the technical ability of surgeons performing diagnostic knee arthroscopic surgery on cadaveric specimens. Cross-sectional study; Level of evidence, 3. Content validity was determined by a group of 7 experts using the Delphi method. Intra-articular performance of a right and left diagnostic knee arthroscopic procedure was recorded for 28 residents and 2 sports medicine fellowship-trained attending surgeons. Surgeon performance was assessed by 2 blinded raters using the ASSET. Concurrent criterion-oriented validity, interrater reliability, and test-retest reliability were evaluated. Content validity: The content development group identified 8 arthroscopic skill domains to evaluate using the ASSET. Concurrent criterion-oriented validity: Significant differences in the total ASSET score (P < .05) between novice, intermediate, and advanced experience groups were identified. Interrater reliability: The ASSET scores assigned by each rater were strongly correlated (r = 0.91, P < .01), and the intraclass correlation coefficient between raters for the total ASSET score was 0.90. Test-retest reliability: There was a significant correlation between ASSET scores for both procedures attempted by each surgeon (r = 0.79, P < .01). The ASSET appears to be a useful, valid, and reliable method for assessing surgeon performance of diagnostic knee arthroscopic surgery in cadaveric specimens. Studies are ongoing to determine its generalizability to other procedures as well as to the live operating room and other simulated environments.
Integration of an EEG biomarker with a clinician's ADHD evaluation
Snyder, Steven M; Rugino, Thomas A; Hornig, Mady; Stein, Mark A
2015-01-01
Background This study is the first to evaluate an assessment aid for attention-deficit/hyperactivity disorder (ADHD) according to both Class-I evidence standards of American Academy of Neurology and De Novo requirements of US Food and Drug Administration. The assessment aid involves a method to integrate an electroencephalographic (EEG) biomarker, theta/beta ratio (TBR), with a clinician's ADHD evaluation. The integration method is intended as a step to help improve certainty with criterion E (i.e., whether symptoms are better explained by another condition). Methods To evaluate the assessment aid, investigators conducted a prospective, triple-blinded, 13-site, clinical cohort study. Comprehensive clinical evaluation data were obtained from 275 children and adolescents presenting with attentional and behavioral concerns. A qualified clinician at each site performed differential diagnosis. EEG was collected by separate teams. The reference standard was consensus diagnosis by an independent, multidisciplinary team (psychiatrist, psychologist, and neurodevelopmental pediatrician), which is well-suited to evaluate criterion E in a complex clinical population. Results Of 209 patients meeting ADHD criteria per a site clinician's judgment, 93 were separately found by the multidisciplinary team to be less likely to meet criterion E, implying possible overdiagnosis by clinicians in 34% of the total clinical sample (93/275). Of those 93, 91% were also identified by EEG, showing a relatively lower TBR (85/93). Further, the integration method was in 97% agreement with the multidisciplinary team in the resolution of a clinician's uncertain cases (35/36). TBR showed statistical power specific to supporting certainty of criterion E per the multidisciplinary team (Cohen's d, 1.53). Patients with relatively lower TBR were more likely to have other conditions that could affect criterion E certainty (10 significant results; P ≤ 0.05). Integration of this information with a clinician's ADHD evaluation could help improve diagnostic accuracy from 61% to 88%. Conclusions The EEG-based assessment aid may help improve accuracy of ADHD diagnosis by supporting greater criterion E certainty. PMID:25798338
Ethical leadership: meta-analytic evidence of criterion-related and incremental validity.
Ng, Thomas W H; Feldman, Daniel C
2015-05-01
This study examines the criterion-related and incremental validity of ethical leadership (EL) with meta-analytic data. Across 101 samples published over the last 15 years (N = 29,620), we observed that EL demonstrated acceptable criterion-related validity with variables that tap followers' job attitudes, job performance, and evaluations of their leaders. Further, followers' trust in the leader mediated the relationships of EL with job attitudes and performance. In terms of incremental validity, we found that EL significantly, albeit weakly in some cases, predicted task performance, citizenship behavior, and counterproductive work behavior-even after controlling for the effects of such variables as transformational leadership, use of contingent rewards, management by exception, interactional fairness, and destructive leadership. The article concludes with a discussion of ways to strengthen the incremental validity of EL. (PsycINFO Database Record (c) 2015 APA, all rights reserved).
ERIC Educational Resources Information Center
Fagan, W. T.
1978-01-01
The Canadian Institute for Research in Behavioral and Social Sciences of Calgary was awarded a contract by the Provincial Government of Alberta to assess student skills and knowledge in reading and written composition. Here evaluation is defined and the use of standardized and criterion referenced tests for evaluating reading performance are…
CA-125 AUC as a predictor for epithelial ovarian cancer relapse.
Mano, António; Falcão, Amílcar; Godinho, Isabel; Santos, Jorge; Leitão, Fátima; de Oliveira, Carlos; Caramona, Margarida
2008-01-01
The aim of the present work was to evaluate the usefulness of CA-125 normalized in time area under the curve (CA-125 AUC) to signalise epithelial ovarian cancer relapse. Data from a hundred and eleven patients were submitted to two different approaches based on CA-125 AUC increase values to predict patient relapse. In Criterion A total CA-125 AUC normalized in time value (AUC(i)) was compared with the immediately previous one (AUC(i-1)) using the formulae AUC(i) > or = F * AUC(i-1) (several F values were tested) to find the appropriate close related increment associated to patient relapse. In Criterion B total CA-125 AUC normalised in time was calculated and several cut-off values were correlated with patient relapse prediction capacity. In Criterion A the best accuracy was achieved with a factor (F) of 1.25 (increment of 25% from the previous status), while in Criterion B the best accuracies were achieved with cut-offs of 25, 50, 75 and 100 IU/mL. The mean lead time to relapse achieved with Criterion A was 181 days, while with Criterion B they were, respectively, 131, 111, 63 and 11 days. Based on our results we believe that conjugation and sequential application of both criteria in patient relapse detection should be highly advisable. CA-125 AUC rapid burst in asymptomatic patients should be firstly evaluated using Criterion A with a high accuracy (0.85) and with a substantial mean lead time to relapse (181 days). If a negative answer was obtained then Criterion B should performed to confirm the absence of relapse.
The Arthroscopic Surgical Skill Evaluation Tool (ASSET)
Koehler, Ryan J.; Amsdell, Simon; Arendt, Elizabeth A; Bisson, Leslie J; Braman, Jonathan P; Butler, Aaron; Cosgarea, Andrew J; Harner, Christopher D; Garrett, William E; Olson, Tyson; Warme, Winston J.; Nicandri, Gregg T.
2014-01-01
Background Surgeries employing arthroscopic techniques are among the most commonly performed in orthopaedic clinical practice however, valid and reliable methods of assessing the arthroscopic skill of orthopaedic surgeons are lacking. Hypothesis The Arthroscopic Surgery Skill Evaluation Tool (ASSET) will demonstrate content validity, concurrent criterion-oriented validity, and reliability, when used to assess the technical ability of surgeons performing diagnostic knee arthroscopy on cadaveric specimens. Study Design Cross-sectional study; Level of evidence, 3 Methods Content validity was determined by a group of seven experts using a Delphi process. Intra-articular performance of a right and left diagnostic knee arthroscopy was recorded for twenty-eight residents and two sports medicine fellowship trained attending surgeons. Subject performance was assessed by two blinded raters using the ASSET. Concurrent criterion-oriented validity, inter-rater reliability, and test-retest reliability were evaluated. Results Content validity: The content development group identified 8 arthroscopic skill domains to evaluate using the ASSET. Concurrent criterion-oriented validity: Significant differences in total ASSET score (p<0.05) between novice, intermediate, and advanced experience groups were identified. Inter-rater reliability: The ASSET scores assigned by each rater were strongly correlated (r=0.91, p <0.01) and the intra-class correlation coefficient between raters for the total ASSET score was 0.90. Test-retest reliability: there was a significant correlation between ASSET scores for both procedures attempted by each individual (r = 0.79, p<0.01). Conclusion The ASSET appears to be a useful, valid, and reliable method for assessing surgeon performance of diagnostic knee arthroscopy in cadaveric specimens. Studies are ongoing to determine its generalizability to other procedures as well as to the live OR and other simulated environments. PMID:23548808
Soft Clustering Criterion Functions for Partitional Document Clustering
2004-05-26
in the clus- ter that it already belongs to. The refinement phase ends, as soon as we perform an iteration in which no documents moved between...for failing to comply with a collection of information if it does not display a currently valid OMB control number. 1. REPORT DATE 26 MAY 2004 2... it with the one obtained by the hard criterion functions. We present a comprehensive experimental evaluation involving twelve differ- ent datasets
3-D Mixed Mode Delamination Fracture Criteria - An Experimentalist's Perspective
NASA Technical Reports Server (NTRS)
Reeder, James R.
2006-01-01
Many delamination failure criteria based on fracture toughness have been suggested over the past few decades, but most only covered the region containing mode I and mode II components of loading because that is where toughness data existed. With new analysis tools, more 3D analyses are being conducted that capture a mode III component of loading. This has increased the need for a fracture criterion that incorporates mode III loading. The introduction of a pure mode III fracture toughness test has also produced data on which to base a full 3D fracture criterion. In this paper, a new framework for visualizing 3D fracture criteria is introduced. The common 2D power law fracture criterion was evaluated to produce unexpected predictions with the introduction of mode III and did not perform well in the critical high mode I region. Another 2D criterion that has been shown to model a wide range of materials well was used as the basis for a new 3D criterion. The new criterion is based on assumptions that the relationship between mode I and mode III toughness is similar to the relation between mode I and mode II and that a linear interpolation can be used between mode II and mode III. Until mixed-mode data exists with a mode III component of loading, 3D fracture criteria cannot be properly evaluated, but these assumptions seem reasonable.
Prescott-Clements, L E; van der Vleuten, C P M; Schuwirth, L; Gibb, E; Hurst, Y; Rennie, J S
2011-08-01
For health professionals, the development of insight into their performance is vital for safe practice, professional development and self-regulation. This study investigates whether the development of dental trainees' insight, when provided with external feedback on performance, can be assessed using a single criterion on a simple global ratings form such as the Longitudinal Evaluation of Performance or Mini Clinical Evaluation Exercise. Postgraduate dental trainees (N = 139) were assessed using this tool on a weekly basis for 6 months. Regression analysis of the data was carried out using SPSS, and a short trainer questionnaire was implemented to investigate feasibility. Ratings for insight were shown to increase with time in a similar manner to the growth observed in other essential skills. The gradient of the slope for growth of insight was slightly less than that of the other observed skills. Trainers were mostly positive about the new criterion assessing trainees' insight, although the importance of training for trainers in this process was highlighted. Our data suggest that practitioners' insight into their performance can be developed with experience and regular feedback. However, this is most likely a complex skill dependent on a number of intrinsic and external factors. The development of trainees' insight into their performance can be assessed using a single criterion on a simple global ratings form. The process involves no additional burden on evaluators in terms of their time or cost, and promotes best practice in the provision of feedback for trainees. © 2011 John Wiley & Sons A/S.
Fertigation uniformity under sprinkler irrigation: evaluation and analysis
USDA-ARS?s Scientific Manuscript database
n modern farming systems, fertigation is widely practiced as a cost effective and convenient method for applying soluble fertilizers to crops. Along with efficiency and adequacy, uniformity is an important fertigation performance evaluation criterion. Fertigation uniformity is defined here as a comp...
12 CFR 345.41 - Assessment area delineation.
Code of Federal Regulations, 2010 CFR
2010-01-01
... the bank's delineation of its assessment area(s) as a separate performance criterion, but the FDIC..., such as those consumer loans on which the bank elects to have its performance assessed). (d... area(s) delineated by a bank in its evaluation of the bank's CRA performance unless the FDIC determines...
ERIC Educational Resources Information Center
Moore, Charles G.; And Others
This guide provides job-related tasks, performance objectives, performance guides, resources, teaching activities, evaluation standards, and criterion-referenced measures in three units of a welding course. Through the curriculum content of the welding course, the guide helps teachers lead students through the learning process, including the…
Hsu, Bing-Cheng
2018-01-01
Waxing is an important aspect of automobile detailing, aimed at protecting the finish of the car and preventing rust. At present, this delicate work is conducted manually due to the need for iterative adjustments to achieve acceptable quality. This paper presents a robotic waxing system in which surface images are used to evaluate the quality of the finish. An RGB-D camera is used to build a point cloud that details the sheet metal components to enable path planning for a robot manipulator. The robot is equipped with a multi-axis force sensor to measure and control the forces involved in the application and buffing of wax. Images of sheet metal components that were waxed by experienced car detailers were analyzed using image processing algorithms. A Gaussian distribution function and its parameterized values were obtained from the images for use as a performance criterion in evaluating the quality of surfaces prepared by the robotic waxing system. Waxing force and dwell time were optimized using a mathematical model based on the image-based criterion used to measure waxing performance. Experimental results demonstrate the feasibility of the proposed robotic waxing system and image-based performance evaluation scheme. PMID:29757940
Lin, Chi-Ying; Hsu, Bing-Cheng
2018-05-14
Waxing is an important aspect of automobile detailing, aimed at protecting the finish of the car and preventing rust. At present, this delicate work is conducted manually due to the need for iterative adjustments to achieve acceptable quality. This paper presents a robotic waxing system in which surface images are used to evaluate the quality of the finish. An RGB-D camera is used to build a point cloud that details the sheet metal components to enable path planning for a robot manipulator. The robot is equipped with a multi-axis force sensor to measure and control the forces involved in the application and buffing of wax. Images of sheet metal components that were waxed by experienced car detailers were analyzed using image processing algorithms. A Gaussian distribution function and its parameterized values were obtained from the images for use as a performance criterion in evaluating the quality of surfaces prepared by the robotic waxing system. Waxing force and dwell time were optimized using a mathematical model based on the image-based criterion used to measure waxing performance. Experimental results demonstrate the feasibility of the proposed robotic waxing system and image-based performance evaluation scheme.
Sun, Min; Wong, David; Kronenfeld, Barry
2016-01-01
Despite conceptual and technology advancements in cartography over the decades, choropleth map design and classification fail to address a fundamental issue: estimates that are statistically indifferent may be assigned to different classes on maps or vice versa. Recently, the class separability concept was introduced as a map classification criterion to evaluate the likelihood that estimates in two classes are statistical different. Unfortunately, choropleth maps created according to the separability criterion usually have highly unbalanced classes. To produce reasonably separable but more balanced classes, we propose a heuristic classification approach to consider not just the class separability criterion but also other classification criteria such as evenness and intra-class variability. A geovisual-analytic package was developed to support the heuristic mapping process to evaluate the trade-off between relevant criteria and to select the most preferable classification. Class break values can be adjusted to improve the performance of a classification. PMID:28286426
Science and Art of Setting Performance Standards and Cutoff Scores in Kinesiology
ERIC Educational Resources Information Center
Zhu, Weimo
2013-01-01
Setting standards and cutoff scores is essential to any measurement and evaluation practice. Two evaluation frameworks, norm-referenced (NR) and criterion-referenced (CR), have often been used for setting standards. Although setting fitness standards based on the NR evaluation is relatively easy as long as a nationally representative sample can be…
Improvement and Extension of Shape Evaluation Criteria in Multi-Scale Image Segmentation
NASA Astrophysics Data System (ADS)
Sakamoto, M.; Honda, Y.; Kondo, A.
2016-06-01
From the last decade, the multi-scale image segmentation is getting a particular interest and practically being used for object-based image analysis. In this study, we have addressed the issues on multi-scale image segmentation, especially, in improving the performances for validity of merging and variety of derived region's shape. Firstly, we have introduced constraints on the application of spectral criterion which could suppress excessive merging between dissimilar regions. Secondly, we have extended the evaluation for smoothness criterion by modifying the definition on the extent of the object, which was brought for controlling the shape's diversity. Thirdly, we have developed new shape criterion called aspect ratio. This criterion helps to improve the reproducibility on the shape of object to be matched to the actual objectives of interest. This criterion provides constraint on the aspect ratio in the bounding box of object by keeping properties controlled with conventional shape criteria. These improvements and extensions lead to more accurate, flexible, and diverse segmentation results according to the shape characteristics of the target of interest. Furthermore, we also investigated a technique for quantitative and automatic parameterization in multi-scale image segmentation. This approach is achieved by comparing segmentation result with training area specified in advance by considering the maximization of the average area in derived objects or satisfying the evaluation index called F-measure. Thus, it has been possible to automate the parameterization that suited the objectives especially in the view point of shape's reproducibility.
NASA Astrophysics Data System (ADS)
Vorotnikov, A. A.; Klimov, D. D.; Romash, E. V.; Bashevskaya, O. S.; Poduraev, Yu. V.; Bazykyan, E. A.; Chunihin, A. A.
2018-03-01
Industrial robots perform technological operations, such as spot and arc welding, machining and laser cutting along different trajectories within their performance characteristics. The evaluation of these characteristics is carried out according to the criteria of the standard ISO 9283. The criteria of this standard are applicable in industrial manufacturing, but not in the medical industry, as they are not developed in the framework of medical tasks. Therefore, it is necessary to evaluate according to criteria built on different principles. In this article, the question of comparative evaluation of trajectories from program movements of a robot and manual movements of a surgeon, arising during the development of robotic medical complexes using industrial robots, is considered. A comparative evaluation is required to prove the expediency of automating medical operations in maxillofacial surgery. This study focuses on the estimation of velocity accuracy of a medical instrument. To obtain the velocity of the medical instrument, coordinates of the trajectory points from the program movements of the robot KUKA LWR4+ and trajectories from the manual movements of a professional surgeon have been measured. The measurement was carried out using a coordinate measuring machine, the laser tracker Leica LTD800. The accuracy estimation was carried out by two criteria: the criterion set out in the ISO 9283 standard, and the developed alternative criterion, the description of which is presented in this article. A quantitative comparative evaluation of the trajectories of a robot and a surgeon was obtained.
A new approach using coagulation rate constant for evaluation of turbidity removal
NASA Astrophysics Data System (ADS)
Al-Sameraiy, Mukheled
2017-06-01
Coagulation-flocculation-sedimentation processes for treating three levels of bentonite synthetic turbid water using date seeds (DS) and alum (A) coagulants were investigated in the previous research work. In the current research, the same experimental results were used to adopt a new approach on a basis of using coagulation rate constant as an investigating parameter to identify optimum doses of these coagulants. Moreover, the performance of these coagulants to meet (WHO) turbidity standard was assessed by introducing a new evaluating criterion in terms of critical coagulation rate constant (kc). Coagulation rate constants (k2) were mathematically calculated in second order form of coagulation process for each coagulant. The maximum (k2) values corresponded to doses, which were obviously to be considered as optimum doses. The proposed criterion to assess the performance of coagulation process of these coagulants was based on the mathematical representation of (WHO) turbidity guidelines in second order form of coagulation process stated that (k2) for each coagulant should be ≥ (kc) for each level of synthetic turbid water. For all tested turbid water, DS coagulant could not satisfy it. While, A coagulant could satisfy it. The results obtained in the present research are exactly in agreement with the previous published results in terms of finding optimum doses for each coagulant and assessing their performances. On the whole, it is recommended considering coagulation rate constant to be a new approach as an indicator for investigating optimum doses and critical coagulation rate constant to be a new evaluating criterion to assess coagulants' performance.
Corner-point criterion for assessing nonlinear image processing imagers
NASA Astrophysics Data System (ADS)
Landeau, Stéphane; Pigois, Laurent; Foing, Jean-Paul; Deshors, Gilles; Swiathy, Greggory
2017-10-01
Range performance modeling of optronics imagers attempts to characterize the ability to resolve details in the image. Today, digital image processing is systematically used in conjunction with the optoelectronic system to correct its defects or to exploit tiny detection signals to increase performance. In order to characterize these processing having adaptive and non-linear properties, it becomes necessary to stimulate the imagers with test patterns whose properties are similar to the actual scene image ones, in terms of dynamic range, contours, texture and singular points. This paper presents an approach based on a Corner-Point (CP) resolution criterion, derived from the Probability of Correct Resolution (PCR) of binary fractal patterns. The fundamental principle lies in the respectful perception of the CP direction of one pixel minority value among the majority value of a 2×2 pixels block. The evaluation procedure considers the actual image as its multi-resolution CP transformation, taking the role of Ground Truth (GT). After a spatial registration between the degraded image and the original one, the degradation is statistically measured by comparing the GT with the degraded image CP transformation, in terms of localized PCR at the region of interest. The paper defines this CP criterion and presents the developed evaluation techniques, such as the measurement of the number of CP resolved on the target, the transformation CP and its inverse transform that make it possible to reconstruct an image of the perceived CPs. Then, this criterion is compared with the standard Johnson criterion, in the case of a linear blur and noise degradation. The evaluation of an imaging system integrating an image display and a visual perception is considered, by proposing an analysis scheme combining two methods: a CP measurement for the highly non-linear part (imaging) with real signature test target and conventional methods for the more linear part (displaying). The application to color imaging is proposed, with a discussion about the choice of the working color space depending on the type of image enhancement processing used.
MSFC Skylab structures and mechanical systems mission evaluation
NASA Technical Reports Server (NTRS)
1974-01-01
A performance analysis for structural and mechanical major hardware systems and components is presented. Development background testing, modifications, and requirement adjustments are included. Functional narratives are provided for comparison purposes as are predicted design performance criterion. Each item is evaluated on an individual basis: that is, (1) history (requirements, design, manufacture, and test); (2) in-orbit performance (description and analysis); and (3) conclusions and recommendations regarding future space hardware application. Overall, the structural and mechanical performance of the Skylab hardware was outstanding.
Park, Jee Won; Seo, Eun Ji; You, Mi-Ae; Song, Ju-Eun
2016-03-01
Program outcome evaluation is important because it is an indicator for good quality of education. Course-embedded assessment is one of the program outcome evaluation methods. However, it is rarely used in Korean nursing education. The study purpose was to develop and apply preliminarily a course-embedded assessment system to evaluate one program outcome and to share our experiences. This was a methodological study to develop and apply the course-embedded assessment system based on the theoretical framework in one nursing program in South Korea. Scores for 77 students generated from the three practicum courses were used. The course-embedded assessment system was developed following the six steps suggested by Han's model as follows. 1) One program outcome in the undergraduate program, "nursing process application ability", was selected and 2) the three clinical practicum courses related to the selected program outcome were identified. 3) Evaluation tools including rubric and items were selected for outcome measurement and 4) performance criterion, the educational goal level for the program, was established. 5) Program outcome was actually evaluated using the rubric and evaluation items in the three practicum courses and 6) the obtained scores were analyzed to identify the achievement rate, which was compared with the performance criterion. Achievement rates for the selected program outcome in adult, maternity, and pediatric nursing practicum were 98.7%, 100%, and 66.2% in the case report and 100% for all three in the clinical practice, and 100%, 100%, and 87% respectively for the conference. These are considered as satisfactory levels when compared with the performance criterion of "at least 60% or more". Course-embedded assessment can be used as an effective and economic method to evaluate the program outcome without running an integrative course additionally. Further studies to develop course-embedded assessment systems for other program outcomes in nursing education are needed. Copyright © 2016 Elsevier Ltd. All rights reserved.
Clinical evaluation of two packable posterior composites: 2-year follow-up.
Fagundes, T C; Barata, T J E; Bresciani, E; Cefaly, D F G; Jorge, M F F; Navarro, M F L
2006-09-01
The clinical performance of two packable posterior composites, Alert (A)-Jeneric/Pentron and SureFil (S)-Dentsply, was evaluated in 33 patients. Each patient received one A and one S restoration, resulting in a total of 66 restorations. The restorations were placed by one operator according to the manufacturer's specifications and were finished and polished after 1 week. Photographs were taken at baseline and after 2 years. Two independent evaluators conducted the clinical evaluation by using modified United States Public Health Service criteria. After 2 years, 60 restorations (30 A and 30 S), 27 class I (16 A and 11 S) and 33 class II (14 A and 19 S) were evaluated in 30 patients. Criterion A for recurrent caries, vitality, and retention was applicable to all 60 restorations. Criterion B was distributed among 40 restorations as follows: surface texture (15 A; 2 S), color (5 A; 6 S), postoperative sensitivity (1 S), marginal discoloration (8 A), marginal adaptation (3 A), and wear resistance (2 A). Data were analyzed using the Exact Fisher and McNemar tests. After 2 years, S showed a significantly better performance than A with respect to surface texture and marginal discoloration. The clinical performance of both materials was considered acceptable over the 2-year period. Further evaluations are necessary for a more in-depth analysis.
Code of Federal Regulations, 2012 CFR
2012-01-01
... 12 Banks and Banking 1 2012-01-01 2012-01-01 false Lending test. 195.22 Section 195.22 Banks and... Assessing Performance § 195.22 Lending test. (a) Scope of test. (1) The lending test evaluates a savings... lending test except the community development lending criterion. (b) Performance criteria. The appropriate...
DOT National Transportation Integrated Search
2010-06-01
The performance of flexible pavements relies heavily on the final quality of the hot-mix asphalt concrete (HMAC) as it : is produced and placed in the field. To account for production and construction variability while ensuring the quality of the : H...
Quick test for durability factor estimation.
DOT National Transportation Integrated Search
2010-03-01
The Missouri Department of Transportation (MoDOT) is considering the use of the AASHTO T 161 Durability Factor (DF) as an endresult : performance specification criterion for evaluation of paving concrete. However, the test method duration can exceed ...
Large Area Crop Inventory Experiment (LACIE). YES phase 1 yield feasibility report
NASA Technical Reports Server (NTRS)
1977-01-01
The author has identified the following significant results. Each state model was separately evaluated to determine if a projected performance to the country level would satisfy a 90/90 criterion. All state models, except the North Dakota and Kansas models, satisfied that criterion both for district estimates aggregated to the state level and for state estimates directly from the models. In addition to the tests of the 90/90 criterion, the models were examined for their ability to adequately respond to fluctuations in weather. This portion of the analysis was based on a subjective interpretation of values of certain description statistics. As a result, 10 of the 12 models were judged to respond inadequately to variation in weather-related variables.
Analysis of augmented aircraft flying qualities through application of the Neal-Smith criterion
NASA Technical Reports Server (NTRS)
Bailey, R. E.; Smith, R. E.
1981-01-01
The Neal-Smith criterion is examined for possible applications in the evaluation of augmented fighter aircraft flying qualities. Longitudinal and lateral flying qualities are addressed. Based on the application of several longitudinal flying qualities data bases, revisions are proposed to the original criterion. Examples are given which show the revised criterion to be a good discriminator of pitch flying qualities. Initial results of lateral flying qualities evaluation through application of the Neal-Smith criterion are poor. Lateral aircraft configurations whose flying qualities are degraded by roll ratcheting effects map into the Level 1 region of the criterion. A third dimension of the criterion for flying qualities specification is evident. Additional criteria are proposed to incorporate this dimension into the criterion structure for flying qualities analysis.
High blood Pressure in children and its correlation with three definitions of obesity in childhood
de Moraes, Leonardo Iezzi; Nicola, Thaís Coutinho; de Jesus, Julyanna Silva Araújo; Alves, Eduardo Roberty Badiani; Giovaninni, Nayara Paula Bernurdes; Marcato, Daniele Gasparini; Sampaio, Jéssica Dutra; Fuly, Jeanne Teixeira Bessa; Costalonga, Everlayny Fiorot
2014-01-01
Background Several authors have correlated the increase of cardiovascular risk with the nutritional status, however there are different criteria for the classification of overweight and obesity in children. Objectives To evaluate the performance of three nutritional classification criteria in children, as definers of the presence of obesity and predictors of high blood pressure in schoolchildren. Methods Eight hundred and seventeen children ranging 6 to 13 years old, enrolled in public schools in the municipality of Vila Velha (ES) were submitted to anthropometric evaluation and blood pressure measurement. The classification of the nutritional status was established by two international criteria (CDC/NCHS 2000 and IOTF 2000) and one Brazilian criterion (Conde e Monteiro 2006). Results The prevalence of overweight was higher when the criterion of Conde e Monteiro (27%) was used, and inferior by the IOTF (15%) criteria. High blood pressure was observed in 7.3% of children. It was identified a strong association between the presence of overweight and the occurrence of high blood pressure, regardless of the test used (p < 0.001). The test showing the highest sensitivity in predicting elevated BP was the Conde e Monteiro (44%), while the highest specificity (94%) and greater overall accuracy (63%), was the CDC criterion. Conclusions The prevalence of overweight in Brazilian children is higher when using the classification criterion of Conde e Monteiro, and lower when the criterion used is IOTF. The Brazilian classification criterion proved to be the most sensitive predictor of high BP risk in this sample. PMID:24676372
NASA Astrophysics Data System (ADS)
Aldossari, M.; Alfalou, A.; Brosseau, C.
2017-08-01
In an earlier study [Opt. Express 22, 22349-22368 (2014)], a compression and encryption method that simultaneous compress and encrypt closely resembling images was proposed and validated. This multiple-image optical compression and encryption (MIOCE) method is based on a special fusion of the different target images spectra in the spectral domain. Now for the purpose of assessing the capacity of the MIOCE method, we would like to evaluate and determine the influence of the number of target images. This analysis allows us to evaluate the performance limitation of this method. To achieve this goal, we use a criterion based on the root-mean-square (RMS) [Opt. Lett. 35, 1914-1916 (2010)] and compression ratio to determine the spectral plane area. Then, the different spectral areas are merged in a single spectrum plane. By choosing specific areas, we can compress together 38 images instead of 26 using the classical MIOCE method. The quality of the reconstructed image is evaluated by making use of the mean-square-error criterion (MSE).
Industry Software Trustworthiness Criterion Research Based on Business Trustworthiness
NASA Astrophysics Data System (ADS)
Zhang, Jin; Liu, Jun-fei; Jiao, Hai-xing; Shen, Yi; Liu, Shu-yuan
To industry software Trustworthiness problem, an idea aiming to business to construct industry software trustworthiness criterion is proposed. Based on the triangle model of "trustworthy grade definition-trustworthy evidence model-trustworthy evaluating", the idea of business trustworthiness is incarnated from different aspects of trustworthy triangle model for special industry software, power producing management system (PPMS). Business trustworthiness is the center in the constructed industry trustworthy software criterion. Fusing the international standard and industry rules, the constructed trustworthy criterion strengthens the maneuverability and reliability. Quantitive evaluating method makes the evaluating results be intuitionistic and comparable.
NASA Astrophysics Data System (ADS)
Noble, Clifford Elliott, II
2002-09-01
The problem. The purpose of this study was to investigate the ability of three single-task instruments---(a) the Test of English as a Foreign Language, (b) the Aviation Test of Spoken English, and (c) the Single Manual-Tracking Test---and three dual-task instruments---(a) the Concurrent Manual-Tracking and Communication Test, (b) the Certified Flight Instructor's Test, and (c) the Simulation-Based English Test---to predict the language performance of 10 Chinese student pilots speaking English as a second language when operating single-engine and multiengine aircraft within American airspace. Method. This research implemented a correlational design to investigate the ability of the six described instruments to predict the mean score of the criterion evaluation, which was the Examiner's Test. This test assessed the oral communication skill of student pilots on the flight portion of the terminal checkride in the Piper Cadet, Piper Seminole, and Beechcraft King Air airplanes. Results. Data from the Single Manual-Tracking Test, as well as the Concurrent Manual-Tracking and Communication Test, were discarded due to performance ceiling effects. Hypothesis 1, which stated that the average correlation between the mean scores of the dual-task evaluations and that of the Examiner's Test would predict the mean score of the criterion evaluation with a greater degree of accuracy than that of single-task evaluations, was not supported. Hypothesis 2, which stated that the correlation between the mean scores of the participants on the Simulation-Based English Test and the Examiner's Test would predict the mean score of the criterion evaluation with a greater degree of accuracy than that of all single- and dual-task evaluations, was also not supported. The findings suggest that single- and dual-task assessments administered after initial flight training are equivalent predictors of language performance when piloting single-engine and multiengine aircraft.
Halimi, C; Montembault, A; Guerry, A; Delair, T; Viguier, E; Fulchiron, R; David, L
2015-01-01
A new generation of dermal filler for wrinkle filler based on chitosan was compared to current hyaluronic acid-based dermal fillers by using a new rheological performance criterion based on viscosity during injection related to Newtonian viscosity. In addition an in vivo evaluation was performed for preclinical evidence of chitosan use as dermal filler. In this way, biocompatibility and dermis reconstruction was evaluated on a pig model.
The Concept of Performance Levels in Criterion-Referenced Assessment.
ERIC Educational Resources Information Center
Hewitson, Mal
The concept of performance levels in criterion-referenced assessment is explored by applying the idea to different types of tests commonly used in schools, mastery tests (including diagnostic tests) and achievement tests. In mastery tests, a threshold performance standard must be established for each criterion. Attainment of this threshold…
Commercial Carpentry: Instructional Units.
ERIC Educational Resources Information Center
Diehl, Donald W.; Penner, Wayman R.
This manual contains instructional materials which measure student performance on commercial carpentry behavioral objectives; criterion-referenced evaluation instruments are also included. Each of the manual's eleven sections consists of one or more units of instruction. Each instructional unit includes behavioral objectives, suggested activities…
Hygrothermal Performance of West Coast Wood Deck Roofing System
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pallin, Simon B.; Kehrer, Manfred; Desjarlais, Andre Omer
2014-02-01
Simulations of roofing assemblies are necessary in order to understand and adequately predict actual the hygrothermal performance. At the request of GAF, simulations have been setup to verify the difference in performance between white and black roofing membrane colors in relation to critical moisture accumulation for traditional low slope wood deck roofing systems typically deployed in various western U.S. Climate Zones. The performance of these roof assemblies has been simulated in the hygrothermal calculation tool of WUFI, from which the result was evaluated based on a defined criterion for moisture safety. The criterion was defined as the maximum accepted watermore » content for wood materials and the highest acceptable moisture accumulation rate in relation to the risk of rot. Based on the criterion, the roof assemblies were certified as being either safe, risky or assumed to fail. The roof assemblies were simulated in different western climates, with varying insulation thicknesses, two different types of wooden decking, applied with varying interior moisture load and with either a high or low solar absorptivity at the roof surface (black or white surface color). The results show that the performance of the studied roof assemblies differs with regard to all of the varying parameters, especially the climate and the indoor moisture load.« less
Evaluation of volatile organic emissions from hazardous waste incinerators.
Sedman, R M; Esparza, J R
1991-01-01
Conventional methods of risk assessment typically employed to evaluate the impact of hazardous waste incinerators on public health must rely on somewhat speculative emissions estimates or on complicated and expensive sampling and analytical methods. The limited amount of toxicological information concerning many of the compounds detected in stack emissions also complicates the evaluation of the public health impacts of these facilities. An alternative approach aimed at evaluating the public health impacts associated with volatile organic stack emissions is presented that relies on a screening criterion to evaluate total stack hydrocarbon emissions. If the concentration of hydrocarbons in ambient air is below the screening criterion, volatile emissions from the incinerator are judged not to pose a significant threat to public health. Both the screening criterion and a conventional method of risk assessment were employed to evaluate the emissions from 20 incinerators. Use of the screening criterion always yielded a substantially greater estimate of risk than that derived by the conventional method. Since the use of the screening criterion always yielded estimates of risk that were greater than that determined by conventional methods and measuring total hydrocarbon emissions is a relatively simple analytical procedure, the use of the screening criterion would appear to facilitate the evaluation of operating hazardous waste incinerators. PMID:1954928
A Universal Model for Evaluating Basic Electronic Courses in Terms of Field Utilization of Training.
ERIC Educational Resources Information Center
Air Force Occupational Measurement Center, Lackland AFB, TX.
The main purpose of the Air Force project was to develop a universal model to evaluate usage of basic electronic principles training. The criterion used by the model to evaluate electronic theory training is a determination of the usefulness of the training vis-a-vis the performance of assigned tasks in the various electronic career fields. Data…
Evidence for the Criterion Validity and Clinical Utility of the Pathological Narcissism Inventory
ERIC Educational Resources Information Center
Thomas, Katherine M.; Wright, Aidan G. C.; Lukowitsky, Mark R.; Donnellan, M. Brent; Hopwood, Christopher J.
2012-01-01
In this study, the authors evaluated aspects of criterion validity and clinical utility of the grandiosity and vulnerability components of the Pathological Narcissism Inventory (PNI) using two undergraduate samples (N = 299 and 500). Criterion validity was assessed by evaluating the correlations of narcissistic grandiosity and narcissistic…
The Impact of Social Cues on Children's Behavior
ERIC Educational Resources Information Center
Dweck, Carol S.; And Others
1976-01-01
Introduces purpose of symposium: to discuss research which explores the factors determining how a child, faced with obtaining some goal or fulfilling some criterion of performance, responds to given instructional or evaluative cues. Delineates variety of research strategies employed. (JH)
Soble, Jason R; Bain, Kathleen M; Bailey, K Chase; Kirton, Joshua W; Marceaux, Janice C; Critchfield, Edan A; McCoy, Karin J M; O'Rourke, Justin J F
2018-01-08
Embedded performance validity tests (PVTs) allow for continuous assessment of invalid performance throughout neuropsychological test batteries. This study evaluated the utility of the Wechsler Memory Scale-Fourth Edition (WMS-IV) Logical Memory (LM) Recognition score as an embedded PVT using the Advanced Clinical Solutions (ACS) for WAIS-IV/WMS-IV Effort System. This mixed clinical sample was comprised of 97 total participants, 71 of whom were classified as valid and 26 as invalid based on three well-validated, freestanding criterion PVTs. Overall, the LM embedded PVT demonstrated poor concordance with the criterion PVTs and unacceptable psychometric properties using ACS validity base rates (42% sensitivity/79% specificity). Moreover, 15-39% of participants obtained an invalid ACS base rate despite having a normatively-intact age-corrected LM Recognition total score. Receiving operating characteristic curve analysis revealed a Recognition total score cutoff of < 61% correct improved specificity (92%) while sensitivity remained weak (31%). Thus, results indicated the LM Recognition embedded PVT is not appropriate for use from an evidence-based perspective, and that clinicians may be faced with reconciling how a normatively intact cognitive performance on the Recognition subtest could simultaneously reflect invalid performance validity.
Mayorga-Vega, Daniel; Bocanegra-Parrilla, Raúl; Ornelas, Martha; Viciana, Jesús
2016-01-01
The main purpose of the present meta-analysis was to examine the criterion-related validity of the distance- and time-based walk/run tests for estimating cardiorespiratory fitness among apparently healthy children and adults. Relevant studies were searched from seven electronic bibliographic databases up to August 2015 and through other sources. The Hunter-Schmidt's psychometric meta-analysis approach was conducted to estimate the population criterion-related validity of the following walk/run tests: 5,000 m, 3 miles, 2 miles, 3,000 m, 1.5 miles, 1 mile, 1,000 m, ½ mile, 600 m, 600 yd, ¼ mile, 15 min, 12 min, 9 min, and 6 min. From the 123 included studies, a total of 200 correlation values were analyzed. The overall results showed that the criterion-related validity of the walk/run tests for estimating maximum oxygen uptake ranged from low to moderate (rp = 0.42-0.79), with the 1.5 mile (rp = 0.79, 0.73-0.85) and 12 min walk/run tests (rp = 0.78, 0.72-0.83) having the higher criterion-related validity for distance- and time-based field tests, respectively. The present meta-analysis also showed that sex, age and maximum oxygen uptake level do not seem to affect the criterion-related validity of the walk/run tests. When the evaluation of an individual's maximum oxygen uptake attained during a laboratory test is not feasible, the 1.5 mile and 12 min walk/run tests represent useful alternatives for estimating cardiorespiratory fitness. As in the assessment with any physical fitness field test, evaluators must be aware that the performance score of the walk/run field tests is simply an estimation and not a direct measure of cardiorespiratory fitness.
QRS detection based ECG quality assessment.
Hayn, Dieter; Jammerbund, Bernhard; Schreier, Günter
2012-09-01
Although immediate feedback concerning ECG signal quality during recording is useful, up to now not much literature describing quality measures is available. We have implemented and evaluated four ECG quality measures. Empty lead criterion (A), spike detection criterion (B) and lead crossing point criterion (C) were calculated from basic signal properties. Measure D quantified the robustness of QRS detection when applied to the signal. An advanced Matlab-based algorithm combining all four measures and a simplified algorithm for Android platforms, excluding measure D, were developed. Both algorithms were evaluated by taking part in the Computing in Cardiology Challenge 2011. Each measure's accuracy and computing time was evaluated separately. During the challenge, the advanced algorithm correctly classified 93.3% of the ECGs in the training-set and 91.6 % in the test-set. Scores for the simplified algorithm were 0.834 in event 2 and 0.873 in event 3. Computing time for measure D was almost five times higher than for other measures. Required accuracy levels depend on the application and are related to computing time. While our simplified algorithm may be accurate for real-time feedback during ECG self-recordings, QRS detection based measures can further increase the performance if sufficient computing power is available.
Systems Engineering Management Guide,
1990-01-01
6•’-&-S- A -i-2-- -4-$-6-7-6-I SPEED AN0 INDURA6Ca CARGO CAPACITY ,- .- ,-,-,-$-- -,-$-4-,-,-7-,-U LOOISTICSR&M CARGO CAPACITY -- - - - -3.2- - 3-4...only subjective and to predict a level of performance with ( high , medium, low) evaluation is possible. respect to each attribute for each alternative For...criterion; evaluated as having an expected speed of however, some fixed plan for scoring 31.5 knots would receive a score of .50, while performance
A Controlled Evaluation of the Distress Criterion for Binge Eating Disorder
ERIC Educational Resources Information Center
Grilo, Carlos M.; White, Marney A.
2011-01-01
Objective: Research has examined various aspects of the validity of the research criteria for binge eating disorder (BED) but has yet to evaluate the utility of Criterion C, "marked distress about binge eating." This study examined the significance of the marked distress criterion for BED using 2 complementary comparison groups. Method:…
Tseng, Yi-Ju; Wu, Jung-Hsuan; Ping, Xiao-Ou; Lin, Hui-Chi; Chen, Ying-Yu; Shang, Rung-Ji; Chen, Ming-Yuan; Lai, Feipei
2012-01-01
Background The emergence and spread of multidrug-resistant organisms (MDROs) are causing a global crisis. Combating antimicrobial resistance requires prevention of transmission of resistant organisms and improved use of antimicrobials. Objectives To develop a Web-based information system for automatic integration, analysis, and interpretation of the antimicrobial susceptibility of all clinical isolates that incorporates rule-based classification and cluster analysis of MDROs and implements control chart analysis to facilitate outbreak detection. Methods Electronic microbiological data from a 2200-bed teaching hospital in Taiwan were classified according to predefined criteria of MDROs. The numbers of organisms, patients, and incident patients in each MDRO pattern were presented graphically to describe spatial and time information in a Web-based user interface. Hierarchical clustering with 7 upper control limits (UCL) was used to detect suspicious outbreaks. The system’s performance in outbreak detection was evaluated based on vancomycin-resistant enterococcal outbreaks determined by a hospital-wide prospective active surveillance database compiled by infection control personnel. Results The optimal UCL for MDRO outbreak detection was the upper 90% confidence interval (CI) using germ criterion with clustering (area under ROC curve (AUC) 0.93, 95% CI 0.91 to 0.95), upper 85% CI using patient criterion (AUC 0.87, 95% CI 0.80 to 0.93), and one standard deviation using incident patient criterion (AUC 0.84, 95% CI 0.75 to 0.92). The performance indicators of each UCL were statistically significantly higher with clustering than those without clustering in germ criterion (P < .001), patient criterion (P = .04), and incident patient criterion (P < .001). Conclusion This system automatically identifies MDROs and accurately detects suspicious outbreaks of MDROs based on the antimicrobial susceptibility of all clinical isolates. PMID:23195868
Advanced training of specialists in area of fiber-optic communication lines maintenance
NASA Astrophysics Data System (ADS)
Andreev, Vladimir A.; Voronkov, Andrey A.; Bukashkin, Sergey A.; Buzova, Maria A.
2017-04-01
The paper considers the concept of fiber-optic communication lines (FOCL) maintenance. Performance criterion of FOCL technical maintenance was proposed. For the first time the algorithm for evaluation of the FOCL maintenance efficiency at telecommunication specialists training was applied.
Small craft ID criteria (N50/V50) for short wave infrared sensors in maritime security
NASA Astrophysics Data System (ADS)
Krapels, Keith; Driggers, Ronald G.; Larson, Paul; Garcia, Jose; Walden, Barry; Agheera, Sameer; Deaver, Dawne; Hixson, Jonathan; Boettcher, Evelyn
2008-04-01
The need for Anti-Terrorism and Force Protection (AT/FP), for both shore and sea platform protection, has resulted in a need for imager design and evaluation tools which can predict field performance against maritime asymmetric threats. In the design of tactical imaging systems for target acquisition, a discrimination criterion is required for successful sensor realization. It characterizes the difficulty of the task being performed by the observer and varies for different target sets. This criterion is used in both assessment of existing infrared sensor and in the design of new conceptual sensors. In this experiment, we collected 8 small craft signatures (military and civilian) in the short wave infrared (SWIR) band during the day. These signatures were processed to determine the targets' characteristic dimension and contrast. They were also processed to bandlimit the signature's spatial information content (simulating longer range) and a perception experiment was performed to determine the task difficulty (N50 and V50). The results are presented in this paper and can be used for maritime security imaging sensor design and evaluation.
The Proposed MACRA/MIPS Threshold for Patient-Facing Encounters: What It Means for Radiologists.
Rosenkrantz, Andrew B; Hirsch, Joshua A; Allen, Bibb; Wang, Wenyi; Hughes, Danny R; Nicola, Gregory N
2017-03-01
In implementing the Merit-Based Incentive Payment System (MIPS), CMS will provide special considerations to physicians with infrequent face-to-face patient encounters by reweighting MIPS performance categories to account for the unique circumstances facing these providers. The aim of this study was to determine the impact of varying criteria on the fraction of radiologists who are likely to receive special considerations for performance assessment under MIPS. Data from the 2014 Medicare Physician and Other Supplier file for 28,710 diagnostic radiologists were used to determine the fraction of radiologists meeting various proposed criteria for receiving special considerations. For each definition, the fraction of patient-facing encounters among all billed codes was determined for those radiologists not receiving special considerations. When using the criterion proposed by CMS that physicians will receive special considerations if billing ≤25 evaluation and management services or surgical codes, 72.0% of diagnostic radiologists would receive special considerations, though such encounters would represent only 2.1% of billed codes among remaining diagnostic radiologists without special considerations. If CMS were to apply an alternative criterion of billing ≤100 evaluation and management codes exclusively, 98.8% of diagnostic radiologists would receive special considerations. At this threshold, patient-facing encounters would represent approximately 10% of billed codes among remaining radiologists without special considerations. The current CMS proposed criterion for special considerations would result in a considerable fraction of radiologists being evaluated on the basis of measures that are not reflective of their practice and beyond their direct control. Alternative criteria could help ensure that radiologists are provided a fair opportunity for success in performance review under the MIPS. Copyright © 2016 American College of Radiology. Published by Elsevier Inc. All rights reserved.
ERIC Educational Resources Information Center
Oakland, Thomas
New strategies for evaluation criterion referenced measures (CRM) are discussed. These strategies examine the following issues: (1) the use of normed referenced measures (NRM) as CRM and then estimating the reliability and validity of such measures in terms of variance from an arbitrarily specified criterion score, (2) estimation of the…
ERIC Educational Resources Information Center
Oxford, Rebecca L.; And Others
The Washington state Title I Migrant Program Evaluation project is a feasibility study designed to assess the suitability of existing normed criterion referenced tests to measuring mathematics achievement at grades four, five, and six. Objectives include judging the technical qualities and content of several normed criterion referenced tests;…
The Educational Warranty: Redesigning the Profession.
ERIC Educational Resources Information Center
Antonelli, George A.
Teacher education programs which guarantee the effectiveness of their graduates may help to redesign the image and substance of the teaching profession. Doane College (Nebraska), one of the pioneers with educational warranties, bases its program on previous concepts of performance contracting and criterion referenced evaluation. Doane's beginning…
Validation of powder X-ray diffraction following EN ISO/IEC 17025.
Eckardt, Regina; Krupicka, Erik; Hofmeister, Wolfgang
2012-05-01
Powder X-ray diffraction (PXRD) is used widely in forensic science laboratories with the main focus of qualitative phase identification. Little is found in literature referring to the topic of validation of PXRD in the field of forensic sciences. According to EN ISO/IEC 17025, the method has to be tested for several parameters. Trueness, specificity, and selectivity of PXRD were tested using certified reference materials or a combination thereof. All three tested parameters showed the secure performance of the method. Sample preparation errors were simulated to evaluate the robustness of the method. These errors were either easily detected by the operator or nonsignificant for phase identification. In case of the detection limit, a statistical evaluation of the signal-to-noise ratio showed that a peak criterion of three sigma is inadequate and recommendations for a more realistic peak criterion are given. Finally, the results of an international proficiency test showed the secure performance of PXRD. © 2012 American Academy of Forensic Sciences.
NASA Astrophysics Data System (ADS)
Krapels, Keith; Driggers, Ronald G.; Deaver, Dawne; Moker, Steven K.; Palmer, John
2007-10-01
The new emphasis on Anti-Terrorism and Force Protection (AT/FP), for both shore and sea platform protection, has resulted in a need for infrared imager design and evaluation tools that demonstrate field performance against U.S. Navy AT/FP requirements. In the design of infrared imaging systems for target acquisition, a discrimination criterion is required for successful sensor realization. It characterizes the difficulty of the task being performed by the observer and varies for different target sets. This criterion is used in both assessment of existing infrared sensor and in the design of new conceptual sensors. We collected 12 small craft signatures (military and civilian) in the visible band during the day and the long-wave and midwave infrared spectra in both the day and the night environments. These signatures were processed to determine the targets' characteristic dimension and contrast. They were also processed to band limit the signature's spatial information content (simulating longer range), and a perception experiment was performed to determine the task difficulty (N50 and V50). The results are presented and can be used for Navy and Coast Guard imaging infrared sensor design and evaluation.
An Elasto-Plastic Damage Model for Rocks Based on a New Nonlinear Strength Criterion
NASA Astrophysics Data System (ADS)
Huang, Jingqi; Zhao, Mi; Du, Xiuli; Dai, Feng; Ma, Chao; Liu, Jingbo
2018-05-01
The strength and deformation characteristics of rocks are the most important mechanical properties for rock engineering constructions. A new nonlinear strength criterion is developed for rocks by combining the Hoek-Brown (HB) criterion and the nonlinear unified strength criterion (NUSC). The proposed criterion takes account of the intermediate principal stress effect against HB criterion, as well as being nonlinear in the meridian plane against NUSC. Only three parameters are required to be determined by experiments, including the two HB parameters σ c and m i . The failure surface of the proposed criterion is continuous, smooth and convex. The proposed criterion fits the true triaxial test data well and performs better than the other three existing criteria. Then, by introducing the Geological Strength Index, the proposed criterion is extended to rock masses and predicts the test data well. Finally, based on the proposed criterion, a triaxial elasto-plastic damage model for intact rock is developed. The plastic part is based on the effective stress, whose yield function is developed by the proposed criterion. For the damage part, the evolution function is assumed to have an exponential form. The performance of the constitutive model shows good agreement with the results of experimental tests.
Lu, Dan; Ye, Ming; Meyer, Philip D.; Curtis, Gary P.; Shi, Xiaoqing; Niu, Xu-Feng; Yabusaki, Steve B.
2013-01-01
When conducting model averaging for assessing groundwater conceptual model uncertainty, the averaging weights are often evaluated using model selection criteria such as AIC, AICc, BIC, and KIC (Akaike Information Criterion, Corrected Akaike Information Criterion, Bayesian Information Criterion, and Kashyap Information Criterion, respectively). However, this method often leads to an unrealistic situation in which the best model receives overwhelmingly large averaging weight (close to 100%), which cannot be justified by available data and knowledge. It was found in this study that this problem was caused by using the covariance matrix, CE, of measurement errors for estimating the negative log likelihood function common to all the model selection criteria. This problem can be resolved by using the covariance matrix, Cek, of total errors (including model errors and measurement errors) to account for the correlation between the total errors. An iterative two-stage method was developed in the context of maximum likelihood inverse modeling to iteratively infer the unknown Cek from the residuals during model calibration. The inferred Cek was then used in the evaluation of model selection criteria and model averaging weights. While this method was limited to serial data using time series techniques in this study, it can be extended to spatial data using geostatistical techniques. The method was first evaluated in a synthetic study and then applied to an experimental study, in which alternative surface complexation models were developed to simulate column experiments of uranium reactive transport. It was found that the total errors of the alternative models were temporally correlated due to the model errors. The iterative two-stage method using Cekresolved the problem that the best model receives 100% model averaging weight, and the resulting model averaging weights were supported by the calibration results and physical understanding of the alternative models. Using Cek obtained from the iterative two-stage method also improved predictive performance of the individual models and model averaging in both synthetic and experimental studies.
Criterion-Referenced Testing in Foreign Language Teaching.
ERIC Educational Resources Information Center
Takala, Sauli
A review of literature serves as the basis for a discussion of various aspects of criterion-referenced tests. The aspects discussed are: teaching and evaluation objectives, criterion- and norm-referenced measurement, stages in construction of criterion-referenced tests, construction and selection of items, test validity, and test reliability.…
Evaluation of Criterion Validity for Scales with Congeneric Measures
ERIC Educational Resources Information Center
Raykov, Tenko
2007-01-01
A method for estimating criterion validity of scales with homogeneous components is outlined. It accomplishes point and interval estimation of interrelationship indices between composite scores and criterion variables and is useful for testing hypotheses about criterion validity of measurement instruments. The method can also be used with missing…
The Effectiveness of Circular Equating as a Criterion for Evaluating Equating.
ERIC Educational Resources Information Center
Wang, Tianyou; Hanson, Bradley A.; Harris, Deborah J.
Equating a test form to itself through a chain of equatings, commonly referred to as circular equating, has been widely used as a criterion to evaluate the adequacy of equating. This paper uses both analytical methods and simulation methods to show that this criterion is in general invalid in serving this purpose. For the random groups design done…
RLS Channel Estimation with Adaptive Forgetting Factor for DS-CDMA Frequency-Domain Equalization
NASA Astrophysics Data System (ADS)
Kojima, Yohei; Tomeba, Hiromichi; Takeda, Kazuaki; Adachi, Fumiyuki
Frequency-domain equalization (FDE) based on the minimum mean square error (MMSE) criterion can increase the downlink bit error rate (BER) performance of DS-CDMA beyond that possible with conventional rake combining in a frequency-selective fading channel. FDE requires accurate channel estimation. Recently, we proposed a pilot-assisted channel estimation (CE) based on the MMSE criterion. Using MMSE-CE, the channel estimation accuracy is almost insensitive to the pilot chip sequence, and a good BER performance is achieved. In this paper, we propose a channel estimation scheme using one-tap recursive least square (RLS) algorithm, where the forgetting factor is adapted to the changing channel condition by the least mean square (LMS)algorithm, for DS-CDMA with FDE. We evaluate the BER performance using RLS-CE with adaptive forgetting factor in a frequency-selective fast Rayleigh fading channel by computer simulation.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nie, K; Pouliot, J; Smith, E
Purpose: To evaluate the performance variations in commercial deformable image registration (DIR) tools for adaptive radiation therapy. Methods: Representative plans from three different anatomical sites, prostate, head-and-neck (HN) and cranial spinal irradiation (CSI) with L-spine boost, were included. Computerized deformed CT images were first generated using virtual DIR QA software (ImSimQA) for each case. The corresponding transformations served as the “reference”. Three commercial software packages MIMVista v5.5 and MIMMaestro v6.0, VelocityAI v2.6.2, and OnQ rts v2.1.15 were tested. The warped contours and doses were compared with the “reference” and among each other. Results: The performance in transferring contours was comparablemore » among all three tools with an average DICE coefficient of 0.81 for all the organs. However, the performance of dose warping accuracy appeared to rely on the evaluation end points. Volume based DVH comparisons were not sensitive enough to illustrate all the detailed variations while isodose assessment on a slice-by-slice basis could be tedious. Point-based evaluation was over-sensitive by having up to 30% hot/cold-spot differences. If adapting the 3mm/3% gamma analysis into the evaluation of dose warping, all three algorithms presented a reasonable level of equivalency. One algorithm had over 10% of the voxels not meeting this criterion for the HN case while another showed disagreement for the CSI case. Conclusion: Overall, our results demonstrated that evaluation based only on the performance of contour transformation could not guarantee the accuracy in dose warping. However, the performance of dose warping accuracy relied on the evaluation methodologies. Nevertheless, as more DIR tools are available for clinical use, the performance could vary at certain degrees. A standard quality assurance criterion with clinical meaning should be established for DIR QA, similar to the gamma index concept, in the near future.« less
NASA Astrophysics Data System (ADS)
Han, Hyung-Suk
2012-12-01
The indoor noise of a ship is usually determined using the A-weighted sound pressure level. However, in order to better understand this phenomenon, evaluation parameters that more accurately reflect the human sense of hearing are required. To find the level of the satisfaction index of the noise inside a naval vessel such as "Loudness" and "Annoyance", psycho-acoustic evaluation of various sound recordings from the naval vessel was performed in a laboratory. The objective of this paper is to develop a single index of "Loudness" and "Annoyance" for noise inside a naval vessel according to a psycho-acoustic evaluation by using psychological responses such as Noise Rating (NR), Noise Criterion (NC), Room Criterion (RC), Preferred Speech Interference Level (PSIL) and loudness level. Additionally, in order to determine a single index of satisfaction for noise such as "Loudness" and "Annoyance", with respect to a human's sense of hearing, a back-propagation neural network is applied.
NASA Technical Reports Server (NTRS)
Gupta, Hoshin V.; Kling, Harald; Yilmaz, Koray K.; Martinez-Baquero, Guillermo F.
2009-01-01
The mean squared error (MSE) and the related normalization, the Nash-Sutcliffe efficiency (NSE), are the two criteria most widely used for calibration and evaluation of hydrological models with observed data. Here, we present a diagnostically interesting decomposition of NSE (and hence MSE), which facilitates analysis of the relative importance of its different components in the context of hydrological modelling, and show how model calibration problems can arise due to interactions among these components. The analysis is illustrated by calibrating a simple conceptual precipitation-runoff model to daily data for a number of Austrian basins having a broad range of hydro-meteorological characteristics. Evaluation of the results clearly demonstrates the problems that can be associated with any calibration based on the NSE (or MSE) criterion. While we propose and test an alternative criterion that can help to reduce model calibration problems, the primary purpose of this study is not to present an improved measure of model performance. Instead, we seek to show that there are systematic problems inherent with any optimization based on formulations related to the MSE. The analysis and results have implications to the manner in which we calibrate and evaluate environmental models; we discuss these and suggest possible ways forward that may move us towards an improved and diagnostically meaningful approach to model performance evaluation and identification.
An investigation of the effects of pitch-roll (de)-coupling on helicopter handling qualities
NASA Technical Reports Server (NTRS)
Ockier, C. J.; Pausder, H. J.; Blanken, C. L.
1995-01-01
An investigation of the effects of pitch-roll coupling on helicopter handling qualities was performed by the US Army and DLR, using a NASA ground-based and a DLR inflight simulator. Over 90 different coupling configurations were evaluated using a roll-axis tracking task. The results show that although the current ADS-33C coupling criterion discriminates against those types of coupling typical of conventionally controlled helicopters, it not always suited for the prediction of handling qualities of helicopters with modern control systems. Based on the observation that high frequency inputs during tracking are used to alleviate coupling, a frequency domain pitch-roll coupling criterion that uses the average coupling ratio between the bandwidth and neutral stability frequency is formulated. This criterion provides a more comprehensive coverage with respect to the different types of coupling and shows excellent consistency.
Repeated readings and science: Fluency with expository passages
NASA Astrophysics Data System (ADS)
Kostewicz, Douglas E.
The current study investigated the effects of repeated readings to a fluency criterion (RRFC) for seven students with disabilities using science text. The study employed a single subject design, specifically, two multiple probe multiple baselines across subjects, to evaluate the effects of the RRFC intervention. Results indicated that students met criterion (200 or more correct words per minute with 2 or fewer errors) on four consecutive passages. A majority of students displayed accelerations to correct words per minute and decelerations to incorrect words per minute on successive initial, intervention readings suggesting reading transfer. Students' reading scores during posttest and maintenance out performed pre-test and baseline readings provided additional measures of reading transfer. For a relationship to comprehension, students scored higher on oral retell measures after meeting criterion as compared to initial readings. Overall, the research findings suggested that the RRFC intervention improves science reading fluency for students with disabilities, and may also indirectly benefit comprehension.
Cook, Karon F; Kallen, Michael A; Bombardier, Charles; Bamer, Alyssa M; Choi, Seung W; Kim, Jiseon; Salem, Rana; Amtmann, Dagmar
2017-01-01
To evaluate whether items of three measures of depressive symptoms function differently in persons with spinal cord injury (SCI) than in persons from a primary care sample. This study was a retrospective analysis of responses to the Patient Health Questionnaire depression scale, the Center for Epidemiological Studies Depression scale, and the National Institutes of Health Patient-Reported Outcomes Measurement Information System (PROMIS ® ) version 1.0 eight-item depression short form 8b (PROMIS-D). The presence of differential item function (DIF) was evaluated using ordinal logistic regression. No items of any of the three target measures were flagged for DIF based on standard criteria. In a follow-up sensitivity analyses, the criterion was changed to make the analysis more sensitive to potential DIF. Scores were corrected for DIF flagged under this criterion. Minimal differences were found between the original scores and those corrected for DIF under the sensitivity criterion. The three depression screening measures evaluated in this study did not perform differently in samples of individuals with SCI compared to general and community samples. Transdiagnostic symptoms did not appear to spuriously inflate depression severity estimates when administered to people with SCI.
ERIC Educational Resources Information Center
Shriver, Edgar L.; And Others
This volume reports an effort to use the video media as an approach for the preparation of a battery of symbolic tests that would be empirically valid substitutes for criterion referenced Job Task Performance Tests. The graphic symbolic tests require the storage of a large amount of pictorial information which must be searched rapidly for display.…
NASA Astrophysics Data System (ADS)
Diamant, Idit; Shalhon, Moran; Goldberger, Jacob; Greenspan, Hayit
2016-03-01
Classification of clustered breast microcalcifications into benign and malignant categories is an extremely challenging task for computerized algorithms and expert radiologists alike. In this paper we present a novel method for feature selection based on mutual information (MI) criterion for automatic classification of microcalcifications. We explored the MI based feature selection for various texture features. The proposed method was evaluated on a standardized digital database for screening mammography (DDSM). Experimental results demonstrate the effectiveness and the advantage of using the MI-based feature selection to obtain the most relevant features for the task and thus to provide for improved performance as compared to using all features.
NASA Technical Reports Server (NTRS)
Homem De Mello, Luiz S.; Sanderson, Arthur C.
1991-01-01
The authors introduce two criteria for the evaluation and selection of assembly plans. The first criterion is to maximize the number of different sequences in which the assembly tasks can be executed. The second criterion is to minimize the total assembly time through simultaneous execution of assembly tasks. An algorithm that performs a heuristic search for the best assembly plan over the AND/OR graph representation of assembly plans is discussed. Admissible heuristics for each of the two criteria introduced are presented. Some implementation issues that affect the computational efficiency are addressed.
In this paper, the methodological concept of landscape optimization presented by Seppelt and Voinov [Ecol. Model. 151 (2/3) (2002) 125] is analyzed. Two aspects are chosen for detailed study. First, we generalize the performance criterion to assess a vector of ecosystem functi...
Computation of Anisotropic Bi-Material Interfacial Fracture Parameters and Delamination Creteria
NASA Technical Reports Server (NTRS)
Chow, W-T.; Wang, L.; Atluri, S. N.
1998-01-01
This report documents the recent developments in methodologies for the evaluation of the integrity and durability of composite structures, including i) the establishment of a stress-intensity-factor based fracture criterion for bimaterial interfacial cracks in anisotropic materials (see Sec. 2); ii) the development of a virtual crack closure integral method for the evaluation of the mixed-mode stress intensity factors for a bimaterial interfacial crack (see Sec. 3). Analytical and numerical results show that the proposed fracture criterion is a better fracture criterion than the total energy release rate criterion in the characterization of the bimaterial interfacial cracks. The proposed virtual crack closure integral method is an efficient and accurate numerical method for the evaluation of mixed-mode stress intensity factors.
Klußmann, André; Gebhardt, Hansjürgen; Rieger, Monika; Liebers, Falk; Steinberg, Ulf
2012-01-01
Upper extremity musculoskeletal symptoms and disorders are common in the working population. The economic and social impact of such disorders is considerable. Long-time, dynamic repetitive exposure of the hand-arm system during manual handling operations (MHO) alone or in combination with static and postural effort are recognised as causes of musculoskeletal symptoms and disorders. The assessment of these manual work tasks is crucial to estimate health risks of exposed employees. For these work tasks, a new method for the assessment of the working conditions was developed and a validation study was performed. The results suggest satisfying criterion validity and moderate objectivity of the KIM-MHO draft 2007. The method was modified and evaluated again. It is planned to release a new version of KIM-MHO in spring 2012.
Failure prediction of thin beryllium sheets used in spacecraft structures
NASA Technical Reports Server (NTRS)
Roschke, Paul N.; Mascorro, Edward; Papados, Photios; Serna, Oscar R.
1991-01-01
The primary objective of this study is to develop a method for prediction of failure of thin beryllium sheets that undergo complex states of stress. Major components of the research include experimental evaluation of strength parameters for cross-rolled beryllium sheet, application of the Tsai-Wu failure criterion to plate bending problems, development of a high order failure criterion, application of the new criterion to a variety of structures, and incorporation of both failure criteria into a finite element code. A Tsai-Wu failure model for SR-200 sheet material is developed from available tensile data, experiments carried out by NASA on two circular plates, and compression and off-axis experiments performed in this study. The failure surface obtained from the resulting criterion forms an ellipsoid. By supplementing experimental data used in the the two-dimensional criterion and modifying previously suggested failure criteria, a multi-dimensional failure surface is proposed for thin beryllium structures. The new criterion for orthotropic material is represented by a failure surface in six-dimensional stress space. In order to determine coefficients of the governing equation, a number of uniaxial, biaxial, and triaxial experiments are required. Details of these experiments and a complementary ultrasonic investigation are described in detail. Finally, validity of the criterion and newly determined mechanical properties is established through experiments on structures composed of SR200 sheet material. These experiments include a plate-plug arrangement under a complex state of stress and a series of plates with an out-of-plane central point load. Both criteria have been incorporated into a general purpose finite element analysis code. Numerical simulation incrementally applied loads to a structural component that is being designed and checks each nodal point in the model for exceedance of a failure criterion. If stresses at all locations do not exceed the failure criterion, the load is increased and the process is repeated. Failure results for the plate-plug and clamped plate tests are accurate to within 2 percent.
Mayorga-Vega, Daniel; Bocanegra-Parrilla, Raúl; Ornelas, Martha; Viciana, Jesús
2016-01-01
Objectives The main purpose of the present meta-analysis was to examine the criterion-related validity of the distance- and time-based walk/run tests for estimating cardiorespiratory fitness among apparently healthy children and adults. Materials and Methods Relevant studies were searched from seven electronic bibliographic databases up to August 2015 and through other sources. The Hunter-Schmidt’s psychometric meta-analysis approach was conducted to estimate the population criterion-related validity of the following walk/run tests: 5,000 m, 3 miles, 2 miles, 3,000 m, 1.5 miles, 1 mile, 1,000 m, ½ mile, 600 m, 600 yd, ¼ mile, 15 min, 12 min, 9 min, and 6 min. Results From the 123 included studies, a total of 200 correlation values were analyzed. The overall results showed that the criterion-related validity of the walk/run tests for estimating maximum oxygen uptake ranged from low to moderate (rp = 0.42–0.79), with the 1.5 mile (rp = 0.79, 0.73–0.85) and 12 min walk/run tests (rp = 0.78, 0.72–0.83) having the higher criterion-related validity for distance- and time-based field tests, respectively. The present meta-analysis also showed that sex, age and maximum oxygen uptake level do not seem to affect the criterion-related validity of the walk/run tests. Conclusions When the evaluation of an individual’s maximum oxygen uptake attained during a laboratory test is not feasible, the 1.5 mile and 12 min walk/run tests represent useful alternatives for estimating cardiorespiratory fitness. As in the assessment with any physical fitness field test, evaluators must be aware that the performance score of the walk/run field tests is simply an estimation and not a direct measure of cardiorespiratory fitness. PMID:26987118
Schiffman, Eric L; Truelove, Edmond L; Ohrbach, Richard; Anderson, Gary C; John, Mike T; List, Thomas; Look, John O
2010-01-01
The purpose of the Research Diagnostic Criteria for Temporomandibular Disorders (RDC/TMD) Validation Project was to assess the diagnostic validity of this examination protocol. The aim of this article is to provide an overview of the project's methodology, descriptive statistics, and data for the study participant sample. This article also details the development of reliable methods to establish the reference standards for assessing criterion validity of the Axis I RDC/TMD diagnoses. The Axis I reference standards were based on the consensus of two criterion examiners independently performing a comprehensive history, clinical examination, and evaluation of imaging. Intersite reliability was assessed annually for criterion examiners and radiologists. Criterion examination reliability was also assessed within study sites. Study participant demographics were comparable to those of participants in previous studies using the RDC/TMD. Diagnostic agreement of the criterion examiners with each other and with the consensus-based reference standards was excellent with all kappas > or = 0.81, except for osteoarthrosis (moderate agreement, k = 0.53). Intrasite criterion examiner agreement with reference standards was excellent (k > or = 0.95). Intersite reliability of the radiologists for detecting computed tomography-disclosed osteoarthrosis and magnetic resonance imaging-disclosed disc displacement was good to excellent (k = 0.71 and 0.84, respectively). The Validation Project study population was appropriate for assessing the reliability and validity of the RDC/TMD Axis I and II. The reference standards used to assess the validity of Axis I TMD were based on reliable and clinically credible methods.
A Five-Year Evaluation of Examination Structure in a Cardiovascular Pharmacotherapy Course
Kolar, Claire; Janke, Kristin K.
2015-01-01
Objective. To evaluate the composition and effectiveness as an assessment tool of a criterion-referenced examination comprised of clinical cases tied to practice decisions, to examine the effect of varying audience response system (ARS) questions on student examination preparation, and to articulate guidelines for structuring examinations to maximize evaluation of student learning. Design. Multiple-choice items developed over 5 years were evaluated using Bloom’s Taxonomy classification, point biserial correlation, item difficulty, and grade distribution. In addition, examination items were classified into categories based on similarity to items used in ARS preparation. Assessment. As the number of items directly tied to clinical practice rose, Bloom’s Taxonomy level and item difficulty also rose. In examination years where Bloom’s levels were high but preparation was minimal, average grade distribution was lower compared with years in which student preparation was higher. Conclusion. Criterion-referenced examinations can benefit from systematic evaluation of their composition and effectiveness as assessment tools. Calculated design and delivery of classroom preparation is an asset in improving examination performance on rigorous, practice-relevant examinations. PMID:27168611
NASA Astrophysics Data System (ADS)
Krapels, Keith; Deaver, Dawne; Driggers, Ronald
2006-09-01
The new emphasis on Anti-Terrorism and Force Protection (AT/FP), for both shore and sea platform protection, has resulted in a need for infrared imager design and evaluation tools which demonstrate field performance against U.S. Navy AT/FP requirements. In the design of infrared imaging systems for target acquisition, a discrimination criterion is required for successful sensor realization. It characterizes the difficulty of the task being performed by the observer and varies for different target sets. This criterion is used in both assessment of existing infrared sensor and in the design of new conceptual sensors. In this experiment, we collected 12 small craft signatures (military and civilian) in the visible band during the day and the LWIR and MWIR spectra in both the day and the night environments. These signatures were processed to determine the targets' characteristic dimension and contrast. They were also processed to bandlimit the signature's spatial information content (simulating longer range) and a perception experiment was performed to determine the task difficulty (N 50 and V 50). The results are presented in this paper and can be used for Navy and Coast Guard imaging infrared sensor design and evaluation.
Yu, Fang; Chen, Ming-Hui; Kuo, Lynn; Talbott, Heather; Davis, John S
2015-08-07
Recently, the Bayesian method becomes more popular for analyzing high dimensional gene expression data as it allows us to borrow information across different genes and provides powerful estimators for evaluating gene expression levels. It is crucial to develop a simple but efficient gene selection algorithm for detecting differentially expressed (DE) genes based on the Bayesian estimators. In this paper, by extending the two-criterion idea of Chen et al. (Chen M-H, Ibrahim JG, Chi Y-Y. A new class of mixture models for differential gene expression in DNA microarray data. J Stat Plan Inference. 2008;138:387-404), we propose two new gene selection algorithms for general Bayesian models and name these new methods as the confident difference criterion methods. One is based on the standardized differences between two mean expression values among genes; the other adds the differences between two variances to it. The proposed confident difference criterion methods first evaluate the posterior probability of a gene having different gene expressions between competitive samples and then declare a gene to be DE if the posterior probability is large. The theoretical connection between the proposed first method based on the means and the Bayes factor approach proposed by Yu et al. (Yu F, Chen M-H, Kuo L. Detecting differentially expressed genes using alibrated Bayes factors. Statistica Sinica. 2008;18:783-802) is established under the normal-normal-model with equal variances between two samples. The empirical performance of the proposed methods is examined and compared to those of several existing methods via several simulations. The results from these simulation studies show that the proposed confident difference criterion methods outperform the existing methods when comparing gene expressions across different conditions for both microarray studies and sequence-based high-throughput studies. A real dataset is used to further demonstrate the proposed methodology. In the real data application, the confident difference criterion methods successfully identified more clinically important DE genes than the other methods. The confident difference criterion method proposed in this paper provides a new efficient approach for both microarray studies and sequence-based high-throughput studies to identify differentially expressed genes.
Larrabee, Glenn J
2014-01-01
Bilder, Sugar, and Hellemann (2014 this issue) contend that empirical support is lacking for use of multiple performance validity tests (PVTs) in evaluation of the individual case, differing from the conclusions of Davis and Millis (2014), and Larrabee (2014), who found no substantial increase in false positive rates using a criterion of failure of ≥ 2 PVTs and/or Symptom Validity Tests (SVTs) out of multiple tests administered. Reconsideration of data presented in Larrabee (2014) supports a criterion of ≥ 2 out of up to 7 PVTs/SVTs, as keeping false positive rates close to and in most cases below 10% in cases with bona fide neurologic, psychiatric, and developmental disorders. Strategies to minimize risk of false positive error are discussed, including (1) adjusting individual PVT cutoffs or criterion for number of PVTs failed, for examinees who have clinical histories placing them at risk for false positive identification (e.g., severe TBI, schizophrenia), (2) using the history of the individual case to rule out conditions known to result in false positive errors, (3) using normal performance in domains mimicked by PVTs to show that sufficient native ability exists for valid performance on the PVT(s) that have been failed, and (4) recognizing that as the number of PVTs/SVTs failed increases, the likelihood of valid clinical presentation decreases, with a corresponding increase in the likelihood of invalid test performance and symptom report.
Evaluation of Measurement Instrument Criterion Validity in Finite Mixture Settings
ERIC Educational Resources Information Center
Raykov, Tenko; Marcoulides, George A.; Li, Tenglong
2016-01-01
A method for evaluating the validity of multicomponent measurement instruments in heterogeneous populations is discussed. The procedure can be used for point and interval estimation of criterion validity of linear composites in populations representing mixtures of an unknown number of latent classes. The approach permits also the evaluation of…
ERIC Educational Resources Information Center
Cory, Charles H.
This report presents data concerning the validity of a set of experimental computerized and paper-and-pencil tests for measures of on-job performance on global and job elements. It reports on the usefulness of 30 experimental and operational variables for predicting marks on 42 job elements and on a global criterion for Electrician's Mate,…
Criterion-Referenced Item Banking in Electronics: Appendix G. Final Report.
ERIC Educational Resources Information Center
Gorth, William Phillip; Swaminathan, Hariharan
This is one of the outcomes of the work of the Massachusetts Evaluation Service Center for Occupational Education (ESCOE). After an overview of the Performance Test Development Project, a summary of the major products and byproducts is presented. The major products are: (1) a set of clearly defined, well-structured, and consistent behavioral…
Evaluation Criterion for Quality Assessment of E-Learning Content
ERIC Educational Resources Information Center
Al-Alwani, Abdulkareem
2014-01-01
Research trends related to e-learning systems are oriented towards increasing the efficiency and capacity of the systems, thus they reflect a large variance in performance when considering content conformity and quality standards. The Framework related to standardisation of digital content for e-learning systems is likely to play a significant…
Federal Register 2010, 2011, 2012, 2013, 2014
2010-09-09
... integral part of the evaluative process used by the agency to ensure the operational safety performance of... the AARM. The reason this additional criterion has been added is to allow NRC's senior management to..., Rockville, Maryland. NRC's Agencywide Documents Access and Management System (ADAMS): Publicly available...
Boutet, Isabelle; Collin, Charles A; MacLeod, Lindsey S; Messier, Claude; Holahan, Matthew R; Berry-Kravis, Elizabeth; Gandhi, Reno M; Kogan, Cary S
2018-01-01
To generate meaningful information, translational research must employ paradigms that allow extrapolation from animal models to humans. However, few studies have evaluated translational paradigms on the basis of defined validation criteria. We outline three criteria for validating translational paradigms. We then evaluate the Hebb-Williams maze paradigm (Hebb and Williams, 1946; Rabinovitch and Rosvold, 1951) on the basis of these criteria using Fragile X syndrome (FXS) as model disease. We focused on this paradigm because it allows direct comparison of humans and animals on tasks that are behaviorally equivalent (criterion #1) and because it measures spatial information processing, a cognitive domain for which FXS individuals and mice show impairments as compared to controls (criterion #2). We directly compared the performance of affected humans and mice across different experimental conditions and measures of behavior to identify which conditions produce comparable patterns of results in both species. Species differences were negligible for Mazes 2, 4, and 5 irrespective of the presence of visual cues, suggesting that these mazes could be used to measure spatial learning in both species. With regards to performance on the first trial, which reflects visuo-spatial problem solving, Mazes 5 and 9 without visual cues produced the most consistent results. We conclude that the Hebb-Williams mazes paradigm has the potential to be utilized in translational research to measure comparable cognitive functions in FXS humans and animals (criterion #3).
Fentz, Hanne N; Arendt, Mikkel; O'Toole, Mia S; Hoffart, Asle; Hougaard, Esben
2014-09-01
Cognitive models of panic disorder (PD) with or without agoraphobia have stressed the role of catastrophic beliefs of bodily symptoms as a central mediating variable of the efficacy of cognitive behavioral therapy (CBT). Perceived ability to cope with or control panic attacks, panic self-efficacy, has also been proposed to play a key role in therapeutic change; however, this cognitive factor has received much less attention in research. The aim of the present review is to evaluate panic self-efficacy as a mediator of therapeutic outcome in CBT for PD using descriptive and meta-analytic procedures. We performed systematic literature searches, and included and evaluated 33 studies according to four criteria for establishing mediation. Twenty-eight studies, including nine randomized waitlist-controlled studies, showed strong support for CBT improving panic self-efficacy (criterion 1); ten showed an association between change in panic self-efficacy and change in outcome during therapy (criterion 2); three tested, and one established formal statistical mediation of panic self-efficacy (criterion 3); while four tested and three found change in panic self-efficacy occurring before the reduction of panic severity (criterion 4). Although none of the studies fulfilled all of the four criteria, results provide some support for panic self-efficacy as a mediator of outcome in CBT for PD, generally on par with catastrophic beliefs in the reviewed studies. Copyright © 2014 Elsevier Ltd. All rights reserved.
Medical privacy protection based on granular computing.
Wang, Da-Wei; Liau, Churn-Jung; Hsu, Tsan-Sheng
2004-10-01
Based on granular computing methodology, we propose two criteria to quantitatively measure privacy invasion. The total cost criterion measures the effort needed for a data recipient to find private information. The average benefit criterion measures the benefit a data recipient obtains when he received the released data. These two criteria remedy the inadequacy of the deterministic privacy formulation proposed in Proceedings of Asia Pacific Medical Informatics Conference, 2000; Int J Med Inform 2003;71:17-23. Granular computing methodology provides a unified framework for these quantitative measurements and previous bin size and logical approaches. These two new criteria are implemented in a prototype system Cellsecu 2.0. Preliminary system performance evaluation is conducted and reviewed.
Saraf, Sanatan; Mathew, Thomas; Roy, Anindya
2015-01-01
For the statistical validation of surrogate endpoints, an alternative formulation is proposed for testing Prentice's fourth criterion, under a bivariate normal model. In such a setup, the criterion involves inference concerning an appropriate regression parameter, and the criterion holds if the regression parameter is zero. Testing such a null hypothesis has been criticized in the literature since it can only be used to reject a poor surrogate, and not to validate a good surrogate. In order to circumvent this, an equivalence hypothesis is formulated for the regression parameter, namely the hypothesis that the parameter is equivalent to zero. Such an equivalence hypothesis is formulated as an alternative hypothesis, so that the surrogate endpoint is statistically validated when the null hypothesis is rejected. Confidence intervals for the regression parameter and tests for the equivalence hypothesis are proposed using bootstrap methods and small sample asymptotics, and their performances are numerically evaluated and recommendations are made. The choice of the equivalence margin is a regulatory issue that needs to be addressed. The proposed equivalence testing formulation is also adopted for other parameters that have been proposed in the literature on surrogate endpoint validation, namely, the relative effect and proportion explained.
A Joint Optimization Criterion for Blind DS-CDMA Detection
NASA Astrophysics Data System (ADS)
Durán-Díaz, Iván; Cruces-Alvarez, Sergio A.
2006-12-01
This paper addresses the problem of the blind detection of a desired user in an asynchronous DS-CDMA communications system with multipath propagation channels. Starting from the inverse filter criterion introduced by Tugnait and Li in 2001, we propose to tackle the problem in the context of the blind signal extraction methods for ICA. In order to improve the performance of the detector, we present a criterion based on the joint optimization of several higher-order statistics of the outputs. An algorithm that optimizes the proposed criterion is described, and its improved performance and robustness with respect to the near-far problem are corroborated through simulations. Additionally, a simulation using measurements on a real software-radio platform at 5 GHz has also been performed.
ERIC Educational Resources Information Center
Messick, Samuel
Cognitive styles--defined as information processing habits--should be considered as a criterion variable in the evaluation of instruction. Research findings identify the characteristics of different cognitive stles. Used in educational practice and evaluation, cognitive styles would be new process variables extending the assessment of mental…
1992-01-01
aircraft it repairs, LA tracks negotiated flow versus actual flow by tail number and the number of days delivered early or late. This directorate, as...elements are defined as follows: Performance criterion: The relative element used to evaluate macro, micro, long -term, short-term, flow, static, functional...constraint is defined as "anything that limits the system from achieving higher performance versus its goal" (Goldratt, 1989, p. 1). The following
Choi, Sang Hyun; Byun, Jae Ho; Lim, Young-Suk; Yu, Eunsil; Lee, So Jung; Kim, So Yeon; Won, Hyung Jin; Shin, Yong Moon; Kim, Pyo Nyun
2016-05-01
Current diagnostic imaging criteria for hepatocellular carcinoma (HCC) are dedicated to imaging with nonspecific extracellular contrast agents. This study aimed to evaluate diagnostic criteria for HCC ⩽3 cm on magnetic resonance imaging (MRI) with a hepatocyte-specific contrast agent through an inception cohort study. Of 291 patients with chronic liver disease and new nodules of 1-3 cm in diameter at surveillance ultrasonography, 295 solid nodules (194 HCCs, 98 benign nodules, and three other malignancies) in 198 patients with a confirmed final diagnosis or ⩾24 months follow-up were evaluated on gadoxetic acid-enhanced MRI. Through univariate and multivariate logistic regression analyses, various diagnostic criteria were developed by combining significant MRI findings for diagnosing HCC. The diagnostic performance of each criterion was compared with that of the European Association for the Study of the Liver (EASL) criteria. Four MRI findings (arterial-phase hyperintensity, transitional-phase hypointensity, hepatobiliary-phase hypointensity, and rim enhancement) were independently significant for diagnosis of HCC ⩽3 cm. For whole nodules, EASL criteria showed the best performance for diagnosing HCC (sensitivity, 83.5%; specificity, 81.2%). For nodules ⩽2 cm in diameter, a new criterion (arterial-phase hyperintensity and hepatobiliary-phase hypointensity) showed a significantly higher sensitivity than that of the EASL criteria (83.0% vs. 74.5%, p=0.008), without a significantly different specificity (76.7% vs. 81.1%, p=0.125). EASL criteria exhibit the best diagnostic performance for HCC ⩽3 cm on hepatocyte-specific contrast-enhanced MRI. A newly identified criterion (arterial-phase hyperintensity and hepatobiliary-phase hypointensity) may increase the diagnostic sensitivity of small (⩽2 cm) HCC. Copyright © 2016 European Association for the Study of the Liver. Published by Elsevier B.V. All rights reserved.
Zhuang, Ziqing; Bergman, Michael; Lei, Zhipeng; Niezgoda, George; Shaffer, Ronald
2017-01-01
This study assessed key test parameters and pass/fail criteria options for developing a respirator fit capability (RFC) test for half-mask air-purifying particulate respirators. Using a 25-subject test panel, benchmark RFC data were collected for 101 National Institute for Occupational Safety and Health-certified respirator models. These models were further grouped into 61 one-, two-, or three-size families. Fit testing was done using a PortaCount® Plus with N95-Companion accessory and an Occupational Safety and Health Administration-accepted quantitative fit test protocol. Three repeated tests (donnings) per subject/respirator model combination were performed. The panel passing rate (PPR) (number or percentage of the 25-subject panel achieving acceptable fit) was determined for each model using five different alternative criteria for determining acceptable fit. When the 101 models are evaluated individually (i.e., not grouped by families), the percentages of models capable of fitting >75% (19/25 subjects) of the panel were 29% and 32% for subjects achieving a fit factor ≥100 for at least one of the first two donnings and at least one of three donnings, respectively. When the models are evaluated grouped into families and using >75% of panel subjects achieving a fit factor ≥100 for at least one of two donnings as the PPR pass/fail criterion, 48% of all models can pass. When >50% (13/25 subjects) of panel subjects was the PPR criterion, the percentage of passing models increased to 70%. Testing respirators grouped into families and evaluating the first two donnings for each of two respirator sizes provided the best balance between meeting end user expectations and creating a performance bar for manufacturers. Specifying the test criterion for a subject obtaining acceptable fit as achieving a fit factor ≥100 on at least one out of the two donnings is reasonable because a majority of existing respirator families can achieve an PPR of >50% using this criterion. The different test criteria can be considered by standards development organizations when developing standards. PMID:28278067
Zhuang, Ziqing; Bergman, Michael; Lei, Zhipeng; Niezgoda, George; Shaffer, Ronald
2017-06-01
This study assessed key test parameters and pass/fail criteria options for developing a respirator fit capability (RFC) test for half-mask air-purifying particulate respirators. Using a 25-subject test panel, benchmark RFC data were collected for 101 National Institute for Occupational Safety and Health-certified respirator models. These models were further grouped into 61 one-, two-, or three-size families. Fit testing was done using a PortaCount® Plus with N95-Companion accessory and an Occupational Safety and Health Administration-accepted quantitative fit test protocol. Three repeated tests (donnings) per subject/respirator model combination were performed. The panel passing rate (PPR) (number or percentage of the 25-subject panel achieving acceptable fit) was determined for each model using five different alternative criteria for determining acceptable fit. When the 101 models are evaluated individually (i.e., not grouped by families), the percentages of models capable of fitting >75% (19/25 subjects) of the panel were 29% and 32% for subjects achieving a fit factor ≥100 for at least one of the first two donnings and at least one of three donnings, respectively. When the models are evaluated grouped into families and using >75% of panel subjects achieving a fit factor ≥100 for at least one of two donnings as the PPR pass/fail criterion, 48% of all models can pass. When >50% (13/25 subjects) of panel subjects was the PPR criterion, the percentage of passing models increased to 70%. Testing respirators grouped into families and evaluating the first two donnings for each of two respirator sizes provided the best balance between meeting end user expectations and creating a performance bar for manufacturers. Specifying the test criterion for a subject obtaining acceptable fit as achieving a fit factor ≥100 on at least one out of the two donnings is reasonable because a majority of existing respirator families can achieve an PPR of >50% using this criterion. The different test criteria can be considered by standards development organizations when developing standards.
Landing flying qualities evaluation criteria for augmented aircraft
NASA Technical Reports Server (NTRS)
Radford, R. C.; Smith, R.; Bailey, R.
1980-01-01
The criteria evaluated were: Calspan Neal-Smith; Onstott (Northrop Time Domain); McDonnell-Douglas Equivalent System Approach; R. H. Smith Criterion. Each criterion was applied to the same set of longitudinal approach and landing flying qualities data. A revised version of the Neal-Smith criterion which is applicable to the landing task was developed and tested against other landing flying qualities data. Results indicated that both the revised Neal-Smith criterion and the Equivalent System Approach are good discriminators of pitch landing flying qualities; Neal-Smith has particular merit as a design guide, while the Equivalent System Approach is well suited for development of appropriate military specification requirements applicable to highly augmented aircraft.
Schiffman, Eric L.; Truelove, Edmond L.; Ohrbach, Richard; Anderson, Gary C.; John, Mike T.; List, Thomas; Look, John O.
2011-01-01
AIMS The purpose of the Research Diagnostic Criteria for Temporomandibular Disorders (RDC/TMD) Validation Project was to assess the diagnostic validity of this examination protocol. An overview is presented, including Axis I and II methodology and descriptive statistics for the study participant sample. This paper details the development of reliable methods to establish the reference standards for assessing criterion validity of the Axis I RDC/TMD diagnoses. Validity testing for the Axis II biobehavioral instruments was based on previously validated reference standards. METHODS The Axis I reference standards were based on the consensus of 2 criterion examiners independently performing a comprehensive history, clinical examination, and evaluation of imaging. Intersite reliability was assessed annually for criterion examiners and radiologists. Criterion exam reliability was also assessed within study sites. RESULTS Study participant demographics were comparable to those of participants in previous studies using the RDC/TMD. Diagnostic agreement of the criterion examiners with each other and with the consensus-based reference standards was excellent with all kappas ≥ 0.81, except for osteoarthrosis (moderate agreement, k = 0.53). Intrasite criterion exam agreement with reference standards was excellent (k ≥ 0.95). Intersite reliability of the radiologists for detecting computed tomography-disclosed osteoarthrosis and magnetic resonance imaging-disclosed disc displacement was good to excellent (k = 0.71 and 0.84, respectively). CONCLUSION The Validation Project study population was appropriate for assessing the reliability and validity of the RDC/TMD Axis I and II. The reference standards used to assess the validity of Axis I TMD were based on reliable and clinically credible methods. PMID:20213028
NASA Astrophysics Data System (ADS)
Huang, H. E.; Liang, C. P.; Jang, C. S.; Chen, J. S.
2015-12-01
Land subsidence due to groundwater exploitation is an urgent environmental problem in Choushui river alluvial fan in Taiwan. Aquifer storage and recovery (ASR), where excess surface water is injected into subsurface aquifers for later recovery, is one promising strategy for managing surplus water and may overcome water shortages. The performance of an ASR scheme is generally evaluated in terms of recovery efficiency, which is defined as percentage of water injected in to a system in an ASR site that fulfills the targeted water quality criterion. Site selection of an ASR scheme typically faces great challenges, due to the spatial variability of groundwater quality and hydrogeological condition. This study proposes a novel method for the ASR site selection based on drinking quality criterion. Simplified groundwater flow and contaminant transport model spatial distributions of the recovery efficiency with the help of the groundwater quality, hydrological condition, ASR operation. The results of this study may provide government administrator for establishing reliable ASR scheme.
The SEER Readability Technique: How Practicable is It?
ERIC Educational Resources Information Center
Duffelmeyer, Frederick A.
1982-01-01
Evaluates the practicability of the Singer Eyeball Estimate of Readability (SEER) techniques with 32 college students. Reveals that only two of the students met SEER's criterion for being considered acceptable judges. Concludes that the criterion is overly stringent and proposes a revised criterion designed to make the SEER technique more…
Standards for Evaluating Criterion-Referenced Tests.
ERIC Educational Resources Information Center
Walker, Clinton B.
Standards for evaluating criterion-referenced tests are presented. Twenty-one standards, grouped in three categories, are discussed. Category one is defined as measurement properties and is comprised of conceptual validity, including description of the domain, test item agreement with objectives, and item representativeness of the objectives; and…
Fernández-Friera, Leticia; García-Ruiz, José Manuel; García-Álvarez, Ana; Fernández-Jiménez, Rodrigo; Sánchez-González, Javier; Rossello, Xavier; Gómez-Talavera, Sandra; López-Martín, Gonzalo J; Pizarro, Gonzalo; Fuster, Valentín; Ibáñez, Borja
2017-05-01
Area at risk (AAR) quantification is important to evaluate the efficacy of cardioprotective therapies. However, postinfarction AAR assessment could be influenced by the infarcted coronary territory. Our aim was to determine the accuracy of T 2 -weighted short tau triple-inversion recovery (T 2 W-STIR) cardiac magnetic resonance (CMR) imaging for accurate AAR quantification in anterior, lateral, and inferior myocardial infarctions. Acute reperfused myocardial infarction was experimentally induced in 12 pigs, with 40-minute occlusion of the left anterior descending (n = 4), left circumflex (n = 4), and right coronary arteries (n = 4). Perfusion CMR was performed during selective intracoronary gadolinium injection at the coronary occlusion site (in vivo criterion standard) and, additionally, a 7-day CMR, including T 2 W-STIR sequences, was performed. Finally, all animals were sacrificed and underwent postmortem Evans blue staining (classic criterion standard). The concordance between the CMR-based criterion standard and T 2 W-STIR to quantify AAR was high for anterior and inferior infarctions (r = 0.73; P = .001; mean error = 0.50%; limits = -12.68%-13.68% and r = 0.87; P = .001; mean error = -1.5%; limits = -8.0%-5.8%, respectively). Conversely, the correlation for the circumflex territories was poor (r = 0.21, P = .37), showing a higher mean error and wider limits of agreement. A strong correlation between pathology and the CMR-based criterion standard was observed (r = 0.84, P < .001; mean error = 0.91%; limits = -7.55%-9.37%). T 2 W-STIR CMR sequences are accurate to determine the AAR for anterior and inferior infarctions; however, their accuracy for lateral infarctions is poor. These findings may have important implications for the design and interpretation of clinical trials evaluating the effectiveness of cardioprotective therapies. Copyright © 2016 Sociedad Española de Cardiología. Published by Elsevier España, S.L.U. All rights reserved.
When is hub gene selection better than standard meta-analysis?
Langfelder, Peter; Mischel, Paul S; Horvath, Steve
2013-01-01
Since hub nodes have been found to play important roles in many networks, highly connected hub genes are expected to play an important role in biology as well. However, the empirical evidence remains ambiguous. An open question is whether (or when) hub gene selection leads to more meaningful gene lists than a standard statistical analysis based on significance testing when analyzing genomic data sets (e.g., gene expression or DNA methylation data). Here we address this question for the special case when multiple genomic data sets are available. This is of great practical importance since for many research questions multiple data sets are publicly available. In this case, the data analyst can decide between a standard statistical approach (e.g., based on meta-analysis) and a co-expression network analysis approach that selects intramodular hubs in consensus modules. We assess the performance of these two types of approaches according to two criteria. The first criterion evaluates the biological insights gained and is relevant in basic research. The second criterion evaluates the validation success (reproducibility) in independent data sets and often applies in clinical diagnostic or prognostic applications. We compare meta-analysis with consensus network analysis based on weighted correlation network analysis (WGCNA) in three comprehensive and unbiased empirical studies: (1) Finding genes predictive of lung cancer survival, (2) finding methylation markers related to age, and (3) finding mouse genes related to total cholesterol. The results demonstrate that intramodular hub gene status with respect to consensus modules is more useful than a meta-analysis p-value when identifying biologically meaningful gene lists (reflecting criterion 1). However, standard meta-analysis methods perform as good as (if not better than) a consensus network approach in terms of validation success (criterion 2). The article also reports a comparison of meta-analysis techniques applied to gene expression data and presents novel R functions for carrying out consensus network analysis, network based screening, and meta analysis.
ERIC Educational Resources Information Center
Woodburn, Jim; Sutcliffe, Nick
1996-01-01
The Objective Structured Clinical Examination (OSCE), initially developed for undergraduate medical education, has been adapted for assessment of clinical skills in podiatry students. A 12-month pilot study found the test had relatively low levels of reliability, high construct and criterion validity, and good stability of performance over time.…
Issues in Evaluating Importance Weighting in Quality of Life Measures
ERIC Educational Resources Information Center
Hsieh, Chang-ming
2013-01-01
For most empirical research investigating the topic of importance weighting in quality of life (QoL) measures, the prevailing approach has been to use (1) a limited choice of global QoL measures as criterion variables (often a single one) to determine the performance of importance weighting, (2) a limited option of weighting methods to develop…
Diagnostic Group Differences in Parent and Teacher Ratings on the BRIEF and Conners' Scales
ERIC Educational Resources Information Center
Sullivan, Jeremy R.; Riccio, Cynthia A.
2007-01-01
Objective: Behavioral rating scales are common instruments used in evaluations of ADHD and executive function. It is important to explore how different diagnostic groups perform on these measures, as this information can be used to provide criterion-related validity evidence for the measures. Method: Data from 92 children and adolescents were used…
12 CFR 563e.22 - Lending test.
Code of Federal Regulations, 2012 CFR
2012-01-01
... 12 Banks and Banking 6 2012-01-01 2012-01-01 false Lending test. 563e.22 Section 563e.22 Banks and... Assessing Performance § 563e.22 Lending test. (a) Scope of test. (1) The lending test evaluates a savings... section. The OTS will not consider these loans under any criterion of the lending test except the...
12 CFR 563e.22 - Lending test.
Code of Federal Regulations, 2011 CFR
2011-01-01
... 12 Banks and Banking 5 2011-01-01 2011-01-01 false Lending test. 563e.22 Section 563e.22 Banks and... Assessing Performance § 563e.22 Lending test. (a) Scope of test. (1) The lending test evaluates a savings... section. The OTS will not consider these loans under any criterion of the lending test except the...
ERIC Educational Resources Information Center
Elias, Maurice J.; White, Gwyne; Stepney, Cesalie
2014-01-01
While educators and policy makers have an intuitive understanding of the influence of socioeconomic factors and race on student achievement, these factors make the current emphasis on standardized test scores as a primary criterion for evaluating schools and teachers indefensible and ineffective. The research presented illustrates the limits of…
ERIC Educational Resources Information Center
Levy, Deborah L.; Bowman, Elizabeth A.; Abel, Larry; Krastoshevsky, Olga; Krause, Verena; Mendell, Nancy R.
2008-01-01
The "co-familiality" criterion for an endophenotype has two requirements: (1) clinically unaffected relatives as a group should show both a shift in mean performance and an increase in variance compared with controls; (2) performance scores should be heritable. Performance on the antisaccade task is one of several candidate endophenotypes for…
Procedures for Constructing and Using Criterion-Referenced Performance Tests.
ERIC Educational Resources Information Center
Campbell, Clifton P.; Allender, Bill R.
1988-01-01
Criterion-referenced performance tests (CRPT) provide a realistic method for objectively measuring task proficiency against predetermined attainment standards. This article explains the procedures of constructing, validating, and scoring CRPTs and includes a checklist for a welding test. (JOW)
Neuropathological diagnostic criteria for Alzheimer's disease.
Murayama, Shigeo; Saito, Yuko
2004-09-01
Neuropathological diagnostic criteria for Alzheimer's disease (AD) are based on tau-related pathology: NFT or neuritic plaques (NP). The Consortium to Establish a Registry for Alzheimer's disease (CERAD) criterion evaluates the highest density of neocortical NP from 0 (none) to C (abundant). Clinical documentation of dementia and NP stage A in younger cases, B in young old cases and C in older cases fulfils the criterion of AD. The CERAD criterion is most frequently used in clinical outcome studies because of its inclusion of clinical information. Braak and Braak's criterion evaluates the density and distribution of NFT and classifies them into: I/II, entorhinal; III/IV, limbic; and V/VI, neocortical stage. These three stages correspond to normal cognition, cognitive impairment and dementia, respectively. As Braak's criterion is based on morphological evaluation of the brain alone, this criterion is usually adopted in the research setting. The National Institute for Aging and Ronald and Nancy Reagan Institute of the Alzheimer's Association criterion combines these two criteria and categorizes cases into NFT V/VI and NP C, NFT III/IV and NP B, and NFT I/II and NP A, corresponding to high, middle and low probability of AD, respectively. As most AD cases in the aged population are categorized into Braak tangle stage IV and CERAD stage C, the usefulness of this criterion has not yet been determined. The combination of Braak's NFT stage equal to or above IV and Braak's senile plaque Stage C provides, arguably, the highest sensitivity and specificity. In future, the criteria should include in vivo dynamic neuropathological data, including 3D MRI, PET scan and CSF biomarkers, as well as more sensitive and specific immunohistochemical and immunochemical grading of AD.
Baumann, Martin; Keinath, Andreas; Krems, Josef F; Bengler, Klaus
2004-05-01
Despite the usefulness of new on-board information systems one has to be concerned about the potential distraction effects that they impose on the driver. Therefore, methods and procedures are necessary to assess the visual demand that is connected to the usage of an on-board system. The occlusion-method is considered a strong candidate as a procedure for evaluating display designs with regard to their visual demand. This paper reports results from two experimental studies conducted to further evaluate this method. In the first study, performance in using an in-car navigation system was measured under three conditions: static (parking lot), occlusion (shutter glasses), and driving. The results show that the occlusion-procedure can be used to simulate visual requirements of real traffic conditions. In a second study the occlusion method was compared to a global evaluation criterion based on the total task time. It can be demonstrated that the occlusion method can identify tasks which meet this criterion, but are yet irresolvable under driving conditions. It is concluded that the occlusion technique seems to be a reliable and valid method for evaluating visual and dialogue aspects of in-car information systems.
Criterion-Related Validity: Assessing the Value of Subscores
ERIC Educational Resources Information Center
Davison, Mark L.; Davenport, Ernest C., Jr.; Chang, Yu-Feng; Vue, Kory; Su, Shiyang
2015-01-01
Criterion-related profile analysis (CPA) can be used to assess whether subscores of a test or test battery account for more criterion variance than does a single total score. Application of CPA to subscore evaluation is described, compared to alternative procedures, and illustrated using SAT data. Considerations other than validity and reliability…
ERIC Educational Resources Information Center
Ding, Cody S.; Davison, Mark L.
2010-01-01
Akaike's information criterion is suggested as a tool for evaluating fit and dimensionality in metric multidimensional scaling that uses least squares methods of estimation. This criterion combines the least squares loss function with the number of estimated parameters. Numerical examples are presented. The results from analyses of both simulation…
Problems in Criterion-Referenced Measurement. CSE Monograph Series in Evaluation, 3.
ERIC Educational Resources Information Center
Harris, Chester W., Ed.; And Others
Six essays on technical measurement problems in criterion referenced tests and four essays by psychometricians proposing solutions are presented: (1) "Criterion-Referenced Measurement" and Other Such Terms, by Marvin C. Alkin which is an overview of the first six papers; (2) Selecting Objectives and Generating Test Items for Objectives-Based…
Link, William; Sauer, John R.
2016-01-01
The analysis of ecological data has changed in two important ways over the last 15 years. The development and easy availability of Bayesian computational methods has allowed and encouraged the fitting of complex hierarchical models. At the same time, there has been increasing emphasis on acknowledging and accounting for model uncertainty. Unfortunately, the ability to fit complex models has outstripped the development of tools for model selection and model evaluation: familiar model selection tools such as Akaike's information criterion and the deviance information criterion are widely known to be inadequate for hierarchical models. In addition, little attention has been paid to the evaluation of model adequacy in context of hierarchical modeling, i.e., to the evaluation of fit for a single model. In this paper, we describe Bayesian cross-validation, which provides tools for model selection and evaluation. We describe the Bayesian predictive information criterion and a Bayesian approximation to the BPIC known as the Watanabe-Akaike information criterion. We illustrate the use of these tools for model selection, and the use of Bayesian cross-validation as a tool for model evaluation, using three large data sets from the North American Breeding Bird Survey.
A proposed criterion for aircraft flight in turbulence
NASA Technical Reports Server (NTRS)
Porter, R. F.; Robinson, A. C.
1971-01-01
A proposed criterion for aircraft flight in turbulent conditions is presented. Subjects discussed are: (1) the problem of flight safety in turbulence, (2) new criterion for turbulence flight where existing ones seem adequate, and (3) computational problems associated with new criterion. Primary emphasis is placed on catastrophic occurrences in subsonic cruise with the aircraft under automatic control. A Monte Carlo simulation is used in the formulation and evaluation of probabilities of survival of an encounter with turbulence.
Gallagher, A G; Lederman, A B; McGlade, K; Satava, R M; Smith, C D
2004-04-01
Increasing constraints on the time and resources needed to train surgeons have led to a new emphasis on finding innovative ways to teach surgical skills outside the operating room. Virtual reality training has been proposed as a method to both instruct surgical students and evaluate the psychomotor components of minimally invasive surgery ex vivo. The performance of 100 laparoscopic novices was compared to that of 12 experienced (>50 minimally invasive procedures) and 12 inexperienced (<10 minimally invasive procedures) laparoscopic surgeons. The values of the experienced surgeons' performance were used as benchmark comparators (or criterion measures). Each subject completed six tasks on the Minimally Invasive Surgical Trainer-Virtual Reality (MIST-VR) three times. The outcome measures were time to complete the task, number of errors, economy of instrument movement, and economy of diathermy. After three trials, the mean performance of the medical students approached that of the experienced surgeons. However, 7-27% of the scores of the students fell more than two SD below the mean scores of the experienced surgeons (the criterion level). The MIST-VR system is capable of evaluating the psychomotor skills necessary in laparoscopic surgery and discriminating between experts and novices. Furthermore, although some novices improved their skills quickly, a subset had difficulty acquiring the psychomotor skills. The MIST-VR may be useful in identifying that subset of novices.
Sanchez-Armass, Omar; Raffaelli, Marcela; Andrade, Flavia Cristina Drumond; Wiley, Angela R; Noyola, Aida Nacielli Morales; Arguelles, Alejandra Cepeda; Aradillas-Garcia, Celia
2017-03-01
To evaluate the criterion validity and diagnostic utility of the SCOFF, a brief eating disorder (ED) screening instrument, in a Mexican sample. The study was conducted in two phases in 2012. Phase I involved the administration of self-report measures [the SCOFF and the Eating Disorder Inventory-2, (EDI-2)] to 1057 students aged 17-56 years (M age = 21.0, SD = 3.4; 67 % female) from three colleges at the Universidad Autónoma de San Luis Potosí, Mexico. In Phase II, a random subsample of these students (n = 104) participated in the eating disorder examination, a structured interview that yields ED diagnoses. Analyses were conducted to evaluate the SCOFF's criterion validity by examining (a) correlations between scores on the SCOFF and the EDI-2 and (b) the SCOFF's ability to differentiate diagnosed ED cases and non-cases. EDI-2 subscales showed high correlations with the SCOFF scores proving initial evidence of criterion validity. A score of two points on the SCOFF optimized the sensitivity (78 %) and specificity (84 %). With this cutoff, the SCOFF correctly classified over half the cases (PPV = 58 %) and screened out the majority of non-cases (NPV = 93 %) providing further evidence of criterion validity. Analyses were repeated separately for men and women, yielding gender-specific information on the SCOFF's performance. Taken as a whole, results indicated that the SCOFF can be a useful tool for identifying Mexican university students who are at risk of eating disorders.
Parrozzani, Raffaele; Clementi, Maurizio; Frizziero, Luisa; Miglionico, Giacomo; Perrini, Pierdavide; Cavarzeran, Fabiano; Kotsafti, Olympia; Comacchio, Francesco; Trevisson, Eva; Convento, Enrica; Fusetti, Stefano; Midena, Edoardo
2015-09-01
To evaluate the feasibility of near-infrared (NIR) imaging acquisition in a large sample of consecutive pediatric patients with neurofibromatosis type 1 (NF1), to evaluate the diagnostic performance of NF1-related choroidal abnormalities as a diagnostic criterion of the disease, and to compare this criterion with other standard National Institutes of Health (NIH) diagnostic criteria. A total of 140 consecutive pediatric patients (0-16 years old) affected by NF1 (at least two diagnostic criteria), 59 suspected (a single diagnostic criterion), and 42 healthy subjects (no diagnostic criterion) were consecutively included. Each patient underwent genetic, dermatologic, and ophthalmologic examination to evaluate the presence/absence of each NIH diagnostic criterion. The presence of NF1-related choroidal abnormalities was investigated using NIR confocal ophthalmoscopy. Two masked operators assessed Lisch nodules and NF1-related choroidal abnormalities. Neurofibromatosis type 1-related choroidal abnormalities were detected in 72 affected (60.5%) and 1 suspected (2.4%) child. No healthy subject had choroidal abnormalities. Feasibility rate of this sign was 82%. Sensitivity, specificity, and positive and negative predictive values of NF1-related choroidal abnormalities were 0.60, 0.97, 0.98, and 0.46, respectively. Compared with standard NIH criteria, the presence of NF1-related choroidal abnormalities was the third parameter for positive predictive value and the fourth for sensitivity, specificity, and negative predictive value. Compared with Lisch nodules, NF1-related choroidal abnormalities were characterized by higher specificity and positive predictive value. The interoperator agreement for Lisch nodules and NF1-related choroidal abnormalities was 0.67 (substantial) and 0.97 (almost perfect), respectively. The use of this sign moved one patient from the suspected to the affected group (0.5%). Neurofibromatosis type 1-related choroidal abnormalities represent a new diagnostic sign in NF1 children. The main advantage of this sign seems the theoretical possibility to anticipate NF1 diagnosis, whereas the main obstacle is the cooperation required by very young patients.
Criteria to Evaluate Interpretive Guides for Criterion-Referenced Tests
ERIC Educational Resources Information Center
Trapp, William J.
2007-01-01
This project provides a list of criteria for which the contents of interpretive guides written for customized, criterion-referenced tests can be evaluated. The criteria are based on the "Standards for Educational and Psychological Testing" (1999) and examine the content breadth of interpretive guides. Interpretive guides written for…
NASA Technical Reports Server (NTRS)
Yorchak, J. P.; Hartley, C. S.; Hinman, E.
1985-01-01
The use of aptitude tests and questionnaries to evaluate an individuals aptitude for teleoperation is studied. The Raven Progressive Matrices Test and Differential Aptitude Tests, and a 16-item questionnaire for assessing the subject's interests, academic background, and previous experience are described. The Proto-Flight Manipulator Arm, cameras, console, hand controller, and task board utilized by the 17 engineers are examined. The correlation between aptitude scores and questionnaire responses, and operator performance is investigated. Multiple regression data reveal that the eight predictor variables are not individually significant for evaluating operator performance; however, the complete test battery is applicable for predicting 49 percent of subject variance on the criterion task.
NASA Astrophysics Data System (ADS)
Iyyappan, I.; Ponmurugan, M.
2017-09-01
We study the performance of a three-terminal thermoelectric device such as heat engine and refrigerator with broken time-reversal symmetry by applying the unified trade-off figure of merit (\\dotΩ criterion) which accounts for both useful energy and losses. For the heat engine, we find that a thermoelectric device working under the maximum \\dotΩ criterion gives a significantly better performance than a device working at maximum power output. Within the framework of linear irreversible thermodynamics such a direct comparison is not possible for refrigerators, however, our study indicates that, for refrigerator, the maximum cooling load gives a better performance than the maximum \\dotΩ criterion for a larger asymmetry. Our results can be useful to choose a suitable optimization criterion for operating a real thermoelectric device with broken time-reversal symmetry.
Uehara, Kosuke; Ogura, Koichi; Akiyama, Toru; Shinoda, Yusuke; Iwata, Shintaro; Kobayashi, Eisuke; Tanzawa, Yoshikazu; Yonemoto, Tsukasa; Kawano, Hirotaka; Kawai, Akira
2017-09-01
The Musculoskeletal Tumor Society (MSTS) scoring system developed in 1993 is a widely used disease-specific evaluation tool for assessment of physical function in patients with musculoskeletal tumors; however, only a few studies have confirmed its reliability and validity. The aim of this study was to validate the MSTS scoring system for the upper extremity (MSTS-UE) in Japanese patients with musculoskeletal tumors for use by others in research. Does the MSTS-UE have: (1) sufficient reliability and internal consistency; (2) adequate construct validity; and (3) reasonable criterion validity in comparison to the Toronto Extremity Salvage Score (TESS) or SF-36? Reliability was performed using test-retest analysis, and internal consistency was evaluated with Cronbach's alpha coefficient. Construct validity was evaluated using a scree plot to confirm the construct number and the Akaike information criterion network. Criterion validity was evaluated by comparing the MSTS-UE with the TESS and SF-36. The test-retest reliability with intraclass correlation coefficient (0.95; 95% CI, 0.91-0.97) was excellent, and internal consistency with Cronbach's α (0.7; 95% CI, 0.53-0.81) was acceptable. There were no ceiling and floor effects. The Akaike Information Criterion network showed that lifting ability, pain, and dexterity played central roles among the components. The MSTS-UE showed substantial correlation with the TESS scoring scale (r = 0.75; p < 0.001) and fair correlation with the SF-36 physical component summary (r = 0.37; p = 0.007). Although the MSTS-UE showed slight correlation with the SF-36 mental component summary, the emotional acceptance component of the MSTS-UE showed fair correlation (r = 0.29; p = 0.039). We can conclude that the MSTS is not an adequate measure of general health-related quality of life; however, this system was designed mainly to be a simple measure of function in a single extremity. To evaluate the mental state of patients with musculoskeletal tumors in the upper extremity, further study is needed.
ERIC Educational Resources Information Center
Brink, Carole Sanger
2011-01-01
In 2007, Georgia developed a comprehensive framework to define what students need to know. One component of this framework emphasizes the use of both formative and summative assessments as part of an integral and specific component of the teachers. performance evaluation. Georgia administers the Criterion-Referenced Competency Test (CRCT) to every…
ERIC Educational Resources Information Center
Williams, Ed
2009-01-01
This study examines fourth grade student achievement in relation to teacher perceptions of principal leadership and other selected variables in a large urban school district in Georgia. Student achievement was measured by performance on the Georgia Criterion-Referenced Competency Tests (CRCT) during the 2004-05 and 2005-06 school years. The…
NASA Technical Reports Server (NTRS)
Hopkins, W. D.; Washburn, D. A.; Hyatt, C. W.; Rumbaugh, D. M. (Principal Investigator)
1996-01-01
This study describes video-task acquisition in two nonhuman primate species. The subjects were seven rhesus monkeys (Macaca mulatta) and seven chimpanzees (Pan troglodytes). All subjects were trained to manipulate a joystick which controlled a cursor displayed on a computer monitor. Two criterion levels were used: one based on conceptual knowledge of the task and one based on motor performance. Chimpanzees and rhesus monkeys attained criterion in a comparable number of trials using a conceptually based criterion. However, using a criterion based on motor performance, chimpanzees reached criterion significantly faster than rhesus monkeys. Analysis of error patterns and latency indicated that the rhesus monkeys had a larger asymmetry in response bias and were significantly slower in responding than the chimpanzees. The results are discussed in terms of the relation between object manipulation skills and video-task acquisition.
Algorithmic Classification of Five Characteristic Types of Paraphasias.
Fergadiotis, Gerasimos; Gorman, Kyle; Bedrick, Steven
2016-12-01
This study was intended to evaluate a series of algorithms developed to perform automatic classification of paraphasic errors (formal, semantic, mixed, neologistic, and unrelated errors). We analyzed 7,111 paraphasias from the Moss Aphasia Psycholinguistics Project Database (Mirman et al., 2010) and evaluated the classification accuracy of 3 automated tools. First, we used frequency norms from the SUBTLEXus database (Brysbaert & New, 2009) to differentiate nonword errors and real-word productions. Then we implemented a phonological-similarity algorithm to identify phonologically related real-word errors. Last, we assessed the performance of a semantic-similarity criterion that was based on word2vec (Mikolov, Yih, & Zweig, 2013). Overall, the algorithmic classification replicated human scoring for the major categories of paraphasias studied with high accuracy. The tool that was based on the SUBTLEXus frequency norms was more than 97% accurate in making lexicality judgments. The phonological-similarity criterion was approximately 91% accurate, and the overall classification accuracy of the semantic classifier ranged from 86% to 90%. Overall, the results highlight the potential of tools from the field of natural language processing for the development of highly reliable, cost-effective diagnostic tools suitable for collecting high-quality measurement data for research and clinical purposes.
Tendency for interlaboratory precision in the GMO analysis method based on real-time PCR.
Kodama, Takashi; Kurosawa, Yasunori; Kitta, Kazumi; Naito, Shigehiro
2010-01-01
The Horwitz curve estimates interlaboratory precision as a function only of concentration, and is frequently used as a method performance criterion in food analysis with chemical methods. The quantitative biochemical methods based on real-time PCR require an analogous criterion to progressively promote method validation. We analyzed the tendency of precision using a simplex real-time PCR technique in 53 collaborative studies of seven genetically modified (GM) crops. Reproducibility standard deviation (SR) and repeatability standard deviation (Sr) of the genetically modified organism (GMO) amount (%) was more or less independent of GM crops (i.e., maize, soybean, cotton, oilseed rape, potato, sugar beet, and rice) and evaluation procedure steps. Some studies evaluated whole steps consisting of DNA extraction and PCR quantitation, whereas others focused only on the PCR quantitation step by using DNA extraction solutions. Therefore, SR and Sr for GMO amount (%) are functions only of concentration similar to the Horwitz curve. We proposed S(R) = 0.1971C 0.8685 and S(r) = 0.1478C 0.8424, where C is the GMO amount (%). We also proposed a method performance index in GMO quantitative methods that is analogous to the Horwitz Ratio.
Performance of electrolyte measurements assessed by a trueness verification program.
Ge, Menglei; Zhao, Haijian; Yan, Ying; Zhang, Tianjiao; Zeng, Jie; Zhou, Weiyan; Wang, Yufei; Meng, Qinghui; Zhang, Chuanbao
2016-08-01
In this study, we analyzed frozen sera with known commutabilities for standardization of serum electrolyte measurements in China. Fresh frozen sera were sent to 187 clinical laboratories in China for measurement of four electrolytes (sodium, potassium, calcium, and magnesium). Target values were assigned by two reference laboratories. Precision (CV), trueness (bias), and accuracy [total error (TEa)] were used to evaluate measurement performance, and the tolerance limit derived from the biological variation was used as the evaluation criterion. About half of the laboratories used a homogeneous system (same manufacturer for instrument, reagent and calibrator) for calcium and magnesium measurement, and more than 80% of laboratories used a homogeneous system for sodium and potassium measurement. More laboratories met the tolerance limit of imprecision (coefficient of variation [CVa]) than the tolerance limits of trueness (biasa) and TEa. For sodium, calcium, and magnesium, the minimal performance criterion derived from biological variation was used, and the pass rates for total error were approximately equal to the bias (<50%). For potassium, the pass rates for CV and TE were more than 90%. Compared with the non homogeneous system, the homogeneous system was superior for all three quality specifications. The use of commutable proficiency testing/external quality assessment (PT/EQA) samples with values assigned by reference methods can monitor performance and provide reliable data for improving the performance of laboratory electrolyte measurement. The homogeneous systems were superior to the non homogeneous systems, whereas accuracy of assigned values of calibrators and assay stability remained challenges.
Lau, Lily; Basso, Michael R; Estevis, Eduardo; Miller, Ashley; Whiteside, Douglas M; Combs, Dennis; Arentsen, Timothy J
2017-11-01
Performance validity tests (PVTs) and symptom validity tests (SVTs) are often administered during neuropsychological evaluations. Examinees may be coached to avoid detection by measures of response validity. Relatively little research has evaluated whether graduated levels of coaching has differential effects upon PVT and SVT performance. Accordingly, the present experiment evaluated the effect of graduated levels of coaching upon the classification accuracy of commonly used PVTs and SVTs and the currently accepted criterion of failing two or more PVTs or SVTs. Participants simulated symptoms associated with mild traumatic brain injury (TBI). One group was provided superficial information concerning cognitive, emotional, and physical symptoms. Another group was provided detailed information about such symptoms. A third group was provided detailed information about symptoms and guidance how to evade detection by PVTs. These groups were compared to an honest-responding group. Extending prior experiments, stand-alone and embedded PVT measures were administered in addition to SVTs. The three simulator groups were readily identified by PVTs and SVTs, but a meaningful minority of those provided test-taking strategies eluded detection. The Word Memory Test emerged as the most sensitive indicator of simulated mild TBI symptoms. PVTs achieved more sensitive detection of simulated head injury status than SVTs. Individuals coached to modify test-taking performance were marginally more successful in eluding detection by PVTs and SVTs than those coached with respect to TBI symptoms only. When the criterion of failing two or more PVTs or SVTs was applied, only 5% eluded detection.
Evaluation of Weighted Scale Reliability and Criterion Validity: A Latent Variable Modeling Approach
ERIC Educational Resources Information Center
Raykov, Tenko
2007-01-01
A method is outlined for evaluating the reliability and criterion validity of weighted scales based on sets of unidimensional measures. The approach is developed within the framework of latent variable modeling methodology and is useful for point and interval estimation of these measurement quality coefficients in counseling and education…
Toro, Brigitte; Nester, Christopher J; Farren, Pauline C
2007-03-01
To develop the construct, content, and criterion validity of the Salford Gait Tool (SF-GT) and to evaluate agreement between gait observations using the SF-GT and kinematic gait data. Tool development and comparative evaluation. University in the United Kingdom. For designing construct and content validity, convenience samples of 10 children with hemiplegic, diplegic, and quadriplegic cerebral palsy (CP) and 152 physical therapy students and 4 physical therapists were recruited. For developing criterion validity, kinematic gait data of 13 gait clusters containing 56 children with hemiplegic, diplegic, and quadriplegic CP and 11 neurologically intact children was used. For clinical evaluation, a convenience sample of 23 pediatric physical therapists participated. We developed a sagittal plane observational gait assessment tool through a series of design, test, and redesign iterations. The tool's grading system was calibrated using kinematic gait data of 13 gait clusters and was evaluated by comparing the agreement of gait observations using the SF-GT with kinematic gait data. Criterion standard kinematic gait data. There was 58% mean agreement based on grading categories and 80% mean agreement based on degree estimations evaluated with the least significant difference method. The new SF-GT has good concurrent criterion validity.
Kaneko, Hiromasa; Funatsu, Kimito
2013-09-23
We propose predictive performance criteria for nonlinear regression models without cross-validation. The proposed criteria are the determination coefficient and the root-mean-square error for the midpoints between k-nearest-neighbor data points. These criteria can be used to evaluate predictive ability after the regression models are updated, whereas cross-validation cannot be performed in such a situation. The proposed method is effective and helpful in handling big data when cross-validation cannot be applied. By analyzing data from numerical simulations and quantitative structural relationships, we confirm that the proposed criteria enable the predictive ability of the nonlinear regression models to be appropriately quantified.
Scaling study of the combustion performance of gas—gas rocket injectors
NASA Astrophysics Data System (ADS)
Wang, Xiao-Wei; Cai, Guo-Biao; Jin, Ping
2011-10-01
To obtain the key subelements that may influence the scaling of gas—gas injector combustor performance, the combustion performance subelements in a liquid propellant rocket engine combustor are initially analysed based on the results of a previous study on the scaling of a gas—gas combustion flowfield. Analysis indicates that inner wall friction loss and heat-flux loss are two key issues in gaining the scaling criterion of the combustion performance. The similarity conditions of the inner wall friction loss and heat-flux loss in a gas—gas combustion chamber are obtained by theoretical analyses. Then the theoretical scaling criterion was obtained for the combustion performance, but it proved to be impractical. The criterion conditions, the wall friction and the heat flux are further analysed in detail to obtain the specific engineering scaling criterion of the combustion performance. The results indicate that when the inner flowfields in the combustors are similar, the combustor wall shear stress will have similar distributions qualitatively and will be directly proportional to pc0.8dt-0.2 quantitatively. In addition, the combustion peformance will remain unchanged. Furthermore, multi-element injector chambers with different geometric sizes and at different pressures are numerically simulated and the wall shear stress and combustion efficiencies are solved and compared with each other. A multielement injector chamber is designed and hot-fire tested at several chamber pressures and the combustion performances are measured in a total of nine hot-fire tests. The numerical and experimental results verified the similarities among combustor wall shear stress and combustion performances at different chamber pressures and geometries, with the criterion applied.
Ranking Schools' Academic Performance Using a Fuzzy VIKOR
NASA Astrophysics Data System (ADS)
Musani, Suhaina; Aziz Jemain, Abdul
2015-06-01
Determination rank is structuring alternatives in order of priority. It is based on the criteria determined for each alternative involved. Evaluation criteria are performed and then a composite index composed of each alternative for the purpose of arranging in order of preference alternatives. This practice is known as multiple criteria decision making (MCDM). There are several common approaches to MCDM, one of the practice is known as VIKOR (Multi-criteria Optimization and Compromise Solution). The objective of this study is to develop a rational method for school ranking based on linguistic information of a criterion. The school represents an alternative, while the results for a number of subjects as the criterion. The results of the examination for a course, is given according to the student percentage of each grade. Five grades of excellence, honours, average, pass and fail is used to indicate a level of achievement in linguistics. Linguistic variables are transformed to fuzzy numbers to form a composite index of school performance. Results showed that fuzzy set theory can solve the limitations of using MCDM when there is uncertainty problems exist in the data.
ERIC Educational Resources Information Center
Meredith, Keith E.; Sabers, Darrell L.
Data required for evaluating a Criterion Referenced Measurement (CRM) is described with a matrix. The information within the matrix consists of the "pass-fail" decisions of two CRMs. By differentially defining these two CRMs, different concepts of reliability and validity can be examined. Indices suggested for analyzing the matrix are listed with…
Linking Health Concepts in the Assessment and Evaluation of Water Distribution Systems
ERIC Educational Resources Information Center
Karney, Bryan W.; Filion, Yves R.
2005-01-01
The concept of health is not only a specific criterion for evaluation of water quality delivered by a distribution system but also a suitable paradigm for overall functioning of the hydraulic and structural components of the system. This article views health, despite its complexities, as the only criterion with suitable depth and breadth to allow…
An investigation of the effects of pitch-roll (de)coupling on helicopter handling qualities
NASA Technical Reports Server (NTRS)
Blanken, C. L.; Pausder, H. J.; Ockier, C. J.
1995-01-01
An extensive investigation of the effects of pitch-roll coupling on helicopter handling qualities was performed by the U.S. Army and Deutsche Forschungsanstalt fur Luft- und Raumfahrt (DLR), using a NASA ground-based and a DLR in-flight simulator. Over 90 different coupling configurations were evaluated using a high gain roll-axis tracking task. The results show that although the current ADS-33C coupling criterion discriminates against those types of coupling typical of conventionally controlled helicopters, it is not always suited for the prediction of handling qualities of helicopters with modern control systems. Based on the observation that high frequency inputs during tracking are used to alleviate coupling, a frequency domain pitch-roll coupling criterion that uses the average coupling ratio between the bandwidth and neutral stability frequency is formulated. This criterion provides a more comprehensive coverage with respect to the different types of coupling, shows excellent consistency, and has the additional benefit that compliance testing data are obtained from the bandwidth/phase delay tests, so that no additional flight testing is needed.
Development of The Science Processes Test.
ERIC Educational Resources Information Center
Ludeman, Robert R.
Presented is a description and copy of a test manual developed to include items in the test on the basis of children's performance; each item correlated highly with performance on an external criterion. The external criterion was the Individual Competency Measures of the elementary science program Science - A Process Approach (SAPA). The test…
Sarkar, Sudipto; Kamilya, Dibyendu; Mal, B C
2007-03-01
Inclined plate settlers are used in treating wastewater due to their low space requirement and high removal rates. The prediction of sedimentation efficiency of these settlers is essential for their performance evaluation. In the present study, the technique of dimensional analysis was applied to predict the sedimentation efficiency of these inclined plate settlers. The effect of various geometric parameters namely, distance between plates (w(p)), plate angle (alpha), length of plate (l(p)), plate roughness (epsilon(p)), number of plates (n(p)) and particle diameter (d(s)) on the dynamic conditions, influencing the sedimentation process was studied. From the study it was established that neither the Reynolds criterion nor the Froude criterion was singularly valid to simulate the sedimentation efficiency (E) for different values of w(p) and flow velocity (v(f)). Considering the prevalent scale effect, simulation equations were developed to predict E at different dynamic conditions. The optimum dynamic condition producing the maximum E is also discussed.
Is the Federal Government Jumping on the Criterion-Referenced Testing Bandwagon?
ERIC Educational Resources Information Center
Buck, Lawrence S.
The increasing use of criterion referenced testing (CRT) among the various branches of the federal government is described. The requirements of the merit system have tended to promote the use of norm referenced tests except for uses such as pass/fail performance tests. The two areas in which criterion-referenced tests have been most useful are…
Organizational Productivity Measurement: The Development and Evaluation of an Integrated Approach.
1987-07-01
measurement and aggregation strategy also has applications in management r information systems, performance appraisal , and other situations where multiple...larger organizational units. The basic measurement and aggregation strategy also has applications in manage- "".". ment information systems, criterion...much has been written on the subject of organizational productiv- ity, there is little consensus concerning its definition ( Tuttle , 1983). Such a lack
Baten, Verena; Busch, Hans-Jörg; Busche, Caroline; Schmid, Bonaventura; Heupel-Reuter, Miriam; Perlov, Evgeniy; Brich, Jochen; Klöppel, Stefan
2018-05-08
Delirium is frequent in elderly patients presenting in the emergency department (ED). Despite the severe prognosis, the majority of delirium cases remain undetected by emergency physicians (EPs). At the time of our study there was no valid delirium screening tool available for EDs in German-speaking regions. We aimed to evaluate the brief Confusion Assessment Method (bCAM) for a German ED during the daily work routine. We implemented the bCAM into practice in a German interdisciplinary high-volume ED and evaluated the bCAM's validity in a convenience sample of medical patients aged ≥ 70 years. The bCAM, which assesses four core features of delirium, was performed by EPs during their daily work routine and compared to a criterion standard based on the criteria for delirium as described in the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition. Compared to the criterion standard, delirium was found to be present in 46 (16.0%) of the 288 nonsurgical patients enrolled. The bCAM showed 93.8% specificity (95% confidence interval [CI] = 90.0%-96.5%) and 65.2% sensitivity (95% CI = 49.8%-78.7%). Positive and negative likelihood ratios were 10.5 and 0.37, respectively, while the odds ratio was 28.4. Delirium was missed in 10 of 16 cases, since the bCAM did not indicate altered levels of consciousness and disorganized thinking. The level of agreement with the criterion standard increased for patients with low cognitive performance. This was the first study evaluating the bCAM for a German ED and when performed by EPs during routine work. The bCAM showed good specificity, but only moderate sensitivity. Nevertheless, application of the bCAM most likely improves the delirium detection rate in German EDs. However, it should only be applied by trained physicians to maximize diagnostic accuracy and hence improve the bCAM's sensitivity. Future studies should refine the bCAM. © 2018 by the Society for Academic Emergency Medicine.
Spathis, Jemima Grace; Connick, Mark James; Beckman, Emma Maree; Newcombe, Peter Anthony; Tweedy, Sean Michael
2015-01-01
Paralympic throwing events for athletes with physical impairments comprise seated and standing javelin, shot put, discus and seated club throwing. Identification of talented throwers would enable prediction of future success and promote participation; however, a valid and reliable talent identification battery for Paralympic throwing has not been reported. This study evaluates the reliability and validity of a talent identification battery for Paralympic throws. Participants were non-disabled so that impairment would not confound analyses, and results would provide an indication of normative performance. Twenty-eight non-disabled participants (13 M; 15 F) aged 23.6 years (±5.44) performed five kinematically distinct criterion throws (three seated, two standing) and nine talent identification tests (three anthropometric, six motor); 23 were tested a second time to evaluate test-retest reliability. Talent identification test-retest reliability was evaluated using Intra-class Correlation Coefficient (ICC) and Bland-Altman plots (Limits of Agreement). Spearman's correlation assessed strength of association between criterion throws and talent identification tests. Reliability was generally acceptable (mean ICC = 0.89), but two seated talent identification tests require more extensive familiarisation. Correlation strength (mean rs = 0.76) indicated that the talent identification tests can be used to validly identify individuals with competitively advantageous attributes for each of the five kinematically distinct throwing activities. Results facilitate further research in this understudied area.
ERIC Educational Resources Information Center
Lin, Keh-chung; Chen, Hui-fang; Chen, Chia-ling; Wang, Tien-ni; Wu, Ching-yi; Hsieh, Yu-wei; Wu, Li-ling
2012-01-01
This study examined criterion-related validity and clinimetric properties of the Pediatric Motor Activity Log (PMAL) in children with cerebral palsy. Study participants were 41 children (age range: 28-113 months) and their parents. Criterion-related validity was evaluated by the associations between the PMAL and criterion measures at baseline and…
The Effects of Performance Fatigability on Postural Control and Rehabilitation in the Older Patient
Hassan, Mahdi; Bugnariu, Nicoleta
2016-01-01
Fatigue is common in older adults and has a significant effect on quality of life. Despite the high prevalence of fatigue in older individuals, several aspects are poorly understood. It is important to differentiate subjective fatigue complaints from fatigability of motor performance because the two are independent constructs with potentially distinct consequences on mobility. Performance fatigability is the magnitude of change in a performance criterion over a given time of task performance. Performance fatigability is a compulsory element of any strength training program, yet strength training is an important component of rehabilitation programs for older adults. The consequences of fatigability for older adults suggest that acute exercise of various types may result in acute impairments in postural control. The effects of performance fatigability on postural control in older adults are evaluated here to aid the rehabilitation clinician in making recommendations for evaluation of fall risks and exercise prescription. PMID:28154794
The Effects of Performance Fatigability on Postural Control and Rehabilitation in the Older Patient.
Papa, Evan V; Hassan, Mahdi; Bugnariu, Nicoleta
2016-09-01
Fatigue is common in older adults and has a significant effect on quality of life. Despite the high prevalence of fatigue in older individuals, several aspects are poorly understood. It is important to differentiate subjective fatigue complaints from fatigability of motor performance because the two are independent constructs with potentially distinct consequences on mobility. Performance fatigability is the magnitude of change in a performance criterion over a given time of task performance. Performance fatigability is a compulsory element of any strength training program, yet strength training is an important component of rehabilitation programs for older adults. The consequences of fatigability for older adults suggest that acute exercise of various types may result in acute impairments in postural control. The effects of performance fatigability on postural control in older adults are evaluated here to aid the rehabilitation clinician in making recommendations for evaluation of fall risks and exercise prescription.
Blanck, Oliver; Masi, Laura; Chan, Mark K H; Adamczyk, Sebastian; Albrecht, Christian; Damme, Marie-Christin; Loutfi-Krauss, Britta; Alraun, Manfred; Fehr, Roman; Ramm, Ulla; Siebert, Frank-Andre; Stelljes, Tenzin Sonam; Poppinga, Daniela; Poppe, Björn
2016-06-01
High precision radiosurgery demands comprehensive delivery-quality-assurance techniques. The use of a liquid-filled ion-chamber-array for robotic-radiosurgery delivery-quality-assurance was investigated and validated using several test scenarios and routine patient plans. Preliminary evaluation consisted of beam profile validation and analysis of source-detector-distance and beam-incidence-angle response dependence. The delivery-quality-assurance analysis is performed in four steps: (1) Array-to-plan registration, (2) Evaluation with standard Gamma-Index criteria (local-dose-difference⩽2%, distance-to-agreement⩽2mm, pass-rate⩾90%), (3) Dose profile alignment and dose distribution shift until maximum pass-rate is found, and (4) Final evaluation with 1mm distance-to-agreement criterion. Test scenarios consisted of intended phantom misalignments, dose miscalibrations, and undelivered Monitor Units. Preliminary method validation was performed on 55 clinical plans in five institutions. The 1000SRS profile measurements showed sufficient agreement compared with a microDiamond detector for all collimator sizes. The relative response changes can be up to 2.2% per 10cm source-detector-distance change, but remains within 1% for the clinically relevant source-detector-distance range. Planned and measured dose under different beam-incidence-angles showed deviations below 1% for angles between 0° and 80°. Small-intended errors were detected by 1mm distance-to-agreement criterion while 2mm criteria failed to reveal some of these deviations. All analyzed delivery-quality-assurance clinical patient plans were within our tight tolerance criteria. We demonstrated that a high-resolution liquid-filled ion-chamber-array can be suitable for robotic radiosurgery delivery-quality-assurance and that small errors can be detected with tight distance-to-agreement criterion. Further improvement may come from beam specific correction for incidence angle and source-detector-distance response. Copyright © 2016 Associazione Italiana di Fisica Medica. Published by Elsevier Ltd. All rights reserved.
Tsugawa, Yusuke; Ohbu, Sadayoshi; Cruess, Richard; Cruess, Sylvia; Okubo, Tomoya; Takahashi, Osamu; Tokuda, Yasuharu; Heist, Brian S; Bito, Seiji; Itoh, Toshiyuki; Aoki, Akiko; Chiba, Tsutomu; Fukui, Tsuguya
2011-08-01
Despite the growing importance of and interest in medical professionalism, there is no standardized tool for its measurement. The authors sought to verify the validity, reliability, and generalizability of the Professionalism Mini-Evaluation Exercise (P-MEX), a previously developed and tested tool, in the context of Japanese hospitals. A multicenter, cross-sectional evaluation study was performed to investigate the validity, reliability, and generalizability of the P-MEX in seven Japanese hospitals. In 2009-2010, 378 evaluators (attending physicians, nurses, peers, and junior residents) completed 360-degree assessments of 165 residents and fellows using the P-MEX. The content validity and criterion-related validity were examined, and the construct validity of the P-MEX was investigated by performing confirmatory factor analysis through a structural equation model. The reliability was tested using generalizability analysis. The contents of the P-MEX achieved good acceptance in a preliminary working group, and the poststudy survey revealed that 302 (79.9%) evaluators rated the P-MEX items as appropriate, indicating good content validity. The correlation coefficient between P-MEX scores and external criteria was 0.78 (P < .001), demonstrating good criterion-related validity. Confirmatory factor analysis verified high path coefficient (0.60-0.99) and adequate goodness of fit of the model. The generalizability analysis yielded a high dependability coefficient, suggesting good reliability, except when evaluators were peers or junior residents. Findings show evidence of adequate validity, reliability, and generalizability of the P-MEX in Japanese hospital settings. The P-MEX is the only evaluation tool for medical professionalism verified in both a Western and East Asian cultural context.
On the predictive information criteria for model determination in seismic hazard analysis
NASA Astrophysics Data System (ADS)
Varini, Elisa; Rotondi, Renata
2016-04-01
Many statistical tools have been developed for evaluating, understanding, and comparing models, from both frequentist and Bayesian perspectives. In particular, the problem of model selection can be addressed according to whether the primary goal is explanation or, alternatively, prediction. In the former case, the criteria for model selection are defined over the parameter space whose physical interpretation can be difficult; in the latter case, they are defined over the space of the observations, which has a more direct physical meaning. In the frequentist approaches, model selection is generally based on an asymptotic approximation which may be poor for small data sets (e.g. the F-test, the Kolmogorov-Smirnov test, etc.); moreover, these methods often apply under specific assumptions on models (e.g. models have to be nested in the likelihood ratio test). In the Bayesian context, among the criteria for explanation, the ratio of the observed marginal densities for two competing models, named Bayes Factor (BF), is commonly used for both model choice and model averaging (Kass and Raftery, J. Am. Stat. Ass., 1995). But BF does not apply to improper priors and, even when the prior is proper, it is not robust to the specification of the prior. These limitations can be extended to two famous penalized likelihood methods as the Akaike Information Criterion (AIC) and the Bayesian Information Criterion (BIC), since they are proved to be approximations of -2log BF . In the perspective that a model is as good as its predictions, the predictive information criteria aim at evaluating the predictive accuracy of Bayesian models or, in other words, at estimating expected out-of-sample prediction error using a bias-correction adjustment of within-sample error (Gelman et al., Stat. Comput., 2014). In particular, the Watanabe criterion is fully Bayesian because it averages the predictive distribution over the posterior distribution of parameters rather than conditioning on a point estimate, but it is hardly applicable to data which are not independent given parameters (Watanabe, J. Mach. Learn. Res., 2010). A solution is given by Ando and Tsay criterion where the joint density may be decomposed into the product of the conditional densities (Ando and Tsay, Int. J. Forecast., 2010). The above mentioned criteria are global summary measures of model performance, but more detailed analysis could be required to discover the reasons for poor global performance. In this latter case, a retrospective predictive analysis is performed on each individual observation. In this study we performed the Bayesian analysis of Italian data sets by four versions of a long-term hazard model known as the stress release model (Vere-Jones, J. Physics Earth, 1978; Bebbington and Harte, Geophys. J. Int., 2003; Varini and Rotondi, Environ. Ecol. Stat., 2015). Then we illustrate the results on their performance evaluated by Bayes Factor, predictive information criteria and retrospective predictive analysis.
NASA Astrophysics Data System (ADS)
Yan, Peng; Lu, Wenbo; Zhang, Jing; Zou, Yujun; Chen, Ming
2017-04-01
Ground vibration, as the most critical public hazard of blasting, has received much attention from the community. Many countries established national standards to suppress vibration impact on structures, but a world-accepted blasting vibration criterion on human safety is still missing. In order to evaluate human response to the vibration from blasting excavation of a large-scale rock slope in China, this study aims to suggest a revised criterion. The vibration frequency was introduced to improve the existing single-factor (peak particle velocity) standard recommended by the United States Bureau of Mines (USBM). The feasibility of the new criterion was checked based on field vibration monitoring and investigation of human reactions. Moreover, the air overpressure or blast effects on human beings have also been discussed. The result indicates that the entire zone of influence can be divided into three subzones: severe-annoyance, light-annoyance and perception zone according to the revised safety standard. Both the construction company and local residents have provided positive comments on this influence degree assessment, which indicates that the presented criterion is suitable for evaluating human response to nearby blasts. Nevertheless, this specific criterion needs more field tests and verifications before it can be
Wireless sensor placement for structural monitoring using information-fusing firefly algorithm
NASA Astrophysics Data System (ADS)
Zhou, Guang-Dong; Yi, Ting-Hua; Xie, Mei-Xi; Li, Hong-Nan
2017-10-01
Wireless sensor networks (WSNs) are promising technology in structural health monitoring (SHM) applications for their low cost and high efficiency. The limited wireless sensors and restricted power resources in WSNs highlight the significance of optimal wireless sensor placement (OWSP) during designing SHM systems to enable the most useful information to be captured and to achieve the longest network lifetime. This paper presents a holistic approach, including an optimization criterion and a solution algorithm, for optimally deploying self-organizing multi-hop WSNs on large-scale structures. The combination of information effectiveness represented by the modal independence and the network performance specified by the network connectivity and network lifetime is first formulated to evaluate the performance of wireless sensor configurations. Then, an information-fusing firefly algorithm (IFFA) is developed to solve the OWSP problem. The step sizes drawn from a Lévy distribution are adopted to drive fireflies toward brighter individuals. Following the movement with Lévy flights, information about the contributions of wireless sensors to the objective function as carried by the fireflies is fused and applied to move inferior wireless sensors to better locations. The reliability of the proposed approach is verified via a numerical example on a long-span suspension bridge. The results demonstrate that the evaluation criterion provides a good performance metric of wireless sensor configurations, and the IFFA outperforms the simple discrete firefly algorithm.
Le Pen, C; Priol, G; Lilliu, H
2003-01-01
The criteria for the registration of new drugs may differ from the criteria for drug reimbursement. In 2000 the French government entrusted the French Medicines Agency with determining the "medical service rendered" (MSR) for each reimbursable drug. The goal was to determine which drugs could be classified with an "insufficient" MSR and therefore should be taken out of the scope of health insurance. We analyze the concepts and methods used for this evaluation and the kind of results that are obtained. We collected data on the result of MSR classification and the criteria used to perform this classification (efficacy-security, severity of the disease,place in the therapeutic strategy, existence of therapeutic alternative, public health value) for a sample of 1453 drugs belonging to five therapeutic areas. We used statistical analysis to determine what were the most influential criteria. Only two criteria - efficacy and disease severity - suffice to very largely explain the MSR classification. The other criteria contribute little added value. Some of these criteria clearly suffer from a lack of clarification, leading to different interpretations according to therapeutic class or even according to drug or drug family. The evaluation procedure differs between therapeutic classes, at least at intermediate MSR levels. Analysis of the French experience with MSR shows that the evaluation procedure has not succeeded in completely breaking away from the traditional logic of the marketing authorization and registration, as witnessed by the importance of the "efficacy/safety" criterion, the absence of an economic criterion, and the vagueness of the "public health value" criterion, which one would have thought would instead be decisive.
Ramírez, David; Caballero, Julio
2018-04-28
Molecular docking is the most frequently used computational method for studying the interactions between organic molecules and biological macromolecules. In this context, docking allows predicting the preferred pose of a ligand inside a receptor binding site. However, the selection of the “best” solution is not a trivial task, despite the widely accepted selection criterion that the best pose corresponds to the best energy score. Here, several rigid-target docking methods were evaluated on the same dataset with respect to their ability to reproduce crystallographic binding orientations, to test if the best energy score is a reliable criterion for selecting the best solution. For this, two experiments were performed: (A) to reconstruct the ligand-receptor complex by performing docking of the ligand in its own crystal structure receptor (defined as self-docking), and (B) to reconstruct the ligand-receptor complex by performing docking of the ligand in a crystal structure receptor that contains other ligand (defined as cross-docking). Root-mean square deviation (RMSD) was used to evaluate how different the obtained docking orientation is from the corresponding co-crystallized pose of the same ligand molecule. We found that docking score function is capable of predicting crystallographic binding orientations, but the best ranked solution according to the docking energy is not always the pose that reproduces the experimental binding orientation. This happened when self-docking was achieved, but it was critical in cross-docking. Taking into account that docking is typically used with predictive purposes, during cross-docking experiments, our results indicate that the best energy score is not a reliable criterion to select the best solution in common docking applications. It is strongly recommended to choose the best docking solution according to the scoring function along with additional structural criteria described for analogue ligands to assure the selection of a correct docking solution.
A Controlled Evaluation of the Distress Criterion for Binge Eating Disorder
Grilo, Carlos M.; White, Marney A.
2012-01-01
Objective Research has examined various aspects of the validity of the research criteria for binge eating disorder (BED) but has yet to evaluate the utility of criterion C “marked distress about binge eating.” This study examined the significance of the marked distress criterion for BED using two complementary comparisons groups. Method A total of 1075 community volunteers completed a battery of self-report instruments as part of an internet study. Analyses compared body mass index (BMI), eating-disorder psychopathology, and depressive levels in four groups: 97 participants with BED except for the distress criterion (BED-ND), 221 participants with BED including the distress criterion (BED), 79 participants with bulimia nervosa (BN), and 489 obese participants without binge-eating or purging (NBPO). Parallel analyses compared these study groups using the broadened frequency criterion (i.e., once-weekly for binge/purge behaviors) proposed for DSM-5 and the DSM-IV twice-weekly frequency criterion. Results The BED group had significantly greater eating-disorder psychopathology and depressive levels than the BED-ND group. The BED group, but not the BED-ND group, had significantly greater eating-disorder psychopathology than the NBPO comparison group. The BN group had significantly greater eating-disorder psychopathology and depressive levels than all three other groups. The group differences existed even after controlling for depression levels, BMI, and demographic variables, although some differences between the BN and BED groups were attenuated when controlling for depression levels. Conclusions These findings provide support for the validity of the “marked distress” criterion for the diagnosis of BED. PMID:21707133
ERIC Educational Resources Information Center
Stewart, Kelise K.; Carr, James E.; Brandt, Charles W.; McHenry, Meade M.
2007-01-01
The present study evaluated the effects of both a traditional lecture and the conservative dual-criterion (CDC) judgment aid on the ability of 6 university students to visually inspect AB-design line graphs. The traditional lecture reliably failed to improve visual inspection accuracy, whereas the CDC method substantially improved the performance…
Criterion-Referenced Job Proficiency Testing: A Large Scale Application. Research Report 1193.
ERIC Educational Resources Information Center
Maier, Milton H.; Hirshfeld, Stephen F.
The Army Skill Qualification Tests (SQT's) were designed to determine levels of competence in performance of the tasks crucial to an enlisted soldier's occupational specialty. SQT's are performance-based, criterion-referenced measures which offer two advantages over traditional proficiency and achievement testing programs: test content can be made…
Personal Career Orientation. Performance Objectives. Criterion Measures. Home Economics.
ERIC Educational Resources Information Center
Allen, Alveta; And Others
Several intermediate performance objectives and corresponding criterion measures are listed for each of six terminal objectives for a personal career orientation course for seventh grade students. This 6- to 9-week course is designed to acquaint the student with personal qualities and characteristics necessary for success in the world of work.…
Generalization of von Neumann analysis for a model of two discrete half-spaces: The acoustic case
Haney, M.M.
2007-01-01
Evaluating the performance of finite-difference algorithms typically uses a technique known as von Neumann analysis. For a given algorithm, application of the technique yields both a dispersion relation valid for the discrete time-space grid and a mathematical condition for stability. In practice, a major shortcoming of conventional von Neumann analysis is that it can be applied only to an idealized numerical model - that of an infinite, homogeneous whole space. Experience has shown that numerical instabilities often arise in finite-difference simulations of wave propagation at interfaces with strong material contrasts. These interface instabilities occur even though the conventional von Neumann stability criterion may be satisfied at each point of the numerical model. To address this issue, I generalize von Neumann analysis for a model of two half-spaces. I perform the analysis for the case of acoustic wave propagation using a standard staggered-grid finite-difference numerical scheme. By deriving expressions for the discrete reflection and transmission coefficients, I study under what conditions the discrete reflection and transmission coefficients become unbounded. I find that instabilities encountered in numerical modeling near interfaces with strong material contrasts are linked to these cases and develop a modified stability criterion that takes into account the resulting instabilities. I test and verify the stability criterion by executing a finite-difference algorithm under conditions predicted to be stable and unstable. ?? 2007 Society of Exploration Geophysicists.
Simulated Driving Assessment (SDA) for Teen Drivers: Results from a Validation Study
McDonald, Catherine C.; Kandadai, Venk; Loeb, Helen; Seacrist, Thomas S.; Lee, Yi-Ching; Winston, Zachary; Winston, Flaura K.
2015-01-01
Background Driver error and inadequate skill are common critical reasons for novice teen driver crashes, yet few validated, standardized assessments of teen driving skills exist. The purpose of this study was to evaluate the construct and criterion validity of a newly developed Simulated Driving Assessment (SDA) for novice teen drivers. Methods The SDA's 35-minute simulated drive incorporates 22 variations of the most common teen driver crash configurations. Driving performance was compared for 21 inexperienced teens (age 16–17 years, provisional license ≤90 days) and 17 experienced adults (age 25–50 years, license ≥5 years, drove ≥100 miles per week, no collisions or moving violations ≤3 years). SDA driving performance (Error Score) was based on driving safety measures derived from simulator and eye-tracking data. Negative driving outcomes included simulated collisions or run-off-the-road incidents. A professional driving evaluator/instructor reviewed videos of SDA performance (DEI Score). Results The SDA demonstrated construct validity: 1.) Teens had a higher Error Score than adults (30 vs. 13, p=0.02); 2.) For each additional error committed, the relative risk of a participant's propensity for a simulated negative driving outcome increased by 8% (95% CI: 1.05–1.10, p<0.01). The SDA demonstrated criterion validity: Error Score was correlated with DEI Score (r=−0.66, p<0.001). Conclusions This study supports the concept of validated simulated driving tests like the SDA to assess novice driver skill in complex and hazardous driving scenarios. The SDA, as a standard protocol to evaluate teen driver performance, has the potential to facilitate screening and assessment of teen driving readiness and could be used to guide targeted skill training. PMID:25740939
Evaluation of a strapless heart rate monitor during simulated flight tasks.
Wang, Zhen; Fu, Shan
2016-01-01
Pilots are under high task demands during flight. Monitoring pilot's physiological status is very important in the evaluation of pilot's workload and flight safety. Recently, physiological status monitor (PSM) has been embedded into a watch that can be used without a conventional chest strap. This makes it possible to unobtrusively monitor, log and transmit pilot's physiological measurements such as heart rate (HR) during flight tasks. The purpose of this study is to validate HR recorded by a strapless heart rate watch against criterion ECG-derived HR. Ten commercial pilots (mean ± SD : age: 39.1 ± 7.8 years; total flight hours 7173.2 ± 5270.9 hr) performed three routinely trained flight tasks in a full flight simulator: wind shear go-around (WG), takeoff and climb (TC), and hydraulic failure (HF). For all tasks combined (overall) and for each task, differences between the heart rate watch measurements and the criterion data were small (mean difference [95% CI]: overall: -0.71 beats/min [-0.85, -0.57]; WG: -0.90 beats/min [-1.15, -0.65]; TC: -0.69 beats/min [-0.98, -0.40]; HF: -0.61 beats/min [-0.80, -0.42]). There were high correlations between the heart rate watch measurements and the ECG-derived HR for all tasks (r ≥ 0.97, SEE < 3). Bland-Altman plots also show high agreements between the watch measurements and the criterion HR. These results suggest that the strapless heart rate watch provides valid measurements of HR during simulated flight tasks and could be a useful tool for pilot workload evaluation.
Angers, Magalie; Svotelis, Amy; Balg, Frederic; Allard, Jean-Pascal
2016-04-01
The Ankle Osteoarthritis Scale (AOS) is a self-administered score specific for ankle osteoarthritis (OA) with excellent reliability and strong construct and criterion validity. Many recent randomized multicentre trials have used the AOS, and the involvement of the French-speaking population is limited by the absence of a French version. Our goal was to develop a French version and validate the psychometric properties to assure equivalence to the original English version. Translation was performed according to American Association of Orthopaedic Surgeons (AAOS) 2000 guidelines for cross-cultural adaptation. Similar to the validation process of the English AOS, we evaluated the psychometric properties of the French version (AOS-Fr): criterion validity (AOS-Fr v. Western Ontario and McMaster Universities Arthritis Index [WOMAC] and SF-36 scores), construct validity (AOS-Fr correlation to single heel-lift test), and reliability (AOS-Fr test-retest). Sixty healthy individuals tested a prefinal version of the AOS-Fr for comprehension, leading to modifications and a final version that was approved by C. Saltzman, author of the AOS. We then recruited patients with ankle OA for evaluation of the AOS-Fr psychometric properties. Twenty-eight patients with ankle OA participated in the evaluation. The AOS-Fr showed strong criterion validity (AOS:WOMAC r = 0.709 and AOS:SF-36 r = -0.654) and construct validity (r = 0.664) and proved to be reliable (test-retest intraclass correlation coefficient = 0.922). The AOS-Fr is a reliable and valid score equivalent to the English version in terms of psychometric properties, thus is available for use in multicentre trials.
Comparison of Nurse Staffing Measurements in Staffing-Outcomes Research.
Park, Shin Hye; Blegen, Mary A; Spetz, Joanne; Chapman, Susan A; De Groot, Holly A
2015-01-01
Investigators have used a variety of operational definitions of nursing hours of care in measuring nurse staffing for health services research. However, little is known about which approach is best for nurse staffing measurement. To examine whether various nursing hours measures yield different model estimations when predicting patient outcomes and to determine the best method to measure nurse staffing based on the model estimations. We analyzed data from the University HealthSystem Consortium for 2005. The sample comprised 208 hospital-quarter observations from 54 hospitals, representing information on 971 adult-care units and about 1 million inpatient discharges. We compared regression models using different combinations of staffing measures based on productive/nonproductive and direct-care/indirect-care hours. Akaike Information Criterion and Bayesian Information Criterion were used in the assessment of staffing measure performance. The models that included the staffing measure calculated from productive hours by direct-care providers were best, in general. However, the Akaike Information Criterion and Bayesian Information Criterion differences between models were small, indicating that distinguishing nonproductive and indirect-care hours from productive direct-care hours does not substantially affect the approximation of the relationship between nurse staffing and patient outcomes. This study is the first to explicitly evaluate various measures of nurse staffing. Productive hours by direct-care providers are the strongest measure related to patient outcomes and thus should be preferred in research on nurse staffing and patient outcomes.
Criterion-Referenced Testing for College-Level General Education: Some Problems and Recommendations.
ERIC Educational Resources Information Center
Benoist, Howard
1979-01-01
The adoption of a criterion-referenced assessment system and the resulting disadvantages of this form of evaluation for the college general education program are discussed, including problems in identifying assessment validation procedures. (RAO)
San Francisco floating STOLport study
NASA Technical Reports Server (NTRS)
1974-01-01
The operational, economic, environmental, social and engineering feasibility of utilizing deactivated maritime vessels as a waterfront quiet short takeoff and landing facility to be located near the central business district of San Francisco was investigated. Criteria were developed to evaluate each site, and minimum standards were established for each criterion. Predicted conditions at the two sites were compared to the requirements for each of the 11 criteria as a means of evaluating site performance. Criteria include land use, community structure, economic impact, access, visual character, noise, air pollution, natural environment, weather, air traffic, and terminal design.
Guidance strategies and analysis for low thrust navigation
NASA Technical Reports Server (NTRS)
Jacobson, R. A.
1973-01-01
A low-thrust guidance algorithm suitable for operational use was formulated. A constrained linear feedback control law was obtained using a minimum terminal miss criterion and restricting control corrections to constant changes for specified time periods. Both fixed- and variable-time-of-arrival guidance were considered. The performance of the guidance law was evaluated by applying it to the approach phase of the 1980 rendezvous mission with the comet Encke.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pilch, M.M.; Allen, M.D.; Klamerus, E.W.
1996-02-01
This report uses the scenarios described in NUREG/CR-6075 and NUREG/CR-6075, Supplement 1, to address the direct containment heating (DCH) issue for all Westinghouse plants with large dry or subatmospheric containments. DCH is considered resolved if the conditional containment failure probability (CCFP) is less than 0.1. Loads versus strength evaluations of the CCFP were performed for each plant using plant-specific information. The DCH issue is considered resolved for a plant if a screening phase results in a CCFP less than 0.01, which is more stringent than the overall success criterion. If the screening phase CCFP for a plant is greater thanmore » 0.01, then refined containment loads evaluations must be performed and/or the probability of high pressure at vessel breach must be analyzed. These analyses could be used separately or could be integrated together to recalculate the CCFP for an individual plant to reduce the CCFP to meet the overall success criterion of less than 0.1. The CCFPs for all of the Westinghouse plants with dry containments were less than 0.01 at the screening phase, and thus, the DCH issue is resolved for these plants based on containment loads alone. No additional analyses are required.« less
The Counselor Evaluation Rating Scale: A Valid Criterion of Counselor Effectiveness?
ERIC Educational Resources Information Center
Jones, Lawrence K.
1974-01-01
The validity of recent recommendations regarding the use of certain factors of the 16 Personality Factor Questionnaire (16PF) to select persons for counselor training programs, where the CERS was the criterion measure, is challenged. (Author)
ERIC Educational Resources Information Center
Maljaars, Jarymke; Noens, Ilse; Scholte, Evert; van Berckelaer-Onnes, Ina
2012-01-01
The Diagnostic Interview for Social and Communication Disorders (DISCO; Wing, 2006) is a standardized, semi-structured and interviewer-based schedule for diagnosis of autism spectrum disorder (ASD). The objective of this study was to evaluate the criterion and convergent validity of the DISCO-11 ICD-10 algorithm in young and low-functioning…
He, Bosheng; Gu, Jinhua; Huang, Sheng; Gao, Xuesong; Fan, Jinhe; Sheng, Meihong; Wang, Lin; Gong, Shenchu
2017-02-01
This study was performed to evaluate the diagnostic performance of multi-slice CT angiography combined with enterography in determining the cause and location of obstruction as well as intestinal ischaemia in patients with small bowel obstruction (SBO). This study retrospectively summarized the image data of 57 SBO patients who received both multi-slice CT angiography and enterography examination between December 2012 and May 2013. The CT diagnoses of SBO and intestinal ischaemia were correlated with the findings at surgery or digital subtraction angiography, which were set as standard references. Multi-slice CT angiography and enterography indicated that the cause of SBO in three patients was misjudged, suggesting a diagnostic accuracy of 94.7%. In one patient the level of obstruction was incorrect, demonstrating a diagnostic accuracy of 98.2%. Based on the results of the receiver operating characteristic (ROC) curve analysis, the diagnostic criterion for ischaemic SBO was at least two of the four CT signs (circumferential bowel wall thickening, reduced enhancement of the intestinal wall, mesenteric oedema and mesenteric vascular engorgement). The criterion yielded a sensitivity of 94.4%, a specificity of 92.3%, a positive predicted value of 85.0% and a negative predicted value of 97.3%, and the area under curve (AUC) was 0.92 (95% CI, 0.85-0.99). Multi-slice CT angiography and enterography have high diagnostic value in identifying the cause and site of SBO. In addition, the suggested diagnostic criterion using CT signs is helpful for diagnosing intestinal ischaemia in SBO patients. © 2016 The Royal Australian and New Zealand College of Radiologists.
Wei, Guo-Zhen; Lu, Xia; Ke, Fu-Sheng; Huang, Ling; Li, Jun-Tao; Wang, Zhao-Xiang; Zhou, Zhi-You; Sun, Shi-Gang
2010-10-15
A cathode for high-rate performance lithium-ion batteries (LIBs) has been developed from a crystal habit-tuned nanoplate Li(Li(0.17)Ni(0.25)Mn(0.58))O₂ material, in which the proportion of (010) nanoplates (see figure) has been significantly increased. The results demonstrate that the fraction of the surface that is electrochemically active for Li(+) transportation is a key criterion for evaluating the different nanostructures of potential LIB materials.
ERIC Educational Resources Information Center
Duval County School Board, Jacksonville, FL.
Several intermediate performance objectives and corresponding criterion measures are presented for each of five terminal objectives for a 12- to 18-week course designed to provide students in grades 8 or 9 with opportunities to explore a broad range of clothing management, production, and service occupations. The course was designed to provide…
Consumer Education--Home Economics. Performance Objectives. Criterion Measures. Home Economics.
ERIC Educational Resources Information Center
Duval County School Board, Jacksonville, FL.
Several intermediate performance objectives and corresponding criterion measures are listed for each of six terminal objectives for an 18-week consumer education-home economics course for 10th, 11th, and 12th grade students. Purposes listed for the course are to develop an understanding of the American market system, and how the individual affects…
The Development of a Criterion Instrument for Counselor Selection.
ERIC Educational Resources Information Center
Remer, Rory; Sease, William
A measure of potential performance as a counselor is needed as an adjunct to the information presently employed in selection decisions. This article deals with one possible method of development of such a potential performance criterion and the steps taken, to date, in the attempt to validate it. It includes: the overall effectiveness of the…
Model selection for the North American Breeding Bird Survey: A comparison of methods
Link, William; Sauer, John; Niven, Daniel
2017-01-01
The North American Breeding Bird Survey (BBS) provides data for >420 bird species at multiple geographic scales over 5 decades. Modern computational methods have facilitated the fitting of complex hierarchical models to these data. It is easy to propose and fit new models, but little attention has been given to model selection. Here, we discuss and illustrate model selection using leave-one-out cross validation, and the Bayesian Predictive Information Criterion (BPIC). Cross-validation is enormously computationally intensive; we thus evaluate the performance of the Watanabe-Akaike Information Criterion (WAIC) as a computationally efficient approximation to the BPIC. Our evaluation is based on analyses of 4 models as applied to 20 species covered by the BBS. Model selection based on BPIC provided no strong evidence of one model being consistently superior to the others; for 14/20 species, none of the models emerged as superior. For the remaining 6 species, a first-difference model of population trajectory was always among the best fitting. Our results show that WAIC is not reliable as a surrogate for BPIC. Development of appropriate model sets and their evaluation using BPIC is an important innovation for the analysis of BBS data.
SU-F-T-272: Patient Specific Quality Assurance of Prostate VMAT Plans with Portal Dosimetry
DOE Office of Scientific and Technical Information (OSTI.GOV)
Darko, J; Osei, E; University of Waterloo, Waterloo, ON
Purpose: To evaluate the effectiveness of using the Portal Dosimetry (PD) method for patient specific quality assurance of prostate VMAT plans. Methods: As per institutional protocol all VMAT plans were measured using the Varian Portal Dosimetry (PD) method. A gamma evaluation criterion of 3%-3mm with a minimum area gamma pass rate (gamma <1) of 95% is used clinically for all plans. We retrospectively evaluated the portal dosimetry results for 170 prostate patients treated with VMAT technique. Three sets of criterions were adopted for re-evaluating the measurements; 3%-3mm, 2%-2mm and 1%-1mm. For all criterions two areas, Field+1cm and MLC-CIAO were analysed.Tomore » ascertain the effectiveness of the portal dosimetry technique in determining the delivery accuracy of prostate VMAT plans, 10 patients previously measured with portal dosimetry, were randomly selected and their measurements repeated using the ArcCHECK method. The same criterion used in the analysis of PD was used for the ArcCHECK measurements. Results: All patient plans reviewed met the institutional criteria for Area Gamma pass rate. Overall, the gamma pass rate (gamma <1) decreases for 3%-3mm, 2%-2mm and 1%-1mm criterion. For each criterion the pass rate was significantly reduced when the MLC-CIAO was used instead of FIELD+1cm. There was noticeable change in sensitivity for MLC-CIAO with 2%-2mm criteria and much more significant reduction at 1%-1mm. Comparable results were obtained for the ArcCHECK measurements. Although differences were observed between the clockwise verses the counter clockwise plans in both the PD and ArcCHECK measurements, this was not deemed to be statistically significant. Conclusion: This work demonstrates that Portal Dosimetry technique can be effectively used for quality assurance of VMAT plans. Results obtained show similar sensitivity compared to ArcCheck. To reveal certain delivery inaccuracies, the use of a combination of criterions may provide an effective way in improving the overall sensitivity of PD. Funding provided in part by the Prostate Ride for Dad, Kitchener-Waterloo, Canada.« less
Measures and Interpretations of Vigilance Performance: Evidence Against the Detection Criterion
NASA Technical Reports Server (NTRS)
Balakrishnan, J. D.
1998-01-01
Operators' performance in a vigilance task is often assumed to depend on their choice of a detection criterion. When the signal rate is low this criterion is set high, causing the hit and false alarm rates to be low. With increasing time on task the criterion presumably tends to increase even further, thereby further decreasing the hit and false alarm rates. Virtually all of the empirical evidence for this simple interpretation is based on estimates of the bias measure Beta from signal detection theory. In this article, I describe a new approach to studying decision making that does not require the technical assumptions of signal detection theory. The results of this new analysis suggest that the detection criterion is never biased toward either response, even when the signal rate is low and the time on task is long. Two modifications of the signal detection theory framework are considered to account for this seemingly paradoxical result. The first assumes that the signal rate affects the relative sizes of the variances of the information distributions; the second assumes that the signal rate affects the logic of the operator's stopping rule. Actual or potential applications of this research include the improved training and performance assessment of operators in areas such as product quality control, air traffic control, and medical and clinical diagnosis.
Validation of Cost-Effectiveness Criterion for Evaluating Noise Abatement Measures
DOT National Transportation Integrated Search
1999-04-01
This project will provide the Texas Department of Transportation (TxDOT)with information about the effects of the current cost-effectiveness criterion. The project has reviewed (1) the cost-effectiveness criteria used by other states, (2) the noise b...
Dahlke, Jeffrey A; Kostal, Jack W; Sackett, Paul R; Kuncel, Nathan R
2018-05-03
We explore potential explanations for validity degradation using a unique predictive validation data set containing up to four consecutive years of high school students' cognitive test scores and four complete years of those students' college grades. This data set permits analyses that disentangle the effects of predictor-score age and timing of criterion measurements on validity degradation. We investigate the extent to which validity degradation is explained by criterion dynamism versus the limited shelf-life of ability scores. We also explore whether validity degradation is attributable to fluctuations in criterion variability over time and/or GPA contamination from individual differences in course-taking patterns. Analyses of multiyear predictor data suggest that changes to the determinants of performance over time have much stronger effects on validity degradation than does the shelf-life of cognitive test scores. The age of predictor scores had only a modest relationship with criterion-related validity when the criterion measurement occasion was held constant. Practical implications and recommendations for future research are discussed. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Above-real-time training (ARTT) improves transfer to a simulated flight control task.
Donderi, D C; Niall, Keith K; Fish, Karyn; Goldstein, Benjamin
2012-06-01
The aim of this study was to measure the effects of above-real-time-training (ARTT) speed and screen resolution on a simulated flight control task. ARTT has been shown to improve transfer to the criterion task in some military simulation experiments. We tested training speed and screen resolution in a project, sponsored by Defence Research and Development Canada, to develop components for prototype air mission simulators. For this study, 54 participants used a single-screen PC-based flight simulation program to learn to chase and catch an F-18A fighter jet with another F-18A while controlling the chase aircraft with a throttle and side-stick controller. Screen resolution was varied between participants, and training speed was varied factorially across two sessions within participants. Pretest and posttest trials were at high resolution and criterion (900 knots) speed. Posttest performance was best with high screen resolution training and when one ARTT training session was followed by a session of criterion speed training. ARTT followed by criterion training improves performance on a visual-motor coordination task. We think that ARTT influences known facilitators of transfer, including similarity to the criterion task and contextual interference. Use high-screen resolution, start with ARTT, and finish with criterion speed training when preparing a mission simulation.
Brinkman, Willem M; Luursema, Jan-Maarten; Kengen, Bas; Schout, Barbara M A; Witjes, J Alfred; Bekkers, Ruud L
2013-03-01
To answer 2 research questions: what are the learning curve patterns of novices on the da Vinci skills simulator parameters and what parameters are appropriate for criterion-based robotic training. A total of 17 novices completed 2 simulator sessions within 3 days. Each training session consisted of a warming-up exercise, followed by 5 repetitions of the "ring and rail II" task. Expert participants (n = 3) performed a warming-up exercise and 3 repetitions of the "ring and rail II" task on 1 day. We analyzed all 9 parameters of the simulator. Significant learning occurred on 5 parameters: overall score, time to complete, instrument collision, instruments out of view, and critical errors within 1-10 repetitions (P <.05). Economy of motion and excessive instrument force only showed improvement within the first 5 repetitions. No significant learning on the parameter drops and master workspace range was found. Using the expert overall performance score (n = 3) as a criterion (overall score 90%), 9 of 17 novice participants met the criterion within 10 repetitions. Most parameters showed that basic robotic skills are learned relatively quickly using the da Vinci skills simulator, but that 10 repetitions were not sufficient for most novices to reach an expert level. Some parameters seemed inappropriate for expert-based criterion training because either no learning occurred or the novice performance was equal to expert performance. Copyright © 2013 Elsevier Inc. All rights reserved.
On the evidence for species coexistence: a critique of the coexistence program.
Siepielski, Adam M; McPeek, Mark A
2010-11-01
A major challenge in ecology is to understand how the millions of species on Earth are organized into biological communities. Mechanisms promoting coexistence are one such class of organizing processes, which allow multiple species to persist in the same trophic level of a given web of species interactions. If some mechanism promotes the coexistence of two or more species, each species must be able to increase when it is rare and the others are at their typical abundances; this invasibility criterion is fundamental evidence for species coexistence regardless of the mechanism. In an attempt to evaluate the level of empirical support for coexistence mechanisms in nature, we surveyed the literature for empirical studies of coexistence at a local scale (i.e., species found living together in one place) to determine whether these studies satisfied the invasibility criterion. In our survey, only seven of 323 studies that drew conclusions about species coexistence evaluated invasibility in some way in either observational or experimental studies. In addition, only three other studies evaluated necessary but not sufficient conditions for invasibility (i.e., negative density dependence and a trade-off in performance that influences population regulation). These results indicate that, while species coexistence is a prevalent assumption for why species are able to live together in one place, critical empirical tests of this fundamental assumption of community structure are rarely performed. These tests are central to developing a more robust understanding of the relative contributions of both deterministic and stochastic processes structuring biological communities.
NASA Astrophysics Data System (ADS)
Guo, Ning; Yang, Zhichun; Wang, Le; Ouyang, Yan; Zhang, Xinping
2018-05-01
Aiming at providing a precise dynamic structural finite element (FE) model for dynamic strength evaluation in addition to dynamic analysis. A dynamic FE model updating method is presented to correct the uncertain parameters of the FE model of a structure using strain mode shapes and natural frequencies. The strain mode shape, which is sensitive to local changes in structure, is used instead of the displacement mode for enhancing model updating. The coordinate strain modal assurance criterion is developed to evaluate the correlation level at each coordinate over the experimental and the analytical strain mode shapes. Moreover, the natural frequencies which provide the global information of the structure are used to guarantee the accuracy of modal properties of the global model. Then, the weighted summation of the natural frequency residual and the coordinate strain modal assurance criterion residual is used as the objective function in the proposed dynamic FE model updating procedure. The hybrid genetic/pattern-search optimization algorithm is adopted to perform the dynamic FE model updating procedure. Numerical simulation and model updating experiment for a clamped-clamped beam are performed to validate the feasibility and effectiveness of the present method. The results show that the proposed method can be used to update the uncertain parameters with good robustness. And the updated dynamic FE model of the beam structure, which can correctly predict both the natural frequencies and the local dynamic strains, is reliable for the following dynamic analysis and dynamic strength evaluation.
NASA Astrophysics Data System (ADS)
Wu, Hsin-Hung; Tsai, Ya-Ning
2012-11-01
This study uses both analytic hierarchy process (AHP) and decision-making trial and evaluation laboratory (DEMATEL) methods to evaluate the criteria in auto spare parts industry in Taiwan. Traditionally, AHP does not consider indirect effects for each criterion and assumes that criteria are independent without further addressing the interdependence between or among the criteria. Thus, the importance computed by AHP can be viewed as short-term improvement opportunity. On the contrary, DEMATEL method not only evaluates the importance of criteria but also depicts the causal relations of criteria. By observing the causal diagrams, the improvement based on cause-oriented criteria might improve the performance effectively and efficiently for the long-term perspective. As a result, the major advantage of integrating AHP and DEMATEL methods is that the decision maker can continuously improve suppliers' performance from both short-term and long-term viewpoints.
NASA Astrophysics Data System (ADS)
Alahmadi, F.; Rahman, N. A.; Abdulrazzak, M.
2014-09-01
Rainfall frequency analysis is an essential tool for the design of water related infrastructure. It can be used to predict future flood magnitudes for a given magnitude and frequency of extreme rainfall events. This study analyses the application of rainfall partial duration series (PDS) in the vast growing urban Madinah city located in the western part of Saudi Arabia. Different statistical distributions were applied (i.e. Normal, Log Normal, Extreme Value type I, Generalized Extreme Value, Pearson Type III, Log Pearson Type III) and their distribution parameters were estimated using L-moments methods. Also, different selection criteria models are applied, e.g. Akaike Information Criterion (AIC), Corrected Akaike Information Criterion (AICc), Bayesian Information Criterion (BIC) and Anderson-Darling Criterion (ADC). The analysis indicated the advantage of Generalized Extreme Value as the best fit statistical distribution for Madinah partial duration daily rainfall series. The outcome of such an evaluation can contribute toward better design criteria for flood management, especially flood protection measures.
Jafarzadeh, S Reza; Johnson, Wesley O; Gardner, Ian A
2016-03-15
The area under the receiver operating characteristic (ROC) curve (AUC) is used as a performance metric for quantitative tests. Although multiple biomarkers may be available for diagnostic or screening purposes, diagnostic accuracy is often assessed individually rather than in combination. In this paper, we consider the interesting problem of combining multiple biomarkers for use in a single diagnostic criterion with the goal of improving the diagnostic accuracy above that of an individual biomarker. The diagnostic criterion created from multiple biomarkers is based on the predictive probability of disease, conditional on given multiple biomarker outcomes. If the computed predictive probability exceeds a specified cutoff, the corresponding subject is allocated as 'diseased'. This defines a standard diagnostic criterion that has its own ROC curve, namely, the combined ROC (cROC). The AUC metric for cROC, namely, the combined AUC (cAUC), is used to compare the predictive criterion based on multiple biomarkers to one based on fewer biomarkers. A multivariate random-effects model is proposed for modeling multiple normally distributed dependent scores. Bayesian methods for estimating ROC curves and corresponding (marginal) AUCs are developed when a perfect reference standard is not available. In addition, cAUCs are computed to compare the accuracy of different combinations of biomarkers for diagnosis. The methods are evaluated using simulations and are applied to data for Johne's disease (paratuberculosis) in cattle. Copyright © 2015 John Wiley & Sons, Ltd.
Increasing money-counting skills with a student with brain injury: skill and performance deficits.
Fienup, Daniel M; Mudgal, Dipti; Pace, Gary
2013-01-01
Two studies examined the effectiveness of interventions designed to increase money-counting skills of a student with brain injury. Both skill and performance hypotheses were examined. Single subject designs were used to evaluate interventions, including a multiple-baseline across counting paper and coin money (study 1) and a changing criterion design (study 2). In study 1, it was hypothesized that the student had a skill deficit; thus, the participant was taught organizational strategies for counting money. In study 2, a performance deficit was hypothesized and the effects of contingent rewards were evaluated. In study 1, organizational strategies increased organized counting of money, but did not affect counting accuracy. In study 2, contingent rewards increased accurate money counting. When dealing with multi-step behaviours, different components of behaviour can be controlled by different variables, such as skill and performance deficits. Effective academic interventions may need to consider both types of deficits.
A new criterion needed to evaluate reliability of digital protective relays
NASA Astrophysics Data System (ADS)
Gurevich, Vladimir
2012-11-01
There is a wide range of criteria and features for evaluating reliability in engineering; but as many as there are, only one of them has been chosen to evaluate reliability of Digital Protective Relays (DPR) in the technical documentation: Mean (operating) Time Between Failures (MTBF), which has gained universal currency and has been specified in technical manuals, information sheets, tender documentation as the key indicator of DPR reliability. But is the choice of this criterion indeed wise? The answer to this question is being sought by the author of this article.
Donders, Jacobus; Janke, Kelly
2008-07-01
The performance of 40 children with complicated mild to severe traumatic brain injury on the Wechsler Intelligence Scale for Children-Fourth Edition (WISC-IV; Wechsler, 2003) was compared with that of 40 demographically matched healthy controls. Of the four WISC-IV factor index scores, only Processing Speed yielded a statistically significant group difference (p < .001) as well as a statistically significant negative correlation with length of coma (p < .01). Logistic regression, using Processing Speed to classify individual children, yielded a sensitivity of 72.50% and a specificity of 62.50%, with false positive and false negative rates both exceeding 30%. We conclude that Processing Speed has acceptable criterion validity in the evaluation of children with complicated mild to severe traumatic brain injury but that the WISC-IV should be supplemented with other measures to assure sufficient accuracy in the diagnostic process.
Working memory training in older adults: evidence of transfer and maintenance effects.
Borella, Erika; Carretti, Barbara; Riboldi, Francesco; De Beni, Rossana
2010-12-01
Few studies have examined working memory (WM) training-related gains and their transfer and maintenance effects in older adults. This present research investigates the efficacy of a verbal WM training program in adults aged 65-75 years, considering specific training gains on a verbal WM (criterion) task as well as transfer effects on measures of visuospatial WM, short-term memory, inhibition, processing speed, and fluid intelligence. Maintenance of training benefits was evaluated at 8-month follow-up. Trained older adults showed higher performance than did controls on the criterion task and maintained this benefit after 8 months. Substantial general transfer effects were found for the trained group, but not for the control one. Transfer maintenance gains were found at follow-up, but only for fluid intelligence and processing speed tasks. The results are discussed in terms of cognitive plasticity in older adults. (c) 2010 APA, all rights reserved).
/sup 99m/Tc-methylene diphosphonate bone imaging in the evaluation of total hip prostheses
DOE Office of Scientific and Technical Information (OSTI.GOV)
Weiss, P.E.; Mall, J.C.; Hoffer, P.B.
1979-12-01
A retrospective study was performed to determine the accuracy of /sup 99m/Tc-methylene diphosphonate bone imaging in the evaluation of total hip arthroplasty for loosening and/or infection. Using focally increased activity at the tip of the femoral component or in the region of the acetabular component as a criterion, the examination was 77% specific and 100% sensitive for loosening and/or infection. A possible explanation for the increased uptake at the tip of the femoral component and the role of this examination in the management of a painful total hip prosthesis are discussed.
Fuzzy approaches to supplier selection problem
NASA Astrophysics Data System (ADS)
Ozkok, Beyza Ahlatcioglu; Kocken, Hale Gonce
2013-09-01
Supplier selection problem is a multi-criteria decision making problem which includes both qualitative and quantitative factors. In the selection process many criteria may conflict with each other, therefore decision-making process becomes complicated. In this study, we handled the supplier selection problem under uncertainty. In this context; we used minimum criterion, arithmetic mean criterion, regret criterion, optimistic criterion, geometric mean and harmonic mean. The membership functions created with the help of the characteristics of used criteria, and we tried to provide consistent supplier selection decisions by using these memberships for evaluating alternative suppliers. During the analysis, no need to use expert opinion is a strong aspect of the methodology used in the decision-making.
Latent Class Analysis of Incomplete Data via an Entropy-Based Criterion
Larose, Chantal; Harel, Ofer; Kordas, Katarzyna; Dey, Dipak K.
2016-01-01
Latent class analysis is used to group categorical data into classes via a probability model. Model selection criteria then judge how well the model fits the data. When addressing incomplete data, the current methodology restricts the imputation to a single, pre-specified number of classes. We seek to develop an entropy-based model selection criterion that does not restrict the imputation to one number of clusters. Simulations show the new criterion performing well against the current standards of AIC and BIC, while a family studies application demonstrates how the criterion provides more detailed and useful results than AIC and BIC. PMID:27695391
Development of a new instrument for determining the level of chewing function in children.
Serel Arslan, S; Demir, N; Barak Dolgun, A; Karaduman, A A
2016-07-01
This study aimed to develop a chewing performance scale that classifies chewing from normal to severely impaired and to investigate its validity and reliability. The study included the developmental phase and reported the content, structural, criterion validity, interobserver and intra-observer reliability of the chewing performance scale, which was called the Karaduman Chewing Performance Scale (KCPS). A dysphagia literature review, other questionnaires and clinical experiences were used in the developmental phase. Seven experts assessed the steps for content validity over two Delphi rounds. To test structural, criterion validity, interobserver and intra-observer reliability, two swallowing therapists evaluated chewing videos of 144 children (Group I: 61 healthy children without chewing disorders, mean age of 42·38 ± 9·36 months; Group II: 83 children with cerebral palsy who have chewing disorders, mean age of 39·09 ± 22·95 months) using KCPS. The Behavioral Pediatrics Feeding Assessment Scale (BPFAS) was used for criterion validity. The KCPS steps arranged between 0-4 were found to be necessary. The content validity index was 0·885. The KCPS levels were found to be different between groups I and II (χ(2) = 123·286, P < 0·001). A moderately strong positive correlation was found between the KCPS and the subscales of the BPFAS (r = 0·444-0·773, P < 0·001). An excellent positive correlation was detected between two swallowing therapists and between two examinations of one swallowing therapist (r = 0·962, P < 0·001; r = 0·990, P < 0·001, respectively). The KCPS is a valid, reliable, quick and clinically easy-to-use functional instrument for determining the level of chewing function in children. © 2016 John Wiley & Sons Ltd.
[Evaluation and improvement of the management of informed consent in the emergency department].
del Pozo, P; García, J A; Escribano, M; Soria, V; Campillo-Soto, A; Aguayo-Albasini, J L
2009-01-01
To assess the preoperative management in our emergency surgical service and to improve the quality of the care provided to patients. In order to find the causes of non-compliance, the Ishikawa Fishbone diagram was used and eight assessment criteria were chosen. The first assessment includes 120 patients operated on from January to April 2007. Corrective measures were implemented, which consisted of meetings and conferences with doctors and nurses, insisting on the importance of the informed consent as a legal document which must be signed by patients, and the obligation of giving a copy to patients or relatives. The second assessment includes the period from July to October 2007 (n=120). We observed a high non-compliance of C1 signing of surgical consent (CRITERION 1: all patients or relatives have to sign the surgical informed consent for the operation to be performed [27.5%]) and C2 giving a copy of the surgical consent (CRITERION 2: all patients or relatives must have received a copy of the surgical informed consent for the Surgery to be performed [72.5%]) and C4 anaesthetic consent copy (CRITERION 4: all patients or relatives must have received a copy of the Anaesthesia informed consent corresponding to the operation performed [90%]). After implementing corrective measures a significant improvement was observed in the compliance of C2 and C4. In C1 there was an improvement without statistical significance. The carrying out of an improvement cycle enabled the main objective of this paper to be achieved: to improve the management of informed consent and the quality of the care and information provided to our patients.
Predicting space telerobotic operator training performance from human spatial ability assessment
NASA Astrophysics Data System (ADS)
Liu, Andrew M.; Oman, Charles M.; Galvan, Raquel; Natapoff, Alan
2013-11-01
Our goal was to determine whether existing tests of spatial ability can predict an astronaut's qualification test performance after robotic training. Because training astronauts to be qualified robotics operators is so long and expensive, NASA is interested in tools that can predict robotics performance before training begins. Currently, the Astronaut Office does not have a validated tool to predict robotics ability as part of its astronaut selection or training process. Commonly used tests of human spatial ability may provide such a tool to predict robotics ability. We tested the spatial ability of 50 active astronauts who had completed at least one robotics training course, then used logistic regression models to analyze the correlation between spatial ability test scores and the astronauts' performance in their evaluation test at the end of the training course. The fit of the logistic function to our data is statistically significant for several spatial tests. However, the prediction performance of the logistic model depends on the criterion threshold assumed. To clarify the critical selection issues, we show how the probability of correct classification vs. misclassification varies as a function of the mental rotation test criterion level. Since the costs of misclassification are low, the logistic models of spatial ability and robotic performance are reliable enough only to be used to customize regular and remedial training. We suggest several changes in tracking performance throughout robotics training that could improve the range and reliability of predictive models.
Development and psychometric testing of the Cancer Knowledge Scale for Elders.
Su, Ching-Ching; Chen, Yuh-Min; Kuo, Bo-Jein
2009-03-01
To develop the Cancer Knowledge Scale for Elders and test its validity and reliability. The number of elders suffering from cancer is increasing. To facilitate cancer prevention behaviours among elders, they shall be educated about cancer-related knowledge. Prior to designing a programme that would respond to the special needs of elders, understanding the cancer-related knowledge within this population was necessary. However, extensive review of the literature revealed a lack of appropriate instruments for measuring cancer-related knowledge. A valid and reliable cancer knowledge scale for elders is necessary. A non-experimental methodological design was used to test the psychometric properties of the Cancer Knowledge Scale for Elders. Item analysis was first performed to screen out items that had low corrected item-total correlation coefficients. Construct validity was examined with a principle component method of exploratory factor analysis. Cancer-related health behaviour was used as the criterion variable to evaluate criterion-related validity. Internal consistency reliability was assessed by the KR-20. Stability was determined by two-week test-retest reliability. The factor analysis yielded a four-factor solution accounting for 49.5% of the variance. For criterion-related validity, cancer knowledge was positively correlated with cancer-related health behaviour (r = 0.78, p < 0.001). The KR-20 coefficients of each factor were 0.85, 0.76, 0.79 and 0.67 and 0.87 for the total scale. Test-retest reliability over a two-week period was 0.83 (p < 0.001). This study provides evidence for content validity, construct validity, criterion-related validity, internal consistency and stability of the Cancer Knowledge Scale for Elders. The results show that this scale is an easy-to-use instrument for elders and has adequate validity and reliability. The scale can be used as an assessment instrument when implementing cancer education programmes for elders. It can also be used to evaluate the effects of education programmes.
Guidelines for Interpreting and Reporting Subscores
ERIC Educational Resources Information Center
Feinberg, Richard A.; Jurich, Daniel P.
2017-01-01
Recent research has proposed a criterion to evaluate the reportability of subscores. This criterion is a value-added ratio ("VAR"), where values greater than 1 suggest that the true subscore is better approximated by the observed subscore than by the total score. This research extends the existing literature by quantifying statistical…
Evaluation of Validity and Reliability for Hierarchical Scales Using Latent Variable Modeling
ERIC Educational Resources Information Center
Raykov, Tenko; Marcoulides, George A.
2012-01-01
A latent variable modeling method is outlined, which accomplishes estimation of criterion validity and reliability for a multicomponent measuring instrument with hierarchical structure. The approach provides point and interval estimates for the scale criterion validity and reliability coefficients, and can also be used for testing composite or…
ERIC Educational Resources Information Center
Burton, Nancy W.
2011-01-01
The educational technologies of the past quarter century--from teaching machines to minimal competency testing--all share the general purpose of helping educators make better, or more uniform, decisions. The currently favored technique for shaping local decisions is criterion-referenced testing. Some criterion-referenced testers first find out how…
Selection and use of TLDS for high precision NERVA shielding measurements
NASA Technical Reports Server (NTRS)
Woodsum, H. C.
1972-01-01
An experimental evaluation of thermoluminescent dosimeters was performed in order to select high precision dosimeters for a study whose purpose is to measure gamma streaming through the coolant passages of a simulated flight type internal NERVA reactor shield. Based on this study, the CaF2 chip TLDs are the most reproducible dosimeters with reproducibility generally within a few percent, but none of the TLDs tested met the reproducibility criterion of plus or minus 2%.
Hsu, Pi-Fang; Wu, Cheng-Ru; Li, Ya-Ting
2008-01-01
While Taiwanese hospitals dispose of large amounts of medical waste to ensure sanitation and personal hygiene, doing so inefficiently creates potential environmental hazards and increases operational expenses. However, hospitals lack objective criteria to select the most appropriate waste disposal firm and evaluate its performance, instead relying on their own subjective judgment and previous experiences. Therefore, this work presents an analytic hierarchy process (AHP) method to objectively select medical waste disposal firms based on the results of interviews with experts in the field, thus reducing overhead costs and enhancing medical waste management. An appropriate weight criterion based on AHP is derived to assess the effectiveness of medical waste disposal firms. The proposed AHP-based method offers a more efficient and precise means of selecting medical waste firms than subjective assessment methods do, thus reducing the potential risks for hospitals. Analysis results indicate that the medical sector selects the most appropriate infectious medical waste disposal firm based on the following rank: matching degree, contractor's qualifications, contractor's service capability, contractor's equipment and economic factors. By providing hospitals with an effective means of evaluating medical waste disposal firms, the proposed AHP method can reduce overhead costs and enable medical waste management to understand the market demand in the health sector. Moreover, performed through use of Expert Choice software, sensitivity analysis can survey the criterion weight of the degree of influence with an alternative hierarchy.
Effects of task-irrelevant grouping on visual selection in partial report.
Lunau, Rasmus; Habekost, Thomas
2017-07-01
Perceptual grouping modulates performance in attention tasks such as partial report and change detection. Specifically, grouping of search items according to a task-relevant feature improves the efficiency of visual selection. However, the role of task-irrelevant feature grouping is not clearly understood. In the present study, we investigated whether grouping of targets by a task-irrelevant feature influences performance in a partial-report task. In this task, participants must report as many target letters as possible from a briefly presented circular display. The crucial manipulation concerned the color of the elements in these trials. In the sorted-color condition, the color of the display elements was arranged according to the selection criterion, and in the unsorted-color condition, colors were randomly assigned. The distractor cost was inferred by subtracting performance in partial-report trials from performance in a control condition that had no distractors in the display. Across five experiments, we manipulated trial order, selection criterion, and exposure duration, and found that attentional selectivity was improved in sorted-color trials when the exposure duration was 200 ms and the selection criterion was luminance. This effect was accompanied by impaired selectivity in unsorted-color trials. Overall, the results suggest that the benefit of task-irrelevant color grouping of targets is contingent on the processing locus of the selection criterion.
Villettaz Robichaud, M; Rushen, J; de Passillé, A M; Vasseur, E; Haley, D B; Pellerin, D
2018-03-01
In order for dairy producers to comply with animal welfare recommendations, financial investments may be required. In Canada, a new dairy animal care assessment program is currently being implemented under the proAction Initiative to determine the extent to which certain aspects of the Code of Practice are being followed and to assess the care and well-being of dairy cattle on farm. The aim of the current study was to evaluate the association between meeting the proAction animal-based and the electric trainer placement criteria and certain aspects of productivity and profitability on tiestall dairy farms. The results of a previous on-farm cow comfort assessment conducted on 100 Canadian tiestall farms were used to simulate the results of a part of the proAction Animal Care assessment on these farms. Each farm's productivity and profitability data were retrieved from the regional dairy herd improvement associations. Univariable and multivariable linear regressions were used to evaluate the associations between meeting these proAction criteria and the farms' average yearly: corrected milk production, somatic cell count (SCC), calving interval, number of breedings/cow, culling rate, prevalence of cows in third or higher lactation, and margins per cow and per kilogram of quota calculated over replacement costs. The association between milk production and the proAction lameness criterion was moderated through an interaction with the milk production genetic index which resulted in an increase in milk production per year with increasing genetic index that was steeper in farms that met the proAction lameness criterion compared with farms that did not. Meeting the proAction body condition score criterion was associated with reduced SCC and meeting the proAction electric trainer placement criterion was associated with SCC through an interaction with the farms' average SCC genetic index. The increase in SCC with increasing SCC genetic index was milder in farms that met this criterion compared with farms that did not. Farms that met the proAction electric trainer placement criterion had 4.6% more cows in their third or greater lactation. These results suggest that some associations exist between the productivity of Canadian tiestall farms and meeting several parameters of the proAction Animal Care assessment. Meeting these criteria is unlikely to impose any economic burden to the dairy industry as a whole. Copyright © 2018 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Hardesty, Samantha L; Hagopian, Louis P; McIvor, Melissa M; Wagner, Leaora L; Sigurdsson, Sigurdur O; Bowman, Lynn G
2014-09-01
The present study isolated the effects of frequently used staff training intervention components to increase communication between direct care staff and clinicians working on an inpatient behavioral unit. Written "protocol review" quizzes developed by clinicians were designed to assess knowledge about a patient's behavioral protocols. Direct care staff completed these at the beginning of each day and evening shift. Clinicians were required to score and discuss these protocol reviews with direct care staff for at least 75% of shifts over a 2-week period. During baseline, only 21% of clinicians met this requirement. Completing and scoring of protocol reviews did not improve following additional in-service training (M = 15%) or following an intervention aimed at decreasing response effort combined with prompting (M = 28%). After implementing an intervention involving specified performance criterion and performance feedback, 86% of clinicians reached the established goal. Results of a component analysis suggested that the presentation of both the specified performance criterion and supporting contingencies was necessary to maintain acceptable levels of performance. © The Author(s) 2014.
Algin, Oktay
2018-05-21
Phase-contrast cine magnetic resonance imaging (PC-MRI) is a widely used technique for determination of possible communication of arachnoid cysts (ACs). Three-dimensional (3D) sampling perfection with application-optimized contrasts using different flip-angle evolutions (3D-SPACE) technique is a relatively new method for 3D isotropic scanning of the entire cranium within a short time. In this research, the usage of the 3D-SPACE technique in differentiation of communicating or noncommunicating type ACs was evaluated. Thirty-five ACs in 34 patients were retrospectively examined. The 3D-SPACE, PC-MRI, and contrast material-enhanced cisternography (if present) images of the patients were analyzed. Each cyst was described according to cyst size/location, third ventricle diameter, Evans index, and presence of hydrocephalus. Communication was defined as absent (score 0), suspected (score 1), or present (score 2) on each sequence. Results of PC-MRI or cisternography (if available) examinations were used as criterion standard techniques to categorize all cysts as communicating or noncommunicating type. The results of 3D-SPACE were compared with criterion standard techniques. The comparisons between groups were performed using Mann-Whitney and Fisher exact tests. For demonstration of communication status of the cysts, criterion standard test results and 3D-SPACE findings were almost in perfect harmony (κ[95% confidence interval: 0.94]; P < 0.001). When evaluating the communicative properties, 3D-SPACE findings correlated with other final results at a rate of 97%. There is a positive correlation with third ventricular diameters and Evans index for all patients (r = 0.77, P < 0.001). For other analyzed variables, there is no significant difference or correlation between the groups. The 3D-SPACE technique is an easy, useful, and noninvasive alternative for the evaluation of morphology, topographical relationships, and communication status of ACs.
Effects of literacy on semantic verbal fluency in an immigrant population.
Nielsen, T Rune; Waldemar, Gunhild
2016-09-01
A significant impact of limited schooling and illiteracy has been found on numerous neuropsychological tests, which may partly be due to the ecological relevance of the tests in the context of illiteracy. The aims of this study were to compare the performance of illiterate and literate immigrants on two semantic criteria for the verbal fluency test, and examine the influence of acculturation on test performances. Performances of 20 cognitively unimpaired illiterate and 21 literate Turkish immigrants aged ≥50 years were compared on an animal and supermarket criterion for the semantic verbal fluency test. Also, the influence of acculturation on test performances was examined. Significantly poorer performance of the illiterate compared to the literate group was found for the animal criterion, whereas no differences were found for the supermarket criterion that was considered more ecologically relevant for illiterate individuals. A significant interaction effect was found between the semantic criteria and literacy group, which was mainly related to a large effect of semantic criteria within the illiterate group. Adjusting for years of residence in Denmark and acculturation score did not affect this interaction effect. Overall, our results are in line with previous studies comparing semantic fluency in illiterate and literate individuals. The results lend further support to the strong associations between literacy, semantic verbal fluency performance and ecological relevance of the semantic criterion and extend previous findings to immigrants with different cultural experiences related to the acculturation process.
Precoded spatial multiplexing MIMO system with spatial component interleaver.
Gao, Xiang; Wu, Zhanji
In this paper, the performance of precoded bit-interleaved coded modulation (BICM) spatial multiplexing multiple-input multiple-output (MIMO) system with spatial component interleaver is investigated. For the ideal precoded spatial multiplexing MIMO system with spatial component interleaver based on singular value decomposition (SVD) of the MIMO channel, the average pairwise error probability (PEP) of coded bits is derived. Based on the PEP analysis, the optimum spatial Q-component interleaver design criterion is provided to achieve the minimum error probability. For the limited feedback precoded proposed scheme with linear zero forcing (ZF) receiver, in order to minimize a bound on the average probability of a symbol vector error, a novel effective signal-to-noise ratio (SNR)-based precoding matrix selection criterion and a simplified criterion are proposed. Based on the average mutual information (AMI)-maximization criterion, the optimal constellation rotation angles are investigated. Simulation results indicate that the optimized spatial multiplexing MIMO system with spatial component interleaver can achieve significant performance advantages compared to the conventional spatial multiplexing MIMO system.
Weykamp, Cas; John, Garry; Gillery, Philippe; English, Emma; Ji, Linong; Lenters-Westra, Erna; Little, Randie R.; Roglic, Gojka; Sacks, David B.; Takei, Izumi
2016-01-01
Background A major objective of the IFCC Task Force on implementation of HbA1c standardization is to develop a model to define quality targets for HbA1c. Methods Two generic models, the Biological Variation and Sigma-metrics model, are investigated. Variables in the models were selected for HbA1c and data of EQA/PT programs were used to evaluate the suitability of the models to set and evaluate quality targets within and between laboratories. Results In the biological variation model 48% of individual laboratories and none of the 26 instrument groups met the minimum performance criterion. In the Sigma-metrics model, with a total allowable error (TAE) set at 5 mmol/mol (0.46% NGSP) 77% of the individual laboratories and 12 of 26 instrument groups met the 2 sigma criterion. Conclusion The Biological Variation and Sigma-metrics model were demonstrated to be suitable for setting and evaluating quality targets within and between laboratories. The Sigma-metrics model is more flexible as both the TAE and the risk of failure can be adjusted to requirements related to e.g. use for diagnosis/monitoring or requirements of (inter)national authorities. With the aim of reaching international consensus on advice regarding quality targets for HbA1c, the Task Force suggests the Sigma-metrics model as the model of choice with default values of 5 mmol/mol (0.46%) for TAE, and risk levels of 2 and 4 sigma for routine laboratories and laboratories performing clinical trials, respectively. These goals should serve as a starting point for discussion with international stakeholders in the field of diabetes. PMID:25737535
Functional Quality Criterion of Rock Handling Mechanization at Open-pit Mines
NASA Astrophysics Data System (ADS)
Voronov, Yuri; Voronov, Artyoni
2017-11-01
Overburden and mining operations at open-pit mines are performed mainly by powerful shovel-truck systems (STSs). One of the main problems of the STSs is a rather low level of their operating quality, mainly due to unjustified over-trucking. In this article, a functional criterion for assessing the qualify of the STS operation at open-pit mines is formulated, derived and analyzed. We introduce the rationale and general principles for the functional criterion formation, its general form, as well as variations for various STS structures: a mixed truck fleet and a homogeneous shovel fleet, a mixed shove! fleet and a homogeneous truck fleet, mixed truck and shovel fleets. The possibility of assessing the quality of the STS operation is of great importance for identifying the main directions for improving their operational performance and operating quality, optimizing the main performance indicators by the qualify criterion, and. as a result, for possible saving of material and technical resources for open-pit mining. Improvement of the quality of the STS operation also allows increasing the mining safety and decreasing the atmosphere pollution - by means of possible reducing of the number of the operating trucks.
Evaluation of a Progressive Failure Analysis Methodology for Laminated Composite Structures
NASA Technical Reports Server (NTRS)
Sleight, David W.; Knight, Norman F., Jr.; Wang, John T.
1997-01-01
A progressive failure analysis methodology has been developed for predicting the nonlinear response and failure of laminated composite structures. The progressive failure analysis uses C plate and shell elements based on classical lamination theory to calculate the in-plane stresses. Several failure criteria, including the maximum strain criterion, Hashin's criterion, and Christensen's criterion, are used to predict the failure mechanisms. The progressive failure analysis model is implemented into a general purpose finite element code and can predict the damage and response of laminated composite structures from initial loading to final failure.
An error criterion for determining sampling rates in closed-loop control systems
NASA Technical Reports Server (NTRS)
Brecher, S. M.
1972-01-01
The determination of an error criterion which will give a sampling rate for adequate performance of linear, time-invariant closed-loop, discrete-data control systems was studied. The proper modelling of the closed-loop control system for characterization of the error behavior, and the determination of an absolute error definition for performance of the two commonly used holding devices are discussed. The definition of an adequate relative error criterion as a function of the sampling rate and the parameters characterizing the system is established along with the determination of sampling rates. The validity of the expressions for the sampling interval was confirmed by computer simulations. Their application solves the problem of making a first choice in the selection of sampling rates.
The transformation of the tender evaluation process in public procurement in Poland
NASA Astrophysics Data System (ADS)
Plebankiewicz, E.; Kozik, R.
2017-10-01
Procedures regarding the evaluation of tenders have been changed since the public procurement law was enacted (it came into force in January 1, 1995). The contracting authority could apply both the criteria related to the qualities of the contractor and those related to the to the subject - matter of public contract. Two extensive amendments in 2001 and a government project introduced vital regulations and excluded the possibility of applying criteria related to the qualities of the contractor. Act of 29 January 2004 Public Procurement Law allowed to use price as the sole contract award criterion. The changes in the Law in 2014 restricted that possibility to the situation in which the subject matter of a contract is commonly available and has established quality standards. The Act of 22 June 2016 amending the Public Procurement Law Act and some other laws introduced the new criteria list and limited the importance of the price criterion in the certain situations. Instead of price, the cost can also be a criterion for tender evaluation. The cost criterion can be determined using life cycle costing. In the paper, based on contract notices of open tendering published in the Public Procurement Bulletin, the criteria of construction contract selection will be analysed. In particular the effectiveness of changes in the Procurement Law will be researched.
Boubouchairopoulou, N; Kollias, A; Chiu, B; Chen, B; Lagou, S; Anestis, P; Stergiou, G S
2017-07-01
A pocket-size cuffless electronic device for self-measurement of blood pressure (BP) has been developed (Freescan, Maisense Inc., Zhubei, Taiwan). The device estimates BP within 10 s using three embedded electrodes and one force sensor that is applied over the radial pulse to evaluate the pulse wave. Before use, basic anthropometric characteristics are recorded on the device, and individualized initial calibration is required based on a standard BP measurement performed using an upper-arm BP monitor. The device performance in providing valid BP readings was evaluated in 313 normotensive and hypertensive adults in three study phases during which the device sensor was upgraded. A formal validation study of a prototype device against mercury sphygmomanometer was performed according to the American National Standards Institute/Association for the Advancement of Medical Instrumentation/International Organization for Standardization (ANSI/AAMI/ISO) 2013 protocol. The test device succeeded in obtaining a valid BP measurement (three successful readings within up to five attempts) in 55-72% of the participants, which reached 87% with device sensor upgrade. For the validation study, 125 adults were recruited and 85 met the protocol requirements for inclusion. The mean device-observers BP difference was 3.2±6.7 (s.d.) mm Hg for systolic and 2.6±4.6 mm Hg for diastolic BP (criterion 1). The estimated s.d. (inter-subject variability) were 5.83 and 4.17 mm Hg respectively (criterion 2). These data suggest that this prototype cuffless BP monitor provides valid self-measurements in the vast majority of adults, and satisfies the BP measurement accuracy criteria of the ANSI/AAMI/ISO 2013 validation protocol.
Simulated Driving Assessment (SDA) for teen drivers: results from a validation study.
McDonald, Catherine C; Kandadai, Venk; Loeb, Helen; Seacrist, Thomas S; Lee, Yi-Ching; Winston, Zachary; Winston, Flaura K
2015-06-01
Driver error and inadequate skill are common critical reasons for novice teen driver crashes, yet few validated, standardised assessments of teen driving skills exist. The purpose of this study is to evaluate the construct and criterion validity of a newly developed Simulated Driving Assessment (SDA) for novice teen drivers. The SDA's 35 min simulated drive incorporates 22 variations of the most common teen driver crash configurations. Driving performance was compared for 21 inexperienced teens (age 16-17 years, provisional license ≤90 days) and 17 experienced adults (age 25-50 years, license ≥5 years, drove ≥100 miles per week, no collisions or moving violations ≤3 years). SDA driving performance (Error Score) was based on driving safety measures derived from simulator and eye-tracking data. Negative driving outcomes included simulated collisions or run-off-the-road incidents. A professional driving evaluator/instructor (DEI Score) reviewed videos of SDA performance. The SDA demonstrated construct validity: (1) teens had a higher Error Score than adults (30 vs. 13, p=0.02); (2) For each additional error committed, the RR of a participant's propensity for a simulated negative driving outcome increased by 8% (95% CI 1.05 to 1.10, p<0.01). The SDA-demonstrated criterion validity: Error Score was correlated with DEI Score (r=-0.66, p<0.001). This study supports the concept of validated simulated driving tests like the SDA to assess novice driver skill in complex and hazardous driving scenarios. The SDA, as a standard protocol to evaluate teen driver performance, has the potential to facilitate screening and assessment of teen driving readiness and could be used to guide targeted skill training. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Contrasting Norm Referenced and Criterion Referenced Measures.
ERIC Educational Resources Information Center
Randall, Robert S.
Differences in design between norm referenced measures (NRM) and criterion referenced measures (CRM) are reviewed, and some of the procedures proposed on designing and evaluating CRM are examined. Differences in design of NRM and CRM are said to arise from the different purposes that underlie each measure. In addition, there are differences among…
ERIC Educational Resources Information Center
Mooney, Paul; Lastrapes, Renée E.
2016-01-01
The amount of research evaluating the technical merits of general outcome measures of science and social studies achievement is growing. This study targeted criterion validity for critical content monitoring. Questions addressed the concurrent criterion validity of alternate presentation formats of critical content monitoring and the measure's…
Characterizing the functional MRI response using Tikhonov regularization.
Vakorin, Vasily A; Borowsky, Ron; Sarty, Gordon E
2007-09-20
The problem of evaluating an averaged functional magnetic resonance imaging (fMRI) response for repeated block design experiments was considered within a semiparametric regression model with autocorrelated residuals. We applied functional data analysis (FDA) techniques that use a least-squares fitting of B-spline expansions with Tikhonov regularization. To deal with the noise autocorrelation, we proposed a regularization parameter selection method based on the idea of combining temporal smoothing with residual whitening. A criterion based on a generalized chi(2)-test of the residuals for white noise was compared with a generalized cross-validation scheme. We evaluated and compared the performance of the two criteria, based on their effect on the quality of the fMRI response. We found that the regularization parameter can be tuned to improve the noise autocorrelation structure, but the whitening criterion provides too much smoothing when compared with the cross-validation criterion. The ultimate goal of the proposed smoothing techniques is to facilitate the extraction of temporal features in the hemodynamic response for further analysis. In particular, these FDA methods allow us to compute derivatives and integrals of the fMRI signal so that fMRI data may be correlated with behavioral and physiological models. For example, positive and negative hemodynamic responses may be easily and robustly identified on the basis of the first derivative at an early time point in the response. Ultimately, these methods allow us to verify previously reported correlations between the hemodynamic response and the behavioral measures of accuracy and reaction time, showing the potential to recover new information from fMRI data. 2007 John Wiley & Sons, Ltd
Genomic selection in a commercial winter wheat population.
He, Sang; Schulthess, Albert Wilhelm; Mirdita, Vilson; Zhao, Yusheng; Korzun, Viktor; Bothe, Reiner; Ebmeyer, Erhard; Reif, Jochen C; Jiang, Yong
2016-03-01
Genomic selection models can be trained using historical data and filtering genotypes based on phenotyping intensity and reliability criterion are able to increase the prediction ability. We implemented genomic selection based on a large commercial population incorporating 2325 European winter wheat lines. Our objectives were (1) to study whether modeling epistasis besides additive genetic effects results in enhancement on prediction ability of genomic selection, (2) to assess prediction ability when training population comprised historical or less-intensively phenotyped lines, and (3) to explore the prediction ability in subpopulations selected based on the reliability criterion. We found a 5 % increase in prediction ability when shifting from additive to additive plus epistatic effects models. In addition, only a marginal loss from 0.65 to 0.50 in accuracy was observed using the data collected from 1 year to predict genotypes of the following year, revealing that stable genomic selection models can be accurately calibrated to predict subsequent breeding stages. Moreover, prediction ability was maximized when the genotypes evaluated in a single location were excluded from the training set but subsequently decreased again when the phenotyping intensity was increased above two locations, suggesting that the update of the training population should be performed considering all the selected genotypes but excluding those evaluated in a single location. The genomic prediction ability was substantially higher in subpopulations selected based on the reliability criterion, indicating that phenotypic selection for highly reliable individuals could be directly replaced by applying genomic selection to them. We empirically conclude that there is a high potential to assist commercial wheat breeding programs employing genomic selection approaches.
Development and evaluation of a gyroscope-based wheel rotation monitor for manual wheelchair users.
Hiremath, Shivayogi V; Ding, Dan; Cooper, Rory A
2013-07-01
To develop and evaluate a wireless gyroscope-based wheel rotation monitor (G-WRM) that can estimate speeds and distances traveled by wheelchair users during regular wheelchair propulsion as well as wheelchair sports such as handcycling, and provide users with real-time feedback through a smartphone application. The speeds and the distances estimated by the G-WRM were compared with the criterion measures by calculating absolute difference, mean difference, and percentage errors during a series of laboratory-based tests. Intraclass correlations (ICC) and the Bland-Altman plots were also used to assess the agreements between the G-WRM and the criterion measures. In addition, battery life and wireless data transmission tests under a number of usage conditions were performed. The percentage errors for the angular velocities, speeds, and distances obtained from three prototype G-WRMs were less than 3% for all the test trials. The high ICC values (ICC (3,1) > 0.94) and the Bland-Altman plots indicate excellent agreement between the estimated speeds and distances by the G-WRMs and the criterion measures. The battery life tests showed that the device could last for 35 hours in wireless mode and 139 hours in secure digital card mode. The wireless data transmission tests indicated less than 0.3% of data loss. The results indicate that the G-WRM is an appropriate tool for tracking a spectrum of wheelchair-related activities from regular wheelchair propulsion to wheelchair sports such as handcycling. The real-time feedback provided by the G-WRM can help wheelchair users self-monitor their everyday activities.
Salimi, Fereshteh; Shahabi, Shahab; Talebzadeh, Hamid; Keshavarzian, Amir; Pourfakharan, Mohammad; Safaei, Mansour
2017-01-01
Fistulas are the preferred permanent hemodialysis vascular access, but a significant obstacle to increasing their prevalence is the fistula's high "failure to mature" (FTM) rate. This study aimed to identify postoperative clinical characteristics that are predictive of fistula FTM. This descriptive cross-sectional study was performed on 80 end-stage renal disease patients who referred to Al Zahra Hospital, Isfahan, for brachiocephalic fistula placement. After 4 weeks, the clinical criteria (trill, firmness, vein length, and venous engorgement) examined and the fistulas situation divided to favorable or unfavorable by each criterion, and the results comprised with dialysis possibility. Data were analyzed with SPSS version 21. Diagnostic index for CLINICAL examination was calculated. Among the 80 cases, 25 (31.2%) female and 55 (68.8%) male were studied with the mean age of 51.9 (standard deviation = 17) year ranged between 18 and 86 years old. Sixty-two (77.5%) cases had successful hemodialysis. All four clinical assessments were significantly more acceptable in patients with successful dialysis ( P < 0.001). According to the results of our study, the accuracy of all physical assessments was above 70% and except vein length other criteria had a sensitivity and negative predictive value of 100%. In this study, firmness of vein has highest specificity and positive predictive value (83.9% and 64.3%, respectively). Results of our study showed that high sensitivity and relatively low specificity of the clinical criterion. It means that unfavorable results of each clinical criterion predict unfavorable dialysis. Clinical evaluation of a newly created fistula 4-6 weeks after surgery should be considered mandatory.
Lansing, Amy E.; Plante, Wendy Y.; Beck, Audrey N.
2016-01-01
Despite growing recognition that cumulative adversity (total stressor exposure), including complex trauma, increases the risk for psychopathology and impacts development, assessment strategies lag behind: Trauma-related mental health needs (symptoms, functional impairment, maladaptive coping) are typically assessed in response to only one qualifying Criterion-A event. This is especially problematic for youth at-risk for health and academic disparities who experience cumulative adversity, including non-qualifying events (parental separations) which may produce more impairing symptomatology. Data from 118 delinquent girls demonstrate: 1) an average of 14 adverse Criterion-A and non-Criterion event exposures; 2) serious maladaptive coping strategies (self-injury) directly in response to cumulative adversity; 3) more cumulative adversity-related than worst-event related symptomatology and functional impairment; and 4) comparable symptomatology, but greater functional impairment, in response to non-Criterion events. These data support the evaluation of mental health needs in response to cumulative adversity for optimal identification and tailoring of services in high-risk populations to reduce disparities. PMID:27745922
Lansing, Amy E; Plante, Wendy Y; Beck, Audrey N
2017-05-01
Despite growing recognition that cumulative adversity (total stressor exposure, including complex trauma), increases the risk for psychopathology and impacts development, assessment strategies lag behind: Adversity-related mental health needs (symptoms, functional impairment, maladaptive coping) are typically assessed in response to only one qualifying Criterion-A traumatic event. This is especially problematic for youth at-risk for health and academic disparities who experience cumulative adversity, including non-qualifying events (separation from caregivers) which may produce more impairing symptomatology. Data from 118 delinquent girls demonstrate: (1) an average of 14 adverse Criterion-A and non-Criterion event exposures; (2) serious maladaptive coping strategies (self-injury) directly in response to cumulative adversity; (3) more cumulative adversity-related than worst-event related symptomatology and functional impairment; and (4) comparable symptomatology, but greater functional impairment, in response to non-Criterion events. These data support the evaluation of mental health needs in response to cumulative adversity for optimal identification and tailoring of services in high-risk populations to reduce disparities. Copyright © 2016 Elsevier Ltd. All rights reserved.
Ultrasound biofeedback treatment for persisting childhood apraxia of speech.
Preston, Jonathan L; Brick, Nickole; Landi, Nicole
2013-11-01
The purpose of this study was to evaluate the efficacy of a treatment program that includes ultrasound biofeedback for children with persisting speech sound errors associated with childhood apraxia of speech (CAS). Six children ages 9-15 years participated in a multiple baseline experiment for 18 treatment sessions during which treatment focused on producing sequences involving lingual sounds. Children were cued to modify their tongue movements using visual feedback from real-time ultrasound images. Probe data were collected before, during, and after treatment to assess word-level accuracy for treated and untreated sound sequences. As participants reached preestablished performance criteria, new sequences were introduced into treatment. All participants met the performance criterion (80% accuracy for 2 consecutive sessions) on at least 2 treated sound sequences. Across the 6 participants, performance criterion was met for 23 of 31 treated sequences in an average of 5 sessions. Some participants showed no improvement in untreated sequences, whereas others showed generalization to untreated sequences that were phonetically similar to the treated sequences. Most gains were maintained 2 months after the end of treatment. The percentage of phonemes correct increased significantly from pretreatment to the 2-month follow-up. A treatment program including ultrasound biofeedback is a viable option for improving speech sound accuracy in children with persisting speech sound errors associated with CAS.
Validation of X1 motorcycle model in industrial plant layout by using WITNESSTM simulation software
NASA Astrophysics Data System (ADS)
Hamzas, M. F. M. A.; Bareduan, S. A.; Zakaria, M. Z.; Tan, W. J.; Zairi, S.
2017-09-01
This paper demonstrates a case study on simulation, modelling and analysis for X1 Motorcycles Model. In this research, a motorcycle assembly plant has been selected as a main place of research study. Simulation techniques by using Witness software were applied to evaluate the performance of the existing manufacturing system. The main objective is to validate the data and find out the significant impact on the overall performance of the system for future improvement. The process of validation starts when the layout of the assembly line was identified. All components are evaluated to validate whether the data is significance for future improvement. Machine and labor statistics are among the parameters that were evaluated for process improvement. Average total cycle time for given workstations is used as criterion for comparison of possible variants. From the simulation process, the data used are appropriate and meet the criteria for two-sided assembly line problems.
Criterion-based laparoscopic training reduces total training time.
Brinkman, Willem M; Buzink, Sonja N; Alevizos, Leonidas; de Hingh, Ignace H J T; Jakimowicz, Jack J
2012-04-01
The benefits of criterion-based laparoscopic training over time-oriented training are unclear. The purpose of this study is to compare these types of training based on training outcome and time efficiency. During four training sessions within 1 week (one session per day) 34 medical interns (no laparoscopic experience) practiced on two basic tasks on the Simbionix LAP Mentor virtual-reality (VR) simulator: 'clipping and grasping' and 'cutting'. Group C (criterion-based) (N = 17) trained to reach predefined criteria and stopped training in each session when these criteria were met, with a maximum training time of 1 h. Group T (time-based) (N = 17) trained for a fixed time of 1 h each session. Retention of skills was assessed 1 week after training. In addition, transferability of skills was established using the Haptica ProMIS augmented-reality simulator. Both groups improved their performance significantly over the course of the training sessions (Wilcoxon signed ranks, P < 0.05). Both groups showed skill transferability and skill retention. When comparing the performance parameters of group C and group T, their performances in the first, the last and the retention training sessions did not differ significantly (Mann-Whitney U test, P > 0.05). The average number of repetitions needed to meet the criteria also did not differ between the groups. Overall, group C spent less time training on the simulator than did group T (74:48 and 120:10 min, respectively; P < 0.001). Group C performed significantly fewer repetitions of each task, overall and in session 2, 3 and 4. Criterion-based training of basic laparoscopic skills can reduce the overall training time with no impact on training outcome, transferability or retention of skills. Criterion-based should be the training of choice in laparoscopic skills curricula.
A multiple maximum scatter difference discriminant criterion for facial feature extraction.
Song, Fengxi; Zhang, David; Mei, Dayong; Guo, Zhongwei
2007-12-01
Maximum scatter difference (MSD) discriminant criterion was a recently presented binary discriminant criterion for pattern classification that utilizes the generalized scatter difference rather than the generalized Rayleigh quotient as a class separability measure, thereby avoiding the singularity problem when addressing small-sample-size problems. MSD classifiers based on this criterion have been quite effective on face-recognition tasks, but as they are binary classifiers, they are not as efficient on large-scale classification tasks. To address the problem, this paper generalizes the classification-oriented binary criterion to its multiple counterpart--multiple MSD (MMSD) discriminant criterion for facial feature extraction. The MMSD feature-extraction method, which is based on this novel discriminant criterion, is a new subspace-based feature-extraction method. Unlike most other subspace-based feature-extraction methods, the MMSD computes its discriminant vectors from both the range of the between-class scatter matrix and the null space of the within-class scatter matrix. The MMSD is theoretically elegant and easy to calculate. Extensive experimental studies conducted on the benchmark database, FERET, show that the MMSD out-performs state-of-the-art facial feature-extraction methods such as null space method, direct linear discriminant analysis (LDA), eigenface, Fisherface, and complete LDA.
Rikli, Roberta E; Jones, C Jessie
2013-04-01
To develop and validate criterion-referenced fitness standards for older adults that predict the level of capacity needed for maintaining physical independence into later life. The proposed standards were developed for use with a previously validated test battery for older adults-the Senior Fitness Test (Rikli, R. E., & Jones, C. J. (2001). Development and validation of a functional fitness test for community--residing older adults. Journal of Aging and Physical Activity, 6, 127-159; Rikli, R. E., & Jones, C. J. (1999a). Senior fitness test manual. Champaign, IL: Human Kinetics.). A criterion measure to assess physical independence was identified. Next, scores from a subset of 2,140 "moderate-functioning" older adults from a larger cross-sectional database, together with findings from longitudinal research on physical capacity and aging, were used as the basis for proposing fitness standards (performance cut points) associated with having the ability to function independently. Validity and reliability analyses were conducted to test the standards for their accuracy and consistency as predictors of physical independence. Performance standards are presented for men and women ages 60-94 indicating the level of fitness associated with remaining physically independent until late in life. Reliability and validity indicators for the standards ranged between .79 and .97. The proposed standards provide easy-to-use, previously unavailable methods for evaluating physical capacity in older adults relative to that associated with physical independence. Most importantly, the standards can be used in planning interventions that target specific areas of weakness, thus reducing risk for premature loss of mobility and independence.
Cuesta-Vargas, Antonio Ignacio; González-Sánchez, Manuel
2014-10-29
Spanish is one of the five most spoken languages in the world. There is currently no published Spanish version of the Örebro Musculoskeletal Pain Questionnaire (OMPQ). The aim of the present study is to describe the process of translating the OMPQ into Spanish and to perform an analysis of reliability, internal structure, internal consistency and concurrent criterion-related validity. Translation and psychometric testing. Two independent translators translated the OMPQ into Spanish. From both translations a consensus version was achieved. A backward translation was made to verify and resolve any semantic or conceptual problems. A total of 104 patients (67 men/37 women) with a mean age of 53.48 (±11.63), suffering from chronic musculoskeletal disorders, twice completed a Spanish version of the OMPQ. Statistical analysis was performed to evaluate the reliability, the internal structure, internal consistency and concurrent criterion-related validity with reference to the gold standard questionnaire SF-12v2. All variables except "Coping" showed a rate above 0.85 on reliability. The internal structure calculation through exploratory factor analysis indicated that 75.2% of the variance can be explained with six components with an eigenvalue higher than 1 and 52.1% with only three components higher than 10% of variance explained. In the concurrent criterion-related validity, several significant correlations were seen close to 0.6, exceeding that value in the correlation between general health and total value of the OMPQ. The Spanish version of the screening questionnaire OMPQ can be used to identify Spanish patients with musculoskeletal pain at risk of developing a chronic disability.
On the measurement of criterion noise in signal detection theory: the case of recognition memory.
Kellen, David; Klauer, Karl Christoph; Singmann, Henrik
2012-07-01
Traditional approaches within the framework of signal detection theory (SDT; Green & Swets, 1966), especially in the field of recognition memory, assume that the positioning of response criteria is not a noisy process. Recent work (Benjamin, Diaz, & Wee, 2009; Mueller & Weidemann, 2008) has challenged this assumption, arguing not only for the existence of criterion noise but also for its large magnitude and substantive contribution to individuals' performance. A review of these recent approaches for the measurement of criterion noise in SDT identifies several shortcomings and confoundings. A reanalysis of Benjamin et al.'s (2009) data sets as well as the results from a new experimental method indicate that the different forms of criterion noise proposed in the recognition memory literature are of very low magnitudes, and they do not provide a significant improvement over the account already given by traditional SDT without criterion noise. Copyright 2012 APA, all rights reserved.
Morgado, José Mário T; Sánchez-Muñoz, Laura; Teodósio, Cristina G; Jara-Acevedo, Maria; Alvarez-Twose, Iván; Matito, Almudena; Fernández-Nuñez, Elisa; García-Montero, Andrés; Orfao, Alberto; Escribano, Luís
2012-04-01
Aberrant expression of CD2 and/or CD25 by bone marrow, peripheral blood or other extracutaneous tissue mast cells is currently used as a minor World Health Organization diagnostic criterion for systemic mastocytosis. However, the diagnostic utility of CD2 versus CD25 expression by mast cells has not been prospectively evaluated in a large series of systemic mastocytosis. Here we evaluate the sensitivity and specificity of CD2 versus CD25 expression in the diagnosis of systemic mastocytosis. Mast cells from a total of 886 bone marrow and 153 other non-bone marrow extracutaneous tissue samples were analysed by multiparameter flow cytometry following the guidelines of the Spanish Network on Mastocytosis at two different laboratories. The 'CD25+ and/or CD2+ bone marrow mast cells' World Health Organization criterion showed an overall sensitivity of 100% with 99.0% specificity for the diagnosis of systemic mastocytosis whereas CD25 expression alone presented a similar sensitivity (100%) with a slightly higher specificity (99.2%). Inclusion of CD2 did not improve the sensitivity of the test and it decreased its specificity. In tissues other than bone marrow, the mast cell phenotypic criterion revealed to be less sensitive. In summary, CD2 expression does not contribute to improve the diagnosis of systemic mastocytosis when compared with aberrant CD25 expression alone, which supports the need to update and replace the minor World Health Organization 'CD25+ and/or CD2+' mast cell phenotypic diagnostic criterion by a major criterion based exclusively on CD25 expression.
Setting Meaningful Criterion-Reference Cut Scores as an Effective Professional Development
ERIC Educational Resources Information Center
Munyofu, Paul
2010-01-01
The state of Pennsylvania, like many organizations interested in performance improvement, routinely engages in professional development activities. Educators in this hands-on activity engaged in setting meaningful criterion-referenced cut scores for career and technical education assessments using two methods. The main purposes of this study were…
Machine Shop. Criterion-Referenced Test (CRT) Item Bank.
ERIC Educational Resources Information Center
Davis, Diane, Ed.
This drafting criterion-referenced test item bank is keyed to the machine shop competency profile developed by industry and education professionals in Missouri. The 16 references used for drafting the test items are listed. Test items are arranged under these categories: orientation to machine shop; performing mathematical calculations; performing…
NASA Astrophysics Data System (ADS)
Kongar, N. Elif
2004-12-01
Today, since customers are able to obtain similar-quality products for similar prices, the lead time has become the only preference criterion for most of the consumers. Therefore, it is crucial that the lead time, i.e., the time spent from the raw material phase till the manufactured good reaches the customer, is minimized. This issue can be investigated under the title of Supply Chain Management (SCM). An efficiently managed supply chain can lead to reduced response time for customers. To achieve this, continuous observation of supply chain efficiency, i.e., a constant performance evaluation of the current SCM is required. Widely used conventional performance measurement methods lack the ability to evaluate a SCM since the supply chain is a dynamic system that requires a more thorough and flexible performance measurement technique. Balanced Scorecard (BS) is an efficient tool for measuring the performance of dynamic systems and has a proven capability of providing the decision makers with the appropriate feedback data. In addition to SCM, a relatively new management field, namely reverse supply chain management (RSCM), also necessitates an appropriate evaluation approach. RSCM differs from SCM in many aspects, i.e., the criteria used for evaluation, the high level of uncertainty involved etc., not allowing the usage of identical evaluation techniques used for SCM. This study proposes a generic Balanced Scorecard to measure the performance of supply chain management while defining the appropriate performance measures for SCM. A scorecard prototype, ESCAPE, is presented to demonstrate the evaluation process.
Fereday, Jennifer; Muir-Cochrane, Eimear
2004-01-01
Performance feedback is information provided to employees about how well they are performing in their work role. The nursing profession has a long history of providing formal, written performance reviews, traditionally from a manager to subordinate, with less formal feedback sources including peers, clients and multidisciplinary team members. This paper is based on one aspect of a PhD research study exploring the dynamics of performance feedback primarily from the nursing clinicians' perspective. The research reported here discusses the impact of the social relationship (between the source and recipient of performance feedback) on the recipient's evaluation of feedback as being 'credible' and 'useful' for self-assessment. Focus group interviews were utilised to ascertain the nursing clinicians' perspectives of performance feedback. Thematic analysis of the data was informed by the Social Phenomenology of Alfred Schutz (1967) specifically his theories of intersubjective understanding. Findings supported the level of familiarity between the feedback source and the nursing clinician as a significant criterion influencing the acceptance or rejection of feedback. Implications for the selection of performance feedback sources and processes within nursing are discussed.
Performance indicators for public mental healthcare: a systematic international inventory
2012-01-01
Background The development and use of performance indicators (PI) in the field of public mental health care (PMHC) has increased rapidly in the last decade. To gain insight in the current state of PI for PMHC in nations and regions around the world, we conducted a structured review of publications in scientific peer-reviewed journals supplemented by a systematic inventory of PI published in policy documents by (non-) governmental organizations. Methods Publications on PI for PMHC were identified through database- and internet searches. Final selection was based on review of the full content of the publications. Publications were ordered by nation or region and chronologically. Individual PI were classified by development method, assessment level, care domain, performance dimension, diagnostic focus, and data source. Finally, the evidence on feasibility, data reliability, and content-, criterion-, and construct validity of the PI was evaluated. Results A total of 106 publications were included in the sample. The majority of the publications (n = 65) were peer-reviewed journal articles and 66 publications specifically dealt with performance of PMHC in the United States. The objectives of performance measurement vary widely from internal quality improvement to increasing transparency and accountability. The characteristics of 1480 unique PI were assessed. The majority of PI is based on stakeholder opinion, assesses care processes, is not specific to any diagnostic group, and utilizes administrative data sources. The targeted quality dimensions varied widely across and within nations depending on local professional or political definitions and interests. For all PI some evidence for the content validity and feasibility has been established. Data reliability, criterion- and construct validity have rarely been assessed. Only 18 publications on criterion validity were included. These show significant associations in the expected direction on the majority of PI, but mixed results on a noteworthy number of others. Conclusions PI have been developed for a broad range of care levels, domains, and quality dimensions of PMHC. To ensure their usefulness for the measurement of PMHC performance and advancement of transparency, accountability and quality improvement in PMHC, future research should focus on assessment of the psychometric properties of PI. PMID:22433251
Zoder-Martell, Kimberly A; Dufrene, Brad A; Tingstrom, Daniel H; Olmi, D Joe; Jordan, Sara S; Biskie, Erika M; Sherman, Julie C
2014-09-01
This study tested the effects of direct training on direct care staff's initiation of positive interactions with individuals with developmental disabilities who resided in an intermediate care facility. Participants included four direct care staff and their residents. Direct training included real-time prompts delivered via a one-way radio, and data were collected for immediate and sustained increases in rates of direct care staff's positive interactions. Additionally, this study evaluated the link between increased rates of positive interactions and concomitant decreases in residents' challenging behaviors. A multiple baseline design across participants was used and results indicated that all direct care staff increased their rates of positive interactions during direct training. Moreover, all but one participant continued to engage residents in positive interactions at levels above the criterion during the maintenance phase and follow-up phases. The direct care staff member who did not initially meet the criterion improved to adequate levels following one brief performance feedback session. With regard to residents' challenging behaviors, across phases, residents engaged in low levels of challenging behaviors making those results difficult to evaluate. However, improvements in residents' rate of positive interactions were noted. Copyright © 2014 Elsevier Ltd. All rights reserved.
Spatial Map of Synthesized Criteria for the Redundancy Resolution of Human Arm Movements.
Li, Zhi; Milutinovic, Dejan; Rosen, Jacob
2015-11-01
The kinematic redundancy of the human arm enables the elbow position to rotate about the axis going through the shoulder and wrist, which results in infinite possible arm postures when the arm reaches to a target in a 3-D workspace. To infer the control strategy the human motor system uses to resolve redundancy in reaching movements, this paper compares five redundancy resolution criteria and evaluates their arm posture prediction performance using data on healthy human motion. Two synthesized criteria are developed to provide better real-time arm posture prediction than the five individual criteria. Of these two, the criterion synthesized using an exponential method predicts the arm posture more accurately than that using a least squares approach, and therefore is preferable for inferring the contributions of the individual criteria to motor control during reaching movements. As a methodology contribution, this paper proposes a framework to compare and evaluate redundancy resolution criteria for arm motion control. A cluster analysis which associates criterion contributions with regions of the workspace provides a guideline for designing a real-time motion control system applicable to upper-limb exoskeletons for stroke rehabilitation.
[Evaluation of Suicide Risk Levels in Hospitals: Validity and Reliability Tests].
Macagnino, Sandro; Steinert, Tilman; Uhlmann, Carmen
2018-05-01
Examination of in-hospital suicide risk levels concerning their validity and their reliability. The internal suicide risk levels were evaluated in a cross sectional study of in 163 inpatients. A reliability check was performed via determining interrater-reliability of senior physician, therapist and the responsible nurse. Within the scope of the validity check, we conducted analyses of criterion validity and construct validity. For the total sample an "acceptable" to "good" interrater-reliability (Kendalls W = .77) of suicide risk levels were obtained. Schizophrenic disorders showed the lowest values, for personality disorders we found the highest level of interrater-reliability. When examining the criterion validity, Item-9 of the BDI-II is substantial correlated to our suicide risk levels (ρ m = .54, p < .01). Within the scope of construct validity check, affective disorders showed the highest correlation (ρ = .77), compatible also with "convergent validity". They differed with schizophrenic disorders which showed the least concordance (ρ = .43). In-hospital suicide risk levels may represent an important contribution to the assessment of suicidal behavior of inpatients experiencing psychiatric treatment due to their overall good validity and reliability. © Georg Thieme Verlag KG Stuttgart · New York.
Quality of Recovery Evaluation of the Protection Schemes for Fiber-Wireless Access Networks
NASA Astrophysics Data System (ADS)
Fu, Minglei; Chai, Zhicheng; Le, Zichun
2016-03-01
With the rapid development of fiber-wireless (FiWi) access network, the protection schemes have got more and more attention due to the risk of huge data loss when failures occur. However, there are few studies on the performance evaluation of the FiWi protection schemes by the unified evaluation criterion. In this paper, quality of recovery (QoR) method was adopted to evaluate the performance of three typical protection schemes (MPMC scheme, OBOF scheme and RPMF scheme) against the segment-level failure in FiWi access network. The QoR models of the three schemes were derived in terms of availability, quality of backup path, recovery time and redundancy. To compare the performance of the three protection schemes comprehensively, five different classes of network services such as emergency service, prioritized elastic service, conversational service, etc. were utilized by means of assigning different QoR weights. Simulation results showed that, for the most service cases, RPMF scheme was proved to be the best solution to enhance the survivability when planning the FiWi access network.
ERIC Educational Resources Information Center
Proger, Barton B.; And Others
Criterion-referenced measurement (CRM) has received increasing attention in regular education. However, it is in education for handicapped children that CRM's flexibility for individualization of both instruction and evaluation become even more fully realized. Research is described on one of the first CRM systems (Individual Achievement Monitoring…
ERIC Educational Resources Information Center
Evers, Kathinka; Kilander, Lena; Lindau, Maria
2007-01-01
The objective of this study was to suggest a new formulation of the core research diagnostic consensus criterion ''loss of insight'' in frontotemporal dementia (FTD). Eight patients with FTD (diagnoses made by interviews, medical and neuropsychological examination, CT scan, and regional cerebral glucose metabolism measured by positron emission…
Do Right- and Left-Handed Monkeys Differ on Cognitive Measures?
NASA Technical Reports Server (NTRS)
Hopkins, William D.; Washburn, David A.
1994-01-01
Twelve left- and 14 right-handed monkeys were compared on 6 measures of cognitive performance (2 maze-solving tasks, matching-to-sample, delayed matching-to-sample, delayed response using spatial cues, and delayed response using form cues). The dependent variable was trials-to-training criterion for each of the 6 tasks. Significant differences were found between left- and right-handed monkeys on the 2 versions of the delayed response task. Right-handed monkeys reached criterion significantly faster on the form cue version of the task, whereas left-handed monkeys reached criterion significantly faster on delayed response for spatial position (p less than .05). The results suggest that sensitive hand preference measures of laterality can reveal differences in cognitive performance, which in turn may reflect underlying laterality in functional organization of the nervous system.
NASA Astrophysics Data System (ADS)
Lian, J.; Ahn, D. C.; Chae, D. C.; Münstermann, S.; Bleck, W.
2016-08-01
Experimental and numerical investigations on the characterisation and prediction of cold formability of a ferritic steel sheet are performed in this study. Tensile tests and Nakajima tests were performed for the plasticity characterisation and the forming limit diagram determination. In the numerical prediction, the modified maximum force criterion is selected as the localisation criterion. For the plasticity model, a non-associated formulation of the Hill48 model is employed. With the non-associated flow rule, the model can result in a similar predictive capability of stress and r-value directionality to the advanced non-quadratic associated models. To accurately characterise the anisotropy evolution during hardening, the anisotropic hardening is also calibrated and implemented into the model for the prediction of the formability.
Optimization of Multi-Fidelity Computer Experiments via the EQIE Criterion
DOE Office of Scientific and Technical Information (OSTI.GOV)
He, Xu; Tuo, Rui; Jeff Wu, C. F.
Computer experiments based on mathematical models are powerful tools for understanding physical processes. This article addresses the problem of kriging-based optimization for deterministic computer experiments with tunable accuracy. Our approach is to use multi- delity computer experiments with increasing accuracy levels and a nonstationary Gaussian process model. We propose an optimization scheme that sequentially adds new computer runs by following two criteria. The first criterion, called EQI, scores candidate inputs with given level of accuracy, and the second criterion, called EQIE, scores candidate combinations of inputs and accuracy. Here, from simulation results and a real example using finite element analysis,more » our method out-performs the expected improvement (EI) criterion which works for single-accuracy experiments.« less
Optimization of Multi-Fidelity Computer Experiments via the EQIE Criterion
He, Xu; Tuo, Rui; Jeff Wu, C. F.
2017-01-31
Computer experiments based on mathematical models are powerful tools for understanding physical processes. This article addresses the problem of kriging-based optimization for deterministic computer experiments with tunable accuracy. Our approach is to use multi- delity computer experiments with increasing accuracy levels and a nonstationary Gaussian process model. We propose an optimization scheme that sequentially adds new computer runs by following two criteria. The first criterion, called EQI, scores candidate inputs with given level of accuracy, and the second criterion, called EQIE, scores candidate combinations of inputs and accuracy. Here, from simulation results and a real example using finite element analysis,more » our method out-performs the expected improvement (EI) criterion which works for single-accuracy experiments.« less
Revision of the criterion to avoid electron heating during laser aided plasma diagnostics (LAPD)
NASA Astrophysics Data System (ADS)
Carbone, E. A. D.; Palomares, J. M.; Hübner, S.; Iordanova, E.; van der Mullen, J. J. A. M.
2012-01-01
A criterion is given for the laser fluency (in J/m2) such that, when satisfied, disturbance of the plasma by the laser is avoided. This criterion accounts for laser heating of the electron gas intermediated by electron-ion (ei) and electron-atom (ea) interactions. The first heating mechanism is well known and was extensively dealt with in the past. The second is often overlooked but of importance for plasmas of low degree of ionization. It is especially important for cold atmospheric plasmas, plasmas that nowadays stand in the focus of attention. The new criterion, based on the concerted action of both ei and ea interactions is validated by Thomson scattering experiments performed on four different plasmas.
Rhodes, Matthew G; Jacoby, Larry L
2007-03-01
The authors examined whether participants can shift their criterion for recognition decisions in response to the probability that an item was previously studied. Participants in 3 experiments were given recognition tests in which the probability that an item was studied was correlated with its location during the test. Results from all 3 experiments indicated that participants' response criteria were sensitive to the probability that an item was previously studied and that shifts in criterion were robust. In addition, awareness of the bases for criterion shifts and feedback on performance were key factors contributing to the observed shifts in decision criteria. These data suggest that decision processes can operate in a dynamic fashion, shifting from item to item.
Multi-Optimisation Consensus Clustering
NASA Astrophysics Data System (ADS)
Li, Jian; Swift, Stephen; Liu, Xiaohui
Ensemble Clustering has been developed to provide an alternative way of obtaining more stable and accurate clustering results. It aims to avoid the biases of individual clustering algorithms. However, it is still a challenge to develop an efficient and robust method for Ensemble Clustering. Based on an existing ensemble clustering method, Consensus Clustering (CC), this paper introduces an advanced Consensus Clustering algorithm called Multi-Optimisation Consensus Clustering (MOCC), which utilises an optimised Agreement Separation criterion and a Multi-Optimisation framework to improve the performance of CC. Fifteen different data sets are used for evaluating the performance of MOCC. The results reveal that MOCC can generate more accurate clustering results than the original CC algorithm.
Lassau, Nathalie; Bonastre, Julia; Kind, Michèle; Vilgrain, Valérie; Lacroix, Joëlle; Cuinet, Marie; Taieb, Sophie; Aziza, Richard; Sarran, Antony; Labbe-Devilliers, Catherine; Gallix, Benoit; Lucidarme, Olivier; Ptak, Yvette; Rocher, Laurence; Caquot, Louis-Michel; Chagnon, Sophie; Marion, Denis; Luciani, Alain; Feutray, Sylvaine; Uzan-Augui, Joëlle; Coiffier, Benedicte; Benastou, Baya; Koscielny, Serge
2014-12-01
Dynamic contrast-enhanced ultrasound (DCE-US) has been used in single-center studies to evaluate tumor response to antiangiogenic treatments: the change of area under the perfusion curve (AUC), a criterion linked to blood volume, was consistently correlated with the Response Evaluation Criteria in Solid Tumors response. The main objective here was to do a multicentric validation of the use of DCE-US to evaluate tumor response in different solid tumor types treated by several antiangiogenic agents. A secondary objective was to evaluate the costs of the procedure. This prospective study included patients from 2007 to 2010 in 19 centers (8 teaching hospitals and 11 comprehensive cancer centers). All patients treated with antiangiogenic therapy were eligible. Dynamic contrast-enhanced ultrasound examinations were performed at baseline as well as on days 7, 15, 30, and 60. For each examination, a perfusion curve was recorded during 3 minutes after injection of a contrast agent. Change from baseline at each time point was estimated for each of 7 fitted criteria. The main end point was freedom from progression (FFP). Criterion/time-point combinations with the strongest correlation with FFP were analyzed further to estimate an optimal cutoff point. A total of 1968 DCE-US examinations in 539 patients were analyzed. The median follow-up was 1.65 years. Variations from baseline were significant at day 30 for several criteria, with AUC having the most significant association with FFP (P = 0.00002). Patients with a greater than 40% decrease in AUC at day 30 had better FFP (P = 0.005) and overall survival (P = 0.05). The mean cost of each DCE-US was 180&OV0556;, which corresponds to $250 using the current exchange rate. Dynamic contrast-enhanced ultrasound is a new functional imaging technique that provides a validated criterion, namely, the change of AUC from baseline to day 30, which is predictive of tumor progression in a large multicenter cohort. Because of its low cost, it should be considered in the routine evaluation of solid tumors treated with antiangiogenic therapy.
Vaughn, Kalif E; Rawson, Katherine A
2011-09-01
Previous research has shown that increasing the criterion level (i.e., the number of times an item must be correctly retrieved during practice) improves subsequent memory, but which specific components of memory does increased criterion level enhance? In two experiments, we examined the extent to which the criterion level affects associative memory, target memory, and cue memory. Participants studied Lithuanian-English word pairs via cued recall with restudy until items were correctly recalled one to five times. In Experiment 1, participants took one of four recall tests and one of three recognition tests after a 2-day delay. In Experiment 2, participants took only recognition tests after a 1-week delay. In both experiments, increasing the criterion level enhanced associative memory, as indicated by enhanced performance on forward and backward cued-recall tests and on tests of associative recognition. An increased criterion level also improved target memory, as indicated by enhanced free recall and recognition of targets, and improved cue memory, as indicated by enhanced free recall and recognition of cues.
An Adaptive Reputation-Based Algorithm for Grid Virtual Organization Formation
NASA Astrophysics Data System (ADS)
Cui, Yongrui; Li, Mingchu; Ren, Yizhi; Sakurai, Kouichi
A novel adaptive reputation-based virtual organization formation is proposed. It restrains the bad performers effectively based on the consideration of the global experience of the evaluator and evaluates the direct trust relation between two grid nodes accurately by consulting the previous trust value rationally. It also consults and improves the reputation evaluation process in PathTrust model by taking account of the inter-organizational trust relationship and combines it with direct and recommended trust in a weighted way, which makes the algorithm more robust against collusion attacks. Additionally, the proposed algorithm considers the perspective of the VO creator and takes required VO services as one of the most important fine-grained evaluation criterion, which makes the algorithm more suitable for constructing VOs in grid environments that include autonomous organizations. Simulation results show that our algorithm restrains the bad performers and resists against fake transaction attacks and badmouth attacks effectively. It provides a clear advantage in the design of a VO infrastructure.
Study on the criterion to determine the bottom deployment modes of a coilable mast
NASA Astrophysics Data System (ADS)
Ma, Haibo; Huang, Hai; Han, Jianbin; Zhang, Wei; Wang, Xinsheng
2017-12-01
A practical design criterion that allows the coilable mast bottom to deploy in local coil mode was proposed. The criterion was defined with initial bottom helical angle and obtained by bottom deformation analyses. Discretizing the longerons into short rods, analyses were conducted based on the cylinder assumption and Kirchhoff's kinetic analogy theory. Then, iterative calculations aiming at the bottom four rods were carried out. A critical bottom helical angle was obtained while the angle changing rate equaled to zero. The critical value was defined as a criterion for judgement of bottom deployment mode. Subsequently, micro-gravity deployment tests were carried out and bottom deployment simulations based on finite element method were developed. Through comparisons of bottom helical angles in critical state, the proposed criterion was evaluated and modified, that is, an initial bottom helical angle less than critical value with a design margin of -13.7% could ensure the mast bottom deploying in local coil mode, and further determine a successful local coil deployment of entire coilable mast.
Bus, Sicco A.; Haspels, Rob; Busch-Westbroek, Tessa E.
2011-01-01
OBJECTIVE Therapeutic footwear for diabetic foot patients aims to reduce the risk of ulceration by relieving mechanical pressure on the foot. However, footwear efficacy is generally not assessed in clinical practice. The purpose of this study was to assess the value of in-shoe plantar pressure analysis to evaluate and optimize the pressure-reducing effects of diabetic therapeutic footwear. RESEARCH DESIGN AND METHODS Dynamic in-shoe plantar pressure distribution was measured in 23 neuropathic diabetic foot patients wearing fully customized footwear. Regions of interest (with peak pressure >200 kPa) were selected and targeted for pressure optimization by modifying the shoe or insole. After each of a maximum of three rounds of modifications, the effect on in-shoe plantar pressure was measured. Successful optimization was achieved with a peak pressure reduction of >25% (criterion A) or below an absolute level of 200 kPa (criterion B). RESULTS In 35 defined regions, mean peak pressure was significantly reduced from 303 (SD 77) to 208 (46) kPa after an average 1.6 rounds of footwear modifications (P < 0.001). This result constitutes a 30.2% pressure relief (range 18–50% across regions). All regions were successfully optimized: 16 according to criterion A, 7 to criterion B, and 12 to criterion A and B. Footwear optimization lasted on average 53 min. CONCLUSIONS These findings suggest that in-shoe plantar pressure analysis is an effective and efficient tool to evaluate and guide footwear modifications that significantly reduce pressure in the neuropathic diabetic foot. This result provides an objective approach to instantly improve footwear quality, which should reduce the risk for pressure-related plantar foot ulcers. PMID:21610125
Development of a Work Sample Criterion for General Vehicle Mechanic.
ERIC Educational Resources Information Center
Engel, John D.
A work sample criterion test was developed for General Vehicle Repairman, MOS 63C30 and 63C40. Test items covered three task categories: troubleshooting, corrective action, and preventive maintenance. Thirty-eight organizational mechanics were tested at Fort Knox, Kentucky. Data were also collected on the quality of performance, for example, use…
Food and Nutrition (Intermediate). Performance Objectives and Criterion-Referenced Test Items.
ERIC Educational Resources Information Center
Missouri Univ., Columbia. Instructional Materials Lab.
This document contains competencies and criterion-referenced test items for the Intermediate Food and Nutrition semester course in Missouri that were derived from the duties and tasks of the Missouri homemaker and identified and validated by home economics teachers and subject matter specialists. The guide is designed to assist home economics…
Chen, Liang-Hsuan; Hsueh, Chan-Ching
2007-06-01
Fuzzy regression models are useful to investigate the relationship between explanatory and response variables with fuzzy observations. Different from previous studies, this correspondence proposes a mathematical programming method to construct a fuzzy regression model based on a distance criterion. The objective of the mathematical programming is to minimize the sum of distances between the estimated and observed responses on the X axis, such that the fuzzy regression model constructed has the minimal total estimation error in distance. Only several alpha-cuts of fuzzy observations are needed as inputs to the mathematical programming model; therefore, the applications are not restricted to triangular fuzzy numbers. Three examples, adopted in the previous studies, and a larger example, modified from the crisp case, are used to illustrate the performance of the proposed approach. The results indicate that the proposed model has better performance than those in the previous studies based on either distance criterion or Kim and Bishu's criterion. In addition, the efficiency and effectiveness for solving the larger example by the proposed model are also satisfactory.
Development of an updated tensile neck injury criterion.
Parr, Jeffrey C; Miller, Michael E; Schubert Kabban, Christine M; Pellettiere, Joseph A; Perry, Chris E
2014-10-01
Ejection neck safety remains a concern in military aviation with the growing use of helmet mounted displays (HMDs) worn for entire mission durations. The original USAF tensile neck injury criterion proposed by Carter et al. (4) is updated and an injury protection limit for tensile loading is presented to evaluate escape system and HMD safety. An existent tensile neck injury criterion was updated through the addition of newer post mortem human subject (PMHS) tensile loading and injury data and the application of Survival Analysis to account for censoring in this data. The updated risk function was constructed with a combined human subject (N = 208) and PMHS (N = 22) data set. An updated AIS 3+ tensile neck injury criterion is proposed based upon human and PMHS data. This limit is significantly more conservative than the criterion proposed by Carter in 2000, yielding a 5% risk of AIS 3+ injury at a force of 1136 N as compared to a corresponding force of 1559 N. The inclusion of recent PMHS data into the original tensile neck injury criterion results in an injury protection limit that is significantly more conservative, as recent PMHS data is substantially less censored than the PMHS data included in the earlier criterion. The updated tensile risk function developed in this work is consistent with the tensile risk function published by the Federal Aviation Administration used as the basis for their neck injury criterion for side facing aircraft seats.
López, Mariana B; Conde, Karina; Cremonte, Mariana
The evidence of important problems related to prenatal alcohol exposure has faced researchers with the problem of understanding and screening alcohol use in this population. Although any alcohol use should be considered risky during pregnancy, identifying alcohol-drinking problems (ADPs) could be especially important because women with ADPs could not benefit from a simple advice of abstinence and because their offsprings are subjected to a higher risk of problems related with prenatal alcohol exposure. In this context, we aim to study the prevalence and characteristics of ADPs in pregnant women, evaluating the performance of different diagnostic systems in this population. The aims of the study were to describe the prevalence of ADPs obtained with the criteria of the Diagnostic and Statistical Manual of Mental Disorders in its fourth (DSM-IV) and fifth edition (DSM-5), and the International Classification of Diseases (ICD)-10, in Argentinean females aged 13 to 44 years, 12 months before delivery; to evaluate the level of agreement between these classification systems; and to analyze the performance of each diagnosis criterion in this population. Data were collected through personal interviews of a probability sample of puerperal women (N = 641) in the city of Santa Fe (Argentina), between October 2010 and February 2011. Diagnoses compatible with DSM-IV, DSM-5, and ICD-10 were obtained through the Composite International Diagnostic Interview. Agreement among diagnostic systems was measured through Cohen kappa. Diagnosis criteria performance were analyzed considering their prevalence and discriminating ability (D value). Total ADP prevalence was 6.4% for DSM-IV (4.2% abuse and 2.2% dependence), 8.1% for DSM-5 (6.4% mild, 0.8% moderate, and 0.9% severe alcohol use disorder), and 14.1% for the ICD-10 (11.9% harmful use and 2.2% dependence). DSM-5 modifications improved agreement between DSM and ICD. The least prevalent and worst discriminating ability diagnostic criterion was "legal problems." The most prevalent and 1 of the best discriminating ability diagnostic criterion was '"health issues." DSM-IV and ICD-10 dependence prevalence was similar to that of previous studies in pregnant women, whereas abuse prevalence was surprisingly higher. Our results indicate a better performance of the DSM-5 alcohol use disorder category relative to the DSM-IV dual categorization. Nevertheless, the poor diagnostic performance of some DSM-5 criteria in this population could evidence their intercultural variability.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hsu, P.-F.; Wu, C.-R.; Li, Y.-T.
2008-07-01
While Taiwanese hospitals dispose of large amounts of medical waste to ensure sanitation and personal hygiene, doing so inefficiently creates potential environmental hazards and increases operational expenses. However, hospitals lack objective criteria to select the most appropriate waste disposal firm and evaluate its performance, instead relying on their own subjective judgment and previous experiences. Therefore, this work presents an analytic hierarchy process (AHP) method to objectively select medical waste disposal firms based on the results of interviews with experts in the field, thus reducing overhead costs and enhancing medical waste management. An appropriate weight criterion based on AHP is derivedmore » to assess the effectiveness of medical waste disposal firms. The proposed AHP-based method offers a more efficient and precise means of selecting medical waste firms than subjective assessment methods do, thus reducing the potential risks for hospitals. Analysis results indicate that the medical sector selects the most appropriate infectious medical waste disposal firm based on the following rank: matching degree, contractor's qualifications, contractor's service capability, contractor's equipment and economic factors. By providing hospitals with an effective means of evaluating medical waste disposal firms, the proposed AHP method can reduce overhead costs and enable medical waste management to understand the market demand in the health sector. Moreover, performed through use of Expert Choice software, sensitivity analysis can survey the criterion weight of the degree of influence with an alternative hierarchy.« less
Feiler, Ute; Ratte, Monika; Arts, Gertie; Bazin, Christine; Brauer, Frank; Casado, Carmen; Dören, Laszlo; Eklund, Britta; Gilberg, Daniel; Grote, Matthias; Gonsior, Guido; Hafner, Christoph; Kopf, Willi; Lemnitzer, Bernd; Liedtke, Anja; Matthias, Uwe; Okos, Ewa; Pandard, Pascal; Scheerbaum, Dirk; Schmitt-Jansen, Mechthild; Stewart, Kathleen; Teodorovic, Ivana; Wenzel, Andrea; Pluta, Hans-Jürgen
2014-03-01
A whole-sediment toxicity test with Myriophyllum aquaticum has been developed by the German Federal Institute of Hydrology and standardized within the International Organization for Standardization (ISO; ISO 16191). An international ring-test was performed to evaluate the precision of the test method. Four sediments (artificial, natural) were tested. Test duration was 10 d, and test endpoint was inhibition of growth rate (r) based on fresh weight data. Eighteen of 21 laboratories met the validity criterion of r ≥ 0.09 d(-1) in the control. Results from 4 tests that did not conform to test-performance criteria were excluded from statistical evaluation. The inter-laboratory variability of growth rates (20.6%-25.0%) and inhibition (26.6%-39.9%) was comparable with the variability of other standardized bioassays. The mean test-internal variability of the controls was low (7% [control], 9.7% [solvent control]), yielding a high discriminatory power of the given test design (median minimum detectable differences [MDD] 13% to 15%). To ensure these MDDs, an additional validity criterion of CV ≤ 15% of the growth rate in the controls was recommended. As a positive control, 90 mg 3,5-dichlorophenol/kg sediment dry mass was tested. The range of the expected growth inhibition was proposed to be 35 ± 15%. The ring test results demonstrated the reliability of the ISO 16191 toxicity test and its suitability as a tool to assess the toxicity of sediment and dredged material. © 2013 SETAC.
Hansarikit, Jarunee; Manotaya, Saknan
2011-05-01
To study the sensitivity and specificity of the modified 100-g oral glucose tolerance test for diagnosis of gestational diabetes mellitus (GDM). Medical records of pregnant women attending the antenatal clinic of King Chulalongkorn Memorial Hospital, Thailand, who underwent a 100-g oral glucose tolerance test (OGTT) during March 2004 to September 2009, were retrospectively reviewed. Three modified criteria were proposed for diagnosis of GDM. The screening efficacy of the modified criteria were assessed, using the National Diabetes Data Group (NDDG) criterion as gold standard. A total of 729 records were reviewed, 511 were included for analysis. Using the NDDG criterion as the gold standard, the modified II criterion has the highest sensitivity of 96.8%, and the highest accuracy of 90.8%. The modified II criterion can detect the same proportion of maternal and neonatal complications, compared to the NDDG criterion. The modified II criterion, using the fasting plasma glucose and 2-hour plasma glucose measurements, showed high sensitivity and accuracy, with moderate specificity for diagnosis of GDM. Its potential use as an alternative to standard 100-g OGTT should be evaluated in the prospective study.
Villettaz Robichaud, M; Rushen, J; de Passillé, A M; Vasseur, E; Haley, D; Orsel, K; Pellerin, D
2018-03-01
Improving animal welfare on farm can sometimes require substantial financial investments. The Canadian dairy industry recently updated their Code of Practice for the care of dairy animals and created a mandatory on-farm animal care assessment (proAction Animal Care). Motivating dairy farmers to follow the recommendations of the Code of Practice and successfully meet the targets of the on-farm assessment can be enhanced by financial gain associated with improved animal welfare. The aim of the current study was to evaluate the association between meeting or not meeting several criteria from an on-farm animal welfare assessment and the farms' productivity and profitability indicators. Data from 130 freestall farms (20 using automatic milking systems) were used to calculate the results of the animal care assessment. Productivity and profitability indicators, including milk production, somatic cell count, reproduction, and longevity, were retrieved from the regional dairy herd improvement association databases. Economic margins over replacement costs were also calculated. Univariable and multivariable linear regression models were used to evaluate the associations between welfare and productivity and profitability indicators. The proportion of automatic milking system farms that met the proAction criterion for hock lesions was higher compared with parlor farms and lower for the neck lesion criterion. The proAction criterion for lameness prevalence was significantly associated with average corrected milk production per year. Average days in milk (DIM) at first breeding acted as an effect modifier for this association, resulting in a steeper increase of milk production in farms that met the criterion with increasing average DIM at first breeding. The reproduction and longevity indicators studied were not significantly associated with meeting or not meeting the proAction criteria investigated in this study. Meeting the proAction lameness prevalence parameter was associated with an increased profitability margin per cow over replacement cost by $236 compared with farms that did not. These results suggest that associations are present between meeting the lameness prevalence benchmark of the Animal Care proAction Initiative and freestall farms' productivity and profitability. Overall, meeting the animal-based criteria evaluated in this study was not detrimental to freestall farms' productivity and profitability. Copyright © 2018 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Ranganathan, Rajiv; Wieser, Jon; Mosier, Kristine M; Mussa-Ivaldi, Ferdinando A; Scheidt, Robert A
2014-06-11
Prior learning of a motor skill creates motor memories that can facilitate or interfere with learning of new, but related, motor skills. One hypothesis of motor learning posits that for a sensorimotor task with redundant degrees of freedom, the nervous system learns the geometric structure of the task and improves performance by selectively operating within that task space. We tested this hypothesis by examining if transfer of learning between two tasks depends on shared dimensionality between their respective task spaces. Human participants wore a data glove and learned to manipulate a computer cursor by moving their fingers. Separate groups of participants learned two tasks: a prior task that was unique to each group and a criterion task that was common to all groups. We manipulated the mapping between finger motions and cursor positions in the prior task to define task spaces that either shared or did not share the task space dimensions (x-y axes) of the criterion task. We found that if the prior task shared task dimensions with the criterion task, there was an initial facilitation in criterion task performance. However, if the prior task did not share task dimensions with the criterion task, there was prolonged interference in learning the criterion task due to participants finding inefficient task solutions. These results show that the nervous system learns the task space through practice, and that the degree of shared task space dimensionality influences the extent to which prior experience transfers to subsequent learning of related motor skills. Copyright © 2014 the authors 0270-6474/14/348289-11$15.00/0.
NASA Astrophysics Data System (ADS)
Rolita, Lisa; Surarso, Bayu; Gernowo, Rahmat
2018-02-01
In order to improve airport safety management system (SMS) performance, an evaluation system is required to improve on current shortcomings and maximize safety. This study suggests the integration of the DEMATEL and ANP methods in decision making processes by analyzing causal relations between the relevant criteria and taking effective analysis-based decision. The DEMATEL method builds on the ANP method in identifying the interdependencies between criteria. The input data consists of questionnaire data obtained online and then stored in an online database. Furthermore, the questionnaire data is processed using DEMATEL and ANP methods to obtain the results of determining the relationship between criteria and criteria that need to be evaluated. The study cases on this evaluation system were Adi Sutjipto International Airport, Yogyakarta (JOG); Ahmad Yani International Airport, Semarang (SRG); and Adi Sumarmo International Airport, Surakarta (SOC). The integration grades SMS performance criterion weights in a descending order as follow: safety and destination policy, safety risk management, healthcare, and safety awareness. Sturges' formula classified the results into nine grades. JOG and SMG airports were in grade 8, while SOG airport was in grade 7.
Evaluer la competence de communication (Evaluating Communicative Competence).
ERIC Educational Resources Information Center
Hediard, Marie
1988-01-01
The structure of a course designed to teach oral communicative competence is outlined, and the approach to evaluation is discussed. Evaluation includes both a criterion test and a specific oral task that students must accomplish. (MSE)
Uhler, Kristin M; Baca, Rosalinda; Dudas, Emily; Fredrickson, Tammy
2015-01-01
Speech perception measures have long been considered an integral piece of the audiological assessment battery. Currently, a prelinguistic, standardized measure of speech perception is missing in the clinical assessment battery for infants and young toddlers. Such a measure would allow systematic assessment of speech perception abilities of infants as well as the potential to investigate the impact early identification of hearing loss and early fitting of amplification have on the auditory pathways. To investigate the impact of sensation level (SL) on the ability of infants with normal hearing (NH) to discriminate /a-i/ and /ba-da/ and to determine if performance on the two contrasts are significantly different in predicting the discrimination criterion. The design was based on a survival analysis model for event occurrence and a repeated measures logistic model for binary outcomes. The outcome for survival analysis was the minimum SL for criterion and the outcome for the logistic regression model was the presence/absence of achieving the criterion. Criterion achievement was designated when an infant's proportion correct score was >0.75 on the discrimination performance task. Twenty-two infants with NH sensitivity participated in this study. There were 9 males and 13 females, aged 6-14 mo. Testing took place over two to three sessions. The first session consisted of a hearing test, threshold assessment of the two speech sounds (/a/ and /i/), and if time and attention allowed, visual reinforcement infant speech discrimination (VRISD). The second session consisted of VRISD assessment for the two test contrasts (/a-i/ and /ba-da/). The presentation level started at 50 dBA. If the infant was unable to successfully achieve criterion (>0.75) at 50 dBA, the presentation level was increased to 70 dBA followed by 60 dBA. Data examination included an event analysis, which provided the probability of criterion distribution across SL. The second stage of the analysis was a repeated measures logistic regression where SL and contrast were used to predict the likelihood of speech discrimination criterion. Infants were able to reach criterion for the /a-i/ contrast at statistically lower SLs when compared to /ba-da/. There were six infants who never reached criterion for /ba-da/ and one never reached criterion for /a-i/. The conditional probability of not reaching criterion by 70 dB SL was 0% for /a-i/ and 21% for /ba-da/. The predictive logistic regression model showed that children were more likely to discriminate the /a-i/ even when controlling for SL. Nearly all normal-hearing infants can demonstrate discrimination criterion of a vowel contrast at 60 dB SL, while a level of ≥70 dB SL may be needed to allow all infants to demonstrate discrimination criterion of a difficult consonant contrast. American Academy of Audiology.
The Information a Test Provides on an Ability Parameter. Research Report. ETS RR-07-18
ERIC Educational Resources Information Center
Haberman, Shelby J.
2007-01-01
In item-response theory, if a latent-structure model has an ability variable, then elementary information theory may be employed to provide a criterion for evaluation of the information the test provides concerning ability. This criterion may be considered even in cases in which the latent-structure model is not valid, although interpretation of…
Mooney, Robert; Corley, Gavin; Godfrey, Alan; Osborough, Conor; ÓLaighin, Gearóid
2017-01-01
Aims The study aims were to evaluate the validity of two commercially available swimming activity monitors for quantifying temporal and kinematic swimming variables. Methods Ten national level swimmers (5 male, 5 female; 15.3±1.3years; 164.8±12.9cm; 62.4±11.1kg; 425±66 FINA points) completed a set protocol comprising 1,500m of swimming involving all four competitive swimming strokes. Swimmers wore the Finis Swimsense and the Garmin Swim activity monitors throughout. The devices automatically identified stroke type, swim distance, lap time, stroke count, stroke rate, stroke length and average speed. Video recordings were also obtained and used as a criterion measure to evaluate performance. Results A significant positive correlation was found between the monitors and video for the identification of each of the four swim strokes (Garmin: X2 (3) = 31.292, p<0.05; Finis:X2 (3) = 33.004, p<0.05). No significant differences were found for swim distance measurements. Swimming laps performed in the middle of a swimming interval showed no significant difference from the criterion (Garmin: bias -0.065, 95% confidence intervals -3.828–6.920; Finis bias -0.02, 95% confidence intervals -3.095–3.142). However laps performed at the beginning and end of an interval were not as accurately timed. Additionally, a statistical difference was found for stroke count measurements in all but two occasions (p<0.05). These differences affect the accuracy of stroke rate, stroke length and average speed scores reported by the monitors, as all of these are derived from lap times and stroke counts. Conclusions Both monitors were found to operate with a relatively similar performance level and appear suited for recreational use. However, issues with feature detection accuracy may be related to individual variances in stroke technique. It is reasonable to expect that this level of error would increase when the devices are used by recreational swimmers rather than elite swimmers. Further development to improve accuracy of feature detection algorithms, specifically for lap time and stroke count, would also increase their suitability within competitive settings. PMID:28178301
Mooney, Robert; Quinlan, Leo R; Corley, Gavin; Godfrey, Alan; Osborough, Conor; ÓLaighin, Gearóid
2017-01-01
The study aims were to evaluate the validity of two commercially available swimming activity monitors for quantifying temporal and kinematic swimming variables. Ten national level swimmers (5 male, 5 female; 15.3±1.3years; 164.8±12.9cm; 62.4±11.1kg; 425±66 FINA points) completed a set protocol comprising 1,500m of swimming involving all four competitive swimming strokes. Swimmers wore the Finis Swimsense and the Garmin Swim activity monitors throughout. The devices automatically identified stroke type, swim distance, lap time, stroke count, stroke rate, stroke length and average speed. Video recordings were also obtained and used as a criterion measure to evaluate performance. A significant positive correlation was found between the monitors and video for the identification of each of the four swim strokes (Garmin: X2 (3) = 31.292, p<0.05; Finis:X2 (3) = 33.004, p<0.05). No significant differences were found for swim distance measurements. Swimming laps performed in the middle of a swimming interval showed no significant difference from the criterion (Garmin: bias -0.065, 95% confidence intervals -3.828-6.920; Finis bias -0.02, 95% confidence intervals -3.095-3.142). However laps performed at the beginning and end of an interval were not as accurately timed. Additionally, a statistical difference was found for stroke count measurements in all but two occasions (p<0.05). These differences affect the accuracy of stroke rate, stroke length and average speed scores reported by the monitors, as all of these are derived from lap times and stroke counts. Both monitors were found to operate with a relatively similar performance level and appear suited for recreational use. However, issues with feature detection accuracy may be related to individual variances in stroke technique. It is reasonable to expect that this level of error would increase when the devices are used by recreational swimmers rather than elite swimmers. Further development to improve accuracy of feature detection algorithms, specifically for lap time and stroke count, would also increase their suitability within competitive settings.
Wilson, G. Terence; Sysko, Robyn
2013-01-01
Objective In DSM-IV, to be diagnosed with Bulimia Nervosa (BN) or the provisional diagnosis of Binge Eating Disorder (BED), an individual must experience episodes of binge eating is “at least twice a week” on average, for three or six months respectively. The purpose of this review was to examine the validity and utility of the frequency criterion for BN and BED. Method Published studies evaluating the frequency criterion were reviewed. Results Our review found little evidence to support the validity or utility of the DSM-IV frequency criterion of twice a week binge eating; however, the number of studies available for our review was limited. Conclusion A number of options are available for the frequency criterion in DSM-V, and the optimal diagnostic threshold for binge eating remains to be determined. PMID:19610014
Nozari, Nazbanou; Hepner, Christopher R
2018-06-05
Competitive accounts of lexical selection propose that the activation of competitors slows down the selection of the target. Non-competitive accounts, on the other hand, posit that target response latencies are independent of the activation of competing items. In this paper, we propose a signal detection framework for lexical selection and show how a flexible selection criterion affects claims of competitive selection. Specifically, we review evidence from neurotypical and brain-damaged speakers and demonstrate that task goals and the state of the production system determine whether a competitive or a non-competitive selection profile arises. We end by arguing that there is conclusive evidence for a flexible criterion in lexical selection, and that integrating criterion shifts into models of language production is critical for evaluating theoretical claims regarding (non-)competitive selection.
Physical Employment Standards for UK Firefighters
Stevenson, Richard D.M.; Siddall, Andrew G.; Turner, Philip F.J.; Bilzon, James L.J.
2017-01-01
Objective: The aim of this study was to assess sensitivity and specificity of surrogate physical ability tests as predictors of criterion firefighting task performance and to identify corresponding minimum muscular strength and endurance standards. Methods: Fifty-one (26 male; 25 female) participants completed three criterion tasks (ladder lift, ladder lower, ladder extension) and three corresponding surrogate tests [one-repetition maximum (1RM) seated shoulder press; 1RM seated rope pull-down; repeated 28 kg seated rope pull-down]. Surrogate test standards were calculated that best identified individuals who passed (sensitivity; true positives) and failed (specificity; true negatives) criterion tasks. Results: Best sensitivity/specificity achieved were 1.00/1.00 for a 35 kg seated shoulder press, 0.79/0.92 for a 60 kg rope pull-down, and 0.83/0.93 for 23 repetitions of the 28 kg rope pull-down. Conclusions: These standards represent performance on surrogate tests commensurate with minimum acceptable performance of essential strength-based occupational tasks in UK firefighters. PMID:28045801
Validation of GPU based TomoTherapy dose calculation engine.
Chen, Quan; Lu, Weiguo; Chen, Yu; Chen, Mingli; Henderson, Douglas; Sterpin, Edmond
2012-04-01
The graphic processing unit (GPU) based TomoTherapy convolution/superposition(C/S) dose engine (GPU dose engine) achieves a dramatic performance improvement over the traditional CPU-cluster based TomoTherapy dose engine (CPU dose engine). Besides the architecture difference between the GPU and CPU, there are several algorithm changes from the CPU dose engine to the GPU dose engine. These changes made the GPU dose slightly different from the CPU-cluster dose. In order for the commercial release of the GPU dose engine, its accuracy has to be validated. Thirty eight TomoTherapy phantom plans and 19 patient plans were calculated with both dose engines to evaluate the equivalency between the two dose engines. Gamma indices (Γ) were used for the equivalency evaluation. The GPU dose was further verified with the absolute point dose measurement with ion chamber and film measurements for phantom plans. Monte Carlo calculation was used as a reference for both dose engines in the accuracy evaluation in heterogeneous phantom and actual patients. The GPU dose engine showed excellent agreement with the current CPU dose engine. The majority of cases had over 99.99% of voxels with Γ(1%, 1 mm) < 1. The worst case observed in the phantom had 0.22% voxels violating the criterion. In patient cases, the worst percentage of voxels violating the criterion was 0.57%. For absolute point dose verification, all cases agreed with measurement to within ±3% with average error magnitude within 1%. All cases passed the acceptance criterion that more than 95% of the pixels have Γ(3%, 3 mm) < 1 in film measurement, and the average passing pixel percentage is 98.5%-99%. The GPU dose engine also showed similar degree of accuracy in heterogeneous media as the current TomoTherapy dose engine. It is verified and validated that the ultrafast TomoTherapy GPU dose engine can safely replace the existing TomoTherapy cluster based dose engine without degradation in dose accuracy.
Fruit Phenolic Profiling: A New Selection Criterion in Olive Breeding Programs
Pérez, Ana G.; León, Lorenzo; Sanz, Carlos; de la Rosa, Raúl
2018-01-01
Olive growing is mainly based on traditional varieties selected by the growers across the centuries. The few attempts so far reported to obtain new varieties by systematic breeding have been mainly focused on improving the olive adaptation to different growing systems, the productivity and the oil content. However, the improvement of oil quality has rarely been considered as selection criterion and only in the latter stages of the breeding programs. Due to their health promoting and organoleptic properties, phenolic compounds are one of the most important quality markers for Virgin olive oil (VOO) although they are not commonly used as quality traits in olive breeding programs. This is mainly due to the difficulties for evaluating oil phenolic composition in large number of samples and the limited knowledge on the genetic and environmental factors that may influence phenolic composition. In the present work, we propose a high throughput methodology to include the phenolic composition as a selection criterion in olive breeding programs. For that purpose, the phenolic profile has been determined in fruits and oils of several breeding selections and two varieties (“Picual” and “Arbequina”) used as control. The effect of three different environments, typical for olive growing in Andalusia, Southern Spain, was also evaluated. A high genetic effect was observed on both fruit and oil phenolic profile. In particular, the breeding selection UCI2-68 showed an optimum phenolic profile, which sums up to a good agronomic performance previously reported. A high correlation was found between fruit and oil total phenolic content as well as some individual phenols from the two different matrices. The environmental effect on phenolic compounds was also significant in both fruit and oil, although the low genotype × environment interaction allowed similar ranking of genotypes on the different environments. In summary, the high genotypic variance and the simplified procedure of the proposed methodology for fruit phenol evaluation seems to be convenient for breeding programs aiming at obtaining new cultivars with improved phenolic profile. PMID:29535752
Fruit Phenolic Profiling: A New Selection Criterion in Olive Breeding Programs.
Pérez, Ana G; León, Lorenzo; Sanz, Carlos; de la Rosa, Raúl
2018-01-01
Olive growing is mainly based on traditional varieties selected by the growers across the centuries. The few attempts so far reported to obtain new varieties by systematic breeding have been mainly focused on improving the olive adaptation to different growing systems, the productivity and the oil content. However, the improvement of oil quality has rarely been considered as selection criterion and only in the latter stages of the breeding programs. Due to their health promoting and organoleptic properties, phenolic compounds are one of the most important quality markers for Virgin olive oil (VOO) although they are not commonly used as quality traits in olive breeding programs. This is mainly due to the difficulties for evaluating oil phenolic composition in large number of samples and the limited knowledge on the genetic and environmental factors that may influence phenolic composition. In the present work, we propose a high throughput methodology to include the phenolic composition as a selection criterion in olive breeding programs. For that purpose, the phenolic profile has been determined in fruits and oils of several breeding selections and two varieties ("Picual" and "Arbequina") used as control. The effect of three different environments, typical for olive growing in Andalusia, Southern Spain, was also evaluated. A high genetic effect was observed on both fruit and oil phenolic profile. In particular, the breeding selection UCI2-68 showed an optimum phenolic profile, which sums up to a good agronomic performance previously reported. A high correlation was found between fruit and oil total phenolic content as well as some individual phenols from the two different matrices. The environmental effect on phenolic compounds was also significant in both fruit and oil, although the low genotype × environment interaction allowed similar ranking of genotypes on the different environments. In summary, the high genotypic variance and the simplified procedure of the proposed methodology for fruit phenol evaluation seems to be convenient for breeding programs aiming at obtaining new cultivars with improved phenolic profile.
Anthropometry as a predictor of bench press performance done at different loads.
Caruso, John F; Taylor, Skyler T; Lutz, Brant M; Olson, Nathan M; Mason, Melissa L; Borgsmiller, Jake A; Riner, Rebekah D
2012-09-01
The purpose of our study was to examine the ability of anthropometric variables (body mass, total arm length, biacromial width) to predict bench press performance at both maximal and submaximal loads. Our methods required 36 men to visit our laboratory and submit to anthropometric measurements, followed by lifting as much weight as possible in good form one time (1 repetition maximum, 1RM) in the exercise. They made 3 more visits in which they performed 4 sets of bench presses to volitional failure at 1 of 3 (40, 55, or 75% 1RM) submaximal loads. An accelerometer (Myotest Inc., Royal Oak MI) measured peak force, velocity, and power after each submaximal load set. With stepwise multivariate regression, our 3 anthropometric variables attempted to explain significant amounts of variance for 13 bench press performance indices. For criterion measures that reached significance, separate Pearson product moment correlation coefficients further assessed if the strength of association each anthropometric variable had with the criterion was also significant. Our analyses showed that anthropometry explained significant amounts (p < 0.05) of variance for 8 criterion measures. It was concluded that body mass had strong univariate correlations with 1RM and force-related measures, total arm length was moderately associated with 1RM and criterion variables at the lightest load, whereas biacromial width had an inverse relationship with the peak number of repetitions performed per set at the 2 lighter loads. Practical applications suggest results may help coaches and practitioners identify anthropometric features that may best predict various measures of bench press prowess in athletes.
Evaluation schemes for video and image anomaly detection algorithms
NASA Astrophysics Data System (ADS)
Parameswaran, Shibin; Harguess, Josh; Barngrover, Christopher; Shafer, Scott; Reese, Michael
2016-05-01
Video anomaly detection is a critical research area in computer vision. It is a natural first step before applying object recognition algorithms. There are many algorithms that detect anomalies (outliers) in videos and images that have been introduced in recent years. However, these algorithms behave and perform differently based on differences in domains and tasks to which they are subjected. In order to better understand the strengths and weaknesses of outlier algorithms and their applicability in a particular domain/task of interest, it is important to measure and quantify their performance using appropriate evaluation metrics. There are many evaluation metrics that have been used in the literature such as precision curves, precision-recall curves, and receiver operating characteristic (ROC) curves. In order to construct these different metrics, it is also important to choose an appropriate evaluation scheme that decides when a proposed detection is considered a true or a false detection. Choosing the right evaluation metric and the right scheme is very critical since the choice can introduce positive or negative bias in the measuring criterion and may favor (or work against) a particular algorithm or task. In this paper, we review evaluation metrics and popular evaluation schemes that are used to measure the performance of anomaly detection algorithms on videos and imagery with one or more anomalies. We analyze the biases introduced by these by measuring the performance of an existing anomaly detection algorithm.
Understanding protocol performance: impact of test performance.
Turner, Robert G
2013-01-01
This is the second of two articles that examine the factors that determine protocol performance. The objective of these articles is to provide a general understanding of protocol performance that can be used to estimate performance, establish limits on performance, decide if a protocol is justified, and ultimately select a protocol. The first article was concerned with protocol criterion and test correlation. It demonstrated the advantages and disadvantages of different criterion when all tests had the same performance. It also examined the impact of increasing test correlation on protocol performance and the characteristics of the different criteria. To examine the impact on protocol performance when individual tests in a protocol have different performance. This is evaluated for different criteria and test correlations. The results of the two articles are combined and summarized. A mathematical model is used to calculate protocol performance for different protocol criteria and test correlations when there are small to large variations in the performance of individual tests in the protocol. The performance of the individual tests that make up a protocol has a significant impact on the performance of the protocol. As expected, the better the performance of the individual tests, the better the performance of the protocol. Many of the characteristics of the different criteria are relatively independent of the variation in the performance of the individual tests. However, increasing test variation degrades some criteria advantages and causes a new disadvantage to appear. This negative impact increases as test variation increases and as more tests are added to the protocol. Best protocol performance is obtained when individual tests are uncorrelated and have the same performance. In general, the greater the variation in the performance of tests in the protocol, the more detrimental this variation is to protocol performance. Since this negative impact is increased as more tests are added to the protocol, greater test variation indicates using fewer tests in the protocol. American Academy of Audiology.
Examination of DSM-5 Section III avoidant personality disorder in a community sample.
Sellbom, Martin; Carmichael, Kieran L C; Liggett, Jacqueline
2017-11-01
The current research evaluated the continuity between DSM-5 Section II and Section III diagnostic operationalizations of avoidant personality disorder (AvPD). More specifically, the study had three aims: (1) to examine which personality constructs comprise the optimal trait constellation for AvPD; (2) to investigate the utility of the proposed structure of the Section III AvPD diagnosis, in regard to combining functional impairment (criterion A) and a dimensional measure of personality (criterion B) variables; and (3) to determine whether AvPD-specific impairment confers incremental meaningful contribution above and beyond general impairment in personality functioning. A mixed sample of 402 university and community participants was recruited, and they were administered multiple measures of Section II PD, personality traits, and personality impairment. A latent measurement model approach was used to analyse data. Results supported the general continuity between Section II and Section III of the DSM-5; however, three of the four main criterion B traits were the stronger predictors. There was also some support for the trait unassertiveness augmenting the criterion B trait profile. The combination of using functional impairment criteria (criterion A) and dimensional personality constructs (criterion B) in operationalizing AvPD was supported; however, the reliance of disorder-specific over general impairment for criterion A was not supported. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.
Hodge, Megan; Gotzke, Carrie Lynne
2014-08-01
To evaluate the criterion-related validity of the TOCS+ sentence measure (TOCS+, Hodge, Daniels & Gotzke, 2009 ) for children with dysarthria and CP by comparing intelligibility and rate scores obtained concurrently from the TOCS+ and from a conversational sample. Twenty children (3 to 10 years old) diagnosed with spastic cerebral palsy (CP) participated. Nineteen children also had a confirmed diagnosis of dysarthria. Children's intelligibility and speaking rate scores obtained from the TOCS+, which uses imitation of sets of randomly selected items ranging from 2-7 words (80 words in total) and from a contiguous 100-word conversational speech were compared. Mean intelligibility scores were 46.5% (SD = 26.4%) and 50.9% (SD = 19.1%) and mean rates in words per minute (WPM) were 90.2 (SD = 22.3) and 94.1 (SD = 25.6), respectively, for the TOCS+ and conversational samples. No significant differences were found between the two conditions for intelligibility or rate scores. Strong correlations were found between the TOCS+ and conversational samples for intelligibility (r = 0.86; p < 0.001) and WPM (r = 0.77; p < 0.001), supporting the criterion validity of the TOCS+ sentence task as a time efficient procedure for measuring intelligibility and rate in children with CP, with and without confirmed dysarthria. The results support the criterion validity of the TOCS+ sentence task as a time efficient procedure for measuring intelligibility and rate in children with CP, with and without confirmed dysarthria. Children varied in their relative performance on the two speaking tasks, reflecting the complexity of factors that influence intelligibility and rate scores.
Becker, Daniel F; Añez, Luis Miguel; Paris, Manuel; Bedregal, Luis; Grilo, Carlos M
2009-01-01
This study examined the internal consistency, factor structure, and diagnostic efficiency of the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition (DSM-IV), criteria for avoidant personality disorder (AVPD) and the extent to which these metrics may be affected by sex. Subjects were 130 monolingual Hispanic adults (90 men, 40 women) who had been admitted to a specialty clinic that provides psychiatric and substance abuse services to Spanish-speaking patients. All were reliably assessed with the Spanish-Language Version of the Diagnostic Interview for DSM-IV Personality Disorders. The AVPD diagnosis was determined by the best-estimate method. After evaluating internal consistency of the AVPD criterion set, an exploratory factor analysis was performed using principal components extraction. Afterward, diagnostic efficiency indices were calculated for all AVPD criteria. Subsequent analyses examined men and women separately. For the overall group, internal consistency of AVPD criteria was good. Exploratory factor analysis revealed a 1-factor solution (accounting for 70% of the variance), supporting the unidimensionality of the AVPD criterion set. The best inclusion criterion was "reluctance to take risks," whereas "interpersonally inhibited" was the best exclusion criterion and the best predictor overall. When men and women were examined separately, similar results were obtained for both internal consistency and factor structure, with slight variations noted between sexes in the patterning of diagnostic efficiency indices. These psychometric findings, which were similar for men and women, support the construct validity of the DSM-IV criteria for AVPD and may also have implications for the treatment of this particular clinical population.
Proposals or findings for a new approach about how to define and diagnose premature ejaculation.
Wang, Weifu; Kumar, Pardeep; Minhas, Suks; Ralph, David
2005-09-01
To review and present the proposals or findings for a new approach about how to define and diagnose premature ejaculation (PE). Using Medline to search for international peer reviewed manuscripts published from 1996 to 2004 about the definition and diagnosis of PE. PE, to date, has not a universally agreed definition and diagnostic criterion. Many definitions are partial, subjective and nonspecific. An ideal definition or diagnostic criterion should consist of intravaginal ejaculatory latency time (IELT), the ability to control over ejaculation, the extent of male sexual satisfaction, the extent of female sexual satisfaction, the frequency of female sexual partner reaching orgasm and the extent of psychological and pathological factors. Therefore, the Chinese Index of Premature Ejaculation (CIPE) seems an ideal tool and criterion used to diagnose PE due to including all the elements above. In the majority of cases, PE is the result of a mix of psychogenic, physiological and organic factors. So, besides some routine tests such as urine routine test, endocrine hormone assay, psychosexual counseling, couple evaluation and physical examination, prostate examination, serum leptin assay, semen magnesium assessment and glans hypersensitivity measurement, are suggested to be performed in the diagnosis of PE. Although elucidated by two clinical trials and further confirmed, serum leptin assay seems a promising and objective marker to diagnose PE because it is related to the serotonergic system whose disorder has been confirmed to contribute to the etiology of PE. None of these definitions and diagnoses has been accepted as a universal agreement of PE. CIPE seems an ideal tool and criterion used to diagnose PE and leptin maybe become a promising and objective marker for PE.
Decohesion models informed by first-principles calculations: The ab initio tensile test
NASA Astrophysics Data System (ADS)
Enrique, Raúl A.; Van der Ven, Anton
2017-10-01
Extreme deformation and homogeneous fracture can be readily studied via ab initio methods by subjecting crystals to numerical "tensile tests", where the energy of locally stable crystal configurations corresponding to elongated and fractured states are evaluated by means of density functional method calculations. The information obtained can then be used to construct traction curves of cohesive zone models in order to address fracture at the macroscopic scale. In this work, we perform an in depth analysis of traction curves and how ab initio calculations must be interpreted to rigorously parameterize an atomic scale cohesive zone model, using crystalline Ag as an example. Our analysis of traction curves reveal the existence of two qualitatively distinct decohesion criteria: (i) an energy criterion whereby the released elastic energy equals the energy cost of creating two new surfaces and (ii) an instability criterion that occurs at a higher and size independent stress than that of the energy criterion. We find that increasing the size of the simulation cell renders parts of the traction curve inaccessible to ab initio calculations involving the uniform decohesion of the crystal. We also find that the separation distance below which a crack heals is not a material parameter as has been proposed in the past. Finally, we show that a large energy barrier separates the uniformly stressed crystal from the decohered crystal, resolving a paradox predicted by a scaling law based on the energy criterion that implies that large crystals will decohere under vanishingly small stresses. This work clarifies confusion in the literature as to how a cohesive zone model is to be parameterized with ab initio "tensile tests" in the presence of internal relaxations.
The Impact of Various Class-Distinction Features on Model Selection in the Mixture Rasch Model
ERIC Educational Resources Information Center
Choi, In-Hee; Paek, Insu; Cho, Sun-Joo
2017-01-01
The purpose of the current study is to examine the performance of four information criteria (Akaike's information criterion [AIC], corrected AIC [AICC] Bayesian information criterion [BIC], sample-size adjusted BIC [SABIC]) for detecting the correct number of latent classes in the mixture Rasch model through simulations. The simulation study…
Effect of Single Setting versus Multiple Setting Training on Learning to Shop in a Department Store.
ERIC Educational Resources Information Center
Westling, David L.; And Others
1990-01-01
Fifteen students, age 13-21, with moderate to profound mental retardation received shopping skills training in either 1 or 3 department stores. A study of operational behaviors, social behaviors, number of settings in which criterion performance was achieved, and number of sessions required to achieve criterion found no significant differences…
ERIC Educational Resources Information Center
Roth, Philip L.; Buster, Maury A.; Bobko, Philip
2011-01-01
A number of applied psychologists have suggested that trainability test Black-White ethnic group differences are low or relatively low (e.g., Siegel & Bergman, 1975), though data are scarce. Likewise, there are relatively few estimates of criterion-related validity for trainability tests predicting job performance (cf. Robertson & Downs,…
ERIC Educational Resources Information Center
Murray, Gregory V.; Moyer-Packenham, Patricia S.
2014-01-01
One option for length of individual mathematics class periods is the schedule type selected for Algebra I classes. This study examined the relationship between student achievement, as indicated by Algebra I Criterion-Referenced Test scores, and the schedule type for Algebra I classes. Data obtained from the Utah State Office of Education included…
Vortex identification from local properties of the vorticity field
NASA Astrophysics Data System (ADS)
Elsas, J. H.; Moriconi, L.
2017-01-01
A number of systematic procedures for the identification of vortices/coherent structures have been developed as a way to address their possible kinematical and dynamical roles in structural formulations of turbulence. It has been broadly acknowledged, however, that vortex detection algorithms, usually based on linear-algebraic properties of the velocity gradient tensor, can be plagued with severe shortcomings and may become, in practical terms, dependent on the choice of subjective threshold parameters in their implementations. In two-dimensions, a large class of standard vortex identification prescriptions turn out to be equivalent to the "swirling strength criterion" (λc i-criterion), which is critically revisited in this work. We classify the instances where the accuracy of the λc i-criterion is affected by nonlinear superposition effects and propose an alternative vortex detection scheme based on the local curvature properties of the vorticity graph (x ,y ,ω ) —the "vorticity curvature criterion" (λω-criterion)—which improves over the results obtained with the λc i-criterion in controlled Monte Carlo tests. A particularly problematic issue, given its importance in wall-bounded flows, is the eventual inadequacy of the λc i-criterion for many-vortex configurations in the presence of strong background shear. We show that the λω-criterion is able to cope with these cases as well, if a subtraction of the mean velocity field background is performed, in the spirit of the Reynolds decomposition procedure. A realistic comparative study for vortex identification is then carried out for a direct numerical simulation of a turbulent channel flow, including a three-dimensional extension of the λω-criterion. In contrast to the λc i-criterion, the λω-criterion indicates in a consistent way the existence of small scale isotropic turbulent fluctuations in the logarithmic layer, in consonance with long-standing assumptions commonly taken in turbulent boundary layer phenomenology.
Hierarchical semi-numeric method for pairwise fuzzy group decision making.
Marimin, M; Umano, M; Hatono, I; Tamura, H
2002-01-01
Gradual improvements to a single-level semi-numeric method, i.e., linguistic labels preference representation by fuzzy sets computation for pairwise fuzzy group decision making are summarized. The method is extended to solve multiple criteria hierarchical structure pairwise fuzzy group decision-making problems. The problems are hierarchically structured into focus, criteria, and alternatives. Decision makers express their evaluations of criteria and alternatives based on each criterion by using linguistic labels. The labels are converted into and processed in triangular fuzzy numbers (TFNs). Evaluations of criteria yield relative criteria weights. Evaluations of the alternatives, based on each criterion, yield a degree of preference for each alternative or a degree of satisfaction for each preference value. By using a neat ordered weighted average (OWA) or a fuzzy weighted average operator, solutions obtained based on each criterion are aggregated into final solutions. The hierarchical semi-numeric method is suitable for solving a larger and more complex pairwise fuzzy group decision-making problem. The proposed method has been verified and applied to solve some real cases and is compared to Saaty's (1996) analytic hierarchy process (AHP) method.
Methods of evaluating the effects of coding on SAR data
NASA Technical Reports Server (NTRS)
Dutkiewicz, Melanie; Cumming, Ian
1993-01-01
It is recognized that mean square error (MSE) is not a sufficient criterion for determining the acceptability of an image reconstructed from data that has been compressed and decompressed using an encoding algorithm. In the case of Synthetic Aperture Radar (SAR) data, it is also deemed to be insufficient to display the reconstructed image (and perhaps error image) alongside the original and make a (subjective) judgment as to the quality of the reconstructed data. In this paper we suggest a number of additional evaluation criteria which we feel should be included as evaluation metrics in SAR data encoding experiments. These criteria have been specifically chosen to provide a means of ensuring that the important information in the SAR data is preserved. The paper also presents the results of an investigation into the effects of coding on SAR data fidelity when the coding is applied in (1) the signal data domain, and (2) the image domain. An analysis of the results highlights the shortcomings of the MSE criterion, and shows which of the suggested additional criterion have been found to be most important.
360-degree suture trabeculotomy ab interno to treat open-angle glaucoma: 2-year outcomes
Sato, Tomoki; Kawaji, Takahiro; Hirata, Akira; Mizoguchi, Takanori
2018-01-01
Purpose The purpose of this study was to evaluate the efficacy of 360-degree suture trabeculotomy (360S-LOT) ab interno for treating open-angle glaucoma (OAG). Risk factors of surgical failure were examined. Patients and methods 360S-LOT ab interno alone was performed for patients with uncontrolled OAG, and combined 360S-LOT ab interno/phacoemulsification was performed for patients with controlled OAG with a visually significant cataract between March 2014 and September 2015 at a single center. The patients were prospectively followed for 2 years. The main outcome measures included 2-year intraocular pressure (IOP), number of anti-glaucoma medications used, postoperative complications, and predictive factors of surgical failure. Kaplan–Meier analysis was performed, with surgical success (with or without medication use) defined as postoperative IOP ≤15 mmHg and IOP reduction ≥20% (criterion A) or IOP ≤12 mmHg and IOP reduction ≥30% (criterion B). Predictive factors were evaluated using Cox proportional hazard ratios. Results A total of 64 eyes of 64 patients were included, and 50 (78%) eyes of 64 eyes underwent a phacoemulsification combination procedure. Surgery significantly reduced IOP from 18.4 ± 2.9 mmHg before surgery to 13.4 ± 3.0 mmHg after surgery (P < 0.001). Patients used an average of 1.8 ± 1.5 medications before surgery and 1.3 ± 1.5 medications after surgery (P = 0.101). No serious postoperative complications were observed. The probability of surgical success was 49.2% and 16.0% using criteria A and B, respectively. No risk factors of surgical failure were identified. Conclusion The 360S-LOT ab interno procedure is a favorable option for treating eyes with mild or moderate OAG. PMID:29844656
2013-01-01
Background The quality of the parent–child relationship has an important effect on a wide range of child outcomes. The evaluation of interventions to promote healthy parenting and family relationships is dependent on outcome measures which can quantify the quality of parent–child relationships. Between the Mothers’ Object Relations – Short Form (MORS-SF) scale for babies and the Child–parent Relationship Scale (C-PRS) there is an age gap where no validated scales are available. We report the development and testing of an adaptation of the MORS-SF; the MORS (Child) scale and its use in children from the age of 2 years to 4 years. This scale aims to capture the nature of the parent–child relationship in a form which is short enough to be used in population surveys and intervention evaluations. Methods Construct and criterion validity, item salience and internal consistency were assessed in a sample of 166 parents of children aged 2–4 years old and compared with that of the C-PRS. The performance of the MORS (Child) as part of a composite measure with the HOME inventory was compared with that of the C-PRS using data collected in a randomised controlled trial and the national evaluation of Sure Start. Results MORS (Child) performed well in children aged 2–4 with high construct and criterion validity, item salience and internal consistency. One item in the C-PRS failed to load on either subscale and parents found this scale slightly more difficult to complete than the MORS (Child). The two measures performed very similarly in a factor analysis with the HOME inventory producing almost identical loadings. Conclusions Adapting the MORS-SF for children aged 2–4 years old produces a scale to assess parent–child relationships that is easy to use and outperforms the more commonly used C-PRS in several respects. PMID:23518176
Hu, Rongrong; Wang, Chenkun; Gu, Yangshun; Racette, Lyne
2016-01-01
Abstract Detection of progression is paramount to the clinical management of glaucoma. Our goal is to compare the performance of standard automated perimetry (SAP), short-wavelength automated perimetry (SWAP), and frequency-doubling technology (FDT) perimetry in monitoring glaucoma progression. Longitudinal data of paired SAP, SWAP, and FDT from 113 eyes with primary open-angle glaucoma enrolled in the Diagnostic Innovations in Glaucoma Study or the African Descent and Glaucoma Evaluation Study were included. Data from all tests were expressed in comparable units by converting the sensitivity from decibels to unitless contrast sensitivity and by expressing sensitivity values in percent of mean normal based on an independent dataset of 207 healthy eyes with aging deterioration taken into consideration. Pointwise linear regression analysis was performed and 3 criteria (conservative, moderate, and liberal) were used to define progression and improvement. Global mean sensitivity (MS) was fitted with linear mixed models. No statistically significant difference in the proportion of progressing and improving eyes was observed across tests using the conservative criterion. Fewer eyes showed improvement on SAP compared to SWAP and FDT using the moderate criterion; and FDT detected less progressing eyes than SAP and SWAP using the liberal criterion. The agreement between these test types was poor. The linear mixed model showed a progressing trend of global MS overtime for SAP and SWAP, but not for FDT. The baseline estimate of SWAP MS was significantly lower than SAP MS by 21.59% of mean normal. FDT showed comparable estimation of baseline MS with SAP. SWAP and FDT do not appear to have significant benefits over SAP in monitoring glaucoma progression. SAP, SWAP, and FDT may, however, detect progression in different glaucoma eyes. PMID:26886602
NASA Astrophysics Data System (ADS)
Ma, Yuanxu; Huang, He Qing
2016-07-01
Accurate estimation of flow resistance is crucial for flood routing, flow discharge and velocity estimation, and engineering design. Various empirical and semiempirical flow resistance models have been developed during the past century; however, a universal flow resistance model for varying types of rivers has remained difficult to be achieved to date. In this study, hydrometric data sets from six stations in the lower Yellow River during 1958-1959 are used to calibrate three empirical flow resistance models (Eqs. (5)-(7)) and evaluate their predictability. A group of statistical measures have been used to evaluate the goodness of fit of these models, including root mean square error (RMSE), coefficient of determination (CD), the Nash coefficient (NA), mean relative error (MRE), mean symmetry error (MSE), percentage of data with a relative error ≤ 50% and 25% (P50, P25), and percentage of data with overestimated error (POE). Three model selection criterions are also employed to assess the model predictability: Akaike information criterion (AIC), Bayesian information criterion (BIC), and a modified model selection criterion (MSC). The results show that mean flow depth (d) and water surface slope (S) can only explain a small proportion of variance in flow resistance. When channel width (w) and suspended sediment concentration (SSC) are involved, the new model (7) achieves a better performance than the previous ones. The MRE of model (7) is generally < 20%, which is apparently better than that reported by previous studies. This model is validated using the data sets from the corresponding stations during 1965-1966, and the results show larger uncertainties than the calibrating model. This probably resulted from the temporal shift of dominant controls caused by channel change resulting from varying flow regime. With the advancements of earth observation techniques, information about channel width, mean flow depth, and suspended sediment concentration can be effectively extracted from multisource satellite images. We expect that the empirical methods developed in this study can be used as an effective surrogate in estimation of flow resistance in the large sand-bed rivers like the lower Yellow River.
Place and direction learning in a spatial T-maze task by neonatal piglets
Elmore, Monica R. P.; Dilger, Ryan N.; Johnson, Rodney W.
2013-01-01
Pigs are a valuable animal model for studying neurodevelopment in humans due to similarities in brain structure and growth. The development and validation of behavioral tests to assess learning and memory in neonatal piglets are needed. The present study evaluated the capability of 2-wk old piglets to acquire a novel place and direction learning spatial T-maze task. Validity of the task was assessed by the administration of scopolamine, an anti-cholinergic drug that acts on the hippocampus and other related structures, to impair spatial memory. During acquisition, piglets were trained to locate a milk reward in a constant place in space, as well as direction (east or west), in a plus-shaped maze using extra-maze visual cues. Following acquisition, reward location was reversed and piglets were re-tested to assess learning and working memory. The performance of control piglets in the maze improved over time (P < 0.0001), reaching performance criterion (80% correct) on day 5 of acquisition. Correct choices decreased in the reversal phase (P < 0.0001), but improved over time. In a separate study, piglets were injected daily with either phosphate buffered saline (PBS; control) or scopolamine prior to testing. Piglets administered scopolamine showed impaired performance in the maze compared to controls (P = 0.03), failing to reach performance criterion after 6 days of acquisition testing. Collectively, these data demonstrate that neonatal piglets can be tested in a spatial T-maze task to assess hippocampal-dependent learning and memory. PMID:22526690
ERIC Educational Resources Information Center
Phemister, Art W.
2010-01-01
The purpose of this study was to evaluate the effectiveness of the Georgia's Choice reading curriculum on third grade science scores on the Georgia Criterion Referenced Competency Test from 2002 to 2008. In assessing the effectiveness of the Georgia's Choice curriculum model this causal comparative study examined the 105 elementary schools that…
Michael S. Williams; Kenneth L. Cormier; Ronald G. Briggs; Donald L. Martinez
1999-01-01
Calibrated Barr & Stroud FP15 and Criterion 400 laser dendrometers were tested for reliability in measuring upper stem diameters and heights under typical field conditions. Data were collected in the Black Hills National Forest, which covers parts of South Dakota and Wyoming in the United States. Mixed effects models were employed to account for differences between...
ERIC Educational Resources Information Center
Abdekhodaie, Zahra; Tabatabaei, Seyed Mahmood; Gholizadeh, Mortaza
2012-01-01
In this study, the prevalence of attention-deficit hyperactivity disorder (ADHD) in kindergarten children in northeast Iran was investigated, and the criterion validity of Conners' parent-teacher questionnaire was evaluated through the use of clinical interviews. This study was a cross-sectional descriptive research project with children in…
ERIC Educational Resources Information Center
Al-Habashneh, Maher Hussein; Najjar, Nabil Juma
2017-01-01
This study aimed at constructing a criterion-reference test to measure the research and statistical competencies of graduate students at the Jordanian governmental universities, the test has to be in its first form of (50) multiple choice items, then the test was introduced to (5) arbitrators with competence in measurement and evaluation to…
[Criterion of dental treatment for the disabled].
Huchun, Wan; Zheng, Yang; Hongkun, Wu; Jianguo, Liu; Jin, Zhao; Xiaoping, Ji; Lin, Zhu; Deqin, Yang; Xuedong, Zhou
2017-08-01
The number of disabled persons increases in the course of human life and in the aging population. The high prevalence, low treatment rate, long therapy period, and sophisticated procedures prevent most of disabled individuals from availing dental services. Moreover, special dental institutions for the disabled are insufficient, and a certain treatment standard is commonly not complied. This study performed analysis and evaluation, including treatment features, pretreatment procedures, patient communication, treatment factors, and treatment standards to provide a targeted solution for the special requirements of the oral therapy for disabled patients.
Tomatis, Laura; Krebs, Andreas; Siegenthaler, Jessica; Murer, Kurt; de Bruin, Eling D
2015-01-01
Health is closely linked to physical activity and fitness. It is therefore important to monitor fitness in children. Although many reports on physical tests have been published, data comparison between studies is an issue. This study reports Swiss first grade norm values of fitness tests and compares these with criterion reference data. A total of 10,565 boys (7.18 ± 0.42 years) and 10,204 girls (7.14 ± 0.41 years) were tested for standing long jump, plate tapping, 20-m shuttle run, lateral jump and 20-m sprint. Average values for six-, seven- and eight-year-olds were analysed and reference curves for age were constructed. Z-values were generated for comparisons with criterion references reported in the literature. Results were better for all disciplines in seven-year-old first grade children compared to six-year-old children (p < 0.01). Eight-year-old children did not perform better compared to seven-year-old children in the sprint run (p = 0.11), standing long jump (p > 0.99) and shuttle run (p = 0.43), whereas they were better in all other disciplines compared to their younger peers. The average performance of boys was better than girls except for tapping at the age of 8 (p = 0.06). Differences in performance due to testing protocol and setting must be considered when test values from a first grade setting are compared to criterion-based benchmarks. In a classroom setting, younger children tended to have better results and older children tended to have worse outcomes when compared to their age group criterion reference values. Norm reference data are valid allowing comparison with other data generated by similar test protocols applied in a classroom setting.
Creating ensembles of decision trees through sampling
Kamath, Chandrika; Cantu-Paz, Erick
2005-08-30
A system for decision tree ensembles that includes a module to read the data, a module to sort the data, a module to evaluate a potential split of the data according to some criterion using a random sample of the data, a module to split the data, and a module to combine multiple decision trees in ensembles. The decision tree method is based on statistical sampling techniques and includes the steps of reading the data; sorting the data; evaluating a potential split according to some criterion using a random sample of the data, splitting the data, and combining multiple decision trees in ensembles.
AHP for Risk Management Based on Expected Utility Theory
NASA Astrophysics Data System (ADS)
Azuma, Rumiko; Miyagi, Hayao
This paper presents a model of decision-making considering the risk assessment. The conventional evaluation in AHP is considered to be a kind of utility. When dealing with the risk, however, it is necessary to consider the probability of damage. In order to take risk into decision-making problem, we construct AHP based on expected utility. The risk is considered as a related element of criterion rather than criterion itself. The expected utility is integrated, considering that satisfaction is positive utility and damage by risk is negative utility. Then, evaluation in AHP is executed using the expected utility.
Severity of illness index for surgical departments in a Cuban hospital: a revalidation study.
Armas-Bencomo, Amadys; Tamargo-Barbeito, Teddy Osmin; Fuentes-Valdés, Edelberto; Jiménez-Paneque, Rosa Eugenia
2017-03-08
In the context of the evaluation of hospital services, the incorporation of severity indices allows an essential control variable for performance comparisons in time and space through risk adjustment. The severity index for surgical services was developed in 1999 and validated as a general index for surgical services. Sixteen years later the hospital context is different in many ways and a revalidation was considered necessary to guarantee its current usefulness. To evaluate the validity and reliability of the surgical services severity index to warrant its reasonable use under current conditions. A descriptive study was carried out in the General Surgery service of the "Hermanos Ameijeiras" Clinical Surgical Hospital of Havana, Cuba during the second half of 2010. We reviewed the medical records of 511 patients discharged from this service. Items were the same as the original index as were their weighted values. Conceptual or construct validity, criterion validity and inter-rater reliability as well as internal consistency of the proposed index were evaluated. Construct validity was expressed as a significant association between the value of the severity index for surgical services and discharge status. A significant association was also found, although weak, with length of hospital stay. Criterion validity was demonstrated through the correlations between the severity index for surgical services and other similar indices. Regarding criterion validity, the Horn index showed a correlation of 0.722 (95% CI: 0.677-0.761) with our index. With the POSSUM score, correlation was 0.454 (95% CI: 0.388-0.514) with mortality risk and 0.539 (95% CI: 0.462-0.607) with morbidity risk. Internal consistency yielded a standardized Cronbach's alpha of 0.8; inter-rater reliability resulted in a reliability coefficient of 0.98 for the quantitative index and a weighted global Kappa coefficient of 0.87 for the ordinal surgical index of severity for surgical services (IGQ). The validity and reliability of the proposed index was satisfactory in all aspects evaluated. The surgical services severity index may be used in the original context and is easily adaptable to other contexts as well.
Shimamune, Satoru; Jitsumori, Masako
1999-01-01
In a computer-assisted sentence completion task, the effects of grammar instruction and fluency training on learning the use of the definite and indefinite articles of English were examined. Forty-eight native Japanese-speaking students were assigned to four groups: with grammar/accuracy (G/A), without grammar/accuracy (N/A), with grammar/fluency (G/F), and without grammar/fluency (N/F). In the G/A and N/A groups, training continued until performance reached 100% accuracy (accuracy criterion). In the G/F and N/F groups, training continued until 100% accuracy was reached and the correct responses were made at a high speed (fluency criterion). Grammar instruction was given to participants in the G/A and G/F groups but not to those in the N/A and N/F groups. Generalization to new sentences was tested immediately after reaching the required criterion. High levels of generalization occurred, regardless of the type of mastery criterion and whether the grammar instruction was given. Retention tests were conducted 4, 6, and 8 weeks after training. Fluency training effectively improved retention of the performance attained without the grammar instruction. This effect was diminished when grammar instruction was given during training. Learning grammatical rules was not necessary for the generalized use of appropriate definite and indefinite articles or for the maintenance of the performance attained through fluency training. PMID:22477154
Evaluation of failure criterion for graphite/epoxy fabric laminates
NASA Technical Reports Server (NTRS)
Tennyson, R. C.; Wharram, G. E.
1985-01-01
The development and application of the tensor polynomial failure criterion for composite laminate analysis is described. Emphasis is given to the fabrication and testing of Narmco Rigidite 5208-WT300, a plain weave fabric of Thornel 300 Graphite fibers impregnated with Narmco 5208 Resin. The quadratic-failure criterion with F sub 12=0 provides accurate estimates of failure stresses for the graphite/epoxy investigated. The cubic failure criterion was recast into an operationally easier form, providing design curves that can be applied to laminates fabricated from orthotropic woven fabric prepregs. In the form presented, no interaction strength tests are required, although recourse to the quadratic model and the principal strength parameters is necessary. However, insufficient test data exist at present to generalize this approach for all prepreg constructions, and its use must be restricted to the generic materials and configurations investigated to date.
NASA Astrophysics Data System (ADS)
Pommatau, Gilles
2014-06-01
The present paper deals with the industrial application, via a software developed by Thales Alenia Space, of a new failure criterion named "Tsai-Hill equivalent criterion" for composite structural parts of satellites. The first part of the paper briefly describes the main hypothesis and the possibilities in terms of failure analysis of the software. The second parts reminds the quadratic and conservative nature of the new failure criterion, already presented in ESA conference in a previous paper. The third part presents the statistical calculation possibilities of the software, and the associated sensitivity analysis, via results obtained on different composites. Then a methodology, proposed to customers and agencies, is presented with its limitations and advantages. It is then conclude that this methodology is an efficient industrial way to perform mechanical analysis on quasi-isotropic composite parts.
A comparison of two microscale laboratory reporting methods in a secondary chemistry classroom
NASA Astrophysics Data System (ADS)
Martinez, Lance Michael
This study attempted to determine if there was a difference between the laboratory achievement of students who used a modified reporting method and those who used traditional laboratory reporting. The study also determined the relationships between laboratory performance scores and the independent variables score on the Group Assessment of Logical Thinking (GALT) test, chronological age in months, gender, and ethnicity for each of the treatment groups. The study was conducted using 113 high school students who were enrolled in first-year general chemistry classes at Pueblo South High School in Colorado. The research design used was the quasi-experimental Nonequivalent Control Group Design. The statistical treatment consisted of the Multiple Regression Analysis and the Analysis of Covariance. Based on the GALT, students in the two groups were generally in the concrete and transitional stages of the Piagetian cognitive levels. The findings of the study revealed that the traditional and the modified methods of laboratory reporting did not have any effect on the laboratory performance outcome of the subjects. However, the students who used the traditional method of reporting showed a higher laboratory performance score when evaluation was conducted using the New Standards rubric recommended by the state. Multiple Regression Analysis revealed that there was a significant relationship between the criterion variable student laboratory performance outcome of individuals who employed traditional laboratory reporting methods and the composite set of predictor variables. On the contrary, there was no significant relationship between the criterion variable student laboratory performance outcome of individuals who employed modified laboratory reporting methods and the composite set of predictor variables.
The Validation of a Case-Based, Cumulative Assessment and Progressions Examination
Coker, Adeola O.; Copeland, Jeffrey T.; Gottlieb, Helmut B.; Horlen, Cheryl; Smith, Helen E.; Urteaga, Elizabeth M.; Ramsinghani, Sushma; Zertuche, Alejandra; Maize, David
2016-01-01
Objective. To assess content and criterion validity, as well as reliability of an internally developed, case-based, cumulative, high-stakes third-year Annual Student Assessment and Progression Examination (P3 ASAP Exam). Methods. Content validity was assessed through the writing-reviewing process. Criterion validity was assessed by comparing student scores on the P3 ASAP Exam with the nationally validated Pharmacy Curriculum Outcomes Assessment (PCOA). Reliability was assessed with psychometric analysis comparing student performance over four years. Results. The P3 ASAP Exam showed content validity through representation of didactic courses and professional outcomes. Similar scores on the P3 ASAP Exam and PCOA with Pearson correlation coefficient established criterion validity. Consistent student performance using Kuder-Richardson coefficient (KR-20) since 2012 reflected reliability of the examination. Conclusion. Pharmacy schools can implement internally developed, high-stakes, cumulative progression examinations that are valid and reliable using a robust writing-reviewing process and psychometric analyses. PMID:26941435
A new tracer‐density criterion for heterogeneous porous media
Barth, Gilbert R.; Illangasekare, Tissa H.; Hill, Mary C.; Rajaram, Harihar
2001-01-01
Tracer experiments provide information about aquifer material properties vital for accurate site characterization. Unfortunately, density‐induced sinking can distort tracer movement, leading to an inaccurate assessment of material properties. Yet existing criteria for selecting appropriate tracer concentrations are based on analysis of homogeneous media instead of media with heterogeneities typical of field sites. This work introduces a hydraulic‐gradient correction for heterogeneous media and applies it to a criterion previously used to indicate density‐induced instabilities in homogeneous media. The modified criterion was tested using a series of two‐dimensional heterogeneous intermediate‐scale tracer experiments and data from several detailed field tracer tests. The intermediate‐scale experimental facility (10.0×1.2×0.06 m) included both homogeneous and heterogeneous (σln k2 = 1.22) zones. The field tracer tests were less heterogeneous (0.24 < σln k2 < 0.37), but measurements were sufficient to detect density‐induced sinking. Evaluation of the modified criterion using the experiments and field tests demonstrates that the new criterion appears to account for the change in density‐induced sinking due to heterogeneity. The criterion demonstrates the importance of accounting for heterogeneity to predict density‐induced sinking and differences in the onset of density‐induced sinking in two‐ and three‐dimensional systems.
ERIC Educational Resources Information Center
Anselmo, Giancarlo A.; Yarbrough, Jamie L.; Kovaleski, Joseph F.; Tran, Vi N.
2017-01-01
This study analyzed the relationship between benchmark scores from two curriculum-based measurement probes in mathematics (M-CBM) and student performance on a state-mandated high-stakes test. Participants were 298 students enrolled in grades 7 and 8 in a rural southeastern school. Specifically, we calculated the criterion-related and predictive…
ERIC Educational Resources Information Center
Kettler, Ryan J.; Elliott, Stephen N.; Davies, Michael; Griffin, Patrick
2012-01-01
This study addresses the predictive validity of results from a screening system of academic enablers, with a sample of Australian elementary school students, when the criterion variable is end-of-year achievement. The investigation included (a) comparing the predictive validity of a brief criterion-referenced nomination system with more…
Social influences on adaptive criterion learning.
Cassidy, Brittany S; Dubé, Chad; Gutchess, Angela H
2015-07-01
People adaptively shift decision criteria when given biased feedback encouraging specific types of errors. Given that work on this topic has been conducted in nonsocial contexts, we extended the literature by examining adaptive criterion learning in both social and nonsocial contexts. Specifically, we compared potential differences in criterion shifting given performance feedback from social sources varying in reliability and from a nonsocial source. Participants became lax when given false positive feedback for false alarms, and became conservative when given false positive feedback for misses, replicating prior work. In terms of a social influence on adaptive criterion learning, people became more lax in response style over time if feedback was provided by a nonsocial source or by a social source meant to be perceived as unreliable and low-achieving. In contrast, people adopted a more conservative response style over time if performance feedback came from a high-achieving and reliable source. Awareness that a reliable and high-achieving person had not provided their feedback reduced the tendency to become more conservative, relative to those unaware of the source manipulation. Because teaching and learning often occur in a social context, these findings may have important implications for many scenarios in which people fine-tune their behaviors, given cues from others.
ATR evaluation through the synthesis of multiple performance measures
NASA Astrophysics Data System (ADS)
Bassham, Christopher B.; Klimack, William K.; Bauer, Kenneth W., Jr.
2002-07-01
This research demonstrates the application of decision analysis (DA) techniques to decisions made within Automatic Target Recognition (ATR) technology development. This work is accomplished to improve the means by which ATR technologies are evaluated. The first step in this research was to create a flexible decision analysis framework that could be applied to several decisions across different ATR programs evaluated by the Comprehensive ATR Scientific Evaluation (COMPASE) Center of the Air Force Research Laboratory (AFRL). For the purposes of this research, a single COMPASE Center representative provided the value, utility, and preference functions for the DA framework. The DA framework employs performance measures collected during ATR classification system (CS) testing to calculate value and utility scores. The authors gathered data from the Moving and Stationary Target Acquisition and Recognition (MSTAR) program to demonstrate how the decision framework could be used to evaluate three different ATR CSs. A decision-maker may use the resultant scores to gain insight into any of the decisions that occur throughout the lifecycle of ATR technologies. Additionally, a means of evaluating ATR CS self-assessment ability is presented. This represents a new criterion that emerged from this study, and no present evaluation metric is known.
Sheffer, C E; Penn, D L; Cassisi, J E
2001-01-01
The effects of self-presentation demands were evaluated through conversational probe (CP) role-play tasks. Participants (N = 29) were required to manage their self-presentations (i.e., the impression they made, in each of two conditions). During high impression management (IM) demand, participants were evaluated on their performance. During Low IM demand, participants evaluated a confederate's performance. The High IM demand condition produced significantly higher heart rate (HR) and self-reported anxiety. HR and self-reported anxiety accounted for a significant amount of the variance in criterion measures of social competence. Greater social competence during High IM was associated with higher HR. Greater social competence during Low IM was associated with lower HR and lower self-reported anxiety. Although preliminary, these results suggest that uncontrolled IM demands contributed to mixed results found within and between social anxiety studies in the literature. Implications for the treatment of social anxiety are discussed.
Coupled Multi-Disciplinary Optimization for Structural Reliability and Affordability
NASA Technical Reports Server (NTRS)
Abumeri, Galib H.; Chamis, Christos C.
2003-01-01
A computational simulation method is presented for Non-Deterministic Multidisciplinary Optimization of engine composite materials and structures. A hypothetical engine duct made with ceramic matrix composites (CMC) is evaluated probabilistically in the presence of combined thermo-mechanical loading. The structure is tailored by quantifying the uncertainties in all relevant design variables such as fabrication, material, and loading parameters. The probabilistic sensitivities are used to select critical design variables for optimization. In this paper, two approaches for non-deterministic optimization are presented. The non-deterministic minimization of combined failure stress criterion is carried out by: (1) performing probabilistic evaluation first and then optimization and (2) performing optimization first and then probabilistic evaluation. The first approach shows that the optimization feasible region can be bounded by a set of prescribed probability limits and that the optimization follows the cumulative distribution function between those limits. The second approach shows that the optimization feasible region is bounded by 0.50 and 0.999 probabilities.
Obtaining systematic teacher reports of disruptive behavior disorders utilizing DSM-IV.
Wolraich, M L; Feurer, I D; Hannah, J N; Baumgaertel, A; Pinnock, T Y
1998-04-01
This study examines the psychometric properties of the Vanderbilt AD/HD Diagnostic Teacher Rating Scale (VADTRS) and provides preliminary normative data from a large, geographically defined population. The VADTRS consists of the complete list of DSM-IV AD/HD symptoms, a screen for other disruptive behavior disorders, anxiety and depression, and ratings of academic and classroom behavior performance. Teachers in one suburban county completed the scale for their students during 2 consecutive years. Statistical methods included (a) exploratory and confirmatory latent variable analyses of item data, (b) evaluation of the internal consistency of the latent dimensions, (c) evaluation of latent structure concordance between school year samples, and (d) preliminary evaluation of criterion-related validity. The instrument comprises four behavioral dimensions and two performance dimensions. The behavioral dimensions were concordant between school years and were consistent with a priori DSM-IV diagnostic criteria. Correlations between latent dimensions and relevant, known disorders or problems varied from .25 to .66.
Evaluation of on-board hydrogen storage methods f or high-speed aircraft
NASA Technical Reports Server (NTRS)
Akyurtlu, Ates; Akyurtlu, Jale F.
1991-01-01
Hydrogen is the fuel of choice for hypersonic vehicles. Its main disadvantage is its low liquid and solid density. This increases the vehicle volume and hence the drag losses during atmospheric flight. In addition, the dry mass of the vehicle is larger due to larger vehicle structure and fuel tankage. Therefore it is very desirable to find a fuel system with smaller fuel storage requirements without deteriorating the vehicle performance substantially. To evaluate various candidate fuel systems, they were first screened thermodynamically with respect to their energy content and cooling capacities. To evaluate the vehicle performance with different fuel systems, a simple computer model is developed to compute the vehicle parameters such as the vehicle volume, dry mass, effective specific impulse, and payload capacity. The results indicate that if the payload capacity (or the gross lift-off mass) is the most important criterion, only slush hydrogen and liquid hydrogen - liquid methane gel shows better performance than the liquid hydrogen vehicle. If all the advantages of a smaller vehicle are considered and a more accurate mass analysis can be performed, other systems using endothermic fuels such as cyclohexane, and some boranes may prove to be worthy of further consideration.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Vogt, B.A.; Gabriel, M.; Vogt, L.J.
Training-induced neuronal activity develops in the mammalian limbic system during discriminative avoidance conditioning. This study explores behaviorally relevant changes in muscarinic ACh receptor binding in 52 rabbits that were trained to one of five stages of conditioned response acquisition. Sixteen naive and 10 animals yoked to criterion performance served as control cases. Upon reaching a particular stage of training, the brains were removed and autoradiographically assayed for 3H-oxotremorine-M binding with 50 nM pirenzepine (OxO-M/PZ) or for 3H-pirenzepine binding in nine limbic thalamic nuclei and cingulate cortex. Specific OxO-M/PZ binding increased in the parvocellular division of the anterodorsal nucleus early inmore » training when the animals were first exposed to pairing of the conditional and unconditional stimuli. Elevated binding in this nucleus was maintained throughout subsequent training. In the parvocellular division of the anteroventral nucleus (AVp), OxO-M/PZ binding progressively increased throughout training, reached a peak at the criterion stage of performance, and returned to control values during extinction sessions. Peak OxO-M/PZ binding in AVp was significantly elevated over that for cases yoked to criterion performance. In the magnocellular division of the anteroventral nucleus (AVm), OxO-M/PZ binding was elevated only during criterion performance of the task, and it was unaltered in any other limbic thalamic nuclei. Specific OxO-M/PZ binding was also elevated in most layers in rostral area 29c when subjects first performed a significant behavioral discrimination. Training-induced alterations in OxO-M/PZ binding in AVp and layer Ia of area 29c were similar and highly correlated.« less
Sabatini, Angelo Maria
2011-01-01
In this paper we present a quaternion-based Extended Kalman Filter (EKF) for estimating the three-dimensional orientation of a rigid body. The EKF exploits the measurements from an Inertial Measurement Unit (IMU) that is integrated with a tri-axial magnetic sensor. Magnetic disturbances and gyro bias errors are modeled and compensated by including them in the filter state vector. We employ the observability rank criterion based on Lie derivatives to verify the conditions under which the nonlinear system that describes the process of motion tracking by the IMU is observable, namely it may provide sufficient information for performing the estimation task with bounded estimation errors. The observability conditions are that the magnetic field, perturbed by first-order Gauss-Markov magnetic variations, and the gravity vector are not collinear and that the IMU is subject to some angular motions. Computer simulations and experimental testing are presented to evaluate the algorithm performance, including when the observability conditions are critical. PMID:22163689
Eleventh interim status report: Model 9975 O-Ring fixture long-term leak performance
DOE Office of Scientific and Technical Information (OSTI.GOV)
Daugherty, W.
2016-08-01
A series of experiments to monitor the aging performance of Viton® GLT O-rings used in the Model 9975 package has been ongoing since 2004 at the Savannah River National Laboratory. One approach has been to periodically evaluate the leak performance of O-rings being aged in mock-up 9975 Primary Containment Vessels (PCVs) at elevated temperature. Other methods such as compression-stress relaxation (CSR) tests and field surveillance are also on-going to evaluate O-ring behavior. Seventy tests using PCV mock-ups were assembled and heated to temperatures ranging from 200 to 450 ºF. They were leak-tested initially and have been tested periodically to determinemore » if they continue to meet the leak-tightness criterion defined in ANSI standard N14.5-97. Due to material substitution, fourteen additional tests were initiated in 2008 with GLT-S O-rings heated to temperatures ranging from 200 to 400 ºF.« less
Tenth interim status report: Model 9975 O-ring fixture long-term leak performance
DOE Office of Scientific and Technical Information (OSTI.GOV)
Daugherty, W. L.
2015-08-26
A series of experiments to monitor the aging performance of Viton ® GLT O-rings used in the Model 9975 package has been ongoing since 2004 at the Savannah River National Laboratory. One approach has been to periodically evaluate the leak performance of O-rings being aged in mock-up 9975 Primary Containment Vessels (PCVs) at elevated temperatures. Other methods such as compression-stress relaxation (CSR) tests and field surveillance are also on-going to evaluate O-ring behavior. Seventy tests using PCV mock-ups were assembled and heated to temperatures ranging from 200 to 450 °F. They were leak-tested initially and have been tested periodically tomore » determine if they continue to meet the leak-tightness criterion defined in ANSI standard N14.5-97. Due to material substitution, fourteen additional tests were initiated in 2008 with GLT-S O-rings heated to temperatures ranging from 200 to 400 °F.« less
Selecting among competing models of electro-optic, infrared camera system range performance
Nichols, Jonathan M.; Hines, James E.; Nichols, James D.
2013-01-01
Range performance is often the key requirement around which electro-optical and infrared camera systems are designed. This work presents an objective framework for evaluating competing range performance models. Model selection based on the Akaike’s Information Criterion (AIC) is presented for the type of data collected during a typical human observer and target identification experiment. These methods are then demonstrated on observer responses to both visible and infrared imagery in which one of three maritime targets was placed at various ranges. We compare the performance of a number of different models, including those appearing previously in the literature. We conclude that our model-based approach offers substantial improvements over the traditional approach to inference, including increased precision and the ability to make predictions for some distances other than the specific set for which experimental trials were conducted.
Antiwindup analysis and design approaches for MIMO systems
NASA Technical Reports Server (NTRS)
Marcopoli, Vincent R.; Phillips, Stephen M.
1994-01-01
Performance degradation of multiple-input multiple-output (MIMO) control systems having limited actuators is often handled by augmenting the controller with an antiwindup mechanism, which attempts to maintain system performance when limits are encountered. The goals of this paper are: (1) To develop a method to analyze antiwindup systems to determine precisely what stability and performance degradation is incurred under limited conditions. It is shown that by reformulating limited actuator commands as resulting from multiplicative perturbations to the corresponding controller requests, mu-analysis tools can be utilized to obtain quantitative measures of stability and performance degradation. (2) To propose a linear, time invariant (LTI) criterion on which to base the antiwindup design. These analysis and design methods are illustrated through the evaluation of two competing antiwindup schemes augmenting the controller of a Short Take-Off and Vertical Landing (STOVL) aircraft in transition flight.
Antiwindup analysis and design approaches for MIMO systems
NASA Technical Reports Server (NTRS)
Marcopoli, Vincent R.; Phillips, Stephen M.
1993-01-01
Performance degradation of multiple-input multiple-output (MIMO) control systems having limited actuators is often handled by augmenting the controller with an antiwindup mechanism, which attempts to maintain system performance when limits are encountered. The goals of this paper are: 1) to develop a method to analyze antiwindup systems to determine precisely what stability and performance degradation is incurred under limited conditions. It is shown that by reformulating limited actuator commands as resulting from multiplicative perturbations to the corresponding controller requests, mu-analysis tools can be utilized to obtain quantitative measures of stability and performance degradation. 2) To propose a linear, time invariant (LTI) criterion on which to base the antiwindup design. These analysis and design methods are illustrated through the evaluation of two competing antiwindup schemes augmenting the controller of a Short Take-Off and Vertical Landing (STOVL) aircraft in transition flight.
Evaluation of Hierarchical Clustering Algorithms for Document Datasets
2002-06-03
link, complete-link, and group average ( UPGMA )) and a new set of merging criteria derived from the six partitional criterion functions. Overall, we...used the single-link, complete-link, and UPGMA schemes, as well as, the various partitional criterion functions described in Section 3.1. The single-link...other (complete-link approach). The UPGMA scheme [16] (also known as group average) overcomes these problems by measuring the similarity of two clusters
A thermodynamic analysis of a novel bidirectional district heating and cooling network
Zarin Pass, R.; Wetter, M.; Piette, M. A.
2017-11-29
In this study, we evaluate an ambient, bidirectional thermal network, which uses a single circuit for both district heating and cooling. When in net more cooling is needed than heating, the system circulates from a central plant in one direction. When more heating is needed, the system circulates in the opposite direction. A large benefit of this design is that buildings can recover waste heat from each other directly. We analyze the thermodynamic performance of the bidirectional system. Because the bidirectional system represents the state-of-the-art in design for district systems, its peak energy efficiency represents an upper bound on themore » thermal performance of any district heating and cooling system. However, because any network has mechanical and thermal distribution losses, we develop a diversity criterion to understand when the bidirectional system may be a more energy-efficient alternative to modern individual-building systems. We show that a simple model of a low-density, high-distribution loss network is more efficient than aggregated individual buildings if there is at least 1 unit of cooling energy per 5.7 units of simultaneous heating energy (or vice versa). We apply this criterion to reference building profiles in three cities to look for promising clusters.« less
Feng, Jia; Kramer, Michael R; Dever, Bridget V; Dunlop, Anne L; Williams, Bryan; Jain, Lucky
2013-05-01
Maternal smoking during pregnancy (MSDP) has been reported to be associated with impaired measures of cognitive function, but it remains unclear whether exposure to MSDP has an impact upon offspring school performance. We examined the association between MSDP and failure of the Criterion-Referenced Competency Tests (CRCT) among Georgia first grade students. A retrospective cohort was created by deterministically linking 331 531 children born in Georgia from 1998 to 2002 (inclusive) to their individual CRCT education records from 2005 to 2009. We evaluated the association between MSDP (yes/no) and failure of the CRCT Reading, English/Language Arts (ELA), and Mathematics tests, with adjustment for maternal and child sociodemographic characteristics and birth outcomes. Log-binomial models estimated the risk ratios and 95% confidence intervals. Conditional models were fitted to paired sibling data. MSDP was associated with CRCT failure with an adjusted risk ratios for Reading: 1.16 [95% CI 1.12, 1.21]; ELA: 1.12 [95%CI 1.10, 1.15]; and Mathematics: 1.13 [95%CI 1.10, 1.16]. The association remained significant in paired sibling analyses. MSDP may have independent long-term effects on offspring school performance, which does not appear to be through smoking-related adverse birth outcomes. © 2013 Blackwell Publishing Ltd.
A thermodynamic analysis of a novel bidirectional district heating and cooling network
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zarin Pass, R.; Wetter, M.; Piette, M. A.
In this study, we evaluate an ambient, bidirectional thermal network, which uses a single circuit for both district heating and cooling. When in net more cooling is needed than heating, the system circulates from a central plant in one direction. When more heating is needed, the system circulates in the opposite direction. A large benefit of this design is that buildings can recover waste heat from each other directly. We analyze the thermodynamic performance of the bidirectional system. Because the bidirectional system represents the state-of-the-art in design for district systems, its peak energy efficiency represents an upper bound on themore » thermal performance of any district heating and cooling system. However, because any network has mechanical and thermal distribution losses, we develop a diversity criterion to understand when the bidirectional system may be a more energy-efficient alternative to modern individual-building systems. We show that a simple model of a low-density, high-distribution loss network is more efficient than aggregated individual buildings if there is at least 1 unit of cooling energy per 5.7 units of simultaneous heating energy (or vice versa). We apply this criterion to reference building profiles in three cities to look for promising clusters.« less
Mosavi, Firas; Laurell, Anna; Ahlström, Håkan
2015-11-01
Whole body (WB) magnetic resonance imaging (MRI), including diffusion-weighted imaging (DWI) has become increasingly utilized in cancer imaging, yet the clinical utility of these techniques in follow-up of testicular cancer patients has not been evaluated. The purpose of this study was to evaluate the feasibility of WB MRI with continuous table movement (CTM) technique, including multistep DWI in follow-up of patients with testicular cancer. WB MRI including DWI was performed in follow-up of 71 consecutive patients (median age, 37 years; range 19-84) with histologically confirmed testicular cancer. WB MRI protocol included axial T1-Dixon and T2-BLADE sequences using CTM technique. Furthermore, multi-step DWI was performed using b-value 50 and 1000 s/mm(2). One criterion for feasibility was patient tolerance and satisfactory image quality. Another criterion was the accuracy in detection of any pathological mass, compared to standard of reference. Signal intensity in DWI was used for evaluation of residual mass activity. Clinical, laboratory and imaging follow-up were applied as standard of reference for the evaluation of WB MRI. WB MRI was tolerated in nearly all patients (69/71 patients, 97%) and the image quality was satisfactory. Metal artifacts deteriorated the image quality in six patients, but it did not influence the overall results. No case of clinical relapse was observed during the follow-up time. There was a good agreement between conventional WB MRI and standard of reference in all patients. Three patients showed residual masses and DWI signal was not restricted in these patients. Furthermore, DWI showed abnormally high signal intensity in a normal-sized retroperitoneal lymph node indicating metastasis. The subsequent (18)F-FDG PET/CT could verify the finding. WB MRI with CTM technique including multi-step DWI is feasible in follow-up of patients with testicular cancer. DWI may contribute to important added-value data to conventional MRI sequences regarding the activity of residual masses.
Multi-objective experimental design for (13)C-based metabolic flux analysis.
Bouvin, Jeroen; Cajot, Simon; D'Huys, Pieter-Jan; Ampofo-Asiama, Jerry; Anné, Jozef; Van Impe, Jan; Geeraerd, Annemie; Bernaerts, Kristel
2015-10-01
(13)C-based metabolic flux analysis is an excellent technique to resolve fluxes in the central carbon metabolism but costs can be significant when using specialized tracers. This work presents a framework for cost-effective design of (13)C-tracer experiments, illustrated on two different networks. Linear and non-linear optimal input mixtures are computed for networks for Streptomyces lividans and a carcinoma cell line. If only glucose tracers are considered as labeled substrate for a carcinoma cell line or S. lividans, the best parameter estimation accuracy is obtained by mixtures containing high amounts of 1,2-(13)C2 glucose combined with uniformly labeled glucose. Experimental designs are evaluated based on a linear (D-criterion) and non-linear approach (S-criterion). Both approaches generate almost the same input mixture, however, the linear approach is favored due to its low computational effort. The high amount of 1,2-(13)C2 glucose in the optimal designs coincides with a high experimental cost, which is further enhanced when labeling is introduced in glutamine and aspartate tracers. Multi-objective optimization gives the possibility to assess experimental quality and cost at the same time and can reveal excellent compromise experiments. For example, the combination of 100% 1,2-(13)C2 glucose with 100% position one labeled glutamine and the combination of 100% 1,2-(13)C2 glucose with 100% uniformly labeled glutamine perform equally well for the carcinoma cell line, but the first mixture offers a decrease in cost of $ 120 per ml-scale cell culture experiment. We demonstrated the validity of a multi-objective linear approach to perform optimal experimental designs for the non-linear problem of (13)C-metabolic flux analysis. Tools and a workflow are provided to perform multi-objective design. The effortless calculation of the D-criterion can be exploited to perform high-throughput screening of possible (13)C-tracers, while the illustrated benefit of multi-objective design should stimulate its application within the field of (13)C-based metabolic flux analysis. Copyright © 2015 Elsevier Inc. All rights reserved.
Kurzeja, Patrick
2016-05-01
Modern imaging techniques, increased simulation capabilities and extended theoretical frameworks, naturally drive the development of multiscale modelling by the question: which new information should be considered? Given the need for concise constitutive relationships and efficient data evaluation; however, one important question is often neglected: which information is sufficient? For this reason, this work introduces the formalized criterion of subscale sufficiency. This criterion states whether a chosen constitutive relationship transfers all necessary information from micro to macroscale within a multiscale framework. It further provides a scheme to improve constitutive relationships. Direct application to static capillary pressure demonstrates usefulness and conditions for subscale sufficiency of saturation and interfacial areas.
A generic bio-economic farm model for environmental and economic assessment of agricultural systems.
Janssen, Sander; Louhichi, Kamel; Kanellopoulos, Argyris; Zander, Peter; Flichman, Guillermo; Hengsdijk, Huib; Meuter, Eelco; Andersen, Erling; Belhouchette, Hatem; Blanco, Maria; Borkowski, Nina; Heckelei, Thomas; Hecker, Martin; Li, Hongtao; Oude Lansink, Alfons; Stokstad, Grete; Thorne, Peter; van Keulen, Herman; van Ittersum, Martin K
2010-12-01
Bio-economic farm models are tools to evaluate ex-post or to assess ex-ante the impact of policy and technology change on agriculture, economics and environment. Recently, various BEFMs have been developed, often for one purpose or location, but hardly any of these models are re-used later for other purposes or locations. The Farm System Simulator (FSSIM) provides a generic framework enabling the application of BEFMs under various situations and for different purposes (generating supply response functions and detailed regional or farm type assessments). FSSIM is set up as a component-based framework with components representing farmer objectives, risk, calibration, policies, current activities, alternative activities and different types of activities (e.g., annual and perennial cropping and livestock). The generic nature of FSSIM is evaluated using five criteria by examining its applications. FSSIM has been applied for different climate zones and soil types (criterion 1) and to a range of different farm types (criterion 2) with different specializations, intensities and sizes. In most applications FSSIM has been used to assess the effects of policy changes and in two applications to assess the impact of technological innovations (criterion 3). In the various applications, different data sources, level of detail (e.g., criterion 4) and model configurations have been used. FSSIM has been linked to an economic and several biophysical models (criterion 5). The model is available for applications to other conditions and research issues, and it is open to be further tested and to be extended with new components, indicators or linkages to other models.
A Generic Bio-Economic Farm Model for Environmental and Economic Assessment of Agricultural Systems
Louhichi, Kamel; Kanellopoulos, Argyris; Zander, Peter; Flichman, Guillermo; Hengsdijk, Huib; Meuter, Eelco; Andersen, Erling; Belhouchette, Hatem; Blanco, Maria; Borkowski, Nina; Heckelei, Thomas; Hecker, Martin; Li, Hongtao; Oude Lansink, Alfons; Stokstad, Grete; Thorne, Peter; van Keulen, Herman; van Ittersum, Martin K.
2010-01-01
Bio-economic farm models are tools to evaluate ex-post or to assess ex-ante the impact of policy and technology change on agriculture, economics and environment. Recently, various BEFMs have been developed, often for one purpose or location, but hardly any of these models are re-used later for other purposes or locations. The Farm System Simulator (FSSIM) provides a generic framework enabling the application of BEFMs under various situations and for different purposes (generating supply response functions and detailed regional or farm type assessments). FSSIM is set up as a component-based framework with components representing farmer objectives, risk, calibration, policies, current activities, alternative activities and different types of activities (e.g., annual and perennial cropping and livestock). The generic nature of FSSIM is evaluated using five criteria by examining its applications. FSSIM has been applied for different climate zones and soil types (criterion 1) and to a range of different farm types (criterion 2) with different specializations, intensities and sizes. In most applications FSSIM has been used to assess the effects of policy changes and in two applications to assess the impact of technological innovations (criterion 3). In the various applications, different data sources, level of detail (e.g., criterion 4) and model configurations have been used. FSSIM has been linked to an economic and several biophysical models (criterion 5). The model is available for applications to other conditions and research issues, and it is open to be further tested and to be extended with new components, indicators or linkages to other models. PMID:21113782
Eblen, Matthew K; Wagner, Robin M; RoyChowdhury, Deepshikha; Patel, Katherine C; Pearson, Katrina
2016-01-01
Understanding the factors associated with successful funding outcomes of research project grant (R01) applications is critical for the biomedical research community. R01 applications are evaluated through the National Institutes of Health (NIH) peer review system, where peer reviewers are asked to evaluate and assign scores to five research criteria when assessing an application's scientific and technical merit. This study examined the relationship of the five research criterion scores to the Overall Impact score and the likelihood of being funded for over 123,700 competing R01 applications for fiscal years 2010 through 2013. The relationships of other application and applicant characteristics, including demographics, to scoring and funding outcomes were studied as well. The analyses showed that the Approach and, to a lesser extent, the Significance criterion scores were the main predictors of an R01 application's Overall Impact score and its likelihood of being funded. Applicants might consider these findings when submitting future R01 applications to NIH.
Eblen, Matthew K.; Wagner, Robin M.; RoyChowdhury, Deepshikha; Patel, Katherine C.; Pearson, Katrina
2016-01-01
Understanding the factors associated with successful funding outcomes of research project grant (R01) applications is critical for the biomedical research community. R01 applications are evaluated through the National Institutes of Health (NIH) peer review system, where peer reviewers are asked to evaluate and assign scores to five research criteria when assessing an application’s scientific and technical merit. This study examined the relationship of the five research criterion scores to the Overall Impact score and the likelihood of being funded for over 123,700 competing R01 applications for fiscal years 2010 through 2013. The relationships of other application and applicant characteristics, including demographics, to scoring and funding outcomes were studied as well. The analyses showed that the Approach and, to a lesser extent, the Significance criterion scores were the main predictors of an R01 application’s Overall Impact score and its likelihood of being funded. Applicants might consider these findings when submitting future R01 applications to NIH. PMID:27249058
Gutiérrez Sánchez, Daniel; Cuesta-Vargas, Antonio I
2018-04-01
Many measurements have been developed to assess the quality of death (QoD). Among these, the Quality of Dying and Death Questionnaire (QODD) is the most widely studied and best validated. Informal carers and health professionals who care for the patient during their last days of life can complete this assessment tool. The aim of the study is to carry out a cross-cultural adaptation and a psychometric analysis of the QODD for the Spanish population. The translation was performed using a double forward and backward method. An expert panel evaluated the content validity. The questionnaire was tested in a sample of 72 Spanish-speaking adult carers of deceased cancer patients. A psychometric analysis was performed to evaluate internal consistency, divergent criterion-related validity with the Mini-Suffering State Examination (MSSE) and concurrent criterion-related validity with the Palliative Outcome Scale (POS). Some items were deleted and modified to create the Spanish version of the QODD (QODD-ESP-26). The instrument was readable and acceptable. The content validity index was 0.96, suggesting that all items are relevant for the measure of the QoD. This questionnaire showed high internal consistency (Cronbach's α coefficient = 0.88). Divergent validity with MSSE (r = -0.64) and convergent validity with POS (r = -0.61) were also demonstrated. The QODD-ESP-26 is a valid and reliable instrument for the assessment of the QoD of deceased cancer patients that can be used in a clinical and research setting. Copyright © 2018 Elsevier Ltd. All rights reserved.
MSPocket: an orientation-independent algorithm for the detection of ligand binding pockets.
Zhu, Hongbo; Pisabarro, M Teresa
2011-02-01
Identification of ligand binding pockets on proteins is crucial for the characterization of protein functions. It provides valuable information for protein-ligand docking and rational engineering of small molecules that regulate protein functions. A major number of current prediction algorithms of ligand binding pockets are based on cubic grid representation of proteins and, thus, the results are often protein orientation dependent. We present the MSPocket program for detecting pockets on the solvent excluded surface of proteins. The core algorithm of the MSPocket approach does not use any cubic grid system to represent proteins and is therefore independent of protein orientations. We demonstrate that MSPocket is able to achieve an accuracy of 75% in predicting ligand binding pockets on a test dataset used for evaluating several existing methods. The accuracy is 92% if the top three predictions are considered. Comparison to one of the recently published best performing methods shows that MSPocket reaches similar performance with the additional feature of being protein orientation independent. Interestingly, some of the predictions are different, meaning that the two methods can be considered complementary and combined to achieve better prediction accuracy. MSPocket also provides a graphical user interface for interactive investigation of the predicted ligand binding pockets. In addition, we show that overlap criterion is a better strategy for the evaluation of predicted ligand binding pockets than the single point distance criterion. The MSPocket source code can be downloaded from http://appserver.biotec.tu-dresden.de/MSPocket/. MSPocket is also available as a PyMOL plugin with a graphical user interface.
Kinematic evaluation of the classical ballet step "plié".
Gontijo, Kaanda Nabilla Souza; Candotti, Cláudia Tarragô; Feijó, Grace Dos Santos; Ribeiro, Lais Paixão; Loss, Jefferson Fagundes
2015-06-01
Lack of alignment between the lowerlimb structures, such as the hips, knees, and longitudinal arches of the feet, has been described as an important predisposing factor in musculoskeletal injury among classical ballet dancers. However, no studies were found that analyzed basic ballet movements with quantification of objective criteria of the movements. The purposes of this study were: 1. to establish a methodology to quantify, using kinematic evaluation, the technical criteria that guide the correct execution of all phases of the plié (simultaneous flexion of the hips, knees, and ankle joints); and 2. to explore whether experienced ballet dancers respect those criteria when performing the plié. The technical criteria considered were the following: 1. midfoot stability; 2. pelvic positioning in a neutral alignment; 3. pelvic stability, represented by pelvic angle variation; and 4. vertical alignment of the knee joint with the second toe of the ipsilateral foot. Twenty dancers from Porto Alegre, Brazil, with 18 years of uninterrupted ballet training, were filmed while performing plié using four synchronized cameras. The descriptive statistical analysis involved calculating the median, minimum, and maximum of each of the technical criteria. Results showed that for criterion 1, the 20 dancers showed great stabilization of the midfoot; for criteria 2 and 3, 18 dancers displayed pelvic instability tending toward retroversion throughout execution of the plié; and for criterion 4, 13 dancers presented with medial misalignment of the knees at all phases of the plié. Using these criteria, it was possible to characterize the plié from a kinematic point of view.
Using Curriculum-Based Measurements for Program Evaluation: Expanding Roles for School Psychologists
ERIC Educational Resources Information Center
Tusing, Mary E.; Breikjern, Nicholle A.
2017-01-01
Educators increasingly need to evaluate schoolwide reform efforts; however, complex program evaluations often are not feasible in schools. Through a case example, we provide a heuristic for program evaluation that is easily replicated in schools. Criterion-referenced interpretations of schoolwide screening data were used to evaluate outcomes…
Model selection for multi-component frailty models.
Ha, Il Do; Lee, Youngjo; MacKenzie, Gilbert
2007-11-20
Various frailty models have been developed and are now widely used for analysing multivariate survival data. It is therefore important to develop an information criterion for model selection. However, in frailty models there are several alternative ways of forming a criterion and the particular criterion chosen may not be uniformly best. In this paper, we study an Akaike information criterion (AIC) on selecting a frailty structure from a set of (possibly) non-nested frailty models. We propose two new AIC criteria, based on a conditional likelihood and an extended restricted likelihood (ERL) given by Lee and Nelder (J. R. Statist. Soc. B 1996; 58:619-678). We compare their performance using well-known practical examples and demonstrate that the two criteria may yield rather different results. A simulation study shows that the AIC based on the ERL is recommended, when attention is focussed on selecting the frailty structure rather than the fixed effects.
40 CFR 35.937-4 - Solicitation and evaluation of proposals.
Code of Federal Regulations, 2010 CFR
2010-07-01
... relative importance attached to each criterion (a numerical weighted formula need not be utilized). (c) All... subpart. The grantee shall also evaluate the candidate's proposed method to accomplish the work required...
New developments in supra-threshold perimetry.
Henson, David B; Artes, Paul H
2002-09-01
To describe a series of recent enhancements to supra-threshold perimetry. Computer simulations were used to develop an improved algorithm (HEART) for the setting of the supra-threshold test intensity at the beginning of a field test, and to evaluate the relationship between various pass/fail criteria and the test's performance (sensitivity and specificity) and how they compare with modern threshold perimetry. Data were collected in optometric practices to evaluate HEART and to assess how the patient's response times can be analysed to detect false positive response errors in visual field test results. The HEART algorithm shows improved performance (reduced between-eye differences) over current algorithms. A pass/fail criterion of '3 stimuli seen of 3-5 presentations' at each test location reduces test/retest variability and combines high sensitivity and specificity. A large percentage of false positive responses can be detected by comparing their latencies to the average response time of a patient. Optimised supra-threshold visual field tests can perform as well as modern threshold techniques. Such tests may be easier to perform for novice patients, compared with the more demanding threshold tests.
Zarb, Francis; McEntee, Mark F; Rainford, Louise
2015-06-01
To evaluate visual grading characteristics (VGC) and ordinal regression analysis during head CT optimisation as a potential alternative to visual grading assessment (VGA), traditionally employed to score anatomical visualisation. Patient images (n = 66) were obtained using current and optimised imaging protocols from two CT suites: a 16-slice scanner at the national Maltese centre for trauma and a 64-slice scanner in a private centre. Local resident radiologists (n = 6) performed VGA followed by VGC and ordinal regression analysis. VGC alone indicated that optimised protocols had similar image quality as current protocols. Ordinal logistic regression analysis provided an in-depth evaluation, criterion by criterion allowing the selective implementation of the protocols. The local radiology review panel supported the implementation of optimised protocols for brain CT examinations (including trauma) in one centre, achieving radiation dose reductions ranging from 24 % to 36 %. In the second centre a 29 % reduction in radiation dose was achieved for follow-up cases. The combined use of VGC and ordinal logistic regression analysis led to clinical decisions being taken on the implementation of the optimised protocols. This improved method of image quality analysis provided the evidence to support imaging protocol optimisation, resulting in significant radiation dose savings. • There is need for scientifically based image quality evaluation during CT optimisation. • VGC and ordinal regression analysis in combination led to better informed clinical decisions. • VGC and ordinal regression analysis led to dose reductions without compromising diagnostic efficacy.
On the vibration properties of composite materials and structures
NASA Astrophysics Data System (ADS)
Lu, Y. P.; Neilson, H. C.; Roscoe, A. J.
1993-01-01
In recent years, there has been a widespread assumption that composite materials and structures offer enhanced vibration and acoustic properties. This assumption has to be evaluated or validated. The objective of this article is to address the subject of vibration characteristics and the related force transmissibility properties of composite structures. For a given composite beam made of Hercules AS4/3501-6 graphite/epoxy with a layered structure sequence of (0,0,30,-30)(sub 6S), resonance frequencies, structural damping, responses, impedances, and force transmissibility properties are determined, discussed, and compared with those of a steel beam. This article proposes a procedure to evaluate the vibration properties of individual composites. The criterion defined for performance comparison between composite materials and conventional materials is also discussed.
Spatial effect of new municipal solid waste landfill siting using different guidelines.
Ahmad, Siti Zubaidah; Ahamad, Mohd Sanusi S; Yusoff, Mohd Suffian
2014-01-01
Proper implementation of landfill siting with the right regulations and constraints can prevent undesirable long-term effects. Different countries have respective guidelines on criteria for new landfill sites. In this article, we perform a comparative study of municipal solid waste landfill siting criteria stated in the policies and guidelines of eight different constitutional bodies from Malaysia, Australia, India, U.S.A., Europe, China and the Middle East, and the World Bank. Subsequently, a geographic information system (GIS) multi-criteria evaluation model was applied to determine new suitable landfill sites using different criterion parameters using a constraint mapping technique and weighted linear combination. Application of Macro Modeler provided in the GIS-IDRISI Andes software helps in building and executing multi-step models. In addition, the analytic hierarchy process technique was included to determine the criterion weight of the decision maker's preferences as part of the weighted linear combination procedure. The differences in spatial results of suitable sites obtained signifies that dissimilarity in guideline specifications and requirements will have an effect on the decision-making process.
Complex motion measurement using genetic algorithm
NASA Astrophysics Data System (ADS)
Shen, Jianjun; Tu, Dan; Shen, Zhenkang
1997-12-01
Genetic algorithm (GA) is an optimization technique that provides an untraditional approach to deal with many nonlinear, complicated problems. The notion of motion measurement using genetic algorithm arises from the fact that the motion measurement is virtually an optimization process based on some criterions. In the paper, we propose a complex motion measurement method using genetic algorithm based on block-matching criterion. The following three problems are mainly discussed and solved in the paper: (1) apply an adaptive method to modify the control parameters of GA that are critical to itself, and offer an elitism strategy at the same time (2) derive an evaluate function of motion measurement for GA based on block-matching technique (3) employ hill-climbing (HC) method hybridly to assist GA's search for the global optimal solution. Some other related problems are also discussed. At the end of paper, experiments result is listed. We employ six motion parameters for measurement in our experiments. Experiments result shows that the performance of our GA is good. The GA can find the object motion accurately and rapidly.
NASA Astrophysics Data System (ADS)
Han, Yu-Yan; Gong, Dunwei; Sun, Xiaoyan
2015-07-01
A flow-shop scheduling problem with blocking has important applications in a variety of industrial systems but is underrepresented in the research literature. In this study, a novel discrete artificial bee colony (ABC) algorithm is presented to solve the above scheduling problem with a makespan criterion by incorporating the ABC with differential evolution (DE). The proposed algorithm (DE-ABC) contains three key operators. One is related to the employed bee operator (i.e. adopting mutation and crossover operators of discrete DE to generate solutions with good quality); the second is concerned with the onlooker bee operator, which modifies the selected solutions using insert or swap operators based on the self-adaptive strategy; and the last is for the local search, that is, the insert-neighbourhood-based local search with a small probability is adopted to improve the algorithm's capability in exploitation. The performance of the proposed DE-ABC algorithm is empirically evaluated by applying it to well-known benchmark problems. The experimental results show that the proposed algorithm is superior to the compared algorithms in minimizing the makespan criterion.
Feldhaus, Charles R; Wolter, Robert M; Hundley, Stephen P; Diemer, Tim
2006-04-01
This paper details efforts by the Purdue School of Engineering and Technology at Indiana University Purdue University Indianapolis (IUPUI) to create a single instrument for honors science, technology, engineering and mathematics (STEM) students wishing to demonstrate competence in the IUPUI Principles of Undergraduate Learning (PUL's) and Accreditation Board for Engineering and Technology (ABET) Engineering Accreditation Criterion (EAC) and Technology Accreditation Criterion (TAC) 2, a through k. Honors courses in Human Behavior, Ethical Decision-Making, Applied Leadership, International Issues and Leadership Theories and Processes were created along with a specific menu of activities and an assessment rubric based on PUL's and ABET criteria to evaluate student performance in the aforementioned courses. Students who complete the series of 18 Honors Credit hours are eligible for an Honors Certificate in Leadership Studies from the Department of Organizational Leadership and Supervision. Finally, an accounting of how various university assessment criteria, in this case the IUPUI Principles of Undergraduate Learning, can be linked to ABET outcomes and prove student competence in both, using the aforementioned courses, menu of items, and assessment rubrics; these will be analyzed and discussed.
Leckman, James F.; Denys, Damiaan; Simpson, H. Blair; Mataix-Cols, David; Hollander, Eric; Saxena, Sanjaya; Miguel, Euripedes C.; Rauch, Scott L.; Goodman, Wayne K.; Phillips, Katharine A.; Stein, Dan J.
2014-01-01
Background Since the publication of the DSM-IV in 1994, research on obsessive–compulsive disorder (OCD) has continued to expand. It is timely to reconsider the nosology of this disorder, assessing whether changes to diagnostic criteria as well as subtypes and specifiers may improve diagnostic validity and clinical utility. Methods The existing criteria were evaluated. Key issues were identified. Electronic databases of PubMed, ScienceDirect, and PsycINFO were searched for relevant studies. Results This review presents a number of options and preliminary recommendations to be considered for DSM-V. These include: (1) clarifying and simplifying the definition of obsessions and compulsions(criterion A); (2) possibly deleting the requirement that people recognize that their obsessions or compulsions are excessive or unreasonable (criterion B); (3) rethinking the clinical significance criterion (criterion C) and, in the interim, possibly adjusting what is considered “time-consuming” for OCD; (4) listing additional disorders to help with the differential diagnosis (criterion D); (5) rethinking the medical exclusion criterion (criterion E) and clarifying what is meant by a “general medical condition”; (6) revising the specifiers (i.e., clarifying that OCD can involve a range of insight, in addition to “poor insight,” and adding “tic-related OCD”); and (7) highlighting in the DSM-V text important clinical features of OCD that are not currently mentioned in the criteria (e.g., the major symptom dimensions). Conclusions A number of changes to the existing diagnostic criteria for OCD are proposed. These proposed criteria may change as the DSM-V process progresses. PMID:20217853
Leckman, James F; Denys, Damiaan; Simpson, H Blair; Mataix-Cols, David; Hollander, Eric; Saxena, Sanjaya; Miguel, Euripedes C; Rauch, Scott L; Goodman, Wayne K; Phillips, Katharine A; Stein, Dan J
2010-06-01
Since the publication of the DSM-IV in 1994, research on obsessive-compulsive disorder (OCD) has continued to expand. It is timely to reconsider the nosology of this disorder, assessing whether changes to diagnostic criteria as well as subtypes and specifiers may improve diagnostic validity and clinical utility. The existing criteria were evaluated. Key issues were identified. Electronic databases of PubMed, ScienceDirect, and PsycINFO were searched for relevant studies. This review presents a number of options and preliminary recommendations to be considered for DSM-V. These include: (1) clarifying and simplifying the definition of obsessions and compulsions (criterion A); (2) possibly deleting the requirement that people recognize that their obsessions or compulsions are excessive or unreasonable (criterion B); (3) rethinking the clinical significance criterion (criterion C) and, in the interim, possibly adjusting what is considered "time-consuming" for OCD; (4) listing additional disorders to help with the differential diagnosis (criterion D); (5) rethinking the medical exclusion criterion (criterion E) and clarifying what is meant by a "general medical condition"; (6) revising the specifiers (i.e., clarifying that OCD can involve a range of insight, in addition to "poor insight," and adding "tic-related OCD"); and (7) highlighting in the DSM-V text important clinical features of OCD that are not currently mentioned in the criteria (e.g., the major symptom dimensions). A number of changes to the existing diagnostic criteria for OCD are proposed. These proposed criteria may change as the DSM-V process progresses. (c) 2010 Wiley-Liss, Inc.
Strömberg, Eric A; Nyberg, Joakim; Hooker, Andrew C
2016-12-01
With the increasing popularity of optimal design in drug development it is important to understand how the approximations and implementations of the Fisher information matrix (FIM) affect the resulting optimal designs. The aim of this work was to investigate the impact on design performance when using two common approximations to the population model and the full or block-diagonal FIM implementations for optimization of sampling points. Sampling schedules for two example experiments based on population models were optimized using the FO and FOCE approximations and the full and block-diagonal FIM implementations. The number of support points was compared between the designs for each example experiment. The performance of these designs based on simulation/estimations was investigated by computing bias of the parameters as well as through the use of an empirical D-criterion confidence interval. Simulations were performed when the design was computed with the true parameter values as well as with misspecified parameter values. The FOCE approximation and the Full FIM implementation yielded designs with more support points and less clustering of sample points than designs optimized with the FO approximation and the block-diagonal implementation. The D-criterion confidence intervals showed no performance differences between the full and block diagonal FIM optimal designs when assuming true parameter values. However, the FO approximated block-reduced FIM designs had higher bias than the other designs. When assuming parameter misspecification in the design evaluation, the FO Full FIM optimal design was superior to the FO block-diagonal FIM design in both of the examples.
Lin, Keh-chung; Chen, Hui-fang; Chen, Chia-ling; Wang, Tien-ni; Wu, Ching-yi; Hsieh, Yu-wei; Wu, Li-ling
2012-01-01
This study examined criterion-related validity and clinimetric properties of the Pediatric Motor Activity Log (PMAL) in children with cerebral palsy. Study participants were 41 children (age range: 28-113 months) and their parents. Criterion-related validity was evaluated by the associations between the PMAL and criterion measures at baseline and posttreatment, including the self-care, mobility, and cognition subscale, the total performance of the Functional Independence Measure in children (WeeFIM), and the grasping and visual-motor integration of the Peabody Developmental Motor Scales. Pearson correlation coefficients were calculated. Responsiveness was examined using the paired t test and the standardized response mean, the minimal detectable change was captured at the 90% confidence level, and the minimal clinically important change was estimated using anchor-based and distribution-based approaches. The PMAL-QOM showed fair concurrent validity at pretreatment and posttreatment and predictive validity, whereas the PMAL-AOU had fair concurrent validity at posttreatment only. The PMAL-AOU and PMAL-QOM were both markedly responsive to change after treatment. Improvement of at least 0.67 points on the PMAL-AOU and 0.66 points on the PMAL-QOM can be considered as a true change, not measurement error. A mean change has to exceed the range of 0.39-0.94 on the PMAL-AOU and the range of 0.38-0.74 on the PMAL-QOM to be regarded as clinically important change. Copyright © 2011 Elsevier Ltd. All rights reserved.
Modeling of cw OIL energy performance based on similarity criteria
NASA Astrophysics Data System (ADS)
Mezhenin, Andrey V.; Pichugin, Sergey Y.; Azyazov, Valeriy N.
2012-01-01
A simplified two-level generation model predicts that power extraction from an cw oxygen-iodine laser (OIL) with stable resonator depends on three similarity criteria. Criterion τd is the ratio of the residence time of active medium in the resonator to the O2(1Δ) reduction time at the infinitely large intraresonator intensity. Criterion Π is small-signal gain to the threshold ratio. Criterion Λ is the relaxation to excitation rate ratio for the electronically excited iodine atoms I(2P1/2). Effective power extraction from a cw OIL is achieved when the values of the similarity criteria are located in the intervals: τd=5-8, Π=3-8 and Λ<=0.01.
42 CFR 421.122 - Performance standards.
Code of Federal Regulations, 2010 CFR
2010-10-01
... performance, application of acceptable statistical measures of variation to nationwide intermediary experience... or criterion. (b) Factors beyond intermediary's control. To identify measurable factors that significantly affect an intermediary's performance, but that are not within the intermediary's control, CMS will...
THESEUS: maximum likelihood superpositioning and analysis of macromolecular structures
Theobald, Douglas L.; Wuttke, Deborah S.
2008-01-01
Summary THESEUS is a command line program for performing maximum likelihood (ML) superpositions and analysis of macromolecular structures. While conventional superpositioning methods use ordinary least-squares (LS) as the optimization criterion, ML superpositions provide substantially improved accuracy by down-weighting variable structural regions and by correcting for correlations among atoms. ML superpositioning is robust and insensitive to the specific atoms included in the analysis, and thus it does not require subjective pruning of selected variable atomic coordinates. Output includes both likelihood-based and frequentist statistics for accurate evaluation of the adequacy of a superposition and for reliable analysis of structural similarities and differences. THESEUS performs principal components analysis for analyzing the complex correlations found among atoms within a structural ensemble. PMID:16777907
2016-01-01
Modern imaging techniques, increased simulation capabilities and extended theoretical frameworks, naturally drive the development of multiscale modelling by the question: which new information should be considered? Given the need for concise constitutive relationships and efficient data evaluation; however, one important question is often neglected: which information is sufficient? For this reason, this work introduces the formalized criterion of subscale sufficiency. This criterion states whether a chosen constitutive relationship transfers all necessary information from micro to macroscale within a multiscale framework. It further provides a scheme to improve constitutive relationships. Direct application to static capillary pressure demonstrates usefulness and conditions for subscale sufficiency of saturation and interfacial areas. PMID:27279769
Study of Multimission Modular Spacecraft (MMS) propulsion requirements
NASA Technical Reports Server (NTRS)
Fischer, N. H.; Tischer, A. E.
1977-01-01
The cost effectiveness of various propulsion technologies for shuttle-launched multimission modular spacecraft (MMS) missions was determined with special attention to the potential role of ion propulsion. The primary criterion chosen for comparison for the different types of propulsion technologies was the total propulsion related cost, including the Shuttle charges, propulsion module costs, upper stage costs, and propulsion module development. In addition to the cost comparison, other criteria such as reliability, risk, and STS compatibility are examined. Topics covered include MMS mission models, propulsion technology definition, trajectory/performance analysis, cost assessment, program evaluation, sensitivity analysis, and conclusions and recommendations.
Lassau, Nathalie; Chapotot, Louis; Benatsou, Baya; Vilgrain, Valérie; Kind, Michèle; Lacroix, Joëlle; Cuinet, Marie; Taieb, Sophie; Aziza, Richard; Sarran, Antony; Labbe, Catherine; Gallix, Benoît; Lucidarme, Olivier; Ptak, Yvette; Rocher, Laurence; Caquot, Louis Michel; Chagnon, Sophie; Marion, Denis; Luciani, Alain; Uzan-Augui, Joëlle; Koscielny, Serge
2012-12-01
The objectives of this study are to describe the standardization and dissemination of dynamic contrast-enhanced ultrasound (DCE-US) for the evaluation of antiangiogenic treatments in solid tumors across 19 oncology centers in France and to define a quality score to account for the variability of the evaluation criteria used to collect DCE-US data. This prospective Soutien aux Techniques Innovantes Coûteuses (Support for Innovative and Expensive Techniques) DCE-US study included patients with metastatic breast cancer, melanoma, colon cancer, gastrointestinal stromal tumors, renal cell carcinoma and patients with primary hepatocellular carcinoma tumors treated with antiangiogenic therapy. The DCE-US method was made available across 19 oncology centers in France. Overall, 2339 DCE-US examinations were performed by 65 radiologists in 539 patients.One target site per patient was studied. Standardized DCE-US examinations were performed before treatment (day 0) and at days 7, 15, 30, and 60. Dynamic contrast-enhanced ultrasound data were transferred from the different sites to the main study center at the Institut Gustave-Roussy for analysis. Quantitative analyses were performed with a mathematical model to determine 7 DCE-US functional parameters using raw linear data. Radiologists had to evaluate 6 criteria that were potentially linked to the precision of the evaluation of these parameters: lesion size, target motion, loss of target, clear borders, total acquisition of wash-in, and vascular recognition imaging window adapted to the lesion size.Eighteen DCE-US examinations were randomly selected from the Soutien aux Techniques Innovantes Coûteuses (Support for Innovative and Expensive Techniques) database. Each examination was quantified twice by 8 engineers/radiologists trained to evaluate the perfusion parameters. The intraobserver variability was estimated on the basis of differences between examinations performed by the same radiologist. The mean coefficient of variability associated with each quality criterion was estimated. The final quality score, ranging from 0 to 5, was defined according to the value of coefficient of variability for each criterion. A total of 2062 examinations were stored with raw linear data. Five criteria were found to have a major impact on quality: lesion size, motion, loss of target, borders, and total acquisition of wash-in. Only 3% of the examinations were of poor quality (quality of 0); quality was correlated with the radiologists' experience, such that it was significantly higher for radiologists who had performed more than 60 DCE-US examinations (P < 0.0001). The DCE-US methodology has been successfully provided to several centers across France together with strict rules for quality assessment. Only 3% of examinations carried out at these centers were considered not interpretable.
Ductile Crack Initiation Criterion with Mismatched Weld Joints Under Dynamic Loading Conditions.
An, Gyubaek; Jeong, Se-Min; Park, Jeongung
2018-03-01
Brittle failure of high toughness steel structures tends to occur after ductile crack initiation/propagation. Damages to steel structures were reported in the Hanshin Great Earthquake. Several brittle failures were observed in beam-to-column connection zones with geometrical discontinuity. It is widely known that triaxial stresses accelerate the ductile fracture of steels. The study examined the effects of geometrical heterogeneity and strength mismatches (both of which elevate plastic constraints due to heterogeneous plastic straining) and loading rate on critical conditions initiating ductile fracture. This involved applying the two-parameter criterion (involving equivalent plastic strain and stress triaxiality) to estimate ductile cracking for strength mismatched specimens under static and dynamic tensile loading conditions. Ductile crack initiation testing was conducted under static and dynamic loading conditions using circumferentially notched specimens (Charpy type) with/without strength mismatches. The results indicated that the condition for ductile crack initiation using the two parameter criterion was a transferable criterion to evaluate ductile crack initiation independent of the existence of strength mismatches and loading rates.
Yu, Yuncui; Jia, Lulu; Meng, Yao; Hu, Lihua; Liu, Yiwei; Nie, Xiaolu; Zhang, Meng; Zhang, Xuan; Han, Sheng; Peng, Xiaoxia; Wang, Xiaoling
2018-04-01
Establishing a comprehensive clinical evaluation system is critical in enacting national drug policy and promoting rational drug use. In China, the 'Clinical Comprehensive Evaluation System for Pediatric Drugs' (CCES-P) project, which aims to compare drugs based on clinical efficacy and cost effectiveness to help decision makers, was recently proposed; therefore, a systematic and objective method is required to guide the process. An evidence-based multi-criteria decision analysis model that involved an analytic hierarchy process (AHP) was developed, consisting of nine steps: (1) select the drugs to be reviewed; (2) establish the evaluation criterion system; (3) determine the criterion weight based on the AHP; (4) construct the evidence body for each drug under evaluation; (5) select comparative measures and calculate the original utility score; (6) place a common utility scale and calculate the standardized utility score; (7) calculate the comprehensive utility score; (8) rank the drugs; and (9) perform a sensitivity analysis. The model was applied to the evaluation of three different inhaled corticosteroids (ICSs) used for asthma management in children (a total of 16 drugs with different dosage forms and strengths or different manufacturers). By applying the drug analysis model, the 16 ICSs under review were successfully scored and evaluated. Budesonide suspension for inhalation (drug ID number: 7) ranked the highest, with comprehensive utility score of 80.23, followed by fluticasone propionate inhaled aerosol (drug ID number: 16), with a score of 79.59, and budesonide inhalation powder (drug ID number: 6), with a score of 78.98. In the sensitivity analysis, the ranking of the top five and lowest five drugs remains unchanged, suggesting this model is generally robust. An evidence-based drug evaluation model based on AHP was successfully developed. The model incorporates sufficient utility and flexibility for aiding the decision-making process, and can be a useful tool for the CCES-P.
CRITICALITY SAFETY CONTROLS AND THE SAFETY BASIS AT PFP
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kessler, S
2009-04-21
With the implementation of DOE Order 420.1B, Facility Safety, and DOE-STD-3007-2007, 'Guidelines for Preparing Criticality Safety Evaluations at Department of Energy Non-Reactor Nuclear Facilities', a new requirement was imposed that all criticality safety controls be evaluated for inclusion in the facility Documented Safety Analysis (DSA) and that the evaluation process be documented in the site Criticality Safety Program Description Document (CSPDD). At the Hanford site in Washington State the CSPDD, HNF-31695, 'General Description of the FH Criticality Safety Program', requires each facility develop a linking document called a Criticality Control Review (CCR) to document performance of these evaluations. Chapter 5,more » Appendix 5B of HNF-7098, Criticality Safety Program, provided an example of a format for a CCR that could be used in lieu of each facility developing its own CCR. Since the Plutonium Finishing Plant (PFP) is presently undergoing Deactivation and Decommissioning (D&D), new procedures are being developed for cleanout of equipment and systems that have not been operated in years. Existing Criticality Safety Evaluations (CSE) are revised, or new ones written, to develop the controls required to support D&D activities. Other Hanford facilities, including PFP, had difficulty using the basic CCR out of HNF-7098 when first implemented. Interpretation of the new guidelines indicated that many of the controls needed to be elevated to TSR level controls. Criterion 2 of the standard, requiring that the consequence of a criticality be examined for establishing the classification of a control, was not addressed. Upon in-depth review by PFP Criticality Safety staff, it was not clear that the programmatic interpretation of criterion 8C could be applied at PFP. Therefore, the PFP Criticality Safety staff decided to write their own CCR. The PFP CCR provides additional guidance for the evaluation team to use by clarifying the evaluation criteria in DOE-STD-3007-2007. In reviewing documents used in classifying controls for Nuclear Safety, it was noted that DOE-HDBK-1188, 'Glossary of Environment, Health, and Safety Terms', defines an Administrative Control (AC) in terms that are different than typically used in Criticality Safety. As part of this CCR, a new term, Criticality Administrative Control (CAC) was defined to clarify the difference between an AC used for criticality safety and an AC used for nuclear safety. In Nuclear Safety terms, an AC is a provision relating to organization and management, procedures, recordkeeping, assessment, and reporting necessary to ensure safe operation of a facility. A CAC was defined as an administrative control derived in a criticality safety analysis that is implemented to ensure double contingency. According to criterion 2 of Section IV, 'Linkage to the Documented Safety Analysis', of DOESTD-3007-2007, the consequence of a criticality should be examined for the purposes of classifying the significance of a control or component. HNF-PRO-700, 'Safety Basis Development', provides control selection criteria based on consequence and risk that may be used in the development of a Criticality Safety Evaluation (CSE) to establish the classification of a component as a design feature, as safety class or safety significant, i.e., an Engineered Safety Feature (ESF), or as equipment important to safety; or merely provides defense-in-depth. Similar logic is applied to the CACs. Criterion 8C of DOE-STD-3007-2007, as written, added to the confusion of using the basic CCR from HNF-7098. The PFP CCR attempts to clarify this criterion by revising it to say 'Programmatic commitments or general references to control philosophy (e.g., mass control or spacing control or concentration control as an overall control strategy for the process without specific quantification of individual limits) is included in the PFP DSA'. Table 1 shows the PFP methodology for evaluating CACs. This evaluation process has been in use since February of 2008 and has proven to be simple and effective. Each control identified in the applicable new/revised CSE is evaluated via the table. The results of this evaluation are documented in tables attached to the CCR as an appendix, for each CSE, to the base document.« less
Decision Criterion Dynamics in Animals Performing an Auditory Detection Task
Mill, Robert W.; Alves-Pinto, Ana; Sumner, Christian J.
2014-01-01
Classical signal detection theory attributes bias in perceptual decisions to a threshold criterion, against which sensory excitation is compared. The optimal criterion setting depends on the signal level, which may vary over time, and about which the subject is naïve. Consequently, the subject must optimise its threshold by responding appropriately to feedback. Here a series of experiments was conducted, and a computational model applied, to determine how the decision bias of the ferret in an auditory signal detection task tracks changes in the stimulus level. The time scales of criterion dynamics were investigated by means of a yes-no signal-in-noise detection task, in which trials were grouped into blocks that alternately contained easy- and hard-to-detect signals. The responses of the ferrets implied both long- and short-term criterion dynamics. The animals exhibited a bias in favour of responding “yes” during blocks of harder trials, and vice versa. Moreover, the outcome of each single trial had a strong influence on the decision at the next trial. We demonstrate that the single-trial and block-level changes in bias are a manifestation of the same criterion update policy by fitting a model, in which the criterion is shifted by fixed amounts according to the outcome of the previous trial and decays strongly towards a resting value. The apparent block-level stabilisation of bias arises as the probabilities of outcomes and shifts on single trials mutually interact to establish equilibrium. To gain an intuition into how stable criterion distributions arise from specific parameter sets we develop a Markov model which accounts for the dynamic effects of criterion shifts. Our approach provides a framework for investigating the dynamics of decisions at different timescales in other species (e.g., humans) and in other psychological domains (e.g., vision, memory). PMID:25485733
Quantitative criteria for assessment of gamma-ray imager performance
NASA Astrophysics Data System (ADS)
Gottesman, Steve; Keller, Kristi; Malik, Hans
2015-08-01
In recent years gamma ray imagers such as the GammaCamTM and Polaris have demonstrated good imaging performance in the field. Imager performance is often summarized as "resolution", either angular, or spatial at some distance from the imager, however the definition of resolution is not always related to the ability to image an object. It is difficult to quantitatively compare imagers without a common definition of image quality. This paper examines three categories of definition: point source; line source; and area source. It discusses the details of those definitions and which ones are more relevant for different situations. Metrics such as Full Width Half Maximum (FWHM), variations on the Rayleigh criterion, and some analogous to National Imagery Interpretability Rating Scale (NIIRS) are discussed. The performance against these metrics is evaluated for a high resolution coded aperture imager modeled using Monte Carlo N-Particle (MCNP), and for a medium resolution imager measured in the lab.
van Wesel, Maarten
2016-02-01
Criteria for the evaluation of most scholars' work have recently received wider attention due to high-profile cases of scientific misconduct which are perceived to be linked to these criteria. However, in the competition for career advancement and funding opportunities almost all scholars are subjected to the same criteria. Therefore these evaluation criteria act as 'switchmen', determining the tracks along which scholarly work is pushed by the dynamic interplay of interests of both scholars and their institutions. Currently one of the most important criteria is the impact of publications. In this research, the extent to which publish or perish, a long standing evaluation criterion, led to scientific misconduct is examined briefly. After this the strive for high impact publications will be examined, firstly by identifying the period in which this became an important evaluation criterion, secondly by looking at variables contributing to the impact of scholarly papers by means of a non-structured literature study, and lastly by combining these data into a quantitative analysis.
Abbas, Ismail; Rovira, Joan; Casanovas, Josep
2006-12-01
To develop and validate a model of a clinical trial that evaluates the changes in cholesterol level as a surrogate marker for lipodystrophy in HIV subjects under alternative antiretroviral regimes, i.e., treatment with Protease Inhibitors vs. a combination of nevirapine and other antiretroviral drugs. Five simulation models were developed based on different assumptions, on treatment variability and pattern of cholesterol reduction over time. The last recorded cholesterol level, the difference from the baseline, the average difference from the baseline and level evolution, are the considered endpoints. Specific validation criteria based on a 10% minus or plus standardized distance in means and variances were used to compare the real and the simulated data. The validity criterion was met by all models for considered endpoints. However, only two models met the validity criterion when all endpoints were considered. The model based on the assumption that within-subjects variability of cholesterol levels changes over time is the one that minimizes the validity criterion, standardized distance equal to or less than 1% minus or plus. Simulation is a useful technique for calibration, estimation, and evaluation of models, which allows us to relax the often overly restrictive assumptions regarding parameters required by analytical approaches. The validity criterion can also be used to select the preferred model for design optimization, until additional data are obtained allowing an external validation of the model.
Comer, Jonathan S; Pincus, Donna B; Hofmann, Stefan G
2012-12-01
A current proposal for the DSM-5 general anxiety disorder (GAD) definition is to remove fatigue, difficulty concentrating, irritability, and sleep disturbance from the list of associated symptoms, and to require the presence of one of two retained symptoms (restlessness or muscle tension) for diagnosis. Relevant evaluations in youth to support such a change are sparse. The present study evaluated patterns and correlates of the DSM-IV GAD associated symptoms in a large outpatient sample of anxious youth (N = 650) to empirically consider how the proposed diagnostic change might impact the prevalence and sample composition of GAD in children. Logistic regression found irritability to be the most associated, and restlessness to be the least associated, with GAD diagnosis. Fatigue, difficulty concentrating, and sleep disturbances-which have each been suggested to be nonspecific to GAD due to their prevalence in depression-showed sizable associations with GAD even after accounting for depression and attention problems. Among GAD youth, 10.9% would not meet the proposed DSM-5 associated symptoms criterion. These children were comparable to GAD youth who would meet the proposed criteria with regard to clinical severity, symptomatology, and functioning. A substantial proportion of youth with excessive, clinically impairing worry may be left unclassified by the DSM-5 if the proposed GAD associated symptoms criterion is adopted. Despite support for the proposed criterion change in adult samples, the present findings suggest that in children it may increase the false negative rate. This calls into question whether the proposed associated symptoms criterion is optimal for defining childhood GAD. © 2012 Wiley Periodicals, Inc.
[Acoustic conditions in open plan offices - Pilot test results].
Mikulski, Witold
The main source of noise in open plan office are conversations. Office work standards in such premises are attained by applying specific acoustic adaptation. This article presents the results of pilot tests and acoustic evaluation of open space rooms. Acoustic properties of 6 open plan office rooms were the subject of the tests. Evaluation parameters, measurement methods and criterial values were adopted according to the following standards: PN-EN ISO 3382- 3:2012, PN-EN ISO 3382-2:2010, PN-B-02151-4:2015-06 and PN-B-02151-3:2015-10. The reverberation time was 0.33- 0.55 s (maximum permissible value in offices - 0.6 s; the criterion was met), sound absorption coefficient in relation to 1 m2 of the room's plan was 0.77-1.58 m2 (minimum permissible value - 1.1 m2; 2 out of 6 rooms met the criterion), distraction distance was 8.5-14 m (maximum permissible value - 5 m; none of the rooms met the criterion), A-weighted sound pressure level of speech at a distance of 4 m was 43.8-54.7 dB (maximum permissible value - 48 dB; 2 out of 6 rooms met the criterion), spatial decay rate of the speech was 1.8-6.3 dB (minimum permissible value - 7 dB; none of the rooms met the criterion). Standard acoustic treatment, containing sound absorbing suspended ceiling, sound absorbing materials on the walls, carpet flooring and sound absorbing workplace barriers, is not sufficient. These rooms require specific advanced acoustic solutions. Med Pr 2016;67(5):653-662. This work is available in Open Access model and licensed under a CC BY-NC 3.0 PL license.
Comer, Jonathan S.; Pincus, Donna B.; Hofmann, Stefan G.
2012-01-01
Background A current proposal for the DSM-5 generalized anxiety disorder (GAD) definition is to remove fatigue, difficulty concentrating, irritability, and sleep disturbance from the list of associated symptoms, and to require the presence of one of two retained symptoms (restlessness or muscle tension) for diagnosis. Relevant evaluations in youth to support such a change are sparse. Methods The present study evaluated patterns and correlates of the DSM-IV GAD associated symptoms in a large outpatient sample of anxious youth (N=650) to empirically consider how the proposed diagnostic change might impact the prevalence and sample composition of GAD in children. Results Logistic regression found irritability to be the most associated, and restlessness to be the least associated, with GAD diagnosis. Fatigue, difficulty concentrating, and sleep disturbances—which have each been suggested to be nonspecific to GAD due to their prevalence in depression—showed sizable associations with GAD even after accounting for depression and attention problems. Among GAD youth, 10.9% would not meet the proposed DSM-5 associated symptoms criterion. These children were comparable to GAD youth who would meet the proposed criteria with regard to clinical severity, symptomatology, and functioning. Conclusions A substantial proportion of youth with excessive, clinically impairing worry may be left unclassified by the DSM-5 if the proposed GAD associated symptoms criterion is adopted. Despite support for the proposed criterion change in adult samples, the present findings suggest that in children it may increase the false negative rate. This calls into question whether the proposed associated symptoms criterion is optimal for defining childhood GAD. PMID:22952043
Acquisition of control skill with delayed and compensated displays.
Ricard, G L
1995-09-01
The difficulty of mastering a two-axis, compensatory, manual control task was manipulated by introducing transport delays into the feedback loop of the controlled element. Realistic aircraft dynamics were used. Subjects' display was a simulation of an "inside-out" artificial horizon instrument perturbed by atmospheric turbulence. The task was to maintain straight and level flight, and delays tested were representative of those found in current training simulators. Delay compensations in the form of first-order lead and first-order lead/lag transfer functions, along with an uncompensated condition, were factorially combined with added delays. Subjects were required to meet a relatively strict criterion for performance. Control activity showed no differences during criterion performance, but the trials needed to achieve the criterion were linearly related to the magnitude of the delay and the compensation condition. These data were collected in the context of aircraft attitude control, but the results can be applied to the simulation of other vehicles, to remote manipulation, and to maneuvering in graphical environments.
NASA Astrophysics Data System (ADS)
Kou, Jiaqing; Le Clainche, Soledad; Zhang, Weiwei
2018-01-01
This study proposes an improvement in the performance of reduced-order models (ROMs) based on dynamic mode decomposition to model the flow dynamics of the attractor from a transient solution. By combining higher order dynamic mode decomposition (HODMD) with an efficient mode selection criterion, the HODMD with criterion (HODMDc) ROM is able to identify dominant flow patterns with high accuracy. This helps us to develop a more parsimonious ROM structure, allowing better predictions of the attractor dynamics. The method is tested in the solution of a NACA0012 airfoil buffeting in a transonic flow, and its good performance in both the reconstruction of the original solution and the prediction of the permanent dynamics is shown. In addition, the robustness of the method has been successfully tested using different types of parameters, indicating that the proposed ROM approach is a tool promising for using in both numerical simulations and experimental data.
Joven, Beatriz E; Escribano, Pilar; Andreu, Jose Luis; Loza, Estibaliz; Jimenez, Carmen; de Yebenes, M Jesus Garcia; Ruiz-Cano, M Jose; Carmona, Loreto; Carreira, Patricia E
2018-06-01
To analyze the performance of the 1980 ACR and new 2013 ACR/EULAR criteria for systemic sclerosis (SSc) in cutaneous SSc (lcSSc) patients, especially those affected by lcSSc and pulmonary arterial hypertension (PAH). All patients with a clinical lcSSc diagnosis from a prospective observational SSc cohort were included. Sociodemographic and disease-related variables were collected, and PAH confirmed by right heart catheterization (RHC). Performance of the 2013 and 1980 SSc criteria was analyzed in terms of clinical diagnosis. Descriptive and between-group analyses were performed as to the fulfillment of criterion sets, including comparison of survival. Overall, 321 patients were included, 63% of whom fulfilled the 1980 ACR and 93% the 2013 ACR/EULAR criteria. Agreement between both criteria sets proved poor (κ = 0.23). LcSSC patients fulfilling both criterion sets were significantly younger at diagnosis, whilst presenting organ involvement, calcinosis, fingertip digital ulcers, and pitting scars more frequently than those who met the 2013 criteria only. Patients who fulfilled the 2013 but not the 1980 criteria presented a higher degree of ACA positivity and PAH. Nearly 12% of patients developed PAH. Patients who did not meet the 1980 criteria were affected by a milder disease from but demonstrated higher pulmonary vascular resistance and lower cardiac index than those fulfilling both criterion sets. Whereas patients with PAH met the 2013 criteria, only 47% fulfilled the 1980 criteria. Regardless of criterion set fulfillment, high mortality was observed in PAH patients, with no significant between-patient difference based on criterion set. The new 2013 ARC/EULAR criteria prove more accurate than the former 1980 ACR criteria in identifying and differentiating patients with lcSSc, especially those with associated PAH. Since PAH exhibits a better prognosis if treated early, all SSc patients should undergo PAH screening. Copyright © 2018 Elsevier Inc. All rights reserved.
Towards a new tool for the evaluation of the quality of ultrasound compressed images.
Delgorge, Cécile; Rosenberger, Christophe; Poisson, Gérard; Vieyres, Pierre
2006-11-01
This paper presents a new tool for the evaluation of ultrasound image compression. The goal is to measure the image quality as easily as with a statistical criterion, and with the same reliability as the one provided by the medical assessment. An initial experiment is proposed to medical experts and represents our reference value for the comparison of evaluation criteria. Twenty-one statistical criteria are selected from the literature. A cumulative absolute similarity measure is defined as a distance between the criterion to evaluate and the reference value. A first fusion method based on a linear combination of criteria is proposed to improve the results obtained by each of them separately. The second proposed approach combines different statistical criteria and uses the medical assessment in a training phase with a support vector machine. Some experimental results are given and show the benefit of fusion.
Isolating and Examining Sources of Suppression and Multicollinearity in Multiple Linear Regression.
Beckstead, Jason W
2012-03-30
The presence of suppression (and multicollinearity) in multiple regression analysis complicates interpretation of predictor-criterion relationships. The mathematical conditions that produce suppression in regression analysis have received considerable attention in the methodological literature but until now nothing in the way of an analytic strategy to isolate, examine, and remove suppression effects has been offered. In this article such an approach, rooted in confirmatory factor analysis theory and employing matrix algebra, is developed. Suppression is viewed as the result of criterion-irrelevant variance operating among predictors. Decomposition of predictor variables into criterion-relevant and criterion-irrelevant components using structural equation modeling permits derivation of regression weights with the effects of criterion-irrelevant variance omitted. Three examples with data from applied research are used to illustrate the approach: the first assesses child and parent characteristics to explain why some parents of children with obsessive-compulsive disorder accommodate their child's compulsions more so than do others, the second examines various dimensions of personal health to explain individual differences in global quality of life among patients following heart surgery, and the third deals with quantifying the relative importance of various aptitudes for explaining academic performance in a sample of nursing students. The approach is offered as an analytic tool for investigators interested in understanding predictor-criterion relationships when complex patterns of intercorrelation among predictors are present and is shown to augment dominance analysis.
Using histograms to introduce randomization in the generation of ensembles of decision trees
Kamath, Chandrika; Cantu-Paz, Erick; Littau, David
2005-02-22
A system for decision tree ensembles that includes a module to read the data, a module to create a histogram, a module to evaluate a potential split according to some criterion using the histogram, a module to select a split point randomly in an interval around the best split, a module to split the data, and a module to combine multiple decision trees in ensembles. The decision tree method includes the steps of reading the data; creating a histogram; evaluating a potential split according to some criterion using the histogram, selecting a split point randomly in an interval around the best split, splitting the data, and combining multiple decision trees in ensembles.
The role of public and private transfers in the cost-benefit analysis of mental health programs.
Brent, Robert J
2004-11-01
This paper revisits the issue of whether to include maintenance costs in an economic evaluation in mental health. The source of these maintenance costs may be public or private transfers. The issue is discussed in terms of a formal cost-benefit criterion. It is shown that, when transfers have productivity effects, income distribution is important, and one recognizes that public transfers have tax implications, transfers can have real resource effects and cannot be ignored. The criterion is then applied to an evaluation of three case management programs in California that sought to reduce the intensive hospitalization of the severely mentally ill. 2004 John Wiley & Sons, Ltd.
A survey of quality measures for gray-scale image compression
NASA Technical Reports Server (NTRS)
Eskicioglu, Ahmet M.; Fisher, Paul S.
1993-01-01
Although a variety of techniques are available today for gray-scale image compression, a complete evaluation of these techniques cannot be made as there is no single reliable objective criterion for measuring the error in compressed images. The traditional subjective criteria are burdensome, and usually inaccurate or inconsistent. On the other hand, being the most common objective criterion, the mean square error (MSE) does not have a good correlation with the viewer's response. It is now understood that in order to have a reliable quality measure, a representative model of the complex human visual system is required. In this paper, we survey and give a classification of the criteria for the evaluation of monochrome image quality.
Zhuang, Xiahai; Bai, Wenjia; Song, Jingjing; Zhan, Songhua; Qian, Xiaohua; Shi, Wenzhe; Lian, Yanyun; Rueckert, Daniel
2015-07-01
Cardiac computed tomography (CT) is widely used in clinical diagnosis of cardiovascular diseases. Whole heart segmentation (WHS) plays a vital role in developing new clinical applications of cardiac CT. However, the shape and appearance of the heart can vary greatly across different scans, making the automatic segmentation particularly challenging. The objective of this work is to develop and evaluate a multiatlas segmentation (MAS) scheme using a new atlas ranking and selection algorithm for automatic WHS of CT data. Research on different MAS strategies and their influence on WHS performance are limited. This work provides a detailed comparison study evaluating the impacts of label fusion, atlas ranking, and sizes of the atlas database on the segmentation performance. Atlases in a database were registered to the target image using a hierarchical registration scheme specifically designed for cardiac images. A subset of the atlases were selected for label fusion, according to the authors' proposed atlas ranking criterion which evaluated the performance of each atlas by computing the conditional entropy of the target image given the propagated atlas labeling. Joint label fusion was used to combine multiple label estimates to obtain the final segmentation. The authors used 30 clinical cardiac CT angiography (CTA) images to evaluate the proposed MAS scheme and to investigate different segmentation strategies. The mean WHS Dice score of the proposed MAS method was 0.918 ± 0.021, and the mean runtime for one case was 13.2 min on a workstation. This MAS scheme using joint label fusion generated significantly better Dice scores than the other label fusion strategies, including majority voting (0.901 ± 0.276, p < 0.01), locally weighted voting (0.905 ± 0.0247, p < 0.01), and probabilistic patch-based fusion (0.909 ± 0.0249, p < 0.01). In the atlas ranking study, the proposed criterion based on conditional entropy yielded a performance curve with higher WHS Dice scores compared to the conventional schemes (p < 0.03). In the atlas database study, the authors showed that the MAS using larger atlas databases generated better performance curves than the MAS using smaller ones, indicating larger atlas databases could produce more accurate segmentation. The authors have developed a new MAS framework for automatic WHS of CTA and investigated alternative implementations of MAS. With the proposed atlas ranking algorithm and joint label fusion, the MAS scheme is able to generate accurate segmentation within practically acceptable computation time. This method can be useful for the development of new clinical applications of cardiac CT.
Renormalization group naturalness of GUT Higgs potentials
NASA Astrophysics Data System (ADS)
Allanach, B. C.; Amelino-Camelia, G.; Philipsen, O.; Pisanti, O.; Rosa, L.
1999-01-01
We analyze the symmetry-breaking patterns of grand unified theories from the point of view of a recently proposed criterion of renormalization-group naturalness. We perform the analysis on simple non-SUSY SU(5) and SO(10) and SUSY SU(5) GUTs. We find that the naturalness criterion can favor spontaneous symmetry breaking in the direction of the smallest of the maximal little groups. Some differences between theories with and without supersymmetry are also emphasized.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yang, Jun; Department of Oncology, First Affiliated Hospital of Xinxiang Medical University, 88 Jiankang Road, Weihui, Henan, 453100; Ma, Lin
2016-07-01
To investigate the dosimetric characteristics of 4 SBRT-capable dose delivery systems, CyberKnife (CK), Helical TomoTherapy (HT), Volumetric Modulated Arc Therapy (VMAT) by Varian RapidArc (RA), and segmental step-and-shoot intensity-modulated radiation therapy (IMRT) by Elekta, on isolated thoracic spinal lesions. CK, HT, RA, and IMRT planning were performed simultaneously for 10 randomly selected patients with 6 body types and 6 body + pedicle types with isolated thoracic lesions. The prescription was set with curative intent and dose of either 33 Gy in 3 fractions (3F) or 40 Gy in 5F to cover at least 90% of the planning target volume (PTV),more » correspondingly. Different dosimetric indices, beam-on time, and monitor units (MUs) were evaluated to compare the advantages/disadvantages of each delivery modality. In ensuring the dose-volume constraints for cord and esophagus of the premise, CK, HT, and RA all achieved a sharp conformity index (CI) and a small penumbra volume compared to IMRT. RA achieved a CI comparable to those from CK, HT, and IMRT. CK had a heterogeneous dose distribution in the target as its radiosurgical nature with less dose uniformity inside the target. CK had the longest beam-on time and the largest MUs, followed by HT and RA. IMRT presented the shortest beam-on time and the least MUs delivery. For the body-type lesions, CK, HT, and RA satisfied the target coverage criterion in 6 cases, but the criterion was satisfied in only 3 (50%) cases with the IMRT technique. For the body + pedicle-type lesions, HT satisfied the criterion of the target coverage of ≥90% in 4 of the 6 cases, and reached a target coverage of 89.0% in another case. However, the criterion of the target coverage of ≥90% was reached in 2 cases by CK and RA, and only in 1 case by IMRT. For curative-intent SBRT of isolated thoracic spinal lesions, RA is the first choice for the body-type lesions owing to its delivery efficiency (time); the second choice is CK or HT; HT is the preferential choice for the body + pedicle-type lesions. This study suggests further clinical investigations with longer follow-up for these studied cases.« less
Training set optimization under population structure in genomic selection.
Isidro, Julio; Jannink, Jean-Luc; Akdemir, Deniz; Poland, Jesse; Heslot, Nicolas; Sorrells, Mark E
2015-01-01
Population structure must be evaluated before optimization of the training set population. Maximizing the phenotypic variance captured by the training set is important for optimal performance. The optimization of the training set (TRS) in genomic selection has received much interest in both animal and plant breeding, because it is critical to the accuracy of the prediction models. In this study, five different TRS sampling algorithms, stratified sampling, mean of the coefficient of determination (CDmean), mean of predictor error variance (PEVmean), stratified CDmean (StratCDmean) and random sampling, were evaluated for prediction accuracy in the presence of different levels of population structure. In the presence of population structure, the most phenotypic variation captured by a sampling method in the TRS is desirable. The wheat dataset showed mild population structure, and CDmean and stratified CDmean methods showed the highest accuracies for all the traits except for test weight and heading date. The rice dataset had strong population structure and the approach based on stratified sampling showed the highest accuracies for all traits. In general, CDmean minimized the relationship between genotypes in the TRS, maximizing the relationship between TRS and the test set. This makes it suitable as an optimization criterion for long-term selection. Our results indicated that the best selection criterion used to optimize the TRS seems to depend on the interaction of trait architecture and population structure.
PKIX Certificate Status in Hybrid MANETs
NASA Astrophysics Data System (ADS)
Muñoz, Jose L.; Esparza, Oscar; Gañán, Carlos; Parra-Arnau, Javier
Certificate status validation is a hard problem in general but it is particularly complex in Mobile Ad-hoc Networks (MANETs) because we require solutions to manage both the lack of fixed infrastructure inside the MANET and the possible absence of connectivity to trusted authorities when the certification validation has to be performed. In this sense, certificate acquisition is usually assumed as an initialization phase. However, certificate validation is a critical operation since the node needs to check the validity of certificates in real-time, that is, when a particular certificate is going to be used. In such MANET environments, it may happen that the node is placed in a part of the network that is disconnected from the source of status data at the moment the status checking is required. Proposals in the literature suggest the use of caching mechanisms so that the node itself or a neighbour node has some status checking material (typically on-line status responses or lists of revoked certificates). However, to the best of our knowledge the only criterion to evaluate the cached (obsolete) material is the time. In this paper, we analyse how to deploy a certificate status checking PKI service for hybrid MANET and we propose a new criterion based on risk to evaluate cached status data that is much more appropriate and absolute than time because it takes into account the revocation process.
NASA Astrophysics Data System (ADS)
Engeland, Kolbjorn; Steinsland, Ingelin
2014-05-01
This study introduces a methodology for the construction of probabilistic inflow forecasts for multiple catchments and lead times, and investigates criterions for evaluation of multi-variate forecasts. A post-processing approach is used, and a Gaussian model is applied for transformed variables. The post processing model has two main components, the mean model and the dependency model. The mean model is used to estimate the marginal distributions for forecasted inflow for each catchment and lead time, whereas the dependency models was used to estimate the full multivariate distribution of forecasts, i.e. co-variances between catchments and lead times. In operational situations, it is a straightforward task to use the models to sample inflow ensembles which inherit the dependencies between catchments and lead times. The methodology was tested and demonstrated in the river systems linked to the Ulla-Førre hydropower complex in southern Norway, where simultaneous probabilistic forecasts for five catchments and ten lead times were constructed. The methodology exhibits sufficient flexibility to utilize deterministic flow forecasts from a numerical hydrological model as well as statistical forecasts such as persistent forecasts and sliding window climatology forecasts. It also deals with variation in the relative weights of these forecasts with both catchment and lead time. When evaluating predictive performance in original space using cross validation, the case study found that it is important to include the persistent forecast for the initial lead times and the hydrological forecast for medium-term lead times. Sliding window climatology forecasts become more important for the latest lead times. Furthermore, operationally important features in this case study such as heteroscedasticity, lead time varying between lead time dependency and lead time varying between catchment dependency are captured. Two criterions were used for evaluating the added value of the dependency model. The first one was the Energy score (ES) that is a multi-dimensional generalization of continuous rank probability score (CRPS). ES was calculated for all lead-times and catchments together, for each catchment across all lead times and for each lead time across all catchments. The second criterion was to use CRPS for forecasted inflows accumulated over several lead times and catchments. The results showed that ES was not very sensitive to correct covariance structure, whereas CRPS for accumulated flows where more suitable for evaluating the dependency model. This indicates that it is more appropriate to evaluate relevant univariate variables that depends on the dependency structure then to evaluate the multivariate forecast directly.
Boni, Robson Aparecido Dos Santos; Paiva, Carlos Eduardo; de Oliveira, Marco Antonio; Lucchetti, Giancarlo; Fregnani, José Humberto Tavares Guerreiro; Paiva, Bianca Sakamoto Ribeiro
2018-01-01
To evaluate the prevalence and possible factors associated with the development of burnout among medical students in the first years of undergraduate school. A cross-sectional study was conducted at the Barretos School of Health Sciences, Dr. Paulo Prata. A total of 330 students in the first four years of medical undergraduate school were invited to participate in responding to the sociodemographic and Maslach Burnout Inventory-Student Survey (MBI-SS) questionnaires. The first-year group consisted of 150 students, followed by the second-, third-, and fourth-year groups, with 60 students each. Data from 265 students who answered at least the sociodemographic questionnaire and the MBI-SS were analyzed (response rate = 80.3%). One (n = 1, 0.3%) potential participant viewed the Informed Consent Form but did not agree to participate in the study. A total of 187 students (187/265, 70.6%) presented high levels of emotional exhaustion, 140 (140/265, 52.8%) had high cynicism, and 129 (129/265, 48.7%) had low academic efficacy. The two-dimensional criterion indicated that 119 (44.9%) students experienced burnout. Based on the three-dimensional criterion, 70 students (26.4%) presented with burnout. The year with the highest frequency of affected students for both criteria was the first year (p = 0.001). Personal attributes were able to explain 11% (ΔR = 0.11) of the variability of burnout under the two-dimensional criterion and 14.4% (R2 = 0.144) under the three-dimensional criterion. This study showed a high prevalence of burnout among medical students in a private school using active teaching methodologies. In the first years of graduation, students' personal attributes (optimism and self-perception of health) and school attributes (motivation and routine of the exhaustive study) were associated with higher levels of burnout. These findings reinforce the need to establish preventive measures focused on the personal attributes of first-year students, providing better performance, motivation, optimism, and empathy in the subsequent stages of the course.
Tightening the Dutch coffee shop policy: Evaluation of the private club and the residence criterion.
van Ooyen-Houben, Marianne M J; Bieleman, Bert; Korf, Dirk J
2016-05-01
The Dutch coffee shop policy was tightened in 2012. Two additional criteria that coffee shops must adhere to in order for them to be tolerated came into force: the private club and the residence criterion. Coffee shops were only permitted to give access to members and only residents of the Netherlands were permitted to become a member. This tightened policy sought to make coffee shops smaller and more controllable, to reduce the nuisance associated with coffee shops and to reduce the number of foreign visitors attracted by the coffee shops. Enforcement began in the southern provinces. The private club criterion was abolished at the end of 2012. A sample of fourteen municipalities with coffee shops was drawn. Seven in the south were treated as an 'experimental group' and the others as 'comparison group'. A baseline assessment and follow-ups at six and 18 months were performed. A combination of methods was applied: interviews with local experts, surveys with neighbourhood residents, coffee shop visitors and cannabis users, and ethnographic field work. Drugs tourism to coffee shops swiftly declined in 2012. The coffee shops also lost a large portion of their local customers, since users did not want to register as a member. The illegal market expanded. Neighbourhood residents experienced a greater amount of nuisance caused by dealer activities. After abolishment of the private club criterion, residents of the Netherlands largely returned to the coffee shops. Drug tourists still remained largely absent. Neighbourhood residents experienced more nuisance from coffee shops again. Illegal cannabis sale was tempered. No effect on cannabis use was found. The quick and robust shifts in the users' market in reaction to the policy changes illustrate the power of policy, but also the limitations caused by the dynamic and resilient nature of the Dutch cannabis supply market. Copyright © 2016 Elsevier B.V. All rights reserved.
Serel Arslan, S; Demir, N; Karaduman, A A
2017-02-01
This study aimed to develop a scale called Tongue Thrust Rating Scale (TTRS), which categorised tongue thrust in children in terms of its severity during swallowing, and to investigate its validity and reliability. The study describes the developmental phase of the TTRS and presented its content and criterion-based validity and interobserver and intra-observer reliability. For content validation, seven experts assessed the steps in the scale over two Delphi rounds. Two physical therapists evaluated videos of 50 children with cerebral palsy (mean age, 57·9 ± 16·8 months), using the TTRS to test criterion-based validity, interobserver and intra-observer reliability. The Karaduman Chewing Performance Scale (KCPS) and Drooling Severity and Frequency Scale (DSFS) were used for criterion-based validity. All the TTRS steps were deemed necessary. The content validity index was 0·857. A very strong positive correlation was found between two examinations by one physical therapist, which indicated intra-observer reliability (r = 0·938, P < 0·001). A very strong positive correlation was also found between the TTRS scores of two physical therapists, indicating interobserver reliability (r = 0·892, P < 0·001). There was also a strong positive correlation between the TTRS and KCPS (r = 0·724, P < 0·001) and a very strong positive correlation between the TTRS scores and DSFS (r = 0·822 and r = 0·755; P < 0·001). These results demonstrated the criterion-based validity of the TTRS. The TTRS is a valid, reliable and clinically easy-to-use functional instrument to document the severity of tongue thrust in children. © 2016 John Wiley & Sons Ltd.
Predictability of Seasonal Rainfall over the Greater Horn of Africa
NASA Astrophysics Data System (ADS)
Ngaina, J. N.
2016-12-01
The El Nino-Southern Oscillation (ENSO) is a primary mode of climate variability in the Greater of Africa (GHA). The expected impacts of climate variability and change on water, agriculture, and food resources in GHA underscore the importance of reliable and accurate seasonal climate predictions. The study evaluated different model selection criteria which included the Coefficient of determination (R2), Akaike's Information Criterion (AIC), Bayesian Information Criterion (BIC), and the Fisher information approximation (FIA). A forecast scheme based on the optimal model was developed to predict the October-November-December (OND) and March-April-May (MAM) rainfall. The predictability of GHA rainfall based on ENSO was quantified based on composite analysis, correlations and contingency tables. A test for field-significance considering the properties of finiteness and interdependence of the spatial grid was applied to avoid correlations by chance. The study identified FIA as the optimal model selection criterion. However, complex model selection criteria (FIA followed by BIC) performed better compared to simple approach (R2 and AIC). Notably, operational seasonal rainfall predictions over the GHA makes of simple model selection procedures e.g. R2. Rainfall is modestly predictable based on ENSO during OND and MAM seasons. El Nino typically leads to wetter conditions during OND and drier conditions during MAM. The correlations of ENSO indices with rainfall are statistically significant for OND and MAM seasons. Analysis based on contingency tables shows higher predictability of OND rainfall with the use of ENSO indices derived from the Pacific and Indian Oceans sea surfaces showing significant improvement during OND season. The predictability based on ENSO for OND rainfall is robust on a decadal scale compared to MAM. An ENSO-based scheme based on an optimal model selection criterion can thus provide skillful rainfall predictions over GHA. This study concludes that the negative phase of ENSO (La Niña) leads to dry conditions while the positive phase of ENSO (El Niño) anticipates enhanced wet conditions
A Systematic Quantitative-Qualitative Model: How To Evaluate Professional Services
ERIC Educational Resources Information Center
Yoda, Koji
1973-01-01
The proposed evaluation model provides for the assignment of relative weights to each criterion, and establishes a weighting system for calculating a quantitative-qualitative raw score for each service activity of a faculty member being reviewed. (Author)
Note on Professor Sizer's Paper.
ERIC Educational Resources Information Center
Balderston, Frederick E.
1979-01-01
Issues suggested by John Sizer's paper, an overview of the assessment of institutional performance, include: the efficient-frontier approach, multiple-criterion decision-making models, performance analysis approached as path analysis, and assessment of academic quality. (JMD)
Jordana-Lluch, Elena; Giménez, Montserrat; Quesada, M Dolores; Rivaya, Belén; Marcó, Clara; Domínguez, M Jesús; Arméstar, Fernando; Martró, Elisa; Ausina, Vicente
2015-01-01
Rapid identification of the etiological agent in bloodstream infections is of vital importance for the early administration of the most appropriate antibiotic therapy. Molecular methods may offer an advantage to current culture-based microbiological diagnosis. The goal of this study was to evaluate the performance of IRIDICA, a platform based on universal genetic amplification followed by mass spectrometry (PCR/ESI-MS) for the molecular diagnosis of sepsis-related pathogens directly from the patient's blood. A total of 410 whole blood specimens from patients admitted to Emergency Room (ER) and Intensive Care Unit (ICU) with clinical suspicion of sepsis were tested with the IRIDICA BAC BSI Assay (broad identification of bacteria and Candida spp.). Microorganisms grown in culture and detected by IRIDICA were compared considering blood culture as gold standard. When discrepancies were found, clinical records and results from other cultures were taken into consideration (clinical infection criterion). The overall positive and negative agreement of IRIDICA with blood culture in the analysis by specimen was 74.8% and 78.6%, respectively, rising to 76.9% and 87.2% respectively, when compared with the clinical infection criterion. Interestingly, IRIDICA detected 41 clinically significant microorganisms missed by culture, most of them from patients under antimicrobial treatment. Of special interest were the detections of one Mycoplasma hominis and two Mycobacterium simiae in immunocompromised patients. When ICU patients were analyzed separately, sensitivity, specificity, positive and negative predictive values compared with blood culture were 83.3%, 78.6%, 33.9% and 97.3% respectively, and 90.5%, 87.2%, 64.4% and 97.3% respectively, in comparison with the clinical infection criterion. IRIDICA is a promising technology that offers an early and reliable identification of a wide variety of pathogens directly from the patient's blood within 6h, which brings the opportunity to improve management of septic patients, especially for those critically ill admitted to the ICU.
Peer Review of Grant Applications: Criteria Used and Qualitative Study of Reviewer Practices
Abdoul, Hendy; Perrey, Christophe; Amiel, Philippe; Tubach, Florence; Gottot, Serge; Durand-Zaleski, Isabelle; Alberti, Corinne
2012-01-01
Background Peer review of grant applications has been criticized as lacking reliability. Studies showing poor agreement among reviewers supported this possibility but usually focused on reviewers’ scores and failed to investigate reasons for disagreement. Here, our goal was to determine how reviewers rate applications, by investigating reviewer practices and grant assessment criteria. Methods and Findings We first collected and analyzed a convenience sample of French and international calls for proposals and assessment guidelines, from which we created an overall typology of assessment criteria comprising nine domains relevance to the call for proposals, usefulness, originality, innovativeness, methodology, feasibility, funding, ethical aspects, and writing of the grant application. We then performed a qualitative study of reviewer practices, particularly regarding the use of assessment criteria, among reviewers of the French Academic Hospital Research Grant Agencies (Programmes Hospitaliers de Recherche Clinique, PHRCs). Semi-structured interviews and observation sessions were conducted. Both the time spent assessing each grant application and the assessment methods varied across reviewers. The assessment criteria recommended by the PHRCs were listed by all reviewers as frequently evaluated and useful. However, use of the PHRC criteria was subjective and varied across reviewers. Some reviewers gave the same weight to each assessment criterion, whereas others considered originality to be the most important criterion (12/34), followed by methodology (10/34) and feasibility (4/34). Conceivably, this variability might adversely affect the reliability of the review process, and studies evaluating this hypothesis would be of interest. Conclusions Variability across reviewers may result in mistrust among grant applicants about the review process. Consequently, ensuring transparency is of the utmost importance. Consistency in the review process could also be improved by providing common definitions for each assessment criterion and uniform requirements for grant application submissions. Further research is needed to assess the feasibility and acceptability of these measures. PMID:23029386
NASA Astrophysics Data System (ADS)
Susanti, Hesty; Suprijanto, Kurniadi, Deddy
2018-02-01
Needle visibility in ultrasound-guided technique has been a crucial factor for successful interventional procedure. It has been affected by several factors, i.e. puncture depth, insertion angle, needle size and material, and imaging technology. The influences of those factors made the needle not always well visible. 20 G needles of 15 cm length (Nano Line, facet) were inserted into water bath with variation of insertion angles and depths. Ultrasound measurements are performed with BK-Medical Flex Focus 800 using 12 MHz linear array and 5 MHz curved array in Ultrasound Guided Regional Anesthesia mode. We propose 3 criteria to evaluate needle visibility, i.e. maximum intensity, mean intensity, and the ratio between minimum and maximum intensity. Those criteria were then depicted into representative maps for practical purpose. The best criterion candidate for representing the needle visibility was criterion 1. Generally, the appearance pattern of the needle from this criterion was relatively consistent, i.e. for linear array, it was relatively poor visibility in the middle part of the shaft, while for curved array, it is relatively better visible toward the end of the shaft. With further investigations, for example with the use of tissue-mimicking phantom, the representative maps can be built for future practical purpose, i.e. as a tool for clinicians to ensure better needle placement in clinical application. It will help them to avoid the "dead" area where the needle is not well visible, so it can reduce the risks of vital structures traversing and the number of required insertion, resulting in less patient morbidity. Those simple criteria and representative maps can be utilized to evaluate general visibility patterns of the needle in vast range of needle types and sizes in different insertion media. This information is also important as an early investigation for future research of needle visibility improvement, i.e. the development of beamforming strategies and ultrasound enhanced (echogenic) needle.
Bernardo, Maria S; Lapa, N; Barbosa, R; Gonçalves, M; Mendes, B; Pinto, F; Gulyurtlu, I
2009-07-15
A mixture of 70% (w/w) pine biomass and 30% (w/w) plastics (mixture of polypropylene, polyethylene, and polystyrene) was subjected to pyrolysis at 400 degrees C, for 15 min, with an initial pressure of 40 MPa. Part of the solid residue produced was subjected to extraction with dichloromethane (DCM). The extracted residue (residue A) and raw residue (residue B) were analyzed by weight loss combustion and submitted to the leaching test ISO/TS 21268-2 using two different leachants: DCM (0.2%, v/v) and calcium chloride (0.001 mol/L). The concentrations of the heavy metals Cd, Cr, Ni, Zn, Pb and Cu were determined in the eluates and in the two residues. The eluates were further characterized by determining their pH and the concentrations of benzene, toluene, ethylbenzene and xylenes (BTEX). The presence of other organic contaminants in the eluates was qualitatively evaluated by gas chromatography, coupled with mass spectrometry. An ecotoxicological characterization was also performed by using the bio-indicator Vibrio fischeri. The chemical and ecotoxicological results were analyzed according to the French proposal of Criteria on the Evaluation Methods of Waste Ecotoxicity (CEMWE). Residue A was not considered to be ecotoxic by the ecotoxicological criterion (EC(50) (30 min) >or=10%), but it was considered to be ecotoxic by the chemical criterion (Ni>or=0.5mg/L). Residue B was considered to be ecotoxic by the ecotoxicological criterion: EC(50) (30 min)
Kojima, Motohiro; Shimazaki, Hideyuki; Iwaya, Keiichi; Kage, Masayoshi; Akiba, Jun; Ohkura, Yasuo; Horiguchi, Shinichiro; Shomori, Kohei; Kushima, Ryoji; Ajioka, Yoichi; Nomura, Shogo; Ochiai, Atsushi
2013-07-01
The goal of this study is to create an objective pathological diagnostic system for blood and lymphatic vessel invasion (BLI). 1450 surgically resected colorectal cancer specimens from eight hospitals were reviewed. Our first step was to compare the current practice of pathology assessment among eight hospitals. Then, H&E stained slides with or without histochemical/immunohistochemical staining were assessed by eight pathologists and concordance of BLI diagnosis was checked. In addition, histological findings associated with BLI having good concordance were reviewed. Based on these results, framework for developing diagnostic criterion was developed, using the Delphi method. The new criterion was evaluated using 40 colorectal cancer specimens. Frequency of BLI diagnoses, number of blocks obtained and stained for assessment of BLI varied among eight hospitals. Concordance was low for BLI diagnosis and was not any better when histochemical/immunohistochemical staining was provided. All histological findings associated with BLI from H&E staining were poor in agreement. However, observation of elastica-stained internal elastic membrane covering more than half of the circumference surrounding the tumour cluster as well as the presence of D2-40-stained endothelial cells covering more than half of the circumference surrounding the tumour cluster showed high concordance. Based on this observation, we developed a framework for pathological diagnostic criterion, using the Delphi method. This criterion was found to be useful in improving concordance of BLI diagnosis. A framework for pathological diagnostic criterion was developed by reviewing concordance and using the Delphi method. The criterion developed may serve as the basis for creating a standardised procedure for pathological diagnosis.
Kojima, Motohiro; Shimazaki, Hideyuki; Iwaya, Keiichi; Kage, Masayoshi; Akiba, Jun; Ohkura, Yasuo; Horiguchi, Shinichiro; Shomori, Kohei; Kushima, Ryoji; Ajioka, Yoichi; Nomura, Shogo; Ochiai, Atsushi
2013-01-01
Aims The goal of this study is to create an objective pathological diagnostic system for blood and lymphatic vessel invasion (BLI). Methods 1450 surgically resected colorectal cancer specimens from eight hospitals were reviewed. Our first step was to compare the current practice of pathology assessment among eight hospitals. Then, H&E stained slides with or without histochemical/immunohistochemical staining were assessed by eight pathologists and concordance of BLI diagnosis was checked. In addition, histological findings associated with BLI having good concordance were reviewed. Based on these results, framework for developing diagnostic criterion was developed, using the Delphi method. The new criterion was evaluated using 40 colorectal cancer specimens. Results Frequency of BLI diagnoses, number of blocks obtained and stained for assessment of BLI varied among eight hospitals. Concordance was low for BLI diagnosis and was not any better when histochemical/immunohistochemical staining was provided. All histological findings associated with BLI from H&E staining were poor in agreement. However, observation of elastica-stained internal elastic membrane covering more than half of the circumference surrounding the tumour cluster as well as the presence of D2-40-stained endothelial cells covering more than half of the circumference surrounding the tumour cluster showed high concordance. Based on this observation, we developed a framework for pathological diagnostic criterion, using the Delphi method. This criterion was found to be useful in improving concordance of BLI diagnosis. Conclusions A framework for pathological diagnostic criterion was developed by reviewing concordance and using the Delphi method. The criterion developed may serve as the basis for creating a standardised procedure for pathological diagnosis. PMID:23592799
Robust signal recovery using the prolate spherical wave functions and maximum correntropy criterion
NASA Astrophysics Data System (ADS)
Zou, Cuiming; Kou, Kit Ian
2018-05-01
Signal recovery is one of the most important problem in signal processing. This paper proposes a novel signal recovery method based on prolate spherical wave functions (PSWFs). PSWFs are a kind of special functions, which have been proved having good performance in signal recovery. However, the existing PSWFs based recovery methods used the mean square error (MSE) criterion, which depends on the Gaussianity assumption of the noise distributions. For the non-Gaussian noises, such as impulsive noise or outliers, the MSE criterion is sensitive, which may lead to large reconstruction error. Unlike the existing PSWFs based recovery methods, our proposed PSWFs based recovery method employs the maximum correntropy criterion (MCC), which is independent of the noise distribution. The proposed method can reduce the impact of the large and non-Gaussian noises. The experimental results on synthetic signals with various types of noises show that the proposed MCC based signal recovery method has better robust property against various noises compared to other existing methods.
The effect of suspended particles on Jean's criterion for gravitational instability
NASA Technical Reports Server (NTRS)
Wollkind, David J.; Yates, Kemble R.
1990-01-01
The effect that the proper inclusion of suspended particles has on Jeans' criterion for the self-gravitational instability of an unbounded nonrotating adiabatic gas cloud is examined by formulating the appropriate model system, introducing particular physically plausible equations of state and constitutive relations, performing a linear stability analysis of a uniformly expanding exact solution to these governing equations, and exploiting the fact that there exists a natural small material parameter for this problem given by N sub 1/n sub 1, the ratio of the initial number density for the particles to that for the gas. The main result of this investigation is the derivation of an altered criterion which can substantially reduce Jeans' original critical wavelength for instability. It is then shown that the existing discrepancy between Jeans' theoretical prediction using and actual observational data relevant to the Andromeda nebula M31 can be accounted for by this new criterion of assuming suspended particles of a reasonable grain size and distribution to be present.
Physical mechanism and numerical simulation of the inception of the lightning upward leader
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li Qingmin; Lu Xinchang; Shi Wei
2012-12-15
The upward leader is a key physical process of the leader progression model of lightning shielding. The inception mechanism and criterion of the upward leader need further understanding and clarification. Based on leader discharge theory, this paper proposes the critical electric field intensity of the stable upward leader (CEFISUL) and characterizes it by the valve electric field intensity on the conductor surface, E{sub L}, which is the basis of a new inception criterion for the upward leader. Through numerical simulation under various physical conditions, we verified that E{sub L} is mainly related to the conductor radius, and data fitting yieldsmore » the mathematical expression of E{sub L}. We further establish a computational model for lightning shielding performance of the transmission lines based on the proposed CEFISUL criterion, which reproduces the shielding failure rate of typical UHV transmission lines. The model-based calculation results agree well with the statistical data from on-site operations, which show the effectiveness and validity of the CEFISUL criterion.« less
Meeting the criteria of a nursing diagnosis classification: Evaluation of ICNP, ICF, NANDA and ZEFP.
Müller-Staub, Maria; Lavin, Mary Ann; Needham, Ian; van Achterberg, Theo
2007-07-01
Few studies described nursing diagnosis classification criteria and how classifications meet these criteria. The purpose was to identify criteria for nursing diagnosis classifications and to assess how these criteria are met by different classifications. First, a literature review was conducted (N=50) to identify criteria for nursing diagnoses classifications and to evaluate how these criteria are met by the International Classification of Nursing Practice (ICNP), the International Classification of Functioning, Disability and Health (ICF), the International Nursing Diagnoses Classification (NANDA), and the Nursing Diagnostic System of the Centre for Nursing Development and Research (ZEFP). Using literature review based general and specific criteria, the principal investigator evaluated each classification, applying a matrix. Second, a convenience sample of 20 nursing experts from different Swiss care institutions answered standardized interview forms, querying current national and international classification state and use. The first general criterion is that a diagnosis classification should describe the knowledge base and subject matter for which the nursing profession is responsible. ICNP) and NANDA meet this goal. The second general criterion is that each class fits within a central concept. The ICF and NANDA are the only two classifications built on conceptually driven classes. The third general classification criterion is that each diagnosis possesses a description, diagnostic criteria, and related etiologies. Although ICF and ICNP describe diagnostic terms, only NANDA fulfils this criterion. The analysis indicated that NANDA fulfilled most of the specific classification criteria in the matrix. The nursing experts considered NANDA to be the best-researched and most widely implemented classification in Switzerland and internationally. The international literature and the opinion of Swiss expert nurses indicate that-from the perspective of classifying comprehensive nursing diagnoses-NANDA should be recommended for nursing practice and electronic nursing documentation. Study limitations and future research needs are discussed.
Newgard, Craig D; Kampp, Michael; Nelson, Maria; Holmes, James F; Zive, Dana; Rea, Thomas; Bulger, Eileen M; Liao, Michael; Sherck, John; Hsia, Renee Y; Wang, N Ewen; Fleischman, Ross J; Barton, Erik D; Daya, Mohamud; Heineman, John; Kuppermann, Nathan
2012-05-01
"Emergency medical services (EMS) provider judgment" was recently added as a field triage criterion to the national guidelines, yet its predictive value and real world application remain unclear. We examine the use and independent predictive value of EMS provider judgment in identifying seriously injured persons. We analyzed a population-based retrospective cohort, supplemented by qualitative analysis, of injured children and adults evaluated and transported by 47 EMS agencies to 94 hospitals in five regions across the Western United States from 2006 to 2008. We used logistic regression models to evaluate the independent predictive value of EMS provider judgment for Injury Severity Score ≥ 16. EMS narratives were analyzed using qualitative methods to assess and compare common themes for each step in the triage algorithm, plus EMS provider judgment. 213,869 injured patients were evaluated and transported by EMS over the 3-year period, of whom 41,191 (19.3%) met at least one of the field triage criteria. EMS provider judgment was the most commonly used triage criterion (40.0% of all triage-positive patients; sole criterion in 21.4%). After accounting for other triage criteria and confounders, the adjusted odds ratio of Injury Severity Score ≥ 16 for EMS provider judgment was 1.23 (95% confidence interval, 1.03-1.47), although there was variability in predictive value across sites. Patients meeting EMS provider judgment had concerning clinical presentations qualitatively similar to those meeting mechanistic and other special considerations criteria. Among this multisite cohort of trauma patients, EMS provider judgment was the most commonly used field trauma triage criterion, independently associated with serious injury, and useful in identifying high-risk patients missed by other criteria. However, there was variability in predictive value between sites.
The Gideon Criterion: The Effects of Selection Criteria on Soldier Capabilities and Battle Results
1982-01-01
United States Army Recruiting Command RESEARCH MEMORANDUM 82-1 AD______ I I THE GIDEON CRITERION: THE EFFECTS OF SELECTION CRITERIA ON SOLDIER...and Evaluation Directorate Fort Sheridan, Illinois 60037 83 05 09 056 ii 1 DISCLAIMER NOTICE THIS DOCUMENT IS BEST QUALITY PRACTICABLE. THE COPY...FURNISHED TO DTIC CONTAINED A SIGNIFICANT NUMBER OF PAGES WHICH DO NOT REPRODUCE LEGIBLY. j1 ... 4 ’ t c " " .. THE GIDEON CR17RION’. THE EFFECTS OF
Evaluation of New Reverse Osmosis Membranes for the Separation of Toxic Compounds from Wastewater
1976-06-01
limited theoretical work being done regarding the separation of inorganic salts. Glueckauf (1967) has analyzed the repulsive forces between ions and a... inorganic ions by cellulose acetate membrane is in the order of the lyotropic series of ions. However, this criterion of separation has a few...12). The objective of this work was to study the criterion for the separa- tion of inorganic ions with the NS-1O0 membrane. Hopefully, it can be used
NASA Astrophysics Data System (ADS)
Kukunda, Collins B.; Duque-Lazo, Joaquín; González-Ferreiro, Eduardo; Thaden, Hauke; Kleinn, Christoph
2018-03-01
Distinguishing tree species is relevant in many contexts of remote sensing assisted forest inventory. Accurate tree species maps support management and conservation planning, pest and disease control and biomass estimation. This study evaluated the performance of applying ensemble techniques with the goal of automatically distinguishing Pinus sylvestris L. and Pinus uncinata Mill. Ex Mirb within a 1.3 km2 mountainous area in Barcelonnette (France). Three modelling schemes were examined, based on: (1) high-density LiDAR data (160 returns m-2), (2) Worldview-2 multispectral imagery, and (3) Worldview-2 and LiDAR in combination. Variables related to the crown structure and height of individual trees were extracted from the normalized LiDAR point cloud at individual-tree level, after performing individual tree crown (ITC) delineation. Vegetation indices and the Haralick texture indices were derived from Worldview-2 images and served as independent spectral variables. Selection of the best predictor subset was done after a comparison of three variable selection procedures: (1) Random Forests with cross validation (AUCRFcv), (2) Akaike Information Criterion (AIC) and (3) Bayesian Information Criterion (BIC). To classify the species, 9 regression techniques were combined using ensemble models. Predictions were evaluated using cross validation and an independent dataset. Integration of datasets and models improved individual tree species classification (True Skills Statistic, TSS; from 0.67 to 0.81) over individual techniques and maintained strong predictive power (Relative Operating Characteristic, ROC = 0.91). Assemblage of regression models and integration of the datasets provided more reliable species distribution maps and associated tree-scale mapping uncertainties. Our study highlights the potential of model and data assemblage at improving species classifications needed in present-day forest planning and management.
Gisbert, Javier P; Marín, Alicia C; Chaparro, María
2016-05-01
To perform a meta-analysis of the risk of relapse after discontinuation of anti-tumor necrosis factor (anti-TNF) therapy in patients with Crohn's disease (CD) and ulcerative colitis (UC), to evaluate risk factors for relapse, and to assess the response to retreatment with the same anti-TNF. Studies evaluating the incidence of relapse after anti-TNF discontinuation in patients with CD or UC who reached clinical remission with anti-TNFs were included. Bibliographies up to January 2015 were searched. Frequency of relapse after discontinuation of anti-TNF agents was determined; meta-analyses were performed using the inverse-variance method. We included 27 studies (21 infliximab and 6 infliximab/adalimumab). The overall risk of relapse after discontinuation of anti-TNF therapy was 44% for CD (95% confidence interval (CI) 36-51%; I(2)=79%; 912 patients) and 38% for UC (23-52%; I(2)=82%; 266 patients). In CD, the relapse rate was 38% at 6 months after discontinuation (short term), 40% at 12 months (medium term), and 49% at >25 months (long term). In UC, 28% of patients relapsed at 12 months. In CD, when clinical remission was the only criterion for stopping anti-TNF therapy, the relapse rate after 1 year was 42%, which decreased to 26% when endoscopic remission was also required. Retreatment with the same anti-TNF induced remission again in 80% of cases (68-91%). Approximately one-third of patients with inflammatory bowel disease in remission under anti-TNF treatment relapsed 1 year after discontinuation. This proportion increased to half in the long term. In CD patients, the risk of relapse was lower when the criterion for discontinuation was endoscopic remission and not only clinical remission. Response to retreatment with the same anti-TNF agent was favorable.
Measurement properties of depression questionnaires in patients with diabetes: a systematic review.
van Dijk, Susan E M; Adriaanse, Marcel C; van der Zwaan, Lennart; Bosmans, Judith E; van Marwijk, Harm W J; van Tulder, Maurits W; Terwee, Caroline B
2018-06-01
To conduct a systematic review on measurement properties of questionnaires measuring depressive symptoms in adult patients with type 1 or type 2 diabetes. A systematic review of the literature in MEDLINE, EMbase and PsycINFO was performed. Full text, original articles, published in any language up to October 2016 were included. Eligibility for inclusion was independently assessed by three reviewers who worked in pairs. Methodological quality of the studies was evaluated by two independent reviewers using the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist. Quality of the questionnaires was rated per measurement property, based on the number and quality of the included studies and the reported results. Of 6286 unique hits, 21 studies met our criteria evaluating nine different questionnaires in multiple settings and languages. The methodological quality of the included studies was variable for the different measurement properties: 9/15 studies scored 'good' or 'excellent' on internal consistency, 2/5 on reliability, 0/1 on content validity, 10/10 on structural validity, 8/11 on hypothesis testing, 1/5 on cross-cultural validity, and 4/9 on criterion validity. For the CES-D, there was strong evidence for good internal consistency, structural validity, and construct validity; moderate evidence for good criterion validity; and limited evidence for good cross-cultural validity. The PHQ-9 and WHO-5 also performed well on several measurement properties. However, the evidence for structural validity of the PHQ-9 was inconclusive. The WHO-5 was less extensively researched and originally not developed to measure depression. Currently, the CES-D is best supported for measuring depressive symptoms in diabetes patients.
Boström, O; Fredriksson, R; Håland, Y; Jakobsson, L; Krafft, M; Lövsund, P; Muser, M H; Svensson, M Y
2000-03-01
Long-term whiplash associated disorders (WAD) 1-3 sustained in low velocity rear-end impacts is the most common disability injury in Sweden. Therefore, to determine neck injury mechanisms and develop methods to measure neck-injury related parameters are of importance for current crash-safety research. A new neck injury criterion (NIC) has previously been proposed and evaluated by means of dummy, human and mathematical rear-impact simulations. So far, the criterion appears to be sensitive to the major car and collision related risk factors for injuries with long-term consequences. To further evaluate the applicability of NIC, four seats were tested according to a recently proposed sled-test procedure. 'Good' as well as 'bad' seats were chosen on the basis of a recently presented disability risk ranking list. The dummy used in the current tests was the Biofidelic Rear Impact Dummy (BioRID). The results of this study showed that NICmax values were generally related to the real-world risk of long-term WAD 1-3. Furthermore, these results suggested that NICmax calculated from sled tests using the BioRID dummy can be used for evaluating the neck injury risk of different car seats.
Wavelength selection in injection-driven Hele-Shaw flows: A maximum amplitude criterion
NASA Astrophysics Data System (ADS)
Dias, Eduardo; Miranda, Jose
2013-11-01
As in most interfacial flow problems, the standard theoretical procedure to establish wavelength selection in the viscous fingering instability is to maximize the linear growth rate. However, there are important discrepancies between previous theoretical predictions and existing experimental data. In this work we perform a linear stability analysis of the radial Hele-Shaw flow system that takes into account the combined action of viscous normal stresses and wetting effects. Most importantly, we introduce an alternative selection criterion for which the selected wavelength is determined by the maximum of the interfacial perturbation amplitude. The effectiveness of such a criterion is substantiated by the significantly improved agreement between theory and experiments. We thank CNPq (Brazilian Sponsor) for financial support.
NASA Astrophysics Data System (ADS)
Shen, Fuhui; Lian, Junhe; Münstermann, Sebastian
2018-05-01
Experimental and numerical investigations on the forming limit diagram (FLD) of a ferritic stainless steel were performed in this study. The FLD of this material was obtained by Nakajima tests. Both the Marciniak-Kuczynski (MK) model and the modified maximum force criterion (MMFC) were used for the theoretical prediction of the FLD. From the results of uniaxial tensile tests along different loading directions with respect to the rolling direction, strong anisotropic plastic behaviour was observed in the investigated steel. A recently proposed anisotropic evolving non-associated Hill48 (enHill48) plasticity model, which was developed from the conventional Hill48 model based on the non-associated flow rule with evolving anisotropic parameters, was adopted to describe the anisotropic hardening behaviour of the investigated material. In the previous study, the model was coupled with the MMFC for FLD prediction. In the current study, the enHill48 was further coupled with the MK model. By comparing the predicted forming limit curves with the experimental results, the influences of anisotropy in terms of flow rule and evolving features on the forming limit prediction were revealed and analysed. In addition, the forming limit predictive performances of the MK and the MMFC models in conjunction with the enHill48 plasticity model were compared and evaluated.
Multimodel predictive system for carbon dioxide solubility in saline formation waters.
Wang, Zan; Small, Mitchell J; Karamalidis, Athanasios K
2013-02-05
The prediction of carbon dioxide solubility in brine at conditions relevant to carbon sequestration (i.e., high temperature, pressure, and salt concentration (T-P-X)) is crucial when this technology is applied. Eleven mathematical models for predicting CO(2) solubility in brine are compared and considered for inclusion in a multimodel predictive system. Model goodness of fit is evaluated over the temperature range 304-433 K, pressure range 74-500 bar, and salt concentration range 0-7 m (NaCl equivalent), using 173 published CO(2) solubility measurements, particularly selected for those conditions. The performance of each model is assessed using various statistical methods, including the Akaike Information Criterion (AIC) and the Bayesian Information Criterion (BIC). Different models emerge as best fits for different subranges of the input conditions. A classification tree is generated using machine learning methods to predict the best-performing model under different T-P-X subranges, allowing development of a multimodel predictive system (MMoPS) that selects and applies the model expected to yield the most accurate CO(2) solubility prediction. Statistical analysis of the MMoPS predictions, including a stratified 5-fold cross validation, shows that MMoPS outperforms each individual model and increases the overall accuracy of CO(2) solubility prediction across the range of T-P-X conditions likely to be encountered in carbon sequestration applications.
Visual properties and memorising scenes: Effects of image-space sparseness and uniformity.
Lukavský, Jiří; Děchtěrenko, Filip
2017-10-01
Previous studies have demonstrated that humans have a remarkable capacity to memorise a large number of scenes. The research on memorability has shown that memory performance can be predicted by the content of an image. We explored how remembering an image is affected by the image properties within the context of the reference set, including the extent to which it is different from its neighbours (image-space sparseness) and if it belongs to the same category as its neighbours (uniformity). We used a reference set of 2,048 scenes (64 categories), evaluated pairwise scene similarity using deep features from a pretrained convolutional neural network (CNN), and calculated the image-space sparseness and uniformity for each image. We ran three memory experiments, varying the memory workload with experiment length and colour/greyscale presentation. We measured the sensitivity and criterion value changes as a function of image-space sparseness and uniformity. Across all three experiments, we found separate effects of 1) sparseness on memory sensitivity, and 2) uniformity on the recognition criterion. People better remembered (and correctly rejected) images that were more separated from others. People tended to make more false alarms and fewer miss errors in images from categorically uniform portions of the image-space. We propose that both image-space properties affect human decisions when recognising images. Additionally, we found that colour presentation did not yield better memory performance over grayscale images.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Klawitter, A.L.; Hoak, T.E.; Decker, A.D.
In 1993, the San Juan Basin accounted for approximately 605 Bcf of the 740 Bcf of all coalbed gas produced in the United States. The San Juan {open_quotes}cavitation fairway{close_quotes} in which production occurs in open-hole cavity completions, is responsible for over 60% of all U.S. coalbed methane production. Perhaps most striking is the fact that over 17,000 wells had penetrated the Fruitland formation in the San Juan Basin prior to recognition of the coalbed methan potential. To understand the dynamic cavity fairway reservoir in the San Juan Basin, an exploration rationale for coalbed methan was developed that permits a sequentialmore » reduction in total basin exploration area based on four primary exploration criteria. One of the most significant criterion is the existence of thick, thermally mature, friable coals. A second criterion is the existence of fully gas-charged coals. Evaluation of this criterion requires reservoir geochemical data to delineate zones of meteoric influx where breaching has occurred. A third criterion is the presence of adequate reservoir permeability. Natural fracturing in coals is due to cleating and tectonic processes. Because of the general relationship between coal cleating and coal rank, coal cleating intensity can be estimated by analysis of regional coal rank maps. The final criterion is determining whether natural fractures are open or closed. To make this determination, remote sensing imagery interpretation is supported by ancillary data compiled from regional tectonic studies. Application of these four criteria to the San Juan Basin in a heuristic, stepwise process resulted in an overall 94% reduction in total basin exploration area. Application of the first criterion reduced the total basin exploration area by 80%. Application of the second criterion further winnows this area by an addition 9%. Application of the third criterion reduces the exploration area to 6% of the total original exploration area.« less
Luiselli, J K
2000-07-01
A 3-year-old child with multiple medical disorders and chronic food refusal was treated successfully using a program that incorporated antecedent control procedures combined with positive reinforcement. The antecedent manipulations included visual cueing of a criterion number of self-feeding responses that were required during meals to receive reinforcement and a gradual increase in the imposed criterion (demand fading) that was based on improved frequency of oral consumption. As evaluated in a changing criterion design, the child learned to feed himself as an outcome of treatment. One year following intervention, he was consuming a variety of foods and had gained weight. Advantages of antecedent control methods for the treatment of chronic food refusal are discussed.
An evaluation system for financial compensation in traditional Chinese medicine services.
Dou, Lei; Yin, Ai-Tian; Hao, Mo; Lu, Jun
2015-10-01
To describe the major factors influencing financial compensation in traditional Chinese medicine (TCM) and prioritize what TCM services should be compensated for. Two structured questionnaires-a TCM service baseline questionnaire and a service cost questionnaire-were used to collect information from TCM public hospitals on TCM services provided in certain situations and service cost accounting. The cross-sectional study examined 110 TCM services provided in four county TCM public hospitals in Shandong province. From the questionnaire data, a screening index system was established via expert consultation and brainstorming. Comprehensive evaluation of TCM services was performed using the analytic hierarchy process method. Weighted coefficients were used to measure the importance of each criterion, after which comprehensive evaluation scores for each service were ranked to indicate what services should receive priority for financial compensation. Economy value, social value, and efficacy value were the three main criteria for screening for what TCM services should be compensated for. The economy value local weight had the highest value (0.588), of which the profit sub-criterion (0.278) was the most important for TCM financial compensation. Moxibustion was tied for the highest comprehensive evaluation scores, at 0.65 while Acupuncture and Massage Therapy were tied for the second and third highest, with 0.63 and 0.58, respectively. Government and policymakers should consider offer financial compensation to Moxibustion, Acupuncture, Massage Therapy, and TCM Orthopedics as priority services. In the meanwhile, it is essential to correct the unreasonable pricing, explore compensation methods, objects and payment, and revise and improve the accounting system for the costs of TCM services. Copyright © 2015 Elsevier Ltd. All rights reserved.
[Information value of "additional tasks" method to evaluate pilot's work load].
Gorbunov, V V
2005-01-01
"Additional task" method was used to evaluate pilot's work load in prolonged flight. Calculated through durations of latent periods of motor responses, quantitative criterion of work load is more informative for objective evaluation of pilot's involvement in his piloting functions rather than of other registered parameters.
Watershed health: An evaluation index for New Mexico
Bill Fleming
1999-01-01
Although watersheds are not equally healthy, there are no generally accepted criteria for evaluating and comparing them. This paper suggests several criteria which numerically evaluate watersheds in four ways: (1) riparian health, (2) aquatic macroinvertebrate biodiversity, (3) hillslope soil loss and (4) upland land use/flood peak potential. Each criterion is...
76 FR 78823 - Schedule for Rating Disabilities; Evaluation of Amyotrophic Lateral Sclerosis
Federal Register 2010, 2011, 2012, 2013, 2014
2011-12-20
... revising the disability evaluation criterion provided for amyotrophic lateral sclerosis (ALS) to provide an evaluation of 100 percent for any veteran with service-connected ALS. This change is necessary to adequately... to provide a total disability rating for any veteran with service-connected ALS. DATES: Effective...
75 FR 35711 - Schedule for Rating Disabilities; Evaluation of Amyotrophic Lateral Sclerosis
Federal Register 2010, 2011, 2012, 2013, 2014
2010-06-23
... revising the evaluation criterion for amyotrophic lateral sclerosis (ALS) to provide a 100-percent evaluation for any veteran with service-connected ALS. This change is necessary to adequately compensate... provide a total disability rating for any veteran with service- connected ALS. DATES: Comments must be...
Oliveira, Lanuza Borges; Soares, Fernanda Amaral; Silveira, Marise Fagundes; de Pinho, Lucinéia; Caldeira, Antônio Prates; Leite, Maísa Tavares de Souza
2016-01-01
ABSTRACT Objective: to develop and validate an instrument to evaluate the knowledge of health professionals about domestic violence on children. Method: this was a study conducted with 194 physicians, nurses and dentists. A literature review was performed for preparation of the items and identification of the dimensions. Apparent and content validation was performed using analysis of three experts and 27 professors of the pediatric health discipline. For construct validation, Cronbach's alpha was used, and the Kappa test was applied to verify reproducibility. The criterion validation was conducted using the Student's t-test. Results: the final instrument included 56 items; the Cronbach alpha was 0.734, the Kappa test showed a correlation greater than 0.6 for most items, and the Student t-test showed a statistically significant value to the level of 5% for the two selected variables: years of education and using the Family Health Strategy. Conclusion: the instrument is valid and can be used as a promising tool to develop or direct actions in public health and evaluate knowledge about domestic violence on children. PMID:27556878
An evaluation of noise and its effects on shuttle crewmembers during STS-50/USML-1
NASA Technical Reports Server (NTRS)
Koros, Anton; Wheelwright, Charles; Adam, Susan
1993-01-01
High noise levels can lead to physiological, psychological, and performance effects in man, ranging from irritability, annoyance, and sleep interference to interference with verbal communication and fatigue, and to temporary or permanent threshold shift at more extreme levels. The current study evaluated the acoustic environment of the STS50/USML-1 mission. The major objectives were to gain subjective assessments of the STS-50 noise levels, document impacts of noise upon crewmember performance, collect inflight sound level measurements, compare noise levels across missions, evaluate the current Shuttle acoustic criterion, and to make recommendations regarding noise specifications for SSF and other long-duration manned space missions. Sound measurements indicated that background noise levels were 60, 64, and 61 A-weighted decibels, respectively, on the Orbiter middeck, flight deck, and Space lab. All levels were rated acceptable, with the Spacelab environment rated the most favorably. Sleep stations afforded attenuation from airborne noise sources, although all crewmembers reported being awakened by crew activity on the middeck. Models of distance for acceptable speech communications were generated, identifying situations of compromised verbal communications to be avoided.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gehin, Jess C; Oakley, Brian; Worrall, Andrew
2015-01-01
Abstract One of the key objectives of the U.S. Department of Energy (DOE) Nuclear Energy R&D Roadmap is the development of sustainable nuclear fuel cycles that can improve natural resource utilization and provide solutions to the management of nuclear wastes. Recently, an evaluation and screening (E&S) of fuel cycle systems has been conducted to identify those options that provide the best opportunities for obtaining such improvements and also to identify the required research and development activities that can support the development of advanced fuel cycle options. In order to evaluate and screen the E&S study included nine criteria including Developmentmore » and Deployment Risk (D&DR). More specifically, this criterion was represented by the following metrics: Development time, development cost, deployment cost from prototypic validation to first-of-a-kind commercial, compatibility with the existing infrastructure, existence of regulations for the fuel cycle and familiarity with licensing, and existence of market incentives and/or barriers to commercial implementation of fuel cycle processes. Given the comprehensive nature of the study, a systematic approach was needed to determine metric data for the D&DR criterion, and is presented here. As would be expected, the Evaluation Group representing the once-through use of uranium in thermal reactors is always the highest ranked fuel cycle Evaluation Group for this D&DR criterion. Evaluation Groups that consist of once-through fuel cycles that use existing reactor types are consistently ranked very high. The highest ranked limited and continuous recycle fuel cycle Evaluation Groups are those that recycle Pu in thermal reactors. The lowest ranked fuel cycles are predominately continuous recycle single stage and multi-stage fuel cycles that involve TRU and/or U-233 recycle.« less
Palm, Peter; Josephson, Malin; Mathiassen, Svend Erik; Kjellberg, Katarina
2016-06-01
We evaluated the intra- and inter-observer reliability and criterion validity of an observation protocol, developed in an iterative process involving practicing ergonomists, for assessment of working technique during cash register work for the purpose of preventing upper extremity symptoms. Two ergonomists independently assessed 17 15-min videos of cash register work on two occasions each, as a basis for examining reliability. Criterion validity was assessed by comparing these assessments with meticulous video-based analyses by researchers. Intra-observer reliability was acceptable (i.e. proportional agreement >0.7 and kappa >0.4) for 10/10 questions. Inter-observer reliability was acceptable for only 3/10 questions. An acceptable inter-observer reliability combined with an acceptable criterion validity was obtained only for one working technique aspect, 'Quality of movements'. Thus, major elements of the cashiers' working technique could not be assessed with an acceptable accuracy from short periods of observations by one observer, such as often desired by practitioners. Practitioner Summary: We examined an observation protocol for assessing working technique in cash register work. It was feasible in use, but inter-observer reliability and criterion validity were generally not acceptable when working technique aspects were assessed from short periods of work. We recommend the protocol to be used for educational purposes only.
Chalam, Kakarla V.; Lin, Selina; Murthy, Ravi K.; Brar, Vikram S.; Gupta, Shailesh K.; Radhakrishnan, Ravi
2011-01-01
Purpose: The purpose was to determine if birth weight (BW) alone can be the sole criterion for screening infants at risk for retinopathy of prematurity (ROP). Materials and Methods: In this retrospective, observational case series, 208 infants were screened for ROP using the American Association for Pediatric Ophthalmology and Strabismus (AAPOS) Guidelines (1997). Variables examined included gestational age (GA), birth weight (BW), and a composite variable BWGA Index [(grams × weeks)/1000], which takes into consideration both the birth weight and gestational age of the infant. Infants were divided into two groups: Group 1, BW ≤1250 g, and Group 2, BW >1250 g. Multivariate analysis was performed to detect factors predictive of ROP. Receiver operator characteristic (ROC) curves were generated to determine the efficacy of screening using the BW, GA, and BWGA Index. Statistical analyses were performed with logistic regression with a P-value of 0.05 or less indicating statistical significance. Results: Varying stages of ROP were present in 116 of 416 eyes. Of the 105 eyes in Group 2, only 1 eye developed stage 1 ROP. Only Group 1 eyes developed stage 3 or higher ROP. The ROC curve for BW alone gave an area under the curve (AUC) of 0.797 (standard error [SE] = 0.0329, P < 0.0001); for GA, AUC was 0.801 (SE = 0.0340, P < 0.0001) and for the BWGA Index, the AUC was 0.808 (SE = 0.0324, P < 0.0001). Using 1250-g BW as a criterion for ROP screening would have decreased the number of screenings by 24%, and did not exclude any ROP higher than stage 1. Conclusion: Data from our neonatal intensive care unit suggest that birth weight ≤ 1250 g alone is an adequate parameter to identify premature infants at risk for ROP. PMID:21887076
Evaluation of a wearable physiological status monitor during simulated fire fighting activities.
Smith, Denise L; Haller, Jeannie M; Dolezal, Brett A; Cooper, Christopher B; Fehling, Patricia C
2014-01-01
A physiological status monitor (PSM) has been embedded in a fire-resistant shirt. The purpose of this research study was to examine the ability of the PSM-shirt to accurately detect heart rate (HR) and respiratory rate (RR) when worn under structural fire fighting personal protective equipment (PPE) during the performance of various activities relevant to fire fighting. Eleven healthy, college-aged men completed three activities (walking, searching/crawling, and ascending/descending stairs) that are routinely performed during fire fighting operations while wearing the PSM-shirt under structural fire fighting PPE. Heart rate and RR recorded by the PSM-shirt were compared to criterion values measured concurrently with an ECG and portable metabolic measurement system, respectively. For all activities combined (overall) and for each activity, small differences were found between the PSM-shirt and ECG (mean difference [95% CI]: overall: -0.4 beats/min [-0.8, -0.1]; treadmill: -0.4 beats/min [-0.7, -0.1]; search: -1.7 beats/min [-3.1, -.04]; stairs: 0.4 beats/min [0.04, 0.7]). Standard error of the estimate was 3.5 beats/min for all tasks combined and 1.9, 5.9, and 1.9 beats/min for the treadmill walk, search, and stair ascent/descent, respectively. Correlations between the PSM-shirt and criterion heart rates were high (r = 0.95 to r = 0.99). The mean difference between RR recorded by the PSM-shirt and criterion overall was 1.1 breaths/min (95% CI: -1.9 to -0.4). The standard error of the estimate for RR ranged from 4.2 breaths/min (treadmill) to 8.2 breaths/min (search), with an overall value of 6.2 breaths/min. These findings suggest that the PSM-shirt provides valid measures of HR and useful approximations of RR when worn during fire fighting duties.
Maryland School Performance Assessment Program (MSPAP), 1999. Technical Report.
ERIC Educational Resources Information Center
Maryland State Dept. of Education, Baltimore.
Maryland School Performance Assessment Program (MSPAP) assessments are criterion-referenced performance tests designed, developed, and implemented by the Maryland State Department of Education in collaboration with classroom teachers and other Maryland educators. MSPAP is the major strategy for implementing Maryland's educational reform…
Ornamental Horticulture Production Occupations. Curriculum Guide.
ERIC Educational Resources Information Center
Reneau, Fred; And Others
This curriculum guide contains guidesheets for the ornamental horticulture production occupations. Each guidesheet provides a job-relevant task; performance objective, with task, performance standard, source of standard, and conditions for performance of task; enabling objectives; a list of resources; teaching activities; a criterion-referenced…
Bashiri, Azadeh; Shahmoradi, Leila; Beigy, Hamid; Savareh, Behrouz A; Nosratabadi, Masood; N Kalhori, Sharareh R; Ghazisaeedi, Marjan
2018-06-01
Quantitative EEG gives valuable information in the clinical evaluation of psychological disorders. The purpose of the present study is to identify the most prominent features of quantitative electroencephalography (QEEG) that affect attention and response control parameters in children with attention deficit hyperactivity disorder. The QEEG features and the Integrated Visual and Auditory-Continuous Performance Test ( IVA-CPT) of 95 attention deficit hyperactivity disorder subjects were preprocessed by Independent Evaluation Criterion for Binary Classification. Then, the importance of selected features in the classification of desired outputs was evaluated using the artificial neural network. Findings uncovered the highest rank of QEEG features in each IVA-CPT parameters related to attention and response control. Using the designed model could help therapists to determine the existence or absence of defects in attention and response control relying on QEEG.
Cheng, Ruey-Kuang; MacDonald, Christopher J.; Williams, Christina L.; Meck, Warren H.
2008-01-01
Choline availability in the maternal diet has a lasting effect on brain and behavior of the offspring. To further delineate the impact of early nutritional status, we examined effects of prenatal-choline supplementation on timing, emotion, and memory performance of adult male and female rats. Rats that were given sufficient choline (CON: 1.1 g/kg) or supplemental choline (SUP: 5.0 g/kg) during embryonic days (ED) 12–17 were trained with a differential reinforcement of low-rate (DRL) schedule that was gradually transitioned through 5-, 10-, 18-, 36-, and 72-sec criterion times. We observed that SUP-females emitted more reinforced responses than CON-females, which were more efficient than both groups of males. In addition, SUP-males and SUP-females exhibited a reduction in burst responding (response latencies <2 sec) compared with both groups of CON rats. Furthermore, despite a reduced level of burst responding, the SUP-males made more nonreinforced responses prior to the DRL criterion as a result of maintaining the previous DRL criterion following transition to a new criterion. In summary, long-lasting effects of prenatal-choline supplementation were exhibited by reduced frustrative DRL responding in conjunction with the persistence of temporal memory in SUP-males and enhanced temporal exploration and response efficiency in SUP-females. PMID:18323570
NASA Astrophysics Data System (ADS)
Tan, Maxine; Li, Zheng; Moore, Kathleen; Thai, Theresa; Ding, Kai; Liu, Hong; Zheng, Bin
2016-03-01
Ovarian cancer is the second most common cancer amongst gynecologic malignancies, and has the highest death rate. Since the majority of ovarian cancer patients (>75%) are diagnosed in the advanced stage with tumor metastasis, chemotherapy is often required after surgery to remove the primary ovarian tumors. In order to quickly assess patient response to the chemotherapy in the clinical trials, two sets of CT examinations are taken pre- and post-therapy (e.g., after 6 weeks). Treatment efficacy is then evaluated based on Response Evaluation Criteria in Solid Tumors (RECIST) guideline, whereby tumor size is measured by the longest diameter on one CT image slice and only a subset of selected tumors are tracked. However, this criterion cannot fully represent the volumetric changes of the tumors and might miss potentially problematic unmarked tumors. Thus, we developed a new CAD approach to measure and analyze volumetric tumor growth/shrinkage using a cubic B-spline deformable image registration method. In this initial study, on 14 sets of pre- and post-treatment CT scans, we registered the two consecutive scans using cubic B-spline registration in a multiresolution (from coarse to fine) framework. We used Mattes mutual information metric as the similarity criterion and the L-BFGS-B optimizer. The results show that our method can quantify volumetric changes in the tumors more accurately than RECIST, and also detect (highlight) potentially problematic regions that were not originally targeted by radiologists. Despite the encouraging results of this preliminary study, further validation of scheme performance is required using large and diverse datasets in future.
Canny edge-based deformable image registration
NASA Astrophysics Data System (ADS)
Kearney, Vasant; Huang, Yihui; Mao, Weihua; Yuan, Baohong; Tang, Liping
2017-02-01
This work focuses on developing a 2D Canny edge-based deformable image registration (Canny DIR) algorithm to register in vivo white light images taken at various time points. This method uses a sparse interpolation deformation algorithm to sparsely register regions of the image with strong edge information. A stability criterion is enforced which removes regions of edges that do not deform in a smooth uniform manner. Using a synthetic mouse surface ground truth model, the accuracy of the Canny DIR algorithm was evaluated under axial rotation in the presence of deformation. The accuracy was also tested using fluorescent dye injections, which were then used for gamma analysis to establish a second ground truth. The results indicate that the Canny DIR algorithm performs better than rigid registration, intensity corrected Demons, and distinctive features for all evaluation matrices and ground truth scenarios. In conclusion Canny DIR performs well in the presence of the unique lighting and shading variations associated with white-light-based image registration.
Optimal firing rate estimation
NASA Technical Reports Server (NTRS)
Paulin, M. G.; Hoffman, L. F.
2001-01-01
We define a measure for evaluating the quality of a predictive model of the behavior of a spiking neuron. This measure, information gain per spike (Is), indicates how much more information is provided by the model than if the prediction were made by specifying the neuron's average firing rate over the same time period. We apply a maximum Is criterion to optimize the performance of Gaussian smoothing filters for estimating neural firing rates. With data from bullfrog vestibular semicircular canal neurons and data from simulated integrate-and-fire neurons, the optimal bandwidth for firing rate estimation is typically similar to the average firing rate. Precise timing and average rate models are limiting cases that perform poorly. We estimate that bullfrog semicircular canal sensory neurons transmit in the order of 1 bit of stimulus-related information per spike.
NASA Technical Reports Server (NTRS)
Heidmann, M F
1957-01-01
Characteristic exhaust velocity of a 200-pound-thrust rocket engine was evaluated for fuel temperatures of -90 degrees, and 200 degrees f with a spray formed by two impinging heptane jets reacting in a highly atomized oxygen atmosphere. Tests covered a range of mixture ratios and chamber lengths. The characteristic exhaust-velocity efficiency increased 2 percent for a 290 degree f increase in fuel temperature. This increase in performance can be compared with that obtained by increasing chamber length by about 1/2 inch. The result agrees with the fuel-temperature effect predicted from an analysis based on droplet evaporation theory. Mixture ratio markedly affected characteristic exhaust velocity efficiency, but total flow rate and fuel temperature did not.
Reum, J C P
2011-12-01
Three lipid correction models were evaluated for liver and white dorsal muscle from Squalus acanthias. For muscle, all three models performed well, based on the Akaike Information Criterion value corrected for small sample sizes (AIC(c) ), and predicted similar lipid corrections to δ(13) C that were up to 2.8 ‰ higher than those predicted using previously published models based on multispecies data. For liver, which possessed higher bulk C:N values compared to that of white muscle, all three models performed poorly and lipid-corrected δ(13) C values were best approximated by simply adding 5.74 ‰ to bulk δ(13) C values. © 2011 The Author. Journal of Fish Biology © 2011 The Fisheries Society of the British Isles.
Carballeira, C; Ramos-Gómez, J; Martín-Díaz, L; DelValls, T A
2012-06-01
Standard toxicity screening tests are useful tools in the management of impacted coastal ecosystems. To our knowledge, this is the first time that the sea urchin embryo development test has been used to evaluate the potential impact of effluents from land-based aquaculture farms in coastal areas. The toxicity of effluents from 8 land-based turbot farms was determined by calculating the percentage of abnormal larvae, according to two criteria: (a) standard, considering as normal pyramid-shaped larvae with differentiated components, and (b) skeletal, a new criterion that considers detailed skeletal characteristics. The skeletal criterion appeared to be more sensitive and enabled calculation of effective concentrations EC(5), EC(10), EC(20) and EC(50), unlike the classical criterion. Inclusion of the skeleton criterion in the sea urchin embryo development test may be useful for categorizing the relatively low toxicity of discharges from land-based marine fish farms. Further studies are encouraged to establish any causative relationships between pollutants and specific larval deformities. Copyright © 2012 Elsevier Ltd. All rights reserved.
Zubeidat, Ihab; Salinas, José María; Sierra, Juan Carlos; Fernández-Parra, Antonio
2007-01-01
In this study, we analyzed the reliability and validity of the Social Interaction Anxiety Scale (SIAS) and propose a separation criterion between youths with specific and generalized social anxiety and youths without social anxiety. A sample of 1012 Spanish youths attending school completed the SIAS, the Liebowitz Social Anxiety Scale, the Social Avoidance and Distress Scale, the Fear of Negative Evaluation Scale, the Youth Self-Report for Ages 11-18 and the Minnesota Multiphasic Personality Inventory-Adolescent. The factor analysis suggests the existence of three factors in the SIAS, the first two of which explain most of the variance of the construct assessed. Internal consistency is adequate in the first two factors. The SIAS features an adequate theoretical validity with the scores of different variables related to social interaction. Analysis of the criterion scores yields three groups pertaining to three clearly differentiated clusters. In the third cluster, two of social anxiety groups - specific and generalized - have been identified by means of a quantitative separation criterion.
The criterion for time symmetry of probabilistic theories and the reversibility of quantum mechanics
NASA Astrophysics Data System (ADS)
Holster, A. T.
2003-10-01
Physicists routinely claim that the fundamental laws of physics are 'time symmetric' or 'time reversal invariant' or 'reversible'. In particular, it is claimed that the theory of quantum mechanics is time symmetric. But it is shown in this paper that the orthodox analysis suffers from a fatal conceptual error, because the logical criterion for judging the time symmetry of probabilistic theories has been incorrectly formulated. The correct criterion requires symmetry between future-directed laws and past-directed laws. This criterion is formulated and proved in detail. The orthodox claim that quantum mechanics is reversible is re-evaluated. The property demonstrated in the orthodox analysis is shown to be quite distinct from time reversal invariance. The view of Satosi Watanabe that quantum mechanics is time asymmetric is verified, as well as his view that this feature does not merely show a de facto or 'contingent' asymmetry, as commonly supposed, but implies a genuine failure of time reversal invariance of the laws of quantum mechanics. The laws of quantum mechanics would be incompatible with a time-reversed version of our universe.
Sainz de Baranda, Pilar; Rodríguez-Iniesta, María; Ayala, Francisco; Santonja, Fernando; Cejudo, Antonio
2014-07-01
To examine the criterion-related validity of the horizontal hip joint angle (H-HJA) test and vertical hip joint angle (V-HJA) test for estimating hamstring flexibility measured through the passive straight-leg raise (PSLR) test using contemporary statistical measures. Validity study. Controlled laboratory environment. One hundred thirty-eight professional trampoline gymnasts (61 women and 77 men). Hamstring flexibility. Each participant performed 2 trials of H-HJA, V-HJA, and PSLR tests in a randomized order. The criterion-related validity of H-HJA and V-HJA tests was measured through the estimation equation, typical error of the estimate (TEEST), validity correlation (β), and their respective confidence limits. The findings from this study suggest that although H-HJA and V-HJA tests showed moderate to high validity scores for estimating hamstring flexibility (standardized TEEST = 0.63; β = 0.80), the TEEST statistic reported for both tests was not narrow enough for clinical purposes (H-HJA = 10.3 degrees; V-HJA = 9.5 degrees). Subsequently, the predicted likely thresholds for the true values that were generated were too wide (H-HJA = predicted value ± 13.2 degrees; V-HJA = predicted value ± 12.2 degrees). The results suggest that although the HJA test showed moderate to high validity scores for estimating hamstring flexibility, the prediction intervals between the HJA and PSLR tests are not strong enough to suggest that clinicians and sport medicine practitioners should use the HJA and PSLR tests interchangeably as gold standard measurement tools to evaluate and detect short hamstring muscle flexibility.
An Approach to the Evaluation of Hypermedia.
ERIC Educational Resources Information Center
Knussen, Christina; And Others
1991-01-01
Discusses methods that may be applied to the evaluation of hypermedia, based on six models described by Lawton. Techniques described include observation, self-report measures, interviews, automated measures, psychometric tests, checklists and criterion-based techniques, process models, Experimentally Measuring Usability (EMU), and a naturalistic…
ERIC Educational Resources Information Center
Kern, Richard
1985-01-01
A computer-based interactive system for diagnosing academic and school behavior problems is described. Elements include criterion-referenced testing, an instructional management system, and a behavior evaluation tool developed by the author. (JW)
Schemes for efficient transmission of encoded video streams on high-speed networks
NASA Astrophysics Data System (ADS)
Ramanathan, Srinivas; Vin, Harrick M.; Rangan, P. Venkat
1994-04-01
In this paper, we argue that significant performance benefits can accrue if integrated networks implement application-specific mechanisms that account for the diversities in media compression schemes. Towards this end, we propose a simple, yet effective, strategy called Frame Induced Packet Discarding (FIPD), in which, upon detection of loss of a threshold number (determined by an application's video encoding scheme) of packets belonging to a video frame, the network attempts to discard all the remaining packets of that frame. In order to analytically quantify the performance of FIPD so as to obtain fractional frame losses that can be guaranteed to video channels, we develop a finite state, discrete time markov chain model of the FIPD strategy. The fractional frame loss thus computed can serve as the criterion for admission control at the network. Performance evaluations demonstrate the utility of the FIPD strategy.
Assessment of Communications-related Admissions Criteria in a Three-year Pharmacy Program
Tejada, Frederick R.; Lang, Lynn A.; Purnell, Miriam; Acedera, Lisa; Ngonga, Ferdinand
2015-01-01
Objective. To determine if there is a correlation between TOEFL and other admissions criteria that assess communications skills (ie, PCAT variables: verbal, reading, essay, and composite), interview, and observational scores and to evaluate TOEFL and these admissions criteria as predictors of academic performance. Methods. Statistical analyses included two sample t tests, multiple regression and Pearson’s correlations for parametric variables, and Mann-Whitney U for nonparametric variables, which were conducted on the retrospective data of 162 students, 57 of whom were foreign-born. Results. The multiple regression model of the other admissions criteria on TOEFL was significant. There was no significant correlation between TOEFL scores and academic performance. However, significant correlations were found between the other admissions criteria and academic performance. Conclusion. Since TOEFL is not a significant predictor of either communication skills or academic success of foreign-born PharmD students in the program, it may be eliminated as an admissions criterion. PMID:26430273
Assessment of Communications-related Admissions Criteria in a Three-year Pharmacy Program.
Parmar, Jayesh R; Tejada, Frederick R; Lang, Lynn A; Purnell, Miriam; Acedera, Lisa; Ngonga, Ferdinand
2015-08-25
To determine if there is a correlation between TOEFL and other admissions criteria that assess communications skills (ie, PCAT variables: verbal, reading, essay, and composite), interview, and observational scores and to evaluate TOEFL and these admissions criteria as predictors of academic performance. Statistical analyses included two sample t tests, multiple regression and Pearson's correlations for parametric variables, and Mann-Whitney U for nonparametric variables, which were conducted on the retrospective data of 162 students, 57 of whom were foreign-born. The multiple regression model of the other admissions criteria on TOEFL was significant. There was no significant correlation between TOEFL scores and academic performance. However, significant correlations were found between the other admissions criteria and academic performance. Since TOEFL is not a significant predictor of either communication skills or academic success of foreign-born PharmD students in the program, it may be eliminated as an admissions criterion.
A Shot Number Based Approach to Performance Analysis in Table Tennis
Yoshida, Kazuto; Yamada, Koshi
2017-01-01
Abstract The current study proposes a novel approach that improves the conventional performance analysis in table tennis by introducing the concept of frequency, or the number of shots, of each shot number. The improvements over the conventional method are as follows: better accuracy of the evaluation of skills and tactics of players, additional insights into scoring and returning skills and ease of understanding the results with a single criterion. The performance analysis of matches played at the 2012 Summer Olympics in London was conducted using the proposed method. The results showed some effects of the shot number and gender differences in table tennis. Furthermore, comparisons were made between Chinese players and players from other countries, what threw light on the skills and tactics of the Chinese players. The present findings demonstrate that the proposed method provides useful information and has some advantages over the conventional method. PMID:28210334
Retrospective voting and party support at elections: credit and blame for government and opposition
Plescia, Carolina; Kritzinger, Sylvia
2017-01-01
ABSTRACT Retrospective voting is arguably one of the most important mechanisms of representative democracy, and whether or not the public holds the government accountable for its policy performance has been extensively studied. In this paper, we test whether retrospective voting extends to parties in the opposition, that is whether and how parties’ past performance evaluations affect their vote, regardless of whether they were in government or in opposition. Taking advantage of a rich set of questions embedded in a representative German national elections panel, we update our knowledge on the retrospective voting mechanism by modeling retrospective voting at the party level. The findings indicate that the incumbent status is not the only criterion for retrospective voting, ultimately suggesting that both government and opposition parties can expect credit and blame for their conduct and this should provide some impetus for responsive performance of all parties. PMID:28515772
Practical Study for the Properties of Hueckel Edge Detection Operator
NASA Astrophysics Data System (ADS)
Jabbar, Hameed M. Abdul; Hatem, Amal J.; Ameer, Inbethaq M. A. Abdul
2018-05-01
The first practical study for the Hueckel edge detection operator was presented in this research, where it is tested on standard step edge set images. A number of criteria were adopted to evaluate its practical performance, which is the accuracy in detecting the edges direction, the error in the edges location (dislocation), edges width, the calculated edge goodness criterion and the consumed execution time. These criteria were studied with the edge direction and the used disk radius of the Hueckel edge detection operator. Important notes were recorded for the performance of this operator depending on the direction of the edge and/or with the radius of the used disk. There is a variation in the performance of the operator in terms of precision in detecting of the edges direction and position. A discussion was presented for the all criteria adopted in the research.
Particle-size distribution models for the conversion of Chinese data to FAO/USDA system.
Shangguan, Wei; Dai, YongJiu; García-Gutiérrez, Carlos; Yuan, Hua
2014-01-01
We investigated eleven particle-size distribution (PSD) models to determine the appropriate models for describing the PSDs of 16349 Chinese soil samples. These data are based on three soil texture classification schemes, including one ISSS (International Society of Soil Science) scheme with four data points and two Katschinski's schemes with five and six data points, respectively. The adjusted coefficient of determination r (2), Akaike's information criterion (AIC), and geometric mean error ratio (GMER) were used to evaluate the model performance. The soil data were converted to the USDA (United States Department of Agriculture) standard using PSD models and the fractal concept. The performance of PSD models was affected by soil texture and classification of fraction schemes. The performance of PSD models also varied with clay content of soils. The Anderson, Fredlund, modified logistic growth, Skaggs, and Weilbull models were the best.
A Pilot Opinion Study of Lateral Control Requirements for Fighter-Type Aircraft
NASA Technical Reports Server (NTRS)
Creer, Brent Y.; Stewart, John D.; Merrick, Robert B.; Drinkwater, Fred J., III
1959-01-01
As part of a continuing NASA program of research on airplane handling qualities, a pilot opinion investigation has been made on the lateral control requirements of fighter aircraft flying in their combat speed range. The investigation was carried out using a stationary flight simulator and a moving flight simulator, and the flight simulator results were supplemented by research tests in actual flight. The flight simulator study was based on the presumption that the pilot rates the roll control of an airplane primarily on a single-degree-of-freedom basis; that is, control of angle of roll about the aircraft body axis being of first importance. From the assumption of a single degree of freedom system it follows that there are two fundamental parameters which govern the airplane roll response, namely the roll damping expressed as a time constant and roll control power in terms of roll acceleration. The simulator study resulted in a criterion in terms of these two parameters which defines satisfactory, unsatisfactory, and unacceptable roll performance from a pilot opinion standpoint. The moving simulator results were substantiated by the in-flight investigation. The derived criterion was compared with the roll performance criterion based upon wing tip helix angle and also with other roll performance concepts which currently influence the roll performance design of military fighter aircraft flying in their combat speed range.
NASA Technical Reports Server (NTRS)
Mikulas, Martin M., Jr.; Sumpter, Rod
1999-01-01
In a previous paper, a new merit function for determining the strength performance of flawed composite laminates was presented. This previous analysis was restricted to circular hole flaws that were large enough that failure could be predicted using the laminate stress concentration factor. In this paper, the merit function is expanded to include the flaw cases of an arbitrary size circular hole or a center crack. Failure prediction for these cases is determined using the point stress criterion. An example application of the merit function is included for a wide range of graphite/epoxy laminates.
NASA Technical Reports Server (NTRS)
Martin, Mikulas M., Jr.; Sumpter, Rod
2000-01-01
In a previous paper, a new merit function for determining the strength performance of flawed composite laminates was presented. This previous analysis was restricted to circular hole flaws that were large enough that failure could be predicted using the laminate stress concentration factor. In this paper, the merit function is expanded to include the flaw cases of an arbitrary size circular hole or center crack. Failure prediction for these cases is determined using the point stress criterion. An example application of the merit function is included for a wide range of graphite/epoxy laminates.
NASA Technical Reports Server (NTRS)
Mikulas, Martin M., Jr.; Sumpter, Rod
1997-01-01
In a previous paper, a new merit function for determining the strength performance of flawed composite laminates was presented. This previous analysis was restricted to circular hole flaws that were large enough that failure could be predicted using the laminate stress concentration factor. In this paper, the merit function is expanded to include the flaw cases of an arbitrary size circular hole or a center crack. Failure prediction for these cases is determined using the point stress criterion. An example application of the merit function is included for a wide range of graphite/epoxy laminates.
Contamination of commercial cane sugars by some organic acids and some inorganic anions.
Wojtczak, Maciej; Antczak, Aneta; Lisik, Krystyna
2013-01-01
The aim of the paper was the identification and the quantitative evaluation of the following inorganic anions: chloride, phosphate, nitrate, nitrite, sulphate and the following organic acids: lactic, acetic, formic, malic and citric in commercial "unrefined" brown cane sugars and in cane raw sugars. The determination was carried out by high performance anion exchange chromatography with conductivity detector HPAEC-CD. The conducted analyses have shown that the content of some inorganic anions and organic acids in cane sugars may be an important criterion of the quality of commercial "unrefined" brown cane sugars. Copyright © 2012 Elsevier Ltd. All rights reserved.
Global Education Implications of the Foreign Pharmacy Graduate Equivalency Examination
Clauson, Kevin A.; Latif, David A.; Al-Rousan, Rabaa M.
2010-01-01
Although the Foreign Pharmacy Graduate Equivalency Examination (FPGEE) is not intended to measure educational outcomes or institutional effectiveness, it may be a reliable and valid criterion to assess the quality or success of international pharmacy programs. This comprehensive review describes the evolution and historical milestones of the FPGEE, along with trends in structure, administration, and passing rates, and the impact of country of origin on participant performance. Similarities between the FPGEE and the Pharmacy Curriculum Outcomes Assessment (PCOA) are also explored. This paper aims to provide a global prospective and insight for foreign academic institutions into parameters for evaluating their students' educational capabilities. PMID:20798798
Trellis coding techniques for mobile communications
NASA Technical Reports Server (NTRS)
Divsalar, D.; Simon, M. K.; Jedrey, T.
1988-01-01
A criterion for designing optimum trellis codes to be used over fading channels is given. A technique is shown for reducing certain multiple trellis codes, optimally designed for the fading channel, to conventional (i.e., multiplicity one) trellis codes. The computational cutoff rate R0 is evaluated for MPSK transmitted over fading channels. Examples of trellis codes optimally designed for the Rayleigh fading channel are given and compared with respect to R0. Two types of modulation/demodulation techniques are considered, namely coherent (using pilot tone-aided carrier recovery) and differentially coherent with Doppler frequency correction. Simulation results are given for end-to-end performance of two trellis-coded systems.
Clinical evaluation of melanomas and common nevi by spectral imaging
Diebele, Ilze; Kuzmina, Ilona; Lihachev, Alexey; Kapostinsh, Janis; Derjabo, Alexander; Valeine, Lauma; Spigulis, Janis
2012-01-01
A clinical trial on multi-spectral imaging of malignant and non-malignant skin pathologies comprising 17 melanomas and 65 pigmented common nevi was performed. Optical density data of skin pathologies were obtained in the spectral range 450–950 nm using the multispectral camera Nuance EX. An image parameter and maps capable of distinguishing melanoma from pigmented nevi were proposed. The diagnostic criterion is based on skin optical density differences at three fixed wavelengths: 540nm, 650nm and 950nm. The sensitivity and specificity of this method were estimated to be 94% and 89%, respectively. The proposed methodology and potential clinical applications are discussed. PMID:22435095
Quadrotor trajectory tracking using PID cascade control
NASA Astrophysics Data System (ADS)
Idres, M.; Mustapha, O.; Okasha, M.
2017-12-01
Quadrotors have been applied to collect information for traffic, weather monitoring, surveillance and aerial photography. In order to accomplish their mission, quadrotors have to follow specific trajectories. This paper presents proportional-integral-derivative (PID) cascade control of a quadrotor for path tracking problem when velocity and acceleration are small. It is based on near hover controller for small attitude angles. The integral of time-weighted absolute error (ITAE) criterion is used to determine the PID gains as a function of quadrotor modeling parameters. The controller is evaluated in three-dimensional environment in Simulink. Overall, the tracking performance is found to be excellent for small velocity condition.
Entanglement-enhanced Neyman-Pearson target detection using quantum illumination
NASA Astrophysics Data System (ADS)
Zhuang, Quntao; Zhang, Zheshen; Shapiro, Jeffrey H.
2017-08-01
Quantum illumination (QI) provides entanglement-based target detection---in an entanglement-breaking environment---whose performance is significantly better than that of optimum classical-illumination target detection. QI's performance advantage was established in a Bayesian setting with the target presumed equally likely to be absent or present and error probability employed as the performance metric. Radar theory, however, eschews that Bayesian approach, preferring the Neyman-Pearson performance criterion to avoid the difficulties of accurately assigning prior probabilities to target absence and presence and appropriate costs to false-alarm and miss errors. We have recently reported an architecture---based on sum-frequency generation (SFG) and feedforward (FF) processing---for minimum error-probability QI target detection with arbitrary prior probabilities for target absence and presence. In this paper, we use our results for FF-SFG reception to determine the receiver operating characteristic---detection probability versus false-alarm probability---for optimum QI target detection under the Neyman-Pearson criterion.
Refining a health-related quality of life assessment strategy for solid organ transplant patients.
Feurer, Irene D; Moore, Derek E; Speroff, Theodore; Liu, Hongxia; Payne, Jerita; Harrison, Connie; Pinson, C Wright
2004-01-01
The psychometric properties of generic health-related quality of life (HRQOL) assessment instruments were evaluated to identify a reliable, valid, and non-redundant battery to measure longitudinal outcomes in organ transplant patients. Objective functional performance and subjective HRQOL were assessed in 371 solid organ (liver, heart, kidney, lung) transplant patients using the Karnofsky scale, the SF-36 Health Survey (SF-36), and Psychosocial Adjustment to Illness Scale (PAIS). The surveys' internal-consistency reliability, criterion-related validity, and redundancy were tested. The SF-36 mental (MCS) and physical components (PCS), and PAIS summary scales were internally consistent (all alpha > or = 0.83). Four out of seven PAIS scales (vocational, domestic, sexual, social) were collectively associated with the PCS (R = 0.65, P < 0.001), as was functional performance (r = 0.52, P < 0.001). Three PAIS scales (family, social, psychological distress) were associated with the MCS (R = 0.72, P < 0.001). Only the PAIS healthcare orientation (satisfaction) scale was not associated with the SF-36((R)). The relationship between functional performance and the PCS is stronger (r = 0.52, P < 0.001) than with the MCS (r = 0.25, P < 0.001) and the PAIS global score (r = 0.37, P < 0.001). The SF-36 and PAIS are internally consistent and exhibit divergent criterion-related validity but, with the exception of the PAIS healthcare orientation scale, are statistically redundant. The advantages of the SF-36 include wider use, more norms, and a lesser response burden. A transplant-specific patient satisfaction inventory was indicated and was developed.
NASA Astrophysics Data System (ADS)
Zhang, Z. Fred
2016-06-01
A surface barrier is a commonly used technology for isolation of subsurface contaminants. Surface barriers for isolating radioactive waste are expected to perform for centuries to millennia, yet there are very few data for field-scale surface barriers for periods approaching a decade or longer. The Prototype Hanford Barrier (PHB) with a design life of 1000 years was constructed over an existing radioactive waste site in 1994 to demonstrate its long-term performance. The primary element of the PHB is an evapotranspiration-capillary (ETC) barrier in which precipitation water is stored in a fine-textured soil layer and later released to the atmosphere via evapotranspiration. To address the barrier performance under extreme conditions, this study included an enhanced precipitation stress test from 1995 to 1997 to determine barrier response to extreme precipitation events. During this period a 1000 year 24 h return rainstorm was simulated in March every year. The loss of vegetation on barrier hydrology was tested with a controlled fire test in 2008. The 19 year monitoring record shows that the store-and-release mechanism worked as well as or better than the design criterion. Average drainage from the ETC barrier amounted to an average of 0.005 mm yr-1, which is well below the design criterion of 0.5 mm yr-1. After a simulated wildfire, the naturally reestablished vegetation and increased evaporation combined to release the stored water and summer precipitation to the atmosphere such that drainage did not occur in the 5 years subsequent to the fire.
Zhong, Shangping; Chen, Tianshun; He, Fengying; Niu, Yuzhen
2014-09-01
For a practical pattern classification task solved by kernel methods, the computing time is mainly spent on kernel learning (or training). However, the current kernel learning approaches are based on local optimization techniques, and hard to have good time performances, especially for large datasets. Thus the existing algorithms cannot be easily extended to large-scale tasks. In this paper, we present a fast Gaussian kernel learning method by solving a specially structured global optimization (SSGO) problem. We optimize the Gaussian kernel function by using the formulated kernel target alignment criterion, which is a difference of increasing (d.i.) functions. Through using a power-transformation based convexification method, the objective criterion can be represented as a difference of convex (d.c.) functions with a fixed power-transformation parameter. And the objective programming problem can then be converted to a SSGO problem: globally minimizing a concave function over a convex set. The SSGO problem is classical and has good solvability. Thus, to find the global optimal solution efficiently, we can adopt the improved Hoffman's outer approximation method, which need not repeat the searching procedure with different starting points to locate the best local minimum. Also, the proposed method can be proven to converge to the global solution for any classification task. We evaluate the proposed method on twenty benchmark datasets, and compare it with four other Gaussian kernel learning methods. Experimental results show that the proposed method stably achieves both good time-efficiency performance and good classification performance. Copyright © 2014 Elsevier Ltd. All rights reserved.
Viyanchi, Amir; Rajabzadeh Ghatari, Ali; Rasekh, Hamid Reza; SafiKhani, HamidReza
2016-01-01
The purposes of our study were to identify a drug entry process, collect, and prioritize criteria for selecting drugs for the list of basic health insurance commitments to prepare an "evidence based reimbursement eligibility plan" in Iran. The 128 noticeable criteria were found when studying the health insurance systems of developed countries. Four parts (involving criteria) formed the first questionnaire: evaluation of evidences quality, clinical evaluation, economic evaluation, and managerial appraisal. The 85 experts (purposed sampling) were asked to mark the importance of each criterion from 1 to 100 as 1 representing the least and 100 the most important criterion and 45 out of them replied completely. Then, in the next questionnaire, we evaluated the 48 remainder criteria by the same45 participants under four sub-criteria (Cost calculation simplicity, Interpretability, Precision, and Updating capability of a criterion). After collecting the replies, the remainder criteria were ranked by TOPSIS method. Softwares "SPSS" 17 and Excel 2007 were used. The ranks of the five most important criteria which were found for drug approval based on TOPSIS are as follows: 1-domestic production (0.556), 2-duration of using (0.399), 3-independence of the assessment group (0.363) 4-impact budgeting (0.362) 5-decisions of other countries about the same drug (0.358). The numbers in parenthesis are relative closeness alternatives in relation to the ideal solution. This model gave a scientific model for judging fairly on the acceptance of novelty medicines.
The performance of trellis coded multilevel DPSK on a fading mobile satellite channel
NASA Technical Reports Server (NTRS)
Simon, Marvin K.; Divsalar, Dariush
1987-01-01
The performance of trellis coded multilevel differential phase-shift-keying (MDPSK) over Rician and Rayleigh fading channels is discussed. For operation at L-Band, this signalling technique leads to a more robust system than the coherent system with dual pilot tone calibration previously proposed for UHF. The results are obtained using a combination of analysis and simulation. The analysis shows that the design criterion for trellis codes to be operated on fading channels with interleaving/deinterleaving is no longer free Euclidean distance. The correct design criterion for optimizing bit error probability of trellis coded MDPSK over fading channels will be presented along with examples illustrating its application.
EPA conducted a study to evaluate the effect of coatings on dislodgeable arsenic, chromium, and copper residues on the surfaces of chromated copper arsenate (CAA) treated wood. Dislodgeable CCA, determined by wipe sampling the wood surfaces, was the primary evaluation criterion f...
Jaman, Ajmery; Latif, Mahbub A H M; Bari, Wasimul; Wahed, Abdus S
2016-05-20
In generalized estimating equations (GEE), the correlation between the repeated observations on a subject is specified with a working correlation matrix. Correct specification of the working correlation structure ensures efficient estimators of the regression coefficients. Among the criteria used, in practice, for selecting working correlation structure, Rotnitzky-Jewell, Quasi Information Criterion (QIC) and Correlation Information Criterion (CIC) are based on the fact that if the assumed working correlation structure is correct then the model-based (naive) and the sandwich (robust) covariance estimators of the regression coefficient estimators should be close to each other. The sandwich covariance estimator, used in defining the Rotnitzky-Jewell, QIC and CIC criteria, is biased downward and has a larger variability than the corresponding model-based covariance estimator. Motivated by this fact, a new criterion is proposed in this paper based on the bias-corrected sandwich covariance estimator for selecting an appropriate working correlation structure in GEE. A comparison of the proposed and the competing criteria is shown using simulation studies with correlated binary responses. The results revealed that the proposed criterion generally performs better than the competing criteria. An example of selecting the appropriate working correlation structure has also been shown using the data from Madras Schizophrenia Study. Copyright © 2015 John Wiley & Sons, Ltd.
Promoted Combustion Test Data Re-Examined
NASA Technical Reports Server (NTRS)
Lewis, Michelle; Jeffers, Nathan; Stoltzfus, Joel
2010-01-01
Promoted combustion testing of metallic materials has been performed by NASA since the mid-1980s to determine the burn resistance of materials in oxygen-enriched environments. As the technolo gy has advanced, the method of interpreting, presenting, and applying the promoted combustion data has advanced as well. Recently NASA changed the bum criterion from 15 cm (6 in.) to 3 cm (1.2 in.). This new burn criterion was adopted for ASTM G 124, Standard Test Method for Determining the Combustion Behavior- of Metallic Materials in Oxygen-Enriched Atmospheres. Its effect on the test data and the latest method to display the test data will be discussed. Two specific examples that illustrate how this new criterion affects the burn/no-bum thresholds of metal alloys will also be presented.
Defect specific maintenance of SG tubes -- How safe is it?
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cizelj, L.; Mavko, B.; Dvorsek, T.
1997-02-01
The efficiency of the defect specific plugging criterion for outside diameter stress corrosion cracking at tube support plates is assessed. The efficiency is defined by three parameters: (1) number of plugged tubes, (2) probability of steam generator tube rupture and (3) predicted accidental leak rate through the defects. A probabilistic model is proposed to quantify the probability of tube rupture, while procedures available in literature were used to define the accidental leak rates. The defect specific plugging criterion was then compared to the performance of traditional (45%) plugging criterion using realistic data from Krsko nuclear power plant. Advantages of themore » defect specific approach over the traditional one are clearly shown. Some hints on the optimization of safe life of steam generator are also given.« less
Verbalizing facial memory: criterion effects in verbal overshadowing.
Clare, Joseph; Lewandowsky, Stephan
2004-07-01
This article investigated the role of the recognition criterion in the verbal overshadowing effect (VOE). In 3 experiments, people witnessed an event, verbally described a perpetrator, and then attempted identification. The authors found in Experiment 1, which included a "not present" response option and both perpetrator-present (PP) and perpetrator-absent (PA) lineups, an increased reluctance to identify a person from both lineup types after verbalization. Experiment 2 incorporated a forced-choice procedure, and the authors found no effect of verbalization on identification performance. Experiment 3 replicated the essential aspects of these results. Consequently, the VOE may reflect a change in recognition criterion rather than a changed processing style or alteration of the underlying memory trace. This conclusion was confirmed by computational modeling of the data. Copyright 2004 APA, all rights reserved
NASA Astrophysics Data System (ADS)
Rashidi Moghaddam, M.; Ayatollahi, M. R.; Berto, F.
2018-01-01
The values of mode II fracture toughness reported in the literature for several rocks are studied theoretically by using a modified criterion based on strain energy density averaged over a control volume around the crack tip. The modified criterion takes into account the effect of T-stress in addition to the singular terms of stresses/strains. The experimental results are related to mode II fracture tests performed on the semicircular bend and Brazilian disk specimens. There are good agreements between theoretical predictions using the generalized averaged strain energy density criterion and the experimental results. The theoretical results reveal that the value of mode II fracture toughness is affected by the size of control volume around the crack tip and also the magnitude and sign of T-stress.
Obuchowski, N A
2001-10-15
Electronic medical images are an efficient and convenient format in which to display, store and transmit radiographic information. Before electronic images can be used routinely to screen and diagnose patients, however, it must be shown that readers have the same diagnostic performance with this new format as traditional hard-copy film. Currently, there exist no suitable definitions of diagnostic equivalence. In this paper we propose two criteria for diagnostic equivalence. The first criterion ('population equivalence') considers the variability between and within readers, as well as the mean reader performance. This criterion is useful for most applications. The second criterion ('individual equivalence') involves a comparison of the test results for individual patients and is necessary when patients are followed radiographically over time. We present methods for testing both individual and population equivalence. The properties of the proposed methods are assessed in a Monte Carlo simulation study. Data from a mammography screening study is used to illustrate the proposed methods and compare them with results from more conventional methods of assessing equivalence and inter-procedure agreement. Copyright 2001 John Wiley & Sons, Ltd.
Fatigue and Fracture-Toughness Characterization of SAW and SMA A537 Class I Ship-Steel Weldments.
1981-12-01
Charpy criterion and proposed NDT-DT criterion of Rolfe . Recommendations are made and further research is suggested to help clarify the assessment of...acceptable performance at -60aF. Likewise, at -60OF the NDT and DT data for these weldments marginally exceed the criteria proposed by Rolfe when the...exceed the CVN values equivalent to the 5/8 DT values required by Rolfe . The 5/8-inch dynamic-tear specimen is not recommended as a quality-control test
Evaluation of light-emitting diodes for signage applications
NASA Astrophysics Data System (ADS)
Freyssinier, Jean Paul; Zhou, Yutao; Ramamurthy, Vasudha; Bierman, Andrew; Bullough, John D.; Narendran, Nadarajah
2004-01-01
This paper outlines two parts of a study designed to evaluate the use of light-emitting diodes (LEDs) in channel-letter signs. The first part of the study evaluated the system performance of red LED signs and white LED signs against reference neon and cold-cathode signs. The results show a large difference between the actual performance and potential savings from red and white LEDs. Depending on the configuration, a red LED sign could use 20% to 60% less power than a neon sign at the same light output. The light output of the brightest white LED sign tested was 15% lower than the cold-cathode reference, but its power was 53% higher. It appears from this study that the most efficient white LED system is still 40% less efficient than the cold-cathode system tested. One area that offers a great potential for further energy savings is the acrylic diffuser of the signs. The acrylic diffusers measured absorb between 60% and 66% of the light output produced by the sign. Qualitative factors are also known to play an important role in signage systems. One of the largest issues with any new lighting technology is its acceptance by the end user. Consistency of light output and color among LEDs, even from the same manufacturing batch, and over time, are two of the major issues that also could affect the advantages of LEDs for signage applications. To evaluate different signage products and to identify the suitability of LEDs for this application, it is important to establish a criterion for brightness uniformity. Building upon this information, the second part of the study used human factors evaluations to determine a brightness-uniformity criterion for channel-letter signs. The results show that the contrast modulation between bright and dark areas within a sign seems to elicit the strongest effect on how people perceive uniformity. A strong monotonic relationship between modulation and acceptability was found in this evaluation. The effect of contrast seems to be stronger than that of spatial frequency or background luminance, particularly for contrast modulation values of less than 0.20 or greater than 0.60. A sign with luminance variations of less than 20% would be accepted by at least 80% of the population in any given context.
NASA Astrophysics Data System (ADS)
Paprocka, I.; Kempa, W. M.; Grabowik, C.; Kalinowski, K.; Krenczyk, D.
2016-08-01
In the paper a survey of predictive and reactive scheduling methods is done in order to evaluate how the ability of prediction of reliability characteristics influences over robustness criteria. The most important reliability characteristics are: Mean Time to Failure, Mean Time of Repair. Survey analysis is done for a job shop scheduling problem. The paper answers the question: what method generates robust schedules in the case of a bottleneck failure occurrence before, at the beginning of planned maintenance actions or after planned maintenance actions? Efficiency of predictive schedules is evaluated using criteria: makespan, total tardiness, flow time, idle time. Efficiency of reactive schedules is evaluated using: solution robustness criterion and quality robustness criterion. This paper is the continuation of the research conducted in the paper [1], where the survey of predictive and reactive scheduling methods is done only for small size scheduling problems.
A Simple Criterion to Estimate Performance of Pulse Jet Mixed Vessels
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pease, Leonard F.; Bamberger, Judith A.; Mahoney, Lenna A.
Pulse jet mixed process vessels comprise a key element of the U.S. Department of Energy’s strategy to process millions of gallons of legacy nuclear waste slurries. Slurry suctioned into a pulse jet mixer (PJM) tube at the end of one pulse is pneumatically driven from the PJM toward the bottom of the vessel at the beginning of the next pulse, forming a jet. The jet front traverses the distance from nozzle outlet to the bottom of the vessel and spreads out radially. Varying numbers of PJMs are typically arranged in a ring configuration within the vessel at a selected radiusmore » and operated concurrently. Centrally directed radial flows from neighboring jets collide to create a central upwell that elevates the solids in the center of the vessel when the PJM tubes expel their contents. An essential goal of PJM operation is to elevate solids to the liquid surface to minimize stratification. Solids stratification may adversely affect throughput of the waste processing plant. Unacceptably high slurry densities at the base of the vessel may plug the pipeline through which the slurry exits the vessel. Additionally, chemical reactions required for processing may not achieve complete conversion. To avoid these conditions, a means of predicting the elevation to which the solids rise in the central upwell that can be used during vessel design remains essential. In this paper we present a simple criterion to evaluate the extent of solids elevation achieved by a turbulent upwell jet. The criterion asserts that at any location in the central upwell the local velocity must be in excess of a cutoff velocity to remain turbulent. We find that local velocities in excess of 0.6 m/s are necessary for turbulent jet flow through both Newtonian and yield stress slurries. By coupling this criterion with the free jet velocity equation relating the local velocity to elevation in the central upwell, we estimate the elevation at which turbulence fails, and consequently the elevation at which the upwell fails to further lift the slurry. Comparing this elevation to the vessel fill level predicts whether the jet flow will achieve the full vertical extent of the vessel at the center. This simple local-velocity criterion determines a minimum PJM nozzle velocity at which the full vertical extent of the central upwell in PJM vessels will be turbulent. The criterion determines a minimum because flow in regions peripheral to the central upwelling jet may not be turbulent, even when the center of the vessel in the upwell is turbulent, if the jet pulse duration is too short. The local-velocity criterion ensures only that there is sufficient wherewithal for the turbulent jet flow to drive solids to the surface in the center of the vessel in the central upwell.« less
Kernel learning at the first level of inference.
Cawley, Gavin C; Talbot, Nicola L C
2014-05-01
Kernel learning methods, whether Bayesian or frequentist, typically involve multiple levels of inference, with the coefficients of the kernel expansion being determined at the first level and the kernel and regularisation parameters carefully tuned at the second level, a process known as model selection. Model selection for kernel machines is commonly performed via optimisation of a suitable model selection criterion, often based on cross-validation or theoretical performance bounds. However, if there are a large number of kernel parameters, as for instance in the case of automatic relevance determination (ARD), there is a substantial risk of over-fitting the model selection criterion, resulting in poor generalisation performance. In this paper we investigate the possibility of learning the kernel, for the Least-Squares Support Vector Machine (LS-SVM) classifier, at the first level of inference, i.e. parameter optimisation. The kernel parameters and the coefficients of the kernel expansion are jointly optimised at the first level of inference, minimising a training criterion with an additional regularisation term acting on the kernel parameters. The key advantage of this approach is that the values of only two regularisation parameters need be determined in model selection, substantially alleviating the problem of over-fitting the model selection criterion. The benefits of this approach are demonstrated using a suite of synthetic and real-world binary classification benchmark problems, where kernel learning at the first level of inference is shown to be statistically superior to the conventional approach, improves on our previous work (Cawley and Talbot, 2007) and is competitive with Multiple Kernel Learning approaches, but with reduced computational expense. Copyright © 2014 Elsevier Ltd. All rights reserved.
Henry, Sharon M; Westervelt, Karen C
2005-06-01
Randomized controlled trial. To determine if supplementing typical clinical instruction with real-time ultrasound feedback facilitates performance and retention of the abdominal hollowing exercise (AHE). Increasingly clinicians are using real-time ultrasound imaging as a form of feedback when teaching patients trunk stabilization exercises; however, there has been no justification for this practice. Forty-eight subjects were divided randomly into 3 groups that received different types of feedback: group 1 received minimal verbal feedback, group 2 received verbal and palpatory feedback, and group 3 received real-time ultrasound, verbal, and palpatory feedback. If the subject performed 3 consecutive correct AHEs during the initial session, she/he returned for a retention test. The performance of 3 consecutive, correct AHEs was the criterion measure; the number of trials to criterion was also recorded during the initial and retention test sessions. The ability to perform the AHE differed among groups (P<.001). During the initial session, 12.5% of subjects in group 1, 50.0% of subjects in group 2, and 87.5% of subjects in group 3 were able to perform 3 consecutive AHEs. Group 3 subjects achieved the criterion in fewer trials than the other 2 groups (P = .0006). No differences among groups were found for the retention testing; however, low power due to fewer subjects precluded a strong interpretation of this finding. Real-time ultrasound feedback can decrease the number of trials needed to consistently perform the AHE; however, the data are inconclusive with regard to retention of this skill.
Comparison of two methods for detection of strain localization in sheet forming
NASA Astrophysics Data System (ADS)
Lumelskyj, Dmytro; Lazarescu, Lucian; Banabic, Dorel; Rojek, Jerzy
2018-05-01
This paper presents a comparison of two criteria of strain localization in experimental research and numerical simulation of sheet metal forming. The first criterion is based on the analysis of the through-thickness thinning (through-thickness strain) and its first time derivative in the most strained zone. The limit strain in the second method is determined by the maximum of the strain acceleration. Experimental and numerical investigation have been carried out for the Nakajima test performed for different specimens of the DC04 grade steel sheet. The strain localization has been identified by analysis of experimental and numerical curves showing the evolution of strains and their derivatives in failure zones. The numerical and experimental limit strains calculated from both criteria have been compared with the experimental FLC evaluated according to the ISO 12004-2 norm. It has been shown that the first method predicts formability limits closer to the experimental FLC. The second criterion predicts values of strains higher than FLC determined according to ISO norm. These values are closer to the strains corresponding to the fracture limit. The results show that analysis of strain evolution allows us to determine strain localization in numerical simulation and experimental studies.
Gerschutz, Maria J; Haynes, Michael L; Nixon, Derek; Colvin, James M
2012-01-01
A prosthesis encounters loading through forces and torques exerted by the person with amputation. International Organization for Standardization (ISO) standard 10328 was designed to test most lower-limb prosthetic components. However, this standard does not include prosthetic sockets. We measured static failure loads of prosthetic sockets using a modified ISO 10328 and then compared them with the criteria set by this standard for other components. Check socket (CS) strengths were influenced by thickness, material choice, and fabrication method. Copolymer socket (CP) strengths depended on thickness and fabrication methods. A majority of the CSs and all of the CPs failed to pass the ISO 10328 ductile loading criterion. In contrast, the strengths of definitive laminated sockets (DLs) were influenced more by construction material and technique. A majority of the DLs failed to pass the ISO 10328 brittle loading criterion. Analyzing prosthetic sockets from a variety of facilities demonstrated that socket performance varies considerably between and within facilities. The results from this article provide a foundation for understanding the quality of prosthetic sockets, some insight into possible routes for improving the current care delivered to patients, and a comparative basis for future technology.
Evaluation of purchase intention of customers in two wheeler automobile segment: AHP and TOPSIS
NASA Astrophysics Data System (ADS)
Sri Yogi, Kottala
2018-03-01
Winning heart of customers is preeminent main design of any business organization in global business environment. This paper explored customer’s priorities while purchasing a two wheeler automobile segment using Analytical Hierarchy Process (AHP) and Technique for Order Preference by Similarity to Ideal Solution (TOPSIS) as a multi criteria decision making tools to accomplish the research objectives. Study has been done to analyze different criteria to be considered during purchasing of two wheeler automobiles among respondents using structured questionnaire based on SAATY scale. Based on our previous work on empirical & fuzzy logic approach to product quality and purchase intention of customers in two wheeler- operational, performance, economic, brand value and maintenance aspects are considered as decision criteria of customers while purchasing a two wheeler. The study suggests high pick up during overtaking, petrol saving, reasonable spare parts price, unique in design and identity and easy to change gear as main criterion in purchasing process. We also found some leading two wheeler automobiles models available in Indian market using some objective function criterion in choosing some important characteristics like price, cylinder capacity, brake horse power and weight during purchasing process of two wheeler automobile in Indian market based on respondents perception.
Dispersant approval procedures in France and Italy: A comparative ecotoxicity study.
Manfra, Loredana; Tornambè, Andrea; Guyomarch, Julien; Le Guerrogue, Pascale; Kerambrun, Loïc; Rotini, Alice; Savorelli, Federica; Onorati, Fulvio; Magaletti, Erika
2017-09-01
A research project has been performed to the request of the RAMOGE Executive Secretariat to identify differences between dispersant approval procedures in France and Italy and propose ways to harmonize them. A collaborative study has been conducted by CEDRE (Centre of Documentation, Research and Experimentation on Accidental Water Pollution) and ISPRA (Italian Institute for Environmental Protection and Research) to: a) compare current approval procedures in Italy and France with identification of differences and commonalities; b) carry out toxicity tests using both procedures on two selected dispersants; c) propose a common approach between Italy and France. The results showed that, because of the differences in ecotoxicological tests and in the evaluation criteria used, the outcomes on the same products could be different in Italy and in France. Both tested dispersants met the French requirements for approval (LC 50 ≥ 10 times reference toxicant), while only one dispersant met the Italian approval criterion (EC 50 > 10mg/L). A possible way of harmonizing the approval procedures could be to increase the number of test organisms in the French procedure, which currently only uses one crustacean species. Furthermore, a common criterion for toxicity assessment should be discussed and agreed. Copyright © 2017. Published by Elsevier Inc.
Chagas disease in bone marrow transplantation: an approach to preemptive therapy.
Altclas, J; Sinagra, A; Dictar, M; Luna, C; Verón, M T; De Rissio, A M; García, M M; Salgueira, C; Riarte, A
2005-07-01
The efficacy of preemptive therapy was evaluated in bone marrow transplantation (BMT) recipients associated with Chagas disease (CD). The criterion to include patients in the protocol was the serological reactivity for CD in recipients and/or donors before transplant. After BMT, the monitoring was performed using the direct Strout method (SM), which detects clinical levels of Trypanosome cruzi parasitemia, and CD conventional serological tests. Monitoring took place during 60 days in ABMT and throughout the immunosuppressive period in allogeneic BMT. Reactivation of CD was diagnosed by detecting T. cruzi parasites in blood or tissues. In primary T. cruzi infection, an additional diagnostic criterion was the serological conversion. A total of 25 CD-BMT patients were included. Two ABMT and four allogeneic BMT recipients showed CD recurrences diagnosed by SM. One patient also showed skin lesions with T. cruzi amastigotes. Benznidazole treatment (Roche Lab), an antiparasitic drug, was prescribed at a dose of 5 mg/kg/day during 4-8 weeks with recovery of patients. Primary T. cruzi infection was not observed. This report proves the relevance of monitoring CD in BMT patients and demonstrates that preemptive therapy was able to abrogate the development of clinical and systemic disease.