Validation of a Scalable Solar Sailcraft
NASA Technical Reports Server (NTRS)
Murphy, D. M.
2006-01-01
The NASA In-Space Propulsion (ISP) program sponsored intensive solar sail technology and systems design, development, and hardware demonstration activities over the past 3 years. Efforts to validate a scalable solar sail system by functional demonstration in relevant environments, together with test-analysis correlation activities on a scalable solar sail system have recently been successfully completed. A review of the program, with descriptions of the design, results of testing, and analytical model validations of component and assembly functional, strength, stiffness, shape, and dynamic behavior are discussed. The scaled performance of the validated system is projected to demonstrate the applicability to flight demonstration and important NASA road-map missions.
The Deaf Acculturation Scale (DAS): Development and Validation of a 58-Item Measure
Maxwell-McCaw, Deborah; Zea, Maria Cecilia
2011-01-01
This study involved the development and validation of the Deaf Acculturation Scale (DAS), a new measure of cultural identity for Deaf and hard-of-hearing (hh) populations. Data for this study were collected online and involved a nation-wide sample of 3,070 deaf/hh individuals. Results indicated strong internal reliabilities for all the subscales, and construct validity was established by demonstrating that the DAS could discriminate groups based on parental hearing status, school background, and use of self-labels. Construct validity was further demonstrated through factorial analyses, and findings resulted in a final 58-item measure. Directions for future research are discussed. PMID:21263041
Paliwal, Nikhil; Damiano, Robert J; Varble, Nicole A; Tutino, Vincent M; Dou, Zhongwang; Siddiqui, Adnan H; Meng, Hui
2017-12-01
Computational fluid dynamics (CFD) is a promising tool to aid in clinical diagnoses of cardiovascular diseases. However, it uses assumptions that simplify the complexities of the real cardiovascular flow. Due to high-stakes in the clinical setting, it is critical to calculate the effect of these assumptions in the CFD simulation results. However, existing CFD validation approaches do not quantify error in the simulation results due to the CFD solver's modeling assumptions. Instead, they directly compare CFD simulation results against validation data. Thus, to quantify the accuracy of a CFD solver, we developed a validation methodology that calculates the CFD model error (arising from modeling assumptions). Our methodology identifies independent error sources in CFD and validation experiments, and calculates the model error by parsing out other sources of error inherent in simulation and experiments. To demonstrate the method, we simulated the flow field of a patient-specific intracranial aneurysm (IA) in the commercial CFD software star-ccm+. Particle image velocimetry (PIV) provided validation datasets for the flow field on two orthogonal planes. The average model error in the star-ccm+ solver was 5.63 ± 5.49% along the intersecting validation line of the orthogonal planes. Furthermore, we demonstrated that our validation method is superior to existing validation approaches by applying three representative existing validation techniques to our CFD and experimental dataset, and comparing the validation results. Our validation methodology offers a streamlined workflow to extract the "true" accuracy of a CFD solver.
Space Technology 5 - A Successful Micro-Satellite Constellation Mission
NASA Technical Reports Server (NTRS)
Carlisle, Candace; Webb, Evan H.
2007-01-01
The Space Technology 5 (ST5) constellation of three micro-satellites was launched March 22, 2006. During the three-month flight demonstration phase, the ST5 team validated key technologies that will make future low-cost micro-sat constellations possible, demonstrated operability concepts for future micro-sat science constellation missions, and demonstrated the utility of a micro-satellite constellation to perform research-quality science. The ST5 mission was successfully completed in June 2006, demonstrating high-quality science and technology validation results.
Cross-Validation of the Africentrism Scale.
ERIC Educational Resources Information Center
Kwate, Naa Oyo A.
2003-01-01
Cross-validated the Africentrism Scale, investigating the relationship between Africentrism and demographic variables in a diverse sample of individuals of African descent. Results indicated that the scale demonstrated solid internal consistency and convergent validity. Age and education related to Africentrism, with younger and less educated…
NASA Technical Reports Server (NTRS)
Litt, Jonathan S.; Sowers, T Shane; Liu, Yuan; Owen, A. Karl; Guo, Ten-Huei
2015-01-01
The National Aeronautics and Space Administration (NASA) has developed independent airframe and engine models that have been integrated into a single real-time aircraft simulation for piloted evaluation of propulsion control algorithms. In order to have confidence in the results of these evaluations, the integrated simulation must be validated to demonstrate that its behavior is realistic and that it meets the appropriate Federal Aviation Administration (FAA) certification requirements for aircraft. The paper describes the test procedures and results, demonstrating that the integrated simulation generally meets the FAA requirements and is thus a valid testbed for evaluation of propulsion control modes.
Validation of an Empathy Scale in Pharmacy and Nursing Students
Chen, Aleda M. H.; Yehle, Karen S.; Plake, Kimberly S.
2013-01-01
Objective. To validate an empathy scale to measure empathy in pharmacy and nursing students. Methods. A 15-item instrument comprised of the cognitive and affective empathy domains, was created. Each item was rated using a 7-point Likert scale, ranging from strongly disagree to strongly agree. Concurrent validity was demonstrated with the Jefferson Scale of Empathy – Health Professional Students (JSE-HPS). Results. Reliability analysis of data from 216 students (pharmacy, N=158; nursing, N=58) showed that scores on the empathy scale were positively associated with JSE-HPS scores (p<0.001). Factor analysis confirmed that 14 of the 15 items were significantly associated with their respective domain, but the overall instrument had limited goodness of fit. Conclusions. Results of this study demonstrate the reliability and validity of a new scale for evaluating student empathy. Further testing of the scale at other universities is needed to establish validity. PMID:23788805
Construct Validation of the Piers-Harris Children's Self Concept Scale.
ERIC Educational Resources Information Center
Franklin, Melvin R., Jr.; And Others
1981-01-01
Results indicated that the Piers-Harris Children's Self Concept Scale demonstrates both convergent and discriminant validity in an assessment of a relatively stable and internally consistent construct. (Author/BW)
Methodology, Methods, and Metrics for Testing and Evaluating Augmented Cognition Systems
DOE Office of Scientific and Technical Information (OSTI.GOV)
Greitzer, Frank L.
The augmented cognition research community seeks cognitive neuroscience-based solutions to improve warfighter performance by applying and managing mitigation strategies to reduce workload and improve the throughput and quality of decisions. The focus of augmented cognition mitigation research is to define, demonstrate, and exploit neuroscience and behavioral measures that support inferences about the warfighter’s cognitive state that prescribe the nature and timing of mitigation. A research challenge is to develop valid evaluation methodologies, metrics and measures to assess the impact of augmented cognition mitigations. Two considerations are external validity, which is the extent to which the results apply to operational contexts;more » and internal validity, which reflects the reliability of performance measures and the conclusions based on analysis of results. The scientific rigor of the research methodology employed in conducting empirical investigations largely affects the validity of the findings. External validity requirements also compel us to demonstrate operational significance of mitigations. Thus it is important to demonstrate effectiveness of mitigations under specific conditions. This chapter reviews some cognitive science and methodological considerations in designing augmented cognition research studies and associated human performance metrics and analysis methods to assess the impact of augmented cognition mitigations.« less
O'Connor, Peter; Nguyen, Jessica; Anglim, Jeromy
2017-01-01
In this study, we investigated the validity of the Trait Emotional Intelligence Questionnaire-Short Form (TEIQue-SF; Petrides, 2009) in the context of task-induced stress. We used a total sample of 225 volunteers to investigate (a) the incremental validity of the TEIQue-SF over other predictors of coping with task-induced stress, and (b) the construct validity of the TEIQue-SF by examining the mechanisms via which scores from the TEIQue-SF predict coping outcomes. Results demonstrated that the TEIQue-SF possessed incremental validity over the Big Five personality traits in the prediction of emotion-focused coping. Results also provided support for the construct validity of the TEIQue-SF by demonstrating that this measure predicted adaptive coping via emotion-focused channels. Specifically, results showed that, following a task stressor, the TEIQue-SF predicted low negative affect and high task performance via high levels of emotion-focused coping. Consistent with the purported theoretical nature of the trait emotional intelligence (EI) construct, trait EI as assessed by the TEIQue-SF primarily enhances affect and performance in stressful situations by regulating negative emotions.
Validation of the Intrinsic Spirituality Scale (ISS) with Muslims.
Hodge, David R; Zidan, Tarek; Husain, Altaf
2015-12-01
This study validates an existing spirituality measure--the intrinsic spirituality scale (ISS)--for use with Muslims in the United States. A confirmatory factor analysis was conducted with a diverse sample of self-identified Muslims (N = 281). Validity and reliability were assessed along with criterion and concurrent validity. The measurement model fit the data well, normed χ2 = 2.50, CFI = 0.99, RMSEA = 0.07, and SRMR = 0.02. All 6 items that comprise the ISS demonstrated satisfactory levels of validity (λ > .70) and reliability (R2 > .50). The Cronbach's alpha obtained with the present sample was .93. Appropriate correlations with theoretically linked constructs demonstrated criterion and concurrent validity. The results suggest the ISS is a valid measure of spirituality in clinical settings with the rapidly growing Muslim population. The ISS may, for instance, provide an efficient screening tool to identify Muslims that are particularly likely to benefit from spiritually accommodative treatments. (c) 2015 APA, all rights reserved).
Kumar, Y Kiran; Mehta, Shashi Bhushan; Ramachandra, Manjunath
2017-01-01
The purpose of this work is to provide some validation methods for evaluating the hemodynamic assessment of Cerebral Arteriovenous Malformation (CAVM). This article emphasizes the importance of validating noninvasive measurements for CAVM patients, which are designed using lumped models for complex vessel structure. The validation of the hemodynamics assessment is based on invasive clinical measurements and cross-validation techniques with the Philips proprietary validated software's Qflow and 2D Perfursion. The modeling results are validated for 30 CAVM patients for 150 vessel locations. Mean flow, diameter, and pressure were compared between modeling results and with clinical/cross validation measurements, using an independent two-tailed Student t test. Exponential regression analysis was used to assess the relationship between blood flow, vessel diameter, and pressure between them. Univariate analysis is used to assess the relationship between vessel diameter, vessel cross-sectional area, AVM volume, AVM pressure, and AVM flow results were performed with linear or exponential regression. Modeling results were compared with clinical measurements from vessel locations of cerebral regions. Also, the model is cross validated with Philips proprietary validated software's Qflow and 2D Perfursion. Our results shows that modeling results and clinical results are nearly matching with a small deviation. In this article, we have validated our modeling results with clinical measurements. The new approach for cross-validation is proposed by demonstrating the accuracy of our results with a validated product in a clinical environment.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tyler Gray; Jeremy Diez; Jeffrey Wishart
2013-07-01
The intent of the electric Ground Support Equipment (eGSE) demonstration is to evaluate the day-to-day vehicle performance of electric baggage tractors using two advanced battery technologies to demonstrate possible replacements for the flooded lead-acid (FLA) batteries utilized throughout the industry. These advanced battery technologies have the potential to resolve barriers to the widespread adoption of eGSE deployment. Validation testing had not previously been performed within fleet operations to determine if the performance of current advanced batteries is sufficient to withstand the duty cycle of electric baggage tractors. This report summarizes the work performed and data accumulated during this demonstration inmore » an effort to validate the capabilities of advanced battery technologies. This report summarizes the work performed and data accumulated during this demonstration in an effort to validate the capabilities of advanced battery technologies. The demonstration project also grew the relationship with Southwest Airlines (SWA), our demonstration partner at Ontario International Airport (ONT), located in Ontario, California. The results of this study have encouraged a proposal for a future demonstration project with SWA.« less
Richter, Tobias; Schroeder, Sascha; Wöhrmann, Britta
2009-03-01
In social cognition, knowledge-based validation of information is usually regarded as relying on strategic and resource-demanding processes. Research on language comprehension, in contrast, suggests that validation processes are involved in the construction of a referential representation of the communicated information. This view implies that individuals can use their knowledge to validate incoming information in a routine and efficient manner. Consistent with this idea, Experiments 1 and 2 demonstrated that individuals are able to reject false assertions efficiently when they have validity-relevant beliefs. Validation processes were carried out routinely even when individuals were put under additional cognitive load during comprehension. Experiment 3 demonstrated that the rejection of false information occurs automatically and interferes with affirmative responses in a nonsemantic task (epistemic Stroop effect). Experiment 4 also revealed complementary interference effects of true information with negative responses in a nonsemantic task. These results suggest the existence of fast and efficient validation processes that protect mental representations from being contaminated by false and inaccurate information.
Formiga, Magno F; Roach, Kathryn E; Vital, Isabel; Urdaneta, Gisel; Balestrini, Kira; Calderon-Candelario, Rafael A
2018-01-01
Purpose The Test of Incremental Respiratory Endurance (TIRE) provides a comprehensive assessment of inspiratory muscle performance by measuring maximal inspiratory pressure (MIP) over time. The integration of MIP over inspiratory duration (ID) provides the sustained maximal inspiratory pressure (SMIP). Evidence on the reliability and validity of these measurements in COPD is not currently available. Therefore, we assessed the reliability, responsiveness and construct validity of the TIRE measures of inspiratory muscle performance in subjects with COPD. Patients and methods Test–retest reliability, known-groups and convergent validity assessments were implemented simultaneously in 81 male subjects with mild to very severe COPD. TIRE measures were obtained using the portable PrO2 device, following standard guidelines. Results All TIRE measures were found to be highly reliable, with SMIP demonstrating the strongest test–retest reliability with a nearly perfect intraclass correlation coefficient (ICC) of 0.99, while MIP and ID clustered closely together behind SMIP with ICC values of about 0.97. Our findings also demonstrated known-groups validity of all TIRE measures, with SMIP and ID yielding larger effect sizes when compared to MIP in distinguishing between subjects of different COPD status. Finally, our analyses confirmed convergent validity for both SMIP and ID, but not MIP. Conclusion The TIRE measures of MIP, SMIP and ID have excellent test–retest reliability and demonstrated known-groups validity in subjects with COPD. SMIP and ID also demonstrated evidence of moderate convergent validity and appear to be more stable measures in this patient population than the traditional MIP. PMID:29805255
Validity and reliability of the NAB Naming Test.
Sachs, Bonnie C; Rush, Beth K; Pedraza, Otto
2016-05-01
Confrontation naming is commonly assessed in neuropsychological practice, but few standardized measures of naming exist and those that do are susceptible to the effects of education and culture. The Neuropsychological Assessment Battery (NAB) Naming Test is a 31-item measure used to assess confrontation naming. Despite adequate psychometric information provided by the test publisher, there has been limited independent validation of the test. In this study, we investigated the convergent and discriminant validity, internal consistency, and alternate forms reliability of the NAB Naming Test in a sample of adults (Form 1: n = 247, Form 2: n = 151) clinically referred for neuropsychological evaluation. Results indicate adequate-to-good internal consistency and alternate forms reliability. We also found strong convergent validity as demonstrated by relationships with other neurocognitive measures. We found preliminary evidence that the NAB Naming Test demonstrates a more pronounced ceiling effect than other commonly used measures of naming. To our knowledge, this represents the largest published independent validation study of the NAB Naming Test in a clinical sample. Our findings suggest that the NAB Naming Test demonstrates adequate validity and reliability and merits consideration in the test arsenal of clinical neuropsychologists.
External validation of a Cox prognostic model: principles and methods
2013-01-01
Background A prognostic model should not enter clinical practice unless it has been demonstrated that it performs a useful role. External validation denotes evaluation of model performance in a sample independent of that used to develop the model. Unlike for logistic regression models, external validation of Cox models is sparsely treated in the literature. Successful validation of a model means achieving satisfactory discrimination and calibration (prediction accuracy) in the validation sample. Validating Cox models is not straightforward because event probabilities are estimated relative to an unspecified baseline function. Methods We describe statistical approaches to external validation of a published Cox model according to the level of published information, specifically (1) the prognostic index only, (2) the prognostic index together with Kaplan-Meier curves for risk groups, and (3) the first two plus the baseline survival curve (the estimated survival function at the mean prognostic index across the sample). The most challenging task, requiring level 3 information, is assessing calibration, for which we suggest a method of approximating the baseline survival function. Results We apply the methods to two comparable datasets in primary breast cancer, treating one as derivation and the other as validation sample. Results are presented for discrimination and calibration. We demonstrate plots of survival probabilities that can assist model evaluation. Conclusions Our validation methods are applicable to a wide range of prognostic studies and provide researchers with a toolkit for external validation of a published Cox model. PMID:23496923
Cooperative Collision Avoidance Technology Demonstration Data Analysis Report
NASA Technical Reports Server (NTRS)
2007-01-01
This report details the National Aeronautics and Space Administration (NASA) Access 5 Project Office Cooperative Collision Avoidance (CCA) Technology Demonstration for unmanned aircraft systems (UAS) conducted from 21 to 28 September 2005. The test platform chosen for the demonstration was the Proteus Optionally Piloted Vehicle operated by Scaled Composites, LLC, flown out of the Mojave Airport, Mojave, CA. A single intruder aircraft, a NASA Gulf stream III, was used during the demonstration to execute a series of near-collision encounter scenarios. Both aircraft were equipped with Traffic Alert and Collision Avoidance System-II (TCAS-II) and Automatic Dependent Surveillance Broadcast (ADS-B) systems. The objective of this demonstration was to collect flight data to support validation efforts for the Access 5 CCA Work Package Performance Simulation and Systems Integration Laboratory (SIL). Correlation of the flight data with results obtained from the performance simulation serves as the basis for the simulation validation. A similar effort uses the flight data to validate the SIL architecture that contains the same sensor hardware that was used during the flight demonstration.
Does virtual reality simulation have a role in training trauma and orthopaedic surgeons?
Bartlett, J D; Lawrence, J E; Stewart, M E; Nakano, N; Khanduja, V
2018-05-01
Aims The aim of this study was to assess the current evidence relating to the benefits of virtual reality (VR) simulation in orthopaedic surgical training, and to identify areas of future research. Materials and Methods A literature search using the MEDLINE, Embase, and Google Scholar databases was performed. The results' titles, abstracts, and references were examined for relevance. Results A total of 31 articles published between 2004 and 2016 and relating to the objective validity and efficacy of specific virtual reality orthopaedic surgical simulators were identified. We found 18 studies demonstrating the construct validity of 16 different orthopaedic virtual reality simulators by comparing expert and novice performance. Eight studies have demonstrated skill acquisition on a simulator by showing improvements in performance with repeated use. A further five studies have demonstrated measurable improvements in operating theatre performance following a period of virtual reality simulator training. Conclusion The demonstration of 'real-world' benefits from the use of VR simulation in knee and shoulder arthroscopy is promising. However, evidence supporting its utility in other forms of orthopaedic surgery is lacking. Further studies of validity and utility should be combined with robust analyses of the cost efficiency of validated simulators to justify the financial investment required for their use in orthopaedic training. Cite this article: Bone Joint J 2018;100-B:559-65.
Joint Test Protocol for Validation of Alternatives to Aliphatic Isocyanate Polyurethanes
NASA Technical Reports Server (NTRS)
Lewis, Pattie
2005-01-01
The primary objective of this effort is to demonstrate and validate alternatives to aliphatic isocyanate polyurethanes. Successful completion of this project will result in one or more isocyanate-free coatings qualified for use at AFSPC and NASA installations participating in this project.
The development and validation of the Physical Appearance Comparison Scale-Revised (PACS-R).
Schaefer, Lauren M; Thompson, J Kevin
2014-04-01
The Physical Appearance Comparison Scale (PACS; Thompson, Heinberg, & Tantleff, 1991) was revised to assess appearance comparisons relevant to women and men in a wide variety of contexts. The revised scale (Physical Appearance Comparison Scale-Revised, PACS-R) was administered to 1176 college females. In Study 1, exploratory factor analysis and parallel analysis using one half of the sample suggested a single factor structure for the PACS-R. Study 2 utilized the remaining half of the sample to conduct confirmatory factor analysis, item analysis, and to examine the convergent validity of the scale. These analyses resulted in an 11-item measure that demonstrated excellent internal consistency and convergent validity with measures of body satisfaction, eating pathology, sociocultural influences on appearance, and self-esteem. Regression analyses demonstrated the utility of the PACS-R in predicting body satisfaction and eating pathology. Overall, results indicate that the PACS-R is a reliable and valid tool for assessing appearance comparison tendencies in women. Copyright © 2014. Published by Elsevier Ltd.
[Nursing on the Web: the creation and validation process of a web site on coronary artery disease].
Marques, Isaac Rosa; Marin, Heimar de Fátima
2002-01-01
The World Wide Web is an important health information research source. A challenge for the Brazilian Nursing Informatics area is to use its potential to promote health education. This paper aims to present a developing and validating model used in an educational Web site, named CardioSite, which subject is Coronary Heart Disease. In its creation it was adopted a method with phases of conceptual modeling, development, implementation, and evaluation. In the evaluation phase, the validation was performed through an online informatics and health experts panel. The results demonstrated that information was reliable and valid. Considering that national official systems are not available to that approach, this model demonstrated effectiveness in assessing the quality of the Web site content.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lundstrom, Blake; Chakraborty, Sudipta; Lauss, Georg
This paper presents a concise description of state-of-the-art real-time simulation-based testing methods and demonstrates how they can be used independently and/or in combination as an integrated development and validation approach for smart grid DERs and systems. A three-part case study demonstrating the application of this integrated approach at the different stages of development and validation of a system-integrated smart photovoltaic (PV) inverter is also presented. Laboratory testing results and perspectives from two international research laboratories are included in the case study.
Martins, Cátia; Ferreira, Paulo Miguel; Carvalho, Raquel; Costa, Sandra Cristina; Farinha, Carlos; Azevedo, Luísa; Amorim, António; Oliveira, Manuela
2018-02-01
Obtaining a genetic profile from pieces of evidence collected at a crime scene is the primary objective of forensic laboratories. New procedures, methods, kits, software or equipment must be carefully evaluated and validated before its implementation. The constant development of new methodologies for DNA testing leads to a steady process of validation, which consists of demonstrating that the technology is robust, reproducible, and reliable throughout a defined range of conditions. The present work aims to internally validate two new retrotransposon-based kits (InnoQuant ® HY and InnoTyper ® 21), under the working conditions of the Laboratório de Polícia Científica da Polícia Judiciária (LPC-PJ). For the internal validation of InnoQuant ® HY and InnoTyper ® 21 sensitivity, repeatability, reproducibility, and mixture tests and a concordance study between these new kits and those currently in use at LPC-PJ (Quantifiler ® Duo and GlobalFiler™) were performed. The results obtained for sensitivity, repeatability, and reproducibility tests demonstrated that both InnoQuant ® HY and InnoTyper ® 21 are robust, reproducible, and reliable. The results of the concordance studies demonstrate that InnoQuant ® HY produced quantification results in nearly 29% more than Quantifiler ® Duo (indicating that this new kit is more effective in challenging samples), while the differences observed between InnoTyper ® 21 and GlobalFiler™ are not significant. Therefore, the utility of InnoTyper ® 21 has been proven, especially by the successful amplification of a greater number of complete genetic profiles (27 vs. 21). The results herein presented allowed the internal validation of both InnoQuant ® HY and InnoTyper ® 21, and their implementation in the LPC-PJ laboratory routine for the treatment of challenging samples. Copyright © 2017 Elsevier B.V. All rights reserved.
Guidelines To Validate Control of Cross-Contamination during Washing of Fresh-Cut Leafy Vegetables.
Gombas, D; Luo, Y; Brennan, J; Shergill, G; Petran, R; Walsh, R; Hau, H; Khurana, K; Zomorodi, B; Rosen, J; Varley, R; Deng, K
2017-02-01
The U.S. Food and Drug Administration requires food processors to implement and validate processes that will result in significantly minimizing or preventing the occurrence of hazards that are reasonably foreseeable in food production. During production of fresh-cut leafy vegetables, microbial contamination that may be present on the product can spread throughout the production batch when the product is washed, thus increasing the risk of illnesses. The use of antimicrobials in the wash water is a critical step in preventing such water-mediated cross-contamination; however, many factors can affect antimicrobial efficacy in the production of fresh-cut leafy vegetables, and the procedures for validating this key preventive control have not been articulated. Producers may consider three options for validating antimicrobial washing as a preventive control for cross-contamination. Option 1 involves the use of a surrogate for the microbial hazard and the demonstration that cross-contamination is prevented by the antimicrobial wash. Option 2 involves the use of antimicrobial sensors and the demonstration that a critical antimicrobial level is maintained during worst-case operating conditions. Option 3 validates the placement of the sensors in the processing equipment with the demonstration that a critical antimicrobial level is maintained at all locations, regardless of operating conditions. These validation options developed for fresh-cut leafy vegetables may serve as examples for validating processes that prevent cross-contamination during washing of other fresh produce commodities.
Validity of Computer Adaptive Tests of Daily Routines for Youth with Spinal Cord Injury
Haley, Stephen M.
2013-01-01
Objective: To evaluate the accuracy of computer adaptive tests (CATs) of daily routines for child- and parent-reported outcomes following pediatric spinal cord injury (SCI) and to evaluate the validity of the scales. Methods: One hundred ninety-six daily routine items were administered to 381 youths and 322 parents. Pearson correlations, intraclass correlation coefficients (ICC), and 95% confidence intervals (CI) were calculated to evaluate the accuracy of simulated 5-item, 10-item, and 15-item CATs against the full-item banks and to evaluate concurrent validity. Independent samples t tests and analysis of variance were used to evaluate the ability of the daily routine scales to discriminate between children with tetraplegia and paraplegia and among 5 motor groups. Results: ICC and 95% CI demonstrated that simulated 5-, 10-, and 15-item CATs accurately represented the full-item banks for both child- and parent-report scales. The daily routine scales demonstrated discriminative validity, except between 2 motor groups of children with paraplegia. Concurrent validity of the daily routine scales was demonstrated through significant relationships with the FIM scores. Conclusion: Child- and parent-reported outcomes of daily routines can be obtained using CATs with the same relative precision of a full-item bank. Five-item, 10-item, and 15-item CATs have discriminative and concurrent validity. PMID:23671380
NASA Technical Reports Server (NTRS)
Price J. M.; Ortega, R.
1998-01-01
Probabilistic method is not a universally accepted approach for the design and analysis of aerospace structures. The validity of this approach must be demonstrated to encourage its acceptance as it viable design and analysis tool to estimate structural reliability. The objective of this Study is to develop a well characterized finite population of similar aerospace structures that can be used to (1) validate probabilistic codes, (2) demonstrate the basic principles behind probabilistic methods, (3) formulate general guidelines for characterization of material drivers (such as elastic modulus) when limited data is available, and (4) investigate how the drivers affect the results of sensitivity analysis at the component/failure mode level.
The Predictive Validity of the Metropolitan Readiness Tests, 1976 Edition.
ERIC Educational Resources Information Center
Nagle, Richard J.
1979-01-01
A sample of 176 first-grade children was tested on the Metropolitan Readiness Tests, 1976 Edition (MRT), during the initial month of school and was retested eight months later on the Stanford Achievement Test. Results demonstrated substantial validity of the MRT for predicting first-grade achievement. (Author/CTM)
Oren, Carmel; Kennet-Cohen, Tamar; Turvall, Elliot; Allalouf, Avi
2014-01-01
The Psychometric Entrance Test (PET), used for admission to higher education in Israel together with the Matriculation (Bagrut), had in the past one general (total) score in which the weights for its domains: Verbal, Quantitative and English, were 2:2:1, respectively. In 2011, two additional total scores were introduced, with different weights for the Verbal and the Quantitative domains. This study compares the predictive validity of the three general scores of PET, and demonstrates validity in terms of utility. 100,863 freshmen students of all Israeli universities over the classes of 2005-2009. Regression weights and correlations of the predictors with FYGPA were computed. Simulations based on these results supplied the utility estimates. On average, PET is slightly more predictive than the Bagrut; using them both yields a better tool than either of them alone. Assigning differential weights to the components in the respective schools further improves the validity. The introduction of the new general scores of PET is validated by gathering and analyzing evidence based on relations of test scores to other variables. The utility of using the test can be demonstrated in ways different from correlations.
The Spatial Power Motivation Scale: a semi-implicit measure of situational power motivation.
Schoel, Christiane; Zimmer, Katharina; Stahlberg, Dagmar
2015-01-01
We introduce a new nonverbal and unobtrusive measure to assess power motive activation, the Spatial Power Motivation Scale (SPMS). The unique features of this instrument are that it is (a) very simple and economical, (b) reliable and valid, and (c) sensitive to situational changes. Study 1 demonstrates the instrument's convergent and discriminant validity with explicit measures. Study 2 demonstrates the instrument's responsiveness to situational power motive salience: anticipating and winning competition versus losing competition and watching television. Studies 3 and 4 demonstrate that thoughts of competition result in higher power motivation specifically for individuals with a high dispositional power motive.
Kim, Eun-Mi; Kim, Sun-Aee; Lee, Ju-Ry; Burlison, Jonathan D; Oh, Eui Geum
2018-02-13
"Second victims" are defined as healthcare professionals whose wellness is influenced by adverse clinical events. The Second Victim Experience and Support Tool (SVEST) was used to measure the second-victim experience and quality of support resources. Although the reliability and validity of the original SVEST have been validated, those for the Korean tool have not been validated. The aim of the study was to evaluate the psychometric properties of the Korean version of the SVEST. The study included 305 clinical nurses as participants. The SVEST was translated into Korean via back translation. Content validity was assessed by seven experts, and test-retest reliability was evaluated by 30 clinicians. Internal consistency and construct validity were assessed via confirmatory factor analysis. The analyses were performed using SPSS 23.0 and STATA 13.0 software. The content validity index value demonstrated validity; item- and scale-level content validity index values were both 0.95. Test-retest reliability and internal consistency reliability were satisfactory: the intraclass consistent coefficient was 0.71, and Cronbach α values ranged from 0.59 to 0.87. The CFA showed a significantly good fit for an eight-factor structure (χ = 578.21, df = 303, comparative fit index = 0.92, Tucker-Lewis index = 0.90, root mean square error of approximation = 0.05). The K-SVEST demonstrated good psychometric properties and adequate validity and reliability. The results showed that the Korean version of SVEST demonstrated the extent of second victimhood and support resources in Korean healthcare workers and could aid in the development of support programs and evaluation of their effectiveness.
Validity and Reliability of the 8-Item Work Limitations Questionnaire.
Walker, Timothy J; Tullar, Jessica M; Diamond, Pamela M; Kohl, Harold W; Amick, Benjamin C
2017-12-01
Purpose To evaluate factorial validity, scale reliability, test-retest reliability, convergent validity, and discriminant validity of the 8-item Work Limitations Questionnaire (WLQ) among employees from a public university system. Methods A secondary analysis using de-identified data from employees who completed an annual Health Assessment between the years 2009-2015 tested research aims. Confirmatory factor analysis (CFA) (n = 10,165) tested the latent structure of the 8-item WLQ. Scale reliability was determined using a CFA-based approach while test-retest reliability was determined using the intraclass correlation coefficient. Convergent/discriminant validity was tested by evaluating relations between the 8-item WLQ with health/performance variables for convergent validity (health-related work performance, number of chronic conditions, and general health) and demographic variables for discriminant validity (gender and institution type). Results A 1-factor model with three correlated residuals demonstrated excellent model fit (CFI = 0.99, TLI = 0.99, RMSEA = 0.03, and SRMR = 0.01). The scale reliability was acceptable (0.69, 95% CI 0.68-0.70) and the test-retest reliability was very good (ICC = 0.78). Low-to-moderate associations were observed between the 8-item WLQ and the health/performance variables while weak associations were observed between the demographic variables. Conclusions The 8-item WLQ demonstrated sufficient reliability and validity among employees from a public university system. Results suggest the 8-item WLQ is a usable alternative for studies when the more comprehensive 25-item WLQ is not available.
Validation of Solar Sail Simulations for the NASA Solar Sail Demonstration Project
NASA Technical Reports Server (NTRS)
Braafladt, Alexander C.; Artusio-Glimpse, Alexandra B.; Heaton, Andrew F.
2014-01-01
NASA's Solar Sail Demonstration project partner L'Garde is currently assembling a flight-like sail assembly for a series of ground demonstration tests beginning in 2015. For future missions of this sail that might validate solar sail technology, it is necessary to have an accurate sail thrust model. One of the primary requirements of a proposed potential technology validation mission will be to demonstrate solar sail thrust over a set time period, which for this project is nominally 30 days. This requirement would be met by comparing a L'Garde-developed trajectory simulation to the as-flown trajectory. The current sail simulation baseline for L'Garde is a Systems Tool Kit (STK) plug-in that includes a custom-designed model of the L'Garde sail. The STK simulation has been verified for a flat plate model by comparing it to the NASA-developed Solar Sail Spaceflight Simulation Software (S5). S5 matched STK with a high degree of accuracy and the results of the validation indicate that the L'Garde STK model is accurate enough to meet the potential future mission requirements. Additionally, since the L'Garde sail deviates considerably from a flat plate, a force model for a non-flat sail provided by L'Garde sail was also tested and compared to a flat plate model in S5. This result will be used in the future as a basis of comparison to the non-flat sail model being developed for STK.
Factor complexity of crash occurrence: An empirical demonstration using boosted regression trees.
Chung, Yi-Shih
2013-12-01
Factor complexity is a characteristic of traffic crashes. This paper proposes a novel method, namely boosted regression trees (BRT), to investigate the complex and nonlinear relationships in high-variance traffic crash data. The Taiwanese 2004-2005 single-vehicle motorcycle crash data are used to demonstrate the utility of BRT. Traditional logistic regression and classification and regression tree (CART) models are also used to compare their estimation results and external validities. Both the in-sample cross-validation and out-of-sample validation results show that an increase in tree complexity provides improved, although declining, classification performance, indicating a limited factor complexity of single-vehicle motorcycle crashes. The effects of crucial variables including geographical, time, and sociodemographic factors explain some fatal crashes. Relatively unique fatal crashes are better approximated by interactive terms, especially combinations of behavioral factors. BRT models generally provide improved transferability than conventional logistic regression and CART models. This study also discusses the implications of the results for devising safety policies. Copyright © 2012 Elsevier Ltd. All rights reserved.
Talip, Whadi-ah; Steyn, Nelia P; Visser, Marianne; Charlton, Karen E; Temple, Norman
2003-09-01
We wanted to develop and validate a test that assesses the knowledge and practices of health professionals (HPs) with regard to the role of nutrition, physical activity, and smoking cessation (lifestyle modification) in chronic diseases of lifestyle. A descriptive cross-sectional validation study was carried out. The validation design consisted of two phases, namely 1) test planning and development and 2) test evaluation. The study sample consisted of five groups of HPs: dietitians, dietetic interns, general practitioners, medical students, and nurses. The overall response rate was 58%, resulting in a sample size of 186 participants. A test was designed to evaluate the knowledge and practices of HPs. The test was first evaluated by an expert group to ensure content, construct, and face validity. Thereafter, the questionnaire was tested on five groups of HPs to test for criterion validity. Internal consistency was evaluated by Cronbach's alpha. An expert panel ensured content, construct, and face validity of the test. Groups with the most training and exposure to nutrition (dietitians and dietetic interns) had the highest group mean score, ranging from 61% to 88%, whereas those with limited nutrition training (general practitioners, medical students, and nurses) had significantly lower scores, ranging from 26% to 80%. This result demonstrated criterion validity. Internal consistency of the overall test demonstrated a Cronbach's alpha of 0.99. Most HPs identified the mass media as their main source of information on lifestyle modification. These HPs also identified lack of time, lack of patient compliance, and lack of knowledge as barriers that prevent them from providing counseling on lifestyle modification. The results of this study showed that this test instrument identifies groups of health professionals with adequate training (knowledge) in lifestyle modification and those who require further training (knowledge).
Assessment of MARMOT Grain Growth Model
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fromm, B.; Zhang, Y.; Schwen, D.
2015-12-01
This report assesses the MARMOT grain growth model by comparing modeling predictions with experimental results from thermal annealing. The purpose here is threefold: (1) to demonstrate the validation approach of using thermal annealing experiments with non-destructive characterization, (2) to test the reconstruction capability and computation efficiency in MOOSE, and (3) to validate the grain growth model and the associated parameters that are implemented in MARMOT for UO 2. To assure a rigorous comparison, the 2D and 3D initial experimental microstructures of UO 2 samples were characterized using non-destructive Synchrotron x-ray. The same samples were then annealed at 2273K for grainmore » growth, and their initial microstructures were used as initial conditions for simulated annealing at the same temperature using MARMOT. After annealing, the final experimental microstructures were characterized again to compare with the results from simulations. So far, comparison between modeling and experiments has been done for 2D microstructures, and 3D comparison is underway. The preliminary results demonstrated the usefulness of the non-destructive characterization method for MARMOT grain growth model validation. A detailed analysis of the 3D microstructures is in progress to fully validate the current model in MARMOT.« less
Generalizing disease management program results: how to get from here to there.
Linden, Ariel; Adams, John L; Roberts, Nancy
2004-07-01
For a disease management (DM) program, the ability to generalize results from the intervention group to the population, to other populations, or to other diseases is as important as demonstrating internal validity. This article provides an overview of the threats to external validity of DM programs, and offers methods to improve the capability for generalizing results obtained through the program. The external validity of DM programs must be evaluated even before program selection and implementation are begun with a prospective new client. Any fundamental differences in characteristics between individuals in an established DM program and in a new population/environment may limit the ability to generalize.
Renshaw, Tyler L; Long, Anna C J; Cook, Clayton R
2015-06-01
This study reports on the initial development and validation of the Teacher Subjective Wellbeing Questionnaire (TSWQ) with 2 samples of educators-a general sample of 185 elementary and middle school teachers, and a target sample of 21 elementary school teachers experiencing classroom management challenges. The TSWQ is an 8-item self-report instrument for assessing teachers' subjective wellbeing, which is operationalized via subscales measuring school connectedness and teaching efficacy. The conceptualization and development processes underlying the TSWQ are described, and results from a series of preliminary psychometric and exploratory analyses are reported to establish initial construct validity. Findings indicated that the TSWQ was characterized by 2 conceptually sound latent factors, that both subscales and the composite scale demonstrated strong internal consistency, and that all scales demonstrated convergent validity with self-reported school supports and divergent validity with self-reported stress and emotional burnout. Furthermore, results indicated that TSWQ scores did not differ according to teachers' school level (i.e., elementary vs. middle), but that they did differ according to unique school environment (e.g., 1 middle school vs. another middle school) and teacher stressors (i.e., general teachers vs. teachers experiencing classroom management challenges). Results also indicated that, for teachers experiencing classroom challenges, the TSWQ had strong short-term predictive validity for psychological distress, accounting for approximately half of the variance in teacher stress and emotional burnout. Implications for theory, research, and the practice of school psychology are discussed. (c) 2015 APA, all rights reserved).
Construct Validity and Reliability of College Students' Responses to the Reasons for Smoking Scale
ERIC Educational Resources Information Center
Fiala, Kelly Ann; D'Abundo, Michelle Lee; Marinaro, Laura Marie
2010-01-01
When utilizing self-assessments to determine motives for health behaviors, it is essential that the resulting data demonstrate sound psychometric properties. The purpose of this research was to assess the reliability and construct validity of college students' responses to the Reasons for Smoking Scale (RFS). Confirmatory factor analyses and…
DEMONSTRATION OF RADON RESISTANT CONSTRUCTION TECHNIQUES - PHASE II. FINAL REPORT
The report gives results of a demonstration of radon resistant construction techniques. Sub-slab mitigation systems were installed (in accordance with draft standards) in 15 new Florida houses in 1992, and these houses have undergone extensive testing to validate techniques used ...
NASA Technical Reports Server (NTRS)
Williams, Daniel M.
2006-01-01
Described is the research process that NASA researchers used to validate the Small Aircraft Transportation System (SATS) Higher Volume Operations (HVO) concept. The four phase building-block validation and verification process included multiple elements ranging from formal analysis of HVO procedures to flight test, to full-system architecture prototype that was successfully shown to the public at the June 2005 SATS Technical Demonstration in Danville, VA. Presented are significant results of each of the four research phases that extend early results presented at ICAS 2004. HVO study results have been incorporated into the development of the Next Generation Air Transportation System (NGATS) vision and offer a validated concept to provide a significant portion of the 3X capacity improvement sought after in the United States National Airspace System (NAS).
Quantitative bioanalysis of strontium in human serum by inductively coupled plasma-mass spectrometry
Somarouthu, Srikanth; Ohh, Jayoung; Shaked, Jonathan; Cunico, Robert L; Yakatan, Gerald; Corritori, Suzana; Tami, Joe; Foehr, Erik D
2015-01-01
Aim: A bioanalytical method using inductively-coupled plasma-mass spectrometry to measure endogenous levels of strontium in human serum was developed and validated. Results & methodology: This article details the experimental procedures used for the method development and validation thus demonstrating the application of the inductively-coupled plasma-mass spectrometry method for quantification of strontium in human serum samples. The assay was validated for specificity, linearity, accuracy, precision, recovery and stability. Significant endogenous levels of strontium are present in human serum samples ranging from 19 to 96 ng/ml with a mean of 34.6 ± 15.2 ng/ml (SD). Discussion & conclusion: Calibration procedures and sample pretreatment were simplified for high throughput analysis. The validation demonstrates that the method was sensitive, selective for quantification of strontium (88Sr) and is suitable for routine clinical testing of strontium in human serum samples. PMID:28031925
Validation of the Short Form of the Academic Procrastination Scale.
Yockey, Ronald D
2016-02-01
The factor structure, internal consistency reliability, and convergent validity of the five-item Academic Procrastination Scale-Short Form was investigated on an ethnically diverse sample of college students. The results provided support for the Academic Procrastination Scale-Short Form as a unidimensional measure of academic procrastination, which possessed good internal consistency reliability in this sample of 282 students. The scale also demonstrated good convergent validity, with moderate to large correlations with both the Procrastination Assessment Scale-Students and the Tuckman Procrastination Scale. Implications of the results are discussed and recommendations for future work provided.
High dynamic GPS receiver validation demonstration
NASA Technical Reports Server (NTRS)
Hurd, W. J.; Statman, J. I.; Vilnrotter, V. A.
1985-01-01
The Validation Demonstration establishes that the high dynamic Global Positioning System (GPS) receiver concept developed at JPL meets the dynamic tracking requirements for range instrumentation of missiles and drones. It was demonstrated that the receiver can track the pseudorange and pseudorange rate of vehicles with acceleration in excess of 100 g and jerk in excess of 100 g/s, dynamics ten times more severe than specified for conventional High Dynamic GPS receivers. These results and analytic extensions to a complete system configuration establish that all range instrumentation requirements can be met. The receiver can be implemented in the 100 cu in volume required by all missiles and drones, and is ideally suited for transdigitizer or translator applications.
Müller-Engelmann, Meike; Schnyder, Ulrich; Dittmann, Clara; Priebe, Kathlen; Bohus, Martin; Thome, Janine; Fydrich, Thomas; Pfaltz, Monique C; Steil, Regina
2018-05-01
The Clinician-Administered PTSD Scale (CAPS) is a widely used diagnostic interview for posttraumatic stress disorder (PTSD). Following fundamental modifications in the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition ( DSM-5), the CAPS had to be revised. This study examined the psychometric properties (internal consistency, interrater reliability, convergent and discriminant validity, and structural validity) of the German version of the CAPS-5 in a trauma-exposed sample ( n = 223 with PTSD; n =51 without PTSD). The results demonstrated high internal consistency (αs = .65-.93) and high interrater reliability (ICCs = .81-.89). With regard to convergent and discriminant validity, we found high correlations between the CAPS severity score and both the Posttraumatic Diagnostic Scale sum score ( r = .87) and the Beck Depression Inventory total score ( r = .72). Regarding the underlying factor structure, the hybrid model demonstrated the best fit, followed by the anhedonia model. However, we encountered some nonpositive estimates for the correlations of the latent variables (factors) for both models. The model with the best fit without methodological problems was the externalizing behaviors model, but the results also supported the DSM-5 model. Overall, the results demonstrate that the German version of the CAPS-5 is a psychometrically sound measure.
Validation of the Spanish Addiction Severity Index Multimedia Version (S-ASI-MV).
Butler, Stephen F; Redondo, José Pedro; Fernandez, Kathrine C; Villapiano, Albert
2009-01-01
This study aimed to develop and test the reliability and validity of a Spanish adaptation of the ASI-MV, a computer administered version of the Addiction Severity Index, called the S-ASI-MV. Participants were 185 native Spanish-speaking adult clients from substance abuse treatment facilities serving Spanish-speaking clients in Florida, New Mexico, California, and Puerto Rico. Participants were administered the S-ASI-MV as well as Spanish versions of the general health subscale of the SF-36, the work and family unit subscales of the Social Adjustment Scale Self-Report, the Michigan Alcohol Screening Test, the alcohol and drug subscales of the Personality Assessment Inventory, and the Hopkins Symptom Checklist-90. Three-to-five-day test-retest reliability was examined along with criterion validity, convergent/discriminant validity, and factorial validity. Measurement invariance between the English and Spanish versions of the ASI-MV was also examined. The S-ASI-MV demonstrated good test-retest reliability (ICCs for composite scores between .59 and .93), criterion validity (rs for composite scores between .66 and .87), and convergent/discriminant validity. Factorial validity and measurement invariance were demonstrated. These results compared favorably with those reported for the original interviewer version of the ASI and the English version of the ASI-MV.
Lau, Nathan; Jamieson, Greg A; Skraaning, Gyrd
2016-03-01
The Process Overview Measure is a query-based measure developed to assess operator situation awareness (SA) from monitoring process plants. A companion paper describes how the measure has been developed according to process plant properties and operator cognitive work. The Process Overview Measure demonstrated practicality, sensitivity, validity and reliability in two full-scope simulator experiments investigating dramatically different operational concepts. Practicality was assessed based on qualitative feedback of participants and researchers. The Process Overview Measure demonstrated sensitivity and validity by revealing significant effects of experimental manipulations that corroborated with other empirical results. The measure also demonstrated adequate inter-rater reliability and practicality for measuring SA in full-scope simulator settings based on data collected on process experts. Thus, full-scope simulator studies can employ the Process Overview Measure to reveal the impact of new control room technology and operational concepts on monitoring process plants. Practitioner Summary: The Process Overview Measure is a query-based measure that demonstrated practicality, sensitivity, validity and reliability for assessing operator situation awareness (SA) from monitoring process plants in representative settings.
ERIC Educational Resources Information Center
Godfrey, Kelly E.; Jagesic, Sanja
2016-01-01
The College-Level Examination Program® (CLEP®) is a computer-based prior-learning assessment that allows examinees the opportunity to demonstrate mastery of knowledge and skills necessary to earn postsecondary course credit in higher education. Currently, there are 33 exams in five subject areas: composition and literature, world languages,…
Preliminary Validity of the Eyberg Child Behavior Inventory With Filipino Immigrant Parents
Coffey, Dean M.; Javier, Joyce R.; Schrager, Sheree M.
2016-01-01
Filipinos are an understudied minority affected by significant behavioral health disparities. We evaluate evidence for the reliability, construct validity, and convergent validity of the Eyberg Child Behavior Inventory (ECBI) in 6- to 12- year old Filipino children (N = 23). ECBI scores demonstrated high internal consistency, supporting a single-factor model (pre-intervention α =.91; post-intervention α =.95). Results document convergent validity with the Child Behavior Checklist Externalizing scale at pretest (r = .54, p < .01) and posttest (r = .71, p < .001). We conclude that the ECBI is a promising tool to measure behavior problems in Filipino children. PMID:27087739
Preliminary Validity of the Eyberg Child Behavior Inventory With Filipino Immigrant Parents.
Coffey, Dean M; Javier, Joyce R; Schrager, Sheree M
Filipinos are an understudied minority affected by significant behavioral health disparities. We evaluate evidence for the reliability, construct validity, and convergent validity of the Eyberg Child Behavior Inventory (ECBI) in 6- to 12- year old Filipino children ( N = 23). ECBI scores demonstrated high internal consistency, supporting a single-factor model (pre-intervention α =.91; post-intervention α =.95). Results document convergent validity with the Child Behavior Checklist Externalizing scale at pretest ( r = .54, p < .01) and posttest ( r = .71, p < .001). We conclude that the ECBI is a promising tool to measure behavior problems in Filipino children.
NASA Technical Reports Server (NTRS)
Chen, R. T. N.; Daughaday, H.; Andrisani, D., II; Till, R. D.; Weingarten, N. C.
1975-01-01
The results of a feasibility study and preliminary design for active control research and validation using the Total In-Flight Simulator (TIFS) aircraft are documented. Active control functions which can be demonstrated on the TIFS aircraft and the cost of preparing, equipping, and operating the TIFS aircraft for active control technology development are determined. It is shown that the TIFS aircraft is as a suitable test bed for inflight research and validation of many ACT concepts.
NDARC - NASA Design and Analysis of Rotorcraft Validation and Demonstration
NASA Technical Reports Server (NTRS)
Johnson, Wayne
2010-01-01
Validation and demonstration results from the development of the conceptual design tool NDARC (NASA Design and Analysis of Rotorcraft) are presented. The principal tasks of NDARC are to design a rotorcraft to satisfy specified design conditions and missions, and then analyze the performance of the aircraft for a set of off-design missions and point operating conditions. The aircraft chosen as NDARC development test cases are the UH-60A single main-rotor and tail-rotor helicopter, the CH-47D tandem helicopter, the XH-59A coaxial lift-offset helicopter, and the XV-15 tiltrotor. These aircraft were selected because flight performance data, a weight statement, detailed geometry information, and a correlated comprehensive analysis model are available for each. Validation consists of developing the NDARC models for these aircraft by using geometry and weight information, airframe wind tunnel test data, engine decks, rotor performance tests, and comprehensive analysis results; and then comparing the NDARC results for aircraft and component performance with flight test data. Based on the calibrated models, the capability of the code to size rotorcraft is explored.
Xu, Yuanxin; Theobald, Valerie; Sung, Crystal; DePalma, Kathleen; Atwater, Laura; Seiger, Keirsten; Perricone, Michael A; Richards, Susan M
2008-01-01
Background HLA-A2 tetramer flow cytometry, IFNγ real time RT-PCR and IFNγ ELISPOT assays are commonly used as surrogate immunological endpoints for cancer immunotherapy. While these are often used as research assays to assess patient's immunologic response, assay validation is necessary to ensure reliable and reproducible results and enable more accurate data interpretation. Here we describe a rigorous validation approach for each of these assays prior to their use for clinical sample analysis. Methods Standard operating procedures for each assay were established. HLA-A2 (A*0201) tetramer assay specific for gp100209(210M) and MART-126–35(27L), IFNγ real time RT-PCR and ELISPOT methods were validated using tumor infiltrating lymphocyte cell lines (TIL) isolated from HLA-A2 melanoma patients. TIL cells, specific for gp100 (TIL 1520) or MART-1 (TIL 1143 and TIL1235), were used alone or spiked into cryopreserved HLA-A2 PBMC from healthy subjects. TIL/PBMC were stimulated with peptides (gp100209, gp100pool, MART-127–35, or influenza-M1 and negative control peptide HIV) to further assess assay performance characteristics for real time RT-PCR and ELISPOT methods. Validation parameters included specificity, accuracy, precision, linearity of dilution, limit of detection (LOD) and limit of quantification (LOQ). In addition, distribution was established in normal HLA-A2 PBMC samples. Reference ranges for assay controls were established. Results The validation process demonstrated that the HLA-A2 tetramer, IFNγ real time RT-PCR, and IFNγ ELISPOT were highly specific for each antigen, with minimal cross-reactivity between gp100 and MelanA/MART-1. The assays were sensitive; detection could be achieved at as few as 1/4545–1/6667 cells by tetramer analysis, 1/50,000 cells by real time RT-PCR, and 1/10,000–1/20,000 by ELISPOT. The assays met criteria for precision with %CV < 20% (except ELISPOT using high PBMC numbers with %CV < 25%) although flow cytometric assays and cell based functional assays are known to have high assay variability. Most importantly, assays were demonstrated to be effective for their intended use. A positive IFNγ response (by RT-PCR and ELISPOT) to gp100 was demonstrated in PBMC from 3 melanoma patients. Another patient showed a positive MART-1 response measured by all 3 validated methods. Conclusion Our results demonstrated the tetramer flow cytometry assay, IFNγ real-time RT-PCR, and INFγ ELISPOT met validation criteria. Validation approaches provide a guide for others in the field to validate these and other similar assays for assessment of patient T cell response. These methods can be applied not only to cancer vaccines but to other therapeutic proteins as part of immunogenicity and safety analyses. PMID:18945350
MacEwan, Matthew J; Dudek, Nancy L; Wood, Timothy J; Gofton, Wade T
2016-01-01
CONSTRUCT: The Ottawa Surgical Competency Operating Room Evaluation (O-SCORE) is a 9-item surgical evaluation tool designed to assess technical competence in surgical trainees using behavioral anchors. The initial development of the O-SCORE produced evidence for valid results. Further work is required to determine if the use of a single surgeon or an unblinded rater introduces bias. In addition, the relationship of the O-SCORE to other currently used technical assessment tools should be explored to provide validity evidence related to the relationship to other measures. We have designed this project to provide continued validity evidence for the O-SCORE related to these two issues. Nineteen residents and 2 staff Orthopedic Surgeons from the University of Ottawa volunteered to participate in a 2-part OSCE style station. Participants completed a written questionnaire followed by a videotaped 10-minute simulated open reduction and internal fixation of a midshaft radius fracture. Videos were rated individually by 2 blinded staff orthopedic surgeons using an Objective Structured Assessment of Technical Skills (OSATS) global rating scale, an OSATS checklist, and the O-SCORE in random order. O-SCORE results appeared sensitive to surgical training level even when raters were blinded. In addition, strong agreement between two independent observers using the O-SCORE suggests that the measure captures a performance easily recognized by surgical observers. Ratings on the O-SCORE also were strongly associated with global ratings on the currently most validated technical evaluation tool (OSATS). Collectively, these results suggest that the O-SCORE generates accurate, reproducible, and meaningful results when used in a randomized and blinded fashion, providing continued validity evidence for using this tool to evaluate surgical trainee competence. The O-SCORE was able to differentiate surgical trainee level using blinded raters providing further evidence of validity for the O-SCORE. There was strong agreement between two independent observers using the O-SCORE. Ratings on the O-SCORE also demonstrated equivalence to scores on the most validated technical evaluation tool (OSATS). These results suggest that the O-SCORE demonstrates accurate and reproducible results when used in a randomized and blinded fashion providing continued validity evidence for this tool in the evaluation of surgical competence in the trainees.
A Design to Improve Internal Validity of Assessments of Teaching Demonstrations
ERIC Educational Resources Information Center
Bartsch, Robert A.; Engelhardt Bittner, Wendy M.; Moreno, Jesse E., Jr.
2008-01-01
Internal validity is important in assessing teaching demonstrations both for one's knowledge and for quality assessment demanded by outside sources. We describe a method to improve the internal validity of assessments of teaching demonstrations: a 1-group pretest-posttest design with alternative forms. This design is often more practical and…
López, Diego M; Blobel, Bernd; Gonzalez, Carolina
2010-01-01
Requirement analysis, design, implementation, evaluation, use, and maintenance of semantically interoperable Health Information Systems (HIS) have to be based on eHealth standards. HIS-DF is a comprehensive approach for HIS architectural development based on standard information models and vocabulary. The empirical validity of HIS-DF has not been demonstrated so far. Through an empirical experiment, the paper demonstrates that using HIS-DF and HL7 information models, semantic quality of HIS architecture can be improved, compared to architectures developed using traditional RUP process. Semantic quality of the architecture has been measured in terms of model's completeness and validity metrics. The experimental results demonstrated an increased completeness of 14.38% and an increased validity of 16.63% when using the HIS-DF and HL7 information models in a sample HIS development project. Quality assurance of the system architecture in earlier stages of HIS development presumes an increased quality of final HIS systems, which supposes an indirect impact on patient care.
Nursing students' confidence in medication calculations predicts math exam performance.
Andrew, Sharon; Salamonson, Yenna; Halcomb, Elizabeth J
2009-02-01
The aim of this study was to examine the psychometric properties, including predictive validity, of the newly-developed nursing self-efficacy for mathematics (NSE-Math). The NSE-Math is a 12 item scale that comprises items related to mathematic and arithmetic concepts underpinning medication calculations. The NSE-Math instrument was administered to second year Bachelor of Nursing students enrolled in a nursing practice subject. Students' academic results for a compulsory medication calculation examination for this subject were collected. One-hundred and twelve students (73%) completed both the NSE-Math instrument and the drug calculation assessment task. The NSE-Math demonstrated two factors 'Confidence in application of mathematic concepts to nursing practice' and 'Confidence in arithmetic concepts' with 63.5% of variance explained. Cronbach alpha for the scale was 0.90. The NSE-Math demonstrated predictive validity with the medication calculation examination results (p=0.009). Psychometric testing suggests the NSE-Math is a valid measure of mathematics self-efficacy of second year nursing students.
Selecting the "Best" Factor Structure and Moving Measurement Validation Forward: An Illustration.
Schmitt, Thomas A; Sass, Daniel A; Chappelle, Wayne; Thompson, William
2018-04-09
Despite the broad literature base on factor analysis best practices, research seeking to evaluate a measure's psychometric properties frequently fails to consider or follow these recommendations. This leads to incorrect factor structures, numerous and often overly complex competing factor models and, perhaps most harmful, biased model results. Our goal is to demonstrate a practical and actionable process for factor analysis through (a) an overview of six statistical and psychometric issues and approaches to be aware of, investigate, and report when engaging in factor structure validation, along with a flowchart for recommended procedures to understand latent factor structures; (b) demonstrating these issues to provide a summary of the updated Posttraumatic Stress Disorder Checklist (PCL-5) factor models and a rationale for validation; and (c) conducting a comprehensive statistical and psychometric validation of the PCL-5 factor structure to demonstrate all the issues we described earlier. Considering previous research, the PCL-5 was evaluated using a sample of 1,403 U.S. Air Force remotely piloted aircraft operators with high levels of battlefield exposure. Previously proposed PCL-5 factor structures were not supported by the data, but instead a bifactor model is arguably more statistically appropriate.
Experimental investigation of an RNA sequence space
NASA Technical Reports Server (NTRS)
Lee, Youn-Hyung; Dsouza, Lisa; Fox, George E.
1993-01-01
Modern rRNAs are the historic consequence of an ongoing evolutionary exploration of a sequence space. These extant sequences belong to a special subset of the sequence space that is comprised only of those primary sequences that can validly perform the biological function(s) required of the particular RNA. If it were possible to readily identify all such valid sequences, stochastic predictions could be made about the relative likelihood of various evolutionary pathways available to an RNA. Herein an experimental system which can assess whether a particular sequence is likely to have validity as a eubacterial 5S rRNA is described. A total of ten naturally occurring, and hence known to be valid, sequences and two point mutants of unknown validity were used to test the usefulness of the approach. Nine of the ten valid sequences tested positive whereas both mutants tested as clearly defective. The tenth valid sequence gave results that would be interpreted as reflecting a borderline status were the answer not known. These results demonstrate that it is possible to experimentally determine which sequences in local regions of the sequence space are potentially valid 5S rRNAs.
Tsuji, Naoko; Kakee, Naoko; Ishida, Yasushi; Asami, Keiko; Tabuchi, Ken; Nakadate, Hisaya; Iwai, Tsuyako; Maeda, Miho; Okamura, Jun; Kazama, Takuro; Terao, Yoko; Ohyama, Wataru; Yuza, Yuki; Kaneko, Takashi; Manabe, Atsushi; Kobayashi, Kyoko; Kamibeppu, Kiyoko; Matsushima, Eisuke
2011-04-10
The PedsQL 3.0 Cancer Module is a widely used instrument to measure pediatric cancer specific health-related quality of life (HRQOL) for children aged 2 to 18 years. We developed the Japanese version of the PedsQL Cancer Module and investigated its reliability and validity among Japanese children and their parents. Participants were 212 children with cancer and 253 of their parents. Reliability was determined by internal consistency using Cronbach's coefficient alpha and test-retest reliability using intra-class correlation coefficient (ICC). Validity was assessed through factor validity, convergent and discriminant validity, concurrent validity, and clinical validity. Factor validity was examined by exploratory factor analysis. Convergent and discriminant validity were examined by multitrait scaling analysis. Concurrent validity was assessed using Spearman's correlation coefficients between the Cancer Module and Generic Core Scales, and the comparison of the scores of child self-reports with those of other self-rating depression scales for children. Clinical validity was assessed by comparing the on- and off- treatment scores using Kruskal-Wallis and Mann-Whitney U tests. Cronbach's coefficient alpha was over 0.70 for the total scale and over 0.60 for each subscale by age except for the 'pain and hurt' subscale for children aged 5 to 7 years. For test-retest reliability, the ICC exceeded 0.70 for the total scale for each age. Exploratory factor analysis demonstrated sufficient factorial validity. Multitrait scaling analysis showed high success rates. Strong correlations were found between the reports by children and their parents, and the scores of the Cancer Module and the Generic Core Scales except for 'treatment anxiety' subscales for child reports. The Depression Self-Rating Scale for Children (DSRS-C) scores were significantly correlated with emotional domains and the total score of the cancer module. Children who had been off treatment over 12 months demonstrated significantly higher scores than those on treatment. The results demonstrate the reliability and validity of the Japanese version of the PedsQL Cancer Module among Japanese children.
Frederick, R I
2000-01-01
Mixed group validation (MGV) is offered as an alternative to criterion group validation (CGV) to estimate the true positive and false positive rates of tests and other diagnostic signs. CGV requires perfect confidence about each research participant's status with respect to the presence or absence of pathology. MGV determines diagnostic efficiencies based on group data; knowing an individual's status with respect to pathology is not required. MGV can use relatively weak indicators to validate better diagnostic signs, whereas CGV requires perfect diagnostic signs to avoid error in computing true positive and false positive rates. The process of MGV is explained, and a computer simulation demonstrates the soundness of the procedure. MGV of the Rey 15-Item Memory Test (Rey, 1958) for 723 pre-trial criminal defendants resulted in higher estimates of true positive rates and lower estimates of false positive rates as compared with prior research conducted with CGV. The author demonstrates how MGV addresses all the criticisms Rogers (1997b) outlined for differential prevalence designs in malingering detection research. Copyright 2000 John Wiley & Sons, Ltd.
Hagemeier, Nicholas E; Murawski, Matthew M
2014-02-12
To develop and validate an instrument to assess subjective ratings of the perceived value of various postgraduate training paths followed using expectancy-value as a theoretical framework; and to explore differences in value beliefs across type of postgraduate training pursued and type of pharmacy training completed prior to postgraduate training. A survey instrument was developed to sample 4 theoretical domains of subjective task value: intrinsic value, attainment value, utility value, and perceived cost. Retrospective self-report methodology was employed to examine respondents' (N=1,148) subjective task value beliefs specific to their highest level of postgraduate training completed. Exploratory and confirmatory factor analytic techniques were used to evaluate and validate value belief constructs. Intrinsic, attainment, utility, cost, and financial value constructs resulted from exploratory factor analysis. Cross-validation resulted in a 26-item instrument that demonstrated good model fit. Differences in value beliefs were noted across type of postgraduate training pursued and pharmacy training characteristics. The Postgraduate Training Value Instrument demonstrated evidence of reliability and construct validity. The survey instrument can be used to assess value beliefs regarding multiple postgraduate training options in pharmacy and potentially inform targeted recruiting of individuals to those paths best matching their own value beliefs.
Ribeiro de Oliveira, Marcelo Magaldi; Nicolato, Arthur; Santos, Marcilea; Godinho, Joao Victor; Brito, Rafael; Alvarenga, Alexandre; Martins, Ana Luiza Valle; Prosdocimi, André; Trivelato, Felipe Padovani; Sabbagh, Abdulrahman J; Reis, Augusto Barbosa; Maestro, Rolando Del
2016-05-01
OBJECT The development of neurointerventional treatments of central nervous system disorders has resulted in the need for adequate training environments for novice interventionalists. Virtual simulators offer anatomical definition but lack adequate tactile feedback. Animal models, which provide more lifelike training, require an appropriate infrastructure base. The authors describe a training model for neurointerventional procedures using the human placenta (HP), which affords haptic training with significantly fewer resource requirements, and discuss its validation. METHODS Twelve HPs were prepared for simulated endovascular procedures. Training exercises performed by interventional neuroradiologists and novice fellows were placental angiography, stent placement, aneurysm coiling, and intravascular liquid embolic agent injection. RESULTS The endovascular training exercises proposed can be easily reproduced in the HP. Face, content, and construct validity were assessed by 6 neurointerventional radiologists and 6 novice fellows in interventional radiology. CONCLUSIONS The use of HP provides an inexpensive training model for the training of neurointerventionalists. Preliminary validation results show that this simulation model has face and content validity and has demonstrated construct validity for the interventions assessed in this study.
ERIC Educational Resources Information Center
Wu, Amery D.; Stone, Jake E.; Liu, Yan
2016-01-01
This article proposes and demonstrates a methodology for test score validation through abductive reasoning. It describes how abductive reasoning can be utilized in support of the claims made about test score validity. This methodology is demonstrated with a real data example of the Canadian English Language Proficiency Index Program…
Safipour, Jalal; Tessma, Mesfin Kassaye; Higginbottom, Gina; Emami, Azita
2010-12-01
The objective of the study is to translate and examine the reliability and validity of the Jessor and Jessor Social Alienation Scale for use in a Swedish context. The study involved four phases of testing: (1) Translation and back-translation; (2) a pilot test to evaluate the translation; (3) reliability testing; and (4) a validity test. Main participants of this study were 446 students (Age = 15-19, SD = 1.01, Mean = 17). Results from the reliability test showed high internal consistency and stability. Face, content and construct validity were demonstrated using experts and confirmatory factor analysis. The results of testing the Swedish version of the alienation scale revealed an acceptable level of reliability and validity, and is appropriate for use in the Swedish context. © 2010 The Authors. Scandinavian Journal of Psychology © 2010 The Scandinavian Psychological Associations.
Assessing attitude toward same-sex marriage: scale development and validation.
Lannutti, Pamela J; Lachlan, Kenneth A
2007-01-01
This paper reports the results of three studies conducted to develop, refine, and validate a scale which assessed heterosexual adults' attitudes toward same-sex marriage, the Attitude Toward Same-Sex Marriage Scale (ASSMS). The need for such a scale is evidenced in the increasing importance of same-sex marriage in the political arena of the United States and other nations, as well as the growing body of empirical research examining same-sex marriage and related issues (e.g., Lannutti, 2005; Solomon, Rothblum, & Balsam, 2004). The results demonstrate strong reliability, convergent validity, and predictive validity for the ASSMS and suggest that the ASSMS may be adapted to measure attitudes toward civil unions and other forms of relational recognition for same-sex couples. Gender comparisons using the validated scale showed that in college and non-college samples, women had a significantly more positive attitude toward same-sex marriage than did men.
McDonald, Scott D; Beckham, Jean C; Morey, Rajendra A; Calhoun, Patrick S
2009-03-01
The present study examined the psychometric properties and diagnostic efficiency of the Davidson Trauma Scale (DTS), a self-report measure of posttraumatic stress disorder (PTSD) symptoms. Participants included 158 U.S. military veterans who have served since September 11, 2001 (post-9/11). Results support the DTS as a valid self-report measure of PTSD symptoms. The DTS demonstrated good internal consistency, concurrent validity, and convergent and divergent validity. Diagnostic efficiency was excellent when discriminating between veterans with PTSD and veterans with no Axis I diagnosis. However, although satisfactory by conventional standards, efficiency was substantially attenuated when discriminating between PTSD and other Axis I diagnoses. Thus, results illustrate that potency of the DTS as a diagnostic aid was highly dependent on the comparison group used for analyses. Results are discussed in terms of applications to clinical practice and research.
Fatigue Failure of Space Shuttle Main Engine Turbine Blades
NASA Technical Reports Server (NTRS)
Swanson, Gregrory R.; Arakere, Nagaraj K.
2000-01-01
Experimental validation of finite element modeling of single crystal turbine blades is presented. Experimental results from uniaxial high cycle fatigue (HCF) test specimens and full scale Space Shuttle Main Engine test firings with the High Pressure Fuel Turbopump Alternate Turbopump (HPFTP/AT) provide the data used for the validation. The conclusions show the significant contribution of the crystal orientation within the blade on the resulting life of the component, that the analysis can predict this variation, and that experimental testing demonstrates it.
Validation of the M. D. Anderson Symptom Inventory multiple myeloma module
2013-01-01
Background The symptom burden associated with multiple myeloma (MM) is often severe. Presently, no instrument comprehensively assesses disease-related and treatment-related symptoms in patients with MM. We sought to validate a module of the M. D. Anderson Symptom Inventory (MDASI) developed specifically for patients with MM (MDASI-MM). Methods The MDASI-MM was developed with clinician input, cognitive debriefing, and literature review, and administered to 132 patients undergoing induction chemotherapy or stem cell transplantation. We demonstrated the MDASI-MM’s reliability (Cronbach α values); criterion validity (item and subscale correlations between the MDASI-MM and the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire (EORTC QLQ-C30) and the EORTC MM module (QLQ-MY20)), and construct validity (differences between groups by performance status). Ratings from transplant patients were examined to demonstrate the MDASI-MM’s sensitivity in detecting the acute worsening of symptoms post-transplantation. Results The MDASI-MM demonstrated excellent correlations with subscales of the 2 EORTC instruments, strong ability to distinguish clinically different patient groups, high sensitivity in detecting change in patients’ performance status, and high reliability. Cognitive debriefing confirmed that the MDASI-MM encompasses the breadth of symptoms relevant to patients with MM. Conclusion The MDASI-MM is a valid, reliable, comprehensive-yet-concise tool that is recommended as a uniform symptom assessment instrument for patients with MM. PMID:23384030
Cappelleri, J C; Althof, S E; Siegel, R L; Shpilsky, A; Bell, S S; Duttagupta, S
2004-02-01
Development and validation of a patient-reported measure of psychosocial variables in men with erectile dysfunction (ED) is described. Literature review, focus groups, and medical specialists identified 86 potential items. Redundant, ambiguous, or low item-to-total correlation items were removed. Data from 98 men reporting diagnosed ED and 94 controls assisted in final item selection and psychometric evaluation. Treatment responsiveness was evaluated in 93 men with ED in a 10-week open-label trial of sildenafil citrate (Viagra). The 14 chosen items resolved into two domains: Sexual Relationship (eight items) and Confidence (six items), the latter comprising Self-Esteem (four items) and Overall Relationship (two items) subscales. The resulting Self-Esteem And Relationship (SEAR) questionnaire demonstrated validity and reliability. The intervention study demonstrated responsiveness to beneficial treatment with significant improvement in scores (P=0.0001). The SEAR questionnaire possesses strong psychometric properties that support its validity and reliability for measuring sexual relationship, confidence, and particularly self-esteem.
Cloud computing and validation of expandable in silico livers.
Ropella, Glen E P; Hunt, C Anthony
2010-12-03
In Silico Livers (ISLs) are works in progress. They are used to challenge multilevel, multi-attribute, mechanistic hypotheses about the hepatic disposition of xenobiotics coupled with hepatic responses. To enhance ISL-to-liver mappings, we added discrete time metabolism, biliary elimination, and bolus dosing features to a previously validated ISL and initiated re-validated experiments that required scaling experiments to use more simulated lobules than previously, more than could be achieved using the local cluster technology. Rather than dramatically increasing the size of our local cluster we undertook the re-validation experiments using the Amazon EC2 cloud platform. So doing required demonstrating the efficacy of scaling a simulation to use more cluster nodes and assessing the scientific equivalence of local cluster validation experiments with those executed using the cloud platform. The local cluster technology was duplicated in the Amazon EC2 cloud platform. Synthetic modeling protocols were followed to identify a successful parameterization. Experiment sample sizes (number of simulated lobules) on both platforms were 49, 70, 84, and 152 (cloud only). Experimental indistinguishability was demonstrated for ISL outflow profiles of diltiazem using both platforms for experiments consisting of 84 or more samples. The process was analogous to demonstration of results equivalency from two different wet-labs. The results provide additional evidence that disposition simulations using ISLs can cover the behavior space of liver experiments in distinct experimental contexts (there is in silico-to-wet-lab phenotype similarity). The scientific value of experimenting with multiscale biomedical models has been limited to research groups with access to computer clusters. The availability of cloud technology coupled with the evidence of scientific equivalency has lowered the barrier and will greatly facilitate model sharing as well as provide straightforward tools for scaling simulations to encompass greater detail with no extra investment in hardware.
Choo, Min Soo; Jeong, Seong Jin; Cho, Sung Yong; Yoo, Changwon; Jeong, Chang Wook; Ku, Ja Hyeon; Oh, Seung-June
2017-04-01
We aimed to externally validate the prediction model we developed for having bladder outlet obstruction (BOO) and requiring prostatic surgery using 2 independent data sets from tertiary referral centers, and also aimed to validate a mobile app for using this model through usability testing. Formulas and nomograms predicting whether a subject has BOO and needs prostatic surgery were validated with an external validation cohort from Seoul National University Bundang Hospital and Seoul Metropolitan Government-Seoul National University Boramae Medical Center between January 2004 and April 2015. A smartphone-based app was developed, and 8 young urologists were enrolled for usability testing to identify any human factor issues of the app. A total of 642 patients were included in the external validation cohort. No significant differences were found in the baseline characteristics of major parameters between the original (n=1,179) and the external validation cohort, except for the maximal flow rate. Predictions of requiring prostatic surgery in the validation cohort showed a sensitivity of 80.6%, a specificity of 73.2%, a positive predictive value of 49.7%, and a negative predictive value of 92.0%, and area under receiver operating curve of 0.84. The calibration plot indicated that the predictions have good correspondence. The decision curve showed also a high net benefit. Similar evaluation results using the external validation cohort were seen in the predictions of having BOO. Overall results of the usability test demonstrated that the app was user-friendly with no major human factor issues. External validation of these newly developed a prediction model demonstrated a moderate level of discrimination, adequate calibration, and high net benefit gains for predicting both having BOO and requiring prostatic surgery. Also a smartphone app implementing the prediction model was user-friendly with no major human factor issue.
Validation of new psychosocial factors questionnaires: a Colombian national study.
Villalobos, Gloria H; Vargas, Angélica M; Rondón, Martin A; Felknor, Sarah A
2013-01-01
The study of workers' health problems possibly associated with stressful conditions requires valid and reliable tools for monitoring risk factors. The present study validates two questionnaires to assess psychosocial risk factors for stress-related illnesses within a sample of Colombian workers. The validation process was based on a representative sample survey of 2,360 Colombian employees, aged 18-70 years. Worker response rate was 90%; 46% of the responders were women. Internal consistency was calculated, construct validity was tested with factor analysis and concurrent validity was tested with Spearman correlations. The questionnaires demonstrated adequate reliability (0.88-0.95). Factor analysis confirmed the dimensions proposed in the measurement model. Concurrent validity resulted in significant correlations with stress and health symptoms. "Work and Non-work Psychosocial Factors Questionnaires" were found to be valid and reliable for the assessment of workers' psychosocial factors, and they provide information for research and intervention. Copyright © 2012 Wiley Periodicals, Inc.
Multiple Versus Single Set Validation of Multivariate Models to Avoid Mistakes.
Harrington, Peter de Boves
2018-01-02
Validation of multivariate models is of current importance for a wide range of chemical applications. Although important, it is neglected. The common practice is to use a single external validation set for evaluation. This approach is deficient and may mislead investigators with results that are specific to the single validation set of data. In addition, no statistics are available regarding the precision of a derived figure of merit (FOM). A statistical approach using bootstrapped Latin partitions is advocated. This validation method makes an efficient use of the data because each object is used once for validation. It was reviewed a decade earlier but primarily for the optimization of chemometric models this review presents the reasons it should be used for generalized statistical validation. Average FOMs with confidence intervals are reported and powerful, matched-sample statistics may be applied for comparing models and methods. Examples demonstrate the problems with single validation sets.
NASA Astrophysics Data System (ADS)
Herrera-Basurto, R.; Mercader-Trejo, F.; Muñoz-Madrigal, N.; Juárez-García, J. M.; Rodriguez-López, A.; Manzano-Ramírez, A.
2016-07-01
The main goal of method validation is to demonstrate that the method is suitable for its intended purpose. One of the advantages of analytical method validation is translated into a level of confidence about the measurement results reported to satisfy a specific objective. Elemental composition determination by wavelength dispersive spectrometer (WDS) microanalysis has been used over extremely wide areas, mainly in the field of materials science, impurity determinations in geological, biological and food samples. However, little information is reported about the validation of the applied methods. Herein, results of the in-house method validation for elemental composition determination by WDS are shown. SRM 482, a binary alloy Cu-Au of different compositions, was used during the validation protocol following the recommendations for method validation proposed by Eurachem. This paper can be taken as a reference for the evaluation of the validation parameters more frequently requested to get the accreditation under the requirements of the ISO/IEC 17025 standard: selectivity, limit of detection, linear interval, sensitivity, precision, trueness and uncertainty. A model for uncertainty estimation was proposed including systematic and random errors. In addition, parameters evaluated during the validation process were also considered as part of the uncertainty model.
Design and validation of a real-time spiking-neural-network decoder for brain-machine interfaces
NASA Astrophysics Data System (ADS)
Dethier, Julie; Nuyujukian, Paul; Ryu, Stephen I.; Shenoy, Krishna V.; Boahen, Kwabena
2013-06-01
Objective. Cortically-controlled motor prostheses aim to restore functions lost to neurological disease and injury. Several proof of concept demonstrations have shown encouraging results, but barriers to clinical translation still remain. In particular, intracortical prostheses must satisfy stringent power dissipation constraints so as not to damage cortex. Approach. One possible solution is to use ultra-low power neuromorphic chips to decode neural signals for these intracortical implants. The first step is to explore in simulation the feasibility of translating decoding algorithms for brain-machine interface (BMI) applications into spiking neural networks (SNNs). Main results. Here we demonstrate the validity of the approach by implementing an existing Kalman-filter-based decoder in a simulated SNN using the Neural Engineering Framework (NEF), a general method for mapping control algorithms onto SNNs. To measure this system’s robustness and generalization, we tested it online in closed-loop BMI experiments with two rhesus monkeys. Across both monkeys, a Kalman filter implemented using a 2000-neuron SNN has comparable performance to that of a Kalman filter implemented using standard floating point techniques. Significance. These results demonstrate the tractability of SNN implementations of statistical signal processing algorithms on different monkeys and for several tasks, suggesting that a SNN decoder, implemented on a neuromorphic chip, may be a feasible computational platform for low-power fully-implanted prostheses. The validation of this closed-loop decoder system and the demonstration of its robustness and generalization hold promise for SNN implementations on an ultra-low power neuromorphic chip using the NEF.
van Rossum, Huub H; Kemperman, Hans
2017-02-01
To date, no practical tools are available to obtain optimal settings for moving average (MA) as a continuous analytical quality control instrument. Also, there is no knowledge of the true bias detection properties of applied MA. We describe the use of bias detection curves for MA optimization and MA validation charts for validation of MA. MA optimization was performed on a data set of previously obtained consecutive assay results. Bias introduction and MA bias detection were simulated for multiple MA procedures (combination of truncation limits, calculation algorithms and control limits) and performed for various biases. Bias detection curves were generated by plotting the median number of test results needed for bias detection against the simulated introduced bias. In MA validation charts the minimum, median, and maximum numbers of assay results required for MA bias detection are shown for various bias. Their use was demonstrated for sodium, potassium, and albumin. Bias detection curves allowed optimization of MA settings by graphical comparison of bias detection properties of multiple MA. The optimal MA was selected based on the bias detection characteristics obtained. MA validation charts were generated for selected optimal MA and provided insight into the range of results required for MA bias detection. Bias detection curves and MA validation charts are useful tools for optimization and validation of MA procedures.
The NASTRAN demonstration program manual (level 16.0)
NASA Technical Reports Server (NTRS)
1976-01-01
The types of problems that can be solved with NASTRAN are presented. The nature of the problem, the underlying theory, the specific geometric and physical input quanties, and the comparison of theoretical and NASTRAN results are discussed. At least one problem for each of the rigid formats and nearly all of the elements or provided. The features of NASTRAN demonstrated by specific problems are described. The results obtained are valid.
Cloud computing and validation of expandable in silico livers
2010-01-01
Background In Silico Livers (ISLs) are works in progress. They are used to challenge multilevel, multi-attribute, mechanistic hypotheses about the hepatic disposition of xenobiotics coupled with hepatic responses. To enhance ISL-to-liver mappings, we added discrete time metabolism, biliary elimination, and bolus dosing features to a previously validated ISL and initiated re-validated experiments that required scaling experiments to use more simulated lobules than previously, more than could be achieved using the local cluster technology. Rather than dramatically increasing the size of our local cluster we undertook the re-validation experiments using the Amazon EC2 cloud platform. So doing required demonstrating the efficacy of scaling a simulation to use more cluster nodes and assessing the scientific equivalence of local cluster validation experiments with those executed using the cloud platform. Results The local cluster technology was duplicated in the Amazon EC2 cloud platform. Synthetic modeling protocols were followed to identify a successful parameterization. Experiment sample sizes (number of simulated lobules) on both platforms were 49, 70, 84, and 152 (cloud only). Experimental indistinguishability was demonstrated for ISL outflow profiles of diltiazem using both platforms for experiments consisting of 84 or more samples. The process was analogous to demonstration of results equivalency from two different wet-labs. Conclusions The results provide additional evidence that disposition simulations using ISLs can cover the behavior space of liver experiments in distinct experimental contexts (there is in silico-to-wet-lab phenotype similarity). The scientific value of experimenting with multiscale biomedical models has been limited to research groups with access to computer clusters. The availability of cloud technology coupled with the evidence of scientific equivalency has lowered the barrier and will greatly facilitate model sharing as well as provide straightforward tools for scaling simulations to encompass greater detail with no extra investment in hardware. PMID:21129207
Junghaenel, Doerte U.; Schneider, Stefan; Stone, Arthur A.; Christodoulou, Christopher; Broderick, Joan E.
2014-01-01
Objective This study examined the ecological validity and clinical utility of NIH Patient Reported-Outcomes Measurement Information System (PROMIS®) instruments for anger, depression, and fatigue in women with premenstrual symptoms. Methods One-hundred women completed daily diaries and weekly PROMIS assessments over 4 weeks. Weekly assessments were administered through Computerized Adaptive Testing (CAT). Weekly CATs and corresponding daily scores were compared to evaluate ecological validity. To test clinical utility, we examined if CATs could detect changes in symptom levels, if these changes mirrored those obtained from daily scores, and if CATs could identify clinically meaningful premenstrual symptom change. Results PROMIS CAT scores were higher in the pre-menstrual than the baseline (ps < .0001) and post-menstrual (ps < .0001) weeks. The correlations between CATs and aggregated daily scores ranged from .73 to .88 supporting ecological validity. Mean CAT scores showed systematic changes in accordance with the menstrual cycle and the magnitudes of the changes were similar to those obtained from the daily scores. Finally, Receiver Operating Characteristic (ROC) analyses demonstrated the ability of the CATs to discriminate between women with and without clinically meaningful premenstrual symptom change. Conclusions PROMIS CAT instruments for anger, depression, and fatigue demonstrated validity and utility in premenstrual symptom assessment. The results provide encouraging initial evidence of the utility of PROMIS instruments for the measurement of affective premenstrual symptoms. PMID:24630180
Development and Validation of a Fatigue Assessment Scale for U.S. Construction Workers
Zhang, Mingzong; Sparer, Emily H.; Murphy, Lauren A.; Dennerlein, Jack T.; Fang, Dongping; Katz, Jeffrey N.; Caban-Martinez, Alberto J.
2015-01-01
Objective To develop a fatigue assessment scale and test its reliability and validity for commercial construction workers. Methods Using a two-phased approach, we first identified items for the development of a Fatigue Assessment Scale for Construction Workers (FASCW) through review of existing scales in the scientific literature, key informant interviews (n=11) and focus groups (3 groups with 6 workers each) with construction workers. The second phase included assessment for the reliability, validity and sensitivity of the new scale using a repeated-measures study design with a convenience sample of construction workers (n=144). Results Phase one resulted in a 16-item preliminary scale that after factor analysis yielded a final 10-item scale with two sub-scales (“Lethargy” and “Bodily Ailment”).. During phase two, the FASCW and its subscales demonstrated satisfactory internal consistency (alpha coefficients were FASCW (0.91), Lethargy (0.86) and Bodily Ailment (0.84)) and acceptable test-retest reliability (Pearson Correlations Coefficients: 0.59–0.68; Intraclass Correlation Coefficients: 0.74–0.80). Correlation analysis substantiated concurrent and convergent validity. A discriminant analysis demonstrated that the FASCW differentiated between groups with arthritis status and different work hours. Conclusions The 10-item FASCW with good reliability and validity is an effective tool for assessing the severity of fatigue among construction workers. PMID:25603944
Broderick, Joan E.; Schneider, Stefan; Junghaenel, Doerte U.; Schwartz, Joseph E.; Stone, Arthur A.
2013-01-01
Objective Evaluation of known group validity, ecological validity, and test-retest reliability of four domain instruments from the Patient Reported Outcomes Measurement System (PROMIS) in osteoarthritis (OA) patients. Methods Recruitment of an osteoarthritis sample and a comparison general population (GP) through an Internet survey panel. Pain intensity, pain interference, physical functioning, and fatigue were assessed for 4 consecutive weeks with PROMIS short forms on a daily basis and compared with same-domain Computer Adaptive Test (CAT) instruments that use a 7-day recall. Known group validity (comparison of OA and GP), ecological validity (comparison of aggregated daily measures with CATs), and test-retest reliability were evaluated. Results The recruited samples matched (age, sex, race, ethnicity) the demographic characteristics of the U.S. sample for arthritis and the 2009 Census for the GP. Compliance with repeated measurements was excellent: > 95%. Known group validity for CATs was demonstrated with large effect sizes (pain intensity: 1.42, pain interference: 1.25, and fatigue: .85). Ecological validity was also established through high correlations between aggregated daily measures and weekly CATs (≥ .86). Test-retest validity (7-day) was very good (≥ .80). Conclusion PROMIS CAT instruments demonstrated known group and ecological validity in a comparison of osteoarthritis patients with a general population sample. Adequate test-retest reliability was also observed. These data provide encouraging initial data on the utility of these PROMIS instruments for clinical and research outcomes in osteoarthritis patients. PMID:23592494
Scaglione, John M.; Mueller, Don E.; Wagner, John C.
2014-12-01
One of the most important remaining challenges associated with expanded implementation of burnup credit in the United States is the validation of depletion and criticality calculations used in the safety evaluation—in particular, the availability and use of applicable measured data to support validation, especially for fission products (FPs). Applicants and regulatory reviewers have been constrained by both a scarcity of data and a lack of clear technical basis or approach for use of the data. In this study, this paper describes a validation approach for commercial spent nuclear fuel (SNF) criticality safety (k eff) evaluations based on best-available data andmore » methods and applies the approach for representative SNF storage and transport configurations/conditions to demonstrate its usage and applicability, as well as to provide reference bias results. The criticality validation approach utilizes not only available laboratory critical experiment (LCE) data from the International Handbook of Evaluated Criticality Safety Benchmark Experiments and the French Haut Taux de Combustion program to support validation of the principal actinides but also calculated sensitivities, nuclear data uncertainties, and limited available FP LCE data to predict and verify individual biases for relevant minor actinides and FPs. The results demonstrate that (a) sufficient critical experiment data exist to adequately validate k eff calculations via conventional validation approaches for the primary actinides, (b) sensitivity-based critical experiment selection is more appropriate for generating accurate application model bias and uncertainty, and (c) calculated sensitivities and nuclear data uncertainties can be used for generating conservative estimates of bias for minor actinides and FPs. Results based on the SCALE 6.1 and the ENDF/B-VII.0 cross-section libraries indicate that a conservative estimate of the bias for the minor actinides and FPs is 1.5% of their worth within the application model. Finally, this paper provides a detailed description of the approach and its technical bases, describes the application of the approach for representative pressurized water reactor and boiling water reactor safety analysis models, and provides reference bias results based on the prerelease SCALE 6.1 code package and ENDF/B-VII nuclear cross-section data.« less
Davenport, Todd E; Stevens, Staci R; Baroni, Katie; Van Ness, J Mark; Snell, Christopher R
2011-01-01
To determine the validity and reliability of Short Form 36 Version 2 (SF36v2) in sub-groups of individuals with fatigue. Thirty subjects participated in this study, including n = 16 subjects who met case definition criteria for chronic fatigue syndrome (CFS) and n = 14 non-disabled sedentary matched control subjects. SF36v2 and Multidimensional Fatigue Inventory (MFI-20) were administered before two maximal cardiopulmonary exercise tests (CPETs) administered 24 h apart and an open-ended recovery questionnaire was administered 7 days after CPET challenge. The main outcome measures were self-reported time to recover to pre-challenge functional and symptom status, frequency of post-exertional symptoms and SF36v2 sub-scale scores. Individuals with CFS demonstrated significantly lower SF36v2 and MFI-20 sub-scale scores prior to CPET. Between-group differences remained significant post-CPET, however, there were no significant group by test interaction effects. Subjects with CFS reported significantly more total symptoms (p < 0.001), as well as reports of fatigue (p < 0.001), neuroendocrine (p < 0.001), immune (p < 0.01), pain (p < 0.01) and sleep disturbance (p < 0.01) symptoms than control subjects as a result of CPET. Many symptom counts demonstrated significant relationships with SF36v2 sub-scale scores (p < 0.05). SF36v2 and MFI-20 sub-scale scores demonstrated significant correlations (p < 0.05). Various SF36v2 sub-scale scores demonstrated significant predictive validity to identify subjects who recovered from CPET challenge within 1 day and 7 days (p < 0.05). Potential floor effects were observed for both questionnaires for individuals with CFS. Various sub-scales of SF36v2 demonstrated adequate reliability and validity for clinical and research applications. Adequacy of sensitivity to change of SF36v2 as a result of a fatiguing stressor should be the subject of additional study.
Validation of the ArthroS virtual reality simulator for arthroscopic skills.
Stunt, J J; Kerkhoffs, G M M J; van Dijk, C N; Tuijthof, G J M
2015-11-01
Virtual reality simulator training has become important for acquiring arthroscopic skills. A new simulator for knee arthroscopy ArthroS™ has been developed. The purpose of this study was to demonstrate face and construct validity, executed according to a protocol used previously to validate arthroscopic simulators. Twenty-seven participants were divided into three groups having different levels of arthroscopic experience. Participants answered questions regarding general information and the outer appearance of the simulator for face validity. Construct validity was assessed with one standardized navigation task. Face validity, educational value and user friendliness were further determined by giving participants three exercises and by asking them to fill out the questionnaire. Construct validity was demonstrated between experts and beginners. Median task times were not significantly different for all repetitions between novices and intermediates, and between intermediates and experts. Median face validity was 8.3 for the outer appearance, 6.5 for the intra-articular joint and 4.7 for surgical instruments. Educational value and user friendliness were perceived as nonsatisfactory, especially because of the lack of tactile feedback. The ArthroS™ demonstrated construct validity between novices and experts, but did not demonstrate full face validity. Future improvements should be mainly focused on the development of tactile feedback. It is necessary that a newly presented simulator is validated to prove it actually contributes to proficiency of skills.
Psychometric Evaluation of the PSQI in U.S. College Students
Dietch, Jessica R.; Taylor, Daniel J.; Sethi, Kevin; Kelly, Kimberly; Bramoweth, Adam D.; Roane, Brandy M.
2016-01-01
Study Objectives: Examine the psychometric properties of the PSQI in two U.S. college samples. Methods: Study I assessed convergent and divergent validity in 866 undergraduates who completed a sleep diary, PSQI, and other sleep and psychosocial measures. Study II assessed PSQI insomnia diagnostic accuracy in a separate sample of 147 healthy undergraduates with and without insomnia. Results: The PSQI global score had only moderate convergent validity with sleep diary sleep efficiency (prospective global measure of sleep continuity; r = 0.53), the Insomnia Severity Index (r = 0.63), and fatigue (r = 0.44). The PSQI global score demonstrated good divergent validity with measures of excessive daytime sleepiness (r = 0.18), circadian preference (r = −0.08), alcohol (r = 0.08) and marijuana (r = 0.05) abuse scales, and poor divergent validity with depression (r = 0.48), anxiety (r = 0.40), and perceived stress (r = 0.33). Examination of other analogous PSQI and sleep diary components showed low to moderate convergent validity: sleep latency (r = 0.70), wake after sleep onset (r = 0.37), sleep duration (r = 0.51), and sleep efficiency (r = −0.32). Diagnostic accuracy of the PSQI to detect insomnia was very high (area under the curve = 0.999). Sensitivity and specificity were maximized at a cutoff of 6. Conclusions: The PSQI demonstrated moderate convergent validity compared to measures of insomnia and fatigue and good divergent validity with measures of daytime sleepiness, circadian phase preference, and alcohol and marijuana use. The PSQI demonstrated considerable overlap with depression, anxiety, and perceived stress. Therefore, caution should be used with interpretation. Citation: Dietch JR, Taylor DJ, Sethi K, Kelly K, Bramoweth AD, Roane BM. Psychometric evaluation of the PSQI in U.S. college students. J Clin Sleep Med 2016;12(8):1121–1129. PMID:27166299
2017-01-01
Objective To perform a translation and cross-cultural adaptation of the Cardiac Rehabilitation Barriers Scale (CRBS) for use in Korea, followed by psychometric validation. The CRBS was developed to assess patients' perception of the degree to which patient, provider and health system-level barriers affect their cardiac rehabilitation (CR) participation. Methods The CRBS consists of 21 items (barriers to adherence) rated on a 5-point Likert scale. The first phase was to translate and cross-culturally adapt the CRBS to the Korean language. After back-translation, both versions were reviewed by a committee. The face validity was assessed in a sample of Korean patients (n=53) with history of acute myocardial infarction that did not participate in CR through semi-structured interviews. The second phase was to assess the construct and criterion validity of the Korean translation as well as internal reliability, through administration of the translated version in 104 patients, principle component analysis with varimax rotation and cross-referencing against CR use, respectively. Results The length, readability, and clarity of the questionnaire were rated well, demonstrating face validity. Analysis revealed a six-factor solution, demonstrating construct validity. Cronbach's alpha was greater than 0.65. Barriers rated highest included not knowing about CR and not being contacted by a program. The mean CRBS score was significantly higher among non-attendees (2.71±0.26) than CR attendees (2.51±0.18) (p<0.01). Conclusion The Korean version of CRBS has demonstrated face, content and criterion validity, suggesting it may be useful for assessing barriers to CR utilization in Korea. PMID:29201826
Ely, E Wesley; Truman, Brenda; Shintani, Ayumi; Thomason, Jason W W; Wheeler, Arthur P; Gordon, Sharon; Francis, Joseph; Speroff, Theodore; Gautam, Shiva; Margolin, Richard; Sessler, Curtis N; Dittus, Robert S; Bernard, Gordon R
2003-06-11
Goal-directed delivery of sedative and analgesic medications is recommended as standard care in intensive care units (ICUs) because of the impact these medications have on ventilator weaning and ICU length of stay, but few of the available sedation scales have been appropriately tested for reliability and validity. To test the reliability and validity of the Richmond Agitation-Sedation Scale (RASS). Prospective cohort study. Adult medical and coronary ICUs of a university-based medical center. Thirty-eight medical ICU patients enrolled for reliability testing (46% receiving mechanical ventilation) from July 21, 1999, to September 7, 1999, and an independent cohort of 275 patients receiving mechanical ventilation were enrolled for validity testing from February 1, 2000, to May 3, 2001. Interrater reliability of the RASS, Glasgow Coma Scale (GCS), and Ramsay Scale (RS); validity of the RASS correlated with reference standard ratings, assessments of content of consciousness, GCS scores, doses of sedatives and analgesics, and bispectral electroencephalography. In 290-paired observations by nurses, results of both the RASS and RS demonstrated excellent interrater reliability (weighted kappa, 0.91 and 0.94, respectively), which were both superior to the GCS (weighted kappa, 0.64; P<.001 for both comparisons). Criterion validity was tested in 411-paired observations in the first 96 patients of the validation cohort, in whom the RASS showed significant differences between levels of consciousness (P<.001 for all) and correctly identified fluctuations within patients over time (P<.001). In addition, 5 methods were used to test the construct validity of the RASS, including correlation with an attention screening examination (r = 0.78, P<.001), GCS scores (r = 0.91, P<.001), quantity of different psychoactive medication dosages 8 hours prior to assessment (eg, lorazepam: r = - 0.31, P<.001), successful extubation (P =.07), and bispectral electroencephalography (r = 0.63, P<.001). Face validity was demonstrated via a survey of 26 critical care nurses, which the results showed that 92% agreed or strongly agreed with the RASS scoring scheme, and 81% agreed or strongly agreed that the instrument provided a consensus for goal-directed delivery of medications. The RASS demonstrated excellent interrater reliability and criterion, construct, and face validity. This is the first sedation scale to be validated for its ability to detect changes in sedation status over consecutive days of ICU care, against constructs of level of consciousness and delirium, and correlated with the administered dose of sedative and analgesic medications.
The design of a joined wing flight demonstrator aircraft
NASA Technical Reports Server (NTRS)
Smith, S. C.; Cliff, S. E.; Kroo, I. M.
1987-01-01
A joined-wing flight demonstrator aircraft has been developed at the NASA Ames Research Center in collaboration with ACA Industries. The aircraft is designed to utilize the fuselage, engines, and undercarriage of the existing NASA AD-1 flight demonstrator aircraft. The design objectives, methods, constraints, and the resulting aircraft design, called the JW-1, are presented. A wind-tunnel model of the JW-1 was tested in the NASA Ames 12-foot wind tunnel. The test results indicate that the JW-1 has satisfactory flying qualities for a flight demonstrator aircraft. Good agreement of test results with design predictions confirmed the validity of the design methods used for application to joined-wing configurations.
Out-of-Plane Continuous Electrostatic Micro-Power Generators
Mahmoud, M. A. E.; Abdel-Rahman, E. M.; Mansour, R. R.; El-Saadany, E. F.
2017-01-01
This paper presents an out-of-plane electrostatic micro-power generator (MPG). Electret-based continuous MPGs with different gaps and masses are fabricated to demonstrate the merits of this topology. Experimental results of the MPG demonstrate output power of 1 mW for a base acceleration amplitude and frequency of 0.08 g and 86 Hz. The MPGs also demonstrate a wideband harvesting bandwidth reaching up to 9 Hz. A free-flight and an impact mode model of electrostatic MPGs are also derived and validated by comparison to experimental results. PMID:28420151
Analysis of model development strategies: predicting ventral hernia recurrence.
Holihan, Julie L; Li, Linda T; Askenasy, Erik P; Greenberg, Jacob A; Keith, Jerrod N; Martindale, Robert G; Roth, J Scott; Liang, Mike K
2016-11-01
There have been many attempts to identify variables associated with ventral hernia recurrence; however, it is unclear which statistical modeling approach results in models with greatest internal and external validity. We aim to assess the predictive accuracy of models developed using five common variable selection strategies to determine variables associated with hernia recurrence. Two multicenter ventral hernia databases were used. Database 1 was randomly split into "development" and "internal validation" cohorts. Database 2 was designated "external validation". The dependent variable for model development was hernia recurrence. Five variable selection strategies were used: (1) "clinical"-variables considered clinically relevant, (2) "selective stepwise"-all variables with a P value <0.20 were assessed in a step-backward model, (3) "liberal stepwise"-all variables were included and step-backward regression was performed, (4) "restrictive internal resampling," and (5) "liberal internal resampling." Variables were included with P < 0.05 for the Restrictive model and P < 0.10 for the Liberal model. A time-to-event analysis using Cox regression was performed using these strategies. The predictive accuracy of the developed models was tested on the internal and external validation cohorts using Harrell's C-statistic where C > 0.70 was considered "reasonable". The recurrence rate was 32.9% (n = 173/526; median/range follow-up, 20/1-58 mo) for the development cohort, 36.0% (n = 95/264, median/range follow-up 20/1-61 mo) for the internal validation cohort, and 12.7% (n = 155/1224, median/range follow-up 9/1-50 mo) for the external validation cohort. Internal validation demonstrated reasonable predictive accuracy (C-statistics = 0.772, 0.760, 0.767, 0.757, 0.763), while on external validation, predictive accuracy dipped precipitously (C-statistic = 0.561, 0.557, 0.562, 0.553, 0.560). Predictive accuracy was equally adequate on internal validation among models; however, on external validation, all five models failed to demonstrate utility. Future studies should report multiple variable selection techniques and demonstrate predictive accuracy on external data sets for model validation. Copyright © 2016 Elsevier Inc. All rights reserved.
Fearon, A M; Ganderton, C; Scarvell, J M; Smith, P N; Neeman, T; Nash, C; Cook, J L
2015-12-01
Greater trochanteric pain syndrome (GTPS) is common, resulting in significant pain and disability. There is no condition specific outcome score to evaluate the degree of severity of disability associated with GTPS in patients with this condition. To develop a reliable and valid outcome measurement capable of evaluating the severity of disability associated with GTPS. A phenomenological framework using in-depth semi structured interviews of patients and medical experts, and focus groups of physiotherapists was used in the item generation. Item and format clarification was undertaken via piloting. Multivariate analysis provided the basis for item reduction. The resultant VISA-G was tested for reliability with the inter class co-efficient (ICC), internal consistency (Cronbach's Alpha), and construct validity (correlation co-efficient) on 52 naïve participants with GTPS and 31 asymptomatic participants. The resultant outcome measurement tool is consistent in style with existing tendinopathy outcome measurement tools, namely the suite of VISA scores. The VISA-G was found to be have a test-retest reliability of ICC2,1 (95% CI) of 0.827 (0.638-0.923). Internal consistency was high with a Cronbach's Alpha of 0.809. Construct validity was demonstrated: the VISA-G measures different constructs than tools previously used in assessing GTPS, the Harris Hip Score and the Oswestry Disability Index (Spearman Rho:0.020 and 0.0205 respectively). The VISA-G did not demonstrate any floor or ceiling effect in symptomatic participants. The VISA-G is a reliable and valid score for measuring the severity of disability associated GTPS. Copyright © 2015 Elsevier Ltd. All rights reserved.
The Development and Validation of the Rational and Intuitive Decision Styles Scale.
Hamilton, Katherine; Shih, Shin-I; Mohammed, Susan
2016-01-01
Decision styles reflect the typical manner by which individuals make decisions. The purpose of this research was to develop and validate a decision style scale that addresses conceptual and psychometric problems with current measures. The resulting 10-item scale captures a broad range of the rational and intuitive styles construct domain. Results from 5 independent samples provide initial support for the dimensionality and reliability of the new scale, as demonstrated by a clear factor structure and high internal consistency. In addition, our results show evidence of convergent and discriminant validity through expected patterns of correlations across decision-making individual differences and the International Personality Item Pool (IPIP) Big Five traits. Research domains that would benefit from incorporating the concept of decision styles are discussed.
AIRS Retrieval Validation During the EAQUATE
NASA Technical Reports Server (NTRS)
Zhou, Daniel K.; Smith, William L.; Cuomo, Vincenzo; Taylor, Jonathan P.; Barnet, Christopher D.; DiGirolamo, Paolo; Pappalardo, Gelsomina; Larar, Allen M.; Liu, Xu; Newman, Stuart M.
2006-01-01
Atmospheric and surface thermodynamic parameters retrieved with advanced hyperspectral remote sensors of Earth observing satellites are critical for weather prediction and scientific research. The retrieval algorithms and retrieved parameters from satellite sounders must be validated to demonstrate the capability and accuracy of both observation and data processing systems. The European AQUA Thermodynamic Experiment (EAQUATE) was conducted mainly for validation of the Atmospheric InfraRed Sounder (AIRS) on the AQUA satellite, but also for assessment of validation systems of both ground-based and aircraft-based instruments which will be used for other satellite systems such as the Infrared Atmospheric Sounding Interferometer (IASI) on the European MetOp satellite, the Cross-track Infrared Sounder (CrIS) from the NPOESS Preparatory Project and the following NPOESS series of satellites. Detailed inter-comparisons were conducted and presented using different retrieval methodologies: measurements from airborne ultraspectral Fourier transform spectrometers, aircraft in-situ instruments, dedicated dropsondes and radiosondes, and ground based Raman Lidar, as well as from the European Center for Medium range Weather Forecasting (ECMWF) modeled thermal structures. The results of this study not only illustrate the quality of the measurements and retrieval products but also demonstrate the capability of these validation systems which are put in place to validate current and future hyperspectral sounding instruments and their scientific products.
Chen, Y-W; HajGhanbari, B; Road, J D; Coxson, H O; Camp, P G; Reid, W D
2018-06-08
Pain is prevalent in chronic obstructive pulmonary disease (COPD) and the Brief Pain Inventory (BPI) appears to be a feasible questionnaire to assess this symptom. However, the reliability and validity of the BPI have not been determined in individuals with COPD. This study aimed to determine the internal consistency, test-retest reliability and validity (construct, convergent, divergent and discriminant) of the BPI in individuals with COPD. In order to examine the test-retest reliability, individuals with COPD were recruited from pulmonary rehabilitation programmes to complete the BPI twice 1 week apart. In order to investigate validity, de-identified data was retrieved from two previous studies, including forced expiratory volume in 1-s, age, sex and data from four questionnaires: the BPI, short-form McGill Pain Questionnaire (SF-MPQ), 36-Item Short Form Survey (SF-36) and Community Health Activities Model Program for Seniors (CHAMPS) questionnaire. In total, 123 participants were included in the analyses (eligible data were retrieved from 86 participants and additional 37 participants were recruited). The BPI demonstrated excellent internal consistency and test-retest reliability. It also showed convergent validity with the SF-MPQ and divergent validity with the SF-36. The factor analysis yielded two factors of the BPI, which demonstrated that the two domains of the BPI measure the intended constructs. The BPI can also discriminate pain levels among COPD patients with varied levels of quality of life (SF-36) and physical activity (CHAMPS). The BPI is a reliable and valid pain questionnaire that can be used to evaluate pain in COPD. This study formally established the reliability and validity of the BPI in individuals with COPD, which have not been determined in this patient group. The results of this study provide strong evidence that assessment results from this pain questionnaire are reliable and valid. © 2018 European Pain Federation - EFIC®.
Design and validation of a real-time spiking-neural-network decoder for brain-machine interfaces.
Dethier, Julie; Nuyujukian, Paul; Ryu, Stephen I; Shenoy, Krishna V; Boahen, Kwabena
2013-06-01
Cortically-controlled motor prostheses aim to restore functions lost to neurological disease and injury. Several proof of concept demonstrations have shown encouraging results, but barriers to clinical translation still remain. In particular, intracortical prostheses must satisfy stringent power dissipation constraints so as not to damage cortex. One possible solution is to use ultra-low power neuromorphic chips to decode neural signals for these intracortical implants. The first step is to explore in simulation the feasibility of translating decoding algorithms for brain-machine interface (BMI) applications into spiking neural networks (SNNs). Here we demonstrate the validity of the approach by implementing an existing Kalman-filter-based decoder in a simulated SNN using the Neural Engineering Framework (NEF), a general method for mapping control algorithms onto SNNs. To measure this system's robustness and generalization, we tested it online in closed-loop BMI experiments with two rhesus monkeys. Across both monkeys, a Kalman filter implemented using a 2000-neuron SNN has comparable performance to that of a Kalman filter implemented using standard floating point techniques. These results demonstrate the tractability of SNN implementations of statistical signal processing algorithms on different monkeys and for several tasks, suggesting that a SNN decoder, implemented on a neuromorphic chip, may be a feasible computational platform for low-power fully-implanted prostheses. The validation of this closed-loop decoder system and the demonstration of its robustness and generalization hold promise for SNN implementations on an ultra-low power neuromorphic chip using the NEF.
A psychometric evaluation of an advanced pharmacy practice experience clinical competency framework.
Douglas Ried, L; Doty, Randell E; Nemire, Ruth E
2015-03-25
To assess the psychometric properties of the clinical competency framework known as the System of Universal Clinical Competency Evaluation in the Sunshine State (SUCCESS), including its internal consistency and content, construct, and criterion validity. Sub-competency items within each hypothesized competency pair were subjected to principal components factor analysis to demonstrate convergent and discriminant validity. Varimax rotation was conducted for each competency pair (eg, competency 1 vs competency 2, competency 1 vs competency 3, competency 2 vs competency 3). Internal consistency was evaluated using Cronbach alpha. Of the initial 78 pairings, 44 (56%) demonstrated convergent and discriminant validity. Five pairs of competencies were unidimensional. Of the 34 pairs where at least 1 competency was multidimensional, most (91%) were from competencies 7, 11, and 12, indicating modifications were warranted in those competencies. After reconfiguring the competencies, 76 (94%) of the 81 pairs resulted in 2 factors as required. A unidimensional factor emerged when all 13 of the competencies were entered into a factor analysis. The internal consistency of all of the competencies was satisfactory. Psychometric evaluation shows the SUCCESS framework demonstrates adequate reliability and validity for most competencies. However, it also provides guidance where improvements are needed as part of a continuous quality improvement program.
Neto, Jose Osni Bruggemann; Gesser, Rafael Lehmkuhl; Steglich, Valdir; Bonilauri Ferreira, Ana Paula; Gandhi, Mihir; Vissoci, João Ricardo Nickenig; Pietrobon, Ricardo
2013-01-01
Background The validation of widely used scales facilitates the comparison across international patient samples. The objective of this study was to translate, culturally adapt and validate the Simple Shoulder Test into Brazilian Portuguese. Also we test the stability of factor analysis across different cultures. Objective The objective of this study was to translate, culturally adapt and validate the Simple Shoulder Test into Brazilian Portuguese. Also we test the stability of factor analysis across different cultures. Methods The Simple Shoulder Test was translated from English into Brazilian Portuguese, translated back into English, and evaluated for accuracy by an expert committee. It was then administered to 100 patients with shoulder conditions. Psychometric properties were analyzed including factor analysis, internal reliability, test-retest reliability at seven days, and construct validity in relation to the Short Form 36 health survey (SF-36). Results Factor analysis demonstrated a three factor solution. Cronbach’s alpha was 0.82. Test-retest reliability index as measured by intra-class correlation coefficient (ICC) was 0.84. Associations were observed in the hypothesized direction with all subscales of SF-36 questionnaire. Conclusion The Simple Shoulder Test translation and cultural adaptation to Brazilian-Portuguese demonstrated adequate factor structure, internal reliability, and validity, ultimately allowing for its use in the comparison with international patient samples. PMID:23675436
How to test validity in orthodontic research: a mixed dentition analysis example.
Donatelli, Richard E; Lee, Shin-Jae
2015-02-01
The data used to test the validity of a prediction method should be different from the data used to generate the prediction model. In this study, we explored whether an independent data set is mandatory for testing the validity of a new prediction method and how validity can be tested without independent new data. Several validation methods were compared in an example using the data from a mixed dentition analysis with a regression model. The validation errors of real mixed dentition analysis data and simulation data were analyzed for increasingly large data sets. The validation results of both the real and the simulation studies demonstrated that the leave-1-out cross-validation method had the smallest errors. The largest errors occurred in the traditional simple validation method. The differences between the validation methods diminished as the sample size increased. The leave-1-out cross-validation method seems to be an optimal validation method for improving the prediction accuracy in a data set with limited sample sizes. Copyright © 2015 American Association of Orthodontists. Published by Elsevier Inc. All rights reserved.
2004-01-01
Background Evaluation is a challenging but necessary part of the development cycle of clinical information systems like the electronic medical records (EMR) system. It is believed that such evaluations should include multiple perspectives, be comparative and employ both qualitative and quantitative methods. Self-administered questionnaires are frequently used as a quantitative evaluation method in medical informatics, but very few validated questionnaires address clinical use of EMR systems. Methods We have developed a task-oriented questionnaire for evaluating EMR systems from the clinician's perspective. The key feature of the questionnaire is a list of 24 general clinical tasks. It is applicable to physicians of most specialties and covers essential parts of their information-oriented work. The task list appears in two separate sections, about EMR use and task performance using the EMR, respectively. By combining these sections, the evaluator may estimate the potential impact of the EMR system on health care delivery. The results may also be compared across time, site or vendor. This paper describes the development, performance and validation of the questionnaire. Its performance is shown in two demonstration studies (n = 219 and 80). Its content is validated in an interview study (n = 10), and its reliability is investigated in a test-retest study (n = 37) and a scaling study (n = 31). Results In the interviews, the physicians found the general clinical tasks in the questionnaire relevant and comprehensible. The tasks were interpreted concordant to their definitions. However, the physicians found questions about tasks not explicitly or only partially supported by the EMR systems difficult to answer. The two demonstration studies provided unambiguous results and low percentages of missing responses. In addition, criterion validity was demonstrated for a majority of task-oriented questions. Their test-retest reliability was generally high, and the non-standard scale was found symmetric and ordinal. Conclusion This questionnaire is relevant for clinical work and EMR systems, provides reliable and interpretable results, and may be used as part of any evaluation effort involving the clinician's perspective of an EMR system. PMID:15018620
2011-02-01
UNCLASSIFIED: Approved for public release; distribution unlimited. Laboratory Validation and Demonstrations of Non-Hexavalent Chromium Conversion...00-00-2011 4. TITLE AND SUBTITLE Laboratory Validation and Demonstrations of Non-Hexavalent Chromium Conversion Coatings for Steel Substrates 5a...Coatings for HHA • SurTec 650 - ChromitAL TCP - Trivalent Chrome Pretreatment Developed by NAVAIR for Aluminum. • Chemetall Oxsilan 9810/2 - Non-chrome
Development, reliability, and validity of the My Child's Play (MCP) questionnaire.
Schneider, Eleanor; Rosenblum, Sara
2014-01-01
This article describes the development, reliability, and validity of My Child's Play (MCP), a parent questionnaire designed to evaluate the play of children ages 3-9 yr. The first phase of the study determined the questionnaire's content and face validity. Subsequently, the internal reliability consistency and construct and concurrent validity were demonstrated using 334 completed questionnaires. The MCP showed good internal consistency (α = .86). The factor analysis revealed four distinct factors with acceptable levels of internal reliability (Cronbach's αs = .63-.81) and gender- and age-related differences in play characteristics; both findings attest to the tool's construct validity. Significant correlations (r = .33, p < .0001) with the Parent as a Teacher Inventory demonstrate the MCP's concurrent validity. The MCP demonstrated acceptable reliability and validity. It appears to be a promising standardized assessment tool for use in research and practice to promote understanding of a child's play. Copyright © 2014 by the American Occupational Therapy Association, Inc.
ERIC Educational Resources Information Center
Hohlfeld, Tina N.; Ritzhaupt, Albert D.; Barron, Ann E.
2013-01-01
This paper examines gender differences related to Information and Communication Technology (ICT) literacy using two valid and internally consistent measures with eighth grade students (N = 1,513) from Florida public schools. The results of t test statistical analyses, which examined only gender differences in demonstrated and perceived ICT skills,…
Optical Closed-Loop Propulsion Control System Development
NASA Technical Reports Server (NTRS)
Poppel, Gary L.
1998-01-01
The overall objective of this program was to design and fabricate the components required for optical closed-loop control of a F404-400 turbofan engine, by building on the experience of the NASA Fiber Optic Control System Integration (FOCSI) program. Evaluating the performance of fiber optic technology at the component and system levels will result in helping to validate its use on aircraft engines. This report includes descriptions of three test plans. The EOI Acceptance Test is designed to demonstrate satisfactory functionality of the EOI, primarily fail-safe throughput of the F404 sensor signals in the normal mode, and validation, switching, and output of the five analog sensor signals as generated from validated optical sensor inputs, in the optical mode. The EOI System Test is designed to demonstrate acceptable F404 ECU functionality as interfaced with the EOI, making use of a production ECU test stand. The Optical Control Engine Test Request describes planned hardware installation, optical signal calibrations, data system coordination, test procedures, and data signal comparisons for an engine test demonstration of the optical closed-loop control.
Quek, June; Brauer, Sandra G; Treleaven, Julia; Pua, Yong-Hao; Mentiplay, Benjamin; Clark, Ross Allan
2014-04-17
Concurrent validity and intra-rater reliability using a customized Android phone application to measure cervical-spine range-of-motion (ROM) has not been previously validated against a gold-standard three-dimensional motion analysis (3DMA) system. Twenty-one healthy individuals (age:31 ± 9.1 years, male:11) participated, with 16 re-examined for intra-rater reliability 1-7 days later. An Android phone was fixed on a helmet, which was then securely fastened on the participant's head. Cervical-spine ROM in flexion, extension, lateral flexion and rotation were performed in sitting with concurrent measurements obtained from both a 3DMA system and the phone.The phone demonstrated moderate to excellent (ICC = 0.53-0.98, Spearman ρ = 0.52-0.98) concurrent validity for ROM measurements in cervical flexion, extension, lateral-flexion and rotation. However, cervical rotation demonstrated both proportional and fixed bias. Excellent intra-rater reliability was demonstrated for cervical flexion, extension and lateral flexion (ICC = 0.82-0.90), but poor for right- and left-rotation (ICC = 0.05-0.33) using the phone. Possible reasons for the outcome are that flexion, extension and lateral-flexion measurements are detected by gravity-dependent accelerometers while rotation measurements are detected by the magnetometer which can be adversely affected by surrounding magnetic fields. The results of this study demonstrate that the tested Android phone application is valid and reliable to measure ROM of the cervical-spine in flexion, extension and lateral-flexion but not in rotation likely due to magnetic interference. The clinical implication of this study is that therapists should be mindful of the plane of measurement when using the Android phone to measure ROM of the cervical-spine.
2014-01-01
Background Concurrent validity and intra-rater reliability using a customized Android phone application to measure cervical-spine range-of-motion (ROM) has not been previously validated against a gold-standard three-dimensional motion analysis (3DMA) system. Findings Twenty-one healthy individuals (age:31 ± 9.1 years, male:11) participated, with 16 re-examined for intra-rater reliability 1–7 days later. An Android phone was fixed on a helmet, which was then securely fastened on the participant’s head. Cervical-spine ROM in flexion, extension, lateral flexion and rotation were performed in sitting with concurrent measurements obtained from both a 3DMA system and the phone. The phone demonstrated moderate to excellent (ICC = 0.53-0.98, Spearman ρ = 0.52-0.98) concurrent validity for ROM measurements in cervical flexion, extension, lateral-flexion and rotation. However, cervical rotation demonstrated both proportional and fixed bias. Excellent intra-rater reliability was demonstrated for cervical flexion, extension and lateral flexion (ICC = 0.82-0.90), but poor for right- and left-rotation (ICC = 0.05-0.33) using the phone. Possible reasons for the outcome are that flexion, extension and lateral-flexion measurements are detected by gravity-dependent accelerometers while rotation measurements are detected by the magnetometer which can be adversely affected by surrounding magnetic fields. Conclusion The results of this study demonstrate that the tested Android phone application is valid and reliable to measure ROM of the cervical-spine in flexion, extension and lateral-flexion but not in rotation likely due to magnetic interference. The clinical implication of this study is that therapists should be mindful of the plane of measurement when using the Android phone to measure ROM of the cervical-spine. PMID:24742001
ERIC Educational Resources Information Center
Rueger, Sandra Yu; Haines, Beth A.; Malecki, Christine Kerres
2010-01-01
The psychometric properties of two paper-and-pencil versions of the Children's Attributional Style Interview (i.e., CASI-I and CASI-II) were evaluated in a sample of 166 third and fourth graders and a sample of 245 sixth and seventh graders. The results demonstrated strong internal consistency reliability, convergent validity, and a factor…
DOE Office of Scientific and Technical Information (OSTI.GOV)
Scaglione, John M; Mueller, Don; Wagner, John C
2011-01-01
One of the most significant remaining challenges associated with expanded implementation of burnup credit in the United States is the validation of depletion and criticality calculations used in the safety evaluation - in particular, the availability and use of applicable measured data to support validation, especially for fission products. Applicants and regulatory reviewers have been constrained by both a scarcity of data and a lack of clear technical basis or approach for use of the data. U.S. Nuclear Regulatory Commission (NRC) staff have noted that the rationale for restricting their Interim Staff Guidance on burnup credit (ISG-8) to actinide-only ismore » based largely on the lack of clear, definitive experiments that can be used to estimate the bias and uncertainty for computational analyses associated with using burnup credit. To address the issue of validation, the NRC initiated a project with the Oak Ridge National Laboratory to (1) develop and establish a technically sound validation approach (both depletion and criticality) for commercial spent nuclear fuel (SNF) criticality safety evaluations based on best-available data and methods and (2) apply the approach for representative SNF storage and transport configurations/conditions to demonstrate its usage and applicability, as well as to provide reference bias results. The purpose of this paper is to describe the criticality (k{sub eff}) validation approach, and resulting observations and recommendations. Validation of the isotopic composition (depletion) calculations is addressed in a companion paper at this conference. For criticality validation, the approach is to utilize (1) available laboratory critical experiment (LCE) data from the International Handbook of Evaluated Criticality Safety Benchmark Experiments and the French Haut Taux de Combustion (HTC) program to support validation of the principal actinides and (2) calculated sensitivities, nuclear data uncertainties, and the limited available fission product LCE data to predict and verify individual biases for relevant minor actinides and fission products. This paper (1) provides a detailed description of the approach and its technical bases, (2) describes the application of the approach for representative pressurized water reactor and boiling water reactor safety analysis models to demonstrate its usage and applicability, (3) provides reference bias results based on the prerelease SCALE 6.1 code package and ENDF/B-VII nuclear cross-section data, and (4) provides recommendations for application of the results and methods to other code and data packages.« less
On the accuracy of aerosol photoacoustic spectrometer calibrations using absorption by ozone
NASA Astrophysics Data System (ADS)
Davies, Nicholas W.; Cotterell, Michael I.; Fox, Cathryn; Szpek, Kate; Haywood, Jim M.; Langridge, Justin M.
2018-04-01
In recent years, photoacoustic spectroscopy has emerged as an invaluable tool for the accurate measurement of light absorption by atmospheric aerosol. Photoacoustic instruments require calibration, which can be achieved by measuring the photoacoustic signal generated by known quantities of gaseous ozone. Recent work has questioned the validity of this approach at short visible wavelengths (404 nm), indicating systematic calibration errors of the order of a factor of 2. We revisit this result and test the validity of the ozone calibration method using a suite of multipass photoacoustic cells operating at wavelengths 405, 514 and 658 nm. Using aerosolised nigrosin with mobility-selected diameters in the range 250-425 nm, we demonstrate excellent agreement between measured and modelled ensemble absorption cross sections at all wavelengths, thus demonstrating the validity of the ozone-based calibration method for aerosol photoacoustic spectroscopy at visible wavelengths.
Correlates of the MMPI-2-RF in a college setting.
Forbey, Johnathan D; Lee, Tayla T C; Handel, Richard W
2010-12-01
The current study examined empirical correlates of scores on Minnesota Multiphasic Personality Inventory-2-Restructured Form (MMPI-2-RF; A. Tellegen & Y. S. Ben-Porath, 2008; Y. S. Ben-Porath & A. Tellegen, 2008) scales in a college setting. The MMPI-2-RF and six criterion measures (assessing anger, assertiveness, sex roles, cognitive failures, social avoidance, and social fear) were administered to 846 college students (nmen = 264, nwomen = 582) to examine the convergent and discriminant validity of scores on the MMPI-2-RF Specific Problems and Interest scales. Results demonstrated evidence of generally good convergent score validity for the selected MMPI-2-RF scales, reflected in large effect size correlations with criterion measure scores. Further, MMPI-2-RF scale scores demonstrated adequate discriminant validity, reflected in relatively low comparative median correlations between scores on MMPI-2-RF substantive scale sets and criterion measures. Limitations and future directions are discussed.
Macarthur, Roy; Feinberg, Max; Bertheau, Yves
2010-01-01
A method is presented for estimating the size of uncertainty associated with the measurement of products derived from genetically modified organisms (GMOs). The method is based on the uncertainty profile, which is an extension, for the estimation of uncertainty, of a recent graphical statistical tool called an accuracy profile that was developed for the validation of quantitative analytical methods. The application of uncertainty profiles as an aid to decision making and assessment of fitness for purpose is also presented. Results of the measurement of the quantity of GMOs in flour by PCR-based methods collected through a number of interlaboratory studies followed the log-normal distribution. Uncertainty profiles built using the results generally give an expected range for measurement results of 50-200% of reference concentrations for materials that contain at least 1% GMO. This range is consistent with European Network of GM Laboratories and the European Union (EU) Community Reference Laboratory validation criteria and can be used as a fitness for purpose criterion for measurement methods. The effect on the enforcement of EU labeling regulations is that, in general, an individual analytical result needs to be < 0.45% to demonstrate compliance, and > 1.8% to demonstrate noncompliance with a labeling threshold of 0.9%.
The Validity of Conscientiousness Is Overestimated in the Prediction of Job Performance
2015-01-01
Introduction Sensitivity analyses refer to investigations of the degree to which the results of a meta-analysis remain stable when conditions of the data or the analysis change. To the extent that results remain stable, one can refer to them as robust. Sensitivity analyses are rarely conducted in the organizational science literature. Despite conscientiousness being a valued predictor in employment selection, sensitivity analyses have not been conducted with respect to meta-analytic estimates of the correlation (i.e., validity) between conscientiousness and job performance. Methods To address this deficiency, we reanalyzed the largest collection of conscientiousness validity data in the personnel selection literature and conducted a variety of sensitivity analyses. Results Publication bias analyses demonstrated that the validity of conscientiousness is moderately overestimated (by around 30%; a correlation difference of about .06). The misestimation of the validity appears to be due primarily to suppression of small effects sizes in the journal literature. These inflated validity estimates result in an overestimate of the dollar utility of personnel selection by millions of dollars and should be of considerable concern for organizations. Conclusion The fields of management and applied psychology seldom conduct sensitivity analyses. Through the use of sensitivity analyses, this paper documents that the existing literature overestimates the validity of conscientiousness in the prediction of job performance. Our data show that effect sizes from journal articles are largely responsible for this overestimation. PMID:26517553
Reliability and validity of the Wolfram Unified Rating Scale (WURS)
2012-01-01
Background Wolfram syndrome (WFS) is a rare, neurodegenerative disease that typically presents with childhood onset insulin dependent diabetes mellitus, followed by optic atrophy, diabetes insipidus, deafness, and neurological and psychiatric dysfunction. There is no cure for the disease, but recent advances in research have improved understanding of the disease course. Measuring disease severity and progression with reliable and validated tools is a prerequisite for clinical trials of any new intervention for neurodegenerative conditions. To this end, we developed the Wolfram Unified Rating Scale (WURS) to measure the severity and individual variability of WFS symptoms. The aim of this study is to develop and test the reliability and validity of the Wolfram Unified Rating Scale (WURS). Methods A rating scale of disease severity in WFS was developed by modifying a standardized assessment for another neurodegenerative condition (Batten disease). WFS experts scored the representativeness of WURS items for the disease. The WURS was administered to 13 individuals with WFS (6-25 years of age). Motor, balance, mood and quality of life were also evaluated with standard instruments. Inter-rater reliability, internal consistency reliability, concurrent, predictive and content validity of the WURS were calculated. Results The WURS had high inter-rater reliability (ICCs>.93), moderate to high internal consistency reliability (Cronbach’s α = 0.78-0.91) and demonstrated good concurrent and predictive validity. There were significant correlations between the WURS Physical Assessment and motor and balance tests (rs>.67, p<.03), between the WURS Behavioral Scale and reports of mood and behavior (rs>.76, p<.04) and between WURS Total scores and quality of life (rs=-.86, p=.001). The WURS demonstrated acceptable content validity (Scale-Content Validity Index=0.83). Conclusions These preliminary findings demonstrate that the WURS has acceptable reliability and validity and captures individual differences in disease severity in children and young adults with WFS. PMID:23148655
Cross-validation pitfalls when selecting and assessing regression and classification models.
Krstajic, Damjan; Buturovic, Ljubomir J; Leahy, David E; Thomas, Simon
2014-03-29
We address the problem of selecting and assessing classification and regression models using cross-validation. Current state-of-the-art methods can yield models with high variance, rendering them unsuitable for a number of practical applications including QSAR. In this paper we describe and evaluate best practices which improve reliability and increase confidence in selected models. A key operational component of the proposed methods is cloud computing which enables routine use of previously infeasible approaches. We describe in detail an algorithm for repeated grid-search V-fold cross-validation for parameter tuning in classification and regression, and we define a repeated nested cross-validation algorithm for model assessment. As regards variable selection and parameter tuning we define two algorithms (repeated grid-search cross-validation and double cross-validation), and provide arguments for using the repeated grid-search in the general case. We show results of our algorithms on seven QSAR datasets. The variation of the prediction performance, which is the result of choosing different splits of the dataset in V-fold cross-validation, needs to be taken into account when selecting and assessing classification and regression models. We demonstrate the importance of repeating cross-validation when selecting an optimal model, as well as the importance of repeating nested cross-validation when assessing a prediction error.
Brief reasons for living inventory: a psychometric investigation.
Cwik, Jan Christopher; Siegmann, Paula; Willutzki, Ulrike; Nyhuis, Peter; Wolter, Marcus; Forkmann, Thomas; Glaesmer, Heide; Teismann, Tobias
2017-11-06
The present study aimed at validating the German version of the Brief Reasons for Living inventory (BRFL). Validity and reliability were established in a community (n = 339) and a clinical sample (n = 272). Convergent and discriminant validity were investigated, and confirmatory factor analyses were conducted for the complete BRFL as well as for a 10-item version excluding conditional items on child-related concerns. Furthermore, it was assessed how BRFL scores moderate the association between depression and suicide ideation. Results indicated an adequate fit of the data to the original factor structure. The total scale and the subscales of the German version of the BRFL had sufficient internal consistency, as well as good convergent and divergent validity. The BRFL demonstrated clinical utility by differentiating between participants with vs. without suicide ideation. Reasons for living proved to moderate the association between depression and suicide ideation. Results provide preliminary evidence that the BRFL may be a reliable and valid measure of adaptive reasons for living that can be used in clinic and research settings.
de Fouchier, Capucine; Blanchet, Alain; Hopkins, William; Bui, Eric; Ait-Aoudia, Malik; Jehel, Louis
2012-01-01
Background To date no validated instrument in the French language exists to screen for posttraumatic stress disorder (PTSD) in survivors of torture and organized violence. Objective The aim of this study is to adapt and validate the Harvard Trauma Questionnaire (HTQ) to this population. Method The adapted version was administered to 52 French-speaking torture survivors, originally from sub-Saharan African countries, receiving psychological treatment in specialized treatment centers. A structured clinical interview for DSM was also conducted in order to assess if they met criteria for PTSD. Results Cronbach's alpha coefficient for the HTQ Part 4 was adequate (0.95). Criterion validity was evaluated using receiver operating characteristic curve analysis that generated good classification accuracy for PTSD (0.83). At the original cut-off score of 2.5, the HTQ demonstrated high sensitivity and specificity (0.87 and 0.73, respectively). Conclusion Results support the reliability and validity of the French version of the HTQ. PMID:23233870
Inter-Rater Reliability and Validity of the Australian Football League’s Kicking and Handball Tests
Cripps, Ashley J.; Hopper, Luke S.; Joyce, Christopher
2015-01-01
Talent identification tests used at the Australian Football League’s National Draft Combine assess the capacities of athletes to compete at a professional level. Tests created for the National Draft Combine are also commonly used for talent identification and athlete development in development pathways. The skills tests created by the Australian Football League required players to either handball (striking the ball with the hand) or kick to a series of 6 randomly generated targets. Assessors subjectively rate each skill execution giving a 0-5 score for each disposal. This study aimed to investigate the inter-rater reliability and validity of the skills tests at an adolescent sub-elite level. Male Australian footballers were recruited from sub-elite adolescent teams (n = 121, age = 15.7 ± 0.3 years, height = 1.77 ± 0.07 m, mass = 69.17 ± 8.08 kg). The coaches (n = 7) of each team were also recruited. Inter-rater reliability was assessed using Inter-class correlations (ICC) and Limits of Agreement statistics. Both the kicking (ICC = 0.96, p < .01) and handball tests (ICC = 0.89, p < .01) demonstrated strong reliability and acceptable levels of absolute agreement. Content validity was determined by examining the test scores sensitivity to laterality and distance. Concurrent validity was assessed by comparing coaches’ perceptions of skill to actual test outcomes. Multivariate analysis of variance (MANOVA) examined the main effect of laterality, with scores on the dominant hand (p = .04) and foot (p < .01) significantly higher compared to the non-dominant side. Follow-up univariate analysis reported significant differences at every distance in the kicking test. A poor correlation was found between coaches’ perceptions of skill and testing outcomes. The results of this study demonstrate both skill tests demonstrate acceptable inter-rater reliable. Partial content validity was confirmed for the kicking test, however further research is required to confirm validity of the handball test. Key points The skill tests created by the AFL demonstrated acceptable levels of relative and absolute inter-rater reliability. Both the AFL’s skills tests are able to differentiate between athletes dominant and non-dominant limbs. However, only the kicking test could consistently differentiated between score outcomes over a range of Australian Football specific disposal distances. Both tests demonstrated poor concurrent validity, with no correlation found between coaches’ perceptions of technical skills and actual skill outcomes measured. PMID:26336356
Jacob, Robin; Somers, Marie-Andree; Zhu, Pei; Bloom, Howard
2016-06-01
In this article, we examine whether a well-executed comparative interrupted time series (CITS) design can produce valid inferences about the effectiveness of a school-level intervention. This article also explores the trade-off between bias reduction and precision loss across different methods of selecting comparison groups for the CITS design and assesses whether choosing matched comparison schools based only on preintervention test scores is sufficient to produce internally valid impact estimates. We conduct a validation study of the CITS design based on the federal Reading First program as implemented in one state using results from a regression discontinuity design as a causal benchmark. Our results contribute to the growing base of evidence regarding the validity of nonexperimental designs. We demonstrate that the CITS design can, in our example, produce internally valid estimates of program impacts when multiple years of preintervention outcome data (test scores in the present case) are available and when a set of reasonable criteria are used to select comparison organizations (schools in the present case). © The Author(s) 2016.
NASA Technical Reports Server (NTRS)
Ku, Jentung; Ottenstein, Laura; Douglas, Donya; Hoang, Triem
2010-01-01
Under NASA s New Millennium Program Space Technology 8 (ST 8) Project, Goddard Space Fight Center has conducted a Thermal Loop experiment to advance the maturity of the Thermal Loop technology from proof of concept to prototype demonstration in a relevant environment , i.e. from a technology readiness level (TRL) of 3 to a level of 6. The thermal Loop is an advanced thermal control system consisting of a miniature loop heat pipe (MLHP) with multiple evaporators and multiple condensers designed for future small system applications requiring low mass, low power, and compactness. The MLHP retains all features of state-of-the-art loop heat pipes (LHPs) and offers additional advantages to enhance the functionality, performance, versatility, and reliability of the system. An MLHP breadboard was built and tested in the laboratory and thermal vacuum environments for the TRL 4 and TRL 5 validations, respectively, and an MLHP proto-flight unit was built and tested in a thermal vacuum chamber for the TRL 6 validation. In addition, an analytical model was developed to simulate the steady state and transient behaviors of the MLHP during various validation tests. The MLHP demonstrated excellent performance during experimental tests and the analytical model predictions agreed very well with experimental data. All success criteria at various TRLs were met. Hence, the Thermal Loop technology has reached a TRL of 6. This paper presents the validation results, both experimental and analytical, of such a technology development effort.
The Alcohol Relapse Situation Appraisal Questionnaire: Development and Validation
Martin, Rosemarie A.; MacKinnon, Selene M.; Johnson, Jennifer E.; Myers, Mark G.; Cook, Travis A. R.; Rohsenow, Damaris J.
2011-01-01
Background The role of cognitive appraisal of the threat of alcohol relapse has received little attention. A previous instrument, the Relapse Situation Appraisal Questionnaire (RSAQ), was developed to assess cocaine users’ primary appraisal of the threat of situations posing a high risk for cocaine relapse. The purpose of the present study was to modify the RSAQ in order to measure primary appraisal in situations involving a high risk for alcohol relapse. Methods The development and psychometric properties of this instrument, the Alcohol Relapse Situation Appraisal Questionnaire (A-RSAQ), were examined with two samples of abstinent adults with alcohol abuse or dependence. Factor structure and validity were examined in Study 1 (N=104). Confirmation of the factor structure and predictive validity were assessed in Study 2 (N=161). Results Results demonstrated construct, discriminant and predictive validity and reliability of the A-RSAQ. Discussion Results support the important role of primary appraisal of degree of risk in alcohol relapse situations. PMID:21237586
Verification and Validation of Adaptive and Intelligent Systems with Flight Test Results
NASA Technical Reports Server (NTRS)
Burken, John J.; Larson, Richard R.
2009-01-01
F-15 IFCS project goals are: a) Demonstrate Control Approaches that can Efficiently Optimize Aircraft Performance in both Normal and Failure Conditions [A] & [B] failures. b) Advance Neural Network-Based Flight Control Technology for New Aerospace Systems Designs with a Pilot in the Loop. Gen II objectives include; a) Implement and Fly a Direct Adaptive Neural Network Based Flight Controller; b) Demonstrate the Ability of the System to Adapt to Simulated System Failures: 1) Suppress Transients Associated with Failure; 2) Re-Establish Sufficient Control and Handling of Vehicle for Safe Recovery. c) Provide Flight Experience for Development of Verification and Validation Processes for Flight Critical Neural Network Software.
Crary, Michael A.; Carnaby, Giselle D.; Sia, Isaac
2017-01-01
Background The aim of this study was to compare spontaneous swallow frequency analysis (SFA) with clinical screening protocols for identification of dysphagia in acute stroke. Methods In all, 62 patients with acute stroke were evaluated for spontaneous swallow frequency rates using a validated acoustic analysis technique. Independent of SFA, these same patients received a routine nurse-administered clinical dysphagia screening as part of standard stroke care. Both screening tools were compared against a validated clinical assessment of dysphagia for acute stroke. In addition, psychometric properties of SFA were compared against published, validated clinical screening protocols. Results Spontaneous SFA differentiates patients with versus without dysphagia after acute stroke. Using a previously identified cut point based on swallows per minute, spontaneous SFA demonstrated superior ability to identify dysphagia cases compared with a nurse-administered clinical screening tool. In addition, spontaneous SFA demonstrated equal or superior psychometric properties to 4 validated, published clinical dysphagia screening tools. Conclusions Spontaneous SFA has high potential to identify dysphagia in acute stroke with psychometric properties equal or superior to clinical screening protocols. PMID:25088166
Measurement uncertainty analysis techniques applied to PV performance measurements
NASA Astrophysics Data System (ADS)
Wells, C.
1992-10-01
The purpose of this presentation is to provide a brief introduction to measurement uncertainty analysis, outline how it is done, and illustrate uncertainty analysis with examples drawn from the PV field, with particular emphasis toward its use in PV performance measurements. The uncertainty information we know and state concerning a PV performance measurement or a module test result determines, to a significant extent, the value and quality of that result. What is measurement uncertainty analysis? It is an outgrowth of what has commonly been called error analysis. But uncertainty analysis, a more recent development, gives greater insight into measurement processes and tests, experiments, or calibration results. Uncertainty analysis gives us an estimate of the interval about a measured value or an experiment's final result within which we believe the true value of that quantity will lie. Why should we take the time to perform an uncertainty analysis? A rigorous measurement uncertainty analysis: Increases the credibility and value of research results; allows comparisons of results from different labs; helps improve experiment design and identifies where changes are needed to achieve stated objectives (through use of the pre-test analysis); plays a significant role in validating measurements and experimental results, and in demonstrating (through the post-test analysis) that valid data have been acquired; reduces the risk of making erroneous decisions; demonstrates quality assurance and quality control measures have been accomplished; define Valid Data as data having known and documented paths of: Origin, including theory; measurements; traceability to measurement standards; computations; uncertainty analysis of results.
Kubayi, Alliance; Toriola, Abel; Didymus, Faye
2018-06-01
The aim of this series of studies was to develop and initially validate an instrument to assess stressors among South African sports coaches. In study one, a preliminary pool of 45 items was developed based on existing literature and an expert panel was employed to assess the content validity and applicability of these items. In study two, the 32 items that were retained after study one were analysed using principal component analysis (PCA). The resultant factorial structure comprised four components: environmental stressors, performance stressors, task-related stressors, and athlete stressors. These four components were made up of 26 items and, together, the components and items comprised the provisional Stressors in Sports Coaching Questionnaire (SSCQ). The results show that the SSCQ demonstrates acceptable internal consistency (.73-.89). The findings provide preliminary evidence that SSCQ is a valid tool to assess stressors among South African sports coaches.
Validation of an automated mite counter for Dermanyssus gallinae in experimental laying hen cages.
Mul, Monique F; van Riel, Johan W; Meerburg, Bastiaan G; Dicke, Marcel; George, David R; Groot Koerkamp, Peter W G
2015-08-01
For integrated pest management (IPM) programs to be maximally effective, monitoring of the growth and decline of the pest populations is essential. Here, we present the validation results of a new automated monitoring device for the poultry red mite (Dermanyssus gallinae), a serious pest in laying hen facilities world-wide. This monitoring device (called an "automated mite counter") was validated in experimental laying hen cages with live birds and a growing population of D. gallinae. This validation study resulted in 17 data points of 'number of mites counted' by the automated mite counter and the 'number of mites present' in the experimental laying hen cages. The study demonstrated that the automated mite counter was able to track the D. gallinae population effectively. A wider evaluation showed that this automated mite counter can become a useful tool in IPM of D. gallinae in laying hen facilities.
Armistead-Jehle, Patrick; Cole, Wesley R; Stegman, Robert L
2018-02-01
The study was designed to replicate and extend pervious findings demonstrating the high rates of invalid neuropsychological testing in military service members (SMs) with a history of mild traumatic brain injury (mTBI) assessed in the context of a medical evaluation board (MEB). Two hundred thirty-one active duty SMs (61 of which were undergoing an MEB) underwent neuropsychological assessment. Performance validity (Word Memory Test) and symptom validity (MMPI-2-RF) test data were compared across those evaluated within disability (MEB) and clinical contexts. As with previous studies, there were significantly more individuals in an MEB context that failed performance (MEB = 57%, non-MEB = 31%) and symptom validity testing (MEB = 57%, non-MEB = 22%) and performance validity testing had a notable affect on cognitive test scores. Performance and symptom validity test failure rates did not vary as a function of the reason for disability evaluation when divided into behavioral versus physical health conditions. These data are consistent with past studies, and extends those studies by including symptom validity testing and investigating the effect of reason for MEB. This and previous studies demonstrate that more than 50% of SMs seen in the context of an MEB will fail performance validity tests and over-report on symptom validity measures. These results emphasize the importance of using both performance and symptom validity testing when evaluating SMs with a history of mTBI, especially if they are being seen for disability evaluations, in order to ensure the accuracy of cognitive and psychological test data. Published by Oxford University Press 2017. This work is written by (a) US Government employee(s) and is in the public domain in the US.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Radulescu, Georgeta; Gauld, Ian C; Ilas, Germina
2011-01-01
The expanded use of burnup credit in the United States (U.S.) for storage and transport casks, particularly in the acceptance of credit for fission products, has been constrained by the availability of experimental fission product data to support code validation. The U.S. Nuclear Regulatory Commission (NRC) staff has noted that the rationale for restricting the Interim Staff Guidance on burnup credit for storage and transportation casks (ISG-8) to actinide-only is based largely on the lack of clear, definitive experiments that can be used to estimate the bias and uncertainty for computational analyses associated with using burnup credit. To address themore » issues of burnup credit criticality validation, the NRC initiated a project with the Oak Ridge National Laboratory to (1) develop and establish a technically sound validation approach for commercial spent nuclear fuel (SNF) criticality safety evaluations based on best-available data and methods and (2) apply the approach for representative SNF storage and transport configurations/conditions to demonstrate its usage and applicability, as well as to provide reference bias results. The purpose of this paper is to describe the isotopic composition (depletion) validation approach and resulting observations and recommendations. Validation of the criticality calculations is addressed in a companion paper at this conference. For isotopic composition validation, the approach is to determine burnup-dependent bias and uncertainty in the effective neutron multiplication factor (keff) due to bias and uncertainty in isotopic predictions, via comparisons of isotopic composition predictions (calculated) and measured isotopic compositions from destructive radiochemical assay utilizing as much assay data as is available, and a best-estimate Monte Carlo based method. This paper (1) provides a detailed description of the burnup credit isotopic validation approach and its technical bases, (2) describes the application of the approach for representative pressurized water reactor and boiling water reactor safety analysis models to demonstrate its usage and applicability, (3) provides reference bias and uncertainty results based on a quality-assurance-controlled prerelease version of the Scale 6.1 code package and the ENDF/B-VII nuclear cross section data.« less
YAKHFOROSHHA, AFSANEH; SHIRAZI, MANDANA; YOUSEFZADEH, NASER; GHANBARNEJAD, AMIN; CHERAGHI, MOHAMMADALI; MOJTAHEDZADEH, RITA; MAHMOODI-BAKHTIARI, BEHROOZ; EMAMI, SEYED AMIR HOSSEIN
2018-01-01
Introduction: Communication skill (CS) has been regarded as one of the fundamental competencies for medical and other health care professionals. Student's attitude toward learning CS is a key factor in designing educational interventions. The original CSAS, as positive and negative subscales, was developed in the UK; however, there is no scale to measure these attitudes in Iran. The aim of this study was to assess the psychometric characteristic of the Communication Skills Attitude Scale (CSAS), in an Iranian context and to understand if it is a valid tool to assess attitude toward learning communication skills among health care professionals. Methods: Psychometric characteristics of the CSAS were assessed by using a cross-sectional design. In the current study, 410 medical students were selected using stratified sampling framework. The face validity of the scale was estimated through students and experts’ opinion. Content validity of CSAS was assessed qualitatively and quantitatively. Reliability was examined through two methods including Chronbach’s alpha coefficient and Intraclass Correlation of Coefficient (ICC). Construct validity of CSAS was assessed using confirmatory factor analysis (CFA) and explanatory factor analysis (PCA) followed by varimax rotation. Convergent and discriminant validity of the scale was measured through Spearman correlation. Statistical analysis was performed using SPSS 19 and EQS, 6.1. Results: The internal consistency and reproducibility of the total CSAS score were 0.84 (Cronbach’s alpha) and 0.81, which demonstrates an acceptable reliability of the questionnaire. The item-level content validity index (I-CVI) and the scale-level content validity index (S-CVI/Ave) demonstrated appropriate results: 0.97 and 0.94, respectively. An exploratory factor analysis (EFA) on the 25 items of the CSAS revealed 4-factor structure that all together explained %55 of the variance. Results of the confirmatory factor analysis indicated an acceptable goodness-of-fit between the model and the observed data. [χ2/df = 2.36, Comparative Fit Index (CFI) = 0.95, the GFI=0.96, Root Mean Square Error of Approximation (RMSEA) = 0.05]. Conclusion: The Persian version of CSAS is a multidimensional, valid and reliable tool for assessing attitudes towards communication skill among medical students. PMID:29344525
Environmental education curriculum evaluation questionnaire: A reliability and validity study
NASA Astrophysics Data System (ADS)
Minner, Daphne Diane
The intention of this research project was to bridge the gap between social science research and application to the environmental domain through the development of a theoretically derived instrument designed to give educators a template by which to evaluate environmental education curricula. The theoretical base for instrument development was provided by several developmental theories such as Piaget's theory of cognitive development, Developmental Systems Theory, Life-span Perspective, as well as curriculum research within the area of environmental education. This theoretical base fueled the generation of a list of components which were then translated into a questionnaire with specific questions relevant to the environmental education domain. The specific research question for this project is: Can a valid assessment instrument based largely on human development and education theory be developed that reliably discriminates high, moderate, and low quality in environmental education curricula? The types of analyses conducted to answer this question were interrater reliability (percent agreement, Cohen's Kappa coefficient, Pearson's Product-Moment correlation coefficient), test-retest reliability (percent agreement, correlation), and criterion-related validity (correlation). Face validity and content validity were also assessed through thorough reviews. Overall results indicate that 29% of the questions on the questionnaire demonstrated a high level of interrater reliability and 43% of the questions demonstrated a moderate level of interrater reliability. Seventy-one percent of the questions demonstrated a high test-retest reliability and 5% a moderate level. Fifty-five percent of the questions on the questionnaire were reliable (high or moderate) both across time and raters. Only eight questions (8%) did not show either interrater or test-retest reliability. The global overall rating of high, medium, or low quality was reliable across both coders and time, indicating that the questionnaire can discriminate differences in quality of environmental education curricula. Of the 35 curricula evaluated, 6 were high quality, 14 were medium quality and 15 were low quality. The criterion-related validity of the instrument is at current time unable to be established due to the lack of comparable measures or a concretely usable set of multidisciplinary standards. Face and content validity were sufficiently demonstrated.
Li, Wylie Wai Yee; Lam, Wendy Wing Tak; Shun, Shiow-Ching; Lai, Yeur-Hur; Law, Wai-Lun; Poon, Jensen; Fielding, Richard
2013-01-01
Background Accurate assessment of unmet supportive care needs is essential for optimal cancer patient care. This study used confirmatory factor analysis (CFA) to test the known factor structures of the short form of Supportive Care Need Survey (SCNS-34) in Hong Kong and Taiwan Chinese patients diagnosed with colorectal cancer (CRC). Methods 360 Hong Kong and 263 Taiwanese Chinese CRC patients completed the Chinese version of SCNS-SF34. Comparative measures (patient satisfaction, anxiety, depression, and symptom distress) tested convergent validity while known group differences were examined to test discriminant validity. Results The original 5-factor and recent 4-factor models of the SCNS demonstrated poor data fit using CFA in both Hong Kong and Taiwan samples. Subsequently a modified five-factor model with correlated residuals demonstrated acceptable fit in both samples. Correlations demonstrated convergent and divergent validity and known group differences were observed. Conclusions While the five-factor model demonstrated a better fit for data from Chinese colorectal cancer patients, some of the items within its domain overlapped, suggesting item redundancy. The five-factor model showed good psychometric properties in these samples but also suggests conceptualization of unmet supportive care needs are currently inadequate. PMID:24146774
2017-05-01
Protecting And Bonding Reinforcing Steel In Cement -Based Composites, Corrosion 2009, Atlanta, GA, 22-26 March 2009. 7. Hock, V., O. Marshall, S...ER D C/ CE RL T R- 17 -1 3 DoD Corrosion Prevention and Control Program Demonstration and Validation of Stainless Steel Materials for...ERDC/CERL TR-17-13 May 2017 Demonstration and Validation of Stainless Steel Materials for Critical Above-Grade Piping in Highly Corrosive
Odegaard, Justin I; Vincent, John J; Mortimer, Stefanie; Vowles, James V; Ulrich, Bryan C; Banks, Kimberly C; Fairclough, Stephen R; Zill, Oliver A; Sikora, Marcin; Mokhtari, Reza; Abdueva, Diana; Nagy, Rebecca J; Lee, Christine E; Kiedrowski, Lesli A; Paweletz, Cloud P; Eltoukhy, Helmy; Lanman, Richard B; Chudova, Darya I; Talasaz, AmirAli
2018-04-24
Purpose: To analytically and clinically validate a circulating cell-free tumor DNA sequencing test for comprehensive tumor genotyping and demonstrate its clinical feasibility. Experimental Design: Analytic validation was conducted according to established principles and guidelines. Blood-to-blood clinical validation comprised blinded external comparison with clinical droplet digital PCR across 222 consecutive biomarker-positive clinical samples. Blood-to-tissue clinical validation comprised comparison of digital sequencing calls to those documented in the medical record of 543 consecutive lung cancer patients. Clinical experience was reported from 10,593 consecutive clinical samples. Results: Digital sequencing technology enabled variant detection down to 0.02% to 0.04% allelic fraction/2.12 copies with ≤0.3%/2.24-2.76 copies 95% limits of detection while maintaining high specificity [prevalence-adjusted positive predictive values (PPV) >98%]. Clinical validation using orthogonal plasma- and tissue-based clinical genotyping across >750 patients demonstrated high accuracy and specificity [positive percent agreement (PPAs) and negative percent agreement (NPAs) >99% and PPVs 92%-100%]. Clinical use in 10,593 advanced adult solid tumor patients demonstrated high feasibility (>99.6% technical success rate) and clinical sensitivity (85.9%), with high potential actionability (16.7% with FDA-approved on-label treatment options; 72.0% with treatment or trial recommendations), particularly in non-small cell lung cancer, where 34.5% of patient samples comprised a directly targetable standard-of-care biomarker. Conclusions: High concordance with orthogonal clinical plasma- and tissue-based genotyping methods supports the clinical accuracy of digital sequencing across all four types of targetable genomic alterations. Digital sequencing's clinical applicability is further supported by high rates of technical success and biomarker target discovery. Clin Cancer Res; 1-11. ©2018 AACR. ©2018 American Association for Cancer Research.
Jean-Pierre, Pascal; Fiscella, Kevin; Winters, Paul C; Paskett, Electra; Wells, Kristen; Battaglia, Tracy
2012-09-01
Patient satisfaction (PS), a key measure of quality of cancer care, is a core study outcome of the multi-site National Cancer Institute-funded Patient Navigation Research Program. Despite large numbers of underserved monolingual Spanish speakers (MSS) residing in USA, there is no validated Spanish measure of PS that spans the whole spectrum of cancer-related care. The present study reports on the validation of the Patient Satisfaction with Cancer Care (PSCC) measure for Spanish (PSCC-Sp) speakers receiving diagnostic and therapeutic cancer-related care. Original PSCC items were professionally translated and back translated to ensure cultural appropriateness, meaningfulness, and equivalence. Then, the resulting 18-item PSCC-Sp measure was administered to 285 MSS. We evaluated latent structure and internal consistency of the PSCC-Sp using principal components analysis (PCA) and Cronbach coefficient alpha (α). We used correlation analyses to demonstrate divergence and convergence of the PSCC-Sp with a Spanish version of the Patient Satisfaction with Interpersonal Relationship with Navigator (PSN-I-Sp) measure and patients' demographics. The PCA revealed a coherent set of items that explicates 47% of the variance in PS. Reliability assessment demonstrated that the PSCC-Sp had high internal consistency (α = 0.92). The PSCC-Sp demonstrated good face validity and convergent and divergent validities as indicated by moderate correlations with the PSN-I-Sp (p = 0.003) and nonsignificant correlations with marital status and household income (all p(s) > 0.05). The PSCC-Sp is a valid and reliable measure of PS and should be tested in other MSS populations.
Schaefer, Lauren M; Harriger, Jennifer A; Heinberg, Leslie J; Soderberg, Taylor; Kevin Thompson, J
2017-02-01
The Sociocultural Attitudes Toward Appearance Questionnaire-4 (SATAQ-4) is a measure of internalization of appearance ideals (i.e., personal acceptance of societal ideals) and appearance pressures (i.e., pressures to achieve the societal ideal). The current study sought to address limitations of the scale in order to increase precision in the measurement of muscular ideal internalization, include an assessment of one's desire for attractiveness, and broaden the measurement of appearance-related pressures. The factor structure, reliability and construct validity of the SATAQ-4-Revised were examined among college women (N = 1,114) in Study 1, adolescent girls (N = 275) in Study 2, and college men (N = 290) in Study 3. Factor analysis among college women indicated a 7-factor 31-item scale, labeled the SATAQ-4R-Female: (1) Internalization: Thin/Low Body Fat, (2) Internalization: Muscular, (3) Internalization: General Attractiveness, (4) Pressures: Family, (5) Pressures: Media, (6) Pressures: Peers, and (7) Pressures: Significant Others. SATAQ-4R-Female subscales demonstrated good reliability and construct validity among college women. Examination of the SATAQ-4R-Female among adolescent girls suggested a six-factor scale in which peer and significant others items comprised a single subscale. The scale demonstrated good reliability and construct validity in adolescent girls. Examination of the SATAQ-4R among men produced a 28-item scale with seven factors paralleling the factors identified among college women. This scale, labeled the SATAQ-4R-Male, demonstrated good reliability and construct validity. Results support the reliability and validity of SATAQ-4R-Female in college women and adolescent girls, and the SATAQ-4R-Male in college men. © 2016 Wiley Periodicals, Inc.(Int J Eat Disord 2017; 50:104-117). © 2016 Wiley Periodicals, Inc.
Peck, Michelle A; Sturk-Andreaggi, Kimberly; Thomas, Jacqueline T; Oliver, Robert S; Barritt-Ross, Suzanne; Marshall, Charla
2018-05-01
Generating mitochondrial genome (mitogenome) data from reference samples in a rapid and efficient manner is critical to harnessing the greater power of discrimination of the entire mitochondrial DNA (mtDNA) marker. The method of long-range target enrichment, Nextera XT library preparation, and Illumina sequencing on the MiSeq is a well-established technique for generating mitogenome data from high-quality samples. To this end, a validation was conducted for this mitogenome method processing up to 24 samples simultaneously along with analysis in the CLC Genomics Workbench and utilizing the AQME (AFDIL-QIAGEN mtDNA Expert) tool to generate forensic profiles. This validation followed the Federal Bureau of Investigation's Quality Assurance Standards (QAS) for forensic DNA testing laboratories and the Scientific Working Group on DNA Analysis Methods (SWGDAM) validation guidelines. The evaluation of control DNA, non-probative samples, blank controls, mixtures, and nonhuman samples demonstrated the validity of this method. Specifically, the sensitivity was established at ≥25 pg of nuclear DNA input for accurate mitogenome profile generation. Unreproducible low-level variants were observed in samples with low amplicon yields. Further, variant quality was shown to be a useful metric for identifying sequencing error and crosstalk. Success of this method was demonstrated with a variety of reference sample substrates and extract types. These studies further demonstrate the advantages of using NGS techniques by highlighting the quantitative nature of heteroplasmy detection. The results presented herein from more than 175 samples processed in ten sequencing runs, show this mitogenome sequencing method and analysis strategy to be valid for the generation of reference data. Copyright © 2018 Elsevier B.V. All rights reserved.
Asadi-Lari, Mohsen; Ahmadi Pishkuhi, Mahin; Almasi-Hashiani, Amir; Safiri, Saeid; Sepidarkish, Mahdi
2015-07-01
Developing a tool for measuring patient's needs is a vital step in the process of cancer treatment and research. In recent years, the European Organization for Research and Treatment of Cancer (EORTC) made a questionnaire to measure cancer patients' received information. Since validity and reliability of any instrument should be evaluated in the new environment and culture, the aim of this study was to assess the validity and reliability of the EORTC QLQ-INFO25 in Iranian cancer patients. One hundred seventy-three patients with different stages of cancer filled questionnaire EORTC QLQ-INFO25, EORTC QLQ-C30, and EORTC IN-PATSAT32. Twenty-five patients answered the questionnaire twice at an interval of 2 weeks. Reliability and validity of the questionnaire was measured by Cronbach's alpha, interclass correlation, test retest, inter-rater agreement (IRA), and exploratory factorial analyses. Using a conservative approach, the IRA for the overall relevancy and clarity of the tool was 87/86% and 83.33%, respectively. Overall appropriateness and clarity were 94.13 and 91.87%, respectively. Overall integrity of the instrument was determined to be 85%. Cronbach's alpha coefficients for all domains and total inventory were top 70 and 90%, respectively. Interclass correlation index ranges between 0.708 and 0.965. Exploratory factorial analyses demonstrate six fields suitable for instrument. Correlation between areas of the questionnaires EORTC QLQ-INFO25 and EORTC in-Patsat32 represents the convergent validity of the questionnaire. Also, results show a standard divergent validity in all domains of the questionnaire (Rho <0.3). Low correlation between the areas of the questionnaires EORTC QLQ-INFO25 and EORTC QLQ-C30 (<0.3) demonstrates the divergence validity of the questionnaire. The results showed that Persian version of the questionnaire EORTC QLQ-INFO25 is a reliable and valid instrument for measuring the perception of information in cancer patients.
Li, Zhao-Liang
2018-01-01
Few studies have examined hyperspectral remote-sensing image classification with type-II fuzzy sets. This paper addresses image classification based on a hyperspectral remote-sensing technique using an improved interval type-II fuzzy c-means (IT2FCM*) approach. In this study, in contrast to other traditional fuzzy c-means-based approaches, the IT2FCM* algorithm considers the ranking of interval numbers and the spectral uncertainty. The classification results based on a hyperspectral dataset using the FCM, IT2FCM, and the proposed improved IT2FCM* algorithms show that the IT2FCM* method plays the best performance according to the clustering accuracy. In this paper, in order to validate and demonstrate the separability of the IT2FCM*, four type-I fuzzy validity indexes are employed, and a comparative analysis of these fuzzy validity indexes also applied in FCM and IT2FCM methods are made. These four indexes are also applied into different spatial and spectral resolution datasets to analyze the effects of spectral and spatial scaling factors on the separability of FCM, IT2FCM, and IT2FCM* methods. The results of these validity indexes from the hyperspectral datasets show that the improved IT2FCM* algorithm have the best values among these three algorithms in general. The results demonstrate that the IT2FCM* exhibits good performance in hyperspectral remote-sensing image classification because of its ability to handle hyperspectral uncertainty. PMID:29373548
Demonstrating Experimenter "Ineptitude" as a Means of Teaching Internal and External Validity
ERIC Educational Resources Information Center
Treadwell, Kimberli R.H.
2008-01-01
Internal and external validity are key concepts in understanding the scientific method and fostering critical thinking. This article describes a class demonstration of a "botched" experiment to teach validity to undergraduates. Psychology students (N = 75) completed assessments at the beginning of the semester, prior to and immediately following…
Simulation-based training for prostate surgery.
Khan, Raheej; Aydin, Abdullatif; Khan, Muhammad Shamim; Dasgupta, Prokar; Ahmed, Kamran
2015-10-01
To identify and review the currently available simulators for prostate surgery and to explore the evidence supporting their validity for training purposes. A review of the literature between 1999 and 2014 was performed. The search terms included a combination of urology, prostate surgery, robotic prostatectomy, laparoscopic prostatectomy, transurethral resection of the prostate (TURP), simulation, virtual reality, animal model, human cadavers, training, assessment, technical skills, validation and learning curves. Furthermore, relevant abstracts from the American Urological Association, European Association of Urology, British Association of Urological Surgeons and World Congress of Endourology meetings, between 1999 and 2013, were included. Only studies related to prostate surgery simulators were included; studies regarding other urological simulators were excluded. A total of 22 studies that carried out a validation study were identified. Five validated models and/or simulators were identified for TURP, one for photoselective vaporisation of the prostate, two for holmium enucleation of the prostate, three for laparoscopic radical prostatectomy (LRP) and four for robot-assisted surgery. Of the TURP simulators, all five have demonstrated content validity, three face validity and four construct validity. The GreenLight laser simulator has demonstrated face, content and construct validities. The Kansai HoLEP Simulator has demonstrated face and content validity whilst the UroSim HoLEP Simulator has demonstrated face, content and construct validity. All three animal models for LRP have been shown to have construct validity whilst the chicken skin model was also content valid. Only two robotic simulators were identified with relevance to robot-assisted laparoscopic prostatectomy, both of which demonstrated construct validity. A wide range of different simulators are available for prostate surgery, including synthetic bench models, virtual-reality platforms, animal models, human cadavers, distributed simulation and advanced training programmes and modules. The currently validated simulators can be used by healthcare organisations to provide supplementary training sessions for trainee surgeons. Further research should be conducted to validate simulated environments, to determine which simulators have greater efficacy than others and to assess the cost-effectiveness of the simulators and the transferability of skills learnt. With surgeons investigating new possibilities for easily reproducible and valid methods of training, simulation offers great scope for implementation alongside traditional methods of training. © 2014 The Authors BJU International © 2014 BJU International Published by John Wiley & Sons Ltd.
Marketing Plan for Demonstration and Validation Assets
DOE Office of Scientific and Technical Information (OSTI.GOV)
None, None
The National Security Preparedness Project (NSPP), is to be sustained by various programs, including technology demonstration and evaluation (DEMVAL). This project assists companies in developing technologies under the National Security Technology Incubator program (NSTI) through demonstration and validation of technologies applicable to national security created by incubators and other sources. The NSPP also will support the creation of an integrated demonstration and validation environment. This report documents the DEMVAL marketing and visibility plan, which will focus on collecting information about, and expanding the visibility of, DEMVAL assets serving businesses with national security technology applications in southern New Mexico.
Development and Validation of the Food Liking Questionnaire in a French-Canadian Population
Carbonneau, Elise; Bradette-Laplante, Maude; Lamarche, Benoît; Provencher, Véronique; Bégin, Catherine; Robitaille, Julie; Desroches, Sophie; Corneau, Louise; Lemieux, Simone
2017-01-01
The purpose of this study was to develop and validate a questionnaire assessing food liking in a French-Canadian population. A questionnaire was developed, in which participants were asked to rate their degree of liking of 50 food items. An expert panel evaluated the content validity. For the validation study, 150 men and women completed the questionnaire twice. An Exploratory Factor Analysis (EFA) was performed to assess the number of subscales of the questionnaire. Internal consistency and test-retest reliability of the subscales were evaluated. Concurrent validity was assessed through correlations between liking scores and self-reported frequencies of consumption. Comments from the experts led to changes in the list of foods included in the questionnaire. The EFA revealed a two-factor structure for the questionnaire (i.e., savory and sweet foods) and led to the removal of nine items, resulting in a 32-item questionnaire. The two subscales revealed good internal consistency (Cronbach alphas: 0.85 and 0.89) and test-retest reliability (p = 0.84 and 0.86). The questionnaire demonstrated adequate concurrent validity, with moderate correlations between food liking and self-reported frequency of consumption (r = 0.19–0.39, ps < 0.05). This new Food Liking Questionnaire assessing liking of a variety of savory and sweet foods demonstrated good psychometric properties in every validation step. This questionnaire will be useful to explore the role of food liking and its interactions with other factors in predicting eating behaviors and energy intake. PMID:29292754
Development and Validation of the Food Liking Questionnaire in a French-Canadian Population.
Carbonneau, Elise; Bradette-Laplante, Maude; Lamarche, Benoît; Provencher, Véronique; Bégin, Catherine; Robitaille, Julie; Desroches, Sophie; Vohl, Marie-Claude; Corneau, Louise; Lemieux, Simone
2017-12-08
The purpose of this study was to develop and validate a questionnaire assessing food liking in a French-Canadian population. A questionnaire was developed, in which participants were asked to rate their degree of liking of 50 food items. An expert panel evaluated the content validity. For the validation study, 150 men and women completed the questionnaire twice. An Exploratory Factor Analysis (EFA) was performed to assess the number of subscales of the questionnaire. Internal consistency and test-retest reliability of the subscales were evaluated. Concurrent validity was assessed through correlations between liking scores and self-reported frequencies of consumption. Comments from the experts led to changes in the list of foods included in the questionnaire. The EFA revealed a two-factor structure for the questionnaire (i.e., savory and sweet foods) and led to the removal of nine items, resulting in a 32-item questionnaire. The two subscales revealed good internal consistency (Cronbach alphas: 0.85 and 0.89) and test-retest reliability ( p = 0.84 and 0.86). The questionnaire demonstrated adequate concurrent validity, with moderate correlations between food liking and self-reported frequency of consumption ( r = 0.19-0.39, p s < 0.05). This new Food Liking Questionnaire assessing liking of a variety of savory and sweet foods demonstrated good psychometric properties in every validation step. This questionnaire will be useful to explore the role of food liking and its interactions with other factors in predicting eating behaviors and energy intake.
2014-01-01
Background Because body proportions in childhood are different to those in adulthood, children have a relatively higher centre of mass location. This biomechanical difference and the fact that children’s movements have not yet fully matured result in different sway performances in children and adults. When assessing static balance, it is essential to use objective, sensitive tools, and these types of measurement have previously been performed in laboratory settings. However, the emergence of technologies like the Nintendo Wii Board (NWB) might allow balance assessment in field settings. As the NWB has only been validated and tested for reproducibility in adults, the purpose of this study was to examine reproducibility and validity of the NWB in a field setting, in a population of children. Methods Fifty-four 10–14 year-olds from the CHAMPS-Study DK performed four different balance tests: bilateral stance with eyes open (1), unilateral stance on dominant (2) and non-dominant leg (3) with eyes open, and bilateral stance with eyes closed (4). Three rounds of the four tests were completed with the NWB and with a force platform (AMTI). To assess reproducibility, an intra-day test-retest design was applied with a two-hour break between sessions. Results Bland-Altman plots supplemented by Minimum Detectable Change (MDC) and concordance correlation coefficient (CCC) demonstrated satisfactory reproducibility for the NWB and the AMTI (MDC: 26.3-28.2%, CCC: 0.76-0.86) using Centre Of Pressure path Length as measurement parameter. Bland-Altman plots demonstrated satisfactory concurrent validity between the NWB and the AMTI, supplemented by satisfactory CCC in all tests (CCC: 0.74-0.87). The ranges of the limits of agreement in the validity study were comparable to the limits of agreement of the reproducibility study. Conclusion Both NWB and AMTI have satisfactory reproducibility for testing static balance in a population of children. Concurrent validity of NWB compared with AMTI was satisfactory. Furthermore, the results from the concurrent validity study were comparable to the reproducibility results of the NWB and the AMTI. Thus, NWB has the potential to replace the AMTI in field settings in studies including children. Future studies are needed to examine intra-subject variability and to test the predictive validity of NWB. PMID:24913461
Optimal test selection for prediction uncertainty reduction
Mullins, Joshua; Mahadevan, Sankaran; Urbina, Angel
2016-12-02
Economic factors and experimental limitations often lead to sparse and/or imprecise data used for the calibration and validation of computational models. This paper addresses resource allocation for calibration and validation experiments, in order to maximize their effectiveness within given resource constraints. When observation data are used for model calibration, the quality of the inferred parameter descriptions is directly affected by the quality and quantity of the data. This paper characterizes parameter uncertainty within a probabilistic framework, which enables the uncertainty to be systematically reduced with additional data. The validation assessment is also uncertain in the presence of sparse and imprecisemore » data; therefore, this paper proposes an approach for quantifying the resulting validation uncertainty. Since calibration and validation uncertainty affect the prediction of interest, the proposed framework explores the decision of cost versus importance of data in terms of the impact on the prediction uncertainty. Often, calibration and validation tests may be performed for different input scenarios, and this paper shows how the calibration and validation results from different conditions may be integrated into the prediction. Then, a constrained discrete optimization formulation that selects the number of tests of each type (calibration or validation at given input conditions) is proposed. Furthermore, the proposed test selection methodology is demonstrated on a microelectromechanical system (MEMS) example.« less
Validation of a scenario-based assessment of critical thinking using an externally validated tool.
Buur, Jennifer L; Schmidt, Peggy; Smylie, Dean; Irizarry, Kris; Crocker, Carlos; Tyler, John; Barr, Margaret
2012-01-01
With medical education transitioning from knowledge-based curricula to competency-based curricula, critical thinking skills have emerged as a major competency. While there are validated external instruments for assessing critical thinking, many educators have created their own custom assessments of critical thinking. However, the face validity of these assessments has not been challenged. The purpose of this study was to compare results from a custom assessment of critical thinking with the results from a validated external instrument of critical thinking. Students from the College of Veterinary Medicine at Western University of Health Sciences were administered a custom assessment of critical thinking (ACT) examination and the externally validated instrument, California Critical Thinking Skills Test (CCTST), in the spring of 2011. Total scores and sub-scores from each exam were analyzed for significant correlations using Pearson correlation coefficients. Significant correlations between ACT Blooms 2 and deductive reasoning and total ACT score and deductive reasoning were demonstrated with correlation coefficients of 0.24 and 0.22, respectively. No other statistically significant correlations were found. The lack of significant correlation between the two examinations illustrates the need in medical education to externally validate internal custom assessments. Ultimately, the development and validation of custom assessments of non-knowledge-based competencies will produce higher quality medical professionals.
Evaluation of the Validity and Reliability of the Waterlow Pressure Ulcer Risk Assessment Scale
Charalambous, Charalambos; Koulori, Agoritsa; Vasilopoulos, Aristidis; Roupa, Zoe
2018-01-01
Introduction Prevention is the ideal strategy to tackle the problem of pressure ulcers. Pressure ulcer risk assessment scales are one of the most pivotal measures applied to tackle the problem, much criticisms has been developed regarding the validity and reliability of these scales. Objective To investigate the validity and reliability of the Waterlow pressure ulcer risk assessment scale. Method The methodology used is a narrative literature review, the bibliography was reviewed through Cinahl, Pubmed, EBSCO, Medline and Google scholar, 26 scientific articles where identified. The articles where chosen due to their direct correlation with the objective under study and their scientific relevance. Results The construct and face validity of the Waterlow appears adequate, but with regards to content validity changes in the category age and gender can be beneficial. The concurrent validity cannot be assessed. The predictive validity of the Waterlow is characterized by high specificity and low sensitivity. The inter-rater reliability has been demonstrated to be inadequate, this may be due to lack of clear definitions within the categories and differentiating level of knowledge between the users. Conclusion Due to the limitations presented regarding the validity and reliability of the Waterlow pressure ulcer risk assessment scale, the scale should be used in conjunction with clinical assessment to provide optimum results. PMID:29736104
Simulated Driving Assessment (SDA) for Teen Drivers: Results from a Validation Study
McDonald, Catherine C.; Kandadai, Venk; Loeb, Helen; Seacrist, Thomas S.; Lee, Yi-Ching; Winston, Zachary; Winston, Flaura K.
2015-01-01
Background Driver error and inadequate skill are common critical reasons for novice teen driver crashes, yet few validated, standardized assessments of teen driving skills exist. The purpose of this study was to evaluate the construct and criterion validity of a newly developed Simulated Driving Assessment (SDA) for novice teen drivers. Methods The SDA's 35-minute simulated drive incorporates 22 variations of the most common teen driver crash configurations. Driving performance was compared for 21 inexperienced teens (age 16–17 years, provisional license ≤90 days) and 17 experienced adults (age 25–50 years, license ≥5 years, drove ≥100 miles per week, no collisions or moving violations ≤3 years). SDA driving performance (Error Score) was based on driving safety measures derived from simulator and eye-tracking data. Negative driving outcomes included simulated collisions or run-off-the-road incidents. A professional driving evaluator/instructor reviewed videos of SDA performance (DEI Score). Results The SDA demonstrated construct validity: 1.) Teens had a higher Error Score than adults (30 vs. 13, p=0.02); 2.) For each additional error committed, the relative risk of a participant's propensity for a simulated negative driving outcome increased by 8% (95% CI: 1.05–1.10, p<0.01). The SDA demonstrated criterion validity: Error Score was correlated with DEI Score (r=−0.66, p<0.001). Conclusions This study supports the concept of validated simulated driving tests like the SDA to assess novice driver skill in complex and hazardous driving scenarios. The SDA, as a standard protocol to evaluate teen driver performance, has the potential to facilitate screening and assessment of teen driving readiness and could be used to guide targeted skill training. PMID:25740939
ERIC Educational Resources Information Center
El-Bassel, Nabila; Schilling, Robert; Ivanoff, Andre; Chen, Duan-Rung; Hanson, Meredith
1998-01-01
Describes the results of administering the World Health Organization's Alcohol Use Disorder Identification Test (AUDIT) to 400 incarcerated drug-using women. Reports on AUDIT's utility, validity, and reliability. Results demonstrate that AUDIT can be used to identify problem drinkers among incarcerated, drug-using women. (MKA)
Revalidation of the NASA Ames 11-by 11-Foot Transonic Wind Tunnel with a Commercial Airplane Model
NASA Technical Reports Server (NTRS)
Kmak, Frank J.; Hudgins, M.; Hergert, D.; George, Michael W. (Technical Monitor)
2001-01-01
The 11-By 11-Foot Transonic leg of the Unitary Plan Wind Tunnel (UPWT) was modernized to improve tunnel performance, capability, productivity, and reliability. Wind tunnel tests to demonstrate the readiness of the tunnel for a return to production operations included an Integrated Systems Test (IST), calibration tests, and airplane validation tests. One of the two validation tests was a 0.037-scale Boeing 777 model that was previously tested in the 11-By 11-Foot tunnel in 1991. The objective of the validation tests was to compare pre-modernization and post-modernization results from the same airplane model in order to substantiate the operational readiness of the facility. Evaluation of within-test, test-to-test, and tunnel-to-tunnel data repeatability were made to study the effects of the tunnel modifications. Tunnel productivity was also evaluated to determine the readiness of the facility for production operations. The operation of the facility, including model installation, tunnel operations, and the performance of tunnel systems, was observed and facility deficiency findings generated. The data repeatability studies and tunnel-to-tunnel comparisons demonstrated outstanding data repeatability and a high overall level of data quality. Despite some operational and facility problems, the validation test was successful in demonstrating the readiness of the facility to perform production airplane wind tunnel%, tests.
Alwaal, Amjad; Al-Qaoud, Talal M; Haddad, Richard L; Alzahrani, Tarek M; Delisle, Josee; Anidjar, Maurice
2015-01-01
Assessing the predictive validity of the LapSim simulator within a urology residency program. Twelve urology residents at McGill University were enrolled in the study between June 2008 and December 2011. The residents had weekly training on the LapSim that consisted of 3 tasks (cutting, clip-applying, and lifting and grasping). They underwent monthly assessment of their LapSim performance using total time, tissue damage and path length among other parameters as surrogates for their economy of movement and respect for tissue. The last residents' LapSim performance was compared with their first performance of radical nephrectomy on anesthetized porcine models in their 4(th) year of training. Two independent urologic surgeons rated the resident performance on the porcine models, and kappa test with standardized weight function was used to assess for inter-observer bias. Nonparametric spearman correlation test was used to compare each rater's cumulative score with the cumulative score obtained on the porcine models in order to test the predictive validity of the LapSim simulator. The kappa results demonstrated acceptable agreement between the two observers among all domains of the rating scale of performance except for confidence of movement and efficiency. In addition, poor predictive validity of the LapSim simulator was demonstrated. Predictive validity was not demonstrated for the LapSim simulator in the context of a urology residency training program.
2012-01-01
Background Technological advances have enabled the widespread use of video cases via web-streaming and online download as an educational medium. The use of real subjects to demonstrate acute pathology should aid the education of health care professionals. However, the methodology by which this effect may be tested is not clear. Methods We undertook a literature review of major databases, found relevant articles relevant to using patient video cases as educational interventions, extracted the methodologies used and assessed these methods for internal and construct validity. Results A review of 2532 abstracts revealed 23 studies meeting the inclusion criteria and a final review of 18 of relevance. Medical students were the most commonly studied group (10 articles) with a spread of learner satisfaction, knowledge and behaviour tested. Only two of the studies fulfilled defined criteria on achieving internal and construct validity. The heterogeneity of articles meant it was not possible to perform any meta-analysis. Conclusions Previous studies have not well classified which facet of training or educational outcome the study is aiming to explore and had poor internal and construct validity. Future research should aim to validate a particular outcome measure, preferably by reproducing previous work rather than adopting new methods. In particular cognitive processing enhancement, demonstrated in a number of the medical student studies, should be tested at a postgraduate level. PMID:23256787
Autonomous formation flying based on GPS — PRISMA flight results
NASA Astrophysics Data System (ADS)
D'Amico, Simone; Ardaens, Jean-Sebastien; De Florio, Sergio
2013-01-01
This paper presents flight results from the early harvest of the Spaceborne Autonomous Formation Flying Experiment (SAFE) conducted in the frame of the Swedish PRISMA technology demonstration mission. SAFE represents one of the first demonstrations in low Earth orbit of an advanced guidance, navigation and control system for dual-spacecraft formations. Innovative techniques based on differential GPS-based navigation and relative orbital elements control are validated and tuned in orbit to fulfill the typical requirements of future distributed scientific instruments for remote sensing.
Substance versus style: a new look at social desirability in motivating contexts.
Smith, D Brent; Ellingson, Jill E
2002-04-01
Although there is an emerging consensus that social desirability does not meaningfully affect criterion-related validity, several researchers have reaffirmed the argument that social desirability degrades the construct validity of personality measures. Yet, most research demonstrating the adverse consequences of faking for construct validity uses a fake-good instruction set. The consequence of such a manipulation is to exacerbate the effects of response distortion beyond what would be expected under realistic circumstances (e.g., an applicant setting). The research reported in this article was designed to assess these issues by using real-world contexts not influenced by artificial instructions. Results suggest that response distortion has little impact on the construct validity of personality measures used in selection contexts.
NASA Astrophysics Data System (ADS)
Hawes, Frederick T.; Berk, Alexander; Richtsmeier, Steven C.
2016-05-01
A validated, polarimetric 3-dimensional simulation capability, P-MCScene, is being developed by generalizing Spectral Sciences' Monte Carlo-based synthetic scene simulation model, MCScene, to include calculation of all 4 Stokes components. P-MCScene polarimetric optical databases will be generated by a new version (MODTRAN7) of the government-standard MODTRAN radiative transfer algorithm. The conversion of MODTRAN6 to a polarimetric model is being accomplished by (1) introducing polarimetric data, by (2) vectorizing the MODTRAN radiation calculations and by (3) integrating the newly revised and validated vector discrete ordinate model VDISORT3. Early results, presented here, demonstrate a clear pathway to the long-term goal of fully validated polarimetric models.
Laerum, Hallvard; Faxvaag, Arild
2004-02-09
Evaluation is a challenging but necessary part of the development cycle of clinical information systems like the electronic medical records (EMR) system. It is believed that such evaluations should include multiple perspectives, be comparative and employ both qualitative and quantitative methods. Self-administered questionnaires are frequently used as a quantitative evaluation method in medical informatics, but very few validated questionnaires address clinical use of EMR systems. We have developed a task-oriented questionnaire for evaluating EMR systems from the clinician's perspective. The key feature of the questionnaire is a list of 24 general clinical tasks. It is applicable to physicians of most specialties and covers essential parts of their information-oriented work. The task list appears in two separate sections, about EMR use and task performance using the EMR, respectively. By combining these sections, the evaluator may estimate the potential impact of the EMR system on health care delivery. The results may also be compared across time, site or vendor. This paper describes the development, performance and validation of the questionnaire. Its performance is shown in two demonstration studies (n = 219 and 80). Its content is validated in an interview study (n = 10), and its reliability is investigated in a test-retest study (n = 37) and a scaling study (n = 31). In the interviews, the physicians found the general clinical tasks in the questionnaire relevant and comprehensible. The tasks were interpreted concordant to their definitions. However, the physicians found questions about tasks not explicitly or only partially supported by the EMR systems difficult to answer. The two demonstration studies provided unambiguous results and low percentages of missing responses. In addition, criterion validity was demonstrated for a majority of task-oriented questions. Their test-retest reliability was generally high, and the non-standard scale was found symmetric and ordinal. This questionnaire is relevant for clinical work and EMR systems, provides reliable and interpretable results, and may be used as part of any evaluation effort involving the clinician's perspective of an EMR system.
Evaluation results for intelligent transportation systems
DOT National Transportation Integrated Search
2000-11-09
This presentation covers the methods of evaluation set out for EC-funded ITS research and demonstration projects, known as the CONVERGE validation quality process and the lessons learned from that approach. The new approach to appraisal, which is bei...
Implementation of clinical decision rules in the emergency department.
Stiell, Ian G; Bennett, Carol
2007-11-01
Clinical decision rules (CDRs) are tools designed to help clinicians make bedside diagnostic and therapeutic decisions. The development of a CDR involves three stages: derivation, validation, and implementation. Several criteria need to be considered when designing and evaluating the results of an implementation trial. In this article, the authors review the results of implementation studies evaluating the effect of four CDRs: the Ottawa Ankle Rules, the Ottawa Knee Rule, the Canadian C-Spine Rule, and the Canadian CT Head Rule. Four implementation studies demonstrated that the implementation of CDRs in the emergency department (ED) safely reduced the use of radiography for ankle, knee, and cervical spine injuries. However, a recent trial failed to demonstrate an impact on computed tomography imaging rates. Well-developed and validated CDRs can be successfully implemented into practice, efficiently standardizing ED care. However, further research is needed to identify barriers to implementation in order to achieve improved uptake in the ED.
L(sub 1) Adaptive Flight Control System: Flight Evaluation and Technology Transition
NASA Technical Reports Server (NTRS)
Xargay, Enric; Hovakimyan, Naira; Dobrokhodov, Vladimir; Kaminer, Isaac; Gregory, Irene M.; Cao, Chengyu
2010-01-01
Certification of adaptive control technologies for both manned and unmanned aircraft represent a major challenge for current Verification and Validation techniques. A (missing) key step towards flight certification of adaptive flight control systems is the definition and development of analysis tools and methods to support Verification and Validation for nonlinear systems, similar to the procedures currently used for linear systems. In this paper, we describe and demonstrate the advantages of L(sub l) adaptive control architectures for closing some of the gaps in certification of adaptive flight control systems, which may facilitate the transition of adaptive control into military and commercial aerospace applications. As illustrative examples, we present the results of a piloted simulation evaluation on the NASA AirSTAR flight test vehicle, and results of an extensive flight test program conducted by the Naval Postgraduate School to demonstrate the advantages of L(sub l) adaptive control as a verifiable robust adaptive flight control system.
Corno, Giulia; Molinari, Guadalupe; Baños, Rosa Maria
2016-01-01
The aim of this study is to explore the psychometric properties of an affect scale, the Scale of Positive and Negative Experience (SPANE), in an Italian-speaking population. The results of this study demonstrate that the Italian version of the SPANE has psychometric properties similar to those shown by the original and previous versions, and it presents satisfactory reliability and factorial validity. The results of the Confirmatory Factor Analysis support the expected two-factor structure, positive and negative feeling, which characterized the previous versions. As expected, measures of negative affect, anxiety, negative future expectances, and depression correlated positively with the negative experiences SPANE subscale, and negatively with the positive experiences SPANE subscale. Results of this study demonstrate that the Italian version of the SPANE has psychometric properties similar to those shown by the original and previous versions, and it presents satisfactory reliability and factorial validity. The use of this instrument provides clinically useful information about a person’s overall emotional experience and it is an indicator of well-being. Although further studies are required to confirm the psychometric characteristics of the scale, the SPANE Italian version is expected to improve theoretical and empirical research on the well-being of the Italian population.
Design and Validation of an Augmented Reality System for Laparoscopic Surgery in a Real Environment
López-Mir, F.; Naranjo, V.; Fuertes, J. J.; Alcañiz, M.; Bueno, J.; Pareja, E.
2013-01-01
Purpose. This work presents the protocol carried out in the development and validation of an augmented reality system which was installed in an operating theatre to help surgeons with trocar placement during laparoscopic surgery. The purpose of this validation is to demonstrate the improvements that this system can provide to the field of medicine, particularly surgery. Method. Two experiments that were noninvasive for both the patient and the surgeon were designed. In one of these experiments the augmented reality system was used, the other one was the control experiment, and the system was not used. The type of operation selected for all cases was a cholecystectomy due to the low degree of complexity and complications before, during, and after the surgery. The technique used in the placement of trocars was the French technique, but the results can be extrapolated to any other technique and operation. Results and Conclusion. Four clinicians and ninety-six measurements obtained of twenty-four patients (randomly assigned in each experiment) were involved in these experiments. The final results show an improvement in accuracy and variability of 33% and 63%, respectively, in comparison to traditional methods, demonstrating that the use of an augmented reality system offers advantages for trocar placement in laparoscopic surgery. PMID:24236293
Adkins, Jennifer W; Weathers, Frank W; McDevitt-Murphy, Meghan; Daniels, Jennifer B
2008-12-01
In this study psychometric properties of seven self-report measures of posttraumatic stress disorder (PTSD) were compared. The seven scales evaluated were the Davidson Trauma Scale (DTS), the PTSD Checklist (PCL), the Posttraumatic Stress Diagnostic Scale (PDS), the Civilian Mississippi Scale (CMS), the Impact of Event Scale-Revised (IES-R), the Penn Inventory for Posttraumatic Stress Disorder (Penn), and the PK scale of the MMPI-2 (PK). Participants were 239 (79 male and 160 female) trauma-exposed undergraduates. All seven measures exhibited good test-retest reliability and internal consistency. The PDS, PCL and DTS demonstrated the best convergent validity; the IES-R, PDS, and PCL demonstrated the best discriminant validity; and the PDS, PCL, and IES-R demonstrated the best diagnostic utility. Overall, results most strongly support the use of the PDS and the PCL for the assessment of PTSD in this population.
The Validity of Conscientiousness Is Overestimated in the Prediction of Job Performance.
Kepes, Sven; McDaniel, Michael A
2015-01-01
Sensitivity analyses refer to investigations of the degree to which the results of a meta-analysis remain stable when conditions of the data or the analysis change. To the extent that results remain stable, one can refer to them as robust. Sensitivity analyses are rarely conducted in the organizational science literature. Despite conscientiousness being a valued predictor in employment selection, sensitivity analyses have not been conducted with respect to meta-analytic estimates of the correlation (i.e., validity) between conscientiousness and job performance. To address this deficiency, we reanalyzed the largest collection of conscientiousness validity data in the personnel selection literature and conducted a variety of sensitivity analyses. Publication bias analyses demonstrated that the validity of conscientiousness is moderately overestimated (by around 30%; a correlation difference of about .06). The misestimation of the validity appears to be due primarily to suppression of small effects sizes in the journal literature. These inflated validity estimates result in an overestimate of the dollar utility of personnel selection by millions of dollars and should be of considerable concern for organizations. The fields of management and applied psychology seldom conduct sensitivity analyses. Through the use of sensitivity analyses, this paper documents that the existing literature overestimates the validity of conscientiousness in the prediction of job performance. Our data show that effect sizes from journal articles are largely responsible for this overestimation.
Barrett, Frederick S; Johnson, Matthew W; Griffiths, Roland R
2015-11-01
The 30-item revised Mystical Experience Questionnaire (MEQ30) was previously developed within an online survey of mystical-type experiences occasioned by psilocybin-containing mushrooms. The rated experiences occurred on average eight years before completion of the questionnaire. The current paper validates the MEQ30 using data from experimental studies with controlled doses of psilocybin. Data were pooled and analyzed from five laboratory experiments in which participants (n=184) received a moderate to high oral dose of psilocybin (at least 20 mg/70 kg). Results of confirmatory factor analysis demonstrate the reliability and internal validity of the MEQ30. Structural equation models demonstrate the external and convergent validity of the MEQ30 by showing that latent variable scores on the MEQ30 positively predict persisting change in attitudes, behavior, and well-being attributed to experiences with psilocybin while controlling for the contribution of the participant-rated intensity of drug effects. These findings support the use of the MEQ30 as an efficient measure of individual mystical experiences. A method to score a "complete mystical experience" that was used in previous versions of the mystical experience questionnaire is validated in the MEQ30, and a stand-alone version of the MEQ30 is provided for use in future research. © The Author(s) 2015.
Validation of the revised Mystical Experience Questionnaire in experimental sessions with psilocybin
Barrett, Frederick S; Johnson, Matthew W; Griffiths, Roland R
2016-01-01
The 30-item revised Mystical Experience Questionnaire (MEQ30) was previously developed within an online survey of mystical-type experiences occasioned by psilocybin-containing mushrooms. The rated experiences occurred on average eight years before completion of the questionnaire. The current paper validates the MEQ30 using data from experimental studies with controlled doses of psilocybin. Data were pooled and analyzed from five laboratory experiments in which participants (n=184) received a moderate to high oral dose of psilocybin (at least 20 mg/70 kg). Results of confirmatory factor analysis demonstrate the reliability and internal validity of the MEQ30. Structural equation models demonstrate the external and convergent validity of the MEQ30 by showing that latent variable scores on the MEQ30 positively predict persisting change in attitudes, behavior, and well-being attributed to experiences with psilocybin while controlling for the contribution of the participant-rated intensity of drug effects. These findings support the use of the MEQ30 as an efficient measure of individual mystical experiences. A method to score a “complete mystical experience” that was used in previous versions of the mystical experience questionnaire is validated in the MEQ30, and a stand-alone version of the MEQ30 is provided for use in future research. PMID:26442957
Development and validation of a fatigue assessment scale for U.S. construction workers.
Zhang, Mingzong; Sparer, Emily H; Murphy, Lauren A; Dennerlein, Jack T; Fang, Dongping; Katz, Jeffrey N; Caban-Martinez, Alberto J
2015-02-01
To develop a fatigue assessment scale and test its reliability and validity for commercial construction workers. Using a two-phased approach, we first identified items (first phase) for the development of a Fatigue Assessment Scale for Construction Workers (FASCW) through review of existing scales in the scientific literature, key informant interviews (n = 11) and focus groups (three groups with six workers each) with construction workers. The second phase included assessment for the reliability, validity, and sensitivity of the new scale using a repeated-measures study design with a convenience sample of construction workers (n = 144). Phase one resulted in a 16-item preliminary scale that after factor analysis yielded a final 10-item scale with two sub-scales ("Lethargy" and "Bodily Ailment"). During phase two, the FASCW and its subscales demonstrated satisfactory internal consistency (alpha coefficients were FASCW [0.91], Lethargy [0.86] and Bodily Ailment [0.84]) and acceptable test-retest reliability (Pearson Correlations Coefficients: 0.59-0.68; Intraclass Correlation Coefficients: 0.74-0.80). Correlation analysis substantiated concurrent and convergent validity. A discriminant analysis demonstrated that the FASCW differentiated between groups with arthritis status and different work hours. The 10-item FASCW with good reliability and validity is an effective tool for assessing the severity of fatigue among construction workers. © 2015 Wiley Periodicals, Inc.
Mahony, Mary C; Patterson, Patricia; Hayward, Brooke; North, Robert; Green, Dawne
2015-05-01
To demonstrate, using human factors engineering (HFE), that a redesigned, pre-filled, ready-to-use, pre-asembled follitropin alfa pen can be used to administer prescribed follitropin alfa doses safely and accurately. A failure modes and effects analysis identified hazards and harms potentially caused by use errors; risk-control measures were implemented to ensure acceptable device use risk management. Participants were women with infertility, their significant others, and fertility nurse (FN) professionals. Preliminary testing included 'Instructions for Use' (IFU) and pre-validation studies. Validation studies used simulated injections in a representative use environment; participants received prior training on pen use. User performance in preliminary testing led to IFU revisions and a change to outer needle cap design to mitigate needle stick potential. In the first validation study (49 users, 343 simulated injections), in the FN group, one observed critical use error resulted in a device design modification and another in an IFU change. A second validation study tested the mitigation strategies; previously reported use errors were not repeated. Through an iterative process involving a series of studies, modifications were made to the pen design and IFU. Simulated-use testing demonstrated that the redesigned pen can be used to administer follitropin alfa effectively and safely.
PedsQL™ Multidimensional Fatigue Scale in Sickle Cell Disease: Feasibility, Reliability and Validity
Panepinto, Julie A.; Torres, Sylvia; Bendo, Cristiane B.; McCavit, Timothy L.; Dinu, Bogdan; Sherman-Bien, Sandra; Bemrich-Stolz, Christy; Varni, James W.
2013-01-01
Background Sickle cell disease (SCD) is an inherited blood disorder characterized by a chronic hemolytic anemia that can contribute to fatigue and global cognitive impairment in patients. The study objective was to report on the feasibility, reliability, and validity of the PedsQL™ Multidimensional Fatigue Scale in SCD for pediatric patient self-report ages 5–18 years and parent proxy-report for ages 2–18 years. Procedure This was a cross-sectional multi-site study whereby 240 pediatric patients with SCD and 303 parents completed the 18-item PedsQL™ Multidimensional Fatigue Scale. Participants also completed the PedsQL™ 4.0 Generic Core Scales. Results The PedsQL™ Multidimensional Fatigue Scale evidenced excellent feasibility, excellent reliability for the Total Scale Scores (patient self-report α = 0.90; parent proxy-report α = 0.95), and acceptable reliability for the three individual scales (patient self-report α = 0.77–0.84; parent proxy-report α = 0.90–0.97). Intercorrelations of the PedsQL™ Multidimensional Fatigue Scale with the PedsQL™ Generic Core Scales were predominantly in the large (≥ 0.50) range, supporting construct validity. PedsQL™ Multidimensional Fatigue Scale Scores were significantly worse with large effects sizes (≥0.80) for patients with SCD than for a comparison sample of healthy children, supporting known-groups discriminant validity. Confirmatory factor analysis demonstrated an acceptable to excellent model fit in SCD. Conclusions The PedsQL™ Multidimensional Fatigue Scale demonstrated acceptable to excellent measurement properties in SCD. The results demonstrate the relative severity of fatigue symptoms in pediatric patients with SCD, indicating the potential clinical utility of multidimensional assessment of fatigue in patients with SCD in clinical research and practice. PMID:24038960
1997-09-01
Illinois Institute of Technology Research Institute (IITRI) calibrated seven parametric models including SPQR /20, the forerunner of CHECKPOINT. The...a semicolon); thus, SPQR /20 was calibrated using SLOC sizing data (IITRI, 1989: 3-4). The results showed only slight overall improvements in accuracy...even when validating the calibrated models with the same data sets. The IITRI study demonstrated SPQR /20 to be one of two models that were most
NASA Technical Reports Server (NTRS)
Fahrenthold, Eric P.; Shivarama, Ravishankar
2004-01-01
The hybrid particle-finite element method of Fahrenthold and Horban, developed for the simulation of hypervelocity impact problems, has been extended to include new formulations of the particle-element kinematics, additional constitutive models, and an improved numerical implementation. The extended formulation has been validated in three dimensional simulations of published impact experiments. The test cases demonstrate good agreement with experiment, good parallel speedup, and numerical convergence of the simulation results.
2011-07-01
10%. These results demonstrate that the IOP-based BRDF correction scheme (which is composed of the R„ model along with the IOP retrieval...distribution was averaged over 10 min 5. Validation of the lOP-Based BRDF Correction Scheme The IOP-based BRDF correction scheme is applied to both...oceanic and coastal waters were very consistent qualitatively and quantitatively and thus validate the IOP- based BRDF correction system, at least
Dynamic Forces in Spur Gears - Measurement, Prediction, and Code Validation
NASA Technical Reports Server (NTRS)
Oswald, Fred B.; Townsend, Dennis P.; Rebbechi, Brian; Lin, Hsiang Hsi
1996-01-01
Measured and computed values for dynamic loads in spur gears were compared to validate a new version of the NASA gear dynamics code DANST-PC. Strain gage data from six gear sets with different tooth profiles were processed to determine the dynamic forces acting between the gear teeth. Results demonstrate that the analysis code successfully simulates the dynamic behavior of the gears. Differences between analysis and experiment were less than 10 percent under most conditions.
Airborne Validation of Spatial Properties Measured by the CALIPSO Lidar
NASA Technical Reports Server (NTRS)
McGill, Matthew J.; Vaughan, Mark A.; Trepte, Charles Reginald; Hart, William D.; Hlavka, Dennis L.; Winker, David M.; Keuhn, Ralph
2007-01-01
The primary payload onboard the Cloud-Aerosol Lidar Infrared Pathfinder Satellite Observations (CALIPSO) satellite is a dual-wavelength backscatter lidar designed to provide vertical profiling of clouds and aerosols. Launched in April 2006, the first data from this new satellite was obtained in June 2006. As with any new satellite measurement capability, an immediate post-launch requirement is to verify that the data being acquired is correct lest scientific conclusions begin to be drawn based on flawed data. A standard approach to verifying satellite data is to take a similar, or validation, instrument and fly it onboard a research aircraft. Using an aircraft allows the validation instrument to get directly under the satellite so that both the satellite instrument and the aircraft instrument are sensing the same region of the atmosphere. Although there are almost always some differences in the sampling capabilities of the two instruments, it is nevertheless possible to directly compare the measurements. To validate the measurements from the CALIPSO lidar, a similar instrument, the Cloud Physics Lidar, was flown onboard the NASA high-altitude ER-2 aircraft during July- August 2006. This paper presents results to demonstrate that the CALIPSO lidar is properly calibrated and the CALIPSO Level 1 data products are correct. The importance of the results is to demonstrate to the research community that CALIPSO Level 1 data can be confidently used for scientific research.
Ràfols, Clara; Bosch, Elisabeth; Barbas, Rafael; Prohens, Rafel
2016-07-01
A study about the suitability of the chelation reaction of Ca(2+)with ethylenediaminetetraacetic acid (EDTA) as a validation standard for Isothermal Titration Calorimeter measurements has been performed exploring the common experimental variables (buffer, pH, ionic strength and temperature). Results obtained in a variety of experimental conditions have been amended according to the side reactions involved in the main process and to the experimental ionic strength and, finally, validated by contrast with the potentiometric reference values. It is demonstrated that the chelation reaction performed in acetate buffer 0.1M and 25°C shows accurate and precise results and it is robust enough to be adopted as a standard calibration process. Copyright © 2016 Elsevier B.V. All rights reserved.
Towards Automatic Validation and Healing of Citygml Models for Geometric and Semantic Consistency
NASA Astrophysics Data System (ADS)
Alam, N.; Wagner, D.; Wewetzer, M.; von Falkenhausen, J.; Coors, V.; Pries, M.
2013-09-01
A steadily growing number of application fields for large 3D city models have emerged in recent years. Like in many other domains, data quality is recognized as a key factor for successful business. Quality management is mandatory in the production chain nowadays. Automated domain-specific tools are widely used for validation of business-critical data but still common standards defining correct geometric modeling are not precise enough to define a sound base for data validation of 3D city models. Although the workflow for 3D city models is well-established from data acquisition to processing, analysis and visualization, quality management is not yet a standard during this workflow. Processing data sets with unclear specification leads to erroneous results and application defects. We show that this problem persists even if data are standard compliant. Validation results of real-world city models are presented to demonstrate the potential of the approach. A tool to repair the errors detected during the validation process is under development; first results are presented and discussed. The goal is to heal defects of the models automatically and export a corrected CityGML model.
Evolving Improvements to TRMM Ground Validation Rainfall Estimates
NASA Technical Reports Server (NTRS)
Robinson, M.; Kulie, M. S.; Marks, D. A.; Wolff, D. B.; Ferrier, B. S.; Amitai, E.; Silberstein, D. S.; Fisher, B. L.; Wang, J.; Einaudi, Franco (Technical Monitor)
2000-01-01
The primary function of the TRMM Ground Validation (GV) Program is to create GV rainfall products that provide basic validation of satellite-derived precipitation measurements for select primary sites. Since the successful 1997 launch of the TRMM satellite, GV rainfall estimates have demonstrated systematic improvements directly related to improved radar and rain gauge data, modified science techniques, and software revisions. Improved rainfall estimates have resulted in higher quality GV rainfall products and subsequently, much improved evaluation products for the satellite-based precipitation estimates from TRMM. This presentation will demonstrate how TRMM GV rainfall products created in a semi-automated, operational environment have evolved and improved through successive generations. Monthly rainfall maps and rainfall accumulation statistics for each primary site will be presented for each stage of GV product development. Contributions from individual product modifications involving radar reflectivity (Ze)-rain rate (R) relationship refinements, improvements in rain gauge bulk-adjustment and data quality control processes, and improved radar and gauge data will be discussed. Finally, it will be demonstrated that as GV rainfall products have improved, rainfall estimation comparisons between GV and satellite have converged, lending confidence to the satellite-derived precipitation measurements from TRMM.
Test and Demonstration Assets of New Mexico
DOE Office of Scientific and Technical Information (OSTI.GOV)
None
This document was developed by the Arrowhead Center of New Mexico State University as part of the National Security Preparedness Project (NSPP), funded by a DOE/NNSA grant. The NSPP has three primary components: business incubation, workforce development, and technology demonstration and validation. The document contains a survey of test and demonstration assets in New Mexico available for external users such as small businesses with security technologies under development. Demonstration and validation of national security technologies created by incubator sources, as well as other sources, are critical phases of technology development. The NSPP will support the utilization of an integrated demonstrationmore » and validation environment.« less
Real-Time Onboard Global Nonlinear Aerodynamic Modeling from Flight Data
NASA Technical Reports Server (NTRS)
Brandon, Jay M.; Morelli, Eugene A.
2014-01-01
Flight test and modeling techniques were developed to accurately identify global nonlinear aerodynamic models onboard an aircraft. The techniques were developed and demonstrated during piloted flight testing of an Aermacchi MB-326M Impala jet aircraft. Advanced piloting techniques and nonlinear modeling techniques based on fuzzy logic and multivariate orthogonal function methods were implemented with efficient onboard calculations and flight operations to achieve real-time maneuver monitoring and analysis, and near-real-time global nonlinear aerodynamic modeling and prediction validation testing in flight. Results demonstrated that global nonlinear aerodynamic models for a large portion of the flight envelope were identified rapidly and accurately using piloted flight test maneuvers during a single flight, with the final identified and validated models available before the aircraft landed.
An Automated, Adaptive Framework for Optimizing Preprocessing Pipelines in Task-Based Functional MRI
Churchill, Nathan W.; Spring, Robyn; Afshin-Pour, Babak; Dong, Fan; Strother, Stephen C.
2015-01-01
BOLD fMRI is sensitive to blood-oxygenation changes correlated with brain function; however, it is limited by relatively weak signal and significant noise confounds. Many preprocessing algorithms have been developed to control noise and improve signal detection in fMRI. Although the chosen set of preprocessing and analysis steps (the “pipeline”) significantly affects signal detection, pipelines are rarely quantitatively validated in the neuroimaging literature, due to complex preprocessing interactions. This paper outlines and validates an adaptive resampling framework for evaluating and optimizing preprocessing choices by optimizing data-driven metrics of task prediction and spatial reproducibility. Compared to standard “fixed” preprocessing pipelines, this optimization approach significantly improves independent validation measures of within-subject test-retest, and between-subject activation overlap, and behavioural prediction accuracy. We demonstrate that preprocessing choices function as implicit model regularizers, and that improvements due to pipeline optimization generalize across a range of simple to complex experimental tasks and analysis models. Results are shown for brief scanning sessions (<3 minutes each), demonstrating that with pipeline optimization, it is possible to obtain reliable results and brain-behaviour correlations in relatively small datasets. PMID:26161667
Validation of Robotic Surgery Simulator (RoSS).
Kesavadas, Thenkurussi; Stegemann, Andrew; Sathyaseelan, Gughan; Chowriappa, Ashirwad; Srimathveeravalli, Govindarajan; Seixas-Mikelus, Stéfanie; Chandrasekhar, Rameella; Wilding, Gregory; Guru, Khurshid
2011-01-01
Recent growth of daVinci Robotic Surgical System as a minimally invasive surgery tool has led to a call for better training of future surgeons. In this paper, a new virtual reality simulator, called RoSS is presented. Initial results from two studies - face and content validity, are very encouraging. 90% of the cohort of expert robotic surgeons felt that the simulator was excellent or somewhat close to the touch and feel of the daVinci console. Content validity of the simulator received 90% approval in some cases. These studies demonstrate that RoSS has the potential of becoming an important training tool for the daVinci surgical robot.
Accelerating cross-validation with total variation and its application to super-resolution imaging
NASA Astrophysics Data System (ADS)
Obuchi, Tomoyuki; Ikeda, Shiro; Akiyama, Kazunori; Kabashima, Yoshiyuki
2017-12-01
We develop an approximation formula for the cross-validation error (CVE) of a sparse linear regression penalized by ℓ_1-norm and total variation terms, which is based on a perturbative expansion utilizing the largeness of both the data dimensionality and the model. The developed formula allows us to reduce the necessary computational cost of the CVE evaluation significantly. The practicality of the formula is tested through application to simulated black-hole image reconstruction on the event-horizon scale with super resolution. The results demonstrate that our approximation reproduces the CVE values obtained via literally conducted cross-validation with reasonably good precision.
Meunier, Jean-Christophe; Roskam, Isabelle
2009-01-01
This study presents a validation of a scale that assesses parents' childrearing behavior toward young children. The scale was validated on 565 parents of 2- to 7-year-old children. The current results replicated the factor solution of the original scale designed for parents of school-aged children. The scale demonstrated good psychometric properties: moderate to high internal consistency, the expected relations with criterion variables (parental self-efficacy beliefs, child's behavior and personality), and discriminative properties according to the parents' gender and educational level, the child's age and gender, and the difference between referred and nonreferred children.
CosmoQuest:Using Data Validation for More Than Just Data Validation
NASA Astrophysics Data System (ADS)
Lehan, C.; Gay, P.
2016-12-01
It is often taken for granted that different scientists completing the same task (e.g. mapping geologic features) will get the same results, and data validation is often skipped or under-utilized due to time and funding constraints. Robbins et. al (2014), however, demonstrated that this is a needed step, as large variation can exist even among collaborating team members completing straight-forward tasks like marking craters. Data Validation should be much more than a simple post-project verification of results. The CosmoQuest virtual research facility employs regular data-validation for a variety of benefits, including real-time user feedback, real-time tracking to observe user activity while it's happening, and using pre-solved data to analyze users' progress and to help them retain skills. Some creativity in this area can drastically improve project results. We discuss methods of validating data in citizen science projects and outline the variety of uses for validation, which, when used properly, improves the scientific output of the project and the user experience for the citizens doing the work. More than just a tool for scientists, validation can assist users in both learning and retaining important information and skills, improving the quality and quantity of data gathered. Real-time analysis of user data can give key information in the effectiveness of the project that a broad glance would miss, and properly presenting that analysis is vital. Training users to validate their own data, or the data of others, can significantly improve the accuracy of misinformed or novice users.
Evaluation results for intelligent transport systems (ITS) : abstract
DOT National Transportation Integrated Search
2000-11-09
This paper summarizes the methods of evaluation set out for EC-funded ITS research and demonstration projects, known as the CONVERGE validation quality process and the lessons learned from that approach. The new approach to appraisal, which is being ...
Determining Prevalence of Acute Bilirubin Encephalopathy in Developing Countries
2015-11-11
Demonstrate BIND II Score of >=5, is Valid for Detecting Moderate to Severe ABE in Neonates <14 Days Old.; Demonstrate Community-BIND Instrument, a Modified BIND II, is a Valid and Reliable Tool for Detecting ABE.; Demonstrate That Community-BIND Can be Used for Acquiring Population-based Prevalence of ABE in the Community.
Hebert, Jeffrey J; Koppenhaver, Shane L; Teyhen, Deydre S; Walker, Bruce F; Fritz, Julie M
2015-06-01
The lumbar multifidus muscle provides an important contribution to lumbar spine stability, and the restoration of lumbar multifidus function is a frequent goal of rehabilitation. Currently, there are no reliable and valid physical examination procedures available to assess lumbar multifidus function among patients with low back pain. To examine the inter-rater reliability and concurrent validity of the multifidus lift test (MLT) to identify lumbar multifidus dysfunction among patients with low back pain. A cross-sectional analysis of reliability and concurrent validity performed in a university outpatient research facility. Thirty-two persons aged 18 to 60 years with current low back pain and a minimum modified Oswestry disability score of 20%. Study participants were excluded if they reported a history of lumbar spine surgery, lumbar radiculopathy, medical red flags, osteoporosis, or had recently been treated with spinal manipulation or trunk stabilization exercises. Concurrent measures of lumbar multifidus muscle function at the L4-L5 and L5-S1 levels were obtained with the MLT (index test) and real-time ultrasound imaging (reference standard). The inter-rater reliability of the MLT was examined by measuring the level of agreement between two blinded examiners. Concurrent validity of the MLT was investigated by comparing clinicians' judgments with real-time ultrasound imaging measures of lumbar multifidus function. Inter-rater reliability of the MLT was substantial to excellent (κ=0.75 to 0.81, p≤.01) and free from errors of bias and prevalence. When performed at L4-L5 or L5-S1, the MLT demonstrated evidence of concurrent validity through its relationship with the reference standard results at L4-L5 (rbis=0.59-0.73, p≤.01). The MLT generally failed to demonstrate a relationship with the reference standard results from the L5-S1 level. Our results provide preliminary evidence supporting the reliability and validity of the MLT to assess lumbar multifidus function at the L4-L5 spinal level. Additional research examining the measurement properties and utility of this test should be undertaken before confident implementation with patients. Copyright © 2015 Elsevier Inc. All rights reserved.
Validation of the Implementation Leadership Scale (ILS) with Supervisors' Self-Ratings.
Torres, Elisa M; Ehrhart, Mark G; Beidas, Rinad S; Farahnak, Lauren R; Finn, Natalie K; Aarons, Gregory A
2018-01-01
Although often discussed, there is a lack of empirical research on the role of leadership in the management and delivery of health services. The implementation leadership scale (ILS) assesses the degree to which leaders are knowledgeable, proactive, perseverant, and supportive during evidence-based practice (EBP) implementation. The purpose of this study was to examine the psychometric properties of the ILS for leaders' self-ratings using a sample of mental health clinic supervisors (N = 119). Supervisors (i.e., leaders) completed surveys including self-ratings of their implementation leadership. Confirmatory factor analysis, reliability, and validity of the ILS were evaluated. The ILS factor structure was supported in the sample of supervisors. Results demonstrated internal consistency reliability and validity. Cronbach alpha's ranged from 0.92 to 0.96 for the ILS subscales and 0.95 for the ILS overall scale. The factor structure replication and reliability of the ILS in a sample of supervisors demonstrates its applicability with employees across organizational levels.
Preliminary Face and Construct Validation Study of a Virtual Basic Laparoscopic Skill Trainer
Sankaranarayanan, Ganesh; Lin, Henry; Arikatla, Venkata S.; Mulcare, Maureen; Zhang, Likun; Derevianko, Alexandre; Lim, Robert; Fobert, David; Cao, Caroline; Schwaitzberg, Steven D.; Jones, Daniel B.
2010-01-01
Abstract Background The Virtual Basic Laparoscopic Skill Trainer (VBLaST™) is a developing virtual-reality–based surgical skill training system that incorporates several of the tasks of the Fundamentals of Laparoscopic Surgery (FLS) training system. This study aimed to evaluate the face and construct validity of the VBLaST™ system. Materials and Methods Thirty-nine subjects were voluntarily recruited at the Beth Israel Deaconess Medical Center (Boston, MA) and classified into two groups: experts (PGY 5, fellow and practicing surgeons) and novice (PGY 1–4). They were then asked to perform three FLS tasks, consisting of peg transfer, pattern cutting, and endoloop, on both the VBLaST and FLS systems. The VBLaST performance scores were automatically computed, while the FLS scores were rated by a trained evaluator. Face validity was assessed using a 5-point Likert scale, varying from not realistic/useful (1) to very realistic/useful (5). Results Face-validity scores showed that the VBLaST system was significantly realistic in portraying the three FLS tasks (3.95 ± 0.909), as well as the reality in trocar placement and tool movements (3.67 ± 0.874). Construct-validity results show that VBLaST was able to differentiate between the expert and novice group (P = 0.015). However, of the two tasks used for evaluating VBLaST, only the peg-transfer task showed a significant difference between the expert and novice groups (P = 0.003). Spearman correlation coefficient analysis between the two scores showed significant correlation for the peg-transfer task (Spearman coefficient 0.364; P = 0.023). Conclusions VBLaST demonstrated significant face and construct validity. A further set of studies, involving improvement to the current VBLaST system, is needed to thoroughly demonstrate face and construct validity for all the tasks. PMID:20201683
Integration and Validation of Hysteroscopy Simulation in the Surgical Training Curriculum.
Elessawy, Mohamed; Skrzipczyk, Moritz; Eckmann-Scholz, Christel; Maass, Nicolai; Mettler, Liselotte; Guenther, Veronika; van Mackelenbergh, Marion; Bauerschlag, Dirk O; Alkatout, Ibrahim
The primary objective of our study was to test the construct validity of the HystSim hysteroscopic simulator to determine whether simulation training can improve the acquisition of hysteroscopic skills regardless of the previous levels of experience of the participants. The secondary objective was to analyze the performance of a selected task, using specially designed scoring charts to help reduce the learning curve for both novices and experienced surgeons. The teaching of hysteroscopic intervention has received only scant attention, focusing mainly on the development of physical models and box simulators. This encouraged our working group to search for a suitable hysteroscopic simulator module and to test its validation. We decided to use the HystSim hysteroscopic simulator, which is one of the few such simulators that has already completed a validation process, with high ratings for both realism and training capacity. As a testing tool for our study, we selected the myoma resection task. We analyzed the results using the multimetric score system suggested by HystSim, allowing a more precise interpretation of the results. Between June 2014 and May 2015, our group collected data on 57 participants of minimally invasive surgical training courses at the Kiel School of Gynecological Endoscopy, Department of Gynecology and Obstetrics, University Hospitals Schleswig-Holstein, Campus Kiel. The novice group consisted of 42 medical students and residents with no prior experience in hysteroscopy, whereas the expert group consisted of 15 participants with more than 2 years of experience of advanced hysteroscopy operations. The overall results demonstrated that all participants attained significant improvements between their pretest and posttests, independent of their previous levels of experience (p < 0.002). Those in the expert group demonstrated statistically significant, superior scores in the pretest and posttests (p = 0.001, p = 0.006). Regarding visualization and ergonomics, the novices showed a better pretest value than the experts; however, the experts were able to improve significantly during the posttest. These precise findings demonstrated that the multimetric scoring system achieved several important objectives, including clinical relevance, critical relevance, and training motivation. All participants demonstrated improvements in their hysteroscopic skills, proving an adequate construct validation of the HystSim. Using the multimetric scoring system enabled a more accurate analysis of the performance of the participants independent of their levels of experience which could be an important key for streamlining the learning curve. Future studies testing the predictive validation of the simulator and frequency of the training intervals are necessary before the introduction of the simulator into the standard surgical training curriculum. Copyright © 2016 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights reserved.
Morizot, Julien
2014-10-01
While there are a number of short personality trait measures that have been validated for use with adults, few are specifically validated for use with adolescents. To trust such measures, it must be demonstrated that they have adequate construct validity. According to the view of construct validity as a unifying form of validity requiring the integration of different complementary sources of information, this article reports the evaluation of content, factor, convergent, and criterion validities as well as reliability of adolescents' self-reported personality traits. Moreover, this study sought to address an inherent potential limitation of short personality trait measures, namely their limited conceptual breadth. In this study, starting with items from a known measure, after the language-level was adjusted for use with adolescents, items tapping fundamental primary traits were added to determine the impact of added conceptual breadth on the psychometric properties of the scales. The resulting new measure was named the Big Five Personality Trait Short Questionnaire (BFPTSQ). A group of expert judges considered the items to have adequate content validity. Using data from a community sample of early adolescents, the results confirmed the factor validity of the Big Five structure in adolescence as well as its measurement invariance across genders. More important, the added items did improve the convergent and criterion validities of the scales, but did not negatively affect their reliability. This study supports the construct validity of adolescents' self-reported personality traits and points to the importance of conceptual breadth in short personality measures. © The Author(s) 2014.
The psychometric properties of the 5-item gratitude questionnaire in Chinese adolescents.
Zeng, Y; Ling, Y; Huebner, E S; He, Y; Lei, X
2017-05-01
WHAT IS KNOWN ON THE SUBJECT?: The GQ-6 is one of the most widely used self-report questionnaires to evaluate the level of gratitude among adults. The GQ-5 appears suitable for adolescents. WHAT THIS PAPER ADDS TO EXISTING KNOWLEDGE?: We developed a Chinese version of the GQ-5 and examined evidence for its reliability and validity. Results demonstrated adequate reliability and validity, indicating that it is appropriate for the assessment of gratitude in Chinese adolescents. In addition, Chinese early adolescent females reported higher gratitude than adolescent males. WHAT ARE THE IMPLICATIONS FOR PRACTICE?: Screening adolescents who have lower levels of gratitude through the GQ-5 could help identify students who may benefit from empirically validated interventions to promote higher levels of gratitude in an effort to promote positive psychosocial and academic outcomes. Background This study was conducted to evaluate the psychometric properties of the Chinese version of the 5-item Gratitude Questionnaire (GQ-5). Method The sample consisted of 2093 middle school students (46.8% males) in mainland China. Confirmatory factor analysis and multigroup confirmatory factor analysis were performed to examine the factor structure and the measurement equivalence across gender. The convergent validity, Cronbach's α and mean interitem correlations of the GQ-5 were also evaluated. Results The results provided evidence of internal consistency reliability through a Cronbach's α of 0.812 and a mean interitem correlation of 0.463 for the total sample. The results also supported a one-dimensional factor structure. In addition, convergent validity was assessed by statistically significant positive correlations between the GQ-5 and the two subscales of the Children's Hope Scale (CHS) and the Brief Multidimensional Students' Life Satisfaction Scale (BMSLSS) total score. Finally, multigroup confirmatory factor analysis also demonstrated measurement equivalence across gender. Subsequent analyses of latent mean revealed gender differences in early adolescent male and female students. Conclusions The Chinese version of the GQ-5 appears to be a reliable and valid measure of gratitude among Chinese early adolescents. Early adolescent female students reported higher gratitude than early adolescent male students. © 2017 John Wiley & Sons Ltd.
Adaptive finite element methods for two-dimensional problems in computational fracture mechanics
NASA Technical Reports Server (NTRS)
Min, J. B.; Bass, J. M.; Spradley, L. W.
1994-01-01
Some recent results obtained using solution-adaptive finite element methods in two-dimensional problems in linear elastic fracture mechanics are presented. The focus is on the basic issue of adaptive finite element methods for validating the new methodology by computing demonstration problems and comparing the stress intensity factors to analytical results.
Demonstration of automated proximity and docking technologies
NASA Astrophysics Data System (ADS)
Anderson, Robert L.; Tsugawa, Roy K.; Bryan, Thomas C.
An autodock was demonstrated using straightforward techniques and real sensor hardware. A simulation testbed was established and validated. The sensor design was refined with improved optical performance and image processing noise mitigation techniques, and the sensor is ready for production from off-the-shelf components. The autonomous spacecraft architecture is defined. The areas of sensors, docking hardware, propulsion, and avionics are included in the design. The Guidance Navigation and Control architecture and requirements are developed. Modular structures suitable for automated control are used. The spacecraft system manager functions including configuration, resource, and redundancy management are defined. The requirements for autonomous spacecraft executive are defined. High level decisionmaking, mission planning, and mission contingency recovery are a part of this. The next step is to do flight demonstrations. After the presentation the following question was asked. How do you define validation? There are two components to validation definition: software simulation with formal and vigorous validation, and hardware and facility performance validated with respect to software already validated against analytical profile.
[Validation and verfication of microbiology methods].
Camaró-Sala, María Luisa; Martínez-García, Rosana; Olmos-Martínez, Piedad; Catalá-Cuenca, Vicente; Ocete-Mochón, María Dolores; Gimeno-Cardona, Concepción
2015-01-01
Clinical microbiologists should ensure, to the maximum level allowed by the scientific and technical development, the reliability of the results. This implies that, in addition to meeting the technical criteria to ensure their validity, they must be performed with a number of conditions that allows comparable results to be obtained, regardless of the laboratory that performs the test. In this sense, the use of recognized and accepted reference methodsis the most effective tool for these guarantees. The activities related to verification and validation of analytical methods has become very important, as there is continuous development, as well as updating techniques and increasingly complex analytical equipment, and an interest of professionals to ensure quality processes and results. The definitions of validation and verification are described, along with the different types of validation/verification, and the types of methods, and the level of validation necessary depending on the degree of standardization. The situations in which validation/verification is mandatory and/or recommended is discussed, including those particularly related to validation in Microbiology. It stresses the importance of promoting the use of reference strains as controls in Microbiology and the use of standard controls, as well as the importance of participation in External Quality Assessment programs to demonstrate technical competence. The emphasis is on how to calculate some of the parameters required for validation/verification, such as the accuracy and precision. The development of these concepts can be found in the microbiological process SEIMC number 48: «Validation and verification of microbiological methods» www.seimc.org/protocols/microbiology. Copyright © 2013 Elsevier España, S.L.U. y Sociedad Española de Enfermedades Infecciosas y Microbiología Clínica. All rights reserved.
Validation of alternative methods for toxicity testing.
Bruner, L H; Carr, G J; Curren, R D; Chamberlain, M
1998-01-01
Before nonanimal toxicity tests may be officially accepted by regulatory agencies, it is generally agreed that the validity of the new methods must be demonstrated in an independent, scientifically sound validation program. Validation has been defined as the demonstration of the reliability and relevance of a test method for a particular purpose. This paper provides a brief review of the development of the theoretical aspects of the validation process and updates current thinking about objectively testing the performance of an alternative method in a validation study. Validation of alternative methods for eye irritation testing is a specific example illustrating important concepts. Although discussion focuses on the validation of alternative methods intended to replace current in vivo toxicity tests, the procedures can be used to assess the performance of alternative methods intended for other uses. Images Figure 1 PMID:9599695
Linton, Steven J; Flink, Ida K; Nilsson, Emma; Edlund, Sara
2017-05-01
Patient-centered, empathetic communication has been recommended as a means for improving the health care of patients suffering pain. However, a problem has been training health care providers since programs may be time-consuming and difficult to learn. Validation, a form of empathetic response that communicates that what a patient experiences is accepted as true, has been suggested as an appropriate method for improving communication with patients suffering pain. We study the immediate effects of providing medical students with a 2-session (45-minute duration each) program in validation skills on communication. A one group, pretest vs posttest design was employed with 22 volunteer medical students. To control patient variables, actors simulated 1 of 2 patient scenarios (randomly provided at pretest and posttest). Video recordings were blindly evaluated. Self-ratings of validation and satisfaction were also employed. Observed validation responses increased significantly after training and corresponded to significant reductions in invalidating responses. Both the patient simulators and the medical students were significantly more satisfied after the training. We demonstrated that training empathetic validation results in improved communication thus extending previous findings to a medical setting with patients suffering pain. Our results suggest that it would be feasible to provide validation training for health care providers and this warrants further investigation in controlled studies.
Korjus, Kristjan; Hebart, Martin N.; Vicente, Raul
2016-01-01
Supervised machine learning methods typically require splitting data into multiple chunks for training, validating, and finally testing classifiers. For finding the best parameters of a classifier, training and validation are usually carried out with cross-validation. This is followed by application of the classifier with optimized parameters to a separate test set for estimating the classifier’s generalization performance. With limited data, this separation of test data creates a difficult trade-off between having more statistical power in estimating generalization performance versus choosing better parameters and fitting a better model. We propose a novel approach that we term “Cross-validation and cross-testing” improving this trade-off by re-using test data without biasing classifier performance. The novel approach is validated using simulated data and electrophysiological recordings in humans and rodents. The results demonstrate that the approach has a higher probability of discovering significant results than the standard approach of cross-validation and testing, while maintaining the nominal alpha level. In contrast to nested cross-validation, which is maximally efficient in re-using data, the proposed approach additionally maintains the interpretability of individual parameters. Taken together, we suggest an addition to currently used machine learning approaches which may be particularly useful in cases where model weights do not require interpretation, but parameters do. PMID:27564393
Korjus, Kristjan; Hebart, Martin N; Vicente, Raul
2016-01-01
Supervised machine learning methods typically require splitting data into multiple chunks for training, validating, and finally testing classifiers. For finding the best parameters of a classifier, training and validation are usually carried out with cross-validation. This is followed by application of the classifier with optimized parameters to a separate test set for estimating the classifier's generalization performance. With limited data, this separation of test data creates a difficult trade-off between having more statistical power in estimating generalization performance versus choosing better parameters and fitting a better model. We propose a novel approach that we term "Cross-validation and cross-testing" improving this trade-off by re-using test data without biasing classifier performance. The novel approach is validated using simulated data and electrophysiological recordings in humans and rodents. The results demonstrate that the approach has a higher probability of discovering significant results than the standard approach of cross-validation and testing, while maintaining the nominal alpha level. In contrast to nested cross-validation, which is maximally efficient in re-using data, the proposed approach additionally maintains the interpretability of individual parameters. Taken together, we suggest an addition to currently used machine learning approaches which may be particularly useful in cases where model weights do not require interpretation, but parameters do.
Personality traits in companion dogs-Results from the VIDOPET.
Turcsán, Borbála; Wallis, Lisa; Virányi, Zsófia; Range, Friederike; Müller, Corsin A; Huber, Ludwig; Riemer, Stefanie
2018-01-01
Individual behavioural differences in pet dogs are of great interest from a basic and applied research perspective. Most existing dog personality tests have specific (practical) goals in mind and so focused only on a limited aspect of dogs' personality, such as identifying problematic (aggressive or fearful) behaviours, assessing suitability as working dogs, or improving the results of adoption. Here we aimed to create a comprehensive test of personality in pet dogs that goes beyond traditional practical evaluations by exposing pet dogs to a range of situations they might encounter in everyday life. The Vienna Dog Personality Test (VIDOPET) consists of 15 subtests and was performed on 217 pet dogs. A two-step data reduction procedure (principal component analysis on each subtest followed by an exploratory factor analysis on the subtest components) yielded five factors: Sociability-obedience, Activity-independence, Novelty seeking, Problem orientation, and Frustration tolerance. A comprehensive evaluation of reliability and validity measures demonstrated excellent inter- and intra-observer reliability and adequate internal consistency of all factors. Moreover the test showed good temporal consistency when re-testing a subsample of dogs after an average of 3.8 years-a considerably longer test-retest interval than assessed for any other dog personality test, to our knowledge. The construct validity of the test was investigated by analysing the correlations between the results of video coding and video rating methods and the owners' assessment via a dog personality questionnaire. The results demonstrated good convergent as well as discriminant validity. To conclude, the VIDOPET is not only a highly reliable and valid tool for measuring dog personality, but also the first test to show consistent behavioural traits related to problem solving ability and frustration tolerance in pet dogs.
Personality traits in companion dogs—Results from the VIDOPET
Wallis, Lisa; Virányi, Zsófia; Range, Friederike; Müller, Corsin A.; Huber, Ludwig; Riemer, Stefanie
2018-01-01
Individual behavioural differences in pet dogs are of great interest from a basic and applied research perspective. Most existing dog personality tests have specific (practical) goals in mind and so focused only on a limited aspect of dogs’ personality, such as identifying problematic (aggressive or fearful) behaviours, assessing suitability as working dogs, or improving the results of adoption. Here we aimed to create a comprehensive test of personality in pet dogs that goes beyond traditional practical evaluations by exposing pet dogs to a range of situations they might encounter in everyday life. The Vienna Dog Personality Test (VIDOPET) consists of 15 subtests and was performed on 217 pet dogs. A two-step data reduction procedure (principal component analysis on each subtest followed by an exploratory factor analysis on the subtest components) yielded five factors: Sociability-obedience, Activity-independence, Novelty seeking, Problem orientation, and Frustration tolerance. A comprehensive evaluation of reliability and validity measures demonstrated excellent inter- and intra-observer reliability and adequate internal consistency of all factors. Moreover the test showed good temporal consistency when re-testing a subsample of dogs after an average of 3.8 years—a considerably longer test-retest interval than assessed for any other dog personality test, to our knowledge. The construct validity of the test was investigated by analysing the correlations between the results of video coding and video rating methods and the owners’ assessment via a dog personality questionnaire. The results demonstrated good convergent as well as discriminant validity. To conclude, the VIDOPET is not only a highly reliable and valid tool for measuring dog personality, but also the first test to show consistent behavioural traits related to problem solving ability and frustration tolerance in pet dogs. PMID:29634747
Quadruplex digital flight control system assessment
NASA Technical Reports Server (NTRS)
Mulcare, D. B.; Downing, L. E.; Smith, M. K.
1988-01-01
Described are the development and validation of a double fail-operational digital flight control system architecture for critical pitch axis functions. Architectural tradeoffs are assessed, system simulator modifications are described, and demonstration testing results are critiqued. Assessment tools and their application are also illustrated. Ultimately, the vital role of system simulation, tailored to digital mechanization attributes, is shown to be essential to validating the airworthiness of full-time critical functions such as augmented fly-by-wire systems for relaxed static stability airplanes.
Development and validation of the Overall Depression Severity and Impairment Scale.
Bentley, Kate H; Gallagher, Matthew W; Carl, Jenna R; Barlow, David H
2014-09-01
The need to capture severity and impairment of depressive symptomatology is widespread. Existing depression scales are lengthy and largely focus on individual symptoms rather than resulting impairment. The Overall Depression Severity and Impairment Scale (ODSIS) is a 5-item, continuous measure designed for use across heterogeneous mood disorders and with subthreshold depressive symptoms. This study examined the psychometric properties of the ODSIS in outpatients in a clinic for emotional disorders (N = 100), undergraduate students (N = 566), and community-based adults (N = 189). Internal consistency, latent structure, item response theory, classification accuracy, convergent and discriminant validity, and differential item functioning analyses were conducted. ODSIS scores exhibited excellent internal consistency, and confirmatory factor analyses supported a unidimensional structure. Item response theory results demonstrated that the ODSIS provides more information about individuals with high levels of depression than those with low levels of depression. Responses on the ODSIS discriminated well between individuals with and without a mood disorder and depression-related severity across clinical and subclinical levels. A cut score of 8 correctly classified 82% of outpatients as with or without a mood disorder; it evidenced a favorable balance of sensitivity and specificity and of positive and negative predictive values. The ODSIS demonstrated good convergent and discriminant validity, and results indicate that items function similarly across clinical and nonclinical samples. Overall, findings suggest that the ODSIS is a valid tool for measuring depression-related severity and impairment. The brevity and ease of use of the ODSIS support its utility for screening and monitoring treatment response across a variety of settings. PsycINFO Database Record (c) 2014 APA, all rights reserved.
Validation of Bayesian analysis of compartmental kinetic models in medical imaging.
Sitek, Arkadiusz; Li, Quanzheng; El Fakhri, Georges; Alpert, Nathaniel M
2016-10-01
Kinetic compartmental analysis is frequently used to compute physiologically relevant quantitative values from time series of images. In this paper, a new approach based on Bayesian analysis to obtain information about these parameters is presented and validated. The closed-form of the posterior distribution of kinetic parameters is derived with a hierarchical prior to model the standard deviation of normally distributed noise. Markov chain Monte Carlo methods are used for numerical estimation of the posterior distribution. Computer simulations of the kinetics of F18-fluorodeoxyglucose (FDG) are used to demonstrate drawing statistical inferences about kinetic parameters and to validate the theory and implementation. Additionally, point estimates of kinetic parameters and covariance of those estimates are determined using the classical non-linear least squares approach. Posteriors obtained using methods proposed in this work are accurate as no significant deviation from the expected shape of the posterior was found (one-sided P>0.08). It is demonstrated that the results obtained by the standard non-linear least-square methods fail to provide accurate estimation of uncertainty for the same data set (P<0.0001). The results of this work validate new methods for a computer simulations of FDG kinetics. Results show that in situations where the classical approach fails in accurate estimation of uncertainty, Bayesian estimation provides an accurate information about the uncertainties in the parameters. Although a particular example of FDG kinetics was used in the paper, the methods can be extended for different pharmaceuticals and imaging modalities. Copyright © 2016 Associazione Italiana di Fisica Medica. Published by Elsevier Ltd. All rights reserved.
Measuring Sexual Motives: A Test of the Psychometric Properties of the Sexual Motivations Scale.
Jardin, Charles; Garey, Lorra; Zvolensky, Michael J
2017-01-01
Sexual motives refer to functions served by sexual behavior. The Sex Motivations Scale (SMS) has frequently been used to assess sexual motives. At its development, the SMS demonstrated good internal consistency; convergent, divergent, and criterion validity; and configural invariance across sex, age, and Caucasians and African Americans. Yet the metric and scalar invariance of the SMS has not been examined, nor has the measurement invariance of the SMS across Hispanic and Asian Americans, sexual minority status, and relationship status been tested. The criterion validity of the SMS also has yet to be examined for nonintercourse sexual behaviors, such as sexting. The present study aimed to address these gaps in a diverse sample of 2,201 college students (77.60% female; M age = 22.06; 27.84% Caucasian). Results further affirmed the configural, metric, and scalar invariance of the SMS. The convergent and divergent validity of the SMS was supported in relation to positive and negative affect and attachment patterns; and specific SMS subscales demonstrated associations with sexual intercourse behaviors and sexting, supporting the criterion validity of the SMS. These findings suggest the relevance of the SMS in assessing sexual motives across diverse populations and behaviors.
Formiga, Magno F; Roach, Kathryn E; Vital, Isabel; Urdaneta, Gisel; Balestrini, Kira; Calderon-Candelario, Rafael A; Campos, Michael A; Cahalin, Lawrence P
2018-01-01
The Test of Incremental Respiratory Endurance (TIRE) provides a comprehensive assessment of inspiratory muscle performance by measuring maximal inspiratory pressure (MIP) over time. The integration of MIP over inspiratory duration (ID) provides the sustained maximal inspiratory pressure (SMIP). Evidence on the reliability and validity of these measurements in COPD is not currently available. Therefore, we assessed the reliability, responsiveness and construct validity of the TIRE measures of inspiratory muscle performance in subjects with COPD. Test-retest reliability, known-groups and convergent validity assessments were implemented simultaneously in 81 male subjects with mild to very severe COPD. TIRE measures were obtained using the portable PrO2 device, following standard guidelines. All TIRE measures were found to be highly reliable, with SMIP demonstrating the strongest test-retest reliability with a nearly perfect intraclass correlation coefficient (ICC) of 0.99, while MIP and ID clustered closely together behind SMIP with ICC values of about 0.97. Our findings also demonstrated known-groups validity of all TIRE measures, with SMIP and ID yielding larger effect sizes when compared to MIP in distinguishing between subjects of different COPD status. Finally, our analyses confirmed convergent validity for both SMIP and ID, but not MIP. The TIRE measures of MIP, SMIP and ID have excellent test-retest reliability and demonstrated known-groups validity in subjects with COPD. SMIP and ID also demonstrated evidence of moderate convergent validity and appear to be more stable measures in this patient population than the traditional MIP.
Warkentin, Sarah; Mais, Laís Amaral; Latorre, Maria do Rosário Dias de Oliveira; Carnell, Susan; Taddei, José Augusto de Aguiar Carrazedo
2016-07-19
Recent national surveys in Brazil have demonstrated a decrease in the consumption of traditional food and a parallel increase in the consumption of ultra-processed food, which has contributed to a rise in obesity prevalence in all age groups. Environmental factors, especially familial factors, have a strong influence on the food intake of preschool children, and this has led to the development of psychometric scales to measure parents' feeding practices. The aim of this study was to test the validity of a translated and adapted Comprehensive Feeding Practices Questionnaire in a sample of Brazilian preschool-aged children enrolled in private schools. A transcultural adaptation process was performed in order to develop a modified questionnaire (43 items). After piloting, the questionnaire was sent to parents, along with additional questions about family characteristics. Test-retest reliability was assessed in one of the schools. Factor analysis with oblique rotation was performed. Internal reliability was tested using Cronbach's alpha and correlations between factors, discriminant validity using marker variables of child's food intake, and convergent validity via correlations with parental perceptions of perceived responsibility for feeding and concern about the child's weight were also performed. The final sample consisted of 402 preschool children. Factor analysis resulted in a final questionnaire of 43 items distributed over 6 factors. Cronbach alpha values were adequate (0.74 to 0.88), between-factor correlations were low, and discriminant validity and convergent validity were acceptable. The modified CFPQ demonstrated significant internal reliability in this urban Brazilian sample. Scale validation within different cultures is essential for a more comprehensive understanding of parental feeding practices for preschoolers.
Translation and validation of the Canadian diabetes risk assessment questionnaire in China.
Guo, Jia; Shi, Zhengkun; Chen, Jyu-Lin; Dixon, Jane K; Wiley, James; Parry, Monica
2018-01-01
To adapt the Canadian Diabetes Risk Assessment Questionnaire for the Chinese population and to evaluate its psychometric properties. A cross-sectional study was conducted with a convenience sample of 194 individuals aged 35-74 years from October 2014 to April 2015. The Canadian Diabetes Risk Assessment Questionnaire was adapted and translated for the Chinese population. Test-retest reliability was conducted to measure stability. Criterion and convergent validity of the adapted questionnaire were assessed using 2-hr 75 g oral glucose tolerance tests and the Finnish Diabetes Risk Scores, respectively. Sensitivity and specificity were evaluated to establish its predictive validity. The test-retest reliability was 0.988. Adequate validity of the adapted questionnaire was demonstrated by positive correlations found between the scores and 2-hr 75 g oral glucose tolerance tests (r = .343, p < .001) and with the Finnish Diabetes Risk Scores (r = .738, p < .001). The area under receiver operating characteristic curve was 0.705 (95% CI .632, .778), demonstrating moderate diagnostic value at a cutoff score of 30. The sensitivity was 73%, with a positive predictive value of 57% and negative predictive value of 78%. Our results provided evidence supporting the translation consistency, content validity, convergent validity, criterion validity, sensitivity, and specificity of the translated Canadian Diabetes Risk Assessment Questionnaire with minor modifications. This paper provides clinical, practical, and methodological information on how to adapt a diabetes risk calculator between cultures for public health nurses. © 2017 Wiley Periodicals, Inc.
Dahm, Jane; Wong, Dana; Ponsford, Jennie
2013-10-01
Anxiety and depression following traumatic brain injury (TBI) are associated with poorer outcomes. A brief self-report questionnaire would assist in identifying those at risk, however validity of such measures is complicated by confounding symptoms of the injury. This study investigated the validity of the Depression Anxiety Stress Scales (DASS) and Hospital Anxiety and Depression Scale (HADS), in screening for clinical diagnoses of anxiety and mood disorders following TBI. One hundred and twenty-three participants with mild to severe TBI were interviewed using the SCID (Axis I) and completed the DASS and HADS. The DASS, DASS21 and HADS scales demonstrated validity compared with SCID diagnoses of anxiety and mood disorders as measured by Area Under ROC Curve, sensitivity and specificity. Validity of the DASS depression scale benefited from items reflecting symptoms of devaluation of life, self-deprecation, and hopelessness that are not present on the HADS. Validity of the HADS anxiety scale benefited from items reflecting symptoms of tension and worry that are measured separately for the DASS on the stress scale. Participants were predominantly drawn from a rehabilitation centre which may limit the extent to which results can be generalized. Scores for the DASS21 were derived from the DASS rather than being administered separately. The DASS, DASS21 and HADS demonstrated validity as screening measures of anxiety and mood disorders in this TBI sample. The findings support use of these self-report questionnaires for individuals with TBI to identify those who should be referred for clinical diagnostic follow-up. © 2013 Elsevier B.V. All rights reserved.
LeDell, Erin; Petersen, Maya; van der Laan, Mark
In binary classification problems, the area under the ROC curve (AUC) is commonly used to evaluate the performance of a prediction model. Often, it is combined with cross-validation in order to assess how the results will generalize to an independent data set. In order to evaluate the quality of an estimate for cross-validated AUC, we obtain an estimate of its variance. For massive data sets, the process of generating a single performance estimate can be computationally expensive. Additionally, when using a complex prediction method, the process of cross-validating a predictive model on even a relatively small data set can still require a large amount of computation time. Thus, in many practical settings, the bootstrap is a computationally intractable approach to variance estimation. As an alternative to the bootstrap, we demonstrate a computationally efficient influence curve based approach to obtaining a variance estimate for cross-validated AUC.
Development and Validation of the Elder Learning Barriers Scale Among Older Chinese Adults.
Wang, Renfeng; De Donder, Liesbeth; De Backer, Free; He, Tao; Van Regenmortel, Sofie; Li, Shihua; Lombaerts, Koen
2017-12-01
This study describes the development and validation of the Elder Learning Barriers (ELB) scale, which seeks to identify the obstacles that affect the level of educational participation of older adults. The process of item pool design and scale development is presented, as well as the testing and scale refinement procedure. The data were collected from a sample of 579 older Chinese adults (aged over 55) in the Xi'an region of China. After randomly splitting the sample for cross-validation purposes, the construct validity of the ELB scale was confirmed containing five dimensions: dispositional, informational, physical, situational, and institutional barriers. Furthermore, developmental differences in factor structure have been examined among older age groups. The results indicated that the scale demonstrated good reliability and validity. We conclude in general that the ELB scale appears to be a valuable instrument for examining the learning barriers that older Chinese citizens experience for participating in organized educational activities.
Petersen, Maya; van der Laan, Mark
2015-01-01
In binary classification problems, the area under the ROC curve (AUC) is commonly used to evaluate the performance of a prediction model. Often, it is combined with cross-validation in order to assess how the results will generalize to an independent data set. In order to evaluate the quality of an estimate for cross-validated AUC, we obtain an estimate of its variance. For massive data sets, the process of generating a single performance estimate can be computationally expensive. Additionally, when using a complex prediction method, the process of cross-validating a predictive model on even a relatively small data set can still require a large amount of computation time. Thus, in many practical settings, the bootstrap is a computationally intractable approach to variance estimation. As an alternative to the bootstrap, we demonstrate a computationally efficient influence curve based approach to obtaining a variance estimate for cross-validated AUC. PMID:26279737
Herrero-Hahn, Raquel; Rojas, Juan Guillermo; Ospina-Díaz, Juan Manuel; Montoya-Juárez, Rafael; Restrepo-Medrano, Juan Carlos; Hueso-Montoro, César
2017-03-01
The level of cultural self-efficacy indicates the degree of confidence nursing professionals possess for their ability to provide culturally competent care. Cultural adaptation and validation of the Cultural Self-Efficacy Scale was performed for nursing professionals in Colombia. A scale validation study was conducted. Cultural adaptation and validation of the Cultural Self-Efficacy Scale was performed using a sample of 190 nurses in Colombia, between September 2013 and April 2014. This sample was chosen via systematic random sampling from a finite population. The scale was culturally adapted. Cronbach's alpha for the revised scale was .978. Factor analysis revealed the existence of six factors grouped in three dimensions that explained 68% of the variance. The results demonstrated that the version of the Cultural Self-Efficacy Scale adapted to the Colombian context is a valid and reliable instrument for determining the level of cultural self-efficacy of nursing professionals.
Validation of the Japanese Version of the Body Vigilance Scale.
Saigo, Tatsuo; Takebayashi, Yoshitake; Tayama, Jun; Bernick, Peter J; Schmidt, Norman B; Shirabe, Susumu; Sakano, Yuji
2016-06-01
The Body Vigilance Scale is a self-report measure of attention to bodily sensations. The measure was translated into Japanese and its reliability, validity, and factor structure were verified. Participants comprised 286 university students (age: 19 ± 1 years). All participants were administered the scale, along with several indices of anxiety (i.e., Anxiety Sensitivity Index, Short Health Anxiety Inventory Illness Likelihood Scale, Social Interaction Anxiety Scale, and Hospital Anxiety and Depression Scale). The Japanese version of the Body Vigilance Scale exhibited a unidimensional factor structure and strong internal consistency. Construct validity was demonstrated by significant correlations with the above measures. Results suggest that the Japanese version of the scale is a reliable, valid tool for measuring body vigilance in Japanese university students. © The Author(s) 2016.
Briley, Daniel A.; Domiteaux, Matthew; Tucker-Drob, Elliot M.
2014-01-01
Many achievement-relevant personality measures (APMs) have been developed, but the interrelations among APMs or associations with the broader personality landscape are not well-known. In Study 1, 214 participants were measured on 36 APMs and a measure of the Big Five. Factor analytic results supported the convergent and discriminant validity of five latent dimensions: performance, mastery, self-doubt, effort, and intellectual investment. Conscientiousness, neuroticism, and openness to experience had the most consistent associations with APMs. We constructed a more efficient scale– the Multidimensional Achievement-Relevant Personality Scale (MAPS). In Study 2, we replicated the factor structure and external correlates of the MAPS in a sample of 359 individuals. Finally, we validated the MAPS with four indicators of academic performance and demonstrated incremental validity. PMID:24839374
Demonstration of innovative techniques for work zone safety data analysis
DOT National Transportation Integrated Search
2009-07-15
Based upon the results of the simulator data analysis, additional future research can be : identified to validate the driving simulator in terms of similarities with Ohio work zones. For : instance, the speeds observed in the simulator were greater f...
Elfenbein, Hillary Anger; Jang, Daisung; Sharma, Sudeep; Sanchez-Burks, Jeffrey
2017-03-01
Emotional intelligence (EI) has captivated researchers and the public alike, but it has been challenging to establish its components as objective abilities. Self-report scales lack divergent validity from personality traits, and few ability tests have objectively correct answers. We adapt the Stroop task to introduce a new facet of EI called emotional attention regulation (EAR), which involves focusing emotion-related attention for the sake of information processing rather than for the sake of regulating one's own internal state. EAR includes 2 distinct components. First, tuning in to nonverbal cues involves identifying nonverbal cues while ignoring alternate content, that is, emotion recognition under conditions of distraction by competing stimuli. Second, tuning out of nonverbal cues involves ignoring nonverbal cues while identifying alternate content, that is, the ability to interrupt emotion recognition when needed to focus attention elsewhere. An auditory test of valence included positive and negative words spoken in positive and negative vocal tones. A visual test of approach-avoidance included green- and red-colored facial expressions depicting happiness and anger. The error rates for incongruent trials met the key criteria for establishing the validity of an EI test, in that the measure demonstrated test-retest reliability, convergent validity with other EI measures, divergent validity from factors such as general processing speed and mostly personality, and predictive validity in this case for well-being. By demonstrating that facets of EI can be validly theorized and empirically assessed, results also speak to the validity of EI more generally. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Willis, Brian H; Riley, Richard D
2017-09-20
An important question for clinicians appraising a meta-analysis is: are the findings likely to be valid in their own practice-does the reported effect accurately represent the effect that would occur in their own clinical population? To this end we advance the concept of statistical validity-where the parameter being estimated equals the corresponding parameter for a new independent study. Using a simple ('leave-one-out') cross-validation technique, we demonstrate how we may test meta-analysis estimates for statistical validity using a new validation statistic, Vn, and derive its distribution. We compare this with the usual approach of investigating heterogeneity in meta-analyses and demonstrate the link between statistical validity and homogeneity. Using a simulation study, the properties of Vn and the Q statistic are compared for univariate random effects meta-analysis and a tailored meta-regression model, where information from the setting (included as model covariates) is used to calibrate the summary estimate to the setting of application. Their properties are found to be similar when there are 50 studies or more, but for fewer studies Vn has greater power but a higher type 1 error rate than Q. The power and type 1 error rate of Vn are also shown to depend on the within-study variance, between-study variance, study sample size, and the number of studies in the meta-analysis. Finally, we apply Vn to two published meta-analyses and conclude that it usefully augments standard methods when deciding upon the likely validity of summary meta-analysis estimates in clinical practice. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd.
Ávila, Christiane Wahast; Riegel, Barbara; Pokorski, Simoni Chiarelli; Camey, Suzi; Silveira, Luana Claudia Jacoby; Rabelo-Silva, Eneida Rejane
2013-01-01
Objective. To adapt and evaluate the psychometric properties of the Brazilian version of the SCHFI v 6.2. Methods. With the approval of the original author, we conducted a complete cross-cultural adaptation of the instrument (translation, synthesis, back translation, synthesis of back translation, expert committee review, and pretesting). The adapted version was named Brazilian version of the self-care of heart failure index v 6.2. The psychometric properties assessed were face validity and content validity (by expert committee review), construct validity (convergent validity and confirmatory factor analysis), and reliability. Results. Face validity and content validity were indicative of semantic, idiomatic, experimental, and conceptual equivalence. Convergent validity was demonstrated by a significant though moderate correlation (r = −0.51) on comparison with equivalent question scores of the previously validated Brazilian European heart failure self-care behavior scale. Confirmatory factor analysis supported the original three-factor model as having the best fit, although similar results were obtained for inadequate fit indices. The reliability of the instrument, as expressed by Cronbach's alpha, was 0.40, 0.82, and 0.93 for the self-care maintenance, self-care management, and self-care confidence scales, respectively. Conclusion. The SCHFI v 6.2 was successfully adapted for use in Brazil. Nevertheless, further studies should be carried out to improve its psychometric properties. PMID:24163765
De Caluwé, Elien; Verbeke, Lize; van Aken, Marcel; van der Heijden, Paul T; De Clercq, Barbara
2018-02-22
The inclusion of a dimensional trait model of personality pathology in DSM-5 creates new opportunities for research on developmental antecedents of personality pathology. The traits of this model can be measured with the Personality Inventory for DSM-5 (PID-5), initially developed for adults, but also demonstrating validity in adolescents. The present study adds to the growing body of literature on the psychometrics of the PID-5, by examining its structure, validity, and reliability in 187 psychiatric-referred late adolescents and emerging adults. PID-5, Big Five Inventory, and Kidscreen self-reports were provided, and 88 non-clinical matched controls completed the PID-5. Results confirm the PID-5's five-factor structure, indicate adequate psychometric properties, and underscore the construct and criterion validity, showing meaningful associations with adaptive traits and quality of life. Results are discussed in terms of the PID-5's applicability in vulnerable populations who are going through important developmental transition phases, such as the step towards early adulthood.
Hall, William J
2016-11-01
This article describes the development and preliminary validation of the Bullying, Harassment, and Aggression Receipt Measure (BullyHARM). The development of the BullyHARM involved a number of steps and methods, including a literature review, expert review, cognitive testing, readability testing, data collection from a large sample, reliability testing, and confirmatory factor analysis. A sample of 275 middle school students was used to examine the psychometric properties and factor structure of the BullyHARM, which consists of 22 items and 6 subscales: physical bullying, verbal bullying, social/relational bullying, cyber-bullying, property bullying, and sexual bullying. First-order and second-order factor models were evaluated. Results demonstrate that the first-order factor model had superior fit. Results of reliability testing indicate that the BullyHARM scale and subscales have very good internal consistency reliability. Findings indicate that the BullyHARM has good properties regarding content validation and respondent-related validation and is a promising instrument for measuring bullying victimization in school.
Hall, William J.
2017-01-01
This article describes the development and preliminary validation of the Bullying, Harassment, and Aggression Receipt Measure (BullyHARM). The development of the BullyHARM involved a number of steps and methods, including a literature review, expert review, cognitive testing, readability testing, data collection from a large sample, reliability testing, and confirmatory factor analysis. A sample of 275 middle school students was used to examine the psychometric properties and factor structure of the BullyHARM, which consists of 22 items and 6 subscales: physical bullying, verbal bullying, social/relational bullying, cyber-bullying, property bullying, and sexual bullying. First-order and second-order factor models were evaluated. Results demonstrate that the first-order factor model had superior fit. Results of reliability testing indicate that the BullyHARM scale and subscales have very good internal consistency reliability. Findings indicate that the BullyHARM has good properties regarding content validation and respondent-related validation and is a promising instrument for measuring bullying victimization in school. PMID:28194041
NASA Astrophysics Data System (ADS)
Pearlman, Aaron J.; Padula, Francis; Shao, Xi; Cao, Changyong; Goodman, Steven J.
2016-09-01
One of the main objectives of the Geostationary Operational Environmental Satellite R-Series (GOES-R) field campaign is to validate the SI traceability of the Advanced Baseline Imager. The campaign plans include a feasibility demonstration study for new near surface unmanned aircraft system (UAS) measurement capability that is being developed to meet the challenges of validating geostationary sensors. We report our progress in developing our initial systems by presenting the design and preliminary characterization results of the sensor suite. The design takes advantage of off-the-shelf technologies and fiber-based optical components to make hemispheric directional measurements from a UAS. The characterization results - including laboratory measurements of temperature effects and polarization sensitivity - are used to refine the radiometric uncertainty budget towards meeting the validation objectives for the campaign. These systems will foster improved validation capabilities for the GOES-R field campaign and other next generation satellite systems.
NASA Technical Reports Server (NTRS)
Ray, Ronald J.
1994-01-01
New flight test maneuvers and analysis techniques for evaluating the dynamic response of in-flight thrust models during throttle transients have been developed and validated. The approach is based on the aircraft and engine performance relationship between thrust and drag. Two flight test maneuvers, a throttle step and a throttle frequency sweep, were developed and used in the study. Graphical analysis techniques, including a frequency domain analysis method, were also developed and evaluated. They provide quantitative and qualitative results. Four thrust calculation methods were used to demonstrate and validate the test technique. Flight test applications on two high-performance aircraft confirmed the test methods as valid and accurate. These maneuvers and analysis techniques were easy to implement and use. Flight test results indicate the analysis techniques can identify the combined effects of model error and instrumentation response limitations on the calculated thrust value. The methods developed in this report provide an accurate approach for evaluating, validating, or comparing thrust calculation methods for dynamic flight applications.
Fan, Leimin; Lee, Jacob; Hall, Jeffrey; Tolentino, Edward J; Wu, Huaiqin; El-Shourbagy, Tawakol
2011-06-01
This article describes validation work for analysis of an Abbott investigational drug (Compound A) in monkey whole blood with dried blood spots (DBS). The impact of DBS spotting volume on analyte concentration was investigated. The quantitation range was between 30.5 and 10,200 ng/ml. Accuracy and precision of quality controls, linearity of calibration curves, matrix effect, selectivity, dilution, recovery and multiple stabilities were evaluated in the validation, and all demonstrated acceptable results. Incurred sample reanalysis was performed with 57 out of 58 samples having a percentage difference (versus the mean value) less than 20%. A linear relationship between the spotting volume and the spot area was drawn. The influence of spotting volume on concentration was discussed. All validation results met good laboratory practice acceptance requirements. Radial spreading of blood on DBS cards can be a factor in DBS concentrations at smaller spotting volumes.
Comprehensive Calibration and Validation Site for Information Remote Sensing
NASA Astrophysics Data System (ADS)
Li, C. R.; Tang, L. L.; Ma, L. L.; Zhou, Y. S.; Gao, C. X.; Wang, N.; Li, X. H.; Wang, X. H.; Zhu, X. H.
2015-04-01
As a naturally part of information technology, Remote Sensing (RS) is strongly required to provide very precise and accurate information product to serve industry, academy and the public at this information economic era. To meet the needs of high quality RS product, building a fully functional and advanced calibration system, including measuring instruments, measuring approaches and target site become extremely important. Supported by MOST of China via national plan, great progress has been made to construct a comprehensive calibration and validation (Cal&Val) site, which integrates most functions of RS sensor aviation testing, EO satellite on-orbit caration and performance assessment and RS product validation at this site located in Baotou, 600km west of Beijing. The site is equipped with various artificial standard targets, including portable and permanent targets, which supports for long-term calibration and validation. A number of fine-designed ground measuring instruments and airborne standard sensors are developed for realizing high-accuracy stepwise validation, an approach in avoiding or reducing uncertainties caused from nonsynchronized measurement. As part of contribution to worldwide Cal&Val study coordinated by CEOS-WGCV, Baotou site is offering its support to Radiometric Calibration Network of Automated Instruments (RadCalNet), with an aim of providing demonstrated global standard automated radiometric calibration service in cooperation with ESA, NASA, CNES and NPL. Furthermore, several Cal&Val campaigns have been performed during the past years to calibrate and validate the spaceborne/airborne optical and SAR sensors, and the results of some typical demonstration are discussed in this study.
Ego-Dissolution and Psychedelics: Validation of the Ego-Dissolution Inventory (EDI)
Nour, Matthew M.; Evans, Lisa; Nutt, David; Carhart-Harris, Robin L.
2016-01-01
Aims: The experience of a compromised sense of “self”, termed ego-dissolution, is a key feature of the psychedelic experience. This study aimed to validate the Ego-Dissolution Inventory (EDI), a new 8-item self-report scale designed to measure ego-dissolution. Additionally, we aimed to investigate the specificity of the relationship between psychedelics and ego-dissolution. Method: Sixteen items relating to altered ego-consciousness were included in an internet questionnaire; eight relating to the experience of ego-dissolution (comprising the EDI), and eight relating to the antithetical experience of increased self-assuredness, termed ego-inflation. Items were rated using a visual analog scale. Participants answered the questionnaire for experiences with classical psychedelic drugs, cocaine and/or alcohol. They also answered the seven questions from the Mystical Experiences Questionnaire (MEQ) relating to the experience of unity with one’s surroundings. Results: Six hundred and ninety-one participants completed the questionnaire, providing data for 1828 drug experiences (1043 psychedelics, 377 cocaine, 408 alcohol). Exploratory factor analysis demonstrated that the eight EDI items loaded exclusively onto a single common factor, which was orthogonal to a second factor comprised of the items relating to ego-inflation (rho = −0.110), demonstrating discriminant validity. The EDI correlated strongly with the MEQ-derived measure of unitive experience (rho = 0.735), demonstrating convergent validity. EDI internal consistency was excellent (Cronbach’s alpha 0.93). Three analyses confirmed the specificity of ego-dissolution for experiences occasioned by psychedelic drugs. Firstly, EDI score correlated with drug-dose for psychedelic drugs (rho = 0.371), but not for cocaine (rho = 0.115) or alcohol (rho = −0.055). Secondly, the linear regression line relating the subjective intensity of the experience to ego-dissolution was significantly steeper for psychedelics (unstandardized regression coefficient = 0.701) compared with cocaine (0.135) or alcohol (0.144). Ego-inflation, by contrast, was specifically associated with cocaine experiences. Finally, a binary Support Vector Machine classifier identified experiences occasioned by psychedelic drugs vs. cocaine or alcohol with over 85% accuracy using ratings of ego-dissolution and ego-inflation alone. Conclusion: Our results demonstrate the psychometric structure, internal consistency and construct validity of the EDI. Moreover, we demonstrate the close relationship between ego-dissolution and the psychedelic experience. The EDI will facilitate the study of the neuronal correlates of ego-dissolution, which is relevant for psychedelic-assisted psychotherapy and our understanding of psychosis. PMID:27378878
Varni, James W; Burwinkle, Tasha M; Katz, Ernest R; Meeske, Kathy; Dickinson, Paige
2002-04-01
The Pediatric Quality of Life Inventory (PedsQL) is a modular instrument designed to measure health-related quality of life (HRQOL) in children and adolescents ages 2-18 years. The PedsQL 4.0 Generic Core Scales are multidimensional child self-report and parent proxy-report scales developed as the generic core measure to be integrated with the PedsQL disease specific modules. The PedsQL Multidimensional Fatigue Scale was designed to measure fatigue in pediatric patients. The PedsQL 3.0 Cancer Module was designed to measure pediatric cancer specific HRQOL. The PedsQL Generic Core Scales, Multidimensional Fatigue Scale, and Cancer Module were administered to 339 families (220 child self-reports; 337 parent proxy-reports). Internal consistency reliability for the PedsQL Generic Core Total Scale Score (alpha = 0.88 child, 0.93 parent report), Multidimensional Fatigue Total Scale Score (alpha = 0.89 child, 0.92 parent report) and most Cancer Module Scales (average alpha = 0.72 child, 0.87 parent report) demonstrated reliability acceptable for group comparisons. Validity was demonstrated using the known-groups method. The PedsQL distinguished between healthy children and children with cancer as a group, and among children on-treatment versus off-treatment. The validity of the PedsQL Multidimensional Fatigue Scale was further demonstrated through hypothesized intercorrelations with dimensions of generic and cancer specific HRQOL. The results demonstrate the reliability and validity of the PedsQL Generic Core Scales, Multidimensional Fatigue Scale, and Cancer Module in pediatric cancer. The PedsQL may be utilized as an outcome measure in clinical trials, research, and clinical practice. Copyright 2002 American Cancer Society.
A Serious Game for Clinical Assessment of Cognitive Status: Validation Study
Chignell, Mark; Tierney, Mary C.; Lee, Jacques
2016-01-01
Background We propose the use of serious games to screen for abnormal cognitive status in situations where it may be too costly or impractical to use standard cognitive assessments (eg, emergency departments). If validated, serious games in health care could enable broader availability of efficient and engaging cognitive screening. Objective The objective of this work is to demonstrate the feasibility of a game-based cognitive assessment delivered on tablet technology to a clinical sample and to conduct preliminary validation against standard mental status tools commonly used in elderly populations. Methods We carried out a feasibility study in a hospital emergency department to evaluate the use of a serious game by elderly adults (N=146; age: mean 80.59, SD 6.00, range 70-94 years). We correlated game performance against a number of standard assessments, including the Mini-Mental State Examination (MMSE), Montreal Cognitive Assessment (MoCA), and the Confusion Assessment Method (CAM). Results After a series of modifications, the game could be used by a wide range of elderly patients in the emergency department demonstrating its feasibility for use with these users. Of 146 patients, 141 (96.6%) consented to participate and played our serious game. Refusals to play the game were typically due to concerns of family members rather than unwillingness of the patient to play the game. Performance on the serious game correlated significantly with the MoCA (r=–.339, P <.001) and MMSE (r=–.558, P <.001), and correlated (point-biserial correlation) with the CAM (r=.565, P <.001) and with other cognitive assessments. Conclusions This research demonstrates the feasibility of using serious games in a clinical setting. Further research is required to demonstrate the validity and reliability of game-based assessments for clinical decision making. PMID:27234145
Larsen, Lisbeth Runge; Jørgensen, Martin Grønbech; Junge, Tina; Juul-Kristensen, Birgit; Wedderkopp, Niels
2014-06-10
Because body proportions in childhood are different to those in adulthood, children have a relatively higher centre of mass location. This biomechanical difference and the fact that children's movements have not yet fully matured result in different sway performances in children and adults. When assessing static balance, it is essential to use objective, sensitive tools, and these types of measurement have previously been performed in laboratory settings. However, the emergence of technologies like the Nintendo Wii Board (NWB) might allow balance assessment in field settings. As the NWB has only been validated and tested for reproducibility in adults, the purpose of this study was to examine reproducibility and validity of the NWB in a field setting, in a population of children. Fifty-four 10-14 year-olds from the CHAMPS-Study DK performed four different balance tests: bilateral stance with eyes open (1), unilateral stance on dominant (2) and non-dominant leg (3) with eyes open, and bilateral stance with eyes closed (4). Three rounds of the four tests were completed with the NWB and with a force platform (AMTI). To assess reproducibility, an intra-day test-retest design was applied with a two-hour break between sessions. Bland-Altman plots supplemented by Minimum Detectable Change (MDC) and concordance correlation coefficient (CCC) demonstrated satisfactory reproducibility for the NWB and the AMTI (MDC: 26.3-28.2%, CCC: 0.76-0.86) using Centre Of Pressure path Length as measurement parameter. Bland-Altman plots demonstrated satisfactory concurrent validity between the NWB and the AMTI, supplemented by satisfactory CCC in all tests (CCC: 0.74-0.87). The ranges of the limits of agreement in the validity study were comparable to the limits of agreement of the reproducibility study. Both NWB and AMTI have satisfactory reproducibility for testing static balance in a population of children. Concurrent validity of NWB compared with AMTI was satisfactory. Furthermore, the results from the concurrent validity study were comparable to the reproducibility results of the NWB and the AMTI. Thus, NWB has the potential to replace the AMTI in field settings in studies including children. Future studies are needed to examine intra-subject variability and to test the predictive validity of NWB.
Measurement Invariance of the Social Phobia and Anxiety Inventory
Bunnell, Brian E.; Joseph, Dana L.; Beidel, Deborah C.
2012-01-01
The Social Phobia and Anxiety Inventory (SPAI) is a commonly used self-report measure of social phobia that has demonstrated adequate reliability, convergent validity, discriminant validity, and criterion-related validity. However, research has yet to address whether this measure functions equivalently in (a) individuals with and without a diagnosis of social phobia and (b) males and females. Evaluating measurement equivalence is necessary in order to determine that the construct of social anxiety is conceptually understood invariantly across these populations. The results of the current investigation, using a series of nested factorial models proposed by Vandenberg and Lance (2000), provide evidence for strong equivalence across 420 individuals with and without diagnoses of social anxiety disorder and across male and female samples. Accordingly, these results provide psychometric justification for comparison of SPAI scores across the symptom continuum and sexes. PMID:23247204
Sharif Nia, Hamid; Pahlevan Sharif, Saeed; Boyle, Christopher; Yaghoobzadeh, Ameneh; Tahmasbi, Bahram; Rassool, G Hussein; Taebei, Mozhgan; Soleimani, Mohammad Ali
2018-04-01
This study aimed to determine the factor structure of the spiritual well-being among a sample of the Iranian veterans. In this methodological research, 211 male veterans of Iran-Iraq warfare completed the Paloutzian and Ellison spiritual well-being scale. Maximum likelihood (ML) with oblique rotation was used to assess domain structure of the spiritual well-being. The construct validity of the scale was assessed using confirmatory factor analysis (CFA), convergent validity, and discriminant validity. Reliability was evaluated with Cronbach's alpha, Theta (θ), and McDonald Omega (Ω) coefficients, intra-class correlation coefficient (ICC), and construct reliability (CR). Results of ML and CFA suggested three factors which were labeled "relationship with God," "belief in fate and destiny," and "life optimism." The ICC, coefficients of the internal consistency, and CR were >.7 for the factors of the scale. Convergent validity and discriminant validity did not fulfill the requirements. The Persian version of spiritual well-being scale demonstrated suitable validity and reliability among the veterans of Iran-Iraq warfare.
Peters, Kurt R; Gawronski, Bertram
2011-04-01
Research has demonstrated that implicit and explicit evaluations of the same object can diverge. Explanations of such dissociations frequently appeal to dual-process theories, such that implicit evaluations are assumed to reflect object-valence contingencies independent of their perceived validity, whereas explicit evaluations reflect the perceived validity of object-valence contingencies. Although there is evidence supporting these assumptions, it remains unclear if dissociations can arise in situations in which object-valence contingencies are judged to be true or false during the learning of these contingencies. Challenging dual-process accounts that propose a simultaneous operation of two parallel learning mechanisms, results from three experiments showed that the perceived validity of evaluative information about social targets qualified both explicit and implicit evaluations when validity information was available immediately after the encoding of the valence information; however, delaying the presentation of validity information reduced its qualifying impact for implicit, but not explicit, evaluations.
Wyant, Tim; Estevam, Jose; Yang, Lili; Rosario, Maria
2016-03-01
Vedolizumab is a monoclonal antibody approved for use in ulcerative colitis and Crohn's disease. By specifically binding to α4 β7 integrin, vedolizumab prevents trafficking of lymphocytes to the gut, thereby interfering with disease pathology. During the clinical development program, the pharmacodynamic effect of vedolizumab was evaluated by 2 flow cytometry receptor occupancy assays: act-1 (ACT-1) and mucosal addressin cell adhesion molecule-1 (MAdCAM-1). Here we describe the development and validation of these assays. The ACT-1 assay is a receptor occupancy free-site assay that uses a monoclonal antibody with the same binding epitope as vedolizumab to detect free (unbound) sites on α4 β7 integrin. The MAdCAM-1 assay used a soluble version of the natural ligand for α4 β7 integrin to detect free sites. The assays were validated using a fit-for-purpose approach throughout the clinical development of vedolizumab. Both the ACT-1 assay and the MAdCAM-1 assay demonstrated acceptable reproducibility and repeatability. The assays were sufficiently stable to allow for clinical use. During clinical testing the assays demonstrated that vedolizumab was able to saturate peripheral cells at all doses tested. Two pharmacodynamic receptor occupancy assays were developed and validated to assess the effect of vedolizumab on peripheral blood cells. The results of these assays demonstrated the practical use of flow cytometry to examine pharmacodynamic response in clinical trials. © 2015 International Clinical Cytometry Society.
Objectifying Content Validity: Conducting a Content Validity Study in Social Work Research.
ERIC Educational Resources Information Center
Rubio, Doris McGartland; Berg-Weger, Marla; Tebb, Susan S.; Lee, E. Suzanne; Rauch, Shannon
2003-01-01
The purpose of this article is to demonstrate how to conduct a content validity study. Instructions on how to calculate a content validity index, factorial validity index, and an interrater reliability index and guide for interpreting these indices are included. Implications regarding the value of conducting a content validity study for…
Brunckhorst, Oliver; Shahid, Shahab; Aydin, Abdullatif; McIlhenny, Craig; Khan, Shahid; Raza, Syed Johar; Sahai, Arun; Brewin, James; Bello, Fernando; Kneebone, Roger; Khan, Muhammad Shamim; Dasgupta, Prokar; Ahmed, Kamran
2015-09-01
Current training modalities within ureteroscopy have been extensively validated and must now be integrated within a comprehensive curriculum. Additionally, non-technical skills often cause surgical error and little research has been conducted to combine this with technical skills teaching. This study therefore aimed to develop and validate a curriculum for semi-rigid ureteroscopy, integrating both technical and non-technical skills teaching within the programme. Delphi methodology was utilised for curriculum development and content validation, with a randomised trial then conducted (n = 32) for curriculum evaluation. The developed curriculum consisted of four modules; initially developing basic technical skills and subsequently integrating non-technical skills teaching. Sixteen participants underwent the simulation-based curriculum and were subsequently assessed, together with the control cohort (n = 16) within a full immersion environment. Both technical (Time to completion, OSATS and a task specific checklist) and non-technical (NOTSS) outcome measures were recorded with parametric and non-parametric analyses used depending on the distribution of our data as evaluated by a Shapiro-Wilk test. Improvements within the intervention cohort demonstrated educational value across all technical and non-technical parameters recorded, including time to completion (p < 0.01), OSATS scores (p < 0.001), task specific checklist scores (p = 0.011) and NOTSS scores (p < 0.001). Content validity, feasibility and acceptability were all demonstrated through curriculum development and post-study questionnaire results. The current developed curriculum demonstrates that integrating both technical and non-technical skills teaching is both educationally valuable and feasible. Additionally, the curriculum offers a validated simulation-based training modality within ureteroscopy and a framework for the development of other simulation-based programmes.
Duruöz, M T; Unal, C; Toprak, C Sanal; Sezer, I; Yilmaz, F; Ulutatar, F; Atagündüz, P; Baklacioglu, H S
2017-12-01
Background Systemic lupus erythematosus (SLE) may have a profound impact on quality of life. There is increasing interest in measuring quality of life in lupus patients. The purpose of this study was to investigate the validity and reliability of SLE Quality of Life Questionnaire (L-QoL) in Turkish SLE patients. Methods SLE according to 2012 Systemic Lupus International Collaborating Clinics Classification Criteria were recruited into the study. Demographic data, clinical parameters and disease activity measured with the Systemic Lupus Erythematosus Disease Activity Index-2000 (SLEDAI-2K); were noted. Nottingham Health Profile and Health Assessment Questionnaire were filled out in addition to the Turkish L-QoL (LQoL-TR). Internal consistency, test-retest reliability, and convergent and discriminant validity were evaluated. Results The mean age of participants was 43.55 ± 14.33 years and the mean disease duration was 89.8 ± 92.1 months. The patients filled out LQoL-TR in 2.5 min. Strong correlation of LQoL-TR with all subgroups of the Nottingham Health Profile and the Health Assessment Questionnaire were established showing the convergent validity. The highest correlation was demonstrated with emotional reactions (rho = 0.72) and sleep component (rho = 0.65) of the Nottingham Health Profile scale ( p < 0.0001). Its poor and not significant correlation with nonfunctional parameters (age, disease duration, perceived general health, SLEDAI-2K) showed its discriminative properties. LQoL-TR demonstrated good internal reliability with a Cronbach's α of 0.93 and test-retest reliability with intraclass correlation coefficient of 0.87. Conclusion The LQoL-TR is a practical and useful tool which demonstrates good validity and reliability.
Using Neural Networks for Sensor Validation
NASA Technical Reports Server (NTRS)
Mattern, Duane L.; Jaw, Link C.; Guo, Ten-Huei; Graham, Ronald; McCoy, William
1998-01-01
This paper presents the results of applying two different types of neural networks in two different approaches to the sensor validation problem. The first approach uses a functional approximation neural network as part of a nonlinear observer in a model-based approach to analytical redundancy. The second approach uses an auto-associative neural network to perform nonlinear principal component analysis on a set of redundant sensors to provide an estimate for a single failed sensor. The approaches are demonstrated using a nonlinear simulation of a turbofan engine. The fault detection and sensor estimation results are presented and the training of the auto-associative neural network to provide sensor estimates is discussed.
Validation and Inter-Comparison of Limb Sounding Profiles from MRO/MCS and MGS/TES
NASA Astrophysics Data System (ADS)
Shirley, J. H.; McConnochie, T. M.; Kass, D. M.; Kleinböhl, A.; Schofield, J. T.; Heavens, N. G.; McCleese, D. J.; Benson, J.; Hinson, D. P.; Bandfield, J. L.
2014-07-01
Nighttime atmospheric temperatures in northern middle latitudes during Mars' aphelion season obtained by MGS/TES and MRO/MCS are compared with MGS radio science results. Profile mean Δ Ts of <= 2 K demonstrate consistency of retrieved temperatures.
Pestle, Sarah L; Chorpita, Bruce F; Schiffman, Jason
2008-04-01
The Penn State Worry Questionnaire for Children (PSWQ-C; Chorpita, Tracey, Brown, Collica, & Barlow, 1997) is a 14-item self-report measure of worry in children and adolescents. Although the PSWQ-C has demonstrated favorable psychometric properties in small clinical and large community samples, this study represents the first psychometric evaluation of the PSWQ-C in a large clinical sample (N = 491). Factor analysis indicated a two-factor structure, in contrast to all previously published findings on the measure. The PSWQ-C demonstrated favorable psychometric properties in this sample, including high internal consistency, high convergent validity with related constructs, and acceptable discriminative validity between diagnostic categories. The performance of the 3 reverse-scored items was closely examined, and results indicated retaining all 14 items.
Validation of a unique concept for a low-cost, lightweight space-deployable antenna structure
NASA Technical Reports Server (NTRS)
Freeland, R. E.; Bilyeu, G. D.; Veal, G. R.
1993-01-01
An experiment conducted in the framework of a NASA In-Space Technology Experiments Program based on a concept of inflatable deployable structures is described. The concept utilizes very low inflation pressure to maintain the required geometry on orbit and gravity-induced deflection of the structure precludes any meaningful ground-based demonstrations of functions performance. The experiment is aimed at validating and characterizing the mechanical functional performance of a 14-m-diameter inflatable deployable reflector antenna structure in the orbital operational environment. Results of the experiment are expected to significantly reduce the user risk associated with using large space-deployable antennas by demonstrating the functional performance of a concept that meets the criteria for low-cost, lightweight, and highly reliable space-deployable structures.
Smeraglia, John; Silva, John-Paul; Jones, Kieran
2017-08-01
In order to evaluate placental transfer of certolizumab pegol (CZP), a more sensitive and selective bioanalytical assay was required to accurately measure low CZP concentrations in infant and umbilical cord blood. Results & methodology: A new electrochemiluminescence immunoassay was developed to measure CZP levels in human plasma. Validation experiments demonstrated improved selectivity (no matrix interference observed) and a detection range of 0.032-5.0 μg/ml. Accuracy and precision met acceptance criteria (mean total error ≤20.8%). Dilution linearity and sample stability were acceptable and sufficient to support the method. The electrochemiluminescence immunoassay was validated for measuring low CZP concentrations in human plasma. The method demonstrated a more than tenfold increase in sensitivity compared with previous assays, and improved selectivity for intact CZP.
Ginsburg, Liane; Berta, Whitney; Baumbusch, Jennifer; Rohit Dass, Adrian; Laporte, Audrey; Reid, R Colin; Squires, Janet; Taylor, Deanne
2016-04-01
Health care aides (HCAs) provide most direct care in long-term care (LTC) and home and community care (HCC) settings but are understudied. We validate three key work attitude measures to better understand HCAs' work experiences: work engagement (WEng), psychological empowerment (PE), and organizational citizenship behavior (OCB-O). Data were collected from 306 HCAs working in LTC and HCC, using survey items for WEng, PE, and OCB-O adapted for HCAs. Psychometric evaluation involved confirmatory factor analysis (CFA). Predictive validity (correlations with measures of job satisfaction and turnover intention) and internal consistency reliability were examined. CFA supported a one-factor model of WEng, a four-factor model of PE, and a one-factor model of OCB-O. HCC workers scored higher than LTC workers on Self-determination (PE) and lower on Impact, demonstrating concurrent validity. WEng and PE correlated with worker outcomes (job satisfaction, turnover intention, and OCB-O), demonstrating predictive validity. Reliability and validity analyses indicated sound psychometric properties overall. Study results support psychometric properties of measures of WEng, PE, and OCB-O for HCAs. Knowledge of HCAs' work attitudes and behaviors can inform recruitment programs, incentive systems, and retention/training strategies for this vital group of care providers. © The Author 2016. Published by Oxford University Press on behalf of The Gerontological Society of America. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Identifying and classifying hyperostosis frontalis interna via computerized tomography.
May, Hila; Peled, Nathan; Dar, Gali; Hay, Ori; Abbas, Janan; Masharawi, Youssef; Hershkovitz, Israel
2010-12-01
The aim of this study was to recognize the radiological characteristics of hyperostosis frontalis interna (HFI) and to establish a valid and reliable method for its identification and classification. A reliability test was carried out on 27 individuals who had undergone a head computerized tomography (CT) scan. Intra-observer reliability was obtained by examining the images three times, by the same researcher, with a 2-week interval between each sample ranking. The inter-observer test was performed by three independent researchers. A validity test was carried out using two methods for identifying and classifying HFI: 46 cadaver skullcaps were ranked twice via computerized tomography scans and then by direct observation. Reliability and validity were calculated using Kappa test (SPSS 15.0). Reliability tests of ranking HFI via CT scans demonstrated good results (K > 0.7). As for validity, a very good consensus was obtained between the CT and direct observation, when moderate and advanced types of HFI were present (K = 0.82). The suggested classification method for HFI, using CT, demonstrated a sensitivity of 84%, specificity of 90.5%, and positive predictive value of 91.3%. In conclusion, volume rendering is a reliable and valid tool for identifying HFI. The suggested three-scale classification is most suitable for radiological diagnosis of the phenomena. Considering the increasing awareness of HFI as an early indicator of a developing malady, this study may assist radiologists in identifying and classifying the phenomena.
Faber, Irene R; Nijhuis-Van Der Sanden, Maria W G; Elferink-Gemser, Marije T; Oosterveld, Frits G J
2015-01-01
A motor skills assessment could be helpful in talent development by estimating essential perceptuo-motor skills of young players, which are considered requisite to develop excellent technical and tactical qualities. The Netherlands Table Tennis Association uses a motor skills assessment in their talent development programme consisting of eight items measuring perceptuo-motor skills specific to table tennis under varying conditions. This study aimed to investigate this assessment regarding its reproducibility, internal consistency, underlying dimensions and concurrent validity in 113 young table tennis players (6-10 years). Intraclass correlation coefficients of six test items met the criteria of 0.7 with coefficients of variation between 3% and 8%. Cronbach's alpha valued 0.853 for internal consistency. The principal components analysis distinguished two conceptually meaningful factors: "ball control" and "gross motor function." Concurrent validity analyses demonstrated moderate associations between the motor skills assessment's results and national ranking; boys r = -0.53 (P < 0.001) and girls r = -0.45 (P = 0.015). In conclusion, this evaluation demonstrated six test items with acceptable reproducibility, good internal consistency and good prospects for validity. Two test items need revision to upgrade reproducibility. Since the motor skills assessment seems to be a reproducible, objective part of a talent development programme, more longitudinal studies are required to investigate its predictive validity.
Active Aeroelastic Wing Aerodynamic Model Development and Validation for a Modified F/A-18A Airplane
NASA Technical Reports Server (NTRS)
Cumming, Stephen B.; Diebler, Corey G.
2005-01-01
A new aerodynamic model has been developed and validated for a modified F/A-18A airplane used for the Active Aeroelastic Wing (AAW) research program. The goal of the program was to demonstrate the advantages of using the inherent flexibility of an aircraft to enhance its performance. The research airplane was an F/A-18A with wings modified to reduce stiffness and a new control system to increase control authority. There have been two flight phases. Data gathered from the first flight phase were used to create the new aerodynamic model. A maximum-likelihood output-error parameter estimation technique was used to obtain stability and control derivatives. The derivatives were incorporated into the National Aeronautics and Space Administration F-18 simulation, validated, and used to develop new AAW control laws. The second phase of flights was used to evaluate the handling qualities of the AAW airplane and the control law design process, and to further test the accuracy of the new model. The flight test envelope covered Mach numbers between 0.85 and 1.30 and dynamic pressures from 600 to 1250 pound-force per square foot. The results presented in this report demonstrate that a thorough parameter identification analysis can be used to improve upon models that were developed using other means. This report describes the parameter estimation technique used, details the validation techniques, discusses differences between previously existing F/A-18 models, and presents results from the second phase of research flights.
Construct validity of the PROMIS® sexual function and satisfaction measures in patients with cancer
2013-01-01
Background With data from a diverse sample of patients either in treatment for cancer or post-treatment for cancer, we examine inter-domain and cross-domain correlations among the core domains of the Patient-Reported Outcomes Measurement Information System Sexual Function and Satisfaction measures (PROMIS® SexFS) and the corresponding domains from conceptually-similar measures of sexual function, the International Index of Erectile Function and the Female Sexual Function Index. Findings Men (N=389) and women (N=430) were recruited from a tumor registry, oncology clinics, and an internet panel. The PROMIS SexFS, International Index of Erectile Function, and Female Sexual Function Index were used to collect participants’ self-reported sexual function. The domains shared among the measures include desire/interest in sexual activity, lubrication and vaginal discomfort/pain (women), erectile function (men), orgasm, and satisfaction. We examined correlations among different domains within the same instrument (discriminant validity) and correlations among similar domains measured by different instruments (convergent validity). Correlations demonstrating discriminant validity ranged from 0.38 to 0.73 for men and 0.48 to 0.74 for women, while correlations demonstrating convergent validity ranged from 0.62 to 0.83 for men and 0.71 to 0.92 for women. As expected, correlations demonstrating convergent validity were higher than correlations demonstrating discriminant validity, with one exception (orgasm for men). Conclusions Construct validity was supported by convergent and discriminant validity in a diverse sample of patients with cancer. For patients with cancer who may or may not have sexual dysfunction, the PROMIS SexFS measures provide a comprehensive assessment of key domains of sexual function and satisfaction. PMID:23497200
Assessing Perceptions AbouT Hazardous Substances (PATHS): The PATHS questionnaire
Amlôt, Richard; Page, Lisa; Pearce, Julia; Wessely, Simon
2013-01-01
How people perceive the nature of a hazardous substance may determine how they respond when potentially exposed to it. We tested a new Perceptions AbouT Hazardous Substances (PATHS) questionnaire. In Study 1 (N = 21), we assessed the face validity of items concerning perceptions about eight properties of a hazardous substance. In Study 2 (N = 2030), we tested the factor structure, reliability and validity of the PATHS questionnaire across four qualitatively different substances. In Study 3 (N = 760), we tested the impact of information provision on Perceptions AbouT Hazardous Substances scores. Our results showed that our eight measures demonstrated good reliability and validity when used for non-contagious hazards. PMID:23104995
Solution-adaptive finite element method in computational fracture mechanics
NASA Technical Reports Server (NTRS)
Min, J. B.; Bass, J. M.; Spradley, L. W.
1993-01-01
Some recent results obtained using solution-adaptive finite element method in linear elastic two-dimensional fracture mechanics problems are presented. The focus is on the basic issue of adaptive finite element method for validating the applications of new methodology to fracture mechanics problems by computing demonstration problems and comparing the stress intensity factors to analytical results.
Wenborn, Jennifer; Challis, David; Pool, Jackie; Burgess, Jane; Elliott, Nicola; Orrell, Martin
2008-03-01
Activity is key to maintaining physical and mental health and well-being. However, as dementia affects the ability to engage in activity, care-givers can find it difficult to provide appropriate activities. The Pool Activity Level (PAL) Checklist guides the selection of appropriate, personally meaningful activities. The aim of this study was to assess the reliability and validity of the PAL Checklist when used with older people with dementia. A postal questionnaire sent to activity providers assessed content validity. Validity and reliability were measured in a sample of 60 older people with dementia. The questionnaire response rate was 83% (102/122). Most respondents felt no important items were missing. Seven of the nine activities were ranked as 'very important' or 'essential' by at least 77% of the sample, indicating very good content validity. Correlation with measures of cognition, severity of dementia and activity performance demonstrated strong concurrent validity. Inter-item correlation indicated strong construct validity. Cronbach's alpha coefficient measured internal consistency as excellent (0.95). All items achieved acceptable test-retest reliability, and the majority demonstrated acceptable inter-rater reliability. We conclude that the PAL Checklist demonstrates adequate validity and reliability when used with older people with dementia and appears a useful tool for a variety of care settings.
An integrated bioanalytical method development and validation approach: case studies.
Xue, Y-J; Melo, Brian; Vallejo, Martha; Zhao, Yuwen; Tang, Lina; Chen, Yuan-Shek; Keller, Karin M
2012-10-01
We proposed an integrated bioanalytical method development and validation approach: (1) method screening based on analyte's physicochemical properties and metabolism information to determine the most appropriate extraction/analysis conditions; (2) preliminary stability evaluation using both quality control and incurred samples to establish sample collection, storage and processing conditions; (3) mock validation to examine method accuracy and precision and incurred sample reproducibility; and (4) method validation to confirm the results obtained during method development. This integrated approach was applied to the determination of compound I in rat plasma and compound II in rat and dog plasma. The effectiveness of the approach was demonstrated by the superior quality of three method validations: (1) a zero run failure rate; (2) >93% of quality control results within 10% of nominal values; and (3) 99% incurred sample within 9.2% of the original values. In addition, rat and dog plasma methods for compound II were successfully applied to analyze more than 900 plasma samples obtained from Investigational New Drug (IND) toxicology studies in rats and dogs with near perfect results: (1) a zero run failure rate; (2) excellent accuracy and precision for standards and quality controls; and (3) 98% incurred samples within 15% of the original values. Copyright © 2011 John Wiley & Sons, Ltd.
A Note on Economic Content and Test Validity.
ERIC Educational Resources Information Center
Soper, John C.; Brenneke, Judith Staley
1987-01-01
Offers practical tips on how teachers can determine whether classroom tests are actually measuring what they are designed to measure. Discusses criterion-related validity, construct validity, and content validity. Demonstrates how to determine the degree of content validity a particular test may have for a particular course or unit. (Author/DH)
Crestani, Anelise Henrich; Moraes, Anaelena Bragança de; Souza, Ana Paula Ramos de
2017-08-10
To analyze the results of the validation of building enunciative signs of language acquisition for children aged 3 to 12 months. The signs were built based on mechanisms of language acquisition in an enunciative perspective and on clinical experience with language disorders. The signs were submitted to judgment of clarity and relevance by a sample of six experts, doctors in linguistic in with knowledge of psycholinguistics and language clinic. In the validation of reliability, two judges/evaluators helped to implement the instruments in videos of 20% of the total sample of mother-infant dyads using the inter-evaluator method. The method known as internal consistency was applied to the total sample, which consisted of 94 mother-infant dyads to the contents of the Phase 1 (3-6 months) and 61 mother-infant dyads to the contents of Phase 2 (7 to 12 months). The data were collected through the analysis of mother-infant interaction based on filming of dyads and application of the parameters to be validated according to the child's age. Data were organized in a spreadsheet and then converted to computer applications for statistical analysis. The judgments of clarity/relevance indicated no modifications to be made in the instruments. The reliability test showed an almost perfect agreement between judges (0.8 ≤ Kappa ≥ 1.0); only the item 2 of Phase 1 showed substantial agreement (0.6 ≤ Kappa ≥ 0.79). The internal consistency for Phase 1 had alpha = 0.84, and Phase 2, alpha = 0.74. This demonstrates the reliability of the instruments. The results suggest adequacy as to content validity of the instruments created for both age groups, demonstrating the relevance of the content of enunciative signs of language acquisition.
Oufir, Mouhssin; Sampath, Chethan; Butterweck, Veronika; Hamburger, Matthias
2012-08-01
The natural product (E,Z)-3-(4-hydroxy-3,5-dimethoxybenzylidene)indolin-2-one (indolinone) was identified some years ago as a nanomolar inhibitor of FcɛRI-receptor dependent mast cell degranulation. To further explore the potential of the compound, we established an UPLC-MS/MS assay for dosage in rat plasma. The method was fully validated according to FDA Guidance for industry. Results of this validation and long term stability study demonstrate that the method in lithium heparinized rat plasma is specific, accurate, precise and capable of producing reliable results according to recommendations of international guidelines. The method was validated with a LLOQ of 30.0 ng/mL and an ULOQ of 3000 ng/mL. The response versus concentration data were fitted with a first order polynomial with 1/X(2) weighting. No matrix effect was observed when using three independent sources of rat plasma. The average extraction recovery was consistent over the investigated range. This validation in rat plasma demonstrated that indolinone was stable for 190 days when stored below -65 °C; for 4 days at 10 °C in the autosampler; for 4h at RT, and during three successive freeze/thaw cycles at -65 °C. Preliminary pharmacokinetic data were obtained in male Sprague-Dawley rats (2 mg/kg BW i.v.). Blood samples taken from 0 to 12 h after injection were collected, and data analyzed with WinNonlin. A short half-life (4.30±0.14 min) and a relatively high clearance (3.83±1.46 L/h/kg) were found. Copyright © 2012 Elsevier B.V. All rights reserved.
Demonstration of Corrosion-Resistant Hybrid Composite Bridge Beams for Structural Applications
2016-09-01
result of corrosion of the steel support structures or the reinforcing bar in the concrete. The application of corrosion-resistant technology can...demonstrated and validated a corrosion-resistant hybrid-composite beam (HCB) for the reconstruction of a one span of a traditional steel and...concrete bridge at Fort Knox, Kentucky. The HCBs were installed on half of the bridge, and conventional steel beams were installed on the other half
Joint Service Solvent Substitution (JS3)
2012-05-01
Process Evaluation Acceptance Criteria Market Research Demonstration Plan Demonstration Validate Implementation Approval Start Approval...PRF-63460D Mg (AZ 31B-H24) mg/cm^2 0.7 Mg (SAE AMS 4377) “ Al (AMS-QQ-A-250) “ Al (7075-T6) “ 0.49 Ti (AMS 4911, 6AL- 4V ...properties – Evaluation of vendor test results – Industry experience – DOD Aerospace & Shipbuilding NESHAP experience Market Research 21 Approved for
Measuring Nutrition Literacy in Spanish-Speaking Latinos: An Exploratory Validation Study.
Gibbs, Heather D; Camargo, Juliana M T B; Owens, Sarah; Gajewski, Byron; Cupertino, Ana Paula
2017-11-21
Nutrition is important for preventing and treating chronic diseases highly prevalent among Latinos, yet no tool exists for measuring nutrition literacy among Spanish speakers. This study aimed to adapt the validated Nutrition Literacy Assessment Instrument for Spanish-speaking Latinos. This study was developed in two phases: adaptation and validity testing. Adaptation included translation, expert item content review, and interviews with Spanish speakers. For validity testing, 51 participants completed the Short Assessment of Health Literacy-Spanish (SAHL-S), the Nutrition Literacy Assessment Instrument in Spanish (NLit-S), and socio-demographic questionnaire. Validity and reliability statistics were analyzed. Content validity was confirmed with a Scale Content Validity Index of 0.96. Validity testing demonstrated NLit-S scores were strongly correlated with SAHL-S scores (r = 0.52, p < 0.001). Entire reliability was substantial at 0.994 (CI 0.992-0.996) and internal consistency was excellent (Cronbach's α = 0.92). The NLit-S demonstrates validity and reliability for measuring nutrition literacy among Spanish-speakers.
Patel, Alpesh A; Dodwad, Shah-Nawaz M; Boody, Barrett S; Bhatt, Surabhi; Savage, Jason W; Hsu, Wellington K; Rothrock, Nan E
2018-03-19
Prospective, cohort study. Demonstrate validity of PROMIS physical function, pain interference, and pain behavior computer adaptive tests (CATs) in surgically treated lumbar stenosis patients. There has been increasing attention given to patient reported outcomes associated with spinal interventions. Historical patient outcome measures have inadequate validation, demonstrate floor/ceiling effects, and infrequently used due to time constraints. PROMIS is an adaptive, responsive NIH assessment tool that measures patient-reported health status. 98 consecutive patients were surgically treated for lumbar spinal stenosis and were assessed using PROMIS CATs, ODI, ZCQ and SF-12. Prior lumbar surgery, history of scoliosis, cancer, trauma, or infection were excluded. Completion time, preoperative assessment, 6 week and 3 month postoperative scores were collected. At baseline, 49%, 79%, and 81% of patients had PROMIS PB, PI, and PF scores greater than 1 SD worse than the general population. 50.6% were categorized as severely disabled, crippled, or bed bound by ODI. PROMIS CATs demonstrated convergent validity through moderate to high correlations with legacy measures (r = 0.35-0.73). PROMIS CATs demonstrated known groups validity when stratified by ODI levels of disability. ODI improvements of at least 10 points on average had changes in PROMIS scores in the expected direction (PI = -12.98, PB = -9.74, PF = 7.53). PROMIS CATs demonstrated comparable responsiveness to change when evaluated against legacy measures. PROMIS PB and PI decreased 6.66 and 9.62 and PROMIS PF increased 6.8 points between baseline and 3-months post-op (p < 0.001). Completion time for the PROMIS CATs (2.6 minutes) compares favorably to ODI, ZCQ, and SF-12 scores (3.1, 3.6, and 3.0 minutes). PROMIS CATs demonstrate convergent validity, known groups validity, and responsiveness for surgically treated patients with lumbar stenosis to detect change over time and are more efficient than legacy instruments. 2.
V.C.3 Technology Validation : Fuel Cell Bus Evaluations
DOT National Transportation Integrated Search
2005-01-06
Based on the results of this analysis and the response from the project partners, the SunLine demonstration was deemed to be a success. Although it was a prototype (or pre-commercial) vehicle, the ThunderPower bus operated in revenue service at a rel...
Continuing Validation of the Teaching Autonomy Scale
ERIC Educational Resources Information Center
Pearson, L. Carolyn; Moomaw, William
2006-01-01
Although researchers have demonstrated a link between teacher autonomy and teacher motivation, job satisfaction, stress (burnout), professionalism, and empowerment, the task of identifying the underlying theoretical dimensions of teacher autonomy has met with varied results. The authors verified the existing 2-factor structure of the Teaching…
Field Test of Route Planning Software for Lunar Polar Missions
NASA Astrophysics Data System (ADS)
Horchler, A. D.; Cunningham, C.; Jones, H. L.; Arnett, D.; Fang, E.; Amoroso, E.; Otten, N.; Kitchell, F.; Holst, I.; Rock, G.; Whittaker, W.
2017-10-01
A novel field test paradigm has been developed to demonstrate and validate route planning software in the stark low-angled light and sweeping shadows a rover would experience at the poles of the Moon. Software, ConOps, and test results are presented.
NASA Technical Reports Server (NTRS)
Nemeth, Michael P.; Mikulas, Martin M., Jr.
2009-01-01
Simple formulas for the buckling stress of homogeneous, specially orthotropic, laminated-composite cylinders are presented. The formulas are obtained by using nondimensional parameters and equations that facilitate general validation, and are validated against the exact solution for a wide range of cylinder geometries and laminate constructions. Results are presented that establish the ranges of the nondimensional parameters and coefficients used. General results, given in terms of the nondimensional parameters, are presented that encompass a wide range of geometries and laminate constructions. These general results also illustrate a wide spectrum of behavioral trends. Design-oriented results are also presented that provide a simple, clear indication of laminate composition on critical stress, critical strain, and axial stiffness. An example is provided to demonstrate the application of these results to thin-walled column designs.
Network Security Validation Using Game Theory
NASA Astrophysics Data System (ADS)
Papadopoulou, Vicky; Gregoriades, Andreas
Non-functional requirements (NFR) such as network security recently gained widespread attention in distributed information systems. Despite their importance however, there is no systematic approach to validate these requirements given the complexity and uncertainty characterizing modern networks. Traditionally, network security requirements specification has been the results of a reactive process. This however, limited the immunity property of the distributed systems that depended on these networks. Security requirements specification need a proactive approach. Networks' infrastructure is constantly under attack by hackers and malicious software that aim to break into computers. To combat these threats, network designers need sophisticated security validation techniques that will guarantee the minimum level of security for their future networks. This paper presents a game-theoretic approach to security requirements validation. An introduction to game theory is presented along with an example that demonstrates the application of the approach.
Gill, Stephen D; de Morton, Natalie A; Mc Burney, Helen
2012-10-01
To assess and compare the validity of six physical function measures in people awaiting hip or knee joint replacement. Eighty-two people awaiting hip or knee replacement were assessed using six physical function measures including the WOMAC Function scale, SF-36 Physical Function scale, SF-36 Physical Component Summary scale, Patient Specific Functional Scale, 30-second chair stand test, and 50-foot timed walk. Validity was assessed using a head-to-head comparison design. Convergent validity was demonstrated with significant correlations between most measures (Spearman's rho 0.22 to 0.71). The Patient Specific Functional Scale had the lowest correlations with other measures of physical function. Discriminant validity was demonstrated with low correlations between mental health and physical function scores (Spearman's rho -0.12 to 0.33). Only the WOMAC Function scale, 30-second chair stand test, and 50-foot timed walk demonstrated known groups validity when scores for participants who walked with a gait aid were compared with those who did not. Standardized response means and Guyatt's responsiveness indexes indicated that the SF-36 was the least responsive measure. For those awaiting joint replacement surgery of the hip or knee, the current investigation found that the WOMAC Function scale, 30-second chair stand test, and 50-foot timed walk demonstrated the most evidence of validity. The Patient Specific Functional Scale might complement other measures by capturing a different aspect of physical function.
Symbolic control of visual attention: semantic constraints on the spatial distribution of attention.
Gibson, Bradley S; Scheutz, Matthias; Davis, Gregory J
2009-02-01
Humans routinely use spatial language to control the spatial distribution of attention. In so doing, spatial information may be communicated from one individual to another across opposing frames of reference, which in turn can lead to inconsistent mappings between symbols and directions (or locations). These inconsistencies may have important implications for the symbolic control of attention because they can be translated into differences in cue validity, a manipulation that is known to influence the focus of attention. This differential validity hypothesis was tested in Experiment 1 by comparing spatial word cues that were predicted to have high learned spatial validity ("above/below") and low learned spatial validity ("left/right"). Consistent with this prediction, when two measures of selective attention were used, the results indicated that attention was less focused in response to "left/right" cues than in response to "above/below" cues, even when the actual validity of each of the cues was equal. In addition, Experiment 2 predicted that spatial words such as "left/right" would have lower spatial validity than would other directional symbols that specify direction along the horizontal axis, such as "<--/-->" cues. The results were also consistent with this hypothesis. Altogether, the present findings demonstrate important semantic-based constraints on the spatial distribution of attention.
The City of Hope-Quality of Life-Ostomy Questionnaire: Persian Translation and Validation
Anaraki, F; Vafaie, M; Behboo, R; Esmaeilpour, S; Maghsoodi, N; Safaee, A; Grant, M
2014-01-01
Background: Since there is no disease-specific instrument for measuring quality-of-life (QOL) in Ostomy patients in Persian language. Aim: This study was designed to translate and evaluate the validity and reliability of City of Hope-quality of life-Ostomy questionnaire (COH-QOL-Ostomy questionnaire). Subjects and Methods: This study was designed as cross-sectional study. Reliability of the subscales and the summary scores were demonstrated by intra-class correlation coefficients. Pearson's correlations of an item with its own scale and other scales were calculated to evaluated convergent and discriminant validity. Clinical validity was also evaluated by known-group comparisons. Results: Cronbach's alpha coefficient for all subscales was about 0.70 or higher. Results of interscale correlation were satisfactory and each subscale only measured a single and specified trait. All subscales met the standards of convergent and discriminant validity. Known group comparison analysis showed significant differences in social and spiritual well-being. Conclusion: The findings confirmed the reliability and validity of Persian version of COH-QOL-Ostomy questionnaire. The instrument was also well received by the Iranian patients. It can be considered as a valuable instrument to assess the different aspects of health related quality-of-life in Ostomy patients and used in clinical research in the future. PMID:25221719
High-speed civil transport issues and technology program
NASA Technical Reports Server (NTRS)
Hewett, Marle D.
1992-01-01
A strawman program plan is presented, consisting of technology developments and demonstrations required to support the construction of a high-speed civil transport. The plan includes a compilation of technology issues related to the development of a transport. The issues represent technical areas in which research and development are required to allow airframe manufacturers to pursue an HSCT development. The vast majority of technical issues presented require flight demonstrated and validated solutions before a transport development will be undertaken by the industry. The author believes that NASA is the agency best suited to address flight demonstration issues in a concentrated effort. The new Integrated Test Facility at NASA Dryden Flight Research Facility is considered ideally suited to the task of supporting ground validations of proof-of-concept and prototype system demonstrations before night demonstrations. An elaborate ground hardware-in-the-loop (iron bird) simulation supported in this facility provides a viable alternative to developing an expensive fill-scale prototype transport technology demonstrator. Drygen's SR-71 assets, modified appropriately, are a suitable test-bed for supporting flight demonstrations and validations of certain transport technology solutions. A subscale, manned or unmanned flight demonstrator is suitable for flight validation of transport technology solutions, if appropriate structural similarity relationships can be established. The author contends that developing a full-scale prototype transport technology demonstrator is the best alternative to ensuring that a positive decision to develop a transport is reached by the United States aerospace industry.
NASA Astrophysics Data System (ADS)
Wimmer, Werenfrid
2016-08-01
The Infrared Sea surface temperature Autonomous Radiometer (ISAR) was developed to provide reference data for the validation of satellite Sea Surface Temperature at the Skin interface (SSTskin) temperature data products, particularly the Advanced Along Track Scanning Radiometer (AATSR). Since March 2004 ISAR instruments have been deployed nearly continuously on ferries crossing the English Channel and the Bay of Biscay, between Portsmouth (UK) and Bilbao/Santander (Spain). The resulting twelve years of ISAR data, including an individual uncertainty estimate for each SST record, are calibrated with traceability to national standards (National Institute of Standards and Technology, USA (NIST) and National Physical Laboratory, Teddigton, UK (NPL), Fiducial Reference Measurements for satellite derived surface temperature product validation (FRM4STS)). They provide a unique independent in situ reference dataset against which to validate satellite derived products. We present results of the AATSR validation, and show the use of ISAR fiducial reference measurements as a common traceable validation data source for both AATSR and Sea and Land Surface Temperature Radiometer (SLSTR). ISAR data were also used to review performance of the Operational Sea Surface Temperature and Sea Ice Analysis (OSTIA) Sea Surface Temperature (SST) analysis before and after the demise of ESA Environmental Satellite (Envisat) when AATSR inputs ceased This demonstrates use of the ISAR reference data set for validating the SST climatologies that will bridge the data gap between AATSR and SLSTR.
Martin, RobRoy L.
2012-01-01
Purpose/Background: The purpose of this study was to systematically review the literature for functional performance tests with evidence of reliability and validity that could be used for a young, athletic population with hip dysfunction. Methods: A search of PubMed and SPORTDiscus databases were performed to identify movement, balance, hop/jump, or agility functional performance tests from the current peer-reviewed literature used to assess function of the hip in young, athletic subjects. Results: The single-leg stance, deep squat, single-leg squat, and star excursion balance tests (SEBT) demonstrated evidence of validity and normative data for score interpretation. The single-leg stance test and SEBT have evidence of validity with association to hip abductor function. The deep squat test demonstrated evidence as a functional performance test for evaluating femoroacetabular impingement. Hop/Jump tests and agility tests have no reported evidence of reliability or validity in a population of subjects with hip pathology. Conclusions: Use of functional performance tests in the assessment of hip dysfunction has not been well established in the current literature. Diminished squat depth and provocation of pain during the single-leg balance test have been associated with patients diagnosed with FAI and gluteal tendinopathy, respectively. The SEBT and single-leg squat tests provided evidence of convergent validity through an analysis of kinematics and muscle function in normal subjects. Reliability of functional performance tests have not been established on patients with hip dysfunction. Further study is needed to establish reliability and validity of functional performance tests that can be used in a young, athletic population with hip dysfunction. Level of Evidence: 2b (Systematic Review of Literature) PMID:22893860
Song, Rhayun; Oh, Hyunkyoung; Ahn, Sukhee; Moorhead, Sue
2018-02-01
The purpose of this study was to validate the Cardiac Health Behavior Scale for Korean adults (CHB-K) to determine its validity and reliability. Cardiovascular diseases (CVDs) are one of the most important chronic diseases due to their high prevalence and mortality rates. Patients with cardiovascular risks or diseases need to perform appropriate cardiac health behaviors that help to prevent the progression of the disease and improve their health status. This secondary analysis obtained data from two clinical trials of cardiac rehabilitation. Data from 298 patients with cardiovascular risks or diseases were analyzed for validation. Data analyses included correlation coefficients, t-tests, and exploratory and confirmatory factor analyses using SPSS (version WIN 22.0) and AMOS (version 20.0). The Self-Efficacy Scale was used to assess convergent validity, while reliability was assessed using Cronbach's alpha coefficients. Five main factors were verified: health responsibility, physical activity, diet habit (eating habit and food choice), stress management, and smoking cessation. A set of 21 items from the 25-item scale was verified after performing item analysis, factor analyses, and critical evaluation of the statistical results. The 21-item CHB-K (CHB-K21) exhibited acceptable validity, and the model of the CHB-K21 provided a good fit to the data. Most of the factors were found to be moderately correlated with SES scores (r=0.45-0.52, p<0.001). The CHB-K21 also demonstrated acceptable reliability (Cronbach's alpha=0.83). The CHB-K21 demonstrates strong validity and reliability. It can be used to assess cardiac health behaviors in Korean adults with cardiovascular risks or diseases. Copyright © 2017 Elsevier Inc. All rights reserved.
Riley, Richard D.
2017-01-01
An important question for clinicians appraising a meta‐analysis is: are the findings likely to be valid in their own practice—does the reported effect accurately represent the effect that would occur in their own clinical population? To this end we advance the concept of statistical validity—where the parameter being estimated equals the corresponding parameter for a new independent study. Using a simple (‘leave‐one‐out’) cross‐validation technique, we demonstrate how we may test meta‐analysis estimates for statistical validity using a new validation statistic, Vn, and derive its distribution. We compare this with the usual approach of investigating heterogeneity in meta‐analyses and demonstrate the link between statistical validity and homogeneity. Using a simulation study, the properties of Vn and the Q statistic are compared for univariate random effects meta‐analysis and a tailored meta‐regression model, where information from the setting (included as model covariates) is used to calibrate the summary estimate to the setting of application. Their properties are found to be similar when there are 50 studies or more, but for fewer studies Vn has greater power but a higher type 1 error rate than Q. The power and type 1 error rate of Vn are also shown to depend on the within‐study variance, between‐study variance, study sample size, and the number of studies in the meta‐analysis. Finally, we apply Vn to two published meta‐analyses and conclude that it usefully augments standard methods when deciding upon the likely validity of summary meta‐analysis estimates in clinical practice. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. PMID:28620945
McElhone, Kathleen; Abbott, Janice; Shelmerdine, Joanna; Bruce, Ian N; Ahmad, Yasmeen; Gordon, Caroline; Peers, Kate; Isenberg, David; Ferenkeh-Koroma, Ada; Griffiths, Bridget; Akil, Mohamed; Maddison, Peter; Teh, Lee-Suan
2007-08-15
To develop and validate a disease-specific health-related quality of life (HRQOL) instrument for adults with systemic lupus erythematosus (SLE). The work consisted of 6 stages. Stage 1 included item generation for questionnaire content from semistructured interviews with SLE patients. In stage 2 item selection for the draft questionnaire was performed by thematic analysis of the patient interview transcripts and expert panel agreement. In stage 3 the content validity of the draft questionnaire was assessed by patients completing the questionnaire and providing critical feedback. In stages 4 and 5 construct validity and internal reliability of the 3 versions of the LupusQoL were evaluated using principal component analysis with varimax rotation and Cronbach's alpha coefficients, respectively. In stage 6 discriminatory validity, concurrent validity, and test-retest reliability were evaluated. Stages 1, 2, and 3 resulted in a preliminary instrument containing 63 items. In stage 4, 8 domains were identified. This factor structure, accounting for 82% of the variance, was confirmed in stage 5. The domains and Cronbach's alpha coefficients were physical health (0.94), emotional health (0.94), body image (0.89), pain (0.92), planning (0.93), fatigue (0.88), intimate relationships (0.96), and burden to others (0.94). Discriminant validity was demonstrated for different levels of disease activity (British Isles Lupus Assessment Group Index) and damage (Systemic Lupus International Collaborating Clinics/American College of Rheumatology Damage Index). High correlations (r = 0.71-0.79) between comparable domains of the Short Form 36 and the LupusQoL assured acceptable concurrent validity. Good test-retest reliability (r = 0.72-0.93) was demonstrated. The LupusQoL is a validated SLE-specific HRQOL instrument with 34 items across 8 domains defined by patients as being important.
Tabanfar, Reza; Chan, Harley H L; Lin, Vincent; Le, Trung; Irish, Jonathan C
To develop and validate a smartphone based Virtual Reality Epley Maneuver System (VREMS) for home use. A smartphone application was designed to produce stereoscopic views of a Virtual Reality (VR) environment, which when viewed after placing a smartphone in a virtual reality headset, allowed the user to be guided step-by-step through the Epley maneuver in a VR environment. Twenty healthy participants were recruited and randomized to undergo either assisted Epleys or self-administered Epleys following reading instructions from an Instructional Handout (IH). All participants were filmed and two expert Otologists reviewed the videos, assigning each participant a score (out of 10) for performance on each step. Participants rated their perceived workload by completing a validated task-load questionnaire (NASA Task Load Index) and averages for both groups were calculated. Twenty participants were evaluated with average age 26.4±7.12years old in the VREMS group and 26.1±7.72 in the IH group. The VR assisted group achieved an average score of 7.78±0.99 compared to 6.65±1.72 in the IH group. This result was statistically significant with p=0.0001 and side dominance did not appear to play a factor. Analyzing each step of the Epley maneuver demonstrated that assisted Epleys were done more accurately with statically significant results in steps 2-4. Results of the NASA-TLX scores were variable with no significant findings. We have developed and demonstrated face validity for VREMS through our randomized controlled trial. The VREMS platform is promising technology, which may improve the accuracy and effectiveness of home Epley treatments. N/A. Copyright © 2017 Elsevier Inc. All rights reserved.
da Silva, Fabiana Alves; Vidal, Cláudia Fernanda de Lacerda; de Araújo, Ednaldo Cavalcante
2015-01-01
Abstract Objective: to validate the content of the prevention protocol for early sepsis caused by Streptococcus agalactiaein newborns. Method: a transversal, descriptive and methodological study, with a quantitative approach. The sample was composed of 15 judges, 8 obstetricians and 7 pediatricians. The validation occurred through the assessment of the content of the protocol by the judges that received the instrument for data collection - checklist - which contained 7 items that represent the requisites to be met by the protocol. The validation of the content was achieved by applying the Content Validity Index. Result: in the judging process, all the items that represented requirements considered by the protocol obtained concordance within the established level (Content Validity Index > 0.75). Of 7 items, 6 have obtained full concordance (Content Validity Index 1.0) and the feasibility item obtained a Content Validity Index of 0.93. The global assessment of the instruments obtained a Content Validity Index of 0.99. Conclusion: the validation of content that was done was an efficient tool for the adjustment of the protocol, according to the judgment of experienced professionals, which demonstrates the importance of conducting a previous validation of the instruments. It is expected that this study will serve as an incentive for the adoption of universal tracking by other institutions through validated protocols. PMID:26444165
A Spanish Validation of the Canadian Adolescent Gambling Inventory (CAGI).
Jiménez-Murcia, Susana; Granero, Roser; Stinchfield, Randy; Tremblay, Joël; Del Pino-Gutiérrez, Amparo; Moragas, Laura; Savvidou, Lamprini G; Fernández-Aranda, Fernando; Aymamí, Neus; Gómez-Peña, Mónica; Tárrega, Salomé; Gunnard, Katarina; Martín-Romera, Virginia; Steward, Trevor; Mestre-Bach, Gemma; Menchón, José M
2017-01-01
Aims: Large-scale epidemiological studies show a significant prevalence of gambling disorder (GD) during adolescence and emerging adulthood, and highlight the need to identify gambling-related behaviors at early ages. However, there are only a handful of screening instruments for this population and many studies measuring youth gambling problems use adult instruments that may not be developmentally appropriate. The aim of this study was to validate a Spanish version of the Canadian Adolescent Gambling Inventory (CAGI) among late adolescent and young adults and to explore its psychometric properties. Methods: The sample (16-29 years old) included a clinical group ( n = 55) with GD patients and a control group ( n = 340). Results: Exploratory factor analysis yielded one factor as the best model. This 24-item scale demonstrated satisfactory reliability (internal consistency, Cronbach's alpha, α = 0.91), satisfactory convergent validity as measured by correlation with South Oaks Gambling Screen ( r = 0.74), and excellent classification accuracy (AUC = 0.99; sensitivity = 0.98; and specificity = 0.99). Conclusion: Our results provide empirical support for our validation of the Spanish version of the CAGI. We uphold that the Spanish CAGI can be used as a brief, reliable, and valid instrument to assess gambling problems in Spanish youth.
Using entropy measures to characterize human locomotion.
Leverick, Graham; Szturm, Tony; Wu, Christine Q
2014-12-01
Entropy measures have been widely used to quantify the complexity of theoretical and experimental dynamical systems. In this paper, the value of using entropy measures to characterize human locomotion is demonstrated based on their construct validity, predictive validity in a simple model of human walking and convergent validity in an experimental study. Results show that four of the five considered entropy measures increase meaningfully with the increased probability of falling in a simple passive bipedal walker model. The same four entropy measures also experienced statistically significant increases in response to increasing age and gait impairment caused by cognitive interference in an experimental study. Of the considered entropy measures, the proposed quantized dynamical entropy (QDE) and quantization-based approximation of sample entropy (QASE) offered the best combination of sensitivity to changes in gait dynamics and computational efficiency. Based on these results, entropy appears to be a viable candidate for assessing the stability of human locomotion.
Jacchia, Sara; Nardini, Elena; Bassani, Niccolò; Savini, Christian; Shim, Jung-Hyun; Trijatmiko, Kurniawan; Kreysa, Joachim; Mazzara, Marco
2015-05-27
This article describes the international validation of the quantitative real-time polymerase chain reaction (PCR) detection method for Golden Rice 2. The method consists of a taxon-specific assay amplifying a fragment of rice Phospholipase D α2 gene, and an event-specific assay designed on the 3' junction between transgenic insert and plant DNA. We validated the two assays independently, with absolute quantification, and in combination, with relative quantification, on DNA samples prepared in haploid genome equivalents. We assessed trueness, precision, efficiency, and linearity of the two assays, and the results demonstrate that both the assays independently assessed and the entire method fulfill European and international requirements for methods for genetically modified organism (GMO) testing, within the dynamic range tested. The homogeneity of the results of the collaborative trial between Europe and Asia is a good indicator of the robustness of the method.
In-vitro Equilibrium Phosphate Binding Study of Sevelamer Carbonate by UV-Vis Spectrophotometry.
Prasaja, Budi; Syabani, M Maulana; Sari, Endah; Chilmi, Uci; Cahyaningsih, Prawitasari; Kosasih, Theresia Weliana
2018-06-12
Sevelamer carbonate is a cross-linked polymeric amine; it is the active ingredient in Renvela ® tablets. US FDA provides recommendation for demonstrating bioequivalence for the development of a generic product of sevelamer carbonte using in-vitro equilibrium binding study. A simple UV-vis spectrophotometry method was developed and validated for quantification of free phosphate to determine the binding parameter constant of sevelamer. The method validation demonstrated the specificity, limit of quantification, accuracy and precision of measurements. The validated method has been successfully used to analyze samples in in-vitro equilibrium binding study for demonstrating bioequivalence. © Georg Thieme Verlag KG Stuttgart · New York.
NASA Technical Reports Server (NTRS)
Vroman, G. A.
1975-01-01
The capability of shallow-notched, round-bar, tensile specimens for screening critical environments as they affect the material fracture properties of the space shuttle main engine was tested and analyzed. Specimens containing a 0.050-inch-deep circumferential sharp notch were cyclically loaded in a 5000-psi hydrogen environment at temperatures of +70 and -15 F. Replication of test results and a marked change in cyclic life because of temperature variation demonstrated the validity of the specimen type to be utilized for screening tests.
ELISA: a cryocooled 10 GHz oscillator with 10(-15) frequency stability.
Grop, S; Bourgeois, P Y; Bazin, N; Kersalé, Y; Rubiola, E; Langham, C; Oxborrow, M; Clapton, D; Walker, S; De Vicente, J; Giordano, V
2010-02-01
This article reports the design, the breadboarding, and the validation of an ultrastable cryogenic sapphire oscillator operated in an autonomous cryocooler. The objective of this project was to demonstrate the feasibility of a frequency stability of 3x10(-15) between 1 and 1000 s for the European Space Agency deep space stations. This represents the lowest fractional frequency instability ever achieved with cryocoolers. The preliminary results presented in this paper validate the design we adopted for the sapphire resonator, the cold source, and the oscillator loop.
NASA Technical Reports Server (NTRS)
Woodard, Paul R.; Batina, John T.; Yang, Henry T. Y.
1992-01-01
Quality assessment procedures are described for two-dimensional unstructured meshes. The procedures include measurement of minimum angles, element aspect ratios, stretching, and element skewness. Meshes about the ONERA M6 wing and the Boeing 747 transport configuration are generated using an advancing front method grid generation package of programs. Solutions of Euler's equations for these meshes are obtained at low angle-of-attack, transonic conditions. Results for these cases, obtained as part of a validation study demonstrate accuracy of an implicit upwind Euler solution algorithm.
Design and landing dynamic analysis of reusable landing leg for a near-space manned capsule
NASA Astrophysics Data System (ADS)
Yue, Shuai; Nie, Hong; Zhang, Ming; Wei, Xiaohui; Gan, Shengyong
2018-06-01
To improve the landing performance of a near-space manned capsule under various landing conditions, a novel landing system is designed that employs double chamber and single chamber dampers in the primary and auxiliary struts, respectively. A dynamic model of the landing system is established, and the damper parameters are determined by employing the design method. A single-leg drop test with different initial pitch angles is then conducted to compare and validate the simulation model. Based on the validated simulation model, seven critical landing conditions regarding nine crucial landing responses are found by combining the radial basis function (RBF) surrogate model and adaptive simulated annealing (ASA) optimization method. Subsequently, the adaptability of the landing system under critical landing conditions is analyzed. The results show that the simulation effectively results match the test results, which validates the accuracy of the dynamic model. In addition, all of the crucial responses under their corresponding critical landing conditions satisfy the design specifications, demonstrating the feasibility of the landing system.
Experimental Validation of L1 Adaptive Control: Rohrs' Counterexample in Flight
NASA Technical Reports Server (NTRS)
Xargay, Enric; Hovakimyan, Naira; Dobrokhodov, Vladimir; Kaminer, Issac; Kitsios, Ioannis; Cao, Chengyu; Gregory, Irene M.; Valavani, Lena
2010-01-01
The paper presents new results on the verification and in-flight validation of an L1 adaptive flight control system, and proposes a general methodology for verification and validation of adaptive flight control algorithms. The proposed framework is based on Rohrs counterexample, a benchmark problem presented in the early 80s to show the limitations of adaptive controllers developed at that time. In this paper, the framework is used to evaluate the performance and robustness characteristics of an L1 adaptive control augmentation loop implemented onboard a small unmanned aerial vehicle. Hardware-in-the-loop simulations and flight test results confirm the ability of the L1 adaptive controller to maintain stability and predictable performance of the closed loop adaptive system in the presence of general (artificially injected) unmodeled dynamics. The results demonstrate the advantages of L1 adaptive control as a verifiable robust adaptive control architecture with the potential of reducing flight control design costs and facilitating the transition of adaptive control into advanced flight control systems.
Ahlander, Britt-Marie; Årestedt, Kristofer; Engvall, Jan; Maret, Eva; Ericsson, Elisabeth
2016-06-01
To develop and validate a new instrument measuring patient anxiety during Magnetic Resonance Imaging examinations, Magnetic Resonance Imaging- Anxiety Questionnaire. Questionnaires measuring patients' anxiety during Magnetic Resonance Imaging examinations have been the same as used in a wide range of conditions. To learn about patients' experience during examination and to evaluate interventions, a specific questionnaire measuring patient anxiety during Magnetic Resonance Imaging is needed. Psychometric cross-sectional study with test-retest design. A new questionnaire, Magnetic Resonance Imaging-Anxiety Questionnaire, was designed from patient expressions of anxiety in Magnetic Resonance Imaging-scanners. The sample was recruited between October 2012-October 2014. Factor structure was evaluated with exploratory factor analysis and internal consistency with Cronbach's alpha. Criterion-related validity, known-group validity and test-retest was calculated. Patients referred for Magnetic Resonance Imaging of either the spine or the heart, were invited to participate. The development and validation of Magnetic Resonance Imaging-Anxiety Questionnaire resulted in 15 items consisting of two factors. Cronbach's alpha was found to be high. Magnetic Resonance Imaging-Anxiety Questionnaire correlated higher with instruments measuring anxiety than with depression scales. Known-group validity demonstrated a higher level of anxiety for patients undergoing Magnetic Resonance Imaging scan of the heart than for those examining the spine. Test-retest reliability demonstrated acceptable level for the scale. Magnetic Resonance Imaging-Anxiety Questionnaire bridges a gap among existing questionnaires, making it a simple and useful tool for measuring patient anxiety during Magnetic Resonance Imaging examinations. © 2016 The Authors. Journal of Advanced Nursing Published by John Wiley & Sons Ltd.
Mills, Whitney L.; Regev, Tziona; Kunik, Mark E.; Wilson, Nancy L.; Moye, Jennifer; McCullough, Laurence B.; Naik, Aanand D.
2017-01-01
Objectives Older adults prefer to remain in their own homes for as long as possible. The purpose of this article is to describe the development and preliminary validation of Making and Executing Decisions for Safe and Independent Living (MED-SAIL), a brief screening tool for capacity to live safely and independently in the community. Design Prospective preliminary validation study. Setting Outpatient geriatrics clinic located in a community-based hospital. Participants Forty-nine community-dwelling older adults referred to the clinic for a comprehensive capacity assessment. Measurements We examined internal consistency, criterion-based validity, concurrent validity, and accuracy of classification for MED-SAIL. Results The items included in MED-SAIL demonstrated internal consistency (5 items; α = 0.85). MED-SAIL was significantly correlated with the Independent Living Scales (r = 0.573, p ≤ 0.001) and instrumental activities of daily living (r = 0.440, p ≤ 0.01). The Mann-Whitney U test revealed significant differences between the no capacity and partial/full capacity classifications on MED-SAIL (U(48) = 60.5, Z = −0.38, p <0.0001). The area under the curve was 0.864 (95% confidence interval: 0.84–0.99). Conclusions This study demonstrated the validity of MED-SAIL as a brief screening tool to identify older adults with impaired capacity for remaining safe and independent in their current living environment. MED-SAIL is useful tool for health and social service providers in the community for the purpose of referral for definitive capacity evaluation. PMID:23567420
Reliable and valid assessment of point-of-care ultrasonography.
Todsen, Tobias; Tolsgaard, Martin Grønnebæk; Olsen, Beth Härstedt; Henriksen, Birthe Merete; Hillingsø, Jens Georg; Konge, Lars; Jensen, Morten Lind; Ringsted, Charlotte
2015-02-01
To explore the reliability and validity of the Objective Structured Assessment of Ultrasound Skills (OSAUS) scale for point-of-care ultrasonography (POC US) performance. POC US is increasingly used by clinicians and is an essential part of the management of acute surgical conditions. However, the quality of performance is highly operator-dependent. Therefore, reliable and valid assessment of trainees' ultrasonography competence is needed to ensure patient safety. Twenty-four physicians, representing novices, intermediates, and experts in POC US, scanned 4 different surgical patient cases in a controlled set-up. All ultrasound examinations were video-recorded and assessed by 2 blinded radiologists using OSAUS. Reliability was examined using generalizability theory. Construct validity was examined by comparing performance scores between the groups and by correlating physicians' OSAUS scores with diagnostic accuracy. The generalizability coefficient was high (0.81) and a D-study demonstrated that 1 assessor and 5 cases would result in similar reliability. The construct validity of the OSAUS scale was supported by a significant difference in the mean scores between the novice group (17.0; SD 8.4) and the intermediate group (30.0; SD 10.1), P = 0.007, as well as between the intermediate group and the expert group (72.9; SD 4.4), P = 0.04, and by a high correlation between OSAUS scores and diagnostic accuracy (Spearman ρ correlation coefficient = 0.76; P < 0.001). This study demonstrates high reliability as well as evidence of construct validity of the OSAUS scale for assessment of POC US competence. Hence, the OSAUS scale may be suitable for both in-training as well as end-of-training assessment.
Giusti, P.; Mancini, V.; Grosso, M.; Barillari, M.R.; Bastiani, L.; Molinaro, S.; Nacci, A.
2016-01-01
SUMMARY The purpose of this study was to compare videofluoroscopy (VFS), fiberoptic endoscopic evaluation of swallowing (FEES) and oro-pharyngo- oesophageal scintigraphy (OPES) with regards to premature spillage, post-swallowing residue and aspiration to assess the reliability of these tests for detection of oro-pharyngeal dysphagia. Sixty patients affected with dysphagia of various origin were enrolled in the study and submitted to VFS, FEES and OPES using a liquid and semi-solid bolus. As a reference, we used VFS. Both the FEES and the OPES showed good sensitivity with high overall values (≥ 80% and ≥ 90% respectively). The comparison between FEES vs VFS concerning drop before swallowing showed good specificity (84.4% for semi-solids and 86.7% for liquids). In the case of post-swallowing residue, FEES vs VFS revealed good overall validity (75% for semi-solids) with specificity and sensitivity well balanced for the semi-solids. OPES vs. VFS demonstrated good sensitivity (88.6%) and overall validity (76.7%) for liquids. The analysis of FEES vs. VFS for aspiration showed that the overall validity was low (≤ 65%). On the other hand, OPES demonstrated appreciable overall validity (71.7%). VFS, FEES and OPES are capable of detecting oro-pharyngeal dysphagia. FEES gave significant results in the evaluation of post-swallowing residues. PMID:27958600
Development and Validation of the Survey of Organizational Research Climate (SORC)
Martinson, Brian C.; Thrush, Carol R.; Crain, A. Lauren
2012-01-01
Background Development and targeting efforts by academic organizations to effectively promote research integrity can be enhanced if they are able to collect reliable data to benchmark baseline conditions, to assess areas needing improvement, and to subsequently assess the impact of specific initiatives. To date, no standardized and validated tool has existed to serve this need. Methods A web- and mail-based survey was administered in the second half of 2009 to 2,837 randomly selected biomedical and social science faculty and postdoctoral fellows at 40 academic health centers in top-tier research universities in the United States. Measures included the Survey of Organizational Research Climate (SORC) as well as measures of perceptions of organizational justice. Results Exploratory and confirmatory factor analyses yielded seven subscales of organizational research climate, all of which demonstrated acceptable internal consistency (Cronbach’s α ranging from 0.81 to 0.87) and adequate test-retest reliability (Pearson r ranging from 0.72 to 0.83). A broad range of correlations between the seven subscales and five measures of organizational justice (unadjusted regression coefficients ranging from .13 to .95) document both construct and discriminant validity of the instrument. Conclusions The SORC demonstrates good internal (alpha) and external reliability (test-retest) as well as both construct and discriminant validity. PMID:23096775
The report gives results of activities relating to the Advanced Utility Simulation Model (AUSM): sensitivity testing. comparison with a mature electric utility model, and calibration to historical emissions. The activities were aimed at demonstrating AUSM's validity over input va...
Correlates of the Rosenberg Self-Esteem Scale Method Effects
ERIC Educational Resources Information Center
Quilty, Lena C.; Oakman, Jonathan M.; Risko, Evan
2006-01-01
Investigators of personality assessment are becoming aware that using positively and negatively worded items in questionnaires to prevent acquiescence may negatively impact construct validity. The Rosenberg Self-Esteem Scale (RSES) has demonstrated a bifactorial structure typically proposed to result from these method effects. Recent work suggests…
Validity of Beck's Cognitive Theory of Depression with Nonreferred Adolescents.
ERIC Educational Resources Information Center
Moilanen, Donna L.
1995-01-01
Examined Beck's cognitive theory by analyzing relationships between depressive symptomatology and various measures of distorted, negative cognitive processes. Results demonstrated that high school students' greater levels of depressive symptomatology on the Beck Depression Inventory were most significantly associated with higher scores on both the…
Computer-Aided Techniques for Providing Operator Performance Measures.
ERIC Educational Resources Information Center
Connelly, Edward M.; And Others
This report documents the theory, structure, and implementation of a performance processor (written in FORTRAN IV) that can accept performance demonstration data representing various levels of operator's skill and, under user control, analyze data to provide candidate performance measures and validation test results. The processor accepts two…
77 FR 72226 - Picoxystrobin; Pesticide Tolerances
Federal Register 2010, 2011, 2012, 2013, 2014
2012-12-05
... one of the following methods: Federal eRulemaking Portal: http://www.regulations.gov . Follow the... validity, completeness, and reliability as well as the relationship of the results of the studies to human... as demonstrated by the severe eye irritation effect seen in the primary eye irritation study on...
Social Networks and Mourning: A Comparative Approach.
ERIC Educational Resources Information Center
Rubin, Nissan
1990-01-01
Suggests using social network theory to explain varieties of mourning behavior in different societies. Compares participation in funeral ceremonies of members of different social circles in American society and Israeli kibbutz. Concludes that results demonstrated validity of concepts deriving from social network analysis in study of bereavement,…
The Consolidation/Transition Model in Moral Reasoning Development.
ERIC Educational Resources Information Center
Walker, Lawrence J.; Gustafson, Paul; Hennig, Karl H.
2001-01-01
This longitudinal study with 62 children and adolescents examined the validity of the consolidation/transition model in the context of moral reasoning development. Results of standard statistical and Bayesian techniques supported the hypotheses regarding cyclical patterns of change and predictors of stage transition, and demonstrated the utility…
Schut, Henk; Stroebe, Margaret S.; Wilson, Stewart; Birrell, John
2016-01-01
Objective This study assessed the validity of the Indicator of Bereavement Adaptation Cruse Scotland (IBACS). Designed for use in clinical and non-clinical settings, the IBACS measures severity of grief symptoms and risk of developing complications. Method N = 196 (44 male, 152 female) help-seeking, bereaved Scottish adults participated at two timepoints: T1 (baseline) and T2 (after 18 months). Four validated assessment instruments were administered: CORE-R, ICG-R, IES-R, SCL-90-R. Discriminative ability was assessed using ROC curve analysis. Concurrent validity was tested through correlation analysis at T1. Predictive validity was assessed using correlation analyses and ROC curve analysis. Optimal IBACS cutoff values were obtained by calculating a maximal Youden index J in ROC curve analysis. Clinical implications were compared across instruments. Results ROC curve analysis results (AUC = .84, p < .01, 95% CI between .77 and .90) indicated the IBACS is a good diagnostic instrument for assessing complicated grief. Positive correlations (p < .01, 2-tailed) with all four instruments at T1 demonstrated the IBACS' concurrent validity, strongest with complicated grief measures (r = .82). Predictive validity was shown to be fair in T2 ROC curve analysis results (n = 67, AUC = .78, 95% CI between .65 and .92; p < .01). Predictive validity was also supported by stable positive correlations between IBACS and other instruments at T2. Clinical indications were found not to differ across instruments. Conclusions The IBACS offers effective grief symptom and risk assessment for use by non-clinicians. Indications are sufficient to support intake assessment for a stepped model of bereavement intervention. PMID:27741246
MIXING STUDY FOR JT-71/72 TANKS
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lee, S.
2013-11-26
All modeling calculations for the mixing operations of miscible fluids contained in HBLine tanks, JT-71/72, were performed by taking a three-dimensional Computational Fluid Dynamics (CFD) approach. The CFD modeling results were benchmarked against the literature results and the previous SRNL test results to validate the model. Final performance calculations were performed by using the validated model to quantify the mixing time for the HB-Line tanks. The mixing study results for the JT-71/72 tanks show that, for the cases modeled, the mixing time required for blending of the tank contents is no more than 35 minutes, which is well below 2.5more » hours of recirculation pump operation. Therefore, the results demonstrate the adequacy of 2.5 hours’ mixing time of the tank contents by one recirculation pump to get well mixed.« less
High Precision Optical Observations of Space Debris in the Geo Ring from Venezuela
NASA Astrophysics Data System (ADS)
Lacruz, E.; Abad, C.; Downes, J. J.; Casanova, D.; Tresaco, E.
2018-01-01
We present preliminary results to demonstrate that our method for detection and location of Space Debris (SD) in the geostationary Earth orbit (GEO) ring, based on observations at the OAN of Venezuela is of high astrometric precision. A detailed explanation of the method, its validation and first results is available in (Lacruz et al. 2017).
A Novel Cost Based Model for Energy Consumption in Cloud Computing
Horri, A.; Dastghaibyfard, Gh.
2015-01-01
Cloud data centers consume enormous amounts of electrical energy. To support green cloud computing, providers also need to minimize cloud infrastructure energy consumption while conducting the QoS. In this study, for cloud environments an energy consumption model is proposed for time-shared policy in virtualization layer. The cost and energy usage of time-shared policy were modeled in the CloudSim simulator based upon the results obtained from the real system and then proposed model was evaluated by different scenarios. In the proposed model, the cache interference costs were considered. These costs were based upon the size of data. The proposed model was implemented in the CloudSim simulator and the related simulation results indicate that the energy consumption may be considerable and that it can vary with different parameters such as the quantum parameter, data size, and the number of VMs on a host. Measured results validate the model and demonstrate that there is a tradeoff between energy consumption and QoS in the cloud environment. Also, measured results validate the model and demonstrate that there is a tradeoff between energy consumption and QoS in the cloud environment. PMID:25705716
A novel cost based model for energy consumption in cloud computing.
Horri, A; Dastghaibyfard, Gh
2015-01-01
Cloud data centers consume enormous amounts of electrical energy. To support green cloud computing, providers also need to minimize cloud infrastructure energy consumption while conducting the QoS. In this study, for cloud environments an energy consumption model is proposed for time-shared policy in virtualization layer. The cost and energy usage of time-shared policy were modeled in the CloudSim simulator based upon the results obtained from the real system and then proposed model was evaluated by different scenarios. In the proposed model, the cache interference costs were considered. These costs were based upon the size of data. The proposed model was implemented in the CloudSim simulator and the related simulation results indicate that the energy consumption may be considerable and that it can vary with different parameters such as the quantum parameter, data size, and the number of VMs on a host. Measured results validate the model and demonstrate that there is a tradeoff between energy consumption and QoS in the cloud environment. Also, measured results validate the model and demonstrate that there is a tradeoff between energy consumption and QoS in the cloud environment.
Hales, M.; Biros, E.
2015-01-01
Background: Since 1982, the International Standards for Neurological Classification of Spinal Cord Injury (ISNCSCI) has been used to classify sensation of spinal cord injury (SCI) through pinprick and light touch scores. The absence of proprioception, pain, and temperature within this scale creates questions about its validity and accuracy. Objectives: To assess whether the sensory component of the ISNCSCI represents a reliable and valid measure of classification of SCI. Methods: A systematic review of studies examining the reliability and validity of the sensory component of the ISNCSCI published between 1982 and February 2013 was conducted. The electronic databases MEDLINE via Ovid, CINAHL, PEDro, and Scopus were searched for relevant articles. A secondary search of reference lists was also completed. Chosen articles were assessed according to the Oxford Centre for Evidence-Based Medicine hierarchy of evidence and critically appraised using the McMasters Critical Review Form. A statistical analysis was conducted to investigate the variability of the results given by reliability studies. Results: Twelve studies were identified: 9 reviewed reliability and 3 reviewed validity. All studies demonstrated low levels of evidence and moderate critical appraisal scores. The majority of the articles (~67%; 6/9) assessing the reliability suggested that training was positively associated with better posttest results. The results of the 3 studies that assessed the validity of the ISNCSCI scale were confounding. Conclusions: Due to the low to moderate quality of the current literature, the sensory component of the ISNCSCI requires further revision and investigation if it is to be a useful tool in clinical trials. PMID:26363591
Validity and reliability of the robotic objective structured assessment of technical skills
Siddiqui, Nazema Y.; Galloway, Michael L.; Geller, Elizabeth J.; Green, Isabel C.; Hur, Hye-Chun; Langston, Kyle; Pitter, Michael C.; Tarr, Megan E.; Martino, Martin A.
2015-01-01
Objective Objective structured assessments of technical skills (OSATS) have been developed to measure the skill of surgical trainees. Our aim was to develop an OSATS specifically for trainees learning robotic surgery. Study Design This is a multi-institutional study in eight academic training programs. We created an assessment form to evaluate robotic surgical skill through five inanimate exercises. Obstetrics/gynecology, general surgery, and urology residents, fellows, and faculty completed five robotic exercises on a standard training model. Study sessions were recorded and randomly assigned to three blinded judges who scored performance using the assessment form. Construct validity was evaluated by comparing scores between participants with different levels of surgical experience; inter- and intra-rater reliability were also assessed. Results We evaluated 83 residents, 9 fellows, and 13 faculty, totaling 105 participants; 88 (84%) were from obstetrics/gynecology. Our assessment form demonstrated construct validity, with faculty and fellows performing significantly better than residents (mean scores: 89 ± 8 faculty; 74 ± 17 fellows; 59 ± 22 residents, p<0.01). In addition, participants with more robotic console experience scored significantly higher than those with fewer prior console surgeries (p<0.01). R-OSATS demonstrated good inter-rater reliability across all five drills (mean Cronbach's α: 0.79 ± 0.02). Intra-rater reliability was also high (mean Spearman's correlation: 0.91 ± 0.11). Conclusions We developed an assessment form for robotic surgical skill that demonstrates construct validity, inter- and intra-rater reliability. When paired with standardized robotic skill drills this form may be useful to distinguish between levels of trainee performance. PMID:24807319
The development and initial validation of the Decent Work Scale.
Duffy, Ryan D; Allan, Blake A; England, Jessica W; Blustein, David L; Autin, Kelsey L; Douglass, Richard P; Ferreira, Joaquim; Santos, Eduardo J R
2017-03-01
Decent work is positioned as the centerpiece of the recently developed Psychology of Working Theory (PWT; Duffy, Blustein, Diemer, & Autin, 2016). However, to date, no instrument exists which assesses all 5 components of decent work from a psychological perspective. In the current study, we developed the Decent Work Scale (DWS) and demonstrated several aspects of validity with 2 samples of working adults. In Study 1 (N = 275), a large pool of items were developed and exploratory factor analysis was conducted resulting in a final 15-item scale with 5 factors/subscales corresponding to the 5 components of decent work: (a) physically and interpersonally safe working conditions, (b) access to health care, (c) adequate compensation, (d) hours that allow for free time and rest, and (e) organizational values that complement family and social values. In Study 2 (N = 589), confirmatory factor analysis demonstrated that a 5-factor, bifactor model offered the strongest and most parsimonious fit to the data. Configural, metric, and scalar invariance models were tested demonstrating that the structure of the instrument did not differ across gender, income, social class, and majority/minority racial/ethnic groups. Finally, the overall scale score and 5 subscale scores correlated in the expected directions with similar constructs supporting convergent and discriminant evidence of validity, and subscale scores evidenced predictive validity in the prediction of job satisfaction, work meaning, and withdrawal intentions. The development of this scale provides a useful tool for researchers and practitioners seeking to assess the attainment of decent work among employed adults. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Skeletal age assessment in children using an open compact MRI system.
Terada, Yasuhiko; Kono, Saki; Tamada, Daiki; Uchiumi, Tomomi; Kose, Katsumi; Miyagi, Ryo; Yamabe, Eiko; Yoshioka, Hiroshi
2013-06-01
MRI may be a noninvasive and alternative tool for skeletal age assessment in children, although few studies have reported on this topic. In this article, skeletal age was assessed over a wide range of ages using an open, compact MRI optimized for the imaging of a child's hand and wrist, and its validity was evaluated. MR images and their three-dimensional segmentation visualized detailed skeletal features of each bone in the hand and wrist. Skeletal age was then independently scored from the MR images by two raters, according to the Tanner-Whitehouse Japan system. The skeletal age assessed by MR rating demonstrated a strong positive correlation with chronological age. The intrarater and inter-rater reproducibilities were significantly high. These results demonstrate the validity and reliability of skeletal age assessment using MRI. Copyright © 2012 Wiley Periodicals, Inc.
A Valid Demonstration of the Missing Fundamental Illusion.
ERIC Educational Resources Information Center
Larsen, Janet D.; Fritsch, Klaus
1998-01-01
Identifies the "missing fundamental illusion" as that which occurs when two tones are heard together and the listener hears a third tone with a pitch corresponding to the difference in their frequencies. Describes an inexpensive and valid demonstration of the missing fundamental using a British police whistle. (MJP)
Fixed gain and adaptive techniques for rotorcraft vibration control
NASA Technical Reports Server (NTRS)
Roy, R. H.; Saberi, H. A.; Walker, R. A.
1985-01-01
The results of an analysis effort performed to demonstrate the feasibility of employing approximate dynamical models and frequency shaped cost functional control law desgin techniques for helicopter vibration suppression are presented. Both fixed gain and adaptive control designs based on linear second order dynamical models were implemented in a detailed Rotor Systems Research Aircraft (RSRA) simulation to validate these active vibration suppression control laws. Approximate models of fuselage flexibility were included in the RSRA simulation in order to more accurately characterize the structural dynamics. The results for both the fixed gain and adaptive approaches are promising and provide a foundation for pursuing further validation in more extensive simulation studies and in wind tunnel and/or flight tests.
Validation of a clinical critical thinking skills test in nursing
2015-01-01
Purpose: The purpose of this study was to develop a revised version of the clinical critical thinking skills test (CCTS) and to subsequently validate its performance. Methods: This study is a secondary analysis of the CCTS. Data were obtained from a convenience sample of 284 college students in June 2011. Thirty items were analyzed using item response theory and test reliability was assessed. Test-retest reliability was measured using the results of 20 nursing college and graduate school students in July 2013. The content validity of the revised items was analyzed by calculating the degree of agreement between instrument developer intention in item development and the judgments of six experts. To analyze response process validity, qualitative data related to the response processes of nine nursing college students obtained through cognitive interviews were analyzed. Results: Out of initial 30 items, 11 items were excluded after the analysis of difficulty and discrimination parameter. When the 19 items of the revised version of the CCTS were analyzed, levels of item difficulty were found to be relatively low and levels of discrimination were found to be appropriate or high. The degree of agreement between item developer intention and expert judgments equaled or exceeded 50%. Conclusion: From above results, evidence of the response process validity was demonstrated, indicating that subjects respondeds as intended by the test developer. The revised 19-item CCTS was found to have sufficient reliability and validity and will therefore represents a more convenient measurement of critical thinking ability. PMID:25622716
VALFAST: Secure Probabilistic Validation of Hundreds of Kepler Planet Candidates
NASA Astrophysics Data System (ADS)
Morton, Tim; Petigura, E.; Johnson, J. A.; Howard, A.; Marcy, G. W.; Baranec, C.; Law, N. M.; Riddle, R. L.; Ciardi, D. R.; Robo-AO Team
2014-01-01
The scope, scale, and tremendous success of the Kepler mission has necessitated the rapid development of probabilistic validation as a new conceptual framework for analyzing transiting planet candidate signals. While several planet validation methods have been independently developed and presented in the literature, none has yet come close to addressing the entire Kepler survey. I present the results of applying VALFAST---a planet validation code based on the methodology described in Morton (2012)---to every Kepler Object of Interest. VALFAST is unique in its combination of detail, completeness, and speed. Using the transit light curve shape, realistic population simulations, and (optionally) diverse follow-up observations, it calculates the probability that a transit candidate signal is the result of a true transiting planet or any of a number of astrophysical false positive scenarios, all in just a few minutes on a laptop computer. In addition to efficiently validating the planetary nature of hundreds of new KOIs, this broad application of VALFAST also demonstrates its ability to reliably identify likely false positives. This extensive validation effort is also the first to incorporate data from all of the largest Kepler follow-up observing efforts: the CKS survey of ~1000 KOIs with Keck/HIRES, the Robo-AO survey of >1700 KOIs, and high-resolution images obtained through the Kepler Follow-up Observing Program. In addition to enabling the core science that the Kepler mission was designed for, this methodology will be critical to obtain statistical results from future surveys such as TESS and PLATO.
NASA Astrophysics Data System (ADS)
Singh, Sarabjeet; Howard, Carl Q.; Hansen, Colin H.; Köpke, Uwe G.
2018-03-01
In this paper, numerically modelled vibration response of a rolling element bearing with a localised outer raceway line spall is presented. The results were obtained from a finite element (FE) model of the defective bearing solved using an explicit dynamics FE software package, LS-DYNA. Time domain vibration signals of the bearing obtained directly from the FE modelling were processed further to estimate time-frequency and frequency domain results, such as spectrogram and power spectrum, using standard signal processing techniques pertinent to the vibration-based monitoring of rolling element bearings. A logical approach to analyses of the numerically modelled results was developed with an aim to presenting the analytical validation of the modelled results. While the time and frequency domain analyses of the results show that the FE model generates accurate bearing kinematics and defect frequencies, the time-frequency analysis highlights the simulation of distinct low- and high-frequency characteristic vibration signals associated with the unloading and reloading of the rolling elements as they move in and out of the defect, respectively. Favourable agreement of the numerical and analytical results demonstrates the validation of the results from the explicit FE modelling of the bearing.
A systematic review of validated sinus surgery simulators.
Stew, B; Kao, S S-T; Dharmawardana, N; Ooi, E H
2018-06-01
Simulation provides a safe and effective opportunity to develop surgical skills. A variety of endoscopic sinus surgery (ESS) simulators has been described in the literature. Validation of these simulators allows for effective utilisation in training. To conduct a systematic review of the published literature to analyse the evidence for validated ESS simulation. Pubmed, Embase, Cochrane and Cinahl were searched from inception of the databases to 11 January 2017. Twelve thousand five hundred and sixteen articles were retrieved of which 10 112 were screened following the removal of duplicates. Thirty-eight full-text articles were reviewed after meeting search criteria. Evidence of face, content, construct, discriminant and predictive validity was extracted. Twenty articles were included in the analysis describing 12 ESS simulators. Eleven of these simulators had undergone validation: 3 virtual reality, 7 physical bench models and 1 cadaveric simulator. Seven of the simulators were shown to have face validity, 7 had construct validity and 1 had predictive validity. None of the simulators demonstrated discriminate validity. This systematic review demonstrates that a number of ESS simulators have been comprehensively validated. Many of the validation processes, however, lack standardisation in outcome reporting, thus limiting a meta-analysis comparison between simulators. © 2017 John Wiley & Sons Ltd.
Gower, Jared R; Moyer-Mileur, Laurie J; Wilkinson, Robert D; Slater, Hillarie; Jordan, Kristine C
2010-03-01
Limited surveys are available to assess the nutrition knowledge of children. The goals of this study were to test the validity and reliability of a computer nutrition knowledge survey for elementary school students and to evaluate the impact of the "Fit Kids 'r' Healthy Kids" nutrition intervention via the knowledge survey. During survey development, a sample (n=12) of health educators, elementary school teachers, and registered dietitians assessed the survey. The target population consisted of first- through fourth-grade students from Salt Lake City, UT, metropolitan area schools. Participants were divided into reliability (n=68), intervention (n=74), and control groups (n=59). The reliability group took the survey twice (2 weeks apart); the intervention and control groups also took the survey twice, but at pre- and post-intervention (4 weeks later). Only students from the intervention group participated in four weekly nutrition classes. Reliability was assessed by Pearson's correlation coefficients for knowledge scores. Results demonstrated appropriate content validity, as indicated by expert peer ratings. Test-retest reliability correlations were found to be significant for the overall survey (r=0.54; P<0.001) and for all subscales: food groups, healthful foods, and food functions (r=0.51, 0.65, and 0.49, respectively; P<0.001). Nutrition knowledge was assessed upon program completion with paired samples t tests. Students from the intervention group demonstrated improvement in nutrition knowledge (12.2+/-1.9 to 13.5+/-1.6; P<0.001), while scores for the control group remained unchanged. The difference in total scores from pre- to post-intervention between the two groups was significant (P<0.001). These results suggest that the computerized nutrition survey demonstrated content validity and test-retest reliability for first- through fourth-grade elementary school children. Also, the study results imply that the Fit Kids 'r' Healthy Kids intervention promoted gains in nutrition knowledge. Overall, the computer survey shows promise as an appealing medium for assessing nutrition knowledge in children. Copyright 2010 American Dietetic Association. Published by Elsevier Inc. All rights reserved.
McAllister, Sue; Lincoln, Michelle; Ferguson, Allison; McAllister, Lindy
2013-01-01
Valid assessment of health science students' ability to perform in the real world of workplace practice is critical for promoting quality learning and ultimately certifying students as fit to enter the world of professional practice. Current practice in performance assessment in the health sciences field has been hampered by multiple issues regarding assessment content and process. Evidence for the validity of scores derived from assessment tools are usually evaluated against traditional validity categories with reliability evidence privileged over validity, resulting in the paradoxical effect of compromising the assessment validity and learning processes the assessments seek to promote. Furthermore, the dominant statistical approaches used to validate scores from these assessments fall under the umbrella of classical test theory approaches. This paper reports on the successful national development and validation of measures derived from an assessment of Australian speech pathology students' performance in the workplace. Validation of these measures considered each of Messick's interrelated validity evidence categories and included using evidence generated through Rasch analyses to support score interpretation and related action. This research demonstrated that it is possible to develop an assessment of real, complex, work based performance of speech pathology students, that generates valid measures without compromising the learning processes the assessment seeks to promote. The process described provides a model for other health professional education programs to trial.
Reliability and Validity of the Korean Version of the Cancer Stigma Scale.
So, Hyang Sook; Chae, Myeong Jeong; Kim, Hye Young
2017-02-01
In this study the reliability and validity of the Korean version of the Cancer Stigma Scale (KCSS) was evaluated. The KCSS was formed through translation and modification of Cataldo Lung Cancer Stigma Scale. The KCSS, Psychological Symptom Inventory (PSI), and European Organization for Research and Treatment of Cancer Quality of Life Questionnaire - Core 30 (EORTC QLQ-C30) were administered to 247 men and women diagnosed with one of the five major cancers. Construct validity, item convergent and discriminant validity, concurrent validity, known-group validity, and internal consistency reliability of the KCSS were evaluated. Exploratory factor analysis supported the construct validity with a six-factor solution; that explained 65.7% of the total variance. The six-factor model was validated by confirmatory factor analysis (Q (χ²/df)= 2.28, GFI=.84, AGFI=.81, NFI=.80, TLI=.86, RMR=.03, and RMSEA=.07). Concurrent validity was demonstrated with the QLQ-C30 (global: r=-.44; functional: r=-.19; symptom: r=.42). The KCSS had known-group validity. Cronbach's alpha coefficient for the 24 items was .89. The results of this study suggest that the 24-item KCSS has relatively acceptable reliability and validity and can be used in clinical research to assess cancer stigma and its impacts on health-related quality of life in Korean cancer patients. © 2017 Korean Society of Nursing Science
Translation, Cultural Adaptation and Validation of the Simple Shoulder Test to Spanish
Arcuri, Francisco; Barclay, Fernando; Nacul, Ivan
2015-01-01
Background: The validation of widely used scales facilitates the comparison across international patient samples. Objective: The objective was to translate, culturally adapt and validate the Simple Shoulder Test into Argentinian Spanish. Methods: The Simple Shoulder Test was translated from English into Argentinian Spanish by two independent translators, translated back into English and evaluated for accuracy by an expert committee to correct the possible discrepancies. It was then administered to 50 patients with different shoulder conditions.Psycometric properties were analyzed including internal consistency, measured with Cronbach´s Alpha, test-retest reliability at 15 days with the interclass correlation coefficient. Results: The internal consistency, validation, was an Alpha of 0,808, evaluated as good. The test-retest reliability index as measured by intra-class correlation coefficient (ICC) was 0.835, evaluated as excellent. Conclusion: The Simple Shoulder Test translation and it´s cultural adaptation to Argentinian-Spanish demonstrated adequate internal reliability and validity, ultimately allowing for its use in the comparison with international patient samples.
Development and Validation of a Measure of Quality of Life for the Young Elderly in Sri Lanka.
de Silva, Sudirikku Hennadige Padmal; Jayasuriya, Anura Rohan; Rajapaksa, Lalini Chandika; de Silva, Ambepitiyawaduge Pubudu; Barraclough, Simon
2016-01-01
Sri Lanka has one of the fastest aging populations in the world. Measurement of quality of life (QoL) in the elderly needs instruments developed that encompass the sociocultural settings. An instrument was developed to measure QoL in the young elderly in Sri Lanka (QLI-YES), using accepted methods to generate and reduce items. The measure was validated using a community sample. Construct, criterion and predictive validity and reliability were tested. A first-order model of 24 items with 6 domains was found to have good fit indices (CMIN/df = 1.567, RMR = 0.05, CFI = 0.95, and RMSEA = 0.053). Both criterion and predictive validity were demonstrated. Good internal consistency reliability (Cronbach's α = 0.93) was shown. The development of the QLI-YES using a societal perspective relevant to the social and cultural beliefs has resulted in a robust and valid instrument to measure QoL for the young elderly in Sri Lanka. © 2015 APJPH.
Jung, Kyoung-Sim; Jung, Jin-Hwa; In, Tae-Sung; Cho, Hwi-Young
2016-09-01
[Purpose] The purpose of this study was to establish the reliability and validity of the Short Musculoskeletal Function Assessment questionnaire, which was translated into Korean, for patients with musculoskeletal disorder. [Subjects and Methods] Fifty-five subjects (26 males and 29 females) with musculoskeletal diseases participated in the study. The Short Musculoskeletal Function Assessment questionnaire focuses on a limited range of physical functions and includes a dysfunction index and a bother index. Reliability was determined using the intraclass correlation coefficient, and validity was examined by correlating short musculoskeletal function assessment scores with the 36-item Short-Form Health Survey (SF-36) score. [Results] The reliability was 0.97 for the dysfunction index and 0.94 for the bother index. Validity was established by comparison with Korean version of the SF-36. [Conclusion] This study demonstrated that the Korean version of the Short Musculoskeletal Function Assessment questionnaire is a reliable and valid instrument for the assessment of musculoskeletal disorders.
A Comprehensive Validation Methodology for Sparse Experimental Data
NASA Technical Reports Server (NTRS)
Norman, Ryan B.; Blattnig, Steve R.
2010-01-01
A comprehensive program of verification and validation has been undertaken to assess the applicability of models to space radiation shielding applications and to track progress as models are developed over time. The models are placed under configuration control, and automated validation tests are used so that comparisons can readily be made as models are improved. Though direct comparisons between theoretical results and experimental data are desired for validation purposes, such comparisons are not always possible due to lack of data. In this work, two uncertainty metrics are introduced that are suitable for validating theoretical models against sparse experimental databases. The nuclear physics models, NUCFRG2 and QMSFRG, are compared to an experimental database consisting of over 3600 experimental cross sections to demonstrate the applicability of the metrics. A cumulative uncertainty metric is applied to the question of overall model accuracy, while a metric based on the median uncertainty is used to analyze the models from the perspective of model development by analyzing subsets of the model parameter space.
Attitudes Toward Transgender Men and Women: Development and Validation of a New Measure
Billard, Thomas J
2018-01-01
A series of three studies were conducted to generate, develop, and validate the Attitudes toward Transgender Men and Women (ATTMW) scale. In Study 1, 120 American adults responded to an open-ended questionnaire probing various dimensions of their perceptions of transgender individuals and identity. Qualitative thematic analysis generated 200 items based on their responses. In Study 2, 238 American adults completed a questionnaire consisting of the generated items. Exploratory factor analysis (EFA) revealed two non-identical 12-item subscales (ATTM and ATTW) of the full 24-item scale. In Study 3, 150 undergraduate students completed a survey containing the ATTMW and a number of validity-testing variables. Confirmatory factor analysis (CFA) verified the single-factor structures of the ATTM and ATTW subscales, and the convergent, discriminant, predictive, and concurrent validities of the ATTMW were also established. Together, our results demonstrate that the ATTMW is a reliable and valid measure of attitudes toward transgender individuals. PMID:29666595
Development and Validation of Triarchic Construct Scales from the Psychopathic Personality Inventory
Hall, Jason R.; Drislane, Laura E.; Patrick, Christopher J.; Morano, Mario; Lilienfeld, Scott O.; Poythress, Norman G.
2014-01-01
The Triarchic model of psychopathy describes this complex condition in terms of distinct phenotypic components of boldness, meanness, and disinhibition. Brief self-report scales designed specifically to index these psychopathy facets have thus far demonstrated promising construct validity. The present study sought to develop and validate scales for assessing facets of the Triarchic model using items from a well-validated existing measure of psychopathy—the Psychopathic Personality Inventory (PPI). A consensus rating approach was used to identify PPI items relevant to each Triarchic facet, and the convergent and discriminant validity of the resulting PPI-based Triarchic scales were evaluated in relation to multiple criterion variables (i.e., other psychopathy inventories, antisocial personality disorder features, personality traits, psychosocial functioning) in offender and non-offender samples. The PPI-based Triarchic scales showed good internal consistency and related to criterion variables in ways consistent with predictions based on the Triarchic model. Findings are discussed in terms of implications for conceptualization and assessment of psychopathy. PMID:24447280
Hall, Jason R; Drislane, Laura E; Patrick, Christopher J; Morano, Mario; Lilienfeld, Scott O; Poythress, Norman G
2014-06-01
The Triarchic model of psychopathy describes this complex condition in terms of distinct phenotypic components of boldness, meanness, and disinhibition. Brief self-report scales designed specifically to index these psychopathy facets have thus far demonstrated promising construct validity. The present study sought to develop and validate scales for assessing facets of the Triarchic model using items from a well-validated existing measure of psychopathy-the Psychopathic Personality Inventory (PPI). A consensus-rating approach was used to identify PPI items relevant to each Triarchic facet, and the convergent and discriminant validity of the resulting PPI-based Triarchic scales were evaluated in relation to multiple criterion variables (i.e., other psychopathy inventories, antisocial personality disorder features, personality traits, psychosocial functioning) in offender and nonoffender samples. The PPI-based Triarchic scales showed good internal consistency and related to criterion variables in ways consistent with predictions based on the Triarchic model. Findings are discussed in terms of implications for conceptualization and assessment of psychopathy.
Individualism: a valid and important dimension of cultural differences between nations.
Schimmack, Ulrich; Oishi, Shigehiro; Diener, Ed
2005-01-01
Oyserman, Coon, and Kemmelmeier's (2002) meta-analysis suggested problems in the measurement of individualism and collectivism. Studies using Hofstede's individualism scores show little convergent validity with more recent measures of individualism and collectivism. We propose that the lack of convergent validity is due to national differences in response styles. Whereas Hofstede statistically controlled for response styles, Oyserman et al.'s meta-analysis relied on uncorrected ratings. Data from an international student survey demonstrated convergent validity between Hofstede's individualism dimension and horizontal individualism when response styles were statistically controlled, whereas uncorrected scores correlated highly with the individualism scores in Oyserman et al.'s meta-analysis. Uncorrected horizontal individualism scores and meta-analytic individualism scores did not correlate significantly with nations' development, whereas corrected horizontal individualism scores and Hofstede's individualism dimension were significantly correlated with development. This pattern of results suggests that individualism is a valid construct for cross-cultural comparisons, but that the measurement of this construct needs improvement.
Reliability and Validity of the Lichtenberg Financial Decision Screening Scale.
Lichtenberg, Peter A; Teresi, Jeanne A; Ocepek-Welikson, Katja; Eimicke, Joseph P
2017-03-01
The scarcity of empirically validated assessment instruments continues to impede the work of professionals in a number of fields, including medicine, finance, and estate planning; adult protective services; and criminal justice-and, more importantly, it impedes their ability to effectively assist and, in some case, protect their clients. Other professionals (e.g. legal, financial, medical, mental health services) are in a position to prevent financial exploitation and would benefit from access to new instruments. The Lichtenberg Financial Decision Screening Scale (LFDSS) was introduced in 2016, along with evidence for its convergent validity (Lichtenberg et al., 2016). Using a sample of 213 participants, this study investigated the internal consistency of the LFDSS and its criterion validity based on ratings by professionals using the scale. Results demonstrate that the LFDSS has excellent internal consistency and clinical utility properties. This paper provides support for use of the LFDSS as a reliable and valid instrument. The LFDSS and instructions for its use are included in the article, along with information about online tools and support.
Development and Validation of a Measure of Quality of Life for the Young Elderly in Sri Lanka
de Silva, Sudirikku Hennadige Padmal; Jayasuriya, Anura Rohan; Rajapaksa, Lalini Chandika; de Silva, Ambepitiyawaduge Pubudu; Barraclough, Simon
2016-01-01
Sri Lanka has one of the fastest aging populations in the world. Measurement of quality of life (QoL) in the elderly needs instruments developed that encompass the sociocultural settings. An instrument was developed to measure QoL in the young elderly in Sri Lanka (QLI-YES), using accepted methods to generate and reduce items. The measure was validated using a community sample. Construct, criterion and predictive validity and reliability were tested. A first-order model of 24 items with 6 domains was found to have good fit indices (CMIN/df = 1.567, RMR = 0.05, CFI = 0.95, and RMSEA = 0.053). Both criterion and predictive validity were demonstrated. Good internal consistency reliability (Cronbach’s α = 0.93) was shown. The development of the QLI-YES using a societal perspective relevant to the social and cultural beliefs has resulted in a robust and valid instrument to measure QoL for the young elderly in Sri Lanka. PMID:26712893
Objective validation of central sensitization in the rat UVB and heat rekindling model
Weerasinghe, NS; Lumb, BM; Apps, R; Koutsikou, S; Murrell, JC
2014-01-01
Background The UVB and heat rekindling (UVB/HR) model shows potential as a translatable inflammatory pain model. However, the occurrence of central sensitization in this model, a fundamental mechanism underlying chronic pain, has been debated. Face, construct and predictive validity are key requisites of animal models; electromyogram (EMG) recordings were utilized to objectively demonstrate validity of the rat UVB/HR model. Methods The UVB/HR model was induced on the heel of the hind paw under anaesthesia. Mechanical withdrawal thresholds (MWTs) were obtained from biceps femoris EMG responses to a gradually increasing pinch at the mid hind paw region under alfaxalone anaesthesia, 96 h after UVB irradiation. MWT was compared between UVB/HR and SHAM-treated rats (anaesthetic only). Underlying central mechanisms in the model were pharmacologically validated by MWT measurement following intrathecal N-methyl-d-aspartate (NMDA) receptor antagonist, MK-801, or saline. Results Secondary hyperalgesia was confirmed by a significantly lower pre-drug MWT {mean [±standard error of the mean (SEM)]} in UVB/HR [56.3 (±2.1) g/mm2, n = 15] compared with SHAM-treated rats [69.3 (±2.9) g/mm2, n = 8], confirming face validity of the model. Predictive validity was demonstrated by the attenuation of secondary hyperalgesia by MK-801, where mean (±SEM) MWT was significantly higher [77.2 (±5.9) g/mm2 n = 7] in comparison with pre-drug [57.8 (±3.5) g/mm2 n = 7] and saline [57.0 (±3.2) g/mm2 n = 8] at peak drug effect. The occurrence of central sensitization confirmed construct validity of the UVB/HR model. Conclusions This study used objective outcome measures of secondary hyperalgesia to validate the rat UVB/HR model as a translational model of inflammatory pain. What's already known about this topic? Most current animal chronic pain models lack translatability to human subjects. Primary hyperalgesia is an established feature of the UVB/heat rekindling inflammatory pain model in rodents and humans, but the presence of secondary hyperalgesia, a hallmark feature of central sensitization and thus chronic pain, is contentious. What does this study add? Secondary hyperalgesia was demonstrated in the rat UVB/heat rekindling model using an objective outcome measure (electromyogram), overcoming the subjective limitations of previous behavioural studies. PMID:24590815
NG, Chong Guan; CHIN, Soo Cheng; YEE, Anne Hway Ann; LOH, Huai Seng; SULAIMAN, Ahmad Hatim; Sherianne Sook Kuan, WONG; HABIL, Mohamed Hussain
2014-01-01
Background: The Snaith-Hamilton Pleasure Scale (SHAPS) is a self-assessment scale designed to evaluate anhedonia in various psychiatric disorders. In order to facilitate its use in Malaysian settings, our current study aimed to examine the validity of a Malay-translated version of the SHAPS (SHAPS-M). Methods: In this cross-sectional study, a total of 44 depressed patients and 82 healthy subjects were recruited from a university out-patient clinic. All participants were given both the Malay and English versions of the SHAPS, Fawcett-Clark Pleasure Scale (FCPS), General Health Questionnaire 12 (GHQ-12), and the Beck Depression Inventory (BDI) to assess their hedonic state, general mental health condition and levels of depression. Results: The results showed that the SHAPS-M has impressive internal consistency (α = 0.96), concurrent validity and good parallel-form reliability (intraclass coefficient, ICC = 0.65). Conclusion: In addition to demonstrating good psychometric properties, the SHAPS-M is easy to administer. Therefore, it is a valid, reliable, and suitable questionnaire for assessing anhedonia among depressed patients in Malaysia. PMID:25246837
Dynamic testing in schizophrenia: does training change the construct validity of a test?
Wiedl, Karl H; Schöttke, Henning; Green, Michael F; Nuechterlein, Keith H
2004-01-01
Dynamic testing typically involves specific interventions for a test to assess the extent to which test performance can be modified, beyond level of baseline (static) performance. This study used a dynamic version of the Wisconsin Card Sorting Test (WCST) that is based on cognitive remediation techniques within a test-training-test procedure. From results of previous studies with schizophrenia patients, we concluded that the dynamic and static versions of the WCST should have different construct validity. This hypothesis was tested by examining the patterns of correlations with measures of executive functioning, secondary verbal memory, and verbal intelligence. Results demonstrated a specific construct validity of WCST dynamic (i.e., posttest) scores as an index of problem solving (Tower of Hanoi) and secondary verbal memory and learning (Auditory Verbal Learning Test), whereas the impact of general verbal capacity and selective attention (Verbal IQ, Stroop Test) was reduced. It is concluded that the construct validity of the test changes with dynamic administration and that this difference helps to explain why the dynamic version of the WCST predicts functional outcome better than the static version.
Apostol, Izydor; Kelner, Drew; Jiang, Xinzhao Grace; Huang, Gang; Wypych, Jette; Zhang, Xin; Gastwirt, Jessica; Chen, Kenneth; Fodor, Szilan; Hapuarachchi, Suminda; Meriage, Dave; Ye, Frank; Poppe, Leszek; Szpankowski, Wojciech
2012-12-01
To predict precision and other performance characteristics of chromatographic purity methods, which represent the most widely used form of analysis in the biopharmaceutical industry. We have conducted a comprehensive survey of purity methods, and show that all performance characteristics fall within narrow measurement ranges. This observation was used to develop a model called Uncertainty Based on Current Information (UBCI), which expresses these performance characteristics as a function of the signal and noise levels, hardware specifications, and software settings. We applied the UCBI model to assess the uncertainty of purity measurements, and compared the results to those from conventional qualification. We demonstrated that the UBCI model is suitable to dynamically assess method performance characteristics, based on information extracted from individual chromatograms. The model provides an opportunity for streamlining qualification and validation studies by implementing a "live validation" of test results utilizing UBCI as a concurrent assessment of measurement uncertainty. Therefore, UBCI can potentially mitigate the challenges associated with laborious conventional method validation and facilitates the introduction of more advanced analytical technologies during the method lifecycle.
Monzani, Dario; Steca, Patrizia; Greco, Andrea
2014-02-01
Dispositional optimism is an individual difference promoting psychosocial adjustment and well-being during adolescence. Dispositional optimism was originally defined as a one-dimensional construct; however, empirical evidence suggests two correlated factors in the Life Orientation Test - Revised (LOT-R). The main aim of the study was to evaluate the dimensionality of the LOT-R. This study is the first attempt to identify the best factor structure, comparing congeneric, two correlated-factor, and two orthogonal-factor models in a sample of adolescents. Concurrent validity was also assessed. The results demonstrated the superior fit of the two orthogonal-factor model thus reconciling the one-dimensional definition of dispositional optimism with the bi-dimensionality of the LOT-R. Moreover, the results of correlational analyses proved the concurrent validity of this self-report measure: optimism is moderately related to indices of psychosocial adjustment and well-being. Thus, the LOT-R is a useful, valid, and reliable self-report measure to properly assess optimism in adolescence. Copyright © 2013 The Foundation for Professionals in Services for Adolescents. Published by Elsevier Ltd. All rights reserved.
Aristotle Meets Zeno: Psychophysiological Evidence
Papageorgiou, Charalabos; Stachtea, Xanthi; Papageorgiou, Panos; Alexandridis, Antonio T.
2016-01-01
This study, a tribute to Aristotle's 2400 years, used a juxtaposition of valid Aristotelian arguments to the paradoxes formulated by Zeno the Eleatic, in order to investigate the electrophysiological correlates of attentional and /or memory processing effects in the course of deductive reasoning. Participants undertook reasoning tasks based on visually presented arguments which were either (a) valid (Aristotelian) statements or (b) paradoxes. We compared brain activation patterns while participants maintained the premises / conclusions of either the valid statements or the paradoxes in working memory (WM). Event-related brain potentials (ERPs), specifically the P300 component of ERPs, were recorded during the WM phase, during which participants were required to draw a logical conclusion regarding the correctness of the valid syllogisms or the paradoxes. During the processing of paradoxes, results demonstrated a more positive event-related potential deflection (P300) across frontal regions, whereas processing of valid statements was associated with noticeable P300 amplitudes across parieto-occipital regions. These findings suggest that paradoxes mobilize frontal attention mechanisms, while valid deduction promotes parieto-occipital activity associated with attention and/or subsequent memory processing. PMID:28033333
Aristotle Meets Zeno: Psychophysiological Evidence.
Papageorgiou, Charalabos; Stachtea, Xanthi; Papageorgiou, Panos; Alexandridis, Antonio T; Tsaltas, Eleftheria; Angelopoulos, Elias
2016-01-01
This study, a tribute to Aristotle's 2400 years, used a juxtaposition of valid Aristotelian arguments to the paradoxes formulated by Zeno the Eleatic, in order to investigate the electrophysiological correlates of attentional and /or memory processing effects in the course of deductive reasoning. Participants undertook reasoning tasks based on visually presented arguments which were either (a) valid (Aristotelian) statements or (b) paradoxes. We compared brain activation patterns while participants maintained the premises / conclusions of either the valid statements or the paradoxes in working memory (WM). Event-related brain potentials (ERPs), specifically the P300 component of ERPs, were recorded during the WM phase, during which participants were required to draw a logical conclusion regarding the correctness of the valid syllogisms or the paradoxes. During the processing of paradoxes, results demonstrated a more positive event-related potential deflection (P300) across frontal regions, whereas processing of valid statements was associated with noticeable P300 amplitudes across parieto-occipital regions. These findings suggest that paradoxes mobilize frontal attention mechanisms, while valid deduction promotes parieto-occipital activity associated with attention and/or subsequent memory processing.
Junghaenel, Doerte U; Schneider, Stefan; Stone, Arthur A; Christodoulou, Christopher; Broderick, Joan E
2014-04-01
This study examined the ecological validity and clinical utility of NIH Patient Reported-Outcomes Measurement Information System (PROMIS®) instruments for anger, depression, and fatigue in women with premenstrual symptoms. One-hundred women completed daily diaries and weekly PROMIS assessments over 4weeks. Weekly assessments were administered through Computerized Adaptive Testing (CAT). Weekly CATs and corresponding daily scores were compared to evaluate ecological validity. To test clinical utility, we examined if CATs could detect changes in symptom levels, if these changes mirrored those obtained from daily scores, and if CATs could identify clinically meaningful premenstrual symptom change. PROMIS CAT scores were higher in the pre-menstrual than the baseline (ps<.0001) and post-menstrual (ps<.0001) weeks. The correlations between CATs and aggregated daily scores ranged from .73 to .88 supporting ecological validity. Mean CAT scores showed systematic changes in accordance with the menstrual cycle and the magnitudes of the changes were similar to those obtained from the daily scores. Finally, Receiver Operating Characteristic (ROC) analyses demonstrated the ability of the CATs to discriminate between women with and without clinically meaningful premenstrual symptom change. PROMIS CAT instruments for anger, depression, and fatigue demonstrated validity and utility in premenstrual symptom assessment. The results provide encouraging initial evidence of the utility of PROMIS instruments for the measurement of affective premenstrual symptoms. Copyright © 2014 Elsevier Inc. All rights reserved.
Ego-Dissolution and Psychedelics: Validation of the Ego-Dissolution Inventory (EDI).
Nour, Matthew M; Evans, Lisa; Nutt, David; Carhart-Harris, Robin L
2016-01-01
The experience of a compromised sense of "self", termed ego-dissolution, is a key feature of the psychedelic experience. This study aimed to validate the Ego-Dissolution Inventory (EDI), a new 8-item self-report scale designed to measure ego-dissolution. Additionally, we aimed to investigate the specificity of the relationship between psychedelics and ego-dissolution. Sixteen items relating to altered ego-consciousness were included in an internet questionnaire; eight relating to the experience of ego-dissolution (comprising the EDI), and eight relating to the antithetical experience of increased self-assuredness, termed ego-inflation. Items were rated using a visual analog scale. Participants answered the questionnaire for experiences with classical psychedelic drugs, cocaine and/or alcohol. They also answered the seven questions from the Mystical Experiences Questionnaire (MEQ) relating to the experience of unity with one's surroundings. Six hundred and ninety-one participants completed the questionnaire, providing data for 1828 drug experiences (1043 psychedelics, 377 cocaine, 408 alcohol). Exploratory factor analysis demonstrated that the eight EDI items loaded exclusively onto a single common factor, which was orthogonal to a second factor comprised of the items relating to ego-inflation (rho = -0.110), demonstrating discriminant validity. The EDI correlated strongly with the MEQ-derived measure of unitive experience (rho = 0.735), demonstrating convergent validity. EDI internal consistency was excellent (Cronbach's alpha 0.93). Three analyses confirmed the specificity of ego-dissolution for experiences occasioned by psychedelic drugs. Firstly, EDI score correlated with drug-dose for psychedelic drugs (rho = 0.371), but not for cocaine (rho = 0.115) or alcohol (rho = -0.055). Secondly, the linear regression line relating the subjective intensity of the experience to ego-dissolution was significantly steeper for psychedelics (unstandardized regression coefficient = 0.701) compared with cocaine (0.135) or alcohol (0.144). Ego-inflation, by contrast, was specifically associated with cocaine experiences. Finally, a binary Support Vector Machine classifier identified experiences occasioned by psychedelic drugs vs. cocaine or alcohol with over 85% accuracy using ratings of ego-dissolution and ego-inflation alone. Our results demonstrate the psychometric structure, internal consistency and construct validity of the EDI. Moreover, we demonstrate the close relationship between ego-dissolution and the psychedelic experience. The EDI will facilitate the study of the neuronal correlates of ego-dissolution, which is relevant for psychedelic-assisted psychotherapy and our understanding of psychosis.
Validity of the Neurology Quality of Life (Neuro-QoL) Measurement System in Adult Epilepsy
Victorson, David; Cavazos, Jose E.; Holmes, Gregory L.; Reder, Anthony T.; Wojna, Valerie; Nowinski, Cindy; Miller, Deborah; Buono, Sarah; Mueller, Allison; Moy, Claudia; Cella, David
2014-01-01
Epilepsy is a chronic neurological disorder that results in recurring seizures and can have a significant adverse effect on health related quality of life (HRQL). Neuro-QoL is an NINDS-funded system of patient reported outcome measures for neurology clinical research, which was designed to provide a precise and standardized way to measure HRQL in epilepsy and other neurological disorders. Using mixed-methods and item response theory-based approaches, we developed generic item banks and targeted scales for adults and children with major neurological disorders. This paper provides empirical results from a clinical validation study with a sample of adults diagnosed with epilepsy. One hundred twenty one people diagnosed with epilepsy participated, of which the majority were male (62%), Caucasian (95%), with a mean age of 47.3 (SD=16.9). Baseline assessments included Neuro-QoL short forms and general and external validity measures. Neuro-QoL short forms that are not typically found in other epilepsy-specific HRQL instruments include Stigma, Sleep Disturbance, Emotional and Behavioral Dyscontrol and Positive Affect & Well-being. Neuro-QoL short forms demonstrated adequate reliability (internal consistency range = .86–.96; test-retest range = .57–.89). Pearson correlations (p<.01) between Neuro-QoL forms of emotional distress (Anxiety, Depression, Stigma) and the QOLIE-31 Emotional Well-being Subscale were in the moderate to strong range (r’s = .66, .71 & .53, respectively), as were relations with the PROMIS Global Mental Health subscale (r’s = .59, .74 & .52, respectively). Moderate correlations were observed between Neuro-QoL Social Role Performance and Satisfaction and the QOLIE-31 Social Function (r’s = .58 & .52, respectively). In measuring aspects of physical function, the Neuro-QoL Mobility and Upper Extremity forms demonstrated moderate associations with the PROMIS Global Physical Function Subscale (r’s = .60 & .61, respectively). Neuro-QoL measures of perceived cognitive function (executive function and general concerns) produced moderate to strong correlations with the QOLIE-31 Cognition subscale (r’s = .65 & .75, respectively) and moderate relations with the Liverpool Adverse Events scale (r’s = .51 & .69, respectively). Finally, the Neuro-QoL Fatigue measure demonstrated moderate associations with the QOLIE-31 Energy/Fatigue subscale (r=−.65), Liverpool Adverse Events Scale (r=.69) and the Liverpool Seizure Severity Scale (r=.50). Five Neuro-QoL short forms demonstrated statistically significant responsiveness to change at 5–7 months, including Fatigue, Sleep Disturbance, Depression, Positive Affect & Well-being, and Emotional and Behavioral Dyscontrol. Overall, Neuro-QoL instruments showed good evidence for internal consistency, test-retest reliability, convergent validity and responsiveness to change over several months. These results support the validity of Neuro-QoL to measure HRQL in adults with epilepsy. PMID:24361767
ERIC Educational Resources Information Center
Berg, Craig; Boote, Stacy
2017-01-01
Prior graphing research has demonstrated that clinical interviews and free-response instruments produce very different results than multiple-choice instruments, indicating potential validity problems when using multiple-choice instruments to assess graphing skills (Berg & Smith in "Science Education," 78(6), 527-554, 1994). Extending…
Experimental evidence for partial spatial coherence in imaging Mueller polarimetry.
Ossikovski, Razvigor; Arteaga, Oriol; Yoo, Sang Hyuk; Garcia-Caurel, Enric; Hingerl, Kurt
2017-11-15
We demonstrate experimentally the validity of the partial spatial coherence formalism in Mueller polarimetry and show that, in a finite spatial resolution experiment, the measured response is obtained through convolving the theoretical one with the instrument function. The reported results are of primary importance for Mueller imaging systems.
77 FR 67771 - Flonicamid; Pesticide Tolerances
Federal Register 2010, 2011, 2012, 2013, 2014
2012-11-14
... one of the following methods: Federal eRulemaking Portal: http://www.regulations.gov . Follow the... validity, completeness, and reliability as well as the relationship of the results of the studies to human... TFNA-OH, also demonstrated low toxicity in acute oral toxicity studies. In the 28-day dermal study with...
Sensory Integration and Ego Development in a Schizophrenic Adolescent Male.
ERIC Educational Resources Information Center
Pettit, Karen A.
1987-01-01
A retrospective study compared hours spent by a schizophrenic adolescent in "time out" before and after initiation of treatment. The study evaluated the effects of sensory integrative treatment on the ability to handle anger and frustration. Results demonstrate the utility of statistical analysis versus visual comparison to validate effectiveness…
Validation of Biofeedback Wearables for Photoplethysmographic Heart Rate Tracking
Jo, Edward; Lewis, Kiana; Directo, Dean; Kim, Michael J.; Dolezal, Brett A.
2016-01-01
The purpose of this study was to examine the validity of HR measurements by two commercial-use activity trackers in comparison to ECG. Twenty-four healthy participants underwent the same 77-minute protocol during a single visit. Each participant completed an initial rest period of 15 minutes followed by 5 minute periods of each of the following activities: 60W and 120W cycling, walking, jogging, running, resisted arm raises, resisted lunges, and isometric plank. In between each exercise task was a 5-minute rest period. Each subject wore a Basis Peak (BPk) on one wrist and a Fitbit Charge HR (FB) on the opposite wrist. Criterion measurement of HR was administered by 12-lead ECG. Time synced data from each device and ECG were concurrently and electronically acquired throughout the entire 77-minute protocol. When examining data in aggregate, there was a strong correlation between BPk and ECG for HR (r = 0.92, p < 0.001) with a mean bias of -2.5 bpm (95% LoA 19.3, -24.4). The FB demonstrated a moderately strong correlation with ECG for HR (r = 0.83, p < 0.001) with an average mean bias of -8.8 bpm (95% LoA 24.2, -41.8). During physical efforts eliciting ECG HR > 116 bpm, the BPk demonstrated an r = 0.77 and mean bias = -4.9 bpm (95% LoA 21.3, -31.0) while the FB demonstrated an r = 0.58 and mean bias = -12.7 bpm (95% LoA 28.6, -54.0). The BPk satisfied validity criteria for HR monitors, however showed a marginal decline in accuracy with increasing physical effort (ECG HR > 116 bpm). The FB failed to satisfy validity criteria and demonstrated a substantial decrease in accuracy during higher exercise intensities. Key points Modern day wearable multi-sensor activity trackers incorporate reflective photoplethymography (PPG) for heart rate detection and monitoring at the dorsal wrist. This study examined the validity of two PPG-based activity trackers, the Basis Peak and Fitbit Charge HR. The Basis Peak performed with accuracy compared with ECG and results substantiate validation of heart rate measurements. There was a slight decrease in performance during higher levels of physical exertion. The Fitbit Charge HR performed with poor accuracy compared with ECG especially during higher physical exertion and specific exercise tasks. The Fitbit Charge HR was not validated for heart rate monitoring, although better accuracy was observed during resting or recovery conditions. PMID:27803634
Vlachopoulos, Symeon P; Gigoudi, Maria A
2008-07-01
This article reports on the development and initial validation of the Amotivation Toward Exercise Scale (ATES), which reflects a taxonomy of older adults' reasons to refrain from exercise. Drawing on work by Pelletier, Dion, Tuson, and Green-Demers (1999) and Legault, Green-Demers, and Pelletier (2006), these dimensions were the outcome beliefs, capacity beliefs, effort beliefs, and value amotivation beliefs toward exercise. The results supported a 4-factor correlated model that fit the data better than either a unidimensional model or a 4-factor uncorrelated model or a hierarchical model with strong internal reliability for all the subscales. Evidence also emerged for the discriminant validity of the subscale scores. Furthermore, the predictive validity of the subscale scores was supported, and satisfactory measurement invariance was demonstrated across the calibration and validation samples, supporting the generalizability of the scale's measurement properties.
NASA Astrophysics Data System (ADS)
Morton, Timothy D.; Bryson, Stephen T.; Coughlin, Jeffrey L.; Rowe, Jason F.; Ravichandran, Ganesh; Petigura, Erik A.; Haas, Michael R.; Batalha, Natalie M.
2016-05-01
We present astrophysical false positive probability calculations for every Kepler Object of Interest (KOI)—the first large-scale demonstration of a fully automated transiting planet validation procedure. Out of 7056 KOIs, we determine that 1935 have probabilities <1% of being astrophysical false positives, and thus may be considered validated planets. Of these, 1284 have not yet been validated or confirmed by other methods. In addition, we identify 428 KOIs that are likely to be false positives, but have not yet been identified as such, though some of these may be a result of unidentified transit timing variations. A side product of these calculations is full stellar property posterior samplings for every host star, modeled as single, binary, and triple systems. These calculations use vespa, a publicly available Python package that is able to be easily applied to any transiting exoplanet candidate.
The Integrated Airframe/Propulsion Control System Architecture program (IAPSA)
NASA Technical Reports Server (NTRS)
Palumbo, Daniel L.; Cohen, Gerald C.; Meissner, Charles W.
1990-01-01
The Integrated Airframe/Propulsion Control System Architecture program (IAPSA) is a two-phase program which was initiated by NASA in the early 80s. The first phase, IAPSA 1, studied different architectural approaches to the problem of integrating engine control systems with airframe control systems in an advanced tactical fighter. One of the conclusions of IAPSA 1 was that the technology to construct a suitable system was available, yet the ability to create these complex computer architectures has outpaced the ability to analyze the resulting system's performance. With this in mind, the second phase of IAPSA approached the same problem with the added constraint that the system be designed for validation. The intent of the design for validation requirement is that validation requirements should be shown to be achievable early in the design process. IAPSA 2 has demonstrated that despite diligent efforts, integrated systems can retain characteristics which are difficult to model and, therefore, difficult to validate.
Brown, J. B.; Schmidt, G.; Lent, B.; Sas, G.; Lemelin, J.
2001-01-01
OBJECTIVE: To replicate, in a Francophone community, our prior work determining the reliability and validity of the full Woman Abuse Screening Tool (WAST) and a two-item version (WAST-Short). DESIGN: Questionnaires completed by abused and nonabused women. SETTING: Two women's shelters in Francophone communities in Ontario and Quebec and participants' homes or workplaces. PARTICIPANTS: A convenience sample of 25 abused women currently residing in two women's shelters and a convenience sample of 21 women who reported they were not abused. MAIN OUTCOME MEASURES: Women's responses to French versions of the WAST, the Abuse Risk Inventory (ARI), and comfort in answering the questions were compared. Also, the reliability and validity of French versions of WAST and WAST-Short were assessed. RESULTS: Abused (n = 23) and not abused (n = 21) women were demographically similar. A strong single-factor structure that accounted for 81% of total variance in the French WAST items was identified. The French WAST was found to be highly reliable with a coefficient alpha of .95 and demonstrated construct and discriminant validity. The WAST-Short correctly classified all the nonabused women and 78.7% of the abused women. The abused women reported feeling less comfortable responding to the WAST questions than the nonabused women. CONCLUSION: The French version of the WAST demonstrated good reliability and validity and discriminated between known samples of abused and nonabused women. Even though the French WAST-Short did not perform as well as the English version, results of this study support further evaluation of the WAST for screening women in Francophone or bilingual family practice settings. PMID:11398732
Validity for What? The Peril of Overclarifying
ERIC Educational Resources Information Center
Murphy, Kevin R.
2012-01-01
As Paul Newton so ably demonstrates, the concept of validity is both important and problematic. Over the last several decades, a consensus definition of validity has emerged; the current edition of "Standards for Educational and Psychological Testing" notes, "Validity refers to the degree to which evidence and theory support the interpretations of…
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liu, Huiqiang; Wu, Xizeng, E-mail: xwu@uabmc.edu, E-mail: tqxiao@sinap.ac.cn; Xiao, Tiqiao, E-mail: xwu@uabmc.edu, E-mail: tqxiao@sinap.ac.cn
Purpose: Propagation-based phase-contrast CT (PPCT) utilizes highly sensitive phase-contrast technology applied to x-ray microtomography. Performing phase retrieval on the acquired angular projections can enhance image contrast and enable quantitative imaging. In this work, the authors demonstrate the validity and advantages of a novel technique for high-resolution PPCT by using the generalized phase-attenuation duality (PAD) method of phase retrieval. Methods: A high-resolution angular projection data set of a fish head specimen was acquired with a monochromatic 60-keV x-ray beam. In one approach, the projection data were directly used for tomographic reconstruction. In two other approaches, the projection data were preprocessed bymore » phase retrieval based on either the linearized PAD method or the generalized PAD method. The reconstructed images from all three approaches were then compared in terms of tissue contrast-to-noise ratio and spatial resolution. Results: The authors’ experimental results demonstrated the validity of the PPCT technique based on the generalized PAD-based method. In addition, the results show that the authors’ technique is superior to the direct PPCT technique as well as the linearized PAD-based PPCT technique in terms of their relative capabilities for tissue discrimination and characterization. Conclusions: This novel PPCT technique demonstrates great potential for biomedical imaging, especially for applications that require high spatial resolution and limited radiation exposure.« less
2003-11-25
alkyd resin enamel was 14 Figure 1 -- Test Panels Before Application of WD CARC at MCB Figure 2 -- Test Panels with WD CARC Applied at MCB...WD) Chemical Agent Resistant Coating (CARC) patented (#5,691,410) by the Army Research Laboratory (ARL) has undergone technology Demonstration...Resistant Coating (CARC) patented (#5,691,410) by the Army Research Laboratory (ARL) has undergone technology Demonstration/Validation (Dem/Val) testing
Validation of asthma recording in electronic health records: a systematic review
Nissen, Francis; Quint, Jennifer K; Wilkinson, Samantha; Mullerova, Hana; Smeeth, Liam; Douglas, Ian J
2017-01-01
Objective To describe the methods used to validate asthma diagnoses in electronic health records and summarize the results of the validation studies. Background Electronic health records are increasingly being used for research on asthma to inform health services and health policy. Validation of the recording of asthma diagnoses in electronic health records is essential to use these databases for credible epidemiological asthma research. Methods We searched EMBASE and MEDLINE databases for studies that validated asthma diagnoses detected in electronic health records up to October 2016. Two reviewers independently assessed the full text against the predetermined inclusion criteria. Key data including author, year, data source, case definitions, reference standard, and validation statistics (including sensitivity, specificity, positive predictive value [PPV], and negative predictive value [NPV]) were summarized in two tables. Results Thirteen studies met the inclusion criteria. Most studies demonstrated a high validity using at least one case definition (PPV >80%). Ten studies used a manual validation as the reference standard; each had at least one case definition with a PPV of at least 63%, up to 100%. We also found two studies using a second independent database to validate asthma diagnoses. The PPVs of the best performing case definitions ranged from 46% to 58%. We found one study which used a questionnaire as the reference standard to validate a database case definition; the PPV of the case definition algorithm in this study was 89%. Conclusion Attaining high PPVs (>80%) is possible using each of the discussed validation methods. Identifying asthma cases in electronic health records is possible with high sensitivity, specificity or PPV, by combining multiple data sources, or by focusing on specific test measures. Studies testing a range of case definitions show wide variation in the validity of each definition, suggesting this may be important for obtaining asthma definitions with optimal validity. PMID:29238227
Pat, Lucio; Ali, Bassam; Guerrero, Armando; Córdova, Atl V.; Garduza, José P.
2016-01-01
Attenuated total reflectance-Fourier transform infrared spectrometry and chemometrics model was used for determination of physicochemical properties (pH, redox potential, free acidity, electrical conductivity, moisture, total soluble solids (TSS), ash, and HMF) in honey samples. The reference values of 189 honey samples of different botanical origin were determined using Association Official Analytical Chemists, (AOAC), 1990; Codex Alimentarius, 2001, International Honey Commission, 2002, methods. Multivariate calibration models were built using partial least squares (PLS) for the measurands studied. The developed models were validated using cross-validation and external validation; several statistical parameters were obtained to determine the robustness of the calibration models: (PCs) optimum number of components principal, (SECV) standard error of cross-validation, (R 2 cal) coefficient of determination of cross-validation, (SEP) standard error of validation, and (R 2 val) coefficient of determination for external validation and coefficient of variation (CV). The prediction accuracy for pH, redox potential, electrical conductivity, moisture, TSS, and ash was good, while for free acidity and HMF it was poor. The results demonstrate that attenuated total reflectance-Fourier transform infrared spectrometry is a valuable, rapid, and nondestructive tool for the quantification of physicochemical properties of honey. PMID:28070445
Overgaauw, Sandy; Rieffe, Carolien; Broekhof, Evelien; Crone, Eveline A.; Güroğlu, Berna
2017-01-01
Empathy plays a crucial role in healthy social functioning and in maintaining positive social relationships. In this study, 1250 children and adolescents (10–15 year olds) completed the newly developed Empathy Questionnaire for Children and Adolescents (EmQue-CA) that was tested on reliability, construct validity, convergent validity, and concurrent validity. The EmQue-CA aims to assess empathy using the following scales: affective empathy, cognitive empathy, and intention to comfort. A Principal Components Analysis, which was directly tested with a Confirmatory Factor Analysis, confirmed the proposed three-factor model resulting in 14 final items. Reliability analyses demonstrated high internal consistency of the scales. Furthermore, the scales showed high convergent validity, as they were positively correlated with related scales of the Interpersonal Reactivity Index (Davis, 1983). With regard to concurrent validity, higher empathy was related to more attention to others’ emotions, higher friendship quality, less focus on own affective state, and lower levels of bullying behavior. Taken together, we show that the EmQue-CA is a reliable and valid instrument to measure empathy in typically developing children and adolescents aged 10 and older. PMID:28611713
The Hyper-X Flight Systems Validation Program
NASA Technical Reports Server (NTRS)
Redifer, Matthew; Lin, Yohan; Bessent, Courtney Amos; Barklow, Carole
2007-01-01
For the Hyper-X/X-43A program, the development of a comprehensive validation test plan played an integral part in the success of the mission. The goal was to demonstrate hypersonic propulsion technologies by flight testing an airframe-integrated scramjet engine. Preparation for flight involved both verification and validation testing. By definition, verification is the process of assuring that the product meets design requirements; whereas validation is the process of assuring that the design meets mission requirements for the intended environment. This report presents an overview of the program with emphasis on the validation efforts. It includes topics such as hardware-in-the-loop, failure modes and effects, aircraft-in-the-loop, plugs-out, power characterization, antenna pattern, integration, combined systems, captive carry, and flight testing. Where applicable, test results are also discussed. The report provides a brief description of the flight systems onboard the X-43A research vehicle and an introduction to the ground support equipment required to execute the validation plan. The intent is to provide validation concepts that are applicable to current, follow-on, and next generation vehicles that share the hybrid spacecraft and aircraft characteristics of the Hyper-X vehicle.
Validity and reliability of a scale to measure genital body image.
Zielinski, Ruth E; Kane-Low, Lisa; Miller, Janis M; Sampselle, Carolyn
2012-01-01
Women's body image dissatisfaction extends to body parts usually hidden from view--their genitals. Ability to measure genital body image is limited by lack of valid and reliable questionnaires. We subjected a previously developed questionnaire, the Genital Self Image Scale (GSIS) to psychometric testing using a variety of methods. Five experts determined the content validity of the scale. Then using four participant groups, factor analysis was performed to determine construct validity and to identify factors. Further construct validity was established using the contrasting groups approach. Internal consistency and test-retest reliability was determined. Twenty one of 29 items were considered content valid. Two items were added based on expert suggestions. Factor analysis was undertaken resulting in four factors, identified as Genital Confidence, Appeal, Function, and Comfort. The revised scale (GSIS-20) included 20 items explaining 59.4% of the variance. Women indicating an interest in genital cosmetic surgery exhibited significantly lower scores on the GSIS-20 than those who did not. The final 20 item scale exhibited internal reliability across all sample groups as well as test-retest reliability. The GSIS-20 provides a measure of genital body image demonstrating reliability and validity across several populations of women.
Post mitigation impact risk analysis for asteroid deflection demonstration missions
NASA Astrophysics Data System (ADS)
Eggl, Siegfried; Hestroffer, Daniel; Thuillot, William; Bancelin, David; Cano, Juan L.; Cichocki, Filippo
2015-08-01
Even though mankind believes to have the capabilities to avert potentially disastrous asteroid impacts, only the realization of mitigation demonstration missions can validate this claim. Such a deflection demonstration attempt has to be cost effective, easy to validate, and safe in the sense that harmless asteroids must not be turned into potentially hazardous objects. Uncertainties in an asteroid's orbital and physical parameters as well as those additionally introduced during a mitigation attempt necessitate an in depth analysis of deflection mission designs in order to dispel planetary safety concerns. We present a post mitigation impact risk analysis of a list of potential kinetic impactor based deflection demonstration missions proposed in the framework of the NEOShield project. Our results confirm that mitigation induced uncertainties have a significant influence on the deflection outcome. Those cannot be neglected in post deflection impact risk studies. We show, furthermore, that deflection missions have to be assessed on an individual basis in order to ensure that asteroids are not inadvertently transported closer to the Earth at a later date. Finally, we present viable targets and mission designs for a kinetic impactor test to be launched between the years 2025 and 2032.
Mean Flow and Noise Prediction for a Separate Flow Jet With Chevron Mixers
NASA Technical Reports Server (NTRS)
Koch, L. Danielle; Bridges, James; Khavaran, Abbas
2004-01-01
Experimental and numerical results are presented here for a separate flow nozzle employing chevrons arranged in an alternating pattern on the core nozzle. Comparisons of these results demonstrate that the combination of the WIND/MGBK suite of codes can predict the noise reduction trends measured between separate flow jets with and without chevrons on the core nozzle. Mean flow predictions were validated against Particle Image Velocimetry (PIV), pressure, and temperature data, and noise predictions were validated against acoustic measurements recorded in the NASA Glenn Aeroacoustic Propulsion Lab. Comparisons are also made to results from the CRAFT code. The work presented here is part of an on-going assessment of the WIND/MGBK suite for use in designing the next generation of quiet nozzles for turbofan engines.
Fast Whole-Engine Stirling Analysis
NASA Technical Reports Server (NTRS)
Dyson, Rodger W.; Wilson, Scott D.; Tew, Roy C.; Demko, Rikako
2005-01-01
An experimentally validated approach is described for fast axisymmetric Stirling engine simulations. These simulations include the entire displacer interior and demonstrate it is possible to model a complete engine cycle in less than an hour. The focus of this effort was to demonstrate it is possible to produce useful Stirling engine performance results in a time-frame short enough to impact design decisions. The combination of utilizing the latest 64-bit Opteron computer processors, fiber-optical Myrinet communications, dynamic meshing, and across zone partitioning has enabled solution times at least 240 times faster than previous attempts at simulating the axisymmetric Stirling engine. A comparison of the multidimensional results, calibrated one-dimensional results, and known experimental results is shown. This preliminary comparison demonstrates that axisymmetric simulations can be very accurate, but more work remains to improve the simulations through such means as modifying the thermal equilibrium regenerator models, adding fluid-structure interactions, including radiation effects, and incorporating mechanodynamics.
Fast Whole-Engine Stirling Analysis
NASA Technical Reports Server (NTRS)
Dyson, Rodger W.; Wilson, Scott D.; Tew, Roy C.; Demko, Rikako
2007-01-01
An experimentally validated approach is described for fast axisymmetric Stirling engine simulations. These simulations include the entire displacer interior and demonstrate it is possible to model a complete engine cycle in less than an hour. The focus of this effort was to demonstrate it is possible to produce useful Stirling engine performance results in a time-frame short enough to impact design decisions. The combination of utilizing the latest 64-bit Opteron computer processors, fiber-optical Myrinet communications, dynamic meshing, and across zone partitioning has enabled solution times at least 240 times faster than previous attempts at simulating the axisymmetric Stirling engine. A comparison of the multidimensional results, calibrated one-dimensional results, and known experimental results is shown. This preliminary comparison demonstrates that axisymmetric simulations can be very accurate, but more work remains to improve the simulations through such means as modifying the thermal equilibrium regenerator models, adding fluid-structure interactions, including radiation effects, and incorporating mechanodynamics.
Strategic marketing applications of conjoint analysis: an HMO perspective.
Rosko, M D; DeVita, M; McKenna, W F; Walker, L R
1985-01-01
The purpose of this article is to demonstrate how data from a conjoint analysis study can be used to help determine the most appropriate marketing mix for an operational HMO which is entering a new market--the geriatric population. Included are two features which are absent in previous articles on health care applications of conjoint analysis: external validation of results, and a demonstration of how conjoint analysis can be used to simulate market responses to changes in the provider's marketing mix.
Relativity, anomalies and objectivity loophole in recent tests of local realism
NASA Astrophysics Data System (ADS)
Bednorz, Adam
2017-11-01
Local realism is in conflict with special quantum Bell-type models. Recently, several experiments have demonstrated violation of local realism if we trust their setup assuming special relativity valid. In this paper we question the assumption of relativity, point out not commented anomalies and show that the experiments have not closed objectivity loophole because clonability of the result has not been demonstrated. We propose several improvements in further experimental tests of local realism make the violation more convincing.
Maples-Keller, Jessica L; Williamson, Rachel L; Sleep, Chelsea E; Carter, Nathan T; Campbell, W Keith; Miller, Joshua D
2017-10-31
Given advantages of freely available and modifiable measures, an increase in the use of measures developed from the International Personality Item Pool (IPIP), including the 300-item representation of the Revised NEO Personality Inventory (NEO PI-R; Costa & McCrae, 1992a ) has occurred. The focus of this study was to use item response theory to develop a 60-item, IPIP-based measure of the Five-Factor Model (FFM) that provides equal representation of the FFM facets and to test the reliability and convergent and criterion validity of this measure compared to the NEO Five Factor Inventory (NEO-FFI). In an undergraduate sample (n = 359), scores from the NEO-FFI and IPIP-NEO-60 demonstrated good reliability and convergent validity with the NEO PI-R and IPIP-NEO-300. Additionally, across criterion variables in the undergraduate sample as well as a community-based sample (n = 757), the NEO-FFI and IPIP-NEO-60 demonstrated similar nomological networks across a wide range of external variables (r ICC = .96). Finally, as expected, in an MTurk sample the IPIP-NEO-60 demonstrated advantages over the Big Five Inventory-2 (Soto & John, 2017 ; n = 342) with regard to the Agreeableness domain content. The results suggest strong reliability and validity of the IPIP-NEO-60 scores.
Boody, Barrett S; Bhatt, Surabhi; Mazmudar, Aditya S; Hsu, Wellington K; Rothrock, Nan E; Patel, Alpesh A
2018-03-01
OBJECTIVE The Patient-Reported Outcomes Measurement Information System (PROMIS), which is funded by the National Institutes of Health, is a set of adaptive, responsive assessment tools that measures patient-reported health status. PROMIS measures have not been validated for surgical patients with cervical spine disorders. The objective of this project is to evaluate the validity (e.g., convergent validity, known-groups validity, responsiveness to change) of PROMIS computer adaptive tests (CATs) for pain behavior, pain interference, and physical function in patients undergoing cervical spine surgery. METHODS The legacy outcome measures Neck Disability Index (NDI) and SF-12 were used as comparisons with PROMIS measures. PROMIS CATs, NDI-10, and SF-12 measures were administered prospectively to 59 consecutive tertiary hospital patients who were treated surgically for degenerative cervical spine disorders. A subscore of NDI-5 was calculated from NDI-10 by eliminating the lifting, headaches, pain intensity, reading, and driving sections and multiplying the final score by 4. Assessments were administered preoperatively (baseline) and postoperatively at 6 weeks and 3 months. Patients presenting for revision surgery, tumor, infection, or trauma were excluded. Participants completed the measures in Assessment Center, an online data collection tool accessed by using a secure login and password on a tablet computer. Subgroup analysis was also performed based on a primary diagnosis of either cervical radiculopathy or cervical myelopathy. RESULTS Convergent validity for PROMIS CATs was supported with multiple statistically significant correlations with the existing legacy measures, NDI and SF-12, at baseline. Furthermore, PROMIS CATs demonstrated known-group validity and identified clinically significant improvements in all measures after surgical intervention. In the cervical radiculopathy and myelopathic cohorts, the PROMIS measures demonstrated similar responsiveness to the SF-12 and NDI scores in the patients who self-identified as having postoperative clinical improvement. PROMIS CATs required a mean total of 3.2 minutes for PROMIS pain behavior (mean ± SD 0.9 ± 0.5 minutes), pain interference (1.2 ± 1.9 minutes), and physical function (1.1 ± 1.4 minutes) and compared favorably with 3.4 minutes for NDI and 4.1 minutes for SF-12. CONCLUSIONS This study verifies that PROMIS CATs demonstrate convergent and known-groups validity and comparable responsiveness to change as existing legacy measures. The PROMIS measures required less time for completion than legacy measures. The validity and efficiency of the PROMIS measures in surgical patients with cervical spine disorders suggest an improvement over legacy measures and an opportunity for incorporation into clinical practice.
Della Manna, Angelo; Nye, Jeffrey V; Carney, Christopher; Hammons, Jennifer S; Mann, Michael; Al Shamali, Farida; Vallone, Peter M; Romsos, Erica L; Marne, Beth Ann; Tan, Eugene; Turingan, Rosemary S; Hogan, Catherine; Selden, Richard F; French, Julie L
2016-11-01
Since the implementation of forensic DNA typing in labs more than 20 years ago, the analysis procedures and data interpretation have always been conducted in a laboratory by highly trained and qualified scientific personnel. Rapid DNA technology has the potential to expand testing capabilities within forensic laboratories and to allow forensic STR analysis to be performed outside the physical boundaries of the traditional laboratory. The developmental validation of the DNAscan/ANDE Rapid DNA Analysis System was completed using a BioChipSet™ Cassette consumable designed for high DNA content samples, such as single source buccal swabs. A total of eight laboratories participated in the testing which totaled over 2300 swabs, and included nearly 1400 unique individuals. The goal of this extensive study was to obtain, document, analyze, and assess DNAscan and its internal Expert System to reliably genotype reference samples in a manner compliant with the FBI's Quality Assurance Standards (QAS) and the NDIS Operational Procedures. The DNAscan System provided high quality, concordant results for reference buccal swabs, including automated data analysis with an integrated Expert System. Seven external laboratories and NetBio, the developer of the technology, participated in the validation testing demonstrating the reproducibility and reliability of the system and its successful use in a variety of settings by numerous operators. The DNAscan System demonstrated limited cross reactivity with other species, was resilient in the presence of numerous inhibitors, and provided reproducible results for both buccal and purified DNA samples with sensitivity at a level appropriate for buccal swabs. The precision and resolution of the system met industry standards for detection of micro-variants and displayed single base resolution. PCR-based studies provided confidence that the system was robust and that the amplification reaction had been optimized to provide high quality results. The DNAscan integrated Expert System was examined as part of the Developmental Validation and successfully interpreted the over 2000 samples tested with over 99.998% concordant alleles. The system appropriately flagged samples for human review and failed both mixed samples and samples with insufficient genetic information. These results demonstrated the integrated Expert System makes correct allele calls without human intervention. Copyright © 2016 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.
Mirza, Tahseen; Liu, Qian Julie; Vivilecchia, Richard; Joshi, Yatindra
2009-03-01
There has been a growing interest during the past decade in the use of fiber optics dissolution testing. Use of this novel technology is mainly confined to research and development laboratories. It has not yet emerged as a tool for end product release testing despite its ability to generate in situ results and efficiency improvement. One potential reason may be the lack of clear validation guidelines that can be applied for the assessment of suitability of fiber optics. This article describes a comprehensive validation scheme and development of a reliable, robust, reproducible and cost-effective dissolution test using fiber optics technology. The test was successfully applied for characterizing the dissolution behavior of a 40-mg immediate-release tablet dosage form that is under development at Novartis Pharmaceuticals, East Hanover, New Jersey. The method was validated for the following parameters: linearity, precision, accuracy, specificity, and robustness. In particular, robustness was evaluated in terms of probe sampling depth and probe orientation. The in situ fiber optic method was found to be comparable to the existing manual sampling dissolution method. Finally, the fiber optic dissolution test was successfully performed by different operators on different days, to further enhance the validity of the method. The results demonstrate that the fiber optics technology can be successfully validated for end product dissolution/release testing. (c) 2008 Wiley-Liss, Inc. and the American Pharmacists Association
An Approach to Comprehensive and Sustainable Solar Wind Model Validation
NASA Astrophysics Data System (ADS)
Rastaetter, L.; MacNeice, P. J.; Mays, M. L.; Boblitt, J. M.; Wiegand, C.
2017-12-01
The number of models of the corona and inner heliosphere and of their updates and upgrades grows steadily, as does the number and character of the model inputs. Maintaining up to date validation of these models, in the face of this constant model evolution, is a necessary but very labor intensive activity. In the last year alone, both NASA's LWS program and the CCMC's ongoing support of model forecasting activities at NOAA SWPC have sought model validation reports on the quality of all aspects of the community's coronal and heliospheric models, including both ambient and CME related wind solutions at L1. In this presentation I will give a brief review of the community's previous model validation results of L1 wind representation. I will discuss the semi-automated web based system we are constructing at the CCMC to present comparative visualizations of all interesting aspects of the solutions from competing models.This system is designed to be easily queried to provide the essential comprehensive inputs to repeat andupdate previous validation studies and support extensions to them. I will illustrate this by demonstrating how the system is being used to support the CCMC/LWS Model Assessment Forum teams focused on the ambient and time dependent corona and solar wind, including CME arrival time and IMF Bz.I will also discuss plans to extend the system to include results from the Forum teams addressing SEP model validation.
Wang, Yao; Xiao, Lily Dongxia; He, Guo-Ping
2015-02-01
Suboptimal care for people with dementia in hospital settings has been reported and is attributed to the lack of knowledge and inadequate attitudes in dementia care among health professionals. Educational interventions have been widely used to improve care outcomes; however, Chinese-language instruments used in dementia educational interventions for health professionals are lacking. The aims of this study were to select, translate and evaluate instruments used in dementia educational interventions for Chinese health professionals in acute-care hospitals. A cross-sectional study design was used. A modified stratified random sampling was used to recruit 442 participants from different levels of hospitals in Changsha, China. Dementia care competence was used as a framework for the selection and evaluation of Alzheimer's Disease Knowledge Scale and Dementia Care Attitudes Scale for health professionals in the study. These two scales were translated into Chinese using forward and back translation method. Content validity, test-retest reliability and internal consistency were assessed. Construct validity was tested using exploratory factor analysis. Known-group validity was established by comparing scores of Alzheimer's Disease Knowledge Scale and Dementia Care Attitudes Scale in two sub-groups. A person-centred care scale was utilised as a gold standard to establish concurrent validity of these two scales. Results demonstrated acceptable content validity, internal consistency, test-retest reliability and concurrent validity. Exploratory factor analysis presented a single-factor structure of the Chinese Alzheimer's Disease Knowledge Scale and a two-factor structure of the Chinese Dementia Care Attitudes Scale, supporting the conceptual dimensions of the original scales. The Chinese Alzheimer's Disease Knowledge Scale and Chinese Dementia Care Attitudes Scale demonstrated known-group validity evidenced by significantly higher scores identified from the sub-group with a longer work experience compared to those in the sub-group with less work experience. The use of dementia care competence as a framework to inform the selection and evaluation of instruments used in dementia educational interventions for health professionals has wide applicability in other areas. The results support that Chinese Alzheimer's Disease Knowledge Scale and Chinese Dementia Care Attitudes Scale are reliable and valid instruments for health professionals to use in acute-care settings. Copyright © 2014 Elsevier Ltd. All rights reserved.
Crouse, Cecelia A; Yeung, Stephanie; Greenspoon, Susan; McGuckian, Amy; Sikorsky, Julie; Ban, Jeff; Mathies, Richard
2005-08-01
To present validation studies performed for the implementation of existing and new technologies to increase the efficiency in the forensic DNA Section of the Palm Beach County Sheriff's Office (PBSO) Crime Laboratory. Using federally funded grants, internal support, and an external Process Mapping Team, the PBSO collaborated with forensic vendors, universities, and other forensic laboratories to enhance DNA testing procedures, including validation of the DNA IQ magnetic bead extraction system, robotic DNA extraction using the BioMek2000, the ABI7000 Sequence Detection System, and is currently evaluating a micro Capillary Array Electrophoresis device. The PBSO successfully validated and implemented both manual and automated Promega DNA IQ magnetic bead extractions system, which have increased DNA profile results from samples with low DNA template concentrations. The Beckman BioMek2000 DNA robotic workstation has been validated for blood, tissue, bone, hair, epithelial cells (touch evidence), and mixed stains such as semen. There has been a dramatic increase in the number of samples tested per case since implementation of the robotic extraction protocols. The validation of the ABI7000 real-time quantitative polymerase chain reaction (qPCR) technology and the single multiplex short tandem repeat (STR) PowerPlex16 BIO amplification system has provided both a time and a financial benefit. In addition, the qPCR system allows more accurate DNA concentration data and the PowerPlex 16 BIO multiplex generates DNA profiles data in half the time when compared to PowerPlex1.1 and PowerPlex2.1 STR systems. The PBSO's future efficiency requirements are being addressed through collaboration with the University of California at Berkeley and the Virginia Division of Forensic Science to validate microcapillary array electrophoresis instrumentation. Initial data demonstrated the electrophoresis of 96 samples in less than twenty minutes. The PBSO demonstrated, through the validation of more efficient extraction and quantification technology, an increase in the number of evidence samples tested using robotic/DNA IQ magnetic bead DNA extraction, a decrease in the number of negative samples amplified due to qPCR and implementation of a single multiplex amplification system. In addition, initial studies show the microcapillary array electrophoresis device (microCAE) evaluation results provide greater sensitivity and faster STR analysis output than current platforms.
[Reliability and validity of depression scales of Chinese version: a systematic review].
Sun, X Y; Li, Y X; Yu, C Q; Li, L M
2017-01-10
Objective: Through systematically reviewing the reliability and validity of depression scales of Chinese version in adults in China to evaluate the psychometric properties of depression scales for different groups. Methods: Eligible studies published before 6 May 2016 were retrieved from the following database: CNKI, Wanfang, PubMed and Embase. The HSROC model of the diagnostic test accuracy (DTA) for Meta-analysis was used to calculate the pooled sensitivity and specificity of the PHQ-9. Results: A total of 44 papers evaluating the performance of depression scales were included. Results showed that the reliability and validity of the common depression scales were eligible, including the Beck depression inventory (BDI), the Hamilton depression scale (HAMD), the center epidemiological studies depression scale (CES-D), the patient health questionnaire (PHQ) and the Geriatric depression scale (GDS). The Cronbach' s coefficient of most tools were larger than 0.8, while the test-retest reliability and split-half reliability were larger than 0.7, indicating good internal consistency and stability. The criterion validity, convergent validity, discrimination validity and screening validity were acceptable though different cut-off points were recommended by different studies. The pooled sensitivity of the 11 studies evaluating PHQ-9 was 0.88 (95 %CI : 0.85-0.91) while the pooled specificity was 0.89 (95 %CI : 0.82-0.94), which demonstrated the applicability of PHQ-9 in screening depression. Conclusion: The reliability and validity of different depression scales of Chinese version are acceptable. The characteristics of different tools and study population should be taken into consideration when choosing a specific scale.
Electron Beam-Cure Polymer Matrix Composites: Processing and Properties
NASA Technical Reports Server (NTRS)
Wrenn, G.; Frame, B.; Jensen, B.; Nettles, A.
2001-01-01
Researchers from NASA and Oak Ridge National Laboratory are evaluating a series of electron beam curable composites for application in reusable launch vehicle airframe and propulsion systems. Objectives are to develop electron beam curable composites that are useful at cryogenic to elevated temperatures (-217 C to 200 C), validate key mechanical properties of these composites, and demonstrate cost-saving fabrication methods at the subcomponent level. Electron beam curing of polymer matrix composites is an enabling capability for production of aerospace structures in a non-autoclave process. Payoffs of this technology will be fabrication of composite structures at room temperature, reduced tooling cost and cure time, and improvements in component durability. This presentation covers the results of material property evaluations for electron beam-cured composites made with either unidirectional tape or woven fabric architectures. Resin systems have been evaluated for performance in ambient, cryogenic, and elevated temperature conditions. Results for electron beam composites and similar composites cured in conventional processes are reviewed for comparison. Fabrication demonstrations were also performed for electron beam-cured composite airframe and propulsion piping subcomponents. These parts have been built to validate manufacturing methods with electron beam composite materials, to evaluate electron beam curing processing parameters, and to demonstrate lightweight, low-cost tooling options.
2006-07-01
in bioremediation (such as lactate, citrate, benzoate , phenols, etc). Site Study Objectives • demonstrate and validate the PFM as an innovative...contaminants and alcohol tracers. However, organic acids (e.g., benzoate ) can be used as the PFM resident racers. We modified zeolites and GAC with a...bioremediation (such as lactate, citrate, benzoate , phenols, etc). 1.2. Objectives of the Demonstration The specific objectives of this
Janssen, Ellen M; Marshall, Deborah A; Hauber, A Brett; Bridges, John F P
2017-12-01
The recent endorsement of discrete-choice experiments (DCEs) and other stated-preference methods by regulatory and health technology assessment (HTA) agencies has placed a greater focus on demonstrating the validity and reliability of preference results. Areas covered: We present a practical overview of tests of validity and reliability that have been applied in the health DCE literature and explore other study qualities of DCEs. From the published literature, we identify a variety of methods to assess the validity and reliability of DCEs. We conceptualize these methods to create a conceptual model with four domains: measurement validity, measurement reliability, choice validity, and choice reliability. Each domain consists of three categories that can be assessed using one to four procedures (for a total of 24 tests). We present how these tests have been applied in the literature and direct readers to applications of these tests in the health DCE literature. Based on a stakeholder engagement exercise, we consider the importance of study characteristics beyond traditional concepts of validity and reliability. Expert commentary: We discuss study design considerations to assess the validity and reliability of a DCE, consider limitations to the current application of tests, and discuss future work to consider the quality of DCEs in healthcare.
Evaluation of the Validity and Reliability of the Waterlow Pressure Ulcer Risk Assessment Scale.
Charalambous, Charalambos; Koulori, Agoritsa; Vasilopoulos, Aristidis; Roupa, Zoe
2018-04-01
Prevention is the ideal strategy to tackle the problem of pressure ulcers. Pressure ulcer risk assessment scales are one of the most pivotal measures applied to tackle the problem, much criticisms has been developed regarding the validity and reliability of these scales. To investigate the validity and reliability of the Waterlow pressure ulcer risk assessment scale. The methodology used is a narrative literature review, the bibliography was reviewed through Cinahl, Pubmed, EBSCO, Medline and Google scholar, 26 scientific articles where identified. The articles where chosen due to their direct correlation with the objective under study and their scientific relevance. The construct and face validity of the Waterlow appears adequate, but with regards to content validity changes in the category age and gender can be beneficial. The concurrent validity cannot be assessed. The predictive validity of the Waterlow is characterized by high specificity and low sensitivity. The inter-rater reliability has been demonstrated to be inadequate, this may be due to lack of clear definitions within the categories and differentiating level of knowledge between the users. Due to the limitations presented regarding the validity and reliability of the Waterlow pressure ulcer risk assessment scale, the scale should be used in conjunction with clinical assessment to provide optimum results.
Validity of the Eating Attitudes Test and the Eating Disorders Inventory in Bulimia Nervosa.
ERIC Educational Resources Information Center
Gross, Janet; And Others
1986-01-01
Assessed criterion and concurrent validity of the Eating Attitudes Test and the Eating Disorder Inventory in 82 women with bulimia nervosa. Both tests demonstrated criterion validity by discriminating bulimia nervosa subjects from normals. Only weak support was found for concurrent validity within bulimia subjects. Recommends combination of…
Ovarian and cervical cancer awareness: development of two validated measurement tools
Simon, Alice E; Wardle, Jane; Grimmett, Chloe; Power, Emily; Corker, Elizabeth; Menon, Usha; Matheson, Lauren; Waller, Jo
2012-01-01
Background The aim of the study was to develop and validate measures of awareness of symptoms and risk factors for ovarian and cervical cancer (Ovarian and Cervical Cancer Awareness Measures). Methods Potentially relevant items were extracted from the literature and generated by experts. Four validation studies were carried out to establish reliability and validity. Women aged 21–67 years (n=146) and ovarian and cervical cancer experts (n=32) were included in the studies. Internal reliability was assessed psychometrically. Test-retest reliability was assessed over a 1-week interval. To establish construct validity, Cancer Awareness Measure (CAM) scores of cancer experts were compared with equally well-educated comparison groups. Sensitivity to change was tested by randomly assigning participants to read either a leaflet giving information about ovarian/cervical cancer or a leaflet with control information, and then completing the ovarian/cervical CAM. Results Internal reliability (Cronbach's α=0.88 for the ovarian CAM and α=0.84 for the cervical CAM) and test-retest reliability (r=0.84 and r=0.77 for the ovarian and cervical CAMs, respectively) were both high. Validity was demonstrated with cancer experts achieving higher scores than controls [ovarian CAM: t(36)= –5.6, p<0.001; cervical CAM: t(38)= –3.7, p=0.001], and volunteers who were randomised to read a cancer leaflet scored higher than those who received a control leaflet [ovarian CAM: t(49)=7.5, p<0.001; cervical CAM: t(48)= –5.5, p<0.001]. Conclusions This study demonstrates the psychometric properties of the ovarian and cervical CAMs and supports their utility in assessing ovarian and cervical cancer awareness in the general population. PMID:21933805
Cho, Hwayoung; Liu, Jianfang
2018-01-01
Background Mobile technology has become a ubiquitous technology and can be particularly useful in the delivery of health interventions. This technology can allow us to deliver interventions to scale, cover broad geographic areas, and deliver technologies in highly tailored ways based on the preferences or characteristics of users. The broad use of mobile technologies supports the need for usability assessments of these tools. Although there have been a number of usability assessment instruments developed, none have been validated for use with mobile technologies. Objective The goal of this work was to validate the Health Information Technology Usability Evaluation Scale (Health-ITUES), a customizable usability assessment instrument in a sample of community-dwelling adults who were testing the use of a new mobile health (mHealth) technology. Methods A sample of 92 community-dwelling adults living with HIV used a new mobile app for symptom self-management and completed the Health-ITUES to assess the usability of the app. They also completed the Post-Study System Usability Questionnaire (PSSUQ), a widely used and well-validated usability assessment tool. Correlations between these scales and each of the subscales were assessed. Results The subscales of the Health-ITUES showed high internal consistency reliability (Cronbach alpha=.85-.92). Each of the Health-ITUES subscales and the overall scale was moderately to strongly correlated with the PSSUQ scales (r=.46-.70), demonstrating the criterion validity of the Health-ITUES. Conclusions The Health-ITUES has demonstrated reliability and validity for use in assessing the usability of mHealth technologies in community-dwelling adults living with a chronic illness. PMID:29305343
Lin, Wen-Ye; Chang, Jung-Tzu; Chu, Chun-Feng
2017-01-01
Despite measures to reduce disease transmission, a risk can occur when blood glucose meters (BGMs) are used on multiple individuals or by caregivers assisting a patient. The laboratory and in-clinic performance of a BGM system before and after disinfection should be demonstrated to guarantee accurate readings and reliable control of blood glucose (BG) for patients. In this study, an effective disinfection procedure, conducting wiping 10 times to assure a one minute contact time of the disinfectant on contaminated surface, was first demonstrated using test samples of the meter housing materials, including acrylonitrile butadiene styrene (ABS), polymethyl methacrylate (PMMA), and polycarbonate (PC), in accordance with ISO 15197:2013. After bench studies comprising 10,000 disinfection cycles, the elemental compositions of the disinfected ABS, PMMA, and PC samples were almost the same as in the original samples, as indicated by electron spectroscopy for chemical analysis. Subsequently, the validated disinfection procedure was then directly applied to disinfect 5 commercial BGM systems composed of ABS, PMMA, or PC to observe the effect of the validated disinfection procedure on meter accuracy. The results of HBsAg values after treatment with HBV sera and disinfectant wipes for each material were less than the LoD of each material of 0.020 IU/mL. Before and after the multiple disinfection cycles, 900 of 900 samples (100%) were within the system accuracy requirements of ISO 15197:2013. All of the systems showed high performance before and after the series of disinfection cycles and met the ISO 15197:2013 requirements. In addition, our results demonstrated multiple cleaning and disinfection cycles that represented normal use over the lifetime of a meter of 3–5 years. Our validated cleaning and disinfection procedure can be directly applied to other registered disinfectants for cleaning commercial BGM products in the future. PMID:28683148
Lin, Shu-Ping; Lin, Wen-Ye; Chang, Jung-Tzu; Chu, Chun-Feng
2017-01-01
Despite measures to reduce disease transmission, a risk can occur when blood glucose meters (BGMs) are used on multiple individuals or by caregivers assisting a patient. The laboratory and in-clinic performance of a BGM system before and after disinfection should be demonstrated to guarantee accurate readings and reliable control of blood glucose (BG) for patients. In this study, an effective disinfection procedure, conducting wiping 10 times to assure a one minute contact time of the disinfectant on contaminated surface, was first demonstrated using test samples of the meter housing materials, including acrylonitrile butadiene styrene (ABS), polymethyl methacrylate (PMMA), and polycarbonate (PC), in accordance with ISO 15197:2013. After bench studies comprising 10,000 disinfection cycles, the elemental compositions of the disinfected ABS, PMMA, and PC samples were almost the same as in the original samples, as indicated by electron spectroscopy for chemical analysis. Subsequently, the validated disinfection procedure was then directly applied to disinfect 5 commercial BGM systems composed of ABS, PMMA, or PC to observe the effect of the validated disinfection procedure on meter accuracy. The results of HBsAg values after treatment with HBV sera and disinfectant wipes for each material were less than the LoD of each material of 0.020 IU/mL. Before and after the multiple disinfection cycles, 900 of 900 samples (100%) were within the system accuracy requirements of ISO 15197:2013. All of the systems showed high performance before and after the series of disinfection cycles and met the ISO 15197:2013 requirements. In addition, our results demonstrated multiple cleaning and disinfection cycles that represented normal use over the lifetime of a meter of 3-5 years. Our validated cleaning and disinfection procedure can be directly applied to other registered disinfectants for cleaning commercial BGM products in the future.
Photogrammetric Technique for Center of Gravity Determination
NASA Technical Reports Server (NTRS)
Jones, Thomas W.; Johnson, Thomas H.; Shemwell, Dave; Shreves, Christopher M.
2012-01-01
A new measurement technique for determination of the center of gravity (CG) for large scale objects has been demonstrated. The experimental method was conducted as part of an LS-DYNA model validation program for the Max Launch Abort System (MLAS) crew module. The test was conducted on the full scale crew module concept at NASA Langley Research Center. Multi-camera photogrammetry was used to measure the test article in several asymmetric configurations. The objective of these measurements was to provide validation of the CG as computed from the original mechanical design. The methodology, measurement technique, and measurement results are presented.
NASA Technical Reports Server (NTRS)
Salazar, Giovanni; Droba, Justin C.; Oliver, Brandon; Amar, Adam J.
2016-01-01
With the recent development of multi-dimensional thermal protection system (TPS) material response codes including the capabilities to account for radiative heating is a requirement. This paper presents the recent efforts to implement such capabilities in the CHarring Ablator Response (CHAR) code developed at NASA's Johnson Space Center. This work also describes the different numerical methods implemented in the code to compute view factors for radiation problems involving multiple surfaces. Furthermore, verification and validation of the code's radiation capabilities are demonstrated by comparing solutions to analytical results, to other codes, and to radiant test data.
Development and Validation of Accident Models for FeCrAl Cladding
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gamble, Kyle Allan Lawrence; Hales, Jason Dean
2016-08-01
The purpose of this milestone report is to present the work completed in regards to material model development for FeCrAl cladding and highlight the results of applying these models to Loss of Coolant Accidents (LOCA) and Station Blackouts (SBO). With the limited experimental data available (essentially only the data used to create the models) true validation is not possible. In the absence of another alternative, qualitative comparisons during postulated accident scenarios between FeCrAl and Zircaloy-4 cladded rods have been completed demonstrating the superior performance of FeCrAl.
NASA Technical Reports Server (NTRS)
Woodard, Paul R.; Yang, Henry T. Y.; Batina, John T.
1992-01-01
Quality assessment procedures are described for two-dimensional and three-dimensional unstructured meshes. The procedures include measurement of minimum angles, element aspect ratios, stretching, and element skewness. Meshes about the ONERA M6 wing and the Boeing 747 transport configuration are generated using an advancing front method grid generation package of programs. Solutions of Euler's equations for these meshes are obtained at low angle-of-attack, transonic conditions. Results for these cases, obtained as part of a validation study demonstrate the accuracy of an implicit upwind Euler solution algorithm.
Olino, Thomas M; Benini, Laura; Icenogle, Grace; Wilson, Sylia; Klein, Daniel N; Seeley, John R; Lewinsohn, Peter M
2017-08-01
Numerous studies have focused on characterizing personality differences between individuals with and without psychopathology. For drawing valid conclusions for these comparisons, the personality instruments used must demonstrate psychometric equivalence. However, we are unaware of any studies that examine measurement invariance in personality across individuals with and without psychopathology. This study conducted tests of measurement invariance for positive emotionality, negative emotionality, and disinhibition across individuals with and without histories of depressive, anxiety, and substance use disorders. We found consistent evidence that positive emotionality, negative emotionality, and disinhibition were assessed equivalently across all comparisons with each demonstrating strict invariance. Overall, results suggest that comparisons of personality measures between diagnostic groups satisfy the assumption of measurement invariance and these scales represent the same psychological constructs. Thus, mean-level comparisons across these groups are valid tests.
Potrzebowski, Wojciech; André, Ingemar
2015-07-01
For highly oriented fibrillar molecules, three-dimensional structures can often be determined from X-ray fiber diffraction data. However, because of limited information content, structure determination and validation can be challenging. We demonstrate that automated structure determination of protein fibers can be achieved by guiding the building of macromolecular models with fiber diffraction data. We illustrate the power of our approach by determining the structures of six bacteriophage viruses de novo using fiber diffraction data alone and together with solid-state NMR data. Furthermore, we demonstrate the feasibility of molecular replacement from monomeric and fibrillar templates by solving the structure of a plant virus using homology modeling and protein-protein docking. The generated models explain the experimental data to the same degree as deposited reference structures but with improved structural quality. We also developed a cross-validation method for model selection. The results highlight the power of fiber diffraction data as structural constraints.
NASA Technical Reports Server (NTRS)
Komendera, Erik E.; Dorsey, John T.
2017-01-01
Developing a capability for the assembly of large space structures has the potential to increase the capabilities and performance of future space missions and spacecraft while reducing their cost. One such application is a megawatt-class solar electric propulsion (SEP) tug, representing a critical transportation ability for the NASA lunar, Mars, and solar system exploration missions. A series of robotic assembly experiments were recently completed at Langley Research Center (LaRC) that demonstrate most of the assembly steps for the SEP tug concept. The assembly experiments used a core set of robotic capabilities: long-reach manipulation and dexterous manipulation. This paper describes cross-cutting capabilities and technologies for in-space assembly (ISA), applies the ISA approach to a SEP tug, describes the design and development of two assembly demonstration concepts, and summarizes results of two sets of assembly experiments that validate the SEP tug assembly steps.
NASA Astrophysics Data System (ADS)
Natali, Marco; Reggente, Melania; Passeri, Daniele; Rossi, Marco
2016-06-01
The development of polymer-based nanocomposites to be used in critical thermal environments requires the characterization of their mechanical properties, which are related to their chemical composition, size, morphology and operating temperature. Atomic force microscopy (AFM) has been proven to be a useful tool to develop techniques for the mechanical characterization of these materials, thanks to its nanometer lateral resolution and to the capability of exerting ultra-low loads, down to the piconewton range. In this work, we demonstrate two techniques, one quasi-static, i.e., AFM-based indentation (I-AFM), and one dynamic, i.e., contact resonance AFM (CR-AFM), for the mechanical characterization of compliant materials at variable temperature. A cross-validation of I-AFM and CR-AFM has been performed by comparing the results obtained on two reference materials, i.e., low-density polyethylene (LDPE) and polycarbonate (PC), which demonstrated the accuracy of the techniques.
A discrete event simulation tool to support and predict hospital and clinic staffing.
DeRienzo, Christopher M; Shaw, Ryan J; Meanor, Phillip; Lada, Emily; Ferranti, Jeffrey; Tanaka, David
2017-06-01
We demonstrate how to develop a simulation tool to help healthcare managers and administrators predict and plan for staffing needs in a hospital neonatal intensive care unit using administrative data. We developed a discrete event simulation model of nursing staff needed in a neonatal intensive care unit and then validated the model against historical data. The process flow was translated into a discrete event simulation model. Results demonstrated that the model can be used to give a respectable estimate of annual admissions, transfers, and deaths based upon two different staffing levels. The discrete event simulation tool model can provide healthcare managers and administrators with (1) a valid method of modeling patient mix, patient acuity, staffing needs, and costs in the present state and (2) a forecast of how changes in a unit's staffing, referral patterns, or patient mix would affect a unit in a future state.
Extension of HCDstruct for Transonic Aeroservoelastic Analysis of Unconventional Aircraft Concepts
NASA Technical Reports Server (NTRS)
Quinlan, Jesse R.; Gern, Frank H.
2017-01-01
A substantial effort has been made to implement an enhanced aerodynamic modeling capability in the Higher-fidelity Conceptual Design and structural optimization tool. This additional capability is needed for a rapid, physics-based method of modeling advanced aircraft concepts at risk of structural failure due to dynamic aeroelastic instabilities. To adequately predict these instabilities, in particular for transonic applications, a generalized aerodynamic matching algorithm was implemented to correct the doublet-lattice model available in Nastran using solution data from a priori computational fluid dynamics anal- ysis. This new capability is demonstrated for two tube-and-wing aircraft configurations, including a Boeing 737-200 for implementation validation and the NASA D8 as a first use case. Results validate the current implementation of the aerodynamic matching utility and demonstrate the importance of using such a method for aircraft configurations featuring fuselage-wing aerodynamic interaction.
Anderson, Ruth A.; Hsieh, Pi-Ching; Su, Hui Fang; Landerman, Lawrence R.; McDaniel, Reuben R.
2013-01-01
Objectives. To (1) describe participation in decision-making as a systems-level property of complex adaptive systems and (2) present empirical evidence of reliability and validity of a corresponding measure. Method. Study 1 was a mail survey of a single respondent (administrators or directors of nursing) in each of 197 nursing homes. Study 2 was a field study using random, proportionally stratified sampling procedure that included 195 organizations with 3,968 respondents. Analysis. In Study 1, we analyzed the data to reduce the number of scale items and establish initial reliability and validity. In Study 2, we strengthened the psychometric test using a large sample. Results. Results demonstrated validity and reliability of the participation in decision-making instrument (PDMI) while measuring participation of workers in two distinct job categories (RNs and CNAs). We established reliability at the organizational level aggregated items scores. We established validity of the multidimensional properties using convergent and discriminant validity and confirmatory factor analysis. Conclusions. Participation in decision making, when modeled as a systems-level property of organization, has multiple dimensions and is more complex than is being traditionally measured. Managers can use this model to form decision teams that maximize the depth and breadth of expertise needed and to foster connection among them. PMID:24349771
Prototypicality ratings of DSM-III criteria for personality disorders.
Livesley, W J; Reiffer, L I; Sheldon, A E; West, M
1987-07-01
Although DSM-III personality disorder criteria have demonstrated acceptable reliability, the question of validity has not been adequately addressed. A first step in establishing the validity of diagnoses is to establish the validity of the criteria used to assess each diagnosis. The content validity of diagnostic criteria was investigated in relation to the larger set of potential criteria culled from the psychiatric literature. For each DSM-III axis II diagnosis, a panel of clinicians rated how prototypical each potential criterion was of the diagnosis in question. The results reveal problems with the organization and content of the criteria for most diagnoses. Many DSM-III criteria are composed of several statements linked by conjunctions or disjunctions. These component statements often received markedly different ratings, suggesting that criteria should be single statements. For most diagnoses, traits not included in DSM-III received higher ratings than did some DSM-III criteria. Suggestions are made to improve the distinctiveness and content validity of paranoid, schizoid, antisocial, borderline, avoidant, dependent, and compulsive personality disorders. The results for schizotypal personality disorder suggest that many clinicians are uncertain about this diagnosis. These findings provide a systematic way to modify definitions that contrasts with the more arbitrary ways in which diagnoses have previously been defined and redefined.
Dong, Ren G; Welcome, Daniel E; McDowell, Thomas W; Wu, John Z
2013-11-25
The relationship between the vibration transmissibility and driving-point response functions (DPRFs) of the human body is important for understanding vibration exposures of the system and for developing valid models. This study identified their theoretical relationship and demonstrated that the sum of the DPRFs can be expressed as a linear combination of the transmissibility functions of the individual mass elements distributed throughout the system. The relationship is verified using several human vibration models. This study also clarified the requirements for reliably quantifying transmissibility values used as references for calibrating the system models. As an example application, this study used the developed theory to perform a preliminary analysis of the method for calibrating models using both vibration transmissibility and DPRFs. The results of the analysis show that the combined method can theoretically result in a unique and valid solution of the model parameters, at least for linear systems. However, the validation of the method itself does not guarantee the validation of the calibrated model, because the validation of the calibration also depends on the model structure and the reliability and appropriate representation of the reference functions. The basic theory developed in this study is also applicable to the vibration analyses of other structures.
Chae, Han; Lee, Siwoo; Park, Soo Hyun; Jang, Eunsu; Lee, Soo Jin
2012-01-01
Objective. Sasang typology is a traditional Korean medicine based on the biopsychosocial perspectives of Neo-Confucianism and utilizes medical herbs and acupuncture for type-specific treatment. This study was designed to develop and validate the Sasang Personality Questionnaire (SPQ) for future use in the assessment of personality based on Sasang typology. Design and Methods. We selected questionnaire items using internal consistency analysis and examined construct validity with explorative factor analysis using 245 healthy participants. Test-retest reliability as well as convergent validity were examined. Results. The 14-item SPQ showed acceptable internal consistency (Cronbach's alpha = .817) and test-retest reliability (r = .837). Three extracted subscales, SPQ-behavior, SPQ-emotionality, and SPQ-cognition, were found, explaining 55.77% of the total variance. The SPQ significantly correlated with Temperament and Character Inventory novelty seeking (r = .462), harm avoidance (r = −.390), and NEO Personality Inventory extraversion (r = .629). The SPQ score of the So-Eum (24.43 ± 4.93), Tae-Eum (27.33 ± 5.88), and So-Yang (30.90 ± 5.23) types were significantly different from each other (P < .01). Conclusion. Current results demonstrated the reliability and validity of the SPQ and its subscales that can be utilized as an objective instrument for conducting personalized medicine research incorporating the biopsychosocial perspective. PMID:22567034
A Spanish Validation of the Canadian Adolescent Gambling Inventory (CAGI)
Jiménez-Murcia, Susana; Granero, Roser; Stinchfield, Randy; Tremblay, Joël; del Pino-Gutiérrez, Amparo; Moragas, Laura; Savvidou, Lamprini G.; Fernández-Aranda, Fernando; Aymamí, Neus; Gómez-Peña, Mónica; Tárrega, Salomé; Gunnard, Katarina; Martín-Romera, Virginia; Steward, Trevor; Mestre-Bach, Gemma; Menchón, José M.
2017-01-01
Aims: Large-scale epidemiological studies show a significant prevalence of gambling disorder (GD) during adolescence and emerging adulthood, and highlight the need to identify gambling-related behaviors at early ages. However, there are only a handful of screening instruments for this population and many studies measuring youth gambling problems use adult instruments that may not be developmentally appropriate. The aim of this study was to validate a Spanish version of the Canadian Adolescent Gambling Inventory (CAGI) among late adolescent and young adults and to explore its psychometric properties. Methods: The sample (16–29 years old) included a clinical group (n = 55) with GD patients and a control group (n = 340). Results: Exploratory factor analysis yielded one factor as the best model. This 24-item scale demonstrated satisfactory reliability (internal consistency, Cronbach’s alpha, α = 0.91), satisfactory convergent validity as measured by correlation with South Oaks Gambling Screen (r = 0.74), and excellent classification accuracy (AUC = 0.99; sensitivity = 0.98; and specificity = 0.99). Conclusion: Our results provide empirical support for our validation of the Spanish version of the CAGI. We uphold that the Spanish CAGI can be used as a brief, reliable, and valid instrument to assess gambling problems in Spanish youth. PMID:28223961
Validation of the Vanderbilt Holistic Face Processing Test.
Wang, Chao-Chih; Ross, David A; Gauthier, Isabel; Richler, Jennifer J
2016-01-01
The Vanderbilt Holistic Face Processing Test (VHPT-F) is a new measure of holistic face processing with better psychometric properties relative to prior measures developed for group studies (Richler et al., 2014). In fields where psychologists study individual differences, validation studies are commonplace and the concurrent validity of a new measure is established by comparing it to an older measure with established validity. We follow this approach and test whether the VHPT-F measures the same construct as the composite task, which is group-based measure at the center of the large literature on holistic face processing. In Experiment 1, we found a significant correlation between holistic processing measured in the VHPT-F and the composite task. Although this correlation was small, it was comparable to the correlation between holistic processing measured in the composite task with the same faces, but different target parts (top or bottom), which represents a reasonable upper limit for correlations between the composite task and another measure of holistic processing. These results confirm the validity of the VHPT-F by demonstrating shared variance with another measure of holistic processing based on the same operational definition. These results were replicated in Experiment 2, but only when the demographic profile of our sample matched that of Experiment 1.
Validation of the Vanderbilt Holistic Face Processing Test
Wang, Chao-Chih; Ross, David A.; Gauthier, Isabel; Richler, Jennifer J.
2016-01-01
The Vanderbilt Holistic Face Processing Test (VHPT-F) is a new measure of holistic face processing with better psychometric properties relative to prior measures developed for group studies (Richler et al., 2014). In fields where psychologists study individual differences, validation studies are commonplace and the concurrent validity of a new measure is established by comparing it to an older measure with established validity. We follow this approach and test whether the VHPT-F measures the same construct as the composite task, which is group-based measure at the center of the large literature on holistic face processing. In Experiment 1, we found a significant correlation between holistic processing measured in the VHPT-F and the composite task. Although this correlation was small, it was comparable to the correlation between holistic processing measured in the composite task with the same faces, but different target parts (top or bottom), which represents a reasonable upper limit for correlations between the composite task and another measure of holistic processing. These results confirm the validity of the VHPT-F by demonstrating shared variance with another measure of holistic processing based on the same operational definition. These results were replicated in Experiment 2, but only when the demographic profile of our sample matched that of Experiment 1. PMID:27933014
Anderson, Ruth A; Plowman, Donde; Corazzini, Kirsten; Hsieh, Pi-Ching; Su, Hui Fang; Landerman, Lawrence R; McDaniel, Reuben R
2013-01-01
Objectives. To (1) describe participation in decision-making as a systems-level property of complex adaptive systems and (2) present empirical evidence of reliability and validity of a corresponding measure. Method. Study 1 was a mail survey of a single respondent (administrators or directors of nursing) in each of 197 nursing homes. Study 2 was a field study using random, proportionally stratified sampling procedure that included 195 organizations with 3,968 respondents. Analysis. In Study 1, we analyzed the data to reduce the number of scale items and establish initial reliability and validity. In Study 2, we strengthened the psychometric test using a large sample. Results. Results demonstrated validity and reliability of the participation in decision-making instrument (PDMI) while measuring participation of workers in two distinct job categories (RNs and CNAs). We established reliability at the organizational level aggregated items scores. We established validity of the multidimensional properties using convergent and discriminant validity and confirmatory factor analysis. Conclusions. Participation in decision making, when modeled as a systems-level property of organization, has multiple dimensions and is more complex than is being traditionally measured. Managers can use this model to form decision teams that maximize the depth and breadth of expertise needed and to foster connection among them.
Translating expert system rules into Ada code with validation and verification
NASA Technical Reports Server (NTRS)
Becker, Lee; Duckworth, R. James; Green, Peter; Michalson, Bill; Gosselin, Dave; Nainani, Krishan; Pease, Adam
1991-01-01
The purpose of this ongoing research and development program is to develop software tools which enable the rapid development, upgrading, and maintenance of embedded real-time artificial intelligence systems. The goals of this phase of the research were to investigate the feasibility of developing software tools which automatically translate expert system rules into Ada code and develop methods for performing validation and verification testing of the resultant expert system. A prototype system was demonstrated which automatically translated rules from an Air Force expert system was demonstrated which detected errors in the execution of the resultant system. The method and prototype tools for converting AI representations into Ada code by converting the rules into Ada code modules and then linking them with an Activation Framework based run-time environment to form an executable load module are discussed. This method is based upon the use of Evidence Flow Graphs which are a data flow representation for intelligent systems. The development of prototype test generation and evaluation software which was used to test the resultant code is discussed. This testing was performed automatically using Monte-Carlo techniques based upon a constraint based description of the required performance for the system.
Weathers, Frank W; Bovin, Michelle J; Lee, Daniel J; Sloan, Denise M; Schnurr, Paula P; Kaloupek, Danny G; Keane, Terence M; Marx, Brian P
2018-03-01
The Clinician-Administered PTSD Scale (CAPS) is an extensively validated and widely used structured diagnostic interview for posttraumatic stress disorder (PTSD). The CAPS was recently revised to correspond with PTSD criteria in the fifth edition of the Diagnostic and Statistical Manual of Mental Disorders (DSM-5; American Psychiatric Association, 2013). This article describes the development of the CAPS for DSM-5 (CAPS-5) and presents the results of an initial psychometric evaluation of CAPS-5 scores in 2 samples of military veterans (Ns = 165 and 207). CAPS-5 diagnosis demonstrated strong interrater reliability (к = .78 to 1.00, depending on the scoring rule) and test-retest reliability (к = .83), as well as strong correspondence with a diagnosis based on the CAPS for DSM-IV (CAPS-IV; к = .84 when optimally calibrated). CAPS-5 total severity score demonstrated high internal consistency (α = .88) and interrater reliability (ICC = .91) and good test-retest reliability (ICC = .78). It also demonstrated good convergent validity with total severity score on the CAPS-IV (r = .83) and PTSD Checklist for DSM-5 (r = .66) and good discriminant validity with measures of anxiety, depression, somatization, functional impairment, psychopathy, and alcohol abuse (rs = .02 to .54). Overall, these results indicate that the CAPS-5 is a psychometrically sound measure of DSM-5 PTSD diagnosis and symptom severity. Importantly, the CAPS-5 strongly corresponds with the CAPS-IV, which suggests that backward compatibility with the CAPS-IV was maintained and that the CAPS-5 provides continuity in evidence-based assessment of PTSD in the transition from DSM-IV to DSM-5 criteria. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
40 CFR 63.5725 - What are the requirements for monitoring and demonstrating continuous compliance?
Code of Federal Regulations, 2014 CFR
2014-07-01
... Pollutants for Boat Manufacturing Demonstrating Compliance for Open Molding Operations Controlled by Add-on... successive cycles of operation to have a valid hour of data. (2) You must have valid data from at least 90... parameter monitoring system and collect emission capture system and add-on control device parameter data at...
40 CFR 63.5725 - What are the requirements for monitoring and demonstrating continuous compliance?
Code of Federal Regulations, 2013 CFR
2013-07-01
... Pollutants for Boat Manufacturing Demonstrating Compliance for Open Molding Operations Controlled by Add-on... successive cycles of operation to have a valid hour of data. (2) You must have valid data from at least 90... parameter monitoring system and collect emission capture system and add-on control device parameter data at...
Schry, Amie R; Roberson-Nay, Roxann; White, Susan W
2012-12-01
Social anxiety disorder (SAD) is 1 of the most prevalent psychological disorders, and among college students in particular, social anxiety has been associated with other problems such as substance use problems and increased vulnerability to other psychiatric disorders. The Social Phobia and Anxiety Inventory-23 (SPAI-23; Roberson-Nay, Strong, Nay, Beidel, & Turner, 2007) may be a useful, brief measure of problematic social anxiety in college students. Results from 4 studies (total n = 2,436) using the SPAI-23 with college student samples are presented. Scores on the SPAI-23 demonstrated strong convergent validity with other measures of social anxiety and discriminant validity as evidenced by lower correlations with measures of dissimilar constructs. Difference scores on the SPAI-23 also demonstrated adequate test-retest reliability over 5 ½ weeks (r = .72). Exploratory factor analysis suggested a two-factor structure: social anxiety and agoraphobia. Finally, differential item function analyses suggested that the items function similarly in men and women. In conclusion, the SPAI-23 demonstrated strong psychometric properties for use with college students.
Wang, Meng-Cheng; Gao, Yu; Deng, Jiaxin; Lai, Hongyu; Deng, Qiaowen; Armour, Cherie
2017-01-01
The current study assesses the factor structure and construct validity of the self-reported Inventory of Callous-Unemotional Traits (ICU) in 637 Chinese community adults (mean age = 25.98, SD = 5.79). A series of theoretical models proposed in previous studies were tested through confirmatory factor analyses. Results indicated that a shortened form that consists of 11 items (ICU-11) to assess callousness and uncaring factors has excellent overall fit. Additionally, correlations with a wide range of external variables demonstrated that this shortened form has similar construct validity compared to the original ICU. In conclusion, our findings suggest that the ICU-11 may be a promising self-report tool that could be a good substitute for the original form to assess callous-uncaring traits in adults.
NASA Astrophysics Data System (ADS)
Dufaux, Frederic
2011-06-01
The issue of privacy in video surveillance has drawn a lot of interest lately. However, thorough performance analysis and validation is still lacking, especially regarding the fulfillment of privacy-related requirements. In this paper, we first review recent Privacy Enabling Technologies (PET). Next, we discuss pertinent evaluation criteria for effective privacy protection. We then put forward a framework to assess the capacity of PET solutions to hide distinguishing facial information and to conceal identity. We conduct comprehensive and rigorous experiments to evaluate the performance of face recognition algorithms applied to images altered by PET. Results show the ineffectiveness of naïve PET such as pixelization and blur. Conversely, they demonstrate the effectiveness of more sophisticated scrambling techniques to foil face recognition.
NASA Technical Reports Server (NTRS)
Lyle, Karen H.
2015-01-01
Acceptance of new spacecraft structural architectures and concepts requires validated design methods to minimize the expense involved with technology demonstration via flight-testing. Hypersonic Inflatable Aerodynamic Decelerator (HIAD) architectures are attractive for spacecraft deceleration because they are lightweight, store compactly, and utilize the atmosphere to decelerate a spacecraft during entry. However, designers are hesitant to include these inflatable approaches for large payloads or spacecraft because of the lack of flight validation. This publication summarizes results comparing analytical results with test data for two concepts subjected to representative entry, static loading. The level of agreement and ability to predict the load distribution is considered sufficient to enable analytical predictions to be used in the design process.
Software development predictors, error analysis, reliability models and software metric analysis
NASA Technical Reports Server (NTRS)
Basili, Victor
1983-01-01
The use of dynamic characteristics as predictors for software development was studied. It was found that there are some significant factors that could be useful as predictors. From a study on software errors and complexity, it was shown that meaningful results can be obtained which allow insight into software traits and the environment in which it is developed. Reliability models were studied. The research included the field of program testing because the validity of some reliability models depends on the answers to some unanswered questions about testing. In studying software metrics, data collected from seven software engineering laboratory (FORTRAN) projects were examined and three effort reporting accuracy checks were applied to demonstrate the need to validate a data base. Results are discussed.
An empirical assessment of validation practices for molecular classifiers
Castaldi, Peter J.; Dahabreh, Issa J.
2011-01-01
Proposed molecular classifiers may be overfit to idiosyncrasies of noisy genomic and proteomic data. Cross-validation methods are often used to obtain estimates of classification accuracy, but both simulations and case studies suggest that, when inappropriate methods are used, bias may ensue. Bias can be bypassed and generalizability can be tested by external (independent) validation. We evaluated 35 studies that have reported on external validation of a molecular classifier. We extracted information on study design and methodological features, and compared the performance of molecular classifiers in internal cross-validation versus external validation for 28 studies where both had been performed. We demonstrate that the majority of studies pursued cross-validation practices that are likely to overestimate classifier performance. Most studies were markedly underpowered to detect a 20% decrease in sensitivity or specificity between internal cross-validation and external validation [median power was 36% (IQR, 21–61%) and 29% (IQR, 15–65%), respectively]. The median reported classification performance for sensitivity and specificity was 94% and 98%, respectively, in cross-validation and 88% and 81% for independent validation. The relative diagnostic odds ratio was 3.26 (95% CI 2.04–5.21) for cross-validation versus independent validation. Finally, we reviewed all studies (n = 758) which cited those in our study sample, and identified only one instance of additional subsequent independent validation of these classifiers. In conclusion, these results document that many cross-validation practices employed in the literature are potentially biased and genuine progress in this field will require adoption of routine external validation of molecular classifiers, preferably in much larger studies than in current practice. PMID:21300697
Clinical evaluation of a new noninvasive ankle arthrometer.
Nauck, Tanja; Lohrer, Heinz; Gollhofer, Albert
2010-06-01
A nonradiographic arthrometer was developed to objectively quantify anterior talar drawer instability in stable and unstable ankles. Diagnostic validity of this device was previously demonstrated in a cadaver study. The aim of the present study was to validate the ankle arthrometer in an in vivo setting. Twenty-three subjects participated in the study. An orthopedic surgeon first performed a manual anterior talar drawer test to classify the subjects' ankles as stable or unstable. The subjects were then evaluated using the ankle arthrometer, and filled out a validated self-reported questionnaire (German version of the Foot and Ankle Ability Measure [FAAM-G]). Ankle stiffness was calculated from the low linear region (40-60 N) of the load deformation curves obtained from the ankle arthrometer. Reliability testing of these stiffness values was done based on load deformation curves, with 150 and 200 N maximum anterior drawer loads applied in the ankle arthrometer. Using the manual anterior drawer test, 16 ankles were classified as stable and 7 were classified as unstable. Arthrometer stiffness analysis differentiated stable from unstable ankles (P = 0.00 and P = 0.01, respectively). Test-retest demonstrated an accurate reliability (intraclass correlation coefficient = 0.80). A significant correlation was found between both FAAM-G subscales and the arthrometer stiffness values (r = 0.43 and 0.54; P = 0.04 and 0.01). Discussion Subjects with and without mechanical ankle instability could be differentiated by ankle arthrometer stiffness analysis and the FAAM-G questionnaire results. This nonradiographic device may be relevant for screening athletes at risk for ankle injuries, for clinical follow-up studies, and implementing preventive strategies. Validity and reliability of the new ankle arthrometer is demonstrated in a small cohort in an in vivo setting.
NASA Astrophysics Data System (ADS)
Singh-Moon, Rajinder P.; Zaryab, Mohammad; Hendon, Christine P.
2017-02-01
Electroanatomical mapping (EAM) is an invaluable tool for guiding cardiac radiofrequency ablation (RFA) therapy. The principle roles of EAM is the identification of candidate ablation sites by detecting regions of abnormal electrogram activity and lesion validation subsequent to RF energy delivery. However, incomplete lesions may present interim electrical inactivity similar to effective treatment in the acute setting, despite efforts to reveal them with pacing or drugs, such as adenosine. Studies report that the misidentification and recovery of such lesions is a leading cause of arrhythmia recurrence and repeat procedures. In previous work, we demonstrated spectroscopic characterization of cardiac tissues using a fiber optic-integrated RF ablation catheter. In this work, we introduce OSAM (optical spectroscopic anatomical mapping), the application of this spectroscopic technique to obtain 2-dimensional biodistribution maps. We demonstrate its diagnostic potential as an auxiliary method for lesion validation in treated swine preparations. Endocardial lesion sets were created on fresh swine cardiac samples using a commercial RFA system. An optically-integrated catheter console fabricated in-house was used for measurement of tissue optical spectra between 600-1000nm. Three dimensional, Spatio-spectral datasets were generated by raster scanning of the optical catheter across the treated sample surface in the presence of whole blood. Tissue optical parameters were recovered at each spatial position using an inverse Monte Carlo method. OSAM biodistribution maps showed stark correspondence with gross examination of tetrazolium chloride stained tissue specimens. Specifically, we demonstrate the ability of OSAM to readily distinguish between shallow and deeper lesions, a limitation faced by current EAM techniques. These results showcase the OSAMs potential for lesion validation strategies for the treatment of cardiac arrhythmias.
Panepinto, Julie A; Torres, Sylvia; Bendo, Cristiane B; McCavit, Timothy L; Dinu, Bogdan; Sherman-Bien, Sandra; Bemrich-Stolz, Christy; Varni, James W
2014-01-01
Sickle cell disease (SCD) is an inherited blood disorder characterized by a chronic hemolytic anemia that can contribute to fatigue and global cognitive impairment in patients. The study objective was to report on the feasibility, reliability, and validity of the PedsQL™ Multidimensional Fatigue Scale in SCD for pediatric patient self-report ages 5-18 years and parent proxy-report for ages 2-18 years. This was a cross-sectional multi-site study whereby 240 pediatric patients with SCD and 303 parents completed the 18-item PedsQL™ Multidimensional Fatigue Scale. Participants also completed the PedsQL™ 4.0 Generic Core Scales. The PedsQL™ Multidimensional Fatigue Scale evidenced excellent feasibility, excellent reliability for the Total Scale Scores (patient self-report α = 0.90; parent proxy-report α = 0.95), and acceptable reliability for the three individual scales (patient self-report α = 0.77-0.84; parent proxy-report α = 0.90-0.97). Intercorrelations of the PedsQL™ Multidimensional Fatigue Scale with the PedsQL™ Generic Core Scales were predominantly in the large (≥0.50) range, supporting construct validity. PedsQL™ Multidimensional Fatigue Scale Scores were significantly worse with large effects sizes (≥0.80) for patients with SCD than for a comparison sample of healthy children, supporting known-groups discriminant validity. Confirmatory factor analysis demonstrated an acceptable to excellent model fit in SCD. The PedsQL™ Multidimensional Fatigue Scale demonstrated acceptable to excellent measurement properties in SCD. The results demonstrate the relative severity of fatigue symptoms in pediatric patients with SCD, indicating the potential clinical utility of multidimensional assessment of fatigue in patients with SCD in clinical research and practice. © 2013 Wiley Periodicals, Inc.
Can I Trust This Software Package? An Exercise in Validation of Computational Results
ERIC Educational Resources Information Center
Shacham, Mordechai; Brauner, Neima; Ashurst, W. Robert; Cutlip, Michael B.
2008-01-01
Mathematical software packages such as Polymath, MATLAB, and Mathcad are currently widely used for engineering problem solving. Applications of several of these packages to typical chemical engineering problems have been demonstrated by Cutlip, et al. The main characteristic of these packages is that they provide a "problem-solving environment…
ERIC Educational Resources Information Center
Dodrill, Carl B.; Clemmons, David
1984-01-01
Examined the validity of intellectual, neuropsychological, and emotional adjustment measures administered in high school in predicting vocational adjustment of 39 young adults with epilepsy. Results showed neuropsychological tests were the best predictors of later adjustment. Abilities were more related to final adjustment than variables…
ERIC Educational Resources Information Center
Elhai, Jon D.; Gray, Matthew J.; Naifeh, James A.; Butcher, Jimmie J.; Davis, Joanne L.; Falsetti, Sherry A.; Best, Connie L.
2005-01-01
The authors examined the Trauma Symptom Inventorys (TSI) ability to discriminate 88 student post-traumatic stress disorder (PTSD) simulators screened for genuine PTSD from 48 clinical PTSD-diagnosed outpatients. Results demonstrated between-group differences on several TSI clinical scales and the Atypical Response (ATR) validity scale.…
ERIC Educational Resources Information Center
Novotny, Eric; Rimland, Emily
2007-01-01
This article discusses a service quality study conducted in the Pennsylvania State University Libraries. The Wisconsin-Ohio Reference Evaluation Program survey was selected as a valid, standardized instrument. We present our results, highlighting the impact on reference training. A second survey a year later demonstrated that focusing on…
ERIC Educational Resources Information Center
Schneider, W. Joel; Roman, Zachary
2018-01-01
We used data simulations to test whether composites consisting of cohesive subtest scores are more accurate than composites consisting of divergent subtest scores. We demonstrate that when multivariate normality holds, divergent and cohesive scores are equally accurate. Furthermore, excluding divergent scores results in biased estimates of…
The Effect of Stakes on Accountability Test Scores and Pass Rates
ERIC Educational Resources Information Center
Steedle, Jeffrey T.; Grochowalski, Joseph
2017-01-01
Students may not fully demonstrate their knowledge and skills on accountability tests if there are no stakes attached to individual performance. In that case, assessment results may not accurately reflect student achievement, so the validity of score interpretations and uses suffers. For this study, matched samples of students taking state…
Multiple Subtypes among Vocationally Undecided College Students: A Model and Assessment Instrument.
ERIC Educational Resources Information Center
Jones, Lawrence K.; Chenery, Mary Faeth
1980-01-01
A model of vocational decision status was developed, and an instrument was constructed and used to assess its three dimensions. Results demonstrated the utility of the model, supported the reliability and validity of the instrument, and illustrated the value of viewing vocationally undecided students as multiple subtypes. (Author)
Shoemaker, Sarah J.; Wolf, Michael S.; Brach, Cindy
2016-01-01
Objective To develop a reliable and valid instrument to assess the understandability and actionability of print and audiovisual materials. Methods We compiled items from existing instruments/guides that the expert panel assessed for face/content validity. We completed four rounds of reliability testing, and produced evidence of construct validity with consumers and readability assessments. Results The experts deemed the PEMAT items face/content valid. Four rounds of reliability testing and refinement were conducted using raters untrained on the PEMAT. Agreement improved across rounds. The final PEMAT showed moderate agreement per Kappa (Average K = 0.57) and strong agreement per Gwet’s AC1 (Average = 0.74). Internal consistency was strong (α = 0.71; Average Item-Total Correlation = 0.62). For construct validation with consumers (n = 47), we found significant differences between actionable and poorly-actionable materials in comprehension scores (76% vs. 63%, p < 0.05) and ratings (8.9 vs. 7.7, p < 0.05). For understandability, there was a significant difference for only one of two topics on consumer numeric scores. For actionability, there were significant positive correlations between PEMAT scores and consumer-testing results, but no relationship for understandability. There were, however, strong, negative correlations between grade-level and both consumer-testing results and PEMAT scores. Conclusions The PEMAT demonstrated strong internal consistency, reliability, and evidence of construct validity. Practice implications The PEMAT can help professionals judge the quality of materials (available at: http://www.ahrq.gov/pemat). PMID:24973195
Espinosa-Montero, Juan; Monterrubio-Flores, Eric A.; Sanchez-Estrada, Marcela; Buendia-Jimenez, Inmaculada; Lieberman, Harris R.; Allaert, François-Andre; Barquera, Simon
2016-01-01
Background Ingestion of water has been associated with general wellbeing. When water intake is insufficient, symptoms such as thirst, fatigue and impaired memory result. Currently there are no instruments to assess water consumption associated with wellbeing. The objective of our study was to develop and validate such an instrument in urban, low socioeconomic, adult Mexican population. Methods To construct the Water Ingestion-Related Wellbeing Instrument (WIRWI), a qualitative study in which wellbeing related to everyday practices and experiences in water consumption were investigated. To validate the WIRWI a formal, five-process procedure was used. Face and content validation were addressed, consistency was assessed by exploratory and confirmatory psychometric factor analyses, repeatability, reproducibility and concurrent validity were assessed by conducting correlation tests with other measures of wellbeing such as a quality of life instrument, the SF-36, and objective parameters such as urine osmolality, 24-hour urine total volume and others. Results The final WIRWI is composed of 17 items assessing physical and mental dimensions. Items were selected based on their content and face validity. Exploratory and confirmatory factor analyses yielded Cronbach's alpha of 0.87 and 0.86, respectively. The final confirmatory factor analysis demonstrated that the model estimates were satisfactory for the constructs. Statistically significant correlations with the SF-36, total liquid consumption and simple water consumption were observed. Conclusion The resulting WIRWI is a reliable tool for assessing wellbeing associated with consumption of plain water in Mexican adults and could be useful for similar groups. PMID:27388902
Hales, M; Biros, E; Reznik, J E
2015-01-01
Since 1982, the International Standards for Neurological Classification of Spinal Cord Injury (ISNCSCI) has been used to classify sensation of spinal cord injury (SCI) through pinprick and light touch scores. The absence of proprioception, pain, and temperature within this scale creates questions about its validity and accuracy. To assess whether the sensory component of the ISNCSCI represents a reliable and valid measure of classification of SCI. A systematic review of studies examining the reliability and validity of the sensory component of the ISNCSCI published between 1982 and February 2013 was conducted. The electronic databases MEDLINE via Ovid, CINAHL, PEDro, and Scopus were searched for relevant articles. A secondary search of reference lists was also completed. Chosen articles were assessed according to the Oxford Centre for Evidence-Based Medicine hierarchy of evidence and critically appraised using the McMasters Critical Review Form. A statistical analysis was conducted to investigate the variability of the results given by reliability studies. Twelve studies were identified: 9 reviewed reliability and 3 reviewed validity. All studies demonstrated low levels of evidence and moderate critical appraisal scores. The majority of the articles (~67%; 6/9) assessing the reliability suggested that training was positively associated with better posttest results. The results of the 3 studies that assessed the validity of the ISNCSCI scale were confounding. Due to the low to moderate quality of the current literature, the sensory component of the ISNCSCI requires further revision and investigation if it is to be a useful tool in clinical trials.
The Chinese version of the Outcome Expectations for Exercise scale: validation study.
Lee, Ling-Ling; Chiu, Yu-Yun; Ho, Chin-Chih; Wu, Shu-Chen; Watson, Roger
2011-06-01
Estimates of the reliability and validity of the English nine-item Outcome Expectations for Exercise (OEE) scale have been tested and found to be valid for use in various settings, particularly among older people, with good internal consistency and validity. Data on the use of the OEE scale among older Chinese people living in the community and how cultural differences might affect the administration of the OEE scale are limited. To test the validity and reliability of the Chinese version of the Outcome Expectations for Exercise scale among older people. A cross-sectional validation study was designed to test the Chinese version of the OEE scale (OEE-C). Reliability was examined by testing both the internal consistency for the overall scale and the squared multiple correlation coefficient for the single item measure. The validity of the scale was tested on the basis of both a traditional psychometric test and a confirmatory factor analysis using structural equation modelling. The Mokken Scaling Procedure (MSP) was used to investigate if there were any hierarchical, cumulative sets of items in the measure. The OEE-C scale was tested in a group of older people in Taiwan (n=108, mean age=77.1). There was acceptable internal consistency (alpha=.85) and model fit in the scale. Evidence of the validity of the measure was demonstrated by the tests for criterion-related validity and construct validity. There was a statistically significant correlation between exercise outcome expectations and exercise self-efficacy (r=.34, p<.01). An analysis of the Mokken Scaling Procedure found that nine items of the scale were all retained in the analysis and the resulting scale was reliable and statistically significant (p=.0008). The results obtained in the present study provided acceptable levels of reliability and validity evidence for the Chinese Outcome Expectations for Exercise scale when used with older people in Taiwan. Future testing of the OEE-C scale needs to be carried out to see whether these results are generalisable to older Chinese people living in urban areas. Copyright © 2010 Elsevier Ltd. All rights reserved.
Can a low-cost webcam be used for a remote neurological exam?
Wood, Jeffrey; Wallin, Mitchell; Finkelstein, Joseph
2013-01-01
Multiple sclerosis (MS) is a demyelinating and axonal degenerative disease of the central nervous system. It is the most common progressive neurological disorder of young adults affecting over 1 million persons worldwide. Despite the increased use of neuroimaging and other tools to measure MS morbidity, the neurological examination remains the primary method to document relapses and progression in disease. The goal of this study was to demonstrate the feasibility and validity of using a low-cost webcam for remote neurological examination in home-setting for patients with MS. Using cross-over design, 20 MS patients were evaluated in-person and via remote televisit and results of the neurological evaluation were compared. Overall, we found that agreement between face-to-face and remote EDSS evaluation was sufficient to provide clinically valid information. Another important finding of this study was high acceptance of patients and their providers of using remote televisits for conducting neurological examinations at MS patient homes. The results of this study demonstrated potential of using low-cost webcams for remote neurological exam in patients with MS.
A Portuguese version of the student-teacher relationship scale - short form.
Patrício, Joana Nunes; Barata, M Clara; Calheiros, M Manuela; Graça, João
2015-05-20
Research consistently demonstrates that positive student-teacher relationships are fundamental to the healthy development of all students. However, we lack a Portuguese-validated measure of student-teacher relationships. In this article we present the adaptation procedures and the psychometric properties of a Portuguese version of the Student-Teacher Relationship Scale - Short Form (Pianta, 1992). Five hundred and thirty five teachers from 127 schools completed the STRS-SF. The results demonstrate that this adapted version of the STRS-SF has good psychometric properties, namely high reliability (α = .84 to .87) and expected construct validity, which were tested through exploratory and confirmatory factor analyses (χ2/df = 1.65, CFI = .96, GFI = .93, RMSEA = 0.05). This study also showed that the correlations of student-teacher relationship with students' demographic variables are consistent with the evidence in the literature about this construct. Finally, the study indicated that female teachers reported more closeness, t(530) = 4.06, p < .001 and better overall student-teacher relationships, t(530) = 4.90, p < .001. In the discussion, we analyze the implications of these results.
Comparison between Inbreeding Analyses Methodologies.
Esparza, Mireia; Martínez-Abadías, Neus; Sjøvold, Torstein; González-José, Rolando; Hernández, Miquel
2015-12-01
Surnames are widely used in inbreeding analysis, but the validity of results has often been questioned due to the failure to comply with the prerequisites of the method. Here we analyze inbreeding in Hallstatt (Austria) between the 17th and the 19th centuries both using genealogies and surnames. The high and significant correlation of the results obtained by both methods demonstrates the validity of the use of surnames in this kind of studies. On the other hand, the inbreeding values obtained (0.24 x 10⁻³ in the genealogies analysis and 2.66 x 10⁻³ in the surnames analysis) are lower than those observed in Europe for this period and for this kind of population, demonstrating the falseness of the apparent isolation of Hallstatt's population. The temporal trend of inbreeding in both analyses does not follow the European general pattern, but shows a maximum in 1850 with a later decrease along the second half of the 19th century. This is probably due to the high migration rate that is implied by the construction of transport infrastructures around the 1870's.
Development and Validation of the Primary Care Team Dynamics Survey
Song, Hummy; Chien, Alyna T; Fisher, Josephine; Martin, Julia; Peters, Antoinette S; Hacker, Karen; Rosenthal, Meredith B; Singer, Sara J
2015-01-01
Objective To develop and validate a survey instrument designed to measure team dynamics in primary care. Data Sources/Study Setting We studied 1,080 physician and nonphysician health care professionals working at 18 primary care practices participating in a learning collaborative aimed at improving team-based care. Study Design We developed a conceptual model and administered a cross-sectional survey addressing team dynamics, and we assessed reliability and discriminant validity of survey factors and the overall survey's goodness-of-fit using structural equation modeling. Data Collection We administered the survey between September 2012 and March 2013. Principal Findings Overall response rate was 68 percent (732 respondents). Results support a seven-factor model of team dynamics, suggesting that conditions for team effectiveness, shared understanding, and three supportive processes are associated with acting and feeling like a team and, in turn, perceived team effectiveness. This model demonstrated adequate fit (goodness-of-fit index: 0.91), scale reliability (Cronbach's alphas: 0.71–0.91), and discriminant validity (average factor correlations: 0.49). Conclusions It is possible to measure primary care team dynamics reliably using a 29-item survey. This survey may be used in ambulatory settings to study teamwork and explore the effect of efforts to improve team-based care. Future studies should demonstrate the importance of team dynamics for markers of team effectiveness (e.g., work satisfaction, care quality, clinical outcomes). PMID:25423886
Development and validation of the primary care team dynamics survey.
Song, Hummy; Chien, Alyna T; Fisher, Josephine; Martin, Julia; Peters, Antoinette S; Hacker, Karen; Rosenthal, Meredith B; Singer, Sara J
2015-06-01
To develop and validate a survey instrument designed to measure team dynamics in primary care. We studied 1,080 physician and nonphysician health care professionals working at 18 primary care practices participating in a learning collaborative aimed at improving team-based care. We developed a conceptual model and administered a cross-sectional survey addressing team dynamics, and we assessed reliability and discriminant validity of survey factors and the overall survey's goodness-of-fit using structural equation modeling. We administered the survey between September 2012 and March 2013. Overall response rate was 68 percent (732 respondents). Results support a seven-factor model of team dynamics, suggesting that conditions for team effectiveness, shared understanding, and three supportive processes are associated with acting and feeling like a team and, in turn, perceived team effectiveness. This model demonstrated adequate fit (goodness-of-fit index: 0.91), scale reliability (Cronbach's alphas: 0.71-0.91), and discriminant validity (average factor correlations: 0.49). It is possible to measure primary care team dynamics reliably using a 29-item survey. This survey may be used in ambulatory settings to study teamwork and explore the effect of efforts to improve team-based care. Future studies should demonstrate the importance of team dynamics for markers of team effectiveness (e.g., work satisfaction, care quality, clinical outcomes). © Health Research and Educational Trust.
Fattori, B; Giusti, P; Mancini, V; Grosso, M; Barillari, M R; Bastiani, L; Molinaro, S; Nacci, A
2016-10-01
The purpose of this study was to compare videofluoroscopy (VFS), fiberoptic endoscopic evaluation of swallowing (FEES) and oro-pharyngo- oesophageal scintigraphy (OPES) with regards to premature spillage, post-swallowing residue and aspiration to assess the reliability of these tests for detection of oro-pharyngeal dysphagia. Sixty patients affected with dysphagia of various origin were enrolled in the study and submitted to VFS, FEES and OPES using a liquid and semi-solid bolus. As a reference, we used VFS. Both the FEES and the OPES showed good sensitivity with high overall values (≥ 80% and ≥ 90% respectively). The comparison between FEES vs VFS concerning drop before swallowing showed good specificity (84.4% for semi-solids and 86.7% for liquids). In the case of post-swallowing residue, FEES vs VFS revealed good overall validity (75% for semi-solids) with specificity and sensitivity well balanced for the semi-solids. OPES vs. VFS demonstrated good sensitivity (88.6%) and overall validity (76.7%) for liquids. The analysis of FEES vs. VFS for aspiration showed that the overall validity was low (≤ 65%). On the other hand, OPES demonstrated appreciable overall validity (71.7%). VFS, FEES and OPES are capable of detecting oro-pharyngeal dysphagia. FEES gave significant results in the evaluation of post-swallowing residues. © Copyright by Società Italiana di Otorinolaringologia e Chirurgia Cervico-Facciale, Rome, Italy.
Development and Testing of a Method for Validating Chemical Inactivation of Ebola Virus.
Alfson, Kendra J; Griffiths, Anthony
2018-03-13
Complete inactivation of infectious Ebola virus (EBOV) is required before a sample may be removed from a Biosafety Level 4 laboratory. The United States Federal Select Agent Program regulations require that procedures used to demonstrate chemical inactivation must be validated in-house to confirm complete inactivation. The objective of this study was to develop a method for validating chemical inactivation of EBOV and then demonstrate the effectiveness of several commonly-used inactivation methods. Samples containing infectious EBOV ( Zaire ebolavirus ) in different matrices were treated, and the sample was diluted to limit the cytopathic effect of the inactivant. The presence of infectious virus was determined by assessing the cytopathic effect in Vero E6 cells. Crucially, this method did not result in a loss of infectivity in control samples, and we were able to detect less than five infectious units of EBOV ( Zaire ebolavirus ). We found that TRIzol LS reagent and RNA-Bee inactivated EBOV in serum; TRIzol LS reagent inactivated EBOV in clarified cell culture media; TRIzol reagent inactivated EBOV in tissue and infected Vero E6 cells; 10% neutral buffered formalin inactivated EBOV in tissue; and osmium tetroxide vapors inactivated EBOV on transmission electron microscopy grids. The methods described herein are easily performed and can be adapted to validate inactivation of viruses in various matrices and by various chemical methods.
Real-Time PCR Method for Detection of Salmonella spp. in Environmental Samples.
Kasturi, Kuppuswamy N; Drgon, Tomas
2017-07-15
The methods currently used for detecting Salmonella in environmental samples require 2 days to produce results and have limited sensitivity. Here, we describe the development and validation of a real-time PCR Salmonella screening method that produces results in 18 to 24 h. Primers and probes specific to the gene invA , group D, and Salmonella enterica serovar Enteritidis organisms were designed and evaluated for inclusivity and exclusivity using a panel of 329 Salmonella isolates representing 126 serovars and 22 non- Salmonella organisms. The invA - and group D-specific sets identified all the isolates accurately. The PCR method had 100% inclusivity and detected 1 to 2 copies of Salmonella DNA per reaction. Primers specific for Salmonella -differentiating fragment 1 (Sdf-1) in conjunction with the group D set had 100% inclusivity for 32 S Enteritidis isolates and 100% exclusivity for the 297 non-Enteritidis Salmonella isolates. Single-laboratory validation performed on 1,741 environmental samples demonstrated that the PCR method detected 55% more positives than the V itek i mmuno d iagnostic a ssay s ystem (VIDAS) method. The PCR results correlated well with the culture results, and the method did not report any false-negative results. The receiver operating characteristic (ROC) analysis documented excellent agreement between the results from the culture and PCR methods (area under the curve, 0.90; 95% confidence interval of 0.76 to 1.0) confirming the validity of the PCR method. IMPORTANCE This validated PCR method detects 55% more positives for Salmonella in half the time required for the reference method, VIDAS. The validated PCR method will help to strengthen public health efforts through rapid screening of Salmonella spp. in environmental samples.
Real-Time PCR Method for Detection of Salmonella spp. in Environmental Samples
Drgon, Tomas
2017-01-01
ABSTRACT The methods currently used for detecting Salmonella in environmental samples require 2 days to produce results and have limited sensitivity. Here, we describe the development and validation of a real-time PCR Salmonella screening method that produces results in 18 to 24 h. Primers and probes specific to the gene invA, group D, and Salmonella enterica serovar Enteritidis organisms were designed and evaluated for inclusivity and exclusivity using a panel of 329 Salmonella isolates representing 126 serovars and 22 non-Salmonella organisms. The invA- and group D-specific sets identified all the isolates accurately. The PCR method had 100% inclusivity and detected 1 to 2 copies of Salmonella DNA per reaction. Primers specific for Salmonella-differentiating fragment 1 (Sdf-1) in conjunction with the group D set had 100% inclusivity for 32 S. Enteritidis isolates and 100% exclusivity for the 297 non-Enteritidis Salmonella isolates. Single-laboratory validation performed on 1,741 environmental samples demonstrated that the PCR method detected 55% more positives than the Vitek immunodiagnostic assay system (VIDAS) method. The PCR results correlated well with the culture results, and the method did not report any false-negative results. The receiver operating characteristic (ROC) analysis documented excellent agreement between the results from the culture and PCR methods (area under the curve, 0.90; 95% confidence interval of 0.76 to 1.0) confirming the validity of the PCR method. IMPORTANCE This validated PCR method detects 55% more positives for Salmonella in half the time required for the reference method, VIDAS. The validated PCR method will help to strengthen public health efforts through rapid screening of Salmonella spp. in environmental samples. PMID:28500041
Mani, Suresh; Sharma, Shobha; Omar, Baharudin; Paungmali, Aatit; Joseph, Leonard
2017-04-01
Purpose The purpose of this review is to systematically explore and summarise the validity and reliability of telerehabilitation (TR)-based physiotherapy assessment for musculoskeletal disorders. Method A comprehensive systematic literature review was conducted using a number of electronic databases: PubMed, EMBASE, PsycINFO, Cochrane Library and CINAHL, published between January 2000 and May 2015. The studies examined the validity, inter- and intra-rater reliabilities of TR-based physiotherapy assessment for musculoskeletal conditions were included. Two independent reviewers used the Quality Appraisal Tool for studies of diagnostic Reliability (QAREL) and the Quality Assessment of Diagnostic Accuracy Studies (QUADAS) tool to assess the methodological quality of reliability and validity studies respectively. Results A total of 898 hits were achieved, of which 11 articles based on inclusion criteria were reviewed. Nine studies explored the concurrent validity, inter- and intra-rater reliabilities, while two studies examined only the concurrent validity. Reviewed studies were moderate to good in methodological quality. The physiotherapy assessments such as pain, swelling, range of motion, muscle strength, balance, gait and functional assessment demonstrated good concurrent validity. However, the reported concurrent validity of lumbar spine posture, special orthopaedic tests, neurodynamic tests and scar assessments ranged from low to moderate. Conclusion TR-based physiotherapy assessment was technically feasible with overall good concurrent validity and excellent reliability, except for lumbar spine posture, orthopaedic special tests, neurodynamic testa and scar assessment.
Biljak, Vanja Radisic; Ozvald, Ivan; Radeljak, Andrea; Majdenic, Kresimir; Lasic, Branka; Siftar, Zoran; Lovrencic, Marijana Vucic; Flegar-Mestric, Zlata
2012-01-01
Introduction The aim of the study was to present a protocol for laboratory information system (LIS) and hospital information system (HIS) validation at the Institute of Clinical Chemistry and Laboratory Medicine of the Merkur University Hospital, Zagreb, Croatia. Materials and methods: Validity of data traceability was checked by entering all test requests for virtual patient into HIS/LIS and printing corresponding barcoded labels that provided laboratory analyzers with the information on requested tests. The original printouts of the test results from laboratory analyzer(s) were compared with the data obtained from LIS and entered into the provided template. Transfer of data from LIS to HIS was examined by requesting all tests in HIS and creating real data in a finding generated in LIS. Data obtained from LIS and HIS were entered into a corresponding template. The main outcome measure was the accuracy of transfer obtained from laboratory analyzers and results transferred from LIS and HIS expressed as percentage (%). Results: The accuracy of data transfer from laboratory analyzers to LIS was 99.5% and of that from LIS to HIS 100%. Conclusion: We presented our established validation protocol for laboratory information system and demonstrated that a system meets its intended purpose. PMID:22384522
NASA Astrophysics Data System (ADS)
Nir, A.; Doughty, C.; Tsang, C. F.
Validation methods which developed in the context of deterministic concepts of past generations often cannot be directly applied to environmental problems, which may be characterized by limited reproducibility of results and highly complex models. Instead, validation is interpreted here as a series of activities, including both theoretical and experimental tests, designed to enhance our confidence in the capability of a proposed model to describe some aspect of reality. We examine the validation process applied to a project concerned with heat and fluid transport in porous media, in which mathematical modeling, simulation, and results of field experiments are evaluated in order to determine the feasibility of a system for seasonal thermal energy storage in shallow unsaturated soils. Technical details of the field experiments are not included, but appear in previous publications. Validation activities are divided into three stages. The first stage, carried out prior to the field experiments, is concerned with modeling the relevant physical processes, optimization of the heat-exchanger configuration and the shape of the storage volume, and multi-year simulation. Subjects requiring further theoretical and experimental study are identified at this stage. The second stage encompasses the planning and evaluation of the initial field experiment. Simulations are made to determine the experimental time scale and optimal sensor locations. Soil thermal parameters and temperature boundary conditions are estimated using an inverse method. Then results of the experiment are compared with model predictions using different parameter values and modeling approximations. In the third stage, results of an experiment performed under different boundary conditions are compared to predictions made by the models developed in the second stage. Various aspects of this theoretical and experimental field study are described as examples of the verification and validation procedure. There is no attempt to validate a specific model, but several models of increasing complexity are compared with experimental results. The outcome is interpreted as a demonstration of the paradigm proposed by van der Heijde, 26 that different constituencies have different objectives for the validation process and therefore their acceptance criteria differ also.
A high-performance spatial database based approach for pathology imaging algorithm evaluation
Wang, Fusheng; Kong, Jun; Gao, Jingjing; Cooper, Lee A.D.; Kurc, Tahsin; Zhou, Zhengwen; Adler, David; Vergara-Niedermayr, Cristobal; Katigbak, Bryan; Brat, Daniel J.; Saltz, Joel H.
2013-01-01
Background: Algorithm evaluation provides a means to characterize variability across image analysis algorithms, validate algorithms by comparison with human annotations, combine results from multiple algorithms for performance improvement, and facilitate algorithm sensitivity studies. The sizes of images and image analysis results in pathology image analysis pose significant challenges in algorithm evaluation. We present an efficient parallel spatial database approach to model, normalize, manage, and query large volumes of analytical image result data. This provides an efficient platform for algorithm evaluation. Our experiments with a set of brain tumor images demonstrate the application, scalability, and effectiveness of the platform. Context: The paper describes an approach and platform for evaluation of pathology image analysis algorithms. The platform facilitates algorithm evaluation through a high-performance database built on the Pathology Analytic Imaging Standards (PAIS) data model. Aims: (1) Develop a framework to support algorithm evaluation by modeling and managing analytical results and human annotations from pathology images; (2) Create a robust data normalization tool for converting, validating, and fixing spatial data from algorithm or human annotations; (3) Develop a set of queries to support data sampling and result comparisons; (4) Achieve high performance computation capacity via a parallel data management infrastructure, parallel data loading and spatial indexing optimizations in this infrastructure. Materials and Methods: We have considered two scenarios for algorithm evaluation: (1) algorithm comparison where multiple result sets from different methods are compared and consolidated; and (2) algorithm validation where algorithm results are compared with human annotations. We have developed a spatial normalization toolkit to validate and normalize spatial boundaries produced by image analysis algorithms or human annotations. The validated data were formatted based on the PAIS data model and loaded into a spatial database. To support efficient data loading, we have implemented a parallel data loading tool that takes advantage of multi-core CPUs to accelerate data injection. The spatial database manages both geometric shapes and image features or classifications, and enables spatial sampling, result comparison, and result aggregation through expressive structured query language (SQL) queries with spatial extensions. To provide scalable and efficient query support, we have employed a shared nothing parallel database architecture, which distributes data homogenously across multiple database partitions to take advantage of parallel computation power and implements spatial indexing to achieve high I/O throughput. Results: Our work proposes a high performance, parallel spatial database platform for algorithm validation and comparison. This platform was evaluated by storing, managing, and comparing analysis results from a set of brain tumor whole slide images. The tools we develop are open source and available to download. Conclusions: Pathology image algorithm validation and comparison are essential to iterative algorithm development and refinement. One critical component is the support for queries involving spatial predicates and comparisons. In our work, we develop an efficient data model and parallel database approach to model, normalize, manage and query large volumes of analytical image result data. Our experiments demonstrate that the data partitioning strategy and the grid-based indexing result in good data distribution across database nodes and reduce I/O overhead in spatial join queries through parallel retrieval of relevant data and quick subsetting of datasets. The set of tools in the framework provide a full pipeline to normalize, load, manage and query analytical results for algorithm evaluation. PMID:23599905
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ortiz-Ramŕez, Pablo, E-mail: rapeitor@ug.uchile.cl; Larroquette, Philippe; Camilla, S.
The intrinsic spatial efficiency method is a new absolute method to determine the efficiency of a gamma spectroscopy system for any extended source. In the original work the method was experimentally demonstrated and validated for homogeneous cylindrical sources containing {sup 137}Cs, whose sizes varied over a small range (29.5 mm radius and 15.0 to 25.9 mm height). In this work we present an extension of the validation over a wide range of sizes. The dimensions of the cylindrical sources vary between 10 to 40 mm height and 8 to 30 mm radius. The cylindrical sources were prepared using the referencemore » material IAEA-372, which had a specific activity of 11320 Bq/kg at july 2006. The obtained results were better for the sources with 29 mm radius showing relative bias lesser than 5% and for the sources with 10 mm height showing relative bias lesser than 6%. In comparison with the obtained results in the work where we present the method, the majority of these results show an excellent agreement.« less
Hamilton, Roy H; Stark, Marianna; Coslett, H Branch
2010-01-01
Debate continues regarding the mechanisms underlying covert shifts of visual attention. We examined the relationship between target eccentricity and the speed of covert shifts of attention in normal subjects and patients with brain lesions using a cued-response task in which cues and targets were presented at 2 degrees or 8 degrees lateral to the fixation point. Normal subjects were slower on invalid trials in the 8 degrees as compared to 2 degrees condition. Patients with right-hemisphere stroke with neglect were slower in their responses to left-sided invalid targets compared to valid targets, and demonstrated a significant increase in the effect of target validity as a function of target eccentricity. Additional data from one neglect patient (JM) demonstrated an exaggerated validity x eccentricity x side interaction for contralesional targets on a cued reaction time task with a central (arrow) cue. We frame these results in the context of a continuous 'moving spotlight' model of attention, and also consider the potential role of spatial saliency maps. By either account, we argue that neglect is characterized by an eccentricity-dependent deficit in the allocation of attention.
The reliability and validity of the Maryland Assessment of Recovery in Serious Mental Illness Scale.
Drapalski, Amy L; Medoff, Deborah; Dixon, Lisa; Bellack, Alan
2016-05-30
The current study aims to further evaluate the psychometric properties of the Maryland Assessment of Recovery in Serious Mental Illness (MARS), a relatively new instrument designed to assess personal recovery status in individuals with serious mental illness. Two hundred and fifty individuals with serious mental illness receiving outpatient mental health treatment completed a baseline assessment which included the MARS and measures to assess recovery-related constructs, clinical outcomes, and social and community functioning. The MARS demonstrated excellent internal consistency and test-retest reliability. Good construct validity was evidenced by strong positive relationships between the MARS and recovery-related constructs (e.g. hope, empowerment, self-efficacy, and personal agency) and a strong negative relationship with self-stigma. Divergent validity was demonstrated by weaker relationships with cognitive and social functioning. The confirmatory factor analysis did not confirm the unitary factor structure found in previous research. Given the equivocal result of the CFA, additional exploratory work is needed to determine if a more complex factor structure is present. This study provides addition support for the psychometric soundness of the MARS and subsequently, its potential use as a measure of personal recovery status in people with serious mental illness. Published by Elsevier Ireland Ltd.
Sridharan, Sarup S; Burrowes, Lindsay M; Bouwmeester, J Christopher; Wang, Jiun-Jr; Shrive, Nigel G; Tyberg, John V
2012-05-01
Our "reservoir-wave approach" to arterial hemodynamics holds that measured arterial pressure should be considered to be the sum of a volume-related pressure (i.e., reservoir pressure, P(reservoir)) and a wave-related pressure (P(excess)). Because some have questioned whether P(reservoir) (and, by extension, P(excess)) is a real component of measured physiological pressure, it was important to demonstrate that P(reservoir) is implicit in Westerhof's classical electrical and hydraulic models of the 3-element Windkessel. To test the validity of our P(reservoir) determinations, we studied a freeware simulation of the electrical model and a benchtop recreation of the hydraulic model, respectively, measuring the voltage and the pressure distal to the proximal resistance. These measurements were then compared with P(reservoir), as calculated from physiological data. Thus, the first objective of this study was to demonstrate that respective voltage and pressure changes could be measured that were similar to calculated physiological values of P(reservoir). The second objective was to confirm previous predictions with respect to the specific effects of systematically altering proximal resistance, distal resistance, and capacitance. The results of this study validate P(reservoir) and, thus, the reservoir-wave approach.
Testability of evolutionary game dynamics based on experimental economics data
NASA Astrophysics Data System (ADS)
Wang, Yijia; Chen, Xiaojie; Wang, Zhijian
In order to better understand the dynamic processes of a real game system, we need an appropriate dynamics model, so to evaluate the validity of a model is not a trivial task. Here, we demonstrate an approach, considering the dynamical macroscope patterns of angular momentum and speed as the measurement variables, to evaluate the validity of various dynamics models. Using the data in real time Rock-Paper-Scissors (RPS) games experiments, we obtain the experimental dynamic patterns, and then derive the related theoretical dynamic patterns from a series of typical dynamics models respectively. By testing the goodness-of-fit between the experimental and theoretical patterns, the validity of the models can be evaluated. One of the results in our study case is that, among all the nonparametric models tested, the best-known Replicator dynamics model performs almost worst, while the Projection dynamics model performs best. Besides providing new empirical macroscope patterns of social dynamics, we demonstrate that the approach can be an effective and rigorous tool to test game dynamics models. Fundamental Research Funds for the Central Universities (SSEYI2014Z) and the National Natural Science Foundation of China (Grants No. 61503062).
Nursing Care Interpersonal Relationship Questionnaire: elaboration and validation 1
Borges, José Wicto Pereira; Moreira, Thereza Maria Magalhães; de Andrade, Dalton Franscisco
2018-01-01
ABSTRACT Objective: to elaborate an instrument for the measurement of the interpersonal relationship in nursing care through the Item Response Theory, and the validation thereof. Method: methodological study, which followed the three poles of psychometry: theoretical, empirical and analytical. The Nursing Care Interpersonal Relationship Questionnaire was developed in light of the Imogene King’s Interpersonal Conceptual Model and the psychometric properties were studied through the Item Response Theory in a sample of 950 patients attended in Primary, Secondary and Tertiary Health Care. Results: the final instrument consisted of 31 items, with Cronbach’s alpha of 0.90 and McDonald’s Omega of 0.92. The parameters of the Item Response Theory demonstrated high discrimination in 28 items, being developed a five-level interpretive scale. At the first level, the communication process begins, gaining a wealth of interaction. Subsequent levels demonstrate qualitatively the points of effectiveness of the interpersonal relationship with the involvement of behaviors related to the concepts of transaction and interaction, followed by the concept of role. Conclusion: the instrument was created and proved to be consistent to measure interpersonal relationship in nursing care, as it presented adequate reliability and validity parameters. PMID:29319743
Basis Selection for Wavelet Regression
NASA Technical Reports Server (NTRS)
Wheeler, Kevin R.; Lau, Sonie (Technical Monitor)
1998-01-01
A wavelet basis selection procedure is presented for wavelet regression. Both the basis and the threshold are selected using cross-validation. The method includes the capability of incorporating prior knowledge on the smoothness (or shape of the basis functions) into the basis selection procedure. The results of the method are demonstrated on sampled functions widely used in the wavelet regression literature. The results of the method are contrasted with other published methods.
A validation framework for brain tumor segmentation.
Archip, Neculai; Jolesz, Ferenc A; Warfield, Simon K
2007-10-01
We introduce a validation framework for the segmentation of brain tumors from magnetic resonance (MR) images. A novel unsupervised semiautomatic brain tumor segmentation algorithm is also presented. The proposed framework consists of 1) T1-weighted MR images of patients with brain tumors, 2) segmentation of brain tumors performed by four independent experts, 3) segmentation of brain tumors generated by a semiautomatic algorithm, and 4) a software tool that estimates the performance of segmentation algorithms. We demonstrate the validation of the novel segmentation algorithm within the proposed framework. We show its performance and compare it with existent segmentation. The image datasets and software are available at http://www.brain-tumor-repository.org/. We present an Internet resource that provides access to MR brain tumor image data and segmentation that can be openly used by the research community. Its purpose is to encourage the development and evaluation of segmentation methods by providing raw test and image data, human expert segmentation results, and methods for comparing segmentation results.
A Confirmatory Factor Analysis of the Structure of Abbreviated Math Anxiety Scale
Farrokhi, Farahman
2011-01-01
Objective The aim of this study is to explore the confirmatory factor analysis results of the Persian adaptation of Abbreviated Math Anxiety Scale (AMAS), proposed by Hopko, Mahadevan, Bare & Hunt. Method The validity and reliability assessments of the scale were performed on 298 college students chosen randomly from Tabriz University in Iran. The confirmatory factor analysis (CFA) was carried out to determine the factor structures of the Persian version of AMAS. Results As expected, the two-factor solution provided a better fit to the data than a single factor. Moreover, multi-group analyses showed that this two-factor structure was invariant across sex. Hence, AMAS provides an equally valid measure for use among college students. Conclusions Brief AMAS demonstrates adequate reliability and validity. The AMAS scores can be used to compare symptoms of math anxiety between male and female students. The study both expands and adds support to the existing body of math anxiety literature. PMID:22952521
Katz, Andrea C; Hee, Danelle; Hooker, Christine I; Shankman, Stewart A
2017-10-03
In Section III of the DSM-5, the American Psychiatric Association (APA) proposes a pathological personality trait model of personality disorders. The recommended assessment instrument is the Personality Inventory for the DSM-5 (PID-5), an empirically derived scale that assesses personality pathology along five domains and 25 facets. Although the PID-5 demonstrates strong convergent validity with other personality measures, no study has examined whether it identifies traits that run in families, another important step toward validating the DSM-5's dimensional model. Using a family study method, we investigated familial associations of PID-5 domain and facet scores in 195 families, examining associations between parents and offspring and across siblings. The Psychoticism, Antagonism, and Detachment domains showed significant familial aggregation, as did facets of Negative Affect and Disinhibition. Results are discussed in the context of personality pathology and family study methodology. The results also help validate the PID-5, given the familial nature of personality traits.
Lee, Pei-Ling; Chen, Bo-Chia; Gollavelli, Ganesh; Shen, Sin-Yu; Yin, Yu-Sheng; Lei, Shiu-Ling; Jhang, Cian-Ling; Lee, Woan-Ruoh; Ling, Yong-Chien
2014-07-30
Zinc oxide nanoparticles (ZnO NPs) exhibit novel physiochemical properties and have found increasing use in sunscreen products and cosmetics. The potential toxicity is of increasing concern due to their close association with human skin. A time-of-flight secondary ion mass spectrometry (TOF-SIMS) and confocal laser scanning microscopy (CLSM) imaging method was developed and validated for rapid and sensitive cytotoxicity study of ZnO NPs using human skin equivalent HaCaT cells as a model system. Assorted material, chemical, and toxicological analysis methods were used to confirm their shape, size, crystalline structure, and aggregation properties as well as dissolution behavior and effect on HaCaT cell viability in the presence of various concentrations of ZnO NPs in aqueous media. Comparative and correlative analyses of aforementioned results with TOF-SIMS and CLSM imaging results exhibit reasonable and acceptable outcome. A marked drop in survival rate was observed with 50μg/ml ZnO NPs. The CLSM images reveal the absorption and localization of ZnO NPs in cytoplasm and nuclei. The TOF-SIMS images demonstrate elevated levels of intracellular ZnO concentration and associated Zn concentration-dependent (40)Ca/(39)K ratio, presumably caused by the dissolution behavior of ZnO NPs. Additional validation by using stable isotope-labeled (68)ZnO NPs as tracers under the same experimental conditions yields similar cytotoxicity effect. The imaging results demonstrate spatially-resolved cytotoxicity relationship between intracellular ZnO NPs, (40)Ca/(39)K ratio, phosphocholine fragments, and glutathione fragments. The trend of change in TOF-SIMS spectra and images of ZnO NPs treated HaCaT cells demonstrate the possible mode of actions by ZnO NP involves cell membrane disruption, cytotoxic response, and ROS mediated apoptosis. Copyright © 2014 Elsevier B.V. All rights reserved.
Abramyan, Tigran M.; Hyde-Volpe, David L.; Stuart, Steven J.; Latour, Robert A.
2017-01-01
The use of standard molecular dynamics simulation methods to predict the interactions of a protein with a material surface have the inherent limitations of lacking the ability to determine the most likely conformations and orientations of the adsorbed protein on the surface and to determine the level of convergence attained by the simulation. In addition, standard mixing rules are typically applied to combine the nonbonded force field parameters of the solution and solid phases the system to represent interfacial behavior without validation. As a means to circumvent these problems, the authors demonstrate the application of an efficient advanced sampling method (TIGER2A) for the simulation of the adsorption of hen egg-white lysozyme on a crystalline (110) high-density polyethylene surface plane. Simulations are conducted to generate a Boltzmann-weighted ensemble of sampled states using force field parameters that were validated to represent interfacial behavior for this system. The resulting ensembles of sampled states were then analyzed using an in-house-developed cluster analysis method to predict the most probable orientations and conformations of the protein on the surface based on the amount of sampling performed, from which free energy differences between the adsorbed states were able to be calculated. In addition, by conducting two independent sets of TIGER2A simulations combined with cluster analyses, the authors demonstrate a method to estimate the degree of convergence achieved for a given amount of sampling. The results from these simulations demonstrate that these methods enable the most probable orientations and conformations of an adsorbed protein to be predicted and that the use of our validated interfacial force field parameter set provides closer agreement to available experimental results compared to using standard CHARMM force field parameterization to represent molecular behavior at the interface. PMID:28514864
2012-01-01
Background Patients in sub-Saharan Africa commonly experience pain, which often is un-assessed and undertreated. One hindrance to routine pain assessment in these settings is the lack of a single-item pain rating scale validated for the particular context. The goal of this study was to examine the face validity and cultural acceptability of two single-item pain scales, the Numerical Rating Scale (NRS) and the Faces Pain Scale-Revised (FPS-R), in a population of patients on the medical, surgical, and pediatric wards of Moi Teaching and Referral Hospital in Kenya. Methods Swahili versions of the NRS and FPS-R were developed by standard translation and back-translation. Cognitive interviews were performed with 15 patients at Moi Teaching and Referral Hospital in Eldoret, Kenya. Interview transcripts were analyzed on a question-by-question basis to identify major themes revealed through the cognitive interviewing process and to uncover any significant problems participants encountered with understanding and using the pain scales. Results Cognitive interview analysis demonstrated that participants had good comprehension of both the NRS and the FPS-R and showed rational decision-making processes in choosing their responses. Participants felt that both scales were easy to use. The FPS-R was preferred almost unanimously to the NRS. Conclusions The face validity and acceptability of the Swahili versions of the NRS and FPS-R has been demonstrated for use in Kenyan patients. The broader application of these scales should be evaluated and may benefit patients who currently suffer from pain. PMID:22512923
Mental State Assessment and Validation Using Personalized Physiological Biometrics
Patel, Aashish N.; Howard, Michael D.; Roach, Shane M.; Jones, Aaron P.; Bryant, Natalie B.; Robinson, Charles S. H.; Clark, Vincent P.; Pilly, Praveen K.
2018-01-01
Mental state monitoring is a critical component of current and future human-machine interfaces, including semi-autonomous driving and flying, air traffic control, decision aids, training systems, and will soon be integrated into ubiquitous products like cell phones and laptops. Current mental state assessment approaches supply quantitative measures, but their only frame of reference is generic population-level ranges. What is needed are physiological biometrics that are validated in the context of task performance of individuals. Using curated intake experiments, we are able to generate personalized models of three key biometrics as useful indicators of mental state; namely, mental fatigue, stress, and attention. We demonstrate improvements to existing approaches through the introduction of new features. Furthermore, addressing the current limitations in assessing the efficacy of biometrics for individual subjects, we propose and employ a multi-level validation scheme for the biometric models by means of k-fold cross-validation for discrete classification and regression testing for continuous prediction. The paper not only provides a unified pipeline for extracting a comprehensive mental state evaluation from a parsimonious set of sensors (only EEG and ECG), but also demonstrates the use of validation techniques in the absence of empirical data. Furthermore, as an example of the application of these models to novel situations, we evaluate the significance of correlations of personalized biometrics to the dynamic fluctuations of accuracy and reaction time on an unrelated threat detection task using a permutation test. Our results provide a path toward integrating biometrics into augmented human-machine interfaces in a judicious way that can help to maximize task performance.
Mental State Assessment and Validation Using Personalized Physiological Biometrics.
Patel, Aashish N; Howard, Michael D; Roach, Shane M; Jones, Aaron P; Bryant, Natalie B; Robinson, Charles S H; Clark, Vincent P; Pilly, Praveen K
2018-01-01
Mental state monitoring is a critical component of current and future human-machine interfaces, including semi-autonomous driving and flying, air traffic control, decision aids, training systems, and will soon be integrated into ubiquitous products like cell phones and laptops. Current mental state assessment approaches supply quantitative measures, but their only frame of reference is generic population-level ranges. What is needed are physiological biometrics that are validated in the context of task performance of individuals. Using curated intake experiments, we are able to generate personalized models of three key biometrics as useful indicators of mental state; namely, mental fatigue, stress, and attention. We demonstrate improvements to existing approaches through the introduction of new features. Furthermore, addressing the current limitations in assessing the efficacy of biometrics for individual subjects, we propose and employ a multi-level validation scheme for the biometric models by means of k -fold cross-validation for discrete classification and regression testing for continuous prediction. The paper not only provides a unified pipeline for extracting a comprehensive mental state evaluation from a parsimonious set of sensors (only EEG and ECG), but also demonstrates the use of validation techniques in the absence of empirical data. Furthermore, as an example of the application of these models to novel situations, we evaluate the significance of correlations of personalized biometrics to the dynamic fluctuations of accuracy and reaction time on an unrelated threat detection task using a permutation test. Our results provide a path toward integrating biometrics into augmented human-machine interfaces in a judicious way that can help to maximize task performance.
Karalunas, Sarah L.; Fair, Damien; Musser, Erica D.; Aykes, Kamari; Iyer, Swathi P.; Nigg, Joel T.
2014-01-01
Importance Psychiatric nosology is limited by behavioral and biological heterogeneity within existing disorder categories. The imprecise nature of current nosological distinctions limits both mechanistic understanding and clinical prediction. Here, we demonstrate an approach consistent with the NIMH Research Domain Criteria (RDoC) initiative to identifying superior, neurobiologically-valid subgroups with better predictive capacity than existing psychiatric categories for childhood Attention-Deficit Hyperactivity Disorder (ADHD). Objective Refine subtyping of childhood ADHD by using biologically-based behavioral dimensions (i.e. temperament), novel classification algorithms, and multiple external validators. In doing so, we demonstrate how refined nosology is capable of improving on current predictive capacity of long-term outcomes relative to current DSM-based nosology. Design, Setting, Participants 437 clinically well-characterized, community-recruited children with and without ADHD participated in an on-going longitudinal study. Baseline data were used to classify children into subgroups based on temperament dimensions and to examine external validators including physiological and MRI measures. One-year longitudinal follow-up data are reported for a subgroup of the ADHD sample to address stability and clinical prediction. Main Outcome Measures Parent/guardian ratings of children on a measure of temperament were used as input features in novel community detection analyses to identify subgroups within the sample. Groups were validated using three widely-accepted external validators: peripheral physiology (cardiac measures of respiratory sinus arrhythmia and pre-ejection period), central nervous system functioning (via resting-state functional connectivity MRI), and clinical outcomes (at one-year longitudinal follow-up). Results The community detection algorithm suggested three novel types of ADHD, labeled as “Mild” (normative emotion regulation); “Surgent” (extreme levels of positive approach-motivation); and “Irritable” (extreme levels of negative emotionality, anger, and poor soothability). Types were independent of existing clinical demarcations, including DSM-5 presentations or symptom severity. These types showed stability over time and were distinguished by unique patterns of cardiac physiological response, resting-state functional brain connectivity, and clinical outcome one year later. Conclusions and Relevance Results suggest that a biologically-informed temperament-based typology, developed with a discovery-based community detection algorithm, provided a superior description of heterogeneity in the ADHD population than any current clinical nosology. This demonstration sets the stage for more aggressive attempts at a tractable, biologically-based nosology. PMID:25006969
Torfeh, Tarraf; Hammoud, Rabih; McGarry, Maeve; Al-Hammadi, Noora; Perkins, Gregory
2015-09-01
To develop and validate a large field of view phantom and quality assurance software tool for the assessment and characterization of geometric distortion in MRI scanners commissioned for radiation therapy planning. A purpose built phantom was developed consisting of 357 rods (6mm in diameter) of polymethyl-methacrylat separated by 20mm intervals, providing a three dimensional array of control points at known spatial locations covering a large field of view up to a diameter of 420mm. An in-house software module was developed to allow automatic geometric distortion assessment. This software module was validated against a virtual dataset of the phantom that reproduced the exact geometry of the physical phantom, but with known translational and rotational displacements and warping. For validation experiments, clinical MRI sequences were acquired with and without the application of a commercial 3D distortion correction algorithm (Gradwarp™). The software module was used to characterize and assess system-related geometric distortion in the sequences relative to a benchmark CT dataset, and the efficacy of the vendor geometric distortion correction algorithms (GDC) was also assessed. Results issued from the validation of the software against virtual images demonstrate the algorithm's ability to accurately calculate geometric distortion with sub-pixel precision by the extraction of rods and quantization of displacements. Geometric distortion was assessed for the typical sequences used in radiotherapy applications and over a clinically relevant 420mm field of view (FOV). As expected and towards the edges of the field of view (FOV), distortion increased with increasing FOV. For all assessed sequences, the vendor GDC was able to reduce the mean distortion to below 1mm over a field of view of 5, 10, 15 and 20cm radius respectively. Results issued from the application of the developed phantoms and algorithms demonstrate a high level of precision. The results indicate that this platform represents an important, robust and objective tool to perform routine quality assurance of MR-guided therapeutic applications, where spatial accuracy is paramount. Copyright © 2015 Elsevier Inc. All rights reserved.
Roberson, David W; Kentala, Erna; Forbes, Peter
2005-12-01
The goals of this project were 1) to develop and validate an objective instrument to measure surgical performance at tonsillectomy, 2) to assess its interobserver and interobservation reliability and construct validity, and 3) to select those items with best reliability and most independent information to design a simplified form suitable for routine use in otolaryngology surgical evaluation. Prospective, observational data collection for an educational quality improvement project. The evaluation instrument was based on previous instruments developed in general surgery with input from attending otolaryngologic surgeons and experts in medical education. It was pilot tested and subjected to iterative improvements. After the instrument was finalized, a total of 55 tonsillectomies were observed and scored during academic year 2002 to 2003: 45 cases by residents at different points during their rotation, 5 by fellows, and 5 by faculty. Results were assessed for interobserver reliability, interobservation reliability, and construct validity. Factor analysis was used to identify items with independent information. Interobserver and interobservation reliability was high. On technical items, faculty substantially outperformed fellows, who in turn outperformed residents (P < .0001 for both comparisons). On the "global" scale (overall assessment), residents improved an average of 1 full point (on a 5 point scale) during a 3 month rotation (P = .01). In the subscale of "patient care," results were less clear cut: fellows outperformed residents, who in turn outperformed faculty, but only the fellows to faculty comparison was statistically significant (P = .04), and residents did not clearly improve over time (P = .36). Factor analysis demonstrated that technical items and patient care items factor separately and thus represent separate skill domains in surgery. It is possible to objectively measure surgical skill at tonsillectomy with high reliability and good construct validity. Factor analysis demonstrated that patient care is a distinct domain in surgical skill. Although the interobserver reliability for some patient care items reached statistical significance, it was not high enough for "high stakes testing" purposes. Using reliability and factor analysis results, we propose a simplified instrument for use in evaluating trainees in otolaryngologic surgery.
Partridge, Roland W; Brown, Fraser S; Brennan, Paul M; Hennessey, Iain A M; Hughes, Mark A
2016-02-01
To assess the potential of the LEAP™ infrared motion tracking device to map laparoscopic instrument movement in a simulated environment. Simulator training is optimized when augmented by objective performance feedback. We explore the potential LEAP has to provide this in a way compatible with affordable take-home simulators. LEAP and the previously validated InsTrac visual tracking tool mapped expert and novice performances of a standardized simulated laparoscopic task. Ability to distinguish between the 2 groups (construct validity) and correlation between techniques (concurrent validity) were the primary outcome measures. Forty-three expert and 38 novice performances demonstrated significant differences in LEAP-derived metrics for instrument path distance (P < .001), speed (P = .002), acceleration (P < .001), motion smoothness (P < .001), and distance between the instruments (P = .019). Only instrument path distance demonstrated a correlation between LEAP and InsTrac tracking methods (novices: r = .663, P < .001; experts: r = .536, P < .001). Consistency of LEAP tracking was poor (average % time hands not tracked: 31.9%). The LEAP motion device is able to track the movement of hands using instruments in a laparoscopic box simulator. Construct validity is demonstrated by its ability to distinguish novice from expert performances. Only time and instrument path distance demonstrated concurrent validity with an existing tracking method however. A number of limitations to the tracking method used by LEAP have been identified. These need to be addressed before it can be considered an alternative to visual tracking for the delivery of objective performance metrics in take-home laparoscopic simulators. © The Author(s) 2015.
Lobo, Daniel; Morokuma, Junji; Levin, Michael
2016-09-01
Automated computational methods can infer dynamic regulatory network models directly from temporal and spatial experimental data, such as genetic perturbations and their resultant morphologies. Recently, a computational method was able to reverse-engineer the first mechanistic model of planarian regeneration that can recapitulate the main anterior-posterior patterning experiments published in the literature. Validating this comprehensive regulatory model via novel experiments that had not yet been performed would add in our understanding of the remarkable regeneration capacity of planarian worms and demonstrate the power of this automated methodology. Using the Michigan Molecular Interactions and STRING databases and the MoCha software tool, we characterized as hnf4 an unknown regulatory gene predicted to exist by the reverse-engineered dynamic model of planarian regeneration. Then, we used the dynamic model to predict the morphological outcomes under different single and multiple knock-downs (RNA interference) of hnf4 and its predicted gene pathway interactors β-catenin and hh Interestingly, the model predicted that RNAi of hnf4 would rescue the abnormal regenerated phenotype (tailless) of RNAi of hh in amputated trunk fragments. Finally, we validated these predictions in vivo by performing the same surgical and genetic experiments with planarian worms, obtaining the same phenotypic outcomes predicted by the reverse-engineered model. These results suggest that hnf4 is a regulatory gene in planarian regeneration, validate the computational predictions of the reverse-engineered dynamic model, and demonstrate the automated methodology for the discovery of novel genes, pathways and experimental phenotypes. michael.levin@tufts.edu. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Kaymak, Tugrul; Türker, Levent; Tulay, Hüseyin; Stroka, Joerg
2018-04-27
Background : Pekmez and pestil are traditional Turkish foods made from concentrated grapejuice, which can be contaminated with mycotoxins such as aflatoxins and ochratoxin A (OTA). Objective : To carry out a single-laboratory validation of a method to simultaneously determine aflatoxins B 1 , B₂, G 1 , and G₂ and ochratoxin A in pekmez and pestil. Methods : The homogenized sample is extracted with methanol-water (80 + 20) using a high-speed blender. The (sample) extract is filtered, diluted with phosphate-buffered saline solution, and applied to a multi-immunoaffinity column (AFLAOCHRA PREP®). Aflatoxins and ochratoxin A are removed with (neat) methanol and then directly analyzed by reversed-phase LC with fluorescence detection using post-column bromination (Kobra cell®). Results : Test portions of blank pekmez and pestil were spiked with a mixture of aflatoxins and ochratoxin A to give levels ranging from 2.6 to 10.4 μg/kg and 1.0-4.0 μg/kg, respectively. Recoveries for total aflatoxins and ochratoxin A ranged from 84 to 106% and 80-97%, respectively, for spiked samples. Based on results for spiked pekmez and pestil (30 replicates each at three levels), the repeatability RSD ranged from 1.6 to 12% and 2.7-11% for total aflatoxins and ochratoxin A, respectively. Conclusions : The method performance in terms of recovery, repeatability, and detection limits has been demonstrated to be suitable for use as an Official Method. Highlights : First immunoaffinity column method validated for simultaneous analysis of aflatoxins and ochratoxin A in pekmez and pestil. Suitability for use for official purposes in Turkey, demonstrated by single-laboratory validation. Co-occurrence of aflatoxins and OTA in mulberry and carob pekmez reported for the first time.
NASA Astrophysics Data System (ADS)
Sitnikov, Nikolay; Borisov, Yuriy; Akmulin, Dimitry; Chekulaev, Igor; Sitnikova, Vera; Ulanovsky, Alexey; Sokolov, Alexey
The results of development of instruments based on heterophase chemiluminescence for measurements of space distribution of ozone and nitrogen oxides concentrations on board of research aircrafts and unmanned aerial vehicles carried out in Central Aerological Observatory are presented. Some results of atmospheric investigations on board of research aircrafts M55 “Geophysica” (Russia) and “Falcon” (Germany) carried out using developed instruments in frame of international projects are demonstrated. Small and low power instruments based on chemiluminescent principle for UAV are developed. The results of measurements on board of UAV are shown. The development can be used for satellite data validation, as well as operative environmental monitoring of contaminated areas in particular, chemical plants, natural and industrial disasters territories, areas and facilities for space purposes etc.
Reducing tensor magnetic gradiometer data for unexploded ordnance detection
Bracken, Robert E.; Brown, Philip J.
2005-01-01
We performed a survey to demonstrate the effectiveness of a prototype tensor magnetic gradiometer system (TMGS) for detection of buried unexploded ordnance (UXO). In order to achieve a useful result, we designed a data-reduction procedure that resulted in a realistic magnetic gradient tensor and devised a simple way of viewing complicated tensor data, not only to assess the validity of the final resulting tensor, but also to preview the data at interim stages of processing. The final processed map of the surveyed area clearly shows a sharp anomaly that peaks almost directly over the target UXO. This map agrees well with a modeled map derived from dipolar sources near the known target locations. From this agreement, it can be deduced that the reduction process is valid, making the prototype TMGS a foundation for development of future systems and processes.
Heritage, Brody; Gilbert, Jessica M.; Roberts, Lynne D.
2016-01-01
Job embeddedness is a construct that describes the manner in which employees can be enmeshed in their jobs, reducing their turnover intentions. Recent questions regarding the properties of quantitative job embeddedness measures, and their predictive utility, have been raised. Our study compared two competing reflective measures of job embeddedness, examining their convergent, criterion, and incremental validity, as a means of addressing these questions. Cross-sectional quantitative data from 246 Australian university employees (146 academic; 100 professional) was gathered. Our findings indicated that the two compared measures of job embeddedness were convergent when total scale scores were examined. Additionally, job embeddedness was capable of demonstrating criterion and incremental validity, predicting unique variance in turnover intention. However, this finding was not readily apparent with one of the compared job embeddedness measures, which demonstrated comparatively weaker evidence of validity. We discuss the theoretical and applied implications of these findings, noting that job embeddedness has a complementary place among established determinants of turnover intention. PMID:27199817
Price, Erika; Ottati, Victor; Wilson, Chase; Kim, Soyeon
2015-11-01
The present research conceptualizes open-minded cognition as a cognitive style that influences how individuals select and process information. An open-minded cognitive style is marked by willingness to consider a variety of intellectual perspectives, values, opinions, or beliefs-even those that contradict the individual's opinion. An individual's level of cognitive openness is expected to vary across domains (such as politics and religion). Four studies develop and validate a novel measure of open-minded cognition, as well as two domain-specific measures of religious and political open-minded cognition. Exploratory and confirmatory factor analysis (controlling for acquiescence bias) are used to develop the scales in Studies 1 to 3. Study 4 demonstrates that these scales possess convergent and discriminant validity. Study 5 demonstrates the scale's unique predictive validity using the outcome of Empathic Concern (Davis, 1980). Study 6 demonstrates the scale's unique predictive validity using the outcomes of warmth toward racial, religious, and sexual minorities. © 2015 by the Society for Personality and Social Psychology, Inc.
Clinical validation of the Tempus xO assay
Beaubier, Nike; Tell, Robert; Huether, Robert; Bontrager, Martin; Bush, Stephen; Parsons, Jerod; Shah, Kaanan; Baker, Tim; Selkov, Gene; Taxter, Tim; Thomas, Amber; Bettis, Sam; Khan, Aly; Lau, Denise; Lee, Christina; Barber, Matthew; Cieslik, Marcin; Frankenberger, Casey; Franzen, Amy; Weiner, Ali; Palmer, Gary; Lonigro, Robert; Robinson, Dan; Wu, Yi-Mi; Cao, Xuhong; Lefkofsky, Eric; Chinnaiyan, Arul; White, Kevin P.
2018-01-01
We have developed a clinically validated NGS assay that includes tumor, germline and RNA sequencing. We apply this assay to clinical specimens and cell lines, and we demonstrate a clinical sensitivity of 98.4% and positive predictive value of 100% for the clinically actionable variants measured by the assay. We also demonstrate highly accurate copy number measurements and gene rearrangement identification. PMID:29899824
Validating Innovative Renewable Energy Technologies: ESTCP Demonstrations at Two DoD Facilities
2012-05-01
AND SUBTITLE Validating Innovative Renewable Energy Technologies: ESTCP Demonstrations at Two DoD Facilities 5a. CONTRACT NUMBER 5b. GRANT NUMBER...Southern Research Institute,Advanced Energy Department,2000 Ninth Avenue South,Birmingham,AL,35205-5305 8. PERFORMING ORGANIZATION REPORT NUMBER...AVAILABILITY STATEMENT Approved for public release; distribution unlimited 13. SUPPLEMENTARY NOTES Presented at the NDIA Environment, Energy Security
Quality of prenatal care questionnaire: instrument development and testing.
Heaman, Maureen I; Sword, Wendy A; Akhtar-Danesh, Noori; Bradford, Amanda; Tough, Suzanne; Janssen, Patricia A; Young, David C; Kingston, Dawn A; Hutton, Eileen K; Helewa, Michael E
2014-06-03
Utilization indices exist to measure quantity of prenatal care, but currently there is no published instrument to assess quality of prenatal care. The purpose of this study was to develop and test a new instrument, the Quality of Prenatal Care Questionnaire (QPCQ). Data for this instrument development study were collected in five Canadian cities. Items for the QPCQ were generated through interviews with 40 pregnant women and 40 health care providers and a review of prenatal care guidelines, followed by assessment of content validity and rating of importance of items. The preliminary 100-item QPCQ was administered to 422 postpartum women to conduct item reduction using exploratory factor analysis. The final 46-item version of the QPCQ was then administered to another 422 postpartum women to establish its construct validity, and internal consistency and test-retest reliability. Exploratory factor analysis reduced the QPCQ to 46 items, factored into 6 subscales, which subsequently were validated by confirmatory factor analysis. Construct validity was also demonstrated using a hypothesis testing approach; there was a significant positive association between women's ratings of the quality of prenatal care and their satisfaction with care (r = 0.81). Convergent validity was demonstrated by a significant positive correlation (r = 0.63) between the "Support and Respect" subscale of the QPCQ and the "Respectfulness/Emotional Support" subscale of the Prenatal Interpersonal Processes of Care instrument. The overall QPCQ had acceptable internal consistency reliability (Cronbach's alpha = 0.96), as did each of the subscales. The test-retest reliability result (Intra-class correlation coefficient = 0.88) indicated stability of the instrument on repeat administration approximately one week later. Temporal stability testing confirmed that women's ratings of their quality of prenatal care did not change as a result of giving birth or between the early postpartum period and 4 to 6 weeks postpartum. The QPCQ is a valid and reliable instrument that will be useful in future research as an outcome measure to compare quality of care across geographic regions, populations, and service delivery models, and to assess the relationship between quality of care and maternal and infant health outcomes.
Rasmussen, Trine Bernholdt; Berg, Selina Kikkenborg; Dixon, Jane; Moons, Philip; Konradsen, Hanne
2016-12-01
Negative body perception has been reported in a number of patient populations. No instrument in Danish for measuring body image-related concerns has been available. Without such an instrument, understanding of the phenomenon in Danish-speaking populations is limited. The purpose of the study was thus to translate and validate a Danish version of the Body Image Quality of Life Inventory (BIQLI), in order to obtain a valid instrument applicable for healthcare research. The study consisted of two phases: (i) instrument adaptation, including forward and back translation, expert committee comparisons and cognitive interviewing, and (ii) empirical testing of the Danish version (BIQLI-DA) with subsequent psychometric evaluation. Hypothesised correlations to other measures, including body mass index (BMI), Medical Outcome Short Form-8 (SF-8), Patient Health Questionnaire-9 (PHQ-9), General Anxiety Disorder-7 and Symptom Check List-90-Revised (SCL-90-R ® ) were tested. In addition, exploratory factor structure analysis (EFA) and internal consistency on item and scale level were performed. The adapted instrument was found to be semantically sound, yet concerns about face validity did arise through cognitive interviews. Danish college students (n = 189, 65 men, M age = 21.1 years) participated in the piloting of the BIQLI-DA. Convergent construct validity was demonstrated through associations to related constructs. Exploratory factor analysis revealed a potential subscale structure. Finally, results showed a high internal consistency (Cronbach's alpha = 0.92). Support for the validity of the BIQLI-DA might have been strengthened by repeating cognitive interviews after layout alterations, by piloting the instrument on a larger sample. This study demonstrated tentative support for the validity of the Danish Body Image Quality of Life (BIQLI-DA) and found the measure to be reliable in terms of internal consistency. Further exploration of response processes and construct validity is needed. © 2016 Nordic College of Caring Science.
Bryant, Elizabeth; Murtagh, Shemane; Finucane, Laura; McCrum, Carol; Mercer, Christopher; Smith, Toby; Canby, Guy; Rowe, David A; Moore, Ann P
2018-05-11
In response for the need of a freely available, stand-alone, validated outcome measure for use within musculoskeletal (MSK) physiotherapy practice, sensitive enough to measure clinical effectiveness, we developed an MSK patient reported outcome measure. This study examined the validity and reliability of the newly developed Brighton musculoskeletal Patient-Reported Outcome Measure (BmPROM) within physiotherapy outpatient settings. Two hundred twenty-four patients attending physiotherapy outpatient departments in South East England with an MSK condition participated in this study. The BmPROM was assessed for user friendliness (rated feedback, N = 224), reliability (internal consistency and test-retest reliability, n = 42), validity (internal and external construct validity, N = 224), and responsiveness (internal, n = 25). Exploratory factor analysis indicated that a two-factor model provides a good fit to the data. Factors were representative of "Functionality" and "Wellbeing". Correlations observed between the BmPROM and SF-36 domains provided evidence of convergent validity. Reliability results indicated that both subscales were internally consistent with alphas above the acceptable limits for both "Functionality" (α = .85, 95% CI [.81, .88]) and 'Wellbeing' (α = .80, 95% CI [.75, .84]). Test-retest analyses (n = 42) demonstrated a high degree of reliability between "Functionality" (ICC = .84; 95% CI [.72, .91]) and "Wellbeing" scores (ICC = .84; 95% CI [.72, .91]). Further examination of test-retest reliability through the Bland-Altman analysis demonstrated that the difference between "Functionality" and "Wellbeing" test scores did not vary as a function of absolute test score. Large treatment effect sizes were found for both subscales (Functionality d = 1.10; Wellbeing 1.03). The BmPROM is a reliable and valid outcome measure for use in evaluating physiotherapy treatment of MSK conditions. Copyright © 2018 John Wiley & Sons, Ltd.
Nascimento, Lucila Castanheira; Nunes, Michelle Darezzo Rodrigues; Rocha, Ester Leonardo; Bomfim, Emiliana Omena; Floria-Santos, Milena; Dos Santos, Claudia Benedita; Dos Santos, Danielle Maria de Souza Serio; de Lima, Regina Aparecida Garcia
2015-01-01
Among the main factors that affect patients' quality of life, fatigue is a significant symptom experienced by children during treatment. Despite the high incidence, there has been no validated scale to evaluate fatigue in children with cancer in Brazil. The purpose of this study was to examine the psychometric properties of the PedsQL™ Multidimensional Fatigue Scale, using self-reports of Brazilian children, 8 to 18 years of age, and proxy reports. A cross-sectional method was used to collect data from 216 subjects over an 18-month period. Reliability ranged from .70 to .90 except for sleep/rest fatigue, self-report (α = .55). No floor or ceiling effects were found in any dimension. Convergent validity was higher than .40 and divergent validity had 100% adjustment. The root mean square error of approximation was acceptable. The comparative fit index was lower than expected. The agreement between self and proxy responses was weak and moderate. The results demonstrate the reliability and validity of the Brazilian version in children with cancer. This is the first validated scale that assesses fatigue in Brazilian children and adolescents with cancer. © 2014 by Association of Pediatric Hematology/Oncology Nurses.
Soble, Jason R; Bain, Kathleen M; Bailey, K Chase; Kirton, Joshua W; Marceaux, Janice C; Critchfield, Edan A; McCoy, Karin J M; O'Rourke, Justin J F
2018-01-08
Embedded performance validity tests (PVTs) allow for continuous assessment of invalid performance throughout neuropsychological test batteries. This study evaluated the utility of the Wechsler Memory Scale-Fourth Edition (WMS-IV) Logical Memory (LM) Recognition score as an embedded PVT using the Advanced Clinical Solutions (ACS) for WAIS-IV/WMS-IV Effort System. This mixed clinical sample was comprised of 97 total participants, 71 of whom were classified as valid and 26 as invalid based on three well-validated, freestanding criterion PVTs. Overall, the LM embedded PVT demonstrated poor concordance with the criterion PVTs and unacceptable psychometric properties using ACS validity base rates (42% sensitivity/79% specificity). Moreover, 15-39% of participants obtained an invalid ACS base rate despite having a normatively-intact age-corrected LM Recognition total score. Receiving operating characteristic curve analysis revealed a Recognition total score cutoff of < 61% correct improved specificity (92%) while sensitivity remained weak (31%). Thus, results indicated the LM Recognition embedded PVT is not appropriate for use from an evidence-based perspective, and that clinicians may be faced with reconciling how a normatively intact cognitive performance on the Recognition subtest could simultaneously reflect invalid performance validity.
Using Focus Groups to Validate a Pharmacy Vaccination Training Program.
Bushell, Mary; Morrissey, Hana; Ball, Patrick
2015-06-12
Introduction: Focus group methodology is commonly used to quickly collate, integrated views from a variety of different stakeholders. This paper provides an example of how focus groups can be employed to collate expert opinion informing amendments on a newly developed training program for integration into undergraduate pharmacy curricula. Materials and methods: Four focus groups were conducted, across three continents, to determine the appropriateness and reliability of a developed vaccination training program with nested injection skills training. All focus groups were comprised of legitimate experts in the field of vaccination, medicine and/or pharmacy. Results: Themes that emerged across focus groups informed amendments giving rise to a validated version of a training program. Discussion : The rigorous validation of the vaccination training program offers generalizable lessons to inform the design and validation of future training programs intended for the health sector and or pharmacy curricula. Using the knowledge and experience of focus group participants fostered collaborative problem solving and validation of material and concept development. The group dynamics of a focus group allowed synthesis of feedback in an inter-professional manner. Conclusions : This paper provides a demonstration of how focus groups can be structured and used by health researchers to validate a newly developed training program.
NASA Technical Reports Server (NTRS)
Whiteman, D. N.; Russo, F.; Demoz, B.; Miloshevich, L. M.; Veselovskii, I.; Hannon, S.; Wang, Z.; Vomel, H.; Schmidlin, F.; Lesht, B.
2005-01-01
Early work within the Aqua validation activity revealed there to be large differences in water vapor measurement accuracy among the various technologies in use for providing validation data. The validation measurements were made at globally distributed sites making it difficult to isolate the sources of the apparent measurement differences among the various sensors, which included both Raman lidar and radiosonde. Because of this, the AIRS Water Vapor Experiment-Ground (AWEX-G) was held in October - November, 2003 with the goal of bringing validation technologies to a common site for intercomparison and resolution of the measurement discrepancies. Using the University of Colorado Cryogenic Frostpoint Hygrometer (CFH) as the water vapor reference, the AWEX-G field campaign resulted in new correction techniques for both Raman lidar, Vaisala RS80-H and RS90/92 measurements that significantly improve the absolute accuracy of those measurement systems particularly in the upper troposphere. Mean comparisons of radiosondes and lidar are performed demonstrating agreement between corrected sensors and the CFH to generally within 5% thereby providing data of sufficient accuracy for Aqua validation purposes. Examples of the use of the correction techniques in radiance and retrieval comparisons are provided and discussed.
The city of hope-quality of life-ostomy questionnaire: persian translation and validation.
Anaraki, F; Vafaie, M; Behboo, R; Esmaeilpour, S; Maghsoodi, N; Safaee, A; Grant, M
2014-07-01
Since there is no disease-specific instrument for measuring quality-of-life (QOL) in Ostomy patients in Persian language. This study was designed to translate and evaluate the validity and reliability of City of Hope-quality of life-Ostomy questionnaire (COH-QOL-Ostomy questionnaire). This study was designed as cross-sectional study. Reliability of the subscales and the summary scores were demonstrated by intra-class correlation coefficients. Pearson's correlations of an item with its own scale and other scales were calculated to evaluated convergent and discriminant validity. Clinical validity was also evaluated by known-group comparisons. Cronbach's alpha coefficient for all subscales was about 0.70 or higher. Results of interscale correlation were satisfactory and each subscale only measured a single and specified trait. All subscales met the standards of convergent and discriminant validity. Known group comparison analysis showed significant differences in social and spiritual well-being. The findings confirmed the reliability and validity of Persian version of COH-QOL-Ostomy questionnaire. The instrument was also well received by the Iranian patients. It can be considered as a valuable instrument to assess the different aspects of health related quality-of-life in Ostomy patients and used in clinical research in the future.
A Gigabit-per-Second Ka-Band Demonstration Using a Reconfigurable FPGA Modulator
NASA Technical Reports Server (NTRS)
Lee, Dennis; Gray, Andrew A.; Kang, Edward C.; Tsou, Haiping; Lay, Norman E.; Fong, Wai; Fisher, Dave; Hoy, Scott
2005-01-01
Gigabit-per-second communications have been a desired target for future NASA Earth science missions, and for potential manned lunar missions. Frequency bandwidth at S-band and X-band is typically insufficient to support missions at these high data rates. In this paper, we present the results of a 1 Gbps 32-QAM end-to-end experiment at Ka-band using a reconfigurable Field Programmable Gate Array (FPGA) baseband modulator board. Bit error rate measurements of the received signal using a software receiver demonstrate the feasibility of using ultra-high data rates at Ka-band, although results indicate that error correcting coding and/or modulator predistortion must be implemented in addition. Also, results of the demonstration validate the low-cost, MOS-based reconfigurable modulator approach taken to development of a high rate modulator, as opposed to more expensive ASIC or pure analog approaches.
Citera, Maryalice; Freeman, Phyllis R; Horowitz, Richard I
2017-01-01
Purpose Lyme disease is spreading worldwide, with multiple Borrelia species causing a broad range of clinical symptoms that mimic other illnesses. A validated Lyme disease screening questionnaire would be clinically useful for both providers and patients. Three studies evaluated such a screening tool, namely the Horowitz Multiple Systemic Infectious Disease Syndrome (MSIDS) Questionnaire. The purpose was to see if the questionnaire could accurately distinguish between Lyme patients and healthy individuals. Methods Study 1 examined the construct validity of the scale examining its factor structure and reliability of the questionnaire among 537 individuals being treated for Lyme disease. Study 2 involved an online sample of 999 participants, who self-identified as either healthy (N=217) or suffering from Lyme now (N=782) who completed the Horowitz MSIDS Questionnaire (HMQ) along with an outdoor activity survey. We examined convergent validity among components of the scale and evaluated discriminant validity with the Big Five personality characteristics. The third study compared a sample of 236 patients with confirmed Lyme disease with an online sample of 568 healthy individuals. Results Factor analysis results identified six underlying latent dimensions; four of these overlapped with critical symptoms identified by Horowitz – neuropathy, cognitive dysfunction, musculoskeletal pain, and fatigue. The HMQ showed acceptable levels of internal reliability using Cronbach’s coefficient alpha and exhibited evidence of convergent and divergent validity. Components of the HMQ correlated more highly with each other than with unrelated traits. Discussion The results consistently demonstrated that the HMQ accurately differentiated those with Lyme disease from healthy individuals. Three migratory pain survey items (persistent muscular pain, arthritic pain, and nerve pain/paresthesias) robustly identified individuals with verified Lyme disease. The results support the use of the HMQ as a valid, efficient, and low-cost screening tool for medical practitioners to decide if additional testing is warranted to distinguish between Lyme disease and other illnesses. PMID:28919803
Validation of Metagenomic Next-Generation Sequencing Tests for Universal Pathogen Detection.
Schlaberg, Robert; Chiu, Charles Y; Miller, Steve; Procop, Gary W; Weinstock, George
2017-06-01
- Metagenomic sequencing can be used for detection of any pathogens using unbiased, shotgun next-generation sequencing (NGS), without the need for sequence-specific amplification. Proof-of-concept has been demonstrated in infectious disease outbreaks of unknown causes and in patients with suspected infections but negative results for conventional tests. Metagenomic NGS tests hold great promise to improve infectious disease diagnostics, especially in immunocompromised and critically ill patients. - To discuss challenges and provide example solutions for validating metagenomic pathogen detection tests in clinical laboratories. A summary of current regulatory requirements, largely based on prior guidance for NGS testing in constitutional genetics and oncology, is provided. - Examples from 2 separate validation studies are provided for steps from assay design, and validation of wet bench and bioinformatics protocols, to quality control and assurance. - Although laboratory and data analysis workflows are still complex, metagenomic NGS tests for infectious diseases are increasingly being validated in clinical laboratories. Many parallels exist to NGS tests in other fields. Nevertheless, specimen preparation, rapidly evolving data analysis algorithms, and incomplete reference sequence databases are idiosyncratic to the field of microbiology and often overlooked.
Miciak, Jeremy; Fletcher, Jack M.; Stuebing, Karla; Vaughn, Sharon; Tolar, Tammy D.
2014-01-01
Purpose Few empirical investigations have evaluated LD identification methods based on a pattern of cognitive strengths and weaknesses (PSW). This study investigated the reliability and validity of two proposed PSW methods: the concordance/discordance method (C/DM) and cross battery assessment (XBA) method. Methods Cognitive assessment data for 139 adolescents demonstrating inadequate response to intervention was utilized to empirically classify participants as meeting or not meeting PSW LD identification criteria using the two approaches, permitting an analysis of: (1) LD identification rates; (2) agreement between methods; and (3) external validity. Results LD identification rates varied between the two methods depending upon the cut point for low achievement, with low agreement for LD identification decisions. Comparisons of groups that met and did not meet LD identification criteria on external academic variables were largely null, raising questions of external validity. Conclusions This study found low agreement and little evidence of validity for LD identification decisions based on PSW methods. An alternative may be to use multiple measures of academic achievement to guide intervention. PMID:24274155
Kadish, Navah Ester; Baumann, Matthias; Pietz, Joachim; Schubert-Bast, Susanne; Reuner, Gitta
2013-10-01
Our prospective study aimed at the validation of EpiTrack Junior, a neuropsychological screening tool for attention and executive functions in children with epilepsy. Twenty-two children with absence epilepsy aged 8-17 years underwent comprehensive neuropsychological evaluation including EpiTrack Junior and measures of intelligence, verbal and nonverbal memory, word fluency and visuoconstructive organization. Concurrent and discriminant validity of EpiTrack Junior subtests and total score as well as sensitivity and specificity of the total score were analyzed. EpiTrack Junior total score was impaired in 59% of participants. Concurrent validity was demonstrated in 4/6 subtests and for the total score. Discriminant validity was shown with respect to verbal and nonverbal long-term memory. Sensitivity was higher than specificity and highest for the "working memory index". EpiTrack Junior is recommended as a sensitive and time-efficient screening tool for attention and executive functions in children with epilepsy. Impaired results should be followed up with detailed evaluation including information from the parents and school as well as counseling where indicated. © 2013.
Liggett, Jacqueline; Carmichael, Kieran L C; Smith, Alexander; Sellbom, Martin
2017-01-01
This study examined the validity of newly developed disorder-specific impairment scales (IS), modeled on the Level of Personality Functioning Scale, for obsessive-compulsive (OCPD) and avoidant (AvPD) personality disorders. The IS focused on content validity (items directly reflected the disorder-specific impairments listed in DSM-5 Section III) and severity of impairment. A community sample of 313 adults completed personality inventories indexing the DSM-5 Sections II and III diagnostic criteria for OCPD and AvPD, as well as measures of impairment in the domains of self- and interpersonal functioning. Results indicated that both impairment measures (for AvPD in particular) showed promise in their ability to measure disorder-specific impairment, demonstrating convergent validity with their respective Section II counterparts and discriminant validity with their noncorresponding Section II disorder and with each other. The pattern of relationships between scores on the IS and scores on external measures of personality functioning, however, did not indicate that it is useful to maintain a distinction between impairment in the self- and interpersonal domains, at least for AvPD and OCPD.
Fitzgibbons, Patrick L; Goldsmith, Jeffrey D; Souers, Rhona J; Fatheree, Lisa A; Volmar, Keith E; Stuart, Lauren N; Nowak, Jan A; Astles, J Rex; Nakhleh, Raouf E
2017-09-01
- Laboratories must demonstrate analytic validity before any test can be used clinically, but studies have shown inconsistent practices in immunohistochemical assay validation. - To assess changes in immunohistochemistry analytic validation practices after publication of an evidence-based laboratory practice guideline. - A survey on current immunohistochemistry assay validation practices and on the awareness and adoption of a recently published guideline was sent to subscribers enrolled in one of 3 relevant College of American Pathologists proficiency testing programs and to additional nonsubscribing laboratories that perform immunohistochemical testing. The results were compared with an earlier survey of validation practices. - Analysis was based on responses from 1085 laboratories that perform immunohistochemical staining. Of 1057 responses, 65.4% (691) were aware of the guideline recommendations before this survey was sent and 79.9% (550 of 688) of those have already adopted some or all of the recommendations. Compared with the 2010 survey, a significant number of laboratories now have written validation procedures for both predictive and nonpredictive marker assays and specifications for the minimum numbers of cases needed for validation. There was also significant improvement in compliance with validation requirements, with 99% (100 of 102) having validated their most recently introduced predictive marker assay, compared with 74.9% (326 of 435) in 2010. The difficulty in finding validation cases for rare antigens and resource limitations were cited as the biggest challenges in implementing the guideline. - Dissemination of the 2014 evidence-based guideline validation practices had a positive impact on laboratory performance; some or all of the recommendations have been adopted by nearly 80% of respondents.
Can We Study Autonomous Driving Comfort in Moving-Base Driving Simulators? A Validation Study.
Bellem, Hanna; Klüver, Malte; Schrauf, Michael; Schöner, Hans-Peter; Hecht, Heiko; Krems, Josef F
2017-05-01
To lay the basis of studying autonomous driving comfort using driving simulators, we assessed the behavioral validity of two moving-base simulator configurations by contrasting them with a test-track setting. With increasing level of automation, driving comfort becomes increasingly important. Simulators provide a safe environment to study perceived comfort in autonomous driving. To date, however, no studies were conducted in relation to comfort in autonomous driving to determine the extent to which results from simulator studies can be transferred to on-road driving conditions. Participants ( N = 72) experienced six differently parameterized lane-change and deceleration maneuvers and subsequently rated the comfort of each scenario. One group of participants experienced the maneuvers on a test-track setting, whereas two other groups experienced them in one of two moving-base simulator configurations. We could demonstrate relative and absolute validity for one of the two simulator configurations. Subsequent analyses revealed that the validity of the simulator highly depends on the parameterization of the motion system. Moving-base simulation can be a useful research tool to study driving comfort in autonomous vehicles. However, our results point at a preference for subunity scaling factors for both lateral and longitudinal motion cues, which might be explained by an underestimation of speed in virtual environments. In line with previous studies, we recommend lateral- and longitudinal-motion scaling factors of approximately 50% to 60% in order to obtain valid results for both active and passive driving tasks.
Development of the Assessment of Belief Conflict in Relationship-14 (ABCR-14)
Kyougoku, Makoto; Teraoka, Mutsumi; Masuda, Noriko; Ooura, Mariko; Abe, Yasushi
2015-01-01
Purpose Nurses and other healthcare workers frequently experience belief conflict, one of the most important, new stress-related problems in both academic and clinical fields. Methods In this study, using a sample of 1,683 nursing practitioners, we developed The Assessment of Belief Conflict in Relationship-14 (ABCR-14), a new scale that assesses belief conflict in the healthcare field. Standard psychometric procedures were used to develop and test the scale, including a qualitative framework concept and item-pool development, item reduction, and scale development. We analyzed the psychometric properties of ABCR-14 according to entropy, polyserial correlation coefficient, exploratory factor analysis, confirmatory factor analysis, average variance extracted, Cronbach’s alpha, Pearson product-moment correlation coefficient, and multidimensional item response theory (MIRT). Results The results of the analysis supported a three-factor model consisting of 14 items. The validity and reliability of ABCR-14 was suggested by evidence from high construct validity, structural validity, hypothesis testing, internal consistency reliability, and concurrent validity. The result of the MIRT offered strong support for good item response of item slope parameters and difficulty parameters. However, the ABCR-14 Likert scale might need to be explored from the MIRT point of view. Yet, as mentioned above, there is sufficient evidence to support that ABCR-14 has high validity and reliability. Conclusion The ABCR-14 demonstrates good psychometric properties for nursing belief conflict. Further studies are recommended to confirm its application in clinical practice. PMID:26247356
Schleier, Jerome J.; Peterson, Robert K.D.; Irvine, Kathryn M.; Marshall, Lucy M.; Weaver, David K.; Preftakes, Collin J.
2012-01-01
One of the more effective ways of managing high densities of adult mosquitoes that vector human and animal pathogens is ultra-low-volume (ULV) aerosol applications of insecticides. The U.S. Environmental Protection Agency uses models that are not validated for ULV insecticide applications and exposure assumptions to perform their human and ecological risk assessments. Currently, there is no validated model that can accurately predict deposition of insecticides applied using ULV technology for adult mosquito management. In addition, little is known about the deposition and drift of small droplets like those used under conditions encountered during ULV applications. The objective of this study was to perform field studies to measure environmental concentrations of insecticides and to develop a validated model to predict the deposition of ULV insecticides. The final regression model was selected by minimizing the Bayesian Information Criterion and its prediction performance was evaluated using k-fold cross validation. Density of the formulation and the density and CMD interaction coefficients were the largest in the model. The results showed that as density of the formulation decreases, deposition increases. The interaction of density and CMD showed that higher density formulations and larger droplets resulted in greater deposition. These results are supported by the aerosol physics literature. A k-fold cross validation demonstrated that the mean square error of the selected regression model is not biased, and the mean square error and mean square prediction error indicated good predictive ability.
A Bacterial Glycoengineered Antigen for Improved Serodiagnosis of Porcine Brucellosis
Cortina, María E.; Balzano, Rodrigo E.; Rey Serantes, Diego A.; Caillava, Ana J.; Elena, Sebastián; Ferreira, A. C.; Nicola, Ana M.; Ugalde, Juan E.
2016-01-01
Brucellosis is a highly zoonotic disease that affects animals and human beings. Brucella suis is the etiological agent of porcine brucellosis and one of the major human brucellosis pathogens. Laboratory diagnosis of porcine brucellosis mainly relies on serological tests, and it has been widely demonstrated that serological assays based on the detection of anti O-polysaccharide antibodies are the most sensitive tests. Here, we validate a recombinant glycoprotein antigen, an N-formylperosamine O-polysaccharide–protein conjugate (OAg-AcrA), for diagnosis of porcine brucellosis. An indirect immunoassay based on the detection of anti-O-polysaccharide IgG antibodies was developed coupling OAg-AcrA to enzyme-linked immunosorbent assay plates (glyco-iELISA). To validate the assay, 563 serum samples obtained from experimentally infected and immunized pigs, as well as animals naturally infected with B. suis biovar 1 or 2, were tested. A receiver operating characteristic (ROC) analysis was performed, and based on this analysis, the optimum cutoff value was 0.56 (relative reactivity), which resulted in a diagnostic sensitivity and specificity of 100% and 99.7%, respectively. A cutoff value of 0.78 resulted in a test sensitivity of 98.4% and a test specificity of 100%. Overall, our results demonstrate that the glyco-iELISA is highly accurate for diagnosis of porcine brucellosis, improving the diagnostic performance of current serological tests. The recombinant glycoprotein OAg-AcrA can be produced in large homogeneous batches in a standardized way, making it an ideal candidate for further validation as a universal antigen for diagnosis of “smooth” brucellosis in animals and humans. PMID:26984975
Geiß, Cornelia; Ruppert, Katharina; Askem, Clare; Barroso, Carlos; Faber, Daniel; Ducrot, Virginie; Holbech, Henrik; Hutchinson, Thomas H; Kajankari, Paula; Kinnberg, Karin Lund; Lagadic, Laurent; Matthiessen, Peter; Morris, Steve; Neiman, Maurine; Penttinen, Olli-Pekka; Sanchez-Marin, Paula; Teigeler, Matthias; Weltje, Lennart; Oehlmann, Jörg
2017-04-01
The Organisation for Economic Cooperation and Development (OECD) provides several standard test methods for the environmental hazard assessment of chemicals, mainly based on primary producers, arthropods, and fish. In April 2016, two new test guidelines with two mollusc species representing different reproductive strategies were approved by OECD member countries. One test guideline describes a 28-day reproduction test with the parthenogenetic New Zealand mudsnail Potamopyrgus antipodarum. The main endpoint of the test is reproduction, reflected by the embryo number in the brood pouch per female. The development of a new OECD test guideline involves several phases including inter-laboratory validation studies to demonstrate the robustness of the proposed test design and the reproducibility of the test results. Therefore, a ring test of the reproduction test with P. antipodarum was conducted including eight laboratories with the test substances trenbolone and prochloraz and results are presented here. Most laboratories could meet test validity criteria, thus demonstrating the robustness of the proposed test protocol. Trenbolone did not have an effect on the reproduction of the snails at the tested concentration range (nominal: 10-1000 ng/L). For prochloraz, laboratories produced similar EC 10 and NOEC values, showing the inter-laboratory reproducibility of results. The average EC 10 and NOEC values for reproduction (with coefficient of variation) were 26.2 µg/L (61.7%) and 29.7 µg/L (32.9%), respectively. This ring test shows that the mudsnail reproduction test is a well-suited tool for use in the chronic aquatic hazard and risk assessment of chemicals.
Tomaszewski, Robert; Mitrushina, Maura
2016-01-01
To investigate utility of the Community Integration Questionnaire (CIQ) in a mixed sample of adults with neurological and neuropsychiatric disorders. Cross-sectional, interview-based study. Participants were community-dwelling adults with disabilities resulting from neurological and neuropsychiatric disorders (N = 54), who participated in a pre-vocational readiness and social skills training program. Psychometric properties of the Community Integration Questionnaire (CIQ) were assessed and validated against Mayo-Portland Adaptability Inventory (MPAI) and The Problem Checklist from the New York University Head Injury Family Interview (PCL). Based on the revised scoring procedures, psychometric properties of the CIQ Home Competency scale were excellent, followed by the Total score and Social Integration scale. Productive Activity scale had low content validity and a weak association with the total score. Convergent and discriminant validity of the CIQ were demonstrated by correlation patterns with MPAI scales in the expected direction. Significant relationship was found with PCL Physical/Dependency scale. Significant associations were found with sex, living status, and record of subsequent employment. The results provide support for the use of the CIQ as a measure of participation in individuals with neurological and neuropsychiatric diagnoses and resulting disabilities. An important goal of rehabilitation and training programs for individuals with dysfunction of the central nervous system is to promote their participation in social, vocational, and domestic activities. The Community Integration Questionnaire (CIQ) is a brief and efficient instrument for measuring these participation domains. This study demonstrated good psychometric properties and high utility of the CIQ in a sample of 54 individuals participating in a prevocational training program.
Biljak, Vanja Radisic; Ozvald, Ivan; Radeljak, Andrea; Majdenic, Kresimir; Lasic, Branka; Siftar, Zoran; Lovrencic, Marijana Vucic; Flegar-Mestric, Zlata
2012-01-01
The aim of the study was to present a protocol for laboratory information system (LIS) and hospital information system (HIS) validation at the Institute of Clinical Chemistry and Laboratory Medicine of the Merkur University Hospital, Zagreb, Croatia. Validity of data traceability was checked by entering all test requests for virtual patient into HIS/LIS and printing corresponding barcoded labels that provided laboratory analyzers with the information on requested tests. The original printouts of the test results from laboratory analyzer(s) were compared with the data obtained from LIS and entered into the provided template. Transfer of data from LIS to HIS was examined by requesting all tests in HIS and creating real data in a finding generated in LIS. Data obtained from LIS and HIS were entered into a corresponding template. The main outcome measure was the accuracy of transfer obtained from laboratory analyzers and results transferred from LIS and HIS expressed as percentage (%). The accuracy of data transfer from laboratory analyzers to LIS was 99.5% and of that from LIS to HIS 100%. We presented our established validation protocol for laboratory information system and demonstrated that a system meets its intended purpose.
Aeroacoustic Validation of Installed Low Noise Propulsion for NASA's N+2 Supersonic Airliner
NASA Technical Reports Server (NTRS)
Bridges, James
2018-01-01
An aeroacoustic test was conducted at NASA Glenn Research Center on an integrated propulsion system designed to meet noise regulations of ICAO Chapter 4 with 10EPNdB cumulative margin. The test had two objectives: to demonstrate that the aircraft design did meet the noise goal, and to validate the acoustic design tools used in the design. Variations in the propulsion system design and its installation were tested and the results compared against predictions. Far-field arrays of microphones measured the acoustic spectral directivity, which was transformed to full scale as noise certification levels. Phased array measurements confirmed that the shielding of the installation model adequately simulated the full aircraft and provided data for validating RANS-based noise prediction tools. Particle image velocimetry confirmed that the flow field around the nozzle on the jet rig mimicked that of the full aircraft and produced flow data to validate the RANS solutions used in the noise predictions. The far-field acoustic measurements confirmed the empirical predictions for the noise. Results provided here detail the steps taken to ensure accuracy of the measurements and give insights into the physics of exhaust noise from installed propulsion systems in future supersonic vehicles.
LoBue, Vanessa; Baker, Lewis; Thrasher, Cat
2017-08-10
Researchers have been interested in the perception of human emotional expressions for decades. Importantly, most empirical work in this domain has relied on controlled stimulus sets of adults posing for various emotional expressions. Recently, the Child Affective Facial Expression (CAFE) set was introduced to the scientific community, featuring a large validated set of photographs of preschool aged children posing for seven different emotional expressions. Although the CAFE set was extensively validated using adult participants, the set was designed for use with children. It is therefore necessary to verify that adult validation applies to child performance. In the current study, we examined 3- to 4-year-olds' identification of a subset of children's faces in the CAFE set, and compared it to adult ratings cited in previous research. Our results demonstrate an exceptionally strong relationship between adult ratings of the CAFE photos and children's ratings, suggesting that the adult validation of the set can be applied to preschool-aged participants. The results are discussed in terms of methodological implications for the use of the CAFE set with children, and theoretical implications for using the set to study the development of emotion perception in early childhood.
The VALiDATe29 MRI Based Multi-Channel Atlas of the Squirrel Monkey Brain.
Schilling, Kurt G; Gao, Yurui; Stepniewska, Iwona; Wu, Tung-Lin; Wang, Feng; Landman, Bennett A; Gore, John C; Chen, Li Min; Anderson, Adam W
2017-10-01
We describe the development of the first digital atlas of the normal squirrel monkey brain and present the resulting product, VALiDATe29. The VALiDATe29 atlas is based on multiple types of magnetic resonance imaging (MRI) contrast acquired on 29 squirrel monkeys, and is created using unbiased, nonlinear registration techniques, resulting in a population-averaged stereotaxic coordinate system. The atlas consists of multiple anatomical templates (proton density, T1, and T2* weighted), diffusion MRI templates (fractional anisotropy and mean diffusivity), and ex vivo templates (fractional anisotropy and a structural MRI). In addition, the templates are combined with histologically defined cortical labels, and diffusion tractography defined white matter labels. The combination of intensity templates and image segmentations make this atlas suitable for the fundamental atlas applications of spatial normalization and label propagation. Together, this atlas facilitates 3D anatomical localization and region of interest delineation, and enables comparisons of experimental data across different subjects or across different experimental conditions. This article describes the atlas creation and its contents, and demonstrates the use of the VALiDATe29 atlas in typical applications. The atlas is freely available to the scientific community.
NASA Astrophysics Data System (ADS)
Lin, Tzung-Jin; Tsai, Chin-Chung
2017-11-01
The purpose of this study was to develop and validate two survey instruments to evaluate high school students' scientific epistemic beliefs and goal orientations in learning science. The initial relationships between the sampled students' scientific epistemic beliefs and goal orientations in learning science were also investigated. A final valid sample of 600 volunteer Taiwanese high school students participated in this survey by responding to the Scientific Epistemic Beliefs Instrument (SEBI) and the Goal Orientations in Learning Science Instrument (GOLSI). Through both exploratory and confirmatory factor analyses, the SEBI and GOLSI were proven to be valid and reliable for assessing the participants' scientific epistemic beliefs and goal orientations in learning science. The path analysis results indicated that, by and large, the students with more sophisticated epistemic beliefs in various dimensions such as Development of Knowledge, Justification for Knowing, and Purpose of Knowing tended to adopt both Mastery-approach and Mastery-avoidance goals. Some interesting results were also found. For example, the students tended to set a learning goal to outperform others or merely demonstrate competence (Performance-approach) if they had more informed epistemic beliefs in the dimensions of Multiplicity of Knowledge, Uncertainty of Knowledge, and Purpose of Knowing.
FACTORIAL VALIDITY OF THE KOREAN VERSION OF THE EXERCISE DEPENDENCE SCALE-REVISED.
Shin, Kyulee; You, Sukkyung
2015-12-01
This study evaluated the psychometric properties of the Korean version of the 21-item Exercise Dependence Scale-Revised (EDS-R). The EDS-R was designed to measure the multidimensional aspects of exercise dependence symptoms such as withdrawal, continuance, tolerance, lack of control, reductions, time, and intention. Although the EDS-R has demonstrated sound psychometric properties, it has only been validated in Western samples. Cross-cultural validations of the instrument may increase the knowledge of exercise dependence. Therefore, this study aimed to contribute to the file by investigating the validity and utility of the construct of the EDS-R, using a non-Western sample. 402 adult participants who were over 18 years of age and who reported exercising at least once a week were asked to complete the EDS-R. The results from factor analyses supported that the seven-factor model of exercise dependence symptoms showed an adequate fit for both men and women. The EDS-R scores differentiated between samples, with varying amounts of exercise; 15.4% of the sample was classified as being at risk for exercise dependence. In sum, the results indicated that the EDS-R is a psychometrically reliable assessment tool for exercise dependence symptoms in Korea.
Costa, Sebastiano; Cuzzocrea, Francesca; Hausenblas, Heather A; Larcan, Rosalba; Oliva, Patrizia
2012-12-01
Background and aims The purpose of this study was to verify the factorial structure, internal validity, reliability, and criterion validity of the 21-item Exercise Dependence Scale-Revised (EDS-R) in an Italian sample. Methods Italian voluntary (N = 519) users of gyms who had a history of regular exercise for over a year completed the EDS-R and measures of exercise frequency. Results and conclusions Confirmatory factor analyses demonstrated a good fit to the hypothesized 7-factor model, and adequate internal consistency for the scale was evidenced. Criterion validity was evidenced by significant correlations among all the subscale of the EDS and exercise frequency. Finally, individuals at risk for exercise dependence reported more exercise behavior compared to the nondependent-symptomatic and nondependent-asymptomatic groups. These results suggest that the seven subscales of the Italian version of the EDS are measuring the construct of exercise dependence as defined by the DSM-IV criteria for substance dependence and also confirm previous research using the EDS-R in other languages. More research is needed to examine the psychometric properties of the EDS-R in diverse populations with various research designs.
Non-Technical Skills for Surgeons (NOTSS): Critical appraisal of its measurement properties.
Jung, James J; Borkhoff, Cornelia M; Jüni, Peter; Grantcharov, Teodor P
2018-02-17
To critically appraise the development and measurement properties, including sensibility, reliability, and validity of the Non-Technical Skills of Surgeons (NOTSS) system. Articles that described development process of the NOTSS system were identified. Relevant primary studies that presented evidence of reliability and validity were identified through a comprehensive literature review. NOTSS was developed through robust item generation and reduction strategies. It was shown to have good content validity, acceptability, and feasibility. Inter-rater reliability increased with greater expertise and number of assessors. Studies demonstrated evidence of cross-sectional construct validity, in that the tool was able to differentiate known groups of varied non-technical skill levels. Evidence of longitudinal construct validity also existed to demonstrate that NOTSS detected changes in non-technical skills before and after targeted training. In populations and settings presented in our critical appraisal, NOTSS provided reliable and valid measurements of intraoperative non-technical skills of surgeons. Copyright © 2018 Elsevier Inc. All rights reserved.
Grant, Jon E; Kim, Suck Won; McCabe, James S
2006-06-01
Kleptomania presents difficulties in diagnosis for clinicians. This study aimed to develop and test a DSM-IV-based diagnostic instrument for kleptomania. To assess for current kleptomania the Structured Clinical Interview for Kleptomania (SCI-K) was administered to 112 consecutive subjects requesting psychiatric outpatient treatment for a variety of disorders. Reliability and validity were determined. Classification accuracy was examined using the longitudinal course of illness. The SCI-K demonstrated excellent test-retest (Phi coefficient = 0.956 (95% CI = 0.937, 0.970)) and inter-rater reliability (phi coefficient = 0.718 (95% CI = 0.506, 0.848)) in the diagnosis of kleptomania. Concurrent validity was observed with a self-report measure using DSM-IV kleptomania criteria (phi coefficient = 0.769 (95% CI = 0.653, 0.850)). Discriminant validity was observed with a measure of depression (point biserial coefficient = -0.020 (95% CI = -0.205, 0.166)). The SCI-K demonstrated both high sensitivity and specificity based on longitudinal assessment. The SCI-K demonstrated excellent reliability and validity in diagnosing kleptomania in subjects presenting with various psychiatric problems. These findings require replication in larger groups, including non-psychiatric populations, to examine their generalizability. Copyright (c) 2006 John Wiley & Sons, Ltd.
Mahler, H I; Kulik, J A
1995-02-01
The purpose of this study was to demonstrate the validation of videotape interventions that were designed to prepare patients for coronary artery bypass graft (CABG) surgery. First, three videotapes were developed. Two of the tapes featured the experiences of three actual CABG patients and were constructed to present either an optimistic portrayal of the recovery period (mastery tape) or a portrayal designed to inoculate patients against potential problems (coping tape). The third videotape contained the more general nurse scenes and narration used in the other two tapes, but did not include the experiences of particular patients. We then conducted a study to establish the convergent and discriminant validity of the three tapes. That is, we sought to demonstrate both that the tapes did differ along the mastery-coping dimension, and that they did not differ in other respects (such as in the degree of information provided or the perceived credibility of the narrator). The validation study, conducted with 42 males who had previously undergone CABG, demonstrated that the intended equivalences and differences between the tapes were achieved. The importance of establishing the validity of health-related interventions is discussed.
Evaluating information skills training in health libraries: a systematic review.
Brettle, Alison
2007-12-01
Systematic reviews have shown that there is limited evidence to demonstrate that the information literacy training health librarians provide is effective in improving clinicians' information skills or has an impact on patient care. Studies lack measures which demonstrate validity and reliability in evaluating the impact of training. To determine what measures have been used; the extent to which they are valid and reliable; to provide guidance for health librarians who wish to evaluate the impact of their information skills training. Systematic review methodology involved searching seven databases, and personal files. Studies were included if they were about information skills training, used an objective measure to assess outcomes, and occurred in a health setting. Fifty-four studies were included in the review. Most outcome measures used in the studies were not tested for the key criteria of validity and reliability. Three tested for validity and reliability are described in more detail. Selecting an appropriate measure to evaluate the impact of training is a key factor in carrying out any evaluation. This systematic review provides guidance to health librarians by highlighting measures used in various circumstances, and those that demonstrate validity and reliability.
Siahaan, Laura A; Syam, Ari F; Simadibrata, Marcellus; Setiati, Siti
2017-01-01
to obtain a valid and reliable GERD-QOL questionnaire for Indonesian application. at the initial stage, the GERD-QOL questionnaire was first translated into Indonesian language and the translated questionnaire was subsequently translated back into the original language (back-to-back translation). The results were evaluated by the researcher team and therefore, an Indonesian version of GERD-QOL questionnaire was developed. Ninety-one patients who had been clinically diagnosed with GERD based on the Montreal criteria were interviewed using the Indonesian version of GERD-QOL questionnaire and the SF 36 questionnaire. The validity was evaluated using a method of construct validity and external validity, and reliability can be tested by the method of internal consistency and test retest. the Indonesian version of GERD-QOL questionnaire had a good internal consistency reliability with a Cronbach Alpha of 0.687-0.842 and a good test retest reliability with an intra-class correlation coefficient of 0.756-0.936; p<0.05). The questionnaire had also been demonstrated to have a good validity with a proven high correlation to each question of SF-36 (p<0.05). the Indonesian version of GERD-QOL questionnaire has been proven valid and reliable to evaluate the quality of life of GERD patients.
Dehghan, Parvin; Asghari-Jafarabadi, Mohammad; Salekzamani, Shabnam
2015-01-01
Background: The aim of this study was to assess the validity, reliability and feasibility of eating behavior pattern questionnaire (EBPQ) in female university students. Methods: In this study, after forward-backward translation, the questionnaire was reviewed by a panel of nutritionists and a psychologist and further thirty participants for the content validity measurement. The translated and modified questionnaire was completed by 225 female students of Tabriz University in 2013. Principle axis factoring, confirmatory factor analysis and known group analysis were conducted for construct, convergent and discriminant validity. Internal consistency and test–retest reliability were assessed by Cronbach’s α coefficient and intra-class correlation coefficient (ICC). Ceiling and floor effects were also performed for evaluating the feasibility of the instrument. Results: By using exploratory factor analysis, nine factors were extracted. Confirmatory factor analysis confirmed the convergent validity. Cronbach ’s αand ICC were ranged between 0.55 to 0.78 and 0.67 to 0.89, respectively. The significant difference for some three subscales between diabetes and healthy subjects determined the discriminant validity. No ceiling and floor effects were found. Conclusion: Our findings demonstrate the initial validity, reliability and feasibility of the Iranian version of EBPQ as a useful tool for eating behavior studies in young females. PMID:26290828
Validation of virtual reality as a tool to understand and prevent child pedestrian injury.
Schwebel, David C; Gaines, Joanna; Severson, Joan
2008-07-01
In recent years, virtual reality has emerged as an innovative tool for health-related education and training. Among the many benefits of virtual reality is the opportunity for novice users to engage unsupervised in a safe environment when the real environment might be dangerous. Virtual environments are only useful for health-related research, however, if behavior in the virtual world validly matches behavior in the real world. This study was designed to test the validity of an immersive, interactive virtual pedestrian environment. A sample of 102 children and 74 adults was recruited to complete simulated road-crossings in both the virtual environment and the identical real environment. In both the child and adult samples, construct validity was demonstrated via significant correlations between behavior in the virtual and real worlds. Results also indicate construct validity through developmental differences in behavior; convergent validity by showing correlations between parent-reported child temperament and behavior in the virtual world; internal reliability of various measures of pedestrian safety in the virtual world; and face validity, as measured by users' self-reported perception of realism in the virtual world. We discuss issues of generalizability to other virtual environments, and the implications for application of virtual reality to understanding and preventing pediatric pedestrian injuries.
Validation of the MISSCARE-BRASIL survey - A tool to assess missed nursing care.
Siqueira, Lillian Dias Castilho; Caliri, Maria Helena Larcher; Haas, Vanderlei José; Kalisch, Beatrice; Dantas, Rosana Aparecida Spadoti
2017-12-21
to analyze the metric validity and reliability properties of the MISSCARE-BRASIL survey. methodological research conducted by assessing construct validity and reliability via confirmatory factor analysis, known-groups validation, convergent construct validation, analysis of internal consistency and test-retest reliability. The sample consisted of 330 nursing professionals, of whom 86 participated in the retest phase. of the 330 participants, 39.7% were aides, 33% technicians, 20.9% nurses, and 6.4% nurses with administrative roles. Confirmatory factorial analysis demonstrated that the Brazilian Portuguese version of the instrument is adequately adjusted to the dimensional structure the scale authors originally proposed. The correlation between "satisfaction with position/role" and "satisfaction with teamwork" and the survey's missed care variables was moderate (Spearman's coefficient =0.35; p<0.001). The results of the Student's t-test indicated known-group validity. Professionals from closed units reported lower levels of missed care in comparison with the other units. The reliability showed a strong correlation, with the exception of "institutional management/leadership style" (intraclass correlation coefficient (ICC)=0.15; p=0.04). The internal consistency was adequate (Cronbach's alpha was greater than 0.70). the MISSCARE-BRASIL was valid and reliable in the group studied. The application of the MISSCARE-BRASIL can contribute to identifying solutions for missed nursing care.
The Leuven Embedded Figures Test (L-EFT): measuring perception, intelligence or executive function?
Van der Hallen, Ruth; Wagemans, Johan; de-Wit, Lee; Chamberlain, Rebecca
2018-01-01
Performance on the Embedded Figures Test (EFT) has been interpreted as a reflection of local/global perceptual style, weak central coherence and/or field independence, as well as a measure of intelligence and executive function. The variable ways in which EFT findings have been interpreted demonstrate that the construct validity of this measure is unclear. In order to address this lack of clarity, we investigated to what extent performance on a new Embedded Figures Test (L-EFT) correlated with measures of intelligence, executive functions and estimates of local/global perceptual styles. In addition, we compared L-EFT performance to the original group EFT to directly contrast both tasks. Taken together, our results indicate that performance on the L-EFT does not correlate strongly with estimates of local/global perceptual style, intelligence or executive functions. Additionally, the results show that performance on the L-EFT is similarly associated with memory span and fluid intelligence as the group EFT. These results suggest that the L-EFT does not reflect a general perceptual or cognitive style/ability. These results further emphasize that empirical data on the construct validity of a task do not always align with the face validity of a task. PMID:29607257
Validation of a highly integrated SiPM readout system with a TOF-PET demonstrator
NASA Astrophysics Data System (ADS)
Niknejad, T.; Setayeshi, S.; Tavernier, S.; Bugalho, R.; Ferramacho, L.; Di Francesco, A.; Leong, C.; Rolo, M. D.; Shamshirsaz, M.; Silva, J. C.; Silva, R.; Silveira, M.; Zorraquino, C.; Varela, J.
2016-12-01
We have developed a highly integrated, fast and compact readout electronics for Silicon Photomultiplier (SiPM) based Time of Flight Positron Emission Tomography (TOF-PET) scanners. The readout is based on the use of TOP-PET Application Specific Integrated Circuit (PETsys TOFPET1 ASIC) with 64 channels, each with its amplifier, discriminator, Time to Digital Converter (TDC) and amplitude determination using Time Over Threshold (TOT). The ASIC has 25 ps r.m.s. intrinsic time resolution and fully digital output. The system is optimised for high rates, good timing, low power consumption and low cost. For validating the readout electronics, we have built a technical PET scanner, hereafter called ``demonstrator'', with 2'048 SiPM channels. The PET demonstrator has 16 compact Detector Modules (DM). Each DM has two ASICs reading 128 SiPM pixels in one-to-one coupling to 128 Lutetium Yttrium Orthosilicate (LYSO) crystals measuring 3.1 × 3.1 × 15 mm3 each. The data acquisition system for the demonstrator has two Front End Boards type D (FEB/D), each collecting the data of 1'024 channels (8 DMs), and transmitting assembled data frames through a serial link (4.8 Gbps), to a single Data Acquisition (DAQ) board plugged into the Peripheral Component Interconnect Express (PCIe) bus of the data acquisition PC. Results obtained with this PET demonstrator are presented.
Riedl, Janet; Esslinger, Susanne; Fauhl-Hassek, Carsten
2015-07-23
Food fingerprinting approaches are expected to become a very potent tool in authentication processes aiming at a comprehensive characterization of complex food matrices. By non-targeted spectrometric or spectroscopic chemical analysis with a subsequent (multivariate) statistical evaluation of acquired data, food matrices can be investigated in terms of their geographical origin, species variety or possible adulterations. Although many successful research projects have already demonstrated the feasibility of non-targeted fingerprinting approaches, their uptake and implementation into routine analysis and food surveillance is still limited. In many proof-of-principle studies, the prediction ability of only one data set was explored, measured within a limited period of time using one instrument within one laboratory. Thorough validation strategies that guarantee reliability of the respective data basis and that allow conclusion on the applicability of the respective approaches for its fit-for-purpose have not yet been proposed. Within this review, critical steps of the fingerprinting workflow were explored to develop a generic scheme for multivariate model validation. As a result, a proposed scheme for "good practice" shall guide users through validation and reporting of non-targeted fingerprinting results. Furthermore, food fingerprinting studies were selected by a systematic search approach and reviewed with regard to (a) transparency of data processing and (b) validity of study results. Subsequently, the studies were inspected for measures of statistical model validation, analytical method validation and quality assurance measures. In this context, issues and recommendations were found that might be considered as an actual starting point for developing validation standards of non-targeted metabolomics approaches for food authentication in the future. Hence, this review intends to contribute to the harmonization and standardization of food fingerprinting, both required as a prior condition for the authentication of food in routine analysis and official control. Copyright © 2015 Elsevier B.V. All rights reserved.
Sugand, Kapil; Wescott, Robert A; Carrington, Richard; Hart, Alister; Van Duren, Bernard H
2018-05-10
Background and purpose - Simulation is an adjunct to surgical education. However, nothing can accurately simulate fluoroscopic procedures in orthopedic trauma. Current options for training with fluoroscopy are either intraoperative, which risks radiation, or use of expensive and unrealistic virtual reality simulators. We introduce FluoroSim, an inexpensive digital fluoroscopy simulator without the need for radiation. Patients and methods - This was a multicenter study with 26 surgeons in which everyone completed 1 attempt at inserting a guide-wire into a femoral dry bone using surgical equipment and FluoroSim. 5 objective performance metrics were recorded in real-time to assess construct validity. The surgeons were categorized based on the number of dynamic hip screws (DHS) performed: novices (< 10), intermediates (10-39) and experts (≥ 40). A 7-point Likert scale questionnaire assessed the face and content validity of FluoroSim. Results - Construct validity was present for 2 clinically validated metrics in DHS surgery. Experts and intermediates statistically significantly outperformed novices for tip-apex distance and for cut-out rate. Novices took the least number of radiographs. Face and content validity were also observed. Interpretation - FluoroSim discriminated between novice and intermediate or expert surgeons based on tip-apex distance and cut-out rate while demonstrating face and content validity. FluoroSim provides a useful adjunct to orthopedic training. Our findings concur with results from studies using other simulation modalities. FluoroSim can be implemented for education easily and cheaply away from theater in a safe and controlled environment.
Examining the Predictive Validity of NIH Peer Review Scores
Lindner, Mark D.; Nakamura, Richard K.
2015-01-01
The predictive validity of peer review at the National Institutes of Health (NIH) has not yet been demonstrated empirically. It might be assumed that the most efficient and expedient test of the predictive validity of NIH peer review would be an examination of the correlation between percentile scores from peer review and bibliometric indices of the publications produced from funded projects. The present study used a large dataset to examine the rationale for such a study, to determine if it would satisfy the requirements for a test of predictive validity. The results show significant restriction of range in the applications selected for funding. Furthermore, those few applications that are funded with slightly worse peer review scores are not selected at random or representative of other applications in the same range. The funding institutes also negotiate with applicants to address issues identified during peer review. Therefore, the peer review scores assigned to the submitted applications, especially for those few funded applications with slightly worse peer review scores, do not reflect the changed and improved projects that are eventually funded. In addition, citation metrics by themselves are not valid or appropriate measures of scientific impact. The use of bibliometric indices on their own to measure scientific impact would likely increase the inefficiencies and problems with replicability already largely attributed to the current over-emphasis on bibliometric indices. Therefore, retrospective analyses of the correlation between percentile scores from peer review and bibliometric indices of the publications resulting from funded grant applications are not valid tests of the predictive validity of peer review at the NIH. PMID:26039440
2011-01-01
Background Insight in children's energy balance-related behaviours (EBRBs) and their determinants is important to inform obesity prevention research. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. Objective To examine the test-retest reliability and construct validity of the child questionnaire used in the ENERGY-project, measuring EBRBs and their potential determinants among 10-12 year old children. Methods We collected data among 10-12 year old children (n = 730 in the test-retest reliability study; n = 96 in the construct validity study) in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC) and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent face-to-face interview was assessed using ICC and percentage agreement. Results Of the 150 questionnaire items, 115 (77%) showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Test-retest reliability was moderate for 34 items (23%) and poor for one item. Construct validity appeared to be good to excellent for 70 (47%) of the 150 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 80 items, construct validity was moderate for 39 (26%) and poor for 41 items (27%). Conclusions Our results demonstrate that the ENERGY-child questionnaire, assessing EBRBs of the child as well as personal, family, and school-environmental determinants related to these EBRBs, has good test-retest reliability and moderate to good construct validity for the large majority of items. PMID:22152048
Veldhuijzen van Zanten, Sophie E M; Lane, Adam; Heymans, Martijn W; Baugh, Joshua; Chaney, Brooklyn; Hoffman, Lindsey M; Doughman, Renee; Jansen, Marc H A; Sanchez, Esther; Vandertop, William P; Kaspers, Gertjan J L; van Vuurden, Dannis G; Fouladi, Maryam; Jones, Blaise V; Leach, James
2017-08-01
We aimed to perform external validation of the recently developed survival prediction model for diffuse intrinsic pontine glioma (DIPG), and discuss its utility. The DIPG survival prediction model was developed in a cohort of patients from the Netherlands, United Kingdom and Germany, registered in the SIOPE DIPG Registry, and includes age <3 years, longer symptom duration and receipt of chemotherapy as favorable predictors, and presence of ring-enhancement on MRI as unfavorable predictor. Model performance was evaluated by analyzing the discrimination and calibration abilities. External validation was performed using an unselected cohort from the International DIPG Registry, including patients from United States, Canada, Australia and New Zealand. Basic comparison with the results of the original study was performed using descriptive statistics, and univariate- and multivariable regression analyses in the validation cohort. External validation was assessed following a variety of analyses described previously. Baseline patient characteristics and results from the regression analyses were largely comparable. Kaplan-Meier curves of the validation cohort reproduced separated groups of standard (n = 39), intermediate (n = 125), and high-risk (n = 78) patients. This discriminative ability was confirmed by similar values for the hazard ratios across these risk groups. The calibration curve in the validation cohort showed a symmetric underestimation of the predicted survival probabilities. In this external validation study, we demonstrate that the DIPG survival prediction model has acceptable cross-cohort calibration and is able to discriminate patients with short, average, and increased survival. We discuss how this clinico-radiological model may serve a useful role in current clinical practice.
Küçükdeveci, Ayse A; Sahin, Hülya; Ataman, Sebnem; Griffiths, Bridget; Tennant, Alan
2004-02-15
Guidelines have been established for cross-cultural adaptation of outcome measures. However, invariance across cultures must also be demonstrated through analysis of Differential Item Functioning (DIF). This is tested in the context of a Turkish adaptation of the Health Assessment Questionnaire (HAQ). Internal construct validity of the adapted HAQ is assessed by Rasch analysis; reliability, by internal consistency and the intraclass correlation coefficient; external construct validity, by association with impairments and American College of Rheumatology functional stages. Cross-cultural validity is tested through DIF by comparison with data from the UK version of the HAQ. The adapted version of the HAQ demonstrated good internal construct validity through fit of the data to the Rasch model (mean item fit 0.205; SD 0.998). Reliability was excellent (alpha = 0.97) and external construct validity was confirmed by expected associations. DIF for culture was found in only 1 item. Cross-cultural validity was found to be sufficient for use in international studies between the UK and Turkey. Future adaptation of instruments should include analysis of DIF at the field testing stage in the adaptation process.
Gourmelon, Anne; Delrue, Nathalie
Ten years elapsed since the OECD published the Guidance document on the validation and international regulatory acceptance of test methods for hazard assessment. Much experience has been gained since then in validation centres, in countries and at the OECD on a variety of test methods that were subjected to validation studies. This chapter reviews validation principles and highlights common features that appear to be important for further regulatory acceptance across studies. Existing OECD-agreed validation principles will most likely generally remain relevant and applicable to address challenges associated with the validation of future test methods. Some adaptations may be needed to take into account the level of technique introduced in test systems, but demonstration of relevance and reliability will continue to play a central role as pre-requisite for the regulatory acceptance. Demonstration of relevance will become more challenging for test methods that form part of a set of predictive tools and methods, and that do not stand alone. OECD is keen on ensuring that while these concepts evolve, countries can continue to rely on valid methods and harmonised approaches for an efficient testing and assessment of chemicals.
Screening for colon cancer: A test for occult blood.
Khakimov, N; Khasanova, G; Ershova, K; Gibadullina, L; Vetkina, T; Lobisheva, G; Chumakova, A
2015-01-01
The relevance of the problem of colorectal cancer (CRC) is evident because of extremely high morbidity and mortality rates, associated with this disease. CRC is mostly diagnosed only at very advanced stages. The reduction of mortality can be achieved by the popularization of screening-methods for early identification of CRC and adenomatous polyps of the colon, which are proved to be precancerous condition. Fecal occult blood test is a well-known method of screening for CRC. The advantages of this method when compared, for example, with colonoscopy are its simplicity and cost-effectiveness.Two techniques are usually used for detection of occult blood in the stool: Hemoccult (Guaiac) test and immunochemical test for hemoglobin. There is no consensus among researchers regarding the validity of these tests for the diagnosis of colorectal cancer. For example, J.S. Mandel (1996) notes 60% sensitivity of Guaiac-test for the detection of the early forms of colorectal cancer, while O.I. Kit (2014) suggets that it is not higher than 30%. There are also various opinions about specificity of these two tests. To review the literature on the validity of the fecal occult blood tests for the diagnosis of CRC. We looked for articles (electronic versions) available for free in the full-text versions, published from June 1, 1990 to December 31, 2014 in Russian or English. The following databases were used for search: E-LIBRARY; Cochrane; MEDLINE; EMBASE; Google search. Only original research papers were analyzed. Literature reviews or systematic reviews were not taken for analyses. 1) use of Guaiac and/or immunochemical fecal occult blood test as screening-tests for the detection of colorectal cancer and/or colon polyps (1 cm or more in diameter) in people older than 45 years; 2) comparing of results with the results of colonoscopy (colonoscopy is counted by majority of the authors as a "gold standard" for the diagnosis of CRC and adenomatous polyps). Initial keyword search returned 803 000 results, of which 449 sources were selected. After reading the abstracts, 29 articles that met inclusion criteria were kept. 10 other articles were excluded after that because they did not contain enough data for extraction or did not contain a control group. At the final step 19 articles were used for meta-analysis.Forest plot and Rock curve, which were developed with inclusion of the data from all studies, showed heterogeneity of the data. Additional analyzes were performed in subgroups with different diagnoses and various tests.The sensitivity of the Guaiac test for the diagnosis of colorectal cancer varied from 0.13 to 1.00, and specificity - from 0.69 to 0.99. The sensitivity of the immunochemical test for the diagnosis of CRC ranged from 0.42 to 0.94 with specificity ranging from 0.40 to 1.00.The sensitivity of the Guaiac test for the diagnosis of the colon polyps was between 0.05 and 0.69, and its specificity - from 0.67 to 0.98. The sensitivity of the immunochemical test for the diagnosis of polyps was from 0.24 to 0.75, and its specificity - from 0.40 to 0.97.Bivariate analysis of the validity of Guaiac test and immunochemical method for the diagnosis of colorectal cancer showed better results for the immunochemical test compared to Guaiac test. The tests showed very similar results when used for the diagnosis of polyposis. Bivariate analysis, comparing the validity of tests for the diagnosis of colorectal cancer versus polyposis demonstrated better results for CRC.Multivariate analysis of the validity of the Guaiac and immunochemical tests for the diagnosis of colorectal cancer and polyps also showed better results for detection of colorectal cancer compared with the polyps for both tests. At the same time the highest validity for the diagnosis of CRC was demonstrated for immunochemical analysis. 1. The sensitivity of the Guaiac test for occult blood in stool is lower than its specificity.2. Broad dispersion of the validity characteristics of the fecal occult blood tests was observed.3. The validity of tests for occult blood was higher when they were used for detection of colorectal cancer than of colon polyposis.4. The highest validity rate has been demonstrated for the immunochemical test when it was used for colon cancer screening.
Murphy, Thomas; Schwedock, Julie; Nguyen, Kham; Mills, Anna; Jones, David
2015-01-01
New recommendations for the validation of rapid microbiological methods have been included in the revised Technical Report 33 release from the PDA. The changes include a more comprehensive review of the statistical methods to be used to analyze data obtained during validation. This case study applies those statistical methods to accuracy, precision, ruggedness, and equivalence data obtained using a rapid microbiological methods system being evaluated for water bioburden testing. Results presented demonstrate that the statistical methods described in the PDA Technical Report 33 chapter can all be successfully applied to the rapid microbiological method data sets and gave the same interpretation for equivalence to the standard method. The rapid microbiological method was in general able to pass the requirements of PDA Technical Report 33, though the study shows that there can be occasional outlying results and that caution should be used when applying statistical methods to low average colony-forming unit values. Prior to use in a quality-controlled environment, any new method or technology has to be shown to work as designed by the manufacturer for the purpose required. For new rapid microbiological methods that detect and enumerate contaminating microorganisms, additional recommendations have been provided in the revised PDA Technical Report No. 33. The changes include a more comprehensive review of the statistical methods to be used to analyze data obtained during validation. This paper applies those statistical methods to analyze accuracy, precision, ruggedness, and equivalence data obtained using a rapid microbiological method system being validated for water bioburden testing. The case study demonstrates that the statistical methods described in the PDA Technical Report No. 33 chapter can be successfully applied to rapid microbiological method data sets and give the same comparability results for similarity or difference as the standard method. © PDA, Inc. 2015.