Sample records for validation test plan

  1. Alphabus Mechanical Validation Plan and Test Campaign

    NASA Astrophysics Data System (ADS)

    Calvisi, G.; Bonnet, D.; Belliol, P.; Lodereau, P.; Redoundo, R.

    2012-07-01

    A joint team of the two leading European satellite companies (Astrium and Thales Alenia Space) worked with the support of ESA and CNES to define a product line able to efficiently address the upper segment of communications satellites : Alphabus Starting in 2009 and up to 2011 the mechanical validation of the Alphabus platform has been obtained thanks to static tests performed on dedicated static model and to environmental test performed on the first satellite based on Alphabus: Alphasat I-XL. The mechanical validation of the Alphabus platform presented an excellent opportunity to improve the validation and qualification process, with respect to static, sine vibrations, acoustic and L/V shock environment, minimizing recurrent cost of manufacturing, integration and testing. A main driver on mechanical testing is that mechanical acceptance testing at satellite level will be performed with empty tanks due to technical constraints (limitation of existing vibration devices) and programmatic advantages (test risk reduction, test schedule minimization). In this paper the impacts that such testing logic have on validation plan are briefly recalled and its actual application for Alphasat PFM mechanical test campaign is detailed.

  2. WEC-SIM Validation Testing Plan FY14 Q4.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ruehl, Kelley Michelle

    2016-02-01

    The WEC-Sim project is currently on track, having met both the SNL and NREL FY14 Milestones, as shown in Table 1 and Table 2. This is also reflected in the Gantt chart uploaded to the WEC-Sim SharePoint site in the FY14 Q4 Deliverables folder. The work completed in FY14 includes code verification through code-to-code comparison (FY14 Q1 and Q2), preliminary code validation through comparison to experimental data (FY14 Q2 and Q3), presentation and publication of the WEC-Sim project at OMAE 2014 [1], [2], [3] and GMREC/METS 2014 [4] (FY14 Q3), WEC-Sim code development and public open-source release (FY14 Q3), andmore » development of a preliminary WEC-Sim validation test plan (FY14 Q4). This report presents the preliminary Validation Testing Plan developed in FY14 Q4. The validation test effort started in FY14 Q4 and will go on through FY15. Thus far the team has developed a device selection method, selected a device, and placed a contract with the testing facility, established several collaborations including industry contacts, and have working ideas on the testing details such as scaling, device design, and test conditions.« less

  3. The Unified Language Testing Plan: Speaking Proficiency Test. Russian Pilot Validation Studies. Report Number 2.

    ERIC Educational Resources Information Center

    Thornton, Julie A.

    The report describes one segment of the Federal Language Testing Board's Unified Language Testing Plan (ULTP), the validation of the speaking proficiency test in Russian. The ULTP is a project to increase standardization of foreign language proficiency measurement and promote sharing of resources among testing programs in the federal government.…

  4. The Unified Language Testing Plan: Speaking Proficiency Test. Spanish and English Pilot Validation Studies. Report Number 1.

    ERIC Educational Resources Information Center

    Thornton, Julie A.

    This report describes one segment of the Federal Language Testing Board's Unified Language Testing Plan (ULTP), the validation of speaking proficiency tests in Spanish and English. The ULTP is a project to increase standardization of foreign language proficiency measurement and promote sharing of resources among testing programs in the federal…

  5. Specific test and evaluation plan

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hays, W.H.

    1998-03-20

    The purpose of this Specific Test and Evaluation Plan (STEP) is to provide a detailed written plan for the systematic testing of modifications made to the 241-AX-B Valve Pit by the W-314 Project. The STEP develops the outline for test procedures that verify the system`s performance to the established Project design criteria. The STEP is a lower tier document based on the W-314 Test and Evaluation Plan (TEP). Testing includes Validations and Verifications (e.g., Commercial Grade Item Dedication activities), Factory Acceptance Tests (FATs), installation tests and inspections, Construction Acceptance Tests (CATs), Acceptance Test Procedures (ATPs), Pre-Operational Test Procedures (POTPs), andmore » Operational Test Procedures (OTPs). It should be noted that POTPs are not required for testing of the transfer line addition. The STEP will be utilized in conjunction with the TEP for verification and validation.« less

  6. Adaptation and validation of the Tower of London test of planning and problem solving in people with intellectual disabilities.

    PubMed

    Masson, J D; Dagnan, D; Evans, J

    2010-05-01

    There is a need for validated, standardised tools for the assessment of executive functions in adults with intellectual disabilities (ID). This study examines the validity of a test of planning and problem solving (Tower of London) with adults with ID. Participants completed an adapted version of the Tower of London (ToL) while day-centre staff completed adaptive function (Adaptive Behaviour Scale - Residential and Community: Second Edition, modified version) and dysexecutive function (DEX-Independent Rater) questionnaires for each participant. Correlation analyses of test and questionnaire variables were undertaken. The adapted ToL has a robust structure and shows significant associations with independent living skills, challenging behaviour and behaviours related to dysexecutive function. The adapted ToL is a valid test for use with people with ID. However, there is also a need to develop other ecologically valid tools based on everyday planning tasks undertaken by people with ID.

  7. Adaptation and Validation of the Tower of London Test of Planning and Problem Solving in People with Intellectual Disabilities

    ERIC Educational Resources Information Center

    Masson, J. D.; Dagnan, D.; Evans, J.

    2010-01-01

    Background: There is a need for validated, standardised tools for the assessment of executive functions in adults with intellectual disabilities (ID). This study examines the validity of a test of planning and problem solving (Tower of London) with adults with ID. Method: Participants completed an adapted version of the Tower of London (ToL) while…

  8. Supersonic Retropropulsion CFD Validation with Ames Unitary Plan Wind Tunnel Test Data

    NASA Technical Reports Server (NTRS)

    Schauerhamer, Daniel G.; Zarchi, Kerry A.; Kleb, William L.; Edquist, Karl T.

    2013-01-01

    A validation study of Computational Fluid Dynamics (CFD) for Supersonic Retropropulsion (SRP) was conducted using three Navier-Stokes flow solvers (DPLR, FUN3D, and OVERFLOW). The study compared results from the CFD codes to each other and also to wind tunnel test data obtained in the NASA Ames Research Center 90 70 Unitary PlanWind Tunnel. Comparisons include surface pressure coefficient as well as unsteady plume effects, and cover a range of Mach numbers, levels of thrust, and angles of orientation. The comparisons show promising capability of CFD to simulate SRP, and best agreement with the tunnel data exists for the steadier cases of the 1-nozzle and high thrust 3-nozzle configurations.

  9. Implementation of the validation testing in MPPG 5.a "Commissioning and QA of treatment planning dose calculations-megavoltage photon and electron beams".

    PubMed

    Jacqmin, Dustin J; Bredfeldt, Jeremy S; Frigo, Sean P; Smilowitz, Jennifer B

    2017-01-01

    The AAPM Medical Physics Practice Guideline (MPPG) 5.a provides concise guidance on the commissioning and QA of beam modeling and dose calculation in radiotherapy treatment planning systems. This work discusses the implementation of the validation testing recommended in MPPG 5.a at two institutions. The two institutions worked collaboratively to create a common set of treatment fields and analysis tools to deliver and analyze the validation tests. This included the development of a novel, open-source software tool to compare scanning water tank measurements to 3D DICOM-RT Dose distributions. Dose calculation algorithms in both Pinnacle and Eclipse were tested with MPPG 5.a to validate the modeling of Varian TrueBeam linear accelerators. The validation process resulted in more than 200 water tank scans and more than 50 point measurements per institution, each of which was compared to a dose calculation from the institution's treatment planning system (TPS). Overall, the validation testing recommended in MPPG 5.a took approximately 79 person-hours for a machine with four photon and five electron energies for a single TPS. Of the 79 person-hours, 26 person-hours required time on the machine, and the remainder involved preparation and analysis. The basic photon, electron, and heterogeneity correction tests were evaluated with the tolerances in MPPG 5.a, and the tolerances were met for all tests. The MPPG 5.a evaluation criteria were used to assess the small field and IMRT/VMAT validation tests. Both institutions found the use of MPPG 5.a to be a valuable resource during the commissioning process. The validation testing in MPPG 5.a showed the strengths and limitations of the TPS models. In addition, the data collected during the validation testing is useful for routine QA of the TPS, validation of software upgrades, and commissioning of new algorithms. © 2016 The Authors. Journal of Applied Clinical Medical Physics published by Wiley Periodicals, Inc. on behalf of

  10. Some guidance on preparing validation plans for the DART Full System Models.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gray, Genetha Anne; Hough, Patricia Diane; Hills, Richard Guy

    2009-03-01

    Planning is an important part of computational model verification and validation (V&V) and the requisite planning document is vital for effectively executing the plan. The document provides a means of communicating intent to the typically large group of people, from program management to analysts to test engineers, who must work together to complete the validation activities. This report provides guidelines for writing a validation plan. It describes the components of such a plan and includes important references and resources. While the initial target audience is the DART Full System Model teams in the nuclear weapons program, the guidelines are generallymore » applicable to other modeling efforts. Our goal in writing this document is to provide a framework for consistency in validation plans across weapon systems, different types of models, and different scenarios. Specific details contained in any given validation plan will vary according to application requirements and available resources.« less

  11. MISR - Science Data Validation Plan

    NASA Technical Reports Server (NTRS)

    Conel, J.; Ledeboer, W.; Ackerman, T.; Marchand, R.; Clothiaux, E.

    2000-01-01

    This Science Data Validation Plan describes the plans for validating a subset of the Multi-angle Imaging SpectroRadiometer (MISR) Level 2 algorithms and data products and supplying top-of-atmosphere (TOA) radiances to the In-flight Radiometric Calibration and Characterization (IFRCC) subsystem for vicarious calibration.

  12. Veggie and the VEG-01 Hardware Validation Test

    NASA Technical Reports Server (NTRS)

    Massa, Gioia; wheeler, Ray; Smith, Trent

    2015-01-01

    This presentation presents a brief overview of KSC plant science hardware for space and then details the Veggie hardware and the VEG-01 hardware validation test. The test results and future plans are discussed.

  13. Validation of Mission Plans Through Simulation

    NASA Astrophysics Data System (ADS)

    St-Pierre, J.; Melanson, P.; Brunet, C.; Crabtree, D.

    2002-01-01

    The purpose of a spacecraft mission planning system is to automatically generate safe and optimized mission plans for a single spacecraft, or more functioning in unison. The system verifies user input syntax, conformance to commanding constraints, absence of duty cycle violations, timing conflicts, state conflicts, etc. Present day constraint-based systems with state-based predictive models use verification rules derived from expert knowledge. A familiar solution found in Mission Operations Centers, is to complement the planning system with a high fidelity spacecraft simulator. Often a dedicated workstation, the simulator is frequently used for operator training and procedure validation, and may be interfaced to actual control stations with command and telemetry links. While there are distinct advantages to having a planning system offer realistic operator training using the actual flight control console, physical verification of data transfer across layers and procedure validation, experience has revealed some drawbacks and inefficiencies in ground segment operations: With these considerations, two simulation-based mission plan validation projects are under way at the Canadian Space Agency (CSA): RVMP and ViSION. The tools proposed in these projects will automatically run scenarios and provide execution reports to operations planning personnel, prior to actual command upload. This can provide an important safeguard for system or human errors that can only be detected with high fidelity, interdependent spacecraft models running concurrently. The core element common to these projects is a spacecraft simulator, built with off-the- shelf components such as CAE's Real-Time Object-Based Simulation Environment (ROSE) technology, MathWork's MATLAB/Simulink, and Analytical Graphics' Satellite Tool Kit (STK). To complement these tools, additional components were developed, such as an emulated Spacecraft Test and Operations Language (STOL) interpreter and CCSDS TM

  14. Pretest information for a test to validate plume simulation procedures (FA-17)

    NASA Technical Reports Server (NTRS)

    Hair, L. M.

    1978-01-01

    The results of an effort to plan a final verification wind tunnel test to validate the recommended correlation parameters and application techniques were presented. The test planning effort was complete except for test site finalization and the associated coordination. Two suitable test sites were identified. Desired test conditions were shown. Subsequent sections of this report present the selected model and test site, instrumentation of this model, planned test operations, and some concluding remarks.

  15. Valid methods: the quality assurance of test method development, validation, approval, and transfer for veterinary testing laboratories.

    PubMed

    Wiegers, Ann L

    2003-07-01

    Third-party accreditation is a valuable tool to demonstrate a laboratory's competence to conduct testing. Accreditation, internationally and in the United States, has been discussed previously. However, accreditation is only I part of establishing data credibility. A validated test method is the first component of a valid measurement system. Validation is defined as confirmation by examination and the provision of objective evidence that the particular requirements for a specific intended use are fulfilled. The international and national standard ISO/IEC 17025 recognizes the importance of validated methods and requires that laboratory-developed methods or methods adopted by the laboratory be appropriate for the intended use. Validated methods are therefore required and their use agreed to by the client (i.e., end users of the test results such as veterinarians, animal health programs, and owners). ISO/IEC 17025 also requires that the introduction of methods developed by the laboratory for its own use be a planned activity conducted by qualified personnel with adequate resources. This article discusses considerations and recommendations for the conduct of veterinary diagnostic test method development, validation, evaluation, approval, and transfer to the user laboratory in the ISO/IEC 17025 environment. These recommendations are based on those of nationally and internationally accepted standards and guidelines, as well as those of reputable and experienced technical bodies. They are also based on the author's experience in the evaluation of method development and transfer projects, validation data, and the implementation of quality management systems in the area of method development.

  16. RELAP-7 Software Verification and Validation Plan

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Smith, Curtis L.; Choi, Yong-Joon; Zou, Ling

    This INL plan comprehensively describes the software for RELAP-7 and documents the software, interface, and software design requirements for the application. The plan also describes the testing-based software verification and validation (SV&V) process—a set of specially designed software models used to test RELAP-7. The RELAP-7 (Reactor Excursion and Leak Analysis Program) code is a nuclear reactor system safety analysis code being developed at Idaho National Laboratory (INL). The code is based on the INL’s modern scientific software development framework – MOOSE (Multi-Physics Object-Oriented Simulation Environment). The overall design goal of RELAP-7 is to take advantage of the previous thirty yearsmore » of advancements in computer architecture, software design, numerical integration methods, and physical models. The end result will be a reactor systems analysis capability that retains and improves upon RELAP5’s capability and extends the analysis capability for all reactor system simulation scenarios.« less

  17. Reliability and Factorial Validity of Non-Specific and Tennis-Specific Pre-Planned Agility Tests; Preliminary Analysis

    PubMed Central

    Sekulic, Damir; Uljevic, Ognjen; Peric, Mia; Spasic, Miodrag; Kondric, Miran

    2017-01-01

    Abstract Agility is an important quality in tennis, yet there is an evident lack of studies focussing on the applicability of tennis-specific agility performances and comparing them to equivalent non-specific agility performances. The aim of this study was to evaluate the reliability and factorial validity of three tests of pre-planned agility, performed in specific (with a tennis racquet) and non-specific (without a tennis racquet) conditions. The sample consisted of 33 tennis players (13 males and 20 females; age: 18.3 ± 1.1 years and 18.6 ± 1.3 years; body height: 185.4 ± 51 cm and 169.3 ± 4.2 cm, 74.0 ± 4.4 kg and 61.2 ± 3.1 kg, respectively). The variables comprised three agility tests: a 20-yard test, a T-test and the Illinois test, all performed in both specific and non-specific conditions. Between-subject and within-subject reliability were found to be high (Cronbach Alpha: 0.93 to 0.98; Coefficient of Variation: 3 to 8%), with better within-subject reliability and stability of the measurement for specific tests. Pearson’s product moment correlations between the non-specific and specific agility performances were high (r ≥0.84), while factor analysis extracted only one significant latent dimension on the basis of the Guttman-Kaiser criterion. The results of the 20-yard test were better when the test was conducted in the specific conditions (t-test = 2.66; p < 0.05). For the Illinois test, superior results were recorded in the non-specific conditions (t-test = 2.96; p < 0.05), which can be explained by the test duration (about 20 s) and non-specific locomotion forms such as rotational movements. Considering the findings of the present study, when testing tennis-specific pre-planned agility, we suggest using tests of short duration (less than 10 s) and sport-specific types of locomotion. PMID:28210343

  18. On Validity Theory and Test Validation

    ERIC Educational Resources Information Center

    Sireci, Stephen G.

    2007-01-01

    Lissitz and Samuelsen (2007) propose a new framework for conceptualizing test validity that separates analysis of test properties from analysis of the construct measured. In response, the author of this article reviews fundamental characteristics of test validity, drawing largely from seminal writings as well as from the accepted standards. He…

  19. The development and validation of the advance care planning questionnaire in Malaysia.

    PubMed

    Lai, Pauline Siew Mei; Mohd Mudri, Salinah; Chinna, Karuthan; Othman, Sajaratulnisah

    2016-10-18

    Advance care planning is a voluntary process whereby individual preferences, values and beliefs are used to aid a person in planning for end-of-life care. Currently, there is no local instrument to assess an individual's awareness and attitude towards advance care planning. This study aimed to develop an Advance Care Planning Questionnaire and to determine its validity and reliability among older people in Malaysia. The Advance Care Planning Questionnaire was developed based on literature review. Face and content validity was verified by an expert panel, and piloted among 15 participants. Our study was conducted from October 2013 to February 2014, at an urban primary care clinic in Malaysia. Included were those aged >50 years, who could understand English. A retest was conducted 2 weeks after the first administration. Participants from the pilot study did not encounter any problems in answering the Advance Care Planning Questionnaire. Hence, no further modifications were made. Flesch reading ease was 71. The final version of the Advance Care Planning Questionnaire consists of 66 items: 30 items were measured on a nominal scale, whilst 36 items were measured on a Likert-like scale; of which we were only able to validate 22 items, as the remaining 14 items were descriptive in nature. A total of 245 eligible participants were approached; of which 230 agreed to participate (response rate = 93.9 %). Factor analysis on the 22 items measured on a Likert-scale revealed four domains: "feelings regarding advance care planning", "justifications for advance care planning", "justifications for not having advance care planning: fate and religion", and "justifications for not having advance care planning: avoid thinking about death". The Cronbach's alpha values for items each domain ranged from 0.637-0.915. In test-retest, kappa values ranged from 0.738-0.947. The final Advance Care Planning Questionnaire consisted of 63 items and 4 domains. It was found to be a valid and

  20. Field Test of Route Planning Software for Lunar Polar Missions

    NASA Astrophysics Data System (ADS)

    Horchler, A. D.; Cunningham, C.; Jones, H. L.; Arnett, D.; Fang, E.; Amoroso, E.; Otten, N.; Kitchell, F.; Holst, I.; Rock, G.; Whittaker, W.

    2017-10-01

    A novel field test paradigm has been developed to demonstrate and validate route planning software in the stark low-angled light and sweeping shadows a rover would experience at the poles of the Moon. Software, ConOps, and test results are presented.

  1. Test Planning Approach and Lessons

    NASA Technical Reports Server (NTRS)

    Parkinson, Douglas A.; Brown, Kendall K.

    2004-01-01

    As NASA began technology risk reduction activities and planning for the next generation launch vehicle under the Space Launch Initiative (SLI), now the Next Generation Launch Technology (NGLT) Program, a review of past large liquid rocket engine development programs was performed. The intent of the review was to identify any significant lessons from the development testing programs that could be applied to current and future engine development programs. Because the primary prototype engine in design at the time of this study was the Boeing-Rocketdyne RS-84, the study was slightly biased towards LOX/RP-1 liquid propellant engines. However, the significant lessons identified are universal. It is anticipated that these lessons will serve as a reference for test planning in the Engine Systems Group at Marshall Space Flight Center (MSFC). Towards the end of F-1 and J-2 engine development testing, NASA/MSFC asked Rocketdyne to review those test programs. The result was a document titled, Study to Accelerate Development by Test of a Rocket Engine (R-8099). The "intent (of this study) is to apply this thinking and learning to more efficiently develop rocket engines to high reliability with improved cost effectivenes" Additionally, several other engine programs were reviewed - such as SSME, NSTS, STME, MC-1, and RS-83- to support or refute the R-8099. R-8099 revealed two primary lessons for test planning, which were supported by the other engine development programs. First, engine development programs can benefit from arranging the test program for engine system testing as early as feasible. The best test for determining environments is at the system level, the closest to the operational flight environment. Secondly, the component testing, which tends to be elaborate, should instead be geared towards reducing risk to enable system test. Technical risk can be reduced at the component level, but the design can only be truly verified and validated after engine system testing.

  2. Project W-314 specific test and evaluation plan for AZ tank farm upgrades

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hays, W.H.

    1998-08-12

    The purpose of this Specific Test and Evaluation Plan (STEP) is to provide a detailed written plan for the systematic testing of modifications made by the addition of the SN-631 transfer line from the AZ-O1A pit to the AZ-02A pit by the W-314 Project. The STEP develops the outline for test procedures that verify the system`s performance to the established Project design criteria. The STEP is a lower tier document based on the W-314 Test and Evaluation P1 an (TEP). Testing includes Validations and Verifications (e.g., Commercial Grade Item Dedication activities, etc), Factory Tests and Inspections (FTIs), installation tests andmore » inspections, Construction Tests and Inspections (CTIs), Acceptance Test Procedures (ATPs), Pre-Operational Test Procedures (POTPs), and Operational Test Procedures (OTPs). The STEP will be utilized in conjunction with the TEP for verification and validation.« less

  3. RELAP-7 Software Verification and Validation Plan: Requirements Traceability Matrix (RTM) Part 1 – Physics and numerical methods

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Choi, Yong Joon; Yoo, Jun Soo; Smith, Curtis Lee

    2015-09-01

    This INL plan comprehensively describes the Requirements Traceability Matrix (RTM) on main physics and numerical method of the RELAP-7. The plan also describes the testing-based software verification and validation (SV&V) process—a set of specially designed software models used to test RELAP-7.

  4. Effort, symptom validity testing, performance validity testing and traumatic brain injury.

    PubMed

    Bigler, Erin D

    2014-01-01

    To understand the neurocognitive effects of brain injury, valid neuropsychological test findings are paramount. This review examines the research on what has been referred to a symptom validity testing (SVT). Above a designated cut-score signifies a 'passing' SVT performance which is likely the best indicator of valid neuropsychological test findings. Likewise, substantially below cut-point performance that nears chance or is at chance signifies invalid test performance. Significantly below chance is the sine qua non neuropsychological indicator for malingering. However, the interpretative problems with SVT performance below the cut-point yet far above chance are substantial, as pointed out in this review. This intermediate, border-zone performance on SVT measures is where substantial interpretative challenges exist. Case studies are used to highlight the many areas where additional research is needed. Historical perspectives are reviewed along with the neurobiology of effort. Reasons why performance validity testing (PVT) may be better than the SVT term are reviewed. Advances in neuroimaging techniques may be key in better understanding the meaning of border zone SVT failure. The review demonstrates the problems with rigidity in interpretation with established cut-scores. A better understanding of how certain types of neurological, neuropsychiatric and/or even test conditions may affect SVT performance is needed.

  5. Biodiesel Test Plan

    DTIC Science & Technology

    2014-07-01

    Biodiesel Test Plan Distribution Statement A: Approved for Public Release; distribution is unlimited. July 2014 Report No. CG-D-07-14...Appendix C) Biodiesel Test Plan ii UNCLAS//Public | CG-926 R&DC | G. W. Johnson, et al. Public | July 2014 N O T I C E This...Development Center 1 Chelsea Street New London, CT 06320 Biodiesel Test Plan iii UNCLAS//Public | CG-926 R&DC | G. W. Johnson, et al

  6. Strategies for Validation Testing of Ground Systems

    NASA Technical Reports Server (NTRS)

    Annis, Tammy; Sowards, Stephanie

    2009-01-01

    In order to accomplish the full Vision for Space Exploration announced by former President George W. Bush in 2004, NASA will have to develop a new space transportation system and supporting infrastructure. The main portion of this supporting infrastructure will reside at the Kennedy Space Center (KSC) in Florida and will either be newly developed or a modification of existing vehicle processing and launch facilities, including Ground Support Equipment (GSE). This type of large-scale launch site development is unprecedented since the time of the Apollo Program. In order to accomplish this successfully within the limited budget and schedule constraints a combination of traditional and innovative strategies for Verification and Validation (V&V) have been developed. The core of these strategies consists of a building-block approach to V&V, starting with component V&V and ending with a comprehensive end-to-end validation test of the complete launch site, called a Ground Element Integration Test (GEIT). This paper will outline these strategies and provide the high level planning for meeting the challenges of implementing V&V on a large-scale development program. KEY WORDS: Systems, Elements, Subsystem, Integration Test, Ground Systems, Ground Support Equipment, Component, End Item, Test and Verification Requirements (TVR), Verification Requirements (VR)

  7. Test Plan - Solids Accumulation Scouting Studies

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Duignan, M. R.; Steeper, T. J.; Steimke, J. L.

    This plan documents the highlights of the Solids Accumulations Scouting Studies test; a project, from Washington River Protection Solutions (WRPS), that began on February 1, 2012. During the last 12 weeks considerable progress has been made to design and plan methods that will be used to estimate the concentration and distribution of heavy fissile solids in accumulated solids in the Hanford double-shell tank (DST) 241-AW-105 (AW-105), which is the primary goal of this task. This DST will be one of the several waste feed delivery staging tanks designated to feed the Pretreatment Facility (PTF) of the Waste Treatment and Immobilizationmore » Plant (WTP). Note that over the length of the waste feed delivery mission AW-105 is currently identified as having the most fill empty cycles of any DST feed tanks, which is the reason for modeling this particular tank. At SRNL an existing test facility, the Mixing Demonstration Tank, which will be modified for the present work, will use stainless steel particles in a simulant that represents Hanford waste to perform mock staging tanks transfers that will allow solids to accumulate in the tank heel. The concentration and location of the mock fissile particles will be measured in these scoping studies to produce information that will be used to better plan larger scaled tests. Included in these studies is a secondary goal of developing measurement methods to accomplish the primary goal. These methods will be evaluated for use in the larger scale experiments. Included in this plan are the several pretest activities that will validate the measurement techniques that are currently in various phases of construction. Aspects of each technique, e.g., particle separations, volume determinations, topographical mapping, and core sampling, have been tested in bench-top trials, as discussed herein, but the actual equipment to be employed during the full test will need evaluation after fabrication and integration into the test facility.« less

  8. Top-Mounted Propulsion Test Plans (TMP17)

    NASA Technical Reports Server (NTRS)

    Bridges, James; Henderson, Brenda; Huff, Dennis

    2017-01-01

    NASA recently completed a study of propulsion cycles and nozzle types applicable to a 70-passenger, M1.6 supersonic airliner, paying especial attention to the noise produced during landing and take-off. The results of the study were validated in a model-scale test at NASA Glenn last summer. The findings of that study and test, along with other studies, have resulted in a new strategy for achieving the Commercial Supersonic Technologys goals for noise and performance. Key to that strategy is moving the propulsion to the top-side of the vehicle and modifying the nozzle and inlet to maximally shield the propulsion noise while maintaining efficient operation. Installed exhaust configurations have been designed to minimize the exhaust noise using new acoustic design tools. A test planned for the fall of 2017 will validate both the new design tools and the low-noise concept using a new translating phased array. During the test, questions regarding modifications of convected waves in the jet near-field that are key to new understandings of aft jet noise will be addressed. Also, to better tie rig results to real-world measurements, a model-scale version of a nozzle that was flight tested by Glenn Research Center in 2001 will be tested.

  9. CASL Verification and Validation Plan

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mousseau, Vincent Andrew; Dinh, Nam

    2016-06-30

    This report documents the Consortium for Advanced Simulation of LWRs (CASL) verification and validation plan. The document builds upon input from CASL subject matter experts, most notably the CASL Challenge Problem Product Integrators, CASL Focus Area leaders, and CASL code development and assessment teams. This document will be a living document that will track progress on CASL to do verification and validation for both the CASL codes (including MPACT, CTF, BISON, MAMBA) and for the CASL challenge problems (CIPS, PCI, DNB). The CASL codes and the CASL challenge problems are at differing levels of maturity with respect to validation andmore » verification. The gap analysis will summarize additional work that needs to be done. Additional VVUQ work will be done as resources permit. This report is prepared for the Department of Energy’s (DOE’s) CASL program in support of milestone CASL.P13.02.« less

  10. Safety validation test equipment operation

    NASA Astrophysics Data System (ADS)

    Kurosaki, Tadaaki; Watanabe, Takashi

    1992-08-01

    An overview of the activities conducted on safety validation test equipment operation for materials used for NASA manned missions is presented. Safety validation tests, such as flammability, odor, offgassing, and so forth were conducted in accordance with NASA-NHB-8060.1C using test subjects common with those used by NASA, and the equipment used were qualified for their functions and performances in accordance with NASDA-CR-99124 'Safety Validation Test Qualification Procedures.' Test procedure systems were established by preparing 'Common Procedures for Safety Validation Test' as well as test procedures for flammability, offgassing, and odor tests. The test operation organization chaired by the General Manager of the Parts and Material Laboratory of NASDA (National Space Development Agency of Japan) was established, and the test leaders and operators in the organization were qualified in accordance with the specified procedures. One-hundred-one tests had been conducted so far by the Parts and Material Laboratory according to the request submitted by the manufacturers through the Space Station Group and the Safety and Product Assurance for Manned Systems Office.

  11. Development of Level 1b Calibration and Validation Readiness, Implementation and Management Plans for GOES-R

    NASA Technical Reports Server (NTRS)

    Kunkee, David B.; Farley, Robert W.; Kwan, Betty P.; Hecht, James H.; Walterscheid, Richard L.; Claudepierre, Seth G.; Bishop, Rebecca L.; Gelinas, Lynette J.; Deluccia, Frank J.

    2017-01-01

    A complement of Readiness, Implementation and Management Plans (RIMPs) to facilitate management of post-launch product test activities for the official Geostationary Operational Environmental Satellite (GOES-R) Level 1b (L1b) products have been developed and documented. Separate plans have been created for each of the GOES-R sensors including: the Advanced Baseline Imager (ABI), the Extreme ultraviolet and X-ray Irradiance Sensors (EXIS), Geostationary Lightning Mapper (GLM), GOES-R Magnetometer (MAG), the Space Environment In-Situ Suite (SEISS), and the Solar Ultraviolet Imager (SUVI). The GOES-R program has implemented these RIMPs in order to address the full scope of CalVal activities required for a successful demonstration of GOES-R L1b data product quality throughout the three validation stages: Beta, Provisional and Full Validation. For each product maturity level, the RIMPs include specific performance criteria and required artifacts that provide evidence a given validation stage has been reached, the timing when each stage will be complete, a description of every applicable Post-Launch Product Test (PLPT), roles and responsibilities of personnel, upstream dependencies, and analysis methods and tools to be employed during validation. Instrument level Post-Launch Tests (PLTs) are also referenced and apply primarily to functional check-out of the instruments.

  12. ExEP yield modeling tool and validation test results

    NASA Astrophysics Data System (ADS)

    Morgan, Rhonda; Turmon, Michael; Delacroix, Christian; Savransky, Dmitry; Garrett, Daniel; Lowrance, Patrick; Liu, Xiang Cate; Nunez, Paul

    2017-09-01

    EXOSIMS is an open-source simulation tool for parametric modeling of the detection yield and characterization of exoplanets. EXOSIMS has been adopted by the Exoplanet Exploration Programs Standards Definition and Evaluation Team (ExSDET) as a common mechanism for comparison of exoplanet mission concept studies. To ensure trustworthiness of the tool, we developed a validation test plan that leverages the Python-language unit-test framework, utilizes integration tests for selected module interactions, and performs end-to-end crossvalidation with other yield tools. This paper presents the test methods and results, with the physics-based tests such as photometry and integration time calculation treated in detail and the functional tests treated summarily. The test case utilized a 4m unobscured telescope with an idealized coronagraph and an exoplanet population from the IPAC radial velocity (RV) exoplanet catalog. The known RV planets were set at quadrature to allow deterministic validation of the calculation of physical parameters, such as working angle, photon counts and integration time. The observing keepout region was tested by generating plots and movies of the targets and the keepout zone over a year. Although the keepout integration test required the interpretation of a user, the test revealed problems in the L2 halo orbit and the parameterization of keepout applied to some solar system bodies, which the development team was able to address. The validation testing of EXOSIMS was performed iteratively with the developers of EXOSIMS and resulted in a more robust, stable, and trustworthy tool that the exoplanet community can use to simulate exoplanet direct-detection missions from probe class, to WFIRST, up to large mission concepts such as HabEx and LUVOIR.

  13. On the Validity of Useless Tests

    ERIC Educational Resources Information Center

    Sireci, Stephen G.

    2016-01-01

    A misconception exists that validity may refer only to the "interpretation" of test scores and not to the "uses" of those scores. The development and evolution of validity theory illustrate test score interpretation was a primary focus in the earliest days of modern testing, and that validating interpretations derived from test…

  14. Psychometric Testing of the Self-Efficacy for Interdisciplinary Plans of Care Scale.

    PubMed

    Molle, Elizabeth; Froman, Robin

    2017-01-01

    Computerized interdisciplinary plans of care have revitalized nurse-centric care plans into dynamic and meaningful electronic documents. To maximize the benefits of these documents, it is important to understand healthcare professionals' attitudes, specifically their confidence, for making computerized interdisciplinary care plans useful and meaningful documents. The purpose of the study was to test the psychometric properties of the Self-Efficacy for Interdisciplinary Plans of Care instrument intended to measure healthcare professionals' self-efficacy for using such documents. Content validity was assessed by an expert review panel. Content validity indices ranged from 0.75 to 1.00, with a scale CVI of 0.94. A sample of 389 healthcare providers completed the 14-item instrument. Principal axis factoring was used to assess factor structure. The exploratory factor analysis yielded a single-factor structure accounting for 71.76% of covariance. Cronbach internal consistency coefficient for the single factor solution was .97. The corrected item-total correlations ranged from 0.71 to 0.90. The coefficient of stability, during a 2-week period, with a subset of the sample (n = 38), was estimated at 0.82. The results of this study suggest that the Self-Efficacy for Interdisciplinary Plans of Care has sturdy reliability and validity for measuring the self-efficacy of healthcare providers to make computerized interdisciplinary plans of care meaningful and useful documents.

  15. Development and Initial Validation of an Instrument for Human Capital Planning

    ERIC Educational Resources Information Center

    Zula, Kenneth J.; Chermack, Thomas J.

    2008-01-01

    This article reports on development and validation of an instrument for use in human capital approaches for organizational planning. The article describes use of a team of subject matter experts in developing a measure of human capital planning, and use of exploratory factor analysis techniques to validate the resulting instrument. These data were…

  16. Validity evidence based on test content.

    PubMed

    Sireci, Stephen; Faulkner-Bond, Molly

    2014-01-01

    Validity evidence based on test content is one of the five forms of validity evidence stipulated in the Standards for Educational and Psychological Testing developed by the American Educational Research Association, American Psychological Association, and National Council on Measurement in Education. In this paper, we describe the logic and theory underlying such evidence and describe traditional and modern methods for gathering and analyzing content validity data. A comprehensive review of the literature and of the aforementioned Standards is presented. For educational tests and other assessments targeting knowledge and skill possessed by examinees, validity evidence based on test content is necessary for building a validity argument to support the use of a test for a particular purpose. By following the methods described in this article, practitioners have a wide arsenal of tools available for determining how well the content of an assessment is congruent with and appropriate for the specific testing purposes.

  17. Shuttle payload vibroacoustic test plan evaluation

    NASA Technical Reports Server (NTRS)

    Stahle, C. V.; Gongloff, H. R.; Young, J. P.; Keegan, W. B.

    1977-01-01

    Statistical decision theory is used to evaluate seven alternate vibro-acoustic test plans for Space Shuttle payloads; test plans include component, subassembly and payload testing and combinations of component and assembly testing. The optimum test levels and the expected cost are determined for each test plan. By including all of the direct cost associated with each test plan and the probabilistic costs due to ground test and flight failures, the test plans which minimize project cost are determined. The lowest cost approach eliminates component testing and maintains flight vibration reliability by performing subassembly tests at a relatively high acoustic level.

  18. Water NSTF Design, Instrumentation, and Test Planning

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lisowski, Darius D.; Gerardi, Craig D.; Hu, Rui

    The following report serves as a formal introduction to the water-based Natural convection Shutdown heat removal Test Facility (NSTF) program at Argonne. Since 2005, this US Department of Energy (DOE) sponsored program has conducted large scale experimental testing to generate high-quality and traceable validation data for guiding design decisions of the Reactor Cavity Cooling System (RCCS) concept for advanced reactor designs. The most recent facility iteration, and focus of this report, is the operation of a 1/2 scale model of a water-RCCS concept. Several features of the NSTF prototype align with the conceptual design that has been publicly released formore » the AREVA 625 MWt SC-HTGR. The design of the NSTF also retains all aspects common to a fundamental boiling water thermosiphon, and thus is well poised to provide necessary experimental data to advance basic understanding of natural circulation phenomena and contribute to computer code validation. Overall, the NSTF program operates to support the DOE vision of aiding US vendors in design choices of future reactor concepts, advancing the maturity of codes for licensing, and ultimately developing safe and reliable reactor technologies. In this report, the top-level program objectives, testing requirements, and unique considerations for the water cooled test assembly are discussed, and presented in sufficient depth to support defining the program’s overall scope and purpose. A discussion of the proposed 6-year testing program is then introduced, which outlines the specific strategy and testing plan for facility operations. The proposed testing plan has been developed to meet the toplevel objective of conducting high-quality test operations that span across a broad range of single- and two-phase operating conditions. Details of characterization, baseline test cases, accident scenario, and parametric variations are provided, including discussions of later-stage test cases that examine the influence of

  19. Development and validation of a knowledge test for health professionals regarding lifestyle modification.

    PubMed

    Talip, Whadi-ah; Steyn, Nelia P; Visser, Marianne; Charlton, Karen E; Temple, Norman

    2003-09-01

    We wanted to develop and validate a test that assesses the knowledge and practices of health professionals (HPs) with regard to the role of nutrition, physical activity, and smoking cessation (lifestyle modification) in chronic diseases of lifestyle. A descriptive cross-sectional validation study was carried out. The validation design consisted of two phases, namely 1) test planning and development and 2) test evaluation. The study sample consisted of five groups of HPs: dietitians, dietetic interns, general practitioners, medical students, and nurses. The overall response rate was 58%, resulting in a sample size of 186 participants. A test was designed to evaluate the knowledge and practices of HPs. The test was first evaluated by an expert group to ensure content, construct, and face validity. Thereafter, the questionnaire was tested on five groups of HPs to test for criterion validity. Internal consistency was evaluated by Cronbach's alpha. An expert panel ensured content, construct, and face validity of the test. Groups with the most training and exposure to nutrition (dietitians and dietetic interns) had the highest group mean score, ranging from 61% to 88%, whereas those with limited nutrition training (general practitioners, medical students, and nurses) had significantly lower scores, ranging from 26% to 80%. This result demonstrated criterion validity. Internal consistency of the overall test demonstrated a Cronbach's alpha of 0.99. Most HPs identified the mass media as their main source of information on lifestyle modification. These HPs also identified lack of time, lack of patient compliance, and lack of knowledge as barriers that prevent them from providing counseling on lifestyle modification. The results of this study showed that this test instrument identifies groups of health professionals with adequate training (knowledge) in lifestyle modification and those who require further training (knowledge).

  20. Computerized Planning of Cryosurgery Using Bubble Packing: An Experimental Validation on a Phantom Material

    PubMed Central

    Rossi, Michael R.; Tanaka, Daigo; Shimada, Kenji; Rabin, Yoed

    2009-01-01

    The current study focuses on experimentally validating a planning scheme based on the so-called bubble-packing method. This study is a part of an ongoing effort to develop computerized planning tools for cryosurgery, where bubble packing has been previously developed as a means to find an initial, uniform distribution of cryoprobes within a given domain; the so-called force-field analogy was then used to move cryoprobes to their optimum layout. However, due to the high quality of the cryoprobes’ distribution, suggested by bubble packing and its low computational cost, it has been argued that a planning scheme based solely on bubble packing may be more clinically relevant. To test this argument, an experimental validation is performed on a simulated cross-section of the prostate, using gelatin solution as a phantom material, proprietary liquid-nitrogen based cryoprobes, and a cryoheater to simulate urethral warming. Experimental results are compared with numerically simulated temperature histories resulting from planning. Results indicate an average disagreement of 0.8 mm in identifying the freezing front location, which is an acceptable level of uncertainty in the context of prostate cryosurgery imaging. PMID:19885373

  1. Validation of alternative methods for toxicity testing.

    PubMed Central

    Bruner, L H; Carr, G J; Curren, R D; Chamberlain, M

    1998-01-01

    Before nonanimal toxicity tests may be officially accepted by regulatory agencies, it is generally agreed that the validity of the new methods must be demonstrated in an independent, scientifically sound validation program. Validation has been defined as the demonstration of the reliability and relevance of a test method for a particular purpose. This paper provides a brief review of the development of the theoretical aspects of the validation process and updates current thinking about objectively testing the performance of an alternative method in a validation study. Validation of alternative methods for eye irritation testing is a specific example illustrating important concepts. Although discussion focuses on the validation of alternative methods intended to replace current in vivo toxicity tests, the procedures can be used to assess the performance of alternative methods intended for other uses. Images Figure 1 PMID:9599695

  2. Validation of a Videoconferenced Speaking Test

    ERIC Educational Resources Information Center

    Kim, Jungtae; Craig, Daniel A.

    2012-01-01

    Videoconferencing offers new opportunities for language testers to assess speaking ability in low-stakes diagnostic tests. To be considered a trusted testing tool in language testing, a test should be examined employing appropriate validation processes [Chapelle, C.A., Jamieson, J., & Hegelheimer, V. (2003). "Validation of a web-based ESL…

  3. Development and Validation of a Questionnaire to Detect Behavior Change in Multiple Advance Care Planning Behaviors

    PubMed Central

    Sudore, Rebecca L.; Stewart, Anita L.; Knight, Sara J.; McMahan, Ryan D.; Feuz, Mariko; Miao, Yinghui; Barnes, Deborah E.

    2013-01-01

    Introduction Advance directives have traditionally been considered the gold standard for advance care planning. However, recent evidence suggests that advance care planning involves a series of multiple discrete behaviors for which people are in varying stages of behavior change. The goal of our study was to develop and validate a survey to measure the full advance care planning process. Methods The Advance Care Planning Engagement Survey assesses “Process Measures” of factors known from Behavior Change Theory to affect behavior (knowledge, contemplation, self-efficacy, and readiness, using 5-point Likert scales) and “Action Measures” (yes/no) of multiple behaviors related to surrogate decision makers, values and quality of life, flexibility for surrogate decision making, and informed decision making. We administered surveys at baseline and 1 week later to 50 diverse, older adults from San Francisco hospitals. Internal consistency reliability of Process Measures was assessed using Cronbach's alpha (only continuous variables) and test-retest reliability of Process and Action Measures was examined using intraclass correlations. For discriminant validity, we compared Process and Action Measure scores between this cohort and 20 healthy college students (mean age 23.2 years, SD 2.7). Results Mean age was 69.3 (SD 10.5) and 42% were non-White. The survey took a mean of 21.4 minutes (±6.2) to administer. The survey had good internal consistency (Process Measures Cronbach's alpha, 0.94) and test-retest reliability (Process Measures intraclass correlation, 0.70; Action Measures, 0.87). Both Process and Action Measure scores were higher in the older than younger group, p<.001. Conclusion A new Advance Care Planning Engagement Survey that measures behavior change (knowledge, contemplation, self-efficacy, and readiness) and multiple advance care planning actions demonstrates good reliability and validity. Further research is needed to assess whether survey scores

  4. 10 CFR 26.131 - Cutoff levels for validity screening and initial validity tests.

    Code of Federal Regulations, 2010 CFR

    2010-01-01

    ... 10 Energy 1 2010-01-01 2010-01-01 false Cutoff levels for validity screening and initial validity tests. 26.131 Section 26.131 Energy NUCLEAR REGULATORY COMMISSION FITNESS FOR DUTY PROGRAMS Licensee Testing Facilities § 26.131 Cutoff levels for validity screening and initial validity tests. (a) Each...

  5. 10 CFR 26.131 - Cutoff levels for validity screening and initial validity tests.

    Code of Federal Regulations, 2011 CFR

    2011-01-01

    ... 10 Energy 1 2011-01-01 2011-01-01 false Cutoff levels for validity screening and initial validity tests. 26.131 Section 26.131 Energy NUCLEAR REGULATORY COMMISSION FITNESS FOR DUTY PROGRAMS Licensee Testing Facilities § 26.131 Cutoff levels for validity screening and initial validity tests. (a) Each...

  6. Development, test-retest reliability and validity of the Pharmacy Value-Added Services Questionnaire (PVASQ)

    PubMed Central

    Tan, Christine L.; Hassali, Mohamed A.; Saleem, Fahad; Shafie, Asrul A.; Aljadhey, Hisham; Gan, Vincent B.

    2015-01-01

    Objective: (i) To develop the Pharmacy Value-Added Services Questionnaire (PVASQ) using emerging themes generated from interviews. (ii) To establish reliability and validity of questionnaire instrument. Methods: Using an extended Theory of Planned Behavior as the theoretical model, face-to-face interviews generated salient beliefs of pharmacy value-added services. The PVASQ was constructed initially in English incorporating important themes and later translated into the Malay language with forward and backward translation. Intention (INT) to adopt pharmacy value-added services is predicted by attitudes (ATT), subjective norms (SN), perceived behavioral control (PBC), knowledge and expectations. Using a 7-point Likert-type scale and a dichotomous scale, test-retest reliability (N=25) was assessed by administrating the questionnaire instrument twice at an interval of one week apart. Internal consistency was measured by Cronbach’s alpha and construct validity between two administrations was assessed using the kappa statistic and the intraclass correlation coefficient (ICC). Confirmatory Factor Analysis, CFA (N=410) was conducted to assess construct validity of the PVASQ. Results: The kappa coefficients indicate a moderate to almost perfect strength of agreement between test and retest. The ICC for all scales tested for intra-rater (test-retest) reliability was good. The overall Cronbach’ s alpha (N=25) is 0.912 and 0.908 for the two time points. The result of CFA (N=410) showed most items loaded strongly and correctly into corresponding factors. Only one item was eliminated. Conclusions: This study is the first to develop and establish the reliability and validity of the Pharmacy Value-Added Services Questionnaire instrument using the Theory of Planned Behavior as the theoretical model. The translated Malay language version of PVASQ is reliable and valid to predict Malaysian patients’ intention to adopt pharmacy value-added services to collect partial medicine

  7. Development, test-retest reliability and validity of the Pharmacy Value-Added Services Questionnaire (PVASQ).

    PubMed

    Tan, Christine L; Hassali, Mohamed A; Saleem, Fahad; Shafie, Asrul A; Aljadhey, Hisham; Gan, Vincent B

    2015-01-01

    (i) To develop the Pharmacy Value-Added Services Questionnaire (PVASQ) using emerging themes generated from interviews. (ii) To establish reliability and validity of questionnaire instrument. Using an extended Theory of Planned Behavior as the theoretical model, face-to-face interviews generated salient beliefs of pharmacy value-added services. The PVASQ was constructed initially in English incorporating important themes and later translated into the Malay language with forward and backward translation. Intention (INT) to adopt pharmacy value-added services is predicted by attitudes (ATT), subjective norms (SN), perceived behavioral control (PBC), knowledge and expectations. Using a 7-point Likert-type scale and a dichotomous scale, test-retest reliability (N=25) was assessed by administrating the questionnaire instrument twice at an interval of one week apart. Internal consistency was measured by Cronbach's alpha and construct validity between two administrations was assessed using the kappa statistic and the intraclass correlation coefficient (ICC). Confirmatory Factor Analysis, CFA (N=410) was conducted to assess construct validity of the PVASQ. The kappa coefficients indicate a moderate to almost perfect strength of agreement between test and retest. The ICC for all scales tested for intra-rater (test-retest) reliability was good. The overall Cronbach' s alpha (N=25) is 0.912 and 0.908 for the two time points. The result of CFA (N=410) showed most items loaded strongly and correctly into corresponding factors. Only one item was eliminated. This study is the first to develop and establish the reliability and validity of the Pharmacy Value-Added Services Questionnaire instrument using the Theory of Planned Behavior as the theoretical model. The translated Malay language version of PVASQ is reliable and valid to predict Malaysian patients' intention to adopt pharmacy value-added services to collect partial medicine supply.

  8. Marketing Plan for Demonstration and Validation Assets

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    None, None

    The National Security Preparedness Project (NSPP), is to be sustained by various programs, including technology demonstration and evaluation (DEMVAL). This project assists companies in developing technologies under the National Security Technology Incubator program (NSTI) through demonstration and validation of technologies applicable to national security created by incubators and other sources. The NSPP also will support the creation of an integrated demonstration and validation environment. This report documents the DEMVAL marketing and visibility plan, which will focus on collecting information about, and expanding the visibility of, DEMVAL assets serving businesses with national security technology applications in southern New Mexico.

  9. Testing and validating environmental models

    USGS Publications Warehouse

    Kirchner, J.W.; Hooper, R.P.; Kendall, C.; Neal, C.; Leavesley, G.

    1996-01-01

    Generally accepted standards for testing and validating ecosystem models would benefit both modellers and model users. Universally applicable test procedures are difficult to prescribe, given the diversity of modelling approaches and the many uses for models. However, the generally accepted scientific principles of documentation and disclosure provide a useful framework for devising general standards for model evaluation. Adequately documenting model tests requires explicit performance criteria, and explicit benchmarks against which model performance is compared. A model's validity, reliability, and accuracy can be most meaningfully judged by explicit comparison against the available alternatives. In contrast, current practice is often characterized by vague, subjective claims that model predictions show 'acceptable' agreement with data; such claims provide little basis for choosing among alternative models. Strict model tests (those that invalid models are unlikely to pass) are the only ones capable of convincing rational skeptics that a model is probably valid. However, 'false positive' rates as low as 10% can substantially erode the power of validation tests, making them insufficiently strict to convince rational skeptics. Validation tests are often undermined by excessive parameter calibration and overuse of ad hoc model features. Tests are often also divorced from the conditions under which a model will be used, particularly when it is designed to forecast beyond the range of historical experience. In such situations, data from laboratory and field manipulation experiments can provide particularly effective tests, because one can create experimental conditions quite different from historical data, and because experimental data can provide a more precisely defined 'target' for the model to hit. We present a simple demonstration showing that the two most common methods for comparing model predictions to environmental time series (plotting model time series

  10. 30 CFR 282.23 - Testing Plan.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 30 Mineral Resources 2 2010-07-01 2010-07-01 false Testing Plan. 282.23 Section 282.23 Mineral... § 282.23 Testing Plan. All testing activities shall be conducted in accordance with a Testing Plan..., to carry out a pilot program to evaluate processing techniques or technology or mining equipment, or...

  11. Dosimetric validation for an automatic brain metastases planning software using single-isocenter dynamic conformal arcsDosimetric validation for an automatic brain metastases planning software using single-isocenter dynamic conformal arcs.

    PubMed

    Liu, Haisong; Li, Jun; Pappas, Evangelos; Andrews, David; Evans, James; Werner-Wasik, Maria; Yu, Yan; Dicker, Adam; Shi, Wenyin

    2016-09-08

    An automatic brain-metastases planning (ABMP) software has been installed in our institution. It is dedicated for treating multiple brain metastases with radiosurgery on linear accelerators (linacs) using a single-setup isocenter with noncoplanar dynamic conformal arcs. This study is to validate the calculated absolute dose and dose distribution of ABMP. Three types of measurements were performed to validate the planning software: 1, dual micro ion chambers were used with an acrylic phantom to measure the absolute dose; 2, a 3D cylindrical phantom with dual diode array was used to evaluate 2D dose distribution and point dose for smaller targets; and 3, a 3D pseudo-in vivo patient-specific phantom filled with polymer gels was used to evaluate the accuracy of 3D dose distribution and radia-tion delivery. Micro chamber measurement of two targets (volumes of 1.2 cc and 0.9 cc, respectively) showed that the percentage differences of the absolute dose at both targets were less than 1%. Averaged GI passing rate of five different plans measured with the diode array phantom was above 98%, using criteria of 3% dose difference, 1 mm distance to agreement (DTA), and 10% low-dose threshold. 3D gel phantom measurement results demonstrated a 3D displacement of nine targets of 0.7 ± 0.4 mm (range 0.2 ~ 1.1 mm). The averaged two-dimensional (2D) GI passing rate for several region of interests (ROI) on axial slices that encompass each one of the nine targets was above 98% (5% dose difference, 2 mm DTA, and 10% low-dose threshold). Measured D95, the minimum dose that covers 95% of the target volume, of the nine targets was 0.7% less than the calculated D95. Three different types of dosimetric verification methods were used and proved the dose calculation of the new automatic brain metastases planning (ABMP) software was clinical acceptable. The 3D pseudo-in vivo patient-specific gel phantom test also served as an end-to-end test for validating not only the dose calculation, but the

  12. WE-DE-201-04: Cross Validation of Knowledge-Based Treatment Planning for Prostate LDR Brachytherapy Using Principle Component Analysis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Roper, J; Ghavidel, B; Godette, K

    Purpose: To validate a knowledge-based algorithm for prostate LDR brachytherapy treatment planning. Methods: A dataset of 100 cases was compiled from an active prostate seed implant service. Cases were randomized into 10 subsets. For each subset, the 90 remaining library cases were registered to a common reference frame and then characterized on a point by point basis using principle component analysis (PCA). Each test case was converted to PCA vectors using the same process and compared with each library case using a Mahalanobis distance to evaluate similarity. Rank order PCA scores were used to select the best-matched library case. Themore » seed arrangement was extracted from the best-matched case and used as a starting point for planning the test case. Any subsequent modifications were recorded that required input from a treatment planner to achieve V100>95%, V150<60%, V200<20%. To simulate operating-room planning constraints, seed activity was held constant, and the seed count could not increase. Results: The computational time required to register test-case contours and evaluate PCA similarity across the library was 10s. Preliminary analysis of 2 subsets shows that 9 of 20 test cases did not require any seed modifications to obtain an acceptable plan. Five test cases required fewer than 10 seed modifications or a grid shift. Another 5 test cases required approximately 20 seed modifications. An acceptable plan was not achieved for 1 outlier, which was substantially larger than its best match. Modifications took between 5s and 6min. Conclusion: A knowledge-based treatment planning algorithm for prostate LDR brachytherapy is being cross validated using 100 prior cases. Preliminary results suggest that for this size library, acceptable plans can be achieved without planner input in about half of the cases while varying amounts of planner input are needed in remaining cases. Computational time and planning time are compatible with clinical practice.« less

  13. TU-D-201-05: Validation of Treatment Planning Dose Calculations: Experience Working with MPPG 5.a

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Xue, J; Park, J; Kim, L

    2016-06-15

    Purpose: Newly published medical physics practice guideline (MPPG 5.a.) has set the minimum requirements for commissioning and QA of treatment planning dose calculations. We present our experience in the validation of a commercial treatment planning system based on MPPG 5.a. Methods: In addition to tests traditionally performed to commission a model-based dose calculation algorithm, extensive tests were carried out at short and extended SSDs, various depths, oblique gantry angles and off-axis conditions to verify the robustness and limitations of a dose calculation algorithm. A comparison between measured and calculated dose was performed based on validation tests and evaluation criteria recommendedmore » by MPPG 5.a. An ion chamber was used for the measurement of dose at points of interest, and diodes were used for photon IMRT/VMAT validations. Dose profiles were measured with a three-dimensional scanning system and calculated in the TPS using a virtual water phantom. Results: Calculated and measured absolute dose profiles were compared at each specified SSD and depth for open fields. The disagreement is easily identifiable with the difference curve. Subtle discrepancy has revealed the limitation of the measurement, e.g., a spike at the high dose region and an asymmetrical penumbra observed on the tests with an oblique MLC beam. The excellent results we had (> 98% pass rate on 3%/3mm gamma index) on the end-to-end tests for both IMRT and VMAT are attributed to the quality beam data and the good understanding of the modeling. The limitation of the model and the uncertainty of measurement were considered when comparing the results. Conclusion: The extensive tests recommended by the MPPG encourage us to understand the accuracy and limitations of a dose algorithm as well as the uncertainty of measurement. Our experience has shown how the suggested tests can be performed effectively to validate dose calculation models.« less

  14. Global Precipitation Measurement (GPM) Ground Validation: Plans and Preparations

    NASA Technical Reports Server (NTRS)

    Schwaller, M.; Bidwell, S.; Durning, F. J.; Smith, E.

    2004-01-01

    The Global Precipitation Measurement (GPM) program is an international partnership led by the National Aeronautics and Space Administration (NASA) and the Japan Aerospace Exploration Agency (JAXA). GPM will improve climate, weather, and hydro-meteorological forecasts through more frequent and more accurate measurement of precipitation across the globe. This paper describes the concept, the planning, and the preparations for Ground Validation within the GPM program. Ground Validation (GV) plays an important role in the program by investigating and quantitatively assessing the errors within the satellite retrievals. These quantitative estimates of retrieval errors will assist the scientific community by bounding the errors within their research products. The two fundamental requirements of the GPM Ground Validation program are: (1) error characterization of the precipitation retrievals and (2) continual improvement of the satellite retrieval algorithms. These two driving requirements determine the measurements, instrumentation, and location for ground observations. This paper outlines GV plans for estimating the systematic and random components of retrieval error and for characterizing the spatial p d temporal structure of the error and plans for algorithm improvement in which error models are developed and experimentally explored to uncover the physical causes of errors within the retrievals. This paper discusses NASA locations for GV measurements as well as anticipated locations from international GPM partners. NASA's primary locations for validation measurements are an oceanic site at Kwajalein Atoll in the Republic of the Marshall Islands and a continental site in north-central Oklahoma at the U.S. Department of Energy's Atmospheric Radiation Measurement Program site.

  15. Validation of Mars-GRAM and Planned New Features

    NASA Technical Reports Server (NTRS)

    Justus, C. G.; Duvall, Aleta; Keller, Vernon W.

    2004-01-01

    For altitudes below 80 km, Mars Global Reference Atmospheric Model (Mars-GRAM 2001) is based on output climatology from NASA Ames Mars General Circulation Model (MGCM). At COSPAR 2002, results were presented of validation tests of Mars-GRAM versus data from Mars Global Surveyor Thermal Emission Spectrometer (TES) and Radio Science (RS) experiment. Further validation tests are presented comparing Mars- GRAM densities with those from the European Mars Climate Database (MCD), and comparing densities from both Mars-GRAM and MCD against TES observations. Throughout most of the height and latitude range of TES data (040 km and 70s to 70N), good agreement is found between atmospheric densities from Mars-GRAM and MCD. However, at the season and latitude zone for Mars Phoenix arrival and landing (Ls = 65 to 80 degrees and latitude 65 to 75N), Mars-GRAM densities are about 30 to 45 percent higher than MCD densities near 40 km altitude. Further evaluation is warranted concerning potential impact of these model differences on planning for Phoenix entry and descent. Three planned features for Mars-GRAM update are also discussed: (1) new MGCM and Thermospheric General Circulation Model data sets to be used as a revised basis for Mars-GRAM mean atmosphere, (2) a new feature to represent planetary-scale traveling waves for upper altitude density variations (such as found during Mars Odyssey aerobraking), and (3) a new model for effects of high resolution topographic slope on winds near the surface (0 to 4.5 km above MOLA topography level). Mars-GRAM slope winds will be computed from a diagnostic (algebraic) relationship based on Ye, Segal, and Pielke (1990). This approach differs from mesoscale models (such as MRAMS and Mars MM5), which use prognostic, full-physics solutions of the time- and space-dependent differential equations of motion. As such, slope winds in Mars-GRAM will be consistent with its "engineering-level" approach, and will be extremely fast and easy to evaluate

  16. Integrating MBSE into Ongoing Projects: Requirements Validation and Test Planning for the ISS SAFER

    NASA Technical Reports Server (NTRS)

    Anderson, Herbert A.; Williams, Antony; Pierce, Gregory

    2016-01-01

    The International Space Station (ISS) Simplified Aid for Extra Vehicular Activity (EVA) Rescue (SAFER) is the spacewalking astronaut's final safety measure against separating from the ISS and being unable to return safely. Since the late 1990s, the SAFER has been a standard element of the spacewalking astronaut's equipment. The ISS SAFER project was chartered to develop a new block of SAFER units using a highly similar design to the legacy SAFER (known as the USA SAFER). An on-orbit test module was also included in the project to enable periodic maintenance/propulsion system checkout on the ISS SAFER. On the ISS SAFER project, model-based systems engineering (MBSE) was not the initial systems engineering (SE) approach, given the volume of heritage systems engineering and integration (SE&I) products. The initial emphasis was ensuring traceability to ISS program standards as well as to legacy USA SAFER requirements. The requirements management capabilities of the Cradle systems engineering tool were to be utilized to that end. During development, however, MBSE approaches were applied selectively to address specific challenges in requirements validation and test and verification (T&V) planning, which provided measurable efficiencies to the project. From an MBSE perspective, ISS SAFER development presented a challenge and an opportunity. Addressing the challenge first, the project was tasked to use the original USA SAFER operational and design requirements baseline, with a number of additional ISS program requirements to address evolving certification expectations for systems operating on the ISS. Additionally, a need to redesign the ISS SAFER avionics architecture resulted in a set of changes to the design requirements baseline. Finally, the project added an entirely new functionality for on-orbit maintenance. After initial requirements integration, the system requirements count was approaching 1000, which represented a growth of 4x over the original USA SAFER system

  17. Spanish Transcultural Adaptation and Validity of the Behavioral Inattention Test

    PubMed Central

    Sánchez-Cabeza, Ángel; Huertas-Hoyas, Elisabet; Máximo-Bocanegra, Nuria; Rosa María Martínez-Piédrola; Pérez-de-Heredia-Torres, Marta

    2017-01-01

    Objective To adapt, validate, and translate the Behavioral Inattention Test as an assessment tool for Spanish individuals with unilateral spatial neglect. Design A cross-sectional descriptive study. Setting University laboratories. Participants A sample of 75 Spanish stroke patients and 18 healthy control subjects. Interventions Not applicable. Main Outcome Measures The Behavioral Inattention Test. Results The Spanish version of the Behavioral Inattention Test shows a high degree of reliability both in the complete test (α = .90) and in the conventional (α = .93) and behavioral subtests (α = .75). The concurrent validity between the total conventional and behavioral scores was high (r = −.80; p < 0.001). Significant differences were found between patients with and without unilateral spatial neglect (p < 0.001). In the comparison between right and left damaged sides, differences were found in all items, except for article reading (p = 0.156) and card sorting (p = 0.117). Conclusions This measure is a useful tool for evaluating unilateral spatial neglect as it provides information on everyday problems. The BIT discriminates between stroke patients with and without unilateral spatial neglect. This measure constitutes a reliable tool for the diagnosis, planning, performance, and design of specific treatment programs intended to improve the functionality and quality of life of people with unilateral spatial neglect. PMID:29097959

  18. Antenna Test Facility (ATF): User Test Planning Guide

    NASA Technical Reports Server (NTRS)

    Lin, Greg

    2011-01-01

    Test process, milestones and inputs are unknowns to first-time users of the ATF. The User Test Planning Guide aids in establishing expectations for both NASA and non-NASA facility customers. The potential audience for this guide includes both internal and commercial spaceflight hardware/software developers. It is intended to assist their test engineering personnel in test planning and execution. Material covered includes a roadmap of the test process, roles and responsibilities of facility and user, major milestones, facility capabilities, and inputs required by the facility. Samples of deliverables, test article interfaces, and inputs necessary to define test scope, cost, and schedule are included as an appendix to the guide.

  19. Structures Test Laboratory (STL). User Test Planning Guide

    NASA Technical Reports Server (NTRS)

    Zipay, John J.

    2011-01-01

    Test process, milestones and inputs are unknowns to first-time users of the STL. The User Test Planning Guide aids in establishing expectations for both NASA and non-NASA facility customers. The potential audience for this guide includes both internal and commercial spaceflight hardware/software developers. It is intended to assist their test engineering personnel in test planning and execution. Material covered includes a roadmap of the test process, roles and responsibilities of facility and user, major milestones, facility capabilities, and inputs required by the facility. Samples of deliverables, test article interfaces, and inputs necessary to define test scope, cost, and schedule are included as an appendix to the guide.

  20. Planning of reach-and-grasp movements: effects of validity and type of object information

    NASA Technical Reports Server (NTRS)

    Loukopoulos, L. D.; Engelbrecht, S. F.; Berthier, N. E.

    2001-01-01

    Individuals are assumed to plan reach-and-grasp movements by using two separate processes. In 1 of the processes, extrinsic (direction, distance) object information is used in planning the movement of the arm that transports the hand to the target location (transport planning); whereas in the other, intrinsic (shape) object information is used in planning the preshaping of the hand and the grasping of the target object (manipulation planning). In 2 experiments, the authors used primes to provide information to participants (N = 5, Experiment 1; N = 6, Experiment 2) about extrinsic and intrinsic object properties. The validity of the prime information was systematically varied. The primes were succeeded by a cue, which always correctly identified the location and shape of the target object. Reaction times were recorded. Four models of transport and manipulation planning were tested. The only model that was consistent with the data was 1 in which arm transport and object manipulation planning were postulated to be independent processes that operate partially in parallel. The authors suggest that the processes involved in motor planning before execution are primarily concerned with the geometric aspects of the upcoming movement but not with the temporal details of its execution.

  1. Initial Teacher Licensure Testing in Tennessee: Test Validation.

    ERIC Educational Resources Information Center

    Bowman, Harry L.; Petry, John R.

    In 1988 a study was conducted to determine the validity of candidate teacher licensure examinations for use in Tennessee under the 1984 Comprehensive Education Reform Act. The Department of Education conducted a study to determine the validity of 11 previously unvalidated or extensively revised tests for certification and to make recommendations…

  2. Evaluating Test Validity: Reprise and Progress

    ERIC Educational Resources Information Center

    Shepard, Lorrie A.

    2016-01-01

    The AERA, APA, NCME Standards define validity as "the degree to which evidence and theory support the interpretations of test scores for proposed uses of tests". A century of disagreement about validity does not mean that there has not been substantial progress. This consensus definition brings together interpretations and use so that it…

  3. Institutional Effectiveness: A Model for Planning, Assessment & Validation.

    ERIC Educational Resources Information Center

    Truckee Meadows Community Coll., Sparks, NV.

    The report presents Truckee Meadows Community College's (Colorado) model for assessing institutional effectiveness and validating the College's mission and vision, and the strategic plan for carrying out the institutional effectiveness model. It also outlines strategic goals for the years 1999-2001. From the system-wide directive that education…

  4. Construct Validity of Neuropsychological Tests in Schizophrenia.

    ERIC Educational Resources Information Center

    Allen, Daniel N.; Aldarondo, Felito; Goldstein, Gerald; Huegel, Stephen G.; Gilbertson, Mark; van Kammen, Daniel P.

    1998-01-01

    The construct validity of neuropsychological tests in patients with schizophrenia was studied with 39 patients who were evaluated with a battery of six tests assessing attention, memory, and abstract reasoning abilities. Results support the construct validity of the neuropsychological tests in patients with schizophrenia. (SLD)

  5. Highly Efficient Training, Refinement, and Validation of a Knowledge-based Planning Quality-Control System for Radiation Therapy Clinical Trials

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Li, Nan; Carmona, Ruben; Sirak, Igor

    original versus KBP{sub FINAL} plans across the 35-patient validation set. Paired t tests were used to test differences between planning sets. Results: KBP{sub FINAL} plans outperformed manual planning across the validation set in all protocol-specific DVH cutpoints. The mean normal tissue complication probability for gastrointestinal toxicity was lower for KBP{sub FINAL} versus validation-set plans (48.7% vs 53.8%, P<.001). Similarly, the estimated mean white blood cell count nadir was higher (2.77 vs 2.49 k/mL, P<.001) with KBP{sub FINAL} plans, indicating lowered probability of hematologic toxicity. Conclusions: This work demonstrates that a KBP system can be efficiently trained and refined for use in radiation therapy clinical trials with minimal effort. This patient-specific plan quality control resulted in improvements on protocol-specific dosimetric endpoints.« less

  6. Alternative Vocabularies in the Test Validity Literature

    ERIC Educational Resources Information Center

    Markus, Keith A.

    2016-01-01

    Justification of testing practice involves moving from one state of knowledge about the test to another. Theories of test validity can (a) focus on the beginning of the process, (b) focus on the end, or (c) encompass the entire process. Analyses of four case studies test and illustrate three claims: (a) restrictions on validity entail a supplement…

  7. Validation of US3D for Capsule Aerodynamics using 05-CA Wind Tunnel Test Data

    NASA Technical Reports Server (NTRS)

    Schwing, Alan

    2012-01-01

    Several comparisons of computational fluid dynamics to wind tunnel test data are shown for the purpose of code validation. The wind tunnel test, 05-CA, uses a 7.66% model of NASA's Multi-Purpose Crew Vehicle in the 11-foot test section of the Ames Unitary Plan Wind tunnel. A variety of freestream conditions over four Mach numbers and three angles of attack are considered. Test data comparisons include time-averaged integrated forces and moments, time-averaged static pressure ports on the surface, and Strouhal Number. The applicability of the US3D code to subsonic and transonic flow over a bluff body is assessed on a comprehensive data set. With close comparison, this work validates US3D for highly separated flows similar to those examined here.

  8. A Note on Economic Content and Test Validity.

    ERIC Educational Resources Information Center

    Soper, John C.; Brenneke, Judith Staley

    1987-01-01

    Offers practical tips on how teachers can determine whether classroom tests are actually measuring what they are designed to measure. Discusses criterion-related validity, construct validity, and content validity. Demonstrates how to determine the degree of content validity a particular test may have for a particular course or unit. (Author/DH)

  9. 14 CFR 437.25 - Flight test plan.

    Code of Federal Regulations, 2014 CFR

    2014-01-01

    ... 14 Aeronautics and Space 4 2014-01-01 2014-01-01 false Flight test plan. 437.25 Section 437.25... TRANSPORTATION LICENSING EXPERIMENTAL PERMITS Requirements to Obtain an Experimental Permit Flight Test Plan § 437.25 Flight test plan. An applicant must— (a) Describe any flight test program, including estimated...

  10. 14 CFR 437.25 - Flight test plan.

    Code of Federal Regulations, 2013 CFR

    2013-01-01

    ... 14 Aeronautics and Space 4 2013-01-01 2013-01-01 false Flight test plan. 437.25 Section 437.25... TRANSPORTATION LICENSING EXPERIMENTAL PERMITS Requirements to Obtain an Experimental Permit Flight Test Plan § 437.25 Flight test plan. An applicant must— (a) Describe any flight test program, including estimated...

  11. 14 CFR 437.25 - Flight test plan.

    Code of Federal Regulations, 2012 CFR

    2012-01-01

    ... 14 Aeronautics and Space 4 2012-01-01 2012-01-01 false Flight test plan. 437.25 Section 437.25... TRANSPORTATION LICENSING EXPERIMENTAL PERMITS Requirements to Obtain an Experimental Permit Flight Test Plan § 437.25 Flight test plan. An applicant must— (a) Describe any flight test program, including estimated...

  12. Validation of the Lollipop Test: A Diagnostic Screening Test of School Readiness.

    ERIC Educational Resources Information Center

    Chew, Alex L.; Morris, John D.

    1984-01-01

    The validity of the Lollipop Test: A Diagnostic Screening Test of School Readiness was examined using the Metropolitan Readiness Test (MRT), Level I, Form Q, as the criterion. Appreciable concurrent validity was found across test batteries. Implications for school readiness screening are discussed. (Author/BS)

  13. Single Event Effect (SEE) Test Planning 101

    NASA Technical Reports Server (NTRS)

    LaBel, Kenneth A.; Pellish, Jonathan; Berg, Melanie D.

    2011-01-01

    This is a course on SEE Test Plan development. It is an introductory discussion of the items that go into planning an SEE test that should complement the SEE test methodology used. Material will only cover heavy ion SEE testing and not proton, LASER, or other though many of the discussed items may be applicable. While standards and guidelines for how-to perform single event effects (SEE) testing have existed almost since the first cyclotron testing, guidance on the development of SEE test plans has not been as easy to find. In this section of the short course, we attempt to rectify this lack. We consider the approach outlined here as a "living" document: mission specific constraints and new technology related issues always need to be taken into account. We note that we will use the term "test planning" in the context of those items being included in a test plan.

  14. 14 CFR 91.1041 - Aircraft proving and validation tests.

    Code of Federal Regulations, 2014 CFR

    2014-01-01

    ... 14 Aeronautics and Space 2 2014-01-01 2014-01-01 false Aircraft proving and validation tests. 91... Ownership Operations Program Management § 91.1041 Aircraft proving and validation tests. (a) No program... tests. However, pilot flight training may be conducted during the proving tests. (d) Validation testing...

  15. 14 CFR 91.1041 - Aircraft proving and validation tests.

    Code of Federal Regulations, 2012 CFR

    2012-01-01

    ... 14 Aeronautics and Space 2 2012-01-01 2012-01-01 false Aircraft proving and validation tests. 91... Ownership Operations Program Management § 91.1041 Aircraft proving and validation tests. (a) No program... tests. However, pilot flight training may be conducted during the proving tests. (d) Validation testing...

  16. 14 CFR 91.1041 - Aircraft proving and validation tests.

    Code of Federal Regulations, 2013 CFR

    2013-01-01

    ... 14 Aeronautics and Space 2 2013-01-01 2013-01-01 false Aircraft proving and validation tests. 91... Ownership Operations Program Management § 91.1041 Aircraft proving and validation tests. (a) No program... tests. However, pilot flight training may be conducted during the proving tests. (d) Validation testing...

  17. 14 CFR 91.1041 - Aircraft proving and validation tests.

    Code of Federal Regulations, 2011 CFR

    2011-01-01

    ... 14 Aeronautics and Space 2 2011-01-01 2011-01-01 false Aircraft proving and validation tests. 91... Ownership Operations Program Management § 91.1041 Aircraft proving and validation tests. (a) No program... tests. However, pilot flight training may be conducted during the proving tests. (d) Validation testing...

  18. 14 CFR 91.1041 - Aircraft proving and validation tests.

    Code of Federal Regulations, 2010 CFR

    2010-01-01

    ... 14 Aeronautics and Space 2 2010-01-01 2010-01-01 false Aircraft proving and validation tests. 91... Ownership Operations Program Management § 91.1041 Aircraft proving and validation tests. (a) No program... tests. However, pilot flight training may be conducted during the proving tests. (d) Validation testing...

  19. Understanding pregnancy planning in a low-income country setting: validation of the London measure of unplanned pregnancy in Malawi.

    PubMed

    Hall, Jennifer; Barrett, Geraldine; Mbwana, Nicholas; Copas, Andrew; Malata, Address; Stephenson, Judith

    2013-11-05

    The London Measure of Unplanned Pregnancy (LMUP) is a new and psychometrically valid measure of pregnancy intention that was developed in the United Kingdom. An improved understanding of pregnancy intention in low-income countries, where unintended pregnancies are common and maternal and neonatal deaths are high, is necessary to inform policies to address the unmet need for family planning. To this end this research aimed to validate the LMUP for use in the Chichewa language in Malawi. Three Chichewa speakers translated the LMUP and one translation was agreed which was back-translated and pre-tested on five pregnant women using cognitive interviews. The measure was field tested with pregnant women who were recruited at antenatal clinics and data were analysed using classical test theory and hypothesis testing. 125 women aged 15-43 (median 23), with parities of 1-8 (median 2) completed the Chichewa LMUP. There were no missing data. The full range of LMUP scores was captured. In terms of reliability, the scale was internally consistent (Cronbach's alpha = 0.78) and test-retest data from 70 women showed good stability (weighted Kappa 0.80). In terms of validity, hypothesis testing confirmed that unmarried women (p = 0.003), women who had four or more children alive (p = 0.0051) and women who were below 20 or over 29 (p = 0.0115) were all more likely to have unintended pregnancies. Principal component analysis showed that five of the six items loaded onto one factor, with a further item borderline. A sensitivity analysis to assess the effect of the removal of the weakest item of the scale showed slightly improved performance but as the LMUP was not significantly adversely affected by its inclusion we recommend retaining the six-item score. The Chichewa LMUP is a valid and reliable measure of pregnancy intention in Malawi and can now be used in research and/or surveillance. This is the first validation of this tool in a low-income country, helping to

  20. Project W-314 specific test and evaluation plan for transfer line SN-633 (241-AX-B to 241-AY-02A)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hays, W.H.

    1998-03-20

    The purpose of this Specific Test and Evaluation Plan (STEP) is to provide a detailed written plan for the systematic testing of modifications made by the addition of the SN-633 transfer line by the W-314 Project. The STEP develops the outline for test procedures that verify the system`s performance to the established Project design criteria. The STEP is a lower tier document based on the W-314 Test and Evaluation Plan (TEP). This STEP encompasses all testing activities required to demonstrate compliance to the project design criteria as it relates to the addition of transfer line SN-633. The Project Design Specificationsmore » (PDS) identify the specific testing activities required for the Project. Testing includes Validations and Verifications (e.g., Commercial Grade Item Dedication activities), Factory Acceptance Tests (FATs), installation tests and inspections, Construction Acceptance Tests (CATs), Acceptance Test Procedures (ATPs), Pre-Operational Test Procedures (POTPs), and Operational Test Procedures (OTPs). It should be noted that POTPs are not required for testing of the transfer line addition. The STEP will be utilized in conjunction with the TEP for verification and validation.« less

  1. Evidence of Construct Validity in Published Achievement Tests.

    ERIC Educational Resources Information Center

    Nolet, Victor; Tindal, Gerald

    Valid interpretation of test scores is the shared responsibility of the test designer and the test user. Test publishers must provide evidence of the validity of the decisions their tests are intended to support, while test users are responsible for analyzing this evidence and subsequently using the test in the manner indicated by the publisher.…

  2. Validation Test Report For The CRWMS Analysis and Logistics Visually Interactive Model Calvin Version 3.0, 10074-Vtr-3.0-00

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    S. Gillespie

    2000-07-27

    This report describes the tests performed to validate the CRWMS ''Analysis and Logistics Visually Interactive'' Model (CALVIN) Version 3.0 (V3.0) computer code (STN: 10074-3.0-00). To validate the code, a series of test cases was developed in the CALVIN V3.0 Validation Test Plan (CRWMS M&O 1999a) that exercises the principal calculation models and options of CALVIN V3.0. Twenty-five test cases were developed: 18 logistics test cases and 7 cost test cases. These cases test the features of CALVIN in a sequential manner, so that the validation of each test case is used to demonstrate the accuracy of the input to subsequentmore » calculations. Where necessary, the test cases utilize reduced-size data tables to make the hand calculations used to verify the results more tractable, while still adequately testing the code's capabilities. Acceptance criteria, were established for the logistics and cost test cases in the Validation Test Plan (CRWMS M&O 1999a). The Logistics test cases were developed to test the following CALVIN calculation models: Spent nuclear fuel (SNF) and reactivity calculations; Options for altering reactor life; Adjustment of commercial SNF (CSNF) acceptance rates for fiscal year calculations and mid-year acceptance start; Fuel selection, transportation cask loading, and shipping to the Monitored Geologic Repository (MGR); Transportation cask shipping to and storage at an Interim Storage Facility (ISF); Reactor pool allocation options; and Disposal options at the MGR. Two types of cost test cases were developed: cases to validate the detailed transportation costs, and cases to validate the costs associated with the Civilian Radioactive Waste Management System (CRWMS) Management and Operating Contractor (M&O) and Regional Servicing Contractors (RSCs). For each test case, values calculated using Microsoft Excel 97 worksheets were compared to CALVIN V3.0 scenarios with the same input data and assumptions. All of the test case results compare with the

  3. Prototyping and validating requirements of radiation and nuclear emergency plan simulator

    NASA Astrophysics Data System (ADS)

    Hamid, AHA.; Rozan, MZA.; Ibrahim, R.; Deris, S.; Selamat, A.

    2015-04-01

    Organizational incapability in developing unrealistic, impractical, inadequate and ambiguous mechanisms of radiological and nuclear emergency preparedness and response plan (EPR) causing emergency plan disorder and severe disasters. These situations resulting from 65.6% of poor definition and unidentified roles and duties of the disaster coordinator. Those unexpected conditions brought huge aftermath to the first responders, operators, workers, patients and community at large. Hence, in this report, we discuss prototyping and validating of Malaysia radiation and nuclear emergency preparedness and response plan simulation model (EPRM). A prototyping technique was required to formalize the simulation model requirements. Prototyping as systems requirements validation was carried on to endorse the correctness of the model itself against the stakeholder's intensions in resolving those organizational incapability. We have made assumptions for the proposed emergency preparedness and response model (EPRM) through the simulation software. Those assumptions provided a twofold of expected mechanisms, planning and handling of the respective emergency plan as well as in bringing off the hazard involved. This model called RANEPF (Radiation and Nuclear Emergency Planning Framework) simulator demonstrated the training emergency response perquisites rather than the intervention principles alone. The demonstrations involved the determination of the casualties' absorbed dose range screening and the coordination of the capacity planning of the expected trauma triage. Through user-centred design and sociotechnical approach, RANEPF simulator was strategized and simplified, though certainly it is equally complex.

  4. GOSAT-2 : Science Plan, Products, Validation, and Application

    NASA Astrophysics Data System (ADS)

    Matsunaga, T.; Morino, I.; Yoshida, Y.; Saito, M.; Hiraki, K.; Yokota, Y.; Kamei, A.; Oishi, Y.; Dupuy, E.; Murakami, K.; Ninomiya, K.; Pang, J. S.; Yokota, T.; Maksyutov, S. S.; Machida, T.; Saigusa, N.; Mukai, H.; Nakajima, M.; Imasu, R.; Nakajima, T.

    2013-12-01

    Based on the success of Greenhouse Gases Observing Satellite (GOSAT) launched in 2009, Ministry of the Environment (MOE), Japan Space Exploration Agency (JAXA), and National Institute for Environmental Studies (NIES) started the preparations for the follow-on satellite, GOSAT-2 in FY2011. The current target launch year of GOSAT-2 is FY2017. The objectives of GOSAT-2 include : 1) Continue and enhance spaceborne greenhouse gases observation started by GOSAT, 2) Improve our understanding of global and regional carbon cycles, and 3) Contribute to the climate change related policies as one of MRV(Measurement, Reporting, and Verification) tools for carbon emission reduction. As a scientific background/rationale of GOSAT-2, GOSAT-2 Science Plan is being edited by GOSAT-2 Science Team Preparation Committee. Not only carbon dioxide and methane but also carbon monoxide, tropospheric ozone, and aerosols are discussed in the plan. GOSAT-2 Level 2 (gas concentrations) and Level 4 (gas fluxes) products will be operationally generated at and distributed from GOSAT-2 Data Handling Facility located in NIES. In addition, a new supercomputer dedicated to GOSAT-2 research and development will be also installed in NIES. GOSAT-2 validation plan is also being discussed. Its baseline is similar to the current GOSAT . But various efforts will be made to extend the coverage of validation data for GOSAT-2. The efforts include the increased commercial passenger aircraft volunteering atmospheric measurements and additional ground-based Fourier transform spectrometers to be newly installed in Asian countries. In addition, a compact accelerator mass spectrometer is being introduced to NIES to investigate the contributions of anthropogenic emissions which is important for GOSAT-2. Climate change related policies include JCM (Joint Crediting Mechanism) in which MRV plays a critical role. MRV tools used in the existing JCM projects are mostly ground-based and site-specific. Satellite atmospheric

  5. Validity and reliability of the NAB Naming Test.

    PubMed

    Sachs, Bonnie C; Rush, Beth K; Pedraza, Otto

    2016-05-01

    Confrontation naming is commonly assessed in neuropsychological practice, but few standardized measures of naming exist and those that do are susceptible to the effects of education and culture. The Neuropsychological Assessment Battery (NAB) Naming Test is a 31-item measure used to assess confrontation naming. Despite adequate psychometric information provided by the test publisher, there has been limited independent validation of the test. In this study, we investigated the convergent and discriminant validity, internal consistency, and alternate forms reliability of the NAB Naming Test in a sample of adults (Form 1: n = 247, Form 2: n = 151) clinically referred for neuropsychological evaluation. Results indicate adequate-to-good internal consistency and alternate forms reliability. We also found strong convergent validity as demonstrated by relationships with other neurocognitive measures. We found preliminary evidence that the NAB Naming Test demonstrates a more pronounced ceiling effect than other commonly used measures of naming. To our knowledge, this represents the largest published independent validation study of the NAB Naming Test in a clinical sample. Our findings suggest that the NAB Naming Test demonstrates adequate validity and reliability and merits consideration in the test arsenal of clinical neuropsychologists.

  6. Radiant Heat Test Facility (RHTF): User Test Planning Guide

    NASA Technical Reports Server (NTRS)

    DelPapa, Steven

    2011-01-01

    Test process, milestones and inputs are unknowns to first-time users of the RHTF. The User Test Planning Guide aids in establishing expectations for both NASA and non- NASA facility customers. The potential audience for this guide includes both internal and commercial spaceflight hardware/software developers. It is intended to assist their test engineering personnel in test planning and execution. Material covered includes a roadmap of the test process, roles and responsibilities of facility and user, major milestones, facility capabilities, and inputs required by the facility. Samples of deliverables, test article interfaces, and inputs necessary to define test scope, cost, and schedule are included as an appendix to the guide.

  7. Electronic Systems Test Laboratory (ESTL) User Test Planning Guide

    NASA Technical Reports Server (NTRS)

    Robinson, Neil

    2011-01-01

    Test process, milestones and inputs are unknowns to first-time users of the ESTL. The User Test Planning Guide aids in establishing expectations for both NASA and non-NASA facility customers. The potential audience for this guide includes both internal and commercial spaceflight hardware/software developers. It is intended to assist their test engineering personnel in test planning and execution. Material covered includes a roadmap of the test process, roles and responsibilities of facility and user, major milestones, facility capabilities, and inputs required by the facility. Samples of deliverables, test article interfaces, and inputs necessary to define test scope, cost, and schedule are included as an appendix to the guide.

  8. College Text Test Validity.

    ERIC Educational Resources Information Center

    McAfee, Donald C.

    1979-01-01

    A team of faculty members and graduate students identified major concepts and developed validated test questions for two widely used textbooks in personal hygiene classes in order to standardize norms for classes and supplement inadequate instructor's manuals. (JMF)

  9. Evaluating the Content Validity of Multistage-Adaptive Tests

    ERIC Educational Resources Information Center

    Crotts, Katrina; Sireci, Stephen G.; Zenisky, April

    2012-01-01

    Validity evidence based on test content is important for educational tests to demonstrate the degree to which they fulfill their purposes. Most content validity studies involve subject matter experts (SMEs) who rate items that comprise a test form. In computerized-adaptive testing, examinees take different sets of items and test "forms"…

  10. STAR-CCM+ Verification and Validation Plan

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Pointer, William David

    2016-09-30

    The commercial Computational Fluid Dynamics (CFD) code STAR-CCM+ provides general purpose finite volume method solutions for fluid dynamics and energy transport. This document defines plans for verification and validation (V&V) of the base code and models implemented within the code by the Consortium for Advanced Simulation of Light water reactors (CASL). The software quality assurance activities described herein are port of the overall software life cycle defined in the CASL Software Quality Assurance (SQA) Plan [Sieger, 2015]. STAR-CCM+ serves as the principal foundation for development of an advanced predictive multi-phase boiling simulation capability within CASL. The CASL Thermal Hydraulics Methodsmore » (THM) team develops advanced closure models required to describe the subgrid-resolution behavior of secondary fluids or fluid phases in multiphase boiling flows within the Eulerian-Eulerian framework of the code. These include wall heat partitioning models that describe the formation of vapor on the surface and the forces the define bubble/droplet dynamic motion. The CASL models are implemented as user coding or field functions within the general framework of the code. This report defines procedures and requirements for V&V of the multi-phase CFD capability developed by CASL THM. Results of V&V evaluations will be documented in a separate STAR-CCM+ V&V assessment report. This report is expected to be a living document and will be updated as additional validation cases are identified and adopted as part of the CASL THM V&V suite.« less

  11. Specialized Environmental Chamber Test Complex: User Test Planning Guide

    NASA Technical Reports Server (NTRS)

    Montz, Michael E.

    2011-01-01

    Test process, milestones and inputs are unknowns to first-time users of the Specialized Environmental Test Complex. The User Test Planning Guide aids in establishing expectations for both NASA and non-NASA facility customers. The potential audience for this guide includes both internal and commercial spaceflight hardware/software developers. It is intended to assist their test engineering personnel in test planning and execution. Material covered includes a roadmap of the test process, roles and responsibilities of facility and user, major milestones, facility capabilities, and inputs required by the facility. Samples of deliverables, test article interfaces, and inputs necessary to define test scope, cost, and schedule are included as an appendix to the guide.

  12. 15 CFR 995.27 - Format validation software testing.

    Code of Federal Regulations, 2013 CFR

    2013-01-01

    ... 15 Commerce and Foreign Trade 3 2013-01-01 2013-01-01 false Format validation software testing... of NOAA ENC Products § 995.27 Format validation software testing. Tests shall be performed verifying, as far as reasonable and practicable, that CEVAD's data testing software performs the checks, as...

  13. 15 CFR 995.27 - Format validation software testing.

    Code of Federal Regulations, 2014 CFR

    2014-01-01

    ... 15 Commerce and Foreign Trade 3 2014-01-01 2014-01-01 false Format validation software testing... of NOAA ENC Products § 995.27 Format validation software testing. Tests shall be performed verifying, as far as reasonable and practicable, that CEVAD's data testing software performs the checks, as...

  14. 15 CFR 995.27 - Format validation software testing.

    Code of Federal Regulations, 2012 CFR

    2012-01-01

    ... 15 Commerce and Foreign Trade 3 2012-01-01 2012-01-01 false Format validation software testing... of NOAA ENC Products § 995.27 Format validation software testing. Tests shall be performed verifying, as far as reasonable and practicable, that CEVAD's data testing software performs the checks, as...

  15. 15 CFR 995.27 - Format validation software testing.

    Code of Federal Regulations, 2011 CFR

    2011-01-01

    ... 15 Commerce and Foreign Trade 3 2011-01-01 2011-01-01 false Format validation software testing... of NOAA ENC Products § 995.27 Format validation software testing. Tests shall be performed verifying, as far as reasonable and practicable, that CEVAD's data testing software performs the checks, as...

  16. Validating the Electric Maze Task as a Measure of Planning

    ERIC Educational Resources Information Center

    Sheppard, Kelly W.; Cheatham, Carol L.

    2017-01-01

    The Electric Maze Task (EMT) is a novel planning task designed to allow flexible testing of planning abilities across a broad age range and to incorporate manipulations to test underlying planning abilities, such as working-memory and inhibitory control skills. The EMT was tested in a group of 63 typically developing 7- to 12-year-olds.…

  17. Development of Level 2 Calibration and Validation Plans for GOES-R; What is a RIMP?

    NASA Technical Reports Server (NTRS)

    Kopp, Thomas J.; Belsma, Leslie O.; Mollner, Andrew K.; Sun, Ziping; Deluccia, Frank

    2017-01-01

    Calibration and Validation (CalVal) plans for Geostationary Operational Environmental Satellite version R (GOES-R) Level 2 (L2) products were documented via Resource, Implementation, and Management Plans (RIMPs) for all of the official L2 products required from the GOES-R Advanced Baseline Imager (ABI). In 2015 the GOES-R program decided to replace the typical CalVal plans with RIMPs that covered, for a given L2 product, what was required from that product, how it would be validated, and what tools would be used to do so. Similar to Level 1b products, the intent was to cover the full spectrum of planning required for the CalVal of L2 ABI products. Instead of focusing on step-by-step procedures, the RIMPs concentrated on the criteria for each stage of the validation process (Beta, Provisional, and Full Validation) and the many elements required to prove when each stage was reached.

  18. Multiyear Plan for Validation of EnergyPlus Multi-Zone HVAC System Modeling using ORNL's Flexible Research Platform

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Im, Piljae; Bhandari, Mahabir S.; New, Joshua Ryan

    This document describes the Oak Ridge National Laboratory (ORNL) multiyear experimental plan for validation and uncertainty characterization of whole-building energy simulation for a multi-zone research facility using a traditional rooftop unit (RTU) as a baseline heating, ventilating, and air conditioning (HVAC) system. The project’s overarching objective is to increase the accuracy of energy simulation tools by enabling empirical validation of key inputs and algorithms. Doing so is required to inform the design of increasingly integrated building systems and to enable accountability for performance gaps between design and operation of a building. The project will produce documented data sets that canmore » be used to validate key functionality in different energy simulation tools and to identify errors and inadequate assumptions in simulation engines so that developers can correct them. ASHRAE Standard 140, Method of Test for the Evaluation of Building Energy Analysis Computer Programs (ASHRAE 2004), currently consists primarily of tests to compare different simulation programs with one another. This project will generate sets of measured data to enable empirical validation, incorporate these test data sets in an extended version of Standard 140, and apply these tests to the Department of Energy’s (DOE) EnergyPlus software (EnergyPlus 2016) to initiate the correction of any significant deficiencies. The fitness-for-purpose of the key algorithms in EnergyPlus will be established and demonstrated, and vendors of other simulation programs will be able to demonstrate the validity of their products. The data set will be equally applicable to validation of other simulation engines as well.« less

  19. In-Space Structural Validation Plan for a Stretched-Lens Solar Array Flight Experiment

    NASA Technical Reports Server (NTRS)

    Pappa, Richard S.; Woods-Vedeler, Jessica A.; Jones, Thomas W.

    2001-01-01

    This paper summarizes in-space structural validation plans for a proposed Space Shuttle-based flight experiment. The test article is an innovative, lightweight solar array concept that uses pop-up, refractive stretched-lens concentrators to achieve a power/mass density of at least 175 W/kg, which is more than three times greater than current capabilities. The flight experiment will validate this new technology to retire the risk associated with its first use in space. The experiment includes structural diagnostic instrumentation to measure the deployment dynamics, static shape, and modes of vibration of the 8-meter-long solar array and several of its lenses. These data will be obtained by photogrammetry using the Shuttle payload-bay video cameras and miniature video cameras on the array. Six accelerometers are also included in the experiment to measure base excitations and small-amplitude tip motions.

  20. Prototyping and validating requirements of radiation and nuclear emergency plan simulator

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hamid, AHA., E-mail: amyhamijah@nm.gov.my; Faculty of Computing, Universiti Teknologi Malaysia; Rozan, MZA.

    2015-04-29

    Organizational incapability in developing unrealistic, impractical, inadequate and ambiguous mechanisms of radiological and nuclear emergency preparedness and response plan (EPR) causing emergency plan disorder and severe disasters. These situations resulting from 65.6% of poor definition and unidentified roles and duties of the disaster coordinator. Those unexpected conditions brought huge aftermath to the first responders, operators, workers, patients and community at large. Hence, in this report, we discuss prototyping and validating of Malaysia radiation and nuclear emergency preparedness and response plan simulation model (EPRM). A prototyping technique was required to formalize the simulation model requirements. Prototyping as systems requirements validation wasmore » carried on to endorse the correctness of the model itself against the stakeholder’s intensions in resolving those organizational incapability. We have made assumptions for the proposed emergency preparedness and response model (EPRM) through the simulation software. Those assumptions provided a twofold of expected mechanisms, planning and handling of the respective emergency plan as well as in bringing off the hazard involved. This model called RANEPF (Radiation and Nuclear Emergency Planning Framework) simulator demonstrated the training emergency response perquisites rather than the intervention principles alone. The demonstrations involved the determination of the casualties’ absorbed dose range screening and the coordination of the capacity planning of the expected trauma triage. Through user-centred design and sociotechnical approach, RANEPF simulator was strategized and simplified, though certainly it is equally complex.« less

  1. 15 CFR 995.27 - Format validation software testing.

    Code of Federal Regulations, 2010 CFR

    2010-01-01

    ... 15 Commerce and Foreign Trade 3 2010-01-01 2010-01-01 false Format validation software testing... CERTIFICATION REQUIREMENTS FOR NOAA HYDROGRAPHIC PRODUCTS AND SERVICES CERTIFICATION REQUIREMENTS FOR... of NOAA ENC Products § 995.27 Format validation software testing. Tests shall be performed verifying...

  2. 14 CFR 135.145 - Aircraft proving and validation tests.

    Code of Federal Regulations, 2011 CFR

    2011-01-01

    ... 14 Aeronautics and Space 3 2011-01-01 2011-01-01 false Aircraft proving and validation tests. 135... Aircraft and Equipment § 135.145 Aircraft proving and validation tests. (a) No certificate holder may...) Validation testing is required to determine that a certificate holder is capable of conducting operations...

  3. 14 CFR 135.145 - Aircraft proving and validation tests.

    Code of Federal Regulations, 2013 CFR

    2013-01-01

    ... 14 Aeronautics and Space 3 2013-01-01 2013-01-01 false Aircraft proving and validation tests. 135... Aircraft and Equipment § 135.145 Aircraft proving and validation tests. (a) No certificate holder may...) Validation testing is required to determine that a certificate holder is capable of conducting operations...

  4. 14 CFR 135.145 - Aircraft proving and validation tests.

    Code of Federal Regulations, 2010 CFR

    2010-01-01

    ... 14 Aeronautics and Space 3 2010-01-01 2010-01-01 false Aircraft proving and validation tests. 135... Aircraft and Equipment § 135.145 Aircraft proving and validation tests. (a) No certificate holder may...) Validation testing is required to determine that a certificate holder is capable of conducting operations...

  5. 14 CFR 135.145 - Aircraft proving and validation tests.

    Code of Federal Regulations, 2014 CFR

    2014-01-01

    ... 14 Aeronautics and Space 3 2014-01-01 2014-01-01 false Aircraft proving and validation tests. 135... Aircraft and Equipment § 135.145 Aircraft proving and validation tests. (a) No certificate holder may...) Validation testing is required to determine that a certificate holder is capable of conducting operations...

  6. 14 CFR 135.145 - Aircraft proving and validation tests.

    Code of Federal Regulations, 2012 CFR

    2012-01-01

    ... 14 Aeronautics and Space 3 2012-01-01 2012-01-01 false Aircraft proving and validation tests. 135... Aircraft and Equipment § 135.145 Aircraft proving and validation tests. (a) No certificate holder may...) Validation testing is required to determine that a certificate holder is capable of conducting operations...

  7. Energy Systems Test Area (ESTA) Battery Test Operations User Test Planning Guide

    NASA Technical Reports Server (NTRS)

    Salinas, Michael

    2012-01-01

    Test process, milestones and inputs are unknowns to first-time users of the ESTA Battery Test Operations. The User Test Planning Guide aids in establishing expectations for both NASA and non-NASA facility customers. The potential audience for this guide includes both internal and commercial spaceflight hardware/software developers. It is intended to assist their test engineering personnel in test planning and execution. Material covered includes a roadmap of the test process, roles and responsibilities of facility and user, major milestones, facility capabilities, and inputs required by the facility. Samples of deliverables, test article interfaces, and inputs necessary to define test scope, cost, and schedule are included as an appendix to the guide.

  8. Construct Validity of the Nepalese School Leaving English Reading Test

    ERIC Educational Resources Information Center

    Dawadi, Saraswati; Shrestha, Prithvi N.

    2018-01-01

    There has been a steady interest in investigating the validity of language tests in the last decades. Despite numerous studies on construct validity in language testing, there are not many studies examining the construct validity of a reading test. This paper reports on a study that explored the construct validity of the English reading test in…

  9. Validation of a novel robot-assisted 3DUS system for real-time planning and guidance of breast interstitial HDR brachytherapy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Poulin, Eric; Beaulieu, Luc, E-mail: Luc.Beaulieu@phy.ulaval.ca; Gardi, Lori

    Purpose: In current clinical practice, there is no integrated 3D ultrasound (3DUS) guidance system clinically available for breast brachytherapy. In this study, the authors present a novel robot-assisted 3DUS system for real-time planning and guidance of breast interstitial high dose rate (HDR) brachytherapy treatment. Methods: For this work, a new computer controlled robotic 3DUS system was built to perform a hybrid motion scan, which is a combination of a 6 cm linear translation with a 30° rotation at both ends. The new 3DUS scanner was designed to fit on a modified Kuske assembly, keeping the current template grid configuration butmore » modifying the frame to allow the mounting of the 3DUS system at several positions. A finer grid was also tested. A user interface was developed to perform image reconstruction, semiautomatic segmentation of the surgical bed as well as catheter reconstruction and tracking. A 3D string phantom was used to validate the geometric accuracy of the reconstruction. The volumetric accuracy of the system was validated with phantoms using magnetic resonance imaging (MRI) and computed tomography (CT) images. In order to accurately determine whether 3DUS can effectively replace CT for treatment planning, the authors have compared the 3DUS catheter reconstruction to the one obtained from CT images. In addition, in agarose-based phantoms, an end-to-end procedure was performed by executing six independent complete procedures with both 14 and 16 catheters, and for both standard and finer Kuske grids. Finally, in phantoms, five end-to-end procedures were performed with the final CT planning for the validation of 3DUS preplanning. Results: The 3DUS acquisition time is approximately 10 s. A paired Student t-test showed that there was no statistical significant difference between known and measured values of string separations in each direction. Both MRI and CT volume measurements were not statistically different from 3DUS volume (Student t-test

  10. Official Position of the American Academy of Clinical Neuropsychology Social Security Administration Policy on Validity Testing: Guidance and Recommendations for Change.

    PubMed

    Chafetz, M D; Williams, M A; Ben-Porath, Y S; Bianchini, K J; Boone, K B; Kirkwood, M W; Larrabee, G J; Ord, J S

    2015-01-01

    The milestone publication by Slick, Sherman, and Iverson (1999) of criteria for determining malingered neurocognitive dysfunction led to extensive research on validity testing. Position statements by the National Academy of Neuropsychology and the American Academy of Clinical Neuropsychology (AACN) recommended routine validity testing in neuropsychological evaluations. Despite this widespread scientific and professional support, the Social Security Administration (SSA) continued to discourage validity testing, a stance that led to a congressional initiative for SSA to reevaluate their position. In response, SSA commissioned the Institute of Medicine (IOM) to evaluate the science concerning the validation of psychological testing. The IOM concluded that validity assessment was necessary in psychological and neuropsychological examinations (IOM, 2015 ). The AACN sought to provide independent expert guidance and recommendations concerning the use of validity testing in disability determinations. A panel of contributors to the science of validity testing and its application to the disability process was charged with describing why the disability process for SSA needs improvement, and indicating the necessity for validity testing in disability exams. This work showed how the determination of malingering is a probability proposition, described how different types of validity tests are appropriate, provided evidence concerning non-credible findings in children and low-functioning individuals, and discussed the appropriate evaluation of pain disorders typically seen outside of mental consultations. A scientific plan for validity assessment that additionally protects test security is needed in disability determinations and in research on classification accuracy of disability decisions.

  11. Test Takers and the Validity of Score Interpretations

    ERIC Educational Resources Information Center

    Kopriva, Rebecca J.; Thurlow, Martha L.; Perie, Marianne; Lazarus, Sheryl S.; Clark, Amy

    2016-01-01

    This article argues that test takers are as integral to determining validity of test scores as defining target content and conditioning inferences on test use. A principled sustained attention to how students interact with assessment opportunities is essential, as is a principled sustained evaluation of evidence confirming the validity or calling…

  12. Design and validation of pictograms in a pediatric anaphylaxis action plan.

    PubMed

    Mok, Garrick; Vaillancourt, Régis; Irwin, Danica; Wong, Alexandre; Zemek, Roger; Alqurashi, Waleed

    2015-05-01

    Current anaphylaxis action plans (AAPs) are based on written instructions without inclusion of pictograms. To develop an AAP with pictorial aids and to prospectively validate the pictogram components of this plan. Participants recruited from the emergency department and allergy clinic participated in a questionnaire to validate pictograms depicting key counseling points of an anaphylactic reaction. Children ≥ 10 years of age and caregivers of children < 10 years with acute anaphylaxis or who carried epinephrine auto-injector for confirmed allergy were eligible. Guessability, translucency, and recall were assessed for 11 pictogram designs. Pictograms identified as correct or partially correct by at least 85% of participants were considered valid. Three independent reviewers assessed these outcome measures. Of the 115 total participants, 73 (63%) were female, 76 (66%) were parents/guardians, and 39 (34%) were children aged 10-17. Overall, 10 pictograms (91%) reached ≥ 85% for correct guessability, translucency, and recall. Four pictograms were redesigned to reach the preset validation target. One pictogram depicting symptom management (5-min wait time after first epinephrine treatment) reached 82% translucency after redesign. However, it reached 98% and 100% of correct guessability and recall, respectively. We prospectively designed and validated a set of pictograms to be included in an AAP. The incorporation of validated pictograms into an AAP may potentially increase comprehension of the triggers, signs and symptoms, and management of an anaphylactic reaction. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  13. Predictive Validity Study of the APS Writing and Reading Tests [and] Validating Placement Rules for the APS Writing Test.

    ERIC Educational Resources Information Center

    College of the Canyons, Valencia, CA. Office of Institutional Development.

    California's College of the Canyons has used the College Board Assessment and Placement Services (APS) test to assess students' abilities in basic and college English since spring 1993. These two reports summarize data from a May 1994 study of the predictive validity of the APS writing and reading tests and a June 1994 effort to validate the cut…

  14. Validating use of a critical thinking test for the dental admission test.

    PubMed

    Tsai, Tsung-Hsun

    2014-04-01

    The purpose of this study was to validate the use of a test to assess dental school applicants' critical thinking abilities. The intent was to include this test on the Dental Admission Test (DAT) if it was shown to enhance the DAT's validity. Correlation and regression analyses of undergraduate and dental school performance with scores on each of the tests on the DAT battery and the California Critical Thinking Skills Test (CCTST) were performed. Data were collected from 439 third- and fourth-year dental students who consented to participate and were enrolled at one of the ten accredited dental schools included in the study. These ten dental schools were from most regions of the United States. This study concluded that including the CCTST on the DAT did not significantly enhance the DAT's validity.

  15. Vibration and Acoustic Test Facility (VATF): User Test Planning Guide

    NASA Technical Reports Server (NTRS)

    Fantasia, Peter M.

    2011-01-01

    Test process, milestones and inputs are unknowns to first-time users of the VATF. The User Test Planning Guide aids in establishing expectations for both NASA and non-NASA facility customers. The potential audience for this guide includes both internal and commercial spaceflight hardware/software developers. It is intended to assist their test engineering personnel in test planning and execution. Material covered includes a roadmap of the test process, roles and responsibilities of facility and user, major milestones, facility capabilities, and inputs required by the facility. Samples of deliverables, test article interfaces, and inputs necessary to define test scope, cost, and schedule are included as an appendix to the guide.

  16. The Concurrent Validity of Four Tests of Metalinguistic Awareness.

    ERIC Educational Resources Information Center

    Day, Kaaren C.; Day, H. D.

    1991-01-01

    Examines the concurrent validity of four metalinguistic awareness tests (Written Language Awareness Test, Test of Early Reading Ability, Linguistic Awareness in Reading Readiness Test, and the Concepts about Print Test). Finds rather low concurrent validity coefficients which suggests that further work is needed to clarify the operations required…

  17. Developing and testing new smoking measures for the Health Plan Employer Data and Information Set.

    PubMed

    Pbert, Lori; Vuckovic, Nancy; Ockene, Judith K; Hollis, Jack F; Riedlinger, Karen

    2003-04-01

    To develop and test items for the Health Plan Employee Data and Information Set (HEDIS) that assess delivery of the full range of provider-delivered tobacco interventions. The authors identified potential items via literature review; items were reviewed by national experts. Face validity of candidate items was tested in focus groups. The final survey was sent to a random sample of 1711 adult primary care patients; the re-test survey was sent to self-identified smokers. The process identified reliable items to capture provider assessment of motivation and provision of assistance and follow-up. One can reliably assess patient self-report of provider delivery of the full range of brief tobacco interventions. Such assessment and feedback to health plans and providers may increase use of evidence-based brief interventions.

  18. Validation of a pregnancy planning measure for Arabic-speaking women.

    PubMed

    Almaghaslah, Eman; Rochat, Roger; Farhat, Ghada

    2017-01-01

    The prevalence of unplanned pregnancy in Saudi Arabia has not been thoroughly investigated. To conduct a psychometric evaluation study of the Arabic version of the London Measure of Unplanned Pregnancy (LMUP). To evaluate the psychometric properties of the LMUP, we conducted a self-administered online survey among 796 ever-married Saudi women aged 20-49 years, and a re-test survey among 24 women. The psychometric properties evaluated included content validity measured by content validity index (CVI), structural validity assessed by exploratory factor analysis (EFA), substantive validity assessed by hypothesis testing, contextual stability for the test-retest assessed by weighted Kappa, and internal consistency assessed by Cronbach's alpha. The psychometric analysis of the Arabic version of LMUP exhibited valid and reliable properties. The CVIs for individual items and at the scale level were >0.7. EFA confirmed a unidimensional extraction of the scale item. Hypothesis testing confirmed expected associations. The tool was stable with weighted kappa = 0.78 and Cronbach's alpha = 0.88. In this study, the validity and reliability of the Arabic version of the LMUP were confirmed according to well-known psychometric criteria. This LMUP version can be used in research studies among Arabic-speaking women to measure unplanned pregnancy and investigate correlates and outcomes related to unplanned pregnancy.

  19. Test Series 2. 4: detailed test plan

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Not Available

    Test Series 2.4 comprises the fourth sub-series of tests to be scheduled as a part of Test Series 2, the second stage of the combustion research program to be carried out at the Grimethorpe Experimental Pressurized Fluidized Bed Combustion Facility. Test Series 2.1, the first sub-series of tests, was completed in February 1983, and the first part of the second sub-series, Test Series 2.3, in October 1983. Test Series 2.2 was completed in February 1984 after which the second part of Test Series 2.3 commenced. The Plan for Test Series 2.4 consists of 350 data gathering hours to be completedmore » within 520 coal burning hours. This document provides a brief description of the Facility and modifications which have been made following the completion of Test Series 2.1. No further modifications were made following the completion of the first part of Test Series 2.3 or Test Series 2.2. The operating requirements for Test Series 2.4 are specified. The tests will be performed using a UK coal (Lady Windsor), and a UK limestone (Middleton) both nominated by the FRG. Seven objectives are proposed which are to be fulfilled by thirteen test conditions. Six part load tests based on input supplied by Kraftwerk Union AG are included. The cascade is expected to be on line for each test condition and total cascade exposure is expected to be in excess of 450 hours. Details of sampling and special measurements are given. A test plan schedule envisages the full test series being completed within a two month calendar period. Finally, a number of contingency strategies are proposed. 3 figures, 14 tables.« less

  20. Validation studies and proficiency testing.

    PubMed

    Ankilam, Elke; Heinze, Petra; Kay, Simon; Van den Eede, Guy; Popping, Bert

    2002-01-01

    Genetically modified organisms (GMOs) entered the European food market in 1996. Current legislation demands the labeling of food products if they contain <1% GMO, as assessed for each ingredient of the product. To create confidence in the testing methods and to complement enforcement requirements, there is an urgent need for internationally validated methods, which could serve as reference methods. To date, several methods have been submitted to validation trials at an international level; approaches now exist that can be used in different circumstances and for different food matrixes. Moreover, the requirement for the formal validation of methods is clearly accepted; several national and international bodies are active in organizing studies. Further validation studies, especially on the quantitative polymerase chain reaction methods, need to be performed to cover the rising demand for new extraction methods and other background matrixes, as well as for novel GMO constructs.

  1. 40 CFR 86.1341-90 - Test cycle validation criteria.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 40 Protection of Environment 19 2011-07-01 2011-07-01 false Test cycle validation criteria. 86... Procedures § 86.1341-90 Test cycle validation criteria. (a) To minimize the biasing effect of the time lag... brake horsepower-hour. (c) Regression line analysis to calculate validation statistics. (1) Linear...

  2. 40 CFR 86.1341-90 - Test cycle validation criteria.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... 40 Protection of Environment 20 2013-07-01 2013-07-01 false Test cycle validation criteria. 86... Procedures § 86.1341-90 Test cycle validation criteria. (a) To minimize the biasing effect of the time lag... brake horsepower-hour. (c) Regression line analysis to calculate validation statistics. (1) Linear...

  3. 40 CFR 86.1341-90 - Test cycle validation criteria.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... 40 Protection of Environment 20 2012-07-01 2012-07-01 false Test cycle validation criteria. 86... Procedures § 86.1341-90 Test cycle validation criteria. (a) To minimize the biasing effect of the time lag... brake horsepower-hour. (c) Regression line analysis to calculate validation statistics. (1) Linear...

  4. PDCI Wide-Area Damping Control: PSLF Simulations of the 2016 Open and Closed Loop Test Plan

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wilches Bernal, Felipe; Pierre, Brian Joseph; Elliott, Ryan Thomas

    To demonstrate and validate the performance of the wide-are a damping control system, the project plans to conduct closed-loop tests on the PDCI in summer/fall 2016. A test plan details the open and closed loop tests to be conducted on the P DCI using the wide-area damping control system. To ensure the appropriate level of preparedness, simulations were performed in order to predict and evaluate any possible unsafe operations before hardware experiments are attempted. This report contains the result s from these simulations using the power system dynamics software PSLF (Power System Load Flow, trademark of GE). The simulations usemore » the WECC (Western Electricity Coordinating Council) 2016 light summer and heavy summer base cases.« less

  5. Verification and Validation Plan for Flight Performance Requirements on the CEV Parachute Assembly System

    NASA Technical Reports Server (NTRS)

    Morris, Aaron L.; Olson, Leah M.

    2011-01-01

    The Crew Exploration Vehicle Parachute Assembly System (CPAS) is engaged in a multi-year design and test campaign aimed at qualifying a parachute recovery system for human use on the Orion Spacecraft. Orion has parachute flight performance requirements that will ultimately be verified through the use of Monte Carlo multi-degree of freedom flight simulations. These simulations will be anchored by real world flight test data and iteratively improved to provide a closer approximation to the real physics observed in the inherently chaotic inflation and steady state flight of the CPAS parachutes. This paper will examine the processes necessary to verify the flight performance requirements of the human rated spacecraft. The focus will be on the requirements verification and model validation planned on CPAS.

  6. 40 CFR 86.1341-98 - Test cycle validation criteria.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... 40 Protection of Environment 20 2012-07-01 2012-07-01 false Test cycle validation criteria. 86... Procedures § 86.1341-98 Test cycle validation criteria. Section 86.1341-98 includes text that specifies...-90 (d)(4), shall be excluded from both cycle validation and the integrated work used for emissions...

  7. 40 CFR 86.1341-98 - Test cycle validation criteria.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... 40 Protection of Environment 20 2013-07-01 2013-07-01 false Test cycle validation criteria. 86... Procedures § 86.1341-98 Test cycle validation criteria. Section 86.1341-98 includes text that specifies...-90 (d)(4), shall be excluded from both cycle validation and the integrated work used for emissions...

  8. 40 CFR 86.1341-98 - Test cycle validation criteria.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 40 Protection of Environment 19 2011-07-01 2011-07-01 false Test cycle validation criteria. 86... Procedures § 86.1341-98 Test cycle validation criteria. Section 86.1341-98 includes text that specifies...-90 (d)(4), shall be excluded from both cycle validation and the integrated work used for emissions...

  9. Students' Initial Knowledge State and Test Design: Towards a Valid and Reliable Test Instrument

    ERIC Educational Resources Information Center

    CoPo, Antonio Roland I.

    2015-01-01

    Designing a good test instrument involves specifications, test construction, validation, try-out, analysis and revision. The initial knowledge state of forty (40) tertiary students enrolled in Business Statistics course was determined and the same test instrument undergoes validation. The designed test instrument did not only reveal the baseline…

  10. Emperical Tests of Acceptance Sampling Plans

    NASA Technical Reports Server (NTRS)

    White, K. Preston, Jr.; Johnson, Kenneth L.

    2012-01-01

    Acceptance sampling is a quality control procedure applied as an alternative to 100% inspection. A random sample of items is drawn from a lot to determine the fraction of items which have a required quality characteristic. Both the number of items to be inspected and the criterion for determining conformance of the lot to the requirement are given by an appropriate sampling plan with specified risks of Type I and Type II sampling errors. In this paper, we present the results of empirical tests of the accuracy of selected sampling plans reported in the literature. These plans are for measureable quality characteristics which are known have either binomial, exponential, normal, gamma, Weibull, inverse Gaussian, or Poisson distributions. In the main, results support the accepted wisdom that variables acceptance plans are superior to attributes (binomial) acceptance plans, in the sense that these provide comparable protection against risks at reduced sampling cost. For the Gaussian and Weibull plans, however, there are ranges of the shape parameters for which the required sample sizes are in fact larger than the corresponding attributes plans, dramatically so for instances of large skew. Tests further confirm that the published inverse-Gaussian (IG) plan is flawed, as reported by White and Johnson (2011).

  11. Development and Validation of an Algorithm to Identify Planned Readmissions From Claims Data.

    PubMed

    Horwitz, Leora I; Grady, Jacqueline N; Cohen, Dorothy B; Lin, Zhenqiu; Volpe, Mark; Ngo, Chi K; Masica, Andrew L; Long, Theodore; Wang, Jessica; Keenan, Megan; Montague, Julia; Suter, Lisa G; Ross, Joseph S; Drye, Elizabeth E; Krumholz, Harlan M; Bernheim, Susannah M

    2015-10-01

    It is desirable not to include planned readmissions in readmission measures because they represent deliberate, scheduled care. To develop an algorithm to identify planned readmissions, describe its performance characteristics, and identify improvements. Consensus-driven algorithm development and chart review validation study at 7 acute-care hospitals in 2 health systems. For development, all discharges qualifying for the publicly reported hospital-wide readmission measure. For validation, all qualifying same-hospital readmissions that were characterized by the algorithm as planned, and a random sampling of same-hospital readmissions that were characterized as unplanned. We calculated weighted sensitivity and specificity, and positive and negative predictive values of the algorithm (version 2.1), compared to gold standard chart review. In consultation with 27 experts, we developed an algorithm that characterizes 7.8% of readmissions as planned. For validation we reviewed 634 readmissions. The weighted sensitivity of the algorithm was 45.1% overall, 50.9% in large teaching centers and 40.2% in smaller community hospitals. The weighted specificity was 95.9%, positive predictive value was 51.6%, and negative predictive value was 94.7%. We identified 4 minor changes to improve algorithm performance. The revised algorithm had a weighted sensitivity 49.8% (57.1% at large hospitals), weighted specificity 96.5%, positive predictive value 58.7%, and negative predictive value 94.5%. Positive predictive value was poor for the 2 most common potentially planned procedures: diagnostic cardiac catheterization (25%) and procedures involving cardiac devices (33%). An administrative claims-based algorithm to identify planned readmissions is feasible and can facilitate public reporting of primarily unplanned readmissions. © 2015 Society of Hospital Medicine.

  12. The Teenage Nonviolence Test: Concurrent and Discriminant Validity.

    ERIC Educational Resources Information Center

    Konen, Kristopher; Mayton, Daniel M., II; Delva, Zenita; Sonnen, Melinda; Dahl, William; Montgomery, Richard

    This study was designed to document the validity of the Teenage Nonviolence Test (TNT). In this study the concurrent validity of the TNT in various ways, the validity of the TNT using known groups, and the discriminant validity of the TNT by evaluating its relationships with other psychological constructs were assessed. The results showed that the…

  13. SeaSat-A Satellite Scatterometer (SASS) Validation and Experiment Plan

    NASA Technical Reports Server (NTRS)

    Schroeder, L. C. (Editor)

    1978-01-01

    This plan was generated by the SeaSat-A satellite scatterometer experiment team to define the pre-and post-launch activities necessary to conduct sensor validation and geophysical evaluation. Details included are an instrument and experiment description/performance requirements, success criteria, constraints, mission requirements, data processing requirement and data analysis responsibilities.

  14. Overview of the Exploration Exercise Device Validation Study Plans

    NASA Technical Reports Server (NTRS)

    DeWitt, J. K.; Swan, B. G.

    2018-01-01

    The NASA has determined that a multi-functional exercise device will be developed for use as an exercise device during exploration missions. The device will allow for full body resistance and metabolic exercise necessary to minimize physiological losses during space flight and to maintain fitness necessary to perform critical mission tasks. Prior to implementation as an exercise device on an Exploration vehicle, there will be verification and validation testing completed to determine device efficacy at providing the necessary training stimuli to achieve desired goals. Because the exploration device will be new device that has yet be specified, specific Verification and Validation (V&V) protocols have yet to be developed. Upon delivery of an exploration exercise device training unit, stakeholders throughout NASA will develop V&V plans that include ground-based testing and testing on the International Space Station (ISS). Stakeholders will develop test protocols that include success criterion for the device. Ground tests will occur at NASA Johnson Space Station prior to flight testing. The intents of the ground tests are to allow crew, spaceflight medicine, science, engineering, Astronaut Strength, Conditioning, and Reconditioning staff, and others to gain experience in the best utilization of the device. The goal is to obtain an evidence base for recommending use of the device on the ISS. The developed protocol will be created to achieve multiple objectives, including determining if the device provides an adequate training stimulus for 5th - 95th percentile males and females, allows for exercise modalities that protect functional capability, and is robust and can withstand extensive human use. Although protocols are yet to be determined, current expectations include use of the device by test subjects and current crew in order to obtain quantitative and qualitative feedback. Information obtained during the ground tests may be used to influence device modifications

  15. The Hyper-X Flight Systems Validation Program

    NASA Technical Reports Server (NTRS)

    Redifer, Matthew; Lin, Yohan; Bessent, Courtney Amos; Barklow, Carole

    2007-01-01

    For the Hyper-X/X-43A program, the development of a comprehensive validation test plan played an integral part in the success of the mission. The goal was to demonstrate hypersonic propulsion technologies by flight testing an airframe-integrated scramjet engine. Preparation for flight involved both verification and validation testing. By definition, verification is the process of assuring that the product meets design requirements; whereas validation is the process of assuring that the design meets mission requirements for the intended environment. This report presents an overview of the program with emphasis on the validation efforts. It includes topics such as hardware-in-the-loop, failure modes and effects, aircraft-in-the-loop, plugs-out, power characterization, antenna pattern, integration, combined systems, captive carry, and flight testing. Where applicable, test results are also discussed. The report provides a brief description of the flight systems onboard the X-43A research vehicle and an introduction to the ground support equipment required to execute the validation plan. The intent is to provide validation concepts that are applicable to current, follow-on, and next generation vehicles that share the hybrid spacecraft and aircraft characteristics of the Hyper-X vehicle.

  16. Dynamic testing in schizophrenia: does training change the construct validity of a test?

    PubMed

    Wiedl, Karl H; Schöttke, Henning; Green, Michael F; Nuechterlein, Keith H

    2004-01-01

    Dynamic testing typically involves specific interventions for a test to assess the extent to which test performance can be modified, beyond level of baseline (static) performance. This study used a dynamic version of the Wisconsin Card Sorting Test (WCST) that is based on cognitive remediation techniques within a test-training-test procedure. From results of previous studies with schizophrenia patients, we concluded that the dynamic and static versions of the WCST should have different construct validity. This hypothesis was tested by examining the patterns of correlations with measures of executive functioning, secondary verbal memory, and verbal intelligence. Results demonstrated a specific construct validity of WCST dynamic (i.e., posttest) scores as an index of problem solving (Tower of Hanoi) and secondary verbal memory and learning (Auditory Verbal Learning Test), whereas the impact of general verbal capacity and selective attention (Verbal IQ, Stroop Test) was reduced. It is concluded that the construct validity of the test changes with dynamic administration and that this difference helps to explain why the dynamic version of the WCST predicts functional outcome better than the static version.

  17. Test/QA plan for the validation of the verification protocol for high speed pesticide spray drift reduction technologies for row and field crops

    EPA Science Inventory

    This test/QA plan for evaluation the generic test protocol for high speed wind tunnel, representing aerial application, pesticide spray drift reduction technologies (DRT) for row and field crops is in conformance with EPA Requirements for Quality Assurance Project Plans (EPA QA/R...

  18. Test/QA plan for the validation of the verification protocol for low speed pesticide spray drift reduction technologies for row and field crops

    EPA Science Inventory

    This test/QA plan for evaluation the generic test protocol for high speed wind tunnel, representing aerial application, pesticide spray drift reduction technologies (DRT) for row and field crops is in conformance with EPA Requirements for Quality Assurance Project Plans (EPA QA/R...

  19. Solar Dynamics Observatory On-Orbit Jitter Testing, Analysis, and Mitigation Plans

    NASA Technical Reports Server (NTRS)

    Liu, Kuo-Chia (Alice); Blaurock, Carl A.; Bourkland, Kristin L.; Morgenstern, Wendy M.; Maghami, Peiman G.

    2011-01-01

    The Solar Dynamics Observatory (SDO) was designed to understand the Sun and the Sun s influence on Earth. SDO was launched on February 11, 2010 carrying three scientific instruments: the Atmospheric Imaging Assembly (AIA), the Helioseismic and Magnetic Imager (HMI), and the Extreme Ultraviolet Variability Experiment (EVE). Both AIA and HMI are sensitive to high frequency pointing perturbations and have sub-arcsecond level line-of-sight (LOS) jitter requirements. Extensive modeling and analysis efforts were directed in estimating the amount of jitter disturbing the science instruments. To verify the disturbance models and to validate the jitter performance prior to launch, many jitter-critical components and subassemblies were tested either by the mechanism vendors or at the NASA Goddard Space Flight Center (GSFC). Although detailed analysis and assembly level tests were performed to obtain good jitter predictions, there were still several sources of uncertainties in the system. The structural finite element model did not have all the modes correlated to test data at high frequencies (greater than 50 Hz). The performance of the instrument stabilization system was not known exactly but was expected to be close to the analytical model. A true disturbance-to-LOS observatory level test was not available due to the tight schedule of the flight spacecraft, the cost in time and manpower, difficulties in creating gravity negation systems, and risks of damaging flight hardware. To protect the observatory jitter performance against model uncertainties, the SDO jitter team devised several on-orbit jitter reduction plans in addition to reserve margins on analysis results. Since some of these plans severely restricted the capabilities of several spacecraft components (e.g. wheels and High Gain Antennas), the SDO team performed on-orbit jitter tests to determine which jitter reduction plans, if any, were necessary to satisfy science LOS jitter requirements. The SDO on

  20. Embedded performance validity testing in neuropsychological assessment: Potential clinical tools.

    PubMed

    Rickards, Tyler A; Cranston, Christopher C; Touradji, Pegah; Bechtold, Kathleen T

    2018-01-01

    The article aims to suggest clinically-useful tools in neuropsychological assessment for efficient use of embedded measures of performance validity. To accomplish this, we integrated available validity-related and statistical research from the literature, consensus statements, and survey-based data from practicing neuropsychologists. We provide recommendations for use of 1) Cutoffs for embedded performance validity tests including Reliable Digit Span, California Verbal Learning Test (Second Edition) Forced Choice Recognition, Rey-Osterrieth Complex Figure Test Combination Score, Wisconsin Card Sorting Test Failure to Maintain Set, and the Finger Tapping Test; 2) Selecting number of performance validity measures to administer in an assessment; and 3) Hypothetical clinical decision-making models for use of performance validity testing in a neuropsychological assessment collectively considering behavior, patient reporting, and data indicating invalid or noncredible performance. Performance validity testing helps inform the clinician about an individual's general approach to tasks: response to failure, task engagement and persistence, compliance with task demands. Data-driven clinical suggestions provide a resource to clinicians and to instigate conversation within the field to make more uniform, testable decisions to further the discussion, and guide future research in this area.

  1. 40 CFR 86.1341-98 - Test cycle validation criteria.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 40 Protection of Environment 19 2010-07-01 2010-07-01 false Test cycle validation criteria. 86...) Emission Regulations for New Otto-Cycle and Diesel Heavy-Duty Engines; Gaseous and Particulate Exhaust Test Procedures § 86.1341-98 Test cycle validation criteria. Section 86.1341-98 includes text that specifies...

  2. Methodology for testing and validating knowledge bases

    NASA Technical Reports Server (NTRS)

    Krishnamurthy, C.; Padalkar, S.; Sztipanovits, J.; Purves, B. R.

    1987-01-01

    A test and validation toolset developed for artificial intelligence programs is described. The basic premises of this method are: (1) knowledge bases have a strongly declarative character and represent mostly structural information about different domains, (2) the conditions for integrity, consistency, and correctness can be transformed into structural properties of knowledge bases, and (3) structural information and structural properties can be uniformly represented by graphs and checked by graph algorithms. The interactive test and validation environment have been implemented on a SUN workstation.

  3. Initiating the Validation of CCIM Processability for Multi-phase all Ceramic (SYNROC) HLW Form: Plan for Test BFY14CCIM-C

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Maio, Vince

    This plan covers test BFY14CCIM-C which will be a first–of–its-kind demonstration for the complete non-radioactive surrogate production of multi-phase ceramic (SYNROC) High Level Waste Forms (HLW) using Cold Crucible Induction Melting (CCIM) Technology. The test will occur in the Idaho National Laboratory’s (INL) CCIM Pilot Plant and is tentatively scheduled for the week of September 15, 2014. The purpose of the test is to begin collecting qualitative data for validating the ceramic HLW form processability advantages using CCIM technology- as opposed to existing ceramic–lined Joule Heated Melters (JHM) currently producing BSG HLW forms. The major objectives of BFY14CCIM-C are tomore » complete crystalline melt initiation with a new joule-heated resistive starter ring, sustain inductive melting at temperatures between 1600 to 1700°C for two different relatively high conductive materials representative of the SYNROC ceramic formation inclusive of a HLW surrogate, complete melter tapping and pouring of molten ceramic material in to a preheated 4 inch graphite canister and a similar canister at room temperature. Other goals include assessing the performance of a new crucible specially designed to accommodate the tapping and pouring of pure crystalline forms in contrast to less recalcitrant amorphous glass, assessing the overall operational effectiveness of melt initiation using a resistive starter ring with a dedicated power source, and observing the tapped molten flow and subsequent relatively quick crystallization behavior in pans with areas identical to standard HLW disposal canisters. Surrogate waste compositions with ceramic SYNROC forming additives and their measured properties for inductive melting, testing parameters, pre-test conditions and modifications, data collection requirements, and sampling/post-demonstration analysis requirements for the produced forms are provided and defined.« less

  4. 40 CFR 86.1341-90 - Test cycle validation criteria.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 40 Protection of Environment 19 2010-07-01 2010-07-01 false Test cycle validation criteria. 86...) Emission Regulations for New Otto-Cycle and Diesel Heavy-Duty Engines; Gaseous and Particulate Exhaust Test Procedures § 86.1341-90 Test cycle validation criteria. (a) To minimize the biasing effect of the time lag...

  5. The validation index: a new metric for validation of segmentation algorithms using two or more expert outlines with application to radiotherapy planning.

    PubMed

    Juneja, Prabhjot; Evans, Philp M; Harris, Emma J

    2013-08-01

    Validation is required to ensure automated segmentation algorithms are suitable for radiotherapy target definition. In the absence of true segmentation, algorithmic segmentation is validated against expert outlining of the region of interest. Multiple experts are used to overcome inter-expert variability. Several approaches have been studied in the literature, but the most appropriate approach to combine the information from multiple expert outlines, to give a single metric for validation, is unclear. None consider a metric that can be tailored to case-specific requirements in radiotherapy planning. Validation index (VI), a new validation metric which uses experts' level of agreement was developed. A control parameter was introduced for the validation of segmentations required for different radiotherapy scenarios: for targets close to organs-at-risk and for difficult to discern targets, where large variation between experts is expected. VI was evaluated using two simulated idealized cases and data from two clinical studies. VI was compared with the commonly used Dice similarity coefficient (DSCpair - wise) and found to be more sensitive than the DSCpair - wise to the changes in agreement between experts. VI was shown to be adaptable to specific radiotherapy planning scenarios.

  6. Joint Test Report for Validation of Alternative Low-Emission Surface Preparation/Depainting Technologies for Structural Steel

    NASA Technical Reports Server (NTRS)

    Lewis, Pattie

    2007-01-01

    Headquarters National Aeronautics and Space Administration (NASA) chartered the NASA Acquisition Pollution Prevention (AP2) Office to coordinate agency activities affecting pollution prevention issues identified during system and component acquisition and sustainment processes. The primary objectives of the AP2 Office are to: (1) Reduce or eliminate the use of hazardous materials or hazardous processes at manufacturing, remanufacturing, and sustainment locations. (2) Avoid duplication of effort in actions required to reduce or eliminate hazardous materials through joint center cooperation and technology sharing. The objective of this project was to qualify candidate alternative Low-Emission Surface Preparation/Depainting Technologies for Structural Steel applications at NASA facilities. This project compares the surface preparation/depainting performance of the proposed alternatives to existing surface preparation/depainting systems or standards. This Joint Test Report (JTR) contains the results of testing as per the outlines of the Joint Test Protocol (JTP), Joint Test Protocol for Validation of Alternative Low-Emission Surface Preparation/Depainting Technologies for Structural Steel, and the Field Test Plan (FTP), Field Evaluations Test Plan for Validation of Alternative Low-Emission Surface Preparation/Depainting Technologies for Structural Steel, for critical requirements and tests necessary to qualify alternatives for coating removal systems. These tests were derived from engineering, performance, and operational impact (supportability) requirements defined by a consensus of government and industry participants. This JTR documents the results of the testing as well as any test modifications made during the execution of the project. This JTR is made available as a reference for future pollution prevention endeavors by other NASA Centers, the Department of Defense and commercial users to minimize duplication of effort. The current coating removal processes

  7. Validating a Spanish Developmental Spelling Test.

    ERIC Educational Resources Information Center

    Ferroli, Lou; Krajenta, Marilyn

    The creation and validation of a Spanish version of an English developmental spelling test (DST) is described. An introductory section reviews related literature on the rationale for and construction of DSTs, spelling development in the early grades, and Spanish-English bilingual education. Differences between the English and Spanish test versions…

  8. Energy Systems Test Area (ESTA) Electrical Power Systems Test Operations: User Test Planning Guide

    NASA Technical Reports Server (NTRS)

    Salinas, Michael J.

    2012-01-01

    Test process, milestones and inputs are unknowns to first-time users of the ESTA Electrical Power Systems Test Laboratory. The User Test Planning Guide aids in establishing expectations for both NASA and non-NASA facility customers. The potential audience for this guide includes both internal and commercial spaceflight hardware/software developers. It is intended to assist their test engineering personnel in test planning and execution. Material covered includes a roadmap of the test process, roles and responsibilities of facility and user, major milestones, facility capabilities, and inputs required by the facility. Samples of deliverables, test article interfaces, and inputs necessary to define test scope, cost, and schedule are included as an appendix to the guide.

  9. Fabrication Control Plan for ORNL RH-LOCA ATF Test Specimens to be Irradiated in the ATR

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Field, Kevin G.; Howard, Richard; Teague, Michael

    2014-06-01

    The purpose of this fabrication plan is (1) to summarize the design of a set of rodlets that will be fabricated and then irradiated in the Advanced Test Reactor (ATR) and (2) provide requirements for fabrication and acceptance criteria for inspections of the Light Water Reactor (LWR) – Accident Tolerant Fuels (ATF) rodlet components. The functional and operational (F&OR) requirements for the ATF program are identified in the ATF Test Plan. The scope of this document only covers fabrication and inspections of rodlet components detailed in drawings 604496 and 604497. It does not cover the assembly of these items tomore » form a completed test irradiation assembly or the inspection of the final assembly, which will be included in a separate INL final test assembly specification/inspection document. The controls support the requirements that the test irradiations must be performed safely and that subsequent examinations must provide valid results.« less

  10. Testing and Validating Gadget2 for GPUs

    NASA Astrophysics Data System (ADS)

    Wibking, Benjamin; Holley-Bockelmann, K.; Berlind, A. A.

    2013-01-01

    We are currently upgrading a version of Gadget2 (Springel et al., 2005) that is optimized for NVIDIA's CUDA GPU architecture (Frigaard, unpublished) to work with the latest libraries and graphics cards. Preliminary tests of its performance indicate a ~40x speedup in the particle force tree approximation calculation, with overall speedup of 5-10x for cosmological simulations run with GPUs compared to running on the same CPU cores without GPU acceleration. We believe this speedup can be reasonably increased by an additional factor of two with futher optimization, including overlap of computation on CPU and GPU. Tests of single-precision GPU numerical fidelity currently indicate accuracy of the mass function and the spectral power density to within a few percent of extended-precision CPU results with the unmodified form of Gadget. Additionally, we plan to test and optimize the GPU code for Millenium-scale "grand challenge" simulations of >10^9 particles, a scale that has been previously untested with this code, with the aid of the NSF XSEDE flagship GPU-based supercomputing cluster codenamed "Keeneland." Current work involves additional validation of numerical results, extending the numerical precision of the GPU calculations to double precision, and evaluating performance/accuracy tradeoffs. We believe that this project, if successful, will yield substantial computational performance benefits to the N-body research community as the next generation of GPU supercomputing resources becomes available, both increasing the electrical power efficiency of ever-larger computations (making simulations possible a decade from now at scales and resolutions unavailable today) and accelerating the pace of research in the field.

  11. Validation of a rapid conductimetric test for the measurement of wine tartaric stability.

    PubMed

    Bosso, Antonella; Motta, Silvia; Petrozziello, Maurizio; Guaita, Massimo; Asproudi, Andriani; Panero, Loretta

    2016-12-01

    This work was aimed at optimizing a rapid and reproducible conductivity test for the evaluation of wine tartaric stability, in order to improve the practices for the prevention of tartaric precipitations during bottle aging. The test consists in measuring the drop of conductivity in wines kept under stirring for a fixed time, at low temperature, after the addition of micronized potassium bitartrate crystals (KHT). An experimental design was planned to study three factors affecting the test: temperature, duration and dose of added potassium bitartrate. A standard protocol was defined to produce a micronized potassium bitartrate starting from available commercial products, since the dimensions of the crystals can affect the final conductivity values. After the choice of the best conditions the method was validated. Two different stability thresholds were defined for white wines and for red/rosé wines by comparing the results of the mini-contact test with those of the cold test. Copyright © 2016 Elsevier Ltd. All rights reserved.

  12. Advanced Materials Laboratory User Test Planning Guide

    NASA Technical Reports Server (NTRS)

    Orndoff, Evelyne

    2012-01-01

    Test process, milestones and inputs are unknowns to first-time users of the Advanced Materials Laboratory. The User Test Planning Guide aids in establishing expectations for both NASA and non-NASA facility customers. The potential audience for this guide includes both internal and commercial spaceflight hardware/software developers. It is intended to assist their test engineering personnel in test planning and execution. Material covered includes a roadmap of the test process, roles and responsibilities of facility and user, major milestones, facility capabilities, and inputs required by the facility. Samples of deliverables, test article interfaces, and inputs necessary to define test scope, cost, and schedule are included as an appendix to the guide.

  13. Crawler Acquisition and Testing Demonstration Project Management Plan

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    DEFIGH-PRICE, C.

    2000-10-23

    If the crawler based retrieval system is selected, this project management plan identifies the path forward for acquiring a crawler/track pump waste retrieval system, and completing sufficient testing to support deploying the crawler for as part of a retrieval technology demonstration for Tank 241-C-104. In the balance of the document, these activities will be referred to as the Crawler Acquisition and Testing Demonstration. During recent Tri-Party Agreement negotiations, TPA milestones were proposed for a sludge/hard heel waste retrieval demonstration in tank C-104. Specifically one of the proposed milestones requires completion of a cold demonstration of sufficient scale to support finalmore » design and testing of the equipment (M-45-03G) by 6/30/2004. A crawler-based retrieval system was one of the two options evaluated during the pre-conceptual engineering for C-104 retrieval (RPP-6843 Rev. 0). The alternative technology procurement initiated by the Hanford Tanks Initiative (HTI) project, combined with the pre-conceptual engineering for C-104 retrieval provide an opportunity to achieve compliance with the proposed TPA milestone M-45-03H. This Crawler Acquisition and Testing Demonstration project management plan identifies the plans, organizational interfaces and responsibilities, management control systems, reporting systems, timeline and requirements for the acquisition and testing of the crawler based retrieval system. This project management plan is complimentary to and supportive of the Project Management Plan for Retrieval of C-104 (RPP-6557). This project management plan focuses on utilizing and completing the efforts initiated under the Hanford Tanks Initiative (HTI) to acquire and cold test a commercial crawler based retrieval system. The crawler-based retrieval system will be purchased on a schedule to support design of the waste retrieval from tank C-104 (project W-523) and to meet the requirement of proposed TPA milestone M-45-03H. This Crawler

  14. Propulsion Ground Testing: Planning for the Future

    NASA Technical Reports Server (NTRS)

    Bruce, Robert

    2003-01-01

    Advanced planners are constantly being asked to plan for the provision of future test capability. Historically, this capability is provided either by substantial investment in new test facility capabilities, or in the substantial investment in the modification of pre- existing test capabilities. The key words in the previous sentence are "substantial investment". In the evolving environment of increasingly constrained resources, how is an advanced planner to plan for the provisions of such capabilities? Additionally, the conundrum exists that program formulation decisions are being made based upon both life cycle cost decisions in an environment in which the more immediate challenge of "front-end" capital investment? Often times is the linch-pin upon which early decisions are made. In such an environment, how are plans and decisions made? This paper cites examples of decisions made in the past in the area of both major test facility upgrades, as well as major new test facility investment.

  15. Propulsion Ground Testing: Planning for the Future

    NASA Technical Reports Server (NTRS)

    Bruce, Robert

    2003-01-01

    Advanced planners are constantly being asked to plan for the provision of future test capability. Historically, this capability is provided either by substantial investment in new test facility capabilities, or in the substantial investment in the modification of pre-exiting test facilities. The key words in the previous sentence are 'substantial investment.' In the evolving environment of increasingly constrained resources, how is an advanced planner to plan for the provisions of such capabilities? Additionally, the conundrum exists that program formulation decisions are being made based on both life cycle cost decisions in an environment in which the more immediate challenge of front-end capital investment oftentimes is the linchpin upon which early decisions are made. In such an environment, how are plans and decisions made? This paper cites examples of decisions made in the past in the area of both major test facility upgrades, as well as major new test facility investment.

  16. Validation of a diabetes numeracy test in Arabic.

    PubMed

    Alghodaier, Hussah; Jradi, Hoda; Mohammad, Najwa Samantha; Bawazir, Amen

    2017-01-01

    The prevalence of diabetes Mellitus in Saudi Arabia is 24%, ranking it among the top ten Worldwide. Diabetes education focuses on self-management and relies on numeracy skills. Poor numeracy may go unrecognized and it is important to have an assessment tool in Arabic to measure such a skill in diabetes care. To validate a 15-item Diabetes Numeracy Test (DNT-15) in the Arabic Language as a tool to assess the numeracy skills of patients with diabetes and to test its properties among Saudi patients with diabetes. A 15-question Arabic-language test to assess diabetes numeracy among patients with diabetes on the basis of the diabetes numeracy test (DNT-15) was validated among a sample Arabic speaking Saudi patients with diabetes. Data collection included patients' demographics, long-term glycemic control, diabetes type, duration, co-morbidities, and diabetes related knowledge questions. Internal reliability was assessed using Kuder-Richardson Formula 20 (KR-20). The average score of Arabic DNT-15 was 53.3% and took an average of 30 minutes to complete. The scores significantly correlated with education, income, HbA1c, and diabetes knowledge (p<0.05). Content Validity Ratio (CVR) of 0.75 and Content Validity Index (CVI) of 0.89 supported good content validity. The Arabic DNT-15 also had good internal reliability (KR20 = 0.90). Patients with diabetes need numeracy skills to manage their disease. Level of education does not reflect level of numeracy, and low numeracy skills might be unnoticed by health care providers. The Arabic DNT-15 is a valid and reliable scale to identify Arabic speaking patients with difficulties in certain diabetes-related numeracy skills.

  17. Validation of the Sport Competition Anxiety Test.

    ERIC Educational Resources Information Center

    Cheatham, T.; Rosentswieg, J.

    1982-01-01

    Fifteen female varsity softball coaches were administered the Sport Competition Anxiety Test prior to competition. Their heart rates, continuously monitored by tilemetry, did not relate significantly to the anxiety test data. The test does not appear to be a valid measure of trait anxiety for women softball coaches. (Author/PN)

  18. Educational testing validity and reliability in pharmacy and medical education literature.

    PubMed

    Hoover, Matthew J; Jung, Rose; Jacobs, David M; Peeters, Michael J

    2013-12-16

    To evaluate and compare the reliability and validity of educational testing reported in pharmacy education journals to medical education literature. Descriptions of validity evidence sources (content, construct, criterion, and reliability) were extracted from articles that reported educational testing of learners' knowledge, skills, and/or abilities. Using educational testing, the findings of 108 pharmacy education articles were compared to the findings of 198 medical education articles. For pharmacy educational testing, 14 articles (13%) reported more than 1 validity evidence source while 83 articles (77%) reported 1 validity evidence source and 11 articles (10%) did not have evidence. Among validity evidence sources, content validity was reported most frequently. Compared with pharmacy education literature, more medical education articles reported both validity and reliability (59%; p<0.001). While there were more scholarship of teaching and learning (SoTL) articles in pharmacy education compared to medical education, validity, and reliability reporting were limited in the pharmacy education literature.

  19. Construct Validation of the Fairy Tale Test--Standardization Data.

    ERIC Educational Resources Information Center

    Coulacoglou, Carina

    2002-01-01

    Studied the construct validity of the Fairy Tale Test (C. Coulacoglu, 1993), a personality projective test for children, in a sample of 800 Greek children aged 8, 10, and 12. Factor analysis led to identification of eight primary factors, and correlations with other measures provide construct validity evidence. (SLD)

  20. NASA sea ice and snow validation plan for the Defense Meteorological Satellite Program special sensor microwave/imager

    NASA Technical Reports Server (NTRS)

    Cavalieri, Donald J. (Editor); Swift, Calvin T. (Editor)

    1987-01-01

    This document addresses the task of developing and executing a plan for validating the algorithm used for initial processing of sea ice data from the Special Sensor Microwave/Imager (SSMI). The document outlines a plan for monitoring the performance of the SSMI, for validating the derived sea ice parameters, and for providing quality data products before distribution to the research community. Because of recent advances in the application of passive microwave remote sensing to snow cover on land, the validation of snow algorithms is also addressed.

  1. Validating Test Score Meaning and Defending Test Score Use: Different Aims, Different Methods

    ERIC Educational Resources Information Center

    Cizek, Gregory J.

    2016-01-01

    Advances in validity theory and alacrity in validation practice have suffered because the term "validity" has been used to refer to two incompatible concerns: (1) the degree of support for specified interpretations of test scores (i.e. intended score meaning) and (2) the degree of support for specified applications (i.e. intended test…

  2. Design and Preliminary Testing Plan of Electronegative Ion Thruster

    NASA Technical Reports Server (NTRS)

    Schloeder, Natalie R.; Liu, Thomas M.; Walker, Mitchell L. R.; Polzin, Kurt A.; Dankanich, John W.; Aanesland, Ane

    2014-01-01

    Electronegative ion thrusters are a new iteration of existing gridded ion thruster technology differentiated by their ability to produce and accelerate both positive and negative ions. The primary motivations for electronegative ion thruster development include the elimination of lifetime-limiting cathodes from a thruster system and the ability to generate appreciable thrust through the acceleration of both positive or negative-charged ions. Proof-of-concept testing of the PEGASES (Plasma Propulsion with Electronegative GASES) thruster demonstrated the production of positively and negatively-charged ions (argon and sulfur hexafluoride, respectively) in an RF discharge and the subsequent acceleration of each charge species through the application of a time-varying electric field to a pair of metallic grids similar to those found in gridded ion thrusters. Leveraging the knowledge gained through experiments with the PEGASES I and II prototypes, the MINT (Marshall's Ion-ioN Thruster) is being developed to provide a platform for additional electronegative thruster proof-of-concept validation testing including direct thrust measurements. The design criteria used in designing the MINT are outlined and the planned tests that will be used to characterize the performance of the prototype are described.

  3. Mars Exploration Rover Mission: Entry, Descent, and Landing System Validation

    NASA Technical Reports Server (NTRS)

    Mitcheltree, Robert A.; Lee, Wayne; Steltzner, Adam; SanMartin, Alejanhdro

    2004-01-01

    System validation for a Mars entry, descent, and landing system is not simply a demonstration that the electrical system functions in the associated environments. The function of this system is its interaction with the atmospheric and surface environment. Thus, in addition to traditional test-bed, hardware-in-the-loop, testing, a validation program that confirms the environmental interaction is required. Unfortunately, it is not possible to conduct a meaningful end-to-end test of a Mars landing system on Earth. The validation plan must be constructed from an interconnected combination of simulation, analysis and test. For the Mars Exploration Rover mission, this combination of activities and the logic of how they combined to the system's validation was explicitly stated, reviewed, and tracked as part of the development plan.

  4. Six-Degree-of-Freedom Dynamic Test System (SDTS) User Test Planning Guide

    NASA Technical Reports Server (NTRS)

    Stokes, LeBarian

    2012-01-01

    Test process, milestones and inputs are unknowns to first-time users of the SDTS. The User Test Planning Guide aids in establishing expectations for both NASA and non- NASA facility customers. The potential audience for this guide includes both internal and commercial spaceflight hardware/software developers. It is intended to assist their test engineering personnel in test planning and execution. Material covered includes a roadmap of the test process, roles and responsibilities of facility and user, major milestones, facility capabilities, and inputs required by the facility. Samples of deliverables, test article interfaces, and inputs necessary to define test scope, cost, and schedule are included as an appendix to the guide.

  5. Test Plan: WIPP bin-scale CH TRU waste tests

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Molecke, M.A.

    1990-08-01

    This WIPP Bin-Scale CH TRU Waste Test program described herein will provide relevant composition and kinetic rate data on gas generation and consumption resulting from TRU waste degradation, as impacted by synergistic interactions due to multiple degradation modes, waste form preparation, long-term repository environmental effects, engineered barrier materials, and, possibly, engineered modifications to be developed. Similar data on waste-brine leachate compositions and potentially hazardous volatile organic compounds released by the wastes will also be provided. The quantitative data output from these tests and associated technical expertise are required by the WIPP Performance Assessment (PA) program studies, and for the scientificmore » benefit of the overall WIPP project. This Test Plan describes the necessary scientific and technical aspects, justifications, and rational for successfully initiating and conducting the WIPP Bin-Scale CH TRU Waste Test program. This Test Plan is the controlling scientific design definition and overall requirements document for this WIPP in situ test, as defined by Sandia National Laboratories (SNL), scientific advisor to the US Department of Energy, WIPP Project Office (DOE/WPO). 55 refs., 16 figs., 19 tabs.« less

  6. Development and Validation of a Gender Ideology Scale for Family Planning Services in Rural China

    PubMed Central

    Yang, Xueyan; Li, Shuzhuo; Feldman, Marcus W.

    2013-01-01

    The objectives of this study are to develop a scale of gender role ideology appropriate for assessing Quality of Care in family planning services for rural China. Literature review, focus-group discussions and in-depth interviews with service providers and clients from two counties in eastern and western China, as well as experts’ assessments, were used to develop a scale for family planning services. Psychometric methodologies were applied to samples of 601 service clients and 541 service providers from a survey in a district in central China to validate its internal consistency, reliability, and construct validity with realistic and strategic dimensions. This scale is found to be reliable and valid, and has prospects for application both academically and practically in the field. PMID:23573222

  7. Using Optimization to Improve Test Planning

    DTIC Science & Technology

    2017-09-01

    friendly and to display the output differently, the test and evaluation test schedule optimization model would be a good tool for the test and... evaluation schedulers. 14. SUBJECT TERMS schedule optimization, test planning 15. NUMBER OF PAGES 223 16. PRICE CODE 17. SECURITY CLASSIFICATION OF...make the input more user-friendly and to display the output differently, the test and evaluation test schedule optimization model would be a good tool

  8. Extended version of the "Sniffin' Sticks" identification test: test-retest reliability and validity.

    PubMed

    Sorokowska, A; Albrecht, E; Haehner, A; Hummel, T

    2015-03-30

    The extended, 32-item version of the Sniffin' Sticks identification test was developed in order to create a precise tool enabling repeated, longitudinal testing of individual olfactory subfunctions. Odors of the previous test version had to be changed for technical reasons, and the odor identification test needed re-investigation in terms of reliability, validity, and normative values. In our study we investigated olfactory abilities of a group of 100 patients with olfactory dysfunction and 100 controls. We reconfirmed the high test-retest reliability of the extended version of the Sniffin' Sticks identification test and high correlations between the new and the original part of this tool. In addition, we confirmed the validity of the test as it discriminated clearly between controls and patients with olfactory loss. The additional set of 16 odor identification sticks can be either included in the current olfactory test, thus creating a more detailed diagnosis tool, or it can be used separately, enabling to follow olfactory function over time. Additionally, the normative values presented in our paper might provide useful guidelines for interpretation of the extended identification test results. The revised version of the Sniffin' Sticks 32-item odor identification test is a reliable and valid tool for the assessment of olfactory function. Copyright © 2015 Elsevier B.V. All rights reserved.

  9. Optical Autocovariance Wind Lidar (OAWL): aircraft test-flight history and current plans

    NASA Astrophysics Data System (ADS)

    Tucker, Sara C.; Weimer, Carl; Adkins, Mike; Delker, Tom; Gleeson, David; Kaptchen, Paul; Good, Bill; Kaplan, Mike; Applegate, Jeff; Taudien, Glenn

    2015-09-01

    To address mission risk and cost limitations the US has faced in putting a much needed Doppler wind lidar into space, Ball Aerospace and Technologies Corp, with support from NASA's Earth Science Technology Office (ESTO), has developed the Optical Autocovariance Wind Lidar (OAWL), designed to measure winds from aerosol backscatter at the 355 nm or 532 nm wavelengths. Preliminary proof of concept hardware efforts started at Ball back in 2004. From 2008 to 2012, under an ESTO-funded Instrument Incubator Program, Ball incorporated the Optical Autocovariance (OA) interferometer receiver into a prototype breadboard lidar system by adding a laser, telescope, and COTS-based data system for operation at the 355 nm wavelength. In 2011, the prototype system underwent ground-based validation testing, and three months later, after hardware and software modifications to ensure autonomous operation and aircraft safety, it was flown on the NASA WB-57 aircraft. The history of the 2011 test flights are reviewed, including efforts to get the system qualified for aircraft flights, modifications made during the flight test period, and the final flight data results. We also present lessons learned and plans for the new, robust, two-wavelength, aircraft system with flight demonstrations planned for Spring 2016.

  10. Airborne Observations and Satellite Validation: INTEX-A Experience and INTEX-B Plans

    NASA Technical Reports Server (NTRS)

    Crawford, James H.; Singh, Hanwant B.; Brune, William H.; Jacob, Daniel J.

    2005-01-01

    Intercontinental Chemical Transport Experiment (INTEX; http://cloudl.arc.nasa.gov) is an ongoing two-phase integrated atmospheric field experiment being performed over North America (NA). Its first phase (INTEX-A) was performed in the summer of 2004 and the second phase (INTEX-B) is planned for the early spring of 2006. The main goal of INTEX-NA is to understand the transport and transformation of gases and aerosols on transcontinental/intercontinental scales and to assess their impact on air quality and climate. Central to achieving this goal is the need to relate space-based observations with those from airborne and surface platforms. During INTEX-A, NASA s DC-8 was joined by some dozen other aircraft from a large number of European and North American partners to focus on the outflow of pollution from NA to the Atlantic. Several instances of Asian pollution over NA were also encountered. INTEX-A flight planning extensively relied on satellite observations and in turn Satellite validation (Terra, Aqua, and Envisat) was given high priority. Over 20 validation profiles were successfully carried out. DC-8 sampling of smoke from Alaskan fires and formaldehyde over forested regions, and simultaneous satellite observations of these provided excellent opportunities for the interplay of these platforms. The planning for INTEX-5 is currently underway, and a vast majority of "standard" and "research" products to be retrieved from Aura instruments will be measured during INTEX-B throughout the troposphere. INTEX-B will focus on the inflow of pollution from Asia to North America and validation of satellite observations with emphasis on Aura. Several national and international partners are expected to coordinate activities with INTEX-B, and we expect its scope to expand in the coming months. An important new development involves partnership with an NSF-sponsored campaign called MIRAGE (Megacity Impacts on Regional and Global Environments- Mexico City Pollution Outflow Field

  11. Coverage of the Test of Memory Malingering, Victoria Symptom Validity Test, and Word Memory Test on the Internet: is test security threatened?

    PubMed

    Bauer, Lyndsey; McCaffrey, Robert J

    2006-01-01

    In forensic neuropsychological settings, maintaining test security has become critically important, especially in regard to symptom validity tests (SVTs). Coaching, which can entail providing patients or litigants with information about the cognitive sequelae of head injury, or teaching them test-taking strategies to avoid detection of symptom dissimulation has been examined experimentally in many research studies. Emerging evidence supports that coaching strategies affect psychological and neuropsychological test performance to differing degrees depending on the coaching paradigm and the tests administered. The present study sought to examine Internet coverage of SVTs because it is potentially another source of coaching, or information that is readily available. Google searches were performed on the Test of Memory Malingering, the Victoria Symptom Validity Test, and the Word Memory Test. Results indicated that there is a variable amount of information available about each test that could threaten test security and validity should inappropriately interested parties find it. Steps that could be taken to improve this situation and limitations to this exploration are discussed.

  12. Development and Validation of Diagnostic Economics Test for Secondary Schools

    ERIC Educational Resources Information Center

    Eleje, Lydia I.; Esomonu, Nkechi P. M.; Agu, Ngozi N.; Okoye, Romy O.; Obasi, Emma; Onah, Frederick E.

    2016-01-01

    A diagnostic test in economics to aid the teachers determine student's specific weak content areas was developed and validated. Five research questions guided the study. Preliminary validation was done by two experienced teachers in the content area of secondary economics and two experts in test construction. The pilot testing was conducted for…

  13. 10 CFR 26.139 - Reporting initial validity and drug test results.

    Code of Federal Regulations, 2014 CFR

    2014-01-01

    ... 10 Energy 1 2014-01-01 2014-01-01 false Reporting initial validity and drug test results. 26.139... § 26.139 Reporting initial validity and drug test results. (a) The licensee testing facility shall... permitted under § 26.75(h), positive test results from initial drug tests at the licensee testing facility...

  14. 10 CFR 26.139 - Reporting initial validity and drug test results.

    Code of Federal Regulations, 2012 CFR

    2012-01-01

    ... 10 Energy 1 2012-01-01 2012-01-01 false Reporting initial validity and drug test results. 26.139... § 26.139 Reporting initial validity and drug test results. (a) The licensee testing facility shall... permitted under § 26.75(h), positive test results from initial drug tests at the licensee testing facility...

  15. Validation of Clinical Testing for Warfarin Sensitivity

    PubMed Central

    Langley, Michael R.; Booker, Jessica K.; Evans, James P.; McLeod, Howard L.; Weck, Karen E.

    2009-01-01

    Responses to warfarin (Coumadin) anticoagulation therapy are affected by genetic variability in both the CYP2C9 and VKORC1 genes. Validation of pharmacogenetic testing for warfarin responses includes demonstration of analytical validity of testing platforms and of the clinical validity of testing. We compared four platforms for determining the relevant single nucleotide polymorphisms (SNPs) in both CYP2C9 and VKORC1 that are associated with warfarin sensitivity (Third Wave Invader Plus, ParagonDx/Cepheid Smart Cycler, Idaho Technology LightCycler, and AutoGenomics Infiniti). Each method was examined for accuracy, cost, and turnaround time. All genotyping methods demonstrated greater than 95% accuracy for identifying the relevant SNPs (CYP2C9 *2 and *3; VKORC1 −1639 or 1173). The ParagonDx and Idaho Technology assays had the shortest turnaround and hands-on times. The Third Wave assay was readily scalable to higher test volumes but had the longest hands-on time. The AutoGenomics assay interrogated the largest number of SNPs but had the longest turnaround time. Four published warfarin-dosing algorithms (Washington University, UCSF, Louisville, and Newcastle) were compared for accuracy for predicting warfarin dose in a retrospective analysis of a local patient population on long-term, stable warfarin therapy. The predicted doses from both the Washington University and UCSF algorithms demonstrated the best correlation with actual warfarin doses. PMID:19324988

  16. SU-E-T-129: Are Knowledge-Based Planning Dose Estimates Valid for Distensible Organs?

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lalonde, R; Heron, D; Huq, M

    2015-06-15

    Purpose: Knowledge-based planning programs have become available to assist treatment planning in radiation therapy. Such programs can be used to generate estimated DVHs and planning constraints for organs at risk (OARs), based upon a model generated from previous plans. These estimates are based upon the planning CT scan. However, for distensible OARs like the bladder and rectum, daily variations in volume may make the dose estimates invalid. The purpose of this study is to determine whether knowledge-based DVH dose estimates may be valid for distensible OARs. Methods: The Varian RapidPlan™ knowledge-based planning module was used to generate OAR dose estimatesmore » and planning objectives for 10 prostate cases previously planned with VMAT, and final plans were calculated for each. Five weekly setup CBCT scans of each patient were then downloaded and contoured (assuming no change in size and shape of the target volume), and rectum and bladder DVHs were recalculated for each scan. Dose volumes were then compared at 75, 60,and 40 Gy for the bladder and rectum between the planning scan and the CBCTs. Results: Plan doses and estimates matched well at all dose points., Volumes of the rectum and bladder varied widely between planning CT and the CBCTs, ranging from 0.46 to 2.42 for the bladder and 0.71 to 2.18 for the rectum, causing relative dose volumes to vary between planning CT and CBCT, but absolute dose volumes were more consistent. The overall ratio of CBCT/plan dose volumes was 1.02 ±0.27 for rectum and 0.98 ±0.20 for bladder in these patients. Conclusion: Knowledge-based planning dose volume estimates for distensible OARs are still valid, in absolute volume terms, between treatment planning scans and CBCT’s taken during daily treatment. Further analysis of the data is being undertaken to determine how differences depend upon rectum and bladder filling state. This work has been supported by Varian Medical Systems.« less

  17. A Comprehensive Plan for the Long-Term Calibration and Validation of Oceanic Biogeochemical Satellite Data

    NASA Technical Reports Server (NTRS)

    Hooker, Stanford B.; McClain, Charles R.; Mannino, Antonio

    2007-01-01

    The primary objective of this planning document is to establish a long-term capability and validating oceanic biogeochemical satellite data. It is a pragmatic solution to a practical problem based primarily o the lessons learned from prior satellite missions. All of the plan's elements are seen to be interdependent, so a horizontal organizational scheme is anticipated wherein the overall leadership comes from the NASA Ocean Biology and Biogeochemistry (OBB) Program Manager and the entire enterprise is split into two components of equal sature: calibration and validation plus satellite data processing. The detailed elements of the activity are based on the basic tasks of the two main components plus the current objectives of the Carbon Cycle and Ecosystems Roadmap. The former is distinguished by an internal core set of responsibilities and the latter is facilitated through an external connecting-core ring of competed or contracted activities. The core elements for the calibration and validation component include a) publish protocols and performance metrics; b) verify uncertainty budgets; c) manage the development and evaluation of instrumentation; and d) coordinate international partnerships. The core elements for the satellite data processing component are e) process and reprocess multisensor data; f) acquire, distribute, and archive data products; and g) implement new data products. Both components have shared responsibilities for initializing and temporally monitoring satellite calibration. Connecting-core elements include (but are not restricted to) atmospheric correction and characterization, standards and traceability, instrument and analysis round robins, field campaigns and vicarious calibration sites, in situ database, bio-optical algorithm (and product) validation, satellite characterization and vicarious calibration, and image processing software. The plan also includes an accountability process, creating a Calibration and Validation Team (to help manage

  18. The Need, Development, and Validation of the Innovation Test Instrument

    ERIC Educational Resources Information Center

    Wheadon, Jacob; Wright, Geoff A.; West, Richard E.; Skaggs, Paul

    2017-01-01

    This study discusses the need, development, and validation of the Innovation Test Instrument (ITI). This article outlines how the researchers identified the content domain of the assessment and created test items. Then, it describes initial validation testing of the instrument. The findings suggest that the ITI is a good first step in creating an…

  19. Planned Comparisons as Better Alternatives to ANOVA Omnibus Tests.

    ERIC Educational Resources Information Center

    Benton, Roberta L.

    Analyses of data are presented to illustrate the advantages of using a priori or planned comparisons rather than omnibus analysis of variance (ANOVA) tests followed by post hoc or posteriori testing. The two types of planned comparisons considered are planned orthogonal non-trend coding contrasts and orthogonal polynomial or trend contrast coding.…

  20. Validation of the Simple Shoulder Test in a Portuguese-Brazilian population. Is the latent variable structure and validation of the Simple Shoulder Test Stable across cultures?

    PubMed

    Neto, Jose Osni Bruggemann; Gesser, Rafael Lehmkuhl; Steglich, Valdir; Bonilauri Ferreira, Ana Paula; Gandhi, Mihir; Vissoci, João Ricardo Nickenig; Pietrobon, Ricardo

    2013-01-01

    The validation of widely used scales facilitates the comparison across international patient samples. The objective of this study was to translate, culturally adapt and validate the Simple Shoulder Test into Brazilian Portuguese. Also we test the stability of factor analysis across different cultures. The objective of this study was to translate, culturally adapt and validate the Simple Shoulder Test into Brazilian Portuguese. Also we test the stability of factor analysis across different cultures. The Simple Shoulder Test was translated from English into Brazilian Portuguese, translated back into English, and evaluated for accuracy by an expert committee. It was then administered to 100 patients with shoulder conditions. Psychometric properties were analyzed including factor analysis, internal reliability, test-retest reliability at seven days, and construct validity in relation to the Short Form 36 health survey (SF-36). Factor analysis demonstrated a three factor solution. Cronbach's alpha was 0.82. Test-retest reliability index as measured by intra-class correlation coefficient (ICC) was 0.84. Associations were observed in the hypothesized direction with all subscales of SF-36 questionnaire. The Simple Shoulder Test translation and cultural adaptation to Brazilian-Portuguese demonstrated adequate factor structure, internal reliability, and validity, ultimately allowing for its use in the comparison with international patient samples.

  1. Validation of the Simple Shoulder Test in a Portuguese-Brazilian Population. Is the Latent Variable Structure and Validation of the Simple Shoulder Test Stable across Cultures?

    PubMed Central

    Neto, Jose Osni Bruggemann; Gesser, Rafael Lehmkuhl; Steglich, Valdir; Bonilauri Ferreira, Ana Paula; Gandhi, Mihir; Vissoci, João Ricardo Nickenig; Pietrobon, Ricardo

    2013-01-01

    Background The validation of widely used scales facilitates the comparison across international patient samples. The objective of this study was to translate, culturally adapt and validate the Simple Shoulder Test into Brazilian Portuguese. Also we test the stability of factor analysis across different cultures. Objective The objective of this study was to translate, culturally adapt and validate the Simple Shoulder Test into Brazilian Portuguese. Also we test the stability of factor analysis across different cultures. Methods The Simple Shoulder Test was translated from English into Brazilian Portuguese, translated back into English, and evaluated for accuracy by an expert committee. It was then administered to 100 patients with shoulder conditions. Psychometric properties were analyzed including factor analysis, internal reliability, test-retest reliability at seven days, and construct validity in relation to the Short Form 36 health survey (SF-36). Results Factor analysis demonstrated a three factor solution. Cronbach’s alpha was 0.82. Test-retest reliability index as measured by intra-class correlation coefficient (ICC) was 0.84. Associations were observed in the hypothesized direction with all subscales of SF-36 questionnaire. Conclusion The Simple Shoulder Test translation and cultural adaptation to Brazilian-Portuguese demonstrated adequate factor structure, internal reliability, and validity, ultimately allowing for its use in the comparison with international patient samples. PMID:23675436

  2. 49 CFR 232.505 - Pre-revenue service acceptance testing plan.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... 49 Transportation 4 2010-10-01 2010-10-01 false Pre-revenue service acceptance testing plan. 232... § 232.505 Pre-revenue service acceptance testing plan. (a) General; submission of plan. Except as... its system the operating railroad or railroads shall submit a pre-revenue service acceptance testing...

  3. Validation of a sampling plan to generate food composition data.

    PubMed

    Sammán, N C; Gimenez, M A; Bassett, N; Lobo, M O; Marcoleri, M E

    2016-02-15

    A methodology to develop systematic plans for food sampling was proposed. Long life whole and skimmed milk, and sunflower oil were selected to validate the methodology in Argentina. Fatty acid profile in all foods, proximal composition, and calcium's content in milk were determined with AOAC methods. The number of samples (n) was calculated applying Cochran's formula with variation coefficients ⩽12% and an estimate error (r) maximum permissible ⩽5% for calcium content in milks and unsaturated fatty acids in oil. n were 9, 11 and 21 for long life whole and skimmed milk, and sunflower oil respectively. Sample units were randomly collected from production sites and sent to labs. Calculated r with experimental data was ⩽10%, indicating high accuracy in the determination of analyte content of greater variability and reliability of the proposed sampling plan. The methodology is an adequate and useful tool to develop sampling plans for food composition analysis. Copyright © 2015 Elsevier Ltd. All rights reserved.

  4. Construction of Valid and Reliable Test for Assessment of Students

    ERIC Educational Resources Information Center

    Osadebe, P. U.

    2015-01-01

    The study was carried out to construct a valid and reliable test in Economics for secondary school students. Two research questions were drawn to guide the establishment of validity and reliability for the Economics Achievement Test (EAT). It is a multiple choice objective test of five options with 100 items. A sample of 1000 students was randomly…

  5. Validity and Reliability of the Arabic Token Test for Children

    ERIC Educational Resources Information Center

    Alkhamra, Rana A.; Al-Jazi, Aya B.

    2016-01-01

    Background: The Token Test for Children (2nd edition) (TTFC) is a measure for assessing receptive language. In this study we describe the translation process, validity and reliability of the Arabic Token Test for Children (A-TTFC). Aims: The aim of this study is to translate, validate and establish the reliability of the Arabic Token Test for…

  6. Conceptualizing Essay Tests' Reliability and Validity: From Research to Theory

    ERIC Educational Resources Information Center

    Badjadi, Nour El Imane

    2013-01-01

    The current paper on writing assessment surveys the literature on the reliability and validity of essay tests. The paper aims to examine the two concepts in relationship with essay testing as well as to provide a snapshot of the current understandings of the reliability and validity of essay tests as drawn in recent research studies. Bearing in…

  7. MCC/shuttle test plan. Volume 1: Philosophy and guidelines

    NASA Technical Reports Server (NTRS)

    1976-01-01

    The Mission Control Center/Shuttle Test Plan is defined from development through operations to a level of detail which will support the National Aeronautics and Space Administration and contractor management in the following areas: test management, test tool development, and resource and schedule planning.

  8. Photographic copy of site plan for proposed Test Stand "D" ...

    Library of Congress Historic Buildings Survey, Historic Engineering Record, Historic Landscapes Survey

    Photographic copy of site plan for proposed Test Stand "D" in 1958. The contemporary site plans of test stands "A," "B," and "C" are also visible, along with the interconnecting tunnel system. California Institute of Technology, Jet Propulsion Laboratory, Plant Engineering "Site Plan for Proposed Test Stand "D" - Edwards Test Station," drawing no. ESP/22-0, 14 November 1958 - Jet Propulsion Laboratory Edwards Facility, Test Stand D, Edwards Air Force Base, Boron, Kern County, CA

  9. Audio Development Laboratory (ADL) User Test Planning Guide

    NASA Technical Reports Server (NTRS)

    Romero, Andy

    2012-01-01

    Test process, milestones and inputs are unknowns to first-time users of the ADL. The User Test Planning Guide aids in establishing expectations for both NASA and non-NASA facility customers. The potential audience for this guide includes both internal and commercial spaceflight hardware/software developers. It is intended to assist their test engineering personnel in test planning and execution. Material covered includes a roadmap of the test process, roles and responsibilities of facility and user, major milestones, facility capabilities, and inputs required by the facility. Samples of deliverables, test article interfaces, and inputs necessary to define test scope, cost, and schedule are included as an appendix to the guide.

  10. Dosimetric accuracy of a treatment planning system for actively scanned proton beams and small target volumes: Monte Carlo and experimental validation

    NASA Astrophysics Data System (ADS)

    Magro, G.; Molinelli, S.; Mairani, A.; Mirandola, A.; Panizza, D.; Russo, S.; Ferrari, A.; Valvo, F.; Fossati, P.; Ciocca, M.

    2015-09-01

    This study was performed to evaluate the accuracy of a commercial treatment planning system (TPS), in optimising proton pencil beam dose distributions for small targets of different sizes (5-30 mm side) located at increasing depths in water. The TPS analytical algorithm was benchmarked against experimental data and the FLUKA Monte Carlo (MC) code, previously validated for the selected beam-line. We tested the Siemens syngo® TPS plan optimisation module for water cubes fixing the configurable parameters at clinical standards, with homogeneous target coverage to a 2 Gy (RBE) dose prescription as unique goal. Plans were delivered and the dose at each volume centre was measured in water with a calibrated PTW Advanced Markus® chamber. An EBT3® film was also positioned at the phantom entrance window for the acquisition of 2D dose maps. Discrepancies between TPS calculated and MC simulated values were mainly due to the different lateral spread modeling and resulted in being related to the field-to-spot size ratio. The accuracy of the TPS was proved to be clinically acceptable in all cases but very small and shallow volumes. In this contest, the use of MC to validate TPS results proved to be a reliable procedure for pre-treatment plan verification.

  11. Dosimetric accuracy of a treatment planning system for actively scanned proton beams and small target volumes: Monte Carlo and experimental validation.

    PubMed

    Magro, G; Molinelli, S; Mairani, A; Mirandola, A; Panizza, D; Russo, S; Ferrari, A; Valvo, F; Fossati, P; Ciocca, M

    2015-09-07

    This study was performed to evaluate the accuracy of a commercial treatment planning system (TPS), in optimising proton pencil beam dose distributions for small targets of different sizes (5-30 mm side) located at increasing depths in water. The TPS analytical algorithm was benchmarked against experimental data and the FLUKA Monte Carlo (MC) code, previously validated for the selected beam-line. We tested the Siemens syngo(®) TPS plan optimisation module for water cubes fixing the configurable parameters at clinical standards, with homogeneous target coverage to a 2 Gy (RBE) dose prescription as unique goal. Plans were delivered and the dose at each volume centre was measured in water with a calibrated PTW Advanced Markus(®) chamber. An EBT3(®) film was also positioned at the phantom entrance window for the acquisition of 2D dose maps. Discrepancies between TPS calculated and MC simulated values were mainly due to the different lateral spread modeling and resulted in being related to the field-to-spot size ratio. The accuracy of the TPS was proved to be clinically acceptable in all cases but very small and shallow volumes. In this contest, the use of MC to validate TPS results proved to be a reliable procedure for pre-treatment plan verification.

  12. 10 CFR 26.139 - Reporting initial validity and drug test results.

    Code of Federal Regulations, 2011 CFR

    2011-01-01

    ... 10 Energy 1 2011-01-01 2011-01-01 false Reporting initial validity and drug test results. 26.139 Section 26.139 Energy NUCLEAR REGULATORY COMMISSION FITNESS FOR DUTY PROGRAMS Licensee Testing Facilities § 26.139 Reporting initial validity and drug test results. (a) The licensee testing facility shall...

  13. 10 CFR 26.139 - Reporting initial validity and drug test results.

    Code of Federal Regulations, 2010 CFR

    2010-01-01

    ... 10 Energy 1 2010-01-01 2010-01-01 false Reporting initial validity and drug test results. 26.139 Section 26.139 Energy NUCLEAR REGULATORY COMMISSION FITNESS FOR DUTY PROGRAMS Licensee Testing Facilities § 26.139 Reporting initial validity and drug test results. (a) The licensee testing facility shall...

  14. 10 CFR 26.139 - Reporting initial validity and drug test results.

    Code of Federal Regulations, 2013 CFR

    2013-01-01

    ... 10 Energy 1 2013-01-01 2013-01-01 false Reporting initial validity and drug test results. 26.139 Section 26.139 Energy NUCLEAR REGULATORY COMMISSION FITNESS FOR DUTY PROGRAMS Licensee Testing Facilities § 26.139 Reporting initial validity and drug test results. (a) The licensee testing facility shall...

  15. Space Weather Model Testing And Validation At The Community Coordinated Modeling Center

    NASA Astrophysics Data System (ADS)

    Hesse, M.; Kuznetsova, M.; Rastaetter, L.; Falasca, A.; Keller, K.; Reitan, P.

    The Community Coordinated Modeling Center (CCMC) is a multi-agency partner- ship aimed at the creation of next generation space weather models. The goal of the CCMC is to undertake the research and developmental work necessary to substantially increase the present-day modeling capability for space weather purposes, and to pro- vide models for transition to the rapid prototyping centers at the space weather forecast centers. This goal requires close collaborations with and substantial involvement of the research community. The physical regions to be addressed by CCMC-related activities range from the solar atmosphere to the Earth's upper atmosphere. The CCMC is an integral part of NASA's Living With aStar initiative, of the National Space Weather Program Implementation Plan, and of the Department of Defense Space Weather Tran- sition Plan. CCMC includes a facility at NASA Goddard Space Flight Center, as well as distributed computing facilities provided by the Air Force. CCMC also provides, to the research community, access to state-of-the-art space research models. In this paper we will provide updates on CCMC status, on current plans, research and devel- opment accomplishments and goals, and on the model testing and validation process undertaken as part of the CCMC mandate.

  16. Eye-Tracking as a Tool in Process-Oriented Reading Test Validation

    ERIC Educational Resources Information Center

    Solheim, Oddny Judith; Uppstad, Per Henning

    2011-01-01

    The present paper addresses the continuous need for methodological reflection on how to validate inferences made on the basis of test scores. Validation is a process that requires many lines of evidence. In this article we discuss the potential of eye tracking methodology in process-oriented reading test validation. Methodological considerations…

  17. The validation of Huffaz Intelligence Test (HIT)

    NASA Astrophysics Data System (ADS)

    Rahim, Mohd Azrin Mohammad; Ahmad, Tahir; Awang, Siti Rahmah; Safar, Ajmain

    2017-08-01

    In general, a hafiz who can memorize the Quran has many specialties especially in respect to their academic performances. In this study, the theory of multiple intelligences introduced by Howard Gardner is embedded in a developed psychometric instrument, namely Huffaz Intelligence Test (HIT). This paper presents the validation and the reliability of HIT of some tahfiz students in Malaysia Islamic schools. A pilot study was conducted involving 87 huffaz who were randomly selected to answer the items in HIT. The analysis method used includes Partial Least Square (PLS) on reliability, convergence and discriminant validation. The study has validated nine intelligences. The findings also indicated that the composite reliabilities for the nine types of intelligences are greater than 0.8. Thus, the HIT is a valid and reliable instrument to measure the multiple intelligences among huffaz.

  18. Prospective clinical validation of independent DVH prediction for plan QA in automatic treatment planning for prostate cancer patients.

    PubMed

    Wang, Yibing; Heijmen, Ben J M; Petit, Steven F

    2017-12-01

    To prospectively investigate the use of an independent DVH prediction tool to detect outliers in the quality of fully automatically generated treatment plans for prostate cancer patients. A plan QA tool was developed to predict rectum, anus and bladder DVHs, based on overlap volume histograms and principal component analysis (PCA). The tool was trained with 22 automatically generated, clinical plans, and independently validated with 21 plans. Its use was prospectively investigated for 50 new plans by replanning in case of detected outliers. For rectum D mean , V 65Gy , V 75Gy , anus D mean , and bladder D mean , the difference between predicted and achieved was within 0.4 Gy or 0.3% (SD within 1.8 Gy or 1.3%). Thirteen detected outliers were re-planned, leading to moderate but statistically significant improvements (mean, max): rectum D mean (1.3 Gy, 3.4 Gy), V 65Gy (2.7%, 4.2%), anus D mean (1.6 Gy, 6.9 Gy), and bladder D mean (1.5 Gy, 5.1 Gy). The rectum V 75Gy of the new plans slightly increased (0.2%, p = 0.087). A high accuracy DVH prediction tool was developed and used for independent QA of automatically generated plans. In 28% of plans, minor dosimetric deviations were observed that could be improved by plan adjustments. Larger gains are expected for manually generated plans. Copyright © 2017 Elsevier B.V. All rights reserved.

  19. TESTING BALANCE AND FALL RISK IN PERSONS WITH PARKINSON DISEASE, AN ARGUMENT FOR ECOLOGICALLY VALID TESTING

    PubMed Central

    Foreman, K. Bo; Addison, Odessa; Kim, Han S.; Dibble, Leland E.

    2010-01-01

    Introduction Despite clear deficits in postural control, most clinical examination tools lack accuracy in identifying persons with Parkinson disease (PD) who have fallen or are at risk for falls. We assert that this is in part due to the lack of ecological validity of the testing. Methods To test this assertion, we examined the responsiveness and predictive validity of the Functional Gait Assessment (FGA), the Pull test, and the Timed up and Go (TUG) during clinically defined ON and OFF medication states. To address responsiveness, ON/OFF medication performance was compared. To address predictive validity, areas under the curve (AUC) of receiver operating characteristic (ROC) curves were compared. Comparisons were made using separate non-parametric tests. Results Thirty-six persons (24 male, 12 female) with PD (22 fallers, 14 non-fallers) participated. Only the FGA was able to detect differences between fallers and non-fallers for both ON/OFF medication testing. The predictive validity of the FGA and the TUG for fall identification was higher during OFF medication compared to ON medication testing. The predictive validity of the FGA was higher than the TUG and the Pull test during ON and OFF medication testing. Discussion In order to most accurately identify fallers, clinicians should test persons with PD in ecologically relevant conditions and tasks. In this study, interpretation of the OFF medication performance and use of the FGA provided more accurate prediction of those who would fall. PMID:21215674

  20. Proof of concept test plan.

    DOT National Transportation Integrated Search

    2008-06-05

    This document is the Proof of Concept (POC) Test Plan and procedures that will be used : to verify that hardware and application functionality meet the requirements of the U.S. : Department of Transportation (USDOT) Next Generation 9-1-1 Initiative (...

  1. Face Validity of Test and Acceptance of Generalized Personality Interpretations

    ERIC Educational Resources Information Center

    Delprato, Dennis J.

    1975-01-01

    The degree to which variations in the face validity of psychological tests affected students' willingness to accept personality interpretations was studied. Acceptance of personality interpretations was compared for four types of tests which varied in face validity. The relationship between judged accuracy and rated likability of the…

  2. Avionics test bed development plan

    NASA Technical Reports Server (NTRS)

    Harris, L. H.; Parks, J. M.; Murdock, C. R.

    1981-01-01

    A development plan for a proposed avionics test bed facility for the early investigation and evaluation of new concepts for the control of large space structures, orbiter attached flex body experiments, and orbiter enhancements is presented. A distributed data processing facility that utilizes the current laboratory resources for the test bed development is outlined. Future studies required for implementation, the management system for project control, and the baseline system configuration are defined. A background analysis of the specific hardware system for the preliminary baseline avionics test bed system is included.

  3. Does Test Preparation Work? Implications for Score Validity

    ERIC Educational Resources Information Center

    Xie, Qin

    2013-01-01

    This article reports an empirical study that examined the pattern of test preparation for College English Test Band 4 (CET4) and the differential effects of test preparation practices on its scores, thereby drawing implications for CET4 score validity. Data collection involved 1,003 test takers of CET4. A pretest was administered at the beginning…

  4. Independent validation of the MMPI-2-RF Somatic/Cognitive and Validity scales in TBI Litigants tested for effort.

    PubMed

    Youngjohn, James R; Wershba, Rebecca; Stevenson, Matthew; Sturgeon, John; Thomas, Michael L

    2011-04-01

    The MMPI-2 Restructured Form (MMPI-2-RF; Ben-Porath & Tellegen, 2008) is replacing the MMPI-2 as the most widely used personality test in neuropsychological assessment, but additional validation studies are needed. Our study examines MMPI-2-RF Validity scales and the newly created Somatic/Cognitive scales in a recently reported sample of 82 traumatic brain injury (TBI) litigants who either passed or failed effort tests (Thomas & Youngjohn, 2009). The restructured Validity scales FBS-r (restructured symptom validity), F-r (restructured infrequent responses), and the newly created Fs (infrequent somatic responses) were not significant predictors of TBI severity. FBS-r was significantly related to passing or failing effort tests, and Fs and F-r showed non-significant trends in the same direction. Elevations on the Somatic/Cognitive scales profile (MLS-malaise, GIC-gastrointestinal complaints, HPC-head pain complaints, NUC-neurological complaints, and COG-cognitive complaints) were significant predictors of effort test failure. Additionally, HPC had the anticipated paradoxical inverse relationship with head injury severity. The Somatic/Cognitive scales as a group were better predictors of effort test failure than the RF Validity scales, which was an unexpected finding. MLS arose as the single best predictor of effort test failure of all RF Validity and Somatic/Cognitive scales. Item overlap analysis revealed that all MLS items are included in the original MMPI-2 Hy scale, making MLS essentially a subscale of Hy. This study validates the MMPI-2-RF as an effective tool for use in neuropsychological assessment of TBI litigants.

  5. Validation of Physics Standardized Test Items

    NASA Astrophysics Data System (ADS)

    Marshall, Jill

    2008-10-01

    The Texas Physics Assessment Team (TPAT) examined the Texas Assessment of Knowledge and Skills (TAKS) to determine whether it is a valid indicator of physics preparation for future course work and employment, and of the knowledge and skills needed to act as an informed citizen in a technological society. We categorized science items from the 2003 and 2004 10th and 11th grade TAKS by content area(s) covered, knowledge and skills required to select the correct answer, and overall quality. We also analyzed a 5000 student sample of item-level results from the 2004 11th grade exam using standard statistical methods employed by test developers (factor analysis and Item Response Theory). Triangulation of our results revealed strengths and weaknesses of the different methods of analysis. The TAKS was found to be only weakly indicative of physics preparation and we make recommendations for increasing the validity of standardized physics testing..

  6. Validation of antibiotic residue tests for dairy goats.

    PubMed

    Zeng, S S; Hart, S; Escobar, E N; Tesfai, K

    1998-03-01

    The SNAP test, LacTek test (B-L and CEF), Charm Bacillus sterothermophilus var. calidolactis disk assay (BsDA), and Charm II Tablet Beta-lactam sequential test were validated using antibiotic-fortified and -incurred goat milk following the protocol for test kit validations of the U.S. Food and Drug Administration Center for Veterinary Medicine. SNAP, Charm BsDA, and Charm II Tablet Sequential tests were sensitive and reliable in detecting antibiotic residues in goat milk. All three assays showed greater than 90% sensitivity and specificity at tolerance and detection levels. However, caution should be taken in interpreting test results at detection levels. Because of the high sensitivity of these three tests, false-violative results could be obtained in goat milk containing antibiotic residues below the tolerance level. Goat milk testing positive by these tests must be confirmed using a more sophisticated methodology, such as high-performance liquid chromatography, before the milk is condemned. LacTek B-L test did not detect several antibiotics, including penicillin G, in goat milk at tolerance levels. However, LacTek CEF was excellent in detecting ceftiofur residue in goat milk.

  7. The Validity of IQ Scores Derived from Readiness Screening Tests

    ERIC Educational Resources Information Center

    Telegdy, Gabriel A.

    1976-01-01

    The Screening Test of Academic Readiness (STAR) and the Peabody Picture Vocabulary Test (PPVT) were administered to 52 kindergarten children to reveal the convergent validity of IQ scores derived from the STAR. The findings raise doubts about the validity of the deviation IQs derived from the STAR. (Author)

  8. An exploratory study into the effect of time-restricted internet access on face-validity, construct validity and reliability of postgraduate knowledge progress testing

    PubMed Central

    2013-01-01

    Background Yearly formative knowledge testing (also known as progress testing) was shown to have a limited construct-validity and reliability in postgraduate medical education. One way to improve construct-validity and reliability is to improve the authenticity of a test. As easily accessible internet has become inseparably linked to daily clinical practice, we hypothesized that allowing internet access for a limited amount of time during the progress test would improve the perception of authenticity (face-validity) of the test, which would in turn improve the construct-validity and reliability of postgraduate progress testing. Methods Postgraduate trainees taking the yearly knowledge progress test were asked to participate in a study where they could access the internet for 30 minutes at the end of a traditional pen and paper test. Before and after the test they were asked to complete a short questionnaire regarding the face-validity of the test. Results Mean test scores increased significantly for all training years. Trainees indicated that the face-validity of the test improved with internet access and that they would like to continue to have internet access during future testing. Internet access did not improve the construct-validity or reliability of the test. Conclusion Improving the face-validity of postgraduate progress testing, by adding the possibility to search the internet for a limited amount of time, positively influences test performance and face-validity. However, it did not change the reliability or the construct-validity of the test. PMID:24195696

  9. Validation of Helicopter Gear Condition Indicators Using Seeded Fault Tests

    NASA Technical Reports Server (NTRS)

    Dempsey, Paula; Brandon, E. Bruce

    2013-01-01

    A "seeded fault test" in support of a rotorcraft condition based maintenance program (CBM), is an experiment in which a component is tested with a known fault while health monitoring data is collected. These tests are performed at operating conditions comparable to operating conditions the component would be exposed to while installed on the aircraft. Performance of seeded fault tests is one method used to provide evidence that a Health Usage Monitoring System (HUMS) can replace current maintenance practices required for aircraft airworthiness. Actual in-service experience of the HUMS detecting a component fault is another validation method. This paper will discuss a hybrid validation approach that combines in service-data with seeded fault tests. For this approach, existing in-service HUMS flight data from a naturally occurring component fault will be used to define a component seeded fault test. An example, using spiral bevel gears as the targeted component, will be presented. Since the U.S. Army has begun to develop standards for using seeded fault tests for HUMS validation, the hybrid approach will be mapped to the steps defined within their Aeronautical Design Standard Handbook for CBM. This paper will step through their defined processes, and identify additional steps that may be required when using component test rig fault tests to demonstrate helicopter CI performance. The discussion within this paper will provide the reader with a better appreciation for the challenges faced when defining a seeded fault test for HUMS validation.

  10. A Historical Overview on the Concept of Validity in Language Testing

    ERIC Educational Resources Information Center

    Hamavandy, Mehraban; Kiany, Gholam Reza

    2014-01-01

    This article provides an overview on language test validation theories, especially the Messickian view on construct validity and the way it's been translated into practice. First, a brief historical synopsis will be set forth, followed by recent views on test validity as advanced by Messick and Kane. The review goes on to lay out the similarities…

  11. Shifting the Focus of Validity for Test Use

    ERIC Educational Resources Information Center

    Moss, Pamela A.

    2016-01-01

    The conventional focus of validity in educational measurement has been on intended interpretations and uses of test scores. Empirical studies of test use by teachers, administrators and policy-makers show that actual interpretations and uses of test scores in context are invariably shaped by local users' questions, which frequently require…

  12. Validation of Fully Automated VMAT Plan Generation for Library-Based Plan-of-the-Day Cervical Cancer Radiotherapy

    PubMed Central

    Breedveld, Sebastiaan; Voet, Peter W. J.; Heijkoop, Sabrina T.; Mens, Jan-Willem M.; Hoogeman, Mischa S.; Heijmen, Ben J. M.

    2016-01-01

    Purpose To develop and validate fully automated generation of VMAT plan-libraries for plan-of-the-day adaptive radiotherapy in locally-advanced cervical cancer. Material and Methods Our framework for fully automated treatment plan generation (Erasmus-iCycle) was adapted to create dual-arc VMAT treatment plan libraries for cervical cancer patients. For each of 34 patients, automatically generated VMAT plans (autoVMAT) were compared to manually generated, clinically delivered 9-beam IMRT plans (CLINICAL), and to dual-arc VMAT plans generated manually by an expert planner (manVMAT). Furthermore, all plans were benchmarked against 20-beam equi-angular IMRT plans (autoIMRT). For all plans, a PTV coverage of 99.5% by at least 95% of the prescribed dose (46 Gy) had the highest planning priority, followed by minimization of V45Gy for small bowel (SB). Other OARs considered were bladder, rectum, and sigmoid. Results All plans had a highly similar PTV coverage, within the clinical constraints (above). After plan normalizations for exactly equal median PTV doses in corresponding plans, all evaluated OAR parameters in autoVMAT plans were on average lower than in the CLINICAL plans with an average reduction in SB V45Gy of 34.6% (p<0.001). For 41/44 autoVMAT plans, SB V45Gy was lower than for manVMAT (p<0.001, average reduction 30.3%), while SB V15Gy increased by 2.3% (p = 0.011). AutoIMRT reduced SB V45Gy by another 2.7% compared to autoVMAT, while also resulting in a 9.0% reduction in SB V15Gy (p<0.001), but with a prolonged delivery time. Differences between manVMAT and autoVMAT in bladder, rectal and sigmoid doses were ≤ 1%. Improvements in SB dose delivery with autoVMAT instead of manVMAT were higher for empty bladder PTVs compared to full bladder PTVs, due to differences in concavity of the PTVs. Conclusions Quality of automatically generated VMAT plans was superior to manually generated plans. Automatic VMAT plan generation for cervical cancer has been implemented in

  13. Validation of Fully Automated VMAT Plan Generation for Library-Based Plan-of-the-Day Cervical Cancer Radiotherapy.

    PubMed

    Sharfo, Abdul Wahab M; Breedveld, Sebastiaan; Voet, Peter W J; Heijkoop, Sabrina T; Mens, Jan-Willem M; Hoogeman, Mischa S; Heijmen, Ben J M

    2016-01-01

    To develop and validate fully automated generation of VMAT plan-libraries for plan-of-the-day adaptive radiotherapy in locally-advanced cervical cancer. Our framework for fully automated treatment plan generation (Erasmus-iCycle) was adapted to create dual-arc VMAT treatment plan libraries for cervical cancer patients. For each of 34 patients, automatically generated VMAT plans (autoVMAT) were compared to manually generated, clinically delivered 9-beam IMRT plans (CLINICAL), and to dual-arc VMAT plans generated manually by an expert planner (manVMAT). Furthermore, all plans were benchmarked against 20-beam equi-angular IMRT plans (autoIMRT). For all plans, a PTV coverage of 99.5% by at least 95% of the prescribed dose (46 Gy) had the highest planning priority, followed by minimization of V45Gy for small bowel (SB). Other OARs considered were bladder, rectum, and sigmoid. All plans had a highly similar PTV coverage, within the clinical constraints (above). After plan normalizations for exactly equal median PTV doses in corresponding plans, all evaluated OAR parameters in autoVMAT plans were on average lower than in the CLINICAL plans with an average reduction in SB V45Gy of 34.6% (p<0.001). For 41/44 autoVMAT plans, SB V45Gy was lower than for manVMAT (p<0.001, average reduction 30.3%), while SB V15Gy increased by 2.3% (p = 0.011). AutoIMRT reduced SB V45Gy by another 2.7% compared to autoVMAT, while also resulting in a 9.0% reduction in SB V15Gy (p<0.001), but with a prolonged delivery time. Differences between manVMAT and autoVMAT in bladder, rectal and sigmoid doses were ≤ 1%. Improvements in SB dose delivery with autoVMAT instead of manVMAT were higher for empty bladder PTVs compared to full bladder PTVs, due to differences in concavity of the PTVs. Quality of automatically generated VMAT plans was superior to manually generated plans. Automatic VMAT plan generation for cervical cancer has been implemented in our clinical routine. Due to the achieved workload

  14. The validity of three tests of temperament in guppies (Poecilia reticulata).

    PubMed

    Burns, James G

    2008-11-01

    Differences in temperament (consistent differences among individuals in behavior) can have important effects on fitness-related activities such as dispersal and competition. However, evolutionary ecologists have put limited effort into validating their tests of temperament. This article attempts to validate three standard tests of temperament in guppies: the open-field test, emergence test, and novel-object test. Through multiple reliability trials, and comparison of results between different types of test, this study establishes the confidence that can be placed in these temperament tests. The open-field test is shown to be a good test of boldness and exploratory behavior; the open-field test was reliable when tested in multiple ways. There were problems with the emergence test and novel-object test, which leads one to conclude that the protocols used in this study should not be considered valid tests for this species. (PsycINFO Database Record (c) 2008 APA, all rights reserved).

  15. Development and validation of a treatment planning model for magnetic nanoparticle hyperthermia cancer therapy

    NASA Astrophysics Data System (ADS)

    Stigliano, Robert Vincent

    The use of magnetic nanoparticles (mNPs) to induce local hyperthermia has been emerging in recent years as a promising cancer therapy, in both a stand-alone and combination treatment setting, including surgery radiation and chemotherapy. The mNP solution can be injected either directly into the tumor, or administered intravenously. Studies have shown that some cancer cells associate with, internalize, and aggregate mNPs more preferentially than normal cells, with and without antibody targeting. Once the mNPs are delivered inside the cells, a low frequency (30-300kHz) alternating electromagnetic field is used to activate the mNPs. The nanoparticles absorb the applied field and provide localized heat generation at nano-micron scales. Treatment planning models have been shown to improve treatment efficacy in radiation therapy by limiting normal tissue damage while maximizing dose to the tumor. To date, there does not exist a clinical treatment planning model for magnetic nanoparticle hyperthermia which is robust, validated, and commercially available. The focus of this research is on the development and experimental validation of a treatment planning model, consisting of a coupled electromagnetic and thermal model that predicts dynamic thermal distributions during treatment. When allowed to incubate, the mNPs are often sequestered by cancer cells and packed into endosomes. The proximity of the mNPs has a strong influence on their ability to heat due to interparticle magnetic interaction effects. A model of mNP heating which takes into account the effects of magnetic interaction was developed, and validated against experimental data. An animal study in mice was conducted to determine the effects of mNP solution injection duration and PEGylation on macroscale mNP distribution within the tumor, in order to further inform the treatment planning model and future experimental technique. In clinical applications, a critical limiting factor for the maximum applied field is

  16. Vibroacoustic test plan evaluation: Parameter variation study

    NASA Technical Reports Server (NTRS)

    Stahle, C. V.; Gongloef, H. R.

    1976-01-01

    Statistical decision models are shown to provide a viable method of evaluating the cost effectiveness of alternate vibroacoustic test plans and the associated test levels. The methodology developed provides a major step toward the development of a realistic tool to quantitatively tailor test programs to specific payloads. Testing is considered at the no test, component, subassembly, or system level of assembly. Component redundancy and partial loss of flight data are considered. Most and probabilistic costs are considered, and incipient failures resulting from ground tests are treated. Optimums defining both component and assembly test levels are indicated for the modified test plans considered. modeling simplifications must be considered in interpreting the results relative to a particular payload. New parameters introduced were a no test option, flight by flight failure probabilities, and a cost to design components for higher vibration requirements. Parameters varied were the shuttle payload bay internal acoustic environment, the STS launch cost, the component retest/repair cost, and the amount of redundancy in the housekeeping section of the payload reliability model.

  17. Space telescope observatory management system preliminary test and verification plan

    NASA Technical Reports Server (NTRS)

    Fritz, J. S.; Kaldenbach, C. F.; Williams, W. B.

    1982-01-01

    The preliminary plan for the Space Telescope Observatory Management System Test and Verification (TAV) is provided. Methodology, test scenarios, test plans and procedure formats, schedules, and the TAV organization are included. Supporting information is provided.

  18. The Air Force Officer Qualifying Test: Validity, Fairness, and Bias

    DTIC Science & Technology

    2010-01-01

    scores. The Standards for Educational and Psychological Testing (AERA, APA, and NCME, 1999) provides a set of guidelines published and endorsed by the...determining the validity and bias of selection tests falls upon professionals in the discipline of industrial/organizational psychology 20 See Roper v. Dep’t...i). 30 The Air Force Officer Qualifying Test : Validity, Fairness, and Bias and closely related fields (e.g., educational psychology and

  19. SU-F-T-617: Remotely Pre-Planned Stereotactic Ablative Radiation Therapy: Validation of Treatment Plan Quality

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Juang, T; Bush, K; Loo, B

    Purpose: We propose a workflow to improve access to stereotactic ablative radiation therapy (SABR) for rural patients. When implemented, a separate trip to the central facility for simulation can be eliminated. Two elements are required: (1) Fabrication of custom immobilization devices to match positioning on prior diagnostic CT (dxCT). (2) Remote radiation pre-planning on dxCT, with transfer of contours/plan to simulation CT (simCT) and initiation of treatment same-day or next day. In this retrospective study, we validated part 2 of the workflow using patients already treated with SABR for upper lobe lung tumors. Methods: Target/normal structures were contoured on dxCT;more » a plan was created and approved by the physician. Structures were transferred to simCT using deformable image registration and the plan was re-optimized on simCT. Plan quality was evaluated through comparison to gold-standard structures contoured on simCT and a gold-standard plan based on these structures. Workflow-generated plan quality in this study represents a worst-case scenario as these patients were not treated using custom immobilization to match dxCT position as would be done when the workflow is implemented clinically. Results: 5/6 plans created through the pre-planning workflow were clinically acceptable. For all six plans, the gold-standard GTV received full prescription dose, along with median PTV V95%=95.2% and median PTV D95%=95.4%. Median GTV DSC=0.80, indicating high degree of similarity between the deformed and gold-standard GTV contours despite small GTV sizes (mean=3.0cc). One outlier (DSC=0.49) resulted in inadequate PTV coverage (V95%=62.9%) in the workflow plan; in clinical practice, this mismatch between deformed/gold-standard GTV would be revised by the physician after deformable registration. For all patients, normal tissue doses were comparable to the gold-standard plan and well within constraints. Conclusion: Pre-planning SABR cases on diagnostic imaging

  20. How to test validity in orthodontic research: a mixed dentition analysis example.

    PubMed

    Donatelli, Richard E; Lee, Shin-Jae

    2015-02-01

    The data used to test the validity of a prediction method should be different from the data used to generate the prediction model. In this study, we explored whether an independent data set is mandatory for testing the validity of a new prediction method and how validity can be tested without independent new data. Several validation methods were compared in an example using the data from a mixed dentition analysis with a regression model. The validation errors of real mixed dentition analysis data and simulation data were analyzed for increasingly large data sets. The validation results of both the real and the simulation studies demonstrated that the leave-1-out cross-validation method had the smallest errors. The largest errors occurred in the traditional simple validation method. The differences between the validation methods diminished as the sample size increased. The leave-1-out cross-validation method seems to be an optimal validation method for improving the prediction accuracy in a data set with limited sample sizes. Copyright © 2015 American Association of Orthodontists. Published by Elsevier Inc. All rights reserved.

  1. [Argumentation and construction of validity in Carlos Matus' situational strategic planning].

    PubMed

    Rivera, Francisco Javier Uribe

    2011-09-01

    This study analyzes the process of producing a situational plan according to a benchmark from the philosophy of language and argumentation theory. The basic approach used in the analysis was developed by Carlos Matus. Specifically, the study seeks to identify the inherent argumentative structure and patterns in the situational explanation and regulatory design in a plan's operations, taking argumentative approaches from pragma-dialectics and informal logic as the analytical parameters. The explanation of a health problem is used to illustrate the study. Methodologically, the study is based on the existing literature on the subject and case analyses. The study concludes with the proposition that the use of the specific references means introducing greater rigor into both the analysis of the validity of causal arguments and the design of proposals for interventions, in order for them to be more conclusive in achieving a plan's objectives.

  2. ASTM Validates Air Pollution Test Methods

    ERIC Educational Resources Information Center

    Chemical and Engineering News, 1973

    1973-01-01

    The American Society for Testing and Materials (ASTM) has validated six basic methods for measuring pollutants in ambient air as the first part of its Project Threshold. Aim of the project is to establish nationwide consistency in measuring pollutants; determining precision, accuracy and reproducibility of 35 standard measuring methods. (BL)

  3. Soil Moisture Active Passive (SMAP) Calibration and Validation Plan and Current Activities

    NASA Technical Reports Server (NTRS)

    Jackson, T. J.; Cosh, M.; Bindlish, R.; Crow, W.; Colliander, A.; Njoku, E.; McDonald, K.; Kimball, J.; Belair, S.; Walker, J.; hide

    2010-01-01

    The primary objective of the SMAP calibration and validation (Cal/Val) program is demonstrating that the science requirements (product accuracy and bias) have been met over the mission life. This begins during pre-launch with activities that contribute to high quality products and establishing post-launch validation infrastructure and continues through the mission life. However, the major focus is on a relatively short Cal/Val period following launch. The general approach and elements of the SMAP Cal/Val plan will be described and along with details on several ongoing or recent field experiments designed to address both near- and long-term Cal/Val.

  4. An Integrated Approach to Establish Validity and Reliability of Reading Tests

    ERIC Educational Resources Information Center

    Razi, Salim

    2012-01-01

    This study presents the processes of developing and establishing reliability and validity of a reading test by administering an integrative approach as conventional reliability and validity measures superficially reveals the difficulty of a reading test. In this respect, analysing vocabulary frequency of the test is regarded as a more eligible way…

  5. Applicability of action planning and coping planning to dental flossing among Norwegian adults: a confirmatory factor analysis approach.

    PubMed

    Astrøm, Anne Nordrehaug

    2008-06-01

    Using a prospective design and a representative sample of 25-yr-old Norwegians, this study hypothesized that action planning and coping planning will add to the prediction of flossing at 4 wk of follow-up over and above the effect of intention and previous flossing. This study tested the validity of a proposed 3-factor structure of the measurement model of intention, action planning, and coping planning and for its invariance across gender. A survey was conducted in three Norwegian counties, and 1,509 out of 8,000 randomly selected individuals completed questionnaires assessing the constructs of action planning and coping planning related to daily flossing. A random subsample of 500 participants was followed up at 4 wk with a telephone interview to assess flossing. Confirmatory factor analysis (CFA) confirmed the proposed 3-factor model after respecification. Although the chi-square test was statistically significant [chi(2) = 58.501, degrees of freedom (d.f.) = 17), complementary fit indices were satisfactory [goodness-of-fit index (GFI) = 0.99, root mean squared error of approximation (RMSEA) = 0.04]. Multigroup CFA provided evidence of complete invariance of the measurement model across gender. After controlling for previous flossing, intention (beta = 0.08) and action planning (beta = 0.11) emerged as independent predictors of subsequent flossing, accounting for 2.3% of its variance. Factorial validity of intention, action planning and coping planning, and the validity of action planning in predicting flossing prospectively, was confirmed by the present study.

  6. The CPT Reading Comprehension Test: A Validity Study.

    ERIC Educational Resources Information Center

    Napoli, Anthony R.; Raymond, Lanette A.; Coffey, Cheryl A.; Bosco, Diane M.

    1998-01-01

    Describes a study done at Suffolk County Community College (New York) that assessed the validity of the College Board's Computerized Placement Test in Reading Comprehension (CPT-R) by comparing test results of 1,154 freshmen with the results of the Degree of Power Reading Test. Results confirmed the CPT-R's reliability in identifying basic…

  7. Validity and Reliability of a Medicine Ball Explosive Power Test.

    ERIC Educational Resources Information Center

    Stockbrugger, Barry A.; Haennel, Robert G.

    2001-01-01

    Evaluated the validity and reliability of a medicine ball throw test to evaluate explosive power. Data on competitive sand volleyball players who performed a medicine ball throw and a standard countermovement jump indicated that the medicine ball throw test was a valid and reliable way to assess explosive power for an analogous total-body movement…

  8. Physics Goals for the Planned Next Linear Collider Engineering Test Facility

    NASA Astrophysics Data System (ADS)

    Raubenheimer, T. O.

    2001-10-01

    The Next Linear Collider (NLC) Collaboration is planning to construct an Engineering Test Facility (ETF) at Fermilab. As presently envisioned, the ETF would comprise a fundamental unit of the NLC main linac to include X-band klystrons and modulators, a delay-line power-distribution system (DLDS), and NLC accelerating structures that serve as loads. The principal purpose of the ETF is to validate stable operation of the power-distribution system, first without beam, then with a beam having the NLC pulse structure. This paper concerns the possibility of configuring and using the ETF to accelerate beam with an NLC pulse structure, as well as of doing experiments to measure beam-induced wakefields in the rf structures and their influence back on the beam.

  9. 49 CFR 238.111 - Pre-revenue service acceptance testing plan.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... the times and places of the pre-revenue service tests to permit FRA observation of such tests. For... 49 Transportation 4 2010-10-01 2010-10-01 false Pre-revenue service acceptance testing plan. 238... and General Requirements § 238.111 Pre-revenue service acceptance testing plan. (a) Passenger...

  10. 49 CFR 238.111 - Pre-revenue service acceptance testing plan.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... the times and places of the pre-revenue service tests to permit FRA observation of such tests. For... 49 Transportation 4 2011-10-01 2011-10-01 false Pre-revenue service acceptance testing plan. 238... and General Requirements § 238.111 Pre-revenue service acceptance testing plan. (a) Passenger...

  11. 49 CFR 238.111 - Pre-revenue service acceptance testing plan.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... the times and places of the pre-revenue service tests to permit FRA observation of such tests. For... 49 Transportation 4 2012-10-01 2012-10-01 false Pre-revenue service acceptance testing plan. 238... and General Requirements § 238.111 Pre-revenue service acceptance testing plan. (a) Passenger...

  12. 49 CFR 238.111 - Pre-revenue service acceptance testing plan.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... the times and places of the pre-revenue service tests to permit FRA observation of such tests. For... 49 Transportation 4 2014-10-01 2014-10-01 false Pre-revenue service acceptance testing plan. 238... and General Requirements § 238.111 Pre-revenue service acceptance testing plan. (a) Passenger...

  13. 49 CFR 238.111 - Pre-revenue service acceptance testing plan.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... the times and places of the pre-revenue service tests to permit FRA observation of such tests. For... 49 Transportation 4 2013-10-01 2013-10-01 false Pre-revenue service acceptance testing plan. 238... and General Requirements § 238.111 Pre-revenue service acceptance testing plan. (a) Passenger...

  14. Test Plan for the Boiling Water Reactor Dry Cask Simulator

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Durbin, Samuel; Lindgren, Eric R.

    The thermal performance of commercial nuclear spent fuel dry storage casks are evaluated through detailed numerical analysis . These modeling efforts are completed by the vendor to demonstrate performance and regulatory compliance. The calculations are then independently verified by the Nuclear Regulatory Commission (NRC). Carefully measured data sets generated from testing of full sized casks or smaller cask analogs are widely recognized as vital for validating these models. Recent advances in dry storage cask designs have significantly increased the maximum thermal load allowed in a cask in part by increasing the efficiency of internal conduction pathways and by increasing themore » internal convection through greater canister helium pressure. These same vertical, canistered cask systems rely on ventilation between the canister and the overpack to convect heat away from the canister to the environment for both above and below-ground configurations. While several testing programs have been previously conducted, these earlier validation attempts did not capture the effects of elevated helium pressures or accurately portray the external convection of above-ground and below-ground canistered dry cask systems. The purpose of the investigation described in this report is to produce a data set that can be used to test the validity of the assumptions associated with the calculations presently used to determine steady-state cladding temperatures in modern vertical, canistered dry cask systems. The BWR cask simulator (BCS) has been designed in detail for both the above-ground and below-ground venting configurations. The pressure vessel representing the canister has been designed, fabricated, and pressure tested for a maximum allowable pressure (MAWP) rating of 24 bar at 400 deg C. An existing electrically heated but otherwise prototypic BWR Incoloy-clad test assembly is being deployed inside of a representative storage basket and cylindrical pressure vessel that

  15. Project Plan: Salt in Situ Heater Test.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kuhlman, Kristopher L.; Mills, Melissa Marie; Herrick, Courtney G.

    This project plan gives a high-level description of the US Department of Energy Office of Nuclear Energy (DOE-NE) Spent Fuel and Waste Disposition (SFWD) campaign in situ borehole heater test project being planned for the Waste Isolation Pilot Plant (WIPP) site This plan provides an overview of the schedule and responsibilities of the parties involved. This project is a collaborative effort by Sandia, Los Alamos, and Lawrence Berkeley National Laboratories to execute a series of small-diameter borehole heater tests in salt for the DOE-NE SFWD campaign. Design of a heater test in salt at WIPP has evolved over several years.more » The current design was completed in fiscal year 2017 (FY17), an equipment shakedown experiment is underway in April FY18, and the test implementation will begin in summer of FY18. The project comprises a suite of modular tests, which consist of a group of nearby boreholes in the wall of drifts at WIPP. Each test is centered around a packer-isolated heated borehole (5" diameter) containing equipment for water-vapor collection and brine sampling, surrounded by smaller-diameter (2" diameter) satellite observation boreholes. Observation boreholes will contain temperature sensors, tracer release points, electrical resistivity tomography (ERT) sensors, fiber optic sensing, and acoustic emission (AE) measurements, and sonic velocity sources and sensors. These satellite boreholes will also be used for plugging/sealing tests. The first two tests to be implemented will have the packer-isolated borehole heated to 120°C, with one observation borehole used to monitor changes. Follow-on tests will be designed using information gathered from the first two tests, will be conducted at other temperatures, will use multiple observation boreholes, and may include other measurement types and test designs.« less

  16. Solar Sail Models and Test Measurements Correspondence for Validation Requirements Definition

    NASA Technical Reports Server (NTRS)

    Ewing, Anthony; Adams, Charles

    2004-01-01

    Solar sails are being developed as a mission-enabling technology in support of future NASA science missions. Current efforts have advanced solar sail technology sufficient to justify a flight validation program. A primary objective of this activity is to test and validate solar sail models that are currently under development so that they may be used with confidence in future science mission development (e.g., scalable to larger sails). Both system and model validation requirements must be defined early in the program to guide design cycles and to ensure that relevant and sufficient test data will be obtained to conduct model validation to the level required. A process of model identification, model input/output documentation, model sensitivity analyses, and test measurement correspondence is required so that decisions can be made to satisfy validation requirements within program constraints.

  17. Construction and Evaluation of Reliability and Validity of Reasoning Ability Test

    ERIC Educational Resources Information Center

    Bhat, Mehraj A.

    2014-01-01

    This paper is based on the construction and evaluation of reliability and validity of reasoning ability test at secondary school students. In this paper an attempt was made to evaluate validity, reliability and to determine the appropriate standards to interpret the results of reasoning ability test. The test includes 45 items to measure six types…

  18. Test Anxiety and the Validity of Cognitive Tests: A Confirmatory Factor Analysis Perspective and Some Empirical Findings

    ERIC Educational Resources Information Center

    Wicherts, Jelte M.; Scholten, Annemarie Zand

    2010-01-01

    The validity of cognitive ability tests is often interpreted solely as a function of the cognitive abilities that these tests are supposed to measure, but other factors may be at play. The effects of test anxiety on the criterion related validity (CRV) of tests was the topic of a recent study by Reeve, Heggestad, and Lievens (2009) (Reeve, C. L.,…

  19. 40 CFR 610.24 - Validity of test data.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 40 Protection of Environment 30 2011-07-01 2011-07-01 false Validity of test data. 610.24 Section 610.24 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) ENERGY POLICY FUEL ECONOMY RETROFIT DEVICES Test Procedures and Evaluation Criteria Evaluation Criteria for the Preliminary...

  20. 40 CFR 610.24 - Validity of test data.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... 40 Protection of Environment 31 2012-07-01 2012-07-01 false Validity of test data. 610.24 Section 610.24 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) ENERGY POLICY FUEL ECONOMY RETROFIT DEVICES Test Procedures and Evaluation Criteria Evaluation Criteria for the Preliminary...

  1. 40 CFR 610.24 - Validity of test data.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... 40 Protection of Environment 30 2014-07-01 2014-07-01 false Validity of test data. 610.24 Section 610.24 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) ENERGY POLICY FUEL ECONOMY RETROFIT DEVICES Test Procedures and Evaluation Criteria Evaluation Criteria for the Preliminary...

  2. 40 CFR 610.24 - Validity of test data.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... 40 Protection of Environment 31 2013-07-01 2013-07-01 false Validity of test data. 610.24 Section 610.24 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) ENERGY POLICY FUEL ECONOMY RETROFIT DEVICES Test Procedures and Evaluation Criteria Evaluation Criteria for the Preliminary...

  3. 40 CFR 610.24 - Validity of test data.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 40 Protection of Environment 29 2010-07-01 2010-07-01 false Validity of test data. 610.24 Section 610.24 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) ENERGY POLICY FUEL ECONOMY RETROFIT DEVICES Test Procedures and Evaluation Criteria Evaluation Criteria for the Preliminary...

  4. Test validity and performance validity: considerations in providing a framework for development of an ability-focused neuropsychological test battery.

    PubMed

    Larrabee, Glenn J

    2014-11-01

    Literature on test validity and performance validity is reviewed to propose a framework for specification of an ability-focused battery (AFB). Factor analysis supports six domains of ability: first, verbal symbolic; secondly, visuoperceptual and visuospatial judgment and problem solving; thirdly, sensorimotor skills; fourthly, attention/working memory; fifthly, processing speed; finally, learning and memory (which can be divided into verbal and visual subdomains). The AFB should include at least three measures for each of the six domains, selected based on various criteria for validity including sensitivity to presence of disorder, sensitivity to severity of disorder, correlation with important activities of daily living, and containing embedded/derived measures of performance validity. Criterion groups should include moderate and severe traumatic brain injury, and Alzheimer's disease. Validation groups should also include patients with left and right hemisphere stroke, to determine measures sensitive to lateralized cognitive impairment and so that the moderating effects of auditory comprehension impairment and neglect can be analyzed on AFB measures. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  5. Field Evaluations Test Plan for Validation of Alternative Low-Emission Surface Preparation/Depainting Technologies for Structural Steel

    NASA Technical Reports Server (NTRS)

    Lewis, Pattie

    2005-01-01

    and approve alternative surface preparation technologies for use at NASA and AFSPC installations. Materials and processes will be evaluated with the goal of selecting those processes that will improve corrosion protection at critical systems, facilitate easier maintenance activity, extend maintenance cycles, eliminate flight hardware contamination and reduce the amount of hazardous waste generated. This Field Evaluations Test Plan defines the field evaluation and testing requirements for validating alternative surface preparation/depainting technologies and supplements the JTP. The field evaluations will be performed at Stennis Space Center, Mississippi, under the oversight of the Project Engineer. Additional field evaluations may be performed at other NASA centers or AFSPC facilities.

  6. An Investigation of the Effectiveness and Validity of Planning Time in Speaking Test Tasks

    ERIC Educational Resources Information Center

    Wigglesworth, Gillian; Elder, Cathie

    2010-01-01

    The study described in this article investigated the relationship between three variables in the IELTS oral module--planning, proficiency, and task--and was designed to enhance our understanding of how or whether these variables interact. The study aimed to determine whether differences in performance resulted from 1 or 2 min of planning time. It…

  7. Validation of Statistical Sampling Algorithms in Visual Sample Plan (VSP): Summary Report

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Nuffer, Lisa L; Sego, Landon H.; Wilson, John E.

    2009-02-18

    The U.S. Department of Homeland Security, Office of Technology Development (OTD) contracted with a set of U.S. Department of Energy national laboratories, including the Pacific Northwest National Laboratory (PNNL), to write a Remediation Guidance for Major Airports After a Chemical Attack. The report identifies key activities and issues that should be considered by a typical major airport following an incident involving release of a toxic chemical agent. Four experimental tasks were identified that would require further research in order to supplement the Remediation Guidance. One of the tasks, Task 4, OTD Chemical Remediation Statistical Sampling Design Validation, dealt with statisticalmore » sampling algorithm validation. This report documents the results of the sampling design validation conducted for Task 4. In 2005, the Government Accountability Office (GAO) performed a review of the past U.S. responses to Anthrax terrorist cases. Part of the motivation for this PNNL report was a major GAO finding that there was a lack of validated sampling strategies in the U.S. response to Anthrax cases. The report (GAO 2005) recommended that probability-based methods be used for sampling design in order to address confidence in the results, particularly when all sample results showed no remaining contamination. The GAO also expressed a desire that the methods be validated, which is the main purpose of this PNNL report. The objective of this study was to validate probability-based statistical sampling designs and the algorithms pertinent to within-building sampling that allow the user to prescribe or evaluate confidence levels of conclusions based on data collected as guided by the statistical sampling designs. Specifically, the designs found in the Visual Sample Plan (VSP) software were evaluated. VSP was used to calculate the number of samples and the sample location for a variety of sampling plans applied to an actual release site. Most of the sampling designs

  8. Application of validity theory and methodology to patient-reported outcome measures (PROMs): building an argument for validity.

    PubMed

    Hawkins, Melanie; Elsworth, Gerald R; Osborne, Richard H

    2018-07-01

    Data from subjective patient-reported outcome measures (PROMs) are now being used in the health sector to make or support decisions about individuals, groups and populations. Contemporary validity theorists define validity not as a statistical property of the test but as the extent to which empirical evidence supports the interpretation of test scores for an intended use. However, validity testing theory and methodology are rarely evident in the PROM validation literature. Application of this theory and methodology would provide structure for comprehensive validation planning to support improved PROM development and sound arguments for the validity of PROM score interpretation and use in each new context. This paper proposes the application of contemporary validity theory and methodology to PROM validity testing. The validity testing principles will be applied to a hypothetical case study with a focus on the interpretation and use of scores from a translated PROM that measures health literacy (the Health Literacy Questionnaire or HLQ). Although robust psychometric properties of a PROM are a pre-condition to its use, a PROM's validity lies in the sound argument that a network of empirical evidence supports the intended interpretation and use of PROM scores for decision making in a particular context. The health sector is yet to apply contemporary theory and methodology to PROM development and validation. The theoretical and methodological processes in this paper are offered as an advancement of the theory and practice of PROM validity testing in the health sector.

  9. Validation through Understanding Test-Taking Strategies: An Illustration With the CELPIP-General Reading Pilot Test Using Structural Equation Modeling

    ERIC Educational Resources Information Center

    Wu, Amery D.; Stone, Jake E.

    2016-01-01

    This article explores an approach for test score validation that examines test takers' strategies for taking a reading comprehension test. The authors formulated three working hypotheses about score validity pertaining to three types of test-taking strategy (comprehending meaning, test management, and test-wiseness). These hypotheses were…

  10. The Anomalous Sentences Repetition Test: Replication and Validation Study.

    ERIC Educational Resources Information Center

    Weeks, David J.

    1986-01-01

    Presents a brief clinical test, derived from earlier neuropsychological instruments, with evidence for its reliability, interscorer agreement, and validity. The latter is based upon correlations with both CAT scan measures of cortical atrophy and ventricular enlargement, as well as correlations with seven other previously validated cognitive…

  11. Reliability and Validity of the Inline Skating Skill Test

    PubMed Central

    Radman, Ivan; Ruzic, Lana; Padovan, Viktoria; Cigrovski, Vjekoslav; Podnar, Hrvoje

    2016-01-01

    This study aimed to examine the reliability and validity of the inline skating skill test. Based on previous skating experience forty-two skaters (26 female and 16 male) were randomized into two groups (competitive level vs. recreational level). They performed the test four times, with a recovery time of 45 minutes between sessions. Prior to testing, the participants rated their skating skill using a scale from 1 to 10. The protocol included performance time measurement through a course, combining different skating techniques. Trivial changes in performance time between the repeated sessions were determined in both competitive females/males and recreational females/males (-1.7% [95% CI: -5.8–2.6%] – 2.2% [95% CI: 0.0–4.5%]). In all four subgroups, the skill test had a low mean within-individual variation (1.6% [95% CI: 1.2–2.4%] – 2.7% [95% CI: 2.1–4.0%]) and high mean inter-session correlation (ICC = 0.97 [95% CI: 0.92–0.99] – 0.99 [95% CI: 0.98–1.00]). The comparison of detected typical errors and smallest worthwhile changes (calculated as standard deviations × 0.2) revealed that the skill test was able to track changes in skaters’ performances. Competitive-level skaters needed shorter time (24.4–26.4%, all p < 0.01) to complete the test in comparison to recreational-level skaters. Moreover, moderate correlation (ρ = 0.80–0.82; all p < 0.01) was observed between the participant’s self-rating and achieved performance times. In conclusion, the proposed test is a reliable and valid method to evaluate inline skating skills in amateur competitive and recreational level skaters. Further studies are needed to evaluate the reproducibility of this skill test in different populations including elite inline skaters. Key points Study evaluated the reliability and construct validity of a newly developed inline skating skill test. Evaluated test is a first protocol designed to assess specific inline skating skill. Two groups of amateur skaters with

  12. Performance Validity Testing in Neuropsychology: Scientific Basis and Clinical Application-A Brief Review.

    PubMed

    Greher, Michael R; Wodushek, Thomas R

    2017-03-01

    Performance validity testing refers to neuropsychologists' methodology for determining whether neuropsychological test performances completed in the course of an evaluation are valid (ie, the results of true neurocognitive function) or invalid (ie, overly impacted by the patient's effort/engagement in testing). This determination relies upon the use of either standalone tests designed for this sole purpose, or specific scores/indicators embedded within traditional neuropsychological measures that have demonstrated this utility. In response to a greater appreciation for the critical role that performance validity issues play in neuropsychological testing and the need to measure this variable to the best of our ability, the scientific base for performance validity testing has expanded greatly over the last 20 to 30 years. As such, the majority of current day neuropsychologists in the United States use a variety of measures for the purpose of performance validity testing as part of everyday forensic and clinical practice and address this issue directly in their evaluations. The following is the first article of a 2-part series that will address the evolution of performance validity testing in the field of neuropsychology, both in terms of the science as well as the clinical application of this measurement technique. The second article of this series will review performance validity tests in terms of methods for development of these measures, and maximizing of diagnostic accuracy.

  13. Validity of the Eating Attitude Test among Exercisers.

    PubMed

    Lane, Helen J; Lane, Andrew M; Matheson, Hilary

    2004-12-01

    Theory testing and construct measurement are inextricably linked. To date, no published research has looked at the factorial validity of an existing eating attitude inventory for use with exercisers. The Eating Attitude Test (EAT) is a 26-item measure that yields a single index of disordered eating attitudes. The original factor analysis showed three interrelated factors: Dieting behavior (13-items), oral control (7-items), and bulimia nervosa-food preoccupation (6-items). The primary purpose of the study was to examine the factorial validity of the EAT among a sample of exercisers. The second purpose was to investigate relationships between eating attitudes scores and selected psychological constructs. In stage one, 598 regular exercisers completed the EAT. Confirmatory factor analysis (CFA) was used to test the single-factor, a three-factor model, and a four-factor model, which distinguished bulimia from food pre-occupation. CFA of the single-factor model (RCFI = 0.66, RMSEA = 0.10), the three-factor-model (RCFI = 0.74; RMSEA = 0.09) showed poor model fit. There was marginal fit for the 4-factor model (RCFI = 0.91, RMSEA = 0.06). Results indicated five-items showed poor factor loadings. After these 5-items were discarded, the three models were re-analyzed. CFA results indicated that the single-factor model (RCFI = 0.76, RMSEA = 0.10) and three-factor model (RCFI = 0.82, RMSEA = 0.08) showed poor fit. CFA results for the four-factor model showed acceptable fit indices (RCFI = 0.98, RMSEA = 0.06). Stage two explored relationships between EAT scores, mood, self-esteem, and motivational indices toward exercise in terms of self-determination, enjoyment and competence. Correlation results indicated that depressed mood scores positively correlated with bulimia and dieting scores. Further, dieting was inversely related with self-determination toward exercising. Collectively, findings suggest that a 21-item four-factor model shows promising validity coefficients among

  14. Validity and Reliability of Baseline Testing in a Standardized Environment.

    PubMed

    Higgins, Kathryn L; Caze, Todd; Maerlender, Arthur

    2017-08-11

    The Immediate Postconcussion Assessment and Cognitive Testing (ImPACT) is a computerized neuropsychological test battery commonly used to determine cognitive recovery from concussion based on comparing post-injury scores to baseline scores. This model is based on the premise that ImPACT baseline test scores are a valid and reliable measure of optimal cognitive function at baseline. Growing evidence suggests that this premise may not be accurate and a large contributor to invalid and unreliable baseline test scores may be the protocol and environment in which baseline tests are administered. This study examined the effects of a standardized environment and administration protocol on the reliability and performance validity of athletes' baseline test scores on ImPACT by comparing scores obtained in two different group-testing settings. Three hundred-sixty one Division 1 cohort-matched collegiate athletes' baseline data were assessed using a variety of indicators of potential performance invalidity; internal reliability was also examined. Thirty-one to thirty-nine percent of the baseline cases had at least one indicator of low performance validity, but there were no significant differences in validity indicators based on environment in which the testing was conducted. Internal consistency reliability scores were in the acceptable to good range, with no significant differences between administration conditions. These results suggest that athletes may be reliably performing at levels lower than their best effort would produce. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  15. Test Series 2. 2: Detailed Test Plan

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Not Available

    Test Series 2.2 comprises the third sub-series of tests to be scheduled as a part of Test Series 2, the second stage of the combustion research program to be carried out at the Grimethorpe Experimental Pressurized Fluidized Bed Combustion Facility. Test Series 2.1, the first sub-series of tests, was completed in February 1983, and the first half of the second sub-series, Test Series 2.3, in October 1983. Test Series 2.2 is to consist of 350 data gathering hours, which it is hoped to complete within 560 coal burning hours. This document provides a brief description of the Facility and modificationsmore » which have been made following the completion of Test Series 2.1. No further modifications were made following the completion of the first half of Test Series 2.3. The operating requirements are specified. The tests will be performed using a UK coal (Kiveton Park), and a UK limestone (Middleton) both nominated by the FRG. Nine objectives are proposed which are to be fulfilled by thirteen test conditions. Six part load tests are included, as defined by Kraftwerk Union AG. The cascade is expected to be on line for each test condition and total cascade exposure is expected to be in excess of 450 hours. Details of sampling and special measurements are given. A test plan schedule envisages the test series being completed within a two month calendar period. Finally, a number of contingency strategies are proposed.« less

  16. K(3)EDTA Vacuum Tubes Validation for Routine Hematological Testing.

    PubMed

    Lima-Oliveira, Gabriel; Lippi, Giuseppe; Salvagno, Gian Luca; Montagnana, Martina; Poli, Giovanni; Solero, Giovanni Pietro; Picheth, Geraldo; Guidi, Gian Cesare

    2012-01-01

    Background and Objective. Some in vitro diagnostic devices (e.g, blood collection vacuum tubes and syringes for blood analyses) are not validated before the quality laboratory managers decide to start using or to change the brand. Frequently, the laboratory or hospital managers select the vacuum tubes for blood collection based on cost considerations or on relevance of a brand. The aim of this study was to validate two dry K(3)EDTA vacuum tubes of different brands for routine hematological testing. Methods. Blood specimens from 100 volunteers in two different K(3)EDTA vacuum tubes were collected by a single, expert phlebotomist. The routine hematological testing was done on Advia 2120i hematology system. The significance of the differences between samples was assessed by paired Student's t-test after checking for normality. The level of statistical significance was set at P < 0.05. Results and Conclusions. Different brand's tubes evaluated can represent a clinically relevant source of variations only on mean platelet volume (MPV) and platelet distribution width (PDW). Basically, our validation will permit the laboratory or hospital managers to select the brand's vacuum tubes validated according to him/her technical or economical reasons for routine hematological tests.

  17. Chamber B Thermal/Vacuum Chamber: User Test Planning Guide

    NASA Technical Reports Server (NTRS)

    Montz, Mike E.

    2012-01-01

    Test process, milestones and inputs are unknowns to first-time users of Chamber B. The User Test Planning Guide aids in establishing expectations for both NASA and non-NASA facility customers. The potential audience for this guide includes both internal and commercial spaceflight hardware/software developers. It is intended to assist their test engineering personnel in test planning and execution. Material covered includes a roadmap of the test process, roles and responsibilities of facility and user, major milestones, facility capabilities, and inputs required by the facility. Samples of deliverables, test article interfaces, and inputs necessary to define test scope, cost, and schedule are included as an appendix to the guide.

  18. Commentary on "Validating the Interpretations and Uses of Test Scores"

    ERIC Educational Resources Information Center

    Brennan, Robert L.

    2013-01-01

    Kane's paper "Validating the Interpretations and Uses of Test Scores" is the most complete and clearest discussion yet available of the argument-based approach to validation. At its most basic level, validation as formulated by Kane is fundamentally a simply-stated two-step enterprise: (1) specify the claims inherent in a particular interpretation…

  19. A Multi-Year Plan for Research, Development, and Prototype Testing of Standard Modular Hydropower Technology

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Smith, Brennan T.; Welch, Tim; Witt, Adam M.

    The Multi-Year Plan for Research, Development, and Prototype Testing of Standard Modular Hydropower Technology (MYRP) presents a strategy for specifying, designing, testing, and demonstrating the efficacy of standard modular hydropower (SMH) as an environmentally compatible and cost-optimized renewable electricity generation technology. The MYRP provides the context, background, and vision for testing the SMH hypothesis: if standardization, modularity, and preservation of stream functionality become essential and fully realized features of hydropower technology, project design, and regulatory processes, they will enable previously unrealized levels of new project development with increased acceptance, reduced costs, increased predictability of outcomes, and increased value to stakeholders.more » To achieve success in this effort, the MYRP outlines a framework of stakeholder-validated criteria, models, design tools, testing facilities, and assessment protocols that will facilitate the development of next-generation hydropower technologies.« less

  20. Electromagnetic Interference/Compatibility (EMI/EMC) Control Test and Measurement Facility: User Test Planning Guide

    NASA Technical Reports Server (NTRS)

    Scully, Robert C.

    2011-01-01

    Test process, milestones and inputs are unknowns to first-time users of the EMI/EMC Test Facility. The User Test Planning Guide aids in establishing expectations for both NASA and non-NASA facility customers. The potential audience for this guide includes both internal and commercial spaceflight hardware/software developers. It is intended to assist their test engineering personnel in test planning and execution. Material covered includes a roadmap of the test process, roles and responsibilities of facility and user, major milestones, facility capabilities, and inputs required by the facility. Samples of deliverables, test article interfaces, and inputs necessary to define test scope, cost, and schedule are included as an appendix to the guide.

  1. Testing-Based Compiler Validation for Synchronous Languages

    NASA Technical Reports Server (NTRS)

    Garoche, Pierre-Loic; Howar, Falk; Kahsai, Temesghen; Thirioux, Xavier

    2014-01-01

    In this paper we present a novel lightweight approach to validate compilers for synchronous languages. Instead of verifying a compiler for all input programs or providing a fixed suite of regression tests, we extend the compiler to generate a test-suite with high behavioral coverage and geared towards discovery of faults for every compiled artifact. We have implemented and evaluated our approach using a compiler from Lustre to C.

  2. Validation of the Information/Communications Technology Literacy Test

    DTIC Science & Technology

    2016-10-01

    nested set. Table 11 presents the results of incremental validity analyses for job knowledge/performance criteria by MOS. Figure 7 presents much...Systems Operator-Analyst (25B) and Nodal Network Systems Operator-Maintainer (25N) MOS. This report documents technical procedures and results of the...research effort. Results suggest that the ICTL test has potential as a valid and highly efficient predictor of valued outcomes in Signal school MOS. Not

  3. Validation of Metagenomic Next-Generation Sequencing Tests for Universal Pathogen Detection.

    PubMed

    Schlaberg, Robert; Chiu, Charles Y; Miller, Steve; Procop, Gary W; Weinstock, George

    2017-06-01

    - Metagenomic sequencing can be used for detection of any pathogens using unbiased, shotgun next-generation sequencing (NGS), without the need for sequence-specific amplification. Proof-of-concept has been demonstrated in infectious disease outbreaks of unknown causes and in patients with suspected infections but negative results for conventional tests. Metagenomic NGS tests hold great promise to improve infectious disease diagnostics, especially in immunocompromised and critically ill patients. - To discuss challenges and provide example solutions for validating metagenomic pathogen detection tests in clinical laboratories. A summary of current regulatory requirements, largely based on prior guidance for NGS testing in constitutional genetics and oncology, is provided. - Examples from 2 separate validation studies are provided for steps from assay design, and validation of wet bench and bioinformatics protocols, to quality control and assurance. - Although laboratory and data analysis workflows are still complex, metagenomic NGS tests for infectious diseases are increasingly being validated in clinical laboratories. Many parallels exist to NGS tests in other fields. Nevertheless, specimen preparation, rapidly evolving data analysis algorithms, and incomplete reference sequence databases are idiosyncratic to the field of microbiology and often overlooked.

  4. Development and Validation of a Test for Bulimia.

    ERIC Educational Resources Information Center

    Smith, Marcia C.; Thelen, Mark H.

    1984-01-01

    Developed the Bulimia Test (BULIT) based on responses of clinically identified females (N=18) and normal female college students (N=119) to preliminary test items. Results showed that the BULIT provided an objective, reliable, and valid measure by which to identify individuals with symptoms of bulimia. (Instrument is appended.) (LLL)

  5. Physics Goals for the Planned Next Linear Collider Engineering Test Facility

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Raubenheimer, Tor O

    2001-10-02

    The Next Linear Collider (NLC) Collaboration is planning to construct an Engineering Test Facility (ETF) at Fermilab. As presently envisioned, the ETF would comprise a fundamental unit of the NLC main linac to include X-band klystrons and modulators, a delay-line power-distribution system (DLDS), and NLC accelerating structures that serve as loads. The principal purpose of the ETF is to validate stable operation of the power-distribution system, first without beam, then with a beam having the NLC pulse structure. This paper concerns the possibility of configuring and using the ETF to accelerate beam with an NLC pulse structure, as well asmore » of doing experiments to measure beam-induced wakefields in the rf structures and their influence back on the beam.« less

  6. Physics goals for the planned next linear collider engineering test facility

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Courtlandt L Bohn et al.

    2001-06-26

    The Next Linear Collider (NLC) Collaboration is planning to construct an Engineering Test Facility (ETF) at Fermilab. As presently envisioned, the ETF would comprise a fundamental unit of the NLC main linac to include X-band klystrons and modulators, a delay-line power-distribution system (DLDS), and NLC accelerating structures that serve as loads. The principal purpose of the ETF is to validate stable operation of the power-distribution system, first without beam, then with a beam having the NLC pulse structure. This paper concerns the possibility of configuring and using the ETF to accelerate beam with an NLC pulse structure, as well asmore » of doing experiments to measure beam-induced wakefields in the rf structures and their influence back on the beam.« less

  7. Physics goals for the planned next linear collider engineering test facility.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bohn, C.; Michelotti, L.; Ostiguy, J.-F.

    2001-07-17

    The Next Linear Collider (NLC) Collaboration is planning to construct an Engineering Test Facility (ETF) at Fermilab. As presently envisioned, the ETF would comprise a fundamental unit of the NLC main linac to include X-band klystrons and modulators, a delay-line power-distribution system (DLDS), and NLC accelerating structures that serve as loads. The principal purpose of the ETF is to validate stable operation of the power-distribution system, first without beam, then with a beam having the NLC pulse structure. This paper concerns the possibility of configuring and using the ETF to accelerate beam with an NLC pulse structure, as well asmore » of doing experiments to measure beam-induced wakefields in the rf structures and their influence back on the beam.« less

  8. Automated Vision Test Development and Validation

    DTIC Science & Technology

    2016-11-01

    Deputy Chief, Aerosp Med Consultation Div Chair, Aerospace Medicine Department This report is published in the interest of...produce software for desktop displays; and to evaluate features such as user interfaces, threshold algorithms, validity of results, and screening...cost of performing full threshold testing on over 30% of normal subjects, which is quite time consuming. This effort was accomplished using desktop

  9. Validation of the Vanderbilt Holistic Face Processing Test.

    PubMed

    Wang, Chao-Chih; Ross, David A; Gauthier, Isabel; Richler, Jennifer J

    2016-01-01

    The Vanderbilt Holistic Face Processing Test (VHPT-F) is a new measure of holistic face processing with better psychometric properties relative to prior measures developed for group studies (Richler et al., 2014). In fields where psychologists study individual differences, validation studies are commonplace and the concurrent validity of a new measure is established by comparing it to an older measure with established validity. We follow this approach and test whether the VHPT-F measures the same construct as the composite task, which is group-based measure at the center of the large literature on holistic face processing. In Experiment 1, we found a significant correlation between holistic processing measured in the VHPT-F and the composite task. Although this correlation was small, it was comparable to the correlation between holistic processing measured in the composite task with the same faces, but different target parts (top or bottom), which represents a reasonable upper limit for correlations between the composite task and another measure of holistic processing. These results confirm the validity of the VHPT-F by demonstrating shared variance with another measure of holistic processing based on the same operational definition. These results were replicated in Experiment 2, but only when the demographic profile of our sample matched that of Experiment 1.

  10. Validation of the Vanderbilt Holistic Face Processing Test

    PubMed Central

    Wang, Chao-Chih; Ross, David A.; Gauthier, Isabel; Richler, Jennifer J.

    2016-01-01

    The Vanderbilt Holistic Face Processing Test (VHPT-F) is a new measure of holistic face processing with better psychometric properties relative to prior measures developed for group studies (Richler et al., 2014). In fields where psychologists study individual differences, validation studies are commonplace and the concurrent validity of a new measure is established by comparing it to an older measure with established validity. We follow this approach and test whether the VHPT-F measures the same construct as the composite task, which is group-based measure at the center of the large literature on holistic face processing. In Experiment 1, we found a significant correlation between holistic processing measured in the VHPT-F and the composite task. Although this correlation was small, it was comparable to the correlation between holistic processing measured in the composite task with the same faces, but different target parts (top or bottom), which represents a reasonable upper limit for correlations between the composite task and another measure of holistic processing. These results confirm the validity of the VHPT-F by demonstrating shared variance with another measure of holistic processing based on the same operational definition. These results were replicated in Experiment 2, but only when the demographic profile of our sample matched that of Experiment 1. PMID:27933014

  11. Reliability and validity of the closed kinetic chain upper extremity stability test.

    PubMed

    Lee, Dong-Rour; Kim, Laurentius Jongsoon

    2015-04-01

    [Purpose] The purpose of this study was to examine the reliability and validity of the Closed Kinetic Chain Upper Extremity Stability (CKCUES) test. [Subjects and Methods] A sample of 40 subjects (20 males, 20 females) with and without pain in the upper limbs was recruited. The subjects were tested twice, three days apart to assess the reliability of the CKCUES test. The CKCUES test was performed four times, and the average was calculated using the data of the last 3 tests. In order to test the validity of the CKCUES test, peak torque of internal/external shoulder rotation was measured using an isokinetic dynamometer, and maximum grip strength was measured using a hand dynamometer, and their Pearson correlation coefficients with the average values of the CKCUES test were calculated. [Results] The reliability of the CKCUES test was very high (ICC=0.97). The correlations between the CKCUES test and maximum grip strength (r=0.78-0.79), and the peak torque of internal/external shoulder rotation (r=0.87-0.94) were high indicating its validity. [Conclusion] The reliability and validity of the CKCUES test were high. The CKCUES test is expected to be used for clinical tests on upper limb stability at low price.

  12. Validity Tests of the Adolescent Domain Screening Inventory (ADSI) with Older Adolescents

    ERIC Educational Resources Information Center

    Corrigan, Matthew J.; Forte, James; Bulgaris, Sarah

    2017-01-01

    The purpose of this replication study is to test the validity of the Adolescent Domain Screening Inventory (ADSI) on an older adolescent population. This cross sectional study used a convenience sample to preliminarily test the validity of the ADSI. Concurrent validity correlations ranged from a high of 0.924 to a low of 0.760. The known…

  13. Joint Test Protocol for Validation of Alternative Low-Emission Surface Preparation/Depainting Technologies for Structural Steel

    NASA Technical Reports Server (NTRS)

    Lewis, Pattie

    2005-01-01

    Headquarters National Aeronautics and Space Administration (NASA) chartered the Acquisition Pollution Prevention (AP2) Office to coordinate agency activities affecting pollution prevention issues identified during system and component acquisition and sustainment processes. The primary objectives of the AP2 Office are to: (1) Reduce or eliminate the use of hazardous materials (HazMats) or hazardous processes at manufacturing, remanufacturing, and sustainment locations. (2) A void duplication of effort in actions required to reduce or eliminate HazMats through joint center cooperation and technology sharing. This project will identify, evaluate and approve alternative surface preparation technologies for use at NASA and Air Force Space Command (AFSPC) installations. Materials and processes will be evaluated with the goal of selecting those processes that will improve corrosion protection at critical systems, facilitate easier maintenance activity, extend maintenance cycles, eliminate flight hardware contamination and reduce the amount of hazardous waste generated. This Joint Test Protocol (JTP) contains the critical requirements and tests necessary to qualify alternative Low-Emission Surface Preparation/Depainting Technologies for Structural Steel Applications. These tests were derived from engineering, performance, and operational impact (supportability) requirements defined by a consensus of NASA and Air Force Space Command (AFSPC) participants. The Field Test Plan (FTP), entitled Joint Test Protocol for Validation of Alternative Low Emission Surface Preparation/Depainting Technologies for Structural Steel, prepared by ITB, defines the field evaluation and testing requirements for validating alternative surface preparation/depainting technologies and supplements the JTP.

  14. Validity Theory: Reform Policies, Accountability Testing, and Consequences

    ERIC Educational Resources Information Center

    Chalhoub-Deville, Micheline

    2016-01-01

    Educational policies such as Race to the Top in the USA affirm a central role for testing systems in government-driven reform efforts. Such reform policies are often referred to as the global education reform movement (GERM). Changes observed with the GERM style of testing demand socially engaged validity theories that include consequential…

  15. Validation of Global EO Biophysical Products at JECAM Test Site in Ukraine

    NASA Astrophysics Data System (ADS)

    Skakun, Sergii; Kussul, Nataliia; Kravchenko, Oleksiy; Basarab, Ruslan; Ostapenko, Vadym; Yailymov, Bohdan; Shelestov, Andrii; Kolotii, Andrii; Mironov, Andrii

    acquired with a NIKON D70 camera. The images acquired during the field campaign are processed with the CAN-EYE software to derive LAI, FAPAR and FCOVER. The in situ biophysical values were used for producing LAI, FCOVER and FAPAR maps from optical satellite images, and provide cross-validation, and validation of global remote sensing products. The following satellite data were used: SPOT-4, RapidEye and Landsat-8. Inter-comparison of the derived products is performed. The paper presents an insight on the general methodology used within JECAM test site, the results achieved so far and challenges, and future planned activities. 1. Gallego, F.J., Kussul, N., Skakun, S., Kravchenko, O., Shelestov, A., Kussul, O. “Efficiency assessment of using satellite data for crop area estimation in Ukraine,” International Journal of Applied Earth Observation and Geoinformation, vol. 29, pp. 22-30, 2014. 2. Kogan, F., Kussul, N., Adamenko, T., Skakun, S., Kravchenko, O., Kryvobok, O., Shelestov, A., Kolotii, A., Kussul, O., Lavrenyuk, A., “Winter wheat yield forecasting in Ukraine based on Earth observation, meteorological data and biophysical models,” International Journal of Applied Earth Observation and Geoinformation, vol. 23, pp. 192-203, 2013.

  16. Validity of Integrity Tests for Predicting Drug and Alcohol Abuse

    DTIC Science & Technology

    1993-08-31

    Wiinkler and Sheridan (1989) found that employees who entered employee assistance programs for treating drug addiction were more likely be absent...August 31, 1993 Final 4. TITLE AND SUBTITLE S. FUNDING NUMBERS Validity of Integrity Tests for Predicting Drug and Alcohol Abuse C No. N00014-92-J...words) This research used psychometric meta-analysis (Hunter & Schmidt, 1990b) to examine the validity of integrity tests for predicting drug and

  17. Validation of a clinical critical thinking skills test in nursing.

    PubMed

    Shin, Sujin; Jung, Dukyoo; Kim, Sungeun

    2015-01-27

    The purpose of this study was to develop a revised version of the clinical critical thinking skills test (CCTS) and to subsequently validate its performance. This study is a secondary analysis of the CCTS. Data were obtained from a convenience sample of 284 college students in June 2011. Thirty items were analyzed using item response theory and test reliability was assessed. Test-retest reliability was measured using the results of 20 nursing college and graduate school students in July 2013. The content validity of the revised items was analyzed by calculating the degree of agreement between instrument developer intention in item development and the judgments of six experts. To analyze response process validity, qualitative data related to the response processes of nine nursing college students obtained through cognitive interviews were analyzed. Out of initial 30 items, 11 items were excluded after the analysis of difficulty and discrimination parameter. When the 19 items of the revised version of the CCTS were analyzed, levels of item difficulty were found to be relatively low and levels of discrimination were found to be appropriate or high. The degree of agreement between item developer intention and expert judgments equaled or exceeded 50%. From above results, evidence of the response process validity was demonstrated, indicating that subjects respondeds as intended by the test developer. The revised 19-item CCTS was found to have sufficient reliability and validity and will therefore represents a more convenient measurement of critical thinking ability.

  18. Development and Validity Testing of an Arthritis Self-Management Assessment Tool.

    PubMed

    Oh, HyunSoo; Han, SunYoung; Kim, SooHyun; Seo, WhaSook

    Because of the chronic, progressive nature of arthritis and the substantial effects it has on quality of life, patients may benefit from self-management. However, no valid, reliable self-management assessment tool has been devised for patients with arthritis. This study was conducted to develop a comprehensive self-management assessment tool for patients with arthritis, that is, the Arthritis Self-Management Assessment Tool (ASMAT). To develop a list of qualified items corresponding to the conceptual definitions and attributes of arthritis self-management, a measurement model was established on the basis of theoretical and empirical foundations. Content validity testing was conducted to evaluate whether listed items were suitable for assessing arthritis self-management. Construct validity and reliability of the ASMAT were tested. Construct validity was examined using confirmatory factor analysis and nomological validity. The 32-item ASMAT was developed with a sample composed of patients in a clinic in South Korea. Content validity testing validated the 32 items, which comprised medical (10 items), behavioral (13 items), and psychoemotional (9 items) management subscales. Construct validity testing of the ASMAT showed that the 32 items properly corresponded with conceptual constructs of arthritis self-management, and were suitable for assessing self-management ability in patients with arthritis. Reliability was also well supported. The ASMAT devised in the present study may aid the evaluation of patient self-management ability and the effectiveness of self-management interventions. The authors believe the developed tool may also aid the identification of problems associated with the adoption of self-management practice, and thus improve symptom management, independence, and quality of life of patients with arthritis.

  19. LADO as a Language Test: Issues of Validity

    ERIC Educational Resources Information Center

    McNamara, Tim; Van Den Hazelkamp, Carolien; Verrips, Maaike

    2016-01-01

    This article brings together the theoretical field of language testing and the practical field of language analysis for the determination of the origin of asylum seekers. It considers what it would mean to think of language analysis as a form of language test, subject to the same validity constraints, and proposes a research agenda.

  20. Two-Speed Gearbox Dynamic Simulation Predictions and Test Validation

    NASA Technical Reports Server (NTRS)

    Lewicki, David G.; DeSmidt, Hans; Smith, Edward C.; Bauman, Steven W.

    2010-01-01

    Dynamic simulations and experimental validation tests were performed on a two-stage, two-speed gearbox as part of the drive system research activities of the NASA Fundamental Aeronautics Subsonics Rotary Wing Project. The gearbox was driven by two electromagnetic motors and had two electromagnetic, multi-disk clutches to control output speed. A dynamic model of the system was created which included a direct current electric motor with proportional-integral-derivative (PID) speed control, a two-speed gearbox with dual electromagnetically actuated clutches, and an eddy current dynamometer. A six degree-of-freedom model of the gearbox accounted for the system torsional dynamics and included gear, clutch, shaft, and load inertias as well as shaft flexibilities and a dry clutch stick-slip friction model. Experimental validation tests were performed on the gearbox in the NASA Glenn gear noise test facility. Gearbox output speed and torque as well as drive motor speed and current were compared to those from the analytical predictions. The experiments correlate very well with the predictions, thus validating the dynamic simulation methodologies.

  1. K3EDTA Vacuum Tubes Validation for Routine Hematological Testing

    PubMed Central

    Lima-Oliveira, Gabriel; Lippi, Giuseppe; Salvagno, Gian Luca; Montagnana, Martina; Poli, Giovanni; Solero, Giovanni Pietro; Picheth, Geraldo; Guidi, Gian Cesare

    2012-01-01

    Background and Objective. Some in vitro diagnostic devices (e.g, blood collection vacuum tubes and syringes for blood analyses) are not validated before the quality laboratory managers decide to start using or to change the brand. Frequently, the laboratory or hospital managers select the vacuum tubes for blood collection based on cost considerations or on relevance of a brand. The aim of this study was to validate two dry K3EDTA vacuum tubes of different brands for routine hematological testing. Methods. Blood specimens from 100 volunteers in two different K3EDTA vacuum tubes were collected by a single, expert phlebotomist. The routine hematological testing was done on Advia 2120i hematology system. The significance of the differences between samples was assessed by paired Student's t-test after checking for normality. The level of statistical significance was set at P < 0.05. Results and Conclusions. Different brand's tubes evaluated can represent a clinically relevant source of variations only on mean platelet volume (MPV) and platelet distribution width (PDW). Basically, our validation will permit the laboratory or hospital managers to select the brand's vacuum tubes validated according to him/her technical or economical reasons for routine hematological tests. PMID:22888448

  2. Embedded performance validity tests within the Hopkins Verbal Learning Test - Revised and the Brief Visuospatial Memory Test - Revised.

    PubMed

    Sawyer, R John; Testa, S Marc; Dux, Moira

    2017-01-01

    Various research studies and neuropsychology practice organizations have reiterated the importance of developing embedded performance validity tests (PVTs) to detect potentially invalid neurocognitive test data. This study investigated whether measures within the Hopkins Verbal Learning Test - Revised (HVLT-R) and the Brief Visuospatial Memory Test - Revised (BVMT-R) could accurately classify individuals who fail two or more PVTs during routine clinical assessment. The present sample of 109 United States military veterans (Mean age = 52.4, SD = 13.3), all consisted of clinically referred patients and received a battery of neuropsychological tests. Based on performance validity findings, veterans were assigned to valid (n = 86) or invalid (n = 23) groups. Of the 109 patients in the overall sample, 77 were administered the HLVT-R and 75 were administered the BVMT-R, which were examined for classification accuracy. The HVLT-R Recognition Discrimination Index and the BVMT-R Retention Percentage showed good to adequate discrimination with an area under the curve of .78 and .70, respectively. The HVLT-R Recognition Discrimination Index showed sensitivity of .53 with specificity of .93. The BVMT-R Retention Percentage demonstrated sensitivity of .31 with specificity of .92. When used in conjunction with other PVTs, these new embedded PVTs may be effective in the detection of invalid test data, although they are not intended for use in patients with dementia.

  3. The Challenge of Grounding Planning in Simulation with an Interactive Model Development Environment

    NASA Technical Reports Server (NTRS)

    Clement, Bradley J.; Frank, Jeremy D.; Chachere, John M.; Smith, Tristan B.; Swanson, Keith J.

    2011-01-01

    A principal obstacle to fielding automated planning systems is the difficulty of modeling. Physical systems are modeled conventionally based on specification documents and the modeler's understanding of the system. Thus, the model is developed in a way that is disconnected from the system's actual behavior and is vulnerable to manual error. Another obstacle to fielding planners is testing and validation. For a space mission, generated plans must be validated often by translating them into command sequences that are run in a simulation testbed. Testing in this way is complex and onerous because of the large number of possible plans and states of the spacecraft. Though, if used as a source of domain knowledge, the simulator can ease validation. This paper poses a challenge: to ground planning models in the system physics represented by simulation. A proposed, interactive model development environment illustrates the integration of planning and simulation to meet the challenge. This integration reveals research paths for automated model construction and validation.

  4. The test-retest reliability and criterion validity of a high-intensity, netball-specific circuit test: The Net-Test.

    PubMed

    Mungovan, Sean F; Peralta, Paula J; Gass, Gregory C; Scanlan, Aaron T

    2018-04-12

    To examine the test-retest reliability and criterion validity of a high-intensity, netball-specific fitness test. Repeated measures, within-subject design. Eighteen female netball players competing in an international competition completed a trial of the Net-Test, which consists of 14 timed netball-specific movements. Players also completed a series of netball-relevant criterion fitness tests. Ten players completed an additional Net-Test trial one week later to assess test-retest reliability using intraclass correlation coefficient (ICC), typical error of measurement (TEM), and coefficient of variation (CV). The typical error of estimate expressed as CV and Pearson correlations were calculated between each criterion test and Net-Test performance to assess criterion validity. Five movements during the Net-Test displayed moderate ICC (0.84-0.90) and two movements displayed high ICC (0.91-0.93). Seven movements and heart rate taken during the Net-Test held low CV (<5%) with values ranging from 1.7 to 9.5% across measures. Total time (41.63±2.05s) during the Net-Test possessed low CV and significant (p<0.05) correlations with 10m sprint time (1.98±0.12s; CV=4.4%, r=0.72), 20m sprint time (3.38±0.19s; CV=3.9%, r=0.79), 505 Change-of-Direction time (2.47±0.08s; CV=2.0%, r=0.80); and maximum oxygen uptake (46.59±2.58 mLkg -1 min -1 ; CV=4.5%, r=-0.66). The Net-Test possesses acceptable reliability for the assessment of netball fitness. Further, the high criterion validity for the Net-Test suggests a range of important netball-specific fitness elements are assessed in combination. Copyright © 2018 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.

  5. Validity and reliability of the Hawaii anaerobic run test.

    PubMed

    Kimura, Iris F; Stickley, Christopher D; Lentz, Melissa A; Wages, Jennifer J; Yanagi, Kazuhiko; Hetzler, Ronald K

    2014-05-01

    This study examined the reliability and validity of the Hawaii anaerobic run test (HART) by comparing anaerobic capacity measures obtained to those during the Wingate Anaerobic Test (WAnT). Ninety-six healthy physically active volunteers (age, 22.0 ± 2.8 years; height, 163.9 ± 9.5 cm; body mass, 70.6 ± 14.7 kg; body fat %, 19.29 ± 5.39%) participated in this study. Each participant performed 2 anaerobic capacity tests: the WAnT and the HART by random assignment on separate days. The reliability of the HART was calculated from 2 separate trials of the test and then determined through intraclass correlation coefficients (ICCs). Blood samples were collected, and lactate was analyzed both pretest and posttest for each of the 2 exercise modes. Heart rate and rate of perceived exertion were also measured pre- and post-exercise. Hawaii anaerobic run test peak and mean momentum were calculated as body mass times highest or average split velocity, respectively. Intraclass correlation coefficients between trials of the HART for peak and mean momentum were 0.98 and 0.99, respectively (SEM = 18.8 and 25.7, respectively). Validity of the HART was established through comparison of momentum on the HART with power on the WAnT. High correlations were found between peak power and peak momentum (r = 0.88), as well as mean power and mean momentum (r = 0.94). The HART was considered to be a reliable test of anaerobic power. The HART was also determined to be a valid test of anaerobic power when compared with the WAnT. When testing healthy college-aged individuals, the HART offers an easy and inexpensive alternative maximal effort anaerobic power test to other established tests.

  6. The General Mission Analysis Tool (GMAT) System Test Plan

    NASA Technical Reports Server (NTRS)

    Conway, Darrel J.; Hughes, Steven P.

    2007-01-01

    This document serves as the System Test Approach for the GMAT Project. Preparation for system testing consists of three major stages: 1) The Test Approach sets the scope of system testing, the overall strategy to be adopted, the activities to be completed, the general resources required and the methods and processes to be used to test the release. It also details the activities, dependencies and effort required to conduct the System Test. 2) Test Planning details the activities, dependencies and effort required to conduct the System Test. 3) Test Cases documents the tests to be applied, the data to be processed, the automated testing coverage and the expected results. This document covers the first two of these items, and established the framework used for the GMAT test case development. The test cases themselves exist as separate components, and are managed outside of and concurrently with this System Test Plan.

  7. A broad scope knowledge based model for optimization of VMAT in esophageal cancer: validation and assessment of plan quality among different treatment centers.

    PubMed

    Fogliata, Antonella; Nicolini, Giorgia; Clivio, Alessandro; Vanetti, Eugenio; Laksar, Sarbani; Tozzi, Angelo; Scorsetti, Marta; Cozzi, Luca

    2015-10-31

    To evaluate the performance of a broad scope model-based optimisation process for volumetric modulated arc therapy applied to esophageal cancer. A set of 70 previously treated patients in two different institutions, were selected to train a model for the prediction of dose-volume constraints. The model was built with a broad-scope purpose, aiming to be effective for different dose prescriptions and tumour localisations. It was validated on three groups of patients from the same institution and from another clinic not providing patients for the training phase. Comparison of the automated plans was done against reference cases given by the clinically accepted plans. Quantitative improvements (statistically significant for the majority of the analysed dose-volume parameters) were observed between the benchmark and the test plans. Of 624 dose-volume objectives assessed for plan evaluation, in 21 cases (3.3 %) the reference plans failed to respect the constraints while the model-based plans succeeded. Only in 3 cases (<0.5 %) the reference plans passed the criteria while the model-based failed. In 5.3 % of the cases both groups of plans failed and in the remaining cases both passed the tests. Plans were optimised using a broad scope knowledge-based model to determine the dose-volume constraints. The results showed dosimetric improvements when compared to the benchmark data. Particularly the plans optimised for patients from the third centre, not participating to the training, resulted in superior quality. The data suggests that the new engine is reliable and could encourage its application to clinical practice.

  8. Supersonic Retropropulsion Test 1853 in NASA LaRC Unitary Plan Wind Tunnel Test Section 2

    NASA Technical Reports Server (NTRS)

    Berry, Scott A.; Rhode, Matthew N.

    2014-01-01

    A supersonic retropropulsion experiment was conducted in the Langley Research Center Unitary Plan Wind Tunnel Test Section 2 at Mach numbers of 2.4, 3.5, and 4.6. Intended as a code validation effort, this study used pretest computations to size and refine the model such that tunnel blockage and internal flow separations were minimized. A 5-in diameter 70 degree sphere-cone forebody, which can accommodate up to four 4:1 area ratio nozzles, followed by a 9.55 inches long cylindrical aft body was selected for this test after computational maturation. The primary measurements for this experiment were high spatial-density surface pressures. In addition, high speed schlieren video and internal pressures and temperatures were acquired. The test included parametric variations in the number of nozzles utilized, thrust coefficients (roughly 0 to 4), and angles of attack (-8 to 20 degrees). The run matrix was developed to also allow quantification of various sources of experimental uncertainty, such as random errors due to run-to-run variations and systematic errors due to flowfield or model misalignments. To accommodate the uncertainty assessment, many runs and replicates were conducted with the model at various locations within the tunnel and with model roll angles of 0, 60, 120, and 180 degrees. This test report provides operational details of the experiment, contains a review of trends, and provides all schlieren and pressure results within appendices.

  9. Test-Retest Reliability and Predictive Validity of the Implicit Association Test in Children

    ERIC Educational Resources Information Center

    Rae, James R.; Olson, Kristina R.

    2018-01-01

    The Implicit Association Test (IAT) is increasingly used in developmental research despite minimal evidence of whether children's IAT scores are reliable across time or predictive of behavior. When test-retest reliability and predictive validity have been assessed, the results have been mixed, and because these studies have differed on many…

  10. Pump CFD code validation tests

    NASA Technical Reports Server (NTRS)

    Brozowski, L. A.

    1993-01-01

    Pump CFD code validation tests were accomplished by obtaining nonintrusive flow characteristic data at key locations in generic current liquid rocket engine turbopump configurations. Data were obtained with a laser two-focus (L2F) velocimeter at scaled design flow. Three components were surveyed: a 1970's-designed impeller, a 1990's-designed impeller, and a four-bladed unshrouded inducer. Two-dimensional velocities were measured upstream and downstream of the two impellers. Three-dimensional velocities were measured upstream, downstream, and within the blade row of the unshrouded inducer.

  11. Cross-Validation of the Computerized Adaptive Screening Test (CAST).

    ERIC Educational Resources Information Center

    Pliske, Rebecca M.; And Others

    The Computerized Adaptive Screening Test (CAST) was developed to provide an estimate at recruiting stations of prospects' Armed Forces Qualification Test (AFQT) scores. The CAST was designed to replace the paper-and-pencil Enlistment Screening Test (EST). The initial validation study of CAST indicated that CAST predicts AFQT at least as accurately…

  12. The validity of upper-limb neurodynamic tests for detecting peripheral neuropathic pain.

    PubMed

    Nee, Robert J; Jull, Gwendolen A; Vicenzino, Bill; Coppieters, Michel W

    2012-05-01

    The validity of upper-limb neurodynamic tests (ULNTs) for detecting peripheral neuropathic pain (PNP) was assessed by reviewing the evidence on plausibility, the definition of a positive test, reliability, and concurrent validity. Evidence was identified by a structured search for peer-reviewed articles published in English before May 2011. The quality of concurrent validity studies was assessed with the Quality Assessment of Diagnostic Accuracy Studies tool, where appropriate. Biomechanical and experimental pain data support the plausibility of ULNTs. Evidence suggests that a positive ULNT should at least partially reproduce the patient's symptoms and that structural differentiation should change these symptoms. Data indicate that this definition of a positive ULNT is reliable when used clinically. Limited evidence suggests that the median nerve test, but not the radial nerve test, helps determine whether a patient has cervical radiculopathy. The median nerve test does not help diagnose carpal tunnel syndrome. These findings should be interpreted cautiously, because diagnostic accuracy might have been distorted by the investigators' definitions of a positive ULNT. Furthermore, patients with PNP who presented with increased nerve mechanosensitivity rather than conduction loss might have been incorrectly classified by electrophysiological reference standards as not having PNP. The only evidence for concurrent validity of the ulnar nerve test was a case study on cubital tunnel syndrome. We recommend that researchers develop more comprehensive reference standards for PNP to accurately assess the concurrent validity of ULNTs and continue investigating the predictive validity of ULNTs for prognosis or treatment response.

  13. Translation, Cultural Adaptation and Validation of the Simple Shoulder Test to Spanish

    PubMed Central

    Arcuri, Francisco; Barclay, Fernando; Nacul, Ivan

    2015-01-01

    Background: The validation of widely used scales facilitates the comparison across international patient samples. Objective: The objective was to translate, culturally adapt and validate the Simple Shoulder Test into Argentinian Spanish. Methods: The Simple Shoulder Test was translated from English into Argentinian Spanish by two independent translators, translated back into English and evaluated for accuracy by an expert committee to correct the possible discrepancies. It was then administered to 50 patients with different shoulder conditions.Psycometric properties were analyzed including internal consistency, measured with Cronbach´s Alpha, test-retest reliability at 15 days with the interclass correlation coefficient. Results: The internal consistency, validation, was an Alpha of 0,808, evaluated as good. The test-retest reliability index as measured by intra-class correlation coefficient (ICC) was 0.835, evaluated as excellent. Conclusion: The Simple Shoulder Test translation and it´s cultural adaptation to Argentinian-Spanish demonstrated adequate internal reliability and validity, ultimately allowing for its use in the comparison with international patient samples.

  14. Validation of a deformable MRI to CT registration algorithm employing same day planning MRI for surrogate analysis.

    PubMed

    Padgett, Kyle R; Stoyanova, Radka; Pirozzi, Sara; Johnson, Perry; Piper, Jon; Dogan, Nesrin; Pollack, Alan

    2018-03-01

    Validating deformable multimodality image registrations is challenging due to intrinsic differences in signal characteristics and their spatial intensity distributions. Evaluating multimodality registrations using these spatial intensity distributions is also complicated by the fact that these metrics are often employed in the registration optimization process. This work evaluates rigid and deformable image registrations of the prostate in between diagnostic-MRI and radiation treatment planning-CT by utilizing a planning-MRI after fiducial marker placement as a surrogate. The surrogate allows for the direct quantitative analysis that can be difficult in the multimodality domain. For thirteen prostate patients, T2 images were acquired at two different time points, the first several weeks prior to planning (diagnostic-MRI) and the second on the same day as the planning-CT (planning-MRI). The diagnostic-MRI was deformed to the planning-CT utilizing a commercially available algorithm which synthesizes a deformable image registration (DIR) algorithm from local rigid registrations. The planning-MRI provided an independent surrogate for the planning-CT for assessing registration accuracy using image similarity metrics, including Pearson correlation and normalized mutual information (NMI). A local analysis was performed by looking only within the prostate, proximal seminal vesicles, penile bulb, and combined areas. The planning-MRI provided an excellent surrogate for the planning-CT with residual error in fiducial alignment between the two datasets being submillimeter, 0.78 mm. DIR was superior to the rigid registration in 11 of 13 cases demonstrating a 27.37% improvement in NMI (P < 0.009) within a regional area surrounding the prostate and associated critical organs. Pearson correlations showed similar results, demonstrating a 13.02% improvement (P < 0.013). By utilizing the planning-MRI as a surrogate for the planning-CT, an independent evaluation of registration

  15. 13. Photographic copy of site plan displaying Test Stand 'C' ...

    Library of Congress Historic Buildings Survey, Historic Engineering Record, Historic Landscapes Survey

    13. Photographic copy of site plan displaying Test Stand 'C' (4217/E-18), Test Stand 'D' (4223/E-24), and Control and Recording Center (4221/E-22) with ancillary structures, and connecting roads and services. California Institute of Technology, Jet Propulsion Laboratory, Facilities Engineering and Construction Office 'Repairs to Test Stand 'C,' Edwards Test Station, Legend & Site Plan M-1,' drawing no. ESP/115, August 14, 1987. - Jet Propulsion Laboratory Edwards Facility, Test Stand C, Edwards Air Force Base, Boron, Kern County, CA

  16. Physical performance tests after stroke: reliability and validity.

    PubMed

    Maeda, A; Yuasa, T; Nakamura, K; Higuchi, S; Motohashi, Y

    2000-01-01

    To evaluate the reliability and validity of the modified physical performance tests for stroke survivors who live in a community. The subjects included 40 stroke survivors and 40 apparently healthy independent elderly persons. The physical performance tests for the stroke survivors comprised two physical capacity evaluation tasks that represented physical abilities necessary to perform the main activities of daily living, e.g., standing-up ability (time needed to stand up from bed rest) and walking ability (time needed to walk 10 m). Regarding the reliability of tests, significant correlations were confirmed between test and retest of physical performance tests with both short and long intervals in individuals after stroke. Regarding the validity of tests, the authors studied the significant correlations between the maximum isometric strength of the quardriceps muscle and the time needed to walk 10 m, centimeters reached while sitting and reaching, and the time needed to stand up from bed rest. The authors confirmed that there were significant correlations between the instrumental activity of daily living and the time needed to stand up from bed rest, along with the time needed to walk 10 m for the stroke survivors. These physical performance tests are useful guides for evaluating a level of activity of daily living and physical frailty of stroke survivors living in a community.

  17. Validation of the Narrowing Beam Walking Test in Lower Limb Prosthesis Users.

    PubMed

    Sawers, Andrew; Hafner, Brian

    2018-04-11

    To evaluate the content, construct, and discriminant validity of the Narrowing Beam Walking Test (NBWT), a performance-based balance test for lower limb prosthesis users. Cross-sectional study. Research laboratory and prosthetics clinic. Unilateral transtibial and transfemoral prosthesis users (N=40). Not applicable. Content validity was examined by quantifying the percentage of participants receiving maximum or minimum scores (ie, ceiling and floor effects). Convergent construct validity was examined using correlations between participants' NBWT scores and scores or times on existing clinical balance tests regularly administered to lower limb prosthesis users. Known-groups construct validity was examined by comparing NBWT scores between groups of participants with different fall histories, amputation levels, amputation etiologies, and functional levels. Discriminant validity was evaluated by analyzing the area under each test's receiver operating characteristic (ROC) curve. No minimum or maximum scores were recorded on the NBWT. NBWT scores demonstrated strong correlations (ρ=.70‒.85) with scores/times on performance-based balance tests (timed Up and Go test, Four Square Step Test, and Berg Balance Scale) and a moderate correlation (ρ=.49) with the self-report Activities-specific Balance Confidence scale. NBWT performance was significantly lower among participants with a history of falls (P=.003), transfemoral amputation (P=.011), and a lower mobility level (P<.001). The NBWT also had the largest area under the ROC curve (.81) and was the only test to exhibit an area that was statistically significantly >.50 (ie, chance). The results provide strong evidence of content, construct, and discriminant validity for the NBWT as a performance-based test of balance ability. The evidence supports its use to assess balance impairments and fall risk in unilateral transtibial and transfemoral prosthesis users. Copyright © 2018 American Congress of Rehabilitation Medicine

  18. Validation of a clinical critical thinking skills test in nursing

    PubMed Central

    2015-01-01

    Purpose: The purpose of this study was to develop a revised version of the clinical critical thinking skills test (CCTS) and to subsequently validate its performance. Methods: This study is a secondary analysis of the CCTS. Data were obtained from a convenience sample of 284 college students in June 2011. Thirty items were analyzed using item response theory and test reliability was assessed. Test-retest reliability was measured using the results of 20 nursing college and graduate school students in July 2013. The content validity of the revised items was analyzed by calculating the degree of agreement between instrument developer intention in item development and the judgments of six experts. To analyze response process validity, qualitative data related to the response processes of nine nursing college students obtained through cognitive interviews were analyzed. Results: Out of initial 30 items, 11 items were excluded after the analysis of difficulty and discrimination parameter. When the 19 items of the revised version of the CCTS were analyzed, levels of item difficulty were found to be relatively low and levels of discrimination were found to be appropriate or high. The degree of agreement between item developer intention and expert judgments equaled or exceeded 50%. Conclusion: From above results, evidence of the response process validity was demonstrated, indicating that subjects respondeds as intended by the test developer. The revised 19-item CCTS was found to have sufficient reliability and validity and will therefore represents a more convenient measurement of critical thinking ability. PMID:25622716

  19. Halogen occultation experiment intergrated test plan

    NASA Technical Reports Server (NTRS)

    Mauldin, L. E., III; Butterfield, A. J.

    1986-01-01

    The test program plan is presented for the Halogen Occultation Experiment (HALOE) instrument, which is being developed in-house at the Langley Research Center for the Upper Atmosphere Research Satellite (UARS). This comprehensive test program was developed to demonstrate that the HALOE instrument meets its performance requirements and maintains integrity through UARS flight environments. Each component, subsystem, and system level test is described in sufficient detail to allow development of the necessary test setups and test procedures. Additionally, the management system for implementing this test program is given. The HALOE instrument is a gas correlation radiometer that measures vertical distribution of eight upper atmospheric constituents: O3, HC1, HF, NO, CH4, H2O, NO2, and CO2.

  20. Validation of Alternative In Vitro Methods to Animal Testing: Concepts, Challenges, Processes and Tools.

    PubMed

    Griesinger, Claudius; Desprez, Bertrand; Coecke, Sandra; Casey, Warren; Zuang, Valérie

    This chapter explores the concepts, processes, tools and challenges relating to the validation of alternative methods for toxicity and safety testing. In general terms, validation is the process of assessing the appropriateness and usefulness of a tool for its intended purpose. Validation is routinely used in various contexts in science, technology, the manufacturing and services sectors. It serves to assess the fitness-for-purpose of devices, systems, software up to entire methodologies. In the area of toxicity testing, validation plays an indispensable role: "alternative approaches" are increasingly replacing animal models as predictive tools and it needs to be demonstrated that these novel methods are fit for purpose. Alternative approaches include in vitro test methods, non-testing approaches such as predictive computer models up to entire testing and assessment strategies composed of method suites, data sources and decision-aiding tools. Data generated with alternative approaches are ultimately used for decision-making on public health and the protection of the environment. It is therefore essential that the underlying methods and methodologies are thoroughly characterised, assessed and transparently documented through validation studies involving impartial actors. Importantly, validation serves as a filter to ensure that only test methods able to produce data that help to address legislative requirements (e.g. EU's REACH legislation) are accepted as official testing tools and, owing to the globalisation of markets, recognised on international level (e.g. through inclusion in OECD test guidelines). Since validation creates a credible and transparent evidence base on test methods, it provides a quality stamp, supporting companies developing and marketing alternative methods and creating considerable business opportunities. Validation of alternative methods is conducted through scientific studies assessing two key hypotheses, reliability and relevance of the

  1. Implementation and Initial Validation of the APS English Test [and] The APS English-Writing Test at Golden West College: Evidence for Predictive Validity.

    ERIC Educational Resources Information Center

    Isonio, Steven

    In May 1991, Golden West College (California) conducted a validation study of the English portion of the Assessment and Placement Services for Community Colleges (APS), followed by a predictive validity study in July 1991. The initial study was designed to aid in the implementation of the new test at GWC by comparing data on APS use at other…

  2. Construct Validity of Physical Fitness Tests

    DTIC Science & Technology

    2011-02-03

    Medicine and Science in Sports and Exercise , 21, 319-324. *Fleishman, E. A. (1964). The structure and measurement of physical fitness. Englewood Cliffs...Quarterly for Exercise and Sport, 64, 256-273. *McCloy, E. (1935). Factor analysis methods in the measurement of physical abilities. Research Quarterly...Research Quarterly, 34, 525. Physical Fitness Test Validity 23 Powers, S. K., & Howley, E. T. (1990). Exercise physiology: Theory and application to

  3. 30 CFR 282.23 - Testing Plan.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... Resources BUREAU OF OCEAN ENERGY MANAGEMENT, REGULATION, AND ENFORCEMENT, DEPARTMENT OF THE INTERIOR... lessee needs more information to develop a detailed Mining Plan than is obtainable under an approved... techniques or technology or mining equipment, or to determine environmental effects by a pilot test mining...

  4. Optical dosimetry probes to validate Monte Carlo and empirical-method-based NIR dose planning in the brain.

    PubMed

    Verleker, Akshay Prabhu; Shaffer, Michael; Fang, Qianqian; Choi, Mi-Ran; Clare, Susan; Stantz, Keith M

    2016-12-01

    A three-dimensional photon dosimetry in tissues is critical in designing optical therapeutic protocols to trigger light-activated drug release. The objective of this study is to investigate the feasibility of a Monte Carlo-based optical therapy planning software by developing dosimetry tools to characterize and cross-validate the local photon fluence in brain tissue, as part of a long-term strategy to quantify the effects of photoactivated drug release in brain tumors. An existing GPU-based 3D Monte Carlo (MC) code was modified to simulate near-infrared photon transport with differing laser beam profiles within phantoms of skull bone (B), white matter (WM), and gray matter (GM). A novel titanium-based optical dosimetry probe with isotropic acceptance was used to validate the local photon fluence, and an empirical model of photon transport was developed to significantly decrease execution time for clinical application. Comparisons between the MC and the dosimetry probe measurements were on an average 11.27%, 13.25%, and 11.81% along the illumination beam axis, and 9.4%, 12.06%, 8.91% perpendicular to the beam axis for WM, GM, and B phantoms, respectively. For a heterogeneous head phantom, the measured % errors were 17.71% and 18.04% along and perpendicular to beam axis. The empirical algorithm was validated by probe measurements and matched the MC results (R20.99), with average % error of 10.1%, 45.2%, and 22.1% relative to probe measurements, and 22.6%, 35.8%, and 21.9% relative to the MC, for WM, GM, and B phantoms, respectively. The simulation time for the empirical model was 6 s versus 8 h for the GPU-based Monte Carlo for a head phantom simulation. These tools provide the capability to develop and optimize treatment plans for optimal release of pharmaceuticals in the treatment of cancer. Future work will test and validate these novel delivery and release mechanisms in vivo.

  5. Development of an Agility Test for Badminton Players and Assessment of Its Validity and Test-Retest Reliability.

    PubMed

    Loureiro, Luiz de França Bahia; de Freitas, Paulo Barbosa

    2016-04-01

    Badminton requires open and fast actions toward the shuttlecock, but there is no specific agility test for badminton players with specific movements. To develop an agility test that simultaneously assesses perception and motor capacity and examine the test's concurrent and construct validity and its test-retest reliability. The Badcamp agility test consists of running as fast as possible to 6 targets placed on the corners and middle points of a rectangular area (5.6 × 4.2 m) from the start position located in the center of it, following visual stimuli presented in a luminous panel. The authors recruited 43 badminton players (17-32 y old) to evaluate concurrent (with shuttle-run agility test--SRAT) and construct validity and test-retest reliability. Results revealed that Badcamp presents concurrent and construct validity, as its performance is strongly related to SRAT (ρ = 0.83, P < .001), with performance of experts being better than nonexpert players (P < .01). In addition, Badcamp is reliable, as no difference (P = .07) and a high intraclass correlation (ICC = .93) were found in the performance of the players on 2 different occasions. The findings indicate that Badcamp is an effective, valid, and reliable tool to measure agility, allowing coaches and athletic trainers to evaluate players' athletic condition and training effectiveness and possibly detect talented individuals in this sport.

  6. 49 CFR 40.89 - What is validity testing, and are laboratories required to conduct it?

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... PROCEDURES FOR TRANSPORTATION WORKPLACE DRUG AND ALCOHOL TESTING PROGRAMS Drug Testing Laboratories § 40.89 What is validity testing, and are laboratories required to conduct it? (a) Specimen validity testing is... 49 Transportation 1 2013-10-01 2013-10-01 false What is validity testing, and are laboratories...

  7. 49 CFR 40.89 - What is validity testing, and are laboratories required to conduct it?

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... PROCEDURES FOR TRANSPORTATION WORKPLACE DRUG AND ALCOHOL TESTING PROGRAMS Drug Testing Laboratories § 40.89 What is validity testing, and are laboratories required to conduct it? (a) Specimen validity testing is... 49 Transportation 1 2011-10-01 2011-10-01 false What is validity testing, and are laboratories...

  8. 49 CFR 40.89 - What is validity testing, and are laboratories required to conduct it?

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... PROCEDURES FOR TRANSPORTATION WORKPLACE DRUG AND ALCOHOL TESTING PROGRAMS Drug Testing Laboratories § 40.89 What is validity testing, and are laboratories required to conduct it? (a) Specimen validity testing is... 49 Transportation 1 2010-10-01 2010-10-01 false What is validity testing, and are laboratories...

  9. 49 CFR 40.89 - What is validity testing, and are laboratories required to conduct it?

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... PROCEDURES FOR TRANSPORTATION WORKPLACE DRUG AND ALCOHOL TESTING PROGRAMS Drug Testing Laboratories § 40.89 What is validity testing, and are laboratories required to conduct it? (a) Specimen validity testing is... 49 Transportation 1 2012-10-01 2012-10-01 false What is validity testing, and are laboratories...

  10. 49 CFR 40.89 - What is validity testing, and are laboratories required to conduct it?

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... PROCEDURES FOR TRANSPORTATION WORKPLACE DRUG AND ALCOHOL TESTING PROGRAMS Drug Testing Laboratories § 40.89 What is validity testing, and are laboratories required to conduct it? (a) Specimen validity testing is... 49 Transportation 1 2014-10-01 2014-10-01 false What is validity testing, and are laboratories...

  11. NASA's Plan for SDLS Testing

    NASA Technical Reports Server (NTRS)

    Bailey, Brandon

    2015-01-01

    The Space Data Link Security (SDLS) Protocol is a Consultative Committee for Space Data Systems (CCSDS) standard which extends the known Data Link protocols to secure data being sent over a space link by providing confidentiality and integrity services. This plan outlines the approach by National Aeronautics Space Administration (NASA) in performing testing of the SDLS protocol using a prototype based on an existing NASA missions simulator.

  12. The influence of various test plans on mission reliability. [for Shuttle Spacelab payloads

    NASA Technical Reports Server (NTRS)

    Stahle, C. V.; Gongloff, H. R.; Young, J. P.; Keegan, W. B.

    1977-01-01

    Methods have been developed for the evaluation of cost effective vibroacoustic test plans for Shuttle Spacelab payloads. The shock and vibration environments of components have been statistically represented, and statistical decision theory has been used to evaluate the cost effectiveness of five basic test plans with structural test options for two of the plans. Component, subassembly, and payload testing have been performed for each plan along with calculations of optimum test levels and expected costs. The tests have been ranked according to both minimizing expected project costs and vibroacoustic reliability. It was found that optimum costs may vary up to $6 million with the lowest plan eliminating component testing and maintaining flight vibration reliability via subassembly tests at high acoustic levels.

  13. Validity of the Butcher Treatment Planning Inventory as a Measure of Negative Treatment Attitudes

    ERIC Educational Resources Information Center

    Hatchett, Gregory T.

    2007-01-01

    This study evaluated the validity of the Butcher Treatment Planning Inventory (BTPI) as a measure of negative expectations and attitudes toward counseling. Undergraduate students completed the BTPI, the Attitudes Toward Seeking Professional Psychological Help Scale-Abbreviated Version, and the Expectations About Counseling-Brief Form during one…

  14. 78 FR 65583 - Capital Planning and Stress Testing

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-11-01

    ... comment. SUMMARY: NCUA proposes to conduct annual stress tests of federally insured credit unions (FICUs... Protection Act (the Dodd-Frank Act), requiring their supervised institutions to conduct annual stress tests... the credit union in its capital plans. Credit unions must also test the impact of interest rate shocks...

  15. Shuttle/Agena study. Volume 2, part 3: Preliminary test plans

    NASA Technical Reports Server (NTRS)

    1972-01-01

    Proposed testing for the Agena tug program is based upon best estimates of shuttle and Agena tug requirements and upon the Agena configuration currently envisioned to meet these requirements. The proposed tests are presented in development, qualification, system, and launch base test plans. These plans are based upon generalized requirements and assumed situations. The limitations of this study precluded all but minimal consideration of related shuttle orbiter and shuttle ground systems. The test plans include provisions for all testing from major component to systems level, identified as necessary to aid in confirmation of the modified Agena configuration for the space tug; considerations that crew safety requirements and new environmental conditions from shuttle interface effects do impose some new Agena testing requirements; considerations that many existing Agena flight-qualified components will be utilized and qualification testing will be minimal; testing not only for the Agena tug but also for new or modified items of handling or servicing equipment for supporting the Agena factory-to-launch sequence; and the assembly of required testing into a sequence-ordered series of events.

  16. Validation of the Arabic Version of the Internet Gaming Disorder-20 Test.

    PubMed

    Hawi, Nazir S; Samaha, Maya

    2017-04-01

    In recent years, researchers have been trying to shed light on gaming addiction and its association with different psychiatric disorders and psychological determinants. The latest edition version of the American Psychiatric Association's Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5) included in its Section 3 Internet Gaming Disorder (IGD) as a condition for further empirical study and proposed nine criteria for the diagnosis of IGD. The 20-item Internet Gaming Disorder (IGD-20) Test was developed as a valid and reliable tool to assess gaming addiction based on the nine criteria set by the DSM-5. The aim of this study is to validate an Arabic version of the IGD-20 Test. The Arabic version of IGD-20 will not only help in identifying Arabic-speaking pathological gamers but also stimulate cross-cultural studies that could contribute to an area in need of more research for insight and treatment. After a process of translation and back-translation and with the participation of a sizable sample of Arabic-speaking adolescents, the present study conducted a psychometric validation of the IGD-20 Test. Our confirmatory factor analysis showed the validity of the Arabic version of the IGD-20 Test. The one-factor model of the Arabic IGD-20 Test had very good psychometric properties, and it fitted the sample data extremely well. In addition, correlation analysis between the IGD-20 Test and the daily duration on weekdays and weekends gameplay revealed significant positive relationships that warranted a criterion-related validation. Thus, the Arabic version of the IGD-20 Test is a valid and reliable measure of IGD among Arabic-speaking populations.

  17. Test plan for performance testing of the Eaton AC-3 electric vehicle

    NASA Astrophysics Data System (ADS)

    Crumley, R.; Heiselmann, H. W.

    1985-04-01

    An alternating current (ac) propulsion system for an electric vehicle was developed and tested. The test bed vehicle is a modified 1981 Mercury Lynx. The test plan was prepared specifically for the third modification to this test bed and identified as the Eaton AC-3. The scope of the testing done on the Eaton AC-3 includes coastdown and dynamometer tests but does not include environmental, on-road, or track testing. Coastdown testing is performed in accordance with SAE J-1263 (SAE Recommended Practice for Road Load Measurement and Dynamometer Simulation Using Coastdown Techniques).

  18. A Human Proximity Operations System test case validation approach

    NASA Astrophysics Data System (ADS)

    Huber, Justin; Straub, Jeremy

    A Human Proximity Operations System (HPOS) poses numerous risks in a real world environment. These risks range from mundane tasks such as avoiding walls and fixed obstacles to the critical need to keep people and processes safe in the context of the HPOS's situation-specific decision making. Validating the performance of an HPOS, which must operate in a real-world environment, is an ill posed problem due to the complexity that is introduced by erratic (non-computer) actors. In order to prove the HPOS's usefulness, test cases must be generated to simulate possible actions of these actors, so the HPOS can be shown to be able perform safely in environments where it will be operated. The HPOS must demonstrate its ability to be as safe as a human, across a wide range of foreseeable circumstances. This paper evaluates the use of test cases to validate HPOS performance and utility. It considers an HPOS's safe performance in the context of a common human activity, moving through a crowded corridor, and extrapolates (based on this) to the suitability of using test cases for AI validation in other areas of prospective application.

  19. Validity and Reliability Testing of an e-learning Questionnaire for Chemistry Instruction

    NASA Astrophysics Data System (ADS)

    Guspatni, G.; Kurniawati, Y.

    2018-04-01

    The aim of this paper is to examine validity and reliability of a questionnaire used to evaluate e-learning implementation in chemistry instruction. 48 questionnaires were filled in by students who had studied chemistry through e-learning system. The questionnaire consisted of 20 indicators evaluating students’ perception on using e-learning. Parametric testing was done as data were assumed to follow normal distribution. Item validity of the questionnaire was examined through item-total correlation using Pearson’s formula while its reliability was assessed with Cronbach’s alpha formula. Moreover, convergent validity was assessed to see whether indicators building a factor had theoretically the same underlying construct. The result of validity testing revealed 19 valid indicators while the result of reliability testing revealed Cronbach’s alpha value of .886. The result of factor analysis showed that questionnaire consisted of five factors, and each of them had indicators building the same construct. This article shows the importance of factor analysis to get a construct valid questionnaire before it is used as research instrument.

  20. Establishing the Test-Retest Reliability & Concurrent Validity for the Repeat Ice Skating Test (RIST) in Adolescent Male Ice Hockey Players

    ERIC Educational Resources Information Center

    Power, Allan; Faught, Brent E.; Przysucha, Eryk; McPherson, Moira; Montelpare, William

    2012-01-01

    In this study the authors examine the test-retest reliability and concurrent validity of the Repeat Ice Skating Test (RIST). This was an on-ice field anaerobic test that measured average peak power and was validated with 3 anaerobic lab tests: (a) vertical jump, (b) the Margaria-Kalamen stair test, and (c) the Wingate Anaerobic Test. The…

  1. Construct validity of the Health Science Reasoning Test.

    PubMed

    Huhn, Karen; Black, Lisa; Jensen, Gail M; Deutsch, Judith E

    2011-01-01

    The aim of this study was to evaluate the construct validity of the Health Science Reasoning Test (HSRT) by determining if the test could discriminate between expert and novice physical therapists' critical-thinking skills. Experts identified from a random list of certified clinical specialists and students in the first year of their physical therapy education from two physical therapy programs completed the HSRT. Experts (n = 73) had a higher total HSRT score (mean 24.06, SD 3.92) than the novices (n = 79) (mean 22.49, SD 3.2), with the difference being statistically significant t (148) = 2.67, p = 0.008. The HSRT total score discriminated between expert and novice critical-thinking skills, therefore establishing construct validity. To our knowledge, this is the first study to compare expert and novice performance on a standardized test. The opportunity to have a tool that provides evidence of students' critical thinking skills could be helpful for educators and students. The test results could aid in identifying areas of students' strengths and weaknesses, thereby enabling targeted remediation to improve critical thinking skills, which are key factors in clinical reasoning, a necessary skill for effective physical therapy practice.

  2. Validity of an Interactive Functional Reach Test.

    PubMed

    Galen, Sujay S; Pardo, Vicky; Wyatt, Douglas; Diamond, Andrew; Brodith, Victor; Pavlov, Alex

    2015-08-01

    Videogaming platforms such as the Microsoft (Redmond, WA) Kinect(®) are increasingly being used in rehabilitation to improve balance performance and mobility. These gaming platforms do not have built-in clinical measures that offer clinically meaningful data. We have now developed software that will enable the Kinect sensor to assess a patient's balance using an interactive functional reach test (I-FRT). The aim of the study was to test the concurrent validity of the I-FRT and to establish the feasibility of implementing the I-FRT in a clinical setting. The concurrent validity of the I-FRT was tested among 20 healthy adults (mean age, 25.8±3.4 years; 14 women). The Functional Reach Test (FRT) was measured simultaneously by both the Kinect sensor using the I-FRT software and the Optotrak Certus(®) 3D motion-capture system (Northern Digital Inc., Waterloo, ON, Canada). The feasibility of implementing the I-FRT in a clinical setting was assessed by performing the I-FRT in 10 participants with mild balance impairments recruited from the outpatient physical therapy clinic (mean age, 55.8±13.5 years; four women) and obtaining their feedback using a NASA Task Load Index (NASA-TLX) questionnaire. There was moderate to good agreement between FRT measures made by the two measurement systems. The greatest agreement between the two measurement system was found with the Kinect sensor placed at a distance of 2.5 m [intraclass correlation coefficient (2,k)=0.786; P<0.001] from the participant. Participants with mild balance impairments whose balance was assessed using the I-FRT software scored their experience favorably by assigning lower scores for the Frustration, Mental Demand, and Temporal Demand subscales on the NASA/TLX questionnaire. FRT measures made using the Kinect sensor I-FRT software provides a valid clinical measure that can be used with the gaming platforms.

  3. Beyond Faith and Face Validity: The Multitrait-Multimethod Matrix and the Convergent and Discriminant Validity of Oral Proficiency Tests.

    ERIC Educational Resources Information Center

    Stevenson, Douglas K.

    Recently there has been a renewed international interest in direct oral proficiency measures such as the oral interview. There has also been a growing awareness among some language testing specialists that all proficiency tests must be subjected to construct validation. It seems that the high face validity of oral interviews tends to cloud and…

  4. Reliability and validity of the revised Gibson Test of Cognitive Skills, a computer-based test battery for assessing cognition across the lifespan.

    PubMed

    Moore, Amy Lawson; Miller, Terissa M

    2018-01-01

    The purpose of the current study is to evaluate the validity and reliability of the revised Gibson Test of Cognitive Skills, a computer-based battery of tests measuring short-term memory, long-term memory, processing speed, logic and reasoning, visual processing, as well as auditory processing and word attack skills. This study included 2,737 participants aged 5-85 years. A series of studies was conducted to examine the validity and reliability using the test performance of the entire norming group and several subgroups. The evaluation of the technical properties of the test battery included content validation by subject matter experts, item analysis and coefficient alpha, test-retest reliability, split-half reliability, and analysis of concurrent validity with the Woodcock Johnson III Tests of Cognitive Abilities and Tests of Achievement. Results indicated strong sources of evidence of validity and reliability for the test, including internal consistency reliability coefficients ranging from 0.87 to 0.98, test-retest reliability coefficients ranging from 0.69 to 0.91, split-half reliability coefficients ranging from 0.87 to 0.91, and concurrent validity coefficients ranging from 0.53 to 0.93. The Gibson Test of Cognitive Skills-2 is a reliable and valid tool for assessing cognition in the general population across the lifespan.

  5. Reliability and validity of an audio signal modified shuttle walk test.

    PubMed

    Singla, Rupak; Rai, Richa; Faye, Abhishek Anil; Jain, Anil Kumar; Chowdhury, Ranadip; Bandyopadhyay, Debdutta

    2017-01-01

    The audio signal in the conventionally accepted protocol of shuttle walk test (SWT) is not well-understood by the patients and modification of the audio signal may improve the performance of the test. The aim of this study is to study the validity and reliability of an audio signal modified SWT, called the Singla-Richa modified SWT (SWTSR), in healthy normal adults. In SWTSR, the audio signal was modified with the addition of reverse counting to it. A total of 54 healthy normal adults underwent conventional SWT (CSWT) at one instance and two times SWTSRon the same day. The validity was assessed by comparing outcomes of the SWTSRto outcomes of CSWT using the Pearson correlation coefficient and Bland-Altman plot. Test-retest reliability of SWTSRwas assessed using the intraclass correlation coefficient (ICC). The acceptability of the modified test in comparison to the conventional test was assessed using Likert scale. The distance walked (mean ± standard deviation) in the CSWT and SWTSRtest was 853.33 ± 217.33 m and 857.22 ± 219.56 m, respectively (Pearson correlation coefficient - 0.98; P < 0.001) indicating SWTSRto be a valid test. The SWTSRwas found to be a reliable test with ICC of 0.98 (95% confidence interval: 0.97-0.99). The acceptability of SWTSRwas significantly higher than CSWT. The SWTSRwith modified audio signal with reverse counting is a reliable as well as a valid test when compared with CSWT in healthy normal adults. It better understood by subjects compared to CSWT.

  6. Content validity and reliability of test of gross motor development in Chilean children

    PubMed Central

    Cano-Cappellacci, Marcelo; Leyton, Fernanda Aleitte; Carreño, Joshua Durán

    2016-01-01

    ABSTRACT OBJECTIVE To validate a Spanish version of the Test of Gross Motor Development (TGMD-2) for the Chilean population. METHODS Descriptive, transversal, non-experimental validity and reliability study. Four translators, three experts and 92 Chilean children, from five to 10 years, students from a primary school in Santiago, Chile, have participated. The Committee of Experts has carried out translation, back-translation and revision processes to determine the translinguistic equivalence and content validity of the test, using the content validity index in 2013. In addition, a pilot implementation was achieved to determine test reliability in Spanish, by using the intraclass correlation coefficient and Bland-Altman method. We evaluated whether the results presented significant differences by replacing the bat with a racket, using T-test. RESULTS We obtained a content validity index higher than 0.80 for language clarity and relevance of the TGMD-2 for children. There were significant differences in the object control subtest when comparing the results with bat and racket. The intraclass correlation coefficient for reliability inter-rater, intra-rater and test-retest reliability was greater than 0.80 in all cases. CONCLUSIONS The TGMD-2 has appropriate content validity to be applied in the Chilean population. The reliability of this test is within the appropriate parameters and its use could be recommended in this population after the establishment of normative data, setting a further precedent for the validation in other Latin American countries. PMID:26815160

  7. Definition study of a Variable Cycle Experimental Engine (VCEE) and associated test program and test plan

    NASA Technical Reports Server (NTRS)

    Allan, R. D.

    1978-01-01

    The Definition Study of a Variable Cycle Experimental Engine (VCEE) and Associated Test Program and Test Plan, was initiated to identify the most cost effective program for a follow-on to the AST Test Bed Program. The VCEE Study defined various subscale VCE's based on different available core engine components, and a full scale VCEE utilizing current technology. The cycles were selected, preliminary design accomplished and program plans and engineering costs developed for several program options. In addition to the VCEE program plans and options, a limited effort was applied to identifying programs that could logically be accomplished on the AST Test Bed Program VCE to extend the usefulness of this test hardware. Component programs were provided that could be accomplished prior to the start of a VCEE program.

  8. Reliability and Validity of the Standing Heel-Rise Test

    ERIC Educational Resources Information Center

    Yocum, Allison; McCoy, Sarah Westcott; Bjornson, Kristie F.; Mullens, Pamela; Burton, Gay Naganuma

    2010-01-01

    A standardized protocol for a pediatric heel-rise test was developed and reliability and validity are reported. Fifty-seven children developing typically (CDT) and 34 children with plantar flexion weakness performed three tests: unilateral heel rise, vertical jump, and force measurement using handheld dynamometry. Intraclass correlation…

  9. Cross-Cultural Validation of TEMAS, a Minority Projective Test.

    ERIC Educational Resources Information Center

    Costantino, Giuseppe; And Others

    The theoretical framework and cross-cultural validation of Tell-Me-A-Story (TEMAS), a projective test developed to measure personality development in ethnic minority children, is presented. The TEMAS test consists of 23 chromatic pictures which incorporate the following characteristics: (1) representation of antithetical concepts which the…

  10. Clinical Functional Capacity Testing in Patients With Facioscapulohumeral Muscular Dystrophy: Construct Validity and Interrater Reliability of Antigravity Tests.

    PubMed

    Rijken, Noortje H; van Engelen, Baziel G; Weerdesteyn, Vivian; Geurts, Alexander C

    2015-12-01

    To evaluate the construct validity and interrater reliability of 4 simple antigravity tests in a small group of patients with facioscapulohumeral muscular dystrophy (FSHD). Case-control study. University medical center. Patients with various severity levels of FSHD (n=9) and healthy control subjects (n=10) were included (N=19). Not applicable. A 4-point ordinal scale was designed to grade performance on the following 4 antigravity tests: sit to stance, stance to sit, step up, and step down. In addition, the 6-minute walk test, 10-m walking test, Berg Balance Scale, and timed Up and Go test were administered as conventional tests. Construct validity was determined by linear regression analysis using the Clinical Severity Score (CSS) as the dependent variable. Interrater agreement was tested using a κ analysis. Patients with FSHD performed worse on all 4 antigravity tests compared with the controls. Stronger correlations were found within than between test categories (antigravity vs conventional). The antigravity tests revealed the highest explained variance with regard to the CSS (R(2)=.86, P=.014). Interrater agreement was generally good. The results of this exploratory study support the construct validity and interrater reliability of the proposed antigravity tests for the assessment of functional capacity in patients with FSHD taking into account the use of compensatory strategies. Future research should further validate these results in a larger sample of patients with FSHD. Copyright © 2015 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.

  11. Validation of Milliflex® Quantum for Bioburden Testing of Pharmaceutical Products.

    PubMed

    Gordon, Oliver; Goverde, Marcel; Staerk, Alexandra; Roesti, David

    2017-01-01

    This article reports the validation strategy used to demonstrate that the Milliflex ® Quantum yielded non-inferior results to the traditional bioburden method. It was validated according to USP <1223>, European Pharmacopoeia 5.1.6, and Parenteral Drug Association Technical Report No. 33 and comprised the validation parameters robustness, ruggedness, repeatability, specificity, limit of detection and quantification, accuracy, precision, linearity, range, and equivalence in routine operation. For the validation, a combination of pharmacopeial ATCC strains as well as a broad selection of in-house isolates were used. In-house isolates were used in stressed state. Results were statistically evaluated regarding the pharmacopeial acceptance criterion of ≥70% recovery compared to the traditional method. Post-hoc test power calculations verified the appropriateness of the used sample size to detect such a difference. Furthermore, equivalence tests verified non-inferiority of the rapid method as compared to the traditional method. In conclusion, the rapid bioburden on basis of the Milliflex ® Quantum was successfully validated as alternative method to the traditional bioburden test. LAY ABSTRACT: Pharmaceutical drug products must fulfill specified quality criteria regarding their microbial content in order to ensure patient safety. Drugs that are delivered into the body via injection, infusion, or implantation must be sterile (i.e., devoid of living microorganisms). Bioburden testing measures the levels of microbes present in the bulk solution of a drug before sterilization, and thus it provides important information for manufacturing a safe product. In general, bioburden testing has to be performed using the methods described in the pharmacopoeias (membrane filtration or plate count). These methods are well established and validated regarding their effectiveness; however, the incubation time required to visually identify microbial colonies is long. Thus, alternative

  12. Non-Nuclear Validation Test Results of a Closed Brayton Cycle Test-Loop

    NASA Astrophysics Data System (ADS)

    Wright, Steven A.

    2007-01-01

    Both NASA and DOE have programs that are investigating advanced power conversion cycles for planetary surface power on the moon or Mars, or for next generation nuclear power plants on earth. Although open Brayton cycles are in use for many applications (combined cycle power plants, aircraft engines), only a few closed Brayton cycles have been tested. Experience with closed Brayton cycles coupled to nuclear reactors is even more limited and current projections of Brayton cycle performance are based on analytic models. This report describes and compares experimental results with model predictions from a series of non-nuclear tests using a small scale closed loop Brayton cycle available at Sandia National Laboratories. A substantial amount of testing has been performed, and the information is being used to help validate models. In this report we summarize the results from three kinds of tests. These tests include: 1) test results that are useful for validating the characteristic flow curves of the turbomachinery for various gases ranging from ideal gases (Ar or Ar/He) to non-ideal gases such as CO2, 2) test results that represent shut down transients and decay heat removal capability of Brayton loops after reactor shut down, and 3) tests that map a range of operating power versus shaft speed curve and turbine inlet temperature that are useful for predicting stable operating conditions during both normal and off-normal operating behavior. These tests reveal significant interactions between the reactor and balance of plant. Specifically these results predict limited speed up behavior of the turbomachinery caused by loss of load, the conditions for stable operation, and for direct cooled reactors, the tests reveal that the coast down behavior during loss of power events can extend for hours provided the ultimate heat sink remains available.

  13. High Burnup Dry Storage Cask Research and Development Project, Final Test Plan

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    None

    2014-02-27

    EPRI is leading a project team to develop and implement the first five years of a Test Plan to collect data from a SNF dry storage system containing high burnup fuel.12 The Test Plan defined in this document outlines the data to be collected, and the storage system design, procedures, and licensing necessary to implement the Test Plan.13 The main goals of the proposed test are to provide confirmatory data14 for models, future SNF dry storage cask design, and to support license renewals and new licenses for ISFSIs. To provide data that is most relevant to high burnup fuel inmore » dry storage, the design of the test storage system must mimic real conditions that high burnup SNF experiences during all stages of dry storage: loading, cask drying, inert gas backfilling, and transfer to the ISFSI for multi-year storage.15 Along with other optional modeling, SETs, and SSTs, the data collected in this Test Plan can be used to evaluate the integrity of dry storage systems and the high burnup fuel contained therein over many decades. It should be noted that the Test Plan described in this document discusses essential activities that go beyond the first five years of Test Plan implementation.16 The first five years of the Test Plan include activities up through loading the cask, initiating the data collection, and beginning the long-term storage period at the ISFSI. The Test Plan encompasses the overall project that includes activities that may not be completed until 15 or more years from now, including continued data collection, shipment of the Research Project Cask to a Fuel Examination Facility, opening the cask at the Fuel Examination Facility, and examining the high burnup fuel after the initial storage period.« less

  14. Development and Test Plans for the MSR EEV

    NASA Technical Reports Server (NTRS)

    Dillman, Robert; Laub, Bernard; Kellas, Sotiris; Schoenenberger, Mark

    2005-01-01

    The goal of the proposed Mars Sample Return mission is to bring samples from the surface of Mars back to Earth for thorough examination and analysis. The Earth Entry Vehicle is the passive entry body designed to protect the sample container from entry heating and deceleration loads during descent through the Earth s atmosphere to a recoverable location on the surface. This paper summarizes the entry vehicle design and outlines the subsystem development and testing currently planned in preparation for an entry vehicle flight test in 2010 and mission launch in 2013. Planned efforts are discussed for the areas of the thermal protection system, vehicle trajectory, aerodynamics and aerothermodynamics, impact energy absorption, structure and mechanisms, and the entry vehicle flight test.

  15. L1 Adaptive Control Law for Flexible Space Launch Vehicle and Proposed Plan for Flight Test Validation

    NASA Technical Reports Server (NTRS)

    Kharisov, Evgeny; Gregory, Irene M.; Cao, Chengyu; Hovakimyan, Naira

    2008-01-01

    This paper explores application of the L1 adaptive control architecture to a generic flexible Crew Launch Vehicle (CLV). Adaptive control has the potential to improve performance and enhance safety of space vehicles that often operate in very unforgiving and occasionally highly uncertain environments. NASA s development of the next generation space launch vehicles presents an opportunity for adaptive control to contribute to improved performance of this statically unstable vehicle with low damping and low bending frequency flexible dynamics. In this paper, we consider the L1 adaptive output feedback controller to control the low frequency structural modes and propose steps to validate the adaptive controller performance utilizing one of the experimental test flights for the CLV Ares-I Program.

  16. 46 CFR 162.060-24 - Test Plan requirements.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... (including the test facility's standard operating procedures for achieving such conditions). (9) Sampling.... Test Plans must include an examination of all the manufacturer's stated requirements and procedures for... potential environmental, health, and safety issues; unusual operating requirements; and any issues related...

  17. 46 CFR 162.060-24 - Test Plan requirements.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... (including the test facility's standard operating procedures for achieving such conditions). (9) Sampling.... Test Plans must include an examination of all the manufacturer's stated requirements and procedures for... potential environmental, health, and safety issues; unusual operating requirements; and any issues related...

  18. 46 CFR 162.060-24 - Test Plan requirements.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... (including the test facility's standard operating procedures for achieving such conditions). (9) Sampling.... Test Plans must include an examination of all the manufacturer's stated requirements and procedures for... potential environmental, health, and safety issues; unusual operating requirements; and any issues related...

  19. Targeting Low Career Confidence Using the Career Planning Confidence Scale

    ERIC Educational Resources Information Center

    McAuliffe, Garrett; Jurgens, Jill C.; Pickering, Worth; Calliotte, James; Macera, Anthony; Zerwas, Steven

    2006-01-01

    The authors describe the development and validation of a test of career planning confidence that makes possible the targeting of specific problem issues in employment counseling. The scale, developed using a rational process and the authors' experience with clients, was tested for criterion-related validity against 2 other measures. The scale…

  20. Transit bus stop pedestrian warning application : acceptance test plan : final report.

    DOT National Transportation Integrated Search

    2016-10-14

    This document is the Acceptance Test Plan for the Transit Bus Stop Pedestrian Warning (TSPW) application. This report describes the test and demonstration plan to verify that the application meets its functional and performance requirements.

  1. Parametric evaluation of the cost effectiveness of Shuttle payload vibroacoustic test plans

    NASA Technical Reports Server (NTRS)

    Stahle, C. V.; Gongloff, H. R.; Keegan, W. B.; Young, J. P.

    1978-01-01

    Consideration is given to alternate vibroacoustic test plans for sortie and free flyer Shuttle payloads. Statistical decision models for nine test plans provide a viable method of evaluating the cost effectiveness of alternate vibroacoustic test plans and the associated test levels. The methodology is a major step toward the development of a useful tool for the quantitative tailoring of vibroacoustic test programs to sortie and free flyer payloads. A broader application of the methodology is now possible by the use of the OCTAVE computer code.

  2. Concurrent validity and clinical usefulness of several individually administered tests of children's social-emotional cognition.

    PubMed

    McKown, Clark

    2007-03-01

    In this study, the validity of 5 tests of children's social-emotional cognition, defined as their encoding, memory, and interpretation of social information, was tested. Participants were 126 clinic-referred children between the ages of 5 and 17. All 5 tests were evaluated in terms of their (a) concurrent validity, (b) incremental validity, and (c) clinical usefulness in predicting social functioning. Tests included measures of nonverbal sensitivity, social language, and social problem solving. Criterion measures included parent and teacher report of social functioning. Analyses support the concurrent validity of all measures, and the incremental validity and clinical usefulness of tests of pragmatic language and problem solving.

  3. Secondary Waste Cast Stone Waste Form Qualification Testing Plan

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Westsik, Joseph H.; Serne, R. Jeffrey

    2012-09-26

    The Hanford Tank Waste Treatment and Immobilization Plant (WTP) is being constructed to treat the 56 million gallons of radioactive waste stored in 177 underground tanks at the Hanford Site. The WTP includes a pretreatment facility to separate the wastes into high-level waste (HLW) and low-activity waste (LAW) fractions for vitrification and disposal. The LAW will be converted to glass for final disposal at the Integrated Disposal Facility (IDF). Cast Stone – a cementitious waste form, has been selected for solidification of this secondary waste stream after treatment in the ETF. The secondary-waste Cast Stone waste form must be acceptablemore » for disposal in the IDF. This secondary waste Cast Stone waste form qualification testing plan outlines the testing of the waste form and immobilization process to demonstrate that the Cast Stone waste form can comply with the disposal requirements. Specifications for the secondary-waste Cast Stone waste form have not been established. For this testing plan, Cast Stone specifications are derived from specifications for the immobilized LAW glass in the WTP contract, the waste acceptance criteria for the IDF, and the waste acceptance criteria in the IDF Permit issued by the State of Washington. This testing plan outlines the testing needed to demonstrate that the waste form can comply with these waste form specifications and acceptance criteria. The testing program must also demonstrate that the immobilization process can be controlled to consistently provide an acceptable waste form product. This testing plan also outlines the testing needed to provide the technical basis for understanding the long-term performance of the waste form in the disposal environment. These waste form performance data are needed to support performance assessment analyses of the long-term environmental impact of the secondary-waste Cast Stone waste form in the IDF« less

  4. Reliability and validity of two isometric squat tests.

    PubMed

    Blazevich, Anthony J; Gill, Nicholas; Newton, Robert U

    2002-05-01

    The purpose of the present study was first to examine the reliability of isometric squat (IS) and isometric forward hack squat (IFHS) tests to determine if repeated measures on the same subjects yielded reliable results. The second purpose was to examine the relation between isometric and dynamic measures of strength to assess validity. Fourteen male subjects performed maximal IS and IFHS tests on 2 occasions and 1 repetition maximum (1-RM) free-weight squat and forward hack squat (FHS) tests on 1 occasion. The 2 tests were found to be highly reliable (intraclass correlation coefficient [ICC](IS) = 0.97 and ICC(IFHS) = 1.00). There was a strong relation between average IS and 1-RM squat performance, and between IFHS and 1-RM FHS performance (r(squat) = 0.77, r(FHS) = 0.76; p < 0.01), but a weak relation between squat and FHS test performances (r < 0.55). There was also no difference between observed 1-RM values and those predicted by our regression equations. Errors in predicting 1-RM performance were in the order of 8.5% (standard error of the estimate [SEE] = 13.8 kg) and 7.3% (SEE = 19.4 kg) for IS and IFHS respectively. Correlations between isometric and 1-RM tests were not of sufficient size to indicate high validity of the isometric tests. Together the results suggest that IS and IFHS tests could detect small differences in multijoint isometric strength between subjects, or performance changes over time, and that the scores in the isometric tests are well related to 1-RM performance. However, there was a small error when predicting 1-RM performance from isometric performance, and these tests have not been shown to discriminate between small changes in dynamic strength. The weak relation between squat and FHS test performance can be attributed to differences in the movement patterns of the tests

  5. Successful MPPF Pneumatics Verification and Validation Testing

    NASA Image and Video Library

    2017-03-28

    Engineers and technicians completed verification and validation testing of several pneumatic systems inside and outside the Multi-Payload Processing Facility (MPPF) at NASA's Kennedy Space Center in Florida. In view is the service platform for Orion spacecraft processing. The MPPF will be used for offline processing and fueling of the Orion spacecraft and service module stack before launch. Orion also will be de-serviced in the MPPF after a mission. The Ground Systems Development and Operations Program (GSDO) is overseeing upgrades to the facility. The Engineering Directorate led the recent pneumatic tests.

  6. Urine specimen validity test for drug abuse testing in workplace and court settings.

    PubMed

    Lin, Shin-Yu; Lee, Hei-Hwa; Lee, Jong-Feng; Chen, Bai-Hsiun

    2018-01-01

    In recent decades, urine drug testing in the workplace has become common in many countries in the world. There have been several studies concerning the use of the urine specimen validity test (SVT) for drug abuse testing administered in the workplace. However, very little data exists concerning the urine SVT on drug abuse tests from court specimens, including dilute, substituted, adulterated, and invalid tests. We investigated 21,696 submitted urine drug test samples for SVT from workplace and court settings in southern Taiwan over 5 years. All immunoassay screen-positive urine specimen drug tests were confirmed by gas chromatography/mass spectrometry. We found that the mean 5-year prevalence of tampering (dilute, substituted, or invalid tests) in urine specimens from the workplace and court settings were 1.09% and 3.81%, respectively. The mean 5-year percentage of dilute, substituted, and invalid urine specimens from the workplace were 89.2%, 6.8%, and 4.1%, respectively. The mean 5-year percentage of dilute, substituted, and invalid urine specimens from the court were 94.8%, 1.4%, and 3.8%, respectively. No adulterated cases were found among the workplace or court samples. The most common drug identified from the workplace specimens was amphetamine, followed by opiates. The most common drug identified from the court specimens was ketamine, followed by amphetamine. We suggest that all urine specimens taken for drug testing from both the workplace and court settings need to be tested for validity. Copyright © 2017. Published by Elsevier B.V.

  7. Single Event Effects (SEE) Testing: Practical Approach to Test Plans

    NASA Technical Reports Server (NTRS)

    LaBel, Kenneth A.; Pellish, Jonathan Allen; Berg, Melanie D.

    2014-01-01

    While standards and guidelines for performing SEE testing have existed for several decades, guidance for developing SEE test plans has not been as easy to find. In this presentation, the variety of areas that need to be considered ranging from resource issues (funds, personnel, schedule) to extremely technical challenges (particle interaction and circuit application), shall be discussed. Note: we consider the approach outlined here as a "living" document: Mission-specific constraints and new technology related issues always need to be taken into account.

  8. Applying Independent Verification and Validation to Automatic Test Equipment

    NASA Technical Reports Server (NTRS)

    Calhoun, Cynthia C.

    1997-01-01

    This paper describes a general overview of applying Independent Verification and Validation (IV&V) to Automatic Test Equipment (ATE). The overview is not inclusive of all IV&V activities that can occur or of all development and maintenance items that can be validated and verified, during the IV&V process. A sampling of possible IV&V activities that can occur within each phase of the ATE life cycle are described.

  9. The Michigan Alcoholism Screening Test (MAST): A Statistical Validation Analysis

    ERIC Educational Resources Information Center

    Laux, John M.; Newman, Isadore; Brown, Russ

    2004-01-01

    This study extends the Michigan Alcoholism Screening Test (MAST; M. L. Selzer, 1971) literature base by examining 4 issues related to the validity of the MAST scores. Specifically, the authors examine the validity of the MAST scores in light of the presence of impression management, participant demographic variables, and item endorsement…

  10. Validation of Linguistic and Communicative Oral Language Tests for Spanish-English Bilingual Programs.

    ERIC Educational Resources Information Center

    Politzer, Robert L.; And Others

    1983-01-01

    The development, administration, and scoring of a communicative test and its validation with tests of linguistic and sociolinguistic competence in English and Spanish are reported. Correlation with measures of home language use and school achievement are also presented, and issues of test validation for bilingual programs are discussed. (MSE)

  11. Decision Models for Determining the Optimal Life Test Sampling Plans

    NASA Astrophysics Data System (ADS)

    Nechval, Nicholas A.; Nechval, Konstantin N.; Purgailis, Maris; Berzins, Gundars; Strelchonok, Vladimir F.

    2010-11-01

    Life test sampling plan is a technique, which consists of sampling, inspection, and decision making in determining the acceptance or rejection of a batch of products by experiments for examining the continuous usage time of the products. In life testing studies, the lifetime is usually assumed to be distributed as either a one-parameter exponential distribution, or a two-parameter Weibull distribution with the assumption that the shape parameter is known. Such oversimplified assumptions can facilitate the follow-up analyses, but may overlook the fact that the lifetime distribution can significantly affect the estimation of the failure rate of a product. Moreover, sampling costs, inspection costs, warranty costs, and rejection costs are all essential, and ought to be considered in choosing an appropriate sampling plan. The choice of an appropriate life test sampling plan is a crucial decision problem because a good plan not only can help producers save testing time, and reduce testing cost; but it also can positively affect the image of the product, and thus attract more consumers to buy it. This paper develops the frequentist (non-Bayesian) decision models for determining the optimal life test sampling plans with an aim of cost minimization by identifying the appropriate number of product failures in a sample that should be used as a threshold in judging the rejection of a batch. The two-parameter exponential and Weibull distributions with two unknown parameters are assumed to be appropriate for modelling the lifetime of a product. A practical numerical application is employed to demonstrate the proposed approach.

  12. Analytical validation of a psychiatric pharmacogenomic test.

    PubMed

    Jablonski, Michael R; King, Nina; Wang, Yongbao; Winner, Joel G; Watterson, Lucas R; Gunselman, Sandra; Dechairo, Bryan M

    2018-05-01

    The aim of this study was to validate the analytical performance of a combinatorial pharmacogenomics test designed to aid in the appropriate medication selection for neuropsychiatric conditions. Genomic DNA was isolated from buccal swabs. Twelve genes (65 variants/alleles) associated with psychotropic medication metabolism, side effects, and mechanisms of actions were evaluated by bead array, MALDI-TOF mass spectrometry, and/or capillary electrophoresis methods (GeneSight Psychotropic, Assurex Health, Inc.). The combinatorial pharmacogenomics test has a dynamic range of 2.5-20 ng/μl of input genomic DNA, with comparable performance for all assays included in the test. Both the precision and accuracy of the test were >99.9%, with individual gene components between 99.4 and 100%. This study demonstrates that the combinatorial pharmacogenomics test is robust and reproducible, making it suitable for clinical use.

  13. VEGGIE and the VEG-01 Hardware Validation Test

    NASA Technical Reports Server (NTRS)

    Massa, Gioia

    2015-01-01

    This is a presentation to NASA HQ for a lunch-and-learn detailing the Veggie testing and results. Space Life and Physical Sciences plans to record this presentation and make it available for public display.

  14. Translation, adaptation and validation the contents of the Diabetes Medical Management Plan for the Brazilian context

    PubMed Central

    Torres, Heloísa de Carvalho; Chaves, Fernanda Figueredo; da Silva, Daniel Dutra Romualdo; Bosco, Adriana Aparecida; Gabriel, Beatriz Diniz; Reis, Ilka Afonso; Rodrigues, Júlia Santos Nunes; Pagano, Adriana Silvina

    2016-01-01

    ABSTRACT Objective: to translate, adapt and validate the contents of the Diabetes Medical Management Plan for the Brazilian context. This protocol was developed by the American Diabetes Association and guides the procedure of educators for the care of children and adolescents with diabetes in schools. Method: this methodological study was conducted in four stages: initial translation, synthesis of initial translation, back translation and content validation by an expert committee, composed of 94 specialists (29 applied linguists and 65 health professionals), for evaluation of the translated version through an online questionnaire. The concordance level of the judges was calculated based on the Content Validity Index. Data were exported into the R program for statistical analysis: Results: the evaluation of the instrument showed good concordance between the judges of the Health and Applied Linguistics areas, with a mean content validity index of 0.9 and 0.89, respectively, and slight variability of the index between groups (difference of less than 0.01). The items in the translated version, evaluated as unsatisfactory by the judges, were reformulated based on the considerations of the professionals of each group. Conclusion: a Brazilian version of Diabetes Medical Management Plan was constructed, called the Plano de Manejo do Diabetes na Escola. PMID:27508911

  15. Acoustic-Structure Interaction in Rocket Engines: Validation Testing

    NASA Technical Reports Server (NTRS)

    Davis, R. Benjamin; Joji, Scott S.; Parks, Russel A.; Brown, Andrew M.

    2009-01-01

    While analyzing a rocket engine component, it is often necessary to account for any effects that adjacent fluids (e.g., liquid fuels or oxidizers) might have on the structural dynamics of the component. To better characterize the fully coupled fluid-structure system responses, an analytical approach that models the system as a coupled expansion of rigid wall acoustic modes and in vacuo structural modes has been proposed. The present work seeks to experimentally validate this approach. To experimentally observe well-coupled system modes, the test article and fluid cavities are designed such that the uncoupled structural frequencies are comparable to the uncoupled acoustic frequencies. The test measures the natural frequencies, mode shapes, and forced response of cylindrical test articles in contact with fluid-filled cylindrical and/or annular cavities. The test article is excited with a stinger and the fluid-loaded response is acquired using a laser-doppler vibrometer. The experimentally determined fluid-loaded natural frequencies are compared directly to the results of the analytical model. Due to the geometric configuration of the test article, the analytical model is found to be valid for natural modes with circumferential wave numbers greater than four. In the case of these modes, the natural frequencies predicted by the analytical model demonstrate excellent agreement with the experimentally determined natural frequencies.

  16. Measurement of Dietary Restraint: Validity Tests of Four Questionnaires

    PubMed Central

    Williamson, Donald A.; Martin, Corby K.; York-Crowe, Emily; Anton, Stephen D.; Redman, Leanne M.; Han, Hongmei; Ravussin, Eric

    2007-01-01

    This study tested the validity of four measures of dietary restraint: Dutch Eating Behavior Questionnaire, Eating Inventory (EI), Revised Restraint Scale (RS), and the Current Dieting Questionnaire. Dietary restraint has been implicated as a determinant of overeating and binge eating. Conflicting findings have been attributed to different methods for measuring dietary restraint. The validity of four self-report measures of dietary restraint and dieting behavior was tested using: 1) factor analysis, 2) changes in dietary restraint in a randomized controlled trial of different methods to achieve calorie restriction, and 3) correlation of changes in dietary restraint with an objective measure of energy balance, calculated from the changes in fat mass and fat-free mass over a six-month dietary intervention. Scores from all four questionnaires, measured at baseline, formed a dietary restraint factor, but the RS also loaded on a binge eating factor. Based on change scores, the EI Restraint scale was the only measure that correlated significantly with energy balance expressed as a percentage of energy require d for weight maintenance. These findings suggest that that, of the four questionnaires tested, the EI Restraint scale was the most valid measure of the intent to diet and actual caloric restriction. PMID:17101191

  17. Intratester Reliability and Construct Validity of a Hip Abductor Eccentric Strength Test.

    PubMed

    Brindle, Richard A; Ebaugh, David; Milner, Clare E

    2018-06-06

    Side-lying hip abductor strength tests are commonly used to evaluate muscle strength. In a "break" test, the tester applies sufficient force to lower the limb to the table while the patient resists. The peak force is postulated to occur while the leg is lowering, thus representing the participant's eccentric muscle strength. However, it is unclear whether peak force occurs before or after the leg begins to lower. To determine intrarater reliability and construct validity of a hip abductor eccentric strength test. Intrarater reliability and construct validity study. Twenty healthy adults (26 [6] y; 1.66 [0.06] m; 62.2 [8.0] kg) made 2 visits to the laboratory at least 1 week apart. During the hip abductor eccentric strength test, a handheld dynamometer recorded peak force and time to peak force, and limb position was recorded via a motion capture system. Intrarater reliability was determined using intraclass correlation, SEM, and minimal detectable difference. Construct validity was assessed by determining if peak force occurred after the start of the lowering phase using a 1-sample t test. The hip abductor eccentric strength test had substantial intrarater reliability (intraclass correlation (3,3)  = .88; 95% confidence interval, .65-.95), SEM of 0.9 %BWh, and a minimal detectable difference of 2.5 %BWh. Construct validity was established as peak force occurred 2.1 (0.6) seconds (range: 0.7-3.7 s) after the start of the lowering phase of the test (P ≤ .001). The hip abductor eccentric strength test is a valid and reliable measure of eccentric muscle strength. This test may be used clinically to assess changes in eccentric muscle strength over time.

  18. Detailed Test Plan Redundant Sensor Strapdown IMU Evaluation Program

    NASA Technical Reports Server (NTRS)

    Hartwell, T.; Miyatake, Y.; Wedekind, D. E.

    1971-01-01

    The test plan for a redundant sensor strapdown inertial measuring unit evaluation program is presented. The subjects discussed are: (1) test philosophy and limitations, (2) test sequence, (3) equipment specifications, (4) general operating procedures, (5) calibration procedures, (6) alignment test phase, and (7) navigation test phase. The data and analysis requirements are analyzed.

  19. Predictive validity of the Biomedical Admissions Test: an evaluation and case study.

    PubMed

    McManus, I C; Ferguson, Eamonn; Wakeford, Richard; Powis, David; James, David

    2011-01-01

    There has been an increase in the use of pre-admission selection tests for medicine. Such tests need to show good psychometric properties. Here, we use a paper by Emery and Bell [2009. The predictive validity of the Biomedical Admissions Test for pre-clinical examination performance. Med Educ 43:557-564] as a case study to evaluate and comment on the reporting of psychometric data in the field of medical student selection (and the comments apply to many papers in the field). We highlight pitfalls when reliability data are not presented, how simple zero-order associations can lead to inaccurate conclusions about the predictive validity of a test, and how biases need to be explored and reported. We show with BMAT that it is the knowledge part of the test which does all the predictive work. We show that without evidence of incremental validity it is difficult to assess the value of any selection tests for medicine.

  20. The Predictive Validity of the Metropolitan Readiness Tests, 1976 Edition.

    ERIC Educational Resources Information Center

    Nagle, Richard J.

    1979-01-01

    A sample of 176 first-grade children was tested on the Metropolitan Readiness Tests, 1976 Edition (MRT), during the initial month of school and was retested eight months later on the Stanford Achievement Test. Results demonstrated substantial validity of the MRT for predicting first-grade achievement. (Author/CTM)

  1. Test Plan Procedure for Experiment T-003

    DOT National Transportation Integrated Search

    1971-05-19

    This document defines the type, sequence, and procedural details required to perform each test on the T-003 experiment aerosol analyzer, its subsystems and components. This plan utilizes the flexibility allowed for instruments in criticality category...

  2. Work plan for cone penetrometer comparison testing.

    DOT National Transportation Integrated Search

    2011-01-01

    The work plan and experimental design are developed around aiding engineers and geologists within the : Wisconsin Department of Transportation to understand the mechanisms controlling cone penetration test : results so that they can decide when the t...

  3. Validation testing of a soil macronutrient sensing system

    USDA-ARS?s Scientific Manuscript database

    Rapid on-site measurements of soil macronutrients (i.e., nitrogen, phosphorus, and potassium) are needed for site-specific crop management, where fertilizer nutrient application rates are adjusted spatially based on local requirements. This study reports on validation testing of a previously develop...

  4. Validation of biological activity testing procedure of recombinant human interleukin-7.

    PubMed

    Lutsenko, T N; Kovalenko, M V; Galkin, O Yu

    2017-01-01

    Validation procedure for method of monitoring the biological activity of reсombinant human interleukin-7 has been developed and conducted according to the requirements of national and international recommendations. This method is based on the ability of recombinant human interleukin-7 to induce proliferation of T lymphocytes. It has been shown that to control the biological activity of recombinant human interleukin-7 peripheral blood mononuclear cells (PBMCs) derived from blood or cell lines can be used. Validation charac­teristics that should be determined depend on the method, type of product or object test/measurement and biological test systems used in research. The validation procedure for the method of control of biological activity of recombinant human interleukin-7 in peripheral blood mononuclear cells showed satisfactory results on all parameters tested such as specificity, accuracy, precision and linearity.

  5. Federal COBOL Compiler Testing Service Compiler Validation Request Information.

    DTIC Science & Technology

    1977-05-09

    background of the Federal COBOL Compiler Testing Service which was set up by a memorandum of agreement between the National Bureau of Standards and the...Federal Standard, and the requirement of COBOL compiler validation in the procurement process. It also contains a list of all software products...produced by the software Development Division in support of the FCCTS as well as the Validation Summary Reports produced as a result of discharging the

  6. Advanced On-the-Job Training System: Master Test Plan

    DTIC Science & Technology

    1990-05-01

    synonymous with program evaluation and consists of a plan to evaluate AOTS with regard to assessment of the four crit’cal issues of system compliance...acceptance, performance and suitability. Within the MTP, these critical issues are assessed at subcomponent, component, and subsystem levels. 14. SUBJECT...Master Test Plan is synonymous with program evaluation and consists of a plan to evaluate AOTS with regard to assessment of the four critical issues

  7. 14 CFR 437.25 - Flight test plan.

    Code of Federal Regulations, 2011 CFR

    2011-01-01

    ... 14 Aeronautics and Space 4 2011-01-01 2011-01-01 false Flight test plan. 437.25 Section 437.25 Aeronautics and Space COMMERCIAL SPACE TRANSPORTATION, FEDERAL AVIATION ADMINISTRATION, DEPARTMENT OF... reusable suborbital rocket. Operational Safety Documentation ...

  8. 14 CFR 437.25 - Flight test plan.

    Code of Federal Regulations, 2010 CFR

    2010-01-01

    ... 14 Aeronautics and Space 4 2010-01-01 2010-01-01 false Flight test plan. 437.25 Section 437.25 Aeronautics and Space COMMERCIAL SPACE TRANSPORTATION, FEDERAL AVIATION ADMINISTRATION, DEPARTMENT OF... reusable suborbital rocket. Operational Safety Documentation ...

  9. Validity of the Eating Attitudes Test and the Eating Disorders Inventory in Bulimia Nervosa.

    ERIC Educational Resources Information Center

    Gross, Janet; And Others

    1986-01-01

    Assessed criterion and concurrent validity of the Eating Attitudes Test and the Eating Disorder Inventory in 82 women with bulimia nervosa. Both tests demonstrated criterion validity by discriminating bulimia nervosa subjects from normals. Only weak support was found for concurrent validity within bulimia subjects. Recommends combination of…

  10. Impact on Participation and Autonomy: Test of Validity and Reliability for Older Persons.

    PubMed

    Hammar, Isabelle Ottenvall; Ekelund, Christina; Wilhelmson, Katarina; Eklund, Kajsa

    2014-11-06

    In research and healthcare it is important to measure older persons' self-determination in order to improve their possibilities to decide for themselves in daily life. The questionnaire Impact on Participation and Autonomy (IPA) assesses self-determination, but is not constructed for older persons. The aim of this study was to examine the validity and reliability of the IPA-S questionnaire for persons aged 70 years and older. The study was performed in two steps; first a validity test of the Swedish version of the questionnaire, IPA-S, followed by a reliability test-retest of an adjusted version. The validity was tested with focus groups and individual interviews on persons aged 77-88 years, and the reliability on persons aged 70-99 years. The validity test result showed that IPA-S is valid for older persons but it was too extensive and the phrasing of the items needed adjustments. The reliability test-retest on the adjusted questionnaire, IPA- Older persons (IPA-O), showed that 15 of 22 items had high agreement. IPA-O can be used to measure older persons' self-determination in their care and rehabilitation.

  11. Validity and the Consequences of Test Interpretation and Use

    ERIC Educational Resources Information Center

    Hubley, Anita M.; Zumbo, Bruno D.

    2011-01-01

    The vast majority of measures have, at their core, a purpose of personal and social change. If test developers and users want measures to have personal and social consequences and impact, then it is critical to consider the consequences and side effects of measurement in the validation process itself. The consequential basis of test interpretation…

  12. An entropy-based nonparametric test for the validation of surrogate endpoints.

    PubMed

    Miao, Xiaopeng; Wang, Yong-Cheng; Gangopadhyay, Ashis

    2012-06-30

    We present a nonparametric test to validate surrogate endpoints based on measure of divergence and random permutation. This test is a proposal to directly verify the Prentice statistical definition of surrogacy. The test does not impose distributional assumptions on the endpoints, and it is robust to model misspecification. Our simulation study shows that the proposed nonparametric test outperforms the practical test of the Prentice criterion in terms of both robustness of size and power. We also evaluate the performance of three leading methods that attempt to quantify the effect of surrogate endpoints. The proposed method is applied to validate magnetic resonance imaging lesions as the surrogate endpoint for clinical relapses in a multiple sclerosis trial. Copyright © 2012 John Wiley & Sons, Ltd.

  13. Validation of the German version of the Ford Insomnia Response to Stress Test.

    PubMed

    Dieck, Arne; Helbig, Susanne; Drake, Christopher L; Backhaus, Jutta

    2018-06-01

    The purpose of this study was to assess the psychometric properties of a German version of the Ford Insomnia Response to Stress Test with groups with and without sleep problems. Three studies were analysed. Data set 1 was based on an initial screening for a sleep training program (n = 393), data set 2 was based on a study to test the test-retest reliability of the Ford Insomnia Response to Stress Test (n = 284) and data set 3 was based on a study to examine the influence of competitive sport on sleep (n = 37). Data sets 1 and 2 were used to test internal consistency, factor structure, convergent validity, discriminant validity and test-retest reliability of the Ford Insomnia Response to Stress Test. Content validity was tested using data set 3. Cronbach's alpha of the Ford Insomnia Response to Stress Test was good (α = 0.80) and test-retest reliability was satisfactory (r = 0.72). Overall, the one-factor model showed the best fit. Furthermore, significant positive correlations between the Ford Insomnia Response to Stress Test and impaired sleep quality, depression and stress reactivity were in line with the expectations regarding the convergent validity. Subjects with sleep problems had significantly higher scores in the Ford Insomnia Response to Stress Test than subjects without sleep problems (P < 0.01). Competitive athletes with higher scores in the Ford Insomnia Response to Stress Test had significantly lower sleep quality (P = 0.01), demonstrating that vulnerability for stress-induced sleep disturbances accompanies poorer sleep quality in stressful episodes. The findings show that the German version of the Ford Insomnia Response to Stress Test is a reliable and valid questionnaire to assess the vulnerability to stress-induced sleep disturbances. © 2017 European Sleep Research Society.

  14. The influence of validity criteria on Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) test-retest reliability among high school athletes.

    PubMed

    Brett, Benjamin L; Solomon, Gary S

    2017-04-01

    Research findings to date on the stability of Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) Composite scores have been inconsistent, requiring further investigation. The use of test validity criteria across these studies also has been inconsistent. Using multiple measures of stability, we examined test-retest reliability of repeated ImPACT baseline assessments in high school athletes across various validity criteria reported in previous studies. A total of 1146 high school athletes completed baseline cognitive testing using the online ImPACT test battery at two time periods of approximately two-year intervals. No participant sustained a concussion between assessments. Five forms of validity criteria used in previous test-retest studies were applied to the data, and differences in reliability were compared. Intraclass correlation coefficients (ICCs) ranged in composite scores from .47 (95% confidence interval, CI [.38, .54]) to .83 (95% CI [.81, .85]) and showed little change across a two-year interval for all five sets of validity criteria. Regression based methods (RBMs) examining the test-retest stability demonstrated a lack of significant change in composite scores across the two-year interval for all forms of validity criteria, with no cases falling outside the expected range of 90% confidence intervals. The application of more stringent validity criteria does not alter test-retest reliability, nor does it account for some of the variation observed across previously performed studies. As such, use of the ImPACT manual validity criteria should be utilized in the determination of test validity and in the individualized approach to concussion management. Potential future efforts to improve test-retest reliability are discussed.

  15. Validity of a novel computerized screening test system for mild cognitive impairment.

    PubMed

    Park, Jin-Hyuck; Jung, Minye; Kim, Jongbae; Park, Hae Yean; Kim, Jung-Ran; Park, Ji-Hyuk

    2018-06-20

    ABSTRACTBackground:The mobile screening test system for screening mild cognitive impairment (mSTS-MCI) was developed for clinical use. However, the clinical usefulness of mSTS-MCI to detect elderly with MCI from those who are cognitively healthy has yet to be validated. Moreover, the comparability between this system and traditional screening tests for MCI has not been evaluated. The purpose of this study was to examine the validity and reliability of the mSTS-MCI and confirm the cut-off scores to detect MCI. The data were collected from 107 healthy elderly people and 74 elderly people with MCI. Concurrent validity was examined using the Korean version of Montreal Cognitive Assessment (MoCA-K) as a gold standard test, and test-retest reliability was investigated using 30 of the study participants at four-week intervals. The sensitivity, specificity, positive predictive value, and negative predictive value (NPV) were confirmed through Receiver Operating Characteristic (ROC) analysis, and the cut-off scores for elderly people with MCI were identified. Concurrent validity showed statistically significant correlations between the mSTS-MCI and MoCA-K and test-rests reliability indicated high correlation. As a result of screening predictability, the mSTS-MCI had a higher NPV than the MoCA-K. The mSTS-MCI was identified as a system with a high degree of validity and reliability. In addition, the mSTS-MCI showed high screening predictability, indicating it can be used in the clinical field as a screening test system for mild cognitive impairment.

  16. Validity of Selected Lab and Field Tests of Physical Working Capacity.

    ERIC Educational Resources Information Center

    Burke, Edmund J.

    The validity of selected lab and field tests of physical working capacity was investigated. Forty-four male college students were administered a series of lab and field tests of physical working capacity. Lab tests include a test of maximum oxygen uptake, the PWC 170 test, the Harvard Step Test, the Progressive Pulse Ratio Test, Margaria Test of…

  17. Calibration and Validation Plan for the L2A Processor and Products of the SENTINEL-2 Mission

    NASA Astrophysics Data System (ADS)

    Main-Knorn, M.; Pflug, B.; Debaecker, V.; Louis, J.

    2015-04-01

    The Copernicus programme, is a European initiative for the implementation of information services based on observation data received from Earth Observation (EO) satellites and ground based information. In the frame of this programme, ESA is developing the Sentinel-2 optical imaging mission that will deliver optical data products designed to feed downstream services mainly related to land monitoring, emergency management and security. To ensure the highest quality of service, ESA sets up the Sentinel-2 Mission Performance Centre (MPC) in charge of the overall performance monitoring of the Sentinel-2 mission. TPZ F and DLR have teamed up in order to provide the best added-value support to the MPC for calibration and validation of the Level-2A processor (Sen2Cor) and products. This paper gives an overview over the planned L2A calibration and validation activities. Level-2A processing is applied to Top-Of-Atmosphere (TOA) Level-1C ortho-image reflectance products. Level-2A main output is the Bottom-Of-Atmosphere (BOA) corrected reflectance product. Additional outputs are an Aerosol Optical Thickness (AOT) map, a Water Vapour (WV) map and a Scene Classification (SC) map with Quality Indicators for cloud and snow probabilities. Level-2A BOA, AOT and WV outputs are calibrated and validated using ground-based data of automatic operating stations and data of in-situ campaigns. Scene classification is validated by the visual inspection of test datasets and cross-sensor comparison, supplemented by meteorological data, if available. Contributions of external in-situ campaigns would enlarge the reference dataset and enable extended validation exercise. Therefore, we are highly interested in and welcome external contributors.

  18. FIELD IMPLEMENTATION PLAN FOR A WILLISTON BASIN BRINE EXTRACTION AND STORAGE TEST

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hamling, John; Klapperich, Ryan; Stepan, Daniel

    2016-03-31

    The Energy & Environmental Research Center (EERC) successfully completed all technical work of Phase I, including development of a field implementation plan (FIP) for a brine extraction and storage test (BEST) in the North Dakota portion of the Williston Basin. This implementation plan was commissioned by the U.S. Department of Energy (DOE) National Energy Technology Laboratory (NETL) as a proxy for managing formation pressure plumes and measuring/monitoring the movement of differential pressure and CO2 plumes in the subsurface for future saline CO2 storage projects. BEST comprises the demonstration and validation of active reservoir management (ARM) strategies and extracted brine treatmentmore » technologies. Two prospective commercial brine injection sites were evaluated for BEST to satisfy DOE’s goals. Ultimately, an active saltwater disposal (SWD) site, Johnsons Corner, was selected because it possesses an ideal combination of key factors making it uniquely suited to host BEST. This site is located in western North Dakota and operated by Nuverra Environmental Solutions (Nuverra), a national leader in brine handling, treatment, and injection. An integrated management approach was used to incorporate local and regional geologic characterization activities with geologic and simulation models, inform a monitoring, verification, and accounting (MVA) plan, and to conduct a risk assessment. This approach was used to design a FIP for an ARM schema and an extracted brine treatment technology test bed facility. The FIP leverages an existing pressure plume generated by two commercial SWD wells. These wells, in conjunction with a new brine extraction well, will be used to conduct the ARM schema. Results of these tests will be quantified based on their impact on the performance of the existing SWD wells and the surrounding reservoir system. Extracted brine will be injected into an underlying deep saline formation through a new injection well. The locations of proposed

  19. Successful MPPF Pneumatics Verification and Validation Testing

    NASA Image and Video Library

    2017-03-28

    Engineers and technicians completed verification and validation testing of several pneumatic systems inside and outside the Multi-Payload Processing Facility (MPPF) at NASA's Kennedy Space Center in Florida. In view is the top level of the service platform for Orion spacecraft processing. The MPPF will be used for offline processing and fueling of the Orion spacecraft and service module stack before launch. Orion also will be de-serviced in the MPPF after a mission. The Ground Systems Development and Operations Program (GSDO) is overseeing upgrades to the facility. The Engineering Directorate led the recent pneumatic tests.

  20. Successful MPPF Pneumatics Verification and Validation Testing

    NASA Image and Video Library

    2017-03-28

    Engineers and technicians completed verification and validation testing of several pneumatic systems inside and outside the Multi-Payload Processing Facility (MPPF) at NASA's Kennedy Space Center in Florida. In view is the service platform for Orion spacecraft processing. To the left are several pneumatic panels. The MPPF will be used for offline processing and fueling of the Orion spacecraft and service module stack before launch. Orion also will be de-serviced in the MPPF after a mission. The Ground Systems Development and Operations Program (GSDO) is overseeing upgrades to the facility. The Engineering Directorate led the recent pneumatic tests.

  1. Validating an artificial intelligence human proximity operations system with test cases

    NASA Astrophysics Data System (ADS)

    Huber, Justin; Straub, Jeremy

    2013-05-01

    An artificial intelligence-controlled robot (AICR) operating in close proximity to humans poses risk to these humans. Validating the performance of an AICR is an ill posed problem, due to the complexity introduced by the erratic (noncomputer) actors. In order to prove the AICR's usefulness, test cases must be generated to simulate the actions of these actors. This paper discusses AICR's performance validation in the context of a common human activity, moving through a crowded corridor, using test cases created by an AI use case producer. This test is a two-dimensional simplification relevant to autonomous UAV navigation in the national airspace.

  2. Test, Control and Monitor System maintenance plan

    NASA Technical Reports Server (NTRS)

    Buehler, David P.; Lougheed, M. J.

    1993-01-01

    The maintenance requirements for Test, Control, and Monitor System (TCMS) and the method for satisfying these requirements prior to First Need Date (FND) of the last TCMS set are described. The method for satisfying maintenance requirements following FND of the last TCMS set will be addressed by a revision to this plan. This maintenance plan serves as the basic planning document for maintenance of this equipment by the NASA Payloads Directorate (CM) and the Payload Ground Operations Contractor (PGOC) at KSC. The terms TCMS Operations and Maintenance (O&M), Payloads Logistics, TCMS Sustaining Engineering, Payload Communications, and Integrated Network Services refer to the appropriate NASA and PGOC organization. For the duration of their contract, the Core Electronic Contractor (CEC) will provide a Set Support Team (SST). One of the primary purposes of this team is to help NASA and PGOC operate and maintain TCMS. It is assumed that SST is an integral part of TCMS O&M. The purpose of this plan is to describe the maintenance concept for TCMS hardware and system software in order to facilitate activation, transition planning, and continuing operation. When software maintenance is mentioned in this plan, it refers to maintenance of TCMS system software.

  3. A Perspective on Development Flight Instrumentation and Flight Test Analysis Plans for Ares I-X

    NASA Technical Reports Server (NTRS)

    Huebner, Lawrence D.; Richards, James S.; Brunty, Joseph A.; Smith, R. Marshall; Trombetta, Dominic R.

    2009-01-01

    NASA. s Constellation Program will take a significant step toward completion of the Ares I crew launch vehicle with the flight test of Ares I-X and completion of the Ares I-X post-flight evaluation. The Ares I-X flight test vehicle is an ascent development flight test that will acquire flight data early enough to impact the design and development of the Ares I. As the primary customer for flight data from the Ares I-X mission, Ares I has been the major driver in the definition of the Development Flight Instrumentation (DFI). This paper focuses on the DFI development process and the plans for post-flight evaluation of the resulting data to impact the Ares I design. Efforts for determining the DFI for Ares I-X began in the fall of 2005, and significant effort to refine and implement the Ares I-X DFI has been expended since that time. This paper will present a perspective in the development and implementation of the DFI. Emphasis will be placed on the process by which the list was established and changes were made to that list due to imposed constraints. The paper will also discuss the plans for the analysis of the DFI data following the flight and a summary of flight evaluation tasks to be performed in support of tools and models validation for design and development.

  4. Validation of the Seating and Mobility Script Concordance Test

    ERIC Educational Resources Information Center

    Cohen, Laura J.; Fitzgerald, Shirley G.; Lane, Suzanne; Boninger, Michael L.; Minkel, Jean; McCue, Michael

    2009-01-01

    The purpose of this study was to develop the scoring system for the Seating and Mobility Script Concordance Test (SMSCT), obtain and appraise internal and external structure evidence, and assess the validity of the SMSCT. The SMSCT purpose is to provide a method for testing knowledge of seating and mobility prescription. A sample of 106 therapists…

  5. Validation in Support of Internationally Harmonised OECD Test Guidelines for Assessing the Safety of Chemicals.

    PubMed

    Gourmelon, Anne; Delrue, Nathalie

    Ten years elapsed since the OECD published the Guidance document on the validation and international regulatory acceptance of test methods for hazard assessment. Much experience has been gained since then in validation centres, in countries and at the OECD on a variety of test methods that were subjected to validation studies. This chapter reviews validation principles and highlights common features that appear to be important for further regulatory acceptance across studies. Existing OECD-agreed validation principles will most likely generally remain relevant and applicable to address challenges associated with the validation of future test methods. Some adaptations may be needed to take into account the level of technique introduced in test systems, but demonstration of relevance and reliability will continue to play a central role as pre-requisite for the regulatory acceptance. Demonstration of relevance will become more challenging for test methods that form part of a set of predictive tools and methods, and that do not stand alone. OECD is keen on ensuring that while these concepts evolve, countries can continue to rely on valid methods and harmonised approaches for an efficient testing and assessment of chemicals.

  6. Validity and test-retest reliability of the six-spot step test in persons after stroke.

    PubMed

    Arvidsson Lindvall, Mialinn; Anderzén-Carlsson, Agneta; Appelros, Peter; Forsberg, Anette

    2018-06-06

    After stroke, asymmetric weight distribution is common with decreased balance control in standing and walking. The six-spot step test (SSST) includes a 5-m walk during which one leg shoves wooden blocks out of circles marked on the floor, thus assessing the ability to take load on each leg. The aim of the present study was to investigate the convergent and discriminant validity and test-retest reliability of the SSST in persons with stroke. Eighty-one participants were included. A cross-sectional study was performed, in which the SSST was conducted twice, 3-7 days apart. Validity was investigated using measures of dynamic balance and walking. Reliability was assessed using intraclass correlation coefficient, standard error of the measurement (SEM), and smallest real difference (SRD). The convergent validity was strong to moderate, and the test-retest reliability was good. The SEM% was 14.7%, and the SRD% was 40.8% based on the mean of four walks shoving twice with the paretic and twice with the non-paretic leg. Values on random measurement error were high affecting the use of the SSST for follow-up evaluations but the SSST can be a complementary measure of gait and balance.

  7. [Comparison of the Wechsler Memory Scale-III and the Spain-Complutense Verbal Learning Test in acquired brain injury: construct validity and ecological validity].

    PubMed

    Luna-Lario, P; Pena, J; Ojeda, N

    2017-04-16

    To perform an in-depth examination of the construct validity and the ecological validity of the Wechsler Memory Scale-III (WMS-III) and the Spain-Complutense Verbal Learning Test (TAVEC). The sample consists of 106 adults with acquired brain injury who were treated in the Area of Neuropsychology and Neuropsychiatry of the Complejo Hospitalario de Navarra and displayed memory deficit as the main sequela, measured by means of specific memory tests. The construct validity is determined by examining the tasks required in each test over the basic theoretical models, comparing the performance according to the parameters offered by the tests, contrasting the severity indices of each test and analysing their convergence. The external validity is explored through the correlation between the tests and by using regression models. According to the results obtained, both the WMS-III and the TAVEC have construct validity. The TAVEC is more sensitive and captures not only the deficits in mnemonic consolidation, but also in the executive functions involved in memory. The working memory index of the WMS-III is useful for predicting the return to work at two years after the acquired brain injury, but none of the instruments anticipates the disability and dependence at least six months after the injury. We reflect upon the construct validity of the tests and their insufficient capacity to predict functionality when the sequelae become chronic.

  8. Reliability and factorial validity of flexibility tests for team sports.

    PubMed

    Sporis, Goran; Vucetic, Vlatko; Jovanovic, Mario; Jukic, Igor; Omrcen, Darija

    2011-04-01

    The main goal of this method paper was to evaluate the reliability and factorial validity of flexibility tests used in soccer, and to do crossvalidation study on 2 other team sports using handball and basketball players. The second aim was to compare the validity of the different tests and evaluate the flexibility of soccer players; the third was to determine the positional differences between attackers, defenders, and midfielders in all flexibility tests. One hundred and fifty (n = 150) elite male junior soccer players, members of the First Croatian Junior League Teams, and 60 (n = 60) handball and 60 (n = 60) basketball players also members of the First Croatian Junior League Teams volunteered to participate in the study, tested for the purpose of crossvalidation. The SAR and V-SAR had the greatest AVR and ICC. The within-subjects variation ranged from between 0.3 and 3.8%. The lowest value of CV was found between the LSPL and LSPR. Low to moderate statistically significant correlation coefficients were found among all the measured flexibility tests. It was observed that the greatest correlations existed between the SAR and V-SAR (r = 0.65) and between the LLSR and LLSL (r = 0.56). Statistically significant correlations were also observed between the BLPL and BLPR (r = 0.62). The principal components factor analysis of 9 flexibility tests resulted in the extraction of 3 significant components. The results of this study have the following implications for the assessment of flexibility in soccer: (a) all flexibility tests used in this study have the acceptable between and within-subjects reliability and they can be used to estimate the flexibility of soccer players; (b) the LSPL and LSPR tests are the most reliable and valid flexibility tests for the estimation of flexibility of professional soccer players.

  9. Development and Validation of a Food-Associated Olfactory Test (FAOT).

    PubMed

    Denzer-Lippmann, Melanie Yvonne; Beauchamp, Jonathan; Freiherr, Jessica; Thuerauf, Norbert; Kornhuber, Johannes; Buettner, Andrea

    2017-01-01

    Olfactory tests are an important tool in human nutritional research for studying food preferences, yet comprehensive tests dedicated solely to food odors are currently lacking. Therefore, within this study, an innovative food-associated olfactory test (FAOT) system was developed. The FAOT comprises 16 odorant pens that contain representative food odors relating to different macronutrient classes. The test underwent a sensory validation based on identification rate, intensity, hedonic value, and food association scores. The accuracy of the test was further compared to the accuracy of the established Sniffin' Sticks identification test. The identification rates and intensities of this new FAOT were found to be comparable to the Sniffin' Sticks olfactory identification test. The odorant pens were also assessed chemo-analytically and were found to be chemically stable for at least 24 weeks. Overall, this new identification test for use in assessing olfaction in a food-associated context is valid both in terms of its use in sensory perception studies and its chemical stability. The FOAT is particularly suited to examinations of the sense of smell regarding food odors. © The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  10. Launch vehicle test and checkout plan. - Volume 2: Saturn 1B launch vehicle Skylab R (rescue) and AS-208 flow plan and listings

    NASA Technical Reports Server (NTRS)

    1973-01-01

    The launch operations test and checkout plan is a planning document that establishes all launch site checkout activity, including the individual tests and sequence of testing required to fulfill the development center and KSC test and checkout requirements. This volume contains the launch vehicle test and checkout plan encompassing S-1B, S-4B, IU stage, and ground support equipment tests. The plan is based upon AS-208 flow utilizing a manned spacecraft, LUT 1, and launch pad 39B facilities.

  11. Validation for the Tropical Rainfall Measuring Mission: Lessons Learned and Future Plans

    NASA Technical Reports Server (NTRS)

    Wolff, David B.; Amitai, E.; Marks, D. A.; Silberstein, D.; Lawrence, R. J.

    2005-01-01

    The Tropical Rainfall Measuring Mission (TRMM) was launched in November 1997 and is a highly regarded and successful mission. A major component of the TRMM program was its Ground Validation (GV) program. Through dedicated research and hard work by many groups, both the GV and satellite-retrieved rain estimates have shown a convergence at key GV sites, lending credibility to the global TRMM estimates. To be sure, there are some regional differences between the various satellite estimates themselves, which still need to be addressed; however, it can be said with some certainty that TRMM has provided a high-quality, long-term climatological data set for researchers that provides errors on the order of 10-20%, rather than pre-TRMM era error estimates on the order of 50-100%. The TRMM GV program's main operational task is to provide rainfall products for four sites: Darwin, Australia (DARW); Houston, Texas (HSTN); Kwajalein, Republic of the Marshall Islands (KWAJ); and, Melbourne, Florida (MELB). A comparison between TRMM Ground Validation (Version 5) and Satellite (Version 6) rain intensity estimates is presented. The gridded satellite product (3668) will be compared to GV Level II rain-intensity and -type maps (2A53 and 2A54, respectively). The 3G68 product represents a 0.5 deg x 0.5 deg data grid providing estimates of rain intensities from the TRMM Precipitation Radar (PR), Microwave Imager (TMI) and Combined (COM) algorithms. The comparisons will be sub-setted according to geographical type (land, coast and ocean). The convergence of the GV and satellite estimates bodes well for expectations for the proposed Global Precipitation Measurement (GPM) program and this study and others are being leveraged towards planning GV goals for GPM. A discussion of lessons learned and future plans for TRMM GV in planning for GPM will also be provided.

  12. Contemporary Test Validity in Theory and Practice: A Primer for Discipline-Based Education Researchers

    PubMed Central

    Reeves, Todd D.; Marbach-Ad, Gili

    2016-01-01

    Most discipline-based education researchers (DBERs) were formally trained in the methods of scientific disciplines such as biology, chemistry, and physics, rather than social science disciplines such as psychology and education. As a result, DBERs may have never taken specific courses in the social science research methodology—either quantitative or qualitative—on which their scholarship often relies so heavily. One particular aspect of (quantitative) social science research that differs markedly from disciplines such as biology and chemistry is the instrumentation used to quantify phenomena. In response, this Research Methods essay offers a contemporary social science perspective on test validity and the validation process. The instructional piece explores the concepts of test validity, the validation process, validity evidence, and key threats to validity. The essay also includes an in-depth example of a validity argument and validation approach for a test of student argument analysis. In addition to DBERs, this essay should benefit practitioners (e.g., lab directors, faculty members) in the development, evaluation, and/or selection of instruments for their work assessing students or evaluating pedagogical innovations. PMID:26903498

  13. Meeting report: Validation of toxicogenomics-based test systems: ECVAM-ICCVAM/NICEATM considerations for regulatory use.

    PubMed

    Corvi, Raffaella; Ahr, Hans-Jürgen; Albertini, Silvio; Blakey, David H; Clerici, Libero; Coecke, Sandra; Douglas, George R; Gribaldo, Laura; Groten, John P; Haase, Bernd; Hamernik, Karen; Hartung, Thomas; Inoue, Tohru; Indans, Ian; Maurici, Daniela; Orphanides, George; Rembges, Diana; Sansone, Susanna-Assunta; Snape, Jason R; Toda, Eisaku; Tong, Weida; van Delft, Joost H; Weis, Brenda; Schechtman, Leonard M

    2006-03-01

    This is the report of the first workshop "Validation of Toxicogenomics-Based Test Systems" held 11-12 December 2003 in Ispra, Italy. The workshop was hosted by the European Centre for the Validation of Alternative Methods (ECVAM) and organized jointly by ECVAM, the U.S. Interagency Coordinating Committee on the Validation of Alternative Methods (ICCVAM), and the National Toxicology Program (NTP) Interagency Center for the Evaluation of Alternative Toxicological Methods (NICEATM). The primary aim of the workshop was for participants to discuss and define principles applicable to the validation of toxicogenomics platforms as well as validation of specific toxicologic test methods that incorporate toxicogenomics technologies. The workshop was viewed as an opportunity for initiating a dialogue between technologic experts, regulators, and the principal validation bodies and for identifying those factors to which the validation process would be applicable. It was felt that to do so now, as the technology is evolving and associated challenges are identified, would be a basis for the future validation of the technology when it reaches the appropriate stage. Because of the complexity of the issue, different aspects of the validation of toxicogenomics-based test methods were covered. The three focus areas include a) biologic validation of toxicogenomics-based test methods for regulatory decision making, b) technical and bioinformatics aspects related to validation, and c) validation issues as they relate to regulatory acceptance and use of toxicogenomics-based test methods. In this report we summarize the discussions and describe in detail the recommendations for future direction and priorities.

  14. Meeting Report: Validation of Toxicogenomics-Based Test Systems: ECVAM–ICCVAM/NICEATM Considerations for Regulatory Use

    PubMed Central

    Corvi, Raffaella; Ahr, Hans-Jürgen; Albertini, Silvio; Blakey, David H.; Clerici, Libero; Coecke, Sandra; Douglas, George R.; Gribaldo, Laura; Groten, John P.; Haase, Bernd; Hamernik, Karen; Hartung, Thomas; Inoue, Tohru; Indans, Ian; Maurici, Daniela; Orphanides, George; Rembges, Diana; Sansone, Susanna-Assunta; Snape, Jason R.; Toda, Eisaku; Tong, Weida; van Delft, Joost H.; Weis, Brenda; Schechtman, Leonard M.

    2006-01-01

    This is the report of the first workshop “Validation of Toxicogenomics-Based Test Systems” held 11–12 December 2003 in Ispra, Italy. The workshop was hosted by the European Centre for the Validation of Alternative Methods (ECVAM) and organized jointly by ECVAM, the U.S. Interagency Coordinating Committee on the Validation of Alternative Methods (ICCVAM), and the National Toxicology Program (NTP) Interagency Center for the Evaluation of Alternative Toxicological Methods (NICEATM). The primary aim of the workshop was for participants to discuss and define principles applicable to the validation of toxicogenomics platforms as well as validation of specific toxicologic test methods that incorporate toxicogenomics technologies. The workshop was viewed as an opportunity for initiating a dialogue between technologic experts, regulators, and the principal validation bodies and for identifying those factors to which the validation process would be applicable. It was felt that to do so now, as the technology is evolving and associated challenges are identified, would be a basis for the future validation of the technology when it reaches the appropriate stage. Because of the complexity of the issue, different aspects of the validation of toxicogenomics-based test methods were covered. The three focus areas include a) biologic validation of toxicogenomics-based test methods for regulatory decision making, b) technical and bioinformatics aspects related to validation, and c) validation issues as they relate to regulatory acceptance and use of toxicogenomics-based test methods. In this report we summarize the discussions and describe in detail the recommendations for future direction and priorities. PMID:16507466

  15. General Vehicle Test Plan (GVTP) for Urban Rail Transit Cars

    DOT National Transportation Integrated Search

    1977-09-01

    The General Vehicle Test Plan provides a system for general vehicle testing and for documenting and utilizing data and information in the testing of urban rail transit cars. Test procedures are defined for nine categories: (1) Performance; (2) Power ...

  16. NASA Countermeasures Evaluation and Validation Project

    NASA Technical Reports Server (NTRS)

    Lundquist, Charlie M.; Paloski, William H. (Technical Monitor)

    2000-01-01

    To support its ISS and exploration class mission objectives, NASA has developed a Countermeasure Evaluation and Validation Project (CEVP). The goal of this project is to evaluate and validate the optimal complement of countermeasures required to maintain astronaut health, safety, and functional ability during and after short- and long-duration space flight missions. The CEVP is the final element of the process in which ideas and concepts emerging from basic research evolve into operational countermeasures. The CEVP is accomplishing these objectives by conducting operational/clinical research to evaluate and validate countermeasures to mitigate these maladaptive responses. Evaluation is accomplished by testing in space flight analog facilities, and validation is accomplished by space flight testing. Both will utilize a standardized complement of integrated physiological and psychological tests, termed the Integrated Testing Regimen (ITR) to examine candidate countermeasure efficacy and intersystem effects. The CEVP emphasis is currently placed on validating the initial complement of ISS countermeasures targeting bone, muscle, and aerobic fitness; followed by countermeasures for neurological, psychological, immunological, nutrition and metabolism, and radiation risks associated with space flight. This presentation will review the processes, plans, and procedures that will enable CEVP to play a vital role in transitioning promising research results into operational countermeasures necessary to maintain crew health and performance during long duration space flight.

  17. Validation and Verification (V and V) Testing on Midscale Flame Resistant (FR) Test Method

    DTIC Science & Technology

    2016-12-16

    Method for Evaluation of Flame Resistant Clothing for Protection against Fire Simulations Using an Instrumented Manikin. Validation and...complement (not replace) the capabilities of the ASTM F1930 Standard Test Method for Evaluation of Flame Resistant Clothing for Protection against Fire ...Engineering Center (NSRDEC) to complement the ASTM F1930 Standard Test Method for Evaluation of Flame Resistant Clothing for Protection against Fire

  18. Validity of the American Sign Language Discrimination Test

    ERIC Educational Resources Information Center

    Bochner, Joseph H.; Samar, Vincent J.; Hauser, Peter C.; Garrison, Wayne M.; Searls, J. Matt; Sanders, Cynthia A.

    2016-01-01

    American Sign Language (ASL) is one of the most commonly taught languages in North America. Yet, few assessment instruments for ASL proficiency have been developed, none of which have adequately demonstrated validity. We propose that the American Sign Language Discrimination Test (ASL-DT), a recently developed measure of learners' ability to…

  19. Functional performance testing of the hip in athletes: a systematic review for reliability and validity.

    PubMed

    Kivlan, Benjamin R; Martin, Robroy L

    2012-08-01

    The purpose of this study was to systematically review the literature for functional performance tests with evidence of reliability and validity that could be used for a young, athletic population with hip dysfunction. A search of PubMed and SPORTDiscus databases were performed to identify movement, balance, hop/jump, or agility functional performance tests from the current peer-reviewed literature used to assess function of the hip in young, athletic subjects. The single-leg stance, deep squat, single-leg squat, and star excursion balance tests (SEBT) demonstrated evidence of validity and normative data for score interpretation. The single-leg stance test and SEBT have evidence of validity with association to hip abductor function. The deep squat test demonstrated evidence as a functional performance test for evaluating femoroacetabular impingement. Hop/Jump tests and agility tests have no reported evidence of reliability or validity in a population of subjects with hip pathology. Use of functional performance tests in the assessment of hip dysfunction has not been well established in the current literature. Diminished squat depth and provocation of pain during the single-leg balance test have been associated with patients diagnosed with FAI and gluteal tendinopathy, respectively. The SEBT and single-leg squat tests provided evidence of convergent validity through an analysis of kinematics and muscle function in normal subjects. Reliability of functional performance tests have not been established on patients with hip dysfunction. Further study is needed to establish reliability and validity of functional performance tests that can be used in a young, athletic population with hip dysfunction. 2b (Systematic Review of Literature).

  20. Testing for Factorial Invariance in the Context of Construct Validation

    ERIC Educational Resources Information Center

    Dimitrov, Dimiter M.

    2010-01-01

    This article describes the logic and procedures behind testing for factorial invariance across groups in the context of construct validation. The procedures include testing for configural, measurement, and structural invariance in the framework of multiple-group confirmatory factor analysis (CFA). The "forward" (sequential constraint imposition)…

  1. The Water Framework Directive: The Challenges of Testing and Validation of Guidance Documents

    NASA Astrophysics Data System (ADS)

    Barth, F.; Bidoglio, G.; Murray, C. N.; Zaldivar, J.; Bouraoui, F.

    process.. · Activity 1: Information sharing · Activity 2: Develop guidance on technical issues · Activity 3: Information and data management · Activity 4: Application, testing and validation The first three priorities have a more horizontal character. They are the key activities for developing a common understanding of the implementation of the Water Framework Directive. All these horizontal activities need to be integrated and made operational in the River Basin Management Plans. Activity 4 (Application, Testing and Validation) significantly contributes to this integration role by making these activities operational in the River Basin Management Plans. The integration step is crucial for the effective implementation of the WFD. The objective of Activity 4 is to ensure coherence amongst the different guidance documents and their cross applicability by testing the guidance documents in selected pilot river basins. To achieve these objectives a Network of pilot river basins and associated coastal zones (where applicable) will be identified, in close co- operation with WGs in Key Action 2, that are considered to represent a range of problems and conditions characteristic of those to be found in the application of the different guidelines. The Network of identified sites will used for testing and cross- validation of proposed WG guidelines. The Joint Research Centre is acting as the technical secretariat for the Scientific Coordination Committee who is responsible for Activity 4. The purpose of the present paper is to describe approach, methodology and timetable for integrated testing of guidance documents.

  2. Testing of the Trim Tab Parametric Model in NASA Langley's Unitary Plan Wind Tunnel

    NASA Technical Reports Server (NTRS)

    Murphy, Kelly J.; Watkins, Anthony N.; Korzun, Ashley M.; Edquist, Karl T.

    2013-01-01

    In support of NASA's Entry, Descent, and Landing technology development efforts, testing of Langley's Trim Tab Parametric Models was conducted in Test Section 2 of NASA Langley's Unitary Plan Wind Tunnel. The objectives of these tests were to generate quantitative aerodynamic data and qualitative surface pressure data for experimental and computational validation and aerodynamic database development. Six component force-and-moment data were measured on 38 unique, blunt body trim tab configurations at Mach numbers of 2.5, 3.5, and 4.5, angles of attack from -4deg to +20deg, and angles of sideslip from 0deg to +8deg. Configuration parameters investigated in this study were forebody shape, tab area, tab cant angle, and tab aspect ratio. Pressure Sensitive Paint was used to provide qualitative surface pressure mapping for a subset of these flow and configuration variables. Over the range of parameters tested, the effects of varying tab area and tab cant angle were found to be much more significant than varying tab aspect ratio relative to key aerodynamic performance requirements. Qualitative surface pressure data supported the integrated aerodynamic data and provided information to aid in future analyses of localized phenomena for trim tab configurations.

  3. Pilot-scale treatability test plan for the 200-BP-5 operable unit

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Not Available

    This document presents the treatability test plan for pilot-scale pump and treat testing at the 200-BP-5 Operable Unit. This treatability test plan has been prepared in response to an agreement between the U.S. Department of Energy (DOE), the U.S. Environmental Protection Agency (EPA), and the State of Washington Department of Ecology (Ecology), as documented in Hanford Federal Facility Agreement and Consent Order (Tri-Party Agreement, Ecology et al. 1989a) Change Control Form M-13-93-03 (Ecology et al. 1994) and a recent 200 NPL Agreement Change Control Form (Appendix A). The agreement also requires that, following completion of the activities described in thismore » test plan, a 200-BP-5 Operable Unit Interim Remedial Measure (IRM) Proposed Plan be developed for use in preparing an Interim Action Record of Decision (ROD). The IRM Proposed Plan will be supported by the results of this treatability test plan, as well as by other 200-BP-5 Operable Unit activities (e.g., development of a qualitative risk assessment). Once issued, the Interim Action ROD will specify the interim action(s) for groundwater contamination at the 200-BP-5 Operable Unit. The treatability test approach is to conduct a pilot-scale pump and treat test for each of the two contaminant plumes associated with the 200-BP-5 Operable Unit. Primary contaminants of concern are {sup 99}Tc and {sup 60}Co for underwater affected by past discharges to the 216-BY Cribs, and {sup 90}Sr, {sup 239/240}Pu, and Cs for groundwater affected by past discharges to the 216-B-5 Reverse Well. The purpose of the pilot-scale treatability testing presented in this testplan is to provide the data basis for preparing an IRM Proposed Plan. To achieve this objective, treatability testing must: Assess the performance of groundwater pumping with respect to the ability to extract a significant amount of the primary contaminant mass present in the two contaminant plumes.« less

  4. Vadose zone transport field study: Detailed test plan for simulated leak tests

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    AL Ward; GW Gee

    2000-06-23

    to: identify mechanisms controlling transport processes in soils typical of the hydrogeologic conditions of Hanford's waste disposal sites; reduce uncertainty in conceptual models; develop a detailed and accurate database of hydraulic and transport parameters for validation of three-dimensional numerical models; identify and evaluate advanced, cost-effective characterization methods with the potential to assess changing conditions in the vadose zone, particularly as surrogates of currently undetectable high-risk contaminants. This plan provides details for conducting field tests during FY 2000 to accomplish these objectives. Details of additional testing during FY 2001 and FY 2002 will be developed as part of the work planning process implemented by the Integration Project.« less

  5. The Construct Validation of Tests of Communicative Competence.

    ERIC Educational Resources Information Center

    Palmer, Adrian S., Ed.; And Others

    This collection, including the proceedings of a colloquium at TESOL 1979, includes the following papers: (1) "Classification of Oral Proficiency Tests," by H. Madsen and R. Jones; (2) "A Theoretical Framework for Communicative Competence," by M. Canale and M. Swain; (3) "Beyond Faith and Face Validity: The Multitrait-Multimethod Matrix and the…

  6. The validity and reliability of a dynamic neuromuscular stabilization-heel sliding test for core stability.

    PubMed

    Cha, Young Joo; Lee, Jae Jin; Kim, Do Hyun; You, Joshua Sung H

    2017-10-23

    Core stabilization plays an important role in the regulation of postural stability. To overcome shortcomings associated with pain and severe core instability during conventional core stabilization tests, we recently developed the dynamic neuromuscular stabilization-based heel sliding (DNS-HS) test. The purpose of this study was to establish the criterion validity and test-retest reliability of the novel DNS-HS test. Twenty young adults with core instability completed both the bilateral straight leg lowering test (BSLLT) and DNS-HS test for the criterion validity study and repeated the DNS-HS test for the test-retest reliability study. Criterion validity was determined by comparing hip joint angle data that were obtained from BSLLT and DNS-HS measures. The test-retest reliability was determined by comparing hip joint angle data. Criterion validity was (ICC2,3) = 0.700 (p< 0.05), suggesting a good relationship between the two core stability measures. Test-retest reliability was (ICC3,3) = 0.953 (p< 0.05), indicating excellent consistency between the repeated DNS-HS measurements. Criterion validity data demonstrated a good relationship between the gold standard BSLLT and DNS-HS core stability measures. Test-retest reliability data suggests that DNS-HS core stability was a reliable test for core stability. Clinically, the DNS-HS test is useful to objectively quantify core instability and allow early detection and evaluation.

  7. Validation of a hydride generation atomic absorption spectrometry methodology for determination of mercury in fish designed for application in the Brazilian national residue control plan.

    PubMed

    Damin, Isabel C F; Santo, Maria A E; Hennigen, Rosmari; Vargas, Denise M

    2013-01-01

    In the present study, a method for the determination of mercury (Hg) in fish was validated according to ISO/IEC 17025, INMETRO (Brazil), and more recent European recommendations (Commission Decision 2007/333/EC and 2002/657/EC) for implementation in the Brazilian Residue Control Plan (NRCP) in routine applications. The parameters evaluated in the validation were investigated in detail. The results obtained for limit of detection and quantification were respectively, 2.36 and 7.88 μg kg(-1) of Hg. While the recovery varies between 90-96%. The coefficient of variation was of 4.06-8.94% for the repeatability. Furthermore, a comparison using an external proficiency testing scheme was realized. The results of method validated for the determination of the mercury in fish by Hydride generation atomic absorption spectrometry were considered suitable for implementation in routine analysis.

  8. 78 FR 20695 - Walk-Through Metal Detectors and Hand-Held Metal Detectors Test Method Validation

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-04-05

    ... Detectors and Hand-Held Metal Detectors Test Method Validation AGENCY: National Institute of Justice, DOJ... ensure that the test methods in the standards are properly documented, NIJ is requesting proposals (including price quotes) for test method validation efforts from testing laboratories. NIJ is also seeking...

  9. Using the Integrated Vehicle Health Management Research Test and Integration Plan Wiki to Identify Synergistic Test Opportunities

    NASA Technical Reports Server (NTRS)

    Koelfgen, Syri J.; Faber, James J.

    2010-01-01

    The National Aeronautics and Space Administration (NASA) and the aviation industry have recognized a need for developing a method to identify and combine resources to carry out research and testing more efficiently. The Integrated Vehicle Health Management (IVHM) Research Test and Integration Plan (RTIP) Wiki is a tool that is used to visualize, plan, and accomplish collaborative research and testing. Synergistic test opportunities are developed using the RTIP Wiki, and include potential common resource testing that combines assets and personnel from NASA, industry, academia, and other government agencies. A research scenario is linked to the appropriate IVHM milestones and resources detailed in the wiki, reviewed by the research team members, and integrated into a collaborative test strategy. The scenario is then implemented by creating a test plan when appropriate and the research is performed. The benefits of performing collaborative research and testing are achieving higher Technology Readiness Level (TRL) test opportunities with little or no additional cost, improved quality of research, and increased communication among researchers. In addition to a description of the method of creating these joint research scenarios, examples of the successful development and implementation of cooperative research using the IVHM RTIP Wiki are given.

  10. Front-end Electronics for Unattended Measurement (FEUM). Prototype Test Plan

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Conrad, Ryan C.; Morris, Scott J.; Smith, Leon E.

    2015-09-16

    The IAEA has requested that PNNL perform an initial set of tests on front-end electronics for unattended measurement (FEUM) prototypes. The FEUM prototype test plan details the tests to be performed, the criteria for evaluation, and the procedures used to execute the tests.

  11. Validity of the modified back-saver sit-and-reach test: a comparison with other protocols.

    PubMed

    Hui, S S; Yuen, P Y

    2000-09-01

    Studies have shown that the classical sit-and-reach (CSR) test, the modified sit-and-reach (MSR), and the newly developed back-saver sit-and-reach (BS) test have poor criterion-related validity in estimating low-back flexibility but yielded moderate criterion-related validity in hamstring flexibility. The V sit-and-reach (VSR) test was found to be practical but the validity has not been established. The purpose of this study was to propose a modified back-saver sit-and-reach (MBS) test, which incorporated all advantages of the various protocols, and to compare the criterion-related validity and reliability of all these tests. 158 college students (F = 96, and M = 62; age = 20.77 +/- 2.51) performed CSR, VSR, BS (left and right leg), and MBS (left and right leg) tests in a randomized order. Scores from each test were then correlated with the criterion measures. For all sit-reach tests, intraclass reliability (single trial) was very high (r = 0.89-0.98). MBS yielded significant and highest r with low-back and hamstring criterion for men (r = 0.47-0.67) and women (r = 0.23-0.54). The low-back and right hamstring validity of MBS for men were significantly (P < 0.01) higher than those from BS and CSR, whereas no differences in criterion-related validity were found between the MBS and other protocols in women. The ratings of perceived comfort among the sit-and-reach protocols were significantly different (P < 0.001) from each other. The rating for MBS was observed the most comfortable test as compared with other protocols. The MBS test is not only a reliable test for hamstring and low-back flexibility, it is also a more practical with improved validity for hamstring and low-back flexibility in men than previous protocols.

  12. Development, construct validity and test-retest reliability of a field-based wheelchair mobility performance test for wheelchair basketball.

    PubMed

    de Witte, Annemarie M H; Hoozemans, Marco J M; Berger, Monique A M; van der Slikke, Rienk M A; van der Woude, Lucas H V; Veeger, Dirkjan H E J

    2018-01-01

    The aim of this study was to develop and describe a wheelchair mobility performance test in wheelchair basketball and to assess its construct validity and reliability. To mimic mobility performance of wheelchair basketball matches in a standardised manner, a test was designed based on observation of wheelchair basketball matches and expert judgement. Forty-six players performed the test to determine its validity and 23 players performed the test twice for reliability. Independent-samples t-tests were used to assess whether the times needed to complete the test were different for classifications, playing standards and sex. Intraclass correlation coefficients (ICC) were calculated to quantify reliability of performance times. Males performed better than females (P < 0.001, effect size [ES] = -1.26) and international men performed better than national men (P < 0.001, ES = -1.62). Performance time of low (≤2.5) and high (≥3.0) classification players was borderline not significant with a moderate ES (P = 0.06, ES = 0.58). The reliability was excellent for overall performance time (ICC = 0.95). These results show that the test can be used as a standardised mobility performance test to validly and reliably assess the capacity in mobility performance of elite wheelchair basketball athletes. Furthermore, the described methodology of development is recommended for use in other sports to develop sport-specific tests.

  13. Screening for cognitive impairment in older individuals. Validation study of a computer-based test.

    PubMed

    Green, R C; Green, J; Harrison, J M; Kutner, M H

    1994-08-01

    This study examined the validity of a computer-based cognitive test that was recently designed to screen the elderly for cognitive impairment. Criterion-related validity was examined by comparing test scores of impaired patients and normal control subjects. Construct-related validity was computed through correlations between computer-based subtests and related conventional neuropsychological subtests. University center for memory disorders. Fifty-two patients with mild cognitive impairment by strict clinical criteria and 50 unimpaired, age- and education-matched control subjects. Control subjects were rigorously screened by neurological, neuropsychological, imaging, and electrophysiological criteria to identify and exclude individuals with occult abnormalities. Using a cut-off total score of 126, this computer-based instrument had a sensitivity of 0.83 and a specificity of 0.96. Using a prevalence estimate of 10%, predictive values, positive and negative, were 0.70 and 0.96, respectively. Computer-based subtests correlated significantly with conventional neuropsychological tests measuring similar cognitive domains. Thirteen (17.8%) of 73 volunteers with normal medical histories were excluded from the control group, with unsuspected abnormalities on standard neuropsychological tests, electroencephalograms, or magnetic resonance imaging scans. Computer-based testing is a valid screening methodology for the detection of mild cognitive impairment in the elderly, although this particular test has important limitations. Broader applications of computer-based testing will require extensive population-based validation. Future studies should recognize that normal control subjects without a history of disease who are typically used in validation studies may have a high incidence of unsuspected abnormalities on neurodiagnostic studies.

  14. Integrated test plan for directional boring

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Volk, B.W.

    This integrated test plan describes the field testing of the DITCH WITCH Directional Boring System. DITCH WITCH is a registered trademark of The Charles Machine Works, Inc., Perry, Oklahoma. The test is being conducted as a coordinated effort between Charles Machine Works (CMW), Sandia National Laboratories (SNL), and the Westinghouse Hanford Company (WHC). Funding for the WHC portion of the project is through the Volatile Organic Compound-Arid Integrated Demonstration (VOC-Arid ID). The purpose of the test is to evaluate the performance of the directional boring system for possible future use on environmental restoration projects at Hanford and other Department ofmore » Energy (DOE) sites. The test will be conducted near the 200 Areas Fire Station located between the 200 East and 200 West Area of the Hanford Site. The directional boring system will be used to drill and complete (with fiberglass casing) two horizontal boreholes. A third borehole will be drilled to test sampling equipment but will not be completed with casing.« less

  15. FUNCTIONAL PERFORMANCE TESTING OF THE HIP IN ATHLETES: A SYSTEMATIC REVIEW FOR RELIABILITY AND VALIDITY

    PubMed Central

    Martin, RobRoy L.

    2012-01-01

    Purpose/Background: The purpose of this study was to systematically review the literature for functional performance tests with evidence of reliability and validity that could be used for a young, athletic population with hip dysfunction. Methods: A search of PubMed and SPORTDiscus databases were performed to identify movement, balance, hop/jump, or agility functional performance tests from the current peer-reviewed literature used to assess function of the hip in young, athletic subjects. Results: The single-leg stance, deep squat, single-leg squat, and star excursion balance tests (SEBT) demonstrated evidence of validity and normative data for score interpretation. The single-leg stance test and SEBT have evidence of validity with association to hip abductor function. The deep squat test demonstrated evidence as a functional performance test for evaluating femoroacetabular impingement. Hop/Jump tests and agility tests have no reported evidence of reliability or validity in a population of subjects with hip pathology. Conclusions: Use of functional performance tests in the assessment of hip dysfunction has not been well established in the current literature. Diminished squat depth and provocation of pain during the single-leg balance test have been associated with patients diagnosed with FAI and gluteal tendinopathy, respectively. The SEBT and single-leg squat tests provided evidence of convergent validity through an analysis of kinematics and muscle function in normal subjects. Reliability of functional performance tests have not been established on patients with hip dysfunction. Further study is needed to establish reliability and validity of functional performance tests that can be used in a young, athletic population with hip dysfunction. Level of Evidence: 2b (Systematic Review of Literature) PMID:22893860

  16. SU-F-BRF-10: Deformable MRI to CT Validation Employing Same Day Planning MRI for Surrogate Analysis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Padgett, K; Stoyanova, R; Johnson, P

    Purpose: To compare rigid and deformable registrations of the prostate in the multi-modality setting (diagnostic-MRI to planning-CT) by utilizing a planning-MRI as a surrogate. The surrogate allows for the direct quantitative analysis which can be difficult in the multi-modality domain where intensity mapping differs. Methods: For ten subjects, T2 fast-spin-echo images were acquired at two different time points, the first several weeks prior to planning (diagnostic-MRI) and the second on the same day in which the planning CT was collected (planning-MRI). Significant effort in patient positioning and bowel/bladder preparation was undertaken to minimize distortion of the prostate in all datasets.more » The diagnostic-MRI was deformed to the planning-CT utilizing a commercially available deformable registration algorithm synthesized from local registrations. The deformed MRI was then rigidly aligned to the planning MRI which was used as the surrogate for the planning-CT. Agreement between the two MRI datasets was scored using intensity based metrics including Pearson correlation and normalized mutual information, NMI. A local analysis was performed by looking only within the prostate, proximal seminal vesicles, penile bulb and combined areas. A similar method was used to assess a rigid registration between the diagnostic-MRI and planning-CT. Results: Utilizing the NMI, the deformable registrations were superior to the rigid registrations in 9 of 10 cases demonstrating a 15.94% improvement (p-value < 0.001) within the combined area. The Pearson correlation showed similar results with the deformable registration superior in the same number of cases and demonstrating a 6.97% improvement (p-value <0.011). Conclusion: Validating deformable multi-modality registrations using spatial intensity based metrics is difficult due to the inherent differences in intensity mapping. This population provides an ideal testing ground for MRI to CT deformable registrations by obviating

  17. Experimental validation of a new heterogeneous mechanical test design

    NASA Astrophysics Data System (ADS)

    Aquino, J.; Campos, A. Andrade; Souto, N.; Thuillier, S.

    2018-05-01

    Standard material parameters identification strategies generally use an extensive number of classical tests for collecting the required experimental data. However, a great effort has been made recently by the scientific and industrial communities to support this experimental database on heterogeneous tests. These tests can provide richer information on the material behavior allowing the identification of a more complete set of material parameters. This is a result of the recent development of full-field measurements techniques, like digital image correlation (DIC), that can capture the heterogeneous deformation fields on the specimen surface during the test. Recently, new specimen geometries were designed to enhance the richness of the strain field and capture supplementary strain states. The butterfly specimen is an example of these new geometries, designed through a numerical optimization procedure where an indicator capable of evaluating the heterogeneity and the richness of strain information. However, no experimental validation was yet performed. The aim of this work is to experimentally validate the heterogeneous butterfly mechanical test in the parameter identification framework. For this aim, DIC technique and a Finite Element Model Up-date inverse strategy are used together for the parameter identification of a DC04 steel, as well as the calculation of the indicator. The experimental tests are carried out in a universal testing machine with the ARAMIS measuring system to provide the strain states on the specimen surface. The identification strategy is accomplished with the data obtained from the experimental tests and the results are compared to a reference numerical solution.

  18. Defense of Tests Prevents Objective Consideration of Validity and Fairness

    ERIC Educational Resources Information Center

    Helms, Janet E.

    2009-01-01

    In defending tests of cognitive abilities, knowledge, or skills (CAKS) from the skepticism of their "family members, friends, and neighbors" and aiding psychologists forced to defend tests from "myth and hearsay" in their own skeptical social networks (p. 215), Sackett, Borneman, and Connelly focused on evaluating validity coefficients, racial or…

  19. Exploring the Reliability and Validity of the Social-Moral Awareness Test

    ERIC Educational Resources Information Center

    Livesey, Alexandra; Dodd, Karen; Pote, Helen; Marlow, Elizabeth

    2012-01-01

    Background: The aim of the study was to explore the validity of the social-moral awareness test (SMAT) a measure designed for assessing socio-moral rule knowledge and reasoning in people with learning disabilities. Comparisons between Theory of Mind and socio-moral reasoning allowed the exploration of construct validity of the tool. Factor…

  20. Test and Evaluation for Enhanced Security: A Quantitative Method to Incorporate Expert Knowledge into Test Planning Decisions.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Rizzo, Davinia; Blackburn, Mark

    Complex systems are comprised of technical, social, political and environmental factors as well as the programmatic factors of cost, schedule and risk. Testing these systems for enhanced security requires expert knowledge in many different fields. It is important to test these systems to ensure effectiveness, but testing is limited to due cost, schedule, safety, feasibility and a myriad of other reasons. Without an effective decision framework for Test and Evaluation (T&E) planning that can take into consideration technical as well as programmatic factors and leverage expert knowledge, security in complex systems may not be assessed effectively. Therefore, this paper coversmore » the identification of the current T&E planning problem and an approach to include the full variety of factors and leverage expert knowledge in T&E planning through the use of Bayesian Networks (BN).« less

  1. Phase 1 Validation Testing and Simulation for the WEC-Sim Open Source Code

    NASA Astrophysics Data System (ADS)

    Ruehl, K.; Michelen, C.; Gunawan, B.; Bosma, B.; Simmons, A.; Lomonaco, P.

    2015-12-01

    WEC-Sim is an open source code to model wave energy converters performance in operational waves, developed by Sandia and NREL and funded by the US DOE. The code is a time-domain modeling tool developed in MATLAB/SIMULINK using the multibody dynamics solver SimMechanics, and solves the WEC's governing equations of motion using the Cummins time-domain impulse response formulation in 6 degrees of freedom. The WEC-Sim code has undergone verification through code-to-code comparisons; however validation of the code has been limited to publicly available experimental data sets. While these data sets provide preliminary code validation, the experimental tests were not explicitly designed for code validation, and as a result are limited in their ability to validate the full functionality of the WEC-Sim code. Therefore, dedicated physical model tests for WEC-Sim validation have been performed. This presentation provides an overview of the WEC-Sim validation experimental wave tank tests performed at the Oregon State University's Directional Wave Basin at Hinsdale Wave Research Laboratory. Phase 1 of experimental testing was focused on device characterization and completed in Fall 2015. Phase 2 is focused on WEC performance and scheduled for Winter 2015/2016. These experimental tests were designed explicitly to validate the performance of WEC-Sim code, and its new feature additions. Upon completion, the WEC-Sim validation data set will be made publicly available to the wave energy community. For the physical model test, a controllable model of a floating wave energy converter has been designed and constructed. The instrumentation includes state-of-the-art devices to measure pressure fields, motions in 6 DOF, multi-axial load cells, torque transducers, position transducers, and encoders. The model also incorporates a fully programmable Power-Take-Off system which can be used to generate or absorb wave energy. Numerical simulations of the experiments using WEC-Sim will be

  2. The Theory of Planned Behavior (TPB) and Pre-Service Teachers' Technology Acceptance: A Validation Study Using Structural Equation Modeling

    ERIC Educational Resources Information Center

    Teo, Timothy; Tan, Lynde

    2012-01-01

    This study applies the theory of planned behavior (TPB), a theory that is commonly used in commercial settings, to the educational context to explain pre-service teachers' technology acceptance. It is also interested in examining its validity when used for this purpose. It has found evidence that the TPB is a valid model to explain pre-service…

  3. A FIELD VALIDATION OF TWO SEDIMENT-AMPHIPOD TOXICITY TESTS

    EPA Science Inventory

    A field validation study of two sediment-amphipod toxicity tests was conducted using sediment samples collected subtidally in the vicinity of a polycyclic aromatic hydrocarbon (PAH)-contaminated Superfund site in Elliott Bay, WA, USA. Sediment samples were collected at 30 stati...

  4. Validity, Reliability, and Sensitivity of a Volleyball Intermittent Endurance Test.

    PubMed

    Rodríguez-Marroyo, Jose A; Medina-Carrillo, Javier; García-López, Juan; Morante, Juan C; Villa, José G; Foster, Carl

    2017-03-01

    To analyze the concurrent and construct validity of a volleyball intermittent endurance test (VIET). The VIET's test-retest reliability and sensitivity to assess seasonal changes was also studied. During the preseason, 71 volleyball players of different competitive levels took part in this study. All performed the VIET and a graded treadmill test with gas-exchange measurement (GXT). Thirty-one of the players performed an additional VIET to analyze the test-retest reliability. To test the VIET's sensitivity, 28 players repeated the VIET and GXT at the end of their season. Significant (P < .001) relationships between VIET distance and maximal oxygen uptake (r = .74) and GXT maximal speed (r = .78) were observed. There were no significant differences between the VIET performance test and retest (1542.1 ± 338.1 vs 1567.1 ± 358.2 m). Significant (P < .001) relationships and intraclass correlation coefficient (ICC) were found (r = .95, ICC = .96) for VIET performance. VIET performance increased significantly (P < .001) with player performance level and was sensitive to fitness changes across the season (1458.8 ± 343.5 vs 1581.1 ± 334.0 m, P < .01). The VIET may be considered a valid, reliable, and sensitive test to assess the aerobic endurance in volleyball players.

  5. Actual curriculum development practices instrument: Testing for factorial validity

    NASA Astrophysics Data System (ADS)

    Foi, Liew Yon; Bakar, Kamariah Abu; Hamzah, Mohd Sahandri Gani; Alwi, Nor Hayati

    2014-09-01

    The Actual Curriculum Development Practices Instrument (ACDP-I) was developed and the factorial validity of the ACDP-I was tested (n = 107) using exploratory factor analysis procedures in the earlier work of [1]. Despite the ACDP-I appears to be content and construct valid instrument with very high internal reliability qualities for using in Malaysia, the accumulated evidences are still needed to provide a sound scientific basis for the proposed score interpretations. Therefore, the present study addresses this concern by utilising the confirmatory factor analysis to further confirm the theoretical structure of the variable Actual Curriculum Development Practices (ACDP) and enrich the psychometrical properties of ACDP-I. Results of this study have practical implication to both researchers and educators whose concerns focus on teachers' classroom practices and the instrument development and validation process.

  6. Validity and Acceptance of Color Vision Testing on Smartphones.

    PubMed

    Ozgur, Omar K; Emborgo, Trisha S; Vieyra, Mark B; Huselid, Rebecca F; Banik, Rudrani

    2018-03-01

    Ishihara color plates (ICP) are the most commonly used color vision test (CVT) worldwide. With the advent of new technologies, attempts have been made to streamline the process of CVT. As hardware and software evolve, smartphone-based testing modalities may aid ophthalmologists in performing more efficient ophthalmic examinations. We assess the validity of smartphone color vision testing (CVT) by comparing results using the Eye Handbook (EHB) CVT application with standard Ishihara color plates (ICP). Prospective case-control study of subjects 18 years and older with visual acuity of 20/100 or better at 14 inches. The study group included patients with any ocular pathology. The color vision deficient (CVD) group was patients who failed more than 2 plates. The control group had no known ocular pathology. CVT was performed with both ICP and EHB under standardized background illuminance. Eleven plates were tested with each modality. Validity of EHB CVT and acceptance of EHB CVT were analyzed. Statistical analyses were performed using Bland-Altman plot with limits of agreement (LOA) at the 95th percentile of differences in score, independent samples t tests with 95% confidence interval (CI), and Pearson χ tests. The Bland-Altman plot showed agreement between correct number of plates in EHB and ICP for the study subjects (bias, -0.25; LOA, -1.92 to 1.42). Agreement was also observed between the correct number of plates in EHB and ICP for the controls (bias, -0.01; LOA, -0.61 to 0.59) and CVD (bias, -0.50; LOA, -4.64 to 3.64) subjects. The sensitivity of EHB was 0.92 (95% CI 0.76-1.07) and the specificity of EHB was 1.00 (95% CI 1.00-1.00). Fifty-nine percent preferred EHB, 12% preferred ICP, and 29% had no preference. In healthy controls and patients with ocular pathology, there was an agreement of CVT results comparing EHB with ICP. Overall, the majority preferred EHB to ICP. These findings demonstrate that further testing is required to understand and improve the

  7. Test of Creative Imagination: Validity and Reliability Study

    ERIC Educational Resources Information Center

    Gundogan, Aysun; Ari, Meziyet; Gonen, Mubeccel

    2013-01-01

    The purpose of this study was to investigate validity and reliability of the test of creative imagination. This study was conducted with the participation of 1000 children, aged between 9-14 and were studying in six primary schools in the city center of Denizli Province, chosen by cluster ratio sampling. In the study, it was revealed that the…

  8. Development and Validation of Economics Achievement Test for Secondary Schools

    ERIC Educational Resources Information Center

    Eleje, Lydia Ijeoma; Abanobi, Chidiebere Christopher; Obasi, Emma

    2017-01-01

    Economics achievement test (EAT) for assessing senior secondary two (SS2) achievement in economics was developed and validated in the study. Five research questions guided the study. Twenty and 100 mid-senior secondary (SS2) economics students was used for the pilot testing and reliability check respectively. A sample of 250 students randomly…

  9. Improved Test Planning and Analysis Through the Use of Advanced Statistical Methods

    NASA Technical Reports Server (NTRS)

    Green, Lawrence L.; Maxwell, Katherine A.; Glass, David E.; Vaughn, Wallace L.; Barger, Weston; Cook, Mylan

    2016-01-01

    The goal of this work is, through computational simulations, to provide statistically-based evidence to convince the testing community that a distributed testing approach is superior to a clustered testing approach for most situations. For clustered testing, numerous, repeated test points are acquired at a limited number of test conditions. For distributed testing, only one or a few test points are requested at many different conditions. The statistical techniques of Analysis of Variance (ANOVA), Design of Experiments (DOE) and Response Surface Methods (RSM) are applied to enable distributed test planning, data analysis and test augmentation. The D-Optimal class of DOE is used to plan an optimally efficient single- and multi-factor test. The resulting simulated test data are analyzed via ANOVA and a parametric model is constructed using RSM. Finally, ANOVA can be used to plan a second round of testing to augment the existing data set with new data points. The use of these techniques is demonstrated through several illustrative examples. To date, many thousands of comparisons have been performed and the results strongly support the conclusion that the distributed testing approach outperforms the clustered testing approach.

  10. Why Lessons Learned from the Past Require Haertel's Expanded Scope for Test Validation

    ERIC Educational Resources Information Center

    Shepard, Lorrie A.

    2013-01-01

    In his article, Haertel (this issue) asks a fundamental question about how use of a test is expected to cause improvements in the educational system and in learning. He also considers how test validity should be investigated and argues for a more expansive view of validity that does not stop with scoring or generalization (the more technical and…

  11. Validation of a Theory of Planned Behavior-Based Questionnaire to Examine Factors Associated With Milk Expression.

    PubMed

    Bai, Yeon K; Dinour, Lauren M

    2017-11-01

    A proper assessment of multidimensional needs for breastfeeding mothers in various settings is crucial to facilitate and support breastfeeding and its exclusivity. The theory of planned behavior (TPB) has been used frequently to measure factors associated with breastfeeding. Full utility of the TPB requires accurate measurement of theory constructs. Research aim: This study aimed to develop and confirm the psychometric properties of an instrument, Milk Expression on Campus, based on the TPB and to establish the reliability and validity of the instrument. In spring 2015, 218 breastfeeding (current or in the recent past) employees and students at one university campus in northern New Jersey completed the online questionnaire containing demography and theory-based items. Internal consistency (α) and split-half reliability ( r) tests and factor analyses established and confirmed the reliability and construct validity of this instrument. Milk Expression on Campus showed strong and significant reliabilities as a full scale (α = .78, r = .74, p < .001) and theory construct subscales. Validity was confirmed as psychometric properties corresponded to the factors extracted from the scale. Four factors extracted from the direct construct subscales accounted for 79.49% of the total variability. Four distinct factors from the indirect construct subscales accounted for 73.68% of the total variability. Milk Expression on Campus can serve as a model TPB-based instrument to examine factors associated with women's milk expression behavior. The utility of this instrument extends to designing effective promotion programs to foster breastfeeding and milk expression behaviors in diverse settings.

  12. Voices from Test-Takers: Further Evidence for Language Assessment Validation and Use

    ERIC Educational Resources Information Center

    Cheng, Liying; DeLuca, Christopher

    2011-01-01

    Test-takers' interpretations of validity as related to test constructs and test use have been widely debated in large-scale language assessment. This study contributes further evidence to this debate by examining 59 test-takers' perspectives in writing large-scale English language tests. Participants wrote about their test-taking experiences in…

  13. The Role of Structural Models in the Solar Sail Flight Validation Process

    NASA Technical Reports Server (NTRS)

    Johnston, John D.

    2004-01-01

    NASA is currently soliciting proposals via the New Millennium Program ST-9 opportunity for a potential Solar Sail Flight Validation (SSFV) experiment to develop and operate in space a deployable solar sail that can be steered and provides measurable acceleration. The approach planned for this experiment is to test and validate models and processes for solar sail design, fabrication, deployment, and flight. These models and processes would then be used to design, fabricate, and operate scaleable solar sails for future space science missions. There are six validation objectives planned for the ST9 SSFV experiment: 1) Validate solar sail design tools and fabrication methods; 2) Validate controlled deployment; 3) Validate in space structural characteristics (focus of poster); 4) Validate solar sail attitude control; 5) Validate solar sail thrust performance; 6) Characterize the sail's electromagnetic interaction with the space environment. This poster presents a top-level assessment of the role of structural models in the validation process for in-space structural characteristics.

  14. Hazardous material transportation safety and security field operational test final detailed test plans : executive summary

    DOT National Transportation Integrated Search

    2003-09-16

    The objective of this Hazardous Material (HazMat) Transportation Safety and Security Field Operational Test (FOT) Final Detailed Test Plans evaluation is to measure the impact of technology solutions on the safety, security, and operational efficienc...

  15. Assessing cultural validity in standardized tests in stem education

    NASA Astrophysics Data System (ADS)

    Gassant, Lunes

    This quantitative ex post facto study examined how race and gender, as elements of culture, influence the development of common misconceptions among STEM students. Primary data came from a standardized test: the Digital Logic Concept Inventory (DLCI) developed by Drs. Geoffrey L. Herman, Michael C. Louis, and Craig Zilles from the University of Illinois at Urbana-Champaign. The sample consisted of a cohort of 82 STEM students recruited from three universities in Northern Louisiana. Microsoft Excel and the Statistical Package for the Social Sciences (SPSS) were used for data computation. Two key concepts, several sub concepts, and 19 misconceptions were tested through 11 items in the DLCI. Statistical analyses based on both the Classical Test Theory (Spearman, 1904) and the Item Response Theory (Lord, 1952) yielded similar results: some misconceptions in the DLCI can reliably be predicted by the Race or the Gender of the test taker. The research is significant because it has shown that some misconceptions in a STEM discipline attracted students with similar ethnic backgrounds differently; thus, leading to the existence of some cultural bias in the standardized test. Therefore the study encourages further research in cultural validity in standardized tests. With culturally valid tests, it will be possible to increase the effectiveness of targeted teaching and learning strategies for STEM students from diverse ethnic backgrounds. To some extent, this dissertation has contributed to understanding, better, the gap between high enrollment rates and low graduation rates among African American students and also among other minority students in STEM disciplines.

  16. Translation and validation of the Malay version of the Stroke Knowledge Test.

    PubMed

    Sowtali, Siti Noorkhairina; Yusoff, Dariah Mohd; Harith, Sakinah; Mohamed, Monniaty

    2016-04-01

    To date, there is a lack of published studies on assessment tools to evaluate the effectiveness of stroke education programs. This study developed and validated the Malay language version of the Stroke Knowledge Test research instrument. This study involved translation, validity, and reliability phases. The instrument underwent backward and forward translation of the English version into the Malay language. Nine experts reviewed the content for consistency, clarity, difficulty, and suitability for inclusion. Perceived usefulness and utilization were obtained from experts' opinions. Later, face validity assessment was conducted with 10 stroke patients to determine appropriateness of sentences and grammar used. A pilot study was conducted with 41 stroke patients to determine the item analysis and reliability of the translated instrument using the Kuder Richardson 20 or Cronbach's alpha. The final Malay version Stroke Knowledge Test included 20 items with good content coverage, acceptable item properties, and positive expert review ratings. Psychometric investigations suggest that Malay version Stroke Knowledge Test had moderate reliability with Kuder Richardson 20 or Cronbach's alpha of 0.58. Improvement is required for Stroke Knowledge Test items with unacceptable difficulty indices. Overall, the average rating of perceived usefulness and perceived utility of the instruments were both 72.7%, suggesting that reviewers were likely to use the instruments in their facilities. Malay version Stroke Knowledge Test was a valid and reliable tool to assess educational needs and to evaluate stroke knowledge among participants of group-based stroke education programs in Malaysia.

  17. Victoria Symptom Validity Test performance in children and adolescents with neurological disorders.

    PubMed

    Brooks, Brian L

    2012-12-01

    It is becoming increasingly more important to study, use, and promote the utility of measures that are designed to detect non-compliance with testing (i.e., poor effort, symptom non-validity, response bias) as part of neuropsychological assessments with children and adolescents. Several measures have evidence for use in pediatrics, but there is a paucity of published support for the Victoria Symptom Validity Test (VSVT) in this population. The purpose of this study was to examine the performance on the VSVT in a sample of pediatric patients with known neurological disorders. The sample consisted of 100 consecutively referred children and adolescents between the ages of 6 and 19 years (mean = 14.0, SD = 3.1) with various neurological diagnoses. On the VSVT total items, 95% of the sample had performance in the "valid" range, with 5% being deemed "questionable" and 0% deemed "invalid". On easy items, 97% were "valid", 2% were "questionable", and 1% was "invalid." For difficult items, 84% were "valid," 16% were "questionable," and 0% was "invalid." For those patients given two effort measures (i.e., VSVT and Test of Memory Malingering; n = 65), none was identified as having poor test-taking compliance on both measures. VSVT scores were significantly correlated with age, intelligence, processing speed, and functional ratings of daily abilities (attention, executive functioning, and adaptive functioning), but not objective performance on the measure of sustained attention, verbal memory, or visual memory. The VSVT has potential to be used in neuropsychological assessments with pediatric patients.

  18. Validating the Astronomy Diagnostics Test for Undergraduate Non-Science Majors

    NASA Astrophysics Data System (ADS)

    Slater, T. F.; Hufnagel, B.; Adams, J. P.

    1999-05-01

    The Astronomy Diagnostics Test (ADT) is a standard diagnostic test for undergraduate non-science majors taking introductory astronomy. Serving to compare the effectiveness of various instructional interventions, the ADT has been developed and field-tested over the last year by a multi-institutional team, known as the Collaboration for Astronomy Education Research (CAER). The team includes Jeff Adams, Rebecca Lindell Adrian, Christine Brick, Gina Brissenden, Grace Deming, Beth Hufnagel, Tim Slater, and Michael Zeilik, among others. The need for a nationally normed, valid, and reliable assessment instrument in astronomy has been articulated in a wide variety of forums. This need results from the simultaneous occurrence of several important phenomena over the last decade including: the inclusion of astronomy concepts in national science education standards; documentation of widespread astronomical misconceptions; the influence of the Force Concept Inventory guiding reform in physics; and the call for university faculty to document improvements in instruction. In a triangulated effort to validate the ADT for widespread use, the researchers used on a three-phase strategy. In this context, "validity" means that the ADT measures what it purports to measure. In other words, do students give the correct answer for the scientifically correct reasons or, alternatively, do students give the correct answer even though they have misunderstandings about the phenomena being tested? These three phases were: (1) conduct statistical item-analysis on each test question for a large and diverse student population (n=2000 from 21 institutions); (2) conduct 60 clinical student interviews using the test questions as the script; and (3) conduct an inductive analysis of 30 student supplied written responses to ADT questions posed without the multiple-choices provided. The ADT and its supporting comparative database is available at URL: http://solar.physics.montana.edu/aae/adt/. This research

  19. EPRI/DOE High Burnup Fuel Sister Pin Test Plan Simplification and Visualization

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Saltzstein, Sylvia J.; Sorenson, Ken B.; Hanson, Brady

    The EPRI/DOE High Burnup Confirmatory Data Project (herein called the "Demo") is a multi-year, multi-entity confirmation demonstration test with the purpose of providing quantitative and qualitative data to show how high-burnup fuel ages in dry storage over a ten-year period. The Demo involves obtaining 32 assemblies of high-burnup PWR fuel of four common cladding alloys from the North Anna Nuclear Power Plant, drying them according to standard plant procedures, and then storing them in an NRC-licensed TN-3 2B cask on the North Anna dry storage pad for ten years. After the ten-year storage time, the cask will be opened andmore » the rods will be examined for signs of aging. Twenty-five rods from assemblies of similar claddings, in-reactor placement, and burnup histories (herein called "sister rods") have been shipped from the North Anna Nuclear Power Plant and are currently being nondestructively tested at Oak Ridge National Laboratory. After the non-destructive testing has been completed for each of the twenty-five rods, destructive analysis will be performed at ORNL, PNNL, and ANL to obtain mechanical data. Opinions gathered from the expert interviews, ORNL and PNNL Sister Rod Test Plans, and numerous meetings has resulted in the Simplified Test Plan described in this document. Some of the opinions and discussions leading to the simplified test plan are included here. Detailed descriptions and background are in the ORNL and PNNL plans in the appendices . After the testing described in this simplified test plan h as been completed , the community will review all the collected data and determine if additional testing is needed.« less

  20. Validation of Sherouk's Critical Thinking Test (SH-CTT)

    ERIC Educational Resources Information Center

    Kadhm, Sherouk J.

    2017-01-01

    This study aimed to examine the psychometric properties (reliability and validity) of the Arabic version of Sherouk's Critical Thinking Test. This test has four parts, each of which provides a story that is divided into an introduction and a scene; each story is then followed by a list of sensitive questions featuring two response options…

  1. Psychometric Arabic Sino-Nasal Outcome Test-22: validation and translation in chronic rhinosinusitis patients.

    PubMed

    Alanazy, Fatma; Dousary, Surayie Al; Albosaily, Ahmed; Aldriweesh, Turki; Alsaleh, Saad; Aldrees, Turki

    2018-01-01

    The Sino-Nasal Outcome Test (SNOT)-22 has multiple items that reflect how nasal disease affects quality of life. Currently, no validated Arabic version of the SNOT-22 is available. . To develop an Arabic-validated version of SNOT-22. Prospective. Tertiary care center. This single-center validation study was conducted between 2015 and 2017 at King Abdul-Aziz University Hospital, Riyadh, Saudi Arabia. The SNOT-22 English version was translated into Arabic by the forward and backward method. The test and retest reliability, internal consistency, responsiveness to surgical treatment, discriminant validity, sensitivity and specificity all were tested. Validated Arabic version of the SNOT-22. Of 265 individuals, 171 were healthy volunteers and 94 were chronic rhinosinusitis patients. The Arabic version showed high internal consistency (Cronbach's of 0.94), and the ability to differentiate between diseased and healthy volunteers (P < .001). The translated versions demonstrated the ability to detect the change scores significantly in response to intervention (P < .001). This is the first validated Arabic version of SNOT-22. The instrument can be used among the Arabic population. No subjects from other Arab countries.

  2. The ad-libitum alcohol 'taste test': secondary analyses of potential confounds and construct validity.

    PubMed

    Jones, Andrew; Button, Emily; Rose, Abigail K; Robinson, Eric; Christiansen, Paul; Di Lemma, Lisa; Field, Matt

    2016-03-01

    Motivation to drink alcohol can be measured in the laboratory using an ad-libitum 'taste test', in which participants rate the taste of alcoholic drinks whilst their intake is covertly monitored. Little is known about the construct validity of this paradigm. The objective of this study was to investigate variables that may compromise the validity of this paradigm and its construct validity. We re-analysed data from 12 studies from our laboratory that incorporated an ad-libitum taste test. We considered time of day and participants' awareness of the purpose of the taste test as potential confounding variables. We examined whether gender, typical alcohol consumption, subjective craving, scores on the Alcohol Use Disorders Identification Test and perceived pleasantness of the drinks predicted ad-libitum consumption (construct validity). We included 762 participants (462 female). Participant awareness and time of day were not related to ad-libitum alcohol consumption. Males drank significantly more alcohol than females (p < 0.001), and individual differences in typical alcohol consumption (p = 0.04), craving (p < 0.001) and perceived pleasantness of the drinks (p = 0.04) were all significant predictors of ad-libitum consumption. We found little evidence that time of day or participant awareness influenced alcohol consumption. The construct validity of the taste test was supported by relationships between ad-libitum consumption and typical alcohol consumption, craving and pleasantness ratings of the drinks. The ad-libitum taste test is a valid method for the assessment of alcohol intake in the laboratory.

  3. Ecological validity of the five digit test and the oral trails test.

    PubMed

    Paiva, Gabrielle Chequer de Castro; Fialho, Mariana Braga; Costa, Danielle de Souza; Paula, Jonas Jardim de

    2016-01-01

    Tests evaluating the attentional-executive system are widely used in clinical practice. However, proximity of an objective cognitive test with real-world situations (ecological validity) is not frequently investigated. The present study evaluate the association between measures of the Five Digit Test (FDT) and the Oral Trails Test (OTT) with self-reported cognitive failures in everyday life as measured by the Cognitive Failures Questionnaire (CFQ). Brazilian adults from 18-to-65 years old voluntarily performed the FDT and OTT tests and reported the frequency of cognitive failures in their everyday life through the CFQ. After controlling for the age effect, the measures of controlled attentional processes were associated with cognitive failures, yet the cognitive flexibility of both FDT and OTT accounted for by the majority of variance in most aspects of the CFQ factors. The FDT and the OTT measures were predictive of real-world problems such as cognitive failures in everyday activities/situations.

  4. Issues in cross-cultural validity: example from the adaptation, reliability, and validity testing of a Turkish version of the Stanford Health Assessment Questionnaire.

    PubMed

    Küçükdeveci, Ayse A; Sahin, Hülya; Ataman, Sebnem; Griffiths, Bridget; Tennant, Alan

    2004-02-15

    Guidelines have been established for cross-cultural adaptation of outcome measures. However, invariance across cultures must also be demonstrated through analysis of Differential Item Functioning (DIF). This is tested in the context of a Turkish adaptation of the Health Assessment Questionnaire (HAQ). Internal construct validity of the adapted HAQ is assessed by Rasch analysis; reliability, by internal consistency and the intraclass correlation coefficient; external construct validity, by association with impairments and American College of Rheumatology functional stages. Cross-cultural validity is tested through DIF by comparison with data from the UK version of the HAQ. The adapted version of the HAQ demonstrated good internal construct validity through fit of the data to the Rasch model (mean item fit 0.205; SD 0.998). Reliability was excellent (alpha = 0.97) and external construct validity was confirmed by expected associations. DIF for culture was found in only 1 item. Cross-cultural validity was found to be sufficient for use in international studies between the UK and Turkey. Future adaptation of instruments should include analysis of DIF at the field testing stage in the adaptation process.

  5. Use of the color trails test as an embedded measure of performance validity.

    PubMed

    Henry, George K; Algina, James

    2013-01-01

    One hundred personal injury litigants and disability claimants referred for a forensic neuropsychological evaluation were administered both portions of the Color Trails Test (CTT) as part of a more comprehensive battery of standardized tests. Subjects who failed two or more free-standing tests of cognitive performance validity formed the Failed Performance Validity (FPV) group, while subjects who passed all free-standing performance validity measures were assigned to the Passed Performance Validity (PPV) group. A cutscore of ≥45 seconds to complete Color Trails 1 (CT1) was associated with a classification accuracy of 78%, good sensitivity (66%) and high specificity (90%), while a cutscore of ≥84 seconds to complete Color Trails 2 (CT2) was associated with a classification accuracy of 82%, good sensitivity (74%) and high specificity (90%). A CT1 cutscore of ≥58 seconds, and a CT2 cutscore ≥100 seconds was associated with 100% positive predictive power at base rates from 20 to 50%.

  6. The Space Station Photovoltaic Panels Plasma Interaction Test Program: Test plan and results

    NASA Technical Reports Server (NTRS)

    Nahra, Henry K.; Felder, Marian C.; Sater, Bernard L.; Staskus, John V.

    1989-01-01

    The Plasma Interaction Test performed on two space station solar array panels is addressed. This includes a discussion of the test requirements, test plan, experimental set-up, and test results. It was found that parasitic current collection was insignificant (0.3 percent of the solar array delivered power). The measured arcing threshold ranged from -210 to -457 V with respect to the plasma potential. Furthermore, the dynamic response of the panels showed the panel time constant to range between 1 and 5 microsec, and the panel capacitance to be between .01 and .02 microF.

  7. The Space Station photovoltaic panels plasma interaction test program - Test plan and results

    NASA Technical Reports Server (NTRS)

    Nahra, Henry K.; Felder, Marian C.; Sater, Bernard L.; Staskus, John V.

    1990-01-01

    The plasma Interaction Test performed on two space station solar array panels is addressed. This includes a discussion of the test requirements, test plan, experimental set-up, and test results. It was found that parasitic current collection was insignificant (0.3 percent of the solar array delivered power). The measured arcing threshold ranged from -210 to -457 V with respect to the plasma potential. Furthermore, the dynamic response of the panels showed the panel time constant to range between 1 and 5 microsec, and the panel capacitance to be between .01 and .02 microF.

  8. Criterion Related Validity of Karate Specific Aerobic Test (KSAT).

    PubMed

    Chaabene, Helmi; Hachana, Younes; Franchini, Emerson; Tabben, Montassar; Mkaouer, Bessem; Negra, Yassine; Hammami, Mehrez; Chamari, Karim

    2015-09-01

    Karate is one the most popular combat sports in the world. Physical fitness assessment on a regular manner is important for monitoring the effectiveness of the training program and the readiness of karatekas to compete. The aim of this research was to examine the criterion related to validity of the karate specific aerobic test (KSAT) as an indicator of aerobic level of karate practitioners. Cardiorespiratory responses, aerobic performance level through both treadmill laboratory test and YoYo intermittent recovery test level 1 (YoYoIRTL1) as well as time to exhaustion in the KSAT test (TE'KSAT) were determined in a total of fifteen healthy international karatekas (i.e. karate practitioners) (means ± SD: age: 22.2 ± 4.3 years; height: 176.4 ± 7.5 cm; body mass: 70.3 ± 9.7 kg and body fat: 13.2 ± 6%). Peak heart rate obtained from KSAT represented ~99% of maximal heart rate registered during the treadmill test showing that KSAT imposes high physiological demands. There was no significant correlation between KSAT's TE and relative (mL/min kg) treadmill maximal oxygen uptake (r = 0.14; P = 0.69; [small]). On the other hand, there was a significant relationship between KSAT's TE and the velocity associated with VO2max (vVO2max) (r = 0.67; P = 0.03; [large]) as well as the velocity at VO2 corresponding to the second ventilatory threshold (vVO2 VAT) (r = 0.64; P = 0.04; [large]). Moreover, significant relationship was found between TE's KSAT and both the total distance covered and parameters of intermittent endurance measured through YoYoIRTL1. The KSAT has not proved to have indirect criterion related validity as no significant correlations have been found between TE's KSAT and treadmill VO2max. Nevertheless, as correlated to other aerobic fitness variables, KSAT can be considered as an indicator of karate specific endurance. The establishment of the criterion related validity of the KSAT requires further investigation.

  9. Implementation and Initial Validation of the MDTP Tests at Golden West College.

    ERIC Educational Resources Information Center

    Isonio, Steven

    In 1992, a study was conducted at Golden West College (California) to determine the predictive validity of the Math Diagnostic Testing Project (MDTP) tests. A total of 1,137 students were tested in-class; 601 took the Algebra Readiness test, 376 took the Elementary Algebra test, and 160 took the Intermediate Algebra test. Two correlation…

  10. Validation of Cardiovascular Parameters during NASA's Functional Task Test

    NASA Technical Reports Server (NTRS)

    Arzeno, N. M.; Stenger, M. B.; Bloomberg, J. J.; Platts, S. H.

    2009-01-01

    Microgravity exposure causes physiological deconditioning and impairs crewmember task performance. The Functional Task Test (FTT) is designed to correlate these physiological changes to performance in a series of operationally-relevant tasks. One of these, the Recovery from Fall/Stand Test (RFST), tests both the ability to recover from a prone position and cardiovascular responses to orthostasis. PURPOSE: Three minutes were chosen for the duration of this test, yet it is unknown if this is long enough to induce cardiovascular responses similar to the operational 5 min stand test. The purpose of this study was to determine the validity and reliability of heart rate variability (HRV) analysis of a 3 min stand and to examine the effect of spaceflight on these measures. METHODS: To determine the validity of using 3 vs. 5 min of standing to assess HRV, ECG was collected from 7 healthy subjects who participated in a 6 min RFST. Mean R-R interval (RR) and spectral HRV were measured in minutes 0-3 and 0-5 following the heart rate transient due to standing. Significant differences between the segments were determined by a paired t-test. To determine the reliability of the 3-min stand test, 13 healthy subjects completed 3 trials of the FTT on separate days, including the RFST with a 3 min stand. Analysis of variance (ANOVA) was performed on the HRV measures. One crewmember completed the FTT before a 14-day mission, on landing day (R+0) and one (R+1) day after returning to Earth. RESULTS VALIDITY: HRV measures reflecting autonomic activity were not significantly different during the 0-3 and 0-5 min segments. RELIABILITY: The average coefficient of variation for RR, systolic (SBP) and diastolic blood pressures during the RFST were less than 8% for the 3 sessions. ANOVA results yielded a greater inter-subject variability (p<0.006) than inter-session variability (p>0.05) for HRV in the RFST. SPACEFLIGHT: Lower RR and higher SBP were observed on R+0 in rest and stand. On R+1

  11. Vertical jumping tests in volleyball: reliability, validity, and playing-position specifics.

    PubMed

    Sattler, Tine; Sekulic, Damir; Hadzic, Vedran; Uljevic, Ognjen; Dervisevic, Edvin

    2012-06-01

    Vertical jumping is known to be important in volleyball, and jumping performance tests are frequently studied for their reliability and validity. However, most studies concerning jumping in volleyball have dealt with standard rather than sport-specific jumping procedures and tests. The aims of this study, therefore, were (a) to determine the reliability and factorial validity of 2 volleyball-specific jumping tests, the block jump (BJ) test and the attack jump (AJ) test, relative to 2 frequently used and systematically validated jumping tests, the countermovement jump test and the squat jump test and (b) to establish volleyball position-specific differences in the jumping tests and simple anthropometric indices (body height [BH], body weight, and body mass index [BMI]). The BJ was performed from a defensive volleyball position, with the hands positioned in front of the chest. During an AJ, the players used a 2- to 3-step approach and performed a drop jump with an arm swing followed by a quick vertical jump. A total of 95 high-level volleyball players (all men) participated in this study. The reliability of the jumping tests ranged from 0.97 to 0.99 for Cronbach's alpha coefficients, from 0.93 to 0.97 for interitem correlation coefficients and from 2.1 to 2.8 for coefficients of variation. The highest reliability was found for the specific jumping tests. The factor analysis extracted one significant component, and all of the tests were highly intercorrelated. The analysis of variance with post hoc analysis showed significant differences between 5 playing positions in some of the jumping tests. In general, receivers had a greater jumping capacity, followed by libero players. The differences in jumping capacities should be emphasized vis-a-vis differences in the anthropometric measures of players, where middle hitters had higher BH and body weight, followed by opposite hitters and receivers, with no differences in the BMI between positions.

  12. Perspectives on Validation of High-Throughput Assays Supporting 21st Century Toxicity Testing1

    PubMed Central

    Judson, Richard; Kavlock, Robert; Martin, Matt; Reif, David; Houck, Keith; Knudsen, Thomas; Richard, Ann; Tice, Raymond R.; Whelan, Maurice; Xia, Menghang; Huang, Ruili; Austin, Christopher; Daston, George; Hartung, Thomas; Fowle, John R.; Wooge, William; Tong, Weida; Dix, David

    2014-01-01

    Summary In vitro, high-throughput screening (HTS) assays are seeing increasing use in toxicity testing. HTS assays can simultaneously test many chemicals, but have seen limited use in the regulatory arena, in part because of the need to undergo rigorous, time-consuming formal validation. Here we discuss streamlining the validation process, specifically for prioritization applications in which HTS assays are used to identify a high-concern subset of a collection of chemicals. The high-concern chemicals could then be tested sooner rather than later in standard guideline bioassays. The streamlined validation process would continue to ensure the reliability and relevance of assays for this application. We discuss the following practical guidelines: (1) follow current validation practice to the extent possible and practical; (2) make increased use of reference compounds to better demonstrate assay reliability and relevance; (3) deemphasize the need for cross-laboratory testing, and; (4) implement a web-based, transparent and expedited peer review process. PMID:23338806

  13. Validity and reliability analysis of the planned behavior theory scale related to the testicular self-examination in a Turkish context.

    PubMed

    Iyigun, Emine; Tastan, Sevinc; Ayhan, Hatice; Kose, Gulsah; Acikel, Cengizhan

    2016-06-01

    This study aimed to determine the validity and reliability levels of the Planned Behavior Theory Scale as related to a testicular self-examination. The study was carried out in a health-profession higher-education school in Ankara, Turkey, from April to June 2012. The study participants comprised 215 male students. Study data were collected by using a questionnaire, a planned behavior theory scale related to testicular self-examination, and Champion's Health Belief Model Scale (CHBMS). The sub-dimensions of the planned behavior theory scale, namely those of intention, attitude, subjective norms and self-efficacy, were found to have Cronbach's alpha values of between 0.81 and 0.89. Exploratory factor analysis showed that items of the scale had five factors that accounted for 75% of the variance. Of these, the sub-dimension of intention was found to have the highest level of contribution. A significant correlation was found between the sub-dimensions of the testicular self-examination planned behavior theory scale and those of CHBMS (p < 0.05). The findings suggest that the Turkish version of the testicular self-examination Planned Behavior Theory Scale is a valid and reliable measurement for Turkish society.

  14. Test Plan for SSR Antenna Rotation Rate Stabilization

    DOT National Transportation Integrated Search

    1981-10-01

    A comprehensive test plan is presented to evaluate the impact of wind and ice loading on the rotation rate stability of a Secondary Surveillance Radar (SSR) antenna used for air traffic control surveillance. Antenna rotation rate variations may intro...

  15. Testing and Validating Machine Learning Classifiers by Metamorphic Testing☆

    PubMed Central

    Xie, Xiaoyuan; Ho, Joshua W. K.; Murphy, Christian; Kaiser, Gail; Xu, Baowen; Chen, Tsong Yueh

    2011-01-01

    Machine Learning algorithms have provided core functionality to many application domains - such as bioinformatics, computational linguistics, etc. However, it is difficult to detect faults in such applications because often there is no “test oracle” to verify the correctness of the computed outputs. To help address the software quality, in this paper we present a technique for testing the implementations of machine learning classification algorithms which support such applications. Our approach is based on the technique “metamorphic testing”, which has been shown to be effective to alleviate the oracle problem. Also presented include a case study on a real-world machine learning application framework, and a discussion of how programmers implementing machine learning algorithms can avoid the common pitfalls discovered in our study. We also conduct mutation analysis and cross-validation, which reveal that our method has high effectiveness in killing mutants, and that observing expected cross-validation result alone is not sufficiently effective to detect faults in a supervised classification program. The effectiveness of metamorphic testing is further confirmed by the detection of real faults in a popular open-source classification program. PMID:21532969

  16. Development and Validity Testing of the Worksite Health Index: An Assessment Tool to Help and Improve Korean Employees' Health-Related Outcome.

    PubMed

    Yun, Young Ho; Sim, Jin Ah; Lim, Ye Jin; Lim, Cheol Il; Kang, Sung-Choon; Kang, Joon-Ho; Park, Jun Dong; Noh, Dong Young

    2016-06-01

    The objective of this study was to develop the Worksite Health Index (WHI) and validate its psychometric properties. The development of the WHI questionnaire included item generation, item construction, and field testing. To assess the instrument's reliability and validity, we recruited 30 different Korean worksites. We developed the WHI questionnaire of 136 items categorized into five domains, namely Governance and Infrastructure, Need Assessment and Planning, Health Prevention and Promotion Program, Occupational Safety, and Monitoring and Feedback. All WHI domains demonstrated a high reliability with good internal consistency. The total WHI scores differentiated worksite groups effectively according to firm size. Each domain was associated significantly with employees' health status, absence, and financial outcome. The WHI can assess comprehensive worksite health programs. This tool is publicly available for addressing the growing need for worksite health programs.

  17. Reliability and criterion-related validity testing (construct) of the Endotracheal Suction Assessment Tool (ESAT©).

    PubMed

    Davies, Kylie; Bulsara, Max K; Ramelet, Anne-Sylvie; Monterosso, Leanne

    2018-05-01

    To establish criterion-related construct validity and test-retest reliability for the Endotracheal Suction Assessment Tool© (ESAT©). Endotracheal tube suction performed in children can significantly affect clinical stability. Previously identified clinical indicators for endotracheal tube suction were used as criteria when designing the ESAT©. Content validity was reported previously. The final stages of psychometric testing are presented. Observational testing was used to measure construct validity and determine whether the ESAT© could guide "inexperienced" paediatric intensive care nurses' decision-making regarding endotracheal tube suction. Test-retest reliability of the ESAT© was performed at two time points. The researchers and paediatric intensive care nurse "experts" developed 10 hypothetical clinical scenarios with predetermined endotracheal tube suction outcomes. "Experienced" (n = 12) and "inexperienced" (n = 14) paediatric intensive care nurses were presented with the scenarios and the ESAT© guiding decision-making about whether to perform endotracheal tube suction for each scenario. Outcomes were compared with those predetermined by the "experts" (n = 9). Test-retest reliability of the ESAT© was measured at two consecutive time points (4 weeks apart) with "experienced" and "inexperienced" paediatric intensive care nurses using the same scenarios and tool to guide decision-making. No differences were observed between endotracheal tube suction decisions made by "experts" (n = 9), "inexperienced" (n = 14) and "experienced" (n = 12) nurses confirming the tool's construct validity. No differences were observed between groups for endotracheal tube suction decisions at T1 and T2. Criterion-related construct validity and test-retest reliability of the ESAT© were demonstrated. Further testing is recommended to confirm reliability in the clinical setting with the "inexperienced" nurse to guide decision-making related to endotracheal tube

  18. Underground Test Area Subproject Project Management Plan, Revision 1

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    NONE

    1998-06-03

    This Project Management Plan (PMP) describes the manner in which the US Department of Energy Nevada Operations Office (DOE/NV) will manage the Underground Test Area (UGTA) Subproject at the Nevada Test Site (NTS). It provides the basic guidance for implementation and the organizational structure for meeting the UGTA objectives.

  19. Validating a UAV artificial intelligence control system using an autonomous test case generator

    NASA Astrophysics Data System (ADS)

    Straub, Jeremy; Huber, Justin

    2013-05-01

    The validation of safety-critical applications, such as autonomous UAV operations in an environment which may include human actors, is an ill posed problem. To confidence in the autonomous control technology, numerous scenarios must be considered. This paper expands upon previous work, related to autonomous testing of robotic control algorithms in a two dimensional plane, to evaluate the suitability of similar techniques for validating artificial intelligence control in three dimensions, where a minimum level of airspeed must be maintained. The results of human-conducted testing are compared to this automated testing, in terms of error detection, speed and testing cost.

  20. Ultrasonic linear array validation via concrete test blocks

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hoegh, Kyle, E-mail: hoeg0021@umn.edu; Khazanovich, Lev, E-mail: hoeg0021@umn.edu; Ferraro, Chris

    2015-03-31

    Oak Ridge National Laboratory (ORNL) comparatively evaluated the ability of a number of NDE techniques to generate an image of the volume of 6.5′ X 5.0′ X 10″ concrete specimens fabricated at the Florida Department of Transportation (FDOT) NDE Validation Facility in Gainesville, Florida. These test blocks were fabricated to test the ability of various NDE methods to characterize various placements and sizes of rebar as well as simulated cracking and non-consolidation flaws. The first version of the ultrasonic linear array device, MIRA [version 1], was one of 7 different NDE equipment used to characterize the specimens. This paper dealsmore » with the ability of this equipment to determine subsurface characterizations such as reinforcing steel relative size, concrete thickness, irregularities, and inclusions using Kirchhoff-based migration techniques. The ability of individual synthetic aperture focusing technique (SAFT) B-scan cross sections resulting from self-contained scans are compared with various processing, analysis, and interpretation methods using the various features fabricated in the specimens for validation. The performance is detailed, especially with respect to the limitations and implications for evaluation of a thicker, more heavily reinforced concrete structures.« less

  1. Testing of the Crew Exploration Vehicle in NASA Langley's Unitary Plan Wind Tunnel

    NASA Technical Reports Server (NTRS)

    Murphy, Kelly J.; Borg, Stephen E.; Watkins, Anthony N.; Cole, Daniel R.; Schwartz, Richard J.

    2007-01-01

    As part of a strategic, multi-facility test program, subscale testing of NASA s Crew Exploration Vehicle was conducted in both legs of NASA Langley s Unitary Plan Wind Tunnel. The objectives of these tests were to generate aerodynamic and surface pressure data over a range of supersonic Mach numbers and reentry angles of attack for experimental and computational validation and aerodynamic database development. To provide initial information on boundary layer transition at supersonic test conditions, transition studies were conducted using temperature sensitive paint and infrared thermography optical techniques. To support implementation of these optical diagnostics in the Unitary Wind Tunnel, the experiment was first modeled using the Virtual Diagnostics Interface software. For reentry orientations of 140 to 170 degrees (heat shield forward), windward surface flow was entirely laminar for freestream unit Reynolds numbers equal to or less than 3 million per foot. Optical techniques showed qualitative evidence of forced transition on the windward heat shield with application of both distributed grit and discreet trip dots. Longitudinal static force and moment data showed the largest differences with Mach number and angle of attack variations. Differences associated with Reynolds number variation and/or laminar versus turbulent flow on the heat shield were very small. Static surface pressure data supported the aforementioned trends with Mach number, Reynolds number, and angle of attack.

  2. Atmospheric Reentry Materials and Structures Evaluation Facility (ARMSEF). User Test Planning Guide

    NASA Technical Reports Server (NTRS)

    2011-01-01

    Test process, milestones and inputs are unknowns to first-time users of the ARMSEF. The User Test Planning Guide aids in establishing expectations for both NASA and non-NASA facility customers. The potential audience for this guide includes both internal and commercial spaceflight hardware/software developers. It is intended to assist their test engineering personnel in test planning and execution. Material covered includes a roadmap of the test process, roles and responsibilities of facility and user, major milestones, facility capabilities, and inputs required by the facility. Samples of deliverables, test article interfaces, and inputs necessary to define test scope, cost, and schedule are included as an appendix to the guide.

  3. Recommendations for elaboration, transcultural adaptation and validation process of tests in Speech, Hearing and Language Pathology.

    PubMed

    Pernambuco, Leandro; Espelt, Albert; Magalhães, Hipólito Virgílio; Lima, Kenio Costa de

    2017-06-08

    to present a guide with recommendations for translation, adaptation, elaboration and process of validation of tests in Speech and Language Pathology. the recommendations were based on international guidelines with a focus on the elaboration, translation, cross-cultural adaptation and validation process of tests. the recommendations were grouped into two Charts, one of them with procedures for translation and transcultural adaptation and the other for obtaining evidence of validity, reliability and measures of accuracy of the tests. a guide with norms for the organization and systematization of the process of elaboration, translation, cross-cultural adaptation and validation process of tests in Speech and Language Pathology was created.

  4. The Advantages of Using Planned Comparisons over Post Hoc Tests.

    ERIC Educational Resources Information Center

    Kuehne, Carolyn C.

    There are advantages to using a priori or planned comparisons rather than omnibus multivariate analysis of variance (MANOVA) tests followed by post hoc or a posteriori testing. A small heuristic data set is used to illustrate these advantages. An omnibus MANOVA test was performed on the data followed by a post hoc test (discriminant analysis). A…

  5. Development of Modal Test Techniques for Validation of a Solar Sail Design

    NASA Technical Reports Server (NTRS)

    Gaspar, James L.; Mann, Troy; Behun, Vaughn; Wilkie, W. Keats; Pappa, Richard

    2004-01-01

    This paper focuses on the development of modal test techniques for validation of a solar sail gossamer space structure design. The major focus is on validating and comparing the capabilities of various excitation techniques for modal testing solar sail components. One triangular shaped quadrant of a solar sail membrane was tested in a 1 Torr vacuum environment using various excitation techniques including, magnetic excitation, and surface-bonded piezoelectric patch actuators. Results from modal tests performed on the sail using piezoelectric patches at different positions are discussed. The excitation methods were evaluated for their applicability to in-vacuum ground testing and to the development of on orbit flight test techniques. The solar sail membrane was tested in the horizontal configuration at various tension levels to assess the variation in frequency with tension in a vacuum environment. A segment of a solar sail mast prototype was also tested in ambient atmospheric conditions using various excitation techniques, and these methods are also assessed for their ground test capabilities and on-orbit flight testing.

  6. Test, Control and Monitor System (TCMS) operations plan

    NASA Technical Reports Server (NTRS)

    Macfarlane, C. K.; Conroy, M. P.

    1993-01-01

    The purpose is to provide a clear understanding of the Test, Control and Monitor System (TCMS) operating environment and to describe the method of operations for TCMS. TCMS is a complex and sophisticated checkout system focused on support of the Space Station Freedom Program (SSFP) and related activities. An understanding of the TCMS operating environment is provided and operational responsibilities are defined. NASA and the Payload Ground Operations Contractor (PGOC) will use it as a guide to manage the operation of the TCMS computer systems and associated networks and workstations. All TCMS operational functions are examined. Other plans and detailed operating procedures relating to an individual operational function are referenced within this plan. This plan augments existing Technical Support Management Directives (TSMD's), Standard Practices, and other management documentation which will be followed where applicable.

  7. Proposal and validation of a clinical trunk control test in individuals with spinal cord injury.

    PubMed

    Quinzaños, J; Villa, A R; Flores, A A; Pérez, R

    2014-06-01

    One of the problems that arise in spinal cord injury (SCI) is alteration in trunk control. Despite the need for standardized scales, these do not exist for evaluating trunk control in SCI. To propose and validate a trunk control test in individuals with SCI. National Institute of Rehabilitation, Mexico. The test was developed and later evaluated for reliability and criteria, content, and construct validity. We carried out 531 tests on 177 patients and found high inter- and intra-rater reliability. In terms of criterion validity, analysis of variance demonstrated a statistically significant difference in the test score of patients with adequate or inadequate trunk control according to the assessment of a group of experts. A receiver operating characteristic curve was plotted for optimizing the instrument's cutoff point, which was determined at 13 points, with a sensitivity of 98% and a specificity of 92.2%. With regard to construct validity, the correlation between the proposed test and the spinal cord independence measure (SCIM) was 0.873 (P=0.001) and that with the evolution time was 0.437 (P=0.001). For testing the hypothesis with qualitative variables, the Kruskal-Wallis test was performed, which resulted in a statistically significant difference between the scores in the proposed scale of each group defined by these variables. It was proven experimentally that the proposed trunk control test is valid and reliable. Furthermore, the test can be used for all patients with SCI despite the type and level of injury.

  8. 7. PHOTOCOPY, PLANS, ELEVATIONS, AND SECTION DRAWING FOR MISSILE TEST ...

    Library of Congress Historic Buildings Survey, Historic Engineering Record, Historic Landscapes Survey

    7. PHOTOCOPY, PLANS, ELEVATIONS, AND SECTION DRAWING FOR MISSILE TEST AND ASSEMBLY BUILDING. - NIKE Missile Base SL-40, Missile Test & Assembly Building, South end of launch area, northeast of Generator Building No. 3, Hecker, Monroe County, IL

  9. Performance validity testing in neuropsychology: a clinical guide, critical review, and update on a rapidly evolving literature.

    PubMed

    Lippa, Sara M

    2018-04-01

    Over the past two decades, there has been much research on measures of response bias and myriad measures have been validated in a variety of clinical and research samples. This critical review aims to guide clinicians through the use of performance validity tests (PVTs) from test selection and administration through test interpretation and feedback. Recommended cutoffs and relevant test operating characteristics are presented. Other important issues to consider during test selection, administration, interpretation, and feedback are discussed including order effects, coaching, impact on test data, and methods to combine measures and improve predictive power. When interpreting performance validity measures, neuropsychologists must use particular caution in cases of dementia, low intelligence, English as a second language/minority cultures, or low education. PVTs provide valuable information regarding response bias and, under the right circumstances, can provide excellent evidence of response bias. Only after consideration of the entire clinical picture, including validity test performance, can concrete determinations regarding the validity of test data be made.

  10. Integration of oncologic margins in three-dimensional virtual planning for head and neck surgery, including a validation of the software pathway.

    PubMed

    Kraeima, Joep; Schepers, Rutger H; van Ooijen, Peter M A; Steenbakkers, Roel J H M; Roodenburg, Jan L N; Witjes, Max J H

    2015-10-01

    Three-dimensional (3D) virtual planning of reconstructive surgery, after resection, is a frequently used method for improving accuracy and predictability. However, when applied to malignant cases, the planning of the oncologic resection margins is difficult due to visualisation of tumours in the current 3D planning. Embedding tumour delineation on a magnetic resonance image, similar to the routinely performed radiotherapeutic contouring of tumours, is expected to provide better margin planning. A new software pathway was developed for embedding tumour delineation on magnetic resonance imaging (MRI) within the 3D virtual surgical planning. The software pathway was validated by the use of five bovine cadavers implanted with phantom tumour objects. MRI and computed tomography (CT) images were fused and the tumour was delineated using radiation oncology software. This data was converted to the 3D virtual planning software by means of a conversion algorithm. Tumour volumes and localization were determined in both software stages for comparison analysis. The approach was applied to three clinical cases. A conversion algorithm was developed to translate the tumour delineation data to the 3D virtual plan environment. The average difference in volume of the tumours was 1.7%. This study reports a validated software pathway, providing multi-modality image fusion for 3D virtual surgical planning. Copyright © 2015 European Association for Cranio-Maxillo-Facial Surgery. Published by Elsevier Ltd. All rights reserved.

  11. Reliability and Validity of the Inline Skating Skill Test.

    PubMed

    Radman, Ivan; Ruzic, Lana; Padovan, Viktoria; Cigrovski, Vjekoslav; Podnar, Hrvoje

    2016-09-01

    This study aimed to examine the reliability and validity of the inline skating skill test. Based on previous skating experience forty-two skaters (26 female and 16 male) were randomized into two groups (competitive level vs. recreational level). They performed the test four times, with a recovery time of 45 minutes between sessions. Prior to testing, the participants rated their skating skill using a scale from 1 to 10. The protocol included performance time measurement through a course, combining different skating techniques. Trivial changes in performance time between the repeated sessions were determined in both competitive females/males and recreational females/males (-1.7% [95% CI: -5.8-2.6%] - 2.2% [95% CI: 0.0-4.5%]). In all four subgroups, the skill test had a low mean within-individual variation (1.6% [95% CI: 1.2-2.4%] - 2.7% [95% CI: 2.1-4.0%]) and high mean inter-session correlation (ICC = 0.97 [95% CI: 0.92-0.99] - 0.99 [95% CI: 0.98-1.00]). The comparison of detected typical errors and smallest worthwhile changes (calculated as standard deviations × 0.2) revealed that the skill test was able to track changes in skaters' performances. Competitive-level skaters needed shorter time (24.4-26.4%, all p < 0.01) to complete the test in comparison to recreational-level skaters. Moreover, moderate correlation (ρ = 0.80-0.82; all p < 0.01) was observed between the participant's self-rating and achieved performance times. In conclusion, the proposed test is a reliable and valid method to evaluate inline skating skills in amateur competitive and recreational level skaters. Further studies are needed to evaluate the reproducibility of this skill test in different populations including elite inline skaters.

  12. MIDURA (Minefield Detection Using Reconnaissance Assets) 1982-1983 Experimental Test Plan.

    DTIC Science & Technology

    1982-04-01

    3.2.4.2 Subjection Validation at the Salem ONG 27 3.2.4.3 Objective Validity at Fort Huachuca 28 4. TEST FLIGHTS AT ARRAYS IIa, lib, Ilia AND IIIb...subjective validation at the Salem ONG; (3) objective validation at Fort Huachuca. 3.2.4.1 Subjective Image Interpretation at ERIM The initial phase...The ERIM II’s will provide for each image estimate of PD’ Pc and PFA on a 0.00 to 1.00 scale. P is defined as the subjective probability estimate that

  13. Converting Hangar High Expansion Foam Systems to Prevent Cockpit Damage: Full-Scale Validation Tests

    DTIC Science & Technology

    2017-09-01

    AFCEC-CO-TY-TR-2018-0001 CONVERTING HANGAR HIGH EXPANSION FOAM SYSTEMS TO PREVENT COCKPIT DAMAGE: FULL-SCALE VALIDATION TESTS Gerard G...REPORT NUMBER(S) 12. DISTRIBUTION/ AVAILABILITY STATEMENT 13. SUPPLEMENTARY NOTES 14. ABSTRACT 15. SUBJECT TERMS 16. SECURITY CLASSIFICATION OF: a. REPORT b...09-2017 Final Test Report May 2017 Converting Hangar High Expansion Foam Systems to Prevent Cockpit Damage: Full-Scale Validation Tests N00173-15-D

  14. The bogus taste test: Validity as a measure of laboratory food intake.

    PubMed

    Robinson, Eric; Haynes, Ashleigh; Hardman, Charlotte A; Kemps, Eva; Higgs, Suzanne; Jones, Andrew

    2017-09-01

    Because overconsumption of food contributes to ill health, understanding what affects how much people eat is of importance. The 'bogus' taste test is a measure widely used in eating behaviour research to identify factors that may have a causal effect on food intake. However, there has been no examination of the validity of the bogus taste test as a measure of food intake. We conducted a participant level analysis of 31 published laboratory studies that used the taste test to measure food intake. We assessed whether the taste test was sensitive to experimental manipulations hypothesized to increase or decrease food intake. We examined construct validity by testing whether participant sex, hunger and liking of taste test food were associated with the amount of food consumed in the taste test. In addition, we also examined whether BMI (body mass index), trait measures of dietary restraint and over-eating in response to palatable food cues were associated with food consumption. Results indicated that the taste test was sensitive to experimental manipulations hypothesized to increase or decrease food intake. Factors that were reliably associated with increased consumption during the taste test were being male, have a higher baseline hunger, liking of the taste test food and a greater tendency to overeat in response to palatable food cues, whereas trait dietary restraint and BMI were not. These results indicate that the bogus taste test is likely to be a valid measure of food intake and can be used to identify factors that have a causal effect on food intake. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.

  15. 49 CFR 232.505 - Pre-revenue service acceptance testing plan.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... acceptance tests; (3) Correct any safety deficiencies identified by FRA in the design of the equipment or in... principal test objectives shall be to demonstrate that the equipment meets the safety design and performance... 49 Transportation 4 2013-10-01 2013-10-01 false Pre-revenue service acceptance testing plan. 232...

  16. 49 CFR 232.505 - Pre-revenue service acceptance testing plan.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... acceptance tests; (3) Correct any safety deficiencies identified by FRA in the design of the equipment or in... principal test objectives shall be to demonstrate that the equipment meets the safety design and performance... 49 Transportation 4 2012-10-01 2012-10-01 false Pre-revenue service acceptance testing plan. 232...

  17. 49 CFR 232.505 - Pre-revenue service acceptance testing plan.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... acceptance tests; (3) Correct any safety deficiencies identified by FRA in the design of the equipment or in... principal test objectives shall be to demonstrate that the equipment meets the safety design and performance... 49 Transportation 4 2014-10-01 2014-10-01 false Pre-revenue service acceptance testing plan. 232...

  18. 49 CFR 232.505 - Pre-revenue service acceptance testing plan.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... acceptance tests; (3) Correct any safety deficiencies identified by FRA in the design of the equipment or in... principal test objectives shall be to demonstrate that the equipment meets the safety design and performance... 49 Transportation 4 2011-10-01 2011-10-01 false Pre-revenue service acceptance testing plan. 232...

  19. Reliability and criterion-related validity of a new repeated agility test

    PubMed Central

    Makni, E; Jemni, M; Elloumi, M; Chamari, K; Nabli, MA; Padulo, J; Moalla, W

    2016-01-01

    The study aimed to assess the reliability and the criterion-related validity of a new repeated sprint T-test (RSTT) that includes intense multidirectional intermittent efforts. The RSTT consisted of 7 maximal repeated executions of the agility T-test with 25 s of passive recovery rest in between. Forty-five team sports players performed two RSTTs separated by 3 days to assess the reliability of best time (BT) and total time (TT) of the RSTT. The intra-class correlation coefficient analysis revealed a high relative reliability between test and retest for BT and TT (>0.90). The standard error of measurement (<0.50) showed that the RSTT has a good absolute reliability. The minimal detectable change values for BT and TT related to the RSTT were 0.09 s and 0.58 s, respectively. To check the criterion-related validity of the RSTT, players performed a repeated linear sprint (RLS) and a repeated sprint with changes of direction (RSCD). Significant correlations between the BT and TT of the RLS, RSCD and RSTT were observed (p<0.001). The RSTT is, therefore, a reliable and valid measure of the intermittent repeated sprint agility performance. As this ability is required in all team sports, it is suggested that team sports coaches, fitness coaches and sports scientists consider this test in their training follow-up. PMID:27274109

  20. The Thinking-about-Derivative Test for Undergraduate Students: Development and Validation

    ERIC Educational Resources Information Center

    Aydin, Utkun; Ubuz, Behiye

    2015-01-01

    Two studies were conducted for the development and validation of a multidimensional test to assess undergraduate students' mathematical thinking about derivative. The first study involved two phases: question generation and refinement of the Thinking-about-Derivative Test (TDT). The second study included four phases as follows: test…

  1. Performance validation of the ANSER control laws for the F-18 HARV

    NASA Technical Reports Server (NTRS)

    Messina, Michael D.

    1995-01-01

    The ANSER control laws were implemented in Ada by NASA Dryden for flight test on the High Alpha Research Vehicle (HARV). The Ada implementation was tested in the hardware-in-the-loop (HIL) simulation, and results were compared to those obtained with the NASA Langley batch Fortran implementation of the control laws which are considered the 'truth model.' This report documents the performance validation test results between these implementations. This report contains the ANSER performance validation test plan, HIL versus batch time-history comparisons, simulation scripts used to generate checkcases, and detailed analysis of discrepancies discovered during testing.

  2. Performance validation of the ANSER Control Laws for the F-18 HARV

    NASA Technical Reports Server (NTRS)

    Messina, Michael D.

    1995-01-01

    The ANSER control laws were implemented in Ada by NASA Dryden for flight test on the High Alpha Research Vehicle (HARV). The Ada implementation was tested in the hardware-in-the-loop (HIL) simulation, and results were compared to those obtained with the NASA Langley batch Fortran implementation of the control laws which are considered the 'truth model'. This report documents the performance validation test results between these implementations. This report contains the ANSER performance validation test plan, HIL versus batch time-history comparisons, simulation scripts used to generate checkcases, and detailed analysis of discrepancies discovered during testing.

  3. Development and validation of a new instrument for testing functional health literacy in Japanese adults.

    PubMed

    Nakagami, Katsuyuki; Yamauchi, Toyoaki; Noguchi, Hiroyuki; Maeda, Tohru; Nakagami, Tomoko

    2014-06-01

    This study aimed to develop a reliable and valid measure of functional health literacy in a Japanese clinical setting. Test development consisted of three phases: generation of an item pool, consultation with experts to assess content validity, and comparison with external criteria (the Japanese Health Knowledge Test) to assess criterion validity. A trial version of the test was administered to 535 Japanese outpatients. Internal consistency reliability, calculated by Cronbach's alpha, was 0.81, and concurrent validity was moderate. Receiver Operating Characteristics and Item Response Theory were used to classify patients as having adequate, marginal, or inadequate functional health literacy. Both inadequate and marginal functional health literacy were associated with older age, lower income, lower educational attainment, and poor health knowledge. The time required to complete the test was 10-15 min. This test should enable health workers to better identify patients with inadequate health literacy. © 2013 Wiley Publishing Asia Pty Ltd.

  4. Testing the Predictive Validity of the Hendrich II Fall Risk Model.

    PubMed

    Jung, Hyesil; Park, Hyeoun-Ae

    2018-03-01

    Cumulative data on patient fall risk have been compiled in electronic medical records systems, and it is possible to test the validity of fall-risk assessment tools using these data between the times of admission and occurrence of a fall. The Hendrich II Fall Risk Model scores assessed during three time points of hospital stays were extracted and used for testing the predictive validity: (a) upon admission, (b) when the maximum fall-risk score from admission to falling or discharge, and (c) immediately before falling or discharge. Predictive validity was examined using seven predictive indicators. In addition, logistic regression analysis was used to identify factors that significantly affect the occurrence of a fall. Among the different time points, the maximum fall-risk score assessed between admission and falling or discharge showed the best predictive performance. Confusion or disorientation and having a poor ability to rise from a sitting position were significant risk factors for a fall.

  5. Integrated Test and Evaluation Flight Test 3 Flight Test Plan

    NASA Technical Reports Server (NTRS)

    Marston, Michael Lawrence

    2015-01-01

    The desire and ability to fly Unmanned Aircraft Systems (UAS) in the National Airspace System (NAS) is of increasing urgency. The application of unmanned aircraft to perform national security, defense, scientific, and emergency management are driving the critical need for less restrictive access by UAS to the NAS. UAS represent a new capability that will provide a variety of services in the government (public) and commercial (civil) aviation sectors. The growth of this potential industry has not yet been realized due to the lack of a common understanding of what is required to safely operate UAS in the NAS. NASA's UAS Integration into the NAS Project is conducting research in the areas of Separation Assurance/Sense and Avoid Interoperability, Human Systems Integration (HSI), and Communication to support reducing the barriers of UAS access to the NAS. This research is broken into two research themes namely, UAS Integration and Test Infrastructure. UAS Integration focuses on airspace integration procedures and performance standards to enable UAS integration in the air transportation system, covering Sense and Avoid (SAA) performance standards, command and control performance standards, and human systems integration. The focus of Test Infrastructure is to enable development and validation of airspace integration procedures and performance standards, including the integrated test and evaluation. In support of the integrated test and evaluation efforts, the Project will develop an adaptable, scalable, and schedulable relevant test environment capable of evaluating concepts and technologies for unmanned aircraft systems to safely operate in the NAS. To accomplish this task, the Project will conduct a series of Human-in-the-Loop and Flight Test activities that integrate key concepts, technologies and/or procedures in a relevant air traffic environment. Each of the integrated events will build on the technical achievements, fidelity and complexity of the previous tests and

  6. EBMUD Drought Planning Put to the Test in 2014

    NASA Astrophysics Data System (ADS)

    Bray, B. S.

    2014-12-01

    The East Bay Municipal Utility District faced challenges in the unprecedented 2014 drought managing limited supplies to reliably serve its customers. The District's successful drought planning required a multi-faceted plan to preserve a reliable water supply, now and into the future. Planning has included investments in recycled water projects, passive and active customer conservation programs, and pursuit of alternative water supply options. EBMUD's drought planning efforts have been tested in 2014 when California experienced one of the driest years on record and the 2nd driest year in the Mokelumne Watershed, the source of 90% of the District's water supply. This presentation will highlight the effectiveness of drought planning in three areas: (1) implementing 10% water conservation as of July 2014, (2) the securing of nearly 20TAF of supplemental water supply conveyed through the Freeport Regional Water Project, and (3) operating EBMUD's Mokelumne River Project to meet fishery flow and water quality objectives.

  7. Reliability and Validity Testing of the Physical Resilience Measure

    ERIC Educational Resources Information Center

    Resnick, Barbara; Galik, Elizabeth; Dorsey, Susan; Scheve, Ann; Gutkin, Susan

    2011-01-01

    Objective: The purpose of this study was to test reliability and validity of the Physical Resilience Scale. Methods: A single-group repeated measure design was used and 130 older adults from three different housing sites participated. Participants completed the Physical Resilience Scale, Hardy-Gill Resilience Scale, 14-item Resilience Scale,…

  8. Continual planning and scheduling for managing patient tests in hospital laboratories.

    PubMed

    Marinagi, C C; Spyropoulos, C D; Papatheodorou, C; Kokkotos, S

    2000-10-01

    Hospital laboratories perform examination tests upon patients, in order to assist medical diagnosis or therapy progress. Planning and scheduling patient requests for examination tests is a complicated problem because it concerns both minimization of patient stay in hospital and maximization of laboratory resources utilization. In the present paper, we propose an integrated patient-wise planning and scheduling system which supports the dynamic and continual nature of the problem. The proposed combination of multiagent and blackboard architecture allows the dynamic creation of agents that share a set of knowledge sources and a knowledge base to service patient test requests.

  9. Contemporary Test Validity in Theory and Practice: A Primer for Discipline-Based Education Researchers.

    PubMed

    Reeves, Todd D; Marbach-Ad, Gili

    2016-01-01

    Most discipline-based education researchers (DBERs) were formally trained in the methods of scientific disciplines such as biology, chemistry, and physics, rather than social science disciplines such as psychology and education. As a result, DBERs may have never taken specific courses in the social science research methodology--either quantitative or qualitative--on which their scholarship often relies so heavily. One particular aspect of (quantitative) social science research that differs markedly from disciplines such as biology and chemistry is the instrumentation used to quantify phenomena. In response, this Research Methods essay offers a contemporary social science perspective on test validity and the validation process. The instructional piece explores the concepts of test validity, the validation process, validity evidence, and key threats to validity. The essay also includes an in-depth example of a validity argument and validation approach for a test of student argument analysis. In addition to DBERs, this essay should benefit practitioners (e.g., lab directors, faculty members) in the development, evaluation, and/or selection of instruments for their work assessing students or evaluating pedagogical innovations. © 2016 T. D. Reeves and G. Marbach-Ad. CBE—Life Sciences Education © 2016 The American Society for Cell Biology. This article is distributed by The American Society for Cell Biology under license from the author(s). It is available to the public under an Attribution–Noncommercial–Share Alike 3.0 Unported Creative Commons License (http://creativecommons.org/licenses/by-nc-sa/3.0).

  10. Validation of a Video-based Game-Understanding Test Procedure in Badminton.

    ERIC Educational Resources Information Center

    Blomqvist, Minna T.; Luhtanen, Pekka; Laakso, Lauri; Keskinen, Esko

    2000-01-01

    Reports the development and validation of video-based game-understanding tests in badminton for elementary and secondary students. The tests included different sequences that simulated actual game situations. Players had to solve tactical problems by selecting appropriate solutions and arguments for their decisions. Results suggest that the test…

  11. Validity Inferences under High-Stakes Conditions: A Response from Language Testing

    ERIC Educational Resources Information Center

    Hill, Kathryn; McNamara, Tim

    2015-01-01

    Those who work in second- and foreign-language testing often find Koretz's concern for validity inferences under high-stakes (VIHS) conditions both welcome and familiar. While the focus of the article is more narrowly on the potential for two instructional responses to test-based accountability, "reallocation" and "coaching,"…

  12. WEC-SIM Phase 1 Validation Testing -- Numerical Modeling of Experiments: Preprint

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ruehl, Kelley; Michelen, Carlos; Bosma, Bret

    2016-08-01

    The Wave Energy Converter Simulator (WEC-Sim) is an open-source code jointly developed by Sandia National Laboratories and the National Renewable Energy Laboratory. It is used to model wave energy converters subjected to operational and extreme waves. In order for the WEC-Sim code to be beneficial to the wave energy community, code verification and physical model validation is necessary. This paper describes numerical modeling of the wave tank testing for the 1:33-scale experimental testing of the floating oscillating surge wave energy converter. The comparison between WEC-Sim and the Phase 1 experimental data set serves as code validation. This paper is amore » follow-up to the WEC-Sim paper on experimental testing, and describes the WEC-Sim numerical simulations for the floating oscillating surge wave energy converter.« less

  13. Validation of laboratory-scale recycling test method of paper PSA label products

    Treesearch

    Carl Houtman; Karen Scallon; Richard Oldack

    2008-01-01

    Starting with test methods and a specification developed by the U.S. Postal Service (USPS) Environmentally Benign Pressure Sensitive Adhesive Postage Stamp Program, a laboratory-scale test method and a specification were developed and validated for pressure-sensitive adhesive labels, By comparing results from this new test method and pilot-scale tests, which have been...

  14. CAPTIONALS: A computer aided testing environment for the verification and validation of communication protocols

    NASA Technical Reports Server (NTRS)

    Feng, C.; Sun, X.; Shen, Y. N.; Lombardi, Fabrizio

    1992-01-01

    This paper covers the verification and protocol validation for distributed computer and communication systems using a computer aided testing approach. Validation and verification make up the so-called process of conformance testing. Protocol applications which pass conformance testing are then checked to see whether they can operate together. This is referred to as interoperability testing. A new comprehensive approach to protocol testing is presented which address: (1) modeling for inter-layer representation for compatibility between conformance and interoperability testing; (2) computational improvement to current testing methods by using the proposed model inclusive of formulation of new qualitative and quantitative measures and time-dependent behavior; (3) analysis and evaluation of protocol behavior for interactive testing without extensive simulation.

  15. Integration and Test Flight Validation Plans for the Pulsed Plasma Thruster Experiment on EO- 1

    NASA Technical Reports Server (NTRS)

    Zakrzwski, Charles; Benson, Scott; Sanneman, Paul; Hoskins, Andy; Bauer, Frank H. (Technical Monitor)

    2002-01-01

    The Pulsed Plasma Thruster (PPT) Experiment on the Earth Observing One (EO-1) spacecraft has been designed to demonstrate the capability of a new generation PPT to perform spacecraft attitude control. The PPT is a small, self-contained pulsed electromagnetic propulsion system capable of delivering high specific impulse (900-1200 s), very small impulse bits (10-1000 uN-s) at low average power (less than 1 to 100 W). Teflon fuel is ablated and slightly ionized by means of a capacitative discharge. The discharge also generates electromagnetic fields that accelerate the plasma by means of the Lorentz Force. EO-1 has a single PPT that can produce thrust in either the positive or negative pitch direction. The flight validation has been designed to demonstrate of the ability of the PPT to provide precision pointing accuracy, response and stability, and confirmation of benign plume and EMI effects. This paper will document the success of the flight validation.

  16. Planning or something else? Examining neuropsychological predictors of Zoo Map performance.

    PubMed

    Oosterman, Joukje M; Wijers, Marijn; Kessels, Roy P C

    2013-01-01

    The Zoo Map Test of the Behavioral Assessment of the Dysexecutive Syndrome battery is often applied to measure planning ability as part of executive function. Successful performance on this test is, however, dependent on various cognitive functions, and deficient Zoo Map performance does therefore not necessarily imply selectively disrupted planning abilities. To address this important issue, we examined whether planning is still the most important predictor of Zoo Map performance in a heterogeneous sample of neurologic and psychiatric outpatients (N = 71). In addition to the Zoo Map Test, the patients completed other neuropsychological tests of planning, inhibition, processing speed, and episodic memory. Planning was the strongest predictor of the total raw score and inappropriate places visited, and no additional contribution of other cognitive scores was found. One exception to this was the total time, which was associated with processing speed. Overall, our findings indicate that the Zoo Map Test is a valid indicator of planning ability in a heterogeneous patient sample.

  17. The Validity of Value-Added Estimates from Low-Stakes Testing Contexts: The Impact of Change in Test-Taking Motivation and Test Consequences

    ERIC Educational Resources Information Center

    Finney, Sara J.; Sundre, Donna L.; Swain, Matthew S.; Williams, Laura M.

    2016-01-01

    Accountability mandates often prompt assessment of student learning gains (e.g., value-added estimates) via achievement tests. The validity of these estimates have been questioned when performance on tests is low stakes for students. To assess the effects of motivation on value-added estimates, we assigned students to one of three test consequence…

  18. Clinical Validation of 4-Dimensional Computed Tomography Ventilation With Pulmonary Function Test Data

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Brennan, Douglas; Schubert, Leah; Diot, Quentin

    Purpose: A new form of functional imaging has been proposed in the form of 4-dimensional computed tomography (4DCT) ventilation. Because 4DCTs are acquired as part of routine care for lung cancer patients, calculating ventilation maps from 4DCTs provides spatial lung function information without added dosimetric or monetary cost to the patient. Before 4DCT-ventilation is implemented it needs to be clinically validated. Pulmonary function tests (PFTs) provide a clinically established way of evaluating lung function. The purpose of our work was to perform a clinical validation by comparing 4DCT-ventilation metrics with PFT data. Methods and Materials: Ninety-eight lung cancer patients withmore » pretreatment 4DCT and PFT data were included in the study. Pulmonary function test metrics used to diagnose obstructive lung disease were recorded: forced expiratory volume in 1 second (FEV1) and FEV1/forced vital capacity. Four-dimensional CT data sets and spatial registration were used to compute 4DCT-ventilation images using a density change–based and a Jacobian-based model. The ventilation maps were reduced to single metrics intended to reflect the degree of ventilation obstruction. Specifically, we computed the coefficient of variation (SD/mean), ventilation V20 (volume of lung ≤20% ventilation), and correlated the ventilation metrics with PFT data. Regression analysis was used to determine whether 4DCT ventilation data could predict for normal versus abnormal lung function using PFT thresholds. Results: Correlation coefficients comparing 4DCT-ventilation with PFT data ranged from 0.63 to 0.72, with the best agreement between FEV1 and coefficient of variation. Four-dimensional CT ventilation metrics were able to significantly delineate between clinically normal versus abnormal PFT results. Conclusions: Validation of 4DCT ventilation with clinically relevant metrics is essential. We demonstrate good global agreement between PFTs and 4DCT-ventilation, indicating

  19. Rigging Test Bed Development for Validation of Multi-Stage Decelerator Extractions

    NASA Technical Reports Server (NTRS)

    Kenig, Sivan J.; Gallon, John C.; Adams, Douglas S.; Rivellini, Tommaso P.

    2013-01-01

    The Low Density Supersonic Decelerator project is developing new decelerator systems for Mars entry which would include testing with a Supersonic Flight Dynamics Test Vehicle. One of the decelerator systems being developed is a large supersonic ringsail parachute. Due to the configuration of the vehicle it is not possible to deploy the parachute with a mortar which would be the preferred method for a spacecraft in a supersonic flow. Alternatively, a multi-stage extraction process using a ballute as a pilot is being developed for the test vehicle. The Rigging Test Bed is a test venue being constructed to perform verification and validation of this extraction process. The test bed consists of a long pneumatic piston device capable of providing a constant force simulating the ballute drag force during the extraction events. The extraction tests will take place both inside a high-bay for frequent tests of individual extraction stages and outdoors using a mobile hydraulic crane for complete deployment tests from initial pack pull out to canopy extraction. These tests will measure line tensions and use photogrammetry to track motion of the elements involved. The resulting data will be used to verify packing and rigging as well, as validate models and identify potential failure modes in order to finalize the design of the extraction system.

  20. Validity of a basketball-specific complex test in female professional players.

    PubMed

    Schwesig, René; Hermassi, Souhail; Lauenroth, Andreas; Laudner, Kevin; Koke, Alexander; Bartels, Thomas; Delank, Stefan; Schulze, Stephan

    2018-06-01

    The purpose of this study was to assess the validity of a new basketball-specific complex test (BBCT) based on the ascertained match performance.Fourteen female professional basketball players (ages: 23.4 ± 1.8 years) performed the BBCT and a treadmill test (TT) at the beginning of pre-season training. Lactate, heart rate (HR), time, shooting precision and number of errors were measured during the four test sequences of the BBCT (short distance sprinting with direction changes, with and without a ball; fast break; lay-up parcours; sprint endurance test). In addition, lactate threshold (LT) and HR were assessed at selected times throughout the TT and the BBCT and over 6 (TT) or 10 (BBCT) minutes after the tests. The match performance score (mps) was calculated on specific parameters (e. g. points) collected during all matches during the subsequent season (22 matches). The mps served as the "gold standard" within the validation process for the BBCT and the TT.TT parameters demonstrated an explained variance (EV) between 0 % (HR recovery) and 11 % (running speed at 6 mmol/l LT). The EV from the BBCT was higher and ranged from 0 % (HR recovery 6 minutes after end of exercise) to 28 % (sprint endurance test after 8 of 10 sprints). Ten out of 21 BBCT parameters (48 %) and 2 out of 5 TT parameters (40 %) demonstrated an EV higher than 10 %. Average EV for all parameters was 12 % (BBCT) and 6 % (TT), respectively. The BBCT had a higher validity than the TT for predicting match performance. These findings suggest that coaches and scientists should consider using the BBCT testing protocol to estimate the match performance abilities of elite female players. © Georg Thieme Verlag KG Stuttgart · New York.