Sample records for observational study validating

  1. Assessing validity of observational intervention studies - the Benchmarking Controlled Trials.

    PubMed

    Malmivaara, Antti

    2016-09-01

    Benchmarking Controlled Trial (BCT) is a concept which covers all observational studies aiming to assess impact of interventions or health care system features to patients and populations. To create and pilot test a checklist for appraising methodological validity of a BCT. The checklist was created by extracting the most essential elements from the comprehensive set of criteria in the previous paper on BCTs. Also checklists and scientific papers on observational studies and respective systematic reviews were utilized. Ten BCTs published in the Lancet and in the New England Journal of Medicine were used to assess feasibility of the created checklist. The appraised studies seem to have several methodological limitations, some of which could be avoided in planning, conducting and reporting phases of the studies. The checklist can be used for planning, conducting, reporting, reviewing, and critical reading of observational intervention studies. However, the piloted checklist should be validated in further studies. Key messages Benchmarking Controlled Trial (BCT) is a concept which covers all observational studies aiming to assess impact of interventions or health care system features to patients and populations. This paper presents a checklist for appraising methodological validity of BCTs and pilot-tests the checklist with ten BCTs published in leading medical journals. The appraised studies seem to have several methodological limitations, some of which could be avoided in planning, conducting and reporting phases of the studies. The checklist can be used for planning, conducting, reporting, reviewing, and critical reading of observational intervention studies.

  2. Development of Creative Behavior Observation Form: A Study on Validity and Reliability

    ERIC Educational Resources Information Center

    Dere, Zeynep; Ömeroglu, Esra

    2018-01-01

    This study, Creative Behavior Observation Form was developed to assess creativity of the children. While the study group on the reliability and validity of Creative Behavior Observation Form was being developed, 257 children in total who were at the ages of 5-6 were used as samples with stratified sampling method. Content Validity Index (CVI) and…

  3. Assessing validity of observational intervention studies – the Benchmarking Controlled Trials

    PubMed Central

    Malmivaara, Antti

    2016-01-01

    Abstract Background: Benchmarking Controlled Trial (BCT) is a concept which covers all observational studies aiming to assess impact of interventions or health care system features to patients and populations. Aims: To create and pilot test a checklist for appraising methodological validity of a BCT. Methods: The checklist was created by extracting the most essential elements from the comprehensive set of criteria in the previous paper on BCTs. Also checklists and scientific papers on observational studies and respective systematic reviews were utilized. Ten BCTs published in the Lancet and in the New England Journal of Medicine were used to assess feasibility of the created checklist. Results: The appraised studies seem to have several methodological limitations, some of which could be avoided in planning, conducting and reporting phases of the studies. Conclusions: The checklist can be used for planning, conducting, reporting, reviewing, and critical reading of observational intervention studies. However, the piloted checklist should be validated in further studies.Key messagesBenchmarking Controlled Trial (BCT) is a concept which covers all observational studies aiming to assess impact of interventions or health care system features to patients and populations.This paper presents a checklist for appraising methodological validity of BCTs and pilot-tests the checklist with ten BCTs published in leading medical journals. The appraised studies seem to have several methodological limitations, some of which could be avoided in planning, conducting and reporting phases of the studies.The checklist can be used for planning, conducting, reporting, reviewing, and critical reading of observational intervention studies. PMID:27238631

  4. Validation of the NOSCA - nurses' observation scale of cognitive abilities.

    PubMed

    Persoon, Anke; Schoonhoven, Lisette; Melis, Rene J F; van Achterberg, Theo; Kessels, Roy P C; Rikkert, Marcel G M Olde

    2012-11-01

    To examine the psychometric properties of the Nurses' Observation Scale for Cognitive Abilities. Nurses' Observation Scale for Cognitive Abilities is a behavioural rating scale comprising eight subscales that represent different cognitive domains. It is based on observations during contact between nurse and patient. Observational study. A total of 50 patients from two geriatric wards in acute care hospitals participated in this study. Reliability was examined via internal consistency and inter-rater reliability. Construct validity of the Nurses' Observation Scale for Cognitive Abilities and its subscales were explored by means of convergent and divergent validity and post hoc analyses for group differences. Cronbach's αs of the total Nurses' Observation Scale for Cognitive Abilities and its subscales were 0·98 and 0·66-0·93, respectively. The item-total correlations were satisfactory (overall > 0·4). The intra-class coefficients were good (37 of 39 items > 0·4). The convergent validity of the Nurses' Observation Scale for Cognitive Abilities against cognitive ratings (MMSE, NOSGER) and severity of dementia (Clinical Dementia Rating) demonstrated satisfactory correlations (0·59-0·70, p < 0·01), except for IQCODE (0·30, p > 0·05). The divergent validity of the Nurses' Observation Scale for Cognitive Abilities against depressive symptoms was low (0·12, p > 0·05). The construct validity of the Nurses' Observation Scale for Cognitive Abilities subscales against 13 specific neuropsychological tests showed correlations varying from poor to fair (0·18-0·74; 10 of 13 correlations p < 0·05). Validity and reliability of the total Nurses' Observation Scale for Cognitive Abilities are excellent. The correlations between the Nurses' Observation Scale for Cognitive Abilities subscales and standard neuropsychological tests were moderate. More conclusive results may be found if the Nurses' Observation Scale for Cognitive Abilities subscales were to be validated

  5. External Validation of the HERNIAscore: An Observational Study.

    PubMed

    Cherla, Deepa V; Moses, Maya L; Mueck, Krislynn M; Hannon, Craig; Ko, Tien C; Kao, Lillian S; Liang, Mike K

    2017-09-01

    The HERNIAscore is a ventral incisional hernia (VIH) risk assessment tool that uses only preoperative variables and predictable intraoperative variables. The aim of this study was to validate and modify, if needed, the HERNIAscore in an external dataset. This was a retrospective observational study of all patients undergoing resection for gastrointestinal malignancy from 2011 through 2015 at a safety-net hospital. The primary end point was clinical postoperative VIH. Patients were stratified into low-risk, medium-risk, and high-risk groups based on HERNIAscore. A revised HERNIAscore was calculated with the addition of earlier abdominal operation as a categorical variable. Cox regression of incisional hernia with stratification by risk class was performed. Incidence rates of clinical VIH formation within each risk class were also calculated. Two hundred and forty-seven patents were enrolled. On Cox regression, in addition to the 3 variables of the HERNIAscore (BMI, COPD, and incision length), earlier abdominal operation was also predictive of VIH. The revised HERNIAscore demonstrated improved predictive accuracy for clinical VIH. Although the original HERNIAscore effectively stratified the risk of an incisional radiographic VIH developing, the revised HERNIAscore provided a statistically significant stratification for both clinical and radiographic VIHs in this patient cohort. We have externally validated and improved the HERNIAscore. The revised HERNIAscore uses BMI, incision length, COPD, and earlier abdominal operation to predict risk of postoperative incisional hernia. Future research should assess methods to prevent incisional hernias in moderate-to-high risk patients. Copyright © 2017 American College of Surgeons. Published by Elsevier Inc. All rights reserved.

  6. Preliminary Checklist for Reporting Observational Studies in Sports Areas: Content Validity.

    PubMed

    Chacón-Moscoso, Salvador; Sanduvete-Chaves, Susana; Anguera, M Teresa; Losada, José L; Portell, Mariona; Lozano-Lozano, José A

    2018-01-01

    Observational studies are based on systematic observation, understood as an organized recording and quantification of behavior in its natural context. Applied to the specific area of sports, observational studies present advantages when comparing studies based on other designs, such as the flexibility for adapting to different contexts and the possibility of using non-standardized instruments as well as a high degree of development in specific software and data analysis. Although the importance and usefulness of sports-related observational studies have been widely shown, there is no checklist to report these studies. Consequently, authors do not have a guide to follow in order to include all of the important elements in an observational study in sports areas, and reviewers do not have a reference tool for assessing this type of work. To resolve these issues, this article aims to develop a checklist to measure the quality of sports-related observational studies based on a content validity study. The participants were 22 judges with at least 3 years of experience in observational studies, sports areas, and methodology. They evaluated a list of 60 items systematically selected and classified into 12 dimensions. They were asked to score four aspects of each item on 5-point Likert scales to measure the following dimensions: representativeness, relevance, utility, and feasibility. The judges also had an open-format section for comments. The Osterlind index was calculated for each item and for each of the four aspects. Items were considered appropriate when obtaining a score of at least 0.5 in the four assessed aspects. After considering these inclusion criteria and all of the open-format comments, the resultant checklist consisted of 54 items grouped into the same initial 12 dimensions. Finally, we highlight the strengths of this work. We also present its main limitation: the need to apply the resultant checklist to obtain data and, thus, increase quality indicators of

  7. Preliminary Checklist for Reporting Observational Studies in Sports Areas: Content Validity

    PubMed Central

    Chacón-Moscoso, Salvador; Sanduvete-Chaves, Susana; Anguera, M. Teresa; Losada, José L.; Portell, Mariona; Lozano-Lozano, José A.

    2018-01-01

    Observational studies are based on systematic observation, understood as an organized recording and quantification of behavior in its natural context. Applied to the specific area of sports, observational studies present advantages when comparing studies based on other designs, such as the flexibility for adapting to different contexts and the possibility of using non-standardized instruments as well as a high degree of development in specific software and data analysis. Although the importance and usefulness of sports-related observational studies have been widely shown, there is no checklist to report these studies. Consequently, authors do not have a guide to follow in order to include all of the important elements in an observational study in sports areas, and reviewers do not have a reference tool for assessing this type of work. To resolve these issues, this article aims to develop a checklist to measure the quality of sports-related observational studies based on a content validity study. The participants were 22 judges with at least 3 years of experience in observational studies, sports areas, and methodology. They evaluated a list of 60 items systematically selected and classified into 12 dimensions. They were asked to score four aspects of each item on 5-point Likert scales to measure the following dimensions: representativeness, relevance, utility, and feasibility. The judges also had an open-format section for comments. The Osterlind index was calculated for each item and for each of the four aspects. Items were considered appropriate when obtaining a score of at least 0.5 in the four assessed aspects. After considering these inclusion criteria and all of the open-format comments, the resultant checklist consisted of 54 items grouped into the same initial 12 dimensions. Finally, we highlight the strengths of this work. We also present its main limitation: the need to apply the resultant checklist to obtain data and, thus, increase quality indicators of

  8. Validating an Environmental Education Field Day Observation Tool

    ERIC Educational Resources Information Center

    Carlson, Stephan P.; Heimlich, Joe E.; Storksdieck, Martin

    2011-01-01

    Environmental Field Days (EFD) are held throughout the country and provide a unique opportunity to involve students in real world science. A study to assess the validity of an observation tool for EFD programs was conducted at the Metro Water Festival with fifth grade students. Items from the observation tool were mapped to students' evaluation…

  9. Validation in the Absence of Observed Events

    DOE PAGES

    Lathrop, John; Ezell, Barry

    2015-07-22

    Here our paper addresses the problem of validating models in the absence of observed events, in the area of Weapons of Mass Destruction terrorism risk assessment. We address that problem with a broadened definition of “Validation,” based on “backing up” to the reason why modelers and decision makers seek validation, and from that basis re-define validation as testing how well the model can advise decision makers in terrorism risk management decisions. We develop that into two conditions: Validation must be based on cues available in the observable world; and it must focus on what can be done to affect thatmore » observable world, i.e. risk management. That in turn leads to two foci: 1.) the risk generating process, 2.) best use of available data. Based on our experience with nine WMD terrorism risk assessment models, we then describe three best use of available data pitfalls: SME confidence bias, lack of SME cross-referencing, and problematic initiation rates. Those two foci and three pitfalls provide a basis from which we define validation in this context in terms of four tests -- Does the model: … capture initiation? … capture the sequence of events by which attack scenarios unfold? … consider unanticipated scenarios? … consider alternative causal chains? Finally, we corroborate our approach against three key validation tests from the DOD literature: Is the model a correct representation of the simuland? To what degree are the model results comparable to the real world? Over what range of inputs are the model results useful?« less

  10. Validation in the Absence of Observed Events.

    PubMed

    Lathrop, John; Ezell, Barry

    2016-04-01

    This article addresses the problem of validating models in the absence of observed events, in the area of weapons of mass destruction terrorism risk assessment. We address that problem with a broadened definition of "validation," based on stepping "up" a level to considering the reason why decisionmakers seek validation, and from that basis redefine validation as testing how well the model can advise decisionmakers in terrorism risk management decisions. We develop that into two conditions: validation must be based on cues available in the observable world; and it must focus on what can be done to affect that observable world, i.e., risk management. That leads to two foci: (1) the real-world risk generating process, and (2) best use of available data. Based on our experience with nine WMD terrorism risk assessment models, we then describe three best use of available data pitfalls: SME confidence bias, lack of SME cross-referencing, and problematic initiation rates. Those two foci and three pitfalls provide a basis from which we define validation in this context in terms of four tests--Does the model: … capture initiation? … capture the sequence of events by which attack scenarios unfold? … consider unanticipated scenarios? … consider alternative causal chains? Finally, we corroborate our approach against three validation tests from the DOD literature: Is the model a correct representation of the process to be simulated? To what degree are the model results comparable to the real world? Over what range of inputs are the model results useful? © 2015 Society for Risk Analysis.

  11. Validating an Observation Protocol to Measure Special Education Teacher Effectiveness

    ERIC Educational Resources Information Center

    Johnson, Evelyn S.; Semmelroth, Carrie L.

    2015-01-01

    This study used Kane's (2013) Interpretation/Use Argument (IUA) to measure validity on the Recognizing Effective Special Education Teachers (RESET) observation tool. The RESET observation tool is designed to evaluate special education teacher effectiveness using evidence-based instructional practices as the basis for evaluation. In alignment with…

  12. Validity and inter-observer reliability of subjective hand-arm vibration assessments.

    PubMed

    Coenen, Pieter; Formanoy, Margriet; Douwes, Marjolein; Bosch, Tim; de Kraker, Heleen

    2014-07-01

    Exposure to mechanical vibrations at work (e.g., due to handling powered tools) is a potential occupational risk as it may cause upper extremity complaints. However, reliable and valid assessment methods for vibration exposure at work are lacking. Measuring hand-arm vibration objectively is often difficult and expensive, while often used information provided by manufacturers lacks detail. Therefore, a subjective hand-arm vibration assessment method was tested on validity and inter-observer reliability. In an experimental protocol, sixteen tasks handling powered tools were executed by two workers. Hand-arm vibration was assessed subjectively by 16 observers according to the proposed subjective assessment method. As a gold standard reference, hand-arm vibration was measured objectively using a vibration measurement device. Weighted κ's were calculated to assess validity, intra-class-correlation coefficients (ICCs) were calculated to assess inter-observer reliability. Inter-observer reliability of the subjective assessments depicting the agreement among observers can be expressed by an ICC of 0.708 (0.511-0.873). The validity of the subjective assessments as compared to the gold-standard reference can be expressed by a weighted κ of 0.535 (0.285-0.785). Besides, the percentage of exact agreement of the subjective assessment compared to the objective measurement was relatively low (i.e., 52% of all tasks). This study shows that subjectively assessed hand-arm vibrations are fairly reliable among observers and moderately valid. This assessment method is a first attempt to use subjective risk assessments of hand-arm vibration. Although, this assessment method can benefit from some future improvement, it can be of use in future studies and in field-based ergonomic assessments. Copyright © 2014 Elsevier Ltd and The Ergonomics Society. All rights reserved.

  13. Assessing the Relationship Between Observed Teaching Practice and Reading Growth in First Grade English Learners: A Validation Study

    ERIC Educational Resources Information Center

    Baker, Scott K.; Gersten, Russell; Haager, Diane; Dingle, Mary; Goldenberg, Claude

    2005-01-01

    Validation of a classroom observation measure for use with English Learners (ELs) in Grade 1 is the focus of this study. Fourteen teachers were observed during reading and language arts instruction with an instrument used to generate overall ratings of instructional quality on a number of dimensions. In these classrooms, the reading performance of…

  14. Reliability and Validity of the Dyadic Observed Communication Scale (DOCS).

    PubMed

    Hadley, Wendy; Stewart, Angela; Hunter, Heather L; Affleck, Katelyn; Donenberg, Geri; Diclemente, Ralph; Brown, Larry K

    2013-02-01

    We evaluated the reliability and validity of the Dyadic Observed Communication Scale (DOCS) coding scheme, which was developed to capture a range of communication components between parents and adolescents. Adolescents and their caregivers were recruited from mental health facilities for participation in a large, multi-site family-based HIV prevention intervention study. Seventy-one dyads were randomly selected from the larger study sample and coded using the DOCS at baseline. Preliminary validity and reliability of the DOCS was examined using various methods, such as comparing results to self-report measures and examining interrater reliability. Results suggest that the DOCS is a reliable and valid measure of observed communication among parent-adolescent dyads that captures both verbal and nonverbal communication behaviors that are typical intervention targets. The DOCS is a viable coding scheme for use by researchers and clinicians examining parent-adolescent communication. Coders can be trained to reliably capture individual and dyadic components of communication for parents and adolescents and this complex information can be obtained relatively quickly.

  15. An Argument Approach to Observation Protocol Validity

    ERIC Educational Resources Information Center

    Bell, Courtney A.; Gitomer, Drew H.; McCaffrey, Daniel F.; Hamre, Bridget K.; Pianta, Robert C.; Qi, Yi

    2012-01-01

    This article develops a validity argument approach for use on observation protocols currently used to assess teacher quality for high-stakes personnel and professional development decisions. After defining the teaching quality domain, we articulate an interpretive argument for observation protocols. To illustrate the types of evidence that might…

  16. DNA Fingerprinting Validates Seed Dispersal Curves from Observational Studies in the Neotropical Legume Parkia

    PubMed Central

    Heymann, Eckhard W.; Lüttmann, Kathrin; Michalczyk, Inga M.; Saboya, Pedro Pablo Pinedo; Ziegenhagen, Birgit; Bialozyt, Ronald

    2012-01-01

    Background Determining the distances over which seeds are dispersed is a crucial component for examining spatial patterns of seed dispersal and their consequences for plant reproductive success and population structure. However, following the fate of individual seeds after removal from the source tree till deposition at a distant place is generally extremely difficult. Here we provide a comparison of observationally and genetically determined seed dispersal distances and dispersal curves in a Neotropical animal-plant system. Methodology/Principal Findings In a field study on the dispersal of seeds of three Parkia (Fabaceae) species by two Neotropical primate species, Saguinus fuscicollis and Saguinus mystax, in Peruvian Amazonia, we observationally determined dispersal distances. These dispersal distances were then validated through DNA fingerprinting, by matching DNA from the maternally derived seed coat to DNA from potential source trees. We found that dispersal distances are strongly right-skewed, and that distributions obtained through observational and genetic methods and fitted distributions do not differ significantly from each other. Conclusions/Significance Our study showed that seed dispersal distances can be reliably estimated through observational methods when a strict criterion for inclusion of seeds is observed. Furthermore, dispersal distances produced by the two primate species indicated that these primates fulfil one of the criteria for efficient seed dispersers. Finally, our study demonstrated that DNA extraction methods so far employed for temperate plant species can be successfully used for hard-seeded tropical plants. PMID:22514748

  17. A Turkish Version of the Critical-Care Pain Observation Tool: Reliability and Validity Assessment.

    PubMed

    Aktaş, Yeşim Yaman; Karabulut, Neziha

    2017-08-01

    The study aim was to evaluate the validity and reliability of the Critical-Care Pain Observation Tool in critically ill patients. A repeated measures design was used for the study. A convenience sample of 66 patients who had undergone open-heart surgery in the cardiovascular surgery intensive care unit in Ordu, Turkey, was recruited for the study. The patients were evaluated by using the Critical-Care Pain Observation Tool at rest, during a nociceptive procedure (suctioning), and 20 minutes after the procedure while they were conscious and intubated after surgery. The Turkish version of the Critical-Care Pain Observation Tool has shown statistically acceptable levels of validity and reliability. Inter-rater reliability was supported by moderate-to-high-weighted κ coefficients (weighted κ coefficient = 0.55 to 1.00). For concurrent validity, significant associations were found between the scores on the Critical-Care Pain Observation Tool and the Behavioral Pain Scale scores. Discriminant validity was also supported by higher scores during suctioning (a nociceptive procedure) versus non-nociceptive procedures. The internal consistency of the Critical-Care Pain Observation Tool was 0.72 during a nociceptive procedure and 0.71 during a non-nociceptive procedure. The validity and reliability of the Turkish version of the Critical-Care Pain Observation Tool was determined to be acceptable for pain assessment in critical care, especially for patients who cannot communicate verbally. Copyright © 2016 American Society of PeriAnesthesia Nurses. Published by Elsevier Inc. All rights reserved.

  18. OCO-2 Observation and Validation Overview: Observations Data Modes and Target Observations, Taken During the First 15 Months of Operations

    NASA Astrophysics Data System (ADS)

    Osterman, G. B.; Fisher, B.; Wunch, D.; Eldering, A.; Wennberg, P. O.; Roehl, C. M.; Naylor, B. J.; Lee, R.; Pollock, R.; Gunson, M. R.

    2015-12-01

    The OCO-2 instrument was successfully launched on July 2, 2014 from Vandenberg Air Force Base in California. The instrument reached its observational orbit about three weeks later. The spacecraft is at the head of the A-train satellites and began collecting operational data on Sept 5, 2014. OCO-2 makes measurements in three modes: nadir, glint and target. Target observations are designed to provide large amounts of data in a small area near a ground validation site. The instruments of the Total Carbon Column Observing Network (TCCON) provide the ground validation data for the OCO-2 XCO2 observations and comparisons to TCCON form the basis of the OCO-2 validation plan. There are now 27 locations at which OCO-2 can perform target observations and CCON sites make up 23 of those possible target locations. For its first year in orbit, OCO-2 operated in nadir mode for 16 days and then in glint mode for 16 days. Each 16-day cycle spans 233 orbits. On July 1, 2015, OCO-2 changed to an observational mode of alternating nadir and glint measurements on an orbit-by-orbit basis. By December 2015, this operational mode may be modified such that orbits that measure only over ocean will always observed in glint mode. In this presentation we will provide information on the observations made by OCO-2 during its first 15 month in operations. We will show maps of the OCO-2 ground tracks and XCO2 data, calendars illustrating the observational schedule and statistics on the target observations taken. We will provide more information on what is involved in making target observations and how it affects the standard operational data acquisition patterns. Changes to the standard observational patterns of OCO-2 and to the list of locations for target observations will be discussed as well. We will provide an overview of some of the validation related analysis being done using nadir and glint mode OCO-2 data in addition to an overview on validation analyses that do not directly utilize TCCON

  19. Validation of an Instructional Observation Instrument for Teaching English as a Foreign Language in Spain

    ERIC Educational Resources Information Center

    Gomez-Garcia, Maria

    2011-01-01

    The design and validation of a classroom observation instrument to provide formative feedback for teachers of EFL in Spain is the overarching purpose of this study. This study proposes that a valid and reliable classroom observation instrument, based on effective practice in teaching EFL, can be developed and used in Spain to enable teachers to…

  20. Using face validity to recognize empirical community observations.

    PubMed

    Gaber, John; Gaber, Sharon L

    2010-05-01

    There is a growing interest among international planning scholars to explore community participation in the plan making process from a qualitative research approach. In this paper the research assessment tool "face validity" is discussed as one way to help planners decipher when the community is sharing empirically grounded observations that can advance the applicability of the plan making process. Face validity provides a common sense assessment of research conclusions. It allows the assessor to look at an entire research project and ask: "on the face of things, does this research make sense?" With planners listening to citizen comments with an ear for face validity observations, holds open the opportunity for government to empirically learn from the community to see if they "got it right." And if not, to chart out a course on how they can get it right. Copyright 2009 Elsevier Ltd. All rights reserved.

  1. Validity of the Autism Spectrum Disorder Observation for Children (ASD-OC)

    ERIC Educational Resources Information Center

    Neal, Daniene; Matson, Johnny L.; Hattier, Megan A.

    2014-01-01

    The Autism Spectrum Disorder Observation for Children (ASD-OC) is a 45-item observation scale used to assess autistic symptomatology. The reliability of this measure has been established in previous research; therefore, the purpose of this study is to evaluate its validity among a sample of children (1-15 years). The large correlation between the…

  2. TU-FG-209-11: Validation of a Channelized Hotelling Observer to Optimize Chest Radiography Image Processing for Nodule Detection: A Human Observer Study

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sanchez, A; Little, K; Chung, J

    Purpose: To validate the use of a Channelized Hotelling Observer (CHO) model for guiding image processing parameter selection and enable improved nodule detection in digital chest radiography. Methods: In a previous study, an anthropomorphic chest phantom was imaged with and without PMMA simulated nodules using a GE Discovery XR656 digital radiography system. The impact of image processing parameters was then explored using a CHO with 10 Laguerre-Gauss channels. In this work, we validate the CHO’s trend in nodule detectability as a function of two processing parameters by conducting a signal-known-exactly, multi-reader-multi-case (MRMC) ROC observer study. Five naive readers scored confidencemore » of nodule visualization in 384 images with 50% nodule prevalence. The image backgrounds were regions-of-interest extracted from 6 normal patient scans, and the digitally inserted simulated nodules were obtained from phantom data in previous work. Each patient image was processed with both a near-optimal and a worst-case parameter combination, as determined by the CHO for nodule detection. The same 192 ROIs were used for each image processing method, with 32 randomly selected lung ROIs per patient image. Finally, the MRMC data was analyzed using the freely available iMRMC software of Gallas et al. Results: The image processing parameters which were optimized for the CHO led to a statistically significant improvement (p=0.049) in human observer AUC from 0.78 to 0.86, relative to the image processing implementation which produced the lowest CHO performance. Conclusion: Differences in user-selectable image processing methods on a commercially available digital radiography system were shown to have a marked impact on performance of human observers in the task of lung nodule detection. Further, the effect of processing on humans was similar to the effect on CHO performance. Future work will expand this study to include a wider range of detection/classification tasks and

  3. The Effect of Observation Length and Presentation Order on the Reliability and Validity of an Observational Measure of Teaching Quality

    ERIC Educational Resources Information Center

    Mashburn, Andrew J.; Meyer, J. Patrick; Allen, Joseph P.; Pianta, Robert C.

    2014-01-01

    Observational methods are increasingly being used in classrooms to evaluate the quality of teaching. Operational procedures for observing teachers are somewhat arbitrary in existing measures and vary across different instruments. To study the effect of different observation procedures on score reliability and validity, we conducted an experimental…

  4. Initial Steps in Creating a Developmentally Valid Tool for Observing/Assessing Rope Jumping

    ERIC Educational Resources Information Center

    Roberton, Mary Ann; Thompson, Gregory; Langendorfer, Stephen J.

    2017-01-01

    Background: Valid motor development sequences show the various behaviors that children display as they progress toward competence in specific motor skills. Teachers can use these sequences to observe informally or formally assess their students. While longitudinal study is ultimately required to validate developmental sequences, there are earlier,…

  5. Effect of individual shades on reliability and validity of observers in colour matching.

    PubMed

    Lagouvardos, P E; Diamanti, H; Polyzois, G

    2004-06-01

    The effect of individual shades in shade guides, on the reliability and validity of measurements in a colour matching process is very important. Observer's agreement on shades and sensitivity/specificity of shades, can give us an estimate of shade's effect on observer's reliability and validity. In the present study, a group of 16 students, matched 15 shades of a Kulzer's guide and 10 human incisors to Kulzer's and/or Vita's shade tabs, in 4 different tests. The results showed shades I, B10, C40, A35 and A10 were those with the highest reliability and validity values. In conclusion, a) the matching process with shades of different materials was not accurate enough, b) some shades produce a more reliable and valid match than others and c) teeth are matched with relative difficulty.

  6. Is Ultrasound a Valid and Reliable Imaging Modality for Airway Evaluation?: An Observational Computed Tomographic Validation Study Using Submandibular Scanning of the Mouth and Oropharynx.

    PubMed

    Abdallah, Faraj W; Yu, Eugene; Cholvisudhi, Phantila; Niazi, Ahtsham U; Chin, Ki J; Abbas, Sherif; Chan, Vincent W

    2017-01-01

    Ultrasound (US) imaging of the airway may be useful in predicting difficulty of airway management (DAM); but its use is limited by lack of proof of its validity and reliability. We sought to validate US imaging of the airway by comparison to CT-scan, and to assess its inter- and intra-observer reliability. We used submandibular sonographic imaging of the mouth and oropharynx to examine how well the ratio of tongue thickness to oral cavity height correlates with the ratio of tongue volume to oral cavity volume, an established tomographic measure of DAM. A cohort of 34 patients undergoing CT-scan was recruited. Study standardized assessments included CT-measured ratios of tongue volume to oropharyngeal cavity volume; tongue thickness to oral cavity height; and US-measured ratio of tongue thickness to oral cavity height. Two sonographers independently performed US imaging of the airway before and after CT-scan. Our findings indicate that the US-measured ratio of tongue thickness to oral cavity height highly correlates with the CT-measured ratio of tongue volume to oral cavity volume. US measurements also demonstrated strong inter- and intra-observer reliability. This study suggests that US is a valid and reliable tool for imaging the oral and oropharyngeal parts of the airway, as well as for measuring the volumetric relationship between the tongue and oral cavity, and may therefore be a useful predictor of DAM. © 2016 by the American Institute of Ultrasound in Medicine.

  7. A methodology to estimate representativeness of LAI station observation for validation: a case study with Chinese Ecosystem Research Network (CERN) in situ data

    NASA Astrophysics Data System (ADS)

    Xu, Baodong; Li, Jing; Liu, Qinhuo; Zeng, Yelu; Yin, Gaofei

    2014-11-01

    Leaf Area Index (LAI) is known as a key vegetation biophysical variable. To effectively use remote sensing LAI products in various disciplines, it is critical to understand the accuracy of them. The common method for the validation of LAI products is firstly establish the empirical relationship between the field data and high-resolution imagery, to derive LAI maps, then aggregate high-resolution LAI maps to match moderate-resolution LAI products. This method is just suited for the small region, and its frequencies of measurement are limited. Therefore, the continuous observing LAI datasets from ground station network are important for the validation of multi-temporal LAI products. However, due to the scale mismatch between the point observation in the ground station and the pixel observation, the direct comparison will bring the scale error. Thus it is needed to evaluate the representativeness of ground station measurement within pixel scale of products for the reasonable validation. In this paper, a case study with Chinese Ecosystem Research Network (CERN) in situ data was taken to introduce a methodology to estimate representativeness of LAI station observation for validating LAI products. We first analyzed the indicators to evaluate the observation representativeness, and then graded the station measurement data. Finally, the LAI measurement data which can represent the pixel scale was used to validate the MODIS, GLASS and GEOV1 LAI products. The result shows that the best agreement is reached between the GLASS and GEOV1, while the lowest uncertainty is achieved by GEOV1 followed by GLASS and MODIS. We conclude that the ground station measurement data can validate multi-temporal LAI products objectively based on the evaluation indicators of station observation representativeness, which can also improve the reliability for the validation of remote sensing products.

  8. An observation tool for instructor and student behaviors to measure in-class learner engagement: a validation study

    PubMed Central

    Alimoglu, Mustafa K.; Sarac, Didar B.; Alparslan, Derya; Karakas, Ayse A.; Altintas, Levent

    2014-01-01

    Background Efforts are made to enhance in-class learner engagement because it stimulates and enhances learning. However, it is not easy to quantify learner engagement. This study aimed to develop and validate an observation tool for instructor and student behaviors to determine and compare in-class learner engagement levels in four different class types delivered by the same instructor. Methods Observer pairs observed instructor and student behaviors during lectures in large class (LLC, n=2) with third-year medical students, lectures in small class (LSC, n=6) and case-based teaching sessions (CBT, n=4) with fifth-year students, and problem-based learning (PBL) sessions (~7 hours) with second-year students. The observation tool was a revised form of STROBE, an instrument for recording behaviors of an instructor and four randomly selected students as snapshots for 5-min cycles. Instructor and student behaviors were scored 1–5 on this tool named ‘in-class engagement measure (IEM)’. The IEM scores were parallel to the degree of behavior's contribution to active student engagement, so higher scores were associated with more in-class learner engagement. Additionally, the number of questions asked by the instructor and students were recorded. A total of 203 5-min observations were performed (LLC 20, LSC 85, CBT 50, and PBL 48). Results Interobserver agreement on instructor and student behaviors was 93.7% (κ=0.87) and 80.6% (κ=0.71), respectively. Higher median IEM scores were found in student-centered and problem-oriented methods such as CBT and PBL. A moderate correlation was found between instructor and student behaviors (r=0.689). Conclusions This study provides some evidence for validity of the IEM scores as a measure of student engagement in different class types. PMID:25308966

  9. Reliability and validity of the Pragmatics Observational Measure (POM): a new observational measure of pragmatic language for children.

    PubMed

    Cordier, Reinie; Munro, Natalie; Wilkes-Gillan, Sarah; Speyer, Renée; Pearce, Wendy M

    2014-07-01

    There is a need for a reliable and valid assessment of childhood pragmatic language skills during peer-peer interactions. This study aimed to evaluate the psychometric properties of a newly developed pragmatic assessment, the Pragmatic Observational Measure (POM). The psychometric properties of the POM were investigated from observational data of two studies - study 1 involved 342 children aged 5-11 years (108 children with ADHD; 108 typically developing playmates; 126 children in the control group), and study 2 involved 9 children with ADHD who attended a 7-week play-based intervention. The psychometric properties of the POM were determined based on the COnsensus-based Standards for the selection of health status Measurement INstruments (COSMIN) taxonomy of psychometric properties and definitions for health-related outcomes; the Pragmatic Protocol was used as the reference tool against which the POM was evaluated. The POM demonstrated sound psychometric properties in all the reliability, validity and interpretability criteria against which it was assessed. The findings showed that the POM is a reliable and valid measure of pragmatic language skills of children with ADHD between the age of 5 and 11 years and has clinical utility in identifying children with pragmatic language difficulty. Copyright © 2014 The Authors. Published by Elsevier Ltd.. All rights reserved.

  10. Validity of covering-up sun-protection habits: Association of observations and self-report

    PubMed Central

    O'Riordan, David L.; Nehl, Eric; Gies, Peter; Bundy, Lucja; Burgess, Kristen; Davis, Erica; Glanz, Karen

    2013-01-01

    Background Few studies have reported the accuracy of measures used to assess sun-protection practices. Valid measures are critical to the internal validity and use of skin cancer control research. Objectives We sought to validate self-reported covering-up practices of pool-goers. Methods A total of 162 lifeguards and 201 parent/child pairs from 16 pools in 4 metropolitan regions in the United States completed a survey and a 4-day sun-habits diary. Observations of sun-protective behaviors were conducted on two occasions. Results Agreement between observations and diaries ranged from slight to substantial, with most values in the fair to moderate range. Highest agreement was observed for parent hat use (κ = 0.58–0.70). There was no systematic pattern of over- or under-reporting among the 3 study groups. Limitations Potential reactivity and a relatively affluent sample are limitations. Conclusion There was little over-reporting and no systematic bias, which increases confidence in reliance on verbal reports of these behaviors in surveys and intervention research. PMID:19278750

  11. Observational Assessment of Preschool Disruptive Behavior, Part II: validity of the Disruptive Behavior Diagnostic Observation Schedule (DB-DOS).

    PubMed

    Wakschlag, Lauren S; Briggs-Gowan, Margaret J; Hill, Carri; Danis, Barbara; Leventhal, Bennett L; Keenan, Kate; Egger, Helen L; Cicchetti, Domenic; Burns, James; Carter, Alice S

    2008-06-01

    To examine the validity of the Disruptive Behavior Diagnostic Observation Schedule (DB-DOS), a new observational method for assessing preschool disruptive behavior. A total of 327 behaviorally heterogeneous preschoolers from low-income environments comprised the validation sample. Parent and teacher reports were used to identify children with clinically significant disruptive behavior. The DB-DOS assessed observed disruptive behavior in two domains, problems in Behavioral Regulation and Anger Modulation, across three interactional contexts: Examiner Engaged, Examiner Busy, and Parent. Convergent and divergent validity of the DB-DOS were tested in relation to parent and teacher reports and independently observed behavior. Clinical validity was tested in terms of criterion and incremental validity of the DB-DOS for discriminating disruptive behavior status and impairment, concurrently and longitudinally. DB-DOS scores were significantly associated with reported and independently observed behavior in a theoretically meaningful fashion. Scores from both DB-DOS domains and each of the three DB-DOS contexts contributed uniquely to discrimination of disruptive behavior status, concurrently and predictively. Observed behavior on the DB-DOS also contributed incrementally to prediction of impairment over time, beyond variance explained by meeting DSM-IV disruptive behavior disorder symptom criteria based on parent/teacher report. The multidomain, multicontext approach of the DB-DOS is a valid method for direct assessment of preschool disruptive behavior. This approach shows promise for enhancing accurate identification of clinically significant disruptive behavior in young children and for characterizing subtypes in a manner that can directly inform etiological and intervention research.

  12. Validity of the modified RULA for computer workers and reliability of one observation compared to six.

    PubMed

    Levanon, Yafa; Lerman, Yehuda; Gefen, Amit; Ratzon, Navah Z

    2014-01-01

    Awkward body posture while typing is associated with musculoskeletal disorders (MSDs). Valid rapid assessment of computer workers' body posture is essential for the prevention of MSD among this large population. This study aimed to examine the validity of the modified rapid upper limb assessment (mRULA) which adjusted the rapid upper limb assessment (RULA) for computer workers. Moreover, this study examines whether one observation during a working day is sufficient or more observations are needed. A total of 29 right-handed computer workers were recruited. RULA and mRULA were conducted. The observations were then repeated six times at one-hour intervals. A significant moderate correlation (r = 0.6 and r = 0.7 for mouse and keyboard, respectively) was found between the assessments. No significant differences were found between one observation and six observations per working day. The mRULA was found to be valid for the assessment of computer workers, and one observation was sufficient to assess the work-related risk factor.

  13. Instructional Interactions of Kindergarten Mathematics Classrooms: Validating a Direct Observation Instrument

    ERIC Educational Resources Information Center

    Doabler, Christian; Smolkowski, Keith; Fien, Hank; Kosty, Derek B.; Cary, Mari Strand

    2010-01-01

    In this paper, the authors report research focused directly on the validation of the Coding of Academic Teacher-Student interactions (CATS) direct observation instrument. They use classroom information gathered by the CATS instrument to better understand the potential mediating variables hypothesized to influence student achievement. Their study's…

  14. Observed Parenting Behavior with Teens: Measurement Invariance and Predictive Validity Across Race

    PubMed Central

    Skinner, Martie L.; MacKenzie, Elizabeth P.; Haggerty, Kevin P.; Hill, Karl G.; Roberson, Kendra C.

    2011-01-01

    Previous reports supporting measurement equality between European American and African American families have often focused on self-reported risk factors or observed parent behavior with young children. This study examines equality of measurement of observer ratings of parenting behavior with adolescents during structured tasks; mean levels of observed parenting; and predictive validity of teen self-reports of antisocial behaviors and beliefs using a sample of 163 African American and 168 European American families. Multiple-group confirmatory factor analyses supported measurement invariance across ethnic groups for 4 measures of observed parenting behavior: prosocial rewards, psychological costs, antisocial rewards, and problem solving. Some mean-level differences were found: African American parents exhibited lower levels of prosocial rewards, higher levels of psychological costs, and lower problem solving when compared to European Americans. No significant mean difference was found in rewards for antisocial behavior. Multigroup structural equation models suggested comparable relationships across race (predictive validity) between parenting constructs and youth antisocial constructs (i.e., drug initiation, positive drug attitudes, antisocial attitudes, problem behaviors) in all but one of the tested relationships. This study adds to existing evidence that family-based interventions targeting parenting behaviors can be generalized to African American families. PMID:21787057

  15. Validation of Clinical Observations of Mastication in Persons with ALS.

    PubMed

    Simione, Meg; Wilson, Erin M; Yunusova, Yana; Green, Jordan R

    2016-06-01

    Amyotrophic lateral sclerosis (ALS) is a progressive neurological disease that can result in difficulties with mastication leading to malnutrition, choking or aspiration, and reduced quality of life. When evaluating mastication, clinicians primarily observe spatial and temporal aspects of jaw motion. The reliability and validity of clinical observations for detecting jaw movement abnormalities is unknown. The purpose of this study is to determine the reliability and validity of clinician-based ratings of chewing performance in neuro-typical controls and persons with varying degrees of chewing impairments due to ALS. Adults chewed a solid food consistency while full-face video were recorded along with jaw kinematic data using a 3D optical motion capture system. Five experienced speech-language pathologists watched the videos and rated the spatial and temporal aspects of chewing performance. The jaw kinematic data served as the gold-standard for validating the clinicians' ratings. Results showed that the clinician-based rating of temporal aspects of chewing performance had strong inter-rater reliability and correlated well with comparable kinematic measures. In contrast, the reliability of rating the spatial and spatiotemporal aspects of chewing (i.e., range of motion of the jaw, consistency of the chewing pattern) was mixed. Specifically, ratings of range of motion were at best only moderately reliable. Ratings of chewing movement consistency were reliable but only weakly correlated with comparable measures of jaw kinematics. These findings suggest that clinician ratings of temporal aspects of chewing are appropriate for clinical use, whereas ratings of the spatial and spatiotemporal aspects of chewing may not be reliable or valid.

  16. Concurrent Validity of the Classroom Strategies Scale for Elementary School--Observer Form

    ERIC Educational Resources Information Center

    Reddy, Linda A.; Fabiano, Gregory A.; Dudek, Christopher M.

    2013-01-01

    The present study is an initial investigation of the concurrent validity of a new assessment, the Classroom Strategies Scale (CSS version 2.0) for Elementary School--Observer Form. The CSS assesses teachers' use of instructional and behavioral management strategies. In the present study, the CSS is compared to the Classroom Assessment Scoring…

  17. Validity and feasibility of the EMG direct observation tool (EMG-DOT).

    PubMed

    Leep Hunderfund, Andrea N; Rubin, Devon I; Laughlin, Ruple S; Sorenson, Eric J; Watson, James C; Jones, Lyell K; Juul, Dorthea; Park, Yoon Soo

    2016-04-26

    To develop a new workplace-based EMG direct observation tool (EMG-DOT) and gather validity evidence supporting its use for assessing electrodiagnostic skills among postgraduate medical trainees. The EMG-DOT was developed by experts using an iterative process. Validity evidence from content, response process, internal structure, relations to other variables, and consequences of testing was collected during the 2013-2014 academic year. Of 3,412 studies performed by trainees during the study period, 299 (9%) were assessed using the EMG-DOT. Of these, 203 (68%) involved a physician rater and 96 (32%) involved a technician rater. The 14-item EMG-DOT had excellent internal-consistency reliability (Cronbach α 0.94). Correlations between individual items and criterion-referenced global ratings of performance ranged from 0.36 to 0.72 (all p < 0.001). Mean total scores increased from 70% to 80% over 4 months of the EMG rotation (p < 0.001) despite a corresponding significant increase in case complexity (0.21-0.74 on a 3-point rating scale; p < 0.001). Trainees reported that the observational assessment exercise improved their knowledge or skills in 82% of encounters (188/230) and that feedback generated by the EMG-DOT improved the quality of care provided to patients in 58% (133/230). Trainees were "satisfied" or "very satisfied" with the observational assessment exercise in 96% of encounters (234/243). This study provides validity evidence supporting the use of EMG-DOT scores to assess electrodiagnostic skills of residents and fellows. The EMG-DOT can be used to inform milestone-based assessments of trainee performance in neurology, child neurology, physical medicine and rehabilitation, neuromuscular, and clinical neurophysiology training programs. © 2016 American Academy of Neurology.

  18. The brief negative symptom scale: validation of the German translation and convergent validity with self-rated anhedonia and observer-rated apathy.

    PubMed

    Bischof, Martin; Obermann, Caitriona; Hartmann, Matthias N; Hager, Oliver M; Kirschner, Matthias; Kluge, Agne; Strauss, Gregory P; Kaiser, Stefan

    2016-11-22

    Negative symptoms are considered core symptoms of schizophrenia. The Brief Negative Symptom Scale (BNSS) was developed to measure this symptomatic dimension according to a current consensus definition. The present study examined the psychometric properties of the German version of the BNSS. To expand former findings on convergent validity, we employed the Temporal Experience Pleasure Scale (TEPS), a hedonic self-report that distinguishes between consummatory and anticipatory pleasure. Additionally, we addressed convergent validity with observer-rated assessment of apathy with the Apathy Evaluation Scale (AES), which was completed by the patient's primary nurse. Data were collected from 75 in- and outpatients from the Psychiatric Hospital, University Zurich diagnosed with either schizophrenia or schizoaffective disorder. We assessed convergent and discriminant validity, internal consistency and inter-rater reliability. We largely replicated the findings of the original version showing good psychometric properties of the BNSS. In addition, the primary nurses evaluation correlated moderately with interview-based clinician rating. BNSS anhedonia items showed good convergent validity with the TEPS. Overall, the German BNSS shows good psychometric properties comparable to the original English version. Convergent validity extends beyond interview-based assessments of negative symptoms to self-rated anhedonia and observer-rated apathy.

  19. Parental modelling of eating behaviours: observational validation of the Parental Modelling of Eating Behaviours scale (PARM).

    PubMed

    Palfreyman, Zoe; Haycraft, Emma; Meyer, Caroline

    2015-03-01

    Parents are important role models for their children's eating behaviours. This study aimed to further validate the recently developed Parental Modelling of Eating Behaviours Scale (PARM) by examining the relationships between maternal self-reports on the PARM with the modelling practices exhibited by these mothers during three family mealtime observations. Relationships between observed maternal modelling and maternal reports of children's eating behaviours were also explored. Seventeen mothers with children aged between 2 and 6 years were video recorded at home on three separate occasions whilst eating a meal with their child. Mothers also completed the PARM, the Children's Eating Behaviour Questionnaire and provided demographic information about themselves and their child. Findings provided validation for all three PARM subscales, which were positively associated with their observed counterparts on the observational coding scheme (PARM-O). The results also indicate that habituation to observations did not change the feeding behaviours displayed by mothers. In addition, observed maternal modelling was significantly related to children's food responsiveness (i.e., their interest in and desire for foods), enjoyment of food, and food fussiness. This study makes three important contributions to the literature. It provides construct validation for the PARM measure and provides further observational support for maternal modelling being related to lower levels of food fussiness and higher levels of food enjoyment in their children. These findings also suggest that maternal feeding behaviours remain consistent across repeated observations of family mealtimes, providing validation for previous research which has used single observations. Copyright © 2014 Elsevier Ltd. All rights reserved.

  20. The definition of radiological signs in gastric ulcer and assessment of their validity by inter-observer variation study.

    PubMed

    Schulman, A; Simpkins, K C

    1975-07-01

    The initial aim was to program a computer with information on the frequency of radiological signs in benign and malignant gastric ulcers in order to obtain a percentage probability of benignancy or malignancy in succeeding ulcers in clinical practice. However, only four of the many signs described in gastric ulcer were confirmed to be of validity (i.e. reliable existence) by an inter-observer variation study using two observers and the films from 69 barium meal examinations. These were projection or non-projection of the in-profile ulcer, presence or absence of adjacent mucosal folds, good or poor definition of the in-face ulcer's edge, and extension of radiating folds to the in-face ulcer's edge. A few more remained unassessed due to insufficient numbers of relevant cases. It is condluced that: as defined in the literature the majority of radiological signs in this field are of uncertain existence; and the four that were found to be valid do not fully describe the important appearances that may be seen in benign and malignant ulcers and would be inadequate to differentiate them to a sufficiently high degree of probability.

  1. Validation: Codes to compare simulation data to various observations

    NASA Astrophysics Data System (ADS)

    Cohn, J. D.

    2017-02-01

    Validation provides codes to compare several observations to simulated data with stellar mass and star formation rate, simulated data stellar mass function with observed stellar mass function from PRIMUS or SDSS-GALEX in several redshift bins from 0.01-1.0, and simulated data B band luminosity function with observed stellar mass function, and to create plots for various attributes, including stellar mass functions, and stellar mass to halo mass. These codes can model predictions (in some cases alongside observational data) to test other mock catalogs.

  2. Inter-calibration and validation of observations from SAPHIR and ATMS instruments

    NASA Astrophysics Data System (ADS)

    Moradi, I.; Ferraro, R. R.

    2015-12-01

    We present the results of evaluating observations from microwave instruments aboard the Suomi National Polar-orbiting Partnership (NPP, ATMS instrument) and Megha-Tropiques (SAPHIR instrument) satellites. The study includes inter-comparison and inter-calibration of observations of similar channels from the two instruments, evaluation of the satellite data using high-quality radiosonde data from Atmospheric Radiation Measurement Program and GPS Radio Occultaion Observations from COSMIC mission, as well as geolocation error correction. The results of this study are valuable for generating climate data records from these instruments as well as for extending current climate data records from similar instruments such as AMSU-B and MHS to the ATMS and SAPHIR instruments. Reference: Moradi et al., Intercalibration and Validation of Observations From ATMS and SAPHIR Microwave Sounders. IEEE Transactions on Geoscience and Remote Sensing. 01/2015; DOI: 10.1109/TGRS.2015.2427165

  3. Developing and Validating a New Classroom Climate Observation Assessment Tool

    PubMed Central

    Leff, Stephen S.; Thomas, Duane E.; Shapiro, Edward S.; Paskewich, Brooke; Wilson, Kim; Necowitz-Hoffman, Beth; Jawad, Abbas F.

    2011-01-01

    The climate of school classrooms, shaped by a combination of teacher practices and peer processes, is an important determinant for children’s psychosocial functioning and is a primary factor affecting bullying and victimization. Given that there are relatively few theoretically-grounded and validated assessment tools designed to measure the social climate of classrooms, our research team developed an observation tool through participatory action research (PAR). This article details how the assessment tool was designed and preliminarily validated in 18 third-, fourth-, and fifth-grade classrooms in a large urban public school district. The goals of this study are to illustrate the feasibility of a PAR paradigm in measurement development, ascertain the psychometric properties of the assessment tool, and determine associations with different indices of classroom levels of relational and physical aggression. PMID:21643447

  4. Developing and Validating a New Classroom Climate Observation Assessment Tool.

    PubMed

    Leff, Stephen S; Thomas, Duane E; Shapiro, Edward S; Paskewich, Brooke; Wilson, Kim; Necowitz-Hoffman, Beth; Jawad, Abbas F

    2011-01-01

    The climate of school classrooms, shaped by a combination of teacher practices and peer processes, is an important determinant for children's psychosocial functioning and is a primary factor affecting bullying and victimization. Given that there are relatively few theoretically-grounded and validated assessment tools designed to measure the social climate of classrooms, our research team developed an observation tool through participatory action research (PAR). This article details how the assessment tool was designed and preliminarily validated in 18 third-, fourth-, and fifth-grade classrooms in a large urban public school district. The goals of this study are to illustrate the feasibility of a PAR paradigm in measurement development, ascertain the psychometric properties of the assessment tool, and determine associations with different indices of classroom levels of relational and physical aggression.

  5. Validation of an image-based technique to assess the perceptual quality of clinical chest radiographs with an observer study

    NASA Astrophysics Data System (ADS)

    Lin, Yuan; Choudhury, Kingshuk R.; McAdams, H. Page; Foos, David H.; Samei, Ehsan

    2014-03-01

    We previously proposed a novel image-based quality assessment technique1 to assess the perceptual quality of clinical chest radiographs. In this paper, an observer study was designed and conducted to systematically validate this technique. Ten metrics were involved in the observer study, i.e., lung grey level, lung detail, lung noise, riblung contrast, rib sharpness, mediastinum detail, mediastinum noise, mediastinum alignment, subdiaphragm-lung contrast, and subdiaphragm area. For each metric, three tasks were successively presented to the observers. In each task, six ROI images were randomly presented in a row and observers were asked to rank the images only based on a designated quality and disregard the other qualities. A range slider on the top of the images was used for observers to indicate the acceptable range based on the corresponding perceptual attribute. Five boardcertificated radiologists from Duke participated in this observer study on a DICOM calibrated diagnostic display workstation and under low ambient lighting conditions. The observer data were analyzed in terms of the correlations between the observer ranking orders and the algorithmic ranking orders. Based on the collected acceptable ranges, quality consistency ranges were statistically derived. The observer study showed that, for each metric, the averaged ranking orders of the participated observers were strongly correlated with the algorithmic orders. For the lung grey level, the observer ranking orders completely accorded with the algorithmic ranking orders. The quality consistency ranges derived from this observer study were close to these derived from our previous study. The observer study indicates that the proposed image-based quality assessment technique provides a robust reflection of the perceptual image quality of the clinical chest radiographs. The derived quality consistency ranges can be used to automatically predict the acceptability of a clinical chest radiograph.

  6. Improved Conceptual Models Methodology (ICoMM) for Validation of Non-Observable Systems

    DTIC Science & Technology

    2015-12-01

    distribution is unlimited IMPROVED CONCEPTUAL MODELS METHODOLOGY (ICoMM) FOR VALIDATION OF NON-OBSERVABLE SYSTEMS by Sang M. Sok December 2015...REPORT TYPE AND DATES COVERED Dissertation 4. TITLE AND SUBTITLE IMPROVED CONCEPTUAL MODELS METHODOLOGY (ICoMM) FOR VALIDATION OF NON-OBSERVABLE...importance of the CoM. The improved conceptual model methodology (ICoMM) is developed in support of improving the structure of the CoM for both face and

  7. Observations on CFD Verification and Validation from the AIAA Drag Prediction Workshops

    NASA Technical Reports Server (NTRS)

    Morrison, Joseph H.; Kleb, Bil; Vassberg, John C.

    2014-01-01

    The authors provide observations from the AIAA Drag Prediction Workshops that have spanned over a decade and from a recent validation experiment at NASA Langley. These workshops provide an assessment of the predictive capability of forces and moments, focused on drag, for transonic transports. It is very difficult to manage the consistency of results in a workshop setting to perform verification and validation at the scientific level, but it may be sufficient to assess it at the level of practice. Observations thus far: 1) due to simplifications in the workshop test cases, wind tunnel data are not necessarily the “correct” results that CFD should match, 2) an average of core CFD data are not necessarily a better estimate of the true solution as it is merely an average of other solutions and has many coupled sources of variation, 3) outlier solutions should be investigated and understood, and 4) the DPW series does not have the systematic build up and definition on both the computational and experimental side that is required for detailed verification and validation. Several observations regarding the importance of the grid, effects of physical modeling, benefits of open forums, and guidance for validation experiments are discussed. The increased variation in results when predicting regions of flow separation and increased variation due to interaction effects, e.g., fuselage and horizontal tail, point out the need for validation data sets for these important flow phenomena. Experiences with a recent validation experiment at NASA Langley are included to provide guidance on validation experiments.

  8. Cooperate to Validate: OBSERVAL-NET Experts' Report on Validation of Non-Formal and Informal Learning (VNIL) 2013

    ERIC Educational Resources Information Center

    Weber Guisan, Saskia; Voit, Janine; Lengauer, Sonja; Proinger, Eva; Duvekot, Ruud; Aagaard, Kirsten

    2014-01-01

    The present publication is one of the outcomes of the OBSERVAL-NET project (follow-up of the OBSERVAL project). The main aim of OBSERVAL-NET was to set up a stakeholder-centric network of organisations supporting the validation of non-formal and informal learning in Europe based on the formation of national working groups in the 8 participating…

  9. Cooperate to Validate. Observal-Net Experts' Report on Validation of Non-Formal and Informal Learning (VNIL) 2013

    ERIC Educational Resources Information Center

    Weber Guisan, Saskia; Voit, Janine; Lengauer, Sonja; Proinger, Eva; Duvekot, Ruud; Aagaard, Kirsten

    2014-01-01

    The present publication is one of the outcomes of the OBSERVAL-NET project (followup of the OBSERVAL project). The main aim of OBSERVAL-NET was to set up a stakeholder centric network of organisations supporting the validation of non-formal and informal learning in Europe based on the formation of national working groups in the 8 participating…

  10. Construction and Validation of an Observational Scale of Neighborhood Characteristics

    ERIC Educational Resources Information Center

    McDonell, James R.; Waters, Tracy J.

    2011-01-01

    This paper reports the development and validation of the Neighborhood Observation Scale, a 41 item measure of neighborhood physical appearance, social appearance, safety, and amenities. Three independent ratings were collected on each of 244 neighborhoods in 132 census block groups in five South Carolina counties, for a total of 732 observations.…

  11. Converting Soil Moisture Observations to Effective Values for Improved Validation of Remotely Sensed Soil Moisture

    NASA Technical Reports Server (NTRS)

    Laymon, Charles A.; Crosson, William L.; Limaye, Ashutosh; Manu, Andrew; Archer, Frank

    2005-01-01

    We compare soil moisture retrieved with an inverse algorithm with observations of mean moisture in the 0-6 cm soil layer. A significant discrepancy is noted between the retrieved and observed moisture. Using emitting depth functions as weighting functions to convert the observed mean moisture to observed effective moisture removes nearly one-half of the discrepancy noted. This result has important implications in remote sensing validation studies.

  12. Earth as an Extrasolar Planet: Earth Model Validation Using EPOXI Earth Observations

    NASA Technical Reports Server (NTRS)

    Robinson, Tyler D.; Meadows, Victoria S.; Crisp, David; Deming, Drake; A'Hearn, Michael F.; Charbonneau, David; Livengood, Timothy A.; Seager, Sara; Barry, Richard; Hearty, Thomas; hide

    2011-01-01

    The EPOXI Discovery Mission of Opportunity reused the Deep Impact flyby spacecraft to obtain spatially and temporally resolved visible photometric and moderate resolution near-infrared (NIR) spectroscopic observations of Earth. These remote observations provide a rigorous validation of whole disk Earth model simulations used to better under- stand remotely detectable extrasolar planet characteristics. We have used these data to upgrade, correct, and validate the NASA Astrobiology Institute s Virtual Planetary Laboratory three-dimensional line-by-line, multiple-scattering spectral Earth model (Tinetti et al., 2006a,b). This comprehensive model now includes specular reflectance from the ocean and explicitly includes atmospheric effects such as Rayleigh scattering, gas absorption, and temperature structure. We have used this model to generate spatially and temporally resolved synthetic spectra and images of Earth for the dates of EPOXI observation. Model parameters were varied to yield an optimum fit to the data. We found that a minimum spatial resolution of approx.100 pixels on the visible disk, and four categories of water clouds, which were defined using observed cloud positions and optical thicknesses, were needed to yield acceptable fits. The validated model provides a simultaneous fit to the Earth s lightcurve, absolute brightness, and spectral data, with a root-mean-square error of typically less than 3% for the multiwavelength lightcurves, and residuals of approx.10% for the absolute brightness throughout the visible and NIR spectral range. We extend our validation into the mid-infrared by comparing the model to high spectral resolution observations of Earth from the Atmospheric Infrared Sounder, obtaining a fit with residuals of approx.7%, and temperature errors of less than 1K in the atmospheric window. For the purpose of understanding the observable characteristics of the distant Earth at arbitrary viewing geometry and observing cadence, our validated

  13. Earth as an Extrasolar Planet: Earth Model Validation Using EPOXI Earth Observations

    NASA Astrophysics Data System (ADS)

    Robinson, Tyler D.; Meadows, Victoria S.; Crisp, David; Deming, Drake; A'Hearn, Michael F.; Charbonneau, David; Livengood, Timothy A.; Seager, Sara; Barry, Richard K.; Hearty, Thomas; Hewagama, Tilak; Lisse, Carey M.; McFadden, Lucy A.; Wellnitz, Dennis D.

    2011-06-01

    The EPOXI Discovery Mission of Opportunity reused the Deep Impact flyby spacecraft to obtain spatially and temporally resolved visible photometric and moderate resolution near-infrared (NIR) spectroscopic observations of Earth. These remote observations provide a rigorous validation of whole-disk Earth model simulations used to better understand remotely detectable extrasolar planet characteristics. We have used these data to upgrade, correct, and validate the NASA Astrobiology Institute's Virtual Planetary Laboratory three-dimensional line-by-line, multiple-scattering spectral Earth model. This comprehensive model now includes specular reflectance from the ocean and explicitly includes atmospheric effects such as Rayleigh scattering, gas absorption, and temperature structure. We have used this model to generate spatially and temporally resolved synthetic spectra and images of Earth for the dates of EPOXI observation. Model parameters were varied to yield an optimum fit to the data. We found that a minimum spatial resolution of ∼100 pixels on the visible disk, and four categories of water clouds, which were defined by using observed cloud positions and optical thicknesses, were needed to yield acceptable fits. The validated model provides a simultaneous fit to Earth's lightcurve, absolute brightness, and spectral data, with a root-mean-square (RMS) error of typically less than 3% for the multiwavelength lightcurves and residuals of ∼10% for the absolute brightness throughout the visible and NIR spectral range. We have extended our validation into the mid-infrared by comparing the model to high spectral resolution observations of Earth from the Atmospheric Infrared Sounder, obtaining a fit with residuals of ∼7% and brightness temperature errors of less than 1 K in the atmospheric window. For the purpose of understanding the observable characteristics of the distant Earth at arbitrary viewing geometry and observing cadence, our validated forward model can be

  14. Earth as an extrasolar planet: Earth model validation using EPOXI earth observations.

    PubMed

    Robinson, Tyler D; Meadows, Victoria S; Crisp, David; Deming, Drake; A'hearn, Michael F; Charbonneau, David; Livengood, Timothy A; Seager, Sara; Barry, Richard K; Hearty, Thomas; Hewagama, Tilak; Lisse, Carey M; McFadden, Lucy A; Wellnitz, Dennis D

    2011-06-01

    The EPOXI Discovery Mission of Opportunity reused the Deep Impact flyby spacecraft to obtain spatially and temporally resolved visible photometric and moderate resolution near-infrared (NIR) spectroscopic observations of Earth. These remote observations provide a rigorous validation of whole-disk Earth model simulations used to better understand remotely detectable extrasolar planet characteristics. We have used these data to upgrade, correct, and validate the NASA Astrobiology Institute's Virtual Planetary Laboratory three-dimensional line-by-line, multiple-scattering spectral Earth model. This comprehensive model now includes specular reflectance from the ocean and explicitly includes atmospheric effects such as Rayleigh scattering, gas absorption, and temperature structure. We have used this model to generate spatially and temporally resolved synthetic spectra and images of Earth for the dates of EPOXI observation. Model parameters were varied to yield an optimum fit to the data. We found that a minimum spatial resolution of ∼100 pixels on the visible disk, and four categories of water clouds, which were defined by using observed cloud positions and optical thicknesses, were needed to yield acceptable fits. The validated model provides a simultaneous fit to Earth's lightcurve, absolute brightness, and spectral data, with a root-mean-square (RMS) error of typically less than 3% for the multiwavelength lightcurves and residuals of ∼10% for the absolute brightness throughout the visible and NIR spectral range. We have extended our validation into the mid-infrared by comparing the model to high spectral resolution observations of Earth from the Atmospheric Infrared Sounder, obtaining a fit with residuals of ∼7% and brightness temperature errors of less than 1 K in the atmospheric window. For the purpose of understanding the observable characteristics of the distant Earth at arbitrary viewing geometry and observing cadence, our validated forward model can be

  15. Earth as an Extrasolar Planet: Earth Model Validation Using EPOXI Earth Observations

    PubMed Central

    Meadows, Victoria S.; Crisp, David; Deming, Drake; A'Hearn, Michael F.; Charbonneau, David; Livengood, Timothy A.; Seager, Sara; Barry, Richard K.; Hearty, Thomas; Hewagama, Tilak; Lisse, Carey M.; McFadden, Lucy A.; Wellnitz, Dennis D.

    2011-01-01

    Abstract The EPOXI Discovery Mission of Opportunity reused the Deep Impact flyby spacecraft to obtain spatially and temporally resolved visible photometric and moderate resolution near-infrared (NIR) spectroscopic observations of Earth. These remote observations provide a rigorous validation of whole-disk Earth model simulations used to better understand remotely detectable extrasolar planet characteristics. We have used these data to upgrade, correct, and validate the NASA Astrobiology Institute's Virtual Planetary Laboratory three-dimensional line-by-line, multiple-scattering spectral Earth model. This comprehensive model now includes specular reflectance from the ocean and explicitly includes atmospheric effects such as Rayleigh scattering, gas absorption, and temperature structure. We have used this model to generate spatially and temporally resolved synthetic spectra and images of Earth for the dates of EPOXI observation. Model parameters were varied to yield an optimum fit to the data. We found that a minimum spatial resolution of ∼100 pixels on the visible disk, and four categories of water clouds, which were defined by using observed cloud positions and optical thicknesses, were needed to yield acceptable fits. The validated model provides a simultaneous fit to Earth's lightcurve, absolute brightness, and spectral data, with a root-mean-square (RMS) error of typically less than 3% for the multiwavelength lightcurves and residuals of ∼10% for the absolute brightness throughout the visible and NIR spectral range. We have extended our validation into the mid-infrared by comparing the model to high spectral resolution observations of Earth from the Atmospheric Infrared Sounder, obtaining a fit with residuals of ∼7% and brightness temperature errors of less than 1 K in the atmospheric window. For the purpose of understanding the observable characteristics of the distant Earth at arbitrary viewing geometry and observing cadence, our validated forward

  16. Assessing Attachment Security With the Attachment Q Sort: Meta-Analytic Evidence for the Validity of the Observer AQS

    ERIC Educational Resources Information Center

    van I Jzendoorn,Marinus H.; Vereijken, Carolus M.J.L.; Bakermans-Kranenburg, Marian J.; Riksen-Walraven, Marianne J.

    2004-01-01

    The reliability and validity of the Attachment Q Sort (AQS; Waters & Deane, 1985) was tested in a series of meta-analyses on 139 studies with 13,835 children. The observer AQS security score showed convergent validity with Strange Situation procedure (SSP) security (r=31) and excellent predictive validity with sensitivity measures (r=39). Its…

  17. Children's Physical Activity While Gardening: Development of a Valid and Reliable Direct Observation Tool.

    PubMed

    Myers, Beth M; Wells, Nancy M

    2015-04-01

    Gardens are a promising intervention to promote physical activity (PA) and foster health. However, because of the unique characteristics of gardening, no extant tool can capture PA, postures, and motions that take place in a garden. The Physical Activity Research and Assessment tool for Garden Observation (PARAGON) was developed to assess children's PA levels, tasks, postures, and motions, associations, and interactions while gardening. PARAGON uses momentary time sampling in which a trained observer watches a focal child for 15 seconds and then records behavior for 15 seconds. Sixty-five children (38 girls, 27 boys) at 4 elementary schools in New York State were observed over 8 days. During the observation, children simultaneously wore Actigraph GT3X+ accelerometers. The overall interrater reliability was 88% agreement, and Ebel was .97. Percent agreement values for activity level (93%), garden tasks (93%), motions (80%), associations (95%), and interactions (91%) also met acceptable criteria. Validity was established by previously validated PA codes and by expected convergent validity with accelerometry. PARAGON is a valid and reliable observation tool for assessing children's PA in the context of gardening.

  18. Assessing anger regulation in middle childhood: development and validation of a behavioral observation measure.

    PubMed

    Rohlf, Helena L; Krahé, Barbara

    2015-01-01

    An observational measure of anger regulation in middle childhood was developed that facilitated the in situ assessment of five maladaptive regulation strategies in response to an anger-eliciting task. 599 children aged 6-10 years (M = 8.12, SD = 0.92) participated in the study. Construct validity of the measure was examined through correlations with parent- and self-reports of anger regulation and anger reactivity. Criterion validity was established through links with teacher-rated aggression and social rejection measured by parent-, teacher-, and self-reports. The observational measure correlated significantly with parent- and self-reports of anger reactivity, whereas it was unrelated to parent- and self-reports of anger regulation. It also made a unique contribution to predicting aggression and social rejection.

  19. Assessing anger regulation in middle childhood: development and validation of a behavioral observation measure

    PubMed Central

    Rohlf, Helena L.; Krahé, Barbara

    2015-01-01

    An observational measure of anger regulation in middle childhood was developed that facilitated the in situ assessment of five maladaptive regulation strategies in response to an anger-eliciting task. 599 children aged 6–10 years (M = 8.12, SD = 0.92) participated in the study. Construct validity of the measure was examined through correlations with parent- and self-reports of anger regulation and anger reactivity. Criterion validity was established through links with teacher-rated aggression and social rejection measured by parent-, teacher-, and self-reports. The observational measure correlated significantly with parent- and self-reports of anger reactivity, whereas it was unrelated to parent- and self-reports of anger regulation. It also made a unique contribution to predicting aggression and social rejection. PMID:25964767

  20. Validation of SMAP Root Zone Soil Moisture Estimates with Improved Cosmic-Ray Neutron Probe Observations

    NASA Astrophysics Data System (ADS)

    Babaeian, E.; Tuller, M.; Sadeghi, M.; Franz, T.; Jones, S. B.

    2017-12-01

    Soil Moisture Active Passive (SMAP) soil moisture products are commonly validated based on point-scale reference measurements, despite the exorbitant spatial scale disparity. The difference between the measurement depth of point-scale sensors and the penetration depth of SMAP further complicates evaluation efforts. Cosmic-ray neutron probes (CRNP) with an approximately 500-m radius footprint provide an appealing alternative for SMAP validation. This study is focused on the validation of SMAP level-4 root zone soil moisture products with 9-km spatial resolution based on CRNP observations at twenty U.S. reference sites with climatic conditions ranging from semiarid to humid. The CRNP measurements are often biased by additional hydrogen sources such as surface water, atmospheric vapor, or mineral lattice water, which sometimes yield unrealistic moisture values in excess of the soil water storage capacity. These effects were removed during CRNP data analysis. Comparison of SMAP data with corrected CRNP observations revealed a very high correlation for most of the investigated sites, which opens new avenues for validation of current and future satellite soil moisture products.

  1. Gravity Waves Generated by Convection: A New Idealized Model Tool and Direct Validation with Satellite Observations

    NASA Astrophysics Data System (ADS)

    Alexander, M. Joan; Stephan, Claudia

    2015-04-01

    In climate models, gravity waves remain too poorly resolved to be directly modelled. Instead, simplified parameterizations are used to include gravity wave effects on model winds. A few climate models link some of the parameterized waves to convective sources, providing a mechanism for feedback between changes in convection and gravity wave-driven changes in circulation in the tropics and above high-latitude storms. These convective wave parameterizations are based on limited case studies with cloud-resolving models, but they are poorly constrained by observational validation, and tuning parameters have large uncertainties. Our new work distills results from complex, full-physics cloud-resolving model studies to essential variables for gravity wave generation. We use the Weather Research Forecast (WRF) model to study relationships between precipitation, latent heating/cooling and other cloud properties to the spectrum of gravity wave momentum flux above midlatitude storm systems. Results show the gravity wave spectrum is surprisingly insensitive to the representation of microphysics in WRF. This is good news for use of these models for gravity wave parameterization development since microphysical properties are a key uncertainty. We further use the full-physics cloud-resolving model as a tool to directly link observed precipitation variability to gravity wave generation. We show that waves in an idealized model forced with radar-observed precipitation can quantitatively reproduce instantaneous satellite-observed features of the gravity wave field above storms, which is a powerful validation of our understanding of waves generated by convection. The idealized model directly links observations of surface precipitation to observed waves in the stratosphere, and the simplicity of the model permits deep/large-area domains for studies of wave-mean flow interactions. This unique validated model tool permits quantitative studies of gravity wave driving of regional

  2. Satellite Based Soil Moisture Product Validation Using NOAA-CREST Ground and L-Band Observations

    NASA Astrophysics Data System (ADS)

    Norouzi, H.; Campo, C.; Temimi, M.; Lakhankar, T.; Khanbilvardi, R.

    2015-12-01

    Soil moisture content is among most important physical parameters in hydrology, climate, and environmental studies. Many microwave-based satellite observations have been utilized to estimate this parameter. The Advanced Microwave Scanning Radiometer 2 (AMSR2) is one of many remotely sensors that collects daily information of land surface soil moisture. However, many factors such as ancillary data and vegetation scattering can affect the signal and the estimation. Therefore, this information needs to be validated against some "ground-truth" observations. NOAA - Cooperative Remote Sensing and Technology (CREST) center at the City University of New York has a site located at Millbrook, NY with several insitu soil moisture probes and an L-Band radiometer similar to Soil Moisture Passive and Active (SMAP) one. This site is among SMAP Cal/Val sites. Soil moisture information was measured at seven different locations from 2012 to 2015. Hydra probes are used to measure six of these locations. This study utilizes the observations from insitu data and the L-Band radiometer close to ground (at 3 meters height) to validate and to compare soil moisture estimates from AMSR2. Analysis of the measurements and AMSR2 indicated a weak correlation with the hydra probes and a moderate correlation with Cosmic-ray Soil Moisture Observing System (COSMOS probes). Several differences including the differences between pixel size and point measurements can cause these discrepancies. Some interpolation techniques are used to expand point measurements from 6 locations to AMSR2 footprint. Finally, the effect of penetration depth in microwave signal and inconsistencies with other ancillary data such as skin temperature is investigated to provide a better understanding in the analysis. The results show that the retrieval algorithm of AMSR2 is appropriate under certain circumstances. This validation algorithm and similar study will be conducted for SMAP mission. Keywords: Remote Sensing, Soil

  3. See Me, Feel Me. Using Physiology to Validate Behavioural Observations of Emotions of People with Severe or Profound Intellectual Disability

    ERIC Educational Resources Information Center

    Vos, P.; De Cock, P.; Petry, K.; Van Den Noortgate, W.; Maes, B.

    2013-01-01

    Background: Behavioural observations are the most frequently used source of information about emotions of people with severe or profound intellectual disabilities but have not yet been validated against other measures of emotion. In this study we wanted to validate the behavioural observations of emotions using respiration (rib cage contribution,…

  4. Using wound care algorithms: a content validation study.

    PubMed

    Beitz, J M; van Rijswijk, L

    1999-09-01

    Valid and reliable heuristic devices facilitating optimal wound care are lacking. The objectives of this study were to establish content validation data for a set of wound care algorithms, to identify their associated strengths and weaknesses, and to gain insight into the wound care decision-making process. Forty-four registered nurse wound care experts were surveyed and interviewed at national and regional educational meetings. Using a cross-sectional study design and an 83-item, 4-point Likert-type scale, this purposive sample was asked to quantify the degree of validity of the algorithms' decisions and components. Participants' comments were tape-recorded, transcribed, and themes were derived. On a scale of 1 to 4, the mean score of the entire instrument was 3.47 (SD +/- 0.87), the instrument's Content Validity Index was 0.86, and the individual Content Validity Index of 34 of 44 participants was > 0.8. Item scores were lower for those related to packing deep wounds (P < .001). No other significant differences were observed. Qualitative data analysis revealed themes of difficulty associated with wound assessment and care issues, that is, the absence of valid and reliable definitions. The wound care algorithms studied proved valid. However, the lack of valid and reliable wound assessment and care definitions hinders optimal use of these instruments. Further research documenting their clinical use is warranted. Research-based practice recommendations should direct the development of future valid and reliable algorithms designed to help nurses provide optimal wound care.

  5. The Anaclitic-Introjective Depression Assessment: Development and preliminary validity of an observer-rated measure.

    PubMed

    Rost, Felicitas; Luyten, Patrick; Fonagy, Peter

    2018-03-01

    The two-configurations model developed by Blatt and colleagues offers a comprehensive conceptual and empirical framework for understanding depression. This model suggests that depressed patients struggle, at different developmental levels, with issues related to dependency (anaclitic issues) or self-definition (introjective issues), or a combination of both. This paper reports three studies on the development and preliminary validation of the Anaclitic-Introjective Depression Assessment, an observer-rated assessment tool of impairments in relatedness and self-definition in clinical depression based on the item pool of the Shedler-Westen Assessment Procedure. Study 1 describes the development of the measure using expert consensus rating and Q-methodology. Studies 2 and 3 report the assessment of its psychometric properties, preliminary reliability, and validity in a sample of 128 patients diagnosed with treatment-resistant depression. Four naturally occurring clusters of depressed patients were identified using Q-factor analysis, which, overall, showed meaningful and theoretically expected relationships with anaclitic/introjective prototypes as formulated by experts, as well as with clinical, social, occupational, global, and relational functioning. Taken together, findings reported in this paper provide preliminary evidence for the reliability and validity of the Anaclitic-Introjective Depression Assessment, an observer-rated measure that allows the detection of important nuanced differentiations between and within anaclitic and introjective depression. Copyright © 2017 John Wiley & Sons, Ltd.

  6. Incremental Validity of Test Session and Classroom Observations in a Multimethod Assessment of Attention Deficit/Hyperactivity Disorder

    ERIC Educational Resources Information Center

    McConaughy, Stephanie H.; Harder, Valerie S.; Antshel, Kevin M.; Gordon, Michael; Eiraldi, Ricardo; Dumenci, Levent

    2010-01-01

    This study tested the incremental validity of behavioral observations, over and above parent and teacher reports, for assessing symptoms of Attention Deficit/Hyperactivity Disorder (ADHD) in children ages 6 to 12, using the Test Observation Form (TOF) and Direct Observation Form (DOF) from the Achenbach System of Empirically Based Assessment. The…

  7. Validation and Inter-comparison Against Observations of GODAE Ocean View Ocean Prediction Systems

    NASA Astrophysics Data System (ADS)

    Xu, J.; Davidson, F. J. M.; Smith, G. C.; Lu, Y.; Hernandez, F.; Regnier, C.; Drevillon, M.; Ryan, A.; Martin, M.; Spindler, T. D.; Brassington, G. B.; Oke, P. R.

    2016-02-01

    For weather forecasts, validation of forecast performance is done at the end user level as well as by the meteorological forecast centers. In the development of Ocean Prediction Capacity, the same level of care for ocean forecast performance and validation is needed. Herein we present results from a validation against observations of 6 Global Ocean Forecast Systems under the GODAE OceanView International Collaboration Network. These systems include the Global Ocean Ice Forecast System (GIOPS) developed by the Government of Canada, two systems PSY3 and PSY4 from the French Mercator-Ocean Ocean Forecasting Group, the FOAM system from UK met office, HYCOM-RTOFS from NOAA/NCEP/NWA of USA, and the Australian Bluelink-OceanMAPS system from the CSIRO, the Australian Meteorological Bureau and the Australian Navy.The observation data used in the comparison are sea surface temperature, sub-surface temperature, sub-surface salinity, sea level anomaly, and sea ice total concentration data. Results of the inter-comparison demonstrate forecast performance limits, strengths and weaknesses of each of the six systems. This work establishes validation protocols and routines by which all new prediction systems developed under the CONCEPTS Collaborative Network will be benchmarked prior to approval for operations. This includes anticipated delivery of CONCEPTS regional prediction systems over the next two years including a pan Canadian 1/12th degree resolution ice ocean prediction system and limited area 1/36th degree resolution prediction systems. The validation approach of comparing forecasts to observations at the time and location of the observation is called Class 4 metrics. It has been adopted by major international ocean prediction centers, and will be recommended to JCOMM-WMO as routine validation approach for operational oceanography worldwide.

  8. Validating Pseudo-dynamic Source Models against Observed Ground Motion Data at the SCEC Broadband Platform, Ver 16.5

    NASA Astrophysics Data System (ADS)

    Song, S. G.

    2016-12-01

    Simulation-based ground motion prediction approaches have several benefits over empirical ground motion prediction equations (GMPEs). For instance, full 3-component waveforms can be produced and site-specific hazard analysis is also possible. However, it is important to validate them against observed ground motion data to confirm their efficiency and validity before practical uses. There have been community efforts for these purposes, which are supported by the Broadband Platform (BBP) project at the Southern California Earthquake Center (SCEC). In the simulation-based ground motion prediction approaches, it is a critical element to prepare a possible range of scenario rupture models. I developed a pseudo-dynamic source model for Mw 6.5-7.0 by analyzing a number of dynamic rupture models, based on 1-point and 2-point statistics of earthquake source parameters (Song et al. 2014; Song 2016). In this study, the developed pseudo-dynamic source models were tested against observed ground motion data at the SCEC BBP, Ver 16.5. The validation was performed at two stages. At the first stage, simulated ground motions were validated against observed ground motion data for past events such as the 1992 Landers and 1994 Northridge, California, earthquakes. At the second stage, they were validated against the latest version of empirical GMPEs, i.e., NGA-West2. The validation results show that the simulated ground motions produce ground motion intensities compatible with observed ground motion data at both stages. The compatibility of the pseudo-dynamic source models with the omega-square spectral decay and the standard deviation of the simulated ground motion intensities are also discussed in the study

  9. The Environmental Reward Observation Scale (EROS): development, validity, and reliability.

    PubMed

    Armento, Maria E A; Hopko, Derek R

    2007-06-01

    Researchers acknowledge a strong association between the frequency and duration of environmental reward and affective mood states, particularly in relation to the etiology, assessment, and treatment of depression. Given behavioral theories that outline environmental reward as a strong mediator of affect and the unavailability of an efficient, reliable, and valid self-report measure of environmental reward, we developed the Environmental Reward Observation Scale (EROS) and examined its psychometric properties. In Experiment 1, exploratory factor analysis supported a unidimensional 10-item measure with strong internal consistency and test-retest reliability. When administered to a replication sample, confirmatory factor analysis suggested an excellent fit to the 1-factor model and convergent/discriminant validity data supported the construct validity of the EROS. In Experiment 2, further support for the convergent validity of the EROS was obtained via moderate correlations with the Pleasant Events Schedule (PES; MacPhillamy & Lewinsohn, 1976). In Experiment 3, hierarchical regression supported the ecological validity of the EROS toward predicting daily diary reports of time spent in highly rewarding behaviors and activities. Above and beyond variance accounted for by depressive symptoms (BDI), the EROS was associated with significant incremental variance in accounting for time spent in both low and high reward behaviors. The EROS may represent a brief, reliable and valid measure of environmental reward that may improve the psychological assessment of negative mood states such as clinical depression.

  10. Students as Ground Observers for Satellite Cloud Retrieval Validation

    NASA Technical Reports Server (NTRS)

    Chambers, Lin H.; Costulis, P. Kay; Young, David F.; Rogerson, Tina M.

    2004-01-01

    The Students' Cloud Observations On-Line (S'COOL) Project was initiated in 1997 to obtain student observations of clouds coinciding with the overpass of the Clouds and the Earth's Radiant Energy System (CERES) instruments on NASA's Earth Observing System satellites. Over the past seven years we have accumulated more than 9,000 cases worldwide where student observations are available within 15 minutes of a CERES observation. This paper reports on comparisons between the student and satellite data as one facet of the validation of the CERES cloud retrievals. Available comparisons include cloud cover, cloud height, cloud layering, and cloud visual opacity. The large volume of comparisons allows some assessment of the impact of surface cover, such as snow and ice, reported by the students. The S'COOL observation database, accessible via the Internet at http://scool.larc.nasa.gov, contains over 32,000 student observations and is growing by over 700 observations each month. Some of these observations may be useful for assessment of other satellite cloud products. In particular, some observing sites have been making hourly observations of clouds during the school day to learn about the diurnal cycle of cloudiness.

  11. Dental neglect and adverse birth outcomes: a validation and observational study.

    PubMed

    Acharya, S; Pentapati, K C; Bhat, P V

    2013-05-01

    The objectives of this study were to validate the Indian translation of the Dental Neglect Scale (DNS) among a sample of parturient Indian women and to investigate dental neglect as a possible risk indicator in adverse birth outcomes. Three hundred and sixteen parturient women were administered the DNS and the Modified Dental Beliefs Scale (MDBS) and were also clinically examined for oral health status. Information regarding socio-economic status, weeks of gestation and birth weight was also collected. A gestation period of less than 37 weeks was considered as preterm and a birth weight of less than 2500 gm as 'low birth weight'. The Indian version of the DNS was found to be reliable (Cronbach's Alpha = 0.72) and valid for assessing dental neglect among the women. Factor analysis of the DNS revealed a two-factor structure accounting for 56% variance. Dental neglect was higher among those with poorer oral health status, lower socio-economic and educational status. Multinomial logistic regression showed high dental neglect and negative dental beliefs and not poor oral health, as significant risk indicators for occurrence of adverse birth outcomes. The finding of an association of adverse birth outcomes with dental neglect and beliefs, but not with poor oral health could be due to the influence of other more important general factors which had a direct bearing on birth outcomes. There is a need for further research to assess the role of behavioural factors like dental neglect as risk indicators for adverse birth outcomes. © 2012 John Wiley & Sons A/S.

  12. Rediscovery rate estimation for assessing the validation of significant findings in high-throughput studies.

    PubMed

    Ganna, Andrea; Lee, Donghwan; Ingelsson, Erik; Pawitan, Yudi

    2015-07-01

    It is common and advised practice in biomedical research to validate experimental or observational findings in a population different from the one where the findings were initially assessed. This practice increases the generalizability of the results and decreases the likelihood of reporting false-positive findings. Validation becomes critical when dealing with high-throughput experiments, where the large number of tests increases the chance to observe false-positive results. In this article, we review common approaches to determine statistical thresholds for validation and describe the factors influencing the proportion of significant findings from a 'training' sample that are replicated in a 'validation' sample. We refer to this proportion as rediscovery rate (RDR). In high-throughput studies, the RDR is a function of false-positive rate and power in both the training and validation samples. We illustrate the application of the RDR using simulated data and real data examples from metabolomics experiments. We further describe an online tool to calculate the RDR using t-statistics. We foresee two main applications. First, if the validation study has not yet been collected, the RDR can be used to decide the optimal combination between the proportion of findings taken to validation and the size of the validation study. Secondly, if a validation study has already been done, the RDR estimated using the training data can be compared with the observed RDR from the validation data; hence, the success of the validation study can be assessed. © The Author 2014. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.

  13. Validation of MODIS Aerosol Observations Over the Netherlands With GLOBE Student Observations: Lessons Learned

    NASA Astrophysics Data System (ADS)

    de Vroom, J.; Boersma, K. F.

    2006-12-01

    We have established a network of secondary schools in the Netherlands (www.knmi.nl/globe) with students routinely measuring aerosol optical thickness (AOT) at two wavelengths with hand-held Sun photometers. Students have performed more than 400 measurements between January 2002 and October 2005 over more than 12 locations within the Netherlands as a contribution to Global Learning and Observations to Benefit the Environment (GLOBE). Results from a theoretical error analysis indicate that GLOBE measurements achieve a precision better than 0.02 AOT for both channels. Comparisons with professional instruments generally give high correlations and low scatter and bias. From these tests, we conclude that student data is scientifically valid and may be used to validate MODIS AOT retrievals over the Netherlands. A manuscript on this study has been accepted by AGU's Journal of Geophysical Research. In this presentation, we will address the pro's and con's of setting up a student-based network. Issues such as effective training, the importance of regular school visits, and the need for an intermediate partner will be discussed. As stated in the outlook of our manuscript: routine has it that involved parties are often short of time, and that incidental school visits are not only hard to organize, but also often abandoned. This is regretful, as some schools, after a promising start, fail to continue their measurement record. In summary, school visits are essential to maintaining and prospering a project as described in this study, and should be performed as often as possible.

  14. Comparing Parent-Child Interactions in the Clinic and at Home: An Exploration of the Validity of Clinical Behavior Observations Using Sequential Analysis

    ERIC Educational Resources Information Center

    Shriver, Mark D.; Frerichs, Lynae J.; Williams, Melissa; Lancaster, Blake M.

    2013-01-01

    Direct observation is often considered the "gold standard" for assessing the function, frequency, and intensity of problem behavior. Currently, the literature investigating the construct validity of direct observation conducted in the clinic setting reveals conflicting results. Previous studies on the construct validity of clinic-based…

  15. The GRACE Checklist: A Validated Assessment Tool for High Quality Observational Studies of Comparative Effectiveness.

    PubMed

    Dreyer, Nancy A; Bryant, Allison; Velentgas, Priscilla

    2016-10-01

    a predictor of quality in all 4 trees. When a composite outcome of the 3 quality measures was used, the GRACE Checklist showed high sensitivity and specificity (71.43% and 80.95%, respectively). The GRACE Checklist stands out from other consensus-driven and expert guidance documents because of its extensive validation efforts. This most recent work shows that the checklist has strong sensitivity and specificity, increasing its utility as a screening tool to identify high-quality observational comparative effectiveness research worthy of in-depth review and applicability for decision support. No outside funding supported this research. All authors are full-time employees of Quintiles, which provides research and consulting services to the biopharmaceutical industry. The authors have no other disclosures to report. Two of the 3 CART trees were presented at the International Society of Pharmacepidemiology in 2015 ("Article Citations per Year" and "Journal Impact Factor"). The original validation study was published in the March 2014 issue of the Journal of Managed Care & Specialty Pharmacy. The checklist questions and scoring were included using a table that was originally published by this journal in 2014. Study concept and design were primarily contributed by Dreyer and Velentgas, along with Bryant. Bryant took the lead in data collection and analysis, along with Dreyer and Velentgas, and data interpretation was performed by Dreyer, Velentgas, and Bryant. The manuscript was written and revised primarily by Dreyer, along with Bryant and Velentgas.

  16. Reliability and criterion validity of an observation protocol for working technique assessments in cash register work.

    PubMed

    Palm, Peter; Josephson, Malin; Mathiassen, Svend Erik; Kjellberg, Katarina

    2016-06-01

    We evaluated the intra- and inter-observer reliability and criterion validity of an observation protocol, developed in an iterative process involving practicing ergonomists, for assessment of working technique during cash register work for the purpose of preventing upper extremity symptoms. Two ergonomists independently assessed 17 15-min videos of cash register work on two occasions each, as a basis for examining reliability. Criterion validity was assessed by comparing these assessments with meticulous video-based analyses by researchers. Intra-observer reliability was acceptable (i.e. proportional agreement >0.7 and kappa >0.4) for 10/10 questions. Inter-observer reliability was acceptable for only 3/10 questions. An acceptable inter-observer reliability combined with an acceptable criterion validity was obtained only for one working technique aspect, 'Quality of movements'. Thus, major elements of the cashiers' working technique could not be assessed with an acceptable accuracy from short periods of observations by one observer, such as often desired by practitioners. Practitioner Summary: We examined an observation protocol for assessing working technique in cash register work. It was feasible in use, but inter-observer reliability and criterion validity were generally not acceptable when working technique aspects were assessed from short periods of work. We recommend the protocol to be used for educational purposes only.

  17. Validation of Radiometric Standards for the Laboratory Calibration of Reflected-Solar Earth Observing Satellite Instruments

    NASA Technical Reports Server (NTRS)

    Butler, James J.; Johnson, B. Carol; Rice, Joseph P.; Brown, Steven W.; Barnes, Robert A.

    2007-01-01

    Historically, the traceability of the laboratory calibration of Earth-observing satellite instruments to a primary radiometric reference scale (SI units) is the responsibility of each instrument builder. For the NASA Earth Observing System (EOS), a program has been developed using laboratory transfer radiometers, each with its own traceability to the primary radiance scale of a national metrology laboratory, to independently validate the radiances assigned to the laboratory sources of the instrument builders. The EOS Project Science Office also developed a validation program for the measurement of onboard diffuse reflecting plaques, which are also used as radiometric standards for Earth-observing satellite instruments. Summarized results of these validation campaigns, with an emphasis on the current state-of-the-art uncertainties in laboratory radiometric standards, will be presented. Future mission uncertainty requirements, and possible enhancements to the EOS validation program to ensure that those uncertainties can be met, will be presented.

  18. Validity of an observation method for assessing pain behavior in individuals with multiple sclerosis.

    PubMed

    Cook, Karon F; Roddey, Toni S; Bamer, Alyssa M; Amtmann, Dagmar; Keefe, Francis J

    2013-09-01

    Pain is a common and complex experience for individuals who live with multiple sclerosis (MS) and it interferes with physical, psychological, and social function. A valid and reliable tool for quantifying observed pain behaviors in MS is critical to understand how pain behaviors contribute to pain-related disability in this clinical population. To evaluate the reliability and validity of a pain behavioral observation protocol in individuals who have MS. Community-dwelling volunteers with MS (N=30), back pain (N=5), or arthritis (N=8) were recruited based on clinician referrals, advertisements, fliers, web postings, and participation in previous research. Participants completed the measures of pain severity, pain interference, and self-reported pain behaviors and were videotaped doing typical activities (e.g., walking and sitting). Two coders independently recorded frequencies of pain behaviors by category (e.g., guarding and bracing) and interrater reliability statistics were calculated. Naïve observers reviewed videotapes of individuals with MS and rated their pain. The Spearman's correlations were calculated between pain behavior frequencies and self-reported pain and pain ratings by naïve observers. Interrater reliability estimates indicated the reliability of pain codes in the MS sample. Kappa coefficients ranged from moderate (sighing=0.40) to substantial agreements (guarding=0.83). These values were comparable with those obtained in the combined back pain and arthritis sample. Concurrent validity was supported by correlations with self-reported pain (0.46-0.53) and with self-reports of pain behaviors (0.58). Construct validity was supported by a finding of 0.87 correlation between total pain behaviors observed by coders and mean pain ratings by naïve observers. Results support the use of the pain behavior observation protocol for assessing pain behaviors of individuals with MS. Valid assessments of pain behaviors of individuals with MS could lead to

  19. Validity of an Observation Method for Assessing Pain Behavior in Individuals With Multiple Sclerosis

    PubMed Central

    Cook, Karon F.; Roddey, Toni S.; Bamer, Alyssa M.; Amtmann, Dagmar; Keefe, Francis J

    2012-01-01

    Context Pain is a common and complex experience for individuals who live with multiple sclerosis (MS) that interferes with physical, psychological and social function. A valid and reliable tool for quantifying observed pain behaviors in MS is critical to understanding how pain behaviors contribute to pain-related disability in this clinical population. Objectives To evaluate the reliability and validity of a pain behavioral observation protocol in individuals who have MS. Methods Community-dwelling volunteers with multiple sclerosis (N=30), back pain (N=5), or arthritis (N=8) were recruited based on clinician referrals, advertisements, fliers, web postings, and participation in previous research. Participants completed measures of pain severity, pain interference, and self-reported pain behaviors and were videotaped doing typical activities (e.g., walking, sitting). Two coders independently recorded frequencies of pain behaviors by category (e.g., guarding, bracing) and inter-rater reliability statistics were calculated. Naïve observers reviewed videotapes of individuals with MS and rated their pain. Spearman correlations were calculated between pain behavior frequencies and self-reported pain and pain ratings by naïve observers. Results Inter-rater reliability estimates indicated the reliability of pain codes in the MS sample. Kappa coefficients ranged from moderate agreement (sighing = 0.40) to substantial agreement (guarding = 0.83). These values were comparable to those obtained in the combined back pain and arthritis sample. Concurrent validity was supported by correlations with self-reported pain (0.46-0.53) and with self-reports of pain behaviors (0.58). Construct validity was supported by finding of 0.87 correlation between total pain behaviors observed by coders and mean pain ratings by naïve observers. Conclusion Results support use of the pain behavior observation protocol for assessing pain behaviors of individuals with MS. Valid assessments of pain

  20. The reliability, validity, and feasibility of physical activity measurement in adults with traumatic brain injury: an observational study.

    PubMed

    Hassett, Leanne; Moseley, Anne; Harmer, Alison; van der Ploeg, Hidde P

    2015-01-01

    To determine the reliability and validity of the Physical Activity Scale for Individuals with a Physical Disability (PASIPD) in adults with severe traumatic brain injury (TBI) and estimate the proportion of the sample participants who fail to meet the World Health Organization guidelines for physical activity. A single-center observational study recruited a convenience sample of 30 community-based ambulant adults with severe TBI. Participants completed the PASIPD on 2 occasions, 1 week apart, and wore an accelerometer (ActiGraph GT3X; ActiGraph LLC, Pensacola, Florida) for the 7 days between these 2 assessments. The PASIPD test-retest reliability was substantial (intraclass correlation coefficient = 0.85; 95% confidence interval, 0.70-0.92), and the correlation with the accelerometer ranged from too low to be meaningful (R = 0.09) to moderate (R = 0.57). From device-based measurement of physical activity, 56% of participants failed to meet the World Health Organization physical activity guidelines. The PASIPD is a reliable measure of the type of physical activity people with severe TBI participate in, but it is not a valid measure of the amount of moderate to vigorous physical activity in which they engage. Accelerometers should be used to quantify moderate to vigorous physical activity in people with TBI.

  1. Coarse Scale In Situ Albedo Observations over Heterogeneous Land Surfaces and Validation Strategy

    NASA Astrophysics Data System (ADS)

    Xiao, Q.; Wu, X.; Wen, J.; BAI, J., Sr.

    2017-12-01

    To evaluate and improve the quality of coarse-pixel land surface albedo products, validation with ground measurements of albedo is crucial over the spatially and temporally heterogeneous land surface. The performance of albedo validation depends on the quality of ground-based albedo measurements at a corresponding coarse-pixel scale, which can be conceptualized as the "truth" value of albedo at coarse-pixel scale. The wireless sensor network (WSN) technology provides access to continuously observe on the large pixel scale. Taking the albedo products as an example, this paper was dedicated to the validation of coarse-scale albedo products over heterogeneous surfaces based on the WSN observed data, which is aiming at narrowing down the uncertainty of results caused by the spatial scaling mismatch between satellite and ground measurements over heterogeneous surfaces. The reference value of albedo at coarse-pixel scale can be obtained through an upscaling transform function based on all of the observations for that pixel. We will devote to further improve and develop new method that that are better able to account for the spatio-temporal characteristic of surface albedo in the future. Additionally, how to use the widely distributed single site measurements over the heterogeneous surfaces is also a question to be answered. Keywords: Remote sensing; Albedo; Validation; Wireless sensor network (WSN); Upscaling; Heterogeneous land surface; Albedo truth at coarse-pixel scale

  2. Development and Construct Validity of the Classroom Strategies Scale-Observer Form

    ERIC Educational Resources Information Center

    Reddy, Linda A.; Fabiano, Gregory; Dudek, Christopher M.; Hsu, Louis

    2013-01-01

    Research on progress monitoring has almost exclusively focused on student behavior and not on teacher practices. This article presents the development and validation of a new teacher observational assessment (Classroom Strategies Scale) of classroom instructional and behavioral management practices. The theoretical underpinnings and empirical…

  3. GOSAT validation out standing in the field: A case study of satellite validation using the SSEC Portable Atmospheric Research Center (SPARC)

    NASA Astrophysics Data System (ADS)

    Wagner, T. J.; Borg, L. A.; Feltz, M.; Gero, P. J.; Knuteson, R. O.; Olson, E.

    2016-12-01

    The Space Science and Engineering Center (SSEC) at the University of Wisconsin-Madison has developed the SSEC Portable Atmospheric Research Center (SPARC), a mobile 11 m trailer that houses numerous in situ and ground-based remote sensing instruments. Available instrumentation includes the Atmospheric Emitted Radiance Interferometer (AERI), a hyperspectral infrared radiometer from which trace gas concentrations and profiles of temperature and water vapor can be retrieved; the High Spectral Resolution Lidar (HSRL), a multichannel lidar capable of directly retrieving profiles of optical depth and backscatter depolarization; and a Doppler lidar wind profiler. The remote instrumentation suite is complemented by surface meteorology observations and a radiosonde ground station. Collectively, these instruments enable SPARC to participate in a wide variety of field studies, including meteorological field experiments and ground-based satellite calibration and validation studies. In August 2016, SPARC traveled to the Chequamegon National Forest in northern Wisconsin for a two week long deployment alongside the WLEF-TV tower. This 447 m tower houses long-term observations of thermodynamic and atmospheric composition at multiple heights, enabling studies of phenomena like atmospheric/land surface interactions and carbon uptake. During this deployment, SPARC launched radiosondes coincident with clear-sky overpasses of the Greenhouse gases Observing SATellite (GOSAT). Thermodynamic profiles from the radiosondes and AERI combined with the trace gas observations from the tower were used to validate the GOSAT observations of carbon dioxide and methane. The on-site presence of SPARC allowed for better characterization of the environment and greater observational certainty than was possible with the tower alone. Examples from this particular validation study as well as a discussion of how SPARC can contribute to other satellite calibration and validation investigations will be

  4. Airborne Observations and Satellite Validation: INTEX-A Experience and INTEX-B Plans

    NASA Technical Reports Server (NTRS)

    Crawford, James H.; Singh, Hanwant B.; Brune, William H.; Jacob, Daniel J.

    2005-01-01

    Intercontinental Chemical Transport Experiment (INTEX; http://cloudl.arc.nasa.gov) is an ongoing two-phase integrated atmospheric field experiment being performed over North America (NA). Its first phase (INTEX-A) was performed in the summer of 2004 and the second phase (INTEX-B) is planned for the early spring of 2006. The main goal of INTEX-NA is to understand the transport and transformation of gases and aerosols on transcontinental/intercontinental scales and to assess their impact on air quality and climate. Central to achieving this goal is the need to relate space-based observations with those from airborne and surface platforms. During INTEX-A, NASA s DC-8 was joined by some dozen other aircraft from a large number of European and North American partners to focus on the outflow of pollution from NA to the Atlantic. Several instances of Asian pollution over NA were also encountered. INTEX-A flight planning extensively relied on satellite observations and in turn Satellite validation (Terra, Aqua, and Envisat) was given high priority. Over 20 validation profiles were successfully carried out. DC-8 sampling of smoke from Alaskan fires and formaldehyde over forested regions, and simultaneous satellite observations of these provided excellent opportunities for the interplay of these platforms. The planning for INTEX-5 is currently underway, and a vast majority of "standard" and "research" products to be retrieved from Aura instruments will be measured during INTEX-B throughout the troposphere. INTEX-B will focus on the inflow of pollution from Asia to North America and validation of satellite observations with emphasis on Aura. Several national and international partners are expected to coordinate activities with INTEX-B, and we expect its scope to expand in the coming months. An important new development involves partnership with an NSF-sponsored campaign called MIRAGE (Megacity Impacts on Regional and Global Environments- Mexico City Pollution Outflow Field

  5. Validity of the Associated Symptom Criteria for Generalized Anxiety Disorder: Observations From the Singapore Mental Health Study.

    PubMed

    Lee, Siau Pheng; Ong, Clarissa; Vaingankar, Janhavi Ajit; Chong, Siow Ann; Subramaniam, Mythily

    2017-05-01

    Previous findings on the diagnostic validity and reliability of generalized anxiety disorder (GAD)-associated symptom criteria suggest need for further evaluation. The current study examined convergent validity and specificity of GAD-associated symptoms in a representative Singapore community sample. The Singapore of Mental Health Study a cross-sectional epidemiological survey conducted among 6166 Singapore residents aged 18 and older. The Composite International Diagnostic Interview version 3.0 was used to diagnose mental disorders. Associated symptoms in the GAD criteria and autonomic hyperactivity symptoms showed convergent validity with a GAD diagnosis. However, associated symptoms of GAD were also linked to major depressive disorder (MDD), bipolar disorder, and obsessive-compulsive disorder, suggesting lack of adequate specificity. The inability of the diagnostic criteria to differentiate GAD from symptoms of other conditions highlights the need to better define its associated symptoms criteria. The relationship of overlapping symptoms between GAD and MDD is also discussed.

  6. The Validation of a Classroom Observation Instrument Based on the Construct of Teacher Adaptive Practice

    ERIC Educational Resources Information Center

    Loughland, Tony; Vlies, Penny

    2016-01-01

    Teacher adaptability is a key disposition for teachers that has been linked to outcomes of interests to schools. The aim of this study was to examine how the broader disposition of teacher adaptability might be observable as classroom-based adaptive practices using an argument-based approach to validation. The findings from the initial phase of…

  7. Validity of a practitioner-administered observational tool to measure physical activity, nutrition, and screen time in school-age programs.

    PubMed

    Lee, Rebekka M; Emmons, Karen M; Okechukwu, Cassandra A; Barrett, Jessica L; Kenney, Erica L; Cradock, Angie L; Giles, Catherine M; deBlois, Madeleine E; Gortmaker, Steven L

    2014-11-28

    Nutrition and physical activity interventions have been effective in creating environmental changes in afterschool programs. However, accurate assessment can be time-consuming and expensive as initiatives are scaled up for optimal population impact. This study aims to determine the criterion validity of a simple, low-cost, practitioner-administered observational measure of afterschool physical activity, nutrition, and screen time practices and child behaviors. Directors from 35 programs in three cities completed the Out-of-School Nutrition and Physical Activity Observational Practice Assessment Tool (OSNAP-OPAT) on five days. Trained observers recorded snacks served and obtained accelerometer data each day during the same week. Observations of physical activity participation and snack consumption were conducted on two days. Correlations were calculated to validate weekly average estimates from OSNAP-OPAT compared to criterion measures. Weekly criterion averages are based on 175 meals served, snack consumption of 528 children, and physical activity levels of 356 children. OSNAP-OPAT validly assessed serving water (r = 0.73), fruits and vegetables (r = 0.84), juice >4oz (r = 0.56), and grains (r = 0.60) at snack; sugary drinks (r = 0.70) and foods (r = 0.68) from outside the program; and children's water consumption (r = 0.56) (all p <0.05). Reports of physical activity time offered were correlated with accelerometer estimates (minutes of moderate and vigorous physical activity r = 0.59, p = 0.02; vigorous physical activity r = 0.63, p = 0.01). The reported proportion of children participating in moderate and vigorous physical activity was correlated with observations (r = 0.48, p = 0.03), as were reports of computer (r = 0.85) and TV/movie (r = 0.68) time compared to direct observations (both p < 0.01). OSNAP-OPAT can assist researchers and practitioners in validly assessing nutrition and physical

  8. The validity of the 4-Skills Scan: A double validation study.

    PubMed

    van Kernebeek, W G; de Kroon, M L A; Savelsbergh, G J P; Toussaint, H M

    2018-06-01

    Adequate gross motor skills are an essential aspect of a child's healthy development. Where physical education (PE) is part of the primary school curriculum, a strong curriculum-based emphasis on evaluation and support of motor skill development in PE is apparent. Monitoring motor development is then a task for the PE teacher. In order to fulfil this task, teachers need adequate tools. The 4-Skills Scan is a quick and easily manageable gross motor skill instrument; however, its validity has never been assessed. Therefore, the purpose of this study is to assess the construct and concurrent validity of both 4-Skills Scans (version 2007 and version 2015). A total of 212 primary school children (6 - 12 years old), was requested to participate in both versions of the 4-Skills Scan. For assessing construct validity, children covered an obstacle course with video recordings for observation by an expert panel. For concurrent validity, a comparison was made with the MABC-2, by calculating Pearson correlations. Multivariable linear regression analyses were performed to determine the contribution of each subscale to the construct of gross motor skills, according to the MABC-2 and the expert panel. Correlations between the 4-Skills Scans and expert valuations were moderate, with coefficients of .47 (version 2007) and .46 (version 2015). Correlations between the 4-Skills Scans and the MABC-2 (gross) were moderate (.56) for version 2007 and high (.64) for version 2015. It is concluded that both versions of the 4-Skills Scans are satisfactory valid instruments for assessing gross motor skills during PE lessons. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

  9. Validation of NH3 satellite observations by ground-based FTIR measurements

    NASA Astrophysics Data System (ADS)

    Dammers, Enrico; Palm, Mathias; Van Damme, Martin; Shephard, Mark; Cady-Pereira, Karen; Capps, Shannon; Clarisse, Lieven; Coheur, Pierre; Erisman, Jan Willem

    2016-04-01

    Global emissions of reactive nitrogen have been increasing to an unprecedented level due to human activities and are estimated to be a factor four larger than pre-industrial levels. Concentration levels of NOx are declining, but ammonia (NH3) levels are increasing around the globe. While NH3 at its current concentrations poses significant threats to the environment and human health, relatively little is known about the total budget and global distribution. Surface observations are sparse and mainly available for north-western Europe, the United States and China and are limited by the high costs and poor temporal and spatial resolution. Since the lifetime of atmospheric NH3 is short, on the order of hours to a few days, due to efficient deposition and fast conversion to particulate matter, the existing surface measurements are not sufficient to estimate global concentrations. Advanced space-based IR-sounders such as the Tropospheric Emission Spectrometer (TES), the Infrared Atmospheric Sounding Interferometer (IASI), and the Cross-track Infrared Sounder (CrIS) enable global observations of atmospheric NH3 that help overcome some of the limitations of surface observations. However, the satellite NH3 retrievals are complex requiring extensive validation. Presently there have only been a few dedicated satellite NH3 validation campaigns performed with limited spatial, vertical or temporal coverage. Recently a retrieval methodology was developed for ground-based Fourier Transform Infrared Spectroscopy (FTIR) instruments to obtain vertical concentration profiles of NH3. Here we show the applicability of retrieved columns from nine globally distributed stations with a range of NH3 pollution levels to validate satellite NH3 products.

  10. Validation of Observations Obtained with a Liquid Mirror Telescope by Comparison with Sloan Digital Sky Survey Observations

    NASA Astrophysics Data System (ADS)

    Borra, E. F.

    2015-06-01

    The results of a search for peculiar astronomical objects using very low resolution spectra obtained with the NASA Orbital Debris Observatory (NODO) 3 m diameter liquid mirror telescope (LMT) are compared with results of spectra obtained with the Sloan Digital Sky Survey (SDSS). The main purpose of this comparison is to verify whether observations taken with this novel type of telescope are reliable. This comparison is important because LMTs are an inexpensive novel type of telescope that is very useful for astronomical surveys, particularly surveys in the time domain, and validation of the data taken with an LMT by comparison with data from a classical telescope will validate their reliability. We start from a published data analysis that classified as peculiar only 206 of the 18,000 astronomical objects observed with the NODO LMT. A total of 29 of these 206 objects were found in the SDSS. The reliability of the NODO data can be seen through the results of the detailed analysis that, in practice, incorrectly identified less than 0.3% of the 18,000 spectra as peculiar objects, most likely because they are variable stars. We conclude that the LMT gave reliable observations, comparable to those that would have been obtained with a telescope using a glass mirror.

  11. Validity Evidence in Scale Development: The Application of Cross Validation and Classification-Sequencing Validation

    ERIC Educational Resources Information Center

    Acar, Tu¨lin

    2014-01-01

    In literature, it has been observed that many enhanced criteria are limited by factor analysis techniques. Besides examinations of statistical structure and/or psychological structure, such validity studies as cross validation and classification-sequencing studies should be performed frequently. The purpose of this study is to examine cross…

  12. Assessment of Interobserver Reliability in Nutrition Studies that Use Direct Observation of School Meals

    PubMed Central

    BAGLIO, MICHELLE L.; BAXTER, SUZANNE DOMEL; GUINN, CAROLINE H.; THOMPSON, WILLIAM O.; SHAFFER, NICOLE M.; FRYE, FRANCESCA H. A.

    2005-01-01

    This article (a) provides a general review of interobserver reliability (IOR) and (b) describes our method for assessing IOR for items and amounts consumed during school meals for a series of studies regarding the accuracy of fourth-grade children's dietary recalls validated with direct observation of school meals. A widely used validation method for dietary assessment is direct observation of meals. Although many studies utilize several people to conduct direct observations, few published studies indicate whether IOR was assessed. Assessment of IOR is necessary to determine that the information collected does not depend on who conducted the observation. Two strengths of our method for assessing IOR are that IOR was assessed regularly throughout the data collection period and that IOR was assessed for foods at the item and amount level instead of at the nutrient level. Adequate agreement among observers is essential to the reasoning behind using observation as a validation tool. Readers are encouraged to question the results of studies that fail to mention and/or to include the results for assessment of IOR when multiple people have conducted observations. PMID:15354155

  13. Using Lunar Observations to Validate In-Flight Calibrations of Clouds and Earth Radiant Energy System Instruments

    NASA Technical Reports Server (NTRS)

    Daniels, Janet L.; Smith, G. Louis; Priestley, Kory J.; Thomas, Susan

    2014-01-01

    The validation of in-orbit instrument performance requires stability in both instrument and calibration source. This paper describes a method of validation using lunar observations scanning near full moon by the Clouds and Earth Radiant Energy System (CERES) instruments. Unlike internal calibrations, the Moon offers an external source whose signal variance is predictable and non-degrading. From 2006 to present, in-orbit observations have become standardized and compiled for the Flight Models-1 and -2 aboard the Terra satellite, for Flight Models-3 and -4 aboard the Aqua satellite, and beginning 2012, for Flight Model-5 aboard Suomi-NPP. Instrument performance parameters which can be gleaned are detector gain, pointing accuracy and static detector point response function validation. Lunar observations are used to examine the stability of all three detectors on each of these instruments from 2006 to present. This validation method has yielded results showing trends per CERES data channel of 1.2% per decade or less.

  14. Measuring physical activity in preschoolers: Reliability and validity of The System for Observing Fitness Instruction Time for Preschoolers (SOFIT-P)

    PubMed Central

    Sharma, Shreela; Chuang, Ru-Jye; Skala, Katherine; Atteberry, Heather

    2012-01-01

    The purpose of this study is describe the initial feasibility, reliability, and validity of an instrument to measure physical activity in preschoolers using direct observation. The System for Observing Fitness Instruction Time for Preschoolers was developed and tested among 3- to 6-year-old children over fall 2008 for feasibility and reliability (Phase I, n=67) and in fall 2009 for concurrent validity (Phase II, n=27). Phase I showed that preschoolers spent >75% of their active time at preschool in light physical activity. The mean inter-observer agreements scores were ≥.75 for physical activity level and type. Correlation coefficients, measuring construct validity between the lesson context and physical activity types with and with the activity levels, were moderately strong. Phase II showed moderately strong correlations ranging from .50 to .54 between the System for Observing Fitness Instruction Time for Preschoolers and Actigraph accelerometers for physical activity levels. The System for Observing Fitness Instruction Time for Preschoolers shows promising initial results as a new method for measuring physical activity among preschoolers. PMID:22485071

  15. Validating Components of Teacher Effectiveness: A Random Assignment Study of Value-Added, Observation, and Survey Scores

    ERIC Educational Resources Information Center

    Bacher-Hicks, Andrew; Chin, Mark; Kane, Thomas J.; Staiger, Douglas O.

    2015-01-01

    Policy changes from the past decade have resulted in a growing interest in identifying effective teachers and their characteristics. This study is the third study to use data from a randomized experiment to test the validity of measures of teacher effectiveness. The authors collected effectiveness measures across three school years from three…

  16. Southern Africa Validation of NASA's Earth Observing System (SAVE EOS)

    NASA Technical Reports Server (NTRS)

    Privette, Jeffrey L.

    2000-01-01

    Southern Africa Validation of EOS (SAVE) is 4-year, multidisciplinary effort to validate operational and experimental products from Terra-the flagship satellite of NASA's Earth Observing System (EOS). At test sites from Zambia to South Africa, we are measuring soil, vegetation and atmospheric parameters over a range of ecosystems for comparison with products from Terra, Landsat 7, AVHRR and SeaWiFS. The data are also employed to parameterize and improve vegetation process models. Fixed-point and mobile "transect" sampling are used to collect the ground data. These are extrapolated over larger areas with fine-resolution multispectral imagery. We describe the sites, infrastructure, and measurement strategies developed underSAVE, as well as initial results from our participation in the first Intensive Field Campaign of SAFARI 2000. We also describe SAVE's role in the Kalahari Transect Campaign (February/March 2000) in Zambia and Botswana.

  17. Multi-Sensor Observations of Earthquake Related Atmospheric Signals over Major Geohazard Validation Sites

    NASA Technical Reports Server (NTRS)

    Ouzounov, D.; Pulinets, S.; Davindenko, D.; Hattori, K.; Kafatos, M.; Taylor, P.

    2012-01-01

    We are conducting a scientific validation study involving multi-sensor observations in our investigation of phenomena preceding major earthquakes. Our approach is based on a systematic analysis of several atmospheric and environmental parameters, which we found, are associated with the earthquakes, namely: thermal infrared radiation, outgoing long-wavelength radiation, ionospheric electron density, and atmospheric temperature and humidity. For first time we applied this approach to selected GEOSS sites prone to earthquakes or volcanoes. This provides a new opportunity to cross validate our results with the dense networks of in-situ and space measurements. We investigated two different seismic aspects, first the sites with recent large earthquakes, viz.- Tohoku-oki (M9, 2011, Japan) and Emilia region (M5.9, 2012,N. Italy). Our retrospective analysis of satellite data has shown the presence of anomalies in the atmosphere. Second, we did a retrospective analysis to check the re-occurrence of similar anomalous behavior in atmosphere/ionosphere over three regions with distinct geological settings and high seismicity: Taiwan, Japan and Kamchatka, which include 40 major earthquakes (M>5.9) for the period of 2005-2009. We found anomalous behavior before all of these events with no false negatives; false positives were less then 10%. Our initial results suggest that multi-instrument space-borne and ground observations show a systematic appearance of atmospheric anomalies near the epicentral area that could be explained by a coupling between the observed physical parameters and earthquake preparation processes.

  18. Predictive validity of the classroom strategies scale-observer form on statewide testing scores: an initial investigation.

    PubMed

    Reddy, Linda A; Fabiano, Gregory A; Dudek, Christopher M; Hsu, Louis

    2013-12-01

    The present study examined the validity of a teacher observation measure, the Classroom Strategies Scale--Observer Form (CSS), as a predictor of student performance on statewide tests of mathematics and English language arts. The CSS is a teacher practice observational measure that assesses evidence-based instructional and behavioral management practices in elementary school. A series of two-level hierarchical generalized linear models were fitted to data of a sample of 662 third- through fifth-grade students to assess whether CSS Part 2 Instructional Strategy and Behavioral Management Strategy scale discrepancy scores (i.e., ∑ |recommended frequency--frequency ratings|) predicted statewide mathematics and English language arts proficiency scores when percentage of minority students in schools was controlled. Results indicated that the Instructional Strategy scale discrepancy scores significantly predicted mathematics and English language arts proficiency scores: Relatively larger discrepancies on observer ratings of what teachers did versus what should have been done were associated with lower proficiency scores. Results offer initial evidence of the predictive validity of the CSS Part 2 Instructional Strategy discrepancy scores on student academic outcomes. PsycINFO Database Record (c) 2013 APA, all rights reserved.

  19. Objectifying Content Validity: Conducting a Content Validity Study in Social Work Research.

    ERIC Educational Resources Information Center

    Rubio, Doris McGartland; Berg-Weger, Marla; Tebb, Susan S.; Lee, E. Suzanne; Rauch, Shannon

    2003-01-01

    The purpose of this article is to demonstrate how to conduct a content validity study. Instructions on how to calculate a content validity index, factorial validity index, and an interrater reliability index and guide for interpreting these indices are included. Implications regarding the value of conducting a content validity study for…

  20. Multi-parameter Observations and Validation of Pre-earthquake Atmospheric Signals

    NASA Astrophysics Data System (ADS)

    Ouzounov, D.; Pulinets, S. A.; Hattori, K.; Mogi, T.; Kafatos, M.

    2014-12-01

    We are presenting the latest development in multi-sensors observations of short-term pre-earthquake phenomena preceding major earthquakes. We are exploring the potential of pre-seismic atmospheric and ionospheric signals to alert for large earthquakes. To achieve this, we start validating anomalous ionospheric /atmospheric signals in retrospective and prospective modes. The integrated satellite and terrestrial framework (ISTF) is our method for validation and is based on a joint analysis of several physical and environmental parameters (Satellite thermal infrared radiation (OLR), electron concentration in the ionosphere (GPS/TEC), VHF-bands radio waves, radon/ion activities, air temperature and seismicity patterns) that were found to be associated with earthquakes. The science rationale for multidisciplinary analysis is based on concept Lithosphere-Atmosphere-Ionosphere Coupling (LAIC) [Pulinets and Ouzounov, 2011], which explains the synergy of different geospace processes and anomalous variations, usually named short-term pre-earthquake anomalies. Our validation processes consist in two steps: (1) A continuous retrospective analysis preformed over two different regions with high seismicity- Taiwan and Japan for 2003-2009 The retrospective tests (100+ major earthquakes, M>5.9, Taiwan and Japan) show OLR anomalous behavior before all of these events with no false negatives. False alarm ratio for false positives is less then 25%. (2) Prospective testing using multiple parameters with potential for M5.5+ events. The initial testing shows systematic appearance of atmospheric anomalies in advance (days) to the M5.5+ events for Taiwan and Japan (Honshu and Hokkaido areas). Our initial prospective results suggest that our approach show a systematic appearance of atmospheric anomalies, one to several days prior to the largest earthquakes That feature could be further studied and tested for advancing the multi-sensors detection of pre-earthquake atmospheric signals.

  1. Development and Validation of a Risk Scale for Emergence Agitation After General Anesthesia in Children: A Prospective Observational Study.

    PubMed

    Hino, Maai; Mihara, Takahiro; Miyazaki, Saeko; Hijikata, Toshiyuki; Miwa, Takaaki; Goto, Takahisa; Ka, Koui

    2017-08-01

    Emergence agitation (EA) is a common complication in children after general anesthesia. The goal of this 2-phase study was (1) to develop a predictive model (EA risk scale) for the incidence of EA in children receiving sevoflurane anesthesia by performing a retrospective analysis of data from our previous study (phase 1) and (2) to determine the validity of the EA risk scale in a prospective observational cohort study (phase 2). Using data collected from 120 patients in our previous study, logistic regression analysis was used to predict the incidence of EA in phase 1. The optimal combination of the predictors was determined by a stepwise selection procedure using Akaike information criterion. The β-coefficient for the selected predictors was calculated, and scores for predictors determined. The predictive ability of the EA risk scale was assessed by a receiver operating characteristic (ROC) curve, and the area under the ROC curve (c-index) was calculated with a 95% confidence interval (CI). In phase 2, the validity of the EA risk scale was confirmed using another data set of 100 patients (who underwent minor surgery under general anesthesia). The ROC curve, the c-index, the best cutoff point, and the sensitivity and specificity at the point were calculated. In addition, we calculated the gray zone, which ranges between the two points where sensitivity and specificity, respectively, become 90%. In phase 1, the final model of the multivariable logistic regression analysis included the following 4 predictors: age (logarithm odds ratios [OR], -0.38; 95% CI, -0.81 to 0.00), Pediatric Anesthesia Behavior score (logarithm OR, 0.65; 95% CI, -0.09 to 1.40), anesthesia time (logarithm OR, 0.60; 95% CI, -0.18 to 1.19), and operative procedure (logarithm OR, 2.53; 95% CI, 1.30-3.75 for strabismus surgery and logarithm OR, 2.71; 95% CI, 0.99-4.45 for tonsillectomy). The EA risk scale included these 4 predictors and ranged from 1 to 23 points. In phase 2, the incidence of EA

  2. Validation of an observation tool to assess physical activity-promoting physical education lessons in high schools: SOFIT.

    PubMed

    Fairclough, Stuart J; Weaver, R Glenn; Johnson, Siobhan; Rawlinson, Jack

    2018-05-01

    SOFIT+ is an observation tool to measure teacher practices related to moderate-to-vigorous physical activity (MVPA) promotion during physical education (PE). The objective of the study was to examine the validity of SOFIT+ during high school PE lessons. This cross-sectional, observational study tested the construct validity of SOFIT+ in boys' and girls' high school PE lessons. Twenty-one PE lessons were video-recorded and retrospectively coded using SOFIT+. Students wore hip-mounted accelerometers during lessons as an objective measure of MVPA. Multinomial logistic regression was used to estimate the likelihood of students engaging in MVPA during different teacher practices represented by observed individual codes and a combined SOFIT+ index-score. Fourteen individual SOFIT+ variables demonstrated a statistically significant relationship with girls' and boys' MVPA. Observed lesson segments identified as high MVPA-promoting were related to an increased likelihood of girls engaging in 5-10 (OR=2.86 [95% CI 2.41-3.40]), 15-25 (OR=7.41 [95% CI 6.05-9.06]), and 30-40 (OR=22.70 [95% CI 16.97-30.37])s of MVPA. For boys, observed high-MVPA promoting segments were related to an increased likelihood of engaging in 5-10 (OR=1.71 [95% CI 1.45-2.01]), 15-25 (OR=2.69 [95% CI 2.31-3.13]) and 30-40 (OR=4.26 [95% CI 3.44-5.29])s of MVPA. Teacher practices during high school PE lessons are significantly related to students' participation in MVPA. SOFIT+ is a valid and reliable tool to examine relationships between PE teacher practices and student MVPA during PE. Copyright © 2017 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.

  3. Validation of Mode-S Meteorological Routine Air Report aircraft observations

    NASA Astrophysics Data System (ADS)

    Strajnar, B.

    2012-12-01

    The success of mesoscale data assimilation depends on the availability of three-dimensional observations with high spatial and temporal resolution. This paper describes an example of such observations, available through Mode-S air traffic control system composed of ground radar and transponders on board the aircraft. The meteorological information is provided by interrogation of a dedicated meteorological data register, called Meteorological Routine Air Report (MRAR). MRAR provides direct measurements of temperature and wind, but is only returned by a small fraction of aircraft. The quality of Mode-S MRAR data, collected at the Ljubljana Airport, Slovenia, is assessed by its comparison with AMDAR and high-resolution radiosonde data sets, which enable high- and low-level validation, respectively. The need for temporal smoothing of raw Mode-S MRAR data is also studied. The standard deviation of differences between smoothed Mode-S MRAR and AMDAR is 0.35°C for temperature, 0.8 m/s for wind speed and below 10 degrees for wind direction. The differences with respect to radiosondes are larger, with standard deviations of approximately 1.7°C, 3 m/s and 25 degrees for temperature, wind speed and wind direction, respectively. It is concluded that both wind and temperature observations from Mode-S MRAR are accurate and therefore potentially very useful for data assimilation in numerical weather prediction models.

  4. Development and Validation of a Multidisciplinary Mobile Care System for Patients With Advanced Gastrointestinal Cancer: Interventional Observation Study.

    PubMed

    Soh, Ji Yeong; Cha, Won Chul; Chang, Dong Kyung; Hwang, Ji Hye; Kim, Kihyung; Rha, Miyong; Kwon, Hee

    2018-05-07

    Mobile health apps have emerged as supportive tools in the management of advanced cancers. However, only a few apps have self-monitoring features, and they are not standardized and validated. This study aimed to develop and validate a multidisciplinary mobile care system with self-monitoring features that can be useful for patients with advanced gastrointestinal cancer. The development of the multidisciplinary mobile health management system was divided into 3 steps. First, the service scope was set up, and the measurement tools were standardized. Second, the service flow of the mobile care system was organized. Third, the mobile app (Life Manager) was developed. The app was developed to achieve 3 major clinical goals: support for quality of life, nutrition, and rehabilitation. Three main functional themes were developed to achieve clinical goals: a to-do list, health education, and in-app chat. Thirteen clinically oriented measures were included: the modified Patient-Reported Outcomes version of the Common Terminology Criteria for Adverse Events questionnaire, Scored Patient-Generated Subjective Global Assessment (PG-SGA), distress, European Organization for Research and Treatment of Cancer Quality of Life Questionnaire, International Physical Activity Questionnaire-Short Form, Low anterior resection syndrome score, satisfaction rate, etc. To validate the system, a prospective observational study was conducted. Patients with gastric cancer or colon cancer undergoing chemotherapy were recruited. We followed the subjects for 12 weeks, and selected clinical measures were taken online and offline. After the development process, a multidisciplinary app, the Life Manager, was launched. For evaluation, 203 patients were recruited for the study, of whom 101 (49.8%) had gastric cancer, and 102 (50.2%) were receiving palliative care. Most patients were in their fifties (35.5%), and 128 (63.1%) were male. Overall, 176 subjects (86.7%) completed the study. Among subjects who

  5. Validation of a Brief Questionnaire Against Direct Observation to Assess Adolescents' School Lunchtime Beverage Consumption.

    PubMed

    Grummon, Anna H; Hampton, Karla E; Hecht, Amelie; Oliva, Ariana; McCulloch, Charles E; Brindis, Claire D; Patel, Anisha I

    Beverage consumption is an important determinant of youth health outcomes. Beverage interventions often occur in schools, yet no brief validated questionnaires exist to assess whether these efforts improve in-school beverage consumption. This study validated a brief questionnaire to assess beverage consumption during school lunch. Researchers observed middle school students' (n = 25) beverage consumption during school lunchtime using a standardized tool. After lunch, students completed questionnaires regarding their lunchtime beverage consumption. Kappa statistics compared self-reported with observed beverage consumption across 15 beverage categories. Eight beverages showed at least fair agreement (kappa [κ] > 0.20) for both type and amount consumed, with most showing substantial agreement (κ > 0.60). One beverage had high raw agreement but κ < 0.20. Six beverages had too few ratings to compute κ's. This brief questionnaire was useful for assessing school lunchtime consumption of many beverages and provides a low-cost tool for evaluating school-based beverage interventions. Copyright © 2017 Society for Nutrition Education and Behavior. Published by Elsevier Inc. All rights reserved.

  6. The development and validity of the Salford Gait Tool: an observation-based clinical gait assessment tool.

    PubMed

    Toro, Brigitte; Nester, Christopher J; Farren, Pauline C

    2007-03-01

    To develop the construct, content, and criterion validity of the Salford Gait Tool (SF-GT) and to evaluate agreement between gait observations using the SF-GT and kinematic gait data. Tool development and comparative evaluation. University in the United Kingdom. For designing construct and content validity, convenience samples of 10 children with hemiplegic, diplegic, and quadriplegic cerebral palsy (CP) and 152 physical therapy students and 4 physical therapists were recruited. For developing criterion validity, kinematic gait data of 13 gait clusters containing 56 children with hemiplegic, diplegic, and quadriplegic CP and 11 neurologically intact children was used. For clinical evaluation, a convenience sample of 23 pediatric physical therapists participated. We developed a sagittal plane observational gait assessment tool through a series of design, test, and redesign iterations. The tool's grading system was calibrated using kinematic gait data of 13 gait clusters and was evaluated by comparing the agreement of gait observations using the SF-GT with kinematic gait data. Criterion standard kinematic gait data. There was 58% mean agreement based on grading categories and 80% mean agreement based on degree estimations evaluated with the least significant difference method. The new SF-GT has good concurrent criterion validity.

  7. Validation of TOMS Aerosol Products using AERONET Observations

    NASA Technical Reports Server (NTRS)

    Bhartia, P. K.; Torres, O.; Sinyuk, A.; Holben, B.

    2002-01-01

    The Total Ozone Mapping Spectrometer (TOMS) aerosol algorithm uses measurements of radiances at two near UV channels in the range 331-380 nm to derive aerosol optical depth and single scattering albedo. Because of the low near UV surface albedo of all terrestrial surfaces (between 0.02 and 0.08), the TOMS algorithm has the capability of retrieving aerosol properties over the oceans and the continents. The Aerosol Robotic Network (AERONET) routinely derives spectral aerosol optical depth and single scattering albedo at a large number of sites around the globe. We have performed comparisons of both aerosol optical depth and single scattering albedo derived from TOMS and AERONET. In general, the TOMS aerosol products agree well with the ground-based observations, Results of this validation will be discussed.

  8. The development and validation of The Inquiry Science Observation Coding Sheet.

    PubMed

    Brandon, P R; Taum, A K H; Young, D B; Pottenger, F M

    2008-08-01

    Evaluation reports increasingly document the degree of program implementation, particularly the extent to which programs adhere to prescribed steps and procedures. Many reports are cursory, however, and few, if any, fully portray the long and winding path taken when developing evaluation instruments, particularly observation instruments. In this article, we describe the development of an observational method for evaluating the degree to which K-12 inquiry science programs are implemented, including the many steps and decisions that occurred during the development, and present evidence for the reliability and validity of the data that we collected with the instrument. The article introduces a method for measuring the adherence of inquiry science implementation and gives evaluators a full picture of what they might expect when developing observation instruments for assessing the degree of program implementation.

  9. Validation of SCIAMACHY and TOMS UV Radiances Using Ground and Space Observations

    NASA Technical Reports Server (NTRS)

    Hilsenrath, E.; Bhartia, P. K.; Bojkov, B. R.; Kowalewski, M.; Labow, G.; Ahmad, Z.

    2004-01-01

    Verification of a stratospheric ozone recovery remains a high priority for environmental research and policy definition. Models predict an ozone recovery at a much lower rate than the measured depletion rate observed to date. Therefore improved precision of the satellite and ground ozone observing systems are required over the long term to verify its recovery. We show that validation of satellite radiances from space and from the ground can be a very effective means for correcting long term drifts of backscatter type satellite measurements and can be used to cross calibrate all B W instruments in orbit (TOMS, SBW/2, GOME, SCIAMACHY, OM, GOME-2, OMPS). This method bypasses the retrieval algorithms used for both satellite and ground based measurements that are normally used to validate and correct the satellite data. Radiance comparisons employ forward models and are inherently more accurate than inverse (retrieval) algorithms. This approach however requires well calibrated instruments and an accurate radiative transfer model that accounts for aerosols. TOMS and SCIAMACHY calibrations are checked to demonstrate this method and to demonstrate applicability for long term trends.

  10. PSI-Center Validation Studies

    NASA Astrophysics Data System (ADS)

    Nelson, B. A.; Akcay, C.; Glasser, A. H.; Hansen, C. J.; Jarboe, T. R.; Marklin, G. J.; Milroy, R. D.; Morgan, K. D.; Norgaard, P. C.; Shumlak, U.; Sutherland, D. A.; Victor, B. S.; Sovinec, C. R.; O'Bryan, J. B.; Held, E. D.; Ji, J.-Y.; Lukin, V. S.

    2014-10-01

    The Plasma Science and Innovation Center (PSI-Center - http://www.psicenter.org) supports collaborating validation platform experiments with 3D extended MHD simulations using the NIMROD, HiFi, and PSI-TET codes. Collaborators include the Bellan Plasma Group (Caltech), CTH (Auburn U), HBT-EP (Columbia), HIT-SI (U Wash-UW), LTX (PPPL), MAST (Culham), Pegasus (U Wisc-Madison), SSX (Swarthmore College), TCSU (UW), and ZaP/ZaP-HD (UW). The PSI-Center is exploring application of validation metrics between experimental data and simulations results. Biorthogonal decomposition (BOD) is used to compare experiments with simulations. BOD separates data sets into spatial and temporal structures, giving greater weight to dominant structures. Several BOD metrics are being formulated with the goal of quantitive validation. Results from these simulation and validation studies, as well as an overview of the PSI-Center status will be presented.

  11. Ethical validity of palliative sedation therapy: a multicenter, prospective, observational study conducted on specialized palliative care units in Japan.

    PubMed

    Morita, Tatsuya; Chinone, Yoshikazu; Ikenaga, Masayuki; Miyoshi, Makoto; Nakaho, Toshimichi; Nishitateno, Kenji; Sakonji, Mitsuaki; Shima, Yasuo; Suenaga, Kazuyuki; Takigawa, Chizuko; Kohara, Hiroyuki; Tani, Kazuhiko; Kawamura, Yasuo; Matsubara, Tatsuhiro; Watanabe, Akihiko; Yagi, Yasuo; Sasaki, Toru; Higuchi, Akiko; Kimura, Hideyuki; Abo, Hirofumi; Ozawa, Taketoshi; Kizawa, Yoshiyuki; Uchitomi, Yosuke

    2005-10-01

    Although palliative sedation therapy is often required in terminally ill cancer patients to achieve acceptable symptom relief, empirical data supporting the ethical validity of this approach are lacking. The primary aim of this study was to systematically investigate whether empirical evidence supports the ethical validity of sedation. This was a multicenter, prospective, observational study, which was conducted by 21 specialized palliative care units in Japan. One-hundred two consecutive adult cancer patients who received continuous deep sedation were enrolled. Continuous deep sedation was defined as the continuous use of sedative medications to relieve intolerable and refractory distress by achieving almost or complete unconsciousness until death. Prior to the study, we conceptualized the ethical validity of sedation from the viewpoints of physicians' intent, proportionality, and autonomy. Sedation was performed mainly with midazolam and phenobarbital. The initial doses of midazolam and phenobarbital were 1.5 mg/hour and 20 mg/hour, respectively. Main administration routes were continuous subcutaneous infusion and continuous intravenous infusion, and no rapid intravenous injection was reported. Of 59 patients who received artificial hydration or could intake adequate fluids/foods orally before sedation, 63% received artificial hydration therapy after sedation, and in the remaining patients, artificial hydration was withheld or withdrawn due to fluid retention symptoms and/or patient wishes. Of 66 patients who were able to verbally express themselves, 95% explicitly stated that symptoms were intolerable. The etiologies of the symptoms requiring sedation were primarily related to the progression of the underlying malignancy, such as cancer cachexia and organ failure, and standard palliative treatments had failed: steroids in 68% of patients with fatigue, opioids in 95% of patients with dyspnea, antisecretion medications in 75% of patients with bronchial secretion

  12. Patients' and observers' perceptions of involvement differ. Validation study on inter-relating measures for shared decision making.

    PubMed

    Kasper, Jürgen; Heesen, Christoph; Köpke, Sascha; Fulcher, Gary; Geiger, Friedemann

    2011-01-01

    Patient involvement into medical decisions as conceived in the shared decision making method (SDM) is essential in evidence based medicine. However, it is not conclusively evident how best to define, realize and evaluate involvement to enable patients making informed choices. We aimed at investigating the ability of four measures to indicate patient involvement. While use and reporting of these instruments might imply wide overlap regarding the addressed constructs this assumption seems questionable with respect to the diversity of the perspectives from which the assessments are administered. The study investigated a nested cohort (N = 79) of a randomized trial evaluating a patient decision aid on immunotherapy for multiple sclerosis. Convergent validities were calculated between observer ratings of videotaped physician-patient consultations (OPTION) and patients' perceptions of the communication (Shared Decision Making Questionnaire, Control Preference Scale & Decisional Conflict Scale). OPTION reliability was high to excellent. Communication performance was low according to OPTION and high according to the three patient administered measures. No correlations were found between observer and patient judges, neither for means nor for single items. Patient report measures showed some moderate correlations. Existing SDM measures do not refer to a single construct. A gold standard is missing to decide whether any of these measures has the potential to indicate patient involvement. Pronounced heterogeneity of the underpinning constructs implies difficulties regarding the interpretation of existing evidence on the efficacy of SDM. Consideration of communication theory and basic definitions of SDM would recommend an inter-subjective focus of measurement. Controlled-Trials.com ISRCTN25267500.

  13. Evolution of Precipitation Structure During the November DYNAMO MJO Event: Cloud-Resolving Model Intercomparison and Cross Validation Using Radar Observations

    NASA Astrophysics Data System (ADS)

    Li, Xiaowen; Janiga, Matthew A.; Wang, Shuguang; Tao, Wei-Kuo; Rowe, Angela; Xu, Weixin; Liu, Chuntao; Matsui, Toshihisa; Zhang, Chidong

    2018-04-01

    Evolution of precipitation structures are simulated and compared with radar observations for the November Madden-Julian Oscillation (MJO) event during the DYNAmics of the MJO (DYNAMO) field campaign. Three ground-based, ship-borne, and spaceborne precipitation radars and three cloud-resolving models (CRMs) driven by observed large-scale forcing are used to study precipitation structures at different locations over the central equatorial Indian Ocean. Convective strength is represented by 0-dBZ echo-top heights, and convective organization by contiguous 17-dBZ areas. The multi-radar and multi-model framework allows for more stringent model validations. The emphasis is on testing models' ability to simulate subtle differences observed at different radar sites when the MJO event passed through. The results show that CRMs forced by site-specific large-scale forcing can reproduce not only common features in cloud populations but also subtle variations observed by different radars. The comparisons also revealed common deficiencies in CRM simulations where they underestimate radar echo-top heights for the strongest convection within large, organized precipitation features. Cross validations with multiple radars and models also enable quantitative comparisons in CRM sensitivity studies using different large-scale forcing, microphysical schemes and parameters, resolutions, and domain sizes. In terms of radar echo-top height temporal variations, many model sensitivity tests have better correlations than radar/model comparisons, indicating robustness in model performance on this aspect. It is further shown that well-validated model simulations could be used to constrain uncertainties in observed echo-top heights when the low-resolution surveillance scanning strategy is used.

  14. Validation studies and proficiency testing.

    PubMed

    Ankilam, Elke; Heinze, Petra; Kay, Simon; Van den Eede, Guy; Popping, Bert

    2002-01-01

    Genetically modified organisms (GMOs) entered the European food market in 1996. Current legislation demands the labeling of food products if they contain <1% GMO, as assessed for each ingredient of the product. To create confidence in the testing methods and to complement enforcement requirements, there is an urgent need for internationally validated methods, which could serve as reference methods. To date, several methods have been submitted to validation trials at an international level; approaches now exist that can be used in different circumstances and for different food matrixes. Moreover, the requirement for the formal validation of methods is clearly accepted; several national and international bodies are active in organizing studies. Further validation studies, especially on the quantitative polymerase chain reaction methods, need to be performed to cover the rising demand for new extraction methods and other background matrixes, as well as for novel GMO constructs.

  15. 40 CFR 761.395 - A validation study.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 40 Protection of Environment 31 2011-07-01 2011-07-01 false A validation study. 761.395 Section... PROHIBITIONS Comparison Study for Validating a New Performance-Based Decontamination Solvent Under § 761.79(d)(4) § 761.395 A validation study. (a) Decontaminate the following prepared sample surfaces using the...

  16. 40 CFR 761.395 - A validation study.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 40 Protection of Environment 30 2010-07-01 2010-07-01 false A validation study. 761.395 Section... PROHIBITIONS Comparison Study for Validating a New Performance-Based Decontamination Solvent Under § 761.79(d)(4) § 761.395 A validation study. (a) Decontaminate the following prepared sample surfaces using the...

  17. [Validity 'and Utilities' clinic of a grid observation (PACSLAC-F) to evaluate the pain in seniors with dementia's living in the Long-Term Care ].

    PubMed

    Aubin, Michèle; Verreault, René; Savoie, Maryse; LeMay, Sylvie; Hadjistavropoulos, Thomas; Fillion, Lise; Beaulieu, Marie; Viens, Chantal; Bergeron, Rénald; Vézina, Lucie; Misson, Lucie; Fuchs-Lacelle, Shannon

    2008-01-01

    This study presents the validation of the French Canadian version (PACLSAC-F) of the Pain Assessment Checklist for Seniors with Limited Ability to Communicate (PACSLAC). Unlike the published validation of the English version of the PACSLAC, which was validated retrospectively, the French version was validated prospectively. The PACSLAC-F was completed by nurses working in long-term care facilities after observing 86 seniors, with severe cognitive impairment, in calm, painful or distressing but non-painful situations. The test-retest and inter-observer reliability, the internal consistency, and the discriminent validity were found to be satisfactory. To evaluate the convergent validity with the DOLOPLUS-2 and the clinical relevance of the PACSLAC, it was also completed by nurses during their work shift, with 26 additional patients, for three days per week during a period of four weeks. These results encourage us to test the PACSLAC in a comprehensive program of pain management targeting this population.

  18. 40 CFR 761.395 - A validation study.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... 40 Protection of Environment 32 2013-07-01 2013-07-01 false A validation study. 761.395 Section...)(4) § 761.395 A validation study. (a) Decontaminate the following prepared sample surfaces using the... must be 10 µg/100 cm2, then the validation study failed and the solvent may not be used for...

  19. 40 CFR 761.395 - A validation study.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... 40 Protection of Environment 31 2014-07-01 2014-07-01 false A validation study. 761.395 Section...)(4) § 761.395 A validation study. (a) Decontaminate the following prepared sample surfaces using the... must be 10 µg/100 cm2, then the validation study failed and the solvent may not be used for...

  20. 40 CFR 761.395 - A validation study.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... 40 Protection of Environment 32 2012-07-01 2012-07-01 false A validation study. 761.395 Section...)(4) § 761.395 A validation study. (a) Decontaminate the following prepared sample surfaces using the... must be 10 µg/100 cm2, then the validation study failed and the solvent may not be used for...

  1. Reliability and Validity of Observational Risk Screening in Evaluating Dynamic Knee Valgus

    PubMed Central

    Ekegren, Christina L.; Miller, William C.; Celebrini, Richard G.; Eng, Janice J.; MacIntyre, Donna L.

    2012-01-01

    Study Design Nonexperimental methodological study. Objectives To determine the interrater and intrarater reliability and validity of using observational risk screening guidelines to evaluate dynamic knee valgus. Background A deficiency in the neuromuscular control of the hip has been identified as a key risk factor for non-contact anterior cruciate ligament (ACL) injury in post pubescent females. This deficiency can manifest itself as a valgus knee alignment during tasks involving hip and knee flexion. There are currently no scientifically tested methods to screen for dynamic knee valgus in the clinic or on the field. Methods Three physiotherapists used observational risk screening guidelines to rate 40 adolescent female soccer players according to their risk of ACL injury. The rating was based on the amount of dynamic knee valgus observed on a drop jump landing. Ratings were evaluated for intrarater and interrater agreement using kappa coefficients. Sensitivity and specificity of ratings were evaluated by comparing observational ratings with measurements obtained using 3-dimensional (3D) motion analysis. Results Kappa coefficients for intrarater and interrater agreement ranged from 0.75 to 0.85, indicating that ratings were reasonably consistent over time and between physiotherapists. Sensitivity values were inadequate, ranging from 67–87%. This indicated that raters failed to detect up to a third of “truly high risk” individuals. Specificity values ranged from 60–72% which was considered adequate for the purposes of the screen. Conclusion Observational risk screening is a practical and cost-effective method of screening for ACL injury risk. Rater agreement and specificity were acceptable for this method but sensitivity was not. To detect a greater proportion of individuals at risk of ACL injury, coaches and clinicians should ensure that they include additional tests for other high risk characteristics in their screening protocols. PMID:19721212

  2. The Role of Anchor Stations in the Validation of Earth Observation Satellite Data and Products. The Valencia and the Alacant Anchor Stations

    NASA Astrophysics Data System (ADS)

    Lopez-Baeza, Ernesto; Geraldo Ferreira, A.; Saleh-Contell, Kauzar

    Space technology facilitates humanity and science with a global revolutionary view of the Earth through the acquisition of Earth Observation satellite data. Satellites capture information over different spatial and temporal scales and assist in understanding natural climate processes and in detecting and explaining climate change. Accurate Earth Observation data is needed to describe climate processes by improving the parameterisations of different climate elements. Algorithms to produce geophysical parameters from raw satellite observations should go through selection processes or participate in inter-comparison programmes to ensure performance reliability. Geophysical parameter datasets, obtained from satellite observations, should pass a quality control before they are accepted in global databases for impact, diagnostic or sensitivity studies. Calibration and Validation, or simply "Cal/Val", is the activity that endeavours to ensure that remote sensing products are highly consistent and reproducible. This is an evolving scientific activity that is becoming increasingly important as more long-term studies on global change are undertaken, and new satellite missions are launched. Calibration is the process of quantitatively defining the system responses to known, controlled signal inputs. Validation refers to the process of assessing, by independent means, the quality of the data products derived from the system outputs. These definitions are generally accepted and most often used in the remote sensing context to refer specifically and respectively to sensor radiometric calibration and geophysical parameter validation. Anchor Stations are carefully selected locations at which instruments measure quantities that are needed to run, calibrate or validate models and algorithms. These are needed to quanti-tatively evaluate satellite data and convert it into geophysical information. The instruments collect measurements of basic quantities over a long timescale

  3. 40 CFR 761.392 - Preparing validation study samples.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 40 Protection of Environment 30 2010-07-01 2010-07-01 false Preparing validation study samples..., AND USE PROHIBITIONS Comparison Study for Validating a New Performance-Based Decontamination Solvent Under § 761.79(d)(4) § 761.392 Preparing validation study samples. (a)(1) To validate a procedure to...

  4. 40 CFR 761.392 - Preparing validation study samples.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 40 Protection of Environment 31 2011-07-01 2011-07-01 false Preparing validation study samples..., AND USE PROHIBITIONS Comparison Study for Validating a New Performance-Based Decontamination Solvent Under § 761.79(d)(4) § 761.392 Preparing validation study samples. (a)(1) To validate a procedure to...

  5. 29 CFR 1607.5 - General standards for validity studies.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 29 Labor 4 2010-07-01 2010-07-01 false General standards for validity studies. 1607.5 Section 1607... studies. A. Acceptable types of validity studies. For the purposes of satisfying these guidelines, users may rely upon criterion-related validity studies, content validity studies or construct validity...

  6. GOSAT TIR radiometric validation toward simultaneous GHG column and profile observation

    NASA Astrophysics Data System (ADS)

    Kataoka, F.; Knuteson, R. O.; Kuze, A.; Shiomi, K.; Suto, H.; Saitoh, N.

    2015-12-01

    The Greenhouse gases Observing SATellite (GOSAT) was launched on January 2009 and continues its operation for more than six years. The thermal and near infrared sensor for carbon observation Fourier-Transform Spectrometer (TANSO-FTS) onboard GOSAT measures greenhouse gases (GHG), such as CO2 and CH4, with wide and high resolution spectra from shortwave infrared (SWIR) to thermal infrared (TIR). This instrument has the advantage of being able to measure simultaneously the same field of view in different spectral ranges. The combination of column-GHG form SWIR band and vertical profile-GHG from TIR band provide better understanding and distribution of GHG, especially in troposphere. This work describes the radiometric validation and sensitivity analysis of TANSO-FTS TIR spectra, especially CO2, atmospheric window and CH4 channels with forward calculation. In this evaluation, we used accurate in-situ dataset of the HIPPO (HIAPER Pole-to-Pole Observation) airplane observation data and GOSAT vicarious calibration and validation campaign data in Railroad Valley, NV. The HIPPO aircraft campaign had taken accurate atmospheric vertical profile dataset (T, RH, O3, CO2, CH4, N2O, CO) approximately pole-to-pole from the surface to the tropopause over the ocean. We implemented these dataset for forward calculation and made the spectral correction model with respect to wavenumber and internal calibration blackbody temperature The GOSAT vicarious calibration campaign have conducted every year since 2009 near summer solstice in Railroad Valley, where high-temperature desert site. In this campaign, we have measured temperature and humidity by a radiosonde and CO2, CH4 and O3 profile by the AJAX airplane at the time of the GOSAT overpass. Sometimes, the GHG profiles over the Railroad Valley show the air mass advection in mid-troposphere depending on upper wind. These advections bring the different concentration of GHG in lower and upper troposphere. Using these cases, we made

  7. Fires and Smoke Observed from the Earth Observing System MODIS Instrument: Products, Validation, and Operational Use

    NASA Technical Reports Server (NTRS)

    Kaufman, Y. J.; Ichoku, C.; Giglio, L.; Korontzi, S.; Chu, D. A.; Hao, W. M.; Justice, C. O.; Lau, William K. M. (Technical Monitor)

    2001-01-01

    The MODIS sensor, launched on NASA's Terra satellite at the end of 1999, was designed with 36 spectral channels for a wide array of land, ocean, and atmospheric investigations. MODIS has a unique ability to observe fires, smoke, and burn scars globally. Its main fire detection channels saturate at high brightness temperatures: 500 K at 4 microns and 400 K at 11 microns, which can only be attained in rare circumstances at the I kin fire detection spatial resolution. Thus, unlike other polar orbiting satellite sensors with similar thermal and spatial resolutions, but much lower saturation temperatures (e.g. AVHRR and ATSR), MODIS can distinguish between low intensity ground surface fires and high intensity crown forest fires. Smoke column concentration over land is for the first time being derived from the MOMS solar channels, extending from 0.41 microns to 2.1 microns. The smoke product has been provisionally validated both globally and regionally over southern Africa and central and south America. Burn scars are observed from MODIS even in the presence of smoke, using the 1.2 to 2.1 micron channels. MODIS burned area information is used to estimate pyrogenic emissions. A wide range of these fire and related products and validation are demonstrated for the wild fires that occurred in northwestern United States in the summer of 2000. The MODIS rapid response system and direct broadcast capability is being developed to enable users to obtain and generate data in near real time. It is expected that health and land management organizations will use these systems for monitoring the occurrence of fires and the dispersion of smoke within two to six hours after data acquisition.

  8. A Test of Model Validation from Observed Temperature Trends

    NASA Astrophysics Data System (ADS)

    Singer, S. F.

    2006-12-01

    How much of current warming is due to natural causes and how much is manmade? This requires a comparison of the patterns of observed warming with the best available models that incorporate both anthropogenic (greenhouse gases and aerosols) as well as natural climate forcings (solar and volcanic). Fortunately, we have the just published U.S.-Climate Change Science Program (CCSP) report (www.climatescience.gov/Library/sap/sap1-1/finalreport/default.htm), based on best current information. As seen in Fig. 1.3F of the report, modeled surface temperature trends change little with latitude, except for a stronger warming in the Arctic. The observations, however, show a strong surface warming in the northern hemisphere but not in the southern hemisphere (see Fig. 3.5C and 3.6D). The Antarctic is found to be cooling and Arctic temperatures, while currently rising, were higher in the 1930s than today. Although the Executive Summary of the CCSP report claims "clear evidence" for anthropogenic warming, based on comparing tropospheric and surface temperature trends, the report itself does not confirm this. Greenhouse models indicate that the tropics should provide the most sensitive location for their validation; trends there should increase by 200-300 percent with altitude, peaking at around 10 kilometers. The observations, however, show the opposite: flat or even decreasing tropospheric trend values (see Fig. 3.7 and also Fig. 5.7E). This disparity is demonstrated most strikingly in Fig. 5.4G, which shows the difference between surface and troposphere trends for a collection of models (displayed as a histogram) and for balloon and satellite data. [The disparities are less apparent in the Summary, which displays model results in terms of "range" rather than as histograms.] There may be several possible reasons for the disparity: Instrumental and other effects that exaggerate or otherwise distort observed temperature trends. Or, more likely: Shortcomings in models that result

  9. Performing a Content Validation Study.

    ERIC Educational Resources Information Center

    Spool, Mark D.

    Content validity is concerned with three components: (1) the job content; (2) the test content, and (3) the strength of the relationship between the two. A content validation study, to be considered adequate and defensible should include at least the following four procedures: (1) A thorough and accurate job analysis (to define the job content);…

  10. Discriminant validity study of Achilles enthesis ultrasound.

    PubMed

    Expósito Molinero, María Rosa; de Miguel Mendieta, Eugenio

    2016-01-01

    We want to know if the ultrasound examination of the Achilles tendon in spondyloarthritis is different compared to other rheumatic diseases. We studied 97 patients divided into five groups: rheumatoid arthritis, spondyloarthritis, gout, chondrocalcinosis and osteoarthritis, exploring six elementary lesions in 194 Achilles entheses examined. In our study the total index ultrasonographic Achilles is higher in spondyloarthritis with significant differences. The worst elementary spondyloarthritis lesions for discriminations against other pathologies were calcification. This study aims to demonstrate the discriminant validity of Achilles enthesitis observed by ultrasound in spondyloarthritis compared with other rheumatic diseases that may also have ultrasound abnormalities such enthesis level. Copyright © 2015 Elsevier España, S.L.U. and Sociedad Española de Reumatología y Colegio Mexicano de Reumatología. All rights reserved.

  11. Improved Diagnostic Validity of the ADOS Revised Algorithms: A Replication Study in an Independent Sample

    ERIC Educational Resources Information Center

    Oosterling, Iris; Roos, Sascha; de Bildt, Annelies; Rommelse, Nanda; de Jonge, Maretha; Visser, Janne; Lappenschaar, Martijn; Swinkels, Sophie; van der Gaag, Rutger Jan; Buitelaar, Jan

    2010-01-01

    Recently, Gotham et al. ("2007") proposed revised algorithms for the Autism Diagnostic Observation Schedule (ADOS) with improved diagnostic validity. The aim of the current study was to replicate predictive validity, factor structure, and correlations with age and verbal and nonverbal IQ of the ADOS revised algorithms for Modules 1 and 2…

  12. "Lies, damned lies ..." and observational studies in comparative effectiveness research.

    PubMed

    Albert, Richard K

    2013-06-01

    A new federal initiative has allocated $1.1 billion to comparative effectiveness research, and many have emphasized the importance of including observational studies in this effort. The rationale for using observational studies to assess comparative effectiveness is based on concerns that randomized controlled trials (RCTs) are not "real world" because they enroll homogeneous patient populations, measure study outcomes that are not important to patients, use protocols that are overly complex, are conducted in specialized centers, and use study treatments that are not consistent with usual care, and that RCTs are not always feasible because of a lack of equipoise, the need to assess delayed endpoints, and concerns that they take years to complete and are expensive. This essay questions the validity of each of these proposed limitations, summarizes concerns raised about the accuracy of results generated by observational studies, provides some examples of discrepancies between results of observational studies and RCTs that pertain to pulmonary and critical care, and suggests that using observational studies for comparative effectiveness research may increase rather than decrease the cost of health care and may harm patients.

  13. A global validation of ERA-Interim integrated water vapor estimates using ground-based GNSS observations

    NASA Astrophysics Data System (ADS)

    Ahmed, F.; Dousa, J.; Hunegnaw, A.; Teferle, F. N.; Bingley, R.

    2017-12-01

    Integrated water vapor (IWV) derived from climate reanalysis models, such as the European Centre for Medium-range Weather Forecasts (ECMWF) ReAnalysis-Interim (ERA-Interim), is widely used in many atmospheric applications. Therefore, it is of interest to assess the quality of this reanalysis product using available observations. Observations from Global Navigation Satellite Systems (GNSS) are, as of now, available for a period of over 2 decades and their global availability makes it possible to validate the IWV obtained from climate reanalysis models in different geographical and climatic regions. In this study, primarily, three 5-year long homogeneously reprocessed GNSS-derived IWV datasets containing over 400 globally distributed ground-based GNSS stations have been used to validate the IWV estimates obtained from the ERA-Interim climate reanalysis model in 25 different climate zones. The IWV from ERA-Interim has been obtained by vertically integrating the specific humidity at all model levels above the locations of GNSS stations. It has been studied how the difference between the ERA-Interim IWV and the GNSS-derived IWV varies with respect to the different climate zones as well as with respect to the difference in the model orography and latitude. The results show a dependence of the ability of ERA-Interim to model the IWV on difference in climate types and latitude. This dependence, however, is dictated by the concentration of water vapor in different climate zones and at different latitudes. Furthermore, as a secondary focus of this study, the weighted mean atmospheric temperature (Tm) obtained from ERA-Interim has been compared to its equivalent obtained using two widely used approximations globally.

  14. Research Vessel Meteorological and Oceanographic Systems Support Satellite and Model Validation Studies

    NASA Astrophysics Data System (ADS)

    Smith, S. R.; Lopez, N.; Bourassa, M. A.; Rolph, J.; Briggs, K.

    2012-12-01

    The research vessel data center at the Florida State University routinely acquires, quality controls, and distributes underway surface meteorological and oceanographic observations from vessels. The activities of the center are coordinated by the Shipboard Automated Meteorological and Oceanographic System (SAMOS) initiative in partnership with the Rolling Deck to Repository (R2R) project. The data center evaluates the quality of the observations, collects essential metadata, provides data quality feedback to vessel operators, and ensures the long-term data preservation at the National Oceanographic Data Center. A description of the SAMOS data stewardship protocols will be provided, including dynamic web tools that ensure users can select the highest quality observations from over 30 vessels presently recruited to the SAMOS initiative. Research vessels provide underway observations at high-temporal frequency (1 min. sampling interval) that include navigational (position, course, heading, and speed), meteorological (air temperature, humidity, wind, surface pressure, radiation, rainfall), and oceanographic (surface sea temperature and salinity) samples. Recruited vessels collect a high concentration of data within the U.S. continental shelf and also frequently operate well outside routine shipping lanes, capturing observations in extreme ocean environments (Southern Ocean, Arctic, South Atlantic and Pacific). The unique quality and sampling locations of research vessel observations and there independence from many models and products (RV data are rarely distributed via normal marine weather reports) makes them ideal for validation studies. We will present comparisons between research vessel observations and model estimates of the sea surface temperature and salinity in the Gulf of Mexico. The analysis reveals an underestimation of the freshwater input to the Gulf from rivers, resulting in an overestimation of near coastal salinity in the model. Additional comparisons

  15. Active-comparator design and new-user design in observational studies

    PubMed Central

    Yoshida, Kazuki; Solomon, Daniel H.; Kim, Seoyoung C.

    2015-01-01

    SUMMARY Over the past decade, an increasing number of observational studies have examined the effectiveness or safety of rheumatoid arthritis treatments. However, unlike randomized controlled trials (RCTs), observational studies of drug effects face methodological challenges including confounding by indication. Two design principles - active comparator design and new user design can help mitigate such challenges in observational studies. To improve validity of study findings, observational studies should be designed in such a way that makes them more closely approximate RCTs. The active comparator design compares the drug of interest to another commonly used agent for the same indication, rather than a ‘non-user’ group. This principle helps select treatment groups similar in treatment indications (both measured and unmeasured characteristics). The new user design includes a cohort of patients from the time of treatment initiation, so that it can assess patients’ pretreatment characteristics and capture all events occurring anytime during follow-up. PMID:25800216

  16. Improvement and validation of trace gas retrieval from ACAM aircraft observation

    NASA Astrophysics Data System (ADS)

    Liu, C.; Liu, X.; Kowalewski, M. G.; Janz, S. J.; Gonzalez Abad, G.; Pickering, K. E.; Chance, K.; Lamsal, L. N.

    2014-12-01

    The ACAM (Airborne Compact Atmospheric Mapper) instrument, flown on board the NASA UC-12 aircraft during the DISCOVER-AQ (Deriving Information on Surface Conditions from Column and Vertically Resolved Observations Relevant to Air Quality) campaigns, was designed to provide remote sensing observations of tropospheric and boundary layer pollutants and help understand some of the most important pollutants that directly affect the health of the population. In this study, slant column densities (SCD) of trace gases (O3, NO2, HCHO) are retrieved from ACAM measurements during the Baltimore-Washington D.C. 2011 campaign by the Basic Optical Absorption Spectroscopy (BOAS) trace gas fitting algorithm using a nonlinear least-squares (NLLS) inversion technique, and then are converted to vertical column densities (VCDs) using the Air Mass Factors (AMF) calculated with the VLIDORT (Vector Linearized Discrete Ordinate Radiative Transfer) model and CMAQ (Community Multi-scale Air Quality) model simulations of trace gas profiles. For surface treatment in the AMF, we use high-resolution MODIS climatological BRDF product (Bidirectional Reflectance Distribution Function) at 470 nm for NO2, and use high-resolution surface albedo derived by combining MODIS and OMI albedo databases for HCHO and O3. We validate ACAM results with coincident ground-based PANDORA, aircraft (P3B) spiral and satellite (OMI) measurements and find out generally good agreement especially for NO2 and O3

  17. Validation of ionospheric electron density profiles inferred from GPS occultation observations of the GPS/MET experiment

    NASA Astrophysics Data System (ADS)

    Kawakami, Todd Mori

    this study is the validation of the electron density profiles inferred from GPS occultation observations using the Abel transform.

  18. Observational studies: a valuable source for data on the true value of RA therapies.

    PubMed

    van Vollenhoven, Ronald F; Severens, Johan L

    2011-03-01

    The validity of observational studies is sometimes questioned because of the limitations of non-randomly assigned controls, various biases such as channeling bias, confounding by indication, and other pitfalls. Yet, (post-marketing) observational data can provide important information regarding not only drug safety but also the effectiveness and appropriate use of agents in the real world, outside of clinical trials. Observational studies also provide data regarding the wider value of these agents in terms of, for example, reducing the need for surgical procedures, reducing absenteeism and increasing productivity. Importantly, data from some observational registry studies have subsequently been confirmed by clinical trials, supporting the overall validity of the registry-based approach. Observational studies also allow measures such as health assessment questionnaire scores, disease activity scores, and glucocorticoid use over time to be monitored for longer periods. Furthermore, observational data in real, less strictly selected patients without the constraints of formal study populations may produce findings not observed in clinical trials but that warrant further investigation in a controlled trial environment. For example, recent data from the Stockholm tumor necrosis factor follow-up registry in Sweden showed increases in the time people worked after initiation of biologics that, surprisingly, continued into the fourth and fifth years of treatment--a finding not observed with standardized outcomes. Observational studies are truly an underappreciated and valuable source of data on the real value of anti-rheumatic therapies, and these data are essential for making sound decisions regarding coverage and reimbursement.

  19. Validation sampling can reduce bias in healthcare database studies: an illustration using influenza vaccination effectiveness

    PubMed Central

    Nelson, Jennifer C.; Marsh, Tracey; Lumley, Thomas; Larson, Eric B.; Jackson, Lisa A.; Jackson, Michael

    2014-01-01

    Objective Estimates of treatment effectiveness in epidemiologic studies using large observational health care databases may be biased due to inaccurate or incomplete information on important confounders. Study methods that collect and incorporate more comprehensive confounder data on a validation cohort may reduce confounding bias. Study Design and Setting We applied two such methods, imputation and reweighting, to Group Health administrative data (full sample) supplemented by more detailed confounder data from the Adult Changes in Thought study (validation sample). We used influenza vaccination effectiveness (with an unexposed comparator group) as an example and evaluated each method’s ability to reduce bias using the control time period prior to influenza circulation. Results Both methods reduced, but did not completely eliminate, the bias compared with traditional effectiveness estimates that do not utilize the validation sample confounders. Conclusion Although these results support the use of validation sampling methods to improve the accuracy of comparative effectiveness findings from healthcare database studies, they also illustrate that the success of such methods depends on many factors, including the ability to measure important confounders in a representative and large enough validation sample, the comparability of the full sample and validation sample, and the accuracy with which data can be imputed or reweighted using the additional validation sample information. PMID:23849144

  20. Practical Aspects of Designing and Conducting Validation Studies Involving Multi-study Trials.

    PubMed

    Coecke, Sandra; Bernasconi, Camilla; Bowe, Gerard; Bostroem, Ann-Charlotte; Burton, Julien; Cole, Thomas; Fortaner, Salvador; Gouliarmou, Varvara; Gray, Andrew; Griesinger, Claudius; Louhimies, Susanna; Gyves, Emilio Mendoza-de; Joossens, Elisabeth; Prinz, Maurits-Jan; Milcamps, Anne; Parissis, Nicholaos; Wilk-Zasadna, Iwona; Barroso, João; Desprez, Bertrand; Langezaal, Ingrid; Liska, Roman; Morath, Siegfried; Reina, Vittorio; Zorzoli, Chiara; Zuang, Valérie

    This chapter focuses on practical aspects of conducting prospective in vitro validation studies, and in particular, by laboratories that are members of the European Union Network of Laboratories for the Validation of Alternative Methods (EU-NETVAL) that is coordinated by the EU Reference Laboratory for Alternatives to Animal Testing (EURL ECVAM). Prospective validation studies involving EU-NETVAL, comprising a multi-study trial involving several laboratories or "test facilities", typically consist of two main steps: (1) the design of the validation study by EURL ECVAM and (2) the execution of the multi-study trial by a number of qualified laboratories within EU-NETVAL, coordinated and supported by EURL ECVAM. The approach adopted in the conduct of these validation studies adheres to the principles described in the OECD Guidance Document on the Validation and International Acceptance of new or updated test methods for Hazard Assessment No. 34 (OECD 2005). The context and scope of conducting prospective in vitro validation studies is dealt with in Chap. 4 . Here we focus mainly on the processes followed to carry out a prospective validation of in vitro methods involving different laboratories with the ultimate aim of generating a dataset that can support a decision in relation to the possible development of an international test guideline (e.g. by the OECD) or the establishment of performance standards.

  1. Validity Studies of the Filial Anxiety Scale.

    ERIC Educational Resources Information Center

    Murray, Paul D.; And Others

    1996-01-01

    Factor analytic and construct validity studies were conducted to explore the validity of Cicirelli's 13-item Filial Anxiety Scale (FAS). The State-Trait Anxiety Inventory and the Marlowe-Crowne Social Desirability Scale were a part of the investigation. Results offer support for the validity of the FAS subscales and the FAS' usefulness as an…

  2. External Validation of Risk Prediction Scores for Invasive Candidiasis in a Medical/Surgical Intensive Care Unit: An Observational Study

    PubMed Central

    Ahmed, Armin; Baronia, Arvind Kumar; Azim, Afzal; Marak, Rungmei S. K.; Yadav, Reema; Sharma, Preeti; Gurjar, Mohan; Poddar, Banani; Singh, Ratender Kumar

    2017-01-01

    Background: The aim of this study was to conduct external validation of risk prediction scores for invasive candidiasis. Methods: We conducted a prospective observational study in a 12-bedded adult medical/surgical Intensive Care Unit (ICU) to evaluate Candida score >3, colonization index (CI) >0.5, corrected CI >0.4 (CCI), and Ostrosky's clinical prediction rule (CPR). Patients' characteristics and risk factors for invasive candidiasis were noted. Patients were divided into two groups; invasive candidiasis and no-invasive candidiasis. Results: Of 198 patients, 17 developed invasive candidiasis. Discriminatory power (area under receiver operator curve [AUROC]) for Candida score, CI, CCI, and CPR were 0.66, 0.67, 0.63, and 0.62, respectively. A large number of patients in the no-invasive candidiasis group (114 out of 181) were exposed to antifungal agents during their stay in ICU. Subgroup analysis was carried out after excluding such patients from no-invasive candidiasis group. AUROC of Candida score, CI, CCI, and CPR were 0.7, 0.7, 0.65, and 0.72, respectively, and positive predictive values (PPVs) were in the range of 25%–47%, along with negative predictive values (NPVs) in the range of 84%–96% in the subgroup analysis. Conclusion: Currently available risk prediction scores have good NPV but poor PPV. They are useful for selecting patients who are not likely to benefit from antifungal therapy. PMID:28904481

  3. Simulators' validation study: Problem solution logic

    NASA Technical Reports Server (NTRS)

    Schoultz, M. B.

    1974-01-01

    A study was conducted to validate the ground based simulators used for aircraft environment in ride-quality research. The logic to the approach for solving this problem is developed. The overall problem solution flow chart is presented. The factors which could influence the human response to the environment on board the aircraft are analyzed. The mathematical models used in the study are explained. The steps which were followed in conducting the validation tests are outlined.

  4. Value-Added and Observational Measures Used in the Teacher Evaluation Process: A Validation Study

    ERIC Educational Resources Information Center

    Guerere, Claudia

    2013-01-01

    Scores from value-added models (VAMs), as used for educational accountability, represent the educational effect teachers have on their students. The use of these scores in teacher evaluations for high-stakes decision making is new for the State of Florida. Validity evidence that supports or questions the use of these scores is critically needed.…

  5. Sensitivity of regression calibration to non-perfect validation data with application to the Norwegian Women and Cancer Study.

    PubMed

    Buonaccorsi, John P; Dalen, Ingvild; Laake, Petter; Hjartåker, Anette; Engeset, Dagrun; Thoresen, Magne

    2015-04-15

    Measurement error occurs when we observe error-prone surrogates, rather than true values. It is common in observational studies and especially so in epidemiology, in nutritional epidemiology in particular. Correcting for measurement error has become common, and regression calibration is the most popular way to account for measurement error in continuous covariates. We consider its use in the context where there are validation data, which are used to calibrate the true values given the observed covariates. We allow for the case that the true value itself may not be observed in the validation data, but instead, a so-called reference measure is observed. The regression calibration method relies on certain assumptions.This paper examines possible biases in regression calibration estimators when some of these assumptions are violated. More specifically, we allow for the fact that (i) the reference measure may not necessarily be an 'alloyed gold standard' (i.e., unbiased) for the true value; (ii) there may be correlated random subject effects contributing to the surrogate and reference measures in the validation data; and (iii) the calibration model itself may not be the same in the validation study as in the main study; that is, it is not transportable. We expand on previous work to provide a general result, which characterizes potential bias in the regression calibration estimators as a result of any combination of the violations aforementioned. We then illustrate some of the general results with data from the Norwegian Women and Cancer Study. Copyright © 2015 John Wiley & Sons, Ltd.

  6. Community Participation in the Development and Validation of a School Violence Observation Instrument.

    PubMed

    Medina, Nilda; Fernández, Gisely; Cruz, Tania; Jordán, Natalia; Trenche, Maryanes

    2016-01-01

    School violence is a worldwide public health issue with negative effects on education. Official statistics and reports do not include daily occurrences of violent behavior that may precede severe incidents. This project aimed to engage school community members in the development, validation, and implementation of an observation instrument to identify characteristics of school violence in two Puerto Rican schools. The role of school community members in all phases of the research is described. The input of community partners contributed to enrich the process by providing insight into the problem studied and a more informed framework for interpreting results. Taking into account distinctive features of each particular school made results meaningful to the school community and fostered a sense of empowerment of community members as they recognized their knowledge is essential to the solution of their problems.

  7. The Effect of Different Cultural Lenses on Reliability and Validity in Observational Data: The Example of Chinese Immigrant Parent-Toddler Dinner Interactions

    ERIC Educational Resources Information Center

    Wang, Yan Z.; Wiley, Angela R.; Zhou, Xiaobin

    2007-01-01

    This study used a mixed methodology to investigate reliability, validity, and analysis level with Chinese immigrant observational data. European-American and Chinese coders quantitatively rated 755 minutes of Chinese immigrant parent-toddler dinner interactions on parental sensitivity, intrusiveness, detachment, negative affect, positive affect,…

  8. NOAA activities in support of in situ validation observations for satellite ocean color products and related ocean science research

    NASA Astrophysics Data System (ADS)

    Lance, V. P.; DiGiacomo, P. M.; Ondrusek, M.; Stengel, E.; Soracco, M.; Wang, M.

    2016-02-01

    The NOAA/STAR ocean color program is focused on "end-to-end" production of high quality satellite ocean color products. In situ validation of satellite data is essential to produce the high quality, "fit for purpose" ocean color products that support users and applications in all NOAA line offices, as well as external (both applied and research) users. The first NOAA/OMAO (Office of Marine and Aviation Operations) sponsored research cruise dedicated to VIIRS SNPP validation was completed aboard the NOAA Ship Nancy Foster in November 2014. The goals and objectives of the 2014 cruise are highlighted in the recently published NOAA/NESDIS Technical Report. A second dedicated validation cruise is planned for December 2015 and will have been completed by the time of this meeting. The goals and objectives of the 2015 cruise will be discussed in the presentation. Participants and observations made will be reported. The NOAA Ocean Color Calibration/Validation (Cal/Val) team also works collaboratively with others programs. A recent collaboration with the NOAA Ocean Acidification program on the East Coast Ocean Acidification (ECOA) cruise during June-July 2015, where biogeochemical and optical measurements were made together, allows for the leveraging of in situ observations for satellite validation and for their use in the development of future ocean acidification satellite products. Datasets from these cruises will be formally archived at NOAA and Digital Object Identifier (DOI) numbers will be assigned. In addition, the NOAA Coast/OceanWatch Program is working to establish a searchable database. The beta version will begin with cruise data and additional in situ calibration/validation related data collected by the NOAA Ocean Color Cal/Val team members. A more comprehensive searchable NOAA database, with contributions from other NOAA ocean observation platforms and cruise collaborations is envisioned. Progress on these activities will be reported.

  9. Toddler physical activity study: laboratory and community studies to evaluate accelerometer validity and correlates.

    PubMed

    Hager, Erin R; Gormley, Candice E; Latta, Laura W; Treuth, Margarita S; Caulfield, Laura E; Black, Maureen M

    2016-09-06

    Toddlerhood is an important age for physical activity (PA) promotion to prevent obesity and support a physically active lifestyle throughout childhood. Accurate assessment of PA is needed to determine trends/correlates of PA, time spent in sedentary, light, or moderate-vigorous PA (MVPA), and the effectiveness of PA promotion programs. Due to the limited availability of objective measures that have been validated and evaluated for feasibility in community studies, it is unclear which subgroups of toddlers are at the highest risk for inactivity. Using Actical ankle accelerometry, the objectives of this study are to develop valid thresholds, examine feasibility, and examine demographic/ anthropometric PA correlates of MVPA among toddlers from low-income families. Two studies were conducted with toddlers (12-36 months). Laboratory Study (n = 24)- Two Actical accelerometers were placed on the ankle. PA was observed using the Child Activity Rating Scale (CARS, prescribed activities). Analyses included device equivalence reliability (correlation: activity counts of two Acticals), criterion-related validity (correlation: activity counts and CARS ratings), and sensitivity/specificity for thresholds. Community Study (n = 277, low-income mother-toddler dyads recruited)- An Actical was worn on the ankle for > 7 days (goal >5, 24-h days). Height/weight was measured. Mothers reported demographics. Analyses included frequencies (feasibility) and stepwise multiple linear regression (sMLR). Laboratory Study- Acticals demonstrated reliability (r = 0.980) and validity (r = 0.75). Thresholds demonstrated sensitivity (86 %) and specificity (88 %). Community Study- 86 % wore accelerometer, 69 % had valid data (mean = 5.2 days). Primary reasons for missing/invalid data: refusal (14 %) and wear-time ≤2 days (11 %). The MVPA threshold (>2200 cpm) yielded 54 min/day. In sMLR, MVPA was associated with age (older > younger, β = 32.8, p < 0

  10. 29 CFR 1607.7 - Use of other validity studies.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 29 Labor 4 2011-07-01 2011-07-01 false Use of other validity studies. 1607.7 Section 1607.7 Labor... EMPLOYEE SELECTION PROCEDURES (1978) General Principles § 1607.7 Use of other validity studies. A. Validity studies not conducted by the user. Users may, under certain circumstances, support the use of selection...

  11. Is Structured Observation a Valid Technique to Measure Handwashing Behavior? Use of Acceleration Sensors Embedded in Soap to Assess Reactivity to Structured Observation

    PubMed Central

    Ram, Pavani K.; Halder, Amal K.; Granger, Stewart P.; Jones, Therese; Hall, Peter; Hitchcock, David; Wright, Richard; Nygren, Benjamin; Islam, M. Sirajul; Molyneaux, John W.; Luby, Stephen P.

    2010-01-01

    Structured observation is often used to evaluate handwashing behavior. We assessed reactivity to structured observation in rural Bangladesh by distributing soap containing acceleration sensors and performing structured observation 4 days later. Sensors recorded the number of times soap was moved. In 45 participating households, the median number of sensor soap movements during the 5-hour time block on pre-observation days was 3.7 (range 0.3–10.6). During the structured observation, the median number of sensor soap movements was 5.0 (range 0–18.0), a 35% increase, P = 0.0004. Compared with the same 5-hour time block on pre-observation days, the number of sensor soap movements increased during structured observation by ≥ 20% in 62% of households, and by ≥ 100% in 22% of households. The increase in sensor soap movements during structured observation, compared with pre-observation days, indicates substantial reactivity to the presence of the observer. These findings call into question the validity of structured observation for measurement of handwashing behavior. PMID:21036840

  12. Is structured observation a valid technique to measure handwashing behavior? Use of acceleration sensors embedded in soap to assess reactivity to structured observation.

    PubMed

    Ram, Pavani K; Halder, Amal K; Granger, Stewart P; Jones, Therese; Hall, Peter; Hitchcock, David; Wright, Richard; Nygren, Benjamin; Islam, M Sirajul; Molyneaux, John W; Luby, Stephen P

    2010-11-01

    Structured observation is often used to evaluate handwashing behavior. We assessed reactivity to structured observation in rural Bangladesh by distributing soap containing acceleration sensors and performing structured observation 4 days later. Sensors recorded the number of times soap was moved. In 45 participating households, the median number of sensor soap movements during the 5-hour time block on pre-observation days was 3.7 (range 0.3-10.6). During the structured observation, the median number of sensor soap movements was 5.0 (range 0-18.0), a 35% increase, P = 0.0004. Compared with the same 5-hour time block on pre-observation days, the number of sensor soap movements increased during structured observation by ≥ 20% in 62% of households, and by ≥ 100% in 22% of households. The increase in sensor soap movements during structured observation, compared with pre-observation days, indicates substantial reactivity to the presence of the observer. These findings call into question the validity of structured observation for measurement of handwashing behavior.

  13. Validation sampling can reduce bias in health care database studies: an illustration using influenza vaccination effectiveness.

    PubMed

    Nelson, Jennifer Clark; Marsh, Tracey; Lumley, Thomas; Larson, Eric B; Jackson, Lisa A; Jackson, Michael L

    2013-08-01

    Estimates of treatment effectiveness in epidemiologic studies using large observational health care databases may be biased owing to inaccurate or incomplete information on important confounders. Study methods that collect and incorporate more comprehensive confounder data on a validation cohort may reduce confounding bias. We applied two such methods, namely imputation and reweighting, to Group Health administrative data (full sample) supplemented by more detailed confounder data from the Adult Changes in Thought study (validation sample). We used influenza vaccination effectiveness (with an unexposed comparator group) as an example and evaluated each method's ability to reduce bias using the control time period before influenza circulation. Both methods reduced, but did not completely eliminate, the bias compared with traditional effectiveness estimates that do not use the validation sample confounders. Although these results support the use of validation sampling methods to improve the accuracy of comparative effectiveness findings from health care database studies, they also illustrate that the success of such methods depends on many factors, including the ability to measure important confounders in a representative and large enough validation sample, the comparability of the full sample and validation sample, and the accuracy with which the data can be imputed or reweighted using the additional validation sample information. Copyright © 2013 Elsevier Inc. All rights reserved.

  14. Quality standards for real-world research. Focus on observational database studies of comparative effectiveness.

    PubMed

    Roche, Nicolas; Reddel, Helen; Martin, Richard; Brusselle, Guy; Papi, Alberto; Thomas, Mike; Postma, Dirjke; Thomas, Vicky; Rand, Cynthia; Chisholm, Alison; Price, David

    2014-02-01

    Real-world research can use observational or clinical trial designs, in both cases putting emphasis on high external validity, to complement the classical efficacy randomized controlled trials (RCTs) with high internal validity. Real-world research is made necessary by the variety of factors that can play an important a role in modulating effectiveness in real life but are often tightly controlled in RCTs, such as comorbidities and concomitant treatments, adherence, inhalation technique, access to care, strength of doctor-caregiver communication, and socio-economic and other organizational factors. Real-world studies belong to two main categories: pragmatic trials and observational studies, which can be prospective or retrospective. Focusing on comparative database observational studies, the process aimed at ensuring high-quality research can be divided into three parts: preparation of research, analyses and reporting, and discussion of results. Key points include a priori planning of data collection and analyses, identification of appropriate database(s), proper outcomes definition, study registration with commitment to publish, bias minimization through matching and adjustment processes accounting for potential confounders, and sensitivity analyses testing the robustness of results. When these conditions are met, observational database studies can reach a sufficient level of evidence to help create guidelines (i.e., clinical and regulatory decision-making).

  15. Issues of reporting in observational studies in veterinary medicine.

    PubMed

    Sargeant, Jan M; O'Connor, Annette M

    2014-02-15

    Observational studies are common in veterinary medicine; the results may be used to inform decision-making, future research, or as inputs to systematic reviews or risk assessment. To be of use, the results must be published, all of the outcomes that were assessed must be included in the publication, and the research (methods and results) must be reported in sufficient detail that the reader can evaluate the internal and external validity. In human healthcare, concerns about the completeness of reporting - and evidence that poor reporting is associated with study results - have led to the creation of reporting guidelines; these include the STROBE statement for observational studies. There is evidence from a limited body of research that there also are reporting inadequacies in veterinary observational studies. There are differences between human and veterinary observational studies that might be relevant to recommendations for reporting. Such differences include: the use of observational studies in animal populations for simultaneously estimating disease frequency and risk-factor identification; the distinction between the animal owners who consent to participate and the animals that are the study subjects; and the complexity of organizational levels inherent in animal research (in particular, for studies in livestock species). In veterinary medicine, it is common to have clustering within outcomes (due to animal grouping) and clustering of predictor variables. We argue that there is a compelling need for the scientific community involved in veterinary observational studies to use the STROBE statement, use an amended version of STROBE, or to develop and use reporting guidelines that are specific to veterinary medicine to improve reporting of these studies. Copyright © 2013 Elsevier B.V. All rights reserved.

  16. Documentation and Validation of the Goddard Earth Observing System (GEOS) Data Assimilation System, Version 4

    NASA Technical Reports Server (NTRS)

    Suarez, Max J. (Editor); daSilva, Arlindo; Dee, Dick; Bloom, Stephen; Bosilovich, Michael; Pawson, Steven; Schubert, Siegfried; Wu, Man-Li; Sienkiewicz, Meta; Stajner, Ivanka

    2005-01-01

    This document describes the structure and validation of a frozen version of the Goddard Earth Observing System Data Assimilation System (GEOS DAS): GEOS-4.0.3. Significant features of GEOS-4 include: version 3 of the Community Climate Model (CCM3) with the addition of a finite volume dynamical core; version two of the Community Land Model (CLM2); the Physical-space Statistical Analysis System (PSAS); and an interactive retrieval system (iRET) for assimilating TOVS radiance data. Upon completion of the GEOS-4 validation in December 2003, GEOS-4 became operational on 15 January 2004. Products from GEOS-4 have been used in supporting field campaigns and for reprocessing several years of data for CERES.

  17. Community participation in the development and validation of a school violence observation instrument

    PubMed Central

    Medina, N.; Fernández, G.; Cruz, T.; Jordán, N.; Trenche, M.

    2015-01-01

    Background School violence is a worldwide public health issue with negative effects on education. Official statistics and reports do not include daily occurrences of violent behavior that may precede severe incidents. Objectives This project aimed to engage school community members in the development, validation and implementation of an observation instrument to identify characteristics of school violence. Methods The role of members of each participating school community in all phases of the research is described. Results (or Lessons Learned) The input of community members contributed to enrich the process by providing insight into the problem studied and a more informed framework for interpreting results. Conclusions Taking into account distinctive features of each particular school made results meaningful to the school community and fostered a sense of empowerment of community members as they recognized their knowledge is essential to the solution of their problems. PMID:27346771

  18. Unresolved versus resolved: testing the validity of young simple stellar population models with VLT/MUSE observations of NGC 3603

    NASA Astrophysics Data System (ADS)

    Kuncarayakti, H.; Galbany, L.; Anderson, J. P.; Krühler, T.; Hamuy, M.

    2016-09-01

    Context. Stellar populations are the building blocks of galaxies, including the Milky Way. The majority, if not all, extragalactic studies are entangled with the use of stellar population models given the unresolved nature of their observation. Extragalactic systems contain multiple stellar populations with complex star formation histories. However, studies of these systems are mainly based upon the principles of simple stellar populations (SSP). Hence, it is critical to examine the validity of SSP models. Aims: This work aims to empirically test the validity of SSP models. This is done by comparing SSP models against observations of spatially resolved young stellar population in the determination of its physical properties, that is, age and metallicity. Methods: Integral field spectroscopy of a young stellar cluster in the Milky Way, NGC 3603, was used to study the properties of the cluster as both a resolved and unresolved stellar population. The unresolved stellar population was analysed using the Hα equivalent width as an age indicator and the ratio of strong emission lines to infer metallicity. In addition, spectral energy distribution (SED) fitting using STARLIGHT was used to infer these properties from the integrated spectrum. Independently, the resolved stellar population was analysed using the colour-magnitude diagram (CMD) to determine age and metallicity. As the SSP model represents the unresolved stellar population, the derived age and metallicity were tested to determine whether they agree with those derived from resolved stars. Results: The age and metallicity estimate of NGC 3603 derived from integrated spectroscopy are confirmed to be within the range of those derived from the CMD of the resolved stellar population, including other estimates found in the literature. The result from this pilot study supports the reliability of SSP models for studying unresolved young stellar populations. Based on observations collected at the European Organisation

  19. Study to validate the Non-Interference Performance Assessment (NIPA) technique

    NASA Technical Reports Server (NTRS)

    Seeman, J. S.; Murphy, G. L.

    1973-01-01

    The NIPA (Non-Interference Performance Assessment) technique involves direct observation of group verbal activities by trained observers who rate the emotional content (affect) of each verbal interaction as either positive, negative, or neutral. During the test, in which four men were confined for 90 consecutive days, feasibility of the NIPA technique was demonstrated and observer reliability was verified. However, the validity of the test was not proved because an independent criterion measure of morale for the confined crew was lacking. There were indications, however, that NIPA measures were tracking changes in crew morale. At approximately the two-thirds point (Days 60 to 70), morale apparently fell dramatically for a period of about ten days, and simultaneously NIPA measure of positive verbalization decreased in number. A need was indicated for a separate study to apply the NIPA technique under experimental conditions and using a clearly defined criterion measure against which the ability of NIPA observations to truly measure morale changes could be determined.

  20. The Modified-Classroom Observation Schedule to Measure Intentional Communication (M-COSMIC): Evaluation of Reliability and Validity

    ERIC Educational Resources Information Center

    Clifford, Sally; Hudry, Kristelle; Brown, Laura; Pasco, Greg; Charman, Tony

    2010-01-01

    The Modified-Classroom Observation Schedule to Measure Intentional Communication (M-COSMIC) was developed as an ecologically valid measure of social-communication behaviour, delineating forms, functions, and intended partners of children's spontaneous communication acts. Forty-one children with autism spectrum disorder (ASD) aged 48-73 months were…

  1. Modeling and validation of photometric characteristics of space targets oriented to space-based observation.

    PubMed

    Wang, Hongyuan; Zhang, Wei; Dong, Aotuo

    2012-11-10

    A modeling and validation method of photometric characteristics of the space target was presented in order to track and identify different satellites effectively. The background radiation characteristics models of the target were built based on blackbody radiation theory. The geometry characteristics of the target were illustrated by the surface equations based on its body coordinate system. The material characteristics of the target surface were described by a bidirectional reflectance distribution function model, which considers the character of surface Gauss statistics and microscale self-shadow and is obtained by measurement and modeling in advance. The contributing surfaces of the target to observation system were determined by coordinate transformation according to the relative position of the space-based target, the background radiation sources, and the observation platform. Then a mathematical model on photometric characteristics of the space target was built by summing reflection components of all the surfaces. Photometric characteristics simulation of the space-based target was achieved according to its given geometrical dimensions, physical parameters, and orbital parameters. Experimental validation was made based on the scale model of the satellite. The calculated results fit well with the measured results, which indicates the modeling method of photometric characteristics of the space target is correct.

  2. STRengthening Analytical Thinking for Observational Studies: the STRATOS initiative

    PubMed Central

    Sauerbrei, Willi; Abrahamowicz, Michal; Altman, Douglas G; le Cessie, Saskia; Carpenter, James

    2014-01-01

    The validity and practical utility of observational medical research depends critically on good study design, excellent data quality, appropriate statistical methods and accurate interpretation of results. Statistical methodology has seen substantial development in recent times. Unfortunately, many of these methodological developments are ignored in practice. Consequently, design and analysis of observational studies often exhibit serious weaknesses. The lack of guidance on vital practical issues discourages many applied researchers from using more sophisticated and possibly more appropriate methods when analyzing observational studies. Furthermore, many analyses are conducted by researchers with a relatively weak statistical background and limited experience in using statistical methodology and software. Consequently, even ‘standard’ analyses reported in the medical literature are often flawed, casting doubt on their results and conclusions. An efficient way to help researchers to keep up with recent methodological developments is to develop guidance documents that are spread to the research community at large. These observations led to the initiation of the strengthening analytical thinking for observational studies (STRATOS) initiative, a large collaboration of experts in many different areas of biostatistical research. The objective of STRATOS is to provide accessible and accurate guidance in the design and analysis of observational studies. The guidance is intended for applied statisticians and other data analysts with varying levels of statistical education, experience and interests. In this article, we introduce the STRATOS initiative and its main aims, present the need for guidance documents and outline the planned approach and progress so far. We encourage other biostatisticians to become involved. PMID:25074480

  3. STRengthening analytical thinking for observational studies: the STRATOS initiative.

    PubMed

    Sauerbrei, Willi; Abrahamowicz, Michal; Altman, Douglas G; le Cessie, Saskia; Carpenter, James

    2014-12-30

    The validity and practical utility of observational medical research depends critically on good study design, excellent data quality, appropriate statistical methods and accurate interpretation of results. Statistical methodology has seen substantial development in recent times. Unfortunately, many of these methodological developments are ignored in practice. Consequently, design and analysis of observational studies often exhibit serious weaknesses. The lack of guidance on vital practical issues discourages many applied researchers from using more sophisticated and possibly more appropriate methods when analyzing observational studies. Furthermore, many analyses are conducted by researchers with a relatively weak statistical background and limited experience in using statistical methodology and software. Consequently, even 'standard' analyses reported in the medical literature are often flawed, casting doubt on their results and conclusions. An efficient way to help researchers to keep up with recent methodological developments is to develop guidance documents that are spread to the research community at large. These observations led to the initiation of the strengthening analytical thinking for observational studies (STRATOS) initiative, a large collaboration of experts in many different areas of biostatistical research. The objective of STRATOS is to provide accessible and accurate guidance in the design and analysis of observational studies. The guidance is intended for applied statisticians and other data analysts with varying levels of statistical education, experience and interests. In this article, we introduce the STRATOS initiative and its main aims, present the need for guidance documents and outline the planned approach and progress so far. We encourage other biostatisticians to become involved. © 2014 The Authors. Statistics in Medicine published by John Wiley & Sons, Ltd.

  4. CANFOR Portuguese version: validation study.

    PubMed

    Talina, Miguel; Thomas, Stuart; Cardoso, Ana; Aguiar, Pedro; Caldas de Almeida, Jose M; Xavier, Miguel

    2013-05-30

    The increase in prisoner population is a troublesome reality in several regions of the world. Along with this growth there is increasing evidence that prisoners have a higher proportion of mental illnesses and suicide than the general population. In order to implement strategies that address criminal recidivism and the health and social status of prisoners, particularly in mental disordered offenders, it is necessary to assess their care needs in a comprehensive, but individual perspective. This assessment must include potential harmful areas like comorbid personality disorder, substance misuse and offending behaviours. The Camberwell Assessment of Need - Forensic Version (CANFOR) has proved to be a reliable tool designed to accomplish such aims. The present study aimed to validate the CANFOR Portuguese version. The translation, adaptation to the Portuguese context, back-translation and revision followed the usual procedures. The sample comprised all detainees receiving psychiatric care in four forensic facilities, over a one year period. A total of 143 subjects, and respective case manager, were selected. The forensic facilities were chosen by convenience: one prison hospital psychiatric ward (n=68; 47.6%), one male (n=24; 16.8%) and one female (n=22; 15.4%) psychiatric clinic and one civil security ward (n=29; 20.3%), all located nearby Lisbon. Basic descriptive statistics and Kappa weighted coefficients were calculated for the inter-rater and the test-retest reliability studies. The convergent validity was evaluated using the Global Assessment of Functioning and the Brief Psychiatric Rating Scale scores. The majority of the participants were male and single, with short school attendance, and accused of a crime involving violence against persons. The most frequent diagnosis was major depression (56.1%) and almost half presented positive suicide risk. The reliability study showed average Kappa weighted coefficients of 0.884 and 0.445 for inter-rater and test

  5. TURKISH VERSION QUALITY OF LIFE IN ESSENTIAL TREMOR QUESTIONNAIRE (QUEST): VALIDITY AND RELIABILITY STUDY.

    PubMed

    Güler, Sibel; Turan, F Nesrin

    2015-09-30

    Our aim was to translate the Quality of Life in Essential Tremor Questionnaire (QUEST) advanced by Troster (2005) and to analyse the validity and reliability of this questionnaire. Two hundred twelve consecutive patients with essential tremor (ET) and forty-three control subjects were included in the study. Permission for the translation and validation of the QUEST scale was obtained. The translation was performed according to the guidelines provided by the publisher. After the translation, the final version of the scale was administered to both groups to determine its reliability and validity. The QUEST Physical, Psychosocial, communication, Hobbies/leisure and Work/finance scores were 0.967, 0.968, 0.933, 0.964 and 0.925, respectively. There were good correlations between each of the QUEST scores that were indicative of good internal consistency. Additionally, we observed that all of the QUEST scores were most strongly related to the right and left arms (p=0.0001). However, we observed that all of the QUEST scores were weakly related to the voice, head and right leg (p=0.0001). These findings support the notion that the Turkish version of the Quality of Life in Essential Tremor (QUEST) questionnaire is a valid and reliable tool for the assessment of the quality of life of patients with ET.

  6. Validation of Satellite-Based Objective Overshooting Cloud-Top Detection Methods Using CloudSat Cloud Profiling Radar Observations

    NASA Technical Reports Server (NTRS)

    Bedka, Kristopher M.; Dworak, Richard; Brunner, Jason; Feltz, Wayne

    2012-01-01

    Two satellite infrared-based overshooting convective cloud-top (OT) detection methods have recently been described in the literature: 1) the 11-mm infrared window channel texture (IRW texture) method, which uses IRW channel brightness temperature (BT) spatial gradients and thresholds, and 2) the water vapor minus IRW BT difference (WV-IRW BTD). While both methods show good performance in published case study examples, it is important to quantitatively validate these methods relative to overshooting top events across the globe. Unfortunately, no overshooting top database currently exists that could be used in such study. This study examines National Aeronautics and Space Administration CloudSat Cloud Profiling Radar data to develop an OT detection validation database that is used to evaluate the IRW-texture and WV-IRW BTD OT detection methods. CloudSat data were manually examined over a 1.5-yr period to identify cases in which the cloud top penetrates above the tropopause height defined by a numerical weather prediction model and the surrounding cirrus anvil cloud top, producing 111 confirmed overshooting top events. When applied to Moderate Resolution Imaging Spectroradiometer (MODIS)-based Geostationary Operational Environmental Satellite-R Series (GOES-R) Advanced Baseline Imager proxy data, the IRW-texture (WV-IRW BTD) method offered a 76% (96%) probability of OT detection (POD) and 16% (81%) false-alarm ratio. Case study examples show that WV-IRW BTD.0 K identifies much of the deep convective cloud top, while the IRW-texture method focuses only on regions with a spatial scale near that of commonly observed OTs. The POD decreases by 20% when IRW-texture is applied to current geostationary imager data, highlighting the importance of imager spatial resolution for observing and detecting OT regions.

  7. Critical validation studies of neurofeedback.

    PubMed

    Gruzelier, John; Egner, Tobias

    2005-01-01

    The field of neurofeedback training has proceeded largely without validation. In this article the authors review studies directed at validating sensory motor rhythm, beta and alpha-theta protocols for improving attention, memory, and music performance in healthy participants. Importantly, benefits were demonstrable with cognitive and neurophysiologic measures that were predicted on the basis of regression models of learning to enhance sensory motor rhythm and beta activity. The first evidence of operant control over the alpha-theta ratio is provided, together with remarkable improvements in artistic aspects of music performance equivalent to two class grades in conservatory students. These are initial steps in providing a much needed scientific basis to neurofeedback.

  8. A new framework of statistical inferences based on the valid joint sampling distribution of the observed counts in an incomplete contingency table.

    PubMed

    Tian, Guo-Liang; Li, Hui-Qiong

    2017-08-01

    Some existing confidence interval methods and hypothesis testing methods in the analysis of a contingency table with incomplete observations in both margins entirely depend on an underlying assumption that the sampling distribution of the observed counts is a product of independent multinomial/binomial distributions for complete and incomplete counts. However, it can be shown that this independency assumption is incorrect and can result in unreliable conclusions because of the under-estimation of the uncertainty. Therefore, the first objective of this paper is to derive the valid joint sampling distribution of the observed counts in a contingency table with incomplete observations in both margins. The second objective is to provide a new framework for analyzing incomplete contingency tables based on the derived joint sampling distribution of the observed counts by developing a Fisher scoring algorithm to calculate maximum likelihood estimates of parameters of interest, the bootstrap confidence interval methods, and the bootstrap testing hypothesis methods. We compare the differences between the valid sampling distribution and the sampling distribution under the independency assumption. Simulation studies showed that average/expected confidence-interval widths of parameters based on the sampling distribution under the independency assumption are shorter than those based on the new sampling distribution, yielding unrealistic results. A real data set is analyzed to illustrate the application of the new sampling distribution for incomplete contingency tables and the analysis results again confirm the conclusions obtained from the simulation studies.

  9. Assessing reliability and validity measures in managed care studies.

    PubMed

    Montoya, Isaac D

    2003-01-01

    To review the reliability and validity literature and develop an understanding of these concepts as applied to managed care studies. Reliability is a test of how well an instrument measures the same input at varying times and under varying conditions. Validity is a test of how accurately an instrument measures what one believes is being measured. A review of reliability and validity instructional material was conducted. Studies of managed care practices and programs abound. However, many of these studies utilize measurement instruments that were developed for other purposes or for a population other than the one being sampled. In other cases, instruments have been developed without any testing of the instrument's performance. The lack of reliability and validity information may limit the value of these studies. This is particularly true when data are collected for one purpose and used for another. The usefulness of certain studies without reliability and validity measures is questionable, especially in cases where the literature contradicts itself

  10. 40 CFR 761.392 - Preparing validation study samples.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... establish a surface concentration to be included in the standard operating procedure. The surface levels of... Under § 761.79(d)(4) § 761.392 Preparing validation study samples. (a)(1) To validate a procedure to... surfaces must be ≥20 µg/100 cm2. (2) To validate a procedure to decontaminate a specified surface...

  11. 40 CFR 761.392 - Preparing validation study samples.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... establish a surface concentration to be included in the standard operating procedure. The surface levels of... Under § 761.79(d)(4) § 761.392 Preparing validation study samples. (a)(1) To validate a procedure to... surfaces must be ≥20 µg/100 cm2. (2) To validate a procedure to decontaminate a specified surface...

  12. 40 CFR 761.392 - Preparing validation study samples.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... establish a surface concentration to be included in the standard operating procedure. The surface levels of... Under § 761.79(d)(4) § 761.392 Preparing validation study samples. (a)(1) To validate a procedure to... surfaces must be ≥20 µg/100 cm2. (2) To validate a procedure to decontaminate a specified surface...

  13. Statistical validation of earthquake related observations

    NASA Astrophysics Data System (ADS)

    Kossobokov, V. G.

    2011-12-01

    The confirmed fractal nature of earthquakes and their distribution in space and time implies that many traditional estimations of seismic hazard (from term-less to short-term ones) are usually based on erroneous assumptions of easy tractable or, conversely, delicately-designed models. The widespread practice of deceptive modeling considered as a "reasonable proxy" of the natural seismic process leads to seismic hazard assessment of unknown quality, which errors propagate non-linearly into inflicted estimates of risk and, eventually, into unexpected societal losses of unacceptable level. The studies aimed at forecast/prediction of earthquakes must include validation in the retro- (at least) and, eventually, in prospective tests. In the absence of such control a suggested "precursor/signal" remains a "candidate", which link to target seismic event is a model assumption. Predicting in advance is the only decisive test of forecast/predictions and, therefore, the score-card of any "established precursor/signal" represented by the empirical probabilities of alarms and failures-to-predict achieved in prospective testing must prove statistical significance rejecting the null-hypothesis of random coincidental occurrence in advance target earthquakes. We reiterate suggesting so-called "Seismic Roulette" null-hypothesis as the most adequate undisturbed random alternative accounting for the empirical spatial distribution of earthquakes: (i) Consider a roulette wheel with as many sectors as the number of earthquake locations from a sample catalog representing seismic locus, a sector per each location and (ii) make your bet according to prediction (i.e., determine, which locations are inside area of alarm, and put one chip in each of the corresponding sectors); (iii) Nature turns the wheel; (iv) accumulate statistics of wins and losses along with the number of chips spent. If a precursor in charge of prediction exposes an imperfection of Seismic Roulette then, having in mind

  14. 41 CFR 60-3.7 - Use of other validity studies.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... studies. 60-3.7 Section 60-3.7 Public Contracts and Property Management Other Provisions Relating to... of other validity studies. A. Validity studies not conducted by the user. Users may, under certain circumstances, support the use of selection procedures by validity studies conducted by other users or conducted...

  15. 41 CFR 60-3.7 - Use of other validity studies.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... studies. 60-3.7 Section 60-3.7 Public Contracts and Property Management Other Provisions Relating to... of other validity studies. A. Validity studies not conducted by the user. Users may, under certain circumstances, support the use of selection procedures by validity studies conducted by other users or conducted...

  16. First validation of satellite microwave liquid water path with ship-based observations in marine low clouds

    NASA Astrophysics Data System (ADS)

    Painemal, D.; Cadeddu, M. P.; Greenwald, T. J.; Minnis, P.

    2015-12-01

    We present the first validation study of satellite microwave liquid water path, from four operational sensors, against in-situ observations from a ship-borne three-channel microwave radiometer collected over the northeast Pacific during May-August of 2013, along a ship transect length of 40˚ (33.7˚N, 118.2˚W - 21.3˚N, 157.8˚W). The satellite sensors analyzed here are: The Tropical Rainfall Measuring Mission (TRMM) Microwave Imager (TMI), Special Sensor Microwave Imager/Sounder (SSMIS) on the Defense Meteorological Satellite Program F16 and F17 satellites, and The Advanced Microwave Scanning Radiometer (AMSR-2) on board the Global Change Observation Mission - Water (GCOM-W1). Satellite retrievals show an overall correlation with hourly-averaged in-situ observations of 0.86 and a positive bias of 10.0 gm2, which decreases to 1.0 gm2 and a correlation that increases to 0.91 when selecting overcast scenes. The satellite bias for broken scenes remains below 22.2 gm2, although the removal of clear-sky in-situ samples yields an unbiased relationship. Satellites produce a diurnal cycle with amplitudes (35-47 gm2) consistent with ship-based observations. Longitudinal biases remain below 17.4 gm2, and they are negligible in overcast scenes and when clear-sky samples are removed from the in-situ hourly average. Our study indicates that satellite microwave retrievals are a reliable dataset for climate studies in marine warm low clouds. The implications for satellite visible/infrared retrievals will be also discussed.

  17. Observations of Tropospheric Carbon Monoxide From the Atmospheric InfraRed Sounder (AIRS): An Alternative Retrieval Scheme and Its Validation.

    NASA Astrophysics Data System (ADS)

    Douglass, D. H.; Kalnay, E.; Li, H.; Cai, M.

    2005-05-01

    Carbon monoxide (CO) is present in the troposphere as a product of fossil fuel combustion, biomass burning and the oxidation of volatile hydrocarbons. It is the principal sink of the hydroxyl radical (OH), thereby affecting the concentrations of greenhouse gases such as CH4 and O3. In addition, CO has a lifetime of 1-3 months, making it a good tracer for studying the long range transport of pollution. Satellite observations present a valuable tool in the investigation of tropospheric CO. The Atmospheric InfraRed Sounder (AIRS), onboard the Aqua satellite, is sensitive to tropospheric CO in a number of its 2378 channels. This sensitivity to CO, combined with the daily global coverage provided by AIRS, makes AIRS a potentially useful instrument for observing CO sources and transport. A maximum a posteriori (MAP) retrieval scheme (Rodgers 2000) has been developed for AIRS, to provide CO profiles from near-surface altitudes to around 150 hPa. An extensive validation data set, consisting of over 50 in-situ aircraft CO profiles, has been constructed. This data set combines CO data from a number of independent aircraft campaigns. Results from this validation study and comparisons with the AIRS level 2 CO product will be presented. Rodgers, C. D. (2000), Inverse Methods for Atmospheric Sounding : Theory and Practice, World Scientific, Singapore.

  18. Workflow interruptions and mental workload in hospital pediatricians: an observational study.

    PubMed

    Weigl, Matthias; Müller, Andreas; Angerer, Peter; Hoffmann, Florian

    2014-09-24

    Pediatricians' workload is increasingly thought to affect pediatricians' quality of work life and patient safety. Workflow interruptions are a frequent stressor in clinical work, impeding clinicians' attention and contributing to clinical malpractice. We aimed to investigate prospective associations of workflow interruptions with multiple dimensions of mental workload in pediatricians during clinical day shifts. In an Academic Children's Hospital a prospective study of 28 full shift observations was conducted among pediatricians providing ward coverage. The prevalence of workflow interruptions was based on expert observation using a validated observation instrument. Concurrently, Pediatricians' workload ratings were assessed with three workload dimensions of the well-validated NASA-Task Load Index: mental demands, effort, and frustration. Observed pediatricians were, on average, disrupted 4.7 times per hour. Most frequent were interruptions by colleagues (30.2%), nursing staff (29.7%), and by telephone/beeper calls (16.3%). Interruption measures were correlated with two workload outcomes of interest: frequent workflow interruptions were related to less cognitive demands, but frequent interruptions were associated with increased frustration. With regard to single sources, interruptions by colleagues showed the strongest associations to workload. The findings provide insights into specific pathways between different types of interruptions and pediatricians' mental workload. These findings suggest further research and yield a number of work and organization re-design suggestions for pediatric care.

  19. Anxiety in early Parkinson's disease: Validation of the Italian observer-rated version of the Parkinson Anxiety Scale (OR-PAS).

    PubMed

    Santangelo, Gabriella; Falco, Fabrizia; D'Iorio, Alfonsina; Cuoco, Sofia; Raimo, Simona; Amboni, Marianna; Pellecchia, Maria Teresa; Longo, Katia; Vitale, Carmine; Barone, Paolo

    2016-08-15

    Anxiety disorders are common in Parkinson's Disease (PD) and their identification is relevant even at early stages. The Parkinson Anxiety Scale (PAS) evaluates anxiety in PD; it was used only in the original validation study in PD patients mainly at 2-3 stages of Hoehn & Yahr system (H&Y). The study aimed to investigate psychometric properties of observer-rated version of the PAS (OR-PAS), prevalence rate of anxiety and its features, compared with diagnostic criteria in early PD patients. A sample of 101 PD patients with H&Y:1-2 underwent the OR-PAS. To assess convergent and divergent validity, PD patients underwent Beck Anxiety Inventory, and scales assessing depression, apathy, anhedonia and cognition. To diagnose anxiety disorders, Mini International Neuropsychiatric Inventory was used as gold standard. A "receiver operating characteristics" curve was obtained; positive and negative predictive values were calculated for different cut-off points of the OR-PAS and its subscales. There was no missing data, no floor and ceiling effects; mean score was 12.2±10.1; Cronbach's alpha was 0.899. The OR-PAS showed good convergent and divergent validity. Maximum discrimination was obtained with a cut-off score of 8.5. The anxiety occurred in 59 patients (58.4%). The OR-PAS is a reliable and valid screening instrument for assessing anxiety in patients at early PD. Anxiety was found in 58.4% of PD patients, demonstrating that anxiety occurs even at early stages. Copyright © 2016 Elsevier B.V. All rights reserved.

  20. 41 CFR 60-3.14 - Technical standards for validity studies.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... likely to affect validity differences; or that these factors are included in the design of the study and... construct validity is both an extensive and arduous effort involving a series of research studies, which... validity studies. 60-3.14 Section 60-3.14 Public Contracts and Property Management Other Provisions...

  1. 41 CFR 60-3.14 - Technical standards for validity studies.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... likely to affect validity differences; or that these factors are included in the design of the study and... construct validity is both an extensive and arduous effort involving a series of research studies, which... validity studies. 60-3.14 Section 60-3.14 Public Contracts and Property Management Other Provisions...

  2. 41 CFR 60-3.14 - Technical standards for validity studies.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... likely to affect validity differences; or that these factors are included in the design of the study and... construct validity is both an extensive and arduous effort involving a series of research studies, which... validity studies. 60-3.14 Section 60-3.14 Public Contracts and Property Management Other Provisions...

  3. 41 CFR 60-3.14 - Technical standards for validity studies.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... likely to affect validity differences; or that these factors are included in the design of the study and... construct validity is both an extensive and arduous effort involving a series of research studies, which... validity studies. 60-3.14 Section 60-3.14 Public Contracts and Property Management Other Provisions...

  4. 41 CFR 60-3.14 - Technical standards for validity studies.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... likely to affect validity differences; or that these factors are included in the design of the study and... construct validity is both an extensive and arduous effort involving a series of research studies, which... validity studies. 60-3.14 Section 60-3.14 Public Contracts and Property Management Other Provisions...

  5. Derivation and Validation of Two Decision Instruments for Selective Chest CT in Blunt Trauma: A Multicenter Prospective Observational Study (NEXUS Chest CT)

    PubMed Central

    Rodriguez, Robert M.; Langdorf, Mark I.; Nishijima, Daniel; Baumann, Brigitte M.; Hendey, Gregory W.; Medak, Anthony J.; Raja, Ali S.; Allen, Isabel E.; Mower, William R.

    2015-01-01

    Background Unnecessary diagnostic imaging leads to higher costs, longer emergency department stays, and increased patient exposure to ionizing radiation. We sought to prospectively derive and validate two decision instruments (DIs) for selective chest computed tomography (CT) in adult blunt trauma patients. Methods and Findings From September 2011 to May 2014, we prospectively enrolled blunt trauma patients over 14 y of age presenting to eight US, urban level 1 trauma centers in this observational study. During the derivation phase, physicians recorded the presence or absence of 14 clinical criteria before viewing chest imaging results. We determined injury outcomes by CT radiology readings and categorized injuries as major or minor according to an expert-panel-derived clinical classification scheme. We then employed recursive partitioning to derive two DIs: Chest CT-All maximized sensitivity for all injuries, and Chest CT-Major maximized sensitivity for only major thoracic injuries (while increasing specificity). In the validation phase, we employed similar methodology to prospectively test the performance of both DIs. We enrolled 11,477 patients—6,002 patients in the derivation phase and 5,475 patients in the validation phase. The derived Chest CT-All DI consisted of (1) abnormal chest X-ray, (2) rapid deceleration mechanism, (3) distracting injury, (4) chest wall tenderness, (5) sternal tenderness, (6) thoracic spine tenderness, and (7) scapular tenderness. The Chest CT-Major DI had the same criteria without rapid deceleration mechanism. In the validation phase, Chest CT-All had a sensitivity of 99.2% (95% CI 95.4%–100%), a specificity of 20.8% (95% CI 19.2%–22.4%), and a negative predictive value (NPV) of 99.8% (95% CI 98.9%–100%) for major injury, and a sensitivity of 95.4% (95% CI 93.6%–96.9%), a specificity of 25.5% (95% CI 23.5%–27.5%), and a NPV of 93.9% (95% CI 91.5%–95.8%) for either major or minor injury. Chest CT-Major had a sensitivity

  6. Derivation and validation of two decision instruments for selective chest CT in blunt trauma: a multicenter prospective observational study (NEXUS Chest CT).

    PubMed

    Rodriguez, Robert M; Langdorf, Mark I; Nishijima, Daniel; Baumann, Brigitte M; Hendey, Gregory W; Medak, Anthony J; Raja, Ali S; Allen, Isabel E; Mower, William R

    2015-10-01

    Unnecessary diagnostic imaging leads to higher costs, longer emergency department stays, and increased patient exposure to ionizing radiation. We sought to prospectively derive and validate two decision instruments (DIs) for selective chest computed tomography (CT) in adult blunt trauma patients. From September 2011 to May 2014, we prospectively enrolled blunt trauma patients over 14 y of age presenting to eight US, urban level 1 trauma centers in this observational study. During the derivation phase, physicians recorded the presence or absence of 14 clinical criteria before viewing chest imaging results. We determined injury outcomes by CT radiology readings and categorized injuries as major or minor according to an expert-panel-derived clinical classification scheme. We then employed recursive partitioning to derive two DIs: Chest CT-All maximized sensitivity for all injuries, and Chest CT-Major maximized sensitivity for only major thoracic injuries (while increasing specificity). In the validation phase, we employed similar methodology to prospectively test the performance of both DIs. We enrolled 11,477 patients-6,002 patients in the derivation phase and 5,475 patients in the validation phase. The derived Chest CT-All DI consisted of (1) abnormal chest X-ray, (2) rapid deceleration mechanism, (3) distracting injury, (4) chest wall tenderness, (5) sternal tenderness, (6) thoracic spine tenderness, and (7) scapular tenderness. The Chest CT-Major DI had the same criteria without rapid deceleration mechanism. In the validation phase, Chest CT-All had a sensitivity of 99.2% (95% CI 95.4%-100%), a specificity of 20.8% (95% CI 19.2%-22.4%), and a negative predictive value (NPV) of 99.8% (95% CI 98.9%-100%) for major injury, and a sensitivity of 95.4% (95% CI 93.6%-96.9%), a specificity of 25.5% (95% CI 23.5%-27.5%), and a NPV of 93.9% (95% CI 91.5%-95.8%) for either major or minor injury. Chest CT-Major had a sensitivity of 99.2% (95% CI 95.4%-100%), a specificity of

  7. Observational study to calculate addictive risk to opioids: a validation study of a predictive algorithm to evaluate opioid use disorder.

    PubMed

    Brenton, Ashley; Richeimer, Steven; Sharma, Maneesh; Lee, Chee; Kantorovich, Svetlana; Blanchard, John; Meshkin, Brian

    2017-01-01

    Opioid abuse in chronic pain patients is a major public health issue, with rapidly increasing addiction rates and deaths from unintentional overdose more than quadrupling since 1999. This study seeks to determine the predictability of aberrant behavior to opioids using a comprehensive scoring algorithm incorporating phenotypic risk factors and neuroscience-associated single-nucleotide polymorphisms (SNPs). The Proove Opioid Risk (POR) algorithm determines the predictability of aberrant behavior to opioids using a comprehensive scoring algorithm incorporating phenotypic risk factors and neuroscience-associated SNPs. In a validation study with 258 subjects with diagnosed opioid use disorder (OUD) and 650 controls who reported using opioids, the POR successfully categorized patients at high and moderate risks of opioid misuse or abuse with 95.7% sensitivity. Regardless of changes in the prevalence of opioid misuse or abuse, the sensitivity of POR remained >95%. The POR correctly stratifies patients into low-, moderate-, and high-risk categories to appropriately identify patients at need for additional guidance, monitoring, or treatment changes.

  8. Offshore Radiation Observations for Climate Research at the CERES Ocean Validation Experiment

    NASA Technical Reports Server (NTRS)

    Rutledge, Charles K.; Schuster, Gregory L.; Charlock, Thomas P.; Denn, Frederick M.; Smith, William L., Jr.; Fabbri, Bryan E.; Madigan, James J., Jr.; Knapp, Robert J.

    2006-01-01

    When radiometers on a satellite are pointed towards the planet with the goal of understanding a phenomenon quantitatively, rather than just creating a pleasing image, the task at hand is often problematic. The signal at the detector can be affected by scattering, absorption, and emission; and these can be due to atmospheric constituents (gases, clouds, and aerosols), the earth's surface, and subsurface features. When targeting surface phenomena, the remote sensing algorithm needs to account for the radiation associated with the atmospheric constituents. Likewise, one needs to correct for the radiation leaving the surface, when atmospheric phenomena are of interest. Rigorous validation of such remote sensing products is a real challenge. In visible and near infrared wavelengths, the jumble of effects on atmospheric radiation are best accomplished over dark surfaces with fairly uniform reflective properties (spatial homogeneity) in the satellite instrument's field of view (FOV). The ocean's surface meets this criteria; land surfaces - which are brighter, more spatially inhomogeneous, and more changeable with time - generally do not. NASA's Clouds and the Earth's Radiant Energy System (CERES) project has used this backdrop to establish a radiation monitoring site in Virginia's coastal Atlantic Ocean. The project, called the CERES Ocean Validation Experiment (COVE), is located on a rigid ocean platform allowing the accurate measurement of radiation parameters that require precise leveling and pointing unavailable from ships or buoys. The COVE site is an optimal location for verifying radiative transfer models and remote sensing algorithms used in climate research; because of the platform's small size, there are no island wake effects; and suites of sensors can be simultaneously trained both on the sky and directly on ocean itself. This paper describes the site, the types of measurements made, multiple years of atmospheric and ocean surface radiation observations, and

  9. Life beyond MSE and R2 — improving validation of predictive models with observations

    NASA Astrophysics Data System (ADS)

    Papritz, Andreas; Nussbaum, Madlene

    2017-04-01

    Machine learning and statistical predictive methods are evaluated by the closeness of predictions to observations of a test dataset. Common criteria for rating predictive methods are bias and mean square error (MSE), characterizing systematic and random prediction errors. Many studies also report R2-values, but their meaning is not always clear (correlation between observations and predictions or MSE skill score; Wilks, 2011). The same criteria are also used for choosing tuning parameters of predictive procedures by cross-validation and bagging (e.g. Hastie et al., 2009). For evident reasons, atmospheric sciences have developed a rich box of tools for forecast verification. Specific criteria have been proposed for evaluating deterministic and probabilistic predictions of binary, multinomial, ordinal and continuous responses (see reviews by Wilks, 2011, Jollie and Stephenson, 2012 and Gneiting et al., 2007). It appears that these techniques are not very well-known in the geosciences community interested in machine learning. In our presentation we review techniques that offer more insight into proximity of data and predictions than bias, MSE and R2 alone. We mention here only examples: (i) Graphing observations vs. predictions is usually more appropriate than the reverse (Piñeiro et al., 2008). (ii) The decomposition of the Brier score score (= MSE for probabilistic predictions of binary yes/no data) into reliability and resolution reveals (conditional) bias and capability of discriminating yes/no observations by the predictions. We illustrate the approaches by applications from digital soil mapping studies. Gneiting, T., Balabdaoui, F., and Raftery, A. E. (2007). Probabilistic forecasts, calibration and sharpness. Journal of the Royal Statistical Society Series B, 69, 243-268. Hastie, T., Tibshirani, R., and Friedman, J. (2009). The Elements of Statistical Learning; Data Mining, Inference and Prediction. Springer, New York, second edition. Jolliffe, I. T. and

  10. A validation study of a stochastic model of human interaction

    NASA Astrophysics Data System (ADS)

    Burchfield, Mitchel Talmadge

    The purpose of this dissertation is to validate a stochastic model of human interactions which is part of a developmentalism paradigm. Incorporating elements of ancient and contemporary philosophy and science, developmentalism defines human development as a progression of increasing competence and utilizes compatible theories of developmental psychology, cognitive psychology, educational psychology, social psychology, curriculum development, neurology, psychophysics, and physics. To validate a stochastic model of human interactions, the study addressed four research questions: (a) Does attitude vary over time? (b) What are the distributional assumptions underlying attitudes? (c) Does the stochastic model, {-}N{intlimitssbsp{-infty}{infty}}varphi(chi,tau)\\ Psi(tau)dtau, have utility for the study of attitudinal distributions and dynamics? (d) Are the Maxwell-Boltzmann, Fermi-Dirac, and Bose-Einstein theories applicable to human groups? Approximately 25,000 attitude observations were made using the Semantic Differential Scale. Positions of individuals varied over time and the logistic model predicted observed distributions with correlations between 0.98 and 1.0, with estimated standard errors significantly less than the magnitudes of the parameters. The results bring into question the applicability of Fisherian research designs (Fisher, 1922, 1928, 1938) for behavioral research based on the apparent failure of two fundamental assumptions-the noninteractive nature of the objects being studied and normal distribution of attributes. The findings indicate that individual belief structures are representable in terms of a psychological space which has the same or similar properties as physical space. The psychological space not only has dimension, but individuals interact by force equations similar to those described in theoretical physics models. Nonlinear regression techniques were used to estimate Fermi-Dirac parameters from the data. The model explained a high degree

  11. Examining construct validity of a new naturalistic observational assessment of hand skills for preschool- and school-age children.

    PubMed

    Chien, Chi-Wen; Brown, Ted; McDonald, Rachael

    2012-04-01

    The Assessment of Children's Hand Skills is a new assessment that utilises a naturalistic observational method to capture children's real-life hand skill performance when engaged at various types of daily activities in everyday living contexts. The Assessment of Children's Hand Skills is designed for use with 2- to 12-year-old children with a range of disabilities or health conditions. The study aimed to investigate construct validity of the Assessment of Children's Hand Skills in Australian children. Rasch analysis was used to examine internal construct validity of the Assessment of Children's Hand Skills in a mixed sample of 53 children with disabilities (including autism spectrum disorder, developmental/genetic disorders and physical disabilities) and 85 typically developing children. External construct validity was examined by correlating with three questionnaires evaluating daily living skills and hand skills. Rasch goodness-of-fit analysis suggested that all 22 activity items and 19 of 20 hand skill items in the Assessment of Children's Hand Skills measured a single construct. The Assessment of Children's Hand Skills items were placed in a clinically meaningful hierarchy from easy to hard, and the difficulty range of the items also matched the majority of children with disabilities and typically developing preschool-aged children. Moderate to high correlations (0.59 ≤ Spearman's ρ coefficients ≤ 0.89, P < 0.01) were found with the assessments of daily living and fine motor skills. This study provided preliminary evidence supporting the construct validity of the Assessment of Children's Hand Skills for its clinical application in assessing children's real-life hand skill performance in Australian contexts. © 2012 The Authors Australian Occupational Therapy Journal © 2012 Occupational Therapy Australia.

  12. Initial validation of the prekindergarten Classroom Observation Tool and goal setting system for data-based coaching.

    PubMed

    Crawford, April D; Zucker, Tricia A; Williams, Jeffrey M; Bhavsar, Vibhuti; Landry, Susan H

    2013-12-01

    Although coaching is a popular approach for enhancing the quality of Tier 1 instruction, limited research has addressed observational measures specifically designed to focus coaching on evidence-based practices. This study explains the development of the prekindergarten (pre-k) Classroom Observation Tool (COT) designed for use in a data-based coaching model. We examined psychometric characteristics of the COT and explored how coaches and teachers used the COT goal-setting system. The study included 193 coaches working with 3,909 pre-k teachers in a statewide professional development program. Classrooms served 3 and 4 year olds (n = 56,390) enrolled mostly in Title I, Head Start, and other need-based pre-k programs. Coaches used the COT during a 2-hr observation at the beginning of the academic year. Teachers collected progress-monitoring data on children's language, literacy, and math outcomes three times during the year. Results indicated a theoretically supported eight-factor structure of the COT across language, literacy, and math instructional domains. Overall interrater reliability among coaches was good (.75). Although correlations with an established teacher observation measure were small, significant positive relations between COT scores and children's literacy outcomes indicate promising predictive validity. Patterns of goal-setting behaviors indicate teachers and coaches set an average of 43.17 goals during the academic year, and coaches reported that 80.62% of goals were met. Both coaches and teachers reported the COT was a helpful measure for enhancing quality of Tier 1 instruction. Limitations of the current study and implications for research and data-based coaching efforts are discussed. PsycINFO Database Record (c) 2013 APA, all rights reserved.

  13. 41 CFR 60-3.5 - General standards for validity studies.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... should avoid making employment decisions on the basis of measures of knowledges, skills, or abilities... General standards for validity studies. A. Acceptable types of validity studies. For the purposes of... of these guidelines, section 14 of this part. New strategies for showing the validity of selection...

  14. Observation of early childhood physical aggression: a psychometric study of the system for coding early physical aggression.

    PubMed

    Mesman, Judi; Alink, Lenneke R A; van Zeijl, Jantien; Stolk, Mirjam N; Bakermans-Kranenburg, Marian J; van Ijzendoorn, Marinus H; Juffer, Femmie; Koot, Hans M

    2008-01-01

    We investigated the reliability and (convergent and discriminant) validity of an observational measure of physical aggression in toddlers and preschoolers, originally developed by Keenan and Shaw [1994]. The observation instrument is based on a developmental definition of aggression. Physical aggression was observed twice in a laboratory setting, the first time when children were 1-3 years old, and again 1 year later. Observed physical aggression was significantly related to concurrent mother-rated physical aggression for 2- to 4-year-olds, but not to maternal ratings of nonaggressive externalizing problems, indicating the measure's discriminant validity. However, we did not find significant 1-year stability of observed physical aggression in any of the age groups, whereas mother-rated physical aggression was significantly stable for all ages. The observational measure shows promise, but may have assessed state rather than trait aggression in our study. Copyright 2008 Wiley-Liss, Inc.

  15. Challenges in translating endpoints from trials to observational cohort studies in oncology

    PubMed Central

    Ording, Anne Gulbech; Cronin-Fenton, Deirdre; Ehrenstein, Vera; Lash, Timothy L; Acquavella, John; Rørth, Mikael; Sørensen, Henrik Toft

    2016-01-01

    Clinical trials are considered the gold standard for examining drug efficacy and for approval of new drugs. Medical databases and population surveillance registries are valuable resources for post-approval observational research, which are increasingly used in studies of benefits and risk of new cancer drugs. Here, we address the challenges in translating endpoints from oncology trials to observational studies. Registry-based cohort studies can investigate real-world safety issues – including previously unrecognized concerns – by examining rare endpoints or multiple endpoints at once. In contrast to clinical trials, observational cohort studies typically do not exclude real-world patients from clinical practice, such as old and frail patients with comorbidity. The observational cohort study complements the clinical trial by examining the effectiveness of interventions applied in clinical practice and by providing evidence on long-term clinical outcomes, which are often not feasible to study in a clinical trial. Various endpoints can be included in clinical trials, such as hard endpoints, soft endpoints, surrogate endpoints, and patient-reported endpoints. Each endpoint has it strengths and limitations for use in research studies. Endpoints used in oncology trials are often not applicable in observational cohort studies which are limited by the setting of standard clinical practice and by non-standardized endpoint determination. Observational studies can be more helpful moving research forward if they restrict focus to appropriate and valid endpoints. PMID:27354827

  16. Beware of external validation! - A Comparative Study of Several Validation Techniques used in QSAR Modelling.

    PubMed

    Majumdar, Subhabrata; Basak, Subhash C

    2018-04-26

    Proper validation is an important aspect of QSAR modelling. External validation is one of the widely used validation methods in QSAR where the model is built on a subset of the data and validated on the rest of the samples. However, its effectiveness for datasets with a small number of samples but large number of predictors remains suspect. Calculating hundreds or thousands of molecular descriptors using currently available software has become the norm in QSAR research, owing to computational advances in the past few decades. Thus, for n chemical compounds and p descriptors calculated for each molecule, the typical chemometric dataset today has high value of p but small n (i.e. n < p). Motivated by the evidence of inadequacies of external validation in estimating the true predictive capability of a statistical model in recent literature, this paper performs an extensive and comparative study of this method with several other validation techniques. We compared four validation methods: leave-one-out, K-fold, external and multi-split validation, using statistical models built using the LASSO regression, which simultaneously performs variable selection and modelling. We used 300 simulated datasets and one real dataset of 95 congeneric amine mutagens for this evaluation. External validation metrics have high variation among different random splits of the data, hence are not recommended for predictive QSAR models. LOO has the overall best performance among all validation methods applied in our scenario. Results from external validation are too unstable for the datasets we analyzed. Based on our findings, we recommend using the LOO procedure for validating QSAR predictive models built on high-dimensional small-sample data. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  17. Automated characterization of perceptual quality of clinical chest radiographs: Validation and calibration to observer preference

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Samei, Ehsan, E-mail: samei@duke.edu; Lin, Yuan; Choudhury, Kingshuk R.

    Purpose: The authors previously proposed an image-based technique [Y. Lin et al. Med. Phys. 39, 7019–7031 (2012)] to assess the perceptual quality of clinical chest radiographs. In this study, an observer study was designed and conducted to validate the output of the program against rankings by expert radiologists and to establish the ranges of the output values that reflect the acceptable image appearance so the program output can be used for image quality optimization and tracking. Methods: Using an IRB-approved protocol, 2500 clinical chest radiographs (PA/AP) were collected from our clinical operation. The images were processed through our perceptual qualitymore » assessment program to measure their appearance in terms of ten metrics of perceptual image quality: lung gray level, lung detail, lung noise, rib–lung contrast, rib sharpness, mediastinum detail, mediastinum noise, mediastinum alignment, subdiaphragm–lung contrast, and subdiaphragm area. From the results, for each targeted appearance attribute/metric, 18 images were selected such that the images presented a relatively constant appearance with respect to all metrics except the targeted one. The images were then incorporated into a graphical user interface, which displayed them into three panels of six in a random order. Using a DICOM calibrated diagnostic display workstation and under low ambient lighting conditions, each of five participating attending chest radiologists was tasked to spatially order the images based only on the targeted appearance attribute regardless of the other qualities. Once ordered, the observer also indicated the range of image appearances that he/she considered clinically acceptable. The observer data were analyzed in terms of the correlations between the observer and algorithmic rankings and interobserver variability. An observer-averaged acceptable image appearance was also statistically derived for each quality attribute based on the collected individual acceptable

  18. Convergent validity of the Arab Teens Lifestyle Study (ATLS) physical activity questionnaire.

    PubMed

    Al-Hazzaa, Hazzaa M; Al-Sobayel, Hana I; Musaiger, Abdulrahman O

    2011-09-01

    The Arab Teens Lifestyle Study (ATLS) is a multicenter project for assessing the lifestyle habits of Arab adolescents. This study reports on the convergent validity of the physical activity questionnaire used in ATLS against an electronic pedometer. Participants were 39 males and 36 females randomly selected from secondary schools, with a mean age of 16.1 ± 1.1 years. ATLS self-reported questionnaire was validated against the electronic pedometer for three consecutive weekdays. Mean steps counts were 6,866 ± 3,854 steps/day with no significant gender difference observed. Questionnaire results showed no significant gender differences in time spent on total or moderate-intensity activities. However, males spent significantly more time than females on vigorous-intensity activity. The correlation of steps counts with total time spent on all activities by the questionnaire was 0.369. Relationship of steps counts was higher with vigorous-intensity (r = 0.338) than with moderate-intensity activity (r = 0.265). Pedometer steps counts showed higher correlations with time spent on walking (r = 0.350) and jogging (r = 0.383) than with the time spent on other activities. Active participants, based on pedometer assessment, were also most active by the questionnaire. It appears that ATLS questionnaire is a valid instrument for assessing habitual physical activity among Arab adolescents.

  19. Validity of the Child Observation Record: An Investigation of the Relationship between Cor Dimensions and Social-Emotional and Cognitive Outcomes for Head Start Children

    ERIC Educational Resources Information Center

    Sekino, Yumiko; Fantuzzo, John

    2005-01-01

    The study examined the validity of the Child Observation Record (COR). Participants were 242 children, a stratified, random sample of a large, urban Head Start program. Teachers trained to collect COR data provided assessments on the Cognitive, Social Engagement, and Coordinated Movement dimensions of the COR. Outcome data included cognitive and…

  20. The development and validation of the Dormitory Observation Report: a behavioral rating instrument for juvenile delinquents in residential care.

    PubMed

    Veneziano, Louis; Veneziano, Carol

    2002-09-01

    In order to provide an objective measure of problematic behavioral patterns among juvenile delinquents in residential facilities, the Dormitory Observation Report (DOR) was developed. The DOR assesses 11 dimensions of problematic behavioral patterns (e.g., physical assaultiveness, manipulativeness), as well as three dimensions of desirable behavioral patterns expected in an institutional setting (e.g., independent functioning, personal hygiene, care of surroundings). Empirical study regarding the reliability and validity of the DOR are reported, and the results are discussed in terms of the theoretical and practical implications of this instrument. Copyright 2002 Wiley Periodicals, Inc.

  1. Observational study to calculate addictive risk to opioids: a validation study of a predictive algorithm to evaluate opioid use disorder

    PubMed Central

    Brenton, Ashley; Richeimer, Steven; Sharma, Maneesh; Lee, Chee; Kantorovich, Svetlana; Blanchard, John; Meshkin, Brian

    2017-01-01

    Background Opioid abuse in chronic pain patients is a major public health issue, with rapidly increasing addiction rates and deaths from unintentional overdose more than quadrupling since 1999. Purpose This study seeks to determine the predictability of aberrant behavior to opioids using a comprehensive scoring algorithm incorporating phenotypic risk factors and neuroscience-associated single-nucleotide polymorphisms (SNPs). Patients and methods The Proove Opioid Risk (POR) algorithm determines the predictability of aberrant behavior to opioids using a comprehensive scoring algorithm incorporating phenotypic risk factors and neuroscience-associated SNPs. In a validation study with 258 subjects with diagnosed opioid use disorder (OUD) and 650 controls who reported using opioids, the POR successfully categorized patients at high and moderate risks of opioid misuse or abuse with 95.7% sensitivity. Regardless of changes in the prevalence of opioid misuse or abuse, the sensitivity of POR remained >95%. Conclusion The POR correctly stratifies patients into low-, moderate-, and high-risk categories to appropriately identify patients at need for additional guidance, monitoring, or treatment changes. PMID:28572737

  2. 29 CFR 1607.7 - Use of other validity studies.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... EMPLOYEE SELECTION PROCEDURES (1978) General Principles § 1607.7 Use of other validity studies. A. Validity studies not conducted by the user. Users may, under certain circumstances, support the use of selection... described in test manuals. While publishers of selection procedures have a professional obligation to...

  3. 29 CFR 1607.7 - Use of other validity studies.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... EMPLOYEE SELECTION PROCEDURES (1978) General Principles § 1607.7 Use of other validity studies. A. Validity studies not conducted by the user. Users may, under certain circumstances, support the use of selection... described in test manuals. While publishers of selection procedures have a professional obligation to...

  4. 29 CFR 1607.7 - Use of other validity studies.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... EMPLOYEE SELECTION PROCEDURES (1978) General Principles § 1607.7 Use of other validity studies. A. Validity studies not conducted by the user. Users may, under certain circumstances, support the use of selection... described in test manuals. While publishers of selection procedures have a professional obligation to...

  5. Measuring Long-Distance Romantic Relationships: A Validity Study

    ERIC Educational Resources Information Center

    Pistole, M. Carole; Roberts, Amber

    2011-01-01

    This study investigated aspects of construct validity for the scores of a new long-distance romantic relationship measure. A single-factor structure of the long-distance romantic relationship index emerged, with convergent and discriminant evidence of external validity, high internal consistency reliability, and applied utility of the scores.…

  6. The Self-Consciousness Scale: A Discriminant Validity Study

    ERIC Educational Resources Information Center

    Carver, Charles S.; Glass, David C.

    1976-01-01

    A validity study is conducted of the Self-Consciousness Scale components with male undergraduates. The components, Private and Public Self Consciousness and Social Anxiety did not correlate with any other measures used to establish their validity and thus seem to be independent of other measures tested. (Author/DEP)

  7. Validation of the Spanish version of the Hip Outcome Score: a multicenter study.

    PubMed

    Seijas, Roberto; Sallent, Andrea; Ruiz-Ibán, Miguel Angel; Ares, Oscar; Marín-Peña, Oliver; Cuéllar, Ricardo; Muriel, Alfonso

    2014-05-13

    The Hip Outcome Score (HOS) is a self-reported questionnaire evaluating the outcomes of treatment interventions for hip pathologies, divided in 19 items of activities of daily life (ADL) and 9 sports' items. The aim of the present study is to translate and validate HOS into Spanish. A prospective and multicenter study with 100 patients undergoing hip arthroscopy was performed between June 2012 and January 2013. Crosscultural adaptation was used to translate HOS into Spanish. Patients completed the questionnaire before and after surgery. Feasibility, reliability, internal consistency, construct validity (correlation with Western Ontario and McMaster Universities Osteoarthritis Index), ceiling and floor effects and sensitivity to change were assessed for the present study. Mean age was 45.05 years old. 36 women and 64 men were included. Feasibility: 13% had at least one missing item within the ADL subscale and 17% within the sport subscale. Reliability: the translated version of HOS was highly reproducible with intraclass correlation coefficient of 0.95 for ADL and 0.94 for the sports subscale. Internal consistency was confirmed with Cronbach's alpha >0.90 in both subscales. Construct validity showed statistically significant correlation with WOMAC. Ceiling effect was observed in 6% and 12% for ADL and sports subscale, respectively. Floor effect was found in 3% and 37% ADL and sports subscale, respectively. Large sensitivity to change was shown in both subscales. The translated version of HOS into Spanish has shown to be feasible, reliable and sensible to changes for patients undergoing hip arthroscopy. This validated translation of HOS allows for comparisons between studies involving either Spanish- or English-speaking patients. Prognostic study, Level I.

  8. All sky imaging observations in visible and infrared waveband for validation of satellite cloud and aerosol products

    NASA Astrophysics Data System (ADS)

    Lu, Daren; Huo, Juan; Zhang, W.; Liu, J.

    A series of satellite sensors in visible and infrared wavelengths have been successfully operated on board a number of research satellites, e.g. NOAA/AVHRR, the MODIS onboard Terra and Aqua, etc. A number of cloud and aerosol products are produced and released in recent years. However, the validation of the product quality and accuracy are still a challenge to the atmospheric remote sensing community. In this paper, we suggest a ground based validation scheme for satellite-derived cloud and aerosol products by using combined visible and thermal infrared all sky imaging observations as well as surface meteorological observations. In the scheme, a visible digital camera with a fish-eye lens is used to continuously monitor the all sky with the view angle greater than 180 deg. The digital camera system is calibrated for both its geometry and radiance (broad blue, green, and red band) so as to a retrieval method can be used to detect the clear and cloudy sky spatial distribution and their temporal variations. A calibrated scanning thermal infrared thermometer is used to monitor the all sky brightness temperature distribution. An algorithm is developed to detect the clear and cloudy sky as well as cloud base height by using sky brightness distribution and surface temperature and humidity as input. Based on these composite retrieval of clear and cloudy sky distribution, it can be used to validate the satellite retrievals in the sense of real-simultaneous comparison and statistics, respectively. What will be presented in this talk include the results of the field observations and comparisons completed in Beijing (40 deg N, 116.5 deg E) in year 2003 and 2004. This work is supported by NSFC grant No. 4002700, and MOST grant No 2001CCA02200

  9. Validation of new psychosocial factors questionnaires: a Colombian national study.

    PubMed

    Villalobos, Gloria H; Vargas, Angélica M; Rondón, Martin A; Felknor, Sarah A

    2013-01-01

    The study of workers' health problems possibly associated with stressful conditions requires valid and reliable tools for monitoring risk factors. The present study validates two questionnaires to assess psychosocial risk factors for stress-related illnesses within a sample of Colombian workers. The validation process was based on a representative sample survey of 2,360 Colombian employees, aged 18-70 years. Worker response rate was 90%; 46% of the responders were women. Internal consistency was calculated, construct validity was tested with factor analysis and concurrent validity was tested with Spearman correlations. The questionnaires demonstrated adequate reliability (0.88-0.95). Factor analysis confirmed the dimensions proposed in the measurement model. Concurrent validity resulted in significant correlations with stress and health symptoms. "Work and Non-work Psychosocial Factors Questionnaires" were found to be valid and reliable for the assessment of workers' psychosocial factors, and they provide information for research and intervention. Copyright © 2012 Wiley Periodicals, Inc.

  10. Precipitation climatology over India: validation with observations and reanalysis datasets and spatial trends

    NASA Astrophysics Data System (ADS)

    Kishore, P.; Jyothi, S.; Basha, Ghouse; Rao, S. V. B.; Rajeevan, M.; Velicogna, Isabella; Sutterley, Tyler C.

    2016-01-01

    Changing rainfall patterns have significant effect on water resources, agriculture output in many countries, especially the country like India where the economy depends on rain-fed agriculture. Rainfall over India has large spatial as well as temporal variability. To understand the variability in rainfall, spatial-temporal analyses of rainfall have been studied by using 107 (1901-2007) years of daily gridded India Meteorological Department (IMD) rainfall datasets. Further, the validation of IMD precipitation data is carried out with different observational and different reanalysis datasets during the period from 1989 to 2007. The Global Precipitation Climatology Project data shows similar features as that of IMD with high degree of comparison, whereas Asian Precipitation-Highly-Resolved Observational Data Integration Towards Evaluation data show similar features but with large differences, especially over northwest, west coast and western Himalayas. Spatially, large deviation is observed in the interior peninsula during the monsoon season with National Aeronautics Space Administration-Modern Era Retrospective-analysis for Research and Applications (NASA-MERRA), pre-monsoon with Japanese 25 years Re Analysis (JRA-25), and post-monsoon with climate forecast system reanalysis (CFSR) reanalysis datasets. Among the reanalysis datasets, European Centre for Medium-Range Weather Forecasts Interim Re-Analysis (ERA-Interim) shows good comparison followed by CFSR, NASA-MERRA, and JRA-25. Further, for the first time, with high resolution and long-term IMD data, the spatial distribution of trends is estimated using robust regression analysis technique on the annual and seasonal rainfall data with respect to different regions of India. Significant positive and negative trends are noticed in the whole time series of data during the monsoon season. The northeast and west coast of the Indian region shows significant positive trends and negative trends over western Himalayas and

  11. Helping Students Evaluate the Validity of a Research Study.

    ERIC Educational Resources Information Center

    Morgan, George A.; Gliner, Jeffrey A.

    Students often have difficulty in evaluating the validity of a study. A conceptually and linguistically meaningful framework for evaluating research studies is proposed that is based on the discussion of internal and external validity of T. D. Cook and D. T. Campbell (1979). The proposal includes six key dimensions, three related to internal…

  12. The ICI classification for calcaneal injuries: a validation study.

    PubMed

    Frima, Herman; Eshuis, Rienk; Mulder, Paul; Leenen, Luke

    2012-06-01

    The integral classification of injuries (ICI), by Zwipp et al. has been developed as a classification system for injuries of the bones, joints, cartilage and ligaments of the foot. It follows the principles of the comprehensive classification of fractures by Müller et al. The ICI was developed for 'everyday use' and scientific purposes. Our aim was to perform a validation study for this classification system applied to the calcaneal injuries. A panel of five experienced trauma and orthopaedic surgeons evaluated the ICI score in 20 calcaneal injuries. After 2 months, a second classification was performed in a different order. Inter- and intra-observer variability were evaluated by kappa statistics. Panel members were not able to evaluate capsule and ligamental injuries based on X-ray and computed tomography (CT) films. Two injuries were excluded for logistical reasons. The inter-observer agreement based on 18 injuries of bone and joints was slight; kappa 0.14 (90% confidence interval (CI): 0.05-0.22). The intra-observer agreement was fair; kappa 0.31 (90% CI: 0.22-0.41). Overall, the panel rated the system as very complicated and not practical. The ICI is a complicated classification system with slight to fair inter- and intra-observer variabilities. It might not be a practical classification system for calcaneal injuries in 'everyday use' or scientific purposes. Copyright © 2011 Elsevier Ltd. All rights reserved.

  13. JaCVAM-organized international validation study of the in vivo rodent alkaline comet assay for the detection of genotoxic carcinogens: I. Summary of pre-validation study results.

    PubMed

    Uno, Yoshifumi; Kojima, Hajime; Omori, Takashi; Corvi, Raffaella; Honma, Masamistu; Schechtman, Leonard M; Tice, Raymond R; Burlinson, Brian; Escobar, Patricia A; Kraynak, Andrew R; Nakagawa, Yuzuki; Nakajima, Madoka; Pant, Kamala; Asano, Norihide; Lovell, David; Morita, Takeshi; Ohno, Yasuo; Hayashi, Makoto

    2015-07-01

    The in vivo rodent alkaline comet assay (comet assay) is used internationally to investigate the in vivo genotoxic potential of test chemicals. This assay, however, has not previously been formally validated. The Japanese Center for the Validation of Alternative Methods (JaCVAM), with the cooperation of the U.S. NTP Interagency Center for the Evaluation of Alternative Toxicological Methods (NICEATM)/the Interagency Coordinating Committee on the Validation of Alternative Methods (ICCVAM), the European Centre for the Validation of Alternative Methods (ECVAM), and the Japanese Environmental Mutagen Society/Mammalian Mutagenesis Study Group (JEMS/MMS), organized an international validation study to evaluate the reliability and relevance of the assay for identifying genotoxic carcinogens, using liver and stomach as target organs. The ultimate goal of this validation effort was to establish an Organisation for Economic Co-operation and Development (OECD) test guideline. The purpose of the pre-validation studies (i.e., Phase 1 through 3), conducted in four or five laboratories with extensive comet assay experience, was to optimize the protocol to be used during the definitive validation study. Copyright © 2015 Elsevier B.V. All rights reserved.

  14. Standards and guidelines for observational studies: quality is in the eye of the beholder.

    PubMed

    Morton, Sally C; Costlow, Monica R; Graff, Jennifer S; Dubois, Robert W

    2016-03-01

    Patient care decisions demand high-quality research. To assist those decisions, numerous observational studies are being performed. Are the standards and guidelines to assess observational studies consistent and actionable? What policy considerations should be considered to ensure decision makers can determine if an observational study is of high-quality and valid to inform treatment decisions? Based on a literature review and input from six experts, we compared and contrasted nine standards/guidelines using 23 methodological elements involved in observational studies (e.g., study protocol, data analysis, and so forth). Fourteen elements (61%) were addressed by at least seven standards/guidelines; 12 of these elements disagreed in the approach. Nine elements (39%) were addressed by six or fewer standards/guidelines. Ten elements (43%) were not actionable in at least one standard/guideline that addressed the element. The lack of observational study standard/guideline agreement may contribute to variation in study conduct; disparities in what is considered credible research; and ultimately, what evidence is adopted. A common set of agreed on standards/guidelines for conducting observational studies will benefit funders, researchers, journal editors, and decision makers. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.

  15. Development and Validation of an Observation System for Analyzing Teaching Roles.

    ERIC Educational Resources Information Center

    Southwell, Reba K.; Webb, Jeaninne N.

    The construction and validation of a theoretically based sign system for the analysis of teaching roles in childhood education is described. A theoretical and empirical approach to validation were developed. In the first, the general concept of teacher role was identified as a viable construct for investigating characteristic patterns of classroom…

  16. Absorption in Sport: A Cross-Validation Study

    PubMed Central

    Koehn, Stefan; Stavrou, Nektarios A. M.; Cogley, Jeremy; Morris, Tony; Mosek, Erez; Watt, Anthony P.

    2017-01-01

    Absorption has been identified as readiness for experiences of deep involvement in the task. Conceptually, absorption is a key psychological construct, incorporating experiential, cognitive, and motivational components. Although, no operationalization of the construct has been provided to facilitate research in this area, the purpose of this research was the development and examination of the psychometric properties of a sport-specific measure of absorption that evolved from the use of the modified Tellegen Absorption Scale (MODTAS; Jamieson, 2005) in mainstream psychology. The study aimed to provide evidence of the psychometric properties, reliability, and validity of the Measure of Absorption in Sport Contexts (MASCs). The psychometric examination included a calibration sample from Scotland and a cross-validation sample from Australia using a cross-sectional design. The item pool was developed based on existing items from the modified Tellegen Absorption Scale (Jamieson, 2005). The MODTAS items were reworded and translated into a sport context. The Scottish sample consisted of 292 participants and the Australian sample of 314 participants. Congeneric model testing and confirmatory factor analysis for both samples and multi-group invariance testing across samples was used. In the cross-validation sample the MASC subscales showed acceptable internal consistency and construct reliability (≥0.70). Excellent fit indices were found for the final 18-item, six-factor measure in the cross-validation sample, χ(120)2 = 197.486, p < 0.001; CFI = 0.957; TLI = 0.945; RMSEA = 0.045; SRMR = 0.044. Multi-group invariance testing revealed no differences in item meaning, except for two items. The MASC and the Dispositional Flow Scale-2 showed moderate-to-strong positive correlations in both samples, r = 0.38, p < 0.001 and r = 0.42, p < 0.001, supporting the external validity of the MASC. This article provides initial evidence in support of the psychometric properties

  17. Observations with the ROWS instrument during the Grand Banks calibration/validation experiments

    NASA Technical Reports Server (NTRS)

    Vandemark, D.; Chapron, B.

    1994-01-01

    As part of a global program to validate the ocean surface sensors on board ERS-1, a joint experiment on the Grand Banks of Newfoundland was carried out in Nov. 1991. The principal objective was to provide a field validation of ERS-1 Synthetic Aperture Radar (SAR) measurement of ocean surface structure. The NASA-P3 aircraft measurements made during this experiment provide independent measurements of the ocean surface along the validation swath. The Radar Ocean Wave Spectrometer (ROWS) is a radar sensor designed to measure direction of the long wave components using spectral analysis of the tilt induced radar backscatter modulation. This technique greatly differs from SAR and thus, provides a unique set of measurements for use in evaluating SAR performance. Also, an altimeter channel in the ROWS gives simultaneous information on the surface wave height and radar mean square slope parameter. The sets of geophysical parameters (wind speed, significant wave height, directional spectrum) are used to study the SAR's ability to accurately measure ocean gravity waves. The known distortion imposed on the true directional spectrum by the SAR imaging mechanism is discussed in light of the direct comparisons between ERS-1 SAR, airborne Canadian Center for Remote Sensing (CCRS) SAR, and ROWS spectra and the use of the nonlinear ocean SAR transform.

  18. Reanalysis of the Harvard Six Cities Study, part I: validation and replication.

    PubMed

    Krewski, D; Burnett, R T; Goldberg, M; Hoover, K; Siemiatycki, J; Abrahamowicz, M; White, W

    2005-01-01

    Because the results of the Harvard Six Cities Study played a critical role in the establishment of the current U.S. ambient air quality objective for fine particles (PM(2.5)), the U.S. Environmental Protection Agency, industry, and nongovernmental organizations called for an independent reanalysis of this study to validate the original findings reported by Dockery and colleagues in the New England Journal of Medicine (vol. 329, pp. 1753-1759) in 1993. Validation of the original findings was accomplished by a detailed statistical audit and replication of original results. With the exception of occupational exposure to dust (14 discrepancies of 249 questionnaires located for evaluation) and fumes (15/249), date of death (2/250), and cause of death (2/250), the audit identified no discrepancies between the original questionnaires and death certificates in the audit sample and the analytic file used by the original investigators. The data quality audit identified a computer programming problem that had resulted in early censorship in 5 of the 6 cities, which resulted in the loss of approximately 1% of the reported person-years of follow-up; the reanalysis team updated the Six Cities cohort to include the missing person-years of observation, resulting in the addition of 928 person-years of observation and 14 deaths. The reanalysis team was able to reproduce virtually all of the original numerical results, including the 26% increase in all-cause mortality in the most polluted city (Stubenville, OH) as compared to the least polluted city (Portage, WI). The audit and validation of the Harvard Six Cities Study conducted by the reanalysis team generally confirmed the quality of the data and the numerical results reported by the original investigators. The discrepancies noted during the audit were not of epidemiologic importance, and did not substantively alter the original risk estimates associated with particulate air pollution, nor the main conclusions reached by the

  19. Assessing the validity and intra-observer agreement of the MIDAM-LTC; an instrument measuring factors that influence personal dignity in long-term care facilities

    PubMed Central

    2014-01-01

    Background Patients who are cared for in long-term care facilities are vulnerable to lose personal dignity. An instrument measuring factors that influence dignity can be used to better target dignity-conserving care to an individual patient, but no such instrument is yet available for the long-term care setting. The aim of this study was to create the Measurement Instrument for Dignity AMsterdam - for Long-Term Care facilities (MIDAM-LTC) and to assess its validity and intra-observer agreement. Methods Thirteen items specific for the LTC setting were added to the earlier developed, more general MIDAM. The MIDAM-LTC consisted of 39 symptoms or experiences for which presence as well as influence on dignity were asked, and a single item score for overall personal dignity. Questionnaires containing the MIDAM-LTC were administered face-to-face at two moments (with a 1-week interval) to 95 nursing home residents residing on general medical wards of six nursing homes in the Netherlands. Constructs related to dignity (WHO Well-Being Five Index, quality of life and physical health status) were also measured. Ten residents answered the questions while thinking aloud. Content validity, construct validity and intra-observer agreement were examined. Results Nine of the 39 items barely exerted influence on dignity. Eight of them could be omitted from the MIDAM-LTC, because the thinking aloud method revealed sensible explanations for their small influence on dignity. Residents reported that they missed no important items. Hypotheses to support construct validity, about the strength of correlations between on the one hand personal dignity and on the other hand well-being, quality of life or physical health status, were confirmed. On average, 83% of the scores given for each item’s influence on dignity were practically consistent over 1 week, and more than 80% of the residents gave consistent scores for the single item score for overall dignity. Conclusion The MIDAM-LTC has good

  20. Convergent Validity of the Arab Teens Lifestyle Study (ATLS) Physical Activity Questionnaire

    PubMed Central

    Al-Hazzaa, Hazzaa M.; Al-Sobayel, Hana I.; Musaiger, Abdulrahman O.

    2011-01-01

    The Arab Teens Lifestyle Study (ATLS) is a multicenter project for assessing the lifestyle habits of Arab adolescents. This study reports on the convergent validity of the physical activity questionnaire used in ATLS against an electronic pedometer. Participants were 39 males and 36 females randomly selected from secondary schools, with a mean age of 16.1 ± 1.1 years. ATLS self-reported questionnaire was validated against the electronic pedometer for three consecutive weekdays. Mean steps counts were 6,866 ± 3,854 steps/day with no significant gender difference observed. Questionnaire results showed no significant gender differences in time spent on total or moderate-intensity activities. However, males spent significantly more time than females on vigorous-intensity activity. The correlation of steps counts with total time spent on all activities by the questionnaire was 0.369. Relationship of steps counts was higher with vigorous-intensity (r = 0.338) than with moderate-intensity activity (r = 0.265). Pedometer steps counts showed higher correlations with time spent on walking (r = 0.350) and jogging (r = 0.383) than with the time spent on other activities. Active participants, based on pedometer assessment, were also most active by the questionnaire. It appears that ATLS questionnaire is a valid instrument for assessing habitual physical activity among Arab adolescents. PMID:22016718

  1. Verification, Validation and Sensitivity Studies in Computational Biomechanics

    PubMed Central

    Anderson, Andrew E.; Ellis, Benjamin J.; Weiss, Jeffrey A.

    2012-01-01

    Computational techniques and software for the analysis of problems in mechanics have naturally moved from their origins in the traditional engineering disciplines to the study of cell, tissue and organ biomechanics. Increasingly complex models have been developed to describe and predict the mechanical behavior of such biological systems. While the availability of advanced computational tools has led to exciting research advances in the field, the utility of these models is often the subject of criticism due to inadequate model verification and validation. The objective of this review is to present the concepts of verification, validation and sensitivity studies with regard to the construction, analysis and interpretation of models in computational biomechanics. Specific examples from the field are discussed. It is hoped that this review will serve as a guide to the use of verification and validation principles in the field of computational biomechanics, thereby improving the peer acceptance of studies that use computational modeling techniques. PMID:17558646

  2. Validation of measures from the smartphone sway balance application: a pilot study.

    PubMed

    Patterson, Jeremy A; Amick, Ryan Z; Thummar, Tarunkumar; Rogers, Michael E

    2014-04-01

    A number of different balance assessment techniques are currently available and widely used. These include both subjective and objective assessments. The ability to provide quantitative measures of balance and posture is the benefit of objective tools, however these instruments are not generally utilized outside of research laboratory settings due to cost, complexity of operation, size, duration of assessment, and general practicality. The purpose of this pilot study was to assess the value and validity of using software developed to access the iPod and iPhone accelerometers output and translate that to the measurement of human balance. Thirty healthy college-aged individuals (13 male, 17 female; age = 26.1 ± 8.5 years) volunteered. Participants performed a static Athlete's Single Leg Test protocol for 10 sec, on a Biodex Balance System SD while concurrently utilizing a mobile device with balance software. Anterior/posterior stability was recorded using both devices, described as the displacement in degrees from level, and was termed the "balance score." There were no significant differences between the two reported balance scores (p = 0.818. Mean balance score on the balance platform was 1.41 ± 0.90, as compared to 1.38 ± 0.72 using the mobile device. There is a need for a valid, convenient, and cost-effective tool to objectively measure balance. Results of this study are promising, as balance score derived from the Smartphone accelerometers were consistent with balance scores obtained from a previously validated balance system. However, further investigation is necessary as this version of the mobile software only assessed balance in the anterior/posterior direction. Additionally, further testing is necessary on a healthy populations and as well as those with impairment of the motor control system. Level 2b (Observational study of validity)(1.)

  3. Threats to validity of nonrandomized studies of postdiagnosis exposures on cancer recurrence and survival.

    PubMed

    Chubak, Jessica; Boudreau, Denise M; Wirtz, Heidi S; McKnight, Barbara; Weiss, Noel S

    2013-10-02

    Studies of the effects of exposures after cancer diagnosis on cancer recurrence and survival can provide important information to the growing group of cancer survivors. Observational studies that address this issue generally fall into one of two categories: 1) those using health plan automated data that contain "continuous" information on exposures, such as studies that use pharmacy records; and 2) survey or interview studies that collect information directly from patients once or periodically postdiagnosis. Reverse causation, confounding, selection bias, and information bias are common in observational studies of cancer outcomes in relation to exposures after cancer diagnosis. We describe these biases, focusing on sources of bias specific to these types of studies, and we discuss approaches for reducing them. Attention to known challenges in epidemiologic research is critical for the validity of studies of postdiagnosis exposures and cancer outcomes.

  4. Threats to Validity of Nonrandomized Studies of Postdiagnosis Exposures on Cancer Recurrence and Survival

    PubMed Central

    2013-01-01

    Studies of the effects of exposures after cancer diagnosis on cancer recurrence and survival can provide important information to the growing group of cancer survivors. Observational studies that address this issue generally fall into one of two categories: 1) those using health plan automated data that contain “continuous” information on exposures, such as studies that use pharmacy records; and 2) survey or interview studies that collect information directly from patients once or periodically postdiagnosis. Reverse causation, confounding, selection bias, and information bias are common in observational studies of cancer outcomes in relation to exposures after cancer diagnosis. We describe these biases, focusing on sources of bias specific to these types of studies, and we discuss approaches for reducing them. Attention to known challenges in epidemiologic research is critical for the validity of studies of postdiagnosis exposures and cancer outcomes. PMID:23940288

  5. [French validation study of the levels of emotional awareness scale].

    PubMed

    Bydlowski, S; Corcos, M; Paterniti, S; Guilbaud, O; Jeammet, P; Consoli, S M

    2002-01-01

    According to a thesis based on the idea of an influence of cognitions in the structuring of internal reality, emotional awareness, ie the capacity of representing your own emotional experience and that of others, is a cognitive process that goes into maturation. Defining this concept, Lane and Schwartz present a cognitivo-developmental model in five stages of the processes of symbolization, accounting for the differences in levels of emotional awareness observed in individuals. The organization of these cognitive processes would thus be structured in well differentiated stages, in which the development of the emotions would be inseparable from the development of ego and of the relation to others. These authors focus on the capacity of representing in a conscious way the emotional experience and consider that verbal representations used to describe the contents of what is experience constitute a good reflection of the organization structural of the emotional awareness. Therefore, they worked out an instrument of evaluation: the Levels of Emotional Awareness Scale (LEAS), which measures the capacity to describe your own emotional experience and the one you allow to others, in an emotional situation. The system of quotation of this scale is based on the analysis of the verbal contents of the provided answers, in direct reference to the authors' theory of the levels of differentiation and integration of the emotional experience. It is therefore an empirical measurement which is centered specifically on the structural organization of the emotional experience. The various studies of validation of this instrument show that it presents solid metrological properties. This work presents the validation of the French version of Lane and Schwartz's LEAS. Validity and fidelity were studied in a group of 121 healthy subjects. This setting is part of a larger clinical evaluation, also including a collection of socio-demographic and clinical data, and other instruments of self

  6. California Diploma Project Technical Report III: Validity Study--Validity Study of the Health Sciences and Medical Technology Standards

    ERIC Educational Resources Information Center

    McGaughy, Charis; Bryck, Rick; de Gonzalez, Alicia

    2012-01-01

    This study is a validity study of the recently revised version of the Health Science Standards. The purpose of this study is to understand how the Health Science Standards relate to college and career readiness, as represented by survey ratings submitted by entry-level college instructors of health science courses and industry representatives. For…

  7. Observational Studies: Cohort and Case-Control Studies

    PubMed Central

    Song, Jae W.; Chung, Kevin C.

    2010-01-01

    Observational studies are an important category of study designs. To address some investigative questions in plastic surgery, randomized controlled trials are not always indicated or ethical to conduct. Instead, observational studies may be the next best method to address these types of questions. Well-designed observational studies have been shown to provide results similar to randomized controlled trials, challenging the belief that observational studies are second-rate. Cohort studies and case-control studies are two primary types of observational studies that aid in evaluating associations between diseases and exposures. In this review article, we describe these study designs, methodological issues, and provide examples from the plastic surgery literature. PMID:20697313

  8. Addressing Participant Validity in a Small Internet Health Survey (The Restore Study): Protocol and Recommendations for Survey Response Validation

    PubMed Central

    Dewitt, James; Capistrant, Benjamin; Kohli, Nidhi; Mitteldorf, Darryl; Merengwa, Enyinnaya; West, William

    2018-01-01

    Background While deduplication and cross-validation protocols have been recommended for large Web-based studies, protocols for survey response validation of smaller studies have not been published. Objective This paper reports the challenges of survey validation inherent in a small Web-based health survey research. Methods The subject population was North American, gay and bisexual, prostate cancer survivors, who represent an under-researched, hidden, difficult-to-recruit, minority-within-a-minority population. In 2015-2016, advertising on a large Web-based cancer survivor support network, using email and social media, yielded 478 completed surveys. Results Our manual deduplication and cross-validation protocol identified 289 survey submissions (289/478, 60.4%) as likely spam, most stemming from advertising on social media. The basic components of this deduplication and validation protocol are detailed. An unexpected challenge encountered was invalid survey responses evolving across the study period. This necessitated the static detection protocol be augmented with a dynamic one. Conclusions Five recommendations for validation of Web-based samples, especially with smaller difficult-to-recruit populations, are detailed. PMID:29691203

  9. Development and validation of a clinical prediction rule to identify suspected breast cancer: a prospective cohort study.

    PubMed

    Galvin, Rose; Joyce, Doireann; Downey, Eithne; Boland, Fiona; Fahey, Tom; Hill, Arnold K

    2014-10-03

    The number of primary care referrals of women with breast symptoms to symptomatic breast units (SBUs) has increased exponentially in the past decade in Ireland. The aim of this study is to develop and validate a clinical prediction rule (CPR) to identify women with breast cancer so that a more evidence based approach to referral from primary care to these SBUs can be developed. We analysed routine data from a prospective cohort of consecutive women reviewed at a SBU with breast symptoms. The dataset was split into a derivation and validation cohort. Regression analysis was used to derive a CPR from the patient's history and clinical findings. Validation of the CPR consisted of estimating the number of breast cancers predicted to occur compared with the actual number of observed breast cancers across deciles of risk. A total of 6,590 patients were included in the derivation study and 4.9% were diagnosed with breast cancer. Independent clinical predictors for breast cancer were: increasing age by year (adjusted odds ratio 1.08, 95% CI 1.07-1.09); presence of a lump (5.63, 95% CI 4.2-7.56); nipple change (2.77, 95% CI 1.68-4.58) and nipple discharge (2.09, 95% CI 1.1-3.97). Validation of the rule (n = 911) demonstrated that the probability of breast cancer was higher with an increasing number of these independent variables. The Hosmer-Lemeshow goodness of fit showed no overall significant difference between the expected and the observed numbers of breast cancer (χ(2)HL: 6.74, p-value: 0.56). This study derived and validated a CPR for breast cancer in women attending an Irish national SBU. We found that increasing age, presence of a lump, nipple discharge and nipple change are all associated with increased risk of breast cancer. Further validation of the rule is necessary as well as an assessment of its impact on referral practice.

  10. Measuring Nutrition Literacy in Spanish-Speaking Latinos: An Exploratory Validation Study.

    PubMed

    Gibbs, Heather D; Camargo, Juliana M T B; Owens, Sarah; Gajewski, Byron; Cupertino, Ana Paula

    2017-11-21

    Nutrition is important for preventing and treating chronic diseases highly prevalent among Latinos, yet no tool exists for measuring nutrition literacy among Spanish speakers. This study aimed to adapt the validated Nutrition Literacy Assessment Instrument for Spanish-speaking Latinos. This study was developed in two phases: adaptation and validity testing. Adaptation included translation, expert item content review, and interviews with Spanish speakers. For validity testing, 51 participants completed the Short Assessment of Health Literacy-Spanish (SAHL-S), the Nutrition Literacy Assessment Instrument in Spanish (NLit-S), and socio-demographic questionnaire. Validity and reliability statistics were analyzed. Content validity was confirmed with a Scale Content Validity Index of 0.96. Validity testing demonstrated NLit-S scores were strongly correlated with SAHL-S scores (r = 0.52, p < 0.001). Entire reliability was substantial at 0.994 (CI 0.992-0.996) and internal consistency was excellent (Cronbach's α = 0.92). The NLit-S demonstrates validity and reliability for measuring nutrition literacy among Spanish-speakers.

  11. Validation of VIIRS Land Surface Phenology using Field Observations, PhenoCam Imagery, and Landsat data

    NASA Astrophysics Data System (ADS)

    Zhang, X.; Jayavelu, S.; Wang, J.; Henebry, G. M.; Gray, J. M.; Friedl, M. A.; Liu, Y.; Schaaf, C.; Shuai, A.

    2016-12-01

    A large number of land surface phenology (LSP) products have been produced from various detection algorithms applied to coarse resolution satellite datasets across regional to global scales. However, validation of the resulting LSP products is very challenging because in-situ observations at comparable spatiotemporal scales are generally not available. This research focuses on efforts to evaluate and validate the global 500m LSP product produced from Visible Infrared Imaging Radiometer Suite (VIIRS) NBAR time series for 2013 and 2014. Specifically, we used three different datasets to evaluate six VIIRS LSP metrics of greenup onset, mid-point of greenup phase, maturity onset, senescence onset, mid-point of senescence phase, and dormancy onset. First, we obtained the field observations from the USA National Phenology Network that has gathered extensive phenological data on individual species. Although it is inappropriate to compare these data directly with the LSP footprints, this large and spatially distributed dataset allows us to evaluate the overall quality of VIIRS LSP results. Second, we gathered PhenoCam imagery from 164 sites, which was used to extract the daily green chromatic coordinate (GCC) and vegetation contrast index (VCI)values. Utilizing these PhenoCam time series, the phenological events were quantified using a hybrid piecewise logistic models for each site. Third, we detected the phenological timing at the landscape scale (30m) from surface reflectance simulated by fusing MODIS data and Landsat 8 OLI observations in an agricultural area (in the central USA) and from overlap zones of OLI scenes in semiarid areas (California and Tibetan Plateau). The phenological timing from these three datasets was used to compare with VIIRS LSP data. Preliminary results show that the VIIRS LSP are generally comparable with phenological data from the USA-NPN, PhenoCam, and Landsat data, with differences arising in specific phenological events and land cover types.

  12. Development and construct validity of the Classroom Strategies Scale-Observer Form.

    PubMed

    Reddy, Linda A; Fabiano, Gregory; Dudek, Christopher M; Hsu, Louis

    2013-12-01

    Research on progress monitoring has almost exclusively focused on student behavior and not on teacher practices. This article presents the development and validation of a new teacher observational assessment (Classroom Strategies Scale) of classroom instructional and behavioral management practices. The theoretical underpinnings and empirical basis for the instructional and behavioral management scales are presented. The Classroom Strategies Scale (CSS) evidenced overall good reliability estimates including internal consistency, interrater reliability, test-retest reliability, and freedom from item bias on important teacher demographics (age, educational degree, years of teaching experience). Confirmatory factor analyses (CFAs) of CSS data from 317 classrooms were carried out to assess the level of empirical support for (a) a 4 first-order factor theory concerning teachers' instructional practices, and (b) a 4 first-order factor theory concerning teachers' behavior management practice. Several fit indices indicated acceptable fit of the (a) and (b) CFA models to the data, as well as acceptable fit of less parsimonious alternative CFA models that included 1 or 2 second-order factors. Information-theory-based indices generally suggested that the (a) and (b) CFA models fit better than some more parsimonious alternative CFA models that included constraints on relations of first-order factors. Overall, CFA first-order and higher order factor results support the CSS-Observer Total, Composite, and subscales. Suggestions for future measurement development efforts are outlined. PsycINFO Database Record (c) 2013 APA, all rights reserved.

  13. Development, initial reliability and validity testing of an observational tool for assessing technical skills of operating room nurses.

    PubMed

    Sevdalis, Nick; Undre, Shabnam; Henry, Janet; Sydney, Elaine; Koutantji, Mary; Darzi, Ara; Vincent, Charles A

    2009-09-01

    The recent emergence of the Systems Approach to the safety and quality of surgical care has triggered individual and team skills training modules for surgeons and anaesthetists and relevant observational assessment tools have been developed. To develop an observational tool that captures operating room (OR) nurses' technical skill and can be used for assessment and training. The Imperial College Assessment of Technical Skills for Nurses (ICATS-N) assesses (i) gowning and gloving, (ii) setting up instrumentation, (iii) draping, and (iv) maintaining sterility. Three to five observable behaviours have been identified for each skill and are rated on 1-6 scales. Feasibility and aspects of reliability and validity were assessed in 20 simulation-based crisis management training modules for trainee nurses and doctors, carried out in a Simulated Operating Room. The tool was feasible to use in the context of simulation-based training. Satisfactory reliability (Cronbach alpha) was obtained across trainers' and trainees' scores (analysed jointly and separately). Moreover, trainer nurse's ratings of the four skills correlated positively, thus indicating adequate content validity. Trainer's and trainees' ratings did not correlate. Assessment of OR nurses' technical skill is becoming a training priority. The present evidence suggests that the ICATS-N could be considered for use as an assessment/training tool for junior OR nurses.

  14. Concurrent Validity Between Live and Home Video Observations Using the Alberta Infant Motor Scale.

    PubMed

    Boonzaaijer, Marike; van Dam, Ellen; van Haastert, Ingrid C; Nuysink, Jacqueline

    2017-04-01

    Serial assessment of gross motor development of infants at risk is an established procedure in neonatal follow-up clinics. Assessments based on home video recordings could be a relevant addition. In 48 infants (1.5-19 months), the concurrent validity of 2 applications was examined using the Alberta Infant Motor Scale: (1) a home video made by parents and (2) simultaneous observation on-site by a pediatric physical therapist. Parents' experiences were explored using a questionnaire. The intraclass correlation coefficient agreement between live and home video assessment was 0.99, with a standard error of measurement of 1.41 items. Intra- and interrater reliability: intraclass correlation coefficients were more than 0.99. According to 94% of the parents, recording their infant's movement repertoire was easy to perform. Assessing the Alberta Infant Motor Scale based on home video recordings is comparable to assessment by live observation. The video method is a promising application that can be used with low burden for parents and infants.

  15. Reliability and validity of the symptoms of major depressive illness.

    PubMed

    Mazure, C; Nelson, J C; Price, L H

    1986-05-01

    In two consecutive studies, we examined the interrater reliability and then the concurrent validity of interview ratings for individual symptoms of major depressive illness. The concurrent validity of symptoms was determined by assessing the degree to which symptoms observed or reported during an interview were observed in daily behavior. Results indicated that most signs and symptoms of major depression and melancholia can be reliably rated by clinicians during a semistructured interview. Ratings of observable symptoms (signs) assessed during the interview were valid indicators of dysfunction observed in daily behavior. Several but not all ratings based on patient report of symptoms were at variance with observation. These discordant patient-reported symptoms may have value as subjective reports but were not accurate descriptions of observed dysfunction.

  16. Cyber Victim and Bullying Scale: A Study of Validity and Reliability

    ERIC Educational Resources Information Center

    Cetin, Bayram; Yaman, Erkan; Peker, Adem

    2011-01-01

    The purpose of this study is to develop a reliable and valid scale, which determines cyber victimization and bullying behaviors of high school students. Research group consisted of 404 students (250 male, 154 male) in Sakarya, in 2009-2010 academic years. In the study sample, mean age is 16.68. Content validity and face validity of the scale was…

  17. The Validation of Macro and Micro Observations of Parent–Child Dynamics Using the Relationship Affect Coding System in Early Childhood

    PubMed Central

    Mun, Chung Jung; Tein, Jenn-Yun; Kim, Hanjoe; Shaw, Daniel S.; Gardner, Frances; Wilson, Melvin N.; Peterson, Jenene

    2018-01-01

    This study examined the validity of micro social observations and macro ratings of parent–child interaction in early to middle childhood. Seven hundred and thirty-one families representing multiple ethnic groups were recruited and screened as at risk in the context of Women, Infant, and Children (WIC) Nutritional Supplement service settings. Families were randomly assigned to the Family Checkup (FCU) intervention or the control condition at age 2 and videotaped in structured interactions in the home at ages 2, 3, 4, and 5. Parent–child interaction videotapes were microcoded using the Relationship Affect Coding System (RACS) that captures the duration of two mutual dyadic states: positive engagement and coercion. Macro ratings of parenting skills were collected after coding the videotapes to assess parent use of positive behavior support and limit setting skills (or lack thereof). Confirmatory factor analyses revealed that the measurement model of macro ratings of limit setting and positive behavior support was not supported by the data, and thus, were excluded from further analyses. However, there was moderate stability in the families’ micro social dynamics across early childhood and it showed significant improvements as a function of random assignment to the FCU. Moreover, parent–child dynamics were predictive of chronic behavior problems as rated by parents in middle childhood, but not emotional problems. We conclude with a discussion of the validity of the RACS and on methodological advantages of micro social coding over the statistical limitations of macro rating observations. Future directions are discussed for observation research in prevention science. PMID:27620623

  18. The Validation of Macro and Micro Observations of Parent-Child Dynamics Using the Relationship Affect Coding System in Early Childhood.

    PubMed

    Dishion, Thomas J; Mun, Chung Jung; Tein, Jenn-Yun; Kim, Hanjoe; Shaw, Daniel S; Gardner, Frances; Wilson, Melvin N; Peterson, Jenene

    2017-04-01

    This study examined the validity of micro social observations and macro ratings of parent-child interaction in early to middle childhood. Seven hundred and thirty-one families representing multiple ethnic groups were recruited and screened as at risk in the context of Women, Infant, and Children (WIC) Nutritional Supplement service settings. Families were randomly assigned to the Family Checkup (FCU) intervention or the control condition at age 2 and videotaped in structured interactions in the home at ages 2, 3, 4, and 5. Parent-child interaction videotapes were micro-coded using the Relationship Affect Coding System (RACS) that captures the duration of two mutual dyadic states: positive engagement and coercion. Macro ratings of parenting skills were collected after coding the videotapes to assess parent use of positive behavior support and limit setting skills (or lack thereof). Confirmatory factor analyses revealed that the measurement model of macro ratings of limit setting and positive behavior support was not supported by the data, and thus, were excluded from further analyses. However, there was moderate stability in the families' micro social dynamics across early childhood and it showed significant improvements as a function of random assignment to the FCU. Moreover, parent-child dynamics were predictive of chronic behavior problems as rated by parents in middle childhood, but not emotional problems. We conclude with a discussion of the validity of the RACS and on methodological advantages of micro social coding over the statistical limitations of macro rating observations. Future directions are discussed for observation research in prevention science.

  19. LIVVkit 2: An extensible land ice verification and validation toolkit for comparing observations and models?

    NASA Astrophysics Data System (ADS)

    Kennedy, J. H.; Bennett, A. R.; Evans, K. J.; Fyke, J. G.; Vargo, L.; Price, S. F.; Hoffman, M. J.

    2016-12-01

    Accurate representation of ice sheets and glaciers are essential for robust predictions of arctic climate within Earth System models. Verification and Validation (V&V) is a set of techniques used to quantify the correctness and accuracy of a model, which builds developer/modeler confidence, and can be used to enhance the credibility of the model. Fundamentally, V&V is a continuous process because each model change requires a new round of V&V testing. The Community Ice Sheet Model (CISM) development community is actively developing LIVVkit, the Land Ice Verification and Validation toolkit, which is designed to easily integrate into an ice-sheet model's development workflow (on both personal and high-performance computers) to provide continuous V&V testing.LIVVkit is a robust and extensible python package for V&V, which has components for both software V&V (construction and use) and model V&V (mathematics and physics). The model Verification component is used, for example, to verify model results against community intercomparisons such as ISMIP-HOM. The model validation component is used, for example, to generate a series of diagnostic plots showing the differences between model results against observations for variables such as thickness, surface elevation, basal topography, surface velocity, surface mass balance, etc. Because many different ice-sheet models are under active development, new validation datasets are becoming available, and new methods of analysing these models are actively being researched, LIVVkit includes a framework to easily extend the model V&V analyses by ice-sheet modelers. This allows modelers and developers to develop evaluations of parameters, implement changes, and quickly see how those changes effect the ice-sheet model and earth system model (when coupled). Furthermore, LIVVkit outputs a portable hierarchical website allowing evaluations to be easily shared, published, and analysed throughout the arctic and Earth system communities.

  20. The Chelsea critical care physical assessment tool (CPAx): validation of an innovative new tool to measure physical morbidity in the general adult critical care population; an observational proof-of-concept pilot study.

    PubMed

    Corner, E J; Wood, H; Englebretsen, C; Thomas, A; Grant, R L; Nikoletou, D; Soni, N

    2013-03-01

    To develop a scoring system to measure physical morbidity in critical care - the Chelsea Critical Care Physical Assessment Tool (CPAx). The development process was iterative involving content validity indices (CVI), a focus group and an observational study of 33 patients to test construct validity against the Medical Research Council score for muscle strength, peak cough flow, Australian Therapy Outcome Measures score, Glasgow Coma Scale score, Bloomsbury sedation score, Sequential Organ Failure Assessment score, Short Form 36 (SF-36) score, days of mechanical ventilation and inter-rater reliability. Trauma and general critical care patients from two London teaching hospitals. Users of the CPAx felt that it possessed content validity, giving a final CVI of 1.00 (P<0.05). Construct validation data showed moderate to strong significant correlations between the CPAx score and all secondary measures, apart from the mental component of the SF-36 which demonstrated weak correlation with the CPAx score (r=0.024, P=0.720). Reliability testing showed internal consistency of α=0.798 and inter-rater reliability of κ=0.988 (95% confidence interval 0.791 to 1.000) between five raters. This pilot work supports proof of concept of the CPAx as a measure of physical morbidity in the critical care population, and is a cogent argument for further investigation of the scoring system. Copyright © 2012 Chartered Society of Physiotherapy. Published by Elsevier Ltd. All rights reserved.

  1. AMSR2 Soil Moisture Product Validation

    NASA Technical Reports Server (NTRS)

    Bindlish, R.; Jackson, T.; Cosh, M.; Koike, T.; Fuiji, X.; de Jeu, R.; Chan, S.; Asanuma, J.; Berg, A.; Bosch, D.; hide

    2017-01-01

    The Advanced Microwave Scanning Radiometer 2 (AMSR2) is part of the Global Change Observation Mission-Water (GCOM-W) mission. AMSR2 fills the void left by the loss of the Advanced Microwave Scanning Radiometer Earth Observing System (AMSR-E) after almost 10 years. Both missions provide brightness temperature observations that are used to retrieve soil moisture. Merging AMSR-E and AMSR2 will help build a consistent long-term dataset. Before tackling the integration of AMSR-E and AMSR2 it is necessary to conduct a thorough validation and assessment of the AMSR2 soil moisture products. This study focuses on validation of the AMSR2 soil moisture products by comparison with in situ reference data from a set of core validation sites. Three products that rely on different algorithms were evaluated; the JAXA Soil Moisture Algorithm (JAXA), the Land Parameter Retrieval Model (LPRM), and the Single Channel Algorithm (SCA). Results indicate that overall the SCA has the best performance based upon the metrics considered.

  2. Addressing Participant Validity in a Small Internet Health Survey (The Restore Study): Protocol and Recommendations for Survey Response Validation.

    PubMed

    Dewitt, James; Capistrant, Benjamin; Kohli, Nidhi; Rosser, B R Simon; Mitteldorf, Darryl; Merengwa, Enyinnaya; West, William

    2018-04-24

    While deduplication and cross-validation protocols have been recommended for large Web-based studies, protocols for survey response validation of smaller studies have not been published. This paper reports the challenges of survey validation inherent in a small Web-based health survey research. The subject population was North American, gay and bisexual, prostate cancer survivors, who represent an under-researched, hidden, difficult-to-recruit, minority-within-a-minority population. In 2015-2016, advertising on a large Web-based cancer survivor support network, using email and social media, yielded 478 completed surveys. Our manual deduplication and cross-validation protocol identified 289 survey submissions (289/478, 60.4%) as likely spam, most stemming from advertising on social media. The basic components of this deduplication and validation protocol are detailed. An unexpected challenge encountered was invalid survey responses evolving across the study period. This necessitated the static detection protocol be augmented with a dynamic one. Five recommendations for validation of Web-based samples, especially with smaller difficult-to-recruit populations, are detailed. ©James Dewitt, Benjamin Capistrant, Nidhi Kohli, B R Simon Rosser, Darryl Mitteldorf, Enyinnaya Merengwa, William West. Originally published in JMIR Research Protocols (http://www.researchprotocols.org), 24.04.2018.

  3. Observational and Modeling Studies of Clouds and the Hydrological Cycle

    NASA Technical Reports Server (NTRS)

    Somerville, Richard C. J.

    1997-01-01

    Our approach involved validating parameterizations directly against measurements from field programs, and using this validation to tune existing parameterizations and to guide the development of new ones. We have used a single-column model (SCM) to make the link between observations and parameterizations of clouds, including explicit cloud microphysics (e.g., prognostic cloud liquid water used to determine cloud radiative properties). Surface and satellite radiation measurements were used to provide an initial evaluation of the performance of the different parameterizations. The results of this evaluation will then used to develop improved cloud and cloud-radiation schemes, which were tested in GCM experiments.

  4. ADAPTIVE MATCHING IN RANDOMIZED TRIALS AND OBSERVATIONAL STUDIES

    PubMed Central

    van der Laan, Mark J.; Balzer, Laura B.; Petersen, Maya L.

    2014-01-01

    SUMMARY In many randomized and observational studies the allocation of treatment among a sample of n independent and identically distributed units is a function of the covariates of all sampled units. As a result, the treatment labels among the units are possibly dependent, complicating estimation and posing challenges for statistical inference. For example, cluster randomized trials frequently sample communities from some target population, construct matched pairs of communities from those included in the sample based on some metric of similarity in baseline community characteristics, and then randomly allocate a treatment and a control intervention within each matched pair. In this case, the observed data can neither be represented as the realization of n independent random variables, nor, contrary to current practice, as the realization of n/2 independent random variables (treating the matched pair as the independent sampling unit). In this paper we study estimation of the average causal effect of a treatment under experimental designs in which treatment allocation potentially depends on the pre-intervention covariates of all units included in the sample. We define efficient targeted minimum loss based estimators for this general design, present a theorem that establishes the desired asymptotic normality of these estimators and allows for asymptotically valid statistical inference, and discuss implementation of these estimators. We further investigate the relative asymptotic efficiency of this design compared with a design in which unit-specific treatment assignment depends only on the units’ covariates. Our findings have practical implications for the optimal design and analysis of pair matched cluster randomized trials, as well as for observational studies in which treatment decisions may depend on characteristics of the entire sample. PMID:25097298

  5. Creating and validating GIS measures of urban design for health research.

    PubMed

    Purciel, Marnie; Neckerman, Kathryn M; Lovasi, Gina S; Quinn, James W; Weiss, Christopher; Bader, Michael D M; Ewing, Reid; Rundle, Andrew

    2009-12-01

    Studies relating urban design to health have been impeded by the unfeasibility of conducting field observations across large areas and the lack of validated objective measures of urban design. This study describes measures for five dimensions of urban design - imageability, enclosure, human scale, transparency, and complexity - created using public geographic information systems (GIS) data from the US Census and city and state government. GIS measures were validated for a sample of 588 New York City block faces using a well-documented field observation protocol. Correlations between GIS and observed measures ranged from 0.28 to 0.89. Results show valid urban design measures can be constructed from digital sources.

  6. Body Dysmorphic Symptoms Scale for patients seeking esthetic surgery: cross-cultural validation study.

    PubMed

    Ramos, Tatiana Dalpasquale; Brito, Maria José Azevedo de; Piccolo, Mônica Sarto; Rosella, Maria Fernanda Normanha da Silva Martins; Sabino, Miguel; Ferreira, Lydia Masako

    2016-07-21

    Rhinoplasty is one of the most sought-after esthetic operations among individuals with body dysmorphic disorder. The aim of this study was to cross-culturally adapt and validate the Body Dysmorphic Symptoms Scale. Cross-cultural validation study conducted in a plastic surgery outpatient clinic of a public university hospital. Between February 2014 and March 2015, 80 consecutive patients of both sexes seeking rhinoplasty were selected. Thirty of them participated in the phase of cultural adaptation of the instrument. Reproducibility was tested on 20 patients and construct validity was assessed on 50 patients, with correlation against the Yale-Brown Obsessive Compulsive Scale for Body Dysmorphic Disorder. The Brazilian version of the instrument showed Cronbach's alpha of 0.805 and excellent inter-rater reproducibility (intraclass correlation coefficient, ICC = 0.873; P < 0.001) and intra-rater reproducibility (ICC = 0.939; P < 0.001). Significant differences in total scores were found between patients with and without symptoms (P < 0.001). A strong correlation (r = 0.841; P < 0.001) was observed between the Yale-Brown Obsessive Compulsive Scale for Body Dysmorphic Disorder and the Body Dysmorphic Symptoms Scale. The area under the receiver operating characteristic curve was 0.981, thus showing good accuracy for discriminating between presence and absence of symptoms of body dysmorphic disorder. Forty-six percent of the patients had body dysmorphic symptoms and 54% had moderate to severe appearance-related obsessive-compulsive symptoms. The Brazilian version of the Body Dysmorphic Symptoms Scale is a reproducible instrument that presents face, content and construct validity.

  7. Body Dysmorphic Symptoms Scale for patients seeking esthetic surgery: cross-cultural validation study.

    PubMed

    Ramos, Tatiana Dalpasquale; Brito, Maria José Azevedo de; Piccolo, Mônica Sarto; Rosella, Maria Fernanda Normanha da Silva Martins; Sabino, Miguel; Ferreira, Lydia Masako

    2016-01-01

    Rhinoplasty is one of the most sought-after esthetic operations among individuals with body dysmorphic disorder. The aim of this study was to cross-culturally adapt and validate the Body Dysmorphic Symptoms Scale. Cross-cultural validation study conducted in a plastic surgery outpatient clinic of a public university hospital. Between February 2014 and March 2015, 80 consecutive patients of both sexes seeking rhinoplasty were selected. Thirty of them participated in the phase of cultural adaptation of the instrument. Reproducibility was tested on 20 patients and construct validity was assessed on 50 patients, with correlation against the Yale-Brown Obsessive Compulsive Scale for Body Dysmorphic Disorder. The Brazilian version of the instrument showed Cronbach's alpha of 0.805 and excellent inter-rater reproducibility (intraclass correlation coefficient, ICC = 0.873; P < 0.001) and intra-rater reproducibility (ICC = 0.939; P < 0.001). Significant differences in total scores were found between patients with and without symptoms (P < 0.001). A strong correlation (r = 0.841; P < 0.001) was observed between the Yale-Brown Obsessive Compulsive Scale for Body Dysmorphic Disorder and the Body Dysmorphic Symptoms Scale. The area under the receiver operating characteristic curve was 0.981, thus showing good accuracy for discriminating between presence and absence of symptoms of body dysmorphic disorder. Forty-six percent of the patients had body dysmorphic symptoms and 54% had moderate to severe appearance-related obsessive-compulsive symptoms. The Brazilian version of the Body Dysmorphic Symptoms Scale is a reproducible instrument that presents face, content and construct validity.

  8. The UK Biobank sample handling and storage validation studies.

    PubMed

    Peakman, Tim C; Elliott, Paul

    2008-04-01

    and aims UK Biobank is a large prospective study in the United Kingdom to investigate the role of genetic factors, environmental exposures and lifestyle in the causes of major diseases of late and middle age. It involves the collection of blood and urine from 500 000 individuals aged between 40 and 69 years. How the samples are collected, processed and stored will have a major impact on the future scientific usefulness of the UK Biobank resource. A series of validation studies was recommended to test the robustness of the draft sample handling and storage protocol. Samples of blood and urine were collected from 40 healthy volunteers and either processed immediately according to the protocol or maintained at specified temperatures (4 degrees C for all tubes with the exception of vacutainers containing acid citrate dextrose that were maintained at 18 degrees C) for 12, 24 or 36 h prior to processing. A further sample was maintained for 24 h at 4 degrees C, processed and the aliquots frozen at -80 degrees C for 20 days and then thawed under controlled conditions. The stability of the samples was compared for the different times in a wide variety of assays. The samples maintained at 4 degrees C were stable for at least 24 h after collection for a wide range of assays. Small but significant changes were observed in metabonomic studies in samples maintained at 4 degrees C for 36 h. There was no degradation of the samples for a range of biochemical assays after short-term freezing and thawing under controlled conditions. Whole blood maintained at 18 degrees C for 24 h in vacutainers containing acid citrate dextrose is suitable for viral immortalization techniques. The validation studies reported in this supplement provide justification for the sample handling and storage procedures adopted in the UK Biobank project.

  9. The use of multiple imputation method for the validation of 24-h food recalls by part-time observation of dietary intake in school.

    PubMed

    Kupek, Emil; de Assis, Maria Alice A

    2016-09-01

    External validation of food recall over 24 h in schoolchildren is often restricted to eating events in schools and is based on direct observation as the reference method. The aim of this study was to estimate the dietary intake out of school, and consequently the bias in such research design based on only part-time validated food recall, using multiple imputation (MI) conditioned on the information on child age, sex, BMI, family income, parental education and the school attended. The previous-day, web-based questionnaire WebCAAFE, structured as six meals/snacks and thirty-two foods/beverage, was answered by a sample of 7-11-year-old Brazilian schoolchildren (n 602) from five public schools. Food/beverage intake recalled by children was compared with the records provided by trained observers during school meals. Sensitivity analysis was performed with artificial data emulating those recalled by children on WebCAAFE in order to evaluate the impact of both differential and non-differential bias. Estimated bias was within ±30 % interval for 84·4 % of the thirty-two foods/beverages evaluated in WebCAAFE, and half of the latter reached statistical significance (P<0·05). Rarely (<3 %) consumed dietary items were often under-reported (fish/seafood, vegetable soup, cheese bread, French fries), whereas some of those most frequently reported (meat, bread/biscuits, fruits) showed large overestimation. Compared with the analysis restricted to fully validated data, MI reduced differential bias in sensitivity analysis but the bias still remained large in most cases. MI provided a suitable statistical framework for part-time validation design of dietary intake over six daily eating events.

  10. JaCVAM-organized international validation study of the in vivo rodent alkaline comet assay for detection of genotoxic carcinogens: II. Summary of definitive validation study results.

    PubMed

    Uno, Yoshifumi; Kojima, Hajime; Omori, Takashi; Corvi, Raffaella; Honma, Masamistu; Schechtman, Leonard M; Tice, Raymond R; Beevers, Carol; De Boeck, Marlies; Burlinson, Brian; Hobbs, Cheryl A; Kitamoto, Sachiko; Kraynak, Andrew R; McNamee, James; Nakagawa, Yuzuki; Pant, Kamala; Plappert-Helbig, Ulla; Priestley, Catherine; Takasawa, Hironao; Wada, Kunio; Wirnitzer, Uta; Asano, Norihide; Escobar, Patricia A; Lovell, David; Morita, Takeshi; Nakajima, Madoka; Ohno, Yasuo; Hayashi, Makoto

    2015-07-01

    The in vivo rodent alkaline comet assay (comet assay) is used internationally to investigate the in vivo genotoxic potential of test chemicals. This assay, however, has not previously been formally validated. The Japanese Center for the Validation of Alternative Methods (JaCVAM), with the cooperation of the U.S. NTP Interagency Center for the Evaluation of Alternative Toxicological Methods (NICEATM)/the Interagency Coordinating Committee on the Validation of Alternative Methods (ICCVAM), the European Centre for the Validation of Alternative Methods (ECVAM), and the Japanese Environmental Mutagen Society/Mammalian Mutagenesis Study Group (JEMS/MMS), organized an international validation study to evaluate the reliability and relevance of the assay for identifying genotoxic carcinogens, using liver and stomach as target organs. The ultimate goal of this exercise was to establish an Organisation for Economic Co-operation and Development (OECD) test guideline. The study protocol was optimized in the pre-validation studies, and then the definitive (4th phase) validation study was conducted in two steps. In the 1st step, assay reproducibility was confirmed among laboratories using four coded reference chemicals and the positive control ethyl methanesulfonate. In the 2nd step, the predictive capability was investigated using 40 coded chemicals with known genotoxic and carcinogenic activity (i.e., genotoxic carcinogens, genotoxic non-carcinogens, non-genotoxic carcinogens, and non-genotoxic non-carcinogens). Based on the results obtained, the in vivo comet assay is concluded to be highly capable of identifying genotoxic chemicals and therefore can serve as a reliable predictor of rodent carcinogenicity. Copyright © 2015 Elsevier B.V. All rights reserved.

  11. Construct Validation Theory Applied to the Study of Personality Dysfunction

    PubMed Central

    Zapolski, Tamika C. B.; Guller, Leila; Smith, Gregory T.

    2013-01-01

    The authors review theory validation and construct validation principles as related to the study of personality dysfunction. Historically, personality disorders have been understood to be syndromes of heterogeneous symptoms. The authors argue that the syndrome approach to description results in diagnoses of unclear meaning and constrained validity. The alternative approach of describing personality dysfunction in terms of homogeneous dimensions of functioning avoids the problems of the syndromal approach and has been shown to provide more valid description and diagnosis. The authors further argue that description based on homogeneous dimensions of personality function/dysfunction is more useful, because it provides direct connections to validated treatments. PMID:22321263

  12. Reliable Digit Span: A Systematic Review and Cross-Validation Study

    ERIC Educational Resources Information Center

    Schroeder, Ryan W.; Twumasi-Ankrah, Philip; Baade, Lyle E.; Marshall, Paul S.

    2012-01-01

    Reliable Digit Span (RDS) is a heavily researched symptom validity test with a recent literature review yielding more than 20 studies ranging in dates from 1994 to 2011. Unfortunately, limitations within some of the research minimize clinical generalizability. This systematic review and cross-validation study was conducted to address these…

  13. Vaginal birth after caesarean section prediction models: a UK comparative observational study.

    PubMed

    Mone, Fionnuala; Harrity, Conor; Mackie, Adam; Segurado, Ricardo; Toner, Brenda; McCormick, Timothy R; Currie, Aoife; McAuliffe, Fionnuala M

    2015-10-01

    Primarily, to assess the performance of three statistical models in predicting successful vaginal birth in patients attempting a trial of labour after one previous lower segment caesarean section (TOLAC). The statistically most reliable models were subsequently subjected to validation testing in a local antenatal population. A retrospective observational study was performed with study data collected from the Northern Ireland Maternity Service Database (NIMATs). The study population included all women that underwent a TOLAC (n=385) from 2010 to 2012 in a regional UK obstetric unit. Data was collected from the Northern Ireland Maternity Service Database (NIMATs). Area under the curve (AUC) and correlation analysis was performed. Of the three prediction models evaluated, AUC calculations for the Smith et al., Grobman et al. and Troyer and Parisi Models were 0.74, 0.72 and 0.65, respectively. Using the Smith et al. model, 52% of women had a low risk of caesarean section (CS) (predicted VBAC >72%) and 20% had a high risk of CS (predicted VBAC <60%), of whom 20% and 63% had delivery by CS. The fit between observed and predicted outcome in this study cohort using the Smith et al. and Grobman et al. models were greatest (Chi-square test, p=0.228 and 0.904), validating both within the population. The Smith et al. and Grobman et al. models could potentially be utilized within the UK to provide women with an informed choice when deciding on mode of delivery after a previous CS. Crown Copyright © 2015. Published by Elsevier Ireland Ltd. All rights reserved.

  14. Ambulance smartphone tool for field triage of ruptured aortic aneurysms (FILTR): study protocol for a prospective observational validation of diagnostic accuracy.

    PubMed

    Lewis, Thomas L; Fothergill, Rachael T; Karthikesalingam, Alan

    2016-10-24

    Rupture of an abdominal aortic aneurysm (rAAA) carries a considerable mortality rate and is often fatal. rAAA can be treated through open or endovascular surgical intervention and it is possible that more rapid access to definitive intervention might be a key aspect of improving mortality for rAAA. Diagnosis is not always straightforward with up to 42% of rAAA initially misdiagnosed, introducing potentially harmful delay. There is a need for an effective clinical decision support tool for accurate prehospital diagnosis and triage to enable transfer to an appropriate centre. Prospective multicentre observational study assessing the diagnostic accuracy of a prehospital smartphone triage tool for detection of rAAA. The study will be conducted across London in conjunction with London Ambulance Service (LAS). A logistic score predicting the risk of rAAA by assessing ten key parameters was developed and retrospectively validated through logistic regression analysis of ambulance records and Hospital Episode Statistics data for 2200 patients from 2005 to 2010. The triage tool is integrated into a secure mobile app for major smartphone platforms. Key parameters collected from the app will be retrospectively matched with final hospital discharge diagnosis for each patient encounter. The primary outcome is to assess the sensitivity, specificity and positive predictive value of the rAAA triage tool logistic score in prospective use as a mob app for prehospital ambulance clinicians. Data collection started in November 2014 and the study will recruit a minimum of 1150 non-consecutive patients over a time period of 2 years. Full ethical approval has been gained for this study. The results of this study will be disseminated in peer-reviewed publications, and international/national presentations. CPMS 16459; pre-results. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.

  15. Direct Observation of Clinical Skills Feedback Scale: Development and Validity Evidence.

    PubMed

    Halman, Samantha; Dudek, Nancy; Wood, Timothy; Pugh, Debra; Touchie, Claire; McAleer, Sean; Humphrey-Murto, Susan

    2016-01-01

    Construct: This article describes the development and validity evidence behind a new rating scale to assess feedback quality in the clinical workplace. Competency-based medical education has mandated a shift to learner-centeredness, authentic observation, and frequent formative assessments with a focus on the delivery of effective feedback. Because feedback has been shown to be of variable quality and effectiveness, an assessment of feedback quality in the workplace is important to ensure we are providing trainees with optimal learning opportunities. The purposes of this project were to develop a rating scale for the quality of verbal feedback in the workplace (the Direct Observation of Clinical Skills Feedback Scale [DOCS-FBS]) and to gather validity evidence for its use. Two panels of experts (local and national) took part in a nominal group technique to identify features of high-quality feedback. Through multiple iterations and review, 9 features were developed into the DOCS-FBS. Four rater types (residents n = 21, medical students n = 8, faculty n = 12, and educators n = 12) used the DOCS-FBS to rate videotaped feedback encounters of variable quality. The psychometric properties of the scale were determined using a generalizability analysis. Participants also completed a survey to gather data on a 5-point Likert scale to inform the ease of use, clarity, knowledge acquisition, and acceptability of the scale. Mean video ratings ranged from 1.38 to 2.96 out of 3 and followed the intended pattern suggesting that the tool allowed raters to distinguish between examples of higher and lower quality feedback. There were no significant differences between rater type (range = 2.36-2.49), suggesting that all groups of raters used the tool in the same way. The generalizability coefficients for the scale ranged from 0.97 to 0.99. Item-total correlations were all above 0.80, suggesting some redundancy in items. Participants found the scale easy to use (M = 4.31/5) and clear

  16. External validity of post-stroke interventional gait rehabilitation studies.

    PubMed

    Kafri, Michal; Dickstein, Ruth

    2017-01-01

    Gait rehabilitation is a major component of stroke rehabilitation, and is supported by extensive research. The objective of this review was to examine the external validity of intervention studies aimed at improving gait in individuals post-stroke. To that end, two aspects of these studies were assessed: subjects' exclusion criteria and the ecological validity of the intervention, as manifested by the intervention's technological complexity and delivery setting. Additionally, we examined whether the target population as inferred from the titles/abstracts is broader than the population actually represented by the reported samples. We systematically researched PubMed for intervention studies to improve gait post-stroke, working backwards from the beginning of 2014. Exclusion criteria, the technological complexity of the intervention (defined as either elaborate or simple), setting, and description of the target population in the titles/abstracts were recorded. Fifty-two studies were reviewed. The samples were exclusive, with recurrent stroke, co-morbidities, cognitive status, walking level, and residency being major reasons for exclusion. In one half of the studies, the intervention was elaborate. Descriptions of participants in the title/abstract in almost one half of the studies included only the diagnosis (stroke or comparable terms) and its stage (acute, subacute, and chronic). The external validity of a substantial number of intervention studies about rehabilitation of gait post-stroke appears to be limited by exclusivity of the samples as well as by deficiencies in ecological validity of the interventions. These limitations are not accurately reflected in the titles or abstracts of the studies.

  17. [Turkish validity and reliability study of fear of pain questionnaire-III].

    PubMed

    Ünver, Seher; Turan, Fatma Nesrin

    2018-01-01

    This study aimed to develop a Turkish version of the Fear of Pain Questionnaire-III developed by McNeil and Rainwater (1998) and examine its validity and reliability indicators. The study was conducted with 459 university students studying in the nursing department. The Turkish translation of the scale was conducted by language experts and the original scale owner. Expert opinions were taken for language validity, and the Lawshe's content validity ratio formula was used to calculate the content validity. Exploratory factor analysis was used to assess the construct validity. The factors were rotated using the Varimax rotation (orthogonal) method. For reliability indicators of the questionnaire, the internal consistency coefficient and test re-test reliability were utilized. Explanatory factor analyses using the three-factor model (explaining 50.5% of the total variance) revealed that the item factor loads varied were above the limit value of 0.30 which indicated that the questionnaire had good construct validity. The Cronbach's alpha value for the total questionnaire was 0.938, and test re-test value was 0.846 for the total scale. The Turkish version of the Fear of Pain Questionnaire-III had sufficiently high reliability and validity to be used as a tool in evaluating the fear of pain among the young Turkish population.

  18. RELIABILITY AND VALIDITY OF SUBJECTIVE ASSESSMENT OF LUMBAR LORDOSIS IN CONVENTIONAL RADIOGRAPHY.

    PubMed

    Ruhinda, E; Byanyima, R K; Mugerwa, H

    2014-10-01

    Reliability and validity studies of different lumbar curvature analysis and measurement techniques have been documented however there is limited literature on the reliability and validity of subjective visual analysis. Radiological assessment of lumbar lordotic curve aids in early diagnosis of conditions even before neurologic changes set in. To ascertain the level of reliability and validity of subjective assessment of lumbar lordosis in conventional radiography. A blinded, repeated-measures diagnostic test was carried out on lumbar spine x-ray radiographs. Radiology Department at Joint Clinical Research Centre (JCRC), Mengo-Kampala-Uganda. Seventy (70) lateral lumbar x-ray films were used for this study and were obtained from the archive of JCRC radiology department at Butikiro house, Mengo-Kampala. Poor observer agreement, both inter- and intra-observer, with kappa values of 0.16 was found. Inter-observer agreement was poorer than intra-observer agreement. Kappa values significantly rose when the lumbar lordosis was clustered into four categories without grading each abnormality. The results confirm that subjective assessment of lumbar lordosis has low reliability and validity. Film quality has limited influence on the observer reliability. This study further shows that fewer scale categories of lordosis abnormalities produce better observer reliability.

  19. Validation of motion correction techniques for liver CT perfusion studies

    PubMed Central

    Chandler, A; Wei, W; Anderson, E F; Herron, D H; Ye, Z; Ng, C S

    2012-01-01

    Objectives Motion in images potentially compromises the evaluation of temporally acquired CT perfusion (CTp) data; image registration should mitigate this, but first requires validation. Our objective was to compare the relative performance of manual, rigid and non-rigid registration techniques to correct anatomical misalignment in acquired liver CTp data sets. Methods 17 data sets in patients with liver tumours who had undergone a CTp protocol were evaluated. Each data set consisted of a cine acquisition during a breath-hold (Phase 1), followed by six further sets of cine scans (each containing 11 images) acquired during free breathing (Phase 2). Phase 2 images were registered to a reference image from Phase 1 cine using two semi-automated intensity-based registration techniques (rigid and non-rigid) and a manual technique (the only option available in the relevant vendor CTp software). The performance of each technique to align liver anatomy was assessed by four observers, independently and blindly, on two separate occasions, using a semi-quantitative visual validation study (employing a six-point score). The registration techniques were statistically compared using an ordinal probit regression model. Results 306 registrations (2448 observer scores) were evaluated. The three registration techniques were significantly different from each other (p=0.03). On pairwise comparison, the semi-automated techniques were significantly superior to the manual technique, with non-rigid significantly superior to rigid (p<0.0001), which in turn was significantly superior to manual registration (p=0.04). Conclusion Semi-automated registration techniques achieved superior alignment of liver anatomy compared with the manual technique. We hope this will translate into more reliable CTp analyses. PMID:22374283

  20. Poor replication validity of biomedical association studies reported by newspapers

    PubMed Central

    Smith, Andy; Boraud, Thomas; Gonon, François

    2017-01-01

    Objective To investigate the replication validity of biomedical association studies covered by newspapers. Methods We used a database of 4723 primary studies included in 306 meta-analysis articles. These studies associated a risk factor with a disease in three biomedical domains, psychiatry, neurology and four somatic diseases. They were classified into a lifestyle category (e.g. smoking) and a non-lifestyle category (e.g. genetic risk). Using the database Dow Jones Factiva, we investigated the newspaper coverage of each study. Their replication validity was assessed using a comparison with their corresponding meta-analyses. Results Among the 5029 articles of our database, 156 primary studies (of which 63 were lifestyle studies) and 5 meta-analysis articles were reported in 1561 newspaper articles. The percentage of covered studies and the number of newspaper articles per study strongly increased with the impact factor of the journal that published each scientific study. Newspapers almost equally covered initial (5/39 12.8%) and subsequent (58/600 9.7%) lifestyle studies. In contrast, initial non-lifestyle studies were covered more often (48/366 13.1%) than subsequent ones (45/3718 1.2%). Newspapers never covered initial studies reporting null findings and rarely reported subsequent null observations. Only 48.7% of the 156 studies reported by newspapers were confirmed by the corresponding meta-analyses. Initial non-lifestyle studies were less often confirmed (16/48) than subsequent ones (29/45) and than lifestyle studies (31/63). Psychiatric studies covered by newspapers were less often confirmed (10/38) than the neurological (26/41) or somatic (40/77) ones. This is correlated to an even larger coverage of initial studies in psychiatry. Whereas 234 newspaper articles covered the 35 initial studies that were later disconfirmed, only four press articles covered a subsequent null finding and mentioned the refutation of an initial claim. Conclusion Journalists

  1. Poor replication validity of biomedical association studies reported by newspapers.

    PubMed

    Dumas-Mallet, Estelle; Smith, Andy; Boraud, Thomas; Gonon, François

    2017-01-01

    To investigate the replication validity of biomedical association studies covered by newspapers. We used a database of 4723 primary studies included in 306 meta-analysis articles. These studies associated a risk factor with a disease in three biomedical domains, psychiatry, neurology and four somatic diseases. They were classified into a lifestyle category (e.g. smoking) and a non-lifestyle category (e.g. genetic risk). Using the database Dow Jones Factiva, we investigated the newspaper coverage of each study. Their replication validity was assessed using a comparison with their corresponding meta-analyses. Among the 5029 articles of our database, 156 primary studies (of which 63 were lifestyle studies) and 5 meta-analysis articles were reported in 1561 newspaper articles. The percentage of covered studies and the number of newspaper articles per study strongly increased with the impact factor of the journal that published each scientific study. Newspapers almost equally covered initial (5/39 12.8%) and subsequent (58/600 9.7%) lifestyle studies. In contrast, initial non-lifestyle studies were covered more often (48/366 13.1%) than subsequent ones (45/3718 1.2%). Newspapers never covered initial studies reporting null findings and rarely reported subsequent null observations. Only 48.7% of the 156 studies reported by newspapers were confirmed by the corresponding meta-analyses. Initial non-lifestyle studies were less often confirmed (16/48) than subsequent ones (29/45) and than lifestyle studies (31/63). Psychiatric studies covered by newspapers were less often confirmed (10/38) than the neurological (26/41) or somatic (40/77) ones. This is correlated to an even larger coverage of initial studies in psychiatry. Whereas 234 newspaper articles covered the 35 initial studies that were later disconfirmed, only four press articles covered a subsequent null finding and mentioned the refutation of an initial claim. Journalists preferentially cover initial findings

  2. Creating and validating GIS measures of urban design for health research

    PubMed Central

    Purciel, Marnie; Neckerman, Kathryn M.; Lovasi, Gina S.; Quinn, James W.; Weiss, Christopher; Bader, Michael D.M.; Ewing, Reid; Rundle, Andrew

    2012-01-01

    Studies relating urban design to health have been impeded by the unfeasibility of conducting field observations across large areas and the lack of validated objective measures of urban design. This study describes measures for five dimensions of urban design – imageability, enclosure, human scale, transparency, and complexity – created using public geographic information systems (GIS) data from the US Census and city and state government. GIS measures were validated for a sample of 588 New York City block faces using a well-documented field observation protocol. Correlations between GIS and observed measures ranged from 0.28 to 0.89. Results show valid urban design measures can be constructed from digital sources. PMID:22956856

  3. Dietary Screener in the 2009 CHIS: Validation

    Cancer.gov

    In the Eating at America's Table Study and the Observing Protein and Energy Nutrition Study, Risk Factors Branch staff assessed the validity of created aggregate variables from the 2009 CHIS Dietary Screener.

  4. Measuring leprosy-related stigma - a pilot study to validate a toolkit of instruments.

    PubMed

    Rensen, Carin; Bandyopadhyay, Sudhakar; Gopal, Pala K; Van Brakel, Wim H

    2011-01-01

    Stigma negatively affects the quality of life of leprosy-affected people. Instruments are needed to assess levels of stigma and to monitor and evaluate stigma reduction interventions. We conducted a validation study of such instruments in Tamil Nadu and West Bengal, India. Four instruments were tested in a 'Community Based Rehabilitation' (CBR) setting, the Participation Scale, Internalised Scale of Mental Illness (ISMI) adapted for leprosy-affected persons, Explanatory Model Interview Catalogue (EMIC) for leprosy-affected and non-affected persons and the General Self-Efficacy (GSE) Scale. We evaluated the following components of validity, construct validity, internal consistency, test-retest reproducibility and reliability to distinguish between groups. Construct validity was tested by correlating instrument scores and by triangulating quantitative and qualitative findings. Reliability was evaluated by comparing levels of stigma among people affected by leprosy and community controls, and among affected people living in CBR project areas and those in non-CBR areas. For the Participation, ISMI and EMIC scores significant differences were observed between those affected by leprosy and those not affected (p = 0.0001), and between affected persons in the CBR and Control group (p < 0.05). The internal consistency of the instruments measured with Cronbach's α ranged from 0.83 to 0.96 and was very good for all instruments. Test-retest reproducibility coefficients were 0.80 for the Participation score, 0.70 for the EMIC score, 0.62 for the ISMI score and 0.50 for the GSE score. The construct validity of all instruments was confirmed. The Participation and EMIC Scales met all validity criteria, but test-retest reproducibility of the ISMI and GSE Scales needs further evaluation with a shorter test-retest interval and longer training and additional adaptations for the latter.

  5. Optical Tracking Data Validation and Orbit Estimation for Sparse Observations of Satellites by the OWL-Net.

    PubMed

    Choi, Jin; Jo, Jung Hyun; Yim, Hong-Suh; Choi, Eun-Jung; Cho, Sungki; Park, Jang-Hyun

    2018-06-07

    An Optical Wide-field patroL-Network (OWL-Net) has been developed for maintaining Korean low Earth orbit (LEO) satellites' orbital ephemeris. The OWL-Net consists of five optical tracking stations. Brightness signals of reflected sunlight of the targets were detected by a charged coupled device (CCD). A chopper system was adopted for fast astrometric data sampling, maximum 50 Hz, within a short observation time. The astrometric accuracy of the optical observation data was validated with precise orbital ephemeris such as Consolidated Prediction File (CPF) data and precise orbit determination result with onboard Global Positioning System (GPS) data from the target satellite. In the optical observation simulation of the OWL-Net for 2017, an average observation span for a single arc of 11 LEO observation targets was about 5 min, while an average optical observation separation time was 5 h. We estimated the position and velocity with an atmospheric drag coefficient of LEO observation targets using a sequential-batch orbit estimation technique after multi-arc batch orbit estimation. Post-fit residuals for the multi-arc batch orbit estimation and sequential-batch orbit estimation were analyzed for the optical measurements and reference orbit (CPF and GPS data). The post-fit residuals with reference show few tens-of-meters errors for in-track direction for multi-arc batch and sequential-batch orbit estimation results.

  6. Design Characteristics Influence Performance of Clinical Prediction Rules in Validation: A Meta-Epidemiological Study.

    PubMed

    Ban, Jong-Wook; Emparanza, José Ignacio; Urreta, Iratxe; Burls, Amanda

    2016-01-01

    Many new clinical prediction rules are derived and validated. But the design and reporting quality of clinical prediction research has been less than optimal. We aimed to assess whether design characteristics of validation studies were associated with the overestimation of clinical prediction rules' performance. We also aimed to evaluate whether validation studies clearly reported important methodological characteristics. Electronic databases were searched for systematic reviews of clinical prediction rule studies published between 2006 and 2010. Data were extracted from the eligible validation studies included in the systematic reviews. A meta-analytic meta-epidemiological approach was used to assess the influence of design characteristics on predictive performance. From each validation study, it was assessed whether 7 design and 7 reporting characteristics were properly described. A total of 287 validation studies of clinical prediction rule were collected from 15 systematic reviews (31 meta-analyses). Validation studies using case-control design produced a summary diagnostic odds ratio (DOR) 2.2 times (95% CI: 1.2-4.3) larger than validation studies using cohort design and unclear design. When differential verification was used, the summary DOR was overestimated by twofold (95% CI: 1.2 -3.1) compared to complete, partial and unclear verification. The summary RDOR of validation studies with inadequate sample size was 1.9 (95% CI: 1.2 -3.1) compared to studies with adequate sample size. Study site, reliability, and clinical prediction rule was adequately described in 10.1%, 9.4%, and 7.0% of validation studies respectively. Validation studies with design shortcomings may overestimate the performance of clinical prediction rules. The quality of reporting among studies validating clinical prediction rules needs to be improved.

  7. Validation of GC and HPLC systems for residue studies

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Williams, M.

    1995-12-01

    For residue studies, GC and HPLC system performance must be validated prior to and during use. One excellent measure of system performance is the standard curve and associated chromatograms used to construct that curve. The standard curve is a model of system response to an analyte over a specific time period, and is prima facia evidence of system performance beginning at the auto sampler and proceeding through the injector, column, detector, electronics, data-capture device, and printer/plotter. This tool measures the performance of the entire chromatographic system; its power negates most of the benefits associated with costly and time-consuming validation ofmore » individual system components. Other measures of instrument and method validation will be discussed, including quality control charts and experimental designs for method validation.« less

  8. Studying the neurobiology of human social interaction: Making the case for ecological validity.

    PubMed

    Hogenelst, Koen; Schoevers, Robert A; aan het Rot, Marije

    2015-01-01

    With this commentary we make the case for an increased focus on the ecological validity of the measures used to assess aspects of human social functioning. Impairments in social functioning are seen in many types of psychopathology, negatively affecting the lives of psychiatric patients and those around them. Yet the neurobiology underlying abnormal social interaction remains unclear. As an example of human social neuroscience research with relevance to biological psychiatry and clinical psychopharmacology, this commentary discusses published experimental studies involving manipulation of the human brain serotonin system that included assessments of social behavior. To date, these studies have mostly been laboratory-based and included computer tasks, observations by others, or single-administration self-report measures. Most laboratory measures used so far inform about the role of serotonin in aspects of social interaction, but the relevance for real-life interaction is often unclear. Few studies have used naturalistic assessments in real life. We suggest several laboratory methods with high ecological validity as well as ecological momentary assessment, which involves intensive repeated measures in naturalistic settings. In sum, this commentary intends to stimulate experimental research on the neurobiology of human social interaction as it occurs in real life.

  9. Improving medical record retrieval for validation studies in Medicare data.

    PubMed

    Wright, Nicole C; Delzell, Elizabeth S; Smith, Wilson K; Xue, Fei; Auroa, Tarun; Curtis, Jeffrey R

    2017-04-01

    The purpose of the study is to describe medical record retrieval for a study validating claims-based algorithms used to identify seven adverse events of special interest (AESI) in a Medicare population. We analyzed 2010-2011 Medicare claims of women with postmenopausal osteoporosis and men ≥65 years of age in the Medicare 5% national sample. The final cohorts included beneficiaries covered continuously for 12+ months by Medicare parts A, B, and D and not enrolled in Medicare Advantage before starting follow-up. We identified beneficiaries using each AESI algorithm and randomly selected 400 women and 100 men with each AESI for medical record retrieval. The Centers for Medicare and Medicaid Services provided beneficiary contact information, and we requested medical records directly from providers, without patient contact. We selected 3331 beneficiaries (women: 2272; men: 559) for whom we requested 3625 medical records. Overall, we received 1738 [47.9% (95%CI 46.3%, 49.6%)] of the requested medical records. We observed small differences in the characteristics of the total population with AESIs compared with those randomly selected for retrieval; however, no differences were seen between those selected and those retrieved. We retrieved 54.7% of records requested from hospitals compared with 26.3% of records requested from physician offices (p < 0.001). Retrieval did not differ by sex or vital status of the beneficiaries. Our national medical record validation study of claims-based algorithms produced a modest retrieval rate. The medical record procedures outlined in this paper could have led to the improved retrieval from our previous medical record retrieval study. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

  10. Concurrent Validity Between Live and Home Video Observations Using the Alberta Infant Motor Scale

    PubMed Central

    van Dam, Ellen; van Haastert, Ingrid C.; Nuysink, Jacqueline

    2017-01-01

    Purpose: Serial assessment of gross motor development of infants at risk is an established procedure in neonatal follow-up clinics. Assessments based on home video recordings could be a relevant addition. Methods: In 48 infants (1.5-19 months), the concurrent validity of 2 applications was examined using the Alberta Infant Motor Scale: (1) a home video made by parents and (2) simultaneous observation on-site by a pediatric physical therapist. Parents' experiences were explored using a questionnaire. Results: The intraclass correlation coefficient agreement between live and home video assessment was 0.99, with a standard error of measurement of 1.41 items. Intra- and interrater reliability: intraclass correlation coefficients were more than 0.99. According to 94% of the parents, recording their infant's movement repertoire was easy to perform. Conclusion: Assessing the Alberta Infant Motor Scale based on home video recordings is comparable to assessment by live observation. The video method is a promising application that can be used with low burden for parents and infants. PMID:28350771

  11. Middle Childhood Attachment Strategies: validation of an observational measure.

    PubMed

    Brumariu, Laura E; Giuseppone, Kathryn R; Kerns, Kathryn A; Van de Walle, Magali; Bureau, Jean-François; Bosmans, Guy; Lyons-Ruth, Karlen

    2018-02-05

    The purpose of this study was to assess behavioral manifestations of attachment in middle childhood, and to evaluate their relations with key theoretical correlates. The sample consisted of 87 children (aged 10-12 years) and their mothers. Dyads participated in an 8-min videotaped discussion of a conflict in their relationships, later scored with the Middle Childhood Attachment Strategies Coding System (MCAS) for key features of all child attachment patterns described in previous literature (secure, ambivalent, avoidant, disorganized-disoriented, caregiving/role-confused, hostile/punitive). To assess validity, relations among MCAS dimensions and other measures of attachment, parenting, and psychological adjustment were evaluated. Results provide preliminary evidence for the psychometric properties of the MCAS in that its behaviorally assessed patterns were associated with theoretically relevant constructs, including maternal warmth/acceptance and psychological control, and children's social competence, depression, and behavioral problems. The MCAS opens new grounds for expanding our understanding of attachment and its outcomes in middle childhood.

  12. 40 CFR 152.93 - Citation of a previously submitted valid study.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... Data Submitters' Rights § 152.93 Citation of a previously submitted valid study. An applicant may demonstrate compliance for a data requirement by citing a valid study previously submitted to the Agency. The... the original data submitter, the applicant may cite the study only in accordance with paragraphs (b...

  13. The statistical validity of nursing home survey findings.

    PubMed

    Woolley, Douglas C

    2011-11-01

    The Medicare nursing home survey is a high-stakes process whose findings greatly affect nursing homes, their current and potential residents, and the communities they serve. Therefore, survey findings must achieve high validity. This study looked at the validity of one key assessment made during a nursing home survey: the observation of the rate of errors in administration of medications to residents (med-pass). Statistical analysis of the case under study and of alternative hypothetical cases. A skilled nursing home affiliated with a local medical school. The nursing home administrators and the medical director. Observational study. The probability that state nursing home surveyors make a Type I or Type II error in observing med-pass error rates, based on the current case and on a series of postulated med-pass error rates. In the common situation such as our case, where med-pass errors occur at slightly above a 5% rate after 50 observations, and therefore trigger a citation, the chance that the true rate remains above 5% after a large number of observations is just above 50%. If the true med-pass error rate were as high as 10%, and the survey team wished to achieve 75% accuracy in determining that a citation was appropriate, they would have to make more than 200 med-pass observations. In the more common situation where med pass errors are closer to 5%, the team would have to observe more than 2000 med-passes to achieve even a modest 75% accuracy in their determinations. In settings where error rates are low, large numbers of observations of an activity must be made to reach acceptable validity of estimates for the true rates of errors. In observing key nursing home functions with current methodology, the State Medicare nursing home survey process does not adhere to well-known principles of valid error determination. Alternate approaches in survey methodology are discussed. Copyright © 2011 American Medical Directors Association. Published by Elsevier Inc. All

  14. Turkish Version of Kolcaba's Immobilization Comfort Questionnaire: A Validity and Reliability Study.

    PubMed

    Tosun, Betül; Aslan, Özlem; Tunay, Servet; Akyüz, Aygül; Özkan, Hüseyin; Bek, Doğan; Açıksöz, Semra

    2015-12-01

    The purpose of this study was to determine the validity and reliability of the Turkish version of the Immobilization Comfort Questionnaire (ICQ). The sample used in this methodological study consisted of 121 patients undergoing lower extremity arthroscopy in a training and research hospital. The validity study of the questionnaire assessed language validity, structural validity and criterion validity. Structural validity was evaluated via exploratory factor analysis. Criterion validity was evaluated by assessing the correlation between the visual analog scale (VAS) scores (i.e., the comfort and pain VAS scores) and the ICQ scores using Spearman's correlation test. The Kaiser-Meyer-Olkin coefficient and Bartlett's test of sphericity were used to determine the suitability of the data for factor analysis. Internal consistency was evaluated to determine reliability. The data were analyzed with SPSS version 15.00 for Windows. Descriptive statistics were presented as frequencies, percentages, means and standard deviations. A p value ≤ .05 was considered statistically significant. A moderate positive correlation was found between the ICQ scores and the VAS comfort scores; a moderate negative correlation was found between the ICQ and the VAS pain measures in the criterion validity analysis. Cronbach α values of .75 and .82 were found for the first and second measurements, respectively. The findings of this study reveal that the ICQ is a valid and reliable tool for assessing the comfort of patients in Turkey who are immobilized because of lower extremity orthopedic problems. Copyright © 2015. Published by Elsevier B.V.

  15. Benchmarking Controlled Trial--a novel concept covering all observational effectiveness studies.

    PubMed

    Malmivaara, Antti

    2015-06-01

    The Benchmarking Controlled Trial (BCT) is a novel concept which covers all observational studies aiming to assess effectiveness. BCTs provide evidence of the comparative effectiveness between health service providers, and of effectiveness due to particular features of the health and social care systems. BCTs complement randomized controlled trials (RCTs) as the sources of evidence on effectiveness. This paper presents a definition of the BCT; compares the position of BCTs in assessing effectiveness with that of RCTs; presents a checklist for assessing methodological validity of a BCT; and pilot-tests the checklist with BCTs published recently in the leading medical journals.

  16. Benchmarking Controlled Trial—a novel concept covering all observational effectiveness studies

    PubMed Central

    Malmivaara, Antti

    2015-01-01

    Abstract The Benchmarking Controlled Trial (BCT) is a novel concept which covers all observational studies aiming to assess effectiveness. BCTs provide evidence of the comparative effectiveness between health service providers, and of effectiveness due to particular features of the health and social care systems. BCTs complement randomized controlled trials (RCTs) as the sources of evidence on effectiveness. This paper presents a definition of the BCT; compares the position of BCTs in assessing effectiveness with that of RCTs; presents a checklist for assessing methodological validity of a BCT; and pilot-tests the checklist with BCTs published recently in the leading medical journals. PMID:25965700

  17. An algorithm to generate input data from meteorological and space shuttle observations to validate a CH4-CO model

    NASA Technical Reports Server (NTRS)

    Peters, L. K.; Yamanis, J.

    1981-01-01

    Objective procedures to analyze data from meteorological and space shuttle observations to validate a three dimensional model were investigated. The transport and chemistry of carbon monoxide and methane in the troposphere were studied. Four aspects were examined: (1) detailed evaluation of the variational calculus procedure, with the equation of continuity as a strong constraint, for adjustment of global tropospheric wind fields; (2) reduction of the National Meteorological Center (NMC) data tapes for data input to the OSTA-1/MAPS Experiment; (3) interpolation of the NMC Data for input to the CH4-CO model; and (4) temporal and spatial interpolation procedures of the CO measurements from the OSTA-1/MAPS Experiment to generate usable contours of the data.

  18. Balancing the Evidence: How to Reconcile the Results of Observational Studies vs. Randomized Clinical Trials in Dialysis.

    PubMed

    Shen, Jenny I; Lum, Erik L; Chang, Tara I

    2016-09-01

    Because large randomized clinical trials (RCTs) in dialysis have been relatively scarce, evidence-based dialysis care has depended heavily on the results of observational studies. However, when results from RCTs appear to contradict the findings of observational studies, nephrologists are left to wonder which type of study they should believe. In this editorial, we explore the key differences between observational studies and RCTs in the context of such seemingly conflicting studies in dialysis. Confounding is the major limitation of observational studies, whereas low statistical power and problems with external validity are more likely to limit the findings of RCTs. Differences in the specification of the population, exposure, and outcomes can also contribute to different results among RCTs and observational studies. Rigorous methods are required regardless of what type of study is conducted, and readers should not automatically assume that one type of study design is superior to the other. Ultimately, dialysis care requires both well-designed, well-conducted observational studies and RCTs to move the field forward. © 2016 Wiley Periodicals, Inc.

  19. Balancing the Evidence: How to Reconcile the Results of Observational Studies vs. Randomized Clinical Trials in Dialysis

    PubMed Central

    Shen, Jenny I.; Lum, Erik L.; Chang, Tara I.

    2016-01-01

    Because large randomized clinical trials (RCTs) in dialysis have been relatively scarce, evidence-based dialysis care has depended heavily on the results of observational studies. However, when results from RCTs appear to contradict the findings of observational studies, nephrologists are left to wonder which type of study they should believe. In this editorial we explore the key differences between observational studies and RCTs in the context of such seemingly conflicting studies in dialysis. Confounding is the major limitation of observational studies, while low statistical power and problems with external validity are more likely to limit the findings of RCTs. Differences in the specification of the population, exposure, and outcomes can also contribute to different results among RCTs and observational studies. Rigorous methods are required regardless of what type of study is conducted, and readers should not automatically assume that one type of study design is superior to the other. Ultimately, dialysis care requires both well-designed, well-conducted observational studies and RCTs to move the field forward. PMID:27207819

  20. Design Characteristics Influence Performance of Clinical Prediction Rules in Validation: A Meta-Epidemiological Study

    PubMed Central

    Ban, Jong-Wook; Emparanza, José Ignacio; Urreta, Iratxe; Burls, Amanda

    2016-01-01

    Background Many new clinical prediction rules are derived and validated. But the design and reporting quality of clinical prediction research has been less than optimal. We aimed to assess whether design characteristics of validation studies were associated with the overestimation of clinical prediction rules’ performance. We also aimed to evaluate whether validation studies clearly reported important methodological characteristics. Methods Electronic databases were searched for systematic reviews of clinical prediction rule studies published between 2006 and 2010. Data were extracted from the eligible validation studies included in the systematic reviews. A meta-analytic meta-epidemiological approach was used to assess the influence of design characteristics on predictive performance. From each validation study, it was assessed whether 7 design and 7 reporting characteristics were properly described. Results A total of 287 validation studies of clinical prediction rule were collected from 15 systematic reviews (31 meta-analyses). Validation studies using case-control design produced a summary diagnostic odds ratio (DOR) 2.2 times (95% CI: 1.2–4.3) larger than validation studies using cohort design and unclear design. When differential verification was used, the summary DOR was overestimated by twofold (95% CI: 1.2 -3.1) compared to complete, partial and unclear verification. The summary RDOR of validation studies with inadequate sample size was 1.9 (95% CI: 1.2 -3.1) compared to studies with adequate sample size. Study site, reliability, and clinical prediction rule was adequately described in 10.1%, 9.4%, and 7.0% of validation studies respectively. Conclusion Validation studies with design shortcomings may overestimate the performance of clinical prediction rules. The quality of reporting among studies validating clinical prediction rules needs to be improved. PMID:26730980

  1. The Diaper Change Play: Validation of a New Observational Assessment Tool for Early Triadic Family Interactions in the First Month Postpartum.

    PubMed

    Rime, Jérôme; Tissot, Hervé; Favez, Nicolas; Watson, Michael; Stadlmayr, Werner

    2018-01-01

    The quality of family relations, observed during mother-father-infant triadic interactions, has been shown to be an important contributor to child social and affective development, beyond the quality of dyadic mother-child, father-child, and marital relationships. Triadic interactions have been well described in families with 3 month olds and older children using the Lausanne Trilogue Play (LTP). Little is known about the development of mother-father-baby interactions in the very 1st weeks postpartum, mostly because no specific observational setting or particular instrument had been designed to cover this age yet. To fill this gap, we adapted the LTP to create a new observational setting, namely the Diaper Change Play (DCP). Interactions are assessed using the Family Alliance Assessment Scales for DCP (FAAS-DCP). We present the validation of the DCP and its coding system, the FAAS-DCP. The three validation studies presented here (44 mother-father-child-triads) involve a sample of parents with 3-week-old infants recruited in two maternity wards ( n = 32 and n = 12) in Switzerland. Infants from both sites were all healthy according to their APGAR scores, weight at birth, and scores on the NICU Network Neurobehavioral Scale (NNNS), which was additionally conducted on the twelve infants recruited in one of the maternity ward. Results showed that the "FAAS - DCP" coding system has good psychometric properties, with a good internal consistency and a satisfying reliability among the three independent raters. Finally, the "FAAS-DCP" scores on the interactive dimensions are comparable to the similar dimensions in the FAAS-LTP. The results showed that there is no statistically significant difference on scores between the "FAAS-DCP" and the "FAAS," which is consistent with previous studies underlying stability in triadic interaction patterns from pregnancy to 18 months. These first results indicated that the DCP is a promising observational setting, able to assess the

  2. The Diaper Change Play: Validation of a New Observational Assessment Tool for Early Triadic Family Interactions in the First Month Postpartum

    PubMed Central

    Rime, Jérôme; Tissot, Hervé; Favez, Nicolas; Watson, Michael; Stadlmayr, Werner

    2018-01-01

    The quality of family relations, observed during mother–father–infant triadic interactions, has been shown to be an important contributor to child social and affective development, beyond the quality of dyadic mother–child, father–child, and marital relationships. Triadic interactions have been well described in families with 3 month olds and older children using the Lausanne Trilogue Play (LTP). Little is known about the development of mother–father–baby interactions in the very 1st weeks postpartum, mostly because no specific observational setting or particular instrument had been designed to cover this age yet. To fill this gap, we adapted the LTP to create a new observational setting, namely the Diaper Change Play (DCP). Interactions are assessed using the Family Alliance Assessment Scales for DCP (FAAS-DCP). We present the validation of the DCP and its coding system, the FAAS-DCP. The three validation studies presented here (44 mother–father–child–triads) involve a sample of parents with 3-week-old infants recruited in two maternity wards (n = 32 and n = 12) in Switzerland. Infants from both sites were all healthy according to their APGAR scores, weight at birth, and scores on the NICU Network Neurobehavioral Scale (NNNS), which was additionally conducted on the twelve infants recruited in one of the maternity ward. Results showed that the “FAAS – DCP” coding system has good psychometric properties, with a good internal consistency and a satisfying reliability among the three independent raters. Finally, the “FAAS-DCP” scores on the interactive dimensions are comparable to the similar dimensions in the FAAS-LTP. The results showed that there is no statistically significant difference on scores between the “FAAS-DCP” and the “FAAS,” which is consistent with previous studies underlying stability in triadic interaction patterns from pregnancy to 18 months. These first results indicated that the DCP is a promising observational

  3. Reproducibility and validity of the Shanghai Women's Health Study physical activity questionnaire.

    PubMed

    Matthews, Charles E; Shu, Xiao-Ou; Yang, Gong; Jin, Fan; Ainsworth, Barbara E; Liu, Dake; Gao, Yu-Tang; Zheng, Wei

    2003-12-01

    In this investigation, the authors evaluated the reproducibility and validity of the Shanghai Women's Health Study (SWHS) physical activity questionnaire (PAQ), which was administered in a cohort study of approximately 75,000 Chinese women aged 40-70 years. Reproducibility (2-year test-retest) was evaluated using kappa statistics and intraclass correlation coefficients (ICCs). Validity was evaluated by comparing Spearman correlations (r) for the SWHS PAQ with two criterion measures administered over a period of 12 months: four 7-day physical activity logs and up to 28 7-day PAQs. Women were recruited from the SWHS cohort (n = 200). Results indicated that the reproducibility of adolescent and adult exercise participation (kappa = 0.85 and kappa = 0.64, respectively) and years of adolescent exercise and adult exercise energy expenditure (ICC = 0.83 and ICC = 0.70, respectively) was reasonable. Reproducibility values for adult lifestyle activities were lower (ICC = 0.14-0.54). Significant correlations between the PAQ and criterion measures of adult exercise were observed for the first PAQ administration (physical activity log, r = 0.50; 7-day PAQ, r = 0.62) and the second PAQ administration (physical activity log, r = 0.74; 7-day PAQ, r = 0.80). Significant correlations between PAQ lifestyle activities and the 7-day PAQ were also noted (r = 0.33-0.88). These data indicate that the SWHS PAQ is a reproducible and valid measure of exercise behaviors and that it demonstrates utility in stratifying women by levels of important lifestyle activities (e.g., housework, walking, cycling).

  4. Using Lunar Observations to Validate Pointing Accuracy and Geolocation, Detector Sensitivity Stability and Static Point Response of the CERES Instruments

    NASA Technical Reports Server (NTRS)

    Daniels, Janet L.; Smith, G. Louis; Priestley, Kory J.; Thomas, Susan

    2014-01-01

    Validation of in-orbit instrument performance is a function of stability in both instrument and calibration source. This paper describes a method using lunar observations scanning near full moon by the Clouds and Earth Radiant Energy System (CERES) instruments. The Moon offers an external source whose signal variance is predictable and non-degrading. From 2006 to present, these in-orbit observations have become standardized and compiled for the Flight Models -1 and -2 aboard the Terra satellite, for Flight Models-3 and -4 aboard the Aqua satellite, and beginning 2012, for Flight Model-5 aboard Suomi-NPP. Instrument performance measurements studied are detector sensitivity stability, pointing accuracy and static detector point response function. This validation method also shows trends per CERES data channel of 0.8% per decade or less for Flight Models 1-4. Using instrument gimbal data and computed lunar position, the pointing error of each detector telescope, the accuracy and consistency of the alignment between the detectors can be determined. The maximum pointing error was 0.2 Deg. in azimuth and 0.17 Deg. in elevation which corresponds to an error in geolocation near nadir of 2.09 km. With the exception of one detector, all instruments were found to have consistent detector alignment from 2006 to present. All alignment error was within 0.1o with most detector telescopes showing a consistent alignment offset of less than 0.02 Deg.

  5. PLCO Ovarian Phase III Validation Study — EDRN Public Portal

    Cancer.gov

    Our preliminary data indicate that the performance of CA 125 as a screening test for ovarian cancer can be improved upon by additional biomarkers. With completion of one additional validation step, we will be ready to test the performance of a consensus marker panel in a phase III validation study. Given the original aims of the PLCO trial, we believe that the PLCO represents an ideal longitudinal cohort offering specimens for phase III validation of ovarian cancer biomarkers.

  6. Validating a Fidelity Scale to Understand Intervention Effects in Classroom-Based Studies

    ERIC Educational Resources Information Center

    Buckley, Pamela; Moore, Brooke; Boardman, Alison G.; Arya, Diana J.; Maul, Andrew

    2017-01-01

    K-12 intervention studies often include fidelity of implementation (FOI) as a mediating variable, though most do not report the validity of fidelity measures. This article discusses the critical need for validated FOI scales. To illustrate our point, we describe the development and validation of the Implementation Validity Checklist (IVC-R), an…

  7. Dimensions of Intuition: First-Round Validation Studies

    ERIC Educational Resources Information Center

    Vrugtman, Rosanne

    2009-01-01

    This study utilized confirmatory factor analysis (CFA), canonical correlation analysis (CCA), regression analysis (RA), and correlation analysis (CA) for first-round validation of the researcher's Dimensions of Intuition (DOI) instrument. The DOI examined 25 personal characteristics and situations purportedly predictive of intuition. Data was…

  8. How Mathematicians Determine if an Argument Is a Valid Proof

    ERIC Educational Resources Information Center

    Weber, Keith

    2008-01-01

    The purpose of this article is to investigate the mathematical practice of proof validation--that is, the act of determining whether an argument constitutes a valid proof. The results of a study with 8 mathematicians are reported. The mathematicians were observed as they read purported mathematical proofs and made judgments about their validity;…

  9. The Asthma Mobile Health Study, a large-scale clinical observational study using ResearchKit.

    PubMed

    Chan, Yu-Feng Yvonne; Wang, Pei; Rogers, Linda; Tignor, Nicole; Zweig, Micol; Hershman, Steven G; Genes, Nicholas; Scott, Erick R; Krock, Eric; Badgeley, Marcus; Edgar, Ron; Violante, Samantha; Wright, Rosalind; Powell, Charles A; Dudley, Joel T; Schadt, Eric E

    2017-04-01

    The feasibility of using mobile health applications to conduct observational clinical studies requires rigorous validation. Here, we report initial findings from the Asthma Mobile Health Study, a research study, including recruitment, consent, and enrollment, conducted entirely remotely by smartphone. We achieved secure bidirectional data flow between investigators and 7,593 participants from across the United States, including many with severe asthma. Our platform enabled prospective collection of longitudinal, multidimensional data (e.g., surveys, devices, geolocation, and air quality) in a subset of users over the 6-month study period. Consistent trending and correlation of interrelated variables support the quality of data obtained via this method. We detected increased reporting of asthma symptoms in regions affected by heat, pollen, and wildfires. Potential challenges with this technology include selection bias, low retention rates, reporting bias, and data security. These issues require attention to realize the full potential of mobile platforms in research and patient care.

  10. The Asthma Mobile Health Study, a large-scale clinical observational study using ResearchKit

    PubMed Central

    Chan, Yu-Feng Yvonne; Wang, Pei; Rogers, Linda; Tignor, Nicole; Zweig, Micol; Hershman, Steven G; Genes, Nicholas; Scott, Erick R; Krock, Eric; Badgeley, Marcus; Edgar, Ron; Violante, Samantha; Wright, Rosalind; Powell, Charles A; Dudley, Joel T; Schadt, Eric E

    2017-01-01

    The feasibility of using mobile health applications to conduct observational clinical studies requires rigorous validation. Here, we report initial findings from the Asthma Mobile Health Study, a research study, including recruitment, consent, and enrollment, conducted entirely remotely by smartphone. We achieved secure bidirectional data flow between investigators and 7,593 participants from across the United States, including many with severe asthma. Our platform enabled prospective collection of longitudinal, multidimensional data (e.g., surveys, devices, geolocation, and air quality) in a subset of users over the 6-month study period. Consistent trending and correlation of interrelated variables support the quality of data obtained via this method. We detected increased reporting of asthma symptoms in regions affected by heat, pollen, and wildfires. Potential challenges with this technology include selection bias, low retention rates, reporting bias, and data security. These issues require attention to realize the full potential of mobile platforms in research and patient care. PMID:28288104

  11. Validation of CALIPSO Lidar Observations Using Data From the NASA Langley Airborne High Spectral Resolution Lidar

    NASA Technical Reports Server (NTRS)

    Hostetler, Chris; Hair, Johnathan; Liu, Zhaoyan; Ferrare, Rich; Harper, David; Cook, Anthony; Vaughan, Mark; Trepte, Chip; Winker, David

    2006-01-01

    This poster focuses on preliminary comparisons of data from the Cloud-Aerosol Lidar with Orthogonal Polarization (CALIOP) instrument on the Cloud-Aerosol Lidar and Infrared Pathfinder Satellite Observations (CALIPSO) spacecraft with data acquired by the NASA Langley Airborne High Spectral Resolution Lidar (HSRL). A series of 20 aircraft validation flights was conducted from 14 June through 27 September 2006, under both day and night lighting conditions and a variety of aerosol and cloud conditions. This poster presents comparisons of CALIOP measurements of attenuated backscatter at 532 and 1064 nm and depolarization at 532 nm with near coincident measurements from the Airborne HSRL as a preliminary assessment of CALIOP calibration accuracy. Note that the CALIOP data presented here are the pre-release version. These data have known artifacts in calibration which have been corrected in the December 8 CALIPSO data release which was not available at the time the comparisons were conducted for this poster. The HSRL data are also preliminary. No artifacts are known to exist; however, refinements in calibration and algorithms are likely to be implemented before validation comparisons are made final.

  12. APPROACHES TO ASSESSING THE VALIDITY OF A FUNCTIONAL OBSERVATIONAL BATTERY

    EPA Science Inventory

    With the growing importance of neurobehavioral assessments at the preliminary stage of chemical testing, it is critical that the screening procedures utilized be valid indicators of neurobehavioral dysfunction in addition to being sensitive, specific, and reliable. fforts in this...

  13. A Primer on Observational Measurement.

    PubMed

    Girard, Jeffrey M; Cohn, Jeffrey F

    2016-08-01

    Observational measurement plays an integral role in a variety of scientific endeavors within biology, psychology, sociology, education, medicine, and marketing. The current article provides an interdisciplinary primer on observational measurement; in particular, it highlights recent advances in observational methodology and the challenges that accompany such growth. First, we detail the various types of instrument that can be used to standardize measurements across observers. Second, we argue for the importance of validity in observational measurement and provide several approaches to validation based on contemporary validity theory. Third, we outline the challenges currently faced by observational researchers pertaining to measurement drift, observer reactivity, reliability analysis, and time/expense. Fourth, we describe recent advances in computer-assisted measurement, fully automated measurement, and statistical data analysis. Finally, we identify several key directions for future observational research to explore.

  14. Verification and Validation of NASA-Supported Enhancements to the Near Real Time Harmful Algal Blooms Observing System (HABSOS)

    NASA Technical Reports Server (NTRS)

    Spruce, Joseph P.; Hall, Calllie; McPherson, Terry; Spiering, Bruce; Brown, Richard; Estep, Lee; Lunde, Bruce; Guest, DeNeice; Navard, Andy; Pagnutti, Mary; hide

    2006-01-01

    This report discusses verification and validation (V&V) assessment of Moderate Resolution Imaging Spectroradiometer (MODIS) ocean data products contributed by the Naval Research Laboratory (NRL) and Applied Coherent Technologies (ACT) Corporation to National Oceanic Atmospheric Administration s (NOAA) Near Real Time (NRT) Harmful Algal Blooms Observing System (HABSOS). HABSOS is a maturing decision support tool (DST) used by NOAA and its partners involved with coastal and public health management.

  15. Postcraniometric sex and ancestry estimation in South Africa: a validation study.

    PubMed

    Liebenberg, Leandi; Krüger, Gabriele C; L'Abbé, Ericka N; Stull, Kyra E

    2018-05-24

    With the acceptance of the Daubert criteria as the standards for best practice in forensic anthropological research, more emphasis is being placed on the validation of published methods. Methods, both traditional and novel, need to be validated, adjusted, and refined for optimal performance within forensic anthropological analyses. Recently, a custom postcranial database of modern South Africans was created for use in Fordisc 3.1. Classification accuracies of up to 85% for ancestry estimation and 98% for sex estimation were achieved using a multivariate approach. To measure the external validity and report more realistic performance statistics, an independent sample was tested. The postcrania from 180 black, white, and colored South Africans were measured and classified using the custom postcranial database. A decrease in accuracy was observed for both ancestry estimation (79%) and sex estimation (95%) of the validation sample. When incorporating both sex and ancestry simultaneously, the method achieved 70% accuracy, and 79% accuracy when sex-specific ancestry analyses were run. Classification matrices revealed that postcrania were more likely to misclassify as a result of ancestry rather than sex. While both sex and ancestry influence the size of an individual, sex differences are more marked in the postcranial skeleton and are therefore easier to identify. The external validity of the postcranial database was verified and therefore shown to be a useful tool for forensic casework in South Africa. While the classification rates were slightly lower than the original method, this is expected when a method is generalized.

  16. Test of Creative Imagination: Validity and Reliability Study

    ERIC Educational Resources Information Center

    Gundogan, Aysun; Ari, Meziyet; Gonen, Mubeccel

    2013-01-01

    The purpose of this study was to investigate validity and reliability of the test of creative imagination. This study was conducted with the participation of 1000 children, aged between 9-14 and were studying in six primary schools in the city center of Denizli Province, chosen by cluster ratio sampling. In the study, it was revealed that the…

  17. Aircraft Wake Vortex Spacing System (AVOSS) Performance Update and Validation Study

    NASA Technical Reports Server (NTRS)

    Rutishauser, David K.; OConnor, Cornelius J.

    2001-01-01

    An analysis has been performed on data generated from the two most recent field deployments of the Aircraft Wake VOrtex Spacing System (AVOSS). The AVOSS provides reduced aircraft spacing criteria for wake vortex avoidance as compared to the FAA spacing applied under Instrument Flight Rules (IFR). Several field deployments culminating in a system demonstration at Dallas Fort Worth (DFW) International Airport in the summer of 2000 were successful in showing a sound operational concept and the system's potential to provide a significant benefit to airport operations. For DFW, a predicted average throughput increase of 6% was observed. This increase implies 6 or 7 more aircraft on the ground in a one-hour period for DFW operations. Several studies of performance correlations to system configuration options, design options, and system inputs are also reported. The studies focus on the validation performance of the system.

  18. Use of the Environment and Policy Evaluation and Observation as a Self-Report Instrument (EPAO-SR) to measure nutrition and physical activity environments in child care settings: validity and reliability evidence.

    PubMed

    Ward, Dianne S; Mazzucca, Stephanie; McWilliams, Christina; Hales, Derek

    2015-09-26

    Early care and education (ECE) centers are important settings influencing young children's diet and physical activity (PA) behaviors. To better understand their impact on diet and PA behaviors as well as to evaluate public health programs aimed at ECE settings, we developed and tested the Environment and Policy Assessment and Observation - Self-Report (EPAO-SR), a self-administered version of the previously validated, researcher-administered EPAO. Development of the EPAO-SR instrument included modification of items from the EPAO, community advisory group and expert review, and cognitive interviews with center directors and classroom teachers. Reliability and validity data were collected across 4 days in 3-5 year old classrooms in 50 ECE centers in North Carolina. Center teachers and directors completed relevant portions of the EPAO-SR on multiple days according to a standardized protocol, and trained data collectors completed the EPAO for 4 days in the centers. Reliability and validity statistics calculated included percent agreement, kappa, correlation coefficients, coefficients of variation, deviations, mean differences, and intraclass correlation coefficients (ICC), depending on the response option of the item. Data demonstrated a range of reliability and validity evidence for the EPAO-SR instrument. Reporting from directors and classroom teachers was consistent and similar to the observational data. Items that produced strongest reliability and validity estimates included beverages served, outside time, and physical activity equipment, while items such as whole grains served and amount of teacher-led PA had lower reliability (observation and self-report) and validity estimates. To overcome lower reliability and validity estimates, some items need administration on multiple days. This study demonstrated appropriate reliability and validity evidence for use of the EPAO-SR in the field. The self-administered EPAO-SR is an advancement of the measurement of ECE

  19. AIRS Retrieval Validation During the EAQUATE

    NASA Technical Reports Server (NTRS)

    Zhou, Daniel K.; Smith, William L.; Cuomo, Vincenzo; Taylor, Jonathan P.; Barnet, Christopher D.; DiGirolamo, Paolo; Pappalardo, Gelsomina; Larar, Allen M.; Liu, Xu; Newman, Stuart M.

    2006-01-01

    Atmospheric and surface thermodynamic parameters retrieved with advanced hyperspectral remote sensors of Earth observing satellites are critical for weather prediction and scientific research. The retrieval algorithms and retrieved parameters from satellite sounders must be validated to demonstrate the capability and accuracy of both observation and data processing systems. The European AQUA Thermodynamic Experiment (EAQUATE) was conducted mainly for validation of the Atmospheric InfraRed Sounder (AIRS) on the AQUA satellite, but also for assessment of validation systems of both ground-based and aircraft-based instruments which will be used for other satellite systems such as the Infrared Atmospheric Sounding Interferometer (IASI) on the European MetOp satellite, the Cross-track Infrared Sounder (CrIS) from the NPOESS Preparatory Project and the following NPOESS series of satellites. Detailed inter-comparisons were conducted and presented using different retrieval methodologies: measurements from airborne ultraspectral Fourier transform spectrometers, aircraft in-situ instruments, dedicated dropsondes and radiosondes, and ground based Raman Lidar, as well as from the European Center for Medium range Weather Forecasting (ECMWF) modeled thermal structures. The results of this study not only illustrate the quality of the measurements and retrieval products but also demonstrate the capability of these validation systems which are put in place to validate current and future hyperspectral sounding instruments and their scientific products.

  20. Observational studies in systematic [corrected] reviews of comparative effectiveness: AHRQ and the Effective Health Care Program.

    PubMed

    Norris, Susan L; Atkins, David; Bruening, Wendy; Fox, Steven; Johnson, Eric; Kane, Robert; Morton, Sally C; Oremus, Mark; Ospina, Maria; Randhawa, Gurvaneet; Schoelles, Karen; Shekelle, Paul; Viswanathan, Meera

    2011-11-01

    Systematic reviewers disagree about the ability of observational studies to answer questions about the benefits or intended effects of pharmacotherapeutic, device, or procedural interventions. This study provides a framework for decision making on the inclusion of observational studies to assess benefits and intended effects in comparative effectiveness reviews (CERs). The conceptual model and recommendations were developed using a consensus process by members of the methods workgroup of the Effective Health Care Program of the Agency for Healthcare Research and Quality. In considering whether to use observational studies in CERs for addressing beneficial effects, reviewers should answer two questions: (1) Are there gaps in the evidence from randomized controlled trials (RCTs)? (2) Will observational studies provide valid and useful information? The latter question involves the following: (a) refocusing the study questions on gaps in the evidence from RCTs, (b) assessing the risk of bias of the body of evidence of observational studies, and (c) assessing whether available observational studies address the gap review questions. Because it is unusual to find sufficient evidence from RCTs to answer all key questions concerning benefit or the balance of benefits and harms, comparative effectiveness reviewers should routinely assess the appropriateness of inclusion of observational studies for questions of benefit. Furthermore, reviewers should explicitly state the rationale for inclusion or exclusion of observational studies when conducting CERs. Copyright © 2011 Elsevier Inc. All rights reserved.

  1. A reliability and validity study of the Palliative Performance Scale

    PubMed Central

    Ho, Francis; Lau, Francis; Downing, Michael G; Lesperance, Mary

    2008-01-01

    Background The Palliative Performance Scale (PPS) was first introduced in1996 as a new tool for measurement of performance status in palliative care. PPS has been used in many countries and has been translated into other languages. Methods This study evaluated the reliability and validity of PPS. A web-based, case scenarios study with a test-retest format was used to determine reliability. Fifty-three participants were recruited and randomly divided into two groups, each evaluating 11 cases at two time points. The validity study was based on the content validation of 15 palliative care experts conducted over telephone interviews, with discussion on five themes: PPS as clinical assessment tool, the usefulness of PPS, PPS scores affecting decision making, the problems in using PPS, and the adequacy of PPS instruction. Results The intraclass correlation coefficients for absolute agreement were 0.959 and 0.964 for Group 1, at Time-1 and Time-2; 0.951 and 0.931 for Group 2, at Time-1 and Time-2 respectively. Results showed that the participants were consistent in their scoring over the two times, with a mean Cohen's kappa of 0.67 for Group 1 and 0.71 for Group 2. In the validity study, all experts agreed that PPS is a valuable clinical assessment tool in palliative care. Many of them have already incorporated PPS as part of their practice standard. Conclusion The results of the reliability study demonstrated that PPS is a reliable tool. The validity study found that most experts did not feel a need to further modify PPS and, only two experts requested that some performance status measures be defined more clearly. Areas of PPS use include prognostication, disease monitoring, care planning, hospital resource allocation, clinical teaching and research. PPS is also a good communication tool between palliative care workers. PMID:18680590

  2. Study of the validity of a job-exposure matrix for the job strain model factors: an update and a study of changes over time.

    PubMed

    Niedhammer, Isabelle; Milner, Allison; LaMontagne, Anthony D; Chastang, Jean-François

    2018-03-08

    The objectives of the study were to construct a job-exposure matrix (JEM) for psychosocial work factors of the job strain model, to evaluate its validity, and to compare the results over time. The study was based on national representative data of the French working population with samples of 46,962 employees (2010 SUMER survey) and 24,486 employees (2003 SUMER survey). Psychosocial work factors included the job strain model factors (Job Content Questionnaire): psychological demands, decision latitude, social support, job strain and iso-strain. Job title was defined by three variables: occupation and economic activity coded using standard classifications, and company size. A JEM was constructed using a segmentation method (Classification and Regression Tree-CART) and cross-validation. The best quality JEM was found using occupation and company size for social support. For decision latitude and psychological demands, there was not much difference using occupation and company size with or without economic activity. The validity of the JEM estimates was higher for decision latitude, job strain and iso-strain, and lower for social support and psychological demands. Differential changes over time were observed for psychosocial work factors according to occupation, economic activity and company size. This study demonstrated that company size in addition to occupation may improve the validity of JEMs for psychosocial work factors. These matrices may be time-dependent and may need to be updated over time. More research is needed to assess the validity of JEMs given that these matrices may be able to provide exposure assessments to study a range of health outcomes.

  3. VALUE - A Framework to Validate Downscaling Approaches for Climate Change Studies

    NASA Astrophysics Data System (ADS)

    Maraun, Douglas; Widmann, Martin; Gutiérrez, José M.; Kotlarski, Sven; Chandler, Richard E.; Hertig, Elke; Wibig, Joanna; Huth, Radan; Wilke, Renate A. I.

    2015-04-01

    VALUE is an open European network to validate and compare downscaling methods for climate change research. VALUE aims to foster collaboration and knowledge exchange between climatologists, impact modellers, statisticians, and stakeholders to establish an interdisciplinary downscaling community. A key deliverable of VALUE is the development of a systematic validation framework to enable the assessment and comparison of both dynamical and statistical downscaling methods. Here, we present the key ingredients of this framework. VALUE's main approach to validation is user-focused: starting from a specific user problem, a validation tree guides the selection of relevant validation indices and performance measures. Several experiments have been designed to isolate specific points in the downscaling procedure where problems may occur: what is the isolated downscaling skill? How do statistical and dynamical methods compare? How do methods perform at different spatial scales? Do methods fail in representing regional climate change? How is the overall representation of regional climate, including errors inherited from global climate models? The framework will be the basis for a comprehensive community-open downscaling intercomparison study, but is intended also to provide general guidance for other validation studies.

  4. VALUE: A framework to validate downscaling approaches for climate change studies

    NASA Astrophysics Data System (ADS)

    Maraun, Douglas; Widmann, Martin; Gutiérrez, José M.; Kotlarski, Sven; Chandler, Richard E.; Hertig, Elke; Wibig, Joanna; Huth, Radan; Wilcke, Renate A. I.

    2015-01-01

    VALUE is an open European network to validate and compare downscaling methods for climate change research. VALUE aims to foster collaboration and knowledge exchange between climatologists, impact modellers, statisticians, and stakeholders to establish an interdisciplinary downscaling community. A key deliverable of VALUE is the development of a systematic validation framework to enable the assessment and comparison of both dynamical and statistical downscaling methods. In this paper, we present the key ingredients of this framework. VALUE's main approach to validation is user- focused: starting from a specific user problem, a validation tree guides the selection of relevant validation indices and performance measures. Several experiments have been designed to isolate specific points in the downscaling procedure where problems may occur: what is the isolated downscaling skill? How do statistical and dynamical methods compare? How do methods perform at different spatial scales? Do methods fail in representing regional climate change? How is the overall representation of regional climate, including errors inherited from global climate models? The framework will be the basis for a comprehensive community-open downscaling intercomparison study, but is intended also to provide general guidance for other validation studies.

  5. Validation of the FALL3D ash dispersion model using observations of the 2010 Eyjafjallajökull volcanic ash clouds

    NASA Astrophysics Data System (ADS)

    Folch, A.; Costa, A.; Basart, S.

    2012-03-01

    During April-May 2010 volcanic ash clouds from the Icelandic Eyjafjallajökull volcano reached Europe causing an unprecedented disruption of the EUR/NAT region airspace. Civil aviation authorities banned all flight operations because of the threat posed by volcanic ash to modern turbine aircraft. New quantitative airborne ash mass concentration thresholds, still under discussion, were adopted for discerning regions contaminated by ash. This has implications for ash dispersal models routinely used to forecast the evolution of ash clouds. In this new context, quantitative model validation and assessment of the accuracies of current state-of-the-art models is of paramount importance. The passage of volcanic ash clouds over central Europe, a territory hosting a dense network of meteorological and air quality observatories, generated a quantity of observations unusual for volcanic clouds. From the ground, the cloud was observed by aerosol lidars, lidar ceilometers, sun photometers, other remote-sensing instruments and in-situ collectors. From the air, sondes and multiple aircraft measurements also took extremely valuable in-situ and remote-sensing measurements. These measurements constitute an excellent database for model validation. Here we validate the FALL3D ash dispersal model by comparing model results with ground and airplane-based measurements obtained during the initial 14-23 April 2010 Eyjafjallajökull explosive phase. We run the model at high spatial resolution using as input hourly-averaged observed heights of the eruption column and the total grain size distribution reconstructed from field observations. Model results are then compared against remote ground-based and in-situ aircraft-based measurements, including lidar ceilometers from the German Meteorological Service, aerosol lidars and sun photometers from EARLINET and AERONET networks, and flight missions of the German DLR Falcon aircraft. We find good quantitative agreement, with an error similar to

  6. Numerical studies and metric development for validation of magnetohydrodynamic models on the HIT-SI experiment

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hansen, C., E-mail: hansec@uw.edu; Columbia University, New York, New York 10027; Victor, B.

    We present application of three scalar metrics derived from the Biorthogonal Decomposition (BD) technique to evaluate the level of agreement between macroscopic plasma dynamics in different data sets. BD decomposes large data sets, as produced by distributed diagnostic arrays, into principal mode structures without assumptions on spatial or temporal structure. These metrics have been applied to validation of the Hall-MHD model using experimental data from the Helicity Injected Torus with Steady Inductive helicity injection experiment. Each metric provides a measure of correlation between mode structures extracted from experimental data and simulations for an array of 192 surface-mounted magnetic probes. Numericalmore » validation studies have been performed using the NIMROD code, where the injectors are modeled as boundary conditions on the flux conserver, and the PSI-TET code, where the entire plasma volume is treated. Initial results from a comprehensive validation study of high performance operation with different injector frequencies are presented, illustrating application of the BD method. Using a simplified (constant, uniform density and temperature) Hall-MHD model, simulation results agree with experimental observation for two of the three defined metrics when the injectors are driven with a frequency of 14.5 kHz.« less

  7. Can We Study Autonomous Driving Comfort in Moving-Base Driving Simulators? A Validation Study.

    PubMed

    Bellem, Hanna; Klüver, Malte; Schrauf, Michael; Schöner, Hans-Peter; Hecht, Heiko; Krems, Josef F

    2017-05-01

    To lay the basis of studying autonomous driving comfort using driving simulators, we assessed the behavioral validity of two moving-base simulator configurations by contrasting them with a test-track setting. With increasing level of automation, driving comfort becomes increasingly important. Simulators provide a safe environment to study perceived comfort in autonomous driving. To date, however, no studies were conducted in relation to comfort in autonomous driving to determine the extent to which results from simulator studies can be transferred to on-road driving conditions. Participants ( N = 72) experienced six differently parameterized lane-change and deceleration maneuvers and subsequently rated the comfort of each scenario. One group of participants experienced the maneuvers on a test-track setting, whereas two other groups experienced them in one of two moving-base simulator configurations. We could demonstrate relative and absolute validity for one of the two simulator configurations. Subsequent analyses revealed that the validity of the simulator highly depends on the parameterization of the motion system. Moving-base simulation can be a useful research tool to study driving comfort in autonomous vehicles. However, our results point at a preference for subunity scaling factors for both lateral and longitudinal motion cues, which might be explained by an underestimation of speed in virtual environments. In line with previous studies, we recommend lateral- and longitudinal-motion scaling factors of approximately 50% to 60% in order to obtain valid results for both active and passive driving tasks.

  8. Validating GEOV3 LAI, FAPAR and vegetation cover estimates derived from PROBA-V observations at 333m over Europe

    NASA Astrophysics Data System (ADS)

    Camacho, Fernando; Sánchez, Jorge; Lacaze, Roselyne; Weiss, Marie; Baret, Frédéric; Verger, Aleixandre; Smets, Bruno; Latorre, Consuelo

    2016-04-01

    The Copernicus Global Land Service (http://land.copernicus.eu/global/) is delivering surface biophysical products derived from satellite observations at global scale. Fifteen years of LAI, FAPAR, and vegetation cover (FCOVER) products among other indicators have been generated from SPOT/VGT observations at 1 km spatial resolution (named GEOV1, GEOV2). The continuity of the service since the end of SPOT/VGT mission (May, 2014) is achieved thanks to PROBA-V, which offers observations at a finer spatial resolution (1/3 km). In the context of the FP7 ImagineS project (http://fp7-imagines.eu/), a new algorithm (Weiss et al., this conference), adapted to PROBA-V spectral and spatial characteristics, was designed to provide vegetation products (named GEOV3) as consistent as possible with GEOV1 and GEOV2 whilst providing near real-time estimates required by some users. It is based on neural network techniques completed with a data filtering and smoothing process. The near real-time estimates are improved through a consolidation period of six dekads during which observations are accumulated every new dekad. The validation of these products is mandatory to provide associated uncertainties for efficient use of this source of information. This work presents an early validation over Europe of the GEOV3 LAI, FAPAR and vegetation cover (FCOVER) products derived from PROBA-V observation at 333 m and 10-days frequency during the year 2014. The validation has been conducted in agreement with the CEOS LPV best practices for global LAI products. Several performance criteria were investigated for the several GEOV3 modes (near real-time, and successive consolidated estimates) including completeness, spatial and temporal consistency, precision and accuracy. The spatial and temporal consistency was evaluated using as reference PROBA-V GEOV1 and MODC5 1 km similar products using a network of 153 validation sites over Europe (EUVAL). The accuracy was assessed with concomitant data collected

  9. Fun and Games: The Validity of Games for the Study of Conflict

    ERIC Educational Resources Information Center

    Schlenker, Barry R.; Bonoma, Thomas V.

    1978-01-01

    Examines claimed advantages and criticisms of the use of games in the study of social conflict, differentiating the advantages and criticisms into questions of internal validity, external validity, and ecological validity. Available from: Sage Publications, Inc., 275 South Beverly Drive, Beverly Hills, California 90212. (JG)

  10. Simultaneous Observation of Hybrid States for Cyber-Physical Systems: A Case Study of Electric Vehicle Powertrain.

    PubMed

    Lv, Chen; Liu, Yahui; Hu, Xiaosong; Guo, Hongyan; Cao, Dongpu; Wang, Fei-Yue

    2017-08-22

    As a typical cyber-physical system (CPS), electrified vehicle becomes a hot research topic due to its high efficiency and low emissions. In order to develop advanced electric powertrains, accurate estimations of the unmeasurable hybrid states, including discrete backlash nonlinearity and continuous half-shaft torque, are of great importance. In this paper, a novel estimation algorithm for simultaneously identifying the backlash position and half-shaft torque of an electric powertrain is proposed using a hybrid system approach. System models, including the electric powertrain and vehicle dynamics models, are established considering the drivetrain backlash and flexibility, and also calibrated and validated using vehicle road testing data. Based on the developed system models, the powertrain behavior is represented using hybrid automata according to the piecewise affine property of the backlash dynamics. A hybrid-state observer, which is comprised of a discrete-state observer and a continuous-state observer, is designed for the simultaneous estimation of the backlash position and half-shaft torque. In order to guarantee the stability and reachability, the convergence property of the proposed observer is investigated. The proposed observer are validated under highly dynamical transitions of vehicle states. The validation results demonstrates the feasibility and effectiveness of the proposed hybrid-state observer.

  11. Filling the observational void: Scientific value and quantitative validation of hydrometeorological data from a community-based monitoring programme

    NASA Astrophysics Data System (ADS)

    Walker, David; Forsythe, Nathan; Parkin, Geoff; Gowing, John

    2016-07-01

    This study shows how community-based hydrometeorological monitoring programmes can provide reliable high-quality measurements comparable to formal observations. Time series of daily rainfall, river stage and groundwater levels obtained by a local community in Dangila woreda, northwest Ethiopia, have passed accepted quality control standards and have been statistically validated against formal sources. In a region of low-density and declining formal hydrometeorological monitoring networks, a situation shared by much of the developing world, community-based monitoring can fill the observational void providing improved spatial and temporal characterisation of rainfall, river flow and groundwater levels. Such time series data are invaluable in water resource assessment and management, particularly where, as shown here, gridded rainfall datasets provide gross under or over estimations of rainfall and where groundwater level data are non-existent. Discussions with the local community during workshops held at the setup of the monitoring programme and since have demonstrated that the community have become engaged in the project and have benefited from a greater hydrological knowledge and sense of ownership of their resources. This increased understanding and empowerment is at the relevant scale required for effective community-based participatory management of shallow groundwater and river catchments.

  12. Sensor data validation and reconstruction. Phase 1: System architecture study

    NASA Technical Reports Server (NTRS)

    1991-01-01

    The sensor validation and data reconstruction task reviewed relevant literature and selected applicable validation and reconstruction techniques for further study; analyzed the selected techniques and emphasized those which could be used for both validation and reconstruction; analyzed Space Shuttle Main Engine (SSME) hot fire test data to determine statistical and physical relationships between various parameters; developed statistical and empirical correlations between parameters to perform validation and reconstruction tasks, using a computer aided engineering (CAE) package; and conceptually designed an expert system based knowledge fusion tool, which allows the user to relate diverse types of information when validating sensor data. The host hardware for the system is intended to be a Sun SPARCstation, but could be any RISC workstation with a UNIX operating system and a windowing/graphics system such as Motif or Dataviews. The information fusion tool is intended to be developed using the NEXPERT Object expert system shell, and the C programming language.

  13. Validation of CERES-MODIS Arctic cloud properties using CloudSat/CALIPSO and ARM NSA observations

    NASA Astrophysics Data System (ADS)

    Giannecchini, K.; Dong, X.; Xi, B.; Minnis, P.; Kato, S.

    2011-12-01

    The traditional passive satellite studies of cloud properties in the Arctic are often affected by the complex surface features present across the region. Nominal visual and thermal contrast exists between Arctic clouds and the snow- and ice-covered surfaces beneath them, which can lead to difficulties in satellite retrievals of cloud properties. However, the addition of active sensors to the A-Train constellation of satellites has increased the availability of validation sources for cloud properties derived from passive sensors in the data-sparse high-latitude regions. In this study, Arctic cloud fraction and cloud heights derived from the NASA CERES team (CERES-MODIS) have been compared with CloudSat/CALIPSO and DOE ARM NSA radar-lidar observations over Barrow, AK, for the two-year period from 2007 to 2008. An Arctic-wide comparison of cloud fraction and height between CERES-MODIS and CloudSat/CALIPSO was then conducted for the same time period. The CERES-MODIS cloud properties, which include cloud fraction and cloud effective heights, were retrieved using the 4-channel VISST (Visible Infrared Solar-Infrared Split-window Technique) [Minnis et al.,1995]. CloudSat/CALIPSO cloud fraction and cloud-base and -top heights were from version RelB1 data products determined by both the 94 GHz radar onboard CloudSat and the lidar on CALIPSO with a vertical resolution of 30 m below 8.2 km and 60 m above. To match the surface and satellite observations/retrievals, the ARM surface observations were averaged into 3-hour intervals centered at the time of the satellite overpass, while satellite observations were averaged within a 3°x3° grid box centered on the Barrow site. The preliminary results have shown that all observed CFs have peaks during April-May and September-October, and dips during winter months (January-February) and summer months (June-July) during the study period of 2007-2008. ARM radar-lidar and CloudSat/CALIPSO show generally good agreement in CF (0.79 vs. 0

  14. Development and preliminary validation of an Observation List for detecting mental disorders and social Problems in the elderly in primary and home care (OLP).

    PubMed

    Tak, Erwin C P M; van Hespen, Ariëtte T H; Verhaak, Peter F M; Eekhof, Just; Hopman-Rock, Marijke

    2016-07-01

    Even though the prevalence of mental disorders and social problems is high among elderly patients, it is difficult to detect these in a primary (home) care setting. Goal was the development and preliminary validation of a short observation list to detect six problem areas: anxiety, depression, cognition, suspicion, loneliness, and somatisation. A draft list of indicators identified from a short review of the literature and the opinions of 22 experts was evaluated by general practitioners (GPs) and home care organisations for feasibility. It was then used by GPs and home care personnel to observe patients, who also completed validated tests for psychological disorders (General Health Questionnaire 12 item version (GHQ-12)), depression (Geriatric Depression Scale 15-item version (GDS-15)), anxiety and suspicion (Symptom Checklist-90 (SCL-90)), loneliness (University of California, Los Angeles (UCLA)), somatisation (Illness Attitude Scale (IAS)), and cognition (Mini-Mental State Examination (MMSE)). GPs and home care personnel observed 180 patients (mean age 78.4 years; 66% female) and evaluated the draft list during a regular visit. Cronbach's α was 0.87 for the draft list and ≥0.80 for the draft problem areas (loneliness and suspicion excepted). Principal component analysis identified six components (cognition, depression + loneliness, somatisation, anxiety + suspicion, depression (other signs), and an ambiguous component). Convergent validity was shown for the indicators list as a whole (using the GHQ-12), and the subscales of depression, anxiety, loneliness, cognition, and somatisation. Using pre-set agreed criteria, the list was reduced to 14 final indicators divided over five problem areas. The Observation List for mental disorders and social Problems (OLP) proved to be preliminarily valid, reliable, and feasible for use in primary and home care settings. Copyright © John Wliey & Sons, Ltd. Copyright © 2015 John Wiley & Sons, Ltd.

  15. Using Web-Based Questionnaires and Obstetric Records to Assess General Health Characteristics Among Pregnant Women: A Validation Study

    PubMed Central

    Schouten, Naomi PE; Merkus, Peter JFM; Verhaak, Chris M; Roeleveld, Nel; Roukema, Jolt

    2015-01-01

    Background Self-reported medical history information is included in many studies. However, data on the validity of Web-based questionnaires assessing medical history are scarce. If proven to be valid, Web-based questionnaires may provide researchers with an efficient means to collect data on this parameter in large populations. Objective The aim of this study was to assess the validity of a Web-based questionnaire on chronic medical conditions, allergies, and blood pressure readings against obstetric records and data from general practitioners. Methods Self-reported questionnaire data were compared with obstetric records for 519 pregnant women participating in the Dutch PRegnancy and Infant DEvelopment (PRIDE) Study from July 2011 through November 2012. These women completed Web-based questionnaires around their first prenatal care visit and in gestational weeks 17 and 34. We calculated kappa statistics (κ) and the observed proportions of positive and negative agreement between the baseline questionnaire and obstetric records for chronic conditions and allergies. In case of inconsistencies between these 2 data sources, medical records from the woman’s general practitioner were consulted as the reference standard. For systolic and diastolic blood pressure, intraclass correlation coefficients (ICCs) were calculated for multiple data points. Results Agreement between the baseline questionnaire and the obstetric record was substantial (κ=.61) for any chronic condition and moderate for any allergy (κ=.51). For specific conditions, we found high observed proportions of negative agreement (range 0.88-1.00) and on average moderate observed proportions of positive agreement with a wide range (range 0.19-0.90). Using the reference standard, the sensitivity of the Web-based questionnaire for chronic conditions and allergies was comparable to or even better than the sensitivity of the obstetric records, in particular for migraine (0.90 vs 0.40, P=.02), asthma (0.86 vs 0

  16. Using Web-Based Questionnaires and Obstetric Records to Assess General Health Characteristics Among Pregnant Women: A Validation Study.

    PubMed

    van Gelder, Marleen M H J; Schouten, Naomi P E; Merkus, Peter J F M; Verhaak, Chris M; Roeleveld, Nel; Roukema, Jolt

    2015-06-16

    Self-reported medical history information is included in many studies. However, data on the validity of Web-based questionnaires assessing medical history are scarce. If proven to be valid, Web-based questionnaires may provide researchers with an efficient means to collect data on this parameter in large populations. The aim of this study was to assess the validity of a Web-based questionnaire on chronic medical conditions, allergies, and blood pressure readings against obstetric records and data from general practitioners. Self-reported questionnaire data were compared with obstetric records for 519 pregnant women participating in the Dutch PRegnancy and Infant DEvelopment (PRIDE) Study from July 2011 through November 2012. These women completed Web-based questionnaires around their first prenatal care visit and in gestational weeks 17 and 34. We calculated kappa statistics (κ) and the observed proportions of positive and negative agreement between the baseline questionnaire and obstetric records for chronic conditions and allergies. In case of inconsistencies between these 2 data sources, medical records from the woman's general practitioner were consulted as the reference standard. For systolic and diastolic blood pressure, intraclass correlation coefficients (ICCs) were calculated for multiple data points. Agreement between the baseline questionnaire and the obstetric record was substantial (κ=.61) for any chronic condition and moderate for any allergy (κ=.51). For specific conditions, we found high observed proportions of negative agreement (range 0.88-1.00) and on average moderate observed proportions of positive agreement with a wide range (range 0.19-0.90). Using the reference standard, the sensitivity of the Web-based questionnaire for chronic conditions and allergies was comparable to or even better than the sensitivity of the obstetric records, in particular for migraine (0.90 vs 0.40, P=.02), asthma (0.86 vs 0.61, P=.04), inhalation allergies (0

  17. The Individualized Classroom Assessment Scoring System (inCLASS): Preliminary Reliability and Validity of a System for Observing Preschoolers’ Competence in Classroom Interactions

    PubMed Central

    Downer, Jason T.; Booren, Leslie M.; Lima, Olivia K.; Luckner, Amy E.; Pianta, Robert C.

    2012-01-01

    This paper introduces the Individualized Classroom Assessment Scoring System (inCLASS), an observation tool that targets children’s interactions in preschool classrooms with teachers, peers, and tasks. In particular, initial evidence is reported of the extent to which the inCLASS meets the following psychometric criteria: inter-rater reliability, normal distributions and adequate range, construct validity, and criterion-related validity. These initial findings suggest that the inCLASS has the potential to provide an authentic, contextualized assessment of young children’s classroom behaviors. Future directions for research with the inCLASS are discussed. PMID:23175598

  18. Application of time transfer functions to Gaia's global astrometry. Validation on DPAC simulated Gaia-like observations

    NASA Astrophysics Data System (ADS)

    Bertone, Stefano; Vecchiato, Alberto; Bucciarelli, Beatrice; Crosta, Mariateresa; Lattanzi, Mario G.; Bianchi, Luca; Angonin, Marie-Christine; Le Poncin-Lafitte, Christophe

    2017-12-01

    Context. A key objective of the ESA Gaia satellite is the realization of a quasi-inertial reference frame at visual wavelengths by means of global astrometric techniques. This requires accurate mathematical and numerical modeling of relativistic light propagation, as well as double-blind-like procedures for the internal validation of the results, before they are released to the scientific community at large. Aims: We aim to specialize the time transfer functions (TTF) formalism to the case of the Gaia observer and prove its applicability to the task of global sphere reconstruction (GSR), in anticipation of its inclusion in the GSR system, already featuring the Relativistic Astrometric MODel (RAMOD) suite, as an additional semi-external validation of the forthcoming Gaia baseline astrometric solutions. Methods: We extended the current GSR framework and software infrastructure (GSR2) to include TTF relativistic observation equations compatible with Gaia's operations. We used simulated data generated by the Gaia Data Processing and Analysis Consortium (DPAC) to obtain different least-squares estimations of the full (five-parameter) stellar spheres and gauge results. These were compared to analogous solutions obtained with the current RAMOD model in GSR2 (RAMOD@GSR2) and to the catalog generated with the Gaia RElativistic Model (GREM), the model baselined for Gaia and used to generate the DPAC synthetic data. Results: Linearized least-squares TTF solutions are based on spheres of about 132 000 primary stars uniformly distributed on the sky and simulated observations spanning the entire 5 yr range of Gaia's nominal operational lifetime. The statistical properties of the results compare well with those of GREM. Finally, comparisons to RAMOD@GSR2 solutions confirmed the known lower accuracy of that model and allowed us to establish firm limits on the quality of the linearization point outside of which an iteration for non-linearity is required for its proper convergence

  19. Validation Study on Alos Prism Dsm Mosaic and Aster Gdem 2

    NASA Astrophysics Data System (ADS)

    Tadono, T.; Takaku, J.; Shimada, M.

    2012-07-01

    This study aims to evaluate height accuracy of two datasets obtained by spaceborne optical instruments of a digital elevation data for a large-scale area. The digital surface model (DSM) was generated by the Panchromatic Remote-sensing Instrument for Stereo Mapping (PRISM) onboard the Advanced Land Observing Satellite (ALOS, nicknamed 'Daichi'), and the global digital elevation model (DEM) version 2 (GDEM-2) was derived from the Advanced Spaceborne Thermal Emission and Reflection Radiometer (ASTER) onboard NASA's TERRA satellite. The test site of this study was the entire country of Bhutan, which is located on the southern slopes of the eastern Himalayas. Bhutan is not a large country, covering about 330 km from east to west, and 170 km from north to south; however, it has large height variation from 200 m to more than 7,000 m. This therefore makes it very interesting for validating digital topographic information in terms of national scale generation as well as wide height range. Regarding the reference data, field surveys were conducted in 2010 and 2011, and collected ground control points by a global positioning system were used for evaluating precise height accuracies in point scale as check points (CPs), with a 3 arc-sec DEM created by the Shuttle Radar Topography Mission (SRTM-3) used to validate the wide region. The results confirmed a root mean square error of 8.1 m for PRISM DSM and 29.4 m for GDEM-2 by CPs.

  20. The Jackson Career Explorer: Two Further Validity Studies

    ERIC Educational Resources Information Center

    Schermer, Julie Aitken

    2012-01-01

    The present report consists of two further validity studies using the Jackson Career Explorer (JCE), a short form and continuous version of the Jackson Vocational Interest Survey, measuring 34 interests. The first study examined the relationships between the JCE and five personality factors, from a sample of 528 individuals. The correlations found…

  1. Sources of Intrusions in Children’s Dietary Recalls from a Validation Study of Order Prompts

    PubMed Central

    Baxter, Suzanne Domel; Hardin, James W.; Royer, Julie A.; Smith, Albert F.; Guinn, Caroline H.

    2008-01-01

    Validation-study data and foodservice production records were analyzed to test hypotheses concerning sources of intrusions (reports of uneaten items) in the school-meal parts of children’s dietary recalls. Each child was observed eating school meals on two days, and interviewed the morning after each observation day; one interview used forward-order (morning-to-evening) and one used reverse-order (evening-to-morning) prompts. Lunch intrusions were likelier to have been available in the foodservice environment at lunch as day before the interview came closer, and on days before than after the interview. Temporal dating errors are contributing sources of intrusions in the school-lunch parts of children’s recalls. PMID:18987088

  2. 29 CFR 1607.14 - Technical standards for validity studies.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... in the design of the study and their effects identified. (5) Statistical relationships. The degree of...; or such factors should be included in the design of the study and their effects identified. (f... arduous effort involving a series of research studies, which include criterion related validity studies...

  3. 29 CFR 1607.14 - Technical standards for validity studies.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... in the design of the study and their effects identified. (5) Statistical relationships. The degree of...; or such factors should be included in the design of the study and their effects identified. (f... arduous effort involving a series of research studies, which include criterion related validity studies...

  4. 29 CFR 1607.14 - Technical standards for validity studies.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... in the design of the study and their effects identified. (5) Statistical relationships. The degree of...; or such factors should be included in the design of the study and their effects identified. (f... arduous effort involving a series of research studies, which include criterion related validity studies...

  5. 29 CFR 1607.14 - Technical standards for validity studies.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... in the design of the study and their effects identified. (5) Statistical relationships. The degree of...; or such factors should be included in the design of the study and their effects identified. (f... arduous effort involving a series of research studies, which include criterion related validity studies...

  6. 29 CFR 1607.14 - Technical standards for validity studies.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... in the design of the study and their effects identified. (5) Statistical relationships. The degree of...; or such factors should be included in the design of the study and their effects identified. (f... arduous effort involving a series of research studies, which include criterion related validity studies...

  7. The FLIR ONE thermal imager for the assessment of burn wounds: Reliability and validity study.

    PubMed

    Jaspers, M E H; Carrière, M E; Meij-de Vries, A; Klaessens, J H G M; van Zuijlen, P P M

    2017-11-01

    Objective measurement tools may be of great value to provide early and reliable burn wound assessment. Thermal imaging is an easy, accessible and objective technique, which measures skin temperature as an indicator of tissue perfusion. These thermal images might be helpful in the assessment of burn wounds. However, before implementation of a novel measurement tool into clinical practice is considered, it is appropriate to test its clinimetric properties (i.e. reliability and validity). The objective of this study was to assess the reliability and validity of the recently introduced FLIR ONE thermal imager. Two observers obtained thermal images of burn wounds in adult patients at day 1-3, 4-7 and 8-10 after burn. Subsequently, temperature differences between the burn wound and healthy skin (ΔT) were calculated on an iPad mini containing the FLIR Tools app. To assess reliability, ΔT values of both observers were compared by calculating the intraclass correlation coefficient (ICC) and measurement error parameters. To assess validity, the ΔT values of the first observer were compared to the registered healing time of the burn wounds, which was specified into three categories: (I) ≤14 days, (II) 15-21 days and (III) >21 days. The ability of the FLIR ONE to discriminate between healing ≤21 days and >21 days was evaluated by means of a receiver operating characteristic curve and an optimal ΔT cut-off value. Reliability: ICCs were 0.99 for each time point, indicating excellent reliability up to 10 days after burn. The standard error of measurement varied between 0.17-0.22°C. the area under the curve was calculated at 0.69 (95% CI 0.54-0.84). A cut-off value of -1.15°C shows a moderate discrimination between burn wound healing ≤21 days and >21 days (46% sensitivity; 82% specificity). Our results show that the FLIR ONE thermal imager is highly reliable, but the moderate validity calls for additional research. However, the FLIR ONE is pre-eminently feasible

  8. Validating the usability of an interactive Earth Observation based web service for landslide investigation

    NASA Astrophysics Data System (ADS)

    Albrecht, Florian; Weinke, Elisabeth; Eisank, Clemens; Vecchiotti, Filippo; Hölbling, Daniel; Friedl, Barbara; Kociu, Arben

    2017-04-01

    Regional authorities and infrastructure maintainers in almost all mountainous regions of the Earth need detailed and up-to-date landslide inventories for hazard and risk management. Landslide inventories usually are compiled through ground surveys and manual image interpretation following landslide triggering events. We developed a web service that uses Earth Observation (EO) data to support the mapping and monitoring tasks for improving the collection of landslide information. The planned validation of the EO-based web service does not only cover the analysis of the achievable landslide information quality but also the usability and user friendliness of the user interface. The underlying validation criteria are based on the user requirements and the defined tasks and aims in the work description of the FFG project Land@Slide (EO-based landslide mapping: from methodological developments to automated web-based information delivery). The service will be validated in collaboration with stakeholders, decision makers and experts. Users are requested to test the web service functionality and give feedback with a web-based questionnaire by following the subsequently described workflow. The users will operate the web-service via the responsive user interface and can extract landslide information from EO data. They compare it to reference data for quality assessment, for monitoring changes and for assessing landslide-affected infrastructure. An overview page lets the user explore a list of example projects with resulting landslide maps and mapping workflow descriptions. The example projects include mapped landslides in several test areas in Austria and Northern Italy. Landslides were extracted from high resolution (HR) and very high resolution (VHR) satellite imagery, such as Landsat, Sentinel-2, SPOT-5, WorldView-2/3 or Pléiades. The user can create his/her own project by selecting available satellite imagery or by uploading new data. Subsequently, a new landslide

  9. High spatial resolution satellite observations for validation of MODIS land products: IKONOS observations acquired under the NASA scientific data purchase.

    Treesearch

    Jeffrey T. Morisette; Jaime E. Nickeson; Paul Davis; Yujie Wang; Yuhong Tian; Curtis E. Woodcock; Nikolay Shabanov; Matthew Hansen; Warren B. Cohen; Doug R. Oetter; Robert E. Kennedy

    2003-01-01

    Phase 1I of the Scientific Data Purchase (SDP) has provided NASA investigators access to data from four different satellite and airborne data sources. The Moderate Resolution Imaging Spectrometer (MODIS) land discipline team (MODLAND) sought to utilize these data in support of land product validation activities with a lbcus on tile EOS Land Validation Core Sites. These...

  10. Evaluation of spectroscopic databases through radiative transfer simulations compared to observations. Application to the validation of GEISA 2015 with IASI and TCCON

    NASA Astrophysics Data System (ADS)

    Armante, Raymond; Scott, Noelle; Crevoisier, Cyril; Capelle, Virginie; Crepeau, Laurent; Jacquinet, Nicole; Chédin, Alain

    2016-09-01

    The quality of spectroscopic parameters that serve as input to forward radiative transfer models are essential to fully exploit remote sensing of Earth atmosphere. However, the process of updating spectroscopic databases in order to provide the users with a database that insures an optimal characterization of spectral properties of molecular absorption for radiative transfer modeling is challenging. The evaluation of the databases content and the underlying choices made by the managing team is thus a crucial step. Here, we introduce an original and powerful approach for evaluating spectroscopic parameters: the Spectroscopic Parameters And Radiative Transfer Evaluation (SPARTE) chain. The SPARTE chain relies on the comparison between forward radiative transfer simulations made by the 4A radiative transfer model and observations of spectra made from various observations collocated over several thousands of well-characterized atmospheric situations. Averaging the resulting 'calculated-observed spectral' residuals minimizes the random errors coming from both the radiometric noise of the instruments and the imperfect description of the atmospheric state. The SPARTE chain can be used to evaluate any spectroscopic databases, from the visible to the microwave, using any type of remote sensing observations (ground-based, airborne or space-borne). We show that the comparison of the shape of the residuals enables: (i) identifying incorrect line parameters (line position, intensity, width, pressure shift, etc.), even for molecules for which interferences between the lines have to be taken into account; (ii) proposing revised values, in cooperation with contributing teams; and (iii) validating the final updated parameters. In particular, we show that the simultaneous availability of two databases such as GEISA and HITRAN helps identifying remaining issues in each database. The SPARTE chain has been here applied to the validation of the update of GEISA-2015 in 2 spectral regions

  11. A newly developed tool for classifying study designs in systematic reviews of interventions and exposures showed substantial reliability and validity.

    PubMed

    Seo, Hyun-Ju; Kim, Soo Young; Lee, Yoon Jae; Jang, Bo-Hyoung; Park, Ji-Eun; Sheen, Seung-Soo; Hahn, Seo Kyung

    2016-02-01

    To develop a study Design Algorithm for Medical Literature on Intervention (DAMI) and test its interrater reliability, construct validity, and ease of use. We developed and then revised the DAMI to include detailed instructions. To test the DAMI's reliability, we used a purposive sample of 134 primary, mainly nonrandomized studies. We then compared the study designs as classified by the original authors and through the DAMI. Unweighted kappa statistics were computed to test interrater reliability and construct validity based on the level of agreement between the original and DAMI classifications. Assessment time was also recorded to evaluate ease of use. The DAMI includes 13 study designs, including experimental and observational studies of interventions and exposure. Both the interrater reliability (unweighted kappa = 0.67; 95% CI [0.64-0.75]) and construct validity (unweighted kappa = 0.63, 95% CI [0.52-0.67]) were substantial. Mean classification time using the DAMI was 4.08 ± 2.44 minutes (range, 0.51-10.92). The DAMI showed substantial interrater reliability and construct validity. Furthermore, given its ease of use, it could be used to accurately classify medical literature for systematic reviews of interventions although minimizing disagreement between authors of such reviews. Copyright © 2016 Elsevier Inc. All rights reserved.

  12. Teacher Evaluation Project. The Beginning Teacher Program, Intellectual Skills Development, Validity Studies of the Evaluation System, Special Instrument Development. Report for 1984-1985.

    ERIC Educational Resources Information Center

    Florida Coalition for the Development of a Performance Measurement System, Tallahassee.

    Reports, summaries, and recommendations are presented on the following research studies: (1) Beginning Teacher Studies; (2) Instructional Skills for Teaching Higher Order Thinking; (3) Development of the Conferential Observation Instrument; (4) Predictive Validity Studies Conducted to Test the Relationship Between Teacher Performance as Measured…

  13. Direct observation of students during clerkship rotations: a multiyear descriptive study.

    PubMed

    Howley, Lisa D; Wilson, William G

    2004-03-01

    To determine how often students report that they are observed while performing physical examinations and taking histories during clerkship rotations. From 1999-2001, 397 students at the University of Virginia School of Medicine were asked at the end of their third year to report the number of times they had been observed by a resident or faculty member while taking histories and performing physical examinations on six rotations. Three hundred and forty-five students (87%) returned the survey instrument; of these, 322 (81%) returned instruments with complete information. On average, the majority reported that they had never been observed by a faculty member while taking a history (51%), performing a focused physical examination (54%), or a complete physical examination (81%). The majority (60%) reported that they had never been observed by a resident while performing a complete physical examination. Faculty observations occurred most frequently during the four-week family medicine rotation and least frequently during the 12-week surgery rotation. The length of the clerkship rotation was inversely related to the number of reported observations, chi(2) (5, n = 295) = 127.85, p <.000. Although alternative assessments of clinical skills are becoming more common in medical education, faculty ratings based on direct observation are still prominent. The data in this study reflect that these observations may actually be occurring quite infrequently, if at all. Decreasing the evaluative weight of faculty and resident ratings during the clerkship rotation may be necessary. Otherwise, efforts should be made to increase the validity of these ratings.

  14. Randomized controlled trials and real-world observational studies in evaluating cardiovascular safety of inhaled bronchodilator therapy in COPD.

    PubMed

    Kardos, Peter; Worsley, Sally; Singh, Dave; Román-Rodríguez, Miguel; Newby, David E; Müllerová, Hana

    2016-01-01

    Long-acting muscarinic antagonist (LAMA) or long-acting β2-agonist (LABA) bronchodilators and their combination are recommended for the maintenance treatment of chronic obstructive pulmonary disease (COPD). Although the efficacy of LAMAs and LABAs has been well established through randomized controlled trials (RCTs), questions remain regarding their cardiovascular (CV) safety. Furthermore, while the safety of LAMA and LABA monotherapy has been extensively studied, data are lacking for LAMA/LABA combination therapy, and the majority of the studies that have reported on the CV safety of LAMA/LABA combination therapy were not specifically designed to assess this. Evaluation of CV safety for COPD treatments is important because many patients with COPD have underlying CV comorbidities. However, severe CV and other comorbidities are often exclusion criteria for RCTs, contributing to a lack in external validity and generalizability. Real-world observational studies are another important tool to evaluate the effectiveness and safety of COPD therapies in a broader population of patients and can improve upon the external validity limitations of RCTs. We examine what is already known regarding the CV and cerebrovascular safety of LAMA/LABA combination therapy from RCTs and real-world observational studies, and explore the advantages and limitations of data derived from each study type. We also describe an ongoing prospective, observational, comparative post-authorization safety study of a LAMA/LABA combination therapy (umeclidinium/vilanterol) and LAMA monotherapy (umeclidinium) versus tiotropium, with a focus on the relative merits of the study design.

  15. Evaluation of a Web-Based Food Record for Children Using Direct Unobtrusive Lunch Observations: A Validation Study

    PubMed Central

    Astrup, Helene; Kåsin, Britt Marlene; Andersen, Lene Frost

    2015-01-01

    Background High-quality, Web-based dietary assessment tools for children are needed to reduce cost and improve user-friendliness when studying children’s dietary practices. Objective To evaluate the first Web-based dietary assessment tool for children in Norway, the Web-based Food Record (WebFR), by comparing children’s true school lunch intake with recordings in the WebFR, using direct unobtrusive observation as the reference method. Methods A total of 117 children, 8-9 years, from Bærum, Norway, were recruited from September to December 2013. Children completed 4 days of recordings in the WebFR, with parental assistance, and were observed during school lunch in the same period by 3 observers. Interobserver reliability assessments were satisfactory. Match, omission, and intrusion rates were calculated to assess the quality of the recordings in the WebFR for different food categories, and for all foods combined. Logistic regression analyses were used to investigate whether body mass index (BMI), parental educational level, parental ethnicity or family structure were associated with having a “low match rate” (≤70%). Results Bread and milk were recorded with less bias than spreads, fruits, and vegetables. Mean (SD) for match, omission, and intrusion rates for all foods combined were 73% (27%), 27% (27%), and 19% (26%), respectively. Match rates were statistically significantly associated with parental educational level (low education 52% [32%] versus high 77% [24%], P=.008) and parental ethnicity (non-Norwegian 57% [28%] versus others 75% [26%], P=.04). Only parental ethnicity remained statistically significant in the logistic regression model, showing an adjusted odds ratio of 6.9 and a 95% confidence interval between 1.3 and 36.4. Conclusions Compared with other similar studies, our results indicate that the WebFR is in line with, or better than most of other similar tools, yet enhancements could further improve the WebFR. PMID:26680744

  16. Academic and Nonacademic Validating Agents on Latinas Mathematics and Science Self Concept A Quantitative Study Utilizing the High School Longitudinal Study of 2009

    NASA Astrophysics Data System (ADS)

    Garza, Jennifer M.

    The purpose of this study is to inform and further the discussion of academic (i.e. teachers and school counselors) and non-academic (i.e. parents, family, friends, etc.) validating agents on Latina students' mathematics and science self-concepts. This study found a relationship between Latina students' interactions with academic and non-academic validating agents and their math and science self-concept at the K-12 level. Through the review of the literature the researcher addresses identifiable factors and strategies that inform the field of education in the areas of validation theory, family characteristics, and access to STEM fields for Latina students. The researcher used an established instrument designed, administered, and validated through the National Center for Education Statistics (NCES). For purposes of this study, a categorical subset of participants who self-identified as being a Latina student was used. As a result, the total subset number in this study was N=1,882. To determine if academic and non-academic validating agents had an observable statistically significant relationship with Latina students' math and science self-concept, a series of one-way ANOVAs were calculated to compare differences in students' math and science self-concept based on academic and non-academic validating agents for the weighted sample of Latinas for the HLS:09 survey. A path analysis was also employed to assess the factors involved in Latina students' math and science self-concepts. The findings are consistent with previous research involving the influence that academic and non-academic validating agents have on the math and science self-concept of Latina students. The results indicated that students who had teachers that believed in the students, regardless of family background, social economic status or home environment influences had higher math and science self concepts than those who did not. Similarly, it was found that students who had counselors that set high

  17. Survival after postoperative morbidity: a longitudinal observational cohort study.

    PubMed

    Moonesinghe, S R; Harris, S; Mythen, M G; Rowan, K M; Haddad, F S; Emberton, M; Grocott, M P W

    2014-12-01

    Previous studies have suggested that there may be long-term harm associated with postoperative complications. Uncertainty exists however, because of the need for risk adjustment and inconsistent definitions of postoperative morbidity. We did a longitudinal observational cohort study of patients undergoing major surgery. Case-mix adjustment was applied and morbidity was recorded using a validated outcome measure. Cox proportional hazards modelling using time-dependent covariates was used to measure the independent relationship between prolonged postoperative morbidity and longer term survival. Data were analysed for 1362 patients. The median length of stay was 9 days and the median follow-up time was 6.5 yr. Independent of perioperative risk, postoperative neurological morbidity (prevalence 2.9%) was associated with a relative hazard for long-term mortality of 2.00 [P=0.001; 95% confidence interval (CI) 1.32-3.04]. Prolonged postoperative morbidity (prevalence 15.6%) conferred a relative hazard for death in the first 12 months after surgery of 3.51 (P<0.001; 95% CI 2.28-5.42) and for the next 2 yr of 2.44 (P<0.001; 95% CI 1.62-3.65), returning to baseline thereafter. Prolonged morbidity after surgery is associated with a risk of premature death for a longer duration than perhaps is commonly thought; however, this risk falls with time. We suggest that prolonged postoperative morbidity measured in this way may be a valid indicator of the quality of surgical healthcare. Our findings reinforce the importance of research and quality improvement initiatives aimed at reducing the duration and severity of postoperative complications. © The Author 2014. Published by Oxford University Press on behalf of the British Journal of Anaesthesia.

  18. A simplified approach to the pooled analysis of calibration of clinical prediction rules for systematic reviews of validation studies

    PubMed Central

    Dimitrov, Borislav D; Motterlini, Nicola; Fahey, Tom

    2015-01-01

    Objective Estimating calibration performance of clinical prediction rules (CPRs) in systematic reviews of validation studies is not possible when predicted values are neither published nor accessible or sufficient or no individual participant or patient data are available. Our aims were to describe a simplified approach for outcomes prediction and calibration assessment and evaluate its functionality and validity. Study design and methods: Methodological study of systematic reviews of validation studies of CPRs: a) ABCD2 rule for prediction of 7 day stroke; and b) CRB-65 rule for prediction of 30 day mortality. Predicted outcomes in a sample validation study were computed by CPR distribution patterns (“derivation model”). As confirmation, a logistic regression model (with derivation study coefficients) was applied to CPR-based dummy variables in the validation study. Meta-analysis of validation studies provided pooled estimates of “predicted:observed” risk ratios (RRs), 95% confidence intervals (CIs), and indexes of heterogeneity (I2) on forest plots (fixed and random effects models), with and without adjustment of intercepts. The above approach was also applied to the CRB-65 rule. Results Our simplified method, applied to ABCD2 rule in three risk strata (low, 0–3; intermediate, 4–5; high, 6–7 points), indicated that predictions are identical to those computed by univariate, CPR-based logistic regression model. Discrimination was good (c-statistics =0.61–0.82), however, calibration in some studies was low. In such cases with miscalibration, the under-prediction (RRs =0.73–0.91, 95% CIs 0.41–1.48) could be further corrected by intercept adjustment to account for incidence differences. An improvement of both heterogeneities and P-values (Hosmer-Lemeshow goodness-of-fit test) was observed. Better calibration and improved pooled RRs (0.90–1.06), with narrower 95% CIs (0.57–1.41) were achieved. Conclusion Our results have an immediate clinical

  19. An observational examination of the literature in diagnostic anatomic pathology.

    PubMed

    Foucar, Elliott; Wick, Mark R

    2005-05-01

    Original research published in the medical literature confronts the reader with three very basic and closely linked questions--are the authors' conclusions true in the contextual setting in which the work was performed (internally valid); if so, are the conclusions also applicable in other practice settings (externally valid); and, if the conclusions of the study are bona fide, do they represent an important contribution to medical practice or are they true-but-insignificant? Most publications attempt to convince readers that the researchers' conclusions are both internally valid and important, and occasionally papers also directly address external validity. Developing standardized methods to facilitate the prospective determination of research importance would be useful to both journals and their readers, but has proven difficult. In contrast, the evidence-based medicine (EBM) movement has had more success with understanding and codifying factors thought to promote research validity. Of the many variables that can influence research validity, research design is the one that has received the most attention. The present paper reviews the contributions of EBM to understanding research validity, looking for areas where EBM's body of knowledge is applicable to the anatomic pathology (AP) literature. As part of this project, the authors performed a pilot observational analysis of a representative sample of the current pertinent literature on diagnostic tissue pathology. The results of that review showed that most of the latter publications employ one of the four categories of "observational" research design that have been delineated by the EBM movement, and that the most common of these observational designs is a "cross-sectional" comparison. Pathologists do not presently use the "experimental" research designs so admired by advocates of EBM. Slightly > 50% of AP observational studies employed statistical evaluations to support their final conclusions. Comparison of the

  20. The global status of freshwater fish age validation studies and a prioritization framework for future research

    USGS Publications Warehouse

    Pope, Kevin L.; Hamel, Martin J.; Pegg, Mark A.; Spurgeon, Jonathan J.

    2016-01-01

    Age information derived from calcified structures is commonly used to estimate recruitment, growth, and mortality for fish populations. Validation of daily or annual marks on age structures is often assumed, presumably due to a lack of general knowledge concerning the status of age validation studies. Therefore, the current status of freshwater fish age validation studies was summarized to show where additional effort is needed, and increase the accessibility of validation studies to researchers. In total, 1351 original peer-reviewed articles were reviewed from freshwater systems that studied age in fish. Periodicity and age validation studies were found for 88 freshwater species comprising 21 fish families. The number of age validation studies has increased over the last 30 years following previous calls for more research; however, few species have validated structures spanning all life stages. In addition, few fishes of conservation concern have validated ageing structures. A prioritization framework, using a combination of eight characteristics, is offered to direct future age validation studies and close the validation information gap. Additional study, using the offered prioritization framework, and increased availability of published studies that incorporate uncertainty when presenting research results dealing with age information are needed.

  1. GLM Validation Studies in Colorado

    NASA Astrophysics Data System (ADS)

    Rutledge, S. A.; Reimel, K.; Fuchs, B.; Xu, W.

    2017-12-01

    On 8 May 2017 the Geostationary Lightning Mapper (GLM) calibration/validation field campaign completed a mission over the domain of the Colorado Lightning Mapping Array (LMA). This "gold mine day" produced a mixture of normal polarity and anomalous storms of varying intensity. A case study analysis has been completed for a portion of three individual storms from this day. By utilizing a cell tracking algorithm and lightning flash attribution program, individual lightning flashes detected by the GLM, LMA, the National Lightning Detection Network (NLDN), and Earth Networks Total Lightning Network (ENTLN) are attributed to individual storm cells. The focus of this analysis is the detection efficiency of GLM. We will discuss how the GLM detection efficiency changes as a result of storm morphology and lightning flash characteristics. Lightning flash size, flash height, and the amount of ice present between the lightning flash altitude and the top of the cloud all appear to play a role in how well GLM detects lightning flashes. Since GLM shares the same concept as its predecessor TRMM LIS (optically-based lightning detection), the evaluation of TRMM LIS against LMA network-detected lightning provides insights into the GLM detection efficiency. We have collected observations by LIS and LMA coincident in time and space during 2008-2014. The sample includes 400 LIS overpasses with both LIS and LMA detecting flashes within 150 km radius of the center of the LMA array during the 120 second LIS observing time period (analysis presently confined to the Alabama LMA network). The overall LIS detection efficiency (DE, defined as the ratio of flash rates between LIS and LMA) is 0.45, with higher DE for lower flash rate cases. LIS showed a DE of nearly 100% for cases with flash rates < 10 fl/min, but had a DE of only 20-30% for high flash rates within intense storms (> 300 fl/min). We further separated the dataset into day and night, and found that the night-time DE (0.6) increased

  2. Construct validity of the individual work performance questionnaire.

    PubMed

    Koopmans, Linda; Bernaards, Claire M; Hildebrandt, Vincent H; de Vet, Henrica C W; van der Beek, Allard J

    2014-03-01

    To examine the construct validity of the Individual Work Performance Questionnaire (IWPQ). A total of 1424 Dutch workers from three occupational sectors (blue, pink, and white collar) participated in the study. First, IWPQ scores were correlated with related constructs (convergent validity). Second, differences between known groups were tested (discriminative validity). First, IWPQ scores correlated weakly to moderately with absolute and relative presenteeism, and work engagement. Second, significant differences in IWPQ scores were observed for workers differing in job satisfaction, and workers differing in health. Overall, the results indicate acceptable construct validity of the IWPQ. Researchers are provided with a reliable and valid instrument to measure individual work performance comprehensively and generically, among workers from different occupational sectors, with and without health problems.

  3. Development and validation study of the Smartphone Overuse Screening Questionnaire.

    PubMed

    Lee, Han-Kyeong; Kim, Ji-Hae; Fava, Maurizio; Mischoulon, David; Park, Jae-Hyun; Shim, Eun-Jung; Lee, Eun-Ho; Lee, Ji Hyeon; Jeon, Hong Jin

    2017-11-01

    The aim of this study was to develop a screening questionnaire that could distinguish individuals at high risk of smartphone overuse from casual users. The reliability, validity, and diagnostic ability of the Smartphone Overuse Screening Questionnaire (SOS-Q) were evaluated. Preliminary items were assessed by 50 addiction experts on-line, and 28 questions were selected. A total of 158 subjects recruited from six community centers for internet addiction participated in this study. The SOS-Q, Young's internet addiction scale, Korean scale for internet addiction, and Smartphone Scale for Smartphone Addiction (S-Scale) were used to assess the concurrent validity. Construct validity was supported by a six-factor model using an exploratory factor analysis. The internal consistency and the item-total correlations were favorable (α = 0.95, r = 0.35-0.81). The test-retest reliability was moderate (r = 0.70). The SOS-Q showed superior concurrent validity with the highest correlation between the S-Scale (r = 0.76). Receiver operating characteristic curve analysis revealed an area under the curve of 0.877. A cut-off point of 49 effectively categorized addiction high-risk group with a sensitivity of 0.81 and specificity of 0.86. Overall, the current study supports the use of SOS-Q as both a primary and supplementary measurement tool in a variety of settings. Copyright © 2017 Elsevier B.V. All rights reserved.

  4. Predictive Validity Study of the APS Writing and Reading Tests [and] Validating Placement Rules for the APS Writing Test.

    ERIC Educational Resources Information Center

    College of the Canyons, Valencia, CA. Office of Institutional Development.

    California's College of the Canyons has used the College Board Assessment and Placement Services (APS) test to assess students' abilities in basic and college English since spring 1993. These two reports summarize data from a May 1994 study of the predictive validity of the APS writing and reading tests and a June 1994 effort to validate the cut…

  5. Many participants in inpatient rehabilitation can quantify their exercise dosage accurately: an observational study.

    PubMed

    Scrivener, Katharine; Sherrington, Catherine; Schurr, Karl; Treacy, Daniel

    2011-01-01

    Are inpatients undergoing rehabilitation who appear able to count exercises able to quantify accurately the amount of exercise they undertake? Observational study. Inpatients in an aged care rehabilitation unit and a neurological rehabilitation unit, who appeared able to count their exercises during a 1-2 min observation by their treating physiotherapist. Participants were observed for 30 min by an external observer while they exercised in the physiotherapy gymnasium. Both the participants and the observer counted exercise repetitions with a hand-held tally counter and the two tallies were compared. Of the 60 people admitted for aged care rehabilitation during the study period, 49 (82%) were judged by their treating therapist to be able to count their own exercise repetitions accurately. Of the 30 people admitted for neurological rehabilitation during the study period, 20 (67%) were judged by their treating therapist to be able to count their repetitions accurately. Of the 69 people judged to be accurate, 40 underwent observation while exercising. There was excellent agreement between these participants' counts of their exercise repetitions and the observers' counts, ICC (3,1) of 0.99 (95% CI 0.98 to 0.99). Eleven participants (28%) were in complete agreement with the observer. A further 19 participants (48%) varied from the observer by less than 10%. Therapists were able to identify a group of rehabilitation participants who were accurate in counting their exercise repetitions. Counting of exercise repetitions by therapist-selected patients is a valid means of quantifying exercise dosage during inpatient rehabilitation. Copyright © 2011 Australian Physiotherapy Association. Published by .. All rights reserved.

  6. A Validation Study of Student Differentiation between Computing Disciplines

    ERIC Educational Resources Information Center

    Battig, Michael; Shariq, Muhammad

    2011-01-01

    Using a previously published study of how students differentiate between computing disciplines, this study attempts to validate the original research and add additional hypotheses regarding the type of institution that the student resides. Using the identical survey instrument from the original study, students in smaller colleges and in different…

  7. Strengthening the reliability and credibility of observational epidemiology studies by creating an Observational Studies Register.

    PubMed

    Swaen, Gerard M H; Carmichael, Neil; Doe, John

    2011-05-01

    To evaluate the need for the creation of a system in which observational epidemiology studies are registered; an Observational Studies Register (OSR). The current scientific process for observational epidemiology studies is described. Next, a parallel is made with the clinical trials area, where the creation of clinical trial registers has greatly restored and improved their credibility and reliability. Next, the advantages and disadvantages of an OSR are compared. The advantages of an OSR outweigh its disadvantages. The creation of an OSR, similar to the existing Clinical Trials Registers, will improve the assessment of publication bias and will provide an opportunity to compare the original study protocol with the results reported in the publication. Reliability, credibility, and transparency of observational epidemiology studies are strengthened by the creation of an OSR. We propose a structured, collaborative, and coordinated approach for observational epidemiology studies that can provide solutions for existing weaknesses and will strengthen credibility and reliability, similar to the approach currently used in clinical trials, where Clinical Trials Registers have played a key role in strengthening their scientific value. Copyright © 2011 Elsevier Inc. All rights reserved.

  8. Measuring striving for understanding and learning value of geometry: a validity study

    NASA Astrophysics Data System (ADS)

    Ubuz, Behiye; Aydınyer, Yurdagül

    2017-11-01

    The current study aimed to construct a questionnaire that measures students' personality traits related to striving for understanding and learning value of geometry and then examine its psychometric properties. Through the use of multiple methods on two independent samples of 402 and 521 middle school students, two studies were performed to address this issue to provide support for its validity. In Study 1, exploratory factor analysis indicated the two-factor model. In Study 2, confirmatory factor analysis indicated the better fit of two-factor model compared to one or three-factor model. Convergent and discriminant validity evidence provided insight into the distinctiveness of the two factors. Subgroup validity evidence revealed gender differences for striving for understanding geometry trait favouring girls and grade level differences for learning value of geometry trait favouring the sixth- and seventh-grade students. Predictive validity evidence demonstrated that the striving for understanding geometry trait but not learning value of geometry trait was significantly correlated with prior mathematics achievement. In both studies, each factor and the entire questionnaire showed satisfactory reliability. In conclusion, the questionnaire was psychometrically sound.

  9. Clinical audit project in undergraduate medical education curriculum: an assessment validation study

    PubMed Central

    Steketee, Carole; Mak, Donna

    2016-01-01

    Objectives To evaluate the merit of the Clinical Audit Project (CAP) in an assessment program for undergraduate medical education using a systematic assessment validation framework. Methods A cross-sectional assessment validation study at one medical school in Western Australia, with retrospective qualitative analysis of the design, development, implementation and outcomes of the CAP, and quantitative analysis of assessment data from four cohorts of medical students (2011- 2014). Results The CAP is fit for purpose with clear external and internal alignment to expected medical graduate outcomes.  Substantive validity in students’ and examiners’ response processes is ensured through relevant methodological and cognitive processes. Multiple validity features are built-in to the design, planning and implementation process of the CAP.  There is evidence of high internal consistency reliability of CAP scores (Cronbach’s alpha > 0.8) and inter-examiner consistency reliability (intra-class correlation>0.7). Aggregation of CAP scores is psychometrically sound, with high internal consistency indicating one common underlying construct.  Significant but moderate correlations between CAP scores and scores from other assessment modalities indicate validity of extrapolation and alignment between the CAP and the overall target outcomes of medical graduates.  Standard setting, score equating and fair decision rules justify consequential validity of CAP scores interpretation and use. Conclusions This study provides evidence demonstrating that the CAP is a meaningful and valid component in the assessment program. This systematic framework of validation can be adopted for all levels of assessment in medical education, from individual assessment modality, to the validation of an assessment program as a whole.  PMID:27716612

  10. Clinical audit project in undergraduate medical education curriculum: an assessment validation study.

    PubMed

    Tor, Elina; Steketee, Carole; Mak, Donna

    2016-09-24

    To evaluate the merit of the Clinical Audit Project (CAP) in an assessment program for undergraduate medical education using a systematic assessment validation framework. A cross-sectional assessment validation study at one medical school in Western Australia, with retrospective qualitative analysis of the design, development, implementation and outcomes of the CAP, and quantitative analysis of assessment data from four cohorts of medical students (2011- 2014). The CAP is fit for purpose with clear external and internal alignment to expected medical graduate outcomes.  Substantive validity in students' and examiners' response processes is ensured through relevant methodological and cognitive processes. Multiple validity features are built-in to the design, planning and implementation process of the CAP.  There is evidence of high internal consistency reliability of CAP scores (Cronbach's alpha > 0.8) and inter-examiner consistency reliability (intra-class correlation>0.7). Aggregation of CAP scores is psychometrically sound, with high internal consistency indicating one common underlying construct.  Significant but moderate correlations between CAP scores and scores from other assessment modalities indicate validity of extrapolation and alignment between the CAP and the overall target outcomes of medical graduates.  Standard setting, score equating and fair decision rules justify consequential validity of CAP scores interpretation and use. This study provides evidence demonstrating that the CAP is a meaningful and valid component in the assessment program. This systematic framework of validation can be adopted for all levels of assessment in medical education, from individual assessment modality, to the validation of an assessment program as a whole.

  11. Development of a novel observational measure for anxiety in young children: The Anxiety Dimensional Observation Scale

    PubMed Central

    Mian, Nicholas D.; Carter, Alice S.; Pine, Daniel S.; Wakschlag, Lauren S.; Briggs-Gowan, Margaret J.

    2015-01-01

    Background Identifying anxiety disorders in preschool-age children represents an important clinical challenge. Observation is essential to clinical assessment and can help differentiate normative variation from clinically significant anxiety. Yet, most anxiety assessment methods for young children rely on parent-reports. The goal of this article is to present and preliminarily test the reliability and validity of a novel observational paradigm for assessing a range of fearful and anxious behaviors in young children, the Anxiety Dimensional Observation Schedule (Anx-DOS). Methods A diverse sample of 403 children, aged 3 to 6 years, and their mothers was studied. Reliability and validity in relation to parent reports (Preschool Age Psychiatric Assessment) and known risk factors, including indicators of behavioral inhibition (latency to touch novel objects) and attention bias to threat (in the dot-probe task) were investigated. Results The Anx-DOS demonstrated good inter-rater reliability and internal consistency. Evidence for convergent validity was demonstrated relative to mother-reported separation anxiety, social anxiety, phobic avoidance, trauma symptoms, and past service use. Finally, fearfulness was associated with observed latency and attention bias toward threat. Conclusions Findings support the Anx-DOS as a method for capturing early manifestations of fearfulness and anxiety in young children. Multimethod assessments incorporating standardized methods for assessing discrete, observable manifestations of anxiety may be beneficial for early identification and clinical intervention efforts. PMID:25773515

  12. Development and validation of a notational system to study the offensive process in football.

    PubMed

    Sarmento, Hugo; Anguera, Teresa; Campaniço, Jorge; Leitão, José

    2010-01-01

    The most striking change within football development is the application of science to its problems and in particular the use of increasingly sophisticated technology that, supported by scientific data, allows us to establish a "code of reading" the reality of the game. Therefore, this study describes the process of the development and validation of an ad hoc system of categorization, which allows the different methods of offensive game in football and the interaction to be analyzed. Therefore, through an exploratory phase of the study, we identified 10 vertebrate criteria and the respective behaviors observed for each of these criteria. We heard a panel of five experts with the purpose of a content validation. The resulting instrument is characterized by a combination of field formats and systems of categories. The reliability of the instrument was calculated by the intraobserver agreement, and values above 0.95 for all criteria were achieved. Two FC Barcelona games were coded and analyzed, which allowed the detection of various T-patterns. The results show that the instrument serves the purpose for which it was developed and can provide important information for the understanding of game interaction in football.

  13. Turkish Adaptation of the Mentorship Effectiveness Scale: A Validity and Reliability Study

    ERIC Educational Resources Information Center

    Yirci, Ramazan; Karakose, Turgut; Uygun, Harun; Ozdemir, Tuncay Yavuz

    2016-01-01

    The purpose of this study is to adapt the Mentoring Relationship Effectiveness Scale to Turkish, and to conduct validity and reliability tests regarding the scale. The study group consisted of 156 university science students receiving graduate education. Construct validity and factor structure of the scale was analyzed first through exploratory…

  14. Effect of telomere length on survival in idiopathic pulmonary fibrosis: an observational study with independent validation

    PubMed Central

    Stuart, Bridget D.; Lee, Joyce S.; Kozlitina, Julia; Noth, Imre; Devine, Megan S.; Glazer, Craig S.; Torres, Fernando; Kaza, Vaidehi; Girod, Carlos E.; Jones, Kirk D.; Elicker, Brett M.; Ma, Shwu-Fan; Vij, Rekha; Collard, Harold R.; Wolters, Paul J.; Garcia, Christine Kim

    2014-01-01

    Background Short telomere lengths are found in a subset of idiopathic pulmonary fibrosis (IPF) patients, but their clinical significance is unknown. The aim of this study was to investigate whether patients with various blood leukocyte telomere lengths had different overall survival. Methods Telomere lengths were measured in 370 genomic DNA samples isolated from peripheral blood collected from patients with interstitial lung disease (149 with IPF) at the time of their initial evaluation. Associations of telomere length with transplant-free survival were determined. Findings were validated in two independent IPF cohorts. Findings Patients with IPF had shorter telomere lengths than controls, but similar telomere lengths when compared to patients with other interstitial lung disease diagnoses after adjusting for age, male sex and ethnicity. Telomere length was independently associated with transplant-free survival time for patients with IPF (HR 0·22 [0·08–0·63], P-value = 0·0048), but not for patients with interstitial lung disease diagnoses other than IPF (HR 0·73 [0·16–3·41], P-value = 0·69). The association between telomere length and IPF survival was independent of age, male sex, forced vital capacity or diffusing capacity of carbon monoxide (and was replicated in two independent IPF cohorts (HR 0·11 [0·03–0·39], P-value 0·00066; HR 0·25 [0·07–0·87], P-value = 0·029). Addition of telomere length to clinical prediction models improved the integrative discrimination index, especially for IPF cohorts with milder disease. Interpretation These findings suggest that shorter leukocyte telomere lengths are associated with worse survival in IPF. Additional studies will be needed to determine clinically relevant thresholds for telomere length and how this biomarker may influence future risk stratification of IPF patients. Furthermore, this study offers mechanistic insight as disease progression in certain IPF patients may be related to aberrant

  15. Validation of APACHE II scoring system at 24 hours after admission as a prognostic tool in urosepsis: A prospective observational study.

    PubMed

    VijayGanapathy, Sundaramoorthy; Karthikeyan, VIlvapathy Senguttuvan; Sreenivas, Jayaram; Mallya, Ashwin; Keshavamurthy, Ramaiah

    2017-11-01

    Urosepsis implies clinically evident severe infection of urinary tract with features of systemic inflammatory response syndrome (SIRS). We validate the role of a single Acute Physiology and Chronic Health Evaluation II (APACHE II) score at 24 hours after admission in predicting mortality in urosepsis. A prospective observational study was done in 178 patients admitted with urosepsis in the Department of Urology, in a tertiary care institute from January 2015 to August 2016. Patients >18 years diagnosed as urosepsis using SIRS criteria with positive urine or blood culture for bacteria were included. At 24 hours after admission to intensive care unit, APACHE II score was calculated using 12 physiological variables, age and chronic health. Mean±standard deviation (SD) APACHE II score was 26.03±7.03. It was 24.31±6.48 in survivors and 32.39±5.09 in those expired (p<0.001). Among patients undergoing surgery, mean±SD score was higher (30.74±4.85) than among survivors (24.30±6.54) (p<0.001). Receiver operating characteristic (ROC) analysis revealed area under curve (AUC) of 0.825 with cutoff 25.5 being 94.7% sensitive and 56.4% specific to predict mortality. Mean±SD score in those undergoing surgery was 25.22±6.70 and was lesser than those who did not undergo surgery (28.44±7.49) (p=0.007). ROC analysis revealed AUC of 0.760 with cutoff 25.5 being 94.7% sensitive and 45.6% specific to predict mortality even after surgery. A single APACHE II score assessed at 24 hours after admission was able to predict morbidity, mortality, need for surgical intervention, length of hospitalization, treatment success and outcome in urosepsis patients.

  16. Validity and relative validity of a novel digital approach for 24-h dietary recall in athletes.

    PubMed

    Baker, Lindsay B; Heaton, Lisa E; Stein, Kimberly W; Nuccio, Ryan P; Jeukendrup, Asker E

    2014-04-30

    We developed a digital dietary analysis tool for athletes (DATA) using a modified 24-h recall method and an integrated, customized nutrient database. The purpose of this study was to assess DATA's validity and relative validity by measuring its agreement with registered dietitians' (RDs) direct observations (OBSERVATION) and 24-h dietary recall interviews using the USDA 5-step multiple-pass method (INTERVIEW), respectively. Fifty-six athletes (14-20 y) completed DATA and INTERVIEW in randomized counter-balanced order. OBSERVATION (n = 26) consisted of RDs recording participants' food/drink intake in a 24-h period and were completed the day prior to DATA and INTERVIEW. Agreement among methods was estimated using a repeated measures t-test and Bland-Altman analysis. The paired differences (with 95% confidence intervals) between DATA and OBSERVATION were not significant for carbohydrate (10.1%, -1.2-22.7%) and protein (14.1%, -3.2-34.5%) but was significant for energy (14.4%, 1.2-29.3%). There were no differences between DATA and INTERVIEW for energy (-1.1%, -9.1-7.7%), carbohydrate (0.2%, -7.1-8.0%) or protein (-2.7%, -11.3-6.7%). Bland-Altman analysis indicated significant positive correlations between absolute values of the differences and the means for OBSERVATION vs. DATA (r = 0.40 and r = 0.47 for energy and carbohydrate, respectively) and INTERVIEW vs. DATA (r = 0.52, r = 0.29, and r = 0.61 for energy, carbohydrate, and protein, respectively). There were also wide 95% limits of agreement (LOA) for most method comparisons. The mean bias ratio (with 95% LOA) for OBSERVATION vs. DATA was 0.874 (0.551-1.385) for energy, 0.906 (0.522-1.575) for carbohydrate, and 0.895(0.395-2.031) for protein. The mean bias ratio (with 95% LOA) for INTERVIEW vs. DATA was 1.016 (0.538-1.919) for energy, 0.995 (0.563-1.757) for carbohydrate, and 1.031 (0.514-2.068) for protein. DATA has good relative validity for group-level comparisons in athletes. However, there are large variations

  17. Validation of self-reported anthropometrics in the Adventist Health Study 2

    PubMed Central

    2011-01-01

    Background Relying on self-reported anthropometric data is often the only feasible way of studying large populations. In this context, there are no studies assessing the validity of anthropometrics in a mostly vegetarian population. The objective of this study was to evaluate the validity of self-reported anthropometrics in the Adventist Health Study 2 (AHS-2). Methods We selected a representative sample of 911 participants of AHS-2, a cohort of over 96,000 adult Adventists in the USA and Canada. Then we compared their measured weight and height with those self-reported at baseline. We calculated the validity of the anthropometrics as continuous variables, and as categorical variables for the definition of obesity. Results On average, participants underestimated their weight by 0.20 kg, and overestimated their height by 1.57 cm resulting in underestimation of body mass index (BMI) by 0.61 kg/m2. The agreement between self-reported and measured BMI (as a continuous variable), as estimated by intraclass correlation coefficient, was 0.97. The sensitivity of self-reported BMI to detect obesity was 0.81, the specificity 0.97, the predictive positive value 0.93, the predictive negative value 0.92, and the Kappa index 0.81. The percentage of absolute agreement for each category of BMI (normoweight, overweight, and obese) was 83.4%. After multivariate analyses, predictors of differences between self-reported and measured BMI were obesity, soy consumption and the type of dietary pattern. Conclusions Self-reported anthropometric data showed high validity in a representative subsample of the AHS-2 being valid enough to be used in epidemiological studies, although it can lead to some underestimation of obesity. PMID:21466678

  18. Validation of self-reported anthropometrics in the Adventist Health Study 2.

    PubMed

    Bes-Rastrollo, Maira; Sabaté, Joan; Jaceldo-Siegl, Karen; Fraser, Gary E

    2011-04-05

    Relying on self-reported anthropometric data is often the only feasible way of studying large populations. In this context, there are no studies assessing the validity of anthropometrics in a mostly vegetarian population. The objective of this study was to evaluate the validity of self-reported anthropometrics in the Adventist Health Study 2 (AHS-2). We selected a representative sample of 911 participants of AHS-2, a cohort of over 96,000 adult Adventists in the USA and Canada. Then we compared their measured weight and height with those self-reported at baseline. We calculated the validity of the anthropometrics as continuous variables, and as categorical variables for the definition of obesity. On average, participants underestimated their weight by 0.20 kg, and overestimated their height by 1.57 cm resulting in underestimation of body mass index (BMI) by 0.61 kg/m(2). The agreement between self-reported and measured BMI (as a continuous variable), as estimated by intraclass correlation coefficient, was 0.97. The sensitivity of self-reported BMI to detect obesity was 0.81, the specificity 0.97, the predictive positive value 0.93, the predictive negative value 0.92, and the Kappa index 0.81. The percentage of absolute agreement for each category of BMI (normoweight, overweight, and obese) was 83.4%. After multivariate analyses, predictors of differences between self-reported and measured BMI were obesity, soy consumption and the type of dietary pattern. Self-reported anthropometric data showed high validity in a representative subsample of the AHS-2 being valid enough to be used in epidemiological studies, although it can lead to some underestimation of obesity.

  19. Validation of the use of the Critical-Care Pain Observation Tool (CPOT) with brain surgery patients in the neurosurgical intensive care unit.

    PubMed

    Echegaray-Benites, Christine; Kapoustina, Oxana; Gélinas, Céline

    2014-10-01

    Many critically ill patients are unable to self-report their pain. In such situations, the use of valid behavioral pain scales is recommended. To validate the use of the Critical-Care Pain Observation Tool (CPOT) with brain surgery adults in the neurosurgical intensive care unit. Repeated-measure within subject prospective design. Forty-three elective brain surgery patients of a Canadian university hospital participated. Participants were video recorded and scored with the CPOT before, during and after a non-nociceptive (non-invasive blood pressure using cuff inflation) and a nociceptive (turning) procedure for a total of six assessments. Self-reports of pain were also obtained. Discriminant validation was supported with higher mean CPOT scores during the nociceptive procedure compared with the non-nociceptive one. More participants reported higher pain intensity during turning compared with cuff inflation. Criterion validation was supported with a moderate positive correlation between self-reports of pain intensity and CPOT scores during turning. Interrater and intrarater reliability of CPOT scores through the viewing of participants' videos by two trained raters was supported with high Intraclass Correlation Coefficients. The CPOT appears to be valid for the detection of pain in elective brain surgery patients in the neurosurgical intensive care unit. Copyright © 2014. Published by Elsevier Ltd.

  20. A conceptual framework for evaluating data suitability for observational studies.

    PubMed

    Shang, Ning; Weng, Chunhua; Hripcsak, George

    2017-09-08

    To contribute a conceptual framework for evaluating data suitability to satisfy the research needs of observational studies. Suitability considerations were derived from a systematic literature review on researchers' common data needs in observational studies and a scoping review on frequent clinical database design considerations, and were harmonized to construct a suitability conceptual framework using a bottom-up approach. The relationships among the suitability categories are explored from the perspective of 4 facets of data: intrinsic, contextual, representational, and accessible. A web-based national survey of domain experts was conducted to validate the framework. Data suitability for observational studies hinges on the following key categories: Explicitness of Policy and Data Governance, Relevance, Availability of Descriptive Metadata and Provenance Documentation, Usability, and Quality. We describe 16 measures and 33 sub-measures. The survey uncovered the relevance of all categories, with a 5-point Likert importance score of 3.9 ± 1.0 for Explicitness of Policy and Data Governance, 4.1 ± 1.0 for Relevance, 3.9 ± 0.9 for Availability of Descriptive Metadata and Provenance Documentation, 4.2 ± 1.0 for Usability, and 4.0 ± 0.9 for Quality. The suitability framework evaluates a clinical data source's fitness for research use. Its construction reflects both researchers' points of view and data custodians' design features. The feedback from domain experts rated Usability, Relevance, and Quality categories as the most important considerations. © The Author 2017. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  1. Functional outcomes in ICU – what should we be using? – an observational study.

    PubMed

    Parry, Selina M; Denehy, Linda; Beach, Lisa J; Berney, Sue; Williamson, Hannah C; Granger, Catherine L

    2015-03-29

    With growing awareness of the importance of rehabilitation, new measures are being developed specifically for use in the intensive care unit (ICU). There are currently 26 measures reported to assess function in ICU survivors. The Physical Function in Intensive care Test scored (PFIT-s) has established clinimetric properties. It is unknown how other functional measures perform in comparison to the PFIT-s or which functional measure may be the most clinically applicable for use within the ICU. The aims of this study were to determine (1) the criterion validity of the Functional Status Score for the ICU (FSS-ICU), ICU Mobility Scale (IMS) and Short Physical Performance Battery (SPPB) against the PFIT-s; (2) the construct validity of these tests against muscle strength; (3) predictive utility of these tests to predict discharge to home; and (4) the clinical applicability. This was a nested study within an ongoing controlled study and an observational study. Sixty-six individuals were assessed at awakening and ICU discharge. Measures included: PFIT-s, FSS-ICU, IMS and SPPB. Bivariate relationships (Spearman's rank correlation coefficient) and predictive validity (logistic regression) were determined. Responsiveness (effect sizes); floor and ceiling effects; and minimal important differences were calculated. Mean ± SD PFIT-s at awakening was 4.7 ± 2.3 out of 10. On awakening a large positive relationship existed between PFIT-s and the other functional measures: FSS-ICU (rho = 0.87, p < 0.005), IMS (rho = 0.81, p < 0.005) and SPPB (rho = 0.70, p < 0.005). The PFIT-s had excellent construct validity (rho = 0.8, p < 0.005) and FSS-ICU (rho = 0.69, p < 0.005) and IMS (rho = 0.57, p < 0.005) had moderate construct validity with muscle strength. The PFIT-s and FSS-ICU had small floor/ceiling effects <11% at awakening and ICU discharge. The SPPB had a large floor effect at awakening (78%) and ICU discharge (56%). All

  2. Prospective study of recovery from copperhead snake envenomation: an observational study.

    PubMed

    Lavonas, Eric J; Gerardo, Charles J

    2015-05-15

    Although much is known about signs, symptoms, and management in the acute phase of crotaline snake envenomation, little is known about signs, symptoms, function, and quality of life during the recovery phase. The purpose of this observational pilot investigation is to evaluate the utility of several clinical outcome instruments in the setting of copperhead snakebite, and to characterize the clinical course of recovery. This is a multi-center prospective, open-label, observational study of patients envenomated by copperhead snakes. We administered the Disabilities of the Arm, Shoulder, and Hand (DASH), Lower Extremity Functional Scale (LEFS), Patient-Specific Functional Scale (PSFS), Work Productivity and Ability Impairment: Special Health Problem (WPAI: SHP), Patients' Global Impression of Change (PGIC), Patient's Global Assessment of Recovery (PGAR), and SF-36 instruments, obtained numeric pain rating scales, and measured grip strength, walking speed, and swelling prior to hospital discharge and 3, 7, 14, 21, and 28 days after envenomation. 20 subjects were enrolled; none were lost to follow-up. Most (80%) had moderate severity swelling, and most (75%) received antivenom. Across the broad range of measures, abnormalities of pain, swelling, impairments of physical and role function, and quality of life persisted for 7-14 days in most subjects. Validated self-reported outcome measures, such as the DASH, LEFS, PSFS, PGIC, SF-36, and the daily activities impairment portion of the WPAI: SHP were more responsive than measurements of swelling or walking speed. Data quality issues limited the utility of the work impairment portion of the WPAI: SHP. Residual signs, symptoms, and impairment in some subjects lasted through the 28-day study period. The study design precluded any assessment of the effectiveness of antivenom. Signs, symptoms, impaired function, and decreased quality of life typically last 7 - 14 days after copperhead envenomation. Several tools appear

  3. Validity and reliability of the Paprosky acetabular defect classification.

    PubMed

    Yu, Raymond; Hofstaetter, Jochen G; Sullivan, Thomas; Costi, Kerry; Howie, Donald W; Solomon, Lucian B

    2013-07-01

    The Paprosky acetabular defect classification is widely used but has not been appropriately validated. Reliability of the Paprosky system has not been evaluated in combination with standardized techniques of measurement and scoring. This study evaluated the reliability, teachability, and validity of the Paprosky acetabular defect classification. Preoperative radiographs from a random sample of 83 patients undergoing 85 acetabular revisions were classified by four observers, and their classifications were compared with quantitative intraoperative measurements. Teachability of the classification scheme was tested by dividing the four observers into two groups. The observers in Group 1 underwent three teaching sessions; those in Group 2 underwent one session and the influence of teaching on the accuracy of their classifications was ascertained. Radiographic evaluation showed statistically significant relationships with intraoperative measurements of anterior, medial, and superior acetabular defect sizes. Interobserver reliability improved substantially after teaching and did not improve without it. The weighted kappa coefficient went from 0.56 at Occasion 1 to 0.79 after three teaching sessions in Group 1 observers, and from 0.49 to 0.65 after one teaching session in Group 2 observers. The Paprosky system is valid and shows good reliability when combined with standardized definitions of radiographic landmarks and a structured analysis. Level II, diagnostic study. See the Guidelines for Authors for a complete description of levels of evidence.

  4. Bayesian data analysis in observational comparative effectiveness research: rationale and examples.

    PubMed

    Olson, William H; Crivera, Concetta; Ma, Yi-Wen; Panish, Jessica; Mao, Lian; Lynch, Scott M

    2013-11-01

    Many comparative effectiveness research and patient-centered outcomes research studies will need to be observational for one or both of two reasons: first, randomized trials are expensive and time-consuming; and second, only observational studies can answer some research questions. It is generally recognized that there is a need to increase the scientific validity and efficiency of observational studies. Bayesian methods for the design and analysis of observational studies are scientifically valid and offer many advantages over frequentist methods, including, importantly, the ability to conduct comparative effectiveness research/patient-centered outcomes research more efficiently. Bayesian data analysis is being introduced into outcomes studies that we are conducting. Our purpose here is to describe our view of some of the advantages of Bayesian methods for observational studies and to illustrate both realized and potential advantages by describing studies we are conducting in which various Bayesian methods have been or could be implemented.

  5. Validity and reliability of the Japanese version of the Newest Vital Sign: a preliminary study.

    PubMed

    Kogure, Takamichi; Sumitani, Masahiko; Suka, Machi; Ishikawa, Hirono; Odajima, Takeshi; Igarashi, Ataru; Kusama, Makiko; Okamoto, Masako; Sugimori, Hiroki; Kawahara, Kazuo

    2014-01-01

    Health literacy (HL) refers to the ability to obtain, process, and understand basic health information and services, and is thus needed to make appropriate health decisions. The Newest Vital Sign (NVS) is comprised of 6 questions about an ice cream nutrition label and assesses HL numeracy skills. We developed a Japanese version of the NVS (NVS-J) and evaluated the validity and reliability of the NVS-J in patients with chronic pain. The translation of the original NVS into Japanese was achieved as per the published guidelines. An observational study was subsequently performed to evaluate the validity and reliability of the NVS-J in 43 Japanese patients suffering from chronic pain. Factor analysis with promax rotation, using the Kaiser criterion (eigenvalues ≥1.0), and a scree plot revealed that the main component of the NVS-J consists of three determinative factors, and each factor consists of two NVS-J items. The criterion-related validity of the total NVS-J score was significantly correlated with the total score of Ishikawa et al.'s self-rated HL Questionnaire, the clinical global assessment of comprehensive HL level, cognitive function, and the Brinkman index. In addition, Cronbach's coefficient for the total score of the NVS-J was adequate (alpha = 0.72). This study demonstrated that the NVS-J has good validity and reliability. Further, the NVS-J consists of three determinative factors: "basic numeracy ability," "complex numeracy ability," and "serious-minded ability." These three HL abilities comprise a 3-step hierarchical structure. Adequate HL should be promoted in chronic pain patients to enable coping, improve functioning, and increase activities of daily living (ADLs) and quality of life (QOL).

  6. Validation of Atmosphere/Ionosphere Signals Associated with Major Earthquakes by Multi-Instrument Space-Borne and Ground Observations

    NASA Technical Reports Server (NTRS)

    Ouzounov, Dimitar; Pulinets, Sergey; Hattori, Katsumi; Parrot, Michel; Liu, J. Y.; Yang, T. F.; Arellano-Baeza, Alonso; Kafatos, M.; Taylor, Patrick

    2012-01-01

    The latest catastrophic earthquake in Japan (March 2011) has renewed interest in the important question of the existence of pre-earthquake anomalous signals related to strong earthquakes. Recent studies have shown that there were precursory atmospheric/ionospheric signals observed in space associated with major earthquakes. The critical question, still widely debated in the scientific community, is whether such ionospheric/atmospheric signals systematically precede large earthquakes. To address this problem we have started to investigate anomalous ionospheric / atmospheric signals occurring prior to large earthquakes. We are studying the Earth's atmospheric electromagnetic environment by developing a multisensor model for monitoring the signals related to active tectonic faulting and earthquake processes. The integrated satellite and terrestrial framework (ISTF) is our method for validation and is based on a joint analysis of several physical and environmental parameters (thermal infrared radiation, electron concentration in the ionosphere, lineament analysis, radon/ion activities, air temperature and seismicity) that were found to be associated with earthquakes. A physical link between these parameters and earthquake processes has been provided by the recent version of Lithosphere-Atmosphere-Ionosphere Coupling (LAIC) model. Our experimental measurements have supported the new theoretical estimates of LAIC hypothesis for an increase in the surface latent heat flux, integrated variability of outgoing long wave radiation (OLR) and anomalous variations of the total electron content (TEC) registered over the epicenters. Some of the major earthquakes are accompanied by an intensification of gas migration to the surface, thermodynamic and hydrodynamic processes of transformation of latent heat into thermal energy and with vertical transport of charged aerosols in the lower atmosphere. These processes lead to the generation of external electric currents in specific

  7. Validation of Model Output versus ADCP Observations on the PR Insular Shelf, Part 2: Are all Sites the Same?

    NASA Astrophysics Data System (ADS)

    Ramos Valle, A.

    2016-02-01

    We have previously compared the output from three oceanographic models against observed data from an ADCP at a common grid point location on the zonally oriented, southwestern Puerto Rico shelf that extends into the northern Caribbean Sea. The three models were: 1) AMSEAS (NCOM), 2) Regional ROMS and 3) a higher resolution version of ROMS nested within the Regional ROMS. These models faced great difficulty in accurately depicting the bathymetry of the ocean in the PR-USVI archipelago which is characterized by small islands, narrow insular shelves, steep slopes and deep water beyond. The resulting validations of the three models versus the ADCP at the selected location were poor. However, the insight we gained into the behavior of the models during the validation process suggested that models might do a better job at simulating currents across the inter-island straits that connect the Atlantic Ocean with the Caribbean Sea than along the insular Caribbean or Atlantic coastlines. We therefore focused our attention on expanding our previous research by performing a similar analysis using the ROMS model against ADCP observations in the Mona Passage, west of PR. This new ADCP location exhibits bathymetric features that are smoother, less complex, and better represented in the Regional ROMS model while flows at the site are stronger than at the previous ADCP site at La Parguera. Statistical time-series analyses are performed on model and ADCP flow velocity time series to quantify the model's skill. Results indicate that ROMS does a much better job at simulating ocean currents at the Mona Passage site than at La Parguera. Dynamical and numerical differences that might explain the spatially varying model skill are considered. In summary: model skill validation sites around PR are not all the same.

  8. Assessment of Developing Intensity Duration Frequency Curves using Satellite Observations (Case Study)

    NASA Astrophysics Data System (ADS)

    Ombadi, Mohammed; Nguyen, Phu; Sorooshian, Soroosh

    2017-12-01

    Intensity Duration Frequency (IDF) curves are essential for the resilient design of infrastructures. Since their earlier development, IDF relationships have been derived using precipitation records from rainfall gauge stations. However, with the recent advancement in satellite observation of precipitation which provides near global coverage and high spatiotemporal resolution, it is worthy of attention to investigate the validity of utilizing the relatively short record length of satellite rainfall to generate robust IDF relationships. These satellite-based IDF can address the paucity of such information in the developing countries. Few studies have used satellite precipitation data in IDF development but mainly focused on merging satellite and gauge precipitation. In this study, however, IDF have been derived solely from satellite observations using PERSIANN-CDR (Precipitation Estimation from Remotely Sensed Information Using Artificial Neural Networks-Climate Data Record). The unique PERSIANN-CDR attributes of high spatial resolution (0.25°×0.25°), daily temporal resolution and a record dating back to 1983 allow for the investigation at fine resolution. The results are compared over most of the contiguous United States against NOAA Atlas 14. The impact of using different methods of sampling, distribution estimators and regionalization in the resulting relationships is investigated. Main challenges to estimate robust and accurate IDF from satellite observations are also highlighted.

  9. The GRACE checklist for rating the quality of observational studies of comparative effectiveness: a tale of hope and caution.

    PubMed

    Dreyer, Nancy A; Velentgas, Priscilla; Westrich, Kimberly; Dubois, Robert

    2014-03-01

    While there is growing demand for information about comparative effectiveness (CE), there is substantial debate about whether and when observational studies have sufficient quality to support decision making. To develop and test an item checklist that can be used to qualify those observational CE studies sufficiently rigorous in design and execution to contribute meaningfully to the evidence base for decision support. An 11-item checklist about data and methods (the GRACE checklist) was developed through literature review and consultation with experts from professional societies, payer groups, the private sector, and academia. Since no single gold standard exists for validation, checklist item responses were compared with 3 different types of external quality ratings (N=88 articles). The articles compared treatment effectiveness and/or safety of drugs, medical devices, and medical procedures. We validated checklist item responses 3 ways against external quality ratings, using published articles of observational CE or safety studies: (a) Systematic Review-quality assessment from a published systematic review; (b) Single Expert Review-quality assessment made according to the solicited "expert opinion" of a senior researcher; and (c) Concordant Expert Review-quality assessments from 2 experts for which there was concordance. Volunteers (N=113) from 5 continents completed 280 article assessments using the checklist. Positive and negative predictive values (PPV, NPV, respectively) of individual items were estimated to compare testers' assessments with those of experts. Taken as a whole, the scale had better NPV than PPV, for both data and methods. The most consistent predictor of quality relates to the validity of the primary outcomes measurement for the study purpose. Other consistent markers of quality relate to using concurrent comparators, minimizing the effects of bias by prudent choice of covariates, and using sensitivity analysis to test robustness of results

  10. An Experimental Study of Characteristic Combustion-Driven Flow for CFD Validation

    NASA Technical Reports Server (NTRS)

    Santoro, Robert J.

    1997-01-01

    A series of uni-element rocket injector studies were completed to provide benchmark quality data needed to validate computational fluid dynamic models. A shear coaxial injector geometry was selected as the primary injector for study using gaseous hydrogen/oxygen and gaseous hydrogen/liquid oxygen propellants. Emphasis was placed on the use of nonintrusive diagnostic techniques to characterize the flowfields inside an optically-accessible rocket chamber. Measurements of the velocity and species fields were obtained using laser velocimetry and Raman spectroscopy, respectively. Qualitative flame shape information was also obtained using laser-induced fluorescence excited from OH radicals and laser light scattering studies of aluminum oxide particle seeded combusting flows. The gaseous hydrogen/liquid oxygen propellant studies for the shear coaxial injector focused on breakup mechanisms associated with the liquid oxygen jet under subcritical pressure conditions. Laser sheet illumination techniques were used to visualize the core region of the jet and a Phase Doppler Particle Analyzer was utilized for drop velocity, size and size distribution characterization. The results of these studies indicated that the shear coaxial geometry configuration was a relatively poor injector in terms of mixing. The oxygen core was observed to extend well downstream of the injector and a significant fraction of the mixing occurred in the near nozzle region where measurements were not possible to obtain. Detailed velocity and species measurements were obtained to allow CFD model validation and this set of benchmark data represents the most comprehensive data set available to date. As an extension of the investigation, a series of gas/gas injector studies were conducted in support of the X-33 Reusable Launch Vehicle program. A Gas/Gas Injector Technology team was formed consisting of the Marshall Space Flight Center, the NASA Lewis Research Center, Rocketdyne and Penn State. Injector

  11. An Experimental Study of Characteristic Combustion-Driven Flow for CFD Validation

    NASA Technical Reports Server (NTRS)

    Santoro, Robert J.

    1997-01-01

    A series of uni-element rocket injector studies were completed to provide benchmark quality data needed to validate computational fluid dynamic models. A shear coaxial injector geometry was selected as the primary injector for study using gaseous hydrogen/oxygen and gaseous hydrogen/liquid oxygen propellants. Emphasis was placed on the use of non-intrusive diagnostic techniques to characterize the flowfields inside an optically-accessible rocket chamber. Measurements of the velocity and species fields were obtained using laser velocimetry and Raman spectroscopy, respectively Qualitative flame shape information was also obtained using laser-induced fluorescence excited from OH radicals and laser light scattering studies of aluminum oxide particle seeded combusting flows. The gaseous hydrogen/liquid oxygen propellant studies for the shear coaxial injector focused on breakup mechanisms associated with the liquid oxygen jet under sub-critical pressure conditions. Laser sheet illumination techniques were used to visualize the core region of the jet and a Phase Doppler Particle Analyzer was utilized for drop velocity, size and size distribution characterization. The results of these studies indicated that the shear coaxial geometry configuration was a relatively poor injector in terms of mixing. The oxygen core was observed to extend well downstream of the injector and a significant fraction of the mixing occurred in the near nozzle region where measurements were not possible to obtain Detailed velocity and species measurements were obtained to allow CFD model validation and this set of benchmark data represents the most comprehensive data set available to date As an extension of the investigation, a series of gas/gas injector studies were conducted in support of the X-33 Reusable Launch Vehicle program. A Gas/Gas Injector Technology team was formed consisting of the Marshall Space Flight Center, the NASA Lewis Research Center, Rocketdyne and Penn State. Injector

  12. Validating an Elicited Imitation Task as a Measure of Implicit Knowledge: Comparisons with Other Validation Studies

    ERIC Educational Resources Information Center

    Spada, Nina; Shiu, Julie Li-Ju; Tomita, Yasuyo

    2015-01-01

    This study builds on research investigating the construct validity of elicited imitation (EI) as a measure of implicit second language (L2) grammatical knowledge. It differs from previous studies in that the EI task focuses on a single grammatical feature and time on task is strictly controlled. Seventy-three EFL learners and 20 native English…

  13. The Staff Observation Aggression Scale - Revised (SOAS-R) - adjustment and validation for emergency primary health care.

    PubMed

    Morken, Tone; Baste, Valborg; Johnsen, Grethe E; Rypdal, Knut; Palmstierna, Tom; Johansen, Ingrid Hjulstad

    2018-05-08

    Many emergency primary health care workers experience aggressive behaviour from patients or visitors. Simple incident-reporting procedures exist for inpatient, psychiatric care, but a similar and simple incident-report for other health care settings is lacking. The aim was to adjust a pre-existing form for reporting aggressive incidents in a psychiatric inpatient setting to the emergency primary health care settings. We also wanted to assess the validity of the severity scores in emergency primary health care. The Staff Observation Scale - Revised (SOAS-R) was adjusted to create a pilot version of the Staff Observation Scale - Revised Emergency (SOAS-RE). A Visual Analogue Scale (VAS) was added to the form to judge the severity of the incident. Data for validation of the pilot version of SOAS-RE were collected from ten casualty clinics in Norway during 12 months. Variance analysis was used to test gender and age differences. Linear regression analysis was performed to evaluate the relative impact that each of the five SOAS-RE columns had on the VAS score. The association between SOAS-RE severity score and VAS severity score was calculated by the Pearson correlation coefficient. The SOAS-R was adjusted to emergency primary health care, refined and called The Staff Observation Aggression Scale - Revised Emergency (SOAS-RE). A total of 350 SOAS-RE forms were collected from the casualty clinics, but due to missing data, 291 forms were included in the analysis. SOAS-RE scores ranged from 1 to 22. The mean total severity score of SOAS-RE was 10.0 (standard deviation (SD) =4.1) and the mean VAS score was 45.4 (SD = 26.7). We found a significant correlation of 0.45 between the SOAS-RE total severity scores and the VAS severity ratings. The linear regression analysis showed that individually each of the categories, which described the incident, had a low impact on the VAS score. The SOAS-RE seems to be a useful instrument for research, incident-recording and management

  14. Assessing Meritorious Teacher Performance: A Differential Validity Study.

    ERIC Educational Resources Information Center

    Ellett, Chad D; Capie, William

    The Teacher Assessment and Development System (TADS) - Meritorious Teacher Program (MTP) FORM instrument is used in the Dade County Public Schools, Miami, Florida, to evaluate teachers. Its validity for decisions concerning merit pay for master teachers was examined in this study. Specifically, its ability to discriminate between high performing…

  15. Observing Parent Behavior: Reconciling Theoretical Concepts with Empirical Reality.

    ERIC Educational Resources Information Center

    Ge, Xiaojia

    Using data from the Iowa Youth and Families Project, this longitudinal study investigated the predictive validity of different dimensions of observed parent behavior on adolescent externalizing (aggression, hostility) and internalizing (depression, anxiety) problems over a 2-year period. In addition, the study examined how observer ratings…

  16. Global precipitation measurements for validating climate models

    NASA Astrophysics Data System (ADS)

    Tapiador, F. J.; Navarro, A.; Levizzani, V.; García-Ortega, E.; Huffman, G. J.; Kidd, C.; Kucera, P. A.; Kummerow, C. D.; Masunaga, H.; Petersen, W. A.; Roca, R.; Sánchez, J.-L.; Tao, W.-K.; Turk, F. J.

    2017-11-01

    The advent of global precipitation data sets with increasing temporal span has made it possible to use them for validating climate models. In order to fulfill the requirement of global coverage, existing products integrate satellite-derived retrievals from many sensors with direct ground observations (gauges, disdrometers, radars), which are used as reference for the satellites. While the resulting product can be deemed as the best-available source of quality validation data, awareness of the limitations of such data sets is important to avoid extracting wrong or unsubstantiated conclusions when assessing climate model abilities. This paper provides guidance on the use of precipitation data sets for climate research, including model validation and verification for improving physical parameterizations. The strengths and limitations of the data sets for climate modeling applications are presented, and a protocol for quality assurance of both observational databases and models is discussed. The paper helps elaborating the recent IPCC AR5 acknowledgment of large observational uncertainties in precipitation observations for climate model validation.

  17. Study on the Validity and Reliability of Melbourne Decision Making Scale in Turkey

    ERIC Educational Resources Information Center

    Çolakkadioglu, Oguzhan; Deniz, M. Engin

    2015-01-01

    This study is to analyze the validity and reliability of Melbourne Decision Making Questionnaire (MDMQ). The sample consisted of 650 university students. The structural validity of the MDMQ, as well as correlations among its sub-scales, measure-bound validity, internal consistency, item total correlations and test-retest reliability coefficients…

  18. Human Rights Attitude Scale: A Validity and Reliability Study

    ERIC Educational Resources Information Center

    Ercan, Recep; Yaman, Tugba; Demir, Selcuk Besir

    2015-01-01

    The objective of this study is to develop a valid and reliable attitude scale having quality psychometric features that can measure secondary school students' attitudes towards human rights. The study group of the research is comprised by 710 6th, 7th and 8th grade students who study at 4 secondary schools in the centre of Sivas. The study group…

  19. 29 CFR 1607.5 - General standards for validity studies.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... experience on the job. J. Interim use of selection procedures. Users may continue the use of a selection... studies. A. Acceptable types of validity studies. For the purposes of satisfying these guidelines, users... which has an adverse impact and which selection procedure has an adverse impact, each user should...

  20. 29 CFR 1607.5 - General standards for validity studies.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... experience on the job. J. Interim use of selection procedures. Users may continue the use of a selection... studies. A. Acceptable types of validity studies. For the purposes of satisfying these guidelines, users... which has an adverse impact and which selection procedure has an adverse impact, each user should...

  1. 29 CFR 1607.5 - General standards for validity studies.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... experience on the job. J. Interim use of selection procedures. Users may continue the use of a selection... studies. A. Acceptable types of validity studies. For the purposes of satisfying these guidelines, users... which has an adverse impact and which selection procedure has an adverse impact, each user should...

  2. Self-reported eating rate is associated with weight status in a Dutch population: a validation study and a cross-sectional study.

    PubMed

    van den Boer, Janet H W; Kranendonk, Jentina; van de Wiel, Anne; Feskens, Edith J M; Geelen, Anouk; Mars, Monica

    2017-09-08

    Observational studies performed in Asian populations suggest that eating rate is related to BMI. This paper investigates the association between self-reported eating rate (SRER) and body mass index (BMI) in a Dutch population, after having validated SRER against actual eating rate. Two studies were performed; a validation and a cross-sectional study. In the validation study SRER (i.e., 'slow', 'average', or 'fast') was obtained from 57 participants (men/women = 16/41, age: mean ± SD = 22.6 ± 2.8 yrs., BMI: mean ± SD = 22.1 ± 2.8 kg/m 2 ) and in these participants actual eating rate was measured for three food products. Using analysis of variance the association between SRER and actual eating rate was studied. The association between SRER and BMI was investigated in cross-sectional data from the NQplus cohort (i.e., 1473 Dutch adults; men/women = 741/732, age: mean ± SD = 54.6 ± 11.7 yrs., BMI: mean ± SD = 25.9 ± 4.0 kg/m 2 ) using (multiple) linear regression analysis. In the validation study actual eating rate increased proportionally with SRER (for all three food products P < 0.01). In the cross-sectional study SRER was positively associated with BMI in both men and women (P = 0.03 and P < 0.001, respectively). Self-reported fast-eating women had a 1.13 kg/m 2 (95% CI 0.43, 1.84) higher BMI compared to average-speed-eating women, after adjusting for confounders. This was not the case in men; self-reported fast-eating men had a 0.29 kg/m 2 (95% CI -0.22, 0.80) higher BMI compared to average-speed-eating men, after adjusting for confounders. These studies show that self-reported eating rate reflects actual eating rate on a group-level, and that a high self-reported eating rate is associated with a higher BMI in this Dutch population.

  3. Using the Autism Diagnostic Interview-Revised and the Autism Diagnostic Observation Schedule with Young Children with Developmental Delay: Evaluating Diagnostic Validity

    ERIC Educational Resources Information Center

    Gray, Kylie M.; Tonge, Bruce J.; Sweeney, Deborah J.

    2008-01-01

    Few studies have focused on the validity of the ADI-R and ADOS in the assessment of preschool children with developmental delay. This study aimed to evaluate the diagnostic validity of the ADI-R and the ADOS in young children. Two-hundred and nine children aged 20-55 months participated in the study, 120 of whom received a diagnosis of autism.…

  4. Understanding Foreign Language Learning Strategies: A Validation Study

    ERIC Educational Resources Information Center

    Tragant, Elsa; Thompson, Marilyn S.; Victori, Mia

    2013-01-01

    The present work aims to contribute to our understanding of the underlying dimensions of language learning strategies in foreign language contexts. The study analyzes alternative factor structures underlying a recently developed instrument (Tragant and Victori, 2012) and it includes the age factor in the examination of its construct validity. The…

  5. Teachers' Engagement at Work: An International Validation Study

    ERIC Educational Resources Information Center

    Klassen, Robert M.; Aldhafri, Said; Mansfield, Caroline F.; Purwanto, Edy; Siu, Angela F. Y.; Wong, Marina W.; Woods-McConney, Amanda

    2012-01-01

    This study explored the validity of the Utrecht Work Engagement Scale in a sample of 853 practicing teachers from Australia, Canada, China (Hong Kong), Indonesia, and Oman. The authors used multigroup confirmatory factor analysis to test the factor structure and measurement invariance across settings, after which they examined the relationships…

  6. A Validation Study of the Existential Anxiety Scale.

    ERIC Educational Resources Information Center

    Hullett, Michael A.

    Logotherapy is a meaning-centered psychotherapy which focuses on both the meaning of human existence and the personal search for meaning. If the will to search for meaning is frustrated, "existential frustration" may result. This study validates the Existential Anxiety Scale (EAS) developed by Good and Good (1974). Basic principles of…

  7. Validation Study of a Gatekeeping Attitude Index for Social Work Education

    ERIC Educational Resources Information Center

    Tam, Dora M. Y.; Coleman, Heather

    2011-01-01

    This article reports on a study designed to validate the Gatekeeping Attitude Index, a 14-item Likert scaling index. The authors collected data from a convenience sample of social work field instructors (N = 188) with a response rate of 74.0%. Construct validation by exploratory factor analysis identified a 2-factor solution on the index after…

  8. 40 CFR 152.92 - Submission of a new valid study.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... Submitters' Rights § 152.92 Submission of a new valid study. An applicant may demonstrate compliance for a... study previously submitted to the Agency should not be resubmitted but should be cited in accordance...

  9. Using non-specialist observers in 4AFC human observer studies

    NASA Astrophysics Data System (ADS)

    Elangovan, Premkumar; Mackenzie, Alistair; Dance, David R.; Young, Kenneth C.; Wells, Kevin

    2017-03-01

    Virtual clinical trials (VCTs) are an emergent approach for rapid evaluation and comparison of various breast imaging technologies and techniques using computer-based modeling tools. Increasingly 4AFC (Four alternative forced choice) virtual clinical trials are used to compare detection performances of different breast imaging modalities. Most prior studies have used physicists and/or radiologists and physicists interchangeably. However, large scale use of statistically significant 4AFC observer studies is challenged by the individual time commitment and cost of such observers, often drawn from a limited local pool of specialists. This work aims to investigate whether non-specialist observers can be used to supplement such studies. A team of five specialist observers (medical physicists) and five non-specialists participated in a 4AFC study containing simulated 2D-mammography and DBT (digital breast tomosynthesis) images, produced using the OPTIMAM toolbox for VCTs. The images contained 4mm irregular solid masses and 4mm spherical targets at a range of contrast levels embedded in a realistic breast phantom background. There was no statistically significant difference between the detection performance of medical physicists and non-specialists (p>0.05). However, non-specialists took longer to complete the study than their physicist counterparts, which was statistically significant (p<0.05). Overall, the results from both observer groups indicate that DBT has a lower detectable threshold contrast than 2D-mammography for both masses and spheres, and both groups found spheres easier to detect than irregular solid masses.

  10. Self-Disclosure Between Friends: A Validity Study

    ERIC Educational Resources Information Center

    Panyard, Christine Marie

    1973-01-01

    Subjects reported that they had disclosed approximately the same amount of information as they had received. The consensual validation of the amount of personal information exchanged between friends suggested that the Self-Disclosure Questionnaire is a valid measure of self-disclosure to a specific target person. (Author)

  11. A Pilot Study of the Validity of Self-reported Ultraviolet Radiation Exposure and Sun Protection Practices Among Lifeguards, Parents and Children

    PubMed Central

    O’Riordan, David L.; Glanz, Karen; Gies, Peter; Elliott, Tom

    2013-01-01

    Outdoor recreation settings, such as swimming pools, provide a promising venue to assess UVR exposure and sun protection practices among individuals who are minimally clothed and exposed to potentially high levels of UVR. Most studies assessing sun exposure/protection practices rely on self-reported data, which are subject to bias. The aim of this study was to establish the feasibility of conducting a multimethod study to examine the validity of self-reported measures within a swimming pool setting. Data were collected from 27 lifeguards, children and parents in Hawaii. Each participant filled out a survey and a 4 day sun habits diary. On two occasions, researchers assessed observable sun protection behaviors (wearing hats, shirts, sunglasses), swabbed the skin to detect the presence of sunscreen, and subjects wore polysulphone dosimeters to measure UVR exposure. Overall, observed sun protection behaviors were more highly correlated with diary reports than with survey reports. While lifeguards and children reported spending comparable amounts of time in the sun, dosimeter measures showed that lifeguards received twice as much UVR exposure. This study demonstrated the feasibility of implementing a multimethod validity study within a broader population of swimming pools. PMID:18179624

  12. Development and validation of self-reported line drawings for assessment of knee malalignment and foot rotation: a cross-sectional comparative study

    PubMed Central

    2010-01-01

    Background For large scale epidemiological studies clinical assessments and radiographs can be impractical and expensive to apply to more than just a sample of the population examined. The study objectives were to develop and validate two novel instruments for self-reported knee malalignment and foot rotation suitable for use in questionnaire studies of knee pain and osteoarthritis. Methods Two sets of line drawings were developed using similar methodology. Each instrument consisted of an explanatory question followed by a set of drawings showing straight alignment, then two each at 7.5° angulation and 15° angulation in the varus/valgus (knee) and inward/outward (foot) directions. Forty one participants undertaking a community study completed the instruments on two occasions. Participants were assessed once by a blinded expert clinical observer with demonstrated excellent reproducibility. Validity was assessed by sensitivity, specificity and likelihood ratio (LR) using the observer as the reference standard. Reliability was assessed using weighted kappa (κ). Knee malalignment was measured on 400 knee radiographs. General linear model was used to assess for the presence of a linear increase in knee alignment angle (measured medially) from self-reported severe varus to mild varus, straight, mild valgus and severe valgus deformity. Results Observer reproducibility (κ) was 0.89 and 0.81 for the knee malalignment and foot rotation instruments respectively. Self-reported participant reproducibility was also good for the knee (κ 0.73) and foot (κ 0.87) instruments. Validity was excellent for the knee malalignment instrument, with a sensitivity of 0.74 (95%CI 0.54, 0.93) and specificity of 0.97 (95%CI 0.94, 1.00). Similarly the foot rotation instrument was also found to have high sensitivity (0.92, 95%CI 0.83, 1.01) and specificity (0.96, 95%CI 0.93, 1.00). The knee alignment angle increased progressively from self reported severe varus to mild varus, straight, mild

  13. Applicability of Monte Carlo cross validation technique for model development and validation using generalised least squares regression

    NASA Astrophysics Data System (ADS)

    Haddad, Khaled; Rahman, Ataur; A Zaman, Mohammad; Shrestha, Surendra

    2013-03-01

    SummaryIn regional hydrologic regression analysis, model selection and validation are regarded as important steps. Here, the model selection is usually based on some measurements of goodness-of-fit between the model prediction and observed data. In Regional Flood Frequency Analysis (RFFA), leave-one-out (LOO) validation or a fixed percentage leave out validation (e.g., 10%) is commonly adopted to assess the predictive ability of regression-based prediction equations. This paper develops a Monte Carlo Cross Validation (MCCV) technique (which has widely been adopted in Chemometrics and Econometrics) in RFFA using Generalised Least Squares Regression (GLSR) and compares it with the most commonly adopted LOO validation approach. The study uses simulated and regional flood data from the state of New South Wales in Australia. It is found that when developing hydrologic regression models, application of the MCCV is likely to result in a more parsimonious model than the LOO. It has also been found that the MCCV can provide a more realistic estimate of a model's predictive ability when compared with the LOO.

  14. A Validity Study of the Self-Esteem Inventory.

    ERIC Educational Resources Information Center

    Landis, H. John

    Results of this validation study of a slightly modified version of the Coppersmith Self-Esteem Inventory substantiate its use with seventh graders to assess Goal I (concerning self-understanding and appreciation of self-worth) of the Educational Quality Assessment Program in Pennsylvania. Appendixes include the definition and rationale for Goal I,…

  15. Stratospheric Assimilation of Chemical Tracer Observations Using a Kalman Filter. Pt. 2; Chi-Square Validated Results and Analysis of Variance and Correlation Dynamics

    NASA Technical Reports Server (NTRS)

    Menard, Richard; Chang, Lang-Ping

    1998-01-01

    A Kalman filter system designed for the assimilation of limb-sounding observations of stratospheric chemical tracers, which has four tunable covariance parameters, was developed in Part I (Menard et al. 1998) The assimilation results of CH4 observations from the Cryogenic Limb Array Etalon Sounder instrument (CLAES) and the Halogen Observation Experiment instrument (HALOE) on board of the Upper Atmosphere Research Satellite are described in this paper. A robust (chi)(sup 2) criterion, which provides a statistical validation of the forecast and observational error covariances, was used to estimate the tunable variance parameters of the system. In particular, an estimate of the model error variance was obtained. The effect of model error on the forecast error variance became critical after only three days of assimilation of CLAES observations, although it took 14 days of forecast to double the initial error variance. We further found that the model error due to numerical discretization as arising in the standard Kalman filter algorithm, is comparable in size to the physical model error due to wind and transport modeling errors together. Separate assimilations of CLAES and HALOE observations were compared to validate the state estimate away from the observed locations. A wave-breaking event that took place several thousands of kilometers away from the HALOE observation locations was well captured by the Kalman filter due to highly anisotropic forecast error correlations. The forecast error correlation in the assimilation of the CLAES observations was found to have a structure similar to that in pure forecast mode except for smaller length scales. Finally, we have conducted an analysis of the variance and correlation dynamics to determine their relative importance in chemical tracer assimilation problems. Results show that the optimality of a tracer assimilation system depends, for the most part, on having flow-dependent error correlation rather than on evolving the

  16. 41 CFR 60-3.7 - Use of other validity studies.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... study was conducted perform substantially the same major work behaviors, as shown by appropriate job analyses both on the job or group of jobs on which the validity study was performed and on the job for...

  17. 41 CFR 60-3.7 - Use of other validity studies.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... study was conducted perform substantially the same major work behaviors, as shown by appropriate job analyses both on the job or group of jobs on which the validity study was performed and on the job for...

  18. Use of Daily Phone Diary to study religiosity and mood: Convergent validity

    PubMed Central

    Szczesniak, Rhonda D.; Zou, Yuanshu; Dimitriou, Sophia M.; Quittner, Alexandra L.; Grossoehme, Daniel H.

    2017-01-01

    Studies of religious/spiritual behavior frequently rely on self-reported questionnaire data, which is susceptible to bias. The Daily Phone Diary (DPD) was developed to minimize bias in reporting activities and behavior across a 24-hour period. A cross-sectional study of 126 parents of children with cystic fibrosis was used to establish the validity of the DPD to study religious/spiritual behaviors. Longitudinal models were used to determine the odds of improved mood during religious/spiritual activities. Convergent validity was found. Participants had increased odds of improved mood during religious/spiritual activities compared to non-religious/spiritual activities. Associations with gender and religious affiliations were found. The DPD is a valid tool for studying religious/spiritual activities and opens novel avenues for chaplaincy research and the development of chaplaincy interventions incorporating these findings. PMID:27869567

  19. Cluster designs to assess the prevalence of acute malnutrition by lot quality assurance sampling: a validation study by computer simulation

    PubMed Central

    Olives, Casey; Pagano, Marcello; Deitchler, Megan; Hedt, Bethany L; Egge, Kari; Valadez, Joseph J

    2009-01-01

    Traditional lot quality assurance sampling (LQAS) methods require simple random sampling to guarantee valid results. However, cluster sampling has been proposed to reduce the number of random starting points. This study uses simulations to examine the classification error of two such designs, a 67×3 (67 clusters of three observations) and a 33×6 (33 clusters of six observations) sampling scheme to assess the prevalence of global acute malnutrition (GAM). Further, we explore the use of a 67×3 sequential sampling scheme for LQAS classification of GAM prevalence. Results indicate that, for independent clusters with moderate intracluster correlation for the GAM outcome, the three sampling designs maintain approximate validity for LQAS analysis. Sequential sampling can substantially reduce the average sample size that is required for data collection. The presence of intercluster correlation can impact dramatically the classification error that is associated with LQAS analysis. PMID:20011037

  20. Cluster designs to assess the prevalence of acute malnutrition by lot quality assurance sampling: a validation study by computer simulation.

    PubMed

    Olives, Casey; Pagano, Marcello; Deitchler, Megan; Hedt, Bethany L; Egge, Kari; Valadez, Joseph J

    2009-04-01

    Traditional lot quality assurance sampling (LQAS) methods require simple random sampling to guarantee valid results. However, cluster sampling has been proposed to reduce the number of random starting points. This study uses simulations to examine the classification error of two such designs, a 67x3 (67 clusters of three observations) and a 33x6 (33 clusters of six observations) sampling scheme to assess the prevalence of global acute malnutrition (GAM). Further, we explore the use of a 67x3 sequential sampling scheme for LQAS classification of GAM prevalence. Results indicate that, for independent clusters with moderate intracluster correlation for the GAM outcome, the three sampling designs maintain approximate validity for LQAS analysis. Sequential sampling can substantially reduce the average sample size that is required for data collection. The presence of intercluster correlation can impact dramatically the classification error that is associated with LQAS analysis.

  1. Developing the Irrational Beliefs in Mathematics Scale (IBIMS): A Validity and Reliability Study

    ERIC Educational Resources Information Center

    Kaya, Deniz

    2017-01-01

    The purpose of this study is developing a valid and reliable scale intended to determine the irrational beliefs of students in mathematics. The study was conducted with a study group consisting of 700 students in 2015-2016 academic year. Expert opinions were received for the content and face validity of the scale, and the Exploratory Factor…

  2. The CPT Reading Comprehension Test: A Validity Study.

    ERIC Educational Resources Information Center

    Napoli, Anthony R.; Raymond, Lanette A.; Coffey, Cheryl A.; Bosco, Diane M.

    1998-01-01

    Describes a study done at Suffolk County Community College (New York) that assessed the validity of the College Board's Computerized Placement Test in Reading Comprehension (CPT-R) by comparing test results of 1,154 freshmen with the results of the Degree of Power Reading Test. Results confirmed the CPT-R's reliability in identifying basic…

  3. Reflective Thinking Scale: A Validity and Reliability Study

    ERIC Educational Resources Information Center

    Basol, Gulsah; Evin Gencel, Ilke

    2013-01-01

    The purpose of this study was to adapt Reflective Thinking Scale to Turkish and investigate its validity and reliability over a Turkish university students' sample. Reflective Thinking Scale (RTS) is a 5 point Likert scale (ranging from 1 corresponding Agree Completely, 3 to Neutral, and 5 to Not Agree Completely), purposed to measure reflective…

  4. Brazilian Portuguese version of the Revised Fibromyalgia Impact Questionnaire (FIQR-Br): cross-cultural validation, reliability, and construct and structural validation.

    PubMed

    Lupi, Jaqueline Basilio; Carvalho de Abreu, Daniela Cristina; Ferreira, Mariana Candido; Oliveira, Renê Donizeti Ribeiro de; Chaves, Thais Cristina

    2017-08-01

    This study aimed to culturally adapt and validate the Revised Fibromyalgia Impact Questionnaire (FIQR) to Brazilian Portuguese, by the use of analysis of internal consistency, reliability, and construct and structural validity. A total of 100 female patients with fibromyalgia participated in the validation process of the Brazilian Portuguese version of the FIQR (FIQR-Br).The intraclass correlation coefficient (ICC) was used for statistical analysis of reliability (test-retest), Cronbach's alpha for internal consistency, Pearson's rank correlation for construct validity, and confirmatory factor analysis (CFA) for structural validity. It was verified excellent levels of reliability, with ICC greater than 0.75 for all questions and domains of the FIQR-Br. For internal consistency, alpha values greater than 0.70 for the items and domains of the questionnaire were observed. Moderate (0.40 < r < 0.70) and strong (r > 0.70) correlations were observed for the scores of domains and total score between the FIQR-Br and FIQ-Br. The structure of the three domains of the FIQR-Br was confirmed by CFA. The results of this study suggest that that the FIQR-Br is a reliable and valid instrument for assessing fibromyalgia-related impact, and supports its use in clinical settings and research. The structure of the three domains of the FIQR-Br was also confirmed. Implications for Rehabilitation Fibromyalgia is a chronic musculoskeletal disorder characterized by widespread and diffuse pain, fatigue, sleep disturbances, and depression. The disease significantly impairs patients' quality of life and can be highly disabling. To be used in multicenter research efforts, the Revised Fibromyalgia Impact Questionnaire (FIQR) must be cross-culturally validated and psychometrically tested. This paper will make available a new version of the FIQR-Br since another version already exists, but there are concerns about its measurement properties. The availability of an instrument adapted to

  5. A Validation Study of the Impression Replica Technique.

    PubMed

    Segerström, Sofia; Wiking-Lima de Faria, Johanna; Braian, Michael; Ameri, Arman; Ahlgren, Camilla

    2018-04-17

    To validate the well-known and often-used impression replica technique for measuring fit between a preparation and a crown in vitro. The validation consisted of three steps. First, a measuring instrument was validated to elucidate its accuracy. Second, a specimen consisting of male and female counterparts was created and validated by the measuring instrument. Calculations were made for the exact values of three gaps between the male and female. Finally, impression replicas were produced of the specimen gaps and sectioned into four pieces. The replicas were then measured with the use of a light microscope. The values received from measuring the specimen were then compared with the values received from the impression replicas, and the technique was thereby validated. The impression replica technique overvalued all measured gaps. Depending on location of the three measuring sites, the difference between the specimen and the impression replicas varied from 47 to 130 μm. The impression replica technique overestimates gaps within the range of 2% to 11%. The validation of the replica technique enables the method to be used as a reference when testing other methods for evaluating fit in dentistry. © 2018 by the American College of Prosthodontists.

  6. Predictability analysis and validation of a low-dimensional model - an application to the dynamics of cereal crops observed from satellite

    NASA Astrophysics Data System (ADS)

    Mangiarotti, Sylvain; Drapeau, Laurent

    2013-04-01

    The global modeling approach aims to obtain parsimonious models of observed dynamics from few or single time series (Letellier et al. 2009). Specific algorithms were developed and validated for this purpose (Mangiarotti et al. 2012a). This approach was applied to the dynamics of cereal crops in semi-arid region using the vegetation index derived from satellite data as a proxy of the dynamics. A low-dimensional autonomous model could be obtained. The corresponding attractor is characteristic of weakly dissipative chaos and exhibits a toroidal-like structure. At present, only few theoretical cases of such chaos are known, and none was obtained from real world observations. Under smooth conditions, a robust validation of three-dimensional chaotic models can be usually performed based on the topological approach (Gilmore 1998). Such approach becomes more difficult for weakly dissipative systems, and almost impossible under noisy observational conditions. For this reason, another validation approach is developed which consists in comparing the forecasting skill of the model to other forecasts for which no dynamical model is required. A data assimilation process is associated to the model to estimate the model's skill; several schemes are tested (simple re-initialization, Extended and Ensemble Kalman Filters and Back and Forth Nudging). Forecasts without model are performed based on the search of analogous states in the phase space (Mangiarotti et al. 2012b). The comparison reveals the quality of the model's forecasts at short to moderate horizons and contributes to validate the model. These results suggest that the dynamics of cereal crops can be reasonably approximated by low-dimensional chaotic models, and also bring out powerful arguments for chaos. Chaotic models have often been used as benchmark to test data assimilation schemes; the present work shows that such tests may not only have a theoretical interest, but also almost direct applicative potential. Moreover

  7. Validity and relative validity of a novel digital approach for 24-h dietary recall in athletes

    PubMed Central

    2014-01-01

    Background We developed a digital dietary analysis tool for athletes (DATA) using a modified 24-h recall method and an integrated, customized nutrient database. The purpose of this study was to assess DATA’s validity and relative validity by measuring its agreement with registered dietitians’ (RDs) direct observations (OBSERVATION) and 24-h dietary recall interviews using the USDA 5-step multiple-pass method (INTERVIEW), respectively. Methods Fifty-six athletes (14–20 y) completed DATA and INTERVIEW in randomized counter-balanced order. OBSERVATION (n = 26) consisted of RDs recording participants’ food/drink intake in a 24-h period and were completed the day prior to DATA and INTERVIEW. Agreement among methods was estimated using a repeated measures t-test and Bland-Altman analysis. Results The paired differences (with 95% confidence intervals) between DATA and OBSERVATION were not significant for carbohydrate (10.1%, -1.2–22.7%) and protein (14.1%, -3.2–34.5%) but was significant for energy (14.4%, 1.2–29.3%). There were no differences between DATA and INTERVIEW for energy (-1.1%, -9.1–7.7%), carbohydrate (0.2%, -7.1–8.0%) or protein (-2.7%, -11.3–6.7%). Bland-Altman analysis indicated significant positive correlations between absolute values of the differences and the means for OBSERVATION vs. DATA (r = 0.40 and r = 0.47 for energy and carbohydrate, respectively) and INTERVIEW vs. DATA (r = 0.52, r = 0.29, and r = 0.61 for energy, carbohydrate, and protein, respectively). There were also wide 95% limits of agreement (LOA) for most method comparisons. The mean bias ratio (with 95% LOA) for OBSERVATION vs. DATA was 0.874 (0.551-1.385) for energy, 0.906 (0.522-1.575) for carbohydrate, and 0.895(0.395-2.031) for protein. The mean bias ratio (with 95% LOA) for INTERVIEW vs. DATA was 1.016 (0.538-1.919) for energy, 0.995 (0.563-1.757) for carbohydrate, and 1.031 (0.514-2.068) for protein. Conclusion DATA has good relative

  8. Apparent and internal validity of a Monte Carlo-Markov model for cardiovascular disease in a cohort follow-up study.

    PubMed

    Nijhuis, Rogier L; Stijnen, Theo; Peeters, Anna; Witteman, Jacqueline C M; Hofman, Albert; Hunink, M G Myriam

    2006-01-01

    To determine the apparent and internal validity of the Rotterdam Ischemic heart disease & Stroke Computer (RISC) model, a Monte Carlo-Markov model, designed to evaluate the impact of cardiovascular disease (CVD) risk factors and their modification on life expectancy (LE) and cardiovascular disease-free LE (DFLE) in a general population (hereinafter, these will be referred to together as (DF)LE). The model is based on data from the Rotterdam Study, a cohort follow-up study of 6871 subjects aged 55 years and older who visited the research center for risk factor assessment at baseline (1990-1993) and completed a follow-up visit 7 years later (original cohort). The transition probabilities and risk factor trends used in the RISC model were based on data from 3501 subjects (the study cohort). To validate the RISC model, the number of simulated CVD events during 7 years' follow-up were compared with the observed number of events in the study cohort and the original cohort, respectively, and simulated (DF)LEs were compared with the (DF)LEs calculated from multistate life tables. Both in the study cohort and in the original cohort, the simulated distribution of CVD events was consistent with the observed number of events (CVD deaths: 7.1% v. 6.6% and 7.4% v. 7.6%, respectively; non-CVD deaths: 11.2% v. 11.5% and 12.9% v. 13.0%, respectively). The distribution of (DF)LEs estimated with the RISC model consistently encompassed the (DF)LEs calculated with multistate life tables. The simulated events and (DF)LE estimates from the RISC model are consistent with observed data from a cohort follow-up study.

  9. Toward Supersonic Retropropulsion CFD Validation

    NASA Technical Reports Server (NTRS)

    Kleb, Bil; Schauerhamer, D. Guy; Trumble, Kerry; Sozer, Emre; Barnhardt, Michael; Carlson, Jan-Renee; Edquist, Karl

    2011-01-01

    This paper begins the process of verifying and validating computational fluid dynamics (CFD) codes for supersonic retropropulsive flows. Four CFD codes (DPLR, FUN3D, OVERFLOW, and US3D) are used to perform various numerical and physical modeling studies toward the goal of comparing predictions with a wind tunnel experiment specifically designed to support CFD validation. Numerical studies run the gamut in rigor from code-to-code comparisons to observed order-of-accuracy tests. Results indicate that this complex flowfield, involving time-dependent shocks and vortex shedding, design order of accuracy is not clearly evident. Also explored is the extent of physical modeling necessary to predict the salient flowfield features found in high-speed Schlieren images and surface pressure measurements taken during the validation experiment. Physical modeling studies include geometric items such as wind tunnel wall and sting mount interference, as well as turbulence modeling that ranges from a RANS (Reynolds-Averaged Navier-Stokes) 2-equation model to DES (Detached Eddy Simulation) models. These studies indicate that tunnel wall interference is minimal for the cases investigated; model mounting hardware effects are confined to the aft end of the model; and sparse grid resolution and turbulence modeling can damp or entirely dissipate the unsteadiness of this self-excited flow.

  10. [Validation study of the Depressive Experience Questionnaire].

    PubMed

    Atger, F; Frasson, G; Loas, G; Guibourgé, S; Corcos, M; Perez Diaz, F; Speranza, M; Venisse, J-L; Lang, F; Stephan, Ph; Bizouard, P; Flament, M; Jeammet, Ph

    2003-01-01

    sample (500 female and 160 male undergraduates). Principal component analysis within sex performed on the answers to DEQ confirmed his assumption in identifying two principal depressive dimensions. The first factor involved items that are primarily externally directed and refer to a disturbance of interpersonal relationships (anaclitism); the second factor consists of items that are more internally directed and reflect concerns about self-identity (self-criticism). A third factor emerged, assessing the good functioning of subject and confidence in his resources and capacities (efficacy). Scales derived from these factors have high internal consistency and substantial test-retest reliability. The solutions for men and women were highly congruent. Factor structure has been replicated in several nonclinical and clinical samples, supporting considerable evidence to the construct validity of the DEQ Dependency and Self-criticism scales. An adolescent form of DEQ (DEQ-A) has successively been developed. Factor analysis revealed three factors that were highly congruent in female and male students and with the three factors of the original DEQ. The reliability, internal consistency and validity of DEQ-A indicate that the DEQ-A closely parallels the DEQ, especially in the articulation of Dependency and Self-criticism as two factors in depression. These formulations and clinical observations about the importance of differentiating a depression focused on issues of self-criticism from issues of dependency are consistent with the formulations of others theorists which, from very different theoretical perspectives, posit 2 types of depression, one in which either perceived loss or rejection in social relationships is central and the other in which perceived failure in achievement, guilt or lack of control serves as the precipitant of depression. These 2 types of experiences have been characterized as dominant other and dominant goal , as anxiously attached and compulsively self

  11. Case Study Observational Research: A Framework for Conducting Case Study Research Where Observation Data Are the Focus.

    PubMed

    Morgan, Sonya J; Pullon, Susan R H; Macdonald, Lindsay M; McKinlay, Eileen M; Gray, Ben V

    2017-06-01

    Case study research is a comprehensive method that incorporates multiple sources of data to provide detailed accounts of complex research phenomena in real-life contexts. However, current models of case study research do not particularly distinguish the unique contribution observation data can make. Observation methods have the potential to reach beyond other methods that rely largely or solely on self-report. This article describes the distinctive characteristics of case study observational research, a modified form of Yin's 2014 model of case study research the authors used in a study exploring interprofessional collaboration in primary care. In this approach, observation data are positioned as the central component of the research design. Case study observational research offers a promising approach for researchers in a wide range of health care settings seeking more complete understandings of complex topics, where contextual influences are of primary concern. Future research is needed to refine and evaluate the approach.

  12. An Italian multicentre validation study of the coma recovery scale-revised.

    PubMed

    Estraneo, A; Moretta, P; De Tanti, A; Gatta, G; Giacino, J T; Trojano, L

    2015-10-01

    Rate of misdiagnosis of disorders of consciousness (DoC) can be reduced by employing validated clinical diagnostic tools, such as the Coma Recovery Scale-Revised (CRS-R). An Italian version of the CRS-R has been recently developed, but its applicability across different clinical settings, and its concurrent validity and diagnostic sensitivity have not been estimated yet. To perform a multicentre validation study of the Italian version of the Coma Recovery Scale-Revised (CRS-R). Analysis of inter-rater reliability, concurrent validity and diagnostic sensitivity of the scale. One Intensive Care Unit, 8 Post-acute rehabilitation centres and 2 Long-term facilities Twenty-seven professionals (physicians, N.=11; psychologists, N.=5; physiotherapists, N.=3; speech therapists, N.=6; nurses, N.=2) from 11 Italian Centres. CRS-R and Disability Rating Scale (DRS) applied to 122 patients with clinical diagnosis of Vegetative State (VS) or Minimally Conscious State (MCS). CRS-R has good-to-excellent inter-rater reliability for all subscales, particularly for the communication subscale. The Italian version of the CRS-R showed a high sensitivity and specificity in detecting MCS with reference to clinical consensus diagnosis. The CRS-R showed good concurrent validity with the Disability Rating Scale, which had very low specificity with reference to clinical consensus diagnosis. The Italian version of the CRS-R is a valid scale for use from the sub-acute to chronic stages of DoC. It can be administered reliably by all members of the rehabilitation team with different specialties, levels of experience and settings. The present study promote use of the Italian version of the CRS-R to improve diagnosis of DoC patients, and plan tailored rehabilitation treatment.

  13. A model of scientific attitudes assessment by observation in physics learning based scientific approach: case study of dynamic fluid topic in high school

    NASA Astrophysics Data System (ADS)

    Yusliana Ekawati, Elvin

    2017-01-01

    This study aimed to produce a model of scientific attitude assessment in terms of the observations for physics learning based scientific approach (case study of dynamic fluid topic in high school). Development of instruments in this study adaptation of the Plomp model, the procedure includes the initial investigation, design, construction, testing, evaluation and revision. The test is done in Surakarta, so that the data obtained are analyzed using Aiken formula to determine the validity of the content of the instrument, Cronbach’s alpha to determine the reliability of the instrument, and construct validity using confirmatory factor analysis with LISREL 8.50 program. The results of this research were conceptual models, instruments and guidelines on scientific attitudes assessment by observation. The construct assessment instruments include components of curiosity, objectivity, suspended judgment, open-mindedness, honesty and perseverance. The construct validity of instruments has been qualified (rated load factor > 0.3). The reliability of the model is quite good with the Alpha value 0.899 (> 0.7). The test showed that the model fits the theoretical models are supported by empirical data, namely p-value 0.315 (≥ 0.05), RMSEA 0.027 (≤ 0.08)

  14. Development of a quality assessment tool for systematic reviews of observational studies (QATSO) of HIV prevalence in men having sex with men and associated risk behaviours

    PubMed Central

    Wong, William CW; Cheung, Catherine SK; Hart, Graham J

    2008-01-01

    Background Systematic reviews based on the critical appraisal of observational and analytic studies on HIV prevalence and risk factors for HIV transmission among men having sex with men are very useful for health care decisions and planning. Such appraisal is particularly difficult, however, as the quality assessment tools available for use with observational and analytic studies are poorly established. Methods We reviewed the existing quality assessment tools for systematic reviews of observational studies and developed a concise quality assessment checklist to help standardise decisions regarding the quality of studies, with careful consideration of issues such as external and internal validity. Results A pilot version of the checklist was developed based on epidemiological principles, reviews of study designs, and existing checklists for the assessment of observational studies. The Quality Assessment Tool for Systematic Reviews of Observational Studies (QATSO) Score consists of five items: External validity (1 item), reporting (2 items), bias (1 item) and confounding factors (1 item). Expert opinions were sought and it was tested on manuscripts that fulfil the inclusion criteria of a systematic review. Like all assessment scales, QATSO may oversimplify and generalise information yet it is inclusive, simple and practical to use, and allows comparability between papers. Conclusion A specific tool that allows researchers to appraise and guide study quality of observational studies is developed and can be modified for similar studies in the future. PMID:19014686

  15. The Potential Utility of Urinary Biomarkers for Risk Prediction in Combat Casualties: A Prospective Observational Cohort Study

    DTIC Science & Technology

    2015-06-16

    are associated with poor outcomes, including death and the need for renal replacement therapy. Methods : We conducted a prospective, observational study...penalty for failing to comply with a collection of information if it does not display a currently valid OMB control number. 1. REPORT DATE 16 JUN 2015...2. REPORT TYPE N/A 3. DATES COVERED - 4. TITLE AND SUBTITLE The Potential Utility of Urinary Biomarkers for Risk Prediction in Combat

  16. A Validation Study of the Adolescent Dissociative Experiences Scale

    ERIC Educational Resources Information Center

    Keck Seeley, Susan. M.; Perosa, Sandra, L.; Perosa, Linda, M.

    2004-01-01

    Objective: The purpose of this study was to further the validation process of the Adolescent Dissociative Experiences Scale (A-DES). In this study, a 6-item Likert response format with descriptors was used when responding to the A-DES rather than the 11-item response format used in the original A-DES. Method: The internal reliability and construct…

  17. Basic School Skills Inventory-3: Validity and Reliability Study

    ERIC Educational Resources Information Center

    Yildiz, F. Ülkü; Çagdas, Aysel; Kayili, Gökhan

    2017-01-01

    The purpose of this study is to perform the validity-reliability analysis of the three subtests of Basic School Skills Inventory 3--Mathematics, Classroom Behavior and Daily Life skills--and do its adaptation for four to six year-old Turkish children. The sample of the study included 595 four to six year-old Turkish children attending public and…

  18. The Spanish version of the Emotional Labour Scale (ELS): a validation study.

    PubMed

    Picardo, Juan M; López-Fernández, Consuelo; Hervás, María José Abellán

    2013-10-01

    To validate the Spanish version of the Emotional Labour Scale (ELS), an instrument widely used to understand how professionals working with people face emotional labor in their daily job. An observational, cross-sectional and multicenter survey was used. Nursing students and their clinical tutors (n=211) completed the self-reported ELS when the clinical practice period was over. First order and second order Confirmatory Factor Analyses (CFA) were estimated in order to test the factor structure of the scale. The results of the CFA confirm a factor structure of the scale with six first order factors (duration, frequency, intensity, variety, surface acting and deep acting) and two larger second order factors named Demands (duration, frequency, intensity and variety) and Acting (surface acting and deep acting) establishing the validity of the Spanish version of the ELS. Copyright © 2012 Elsevier Ltd. All rights reserved.

  19. 40 CFR 761.386 - Required experimental conditions for the validation study and subsequent use during decontamination.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... the validation study and subsequent use during decontamination. 761.386 Section 761.386 Protection of... experimental conditions for the validation study and subsequent use during decontamination. The following experimental conditions apply for any solvent: (a) Temperature and pressure. Conduct the validation study and...

  20. 40 CFR 761.386 - Required experimental conditions for the validation study and subsequent use during decontamination.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... the validation study and subsequent use during decontamination. 761.386 Section 761.386 Protection of... experimental conditions for the validation study and subsequent use during decontamination. The following experimental conditions apply for any solvent: (a) Temperature and pressure. Conduct the validation study and...

  1. 40 CFR 761.386 - Required experimental conditions for the validation study and subsequent use during decontamination.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... the validation study and subsequent use during decontamination. 761.386 Section 761.386 Protection of... experimental conditions for the validation study and subsequent use during decontamination. The following experimental conditions apply for any solvent: (a) Temperature and pressure. Conduct the validation study and...

  2. Risk of bias and confounding of observational studies of Zika virus infection: A scoping review of research protocols

    PubMed Central

    Haby, Michelle M.; Martínez-Vega, Ruth; Pinzón-Flores, Carlos E.; Smith, Emma; Pinart, Mariona; Broutet, Nathalie; Becerra-Posada, Francisco; Aldighieri, Sylvain; Van Kerkhove, Maria D.

    2017-01-01

    Introduction Given the severity and impact of the current Zika virus (ZIKV) outbreak in the Americas, numerous countries have rushed to develop research studies to assess ZIKV and its potential health consequences. In an effort to ensure that studies are comprehensive, both internally and externally valid, and with reliable results, the World Health Organization, the Pan American Health Organization, Institut Pasteur, the networks of Fiocruz, the Consortia for the Standardization of Influenza Seroepidemiology (CONSISE) and the International Severe Acute Respiratory and Emerging Infection Consortium (ISARIC) have generated six standardized clinical and epidemiological research protocols and questionnaires to address key public health questions on ZIKV. Methods We conducted a systematic search of ongoing study protocols related to ZIKV research. We analyzed the content of protocols of 32 cohort studies and 13 case control studies for systematic bias that could produce erroneous results. Additionally we aimed to characterize the risks of bias and confounding in observational studies related to ZIKV and to propose ways to minimize them, including the use of six newly standardized research protocols. Results Observational studies of ZIKV face an array of challenges, including measurement of exposure and outcomes (microcephaly and Guillain-Barré Syndrome). Potential confounders need to be measured where known and controlled for in the analysis. Selection bias due to non-random selection is a significant issue, particularly in the case-control design, and losses to follow-up is equally important for the cohort design. Conclusion Observational research seeking to answer key questions on the ZIKV should consider these restrictions and take precautions to minimize bias in an effort to provide reliable and valid results. Utilization of the standardized research protocols developed by the WHO, PAHO, Institut Pasteur, and CONSISE will harmonize the key methodological aspects

  3. Risk of bias and confounding of observational studies of Zika virus infection: A scoping review of research protocols.

    PubMed

    Reveiz, Ludovic; Haby, Michelle M; Martínez-Vega, Ruth; Pinzón-Flores, Carlos E; Elias, Vanessa; Smith, Emma; Pinart, Mariona; Broutet, Nathalie; Becerra-Posada, Francisco; Aldighieri, Sylvain; Van Kerkhove, Maria D

    2017-01-01

    Given the severity and impact of the current Zika virus (ZIKV) outbreak in the Americas, numerous countries have rushed to develop research studies to assess ZIKV and its potential health consequences. In an effort to ensure that studies are comprehensive, both internally and externally valid, and with reliable results, the World Health Organization, the Pan American Health Organization, Institut Pasteur, the networks of Fiocruz, the Consortia for the Standardization of Influenza Seroepidemiology (CONSISE) and the International Severe Acute Respiratory and Emerging Infection Consortium (ISARIC) have generated six standardized clinical and epidemiological research protocols and questionnaires to address key public health questions on ZIKV. We conducted a systematic search of ongoing study protocols related to ZIKV research. We analyzed the content of protocols of 32 cohort studies and 13 case control studies for systematic bias that could produce erroneous results. Additionally we aimed to characterize the risks of bias and confounding in observational studies related to ZIKV and to propose ways to minimize them, including the use of six newly standardized research protocols. Observational studies of ZIKV face an array of challenges, including measurement of exposure and outcomes (microcephaly and Guillain-Barré Syndrome). Potential confounders need to be measured where known and controlled for in the analysis. Selection bias due to non-random selection is a significant issue, particularly in the case-control design, and losses to follow-up is equally important for the cohort design. Observational research seeking to answer key questions on the ZIKV should consider these restrictions and take precautions to minimize bias in an effort to provide reliable and valid results. Utilization of the standardized research protocols developed by the WHO, PAHO, Institut Pasteur, and CONSISE will harmonize the key methodological aspects of each study design to minimize bias at

  4. Impulsivity-hyperactivity and subtypes of aggression in early childhood: an observational and short-term longitudinal study.

    PubMed

    Ostrov, Jamie M; Godleski, Stephanie A

    2009-08-01

    This short-term longitudinal study (N = 112) was conducted to explore the concurrent and prospective associations between teacher-reported impulsive-hyperactive behavior and observed relational and physical aggression during early childhood (M = 45.54 months old, SD = 9.07). Multiple informants and methods including observational methods (i.e., 160 min per child) were used to assess aggression and impulsivity-hyperactivity. All measures were found to be valid and reliable. Prospective hierarchical regression analyses revealed that impulsivity-hyperactivity was associated with increases in observed physical aggression across time, controlling for initial relational aggression and gender. These findings add to the growing developmental psychopathology literature that suggests that distinguishing between subtypes of aggression during early childhood may be important for understanding the course of impulsivity-hyperactivity in young children. Implications for practice are discussed.

  5. Validation of intermediate end points in cancer research.

    PubMed

    Schatzkin, A; Freedman, L S; Schiffman, M H; Dawsey, S M

    1990-11-21

    Investigations using intermediate end points as cancer surrogates are quicker, smaller, and less expensive than studies that use malignancy as the end point. We present a strategy for determining whether a given biomarker is a valid intermediate end point between an exposure and incidence of cancer. Candidate intermediate end points may be selected from case series, ecologic studies, and animal experiments. Prospective cohort and sometimes case-control studies may be used to quantify the intermediate end point-cancer association. The most appropriate measure of this association is the attributable proportion. The intermediate end point is a valid cancer surrogate if the attributable proportion is close to 1.0, but not if it is close to 0. Usually, the attributable proportion is close to neither 1.0 nor 0; in this case, valid surrogacy requires that the intermediate end point mediate an established exposure-cancer relation. This would in turn imply that the exposure effect would vanish if adjusted for the intermediate end point. We discuss the relative advantages of intervention and observational studies for the validation of intermediate end points. This validation strategy also may be applied to intermediate end points for adverse reproductive outcomes and chronic diseases other than cancer.

  6. Forest-Observation-System.net - towards a global in-situ data repository for biomass datasets validation

    NASA Astrophysics Data System (ADS)

    Shchepashchenko, D.; Chave, J.; Phillips, O. L.; Davies, S. J.; Lewis, S. L.; Perger, C.; Dresel, C.; Fritz, S.; Scipal, K.

    2017-12-01

    Forest monitoring is high on the scientific and political agenda. Global measurements of forest height, biomass and how they change with time are urgently needed as essential climate and ecosystem variables. The Forest Observation System - FOS (http://forest-observation-system.net/) is an international cooperation to establish a global in-situ forest biomass database to support earth observation and to encourage investment in relevant field-based observations and science. FOS aims to link the Remote Sensing (RS) community with ecologists who measure forest biomass and estimating biodiversity in the field for a common benefit. The benefit of FOS for the RS community is the partnering of the most established teams and networks that manage permanent forest plots globally; to overcome data sharing issues and introduce a standard biomass data flow from tree level measurement to the plot level aggregation served in the most suitable form for the RS community. Ecologists benefit from the FOS with improved access to global biomass information, data standards, gap identification and potential improved funding opportunities to address the known gaps and deficiencies in the data. FOS closely collaborate with the Center for Tropical Forest Science -CTFS-ForestGEO, the ForestPlots.net (incl. RAINFOR, AfriTRON and T-FORCES), AusCover, Tropical managed Forests Observatory and the IIASA network. FOS is an open initiative with other networks and teams most welcome to join. The online database provides open access for both metadata (e.g. who conducted the measurements, where and which parameters) and actual data for a subset of plots where the authors have granted access. A minimum set of database values include: principal investigator and institution, plot coordinates, number of trees, forest type and tree species composition, wood density, canopy height and above ground biomass of trees. Plot size is 0.25 ha or large. The database will be essential for validating and calibrating

  7. Prediction of liver disease in patients whose liver function tests have been checked in primary care: model development and validation using population-based observational cohorts.

    PubMed

    McLernon, David J; Donnan, Peter T; Sullivan, Frank M; Roderick, Paul; Rosenberg, William M; Ryder, Steve D; Dillon, John F

    2014-06-02

    To derive and validate a clinical prediction model to estimate the risk of liver disease diagnosis following liver function tests (LFTs) and to convert the model to a simplified scoring tool for use in primary care. Population-based observational cohort study of patients in Tayside Scotland identified as having their LFTs performed in primary care and followed for 2 years. Biochemistry data were linked to secondary care, prescriptions and mortality data to ascertain baseline characteristics of the derivation cohort. A separate validation cohort was obtained from 19 general practices across the rest of Scotland to externally validate the final model. Primary care, Tayside, Scotland. Derivation cohort: LFT results from 310 511 patients. After exclusions (including: patients under 16 years, patients having initial LFTs measured in secondary care, bilirubin >35 μmol/L, liver complications within 6 weeks and history of a liver condition), the derivation cohort contained 95 977 patients with no clinically apparent liver condition. Validation cohort: after exclusions, this cohort contained 11 653 patients. Diagnosis of a liver condition within 2 years. From the derivation cohort (n=95 977), 481 (0.5%) were diagnosed with a liver disease. The model showed good discrimination (C-statistic=0.78). Given the low prevalence of liver disease, the negative predictive values were high. Positive predictive values were low but rose to 20-30% for high-risk patients. This study successfully developed and validated a clinical prediction model and subsequent scoring tool, the Algorithm for Liver Function Investigations (ALFI), which can predict liver disease risk in patients with no clinically obvious liver disease who had their initial LFTs taken in primary care. ALFI can help general practitioners focus referral on a small subset of patients with higher predicted risk while continuing to address modifiable liver disease risk factors in those at lower risk. Published

  8. Multidimensional measures validated for home health needs of older persons: A systematic review.

    PubMed

    de Rossi Figueiredo, Daniela; Paes, Lucilene Gama; Warmling, Alessandra Martins; Erdmann, Alacoque Lorenzini; de Mello, Ana Lúcia Schaefer Ferreira

    2018-01-01

    To conduct a systematic review of the literature on valid and reliable multidimensional instruments to assess home health needs of older persons. Systematic review. Electronic databases, PubMed/Medline, Web of Science, Scopus, Cumulative Index to Nursing and Allied Health Literature, Scientific Electronic Library Online and the Latin American and Caribbean Health Sciences Information. All English, Portuguese and Spanish literature which included studies of reliability and validity of instruments that assessed at least two dimensions: physical, psychological, social support and functional independence, self-rated health behaviors and contextual environment and if such instruments proposed interventions after evaluation and/or monitoring changes over a period of time. Older persons aged 60 years or older. Of the 2397 studies identified, 32 were considered eligible. Two-thirds of the instruments proposed the physical, psychological, social support and functional independence dimensions. Inter-observer and intra-observer reliability and internal consistency values were 0.7 or above. More than two-thirds of the studies included validity (n=26) and more than one validity was tested in 15% (n=4) of these. Only 7% (n=2) proposed interventions after evaluation and/or monitoring changes over a period of time. Although the multidimensional assessment was performed, and the reliability values of the reviewed studies were satisfactory, different validity tests were not present in several studies. A gap at the instrument conception was observed related to interventions after evaluation and/or monitoring changes over a period of time. Further studies with this purpose are necessary for home health needs of the older persons. Copyright © 2017 Elsevier Ltd. All rights reserved.

  9. The impact of registration accuracy on imaging validation study design: A novel statistical power calculation.

    PubMed

    Gibson, Eli; Fenster, Aaron; Ward, Aaron D

    2013-10-01

    Novel imaging modalities are pushing the boundaries of what is possible in medical imaging, but their signal properties are not always well understood. The evaluation of these novel imaging modalities is critical to achieving their research and clinical potential. Image registration of novel modalities to accepted reference standard modalities is an important part of characterizing the modalities and elucidating the effect of underlying focal disease on the imaging signal. The strengths of the conclusions drawn from these analyses are limited by statistical power. Based on the observation that in this context, statistical power depends in part on uncertainty arising from registration error, we derive a power calculation formula relating registration error, number of subjects, and the minimum detectable difference between normal and pathologic regions on imaging, for an imaging validation study design that accommodates signal correlations within image regions. Monte Carlo simulations were used to evaluate the derived models and test the strength of their assumptions, showing that the model yielded predictions of the power, the number of subjects, and the minimum detectable difference of simulated experiments accurate to within a maximum error of 1% when the assumptions of the derivation were met, and characterizing sensitivities of the model to violations of the assumptions. The use of these formulae is illustrated through a calculation of the number of subjects required for a case study, modeled closely after a prostate cancer imaging validation study currently taking place at our institution. The power calculation formulae address three central questions in the design of imaging validation studies: (1) What is the maximum acceptable registration error? (2) How many subjects are needed? (3) What is the minimum detectable difference between normal and pathologic image regions? Copyright © 2013 Elsevier B.V. All rights reserved.

  10. Clinical Validation of Heart Rate Apps: Mixed-Methods Evaluation Study

    PubMed Central

    Stans, Jelle; Mortelmans, Christophe; Van Haelst, Ruth; Van Schelvergem, Gertjan; Pelckmans, Caroline; Smeets, Christophe JP; Lanssens, Dorien; De Cannière, Hélène; Storms, Valerie; Thijs, Inge M; Vaes, Bert; Vandervoort, Pieter M

    2017-01-01

    Background Photoplethysmography (PPG) is a proven way to measure heart rate (HR). This technology is already available in smartphones, which allows measuring HR only by using the smartphone. Given the widespread availability of smartphones, this creates a scalable way to enable mobile HR monitoring. An essential precondition is that these technologies are as reliable and accurate as the current clinical (gold) standards. At this moment, there is no consensus on a gold standard method for the validation of HR apps. This results in different validation processes that do not always reflect the veracious outcome of comparison. Objective The aim of this paper was to investigate and describe the necessary elements in validating and comparing HR apps versus standard technology. Methods The FibriCheck (Qompium) app was used in two separate prospective nonrandomized studies. In the first study, the HR of the FibriCheck app was consecutively compared with 2 different Food and Drug Administration (FDA)-cleared HR devices: the Nonin oximeter and the AliveCor Mobile ECG. In the second study, a next step in validation was performed by comparing the beat-to-beat intervals of the FibriCheck app to a synchronized ECG recording. Results In the first study, the HR (BPM, beats per minute) of 88 random subjects consecutively measured with the 3 devices showed a correlation coefficient of .834 between FibriCheck and Nonin, .88 between FibriCheck and AliveCor, and .897 between Nonin and AliveCor. A single way analysis of variance (ANOVA; P=.61 was executed to test the hypothesis that there were no significant differences between the HRs as measured by the 3 devices. In the second study, 20,298 (ms) R-R intervals (RRI)–peak-to-peak intervals (PPI) from 229 subjects were analyzed. This resulted in a positive correlation (rs=.993, root mean square deviation [RMSE]=23.04 ms, and normalized root mean square error [NRMSE]=0.012) between the PPI from FibriCheck and the RRI from the wearable

  11. Current Concerns in Validity Theory.

    ERIC Educational Resources Information Center

    Kane, Michael

    Validity is concerned with the clarification and justification of the intended interpretations and uses of observed scores. It has not been easy to formulate a general methodology set of principles for validation, but progress has been made, especially as the field has moved from relatively limited criterion-related models to sophisticated…

  12. The relation between child feeding problems as measured by parental report and mealtime behavior observation: A pilot study.

    PubMed

    van Dijk, Marijn; Bruinsma, Eke; Hauser, M Paulina

    2016-04-01

    Because feeding problems have clear negative consequences for both child and caretakers, early diagnosis and intervention are important. Parent-report questionnaires can contribute to early identification, because they are efficient and typically offer a 'holistic' perspective of the child's eating in different contexts. In this pilot study, we aim to explore the concurrent validity of a short screening instrument (the SEP, which is the Dutch MCH-FS) in one of its target populations (a group of premature children) by comparing the total score with the observed behavior of the child and caretaker during a regular home meal. 28 toddlers (aged 9-18 months) and their caretakers participated in the study. Video-observations of the meals were coded for categories of eating behavior and parent-child interaction. The results show that the total SEP-score correlates with food refusal, feeding efficiency, and self-feeding, but not with negative affect and parental instructions. This confirms that the SEP has a certain degree of concurrent validity in the sense that its total score is associated with specific 'benchmark' feeding behaviors: food refusal, feeding efficiency and autonomy. Future studies with larger samples are needed to generalize the findings from this pilot to a broader context. Copyright © 2016 Elsevier Ltd. All rights reserved.

  13. The Chinese version of the Outcome Expectations for Exercise scale: validation study.

    PubMed

    Lee, Ling-Ling; Chiu, Yu-Yun; Ho, Chin-Chih; Wu, Shu-Chen; Watson, Roger

    2011-06-01

    Estimates of the reliability and validity of the English nine-item Outcome Expectations for Exercise (OEE) scale have been tested and found to be valid for use in various settings, particularly among older people, with good internal consistency and validity. Data on the use of the OEE scale among older Chinese people living in the community and how cultural differences might affect the administration of the OEE scale are limited. To test the validity and reliability of the Chinese version of the Outcome Expectations for Exercise scale among older people. A cross-sectional validation study was designed to test the Chinese version of the OEE scale (OEE-C). Reliability was examined by testing both the internal consistency for the overall scale and the squared multiple correlation coefficient for the single item measure. The validity of the scale was tested on the basis of both a traditional psychometric test and a confirmatory factor analysis using structural equation modelling. The Mokken Scaling Procedure (MSP) was used to investigate if there were any hierarchical, cumulative sets of items in the measure. The OEE-C scale was tested in a group of older people in Taiwan (n=108, mean age=77.1). There was acceptable internal consistency (alpha=.85) and model fit in the scale. Evidence of the validity of the measure was demonstrated by the tests for criterion-related validity and construct validity. There was a statistically significant correlation between exercise outcome expectations and exercise self-efficacy (r=.34, p<.01). An analysis of the Mokken Scaling Procedure found that nine items of the scale were all retained in the analysis and the resulting scale was reliable and statistically significant (p=.0008). The results obtained in the present study provided acceptable levels of reliability and validity evidence for the Chinese Outcome Expectations for Exercise scale when used with older people in Taiwan. Future testing of the OEE-C scale needs to be carried out

  14. Learning about Teachers' Literacy Instruction from Classroom Observations

    ERIC Educational Resources Information Center

    Kelcey, Ben; Carlisle, Joanne F.

    2013-01-01

    The purpose of this study is to contribute to efforts to improve methods for gathering and analyzing data from classroom observations in early literacy. The methodological approach addresses current problems of reliability and validity of classroom observations by taking into account differences in teachers' uses of instructional actions (e.g.,…

  15. Brazilian validation of the Alberta Infant Motor Scale.

    PubMed

    Valentini, Nadia Cristina; Saccani, Raquel

    2012-03-01

    The Alberta Infant Motor Scale (AIMS) is a well-known motor assessment tool used to identify potential delays in infants' motor development. Although Brazilian researchers and practitioners have used the AIMS in laboratories and clinical settings, its translation to Portuguese and validation for the Brazilian population is yet to be investigated. This study aimed to translate and validate all AIMS items with respect to internal consistency and content, criterion, and construct validity. A cross-sectional and longitudinal design was used. A cross-cultural translation was used to generate a Brazilian-Portuguese version of the AIMS. In addition, a validation process was conducted involving 22 professionals and 766 Brazilian infants (aged 0-18 months). The results demonstrated language clarity and internal consistency for the motor criteria (motor development score, α=.90; prone, α=.85; supine, α=.92; sitting, α=.84; and standing, α=.86). The analysis also revealed high discriminative power to identify typical and atypical development (motor development score, P<.001; percentile, P=.04; classification criterion, χ(2)=6.03; P=.05). Temporal stability (P=.07) (rho=.85, P<.001) was observed, and predictive power (P<.001) was limited to the group of infants aged from 3 months to 9 months. Limited predictive validity was observed, which may have been due to the restricted time that the groups were followed longitudinally. In sum, the translated version of AIMS presented adequate validity and reliability.

  16. Airglow studies using observations made with the GLO instrument on the Space Shuttle

    NASA Astrophysics Data System (ADS)

    Alfaro Suzan, Ana Luisa

    2009-12-01

    Our understanding of Earth's upper atmosphere has advanced tremendously over the last few decades due to our enhanced capacity for making remote observations from space. Space based observations of Earth's daytime and nighttime airglow emissions are very good examples of such enhancements to our knowledge. The terrestrial nighttime airglow, or nightglow, is barely discernible to the naked eye as viewed from Earth's surface. However, it is clearly visible from space - as most astronauts have been amazed to report. The nightglow consists of emissions of ultraviolet, visible and near-infrared radiation from electronically excited oxygen molecules and atoms and vibrationally excited OH molecules. It mostly emanates from a 10 km thick layer located about 100 km above Earth's surface. Various photochemical models have been proposed to explain the production of the emitting species. In this study some unique observations of Earth's nightglow made with the GLO instrument on NASA's Space Shuttle, are analyzed to assess the proposed excitation models. Previous analyses of these observations by Broadfoot and Gardner (2001), performed using a 1-D inversion technique, have indicated significant spatial structures and have raised serious questions about the proposed nightglow excitation models. However, the observation of such strong spatial structures calls into serious question the appropriateness of the adopted 1-D inversion technique and, therefore, the validity of the conclusions. In this study a more rigorous 2-D tomographic inversion technique is developed and applied to the available GLO data to determine if some of the apparent discrepancies can be explained by the limitations of the previously applied 1-D inversion approach. The results of this study still reveal some potentially serious inadequacies in the proposed photochemical models. However, alternative explanations for the discrepancies between the GLO observations and the model expectations are suggested. These

  17. Content Validation and Evaluation of an Endovascular Teamwork Assessment Tool.

    PubMed

    Hull, L; Bicknell, C; Patel, K; Vyas, R; Van Herzeele, I; Sevdalis, N; Rudarakanchana, N

    2016-07-01

    To modify, content validate, and evaluate a teamwork assessment tool for use in endovascular surgery. A multistage, multimethod study was conducted. Stage 1 included expert review and modification of the existing Observational Teamwork Assessment for Surgery (OTAS) tool. Stage 2 included identification of additional exemplar behaviours contributing to effective teamwork and enhanced patient safety in endovascular surgery (using real-time observation, focus groups, and semistructured interviews of multidisciplinary teams). Stage 3 included content validation of exemplar behaviours using expert consensus according to established psychometric recommendations and evaluation of structure, content, feasibility, and usability of the Endovascular Observational Teamwork Assessment Tool (Endo-OTAS) by an expert multidisciplinary panel. Stage 4 included final team expert review of exemplars. OTAS core team behaviours were maintained (communication, coordination, cooperation, leadership team monitoring). Of the 114 OTAS behavioural exemplars, 19 were modified, four removed, and 39 additional endovascular-specific behaviours identified. Content validation of these 153 exemplar behaviours showed that 113/153 (73.9%) reached the predetermined Item-Content Validity Index rating for teamwork and/or patient safety. After expert team review, 140/153 (91.5%) exemplars were deemed to warrant inclusion in the tool. More than 90% of the expert panel agreed that Endo-OTAS is an appropriate teamwork assessment tool with observable behaviours. Some concerns were noted about the time required to conduct observations and provide performance feedback. Endo-OTAS is a novel teamwork assessment tool, with evidence for content validity and relevance to endovascular teams. Endo-OTAS enables systematic objective assessment of the quality of team performance during endovascular procedures. Copyright © 2016. Published by Elsevier Ltd.

  18. Validation of Student and Parent Reported Data on the Basic Grant Application Form: Pre-Award Validation Analysis Study. Revised Final Report.

    ERIC Educational Resources Information Center

    Applied Management Sciences, Inc., Silver Spring, MD.

    The 1978-1979 pre-award institution validation process for the Basic Educational Opportunity Grant (BEOG) program was studied, based on applicant and grant recipient files as of the end of February 1979. The objective was to assess the impact of the validation process on the proper award of BEOGs, and to determine whether the criteria for…

  19. External validation of preexisting first trimester preeclampsia prediction models.

    PubMed

    Allen, Rebecca E; Zamora, Javier; Arroyo-Manzano, David; Velauthar, Luxmilar; Allotey, John; Thangaratinam, Shakila; Aquilina, Joseph

    2017-10-01

    To validate the increasing number of prognostic models being developed for preeclampsia using our own prospective study. A systematic review of literature that assessed biomarkers, uterine artery Doppler and maternal characteristics in the first trimester for the prediction of preeclampsia was performed and models selected based on predefined criteria. Validation was performed by applying the regression coefficients that were published in the different derivation studies to our cohort. We assessed the models discrimination ability and calibration. Twenty models were identified for validation. The discrimination ability observed in derivation studies (Area Under the Curves) ranged from 0.70 to 0.96 when these models were validated against the validation cohort, these AUC varied importantly, ranging from 0.504 to 0.833. Comparing Area Under the Curves obtained in the derivation study to those in the validation cohort we found statistically significant differences in several studies. There currently isn't a definitive prediction model with adequate ability to discriminate for preeclampsia, which performs as well when applied to a different population and can differentiate well between the highest and lowest risk groups within the tested population. The pre-existing large number of models limits the value of further model development and future research should be focussed on further attempts to validate existing models and assessing whether implementation of these improves patient care. Crown Copyright © 2017. Published by Elsevier B.V. All rights reserved.

  20. Global validation of a process-based model on vegetation gross primary production using eddy covariance observations.

    PubMed

    Liu, Dan; Cai, Wenwen; Xia, Jiangzhou; Dong, Wenjie; Zhou, Guangsheng; Chen, Yang; Zhang, Haicheng; Yuan, Wenping

    2014-01-01

    Gross Primary Production (GPP) is the largest flux in the global carbon cycle. However, large uncertainties in current global estimations persist. In this study, we examined the performance of a process-based model (Integrated BIosphere Simulator, IBIS) at 62 eddy covariance sites around the world. Our results indicated that the IBIS model explained 60% of the observed variation in daily GPP at all validation sites. Comparison with a satellite-based vegetation model (Eddy Covariance-Light Use Efficiency, EC-LUE) revealed that the IBIS simulations yielded comparable GPP results as the EC-LUE model. Global mean GPP estimated by the IBIS model was 107.50±1.37 Pg C year(-1) (mean value ± standard deviation) across the vegetated area for the period 2000-2006, consistent with the results of the EC-LUE model (109.39±1.48 Pg C year(-1)). To evaluate the uncertainty introduced by the parameter Vcmax, which represents the maximum photosynthetic capacity, we inversed Vcmax using Markov Chain-Monte Carlo (MCMC) procedures. Using the inversed Vcmax values, the simulated global GPP increased by 16.5 Pg C year(-1), indicating that IBIS model is sensitive to Vcmax, and large uncertainty exists in model parameterization.

  1. Validation of Satellite Retrieved Land Surface Variables

    NASA Technical Reports Server (NTRS)

    Lakshmi, Venkataraman; Susskind, Joel

    1999-01-01

    The effective use of satellite observations of the land surface is limited by the lack of high spatial resolution ground data sets for validation of satellite products. Recent large scale field experiments include FIFE, HAPEX-Sahel and BOREAS which provide us with data sets that have large spatial coverage and long time coverage. It is the objective of this paper to characterize the difference between the satellite estimates and the ground observations. This study and others along similar lines will help us in utilization of satellite retrieved data in large scale modeling studies.

  2. Sources of Self-Efficacy in Mathematics: A Validation Study

    ERIC Educational Resources Information Center

    Usher, Ellen L.; Pajares, Frank

    2009-01-01

    The purpose of this study was to develop and validate items with which to assess A. Bandura's (1997) theorized sources of self-efficacy among middle school mathematics students. Results from Phase 1 (N=1111) were used to develop and refine items for subsequent use. In Phase 2 of the study (N=824), a 39-item, four-factor exploratory model fit best.…

  3. A validation study of the Keyboard Personal Computer Style instrument (K-PeCS) for use with children.

    PubMed

    Green, Dido; Meroz, Anat; Margalit, Adi Edit; Ratzon, Navah Z

    2012-11-01

    This study examines a potential instrument for measurement of typing postures of children. This paper describes inter-rater, test-retest reliability and concurrent validity of the Keyboard Personal Computer Style instrument (K-PeCS), an observational measurement of postures and movements during keyboarding, for use with children. Two trained raters independently rated videos of 24 children (aged 7-10 years). Six children returned one week later for identifying test-retest reliability. Concurrent validity was assessed by comparing ratings obtained using the K-PECS to scores from a 3D motion analysis system. Inter-rater reliability was moderate to high for 12 out of 16 items (Kappa: 0.46 to 1.00; correlation coefficients: 0.77-0.95) and test-retest reliability varied across items (Kappa: 0.25 to 0.67; correlation coefficients: r = 0.20 to r = 0.95). Concurrent validity compared favourably across arm pathlength, wrist extension and ulnar deviation. In light of the limitations of other tools the K-PeCS offers a fairly affordable, reliable and valid instrument to address the gap for measurement of typing styles of children, despite the shortcomings of some items. However further research is required to refine the instrument for use in evaluating typing among children. Copyright © 2012 Elsevier Ltd and The Ergonomics Society. All rights reserved.

  4. Research Measures for Dyscalculia: A Validity and Reliability Study.

    ERIC Educational Resources Information Center

    Geiman, R. M.

    1986-01-01

    This study sought to evaluate a measure of dyscalculia to determine its validity and reliability. It also tested use of the instrument with seventh graders and ascertained where errors attributed to dyscalculia were also present in an average sample of seventh graders. Results varied. (MNS)

  5. Chemotherapy effectiveness and mortality prediction in surgically treated osteosarcoma dogs: A validation study.

    PubMed

    Schmidt, A F; Nielen, M; Withrow, S J; Selmic, L E; Burton, J H; Klungel, O H; Groenwold, R H H; Kirpensteijn, J

    2016-03-01

    Canine osteosarcoma is the most common bone cancer, and an important cause of mortality and morbidity, in large purebred dogs. Previously we constructed two multivariable models to predict a dog's 5-month or 1-year mortality risk after surgical treatment for osteosarcoma. According to the 5-month model, dogs with a relatively low risk of 5-month mortality benefited most from additional chemotherapy treatment. In the present study, we externally validated these results using an independent cohort study of 794 dogs. External performance of our prediction models showed some disagreement between observed and predicted risk, mean difference: -0.11 (95% confidence interval [95% CI]-0.29; 0.08) for 5-month risk and 0.25 (95%CI 0.10; 0.40) for 1-year mortality risk. After updating the intercept, agreement improved: -0.0004 (95%CI-0.16; 0.16) and -0.002 (95%CI-0.15; 0.15). The chemotherapy by predicted mortality risk interaction (P-value=0.01) showed that the chemotherapy compared to no chemotherapy effectiveness was modified by 5-month mortality risk: dogs with a relatively lower risk of mortality benefited most from additional chemotherapy. Chemotherapy effectiveness on 1-year mortality was not significantly modified by predicted risk (P-value=0.28). In conclusion, this external validation study confirmed that our multivariable risk prediction models can predict a patient's mortality risk and that dogs with a relatively lower risk of 5-month mortality seem to benefit most from chemotherapy. Copyright © 2016 Elsevier B.V. All rights reserved.

  6. Validating the Center for Epidemiological Studies Depression Scale for Children in Rwanda

    ERIC Educational Resources Information Center

    Betancourt, Theresa; Scorza, Pamela; Meyers-Ohki, Sarah; Mushashi, Christina; Kayiteshonga, Yvonne; Binagwaho, Agnes; Stulac, Sara; Beardslee, William R.

    2012-01-01

    Objective: We assessed the validity of the Center for Epidemiological Studies Depression Scale for Children (CES-DC) as a screen for depression in Rwandan children and adolescents. Although the CES-DC is widely used for depression screening in high-income countries, its validity in low-income and culturally diverse settings, including sub-Saharan…

  7. Evaluation of reporting quality for observational studies using routinely collected health data in pharmacovigilance.

    PubMed

    Nie, Xiaolu; Zhang, Ying; Wu, Zehao; Jia, Lulu; Wang, Xiaoling; Langan, Sinéad M; Benchimol, Eric I; Peng, Xiaoxia

    2018-06-01

    To appraise the reporting quality of studies which concerned linezolid related thrombocytopenia referring to REporting of studies Conducted using Observational Routinely-collected health Data (RECORD) statement. Medline, Embase, Cochrane library and clinicaltrial.gov were searched for observational studies concerning linezolid related thrombocytopenia using routinely collected health data from 2000 to 2017. Two reviewers screened potential eligible articles and extracted data independently. Finally, reporting quality assessment was performed by two senior researchers using RECORD statement. Of 25 included studies, 11 (44.0%) mentioned the type of data in the title and/or abstract. In 38 items derived from RECORD statement, the median number of items reported in the included studies was 22 (interquartile range (IQR) 18 to 27). Inadequate reporting issues were discovered in the following aspects: validation studies of the codes or algorithms, study size estimation, quantitative variables, subgroup statistical methods, missing data, follow-up/matching or sampling strategy, sensitivity analysis and cleaning methods, funding and role of funders and accessibility of protocol, raw data. This study provides the evidence that the reporting quality of post-marketing safety evaluation studies conducted using routinely collected health data was often insufficient. Future stakeholders are encouraged to endorse the RECORD guidelines in pharmacovigilance.

  8. Is the Scale for Measuring Motivational Interviewing Skills a valid and reliable instrument for measuring the primary care professionals motivational skills?: EVEM study protocol.

    PubMed

    Pérula, Luis Á; Campiñez, Manuel; Bosch, Josep M; Barragán Brun, Nieves; Arboniés, Juan C; Bóveda Fontán, Julia; Martín Alvarez, Remedios; Prados, Jose A; Martín-Rioboó, Enrique; Massons, Josep; Criado, Margarita; Fernández, José Á; Parras, Juan M; Ruiz-Moral, Roger; Novo, Jesús M

    2012-11-22

    Lifestyle is one of the main determinants of people's health. It is essential to find the most effective prevention strategies to be used to encourage behavioral changes in their patients. Many theories are available that explain change or adherence to specific health behaviors in subjects. In this sense the named Motivational Interviewing has increasingly gained relevance. Few well-validated instruments are available for measuring doctors' communication skills, and more specifically the Motivational Interviewing. The hypothesis of this study is that the Scale for Measuring Motivational Interviewing Skills (EVEM questionnaire) is a valid and reliable instrument for measuring the primary care professionals skills to get behavior change in patients. To test the hypothesis we have designed a prospective, observational, multi-center study to validate a measuring instrument. - Thirty-two primary care centers in Spain. -Sampling and Size: a) face and consensual validity: A group composed of 15 experts in Motivational Interviewing. b) Assessment of the psychometric properties of the scale; 50 physician- patient encounters will be videoed; a total of 162 interviews will be conducted with six standardized patients, and another 200 interviews will be conducted with 50 real patients (n=362). Four physicians will be specially trained to assess 30 interviews randomly selected to test the scale reproducibility. -Measurements for to test the hypothesis: a) Face validity: development of a draft questionnaire based on a theoretical model, by using Delphi-type methodology with experts. b) Scale psychometric properties: intraobservers will evaluate video recorded interviews: content-scalability validity (Exploratory Factor Analysis), internal consistency (Cronbach alpha), intra-/inter-observer reliability (Kappa index, intraclass correlation coefficient, Bland & Altman methodology), generalizability, construct validity and sensitivity to change (Pearson product-moment correlation

  9. Is the Scale for Measuring Motivational Interviewing Skills a valid and reliable instrument for measuring the primary care professionals motivational skills?: EVEM study protocol

    PubMed Central

    2012-01-01

    Background Lifestyle is one of the main determinants of people’s health. It is essential to find the most effective prevention strategies to be used to encourage behavioral changes in their patients. Many theories are available that explain change or adherence to specific health behaviors in subjects. In this sense the named Motivational Interviewing has increasingly gained relevance. Few well-validated instruments are available for measuring doctors’ communication skills, and more specifically the Motivational Interviewing. Methods/Design The hypothesis of this study is that the Scale for Measuring Motivational Interviewing Skills (EVEM questionnaire) is a valid and reliable instrument for measuring the primary care professionals skills to get behavior change in patients. To test the hypothesis we have designed a prospective, observational, multi-center study to validate a measuring instrument. –Scope: Thirty-two primary care centers in Spain. -Sampling and Size: a) face and consensual validity: A group composed of 15 experts in Motivational Interviewing. b) Assessment of the psychometric properties of the scale; 50 physician- patient encounters will be videoed; a total of 162 interviews will be conducted with six standardized patients, and another 200 interviews will be conducted with 50 real patients (n=362). Four physicians will be specially trained to assess 30 interviews randomly selected to test the scale reproducibility. -Measurements for to test the hypothesis: a) Face validity: development of a draft questionnaire based on a theoretical model, by using Delphi-type methodology with experts. b) Scale psychometric properties: intraobservers will evaluate video recorded interviews: content-scalability validity (Exploratory Factor Analysis), internal consistency (Cronbach alpha), intra-/inter-observer reliability (Kappa index, intraclass correlation coefficient, Bland & Altman methodology), generalizability, construct validity and sensitivity to change

  10. Terminal illness and the increased mortality risk of conventional antipsychotics in observational studies: a systematic review.

    PubMed

    Luijendijk, Hendrika J; de Bruin, Niels C; Hulshof, Tessa A; Koolman, Xander

    2016-02-01

    Numerous large observational studies have shown an increased risk of mortality in elderly users of conventional antipsychotics. Health authorities have warned against use of these drugs. However, terminal illness is a potentially strong confounder of the observational findings. So, the objective of this study was to systematically assess whether terminal illness may have biased the observational association between conventional antipsychotics and risk of mortality in elderly patients. Studies were searched in PubMed, CINAHL, Embase, the references of selected studies and articles referring to selected studies (Web of Science). Inclusion criteria were (i) observational studies that estimated (ii) the risk of all-cause mortality in (iii) new elderly users of (iv) conventional antipsychotics compared with atypical antipsychotics or no use. Two investigators assessed the characteristics of the exposure and reference groups, main results, measured confounders and methods used to adjust for unmeasured confounders. We identified 21 studies. All studies were based on administrative medical and pharmaceutical databases. Sicker and older patients received conventional antipsychotics more often than new antipsychotics. The risk of dying was especially high in the first month of use, and when haloperidol was administered per injection or in high doses. Terminal illness was not measured in any study. Instrumental variables that were used were also confounded by terminal illness. We conclude that terminal illness has not been adjusted for in observational studies that reported an increased risk of mortality risk in elderly users of conventional antipsychotics. As the validity of the evidence is questionable, so is the warning based on it. Copyright © 2015 John Wiley & Sons, Ltd.

  11. Detiding Tsunami Currents to Validate Velocities in Numerical Simulation Codes using Observations Near Hawaii from the 2011 Tohoku Tsunami

    NASA Astrophysics Data System (ADS)

    Adams, L. M.; LeVeque, R. J.

    2015-12-01

    The ability to measure, predict, and compute tsunami flow velocities is ofimportance in risk assessment and hazard mitigation. Until recently, fewdirect measurements of tsunami velocities existed to compare with modelresults. During the 11 March 2001 Tohoku Tsunami, 328 current meters werewere in place around the Hawaiian Islands, USA, that captured time seriesof water velocity in 18 locations, in both harbors and deep channels, ata series of depths. Arcos and LeVeque[1] compared these records againstnumerical simulations performed using the GeoClaw numerical tsunami modelwhich is based on the depth-averaged shallow water equations. They confirmedthat GeoClaw can accurately predict velocities at nearshore locations, andthat tsunami current velocity is more spatially variable than wave formor height and potentially more sensitive for model validation.We present a new approach to detiding this sensitive current data. Thisapproach can be used separately on data at each depth of a current gauge.When averaged across depths, the Geoclaw results in [1] are validated. Withoutaveraging, the results should be useful to researchers wishing to validate their3D codes. These results can be downloaded from the project website below.The approach decomposes the pre-tsunami component of the data into three parts:a tidal component, a fast component (noise), and a slow component (not matchedby the harmonic analysis). Each part is extended to the time when the tsunamiis present and subtracted from the current data then to give the ''tsunami current''that can be compared with 2D or 3D codes that do not model currents in thepre-tsunami regime. [1] "Validating Velocities in the GeoClaw Tsunami Model using Observations NearHawaii from the 2001 Tohoku Tsunami"M.E.M. Arcos and Randall J. LeVequearXiv:1410.2884v1 [physics.geo-py], 10 Oct. 2014.project website: http://faculty.washington.edu/lma3/research.html

  12. External Validation Study of First Trimester Obstetric Prediction Models (Expect Study I): Research Protocol and Population Characteristics.

    PubMed

    Meertens, Linda Jacqueline Elisabeth; Scheepers, Hubertina Cj; De Vries, Raymond G; Dirksen, Carmen D; Korstjens, Irene; Mulder, Antonius Lm; Nieuwenhuijze, Marianne J; Nijhuis, Jan G; Spaanderman, Marc Ea; Smits, Luc Jm

    2017-10-26

    A number of first-trimester prediction models addressing important obstetric outcomes have been published. However, most models have not been externally validated. External validation is essential before implementing a prediction model in clinical practice. The objective of this paper is to describe the design of a study to externally validate existing first trimester obstetric prediction models, based upon maternal characteristics and standard measurements (eg, blood pressure), for the risk of pre-eclampsia (PE), gestational diabetes mellitus (GDM), spontaneous preterm birth (PTB), small-for-gestational-age (SGA) infants, and large-for-gestational-age (LGA) infants among Dutch pregnant women (Expect Study I). The results of a pilot study on the feasibility and acceptability of the recruitment process and the comprehensibility of the Pregnancy Questionnaire 1 are also reported. A multicenter prospective cohort study was performed in The Netherlands between July 1, 2013 and December 31, 2015. First trimester obstetric prediction models were systematically selected from the literature. Predictor variables were measured by the Web-based Pregnancy Questionnaire 1 and pregnancy outcomes were established using the Postpartum Questionnaire 1 and medical records. Information about maternal health-related quality of life, costs, and satisfaction with Dutch obstetric care was collected from a subsample of women. A pilot study was carried out before the official start of inclusion. External validity of the models will be evaluated by assessing discrimination and calibration. Based on the pilot study, minor improvements were made to the recruitment process and online Pregnancy Questionnaire 1. The validation cohort consists of 2614 women. Data analysis of the external validation study is in progress. This study will offer insight into the generalizability of existing, non-invasive first trimester prediction models for various obstetric outcomes in a Dutch obstetric population

  13. Development and validation of the Salzburg COPD-screening questionnaire (SCSQ): a questionnaire development and validation study.

    PubMed

    Weiss, Gertraud; Steinacher, Ina; Lamprecht, Bernd; Kaiser, Bernhard; Mikes, Romana; Sator, Lea; Hartl, Sylvia; Wagner, Helga; Studnicka, M

    2017-01-26

    Chronic obstructive pulmonary disease prevalence rates are still high. However, the majority of subjects are not diagnosed. Strategies have to be implemented to overcome the problem of under-diagnosis. Questionnaires could be used to pre-select subjects for spirometry and thereby help reducing under-diagnosis. We report a brief, simple, self-administrable and validated chronic obstructive pulmonary disease questionnaire to increase the pre-test probability for chronic obstructive pulmonary disease diagnosis in subjects undergoing confirmatory spirometry. In 2005, we completed the Austrian Burden of Obstructive Lung Disease-study in 1258 subjects aged >40 years. Post-bronchodilator spirometry was performed, and non-reversible airflow limitation defined by FEV 1 /FVC ratio below the lower limit of normal. Questions from the Salzburg chronic obstructive pulmonary disease screening-questionnaire were selected using a logistic regression model, and risk scores were based on regression-coefficients. A training sub-sample (n = 800) was used to create the score, and a test sub-sample (n = 458) was used to test it. In 2008, an external validation study was done, using the same protocol in 775 patients from primary care. The Salzburg chronic obstructive pulmonary disease screening questionnaire was composed of items related to "breathing problems", "wheeze", "cough", "limitation of physical activity", and "smoking". At the >=2 points cut-off of the Salzburg chronic obstructive pulmonary disease screening questionnaire, sensitivity was 69.1% [95%CI: 56.6%; 79.5%], specificity 60.0% [95%CI: 54.9%; 64.9%], the positive predictive value 23.2% [95%CI: 17.7%; 29.7%] and the negative predictive value 91.8% [95%CI: 87.5%; 95.7%] to detect post bronchodilator airflow limitation. The external validation study in primary care confirmed these findings. The Salzburg chronic obstructive pulmonary disease screening questionnaire was derived from the highly standardized Burden of

  14. Community validation of the IDEA study cognitive screen in rural Tanzania.

    PubMed

    Gray, William K; Paddick, Stella Maria; Collingwood, Cecilia; Kisoli, Aloyce; Mbowe, Godfrey; Mkenda, Sarah; Lissu, Carolyn; Rogathi, Jane; Kissima, John; Walker, Richard W; Mushi, Declare; Chaote, Paul; Ogunniyi, Adesola; Dotchin, Catherine L

    2016-11-01

    The dementia diagnosis gap in sub-Saharan Africa (SSA) is large, partly because of difficulties in screening for cognitive impairment in the community. As part of the Identification and Intervention for Dementia in Elderly Africans (IDEA) study, we aimed to validate the IDEA cognitive screen in a community-based sample in rural Tanzania METHODS: Study participants were recruited from people who attended screening days held in villages within the rural Hai district of Tanzania. Criterion validity was assessed against the gold standard clinical dementia diagnosis using DSM-IV criteria. Construct validity was assessed against, age, education, sex and grip strength and instrumental activities of daily living (IADLs). Internal consistency and floor and ceiling effects were also examined. During community screening, the IDEA cognitive screen had high criterion validity, with an area under the receiver operating characteristic curve of 0.855 (95% CI 0.794 to 0.915). Higher scores on the screen were significantly correlated with lower age, male sex, having attended school, better grip strength and improved performance in activities of daily living. Factor analysis revealed a single factor with an eigenvalue greater than one, although internal consistency was only moderate (Cronbach's alpha = 0.534). The IDEA cognitive screen had high criterion and construct validity and is suitable for use as a cognitive screening instrument in a community setting in SSA. Only moderate internal consistency may partly reflect the multi-domain nature of dementia as diagnosed clinically. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

  15. Observing Atmospheric Formaldehyde (HCHO) from Space: Validation and Intercomparison of Six Retrievals from Four Satellites (OMI, GOME2A, GOME2B, OMPS) with SEAC4RS Aircraft Observations over the Southeast US

    NASA Technical Reports Server (NTRS)

    Zhu, Lei; Jacob, Daniel J.; Kim, Patrick S.; Fisher, Jenny A.; Yu, Karen; Travis, Katherine R.; Mickley, Loretta J.; Yantosca, Robert M.; Sulprizio, Melissa P.; De Smedt, Isabelle; hide

    2016-01-01

    Formaldehyde (HCHO) column data from satellites are widely used as a proxy for emissions of volatile organic compounds (VOCs), but validation of the data has been extremely limited. Here we use highly accurate HCHO aircraft observations from the NASA SEAC4RS (Studies of Emissions, Atmospheric Composition, Clouds and Climate Coupling by Regional Surveys) campaign over the southeast US in August-September 2013 to validate and intercompare six retrievals of HCHO columns from four different satellite instruments (OMI (Ozone Monitoring Instrument), GOME (Global Ozone Monitoring Experiment) 2A, GOME (Global Ozone Monitoring Experiment) 2B and OMPS (Ozone Mapping and Profiler Suite)) and three different research groups. The GEOS (Goddard Earth Observing System)-Chem chemical transport model is used as a common intercomparison platform. All retrievals feature a HCHO maximum over Arkansas and Louisiana, consistent with the aircraft observations and reflecting high emissions of biogenic isoprene. The retrievals are also interconsistent in their spatial variability over the southeast US (r equals 0.4 to 0.8 on a 0.5 degree by 0.5 degree grid) and in their day-to-day variability (r equals 0.5 to 0.8). However, all retrievals are biased low in the mean by 20 to 51 percent, which would lead to corresponding bias in estimates of isoprene emissions from the satellite data. The smallest bias is for OMI-BIRA (Ozone Monitoring Instrument - Belgian Institute for Space Aeronomy), which has high corrected slant columns relative to the other retrievals and low scattering weights in its air mass factor (AMF) calculation. OMI-BIRA has systematic error in its assumed vertical HCHO shape profiles for the AMF calculation, and correcting this would eliminate its bias relative to the SEAC (sup 4) RS data. Our results support the use of satellite HCHO data as a quantitative proxy for isoprene emission after correction of the low mean bias. There is no evident pattern in the bias, suggesting that

  16. Quality Rating and Improvement System (QRIS) Validation Study Designs. CEELO FastFacts

    ERIC Educational Resources Information Center

    Schilder, D.

    2013-01-01

    In this "Fast Facts," a state has received Race to the Top Early Learning Challenge funds and is seeking information to inform the design of the Quality Rating and Improvement System (QRIS) validation study. The Center on Enhancing Early Learning Outcomes (CEELO) responds that according to Resnick (2012), validation of a QRIS is an…

  17. Validation study of the Japanese version of the Obsessive-Compulsive Drinking Scale.

    PubMed

    Tatsuzawa, Yasutaka; Yoshimasu, Haruo; Moriyama, Yasushi; Furusawa, Teruyuki; Yoshino, Aihide

    2002-02-01

    The Obsessive-Compulsive Drinking Scale (OCDS) is a self-rating questionnaire that measures cognitive and behavioral aspects of craving for alcohol. The OCDS consists of two subscales: the obsessive thoughts of drinking subscale (OS) and the compulsive drinking subscale (CS). This study aims to validate the Japanese version of the OCDS. First, internal consistency and discriminant validity were evaluated. Second, a prospective longitudinal 3-month outcome study of 67 patients with alcohol dependence who participated in a relapse prevention program was designed to assess the concurrent and predictive validity of the OCDS. The OCDS demonstrated high internal consistency. The OS had high discriminant validity, while the CS did not. Twenty-three patients (34.3%) dropped out of treatment. These patients had significantly higher OS scores than those who completed the program. At 3 months, the relapse group had significantly higher OCDS scores than the no relapse group. Also, the OCDS score was higher in subjects who had early-onset alcohol dependence than late-onset dependence. The OCDS is useful for evaluating cognitive aspect of craving and predicts dropout and relapse.

  18. Validation of TES ammonia observations at the single pixel scale in the San Joaquin Valley during DISCOVER-AQ

    NASA Astrophysics Data System (ADS)

    Sun, Kang; Cady-Pereira, Karen; Miller, David J.; Tao, Lei; Zondlo, Mark A.; Nowak, John B.; Neuman, J. A.; Mikoviny, Tomas; Müller, Markus; Wisthaler, Armin; Scarino, Amy J.; Hostetler, Chris A.

    2015-05-01

    Ammonia measurements from a vehicle-based, mobile open-path sensor and those from aircraft were compared with Tropospheric Emission Spectrometer (TES) NH3 columns at the pixel scale during the NASA Deriving Information on Surface conditions from Column and Vertically Resolved Observations Relevant to Air Quality field experiment. Spatial and temporal mismatches were reduced by having the mobile laboratory sample in the same areas as the TES footprints. To examine how large heterogeneities in the NH3 surface mixing ratios may affect validation, a detailed spatial survey was performed within a single TES footprint around the overpass time. The TES total NH3 column above a single footprint showed excellent agreement with the in situ total column constructed from surface measurements with a difference of 2% (within the combined measurement uncertainties). The comparison was then extended to a TES transect of nine footprints where aircraft data (5-80 ppbv) were available in a narrow spatiotemporal window (<10 km, <1 h). The TES total NH3 columns above the nine footprints agreed to within 6% of the in situ total columns derived from the aircraft-based measurements. Finally, to examine how TES captures surface spatial gradients at the interpixel scale, ground-based, mobile measurements were performed directly underneath a TES transect, covering nine footprints within ±1.5 h of the overpass. The TES total columns were strongly correlated (R2 = 0.82) with the median NH3 mixing ratios measured at the surface. These results provide the first in situ validation of the TES total NH3 column product, and the methodology is applicable to other satellite observations of short-lived species at the pixel scale.

  19. Clinical photographic observation of plantar corns and callus associated with a nominal scale classification and inter- observer reliability study in a student population.

    PubMed

    Tollafield, David R

    2017-01-01

    The management of plantar corns and callus has a low cost-benefit with reduced prioritisation in healthcare. The distinction between types of keratin lesions that forms corns and callus has attracted limited interest. Observation is imperative to improving diagnostic predictions and a number of studies point to some confusion as to how best to achieve this. The use of photographic observation has been proposed to improve our understanding of intractable keratin lesions. Students from a podiatry school reviewed photographs where plantar keratin lesions were divided into four nominal groups; light callus (Grade 1), heavy defined callus (Grade 2), concentric keratin plugs (Grade 3) and callus with deeper density changes under the forefoot (Grade 4). A group of 'experts' assigned from qualified podiatrists validated the observer rated responses by the students. Cohen's weighted statistic (k) was used to measure inter-observer reliability. First year students (unskilled) performed less well when viewing photographs ( k  = 0.33) compared to third year students (semi-skilled, k  = 0.62). The experts performed better than students ( k  = 0.88) providing consistency with wound care models in other studies. Improved clinical annotation of clinical features, supported by classification of keratin- based lesions, combined with patient outcome tools, could improve the scientific rationale to prioritise patient care. Problems associated with photographic assessment involves trying to differentiate similar lesions without the benefit of direct palpation. Direct observation of callus with and without debridement requires further investigation alongside the model proposed in this paper.

  20. Validity and reliability of the VOAA-DDD to assess spontaneous hand use with a video observation tool in children with spastic unilateral cerebral palsy.

    PubMed

    Aarts, Pauline B M; Jongerius, Peter H; Geerdink, Yvonne A; Geurts, Alexander C

    2009-11-25

    In 2003 new computer software, the VOAA (Video Observations Aarts and Aarts), was designed to score and evaluate two important aspects of spontaneous upper limb use, i.e. overall duration and frequency of specific behaviours. The aim of this study was to investigate the test-retest, interrater and intrarater reliability and the construct validity of a new module, the VOAA-DDD, to determine developmental disregard in children with spastic unilateral cerebral palsy (CP). A test-retest design with three raters for reliability and a two-group design for construct validity were used. Subjects were a total of 20 children with spastic unilateral CP equally divided in two age groups (2.5-5 and 5-8 years), and 56 healthy children of the same age groups. Overall duration and frequency of specific behaviours of the affected arm and hand were assessed during a task demanding ('stringing beads') and a task stimulating ('decorating a muffin') the use of both hands. Reliability was estimated by intraclass correlation coefficients (ICCs). Construct validity was assessed by comparing children with CP to healthy children. All ICCs exceeded 0.87. In contrast with healthy children, children with CP used their affected hand less during the 'muffin' task compared to the 'beads' task. Of the children with CP, 90% in the age group of 2.5-5 years and 50% in the age group of 5-8 years showed values exceeding the extreme values of healthy controls, respectively, indicating developmental disregard. The VOAA-DDD is a reliable and valid instrument to assess spontaneous use of the affected arm and hand in order to determine developmental disregard in children with spastic unilateral CP.

  1. A new framework to enhance the interpretation of external validation studies of clinical prediction models.

    PubMed

    Debray, Thomas P A; Vergouwe, Yvonne; Koffijberg, Hendrik; Nieboer, Daan; Steyerberg, Ewout W; Moons, Karel G M

    2015-03-01

    It is widely acknowledged that the performance of diagnostic and prognostic prediction models should be assessed in external validation studies with independent data from "different but related" samples as compared with that of the development sample. We developed a framework of methodological steps and statistical methods for analyzing and enhancing the interpretation of results from external validation studies of prediction models. We propose to quantify the degree of relatedness between development and validation samples on a scale ranging from reproducibility to transportability by evaluating their corresponding case-mix differences. We subsequently assess the models' performance in the validation sample and interpret the performance in view of the case-mix differences. Finally, we may adjust the model to the validation setting. We illustrate this three-step framework with a prediction model for diagnosing deep venous thrombosis using three validation samples with varying case mix. While one external validation sample merely assessed the model's reproducibility, two other samples rather assessed model transportability. The performance in all validation samples was adequate, and the model did not require extensive updating to correct for miscalibration or poor fit to the validation settings. The proposed framework enhances the interpretation of findings at external validation of prediction models. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  2. The birth satisfaction scale: Turkish adaptation, validation and reliability study

    PubMed Central

    Cetin, Fatma Cosar; Sezer, Ayse; Merih, Yeliz Dogan

    2015-01-01

    OBJECTIVE: The objective of this study is to investigate the validity and the reliability of Birth Satisfaction Scale (BSS) and to adapt it into the Turkish language. This scale is used for measuring maternal satisfaction with birth in order to evaluate women’s birth perceptions. METHODS: In this study there were 150 women who attended to inpatient postpartum clinic. The participants filled in an information form and the BSS questionnaire forms. The properties of the scale were tested by conducting reliability and validation analyses. RESULTS: BSS entails 30 Likert-type questions. It was developed by Hollins Martin and Fleming. Total scale scores ranged between 30–150 points. Higher scores from the scale mean increases in birth satisfaction. Three overarching themes were identified in Scale: service provision (home assessment, birth environment, support, relationships with health care professionals); personal attributes (ability to cope during labour, feeling in control, childbirth preparation, relationship with baby); and stress experienced during labour (distress, obstetric injuries, receiving sufficient medical care, obstetric intervention, pain, prolonged labour and baby’s health). Cronbach’s alfa coefficient was 0.62. CONCLUSION: According to the present study, BSS entails 30 Likert-type questions and evaluates women’s birth perceptions. The Turkish version of BSS has been proven to be a valid and a reliable scale. PMID:28058355

  3. Do physicians clean their hands? Insights from a covert observational study.

    PubMed

    Kovacs-Litman, Adam; Wong, Kimberly; Shojania, Kaveh G; Callery, Sandra; Vearncombe, Mary; Leis, Jerome A

    2016-12-01

    Physicians are notorious for poor hand hygiene (HH) compliance. We wondered if lower performance by physicians compared with other health professionals might reflect differences in the Hawthorne effect. We introduced covert HH observers to see if performance differences between physicians and nurses decreased and to gain further insights into physician HH behaviors. Following training and validation with a hospital HH auditor, 2 students covertly measured HH during clinical rotations. Students rotated off clinical services every week to increase exposure to different providers and minimize risk of exposing the covert observation. We compared covertly measured HH compliance with data from overt observation by hospital auditors during the same time period. Covert observation produced much lower HH compliance than recorded by hospital auditors during the same time period: 50.0% (799/1597) versus 83.7% (2769/3309) (P < 0.0002). The difference in physician compliance between hospital auditors and covert observers was 19.0% (73.2% vs 54.2%); for nurses this difference was much higher at 40.7% (85.8% vs 45.1%) (P < 0.0001). Physician trainees showed markedly better compliance when attending staff cleaned their hands compared with encounters when attending did not (79.5% vs 18.9%; P < 0.0002). Our study suggests that traditional HH audits not only overstate HH performance overall, but can lead to inaccurate inferences about performance by professional groupings due to relative differences in the Hawthorne effect. We suggest that future improvement efforts will rely on more accurate HH monitoring systems and strong attending physician leadership to set an example for trainees. Journal of Hospital Medicine 2015;11:862-864. © 2015 Society of Hospital Medicine. © 2016 Society of Hospital Medicine.

  4. Ways of learning: Observational studies versus experiments

    USGS Publications Warehouse

    Shaffer, T.L.; Johnson, D.H.

    2008-01-01

    Manipulative experimentation that features random assignment of treatments, replication, and controls is an effective way to determine causal relationships. Wildlife ecologists, however, often must take a more passive approach to investigating causality. Their observational studies lack one or more of the 3 cornerstones of experimentation: controls, randomization, and replication. Although an observational study can be analyzed similarly to an experiment, one is less certain that the presumed treatment actually caused the observed response. Because the investigator does not actively manipulate the system, the chance that something other than the treatment caused the observed results is increased. We reviewed observational studies and contrasted them with experiments and, to a lesser extent, sample surveys. We identified features that distinguish each method of learning and illustrate or discuss some complications that may arise when analyzing results of observational studies. Findings from observational studies are prone to bias. Investigators can reduce the chance of reaching erroneous conclusions by formulating a priori hypotheses that can be pursued multiple ways and by evaluating the sensitivity of study conclusions to biases of various magnitudes. In the end, however, professional judgment that considers all available evidence is necessary to render a decision regarding causality based on observational studies.

  5. Five-Factor Screener in the 2005 National Health Interview Survey Cancer Control Supplement: Validation Results

    Cancer.gov

    Risk Factor Assessment Branch staff have assessed indirectly the validity of parts of the Five-Factor Screener in two studies: NCI's Observing Protein and Energy (OPEN) Study and the Eating at America's Table Study (EATS). In both studies, multiple 24-hour recalls in conjunction with a measurement error model were used to assess validity.

  6. Accuracy of clinical observations of push-off during gait after stroke.

    PubMed

    McGinley, Jennifer L; Morris, Meg E; Greenwood, Ken M; Goldie, Patricia A; Olney, Sandra J

    2006-06-01

    To determine the accuracy (criterion-related validity) of real-time clinical observations of push-off in gait after stroke. Criterion-related validity study of gait observations. Rehabilitation hospital in Australia. Eleven participants with stroke and 8 treating physical therapists. Not applicable. Pearson product-moment correlation between physical therapists' observations of push-off during gait and criterion measures of peak ankle power generation from a 3-dimensional motion analysis system. A high correlation was obtained between the observational ratings and the measurements of peak ankle power generation (Pearson r =.98). The standard error of estimation of ankle power generation was .32W/kg. Physical therapists can make accurate real-time clinical observations of push-off during gait following stroke.

  7. Effects of Coaching on the Validity of the SAT: A Simulation Study.

    ERIC Educational Resources Information Center

    Baydar, Nazli

    The effects of student coaching in preparation for the College Board Scholastic Aptitude Test (SAT) on the predictive validity of this test for freshman year performance were studied using data on 1985 freshman year students from four colleges. After the validity of the SAT was estimated for each school, a given proportion of students was picked,…

  8. Seat belt use among rear passengers: validity of self-reported versus observational measures

    PubMed Central

    Zambon, Francesco; Fedeli, Ugo; Marchesan, Maria; Schievano, Elena; Ferro, Antonio; Spolaore, Paolo

    2008-01-01

    Background The effects of seat belt laws and public education campaigns on seat belt use are assessed on the basis of observational or self-reported data on seat belt use. Previous studies focusing on front seat occupants have shown that self-reports indicate a greater seat belt usage than observational findings. Whether this over-reporting in self reports applies to rear seat belt usage, and to what extent, have yet to be investigated. We aimed to evaluate the over-reporting factor for rear seat passengers and whether this varies by gender and under different compulsory seat belt use conditions. Methods The study was conducted in the Veneto Region, an area in the North-East of Italy with a population of 4.7 million. The prevalence of seat belt use among rear seat passengers was determined by means of a cross-sectional self-report survey and an observational study. Both investigations were performed in two time periods: in 2003, when rear seat belt use was not enforced by primary legislation, and in 2005, after rear seat belt use had become compulsory (June 2003). Overall, 8138 observations and 7902 interviews were recorded. Gender differences in the prevalence of rear seat belt use were examined using the chi-square test. The over-reporting factor, defined as the ratio of the self-reported to the observed prevalence of rear seat belt use, was calculated by gender before and after the rear seat belt legislation came into effect. Results Among rear seat passengers, self-reported rates were always higher than the observational findings, with an overall over-reporting factor of 1.4. We registered no statistically significant changes over time in the over-reporting factor, nor any major differences between genders. Conclusion Self-reported seat belt usage by rear passengers represents an efficient alternative to observational studies for tracking changes in actual behavior, although the reported figures need to be adjusted using an appropriate over-reporting factor in

  9. Synthesis and validation of novel cholesterol-based fluorescent lipids designed to observe the cellular trafficking of cationic liposomes.

    PubMed

    Kim, Bieong-Kil; Seu, Young-Bae; Choi, Jong-Soo; Park, Jong-Won; Doh, Kyung-Oh

    2015-09-15

    Cholesterol-based fluorescent lipids with ether linker were synthesized using NBD (Chol-E-NBD) or Rhodamine B (Chol-E-Rh), and the usefulnesses as fluorescent probes for tracing cholesterol-based liposomes were validated. The fluorescent intensities of liposomes containing these modified lipids were measured and observed under a microscope. Neither compound interfered with the expression of GFP plasmid, and live cell images were obtained without interferences. Changes in the fluorescent intensity of liposomes containing Chol-E-NBD were followed by flow cytometry for up to 24h. These fluorescent lipids could be useful probes for trafficking of cationic liposome-mediated gene delivery. Copyright © 2015 Elsevier Ltd. All rights reserved.

  10. Developing the Coach Analysis and Intervention System (CAIS): establishing validity and reliability of a computerised systematic observation instrument.

    PubMed

    Cushion, Christopher; Harvey, Stephen; Muir, Bob; Nelson, Lee

    2012-01-01

    We outline the evolution of a computerised systematic observation tool and describe the process for establishing the validity and reliability of this new instrument. The Coach Analysis and Interventions System (CAIS) has 23 primary behaviours related to physical behaviour, feedback/reinforcement, instruction, verbal/non-verbal, questioning and management. The instrument also analyses secondary coach behaviour related to performance states, recipient, timing, content and questioning/silence. The CAIS is a multi-dimensional and multi-level mechanism able to provide detailed and contextualised data about specific coaching behaviours occurring in complex and nuanced coaching interventions and environments that can be applied to both practice sessions and competition.

  11. The Behavior Pain Assessment Tool for critically ill adults: a validation study in 28 countries.

    PubMed

    Gélinas, Céline; Puntillo, Kathleen A; Levin, Pavel; Azoulay, Elie

    2017-05-01

    Many critically ill adults are unable to communicate their pain through self-report. The study purpose was to validate the use of the 8-item Behavior Pain Assessment Tool (BPAT) in patients hospitalized in 192 intensive care units from 28 countries. A total of 4812 procedures in 3851 patients were included in data analysis. Patients were assessed with the BPAT before and during procedures by 2 different raters (mostly nurses and physicians). Those who were able to self-report were asked to rate their pain intensity and pain distress on 0 to 10 numeric rating scales. Interrater reliability of behavioral observations was supported by moderate (0.43-0.60) to excellent (>0.60) kappa coefficients. Mixed effects multilevel logistic regression models showed that most behaviors were more likely to be present during the procedure than before and in less sedated patients, demonstrating discriminant validation of the tool use. Regarding criterion validation, moderate positive correlations were found during procedures between the mean BPAT scores and the mean pain intensity (r = 0.54) and pain distress (r = 0.49) scores (P < 0.001). Regression models showed that all behaviors were significant predictors of pain intensity and pain distress, accounting for 35% and 29% of their total variance, respectively. A BPAT cut-point score >3.5 could classify patients with or without severe levels (≥8) of pain intensity and distress with sensitivity and specificity findings ranging from 61.8% to 75.1%. The BPAT was found to be reliable and valid. Its feasibility for use in practice and the effect of its clinical implementation on patient pain and intensive care unit outcomes need further research.

  12. A content validity study of signs, symptoms and diseases/health problems expressed in LIBRAS.

    PubMed

    Aragão, Jamilly da Silva; de França, Inacia Sátiro Xavier; Coura, Alexsandro Silva; de Sousa, Francisco Stélio; Batista, Joana D'arc Lyra; Magalhães, Isabella Medeiros de Oliveira

    2015-01-01

    To validate the content of signs, symptoms and diseases/health problems expressed in LIBRAS for people with deafness. Method: Methodological development study, which involved 36 people with deafness and three LIBRAS specialists. The study was conducted in three stages: investigation of the signs, symptoms and diseases/health problems, referred to by people with deafness, reported in a questionnaire; video recordings of how people with deafness express, through LIBRA, the signs, symptoms and diseases/health problems; and validation of the contents of the recordings of the expressions by LIBRAS specialists. Data were processed in a spreadsheet and analyzed using univariate tables, with absolute frequencies and percentages. The validation results were analyzed using the Content Validity Index (CVI). 33 expressions in LIBRAS, of signs, symptoms and diseases/health problems were evaluated, and 28 expressions obtained a satisfactory CVI (1.00). The signs, symptoms and diseases/health problems expressed in LIBRAS presented validity, in the study region, for health professionals, especially nurses, for use in the clinical anamnesis of the nursing consultation for people with deafness.

  13. Ride qualities criteria validation/pilot performance study: Flight test results

    NASA Technical Reports Server (NTRS)

    Nardi, L. U.; Kawana, H. Y.; Greek, D. C.

    1979-01-01

    Pilot performance during a terrain following flight was studied for ride quality criteria validation. Data from manual and automatic terrain following operations conducted during low level penetrations were analyzed to determine the effect of ride qualities on crew performance. The conditions analyzed included varying levels of turbulence, terrain roughness, and mission duration with a ride smoothing system on and off. Limited validation of the B-1 ride quality criteria and some of the first order interactions between ride qualities and pilot/vehicle performance are highlighted. An earlier B-1 flight simulation program correlated well with the flight test results.

  14. Iowa Hydrologic and Environmental Validation Site: A Proposal to the Community

    NASA Astrophysics Data System (ADS)

    Bradley, A. A.; Ciach, G. J.; Eichinger, W. N.; Hornbuckle, K. C.; Illman, W.; Krajewski, W. F.; Kruger, A.; Patel, V. C.; Weirich, F. H.; Zhang, Y.

    2002-05-01

    We present a proposal to the hydrologic research community to establish a validation site in eastern Iowa. Many hydrological and meteorological variables observed using remote sensing techniques or predicted using numerical simulation models require validation. Validation, understood as quantification of the uncertainty, is difficult and often even impossible using operationally available in-situ observations. Specialized high-density networks of sensors with well-established error characteristics are required to serve as reference. We propose to establish a well-instrumented site for validation of several hydrometeorlogical and environmental variables near Iowa City, Iowa. We foresee this site as a national resource of detailed information collected in partnership with federal, state, and local agencies but independent of their routine mission oriented operations. The data would be distributed in real-time via the Internet to the research community nation wide to support model validation and development studies. In the presentation we justify the need for such sites, we make the case for setting a prototype site in Iowa, and we present preliminary considerations for the site's design and the data distribution system.

  15. Collocation mismatch uncertainties in satellite aerosol retrieval validation

    NASA Astrophysics Data System (ADS)

    Virtanen, Timo H.; Kolmonen, Pekka; Sogacheva, Larisa; Rodríguez, Edith; Saponaro, Giulia; de Leeuw, Gerrit

    2018-02-01

    Satellite-based aerosol products are routinely validated against ground-based reference data, usually obtained from sun photometer networks such as AERONET (AEROsol RObotic NETwork). In a typical validation exercise a spatial sample of the instantaneous satellite data is compared against a temporal sample of the point-like ground-based data. The observations do not correspond to exactly the same column of the atmosphere at the same time, and the representativeness of the reference data depends on the spatiotemporal variability of the aerosol properties in the samples. The associated uncertainty is known as the collocation mismatch uncertainty (CMU). The validation results depend on the sampling parameters. While small samples involve less variability, they are more sensitive to the inevitable noise in the measurement data. In this paper we study systematically the effect of the sampling parameters in the validation of AATSR (Advanced Along-Track Scanning Radiometer) aerosol optical depth (AOD) product against AERONET data and the associated collocation mismatch uncertainty. To this end, we study the spatial AOD variability in the satellite data, compare it against the corresponding values obtained from densely located AERONET sites, and assess the possible reasons for observed differences. We find that the spatial AOD variability in the satellite data is approximately 2 times larger than in the ground-based data, and the spatial variability correlates only weakly with that of AERONET for short distances. We interpreted that only half of the variability in the satellite data is due to the natural variability in the AOD, and the rest is noise due to retrieval errors. However, for larger distances (˜ 0.5°) the correlation is improved as the noise is averaged out, and the day-to-day changes in regional AOD variability are well captured. Furthermore, we assess the usefulness of the spatial variability of the satellite AOD data as an estimate of CMU by comparing the

  16. Validation of general job satisfaction in the Korean Labor and Income Panel Study.

    PubMed

    Park, Shin Goo; Hwang, Sang Hee

    2017-01-01

    The purpose of this study is to assess the validity and reliability of general job satisfaction (JS) in the Korean Labor and Income Panel Study (KLIPS). We used the data from the 17th wave (2014) of the nationwide KLIPS, which selected a representative panel sample of Korean households and individuals aged 15 or older residing in urban areas. We included in this study 7679 employed subjects (4529 males and 3150 females). The general JS instrument consisted of five items rated on a scale from 1 (strongly disagree) to 5 (strongly agree). The general JS reliability was assessed using the corrected item-total correlation and Cronbach's alpha coefficient. The validity of general JS was assessed using confirmatory factor analysis (CFA) and Pearson's correlation. The corrected item-total correlations ranged from 0.736 to 0.837. Therefore, no items were removed. Cronbach's alpha for general JS was 0.925, indicating excellent internal consistency. The CFA of the general JS model showed a good fit. Pearson's correlation coefficients for convergent validity showed moderate or strong correlations. The results obtained in our study confirm the validity and reliability of general JS.

  17. Clinical Validation of Heart Rate Apps: Mixed-Methods Evaluation Study.

    PubMed

    Vandenberk, Thijs; Stans, Jelle; Mortelmans, Christophe; Van Haelst, Ruth; Van Schelvergem, Gertjan; Pelckmans, Caroline; Smeets, Christophe Jp; Lanssens, Dorien; De Cannière, Hélène; Storms, Valerie; Thijs, Inge M; Vaes, Bert; Vandervoort, Pieter M

    2017-08-25

    Photoplethysmography (PPG) is a proven way to measure heart rate (HR). This technology is already available in smartphones, which allows measuring HR only by using the smartphone. Given the widespread availability of smartphones, this creates a scalable way to enable mobile HR monitoring. An essential precondition is that these technologies are as reliable and accurate as the current clinical (gold) standards. At this moment, there is no consensus on a gold standard method for the validation of HR apps. This results in different validation processes that do not always reflect the veracious outcome of comparison. The aim of this paper was to investigate and describe the necessary elements in validating and comparing HR apps versus standard technology. The FibriCheck (Qompium) app was used in two separate prospective nonrandomized studies. In the first study, the HR of the FibriCheck app was consecutively compared with 2 different Food and Drug Administration (FDA)-cleared HR devices: the Nonin oximeter and the AliveCor Mobile ECG. In the second study, a next step in validation was performed by comparing the beat-to-beat intervals of the FibriCheck app to a synchronized ECG recording. In the first study, the HR (BPM, beats per minute) of 88 random subjects consecutively measured with the 3 devices showed a correlation coefficient of .834 between FibriCheck and Nonin, .88 between FibriCheck and AliveCor, and .897 between Nonin and AliveCor. A single way analysis of variance (ANOVA; P=.61 was executed to test the hypothesis that there were no significant differences between the HRs as measured by the 3 devices. In the second study, 20,298 (ms) R-R intervals (RRI)-peak-to-peak intervals (PPI) from 229 subjects were analyzed. This resulted in a positive correlation (rs=.993, root mean square deviation [RMSE]=23.04 ms, and normalized root mean square error [NRMSE]=0.012) between the PPI from FibriCheck and the RRI from the wearable ECG. There was no significant difference

  18. An exploratory study into the effect of time-restricted internet access on face-validity, construct validity and reliability of postgraduate knowledge progress testing

    PubMed Central

    2013-01-01

    Background Yearly formative knowledge testing (also known as progress testing) was shown to have a limited construct-validity and reliability in postgraduate medical education. One way to improve construct-validity and reliability is to improve the authenticity of a test. As easily accessible internet has become inseparably linked to daily clinical practice, we hypothesized that allowing internet access for a limited amount of time during the progress test would improve the perception of authenticity (face-validity) of the test, which would in turn improve the construct-validity and reliability of postgraduate progress testing. Methods Postgraduate trainees taking the yearly knowledge progress test were asked to participate in a study where they could access the internet for 30 minutes at the end of a traditional pen and paper test. Before and after the test they were asked to complete a short questionnaire regarding the face-validity of the test. Results Mean test scores increased significantly for all training years. Trainees indicated that the face-validity of the test improved with internet access and that they would like to continue to have internet access during future testing. Internet access did not improve the construct-validity or reliability of the test. Conclusion Improving the face-validity of postgraduate progress testing, by adding the possibility to search the internet for a limited amount of time, positively influences test performance and face-validity. However, it did not change the reliability or the construct-validity of the test. PMID:24195696

  19. Sample size determination for disease prevalence studies with partially validated data.

    PubMed

    Qiu, Shi-Fang; Poon, Wai-Yin; Tang, Man-Lai

    2016-02-01

    Disease prevalence is an important topic in medical research, and its study is based on data that are obtained by classifying subjects according to whether a disease has been contracted. Classification can be conducted with high-cost gold standard tests or low-cost screening tests, but the latter are subject to the misclassification of subjects. As a compromise between the two, many research studies use partially validated datasets in which all data points are classified by fallible tests, and some of the data points are validated in the sense that they are also classified by the completely accurate gold-standard test. In this article, we investigate the determination of sample sizes for disease prevalence studies with partially validated data. We use two approaches. The first is to find sample sizes that can achieve a pre-specified power of a statistical test at a chosen significance level, and the second is to find sample sizes that can control the width of a confidence interval with a pre-specified confidence level. Empirical studies have been conducted to demonstrate the performance of various testing procedures with the proposed sample sizes. The applicability of the proposed methods are illustrated by a real-data example. © The Author(s) 2012.

  20. [Validation of the German version of the Oxford Elbow Score : A cross-sectional study].

    PubMed

    Marquardt, J; Schöttker-Königer, T; Schäfer, A

    2016-08-01

    Elbow complaints are complex problems leading to severe consequences for affected people and the healthcare system. The German version of the Oxford Elbow Score (OES) is the first German-speaking instrument that specifically measures elbow complaints from the patient's perspective and changes of their health status. The aim of this study is the validation of the German version of the OES. In this context the internal consistency and the construct validity were investigated. 59 patients with elbow complaints completed the German version of the OES, the DASH and the SF-36 in a cross-sectional study. The internal consistency was calculated with Cronbach's alpha coefficients. Spearman's correlation coefficients were used to confirm construct validity. Cronbach's alpha for pain, function and psychological subscales was 0.88, 0.81 and 0.90, respectively. The whole questionnaire presents a Cronbach's alpha value of 0.93. Convergent construct validity was confirmed with correlation coefficients containing values of -0.84, -0.77 and -0.82 compared to DASH and values ranging from 0.41 to 0.80 compared with the physical domains of the SF-36. The divergent construct validity presented values ranging from 0.07 to 0.20 with the SF-36 domains of "general health perception" and "mental health". The German OES is an internal consistent instrument with good convergent and divergent construct validity. Other aspects of the validity, the reliability and the responsiveness should be confirmed through further studies.

  1. Assessing South China (Guangzhou) High School Students' Views on Nature of Science: A Validation Study

    NASA Astrophysics Data System (ADS)

    Deng, Feng; Chai, Ching Sing; Tsai, Chin-Chung; Lin, Tzung-Jin

    2014-04-01

    Research on students' views on nature of science (VNOS) in Asian countries such as China is notably lacking. This study aimed to develop and validate an instrument to measure South China high school students' VNOS. Based on the previously acquired qualitative data, the instrument included seven VNOS dimensions which reflect the crucial aspects of NOS indicated by the literature and/or the dominating ideology in China (i.e., Marxism). A sample (N = 604) was randomly divided into two groups used for exploratory analyses and confirmatory analyses. The results indicated that the instrument expressed satisfactory reliability and validity and the seven NOS dimensions could be explained by a higher-order dimension. That is, the data of this study supported the multi-dimensional framework that treats VNOS as comprising several more-or-less correlated dimensions. Two distinct dimensions, namely "Accumulative-Empirical Source" and "Pragmatic Justification" which have not been explicitly specified in the past literature, were found. In addition, the Chinese high school students generally held a constructivist/relativist-oriented view of all seven dimensions. Differences in gender and grade level were hardly observed in any dimension of the instrument. The findings are further discussed through a socio-cultural lens to enrich the current understanding of VNOS.

  2. Bem Sex Role Inventory Validation in the International Mobility in Aging Study.

    PubMed

    Ahmed, Tamer; Vafaei, Afshin; Belanger, Emmanuelle; Phillips, Susan P; Zunzunegui, Maria-Victoria

    2016-09-01

    This study investigated the measurement structure of the Bem Sex Role Inventory (BSRI) with different factor analysis methods. Most previous studies on validity applied exploratory factor analysis (EFA) to examine the BSRI. We aimed to assess the psychometric properties and construct validity of the 12-item short-form BSRI in a sample administered to 1,995 older adults from wave 1 of the International Mobility in Aging Study (IMIAS). We used Cronbach's alpha to assess internal consistency reliability and confirmatory factor analysis (CFA) to assess psychometric properties. EFA revealed a three-factor model, further confirmed by CFA and compared with the original two-factor structure model. Results revealed that a two-factor solution (instrumentality-expressiveness) has satisfactory construct validity and superior fit to data compared to the three-factor solution. The two-factor solution confirms expected gender differences in older adults. The 12-item BSRI provides a brief, psychometrically sound, and reliable instrument in international samples of older adults.

  3. Pressure ulcer prevention algorithm content validation: a mixed-methods, quantitative study.

    PubMed

    van Rijswijk, Lia; Beitz, Janice M

    2015-04-01

    Translating pressure ulcer prevention (PUP) evidence-based recommendations into practice remains challenging for a variety of reasons, including the perceived quality, validity, and usability of the research or the guideline itself. Following the development and face validation testing of an evidence-based PUP algorithm, additional stakeholder input and testing were needed. Using convenience sampling methods, wound care experts attending a national wound care conference and a regional wound ostomy continence nursing (WOCN) conference and/or graduates of a WOCN program were invited to participate in an Internal Review Board-approved, mixed-methods quantitative survey with qualitative components to examine algorithm content validity. After participants provided written informed consent, demographic variables were collected and participants were asked to comment on and rate the relevance and appropriateness of each of the 26 algorithm decision points/steps using standard content validation study procedures. All responses were anonymous. Descriptive summary statistics, mean relevance/appropriateness scores, and the content validity index (CVI) were calculated. Qualitative comments were transcribed and thematically analyzed. Of the 553 wound care experts invited, 79 (average age 52.9 years, SD 10.1; range 23-73) consented to participate and completed the study (a response rate of 14%). Most (67, 85%) were female, registered (49, 62%) or advanced practice (12, 15%) nurses, and had > 10 years of health care experience (88, 92%). Other health disciplines included medical doctors, physical therapists, nurse practitioners, and certified nurse specialists. Almost all had received formal wound care education (75, 95%). On a Likert-type scale of 1 (not relevant/appropriate) to 4 (very relevant and appropriate), the average score for the entire algorithm/all decision points (N = 1,912) was 3.72 with an overall CVI of 0.94 (out of 1). The only decision point/step recommendation

  4. An integrated bioanalytical method development and validation approach: case studies.

    PubMed

    Xue, Y-J; Melo, Brian; Vallejo, Martha; Zhao, Yuwen; Tang, Lina; Chen, Yuan-Shek; Keller, Karin M

    2012-10-01

    We proposed an integrated bioanalytical method development and validation approach: (1) method screening based on analyte's physicochemical properties and metabolism information to determine the most appropriate extraction/analysis conditions; (2) preliminary stability evaluation using both quality control and incurred samples to establish sample collection, storage and processing conditions; (3) mock validation to examine method accuracy and precision and incurred sample reproducibility; and (4) method validation to confirm the results obtained during method development. This integrated approach was applied to the determination of compound I in rat plasma and compound II in rat and dog plasma. The effectiveness of the approach was demonstrated by the superior quality of three method validations: (1) a zero run failure rate; (2) >93% of quality control results within 10% of nominal values; and (3) 99% incurred sample within 9.2% of the original values. In addition, rat and dog plasma methods for compound II were successfully applied to analyze more than 900 plasma samples obtained from Investigational New Drug (IND) toxicology studies in rats and dogs with near perfect results: (1) a zero run failure rate; (2) excellent accuracy and precision for standards and quality controls; and (3) 98% incurred samples within 15% of the original values. Copyright © 2011 John Wiley & Sons, Ltd.

  5. Do placebo based validation standards mimic real batch products behaviour? Case studies.

    PubMed

    Bouabidi, A; Talbi, M; Bouklouze, A; El Karbane, M; Bourichi, H; El Guezzar, M; Ziemons, E; Hubert, Ph; Rozet, E

    2011-06-01

    Analytical methods validation is a mandatory step to evaluate the ability of developed methods to provide accurate results for their routine application. Validation usually involves validation standards or quality control samples that are prepared in placebo or reconstituted matrix made of a mixture of all the ingredients composing the drug product except the active substance or the analyte under investigation. However, one of the main concerns that can be made with this approach is that it may lack an important source of variability that come from the manufacturing process. The question that remains at the end of the validation step is about the transferability of the quantitative performance from validation standards to real authentic drug product samples. In this work, this topic is investigated through three case studies. Three analytical methods were validated using the commonly spiked placebo validation standards at several concentration levels as well as using samples coming from authentic batch samples (tablets and syrups). The results showed that, depending on the type of response function used as calibration curve, there were various degrees of differences in the results accuracy obtained with the two types of samples. Nonetheless the use of spiked placebo validation standards was showed to mimic relatively well the quantitative behaviour of the analytical methods with authentic batch samples. Adding these authentic batch samples into the validation design may help the analyst to select and confirm the most fit for purpose calibration curve and thus increase the accuracy and reliability of the results generated by the method in routine application. Copyright © 2011 Elsevier B.V. All rights reserved.

  6. External validation and comparison of three prediction tools for risk of osteoporotic fractures using data from population based electronic health records: retrospective cohort study

    PubMed Central

    Cohen-Stavi, Chandra; Leventer-Roberts, Maya; Balicer, Ran D

    2017-01-01

    Objective To directly compare the performance and externally validate the three most studied prediction tools for osteoporotic fractures—QFracture, FRAX, and Garvan—using data from electronic health records. Design Retrospective cohort study. Setting Payer provider healthcare organisation in Israel. Participants 1 054 815 members aged 50 to 90 years for comparison between tools and cohorts of different age ranges, corresponding to those in each tools’ development study, for tool specific external validation. Main outcome measure First diagnosis of a major osteoporotic fracture (for QFracture and FRAX tools) and hip fractures (for all three tools) recorded in electronic health records from 2010 to 2014. Observed fracture rates were compared to probabilities predicted retrospectively as of 2010. Results The observed five year hip fracture rate was 2.7% and the rate for major osteoporotic fractures was 7.7%. The areas under the receiver operating curve (AUC) for hip fracture prediction were 82.7% for QFracture, 81.5% for FRAX, and 77.8% for Garvan. For major osteoporotic fractures, AUCs were 71.2% for QFracture and 71.4% for FRAX. All the tools underestimated the fracture risk, but the average observed to predicted ratios and the calibration slopes of FRAX were closest to 1. Tool specific validation analyses yielded hip fracture prediction AUCs of 88.0% for QFracture (among those aged 30-100 years), 81.5% for FRAX (50-90 years), and 71.2% for Garvan (60-95 years). Conclusions Both QFracture and FRAX had high discriminatory power for hip fracture prediction, with QFracture performing slightly better. This performance gap was more pronounced in previous studies, likely because of broader age inclusion criteria for QFracture validations. The simpler FRAX performed almost as well as QFracture for hip fracture prediction, and may have advantages if some of the input data required for QFracture are not available. However, both tools require calibration

  7. Development and testing of the cancer multidisciplinary team meeting observational tool (MDT-MOT)

    PubMed Central

    Harris, Jenny; Taylor, Cath; Sevdalis, Nick; Jalil, Rozh; Green, James S.A.

    2016-01-01

    Abstract Objective To develop a tool for independent observational assessment of cancer multidisciplinary team meetings (MDMs), and test criterion validity, inter-rater reliability/agreement and describe performance. Design Clinicians and experts in teamwork used a mixed-methods approach to develop and refine the tool. Study 1 observers rated pre-determined optimal/sub-optimal MDM film excerpts and Study 2 observers independently rated video-recordings of 10 MDMs. Setting Study 2 included 10 cancer MDMs in England. Participants Testing was undertaken by 13 health service staff and a clinical and non-clinical observer. Intervention None. Main Outcome Measures Tool development, validity, reliability/agreement and variability in MDT performance. Results Study 1: Observers were able to discriminate between optimal and sub-optimal MDM performance (P ≤ 0.05). Study 2: Inter-rater reliability was good for 3/10 domains. Percentage of absolute agreement was high (≥80%) for 4/10 domains and percentage agreement within 1 point was high for 9/10 domains. Four MDTs performed well (scored 3+ in at least 8/10 domains), 5 MDTs performed well in 6–7 domains and 1 MDT performed well in only 4 domains. Leadership and chairing of the meeting, the organization and administration of the meeting, and clinical decision-making processes all varied significantly between MDMs (P ≤ 0.01). Conclusions MDT-MOT demonstrated good criterion validity. Agreement between clinical and non-clinical observers (within one point on the scale) was high but this was inconsistent with reliability coefficients and warrants further investigation. If further validated MDT-MOT might provide a useful mechanism for the routine assessment of MDMs by the local workforce to drive improvements in MDT performance. PMID:27084499

  8. Validation of psychoanalytic theories: towards a conceptualization of references.

    PubMed

    Zachrisson, Anders; Zachrisson, Henrik Daae

    2005-10-01

    The authors discuss criteria for the validation of psychoanalytic theories and develop a heuristic and normative model of the references needed for this. Their core question in this paper is: can psychoanalytic theories be validated exclusively from within psychoanalytic theory (internal validation), or are references to sources of knowledge other than psychoanalysis also necessary (external validation)? They discuss aspects of the classic truth criteria correspondence and coherence, both from the point of view of contemporary psychoanalysis and of contemporary philosophy of science. The authors present arguments for both external and internal validation. Internal validation has to deal with the problems of subjectivity of observations and circularity of reasoning, external validation with the problem of relevance. They recommend a critical attitude towards psychoanalytic theories, which, by carefully scrutinizing weak points and invalidating observations in the theories, reduces the risk of wishful thinking. The authors conclude by sketching a heuristic model of validation. This model combines correspondence and coherence with internal and external validation into a four-leaf model for references for the process of validating psychoanalytic theories.

  9. Assessment study of insight ARTHRO VR (®) arthroscopy virtual training simulator: face, content, and construct validities.

    PubMed

    Bayona, Sofía; Fernández-Arroyo, José Manuel; Martín, Isaac; Bayona, Pilar

    2008-09-01

    The aims of this study were to test the face, content, and construct validities of a virtual-reality haptic arthroscopy simulator and to validate four assessment hypothesis. The participants in our study were 94 arthroscopists attending an international conference on arthroscopy. The interviewed surgeons had been performing arthroscopies for a mean of 8.71 years (σ = 6.94 years). We explained the operation, functionality, instructions for use, and the exercises provided by the simulator. They performed a trial exercise and then an exercise in which performance was recorded. After having using it, the arthroscopists answered a questionnaire. The simulator was classified as one of the best training methods (over phantoms), and obtained a mark of 7.10 out of 10 as an evaluation tool. The simulator was considered more useful for inexperienced surgeons than for surgeons with experience (mean difference 1.88 out of 10, P value < 0.001). The participants valued the simulator at 8.24 as a tool for learning skills, its fidelity at 7.41, the quality of the platform at 7.54, and the content of the exercises at 7.09. It obtained a global score of 7.82. Of the subjects, 30.8% said they would practise with the simulator more than 6 h per week. Of the surgeons, 89.4% affirmed that they would recommend the simulator to their colleagues. The data gathered support the first three hypotheses, as well as face and content validities. Results show statistically significant differences between experts and novices, thus supporting the construct validity, but studies with a larger sample must be carried out to verify this. We propose concrete solutions and an equation to calculate economy of movement. Analogously, we analyze competence measurements and propose an equation to provide a single measurement that contains them all and that, according to the surgeons' criteria, is as reliable as the judgment of experts observing the performance of an apprentice.

  10. Validation of recent geopotential models in Tierra Del Fuego

    NASA Astrophysics Data System (ADS)

    Gomez, Maria Eugenia; Perdomo, Raul; Del Cogliano, Daniel

    2017-10-01

    This work presents a validation study of global geopotential models (GGM) in the region of Fagnano Lake, located in the southern Andes. This is an excellent area for this type of validation because it is surrounded by the Andes Mountains, and there is no terrestrial gravity or GNSS/levelling data. However, there are mean lake level (MLL) observations, and its surface is assumed to be almost equipotential. Furthermore, in this article, we propose improved geoid solutions through the Residual Terrain Modelling (RTM) approach. Using a global geopotential model, the results achieved allow us to conclude that it is possible to use this technique to extend an existing geoid model to those regions that lack any information (neither gravimetric nor GNSS/levelling observations). As GGMs have evolved, our results have improved progressively. While the validation of EGM2008 with MLL data shows a standard deviation of 35 cm, GOCO05C shows a deviation of 13 cm, similar to the results obtained on land.

  11. NCAR Earth Observing Laboratory - An End-to-End Observational Science Enterprise

    NASA Astrophysics Data System (ADS)

    Rockwell, A.; Baeuerle, B.; Grubišić, V.; Hock, T. F.; Lee, W. C.; Ranson, J.; Stith, J. L.; Stossmeister, G.

    2017-12-01

    Researchers who want to understand and describe the Earth System require high-quality observations of the atmosphere, ocean, and biosphere. Making these observations not only requires capable research platforms and state-of-the-art instrumentation but also benefits from comprehensive in-field project management and data services. NCAR's Earth Observing Laboratory (EOL) is an end-to-end observational science enterprise that provides leadership in observational research to scientists from universities, U.S. government agencies, and NCAR. Deployment: EOL manages the majority of the NSF Lower Atmosphere Observing Facilities, which includes research aircraft, radars, lidars, profilers, and surface and sounding systems. This suite is designed to address a wide range of Earth system science - from microscale to climate process studies and from the planet's surface into the Upper Troposphere/Lower Stratosphere. EOL offers scientific, technical, operational, and logistics support to small and large field campaigns across the globe. Development: By working closely with the scientific community, EOL's engineering and scientific staff actively develop the next generation of observing facilities, staying abreast of emerging trends, technologies, and applications in order to improve our measurement capabilities. Through our Design and Fabrication Services, we also offer high-level engineering and technical expertise, mechanical design, and fabrication to the atmospheric research community. Data Services: EOL's platforms and instruments collect unique datasets that must be validated, archived, and made available to the research community. EOL's Data Management and Services deliver high-quality datasets and metadata in ways that are transparent, secure, and easily accessible. We are committed to the highest standard of data stewardship from collection to validation to archival. Discovery: EOL promotes curiosity about Earth science, and fosters advanced understanding of the

  12. Observer perceptions of pain in children with cognitive impairments: vignette development and validation.

    PubMed

    Genik, Lara M; McMurtry, C Meghan; Breau, Lynn M

    2015-01-01

    Develop vignettes depicting different pain types in verbal and nonverbal children with cognitive impairments that could help examine pain assessment and management decisions of secondary caregivers, and conduct initial convergent and divergent validity analyses. For six vignettes, 76 undergraduate students (38 females, mean age = 19.55) rated (0-10): pain intensity, difficulty rating pain intensity, need for medical attention and need for other attention (e.g., physical comfort). Ratings significantly varied by pain source (e.g., headache was rated more painful than injections). Verbal ability did not impact ratings. Vignettes could serve as an alternative method to study pain decisions by caregivers of children with cognitive impairments when ethical barriers limit more naturalistic research.

  13. The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) Statement: guidelines for reporting observational studies.

    PubMed

    von Elm, Erik; Altman, Douglas G; Egger, Matthias; Pocock, Stuart J; Gøtzsche, Peter C; Vandenbroucke, Jan P

    2014-12-01

    Much biomedical research is observational. The reporting of such research is often inadequate, which hampers the assessment of its strengths and weaknesses and of a study's generalisability. The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) Initiative developed recommendations on what should be included in an accurate and complete report of an observational study. We defined the scope of the recommendations to cover three main study designs: cohort, case-control, and cross-sectional studies. We convened a 2-day workshop in September 2004, with methodologists, researchers, and journal editors to draft a checklist of items. This list was subsequently revised during several meetings of the coordinating group and in e-mail discussions with the larger group of STROBE contributors, taking into account empirical evidence and methodological considerations. The workshop and the subsequent iterative process of consultation and revision resulted in a checklist of 22 items (the STROBE Statement) that relate to the title, abstract, introduction, methods, results, and discussion sections of articles. 18 items are common to all three study designs and four are specific for cohort, case-control, or cross-sectional studies. A detailed Explanation and Elaboration document is published separately and is freely available on the Web sites of PLoS Medicine, Annals of Internal Medicine, and Epidemiology. We hope that the STROBE Statement will contribute to improving the quality of reporting of observational studies. Copyright © 2014 The Authors. Published by Elsevier Ltd.. All rights reserved.

  14. Accuracy of Fourth-Graders' Dietary Recalls of School Breakfast and School Lunch Validated with Observations: In-Person versus Telephone Interviews

    PubMed Central

    THOMPSON, WILLIAM O.; LITAKER, MARK S.; GUINN, CAROLINE H.; FRYE, FRANCESCA H. A.; BAGLIO, MICHELLE L.; SHAFFER, NICOLE M.

    2005-01-01

    Objective: To investigate the accuracy of children's dietary recalls of school breakfast and school lunch validated with observations and obtained during in-person versus telephone interviews. Design: Each child was observed eating school breakfast and school lunch and was interviewed that evening about that day's intake. Setting: Ten elementary schools. Participants: A sample of fourth-graders was randomly selected within race (black, white) and gender strata, observed, and interviewed in person (n = 33) or by telephone (n = 36). Main Outcomes Measured: Rates for omissions (items observed but not reported) and intrusions (items reported but not observed) were calculated to determine accuracy for reporting items. A measure of total inaccuracy was calculated to determine inaccuracy for reporting items and amounts combined. Analysis: Analysis of variance; chi-square. Results: Interview type (in person, telephone) did not significantly affect recall accuracy. For omission rate, intrusion rate, and total inaccuracy, means were 34%, 19%, and 4.6 servings for in person recalls and 32%, 16%, and 4.3 servings for telephone recalls of school breakfast and school lunch. Conclusions and Implications: The accuracy of children's recalls of school breakfast and school lunch is not significantly different whether obtained in person or by telephone. Whether interviewed in person or by telephone, children reported only 67% of items observed; furthermore, 17% of items reported were not observed. PMID:12773283

  15. Validation of Ocean Color Remote Sensing Reflectance Using Autonomous Floats

    NASA Technical Reports Server (NTRS)

    Gerbi, Gregory P.; Boss, Emanuel; Werdell, P. Jeremy; Proctor, Christopher W.; Haentjens, Nils; Lewis, Marlon R.; Brown, Keith; Sorrentino, Diego; Zaneveld, J. Ronald V.; Barnard, Andrew H.; hide

    2016-01-01

    The use of autonomous proling oats for observational estimates of radiometric quantities in the ocean is explored, and the use of this platform for validation of satellite-based estimates of remote sensing reectance in the ocean is examined. This effort includes comparing quantities estimated from oat and satellite data at nominal wavelengths of 412, 443, 488, and 555 nm, and examining sources and magnitudes of uncertainty in the oat estimates. This study had 65 occurrences of coincident high-quality observations from oats and MODIS Aqua and 15 occurrences of coincident high-quality observations oats and Visible Infrared Imaging Radi-ometer Suite (VIIRS). The oat estimates of remote sensing reectance are similar to the satellite estimates, with disagreement of a few percent in most wavelengths. The variability of the oatsatellite comparisons is similar to the variability of in situsatellite comparisons using a validation dataset from the Marine Optical Buoy (MOBY). This, combined with the agreement of oat-based and satellite-based quantities, suggests that oats are likely a good platform for validation of satellite-based estimates of remote sensing reectance.

  16. Global Precipitation Measurement (GPM) Ground Validation (GV) Science Implementation Plan

    NASA Technical Reports Server (NTRS)

    Petersen, Walter A.; Hou, Arthur Y.

    2008-01-01

    For pre-launch algorithm development and post-launch product evaluation Global Precipitation Measurement (GPM) Ground Validation (GV) goes beyond direct comparisons of surface rain rates between ground and satellite measurements to provide the means for improving retrieval algorithms and model applications.Three approaches to GPM GV include direct statistical validation (at the surface), precipitation physics validation (in a vertical columns), and integrated science validation (4-dimensional). These three approaches support five themes: core satellite error characterization; constellation satellites validation; development of physical models of snow, cloud water, and mixed phase; development of cloud-resolving model (CRM) and land-surface models to bridge observations and algorithms; and, development of coupled CRM-land surface modeling for basin-scale water budget studies and natural hazard prediction. This presentation describes the implementation of these approaches.

  17. Validation study and routine control monitoring of moist heat sterilization procedures.

    PubMed

    Shintani, Hideharu

    2012-06-01

    The proposed approach to validation of steam sterilization in autoclaves follows the basic life cycle concepts applicable to all validation programs. Understand the function of sterilization process, develop and understand the cycles to carry out the process, and define a suitable test or series of tests to confirm that the function of the process is suitably ensured by the structure provided. Sterilization of product and components and parts that come in direct contact with sterilized product is the most critical of pharmaceutical processes. Consequently, this process requires a most rigorous and detailed approach to validation. An understanding of the process requires a basic understanding of microbial death, the parameters that facilitate that death, the accepted definition of sterility, and the relationship between the definition and sterilization parameters. Autoclaves and support systems need to be designed, installed, and qualified in a manner that ensures their continued reliability. Lastly, the test program must be complete and definitive. In this paper, in addition to validation study, documentation of IQ, OQ and PQ concretely were described.

  18. 3-D microphysical model studies of Arctic denitrification: comparison with observations

    NASA Astrophysics Data System (ADS)

    Davies, S.; Mann, G. W.; Carslaw, K. S.; Chipperfield, M. P.; Kettleborough, J. A.; Santee, M. L.; Oelhaf, H.; Wetzel, G.; Sasano, Y.; Sugita, T.

    2005-11-01

    Simulations of Arctic denitrification using a 3-D chemistry-microphysics transport model are compared with observations for the winters 1994/95, 1996/97 and 1999/2000. The model of Denitrification by Lagrangian Particle Sedimentation (DLAPSE) couples the full chemical scheme of the 3-D chemical transport model, SLIMCAT, with a nitric acid trihydrate (NAT) growth and sedimentation scheme. We use observations from the Microwave Limb Sounder (MLS) and Improved Limb Atmospheric Sounder (ILAS) satellite instruments, the balloon-borne Michelsen Interferometer for Passive Atmospheric Sounding (MIPAS-B), and the in situ NOy instrument on-board the ER-2. As well as directly comparing model results with observations, we also assess the extent to which these observations are able to validate the modelling approach taken. For instance, in 1999/2000 the model captures the temporal development of denitrification observed by the ER-2 from late January into March. However, in this winter the vortex was already highly denitrified by late January so the observations do not provide a strong constraint on the modelled rate of denitrification. The model also reproduces the MLS observations of denitrification in early February 2000. In 1996/97 the model captures the timing and magnitude of denitrification as observed by ILAS, although the lack of observations north of ~67° N in the beginning of February make it difficult to constrain the actual timing of onset. The comparison for this winter does not support previous conclusions that denitrification must be caused by an ice-mediated process. In 1994/95 the model notably underestimates the magnitude of denitrification observed during a single balloon flight of the MIPAS-B instrument. Agreement between model and MLS HNO3 at 68 hPa in mid-February 1995 is significantly better. Sensitivity tests show that a 1.5 K overall decrease in vortex temperatures, or a factor 4 increase in assumed NAT nucleation rates, produce the best statistical

  19. 3-D microphysical model studies of Arctic denitrification: comparison with observations

    NASA Astrophysics Data System (ADS)

    Davies, S.; Mann, G. W.; Carslaw, K. S.; Chipperfield, M. P.; Kettleborough, J. A.; Santee, M. L.; Oelhaf, H.; Wetzel, G.; Sasano, Y.; Sugita, T.

    2005-01-01

    Simulations of Arctic denitrification using a 3-D chemistry-microphysics transport model are compared with observations for the winters 1994/1995, 1996/1997 and 1999/2000. The model of Denitrification by Lagrangian Particle Sedimentation (DLAPSE) couples the full chemical scheme of the 3-D chemical transport model, SLIMCAT, with a nitric acid trihydrate (NAT) growth and sedimentation scheme. We use observations from the Microwave Limb Sounder (MLS) and Improved Limb Atmospheric Sounder (ILAS) satellite instruments, the balloon-borne Michelsen Interferometer for Passive Atmospheric Sounding (MIPAS-B), and the in situ NOy instrument on-board the ER-2. As well as directly comparing model results with observations, we also assess the extent to which these observations are able to validate the modelling approach taken. For instance, in 1999/2000 the model captures the temporal development of denitrification observed by the ER-2 from late January into March. However, in this winter the vortex was already highly denitrified by late January so the observations do not provide a strong constraint on the modelled rate of denitrification. The model also reproduces the MLS observations of denitrification in early February 2000. In 1996/1997 the model captures the timing and magnitude of denitrification as observed by ILAS, although the lack of observations north of ~67° N make it difficult to constrain the actual timing of onset. The comparison for this winter does not support previous conclusions that denitrification must be caused by an ice-mediated process. In 1994/1995 the model notably underestimates the magnitude of denitrification observed during a single balloon flight of the MIPAS-B instrument. Agreement between model and MLS HNO3 at 68 hPa in mid-February 1995 was significantly better. Sensitivity tests show that a 1.5 K overall decrease in vortex temperatures or a factor 4 increase in assumed NAT nucleation rates produce the best statistical fit to MLS observations

  20. Turkish Metalinguistic Awareness Scale: A Validity and Reliability Study

    ERIC Educational Resources Information Center

    Varisoglu, Behice

    2018-01-01

    The aim of this study is to develop a useful, valid and reliable measurement tool that will help teacher candidates determine their Turkish metalinguistic awareness. During the development of the scale, a pool of items was created by scanning the relevant literature and examining other awareness scales. The materials prepared were re-examined…