Pope, Kevin L.; Hamel, Martin J.; Pegg, Mark A.; Spurgeon, Jonathan J.
2016-01-01
Age information derived from calcified structures is commonly used to estimate recruitment, growth, and mortality for fish populations. Validation of daily or annual marks on age structures is often assumed, presumably due to a lack of general knowledge concerning the status of age validation studies. Therefore, the current status of freshwater fish age validation studies was summarized to show where additional effort is needed, and increase the accessibility of validation studies to researchers. In total, 1351 original peer-reviewed articles were reviewed from freshwater systems that studied age in fish. Periodicity and age validation studies were found for 88 freshwater species comprising 21 fish families. The number of age validation studies has increased over the last 30 years following previous calls for more research; however, few species have validated structures spanning all life stages. In addition, few fishes of conservation concern have validated ageing structures. A prioritization framework, using a combination of eight characteristics, is offered to direct future age validation studies and close the validation information gap. Additional study, using the offered prioritization framework, and increased availability of published studies that incorporate uncertainty when presenting research results dealing with age information are needed.
PLCO Ovarian Phase III Validation Study — EDRN Public Portal
Our preliminary data indicate that the performance of CA 125 as a screening test for ovarian cancer can be improved upon by additional biomarkers. With completion of one additional validation step, we will be ready to test the performance of a consensus marker panel in a phase III validation study. Given the original aims of the PLCO trial, we believe that the PLCO represents an ideal longitudinal cohort offering specimens for phase III validation of ovarian cancer biomarkers.
Ó Ciardha, Caoilte; Attard-Johnson, Janice; Bindemann, Markus
2018-04-01
Latency-based measures of sexual interest require additional evidence of validity, as do newer pupil dilation approaches. A total of 102 community men completed six latency-based measures of sexual interest. Pupillary responses were recorded during three of these tasks and in an additional task where no participant response was required. For adult stimuli, there was a high degree of intercorrelation between measures, suggesting that tasks may be measuring the same underlying construct (convergent validity). In addition to being correlated with one another, measures also predicted participants' self-reported sexual interest, demonstrating concurrent validity (i.e., the ability of a task to predict a more validated, simultaneously recorded, measure). Latency-based and pupillometric approaches also showed preliminary evidence of concurrent validity in predicting both self-reported interest in child molestation and viewing pornographic material containing children. Taken together, the study findings build on the evidence base for the validity of latency-based and pupillometric measures of sexual interest.
Ruuska, Salla; Hämäläinen, Wilhelmiina; Kajava, Sari; Mughal, Mikaela; Matilainen, Pekka; Mononen, Jaakko
2018-03-01
The aim of the present study was to evaluate empirically confusion matrices in device validation. We compared the confusion matrix method to linear regression and error indices in the validation of a device measuring feeding behaviour of dairy cattle. In addition, we studied how to extract additional information on classification errors with confusion probabilities. The data consisted of 12 h behaviour measurements from five dairy cows; feeding and other behaviour were detected simultaneously with a device and from video recordings. The resulting 216 000 pairs of classifications were used to construct confusion matrices and calculate performance measures. In addition, hourly durations of each behaviour were calculated and the accuracy of measurements was evaluated with linear regression and error indices. All three validation methods agreed when the behaviour was detected very accurately or inaccurately. Otherwise, in the intermediate cases, the confusion matrix method and error indices produced relatively concordant results, but the linear regression method often disagreed with them. Our study supports the use of confusion matrix analysis in validation since it is robust to any data distribution and type of relationship, it makes a stringent evaluation of validity, and it offers extra information on the type and sources of errors. Copyright © 2018 Elsevier B.V. All rights reserved.
Eating Disorder Diagnostic Scale: Additional Evidence of Reliability and Validity
ERIC Educational Resources Information Center
Stice, Eric; Fisher, Melissa; Martinez, Erin
2004-01-01
The authors conducted 4 studies investigating the reliability and validity of the Eating Disorder Diagnostic Scale (HDDS; E. Stice, C. F. Telch, & S. L. Rizvi, 2000), a brief self-report measure for diagnosing anorexia nervosa, bulimia nervosa, and binge eating disorder. Study 1 found that the HDDS showed criterion validity with interview-based…
The, Bertram; Reininga, Inge H F; El Moumni, Mostafa; Eygendaal, Denise
2013-10-01
The modern standard of evaluating treatment results includes the use of rating systems. Elbow-specific rating systems are frequently used in studies aiming at elbow-specific pathology. However, proper validation studies seem to be relatively sparse. In addition, these scoring systems might not always be used for appropriate populations of interest. Both of these issues might give rise to invalid conclusions being reported in the literature. Our aim was to investigate the extent to which the available elbow-specific outcome measurement tools have been validated and the quality of the validation itself. We also aimed to provide characteristics of the populations used for validation of these scales to enable clinicians to use them appropriately. A literature search identified 17 studies of 12 different elbow-specific scoring systems. These were assessed for validity, reliability, and responsiveness characteristics. The quality of these assessments was rated according to the Consensus Based Standards for the Selection of Health Measurement Instruments (COSMIN) checklist criteria, a standardized and validated tool developed specifically for this purpose. Currently, the only elbow-specific rating system that is validated using high-quality methodology is the Oxford Elbow Score, a patient-administered outcome measure tool that has been validated on heterogeneous study populations. Other rating systems still have to be proven in the future to be as good as the Oxford Elbow Score for clinical or research purposes. Additional validation studies are needed. Copyright © 2013 Journal of Shoulder and Elbow Surgery Board of Trustees. Published by Mosby, Inc. All rights reserved.
A Validation Study of Student Differentiation between Computing Disciplines
ERIC Educational Resources Information Center
Battig, Michael; Shariq, Muhammad
2011-01-01
Using a previously published study of how students differentiate between computing disciplines, this study attempts to validate the original research and add additional hypotheses regarding the type of institution that the student resides. Using the identical survey instrument from the original study, students in smaller colleges and in different…
The Reliability and Validity of Big Five Inventory Scores with African American College Students
ERIC Educational Resources Information Center
Worrell, Frank C.; Cross, William E., Jr.
2004-01-01
This article describes a study that examined the reliability and validity of scores on the Big Five Inventory (BFI; O. P. John, E. M. Donahue, & R. L. Kentle, 1991) in a sample of 336 African American college students. Results from the study indicated moderate reliability and structural validity for BFI scores. Additionally, BFI subscales had few…
ERIC Educational Resources Information Center
Salta, Katerina; Koulougliotis, Dionysios
2015-01-01
In educational research, the availability of a validated version of an original instrument in a different language offers the possibility for valid measurements obtained within the specific educational context and in addition it provides the opportunity for valid cross-cultural comparisons. The present study aimed to adapt the Science Motivation…
Validation of Reverse-Engineered and Additive-Manufactured Microsurgical Instrument Prototype.
Singh, Ramandeep; Suri, Ashish; Anand, Sneh; Baby, Britty
2016-12-01
With advancements in imaging techniques, neurosurgical procedures are becoming highly precise and minimally invasive, thus demanding development of new ergonomically aesthetic instruments. Conventionally, neurosurgical instruments are manufactured using subtractive manufacturing methods. Such a process is complex, time-consuming, and impractical for prototype development and validation of new designs. Therefore, an alternative design process has been used utilizing blue light scanning, computer-aided designing, and additive manufacturing direct metal laser sintering (DMLS) for microsurgical instrument prototype development. Deviations of DMLS-fabricated instrument were studied by superimposing scan data of fabricated instrument with the computer-aided designing model. Content and concurrent validity of the fabricated prototypes was done by a group of 15 neurosurgeons by performing sciatic nerve anastomosis in small laboratory animals. Comparative scoring was obtained for the control and study instrument. T test was applied to the individual parameters and P values for force (P < .0001) and surface roughness (P < .01) were found to be statistically significant. These 2 parameters were further analyzed using objective measures. Results depicts that additive manufacturing by DMLS provides an effective method for prototype development. However, direct application of these additive-manufactured instruments in the operating room requires further validation. © The Author(s) 2016.
Investigating Attitudes toward Physical Education: Validation across Two Instruments
ERIC Educational Resources Information Center
Donovan, Corinne Baron; Mercier, Kevin; Phillips, Sharon R.
2015-01-01
The Centers for Disease Control have suggested that physical education plays a role in promoting healthy lifestyles. Prior research suggests a link between attitudes toward physical education and physical activity outside school. The current study provides additional evidence of construct validity through a validation across two instruments…
Validating the Watson Glaser Critical Thinking Appraisal
ERIC Educational Resources Information Center
Hassan, Karma El; Madhum, Ghida
2007-01-01
This study validated the Watson Glaser Critical Thinking Appraisal (WGCTA) on a sample of 273 private university students in Lebanon. For that purpose, evidence for construct validation was investigated through identifying the test's factor structure and subscale total correlations, in addition to differences in scores by gender, different levels,…
ERIC Educational Resources Information Center
Lane, Kathleen Lynne; Oakes, Wendy P.; Ennis, Robin Parks; Cox, Meredith Lucille; Schatschneider, Christopher; Lambert, Warren
2013-01-01
This study reports findings from a validation study of the Student Risk Screening Scale for use with 9th- through 12th-grade students (N = 1854) attending a rural fringe school. Results indicated high internal consistency, test-retest stability, and inter-rater reliability. Predictive validity was established across two academic years, with Spring…
The Fruit & Vegetable Screener in the 2000 California Health Interview Survey: Validation Results
In this study, multiple 24-hour recalls in conjunction with a measurement error model were used to assess validity. The screeners used in the EATS included additional foods and reported portion sizes.
Perfectionism in Intimate Relationships: The Dyadic Almost Perfect Scale
ERIC Educational Resources Information Center
Shea, Andrew J.; Slaney, Robert B.; Rice, Kenneth G.
2006-01-01
The authors describe 2 studies on the development of the Dyadic Almost Perfect Scale (A. J. Shea & R. B. Slaney, 1999). In Study 1 (N = 389), confirmatory analyses validated the factor structure of the measure, and additional results supported the reliability and construct validity of the subscale scores. In Study 2 (N = 280), data again…
Workplace status: The development and validation of a scale.
Djurdjevic, Emilija; Stoverink, Adam C; Klotz, Anthony C; Koopman, Joel; da Motta Veiga, Serge P; Yam, Kai Chi; Chiang, Jack Ting-Ju
2017-07-01
Research suggests that employee status, and various status proxies, relate to a number of meaningful outcomes in the workplace. The advancement of the study of status in organizational settings has, however, been stymied by the lack of a validated workplace status measure. The purpose of this manuscript, therefore, is to develop and validate a measure of workplace status based on a theoretically grounded definition of status in organizations. Subject-matter experts were used to examine the content validity of the measure. Then, 2 separate samples were employed to assess the psychometric properties (i.e., factor structure, reliability, convergent and discriminant validity) and nomological network of a 5-item, self-report Workplace Status Scale (WSS). To allow for methodological flexibility, an additional 3 samples were used to extend the WSS to coworker reports of a focal employee's status, provide additional evidence for the validity and reliability of the WSS, and to demonstrate consensus among coworker ratings. Together, these studies provide evidence of the psychometric soundness of the WSS for assessing employee status using either self-reports or other-source reports. The implications of the development of the WSS for the study of status in organizations are discussed, and suggestions for future research using the new measure are offered. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Youngjohn, James R; Wershba, Rebecca; Stevenson, Matthew; Sturgeon, John; Thomas, Michael L
2011-04-01
The MMPI-2 Restructured Form (MMPI-2-RF; Ben-Porath & Tellegen, 2008) is replacing the MMPI-2 as the most widely used personality test in neuropsychological assessment, but additional validation studies are needed. Our study examines MMPI-2-RF Validity scales and the newly created Somatic/Cognitive scales in a recently reported sample of 82 traumatic brain injury (TBI) litigants who either passed or failed effort tests (Thomas & Youngjohn, 2009). The restructured Validity scales FBS-r (restructured symptom validity), F-r (restructured infrequent responses), and the newly created Fs (infrequent somatic responses) were not significant predictors of TBI severity. FBS-r was significantly related to passing or failing effort tests, and Fs and F-r showed non-significant trends in the same direction. Elevations on the Somatic/Cognitive scales profile (MLS-malaise, GIC-gastrointestinal complaints, HPC-head pain complaints, NUC-neurological complaints, and COG-cognitive complaints) were significant predictors of effort test failure. Additionally, HPC had the anticipated paradoxical inverse relationship with head injury severity. The Somatic/Cognitive scales as a group were better predictors of effort test failure than the RF Validity scales, which was an unexpected finding. MLS arose as the single best predictor of effort test failure of all RF Validity and Somatic/Cognitive scales. Item overlap analysis revealed that all MLS items are included in the original MMPI-2 Hy scale, making MLS essentially a subscale of Hy. This study validates the MMPI-2-RF as an effective tool for use in neuropsychological assessment of TBI litigants.
Hickey, Graeme L; Blackstone, Eugene H
2016-08-01
Clinical risk-prediction models serve an important role in healthcare. They are used for clinical decision-making and measuring the performance of healthcare providers. To establish confidence in a model, external model validation is imperative. When designing such an external model validation study, thought must be given to patient selection, risk factor and outcome definitions, missing data, and the transparent reporting of the analysis. In addition, there are a number of statistical methods available for external model validation. Execution of a rigorous external validation study rests in proper study design, application of suitable statistical methods, and transparent reporting. Copyright © 2016 The American Association for Thoracic Surgery. Published by Elsevier Inc. All rights reserved.
Development and evaluation of an automated fall risk assessment system.
Lee, Ju Young; Jin, Yinji; Piao, Jinshi; Lee, Sun-Mi
2016-04-01
Fall risk assessment is the first step toward prevention, and a risk assessment tool with high validity should be used. This study aimed to develop and validate an automated fall risk assessment system (Auto-FallRAS) to assess fall risks based on electronic medical records (EMRs) without additional data collected or entered by nurses. This study was conducted in a 1335-bed university hospital in Seoul, South Korea. The Auto-FallRAS was developed using 4211 fall-related clinical data extracted from EMRs. Participants included fall patients and non-fall patients (868 and 3472 for the development study; 752 and 3008 for the validation study; and 58 and 232 for validation after clinical application, respectively). The system was evaluated for predictive validity and concurrent validity. The final 10 predictors were included in the logistic regression model for the risk-scoring algorithm. The results of the Auto-FallRAS were shown as high/moderate/low risk on the EMR screen. The predictive validity analyzed after clinical application of the Auto-FallRAS was as follows: sensitivity = 0.95, NPV = 0.97 and Youden index = 0.44. The validity of the Morse Fall Scale assessed by nurses was as follows: sensitivity = 0.68, NPV = 0.88 and Youden index = 0.28. This study found that the Auto-FallRAS results were better than were the nurses' predictions. The advantage of the Auto-FallRAS is that it automatically analyzes information and shows patients' fall risk assessment results without requiring additional time from nurses. © The Author 2016. Published by Oxford University Press in association with the International Society for Quality in Health Care; all rights reserved.
de Boer, Pieter T; Frederix, Geert W J; Feenstra, Talitha L; Vemer, Pepijn
2016-09-01
Transparent reporting of validation efforts of health economic models give stakeholders better insight into the credibility of model outcomes. In this study we reviewed recently published studies on seasonal influenza and early breast cancer in order to gain insight into the reporting of model validation efforts in the overall health economic literature. A literature search was performed in Pubmed and Embase to retrieve health economic modelling studies published between 2008 and 2014. Reporting on model validation was evaluated by checking for the word validation, and by using AdViSHE (Assessment of the Validation Status of Health Economic decision models), a tool containing a structured list of relevant items for validation. Additionally, we contacted corresponding authors to ask whether more validation efforts were performed other than those reported in the manuscripts. A total of 53 studies on seasonal influenza and 41 studies on early breast cancer were included in our review. The word validation was used in 16 studies (30 %) on seasonal influenza and 23 studies (56 %) on early breast cancer; however, in a minority of studies, this referred to a model validation technique. Fifty-seven percent of seasonal influenza studies and 71 % of early breast cancer studies reported one or more validation techniques. Cross-validation of study outcomes was found most often. A limited number of studies reported on model validation efforts, although good examples were identified. Author comments indicated that more validation techniques were performed than those reported in the manuscripts. Although validation is deemed important by many researchers, this is not reflected in the reporting habits of health economic modelling studies. Systematic reporting of validation efforts would be desirable to further enhance decision makers' confidence in health economic models and their outcomes.
Validation studies of the DOE-2 Building Energy Simulation Program. Final Report
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sullivan, R.; Winkelmann, F.
1998-06-01
This report documents many of the validation studies (Table 1) of the DOE-2 building energy analysis simulation program that have taken place since 1981. Results for several versions of the program are presented with the most recent study conducted in 1996 on version DOE-2.1E and the most distant study conducted in 1981 on version DOE-1.3. This work is part of an effort related to continued development of DOE-2, particularly in its use as a simulation engine for new specialized versions of the program such as the recently released RESFEN 3.1. RESFEN 3.1 is a program specifically dealing with analyzing themore » energy performance of windows in residential buildings. The intent in providing the results of these validation studies is to give potential users of the program a high degree of confidence in the calculated results. Validation studies in which calculated simulation data is compared to measured data have been conducted throughout the development of the DOE-2 program. Discrepancies discovered during the course of such work has resulted in improvements in the simulation algorithms. Table 2 provides a listing of additions and modifications that have been made to various versions of the program since version DOE-2.1A. One of the most significant recent changes in the program occurred with version DOE-2.1E. An improved algorithm for calculating the outside surface film coefficient was implemented. In addition, integration of the WINDOW 4 program was accomplished resulting in improved ability in analyzing window energy performance. Validation and verification of a program as sophisticated as DOE-2 must necessarily be limited because of the approximations inherent in the program. For example, the most accurate model of the heat transfer processes in a building would include a three-dimensional analysis. To justify such detailed algorithmic procedures would correspondingly require detailed information describing the building and/or HVAC system and energy plant parameters. Until building simulation programs can get this data directly from CAD programs, such detail would negate the usefulness of the program for the practicing engineers and architects who currently use the program. In addition, the validation studies discussed herein indicate that such detail is really unnecessary. The comparison of calculated and measured quantities have resulted in a satisfactory level of confidence that is sufficient for continued use of the DOE-2 program. However, additional validation is warranted, particularly at the component level, to further improve the program.« less
ERIC Educational Resources Information Center
Lindwall, Magnus; Palmeira, Antonio
2009-01-01
The present study investigated the factorial validity and factorial invariance of the 21-item Exercise Dependence Scale-Revised using 162 Swedish and 269 Portuguese exercisers. In addition, the prevalence of exercise dependence symptoms and links to exercise behavior, gender, and age in the two samples was also studied. Confirmatory factor…
ERIC Educational Resources Information Center
Mangione, Katherine Anna
2010-01-01
This study was to determine reliability and validity for a two-tiered, multiple- choice instrument designed to identify alternative conceptions in earth science. Additionally, this study sought to identify alternative conceptions in earth science held by preservice teachers, to investigate relationships between self-reported confidence scores and…
Measuring the Quality of Life of University Students. Research Monograph Series. Volume 1.
ERIC Educational Resources Information Center
Roberts, Lance W.; Clifton, Rodney A.
This study sought to develop a valid set of scales in the cognitive and affective domains for measuring the quality of life of university students. In addition the study attempted to illustrate the usefulness of Thomas Piazza's procedures for constructing valid scales in educational research. Piazza's method involves a multi-step construction of…
Fitzsimmons-Craft, Ellen E; Bardone-Cone, Anna M
2014-01-01
This study examined the one-year temporal stability and the predictive and incremental validity of the Body, Eating, and Exercise Comparison Measure (BEECOM) in a sample of 237 college women who completed study measures at two time points about one year apart. One-year temporal stability was high for the BEECOM total and subscale (i.e., Body, Eating, and Exercise Comparison Orientation) scores. Additionally, the BEECOM exhibited predictive validity in that it accounted for variance in body dissatisfaction and eating disorder symptomatology one year later. These findings held even after controlling for body mass index and existing measures of social comparison orientation. However, results regarding the incremental validity of the BEECOM, or its ability to predict change in these constructs over time, were more mixed. Overall, this study demonstrated additional psychometric properties of the BEECOM among college women, further establishing the usefulness of this measure for more comprehensively assessing eating disorder-related social comparison. Copyright © 2013 Elsevier Ltd. All rights reserved.
An empirical assessment of validation practices for molecular classifiers
Castaldi, Peter J.; Dahabreh, Issa J.
2011-01-01
Proposed molecular classifiers may be overfit to idiosyncrasies of noisy genomic and proteomic data. Cross-validation methods are often used to obtain estimates of classification accuracy, but both simulations and case studies suggest that, when inappropriate methods are used, bias may ensue. Bias can be bypassed and generalizability can be tested by external (independent) validation. We evaluated 35 studies that have reported on external validation of a molecular classifier. We extracted information on study design and methodological features, and compared the performance of molecular classifiers in internal cross-validation versus external validation for 28 studies where both had been performed. We demonstrate that the majority of studies pursued cross-validation practices that are likely to overestimate classifier performance. Most studies were markedly underpowered to detect a 20% decrease in sensitivity or specificity between internal cross-validation and external validation [median power was 36% (IQR, 21–61%) and 29% (IQR, 15–65%), respectively]. The median reported classification performance for sensitivity and specificity was 94% and 98%, respectively, in cross-validation and 88% and 81% for independent validation. The relative diagnostic odds ratio was 3.26 (95% CI 2.04–5.21) for cross-validation versus independent validation. Finally, we reviewed all studies (n = 758) which cited those in our study sample, and identified only one instance of additional subsequent independent validation of these classifiers. In conclusion, these results document that many cross-validation practices employed in the literature are potentially biased and genuine progress in this field will require adoption of routine external validation of molecular classifiers, preferably in much larger studies than in current practice. PMID:21300697
Dugosh, Karen Leggett; Festinger, David S.; Croft, Jason R.; Marlowe, Douglas B.
2011-01-01
Despite many efforts aimed to ensure that research participation is autonomous and not coerced, there exists no reliable and valid measure of perceived coercion for the doubly vulnerable population of substance-abusing offenders. The current study describes the development and initial validation of an instrument measuring perceived coercion to participate in research among substance-abusing offenders. The results indicated that a substantial number of individuals report feeling coerced to participate in the study. In addition, the instrument has adequate levels of internal consistency, a one-dimensional factor structure, and evidence of discriminative validity. This study provides initial support for the instrument’s validity and clinical utility. PMID:20235867
Probability of Detection (POD) as a statistical model for the validation of qualitative methods.
Wehling, Paul; LaBudde, Robert A; Brunelle, Sharon L; Nelson, Maria T
2011-01-01
A statistical model is presented for use in validation of qualitative methods. This model, termed Probability of Detection (POD), harmonizes the statistical concepts and parameters between quantitative and qualitative method validation. POD characterizes method response with respect to concentration as a continuous variable. The POD model provides a tool for graphical representation of response curves for qualitative methods. In addition, the model allows comparisons between candidate and reference methods, and provides calculations of repeatability, reproducibility, and laboratory effects from collaborative study data. Single laboratory study and collaborative study examples are given.
Incremental Validity of the New MCAT.
ERIC Educational Resources Information Center
Friedman, Charles P.; Bakewell, William E., Jr.
1980-01-01
The ability of the new Medical College Admission Test (MCAT) to predict performance of first-year medical students at the University of North Carolina was studied. Its incremental validity, determined by computing the additional variance in performance explainable by the MCAT after the effects of other admissions variables were taken into account,…
The goal of this EDRN set-asides proposal is to carry out pre-validation studies on sarcosine as a metabolomic biomarker of prostate cancer in urine. Not only does sarcosine have potential as a marker for the early detection of prostate cancer in post-DRE urine specimens-- but since its highest levels are in metastatic disease it might have utility in predicting aggressiveness of clinically localized disease. We will also use these funds to determine if we can add additional metabolites to sarcosine in order to develop a multiplex metabolomic biomarker panel in prostate cancer. In addition to sarcosine, we have 10-12 additional candidate metabolomic biomarkers that could be developed (as seen from our preliminary global metabolite studies).
Gilman, T Lee; Shaheen, Razan; Nylocks, K Maria; Halachoff, Danielle; Chapman, Jessica; Flynn, Jessica J; Matt, Lindsey M; Coifman, Karin G
2017-12-01
Emotions are highly influential to many psychological processes. Indeed, research employing emotional stimuli is rapidly escalating across the field of psychology. However, challenges remain regarding discrete evocation of frequently co-elicited emotions such as amusement and happiness, or anger and disgust. Further, as much contemporary work in emotion employs college students, we sought to additionally evaluate the efficacy of film clips to discretely elicit these more challenging emotions in a young adult population using an online medium. The internet is an important tool for investigating responses to emotional stimuli, but validations of emotionally evocative film clips across laboratory and web-based settings are limited in the literature. An additional obstacle is identifying stimuli amidst the numerous film clip validation studies. During our investigation, we recognized the lack of a categorical database to facilitate rapid identification of useful film clips for individual researchers' unique investigations. Consequently, here we also sought to produce the first compilation of such stimuli into an accessible and comprehensive catalog. We based our catalog upon prior work as well as our own, and identified 24 articles and 295 film clips from four decades of research. We present information on the validation of these clips in addition to our own research validating six clips using online administration settings. The results of our search in the literature and our own study are presented in tables designed to facilitate and improve a selection of highly valid film stimuli for future research.
Singh, Jay P; Desmarais, Sarah L; Van Dorn, Richard A
2013-01-01
The objective of the present review was to examine how predictive validity is analyzed and reported in studies of instruments used to assess violence risk. We reviewed 47 predictive validity studies published between 1990 and 2011 of 25 instruments that were included in two recent systematic reviews. Although all studies reported receiver operating characteristic curve analyses and the area under the curve (AUC) performance indicator, this methodology was defined inconsistently and findings often were misinterpreted. In addition, there was between-study variation in benchmarks used to determine whether AUCs were small, moderate, or large in magnitude. Though virtually all of the included instruments were designed to produce categorical estimates of risk - through the use of either actuarial risk bins or structured professional judgments - only a minority of studies calculated performance indicators for these categorical estimates. In addition to AUCs, other performance indicators, such as correlation coefficients, were reported in 60% of studies, but were infrequently defined or interpreted. An investigation of sources of heterogeneity did not reveal significant variation in reporting practices as a function of risk assessment approach (actuarial vs. structured professional judgment), study authorship, geographic location, type of journal (general vs. specialized audience), sample size, or year of publication. Findings suggest a need for standardization of predictive validity reporting to improve comparison across studies and instruments. Copyright © 2013 John Wiley & Sons, Ltd.
Validation of the Social Appearance Anxiety Scale: factor, convergent, and divergent validity.
Levinson, Cheri A; Rodebaugh, Thomas L
2011-09-01
The Social Appearance Anxiety Scale (SAAS) was created to assess fear of overall appearance evaluation. Initial psychometric work indicated that the measure had a single-factor structure and exhibited excellent internal consistency, test-retest reliability, and convergent validity. In the current study, the authors further examined the factor, convergent, and divergent validity of the SAAS in two samples of undergraduates. In Study 1 (N = 323), the authors tested the factor structure, convergent, and divergent validity of the SAAS with measures of the Big Five personality traits, negative affect, fear of negative evaluation, and social interaction anxiety. In Study 2 (N = 118), participants completed a body evaluation that included measurements of height, weight, and body fat content. The SAAS exhibited excellent convergent and divergent validity with self-report measures (i.e., self-esteem, trait anxiety, ethnic identity, and sympathy), predicted state anxiety experienced during the body evaluation, and predicted body fat content. In both studies, results confirmed a single-factor structure as the best fit to the data. These results lend additional support for the use of the SAAS as a valid measure of social appearance anxiety.
Factor structure and validation of the Attentional Control Scale.
Judah, Matt R; Grant, DeMond M; Mills, Adam C; Lechner, William V
2014-04-01
The Attentional Control Scale (ACS; Derryberry & Reed, 2002) has been used to assess executive control over attention in numerous studies, but no published data have examined the factor structure of the English version. The current studies addressed this need and tested the predictive and convergent validity of the ACS subscales. In Study 1, exploratory factor analysis yielded a two-factor model with Focusing and Shifting subscales. In Study 2, confirmatory factor analysis supported this model and suggested superior fit compared to the factor structure of the Icelandic version (Ólafsson et al., 2011). Study 3 examined correlations between the ACS subscales and measures of working memory, anxiety, and cognitive control. Study 4 examined correlations between the subscales and reaction times on a mixed-antisaccade task, revealing positive correlations for antisaccade performance and prosaccade latency with Focusing scores and between switch trial performance and Shifting scores. Additionally, the findings partially supported unique relationships between Focusing and trait anxiety and between Shifting and depression that have been noted in recent research. Although the results generally support the validity of the ACS, additional research using performance-based tasks is needed.
ERIC Educational Resources Information Center
Sinharay, Sandip; Feng, Ying; Saldivia, Luis; Powers, Donald E.; Ginuta, Anthony; Simpson, Annabelle; Weng, Vincent
2008-01-01
The validity of TOEIC Bridge™ scores as a measure of English language skill was examined from the standpoint of a unified concept of test validity. In this study, more than 6,000 test takers in 3 Latin American countries (Chile, Colombia, and Ecuador) took 1 form of the TOEIC Bridge test, and their scores were compared to additional information…
ERIC Educational Resources Information Center
Carlson, Thomas Stone; McGeorge, Christi R.; Toomey, Russell B.
2013-01-01
This study established the validity and factor structure of the Affirmative Training Inventory (ATI; T. S. Carlson, C. R. McGeorge & M. Rock, unpublished) as a measure of lesbian, gay, and bisexual (LGB) affirmative clinical training. Additionally, this study examined the latent associations among the subscales of the ATI and the Sexual…
Validation analysis of probabilistic models of dietary exposure to food additives.
Gilsenan, M B; Thompson, R L; Lambe, J; Gibney, M J
2003-10-01
The validity of a range of simple conceptual models designed specifically for the estimation of food additive intakes using probabilistic analysis was assessed. Modelled intake estimates that fell below traditional conservative point estimates of intake and above 'true' additive intakes (calculated from a reference database at brand level) were considered to be in a valid region. Models were developed for 10 food additives by combining food intake data, the probability of an additive being present in a food group and additive concentration data. Food intake and additive concentration data were entered as raw data or as a lognormal distribution, and the probability of an additive being present was entered based on the per cent brands or the per cent eating occasions within a food group that contained an additive. Since the three model components assumed two possible modes of input, the validity of eight (2(3)) model combinations was assessed. All model inputs were derived from the reference database. An iterative approach was employed in which the validity of individual model components was assessed first, followed by validation of full conceptual models. While the distribution of intake estimates from models fell below conservative intakes, which assume that the additive is present at maximum permitted levels (MPLs) in all foods in which it is permitted, intake estimates were not consistently above 'true' intakes. These analyses indicate the need for more complex models for the estimation of food additive intakes using probabilistic analysis. Such models should incorporate information on market share and/or brand loyalty.
Nelson, Jennifer C.; Marsh, Tracey; Lumley, Thomas; Larson, Eric B.; Jackson, Lisa A.; Jackson, Michael
2014-01-01
Objective Estimates of treatment effectiveness in epidemiologic studies using large observational health care databases may be biased due to inaccurate or incomplete information on important confounders. Study methods that collect and incorporate more comprehensive confounder data on a validation cohort may reduce confounding bias. Study Design and Setting We applied two such methods, imputation and reweighting, to Group Health administrative data (full sample) supplemented by more detailed confounder data from the Adult Changes in Thought study (validation sample). We used influenza vaccination effectiveness (with an unexposed comparator group) as an example and evaluated each method’s ability to reduce bias using the control time period prior to influenza circulation. Results Both methods reduced, but did not completely eliminate, the bias compared with traditional effectiveness estimates that do not utilize the validation sample confounders. Conclusion Although these results support the use of validation sampling methods to improve the accuracy of comparative effectiveness findings from healthcare database studies, they also illustrate that the success of such methods depends on many factors, including the ability to measure important confounders in a representative and large enough validation sample, the comparability of the full sample and validation sample, and the accuracy with which data can be imputed or reweighted using the additional validation sample information. PMID:23849144
Cultural Validation of the Maslach Burnout Inventory for Korean Students
ERIC Educational Resources Information Center
Shin, Hyojung; Puig, Ana; Lee, Jayoung; Lee, Ji Hee; Lee, Sang Min
2011-01-01
The purpose of this study was to examine the factorial validity of the MBI-SS in Korean students. Specifically, we investigated whether the original three-factor structure of the MBI-SS was appropriate for use with Korean students. In addition, by running multi-group structural equation model analyses with factorial invariance tests simultaneously…
Additional Evidence of Convergent Validity between SRSS-IE and SSiS-PSG Scores
ERIC Educational Resources Information Center
Lane, Kathleen Lynne; Oakes, Wendy Peia; Ennis, Robin Parks; Royer, David James
2015-01-01
We report findings of a validity study comparing two screening tools: the Student Risk Screening Scale-Internalizing and Externalizing (SRSS-IE) and the Social Skills Improvement System-Performance Screening Guide (SSiS-PSG; Elliott & Gresham, 2007). Participants were 1,680 kindergarten through sixth-grade elementary students from three…
Design and validation of instruments to measure knowledge.
Elliott, T E; Regal, R R; Elliott, B A; Renier, C M
2001-01-01
Measuring health care providers' learning after they have participated in educational interventions that use experimental designs requires valid, reliable, and practical instruments. A literature review was conducted. In addition, experience gained from designing and validating instruments for measuring the effect of an educational intervention informed this process. The eight main steps for designing, validating, and testing the reliability of instruments for measuring learning outcomes are presented. The key considerations and rationale for this process are discussed. Methods for critiquing and adapting existent instruments and creating new ones are offered. This study may help other investigators in developing valid, reliable, and practical instruments for measuring the outcomes of educational activities.
Development and validation of a measure of food choice values.
Lyerly, Jordan E; Reeve, Charlie L
2015-06-01
Food choice values (FCVs) are factors that individuals consider when deciding which foods to purchase and/or consume. Given the potentially important implications for health, it is critical for researchers to have access to a validated measure of FCV. Though there is an existing measure of FCV, this measure was developed 20 years ago and recent research suggests additional FCVs exist that are not included in this measure. A series of four studies was conducted to develop a new expanded measure of FCV. An eight-factor model of FCV was supported and confirmed. In aggregate, results from the four studies indicate that the measure is content valid, and has internally consistent scales that also demonstrated acceptable temporal stability and convergent validity. In addition, the eight scales of the measures were independent of social desirability, met criteria for measurement invariance across income groups, and predicted dietary intake. The development of this new measure of FCV may be useful for researchers examining FCVs (FCVs) in the future, as well as for use in intervention and prevention efforts targeting dietary choices. Copyright © 2015 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Zimmermann, Judith; von Davier, Alina A.; Buhmann, Joachim M.; Heinimann, Hans R.
2018-01-01
Graduate admission has become a critical process in tertiary education, whereby selecting valid admissions instruments is key. This study assessed the validity of Graduate Record Examination (GRE) General Test scores for admission to Master's programmes at a technical university in Europe. We investigated the indicative value of GRE scores for the Master's programme grade point average (GGPA) with and without the addition of the undergraduate GPA (UGPA) and the TOEFL score, and of GRE scores for study completion and Master's thesis performance. GRE scores explained 20% of the variation in the GGPA, while additional 7% were explained by the TOEFL score and 3% by the UGPA. Contrary to common belief, the GRE quantitative reasoning score showed only little explanatory power. GRE scores were also weakly related to study progress but not to thesis performance. Nevertheless, GRE and TOEFL scores were found to be sensible admissions instruments. Rigorous methodology was used to obtain highly reliable results.
Amendola, Fernanda; Alvarenga, Márcia Regina Martins; Latorre, Maria do Rosário Dias de Oliveira; Oliveira, Maria Amélia de Campos
2014-02-01
This exploratory, descriptive, cross-sectional, and quantitative study aimed to develop and validate an index of family vulnerability to disability and dependence (FVI-DD). This study was adapted from the Family Development Index, with the addition of social and health indicators of disability and dependence. The instrument was applied to 248 families in the city of Sao Paulo, followed by exploratory factor analysis. Factor validation was performed using the concurrent and discriminant validity of the Lawton scale and Katz Index. The descriptive level adopted for the study was p < 0.05. The final vulnerability index comprised 50 questions classified into seven factors contemplating social and health dimensions, and this index exhibited good internal consistency (Cronbach's alpha = 0.82). FVI-DD was validated using both the Lawton scale and Katz Index. We conclude that FVI-DD can accurately and reliably assess family vulnerability to disability and dependence.
Overby, Nina Cecilie; Johannesen, Elisabeth; Jensen, Grete; Skjaevesland, Anne-Kirsti; Haugen, Margaretha
2014-01-01
The assessment of food intake is challenging and prone to errors; it is therefore important to consider the reliability and validity of the assessment methods. The aim of this study was to analyze the reproducibility and validity of a developed food-frequency questionnaire (FFQ) for use among adolescents. In total, 58 students (aged 13-14) from four different schools in the southern part of Norway participated in the reproducibility study of filling out the FFQ 4 weeks apart. In addition, 93 students participated in the relative validity study where the FFQ was compared to 2×24-hour dietary recalls, while 92 students participated in the absolute validity study where the intakes of fatty acids and vitamin D from the FFQ were compared to fatty acids and 25-hydroxy-vitamin D3 in whole blood. The median Spearman correlation coefficient for all nutrients in the test-retest reliability study was 0.57. The median Spearman correlation for all nutrients in the relative validity study was 0.26, while the correlations coefficients were low in the absolute validity study with n-3 fatty acid coefficients ranging from 0.05 to 0.25, and absent for vitamin D (r=0.000). The test-retest reproducibility was considered good, the relative validity was considered poor to good, and the absolute validity was considered poor. However, the results are comparable to other studies among adolescents.
Paiva, Carlos Eduardo; Carneseca, Estela Cristina; Barroso, Eliane Marçon; de Camargos, Mayara Goulart; Alfano, Ana Camila Callado; Rugno, Fernanda Capella; Paiva, Bianca Sakamoto Ribeiro
2014-08-01
The European Organization for Research and Treatment of Cancer Core Quality of Life Questionnaire (EORTC QLQ-C30) is considered a valid instrument for use in Brazil. However, the previous Brazilian validation study included only 30 lung cancer patients and only measured test-retest reliability. The aim of this study was to evaluate the psychometric properties of the EORTC QLQ-C30 in a sample of cancer patients at different educational levels who completed the instrument administered by an interviewer. Data from six prospective studies conducted by the same group of researchers were combined in this study (N = 986). Reliability was assessed using Cronbach's alpha coefficient, all values of which were >0.7, with the exception of cognitive functioning, social functioning, and nausea and vomiting (α = 0.57, α = 0.69, and α = 0.68, respectively). In multi-trait scaling analysis, convergent and divergent validity were considered adequate (validity indices were 91.6 and 97.4%). In general, moderate to strong correlations were found between the subscales of the EORTC QLQ-C30 and its respective dimensions from the WHOQOL-bref, the hospital anxiety and depression scale, and the Edmonton Symptom Assessment System (ESAS) instruments. In addition, the EORTC QLQ-C30 was able to differentiate groups of patients with distinct performance statuses and types of treatment (known-group validation). Statistical analyses were also performed on educational status, yielding similar results. Detailed psychometric property data using the EORTC QLQ-C30 in Brazil are added by this study. In addition, we demonstrated that this instrument is in general reliable and valid regardless of the patient educational level.
Cognitive Bias in the Verification and Validation of Space Flight Systems
NASA Technical Reports Server (NTRS)
Larson, Steve
2012-01-01
Cognitive bias is generally recognized as playing a significant role in virtually all domains of human decision making. Insight into this role is informally built into many of the system engineering practices employed in the aerospace industry. The review process, for example, typically has features that help to counteract the effect of bias. This paper presents a discussion of how commonly recognized biases may affect the verification and validation process. Verifying and validating a system is arguably more challenging than development, both technically and cognitively. Whereas there may be a relatively limited number of options available for the design of a particular aspect of a system, there is a virtually unlimited number of potential verification scenarios that may be explored. The probability of any particular scenario occurring in operations is typically very difficult to estimate, which increases reliance on judgment that may be affected by bias. Implementing a verification activity often presents technical challenges that, if they can be overcome at all, often result in a departure from actual flight conditions (e.g., 1-g testing, simulation, time compression, artificial fault injection) that may raise additional questions about the meaningfulness of the results, and create opportunities for the introduction of additional biases. In addition to mitigating the biases it can introduce directly, the verification and validation process must also overcome the cumulative effect of biases introduced during all previous stages of development. A variety of cognitive biases will be described, with research results for illustration. A handful of case studies will be presented that show how cognitive bias may have affected the verification and validation process on recent JPL flight projects, identify areas of strength and weakness, and identify potential changes or additions to commonly used techniques that could provide a more robust verification and validation of future systems.
ERIC Educational Resources Information Center
Zembat, Rengin; Turasli, Nalan Kuru; Güven, Gülçin; Sezer, Türker; Aksin, Ezgi; Yilmaz, Elif; Bayindir, Dilan
2016-01-01
The aim of this study is to investigate the reliability and validity of the DeMoulin Self-Concept Developmental Scale for 36-72 month old children. In addition, it has been attempted to examine the effects of age and gender variables on the self-concept of children. The study is in survey method. The sample consists of 810 children who attend…
Moreira, Paulo A S; Oliveira, João Tiago; Dias, Paulo; Vaz, Filipa Machado; Torres-Oliveira, Isabel
2014-08-04
Students' perceptions about school success promotion strategies are of great importance for schools, as they are an indicator of how students perceive the school success promotion strategies. The objective of this study was to develop and analyze the validity evidence based of The Students' Perceptions of School Success Promoting Strategies Inventory (SPSI), which assesses both individual students' perceptions of their school success promoting strategies, and dimensions of school quality. A structure of 7 related factors was found, which showed good adjustment indices in two additional different samples, suggesting that this is a well-fitting multi-group model (p < .001). All scales presented good reliability values. Schools with good academic results registered higher values in Career development, Active learning, Proximity, Educational Technologies and Extra-curricular activities (p < .05). SPSI showed to be adequate to measure within-schools (students within schools) dimensions of school success. In addition, there is preliminary evidence for its adequacy for measuring school success promotion dimensions between schools for 4 dimensions. This study supports the validity evidence based of the SPSI (validity evidence based on test content, on internal structure, on relations to other variables and on consequences of testing). Future studies should test for within- and between-level variance in a bigger sample of schools.
Nelson, Jennifer Clark; Marsh, Tracey; Lumley, Thomas; Larson, Eric B; Jackson, Lisa A; Jackson, Michael L
2013-08-01
Estimates of treatment effectiveness in epidemiologic studies using large observational health care databases may be biased owing to inaccurate or incomplete information on important confounders. Study methods that collect and incorporate more comprehensive confounder data on a validation cohort may reduce confounding bias. We applied two such methods, namely imputation and reweighting, to Group Health administrative data (full sample) supplemented by more detailed confounder data from the Adult Changes in Thought study (validation sample). We used influenza vaccination effectiveness (with an unexposed comparator group) as an example and evaluated each method's ability to reduce bias using the control time period before influenza circulation. Both methods reduced, but did not completely eliminate, the bias compared with traditional effectiveness estimates that do not use the validation sample confounders. Although these results support the use of validation sampling methods to improve the accuracy of comparative effectiveness findings from health care database studies, they also illustrate that the success of such methods depends on many factors, including the ability to measure important confounders in a representative and large enough validation sample, the comparability of the full sample and validation sample, and the accuracy with which the data can be imputed or reweighted using the additional validation sample information. Copyright © 2013 Elsevier Inc. All rights reserved.
Bischof, Martin; Obermann, Caitriona; Hartmann, Matthias N; Hager, Oliver M; Kirschner, Matthias; Kluge, Agne; Strauss, Gregory P; Kaiser, Stefan
2016-11-22
Negative symptoms are considered core symptoms of schizophrenia. The Brief Negative Symptom Scale (BNSS) was developed to measure this symptomatic dimension according to a current consensus definition. The present study examined the psychometric properties of the German version of the BNSS. To expand former findings on convergent validity, we employed the Temporal Experience Pleasure Scale (TEPS), a hedonic self-report that distinguishes between consummatory and anticipatory pleasure. Additionally, we addressed convergent validity with observer-rated assessment of apathy with the Apathy Evaluation Scale (AES), which was completed by the patient's primary nurse. Data were collected from 75 in- and outpatients from the Psychiatric Hospital, University Zurich diagnosed with either schizophrenia or schizoaffective disorder. We assessed convergent and discriminant validity, internal consistency and inter-rater reliability. We largely replicated the findings of the original version showing good psychometric properties of the BNSS. In addition, the primary nurses evaluation correlated moderately with interview-based clinician rating. BNSS anhedonia items showed good convergent validity with the TEPS. Overall, the German BNSS shows good psychometric properties comparable to the original English version. Convergent validity extends beyond interview-based assessments of negative symptoms to self-rated anhedonia and observer-rated apathy.
ERIC Educational Resources Information Center
McIntosh, Kent; Campbell, Amy L.; Carter, Deborah Russell; Zumbo, Bruno D.
2009-01-01
Office discipline referrals (ODRs) are commonly used by school teams implementing schoolwide positive behavior support to indicate individual student need for additional behavior support. However, little is known about the technical adequacy of ODRs when used in this manner. In this study, the authors assessed (a) the concurrent validity of number…
Measuring the Effect of Tourism Services on Travelers' Quality of Life: Further Validation.
ERIC Educational Resources Information Center
Neal, Janet D.; Sirgy, M. Joseph; Uysal, Muzaffer
2004-01-01
lication and extension study provided additional validational support of the original tourism services satisfaction measure in relation to QOL-related measures.Neal, Sirgy and Uysal (1999) developed a model and a measure to capture the effect of tourism services on travelers' quality of life (QOL). They hypothesized that travelers' overall life…
DOT National Transportation Integrated Search
2017-02-08
The study re-evaluates distress prediction models using the Mechanistic-Empirical Pavement Design Guide (MEPDG) and expands the sensitivity analysis to a wide range of pavement structures and soils. In addition, an extensive validation analysis of th...
ERIC Educational Resources Information Center
Kramer, Gene A.; Johnston, JoElle
1997-01-01
A study examined the relationship between Optometry Admission Test scores and pre-optometry or undergraduate grade point average (GPA) with first and second year performance in optometry schools. The test's predictive validity was limited but significant, and comparable to those reported for other admission tests. In addition, the scores…
Empirical validation of an agent-based model of wood markets in Switzerland
Hilty, Lorenz M.; Lemm, Renato; Thees, Oliver
2018-01-01
We present an agent-based model of wood markets and show our efforts to validate this model using empirical data from different sources, including interviews, workshops, experiments, and official statistics. Own surveys closed gaps where data was not available. Our approach to model validation used a variety of techniques, including the replication of historical production amounts, prices, and survey results, as well as a historical case study of a large sawmill entering the market and becoming insolvent only a few years later. Validating the model using this case provided additional insights, showing how the model can be used to simulate scenarios of resource availability and resource allocation. We conclude that the outcome of the rigorous validation qualifies the model to simulate scenarios concerning resource availability and allocation in our study region. PMID:29351300
Harrison, Peter M C; Collins, Tom; Müllensiefen, Daniel
2017-06-15
Modern psychometric theory provides many useful tools for ability testing, such as item response theory, computerised adaptive testing, and automatic item generation. However, these techniques have yet to be integrated into mainstream psychological practice. This is unfortunate, because modern psychometric techniques can bring many benefits, including sophisticated reliability measures, improved construct validity, avoidance of exposure effects, and improved efficiency. In the present research we therefore use these techniques to develop a new test of a well-studied psychological capacity: melodic discrimination, the ability to detect differences between melodies. We calibrate and validate this test in a series of studies. Studies 1 and 2 respectively calibrate and validate an initial test version, while Studies 3 and 4 calibrate and validate an updated test version incorporating additional easy items. The results support the new test's viability, with evidence for strong reliability and construct validity. We discuss how these modern psychometric techniques may also be profitably applied to other areas of music psychology and psychological science in general.
Tavoli, Azadeh; Melyani, Mahdiyeh; Bakhtiari, Maryam; Ghaedi, Gholam Hossein; Montazeri, Ali
2009-07-09
The Brief Fear of Negative Evaluation Scale (BFNE) is a commonly used instrument to measure social anxiety. This study aimed to translate and to test the reliability and validity of the BFNE in Iran. The English language version of the BFNE was translated into Persian (Iranian language) and was used in this study. The questionnaire was administered to a consecutive sample of 235 students with (n = 33, clinical group) and without social phobia (n = 202, non-clinical group). In addition to the BFNE, two standard instruments were used to measure social phobia severity: the Social Phobia Inventory (SPIN), and the Social Interaction Anxiety Scale (SIAS). All participants completed a brief background information questionnaire, the SPIN, the SIAS and the BFNE scales. Statistical analysis was performed to test the reliability and validity of the BFNE. In all 235 students were studied (111 male and 124 female). The mean age for non-clinical group was 22.2 (SD = 2.1) years and for clinical sample it was 22.4 (SD = 1.8) years. Cronbach's alpha coefficient (to test reliability) was acceptable for both non-clinical and clinical samples (alpha = 0.90 and 0.82 respectively). In addition, 3-week test-retest reliability was performed in non-clinical sample and the intraclass correlation coefficient (ICC) was quite high (ICC = 0.71). Validity as performed using convergent and discriminant validity showed satisfactory results. The questionnaire correlated well with established measures of social phobia such as the SPIN (r = 0.43, p < 0.001) and the SIAS (r = 0.54, p < 0.001). Also the BFNE discriminated well between men and women with and without social phobia in the expected direction. Factor analysis supported a two-factor solution corresponding to positive and reverse-worded items. This validation study of the Iranian version of BFNE proved that it is an acceptable, reliable and valid measure of social phobia. However, since the scale showed a two-factor structure and this does not confirm to the theoretical basis for the BFNE, thus we suggest the use of the BFNE-II when it becomes available in Iran. The validation study of the BFNE-II is in progress.
Hybrid Soft Soil Tire Model (HSSTM). Part 1: Tire Material and Structure Modeling
2015-04-28
commercially available vehicle simulation packages. Model parameters are obtained using a validated finite element tire model, modal analysis, and other...design of experiment matrix. This data, in addition to modal analysis data were used to validate the tire model. Furthermore, to study the validity...é ë ê ê ê ê ê ê ê ù û ú ú ú ú ú ú ú (78) The applied forces to the rim center consist of the axle forces and suspension forces: FFF Gsuspension G
Classification based upon gene expression data: bias and precision of error rates.
Wood, Ian A; Visscher, Peter M; Mengersen, Kerrie L
2007-06-01
Gene expression data offer a large number of potentially useful predictors for the classification of tissue samples into classes, such as diseased and non-diseased. The predictive error rate of classifiers can be estimated using methods such as cross-validation. We have investigated issues of interpretation and potential bias in the reporting of error rate estimates. The issues considered here are optimization and selection biases, sampling effects, measures of misclassification rate, baseline error rates, two-level external cross-validation and a novel proposal for detection of bias using the permutation mean. Reporting an optimal estimated error rate incurs an optimization bias. Downward bias of 3-5% was found in an existing study of classification based on gene expression data and may be endemic in similar studies. Using a simulated non-informative dataset and two example datasets from existing studies, we show how bias can be detected through the use of label permutations and avoided using two-level external cross-validation. Some studies avoid optimization bias by using single-level cross-validation and a test set, but error rates can be more accurately estimated via two-level cross-validation. In addition to estimating the simple overall error rate, we recommend reporting class error rates plus where possible the conditional risk incorporating prior class probabilities and a misclassification cost matrix. We also describe baseline error rates derived from three trivial classifiers which ignore the predictors. R code which implements two-level external cross-validation with the PAMR package, experiment code, dataset details and additional figures are freely available for non-commercial use from http://www.maths.qut.edu.au/profiles/wood/permr.jsp
Øverby, Nina Cecilie; Johannesen, Elisabeth; Jensen, Grete; Skjaevesland, Anne-Kirsti; Haugen, Margaretha
2014-01-01
Background The assessment of food intake is challenging and prone to errors; it is therefore important to consider the reliability and validity of the assessment methods. Objective The aim of this study was to analyze the reproducibility and validity of a developed food-frequency questionnaire (FFQ) for use among adolescents. Design In total, 58 students (aged 13–14) from four different schools in the southern part of Norway participated in the reproducibility study of filling out the FFQ 4 weeks apart. In addition, 93 students participated in the relative validity study where the FFQ was compared to 2×24-hour dietary recalls, while 92 students participated in the absolute validity study where the intakes of fatty acids and vitamin D from the FFQ were compared to fatty acids and 25-hydroxy-vitamin D3 in whole blood. Results The median Spearman correlation coefficient for all nutrients in the test–retest reliability study was 0.57. The median Spearman correlation for all nutrients in the relative validity study was 0.26, while the correlations coefficients were low in the absolute validity study with n-3 fatty acid coefficients ranging from 0.05 to 0.25, and absent for vitamin D (r=0.000). Conclusion The test–retest reproducibility was considered good, the relative validity was considered poor to good, and the absolute validity was considered poor. However, the results are comparable to other studies among adolescents. PMID:25371661
The scoring of arousal in sleep: reliability, validity, and alternatives.
Bonnet, Michael H; Doghramji, Karl; Roehrs, Timothy; Stepanski, Edward J; Sheldon, Stephen H; Walters, Arthur S; Wise, Merrill; Chesson, Andrew L
2007-03-15
The reliability and validity of EEG arousals and other types of arousal are reviewed. Brief arousals during sleep had been observed for many years, but the evolution of sleep medicine in the 1980s directed new attention to these events. Early studies at that time in animals and humans linked brief EEG arousals and associated fragmentation of sleep to daytime sleepiness and degraded performance. Increasing interest in scoring of EEG arousals led the ASDA to publish a scoring manual in 1992. The current review summarizes numerous studies that have examined scoring reliability for these EEG arousals. Validity of EEG arousals was explored by review of studies that empirically varied arousals and found deficits similar to those found after total sleep deprivation depending upon the rate and extent of sleep fragmentation. Additional data from patients with clinical sleep disorders prior to and after effective treatment has also shown a continuing relationship between reduction in pathology-related arousals and improved sleep and daytime function. Finally, many suggestions have been made to refine arousal scoring to include additional elements (e.g., CAP), change the time frame, or focus on other physiological responses such as heart rate or blood pressure changes. Evidence to support the reliability and validity of these measures is presented. It was concluded that the scoring of EEG arousals has added much to our understanding of the sleep process but that significant work on the neurophysiology of arousal needs to be done. Additional refinement of arousal scoring will provide improved insight into sleep pathology and recovery.
Wild, Diane; Furtado, Tamzin; Angalakuditi, Mallik
2012-01-01
Background The Child Behavior Checklist (CBCL) is a caregiver rating scale for assessing the behavioral profile of children. It was developed in the US, and has been extensively translated and used in a large number of studies internationally. Objective The objective of this study was to translate the CBCL into six languages using a rigorous translation methodology, placing particular emphasis on cultural adaptation and ensuring that the measure has content validity with carers of children with epilepsy. Methods A rigorous translation and cultural adaptation methodology was used. This is a process which includes two forward translations, reconciliation, two back-translations, and cognitive debriefing interviews with five carers of children with epilepsy in each country. In addition, a series of open-ended questions were asked of the carers in order to provide evidence of content validity. Results A number of cultural adaptations were made during the translation process. This included adaptations to the examples of sports and hobbies. An addition of “milk delivery” was made to the job examples in the Malayalam translation. In addition, two sexual problem items were removed from the Hebrew translation for Israel. Conclusion An additional six translations of the CBCL are now available for use in multinational studies. These translations have evidence of content validity for use with parents of children with epilepsy and have been appropriately culturally adapted so that they are acceptable for use in the target countries. The study highlights the importance of a rigorous translation process and the process of cultural adaptation. PMID:22715318
ERIC Educational Resources Information Center
Lane, Kathleen Lynne; Oakes, Wendy P.; Harris, Pamela J.; Menzies, Holly Mariah; Cox, Meredith; Lambert, Warren
2012-01-01
We report findings of an exploratory validation study of a revised instrument: the Student Risk Screening Scale-Internalizing and Externalizing (SRSS-IE). The SRSS-IE was modified to include seven additional items reflecting characteristics of internalizing behaviors, with proposed items generated from the current literature base, review of…
ERIC Educational Resources Information Center
Merrell, Kenneth W.; Felver-Gant, Josh C.; Tom, Karalyn M.
2011-01-01
Based on the premises that strength-based assessment of children and adolescents is an important emerging area, and that additional tools for this purpose are needed, this study details development and validation efforts on a new strength-based assessment: the Social-Emotional Assets and Resilience Scale, parent form (SEARS-P). Following careful…
ERIC Educational Resources Information Center
Wu, Pei-Chen; Huang, Tsai-Wei
2010-01-01
This study was to apply the mixed Rasch model to investigate person heterogeneity of Beck Depression Inventory-II-Chinese version (BDI-II-C) and its effects on dimensionality and construct validity. Person heterogeneity was reflected by two latent classes that differ qualitatively. Additionally, person heterogeneity adversely affected the…
ERIC Educational Resources Information Center
Flight, Ingrid H.; Wilson, Carlene J.; McGillivray, Jane; Myers, Ronald E.
2010-01-01
We investigated whether the five-factor structure of the Preventive Health Model for colorectal cancer screening, developed in the United States, has validity in Australia. We also tested extending the model with the addition of the factor Self-Efficacy to Screen using Fecal Occult Blood Test (SESFOBT). Randomly selected men and women aged between…
75 FR 25763 - Addition to the List of Validated End-Users: Advanced Micro Devices China, Inc.
Federal Register 2010, 2011, 2012, 2013, 2014
2010-05-10
.... Additional Validated End-User in the PRC and Its Respective ``Eligible Items (By ECCN)'' and ``Eligible... to the ``development'' of products under ECCN 4A003). This authorization was made based on an... Country Validated end-user Eligible items (by ECCN) Eligible destination China (People's Republic of...
Tsuno, Kanami; Kawakami, Norito; Shimazu, Akihito; Shimada, Kyoko; Inoue, Akiomi; P Leiter, Michael
2017-05-25
Although incivility is a common interpersonal mistreatment and associated with poor mental health, there are few studies about it in Asian countries. The aim of this study was to develop the Japanese version of the modified Work Incivility Scale (J-MWIS), investigate its reliability and validity, and reveal the prevalence of incivility among Japanese employees in comparison with data on Canadian employees. A total of 2,191 Japanese and 1,071 Canadian employees were surveyed, using either the J-MWIS or MWIS. Japanese employees additionally answered questions on civility, worksite social support, workplace bullying, psychological distress, intention to leave, and work engagement to investigate construct validity. At least one form of workplace incivility was experienced by both Japanese (52.3%) and Canadian (86.0%) employees in the previous month. Internal consistency reliability of the J-MWIS was acceptable (α=0.71-0.81), and correlation analyses also confirmed its construct validity as expected. Workplace incivility was associated with lower workgroup civility, lower supervisor and coworker support, higher workplace bullying, higher psychological distress, higher intention to leave, and lower work engagement. Confirmatory factor analyses showed that the original three-factor model (supervisor incivility, coworker incivility, and instigated incivility) fitted moderately in both Japan and Canada data, though the privacy/overfamiliarity factor was additionally extracted from exploratory factor analysis for the J-MWIS. The results of this study suggested that the J-MWIS has moderate internal consistency reliability and good construct validity.
Tsuno, Kanami; Kawakami, Norito; Shimazu, Akihito; Shimada, Kyoko; Inoue, Akiomi; P. Leiter, Michael
2017-01-01
Objectives: Although incivility is a common interpersonal mistreatment and associated with poor mental health, there are few studies about it in Asian countries. The aim of this study was to develop the Japanese version of the modified Work Incivility Scale (J-MWIS), investigate its reliability and validity, and reveal the prevalence of incivility among Japanese employees in comparison with data on Canadian employees. Methods: A total of 2,191 Japanese and 1,071 Canadian employees were surveyed, using either the J-MWIS or MWIS. Japanese employees additionally answered questions on civility, worksite social support, workplace bullying, psychological distress, intention to leave, and work engagement to investigate construct validity. Results: At least one form of workplace incivility was experienced by both Japanese (52.3%) and Canadian (86.0%) employees in the previous month. Internal consistency reliability of the J-MWIS was acceptable (α=0.71-0.81), and correlation analyses also confirmed its construct validity as expected. Workplace incivility was associated with lower workgroup civility, lower supervisor and coworker support, higher workplace bullying, higher psychological distress, higher intention to leave, and lower work engagement. Confirmatory factor analyses showed that the original three-factor model (supervisor incivility, coworker incivility, and instigated incivility) fitted moderately in both Japan and Canada data, though the privacy/overfamiliarity factor was additionally extracted from exploratory factor analysis for the J-MWIS. Conclusions: The results of this study suggested that the J-MWIS has moderate internal consistency reliability and good construct validity. PMID:28302927
Johnson, Sheena Joanne; Guediri, Sara M; Kilkenny, Caroline; Clough, Peter J
2011-12-01
This study developed and validated a virtual reality (VR) simulator for use by interventional radiologists. Research in the area of skill acquisition reports practice as essential to become a task expert. Studies on simulation show skills learned in VR can be successfully transferred to a real-world task. Recently, with improvements in technology, VR simulators have been developed to allow complex medical procedures to be practiced without risking the patient. Three studies are reported. In Study I, 35 consultant interventional radiologists took part in a cognitive task analysis to empirically establish the key competencies of the Seldinger procedure. In Study 2, 62 participants performed one simulated procedure, and their performance was compared by expertise. In Study 3, the transferability of simulator training to a real-world procedure was assessed with 14 trainees. Study I produced 23 key competencies that were implemented as performance measures in the simulator. Study 2 showed the simulator had both face and construct validity, although some issues were identified. Study 3 showed the group that had undergone simulator training received significantly higher mean performance ratings on a subsequent patient procedure. The findings of this study support the centrality of validation in the successful design of simulators and show the utility of simulators as a training device. The studies show the key elements of a validation program for a simulator. In addition to task analysis and face and construct validities, the authors highlight the importance of transfer of training in validation studies.
Steele, John C; Clark, Hadleigh J; Hong, Catherine H L; Jurge, Sabine; Muthukrishnan, Arvind; Kerr, A Ross; Wray, David; Prescott-Clements, Linda; Felix, David H; Sollecito, Thomas P
2015-08-01
To explore international consensus for the validation of clinical competencies for advanced training in Oral Medicine. An electronic survey of clinical competencies was designed. The survey was sent to and completed by identified international stakeholders during a 10-week period. To be validated, an individual competency had to achieve 90% or greater consensus to keep it in its current format. Stakeholders from 31 countries responded. High consensus agreement was achieved with 93 of 101 (92%) competencies exceeding the benchmark for agreement. Only 8 warranted further attention and were reviewed by a focus group. No additional competencies were suggested. This is the first international validated study of clinical competencies for advanced training in Oral Medicine. These validated clinical competencies could provide a model for countries developing an advanced training curriculum for Oral Medicine and also inform review of existing curricula. Copyright © 2015 Elsevier Inc. All rights reserved.
Designing and validation of a yoga-based intervention for schizophrenia.
Govindaraj, Ramajayam; Varambally, Shivarama; Sharma, Manjunath; Gangadhar, Bangalore Nanjundaiah
2016-06-01
Schizophrenia is a chronic mental illness which causes significant distress and dysfunction. Yoga has been found to be effective as an add-on therapy in schizophrenia. Modules of yoga used in previous studies were based on individual researcher's experience. This study aimed to develop and validate a specific generic yoga-based intervention module for patients with schizophrenia. The study was conducted at NIMHANS Integrated Centre for Yoga (NICY). A yoga module was designed based on traditional and contemporary yoga literature as well as published studies. The yoga module along with three case vignettes of adult patients with schizophrenia was sent to 10 yoga experts for their validation. Experts (n = 10) gave their opinion on the usefulness of a yoga module for patients with schizophrenia with some modifications. In total, 87% (13 of 15 items) of the items in the initial module were retained, with modification in the remainder as suggested by the experts. A specific yoga-based module for schizophrenia was designed and validated by experts. Further studies are needed to confirm efficacy and clinical utility of the module. Additional clinical validation is suggested.
Forney, K Jean; Bodell, Lindsay P; Haedt-Matt, Alissa A; Keel, Pamela K
2016-07-01
Of the two primary features of binge eating, loss of control (LOC) eating is well validated while the role of eating episode size is less clear. Given the ICD-11 proposal to eliminate episode size from the binge-eating definition, the present study examined the incremental validity of the size criterion, controlling for LOC. Interview and questionnaire data come from four studies of 243 women with bulimia nervosa (n = 141) or purging disorder (n = 102). Hierarchical linear regression tested if the largest reported episode size, coded in kilocalories, explained additional variance in eating disorder features, psychopathology, personality traits, and impairment, holding constant LOC eating frequency, age, and body mass index (BMI). Analyses also tested if episode size moderated the association between LOC eating and these variables. Holding LOC constant, episode size explained significant variance in disinhibition, trait anxiety, and eating disorder-related impairment. Episode size moderated the association of LOC eating with purging frequency and depressive symptoms, such that in the presence of larger eating episodes, LOC eating was more closely associated with these features. Neither episode size nor its interaction with LOC explained additional variance in BMI, hunger, restraint, shape concerns, state anxiety, negative urgency, or global functioning. Taken together, results support the incremental validity of the size criterion, in addition to and in combination with LOC eating, for defining binge-eating episodes in purging syndromes. Future research should examine the predictive validity of episode size in both purging and nonpurging eating disorders (e.g., binge eating disorder) to inform nosological schemes. © 2016 Wiley Periodicals, Inc. (Int J Eat Disord 2016; 49:651-662). © 2016 Wiley Periodicals, Inc.
Weiss, Maureen R; Bolter, Nicole D; Kipp, Lindsay E
2014-09-01
A signature characteristic of positive youth development (PYD) programs is the opportunity to develop life skills, such as social, behavioral, and moral competencies, that can be generalized to domains beyond the immediate activity. Although context-specific instruments are available to assess developmental outcomes, a measure of life skills transfer would enable evaluation of PYD programs in successfully teaching skills that youth report using in other domains. The purpose of our studies was to develop and validate a measure of perceived life skills transfer, based on data collected with The First Tee, a physical activity-based PYD program. In 3 studies, we conducted a series of steps to provide content and construct validity and internal consistency reliability for the Life Skills Transfer Survey (LSTS), a measure of perceived life skills transfer. Study 1 provided content validity for the LSTS that included 8 life skills and 50 items. Study 2 revealed construct validity (structural validity) through a confirmatory factor analysis and convergent validity by correlating scores on the LSTS with scores on an assessment tool that measures a related construct. Study 3 offered additional construct validity by reassessing youth 1 year later and showing that scores during both time periods were invariant in factor pattern, loadings, and variances and covariances. Studies 2 and 3 demonstrated internal consistency reliability of the LSTS. RESULTS from 3 studies provide evidence of content and construct validity and internal consistency reliability for the LSTS, which can be used in evaluation research with youth development programs.
Preliminary validation of the Yale Food Addiction Scale.
Gearhardt, Ashley N; Corbin, William R; Brownell, Kelly D
2009-04-01
Previous research has found similarities between addiction to psychoactive substances and excessive food consumption. Further exploration is needed to evaluate the concept of "food addiction," as there is currently a lack of psychometrically validated measurement tools in this area. The current study represents a preliminary exploration of the Yale Food Addiction Scale (YFAS), designed to identify those exhibiting signs of addiction towards certain types of foods (e.g., high fat and high sugar). Survey data were collected from 353 respondents from a stratified random sample of young adults. In addition to the YFAS, the survey assessed eating pathology, alcohol consumption and other health behaviors. The YFAS exhibited adequate internal reliability, and showed good convergent validity with measures of similar constructs and good discriminant validity relative to related but dissimilar constructs. Additionally, the YFAS predicted binge-eating behavior above and beyond existing measures of eating pathology, demonstrating incremental validity. The YFAS is a sound tool for identifying eating patterns that are similar to behaviors seen in classic areas of addiction. Further evaluation of the scale is needed, especially due to a low response rate of 24.5% and a non-clinical sample, but confirmation of the reliability and validity of the scale has the potential to facilitate empirical research on the concept of "food addiction".
Papageorgiou, Charalabos; Rabavilas, Andreas D; Stachtea, Xanthy; Giannakakis, Giorgos A; Kyprianou, Miltiades; Papadimitriou, George N; Stefanis, Costas N
2012-04-01
The objective of this study was to investigate the link between the Eysenck Personality Questionnaire (EPQ) scores and depressive symptomatology with reasoning performance induced by a task including valid and invalid Aristotelian syllogisms. The EPQ and the Zung Depressive Scale (ZDS) were completed by 48 healthy subjects (27 male, 21 female) aged 33.5 ± 9.0 years. Additionally, the subjects engaged into two reasoning tasks (valid vs. invalid syllogisms). Analysis showed that the judgment of invalid syllogisms is a more difficult task than of valid judgments (65.1% vs. 74.6% of correct judgments respectively, p < 0.01). In both conditions, the subjects' degree of confidence is significantly higher when they make a correct judgment than when they make an incorrect judgment (83.8 ± 11.2 vs. 75.3 ± 17.3, p < 0.01). Subjects with extraversion as measured by EPQ and high sexual desire as rated by the relative ZDS subscale are more prone to make incorrect judgments in the valid syllogisms, while, at the same time, they are more confident in their responses. The effects of extraversion/introversion and sexual desire on the outcome measures of the valid condition are not commutative but additive. These findings indicate that extraversion/introversion and sexual desire variations may have a detrimental effect in the reasoning performance.
Chandrasekar, Vaishnavi; Janes, Dustin W; Saylor, David M; Hood, Alan; Bajaj, Akhil; Duncan, Timothy V; Zheng, Jiwen; Isayeva, Irada S; Forrey, Christopher; Casey, Brendan J
2018-01-01
A novel approach for rapid risk assessment of targeted leachables in medical device polymers is proposed and validated. Risk evaluation involves understanding the potential of these additives to migrate out of the polymer, and comparing their exposure to a toxicological threshold value. In this study, we propose that a simple diffusive transport model can be used to provide conservative exposure estimates for phase separated color additives in device polymers. This model has been illustrated using a representative phthalocyanine color additive (manganese phthalocyanine, MnPC) and polymer (PEBAX 2533) system. Sorption experiments of MnPC into PEBAX were conducted in order to experimentally determine the diffusion coefficient, D = (1.6 ± 0.5) × 10 -11 cm 2 /s, and matrix solubility limit, C s = 0.089 wt.%, and model predicted exposure values were validated by extraction experiments. Exposure values for the color additive were compared to a toxicological threshold for a sample risk assessment. Results from this study indicate that a diffusion model-based approach to predict exposure has considerable potential for use as a rapid, screening-level tool to assess the risk of color additives and other small molecule additives in medical device polymers.
Design, development, testing and validation of a Photonics Virtual Laboratory for the study of LEDs
NASA Astrophysics Data System (ADS)
Naranjo, Francisco L.; Martínez, Guadalupe; Pérez, Ángel L.; Pardo, Pedro J.
2014-07-01
This work presents the design, development, testing and validation of a Photonic Virtual Laboratory, highlighting the study of LEDs. The study was conducted from a conceptual, experimental and didactic standpoint, using e-learning and m-learning platforms. Specifically, teaching tools that help ensure that our students perform significant learning have been developed. It has been brought together the scientific aspect, such as the study of LEDs, with techniques of generation and transfer of knowledge through the selection, hierarchization and structuring of information using concept maps. For the validation of the didactic materials developed, it has been used procedures with various assessment tools for the collection and processing of data, applied in the context of an experimental design. Additionally, it was performed a statistical analysis to determine the validity of the materials developed. The assessment has been designed to validate the contributions of the new materials developed over the traditional method of teaching, and to quantify the learning achieved by students, in order to draw conclusions that serve as a reference for its application in the teaching and learning processes, and comprehensively validate the work carried out.
Muse, Kate; McManus, Freda; Rakovshik, Sarah; Thwaites, Richard
2017-05-01
This article outlines the development and psychometric evaluation of the Assessment of Core CBT Skills (ACCS) rating scale. The ACCS aims to provide a novel assessment framework to deliver formative and summative feedback regarding therapists' performance within observed cognitive-behavioral treatment sessions, and for therapists to rate and reflect on their own performance. Findings from 3 studies are outlined: (a) a feedback study (n = 66) examining content validity, face validity and usability; (b) a focus group (n = 9) evaluating usability and utility; and (c) an evaluation of the psychometric properties of the ACCS in real world cognitive behavioral therapy (CBT) training and routine clinical practice contexts. Results suggest that the ACCS has good face validity, content validity, and usability and provides a user-friendly tool that is useful for promoting self-reflection and providing formative feedback. Scores on both the self and assessor-rated versions of the ACCS demonstrate good internal consistency, interrater reliability, and discriminant validity. In addition, ACCS scores were found to be correlated with, but distinct from, the Revised Cognitive Therapy Scale (CTS-R) and were comparable to CTS-R scores in terms of internal consistency and discriminant validity. In addition, the ACCS may have advantages over the CTS-R in terms of interrater reliability of scores. The studies also provided insight into areas for refinement and a number of modifications were undertaken to improve the scale. In summary, the ACCS is an appropriate and useful measure of CBT competence that can be used to promote self-reflection and provide therapists with formative and summative feedback. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Cern, Ahuva; Barenholz, Yechezkel; Tropsha, Alexander; Goldblum, Amiram
2014-01-10
Previously we have developed and statistically validated Quantitative Structure Property Relationship (QSPR) models that correlate drugs' structural, physical and chemical properties as well as experimental conditions with the relative efficiency of remote loading of drugs into liposomes (Cern et al., J. Control. Release 160 (2012) 147-157). Herein, these models have been used to virtually screen a large drug database to identify novel candidate molecules for liposomal drug delivery. Computational hits were considered for experimental validation based on their predicted remote loading efficiency as well as additional considerations such as availability, recommended dose and relevance to the disease. Three compounds were selected for experimental testing which were confirmed to be correctly classified by our previously reported QSPR models developed with Iterative Stochastic Elimination (ISE) and k-Nearest Neighbors (kNN) approaches. In addition, 10 new molecules with known liposome remote loading efficiency that were not used by us in QSPR model development were identified in the published literature and employed as an additional model validation set. The external accuracy of the models was found to be as high as 82% or 92%, depending on the model. This study presents the first successful application of QSPR models for the computer-model-driven design of liposomal drugs. © 2013.
Cern, Ahuva; Barenholz, Yechezkel; Tropsha, Alexander; Goldblum, Amiram
2014-01-01
Previously we have developed and statistically validated Quantitative Structure Property Relationship (QSPR) models that correlate drugs’ structural, physical and chemical properties as well as experimental conditions with the relative efficiency of remote loading of drugs into liposomes (Cern et al, Journal of Controlled Release, 160(2012) 14–157). Herein, these models have been used to virtually screen a large drug database to identify novel candidate molecules for liposomal drug delivery. Computational hits were considered for experimental validation based on their predicted remote loading efficiency as well as additional considerations such as availability, recommended dose and relevance to the disease. Three compounds were selected for experimental testing which were confirmed to be correctly classified by our previously reported QSPR models developed with Iterative Stochastic Elimination (ISE) and k-nearest neighbors (kNN) approaches. In addition, 10 new molecules with known liposome remote loading efficiency that were not used in QSPR model development were identified in the published literature and employed as an additional model validation set. The external accuracy of the models was found to be as high as 82% or 92%, depending on the model. This study presents the first successful application of QSPR models for the computer-model-driven design of liposomal drugs. PMID:24184343
Extended version of the "Sniffin' Sticks" identification test: test-retest reliability and validity.
Sorokowska, A; Albrecht, E; Haehner, A; Hummel, T
2015-03-30
The extended, 32-item version of the Sniffin' Sticks identification test was developed in order to create a precise tool enabling repeated, longitudinal testing of individual olfactory subfunctions. Odors of the previous test version had to be changed for technical reasons, and the odor identification test needed re-investigation in terms of reliability, validity, and normative values. In our study we investigated olfactory abilities of a group of 100 patients with olfactory dysfunction and 100 controls. We reconfirmed the high test-retest reliability of the extended version of the Sniffin' Sticks identification test and high correlations between the new and the original part of this tool. In addition, we confirmed the validity of the test as it discriminated clearly between controls and patients with olfactory loss. The additional set of 16 odor identification sticks can be either included in the current olfactory test, thus creating a more detailed diagnosis tool, or it can be used separately, enabling to follow olfactory function over time. Additionally, the normative values presented in our paper might provide useful guidelines for interpretation of the extended identification test results. The revised version of the Sniffin' Sticks 32-item odor identification test is a reliable and valid tool for the assessment of olfactory function. Copyright © 2015 Elsevier B.V. All rights reserved.
ERIC Educational Resources Information Center
Lane, Kathleen Lynne; Oakes, Wendy Peia; Carter, Erik W.; Lambert, Warren E.; Jenkins, Abbie B.
2013-01-01
We reported findings of an exploratory validation study of a revised universal screening instrument: the Student Risk Screening Scale--Internalizing and Externalizing (SRSS-IE) for use with middle school students. Tested initially for use with elementary-age students, the SRSS-IE was adapted to include seven additional items reflecting…
ERIC Educational Resources Information Center
Lane, Kathleen Lynne; Menzies, Holly M.; Oakes, Wendy P.; Lambert, Warren; Cox, Meredith; Hankins, Katy
2012-01-01
We report findings of two studies, one conducted in a rural school district (N = 982) and a second conducted in an urban district (N = 1,079), offering additional evidence of the reliability and validity of a revised instrument, the Student Risk Screening Scale-Internalizing and Externalizing (SRSS-IE), to accurately detect internalizing and…
ERIC Educational Resources Information Center
Muyskens, Paul; Betts, Joseph; Lau, Matthew Y.; Marston, Doug
2009-01-01
The inclusion of English Language Learners as a subgroup in the No Child Left Behind legislation has leant additional importance to the need for valid and efficient measures of reading for students whose first language is not English. This study examines the use of Curriculum-Based Measurement (CBM) reading fluency as a predictor of later reading…
ERIC Educational Resources Information Center
Shriberg, Lawrence D.; Strand, Edythe A.; Fourakis, Marios; Jakielski, Kathy J.; Hall, Sheryl D.; Karlsson, Heather B.; Mabie, Heather L.; McSweeny, Jane L.; Tilkens, Christie M.; Wilson, David L.
2017-01-01
Purpose: The purpose of this 2nd article in this supplement is to report validity support findings for the Pause Marker (PM), a proposed single-sign diagnostic marker of childhood apraxia of speech (CAS). Method: PM scores and additional perceptual and acoustic measures were obtained from 296 participants in cohorts with idiopathic and…
Prognostic Biomarkers Used for Localised Prostate Cancer Management: A Systematic Review.
Lamy, Pierre-Jean; Allory, Yves; Gauchez, Anne-Sophie; Asselain, Bernard; Beuzeboc, Philippe; de Cremoux, Patricia; Fontugne, Jacqueline; Georges, Agnès; Hennequin, Christophe; Lehmann-Che, Jacqueline; Massard, Christophe; Millet, Ingrid; Murez, Thibaut; Schlageter, Marie-Hélène; Rouvière, Olivier; Kassab-Chahmi, Diana; Rozet, François; Descotes, Jean-Luc; Rébillard, Xavier
2017-03-07
Prostate cancer stratification is based on tumour size, pretreatment PSA level, and Gleason score, but it remains imperfect. Current research focuses on the discovery and validation of novel prognostic biomarkers to improve the identification of patients at risk of aggressive cancer or of tumour relapse. This systematic review by the Intergroupe Coopérateur Francophone de Recherche en Onco-urologie (ICFuro) analysed new evidence on the analytical validity and clinical validity and utility of six prognostic biomarkers (PHI, 4Kscore, MiPS, GPS, Prolaris, Decipher). All available data for the six biomarkers published between January 2002 and April 2015 were systematically searched and reviewed. The main endpoints were aggressive prostate cancer prediction, additional value compared to classical prognostic parameters, and clinical benefit for patients with localised prostate cancer. The preanalytical and analytical validations were heterogeneous for all tests and often not adequate for the molecular signatures. Each biomarker was studied for specific indications (candidates for a first or second biopsy, and potential candidates for active surveillance, radical prostatectomy, or adjuvant treatment) for which the level of evidence (LOE) was variable. PHI and 4Kscore were the biomarkers with the highest LOE for discriminating aggressive and indolent tumours in different indications. Blood biomarkers (PHI and 4Kscore) have the highest LOE for the prediction of more aggressive prostate cancer and could help clinicians to manage patients with localised prostate cancer. The other biomarkers show a potential prognostic value; however, they should be evaluated in additional studies to confirm their clinical validity. We reviewed studies assessing the value of six prognostic biomarkers for prostate cancer. On the basis of the available evidence, some biomarkers could help in discriminating between aggressive and non-aggressive tumours with an additional value compared to the prognostic parameters currently used by clinicians. Copyright © 2017 European Association of Urology. Published by Elsevier B.V. All rights reserved.
Moreno-Murcia, Juan A; Martínez-Galindo, Celestina; Moreno-Pérez, Víctor; Marcos, Pablo J.; Borges, Fernanda
2012-01-01
This study aimed to cross-validate the psychometric properties of the Basic Psychological Needs in Exercise Scale (BPNES) by Vlachopoulos and Michailidou, 2006 in a Spanish context. Two studies were conducted. Confirmatory factor analysis results confirmed the hypothesized three-factor solution In addition, we documented evidence of reliability, analysed as internal consistency and temporal stability. Future studies should analyse the scale's validity and reliability with different populations and check their experimental effect. Key pointsThe Basic Psychological Needs in Exercise Scale (BPNES) is valid and reliable for measuring basic psychological needs in healthy physical exercise in the Spanish context.The factor structure of three correlated factors has shown minimal invariance across gender. PMID:24149130
Van de Weijer-Bergsma, Eva; Kroesbergen, Evelyn H; Prast, Emilie J; Van Luit, Johannes E H
2015-09-01
Working memory is an important predictor of academic performance, and of math performance in particular. Most working memory tasks depend on one-to-one administration by a testing assistant, which makes the use of such tasks in large-scale studies time-consuming and costly. Therefore, an online, self-reliant visual-spatial working memory task (the Lion game) was developed for primary school children (6-12 years of age). In two studies, the validity and reliability of the Lion game were investigated. The results from Study 1 (n = 442) indicated satisfactory six-week test-retest reliability, excellent internal consistency, and good concurrent and predictive validity. The results from Study 2 (n = 5,059) confirmed the results on the internal consistency and predictive validity of the Lion game. In addition, multilevel analysis revealed that classroom membership influenced Lion game scores. We concluded that the Lion game is a valid and reliable instrument for the online computerized and self-reliant measurement of visual-spatial working memory (i.e., updating).
ERIC Educational Resources Information Center
Deliyianni, Eleni; Gagatsis, Athanasios; Elia, Iliada; Panaoura, Areti
2016-01-01
The aim of this study was to propose and validate a structural model in fraction and decimal number addition, which is founded primarily on a synthesis of major theoretical approaches in the field of representations in Mathematics and also on previous research on the learning of fractions and decimals. The study was conducted among 1,701 primary…
Liao, Katherine P; Ananthakrishnan, Ashwin N; Kumar, Vishesh; Xia, Zongqi; Cagan, Andrew; Gainer, Vivian S; Goryachev, Sergey; Chen, Pei; Savova, Guergana K; Agniel, Denis; Churchill, Susanne; Lee, Jaeyoung; Murphy, Shawn N; Plenge, Robert M; Szolovits, Peter; Kohane, Isaac; Shaw, Stanley Y; Karlson, Elizabeth W; Cai, Tianxi
2015-01-01
Typically, algorithms to classify phenotypes using electronic medical record (EMR) data were developed to perform well in a specific patient population. There is increasing interest in analyses which can allow study of a specific outcome across different diseases. Such a study in the EMR would require an algorithm that can be applied across different patient populations. Our objectives were: (1) to develop an algorithm that would enable the study of coronary artery disease (CAD) across diverse patient populations; (2) to study the impact of adding narrative data extracted using natural language processing (NLP) in the algorithm. Additionally, we demonstrate how to implement CAD algorithm to compare risk across 3 chronic diseases in a preliminary study. We studied 3 established EMR based patient cohorts: diabetes mellitus (DM, n = 65,099), inflammatory bowel disease (IBD, n = 10,974), and rheumatoid arthritis (RA, n = 4,453) from two large academic centers. We developed a CAD algorithm using NLP in addition to structured data (e.g. ICD9 codes) in the RA cohort and validated it in the DM and IBD cohorts. The CAD algorithm using NLP in addition to structured data achieved specificity >95% with a positive predictive value (PPV) 90% in the training (RA) and validation sets (IBD and DM). The addition of NLP data improved the sensitivity for all cohorts, classifying an additional 17% of CAD subjects in IBD and 10% in DM while maintaining PPV of 90%. The algorithm classified 16,488 DM (26.1%), 457 IBD (4.2%), and 245 RA (5.0%) with CAD. In a cross-sectional analysis, CAD risk was 63% lower in RA and 68% lower in IBD compared to DM (p<0.0001) after adjusting for traditional cardiovascular risk factors. We developed and validated a CAD algorithm that performed well across diverse patient populations. The addition of NLP into the CAD algorithm improved the sensitivity of the algorithm, particularly in cohorts where the prevalence of CAD was low. Preliminary data suggest that CAD risk was significantly lower in RA and IBD compared to DM.
Liao, Katherine P.; Ananthakrishnan, Ashwin N.; Kumar, Vishesh; Xia, Zongqi; Cagan, Andrew; Gainer, Vivian S.; Goryachev, Sergey; Chen, Pei; Savova, Guergana K.; Agniel, Denis; Churchill, Susanne; Lee, Jaeyoung; Murphy, Shawn N.; Plenge, Robert M.; Szolovits, Peter; Kohane, Isaac; Shaw, Stanley Y.; Karlson, Elizabeth W.; Cai, Tianxi
2015-01-01
Background Typically, algorithms to classify phenotypes using electronic medical record (EMR) data were developed to perform well in a specific patient population. There is increasing interest in analyses which can allow study of a specific outcome across different diseases. Such a study in the EMR would require an algorithm that can be applied across different patient populations. Our objectives were: (1) to develop an algorithm that would enable the study of coronary artery disease (CAD) across diverse patient populations; (2) to study the impact of adding narrative data extracted using natural language processing (NLP) in the algorithm. Additionally, we demonstrate how to implement CAD algorithm to compare risk across 3 chronic diseases in a preliminary study. Methods and Results We studied 3 established EMR based patient cohorts: diabetes mellitus (DM, n = 65,099), inflammatory bowel disease (IBD, n = 10,974), and rheumatoid arthritis (RA, n = 4,453) from two large academic centers. We developed a CAD algorithm using NLP in addition to structured data (e.g. ICD9 codes) in the RA cohort and validated it in the DM and IBD cohorts. The CAD algorithm using NLP in addition to structured data achieved specificity >95% with a positive predictive value (PPV) 90% in the training (RA) and validation sets (IBD and DM). The addition of NLP data improved the sensitivity for all cohorts, classifying an additional 17% of CAD subjects in IBD and 10% in DM while maintaining PPV of 90%. The algorithm classified 16,488 DM (26.1%), 457 IBD (4.2%), and 245 RA (5.0%) with CAD. In a cross-sectional analysis, CAD risk was 63% lower in RA and 68% lower in IBD compared to DM (p<0.0001) after adjusting for traditional cardiovascular risk factors. Conclusions We developed and validated a CAD algorithm that performed well across diverse patient populations. The addition of NLP into the CAD algorithm improved the sensitivity of the algorithm, particularly in cohorts where the prevalence of CAD was low. Preliminary data suggest that CAD risk was significantly lower in RA and IBD compared to DM. PMID:26301417
DOT National Transportation Integrated Search
1979-04-01
The study assesses the impact of the consumer misconceptions, the lack of a seat belt standard, corporate attitudes towards safety, and the role of safety advocates. In addition, the study assesses the validity of the oft-stated hypothesis "safety do...
Disentangling Wording and Substantive Factors in the Spiritual Well-Being Scale.
Murray, Aja L; Johnson, Wendy; Gow, Alan J; Deary, Ian J
2015-05-01
We evaluated the extent to which the Spiritual Well-Being Scale (SWBS) may help to meet the need for multidimensional, psychometrically sophisticated measures of spiritual and religious traits. Although the various forms of validity of the scale have, for the most part, been supported by psychometric studies, conflicting evidence surrounding its dimensionality has called into question its structural validity. Specifically, numerous authors have suggested that a more appropriate factor structure for the SWBS includes further substantive factors in addition to the 2 factors that the scale was originally intended to measure. In the current study, we attempted to resolve these debates using a combination of exploratory and confirmatory factor analysis based investigations in the Lothian Birth Cohort, 1921 study. Our analyses suggested that the additional factors suggested in previous studies may not have reflected substantive constructs; but rather, common variance due to methodological factors.
Kirsch, Monika; Mitchell, Sandra A; Dobbels, Fabienne; Stussi, Georg; Basch, Ethan; Halter, Jorg P; De Geest, Sabina
2015-02-01
The aim of this sequential mixed methods study was to develop a PRO-CTCAE (Patient-Reported Outcomes version of the Common Terminology Criteria for Adverse Events)-based measure of the symptom experience of late effects in German speaking long-term survivors of allogeneic stem cell transplantation (SCT), and to examine its content validity. The US National Cancer Institute's PRO-CTAE item library was translated into German and linguistically validated. PRO-CTCAE symptoms prevalent in ≥50% of survivors (n = 15) and recognized in its importance by SCT experts (n = 9) were identified. Additional concepts relevant to the symptom experience and its consequences were elicited. Content validity of the PROVIVO (Patient-Reported Outcomes of long-term survivors after allogeneic SCT) instrument was assessed through an additional round of cognitive debriefing in 15 patients, and item and scale content validity indices by 9 experts. PROVIVO is comprised of a total of 49 items capturing the experience of physical, emotional and cognitive symptoms. To improve the instrument's utility for clinical decision-making, questions soliciting limitations in activities of daily living, frequent infections, and overall well-being were added. Cognitive debriefings demonstrated that items were well understood and relevant to the SCT survivor experience. Scale Content Validity Index (CVI) (0.94) and item CVI (median = 1; range 0.75-1) were very high. Qualitative and quantitative data provide preliminary evidence supporting the content validity of PROVIVO and identify a PRO-CTCAE item bundle for use in SCT survivors. A study to evaluate the measurement properties of PROVIVO and to examine its capacity to improve survivorship care planning is underway. Copyright © 2014 Elsevier Ltd. All rights reserved.
49 CFR 1522.111 - Reconsideration of disapproval of an application.
Code of Federal Regulations, 2010 CFR
2010-10-01
... TRANSPORTATION TSA-APPROVED VALIDATION FIRMS AND VALIDATORS TSA-Approved Validation Firms and Validators for the... additional information from the applicant prior to rendering a decision. This disposition is a final agency...
Tavoli, Azadeh; Melyani, Mahdiyeh; Bakhtiari, Maryam; Ghaedi, Gholam Hossein; Montazeri, Ali
2009-01-01
Background The Brief Fear of Negative Evaluation Scale (BFNE) is a commonly used instrument to measure social anxiety. This study aimed to translate and to test the reliability and validity of the BFNE in Iran. Methods The English language version of the BFNE was translated into Persian (Iranian language) and was used in this study. The questionnaire was administered to a consecutive sample of 235 students with (n = 33, clinical group) and without social phobia (n = 202, non-clinical group). In addition to the BFNE, two standard instruments were used to measure social phobia severity: the Social Phobia Inventory (SPIN), and the Social Interaction Anxiety Scale (SIAS). All participants completed a brief background information questionnaire, the SPIN, the SIAS and the BFNE scales. Statistical analysis was performed to test the reliability and validity of the BFNE. Results In all 235 students were studied (111 male and 124 female). The mean age for non-clinical group was 22.2 (SD = 2.1) years and for clinical sample it was 22.4 (SD = 1.8) years. Cronbach's alpha coefficient (to test reliability) was acceptable for both non-clinical and clinical samples (α = 0.90 and 0.82 respectively). In addition, 3-week test-retest reliability was performed in non-clinical sample and the intraclass correlation coefficient (ICC) was quite high (ICC = 0.71). Validity as performed using convergent and discriminant validity showed satisfactory results. The questionnaire correlated well with established measures of social phobia such as the SPIN (r = 0.43, p < 0.001) and the SIAS (r = 0.54, p < 0.001). Also the BFNE discriminated well between men and women with and without social phobia in the expected direction. Factor analysis supported a two-factor solution corresponding to positive and reverse-worded items. Conclusion This validation study of the Iranian version of BFNE proved that it is an acceptable, reliable and valid measure of social phobia. However, since the scale showed a two-factor structure and this does not confirm to the theoretical basis for the BFNE, thus we suggest the use of the BFNE-II when it becomes available in Iran. The validation study of the BFNE-II is in progress. PMID:19589161
Rizvi, Sakina J; Quilty, Lena C; Sproule, Beth A; Cyriac, Anna; Michael Bagby, R; Kennedy, Sidney H
2015-09-30
Anhedonia, a core symptom of Major Depressive Disorder (MDD), is predictive of antidepressant non-response. In contrast to the definition of anhedonia as a "loss of pleasure", neuropsychological studies provide evidence for multiple facets of hedonic function. The aim of the current study was to develop and validate the Dimensional Anhedonia Rating Scale (DARS), a dynamic scale that measures desire, motivation, effort and consummatory pleasure across hedonic domains. Following item selection procedures and reliability testing using data from community participants (N=229) (Study 1), the 17-item scale was validated in an online study with community participants (N=150) (Study 2). The DARS was also validated in unipolar or bipolar depressed patients (n=52) and controls (n=50) (Study 3). Principal components analysis of the 17-item DARS revealed a 4-component structure mapping onto the domains of anhedonia: hobbies, food/drink, social activities, and sensory experience. Reliability of the DARS subscales was high across studies (Cronbach's α=0.75-0.92). The DARS also demonstrated good convergent and divergent validity. Hierarchical regression analysis revealed the DARS showed additional utility over the Snaith-Hamilton Pleasure Scale (SHAPS) in predicting reward function and distinguishing MDD subgroups. These studies provide support for the reliability and validity of the DARS. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Maier, Jürgen; Hampe, J Felix; Jahn, Nico
2016-01-01
Real-time response (RTR) measurement is an important technique for analyzing human processing of electronic media stimuli. Although it has been demonstrated that RTR data are reliable and internally valid, some argue that they lack external validity. The reason for this is that RTR measurement is restricted to a laboratory environment due to its technical requirements. This paper introduces a smartphone app that 1) captures real-time responses using the dial technique and 2) provides a solution for one of the most important problems in RTR measurement, the (automatic) synchronization of RTR data. In addition, it explores the reliability and validity of mobile RTR measurement by comparing the real-time reactions of two samples of young and well-educated voters to the 2013 German televised debate. Whereas the first sample participated in a classical laboratory study, the second sample was equipped with our mobile RTR system and watched the debate at home. Results indicate that the mobile RTR system yields similar results to the lab-based RTR measurement, providing evidence that laboratory studies using RTR are externally valid. In particular, the argument that the artificial reception situation creates artificial results has to be questioned. In addition, we conclude that RTR measurement outside the lab is possible. Hence, mobile RTR opens the door for large-scale studies to better understand the processing and impact of electronic media content.
Maier, Jürgen; Hampe, J. Felix; Jahn, Nico
2016-01-01
Real-time response (RTR) measurement is an important technique for analyzing human processing of electronic media stimuli. Although it has been demonstrated that RTR data are reliable and internally valid, some argue that they lack external validity. The reason for this is that RTR measurement is restricted to a laboratory environment due to its technical requirements. This paper introduces a smartphone app that 1) captures real-time responses using the dial technique and 2) provides a solution for one of the most important problems in RTR measurement, the (automatic) synchronization of RTR data. In addition, it explores the reliability and validity of mobile RTR measurement by comparing the real-time reactions of two samples of young and well-educated voters to the 2013 German televised debate. Whereas the first sample participated in a classical laboratory study, the second sample was equipped with our mobile RTR system and watched the debate at home. Results indicate that the mobile RTR system yields similar results to the lab-based RTR measurement, providing evidence that laboratory studies using RTR are externally valid. In particular, the argument that the artificial reception situation creates artificial results has to be questioned. In addition, we conclude that RTR measurement outside the lab is possible. Hence, mobile RTR opens the door for large-scale studies to better understand the processing and impact of electronic media content. PMID:27274577
Anxiety measures validated in perinatal populations: a systematic review.
Meades, Rose; Ayers, Susan
2011-09-01
Research and screening of anxiety in the perinatal period is hampered by a lack of psychometric data on self-report anxiety measures used in perinatal populations. This paper aimed to review self-report measures that have been validated with perinatal women. A systematic search was carried out of four electronic databases. Additional papers were obtained through searching identified articles. Thirty studies were identified that reported validation of an anxiety measure with perinatal women. Most commonly validated self-report measures were the General Health Questionnaire (GHQ), State-Trait Anxiety Inventory (STAI), and Hospital Anxiety and Depression Scales (HADS). Of the 30 studies included, 11 used a clinical interview to provide criterion validity. Remaining studies reported one or more other forms of validity (factorial, discriminant, concurrent and predictive) or reliability. The STAI shows criterion, discriminant and predictive validity and may be most useful for research purposes as a specific measure of anxiety. The Kessler 10 (K-10) may be the best short screening measure due to its ability to differentiate anxiety disorders. The Depression Anxiety Stress Scales 21 (DASS-21) measures multiple types of distress, shows appropriate content, and remains to be validated against clinical interview in perinatal populations. Nineteen studies did not report sensitivity or specificity data. The early stages of research into perinatal anxiety, the multitude of measures in use, and methodological differences restrict comparison of measures across studies. There is a need for further validation of self-report measures of anxiety in the perinatal period to enable accurate screening and detection of anxiety symptoms and disorders. Copyright © 2010 Elsevier B.V. All rights reserved.
Landscape scale estimation of soil carbon stock using 3D modelling.
Veronesi, F; Corstanje, R; Mayr, T
2014-07-15
Soil C is the largest pool of carbon in the terrestrial biosphere, and yet the processes of C accumulation, transformation and loss are poorly accounted for. This, in part, is due to the fact that soil C is not uniformly distributed through the soil depth profile and most current landscape level predictions of C do not adequately account the vertical distribution of soil C. In this study, we apply a method based on simple soil specific depth functions to map the soil C stock in three-dimensions at landscape scale. We used soil C and bulk density data from the Soil Survey for England and Wales to map an area in the West Midlands region of approximately 13,948 km(2). We applied a method which describes the variation through the soil profile and interpolates this across the landscape using well established soil drivers such as relief, land cover and geology. The results indicate that this mapping method can effectively reproduce the observed variation in the soil profiles samples. The mapping results were validated using cross validation and an independent validation. The cross-validation resulted in an R(2) of 36% for soil C and 44% for BULKD. These results are generally in line with previous validated studies. In addition, an independent validation was undertaken, comparing the predictions against the National Soil Inventory (NSI) dataset. The majority of the residuals of this validation are between ± 5% of soil C. This indicates high level of accuracy in replicating topsoil values. In addition, the results were compared to a previous study estimating the carbon stock of the UK. We discuss the implications of our results within the context of soil C loss factors such as erosion and the impact on regional C process models. Copyright © 2014 Elsevier B.V. All rights reserved.
Validity and Reliability of Accelerometers in Patients With COPD: A SYSTEMATIC REVIEW.
Gore, Shweta; Blackwood, Jennifer; Guyette, Mary; Alsalaheen, Bara
2018-05-01
Reduced physical activity is associated with poor prognosis in chronic obstructive pulmonary disease (COPD). Accelerometers have greatly improved quantification of physical activity by providing information on step counts, body positions, energy expenditure, and magnitude of force. The purpose of this systematic review was to compare the validity and reliability of accelerometers used in patients with COPD. An electronic database search of MEDLINE and CINAHL was performed. Study quality was assessed with the Strengthening the Reporting of Observational Studies in Epidemiology checklist while methodological quality was assessed using the modified Quality Appraisal Tool for Reliability Studies. The search yielded 5392 studies; 25 met inclusion criteria. The SenseWear Pro armband reported high criterion validity under controlled conditions (r = 0.75-0.93) and high reliability (ICC = 0.84-0.86) for step counts. The DynaPort MiniMod demonstrated highest concurrent validity for step count using both video and manual methods. Validity of the SenseWear Pro armband varied between studies especially in free-living conditions, slower walking speeds, and with addition of weights during gait. A high degree of variability was found in the outcomes used and statistical analyses performed between studies, indicating a need for further studies to measure reliability and validity of accelerometers in COPD. The SenseWear Pro armband is the most commonly used accelerometer in COPD, but measurement properties are limited by gait speed variability and assistive device use. DynaPort MiniMod and Stepwatch accelerometers demonstrated high validity in patients with COPD but lack reliability data.
Convergent, discriminant, and criterion validity of DSM-5 traits.
Yalch, Matthew M; Hopwood, Christopher J
2016-10-01
Section III of the Diagnostic and Statistical Manual of Mental Disorders (5th edi.; DSM-5; American Psychiatric Association, 2013) contains a system for diagnosing personality disorder based in part on assessing 25 maladaptive traits. Initial research suggests that this aspect of the system improves the validity and clinical utility of the Section II Model. The Computer Adaptive Test of Personality Disorder (CAT-PD; Simms et al., 2011) contains many similar traits as the DSM-5, as well as several additional traits seemingly not covered in the DSM-5. In this study we evaluate the convergent and discriminant validity between the DSM-5 traits, as assessed by the Personality Inventory for DSM-5 (PID-5; Krueger et al., 2012), and CAT-PD in an undergraduate sample, and test whether traits included in the CAT-PD but not the DSM-5 provide incremental validity in association with clinically relevant criterion variables. Results supported the convergent and discriminant validity of the PID-5 and CAT-PD scales in their assessment of 23 out of 25 DSM-5 traits. DSM-5 traits were consistently associated with 11 criterion variables, despite our having intentionally selected clinically relevant criterion constructs not directly assessed by DSM-5 traits. However, the additional CAT-PD traits provided incremental information above and beyond the DSM-5 traits for all criterion variables examined. These findings support the validity of pathological trait models in general and the DSM-5 and CAT-PD models in particular, while also suggesting that the CAT-PD may include additional traits for consideration in future iterations of the DSM-5 system. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Chan, Wallace Chi Ho; Chan, Christopher L F; Suen, Margaret
2013-11-01
Family caregivers may often experience caregiving stress and burden. To systematically assess this issue, medical social workers may need to use a brief and valid measurement in their practice. In the Hong Kong Chinese context, one additional challenge is to examine whether a measurement developed in the West is valid for Hong Kong Chinese caregivers. Thus, medical social workers in Hong Kong initiated this research study to validate the Chinese version of the Modified Caregiver Strain Index (C-M-CSI). A total of 223 Chinese caregivers of patients with various chronic illnesses were recruited for this validation study. C-M-CSI demonstrated good reliability (Cronbach's alpha coefficient = .91), concurrent validity with the Chinese version of the Caregiver Burden Inventory, and discriminant validity with the Chinese version of the Meaning in Life Questionnaire. Factor analysis yielded a single factor as the original M-CSI, which explained 49 percent of variance. Construct validity was shown by differentiating spousal and nonspousal caregivers, as well as caregivers of patients with and without behavioral problems. C-M-CSI is recommended as a brief and valid measurement that can be used by medical social workers in assessing the caregiving strain of Chinese caregivers of patients in Hong Kong.
Whitney, Kriscinda A; Davis, Jeremy J; Shepard, Polly H; Herman, Steven M
2008-01-01
The present study represents a replication and extension of the original Response Bias Scale (RBS) validation study. In addition to examining the relationship between the Test of Memory Malingering (TOMM), RBS, and several other well-researched Minnesota Multiphasic Personality Inventory 2 (MMPI-2) validity scales (i.e., F, Fb, Fp, and the Fake Bad Scale), the present study also included the recently developed Infrequency Post-Traumatic Stress Disorder Scale and the Henry-Heilbronner Index (HHI) of the MMPI-2. Findings from this retrospective data analysis (N=46) demonstrated the superiority of the RBS, and to a certain extent the HHI, over other MMPI-2 validity scales in predicting TOMM failure within the outpatient Veterans Affairs population. Results of the current study confirm the clinical utility of the RBS and suggest that, particularly if the MMPI-2 is an existing part of the neuropsychological assessment, examination of RBS scores is an efficient means of detecting negative response bias.
Crary, Michael A.; Carnaby, Giselle D.; Sia, Isaac
2017-01-01
Background The aim of this study was to compare spontaneous swallow frequency analysis (SFA) with clinical screening protocols for identification of dysphagia in acute stroke. Methods In all, 62 patients with acute stroke were evaluated for spontaneous swallow frequency rates using a validated acoustic analysis technique. Independent of SFA, these same patients received a routine nurse-administered clinical dysphagia screening as part of standard stroke care. Both screening tools were compared against a validated clinical assessment of dysphagia for acute stroke. In addition, psychometric properties of SFA were compared against published, validated clinical screening protocols. Results Spontaneous SFA differentiates patients with versus without dysphagia after acute stroke. Using a previously identified cut point based on swallows per minute, spontaneous SFA demonstrated superior ability to identify dysphagia cases compared with a nurse-administered clinical screening tool. In addition, spontaneous SFA demonstrated equal or superior psychometric properties to 4 validated, published clinical dysphagia screening tools. Conclusions Spontaneous SFA has high potential to identify dysphagia in acute stroke with psychometric properties equal or superior to clinical screening protocols. PMID:25088166
Crary, Michael A; Carnaby, Giselle D; Sia, Isaac
2014-09-01
The aim of this study was to compare spontaneous swallow frequency analysis (SFA) with clinical screening protocols for identification of dysphagia in acute stroke. In all, 62 patients with acute stroke were evaluated for spontaneous swallow frequency rates using a validated acoustic analysis technique. Independent of SFA, these same patients received a routine nurse-administered clinical dysphagia screening as part of standard stroke care. Both screening tools were compared against a validated clinical assessment of dysphagia for acute stroke. In addition, psychometric properties of SFA were compared against published, validated clinical screening protocols. Spontaneous SFA differentiates patients with versus without dysphagia after acute stroke. Using a previously identified cut point based on swallows per minute, spontaneous SFA demonstrated superior ability to identify dysphagia cases compared with a nurse-administered clinical screening tool. In addition, spontaneous SFA demonstrated equal or superior psychometric properties to 4 validated, published clinical dysphagia screening tools. Spontaneous SFA has high potential to identify dysphagia in acute stroke with psychometric properties equal or superior to clinical screening protocols. Copyright © 2014 National Stroke Association. Published by Elsevier Inc. All rights reserved.
Tsuchiyagaito, Aki; Horiuchi, Satoshi; Igarashi, Toko; Kawanori, Yoshiya; Hirano, Yoshiyuki; Yabe, Hirooki; Nakagawa, Akiko
2017-01-01
The Hoarding Rating Scale-Self-Report (HRS-SR) is a five-item scale that assesses the symptoms of hoarding. These symptoms include excessive acquisition, difficulty in discarding, and excessive clutter that causes distress. We conducted three studies to examine the factor structure, reliability, and validity of the Japanese version of the HRS-SR (HRS-SR-J). Study 1 examined its reliability; 193 college students and 320 adolescents and adults completed the HRS-SR-J and, of the college students, 32 took it again 2 weeks later. Study 2 aimed to confirm that its scores in a sample of 210 adolescents and adults are independent of social desirability. Study 3 aimed to validate the HRS-SR-J in the aspects of convergent and discriminant validity in a sample of 550 adults. The HRS-SR-J showed good internal consistency and 2-week test-retest reliability. Based on the nonsignificant correlations between the HRS-SR-J and social desirability, the HRS-SR-J was not strongly affected by social desirability. In addition, it also had a good convergent validity with the Japanese version of the Saving Inventory-Revised (SI-R-J) and the hoarding subscale of the Obsessive-Compulsive Inventory, while having a significantly weaker correlation with the five subscales of the Obsessive-Compulsive Inventory, except for the hoarding subscale. In addition, the strength of the correlation between the HRS-SR-J and the Japanese version of the Patient Health Questionnaire-9 and that between the HRS-SR-J and the Generalized Anxiety Disorder-7 were significantly weaker than the correlation between the HRS-SR-J and the SI-R-J. These results demonstrate that the HRS-SR-J has good convergent and discriminant validity. The HRS-SR-J is a notable self-report scale for examining the severity of hoarding symptoms.
Perez, Miguel A; Sudweeks, Jeremy D; Sears, Edie; Antin, Jonathan; Lee, Suzanne; Hankey, Jonathan M; Dingus, Thomas A
2017-06-01
Understanding causal factors for traffic safety-critical events (e.g., crashes and near-crashes) is an important step in reducing their frequency and severity. Naturalistic driving data offers unparalleled insight into these factors, but requires identification of situations where crashes are present within large volumes of data. Sensitivity and specificity of these identification approaches are key to minimizing the resources required to validate candidate crash events. This investigation used data from the Second Strategic Highway Research Program Naturalistic Driving Study (SHRP 2 NDS) and the Canada Naturalistic Driving Study (CNDS) to develop and validate different kinematic thresholds that can be used to detect crash events. Results indicate that the sensitivity of many of these approaches can be quite low, but can be improved by selecting particular threshold levels based on detection performance. Additional improvements in these approaches are possible, and may involve leveraging combinations of different detection approaches, including advanced statistical techniques and artificial intelligence approaches, additional parameter modifications, and automation of validation processes. Copyright © 2017 Elsevier Ltd. All rights reserved.
Validation of biomarkers of food intake-critical assessment of candidate biomarkers.
Dragsted, L O; Gao, Q; Scalbert, A; Vergères, G; Kolehmainen, M; Manach, C; Brennan, L; Afman, L A; Wishart, D S; Andres Lacueva, C; Garcia-Aloy, M; Verhagen, H; Feskens, E J M; Praticò, G
2018-01-01
Biomarkers of food intake (BFIs) are a promising tool for limiting misclassification in nutrition research where more subjective dietary assessment instruments are used. They may also be used to assess compliance to dietary guidelines or to a dietary intervention. Biomarkers therefore hold promise for direct and objective measurement of food intake. However, the number of comprehensively validated biomarkers of food intake is limited to just a few. Many new candidate biomarkers emerge from metabolic profiling studies and from advances in food chemistry. Furthermore, candidate food intake biomarkers may also be identified based on extensive literature reviews such as described in the guidelines for Biomarker of Food Intake Reviews (BFIRev). To systematically and critically assess the validity of candidate biomarkers of food intake, it is necessary to outline and streamline an optimal and reproducible validation process. A consensus-based procedure was used to provide and evaluate a set of the most important criteria for systematic validation of BFIs. As a result, a validation procedure was developed including eight criteria, plausibility, dose-response, time-response, robustness, reliability, stability, analytical performance, and inter-laboratory reproducibility. The validation has a dual purpose: (1) to estimate the current level of validation of candidate biomarkers of food intake based on an objective and systematic approach and (2) to pinpoint which additional studies are needed to provide full validation of each candidate biomarker of food intake. This position paper on biomarker of food intake validation outlines the second step of the BFIRev procedure but may also be used as such for validation of new candidate biomarkers identified, e.g., in food metabolomic studies.
Durand, Guillaume
2018-05-03
Although highly debated, the notion of the existence of an adaptive side to psychopathy is supported by some researchers. Currently, 2 instruments assessing psychopathic traits include an adaptive component, which might not cover the full spectrum of adaptive psychopathic traits. The Durand Adaptive Psychopathic Traits Questionnaire (DAPTQ; Durand, 2017 ) is a 41-item self-reported instrument assessing adaptive traits known to correlate with the psychopathic personality. In this study, I investigated in 2 samples (N = 263 and N = 262) the incremental validity of the DAPTQ over the Psychopathic Personality Inventory-Short Form (PPI-SF) and the Triarchic Psychopathy Measure (TriPM) using multiple criterion measures. Results showed that the DAPTQ significantly increased the predictive validity over the PPI-SF on 5 factors of the HEXACO. Additionally, the DAPTQ provided incremental validity over both the PPI-SF and the TriPM on measures of communication adaptability, perceived stress, and trait anxiety. Overall, these results support the validity of the DAPTQ in community samples. Directions for future studies to further validate the DAPTQ are discussed.
Validation study and routine control monitoring of moist heat sterilization procedures.
Shintani, Hideharu
2012-06-01
The proposed approach to validation of steam sterilization in autoclaves follows the basic life cycle concepts applicable to all validation programs. Understand the function of sterilization process, develop and understand the cycles to carry out the process, and define a suitable test or series of tests to confirm that the function of the process is suitably ensured by the structure provided. Sterilization of product and components and parts that come in direct contact with sterilized product is the most critical of pharmaceutical processes. Consequently, this process requires a most rigorous and detailed approach to validation. An understanding of the process requires a basic understanding of microbial death, the parameters that facilitate that death, the accepted definition of sterility, and the relationship between the definition and sterilization parameters. Autoclaves and support systems need to be designed, installed, and qualified in a manner that ensures their continued reliability. Lastly, the test program must be complete and definitive. In this paper, in addition to validation study, documentation of IQ, OQ and PQ concretely were described.
Nakano, Hideki; Kodama, Takayuki; Ukai, Kazumasa; Kawahara, Satoru; Horikawa, Shiori; Murata, Shin
2018-01-01
In this study, we aimed to (1) translate the English version of the Kinesthetic and Visual Imagery Questionnaire (KVIQ), which assesses motor imagery ability, into Japanese, and (2) investigate the reliability and validity of the Japanese KVIQ. We enrolled 28 healthy adults in this study. We used Cronbach’s alpha coefficients to assess reliability reflected by the internal consistency. Additionally, we assessed validity reflected by the criterion-related validity between the Japanese KVIQ and the Japanese version of the Movement Imagery Questionnaire-Revised (MIQ-R) with Spearman’s rank correlation coefficients. The Cronbach’s alpha coefficients for the KVIQ-20 were 0.88 (Visual) and 0.91 (Kinesthetic), which indicates high reliability. There was a significant positive correlation between the Japanese KVIQ-20 (Total) and the Japanese MIQ-R (Total) (r = 0.86, p < 0.01). Our results suggest that the Japanese KVIQ is an assessment that is a reliable and valid index of motor imagery ability. PMID:29724042
Nakano, Hideki; Kodama, Takayuki; Ukai, Kazumasa; Kawahara, Satoru; Horikawa, Shiori; Murata, Shin
2018-05-02
In this study, we aimed to (1) translate the English version of the Kinesthetic and Visual Imagery Questionnaire (KVIQ), which assesses motor imagery ability, into Japanese, and (2) investigate the reliability and validity of the Japanese KVIQ. We enrolled 28 healthy adults in this study. We used Cronbach’s alpha coefficients to assess reliability reflected by the internal consistency. Additionally, we assessed validity reflected by the criterion-related validity between the Japanese KVIQ and the Japanese version of the Movement Imagery Questionnaire-Revised (MIQ-R) with Spearman’s rank correlation coefficients. The Cronbach’s alpha coefficients for the KVIQ-20 were 0.88 (Visual) and 0.91 (Kinesthetic), which indicates high reliability. There was a significant positive correlation between the Japanese KVIQ-20 (Total) and the Japanese MIQ-R (Total) (r = 0.86, p < 0.01). Our results suggest that the Japanese KVIQ is an assessment that is a reliable and valid index of motor imagery ability.
Physics Metacognition Inventory Part II: Confirmatory factor analysis and Rasch analysis
NASA Astrophysics Data System (ADS)
Taasoobshirazi, Gita; Bailey, MarLynn; Farley, John
2015-11-01
The Physics Metacognition Inventory was developed to measure physics students' metacognition for problem solving. In one of our earlier studies, an exploratory factor analysis provided evidence of preliminary construct validity, revealing six components of students' metacognition when solving physics problems including knowledge of cognition, planning, monitoring, evaluation, debugging, and information management. The college students' scores on the inventory were found to be reliable and related to students' physics motivation and physics grade. However, the results of the exploratory factor analysis indicated that the questionnaire could be revised to improve its construct validity. The goal of this study was to revise the questionnaire and establish its construct validity through a confirmatory factor analysis. In addition, a Rasch analysis was applied to the data to better understand the psychometric properties of the inventory and to further evaluate the construct validity. Results indicated that the final, revised inventory is a valid, reliable, and efficient tool for assessing student metacognition for physics problem solving.
Development and Validation of a Smartphone Addiction Scale (SAS)
Kwon, Min; Lee, Joon-Yeop; Won, Wang-Youn; Park, Jae-Woo; Min, Jung-Ah; Hahn, Changtae; Gu, Xinyu; Choi, Ji-Hye; Kim, Dai-Jin
2013-01-01
Objective The aim of this study was to develop a self-diagnostic scale that could distinguish smartphone addicts based on the Korean self-diagnostic program for Internet addiction (K-scale) and the smartphone's own features. In addition, the reliability and validity of the smartphone addiction scale (SAS) was demonstrated. Methods A total of 197 participants were selected from Nov. 2011 to Jan. 2012 to accomplish a set of questionnaires, including SAS, K-scale, modified Kimberly Young Internet addiction test (Y-scale), visual analogue scale (VAS), and substance dependence and abuse diagnosis of DSM-IV. There were 64 males and 133 females, with ages ranging from 18 to 53 years (M = 26.06; SD = 5.96). Factor analysis, internal-consistency test, t-test, ANOVA, and correlation analysis were conducted to verify the reliability and validity of SAS. Results Based on the factor analysis results, the subscale “disturbance of reality testing” was removed, and six factors were left. The internal consistency and concurrent validity of SAS were verified (Cronbach's alpha = 0.967). SAS and its subscales were significantly correlated with K-scale and Y-scale. The VAS of each factor also showed a significant correlation with each subscale. In addition, differences were found in the job (p<0.05), education (p<0.05), and self-reported smartphone addiction scores (p<0.001) in SAS. Conclusions This study developed the first scale of the smartphone addiction aspect of the diagnostic manual. This scale was proven to be relatively reliable and valid. PMID:23468893
Development and validation of a smartphone addiction scale (SAS).
Kwon, Min; Lee, Joon-Yeop; Won, Wang-Youn; Park, Jae-Woo; Min, Jung-Ah; Hahn, Changtae; Gu, Xinyu; Choi, Ji-Hye; Kim, Dai-Jin
2013-01-01
The aim of this study was to develop a self-diagnostic scale that could distinguish smartphone addicts based on the Korean self-diagnostic program for Internet addiction (K-scale) and the smartphone's own features. In addition, the reliability and validity of the smartphone addiction scale (SAS) was demonstrated. A total of 197 participants were selected from Nov. 2011 to Jan. 2012 to accomplish a set of questionnaires, including SAS, K-scale, modified Kimberly Young Internet addiction test (Y-scale), visual analogue scale (VAS), and substance dependence and abuse diagnosis of DSM-IV. There were 64 males and 133 females, with ages ranging from 18 to 53 years (M = 26.06; SD = 5.96). Factor analysis, internal-consistency test, t-test, ANOVA, and correlation analysis were conducted to verify the reliability and validity of SAS. Based on the factor analysis results, the subscale "disturbance of reality testing" was removed, and six factors were left. The internal consistency and concurrent validity of SAS were verified (Cronbach's alpha = 0.967). SAS and its subscales were significantly correlated with K-scale and Y-scale. The VAS of each factor also showed a significant correlation with each subscale. In addition, differences were found in the job (p<0.05), education (p<0.05), and self-reported smartphone addiction scores (p<0.001) in SAS. This study developed the first scale of the smartphone addiction aspect of the diagnostic manual. This scale was proven to be relatively reliable and valid.
Morasco, Benjamin J; Gfeller, Jeffrey D; Elder, Katherine A
2007-06-01
In this psychometric study, we compared the recently developed Validity Scales from the Revised NEO Personality Inventory (NEO PI-R; Costa & McCrae, 1992b) with the MMPI-2 (Butcher, Dahstrom, Graham, Tellegen, & Kaemmer, 1989) Validity Scales. We collected data from clients (n = 74) who completed comprehensive psychological evaluations at a university-based outpatient mental health clinic. Correlations between the Validity Scales of the NEO-PI-R and MMPI-2 were significant and in the expected directions. The relationships provide support for convergent and discriminant validity of the NEO-PI-R Validity Scales. The percent agreement of invalid responding on the two measures was high, although the diagnostic agreement was modest (kappa = .22-.33). Finally, clients who responded in an invalid manner on the NEO-PI-R Validity Scales produced significantly different clinical profiles on the NEO-PI-R and MMPI-2 than clients with valid protocols. These results provide additional support for the clinical utility of the NEO-PI-R Validity Scales as indicators of response bias.
Display format, highlight validity, and highlight method: Their effects on search performance
NASA Technical Reports Server (NTRS)
Donner, Kimberly A.; Mckay, Tim D.; Obrien, Kevin M.; Rudisill, Marianne
1991-01-01
Display format and highlight validity were shown to affect visual display search performance; however, these studies were conducted on small, artificial displays of alphanumeric stimuli. A study manipulating these variables was conducted using realistic, complex Space Shuttle information displays. A 2x2x3 within-subjects analysis of variance found that search times were faster for items in reformatted displays than for current displays. Responses to valid applications of highlight were significantly faster than responses to non or invalidly highlighted applications. The significant format by highlight validity interaction showed that there was little difference in response time to both current and reformatted displays when the highlight validity was applied; however, under the non or invalid highlight conditions, search times were faster with reformatted displays. A separate within-subject analysis of variance of display format, highlight validity, and several highlight methods did not reveal a main effect of highlight method. In addition, observed display search times were compared to search time predicted by Tullis' Display Analysis Program. Benefits of highlighting and reformatting displays to enhance search and the necessity to consider highlight validity and format characteristics in tandem for predicting search performance are discussed.
Boerboom, T B B; Dolmans, D H J M; Jaarsma, A D C; Muijtjens, A M M; Van Beukelen, P; Scherpbier, A J J A
2011-01-01
Feedback to aid teachers in improving their teaching requires validated evaluation instruments. When implementing an evaluation instrument in a different context, it is important to collect validity evidence from multiple sources. We examined the validity and reliability of the Maastricht Clinical Teaching Questionnaire (MCTQ) as an instrument to evaluate individual clinical teachers during short clinical rotations in veterinary education. We examined four sources of validity evidence: (1) Content was examined based on theory of effective learning. (2) Response process was explored in a pilot study. (3) Internal structure was assessed by confirmatory factor analysis using 1086 student evaluations and reliability was examined utilizing generalizability analysis. (4) Relations with other relevant variables were examined by comparing factor scores with other outcomes. Content validity was supported by theory underlying the cognitive apprenticeship model on which the instrument is based. The pilot study resulted in an additional question about supervision time. A five-factor model showed a good fit with the data. Acceptable reliability was achievable with 10-12 questionnaires per teacher. Correlations between the factors and overall teacher judgement were strong. The MCTQ appears to be a valid and reliable instrument to evaluate clinical teachers' performance during short rotations.
Jin, Yinji; Jin, Taixian; Lee, Sun-Mi
Pressure injury risk assessment is the first step toward preventing pressure injuries, but traditional assessment tools are time-consuming, resulting in work overload and fatigue for nurses. The objectives of the study were to build an automated pressure injury risk assessment system (Auto-PIRAS) that can assess pressure injury risk using data, without requiring nurses to collect or input additional data, and to evaluate the validity of this assessment tool. A retrospective case-control study and a system development study were conducted in a 1,355-bed university hospital in Seoul, South Korea. A total of 1,305 pressure injury patients and 5,220 nonpressure injury patients participated for the development of a risk scoring algorithm: 687 and 2,748 for the validation of the algorithm and 237 and 994 for validation after clinical implementation, respectively. A total of 4,211 pressure injury-related clinical variables were extracted from the electronic health record (EHR) systems to develop a risk scoring algorithm, which was validated and incorporated into the EHR. That program was further evaluated for predictive and concurrent validity. Auto-PIRAS, incorporated into the EHR system, assigned a risk assessment score of high, moderate, or low and displayed this on the Kardex nursing record screen. Risk scores were updated nightly according to 10 predetermined risk factors. The predictive validity measures of the algorithm validation stage were as follows: sensitivity = .87, specificity = .90, positive predictive value = .68, negative predictive value = .97, Youden index = .77, and the area under the receiver operating characteristic curve = .95. The predictive validity measures of the Braden Scale were as follows: sensitivity = .77, specificity = .93, positive predictive value = .72, negative predictive value = .95, Youden index = .70, and the area under the receiver operating characteristic curve = .85. The kappa of the Auto-PIRAS and Braden Scale risk classification result was .73. The predictive performance of the Auto-PIRAS was similar to Braden Scale assessments conducted by nurses. Auto-PIRAS is expected to be used as a system that assesses pressure injury risk automatically without additional data collection by nurses.
Dutta, Sayon; Long, William J; Brown, David F M; Reisner, Andrew T
2013-08-01
As use of radiology studies increases, there is a concurrent increase in incidental findings (eg, lung nodules) for which the radiologist issues recommendations for additional imaging for follow-up. Busy emergency physicians may be challenged to carefully communicate recommendations for additional imaging not relevant to the patient's primary evaluation. The emergence of electronic health records and natural language processing algorithms may help address this quality gap. We seek to describe recommendations for additional imaging from our institution and develop and validate an automated natural language processing algorithm to reliably identify recommendations for additional imaging. We developed a natural language processing algorithm to detect recommendations for additional imaging, using 3 iterative cycles of training and validation. The third cycle used 3,235 radiology reports (1,600 for algorithm training and 1,635 for validation) of discharged emergency department (ED) patients from which we determined the incidence of discharge-relevant recommendations for additional imaging and the frequency of appropriate discharge documentation. The test characteristics of the 3 natural language processing algorithm iterations were compared, using blinded chart review as the criterion standard. Discharge-relevant recommendations for additional imaging were found in 4.5% (95% confidence interval [CI] 3.5% to 5.5%) of ED radiology reports, but 51% (95% CI 43% to 59%) of discharge instructions failed to note those findings. The final natural language processing algorithm had 89% (95% CI 82% to 94%) sensitivity and 98% (95% CI 97% to 98%) specificity for detecting recommendations for additional imaging. For discharge-relevant recommendations for additional imaging, sensitivity improved to 97% (95% CI 89% to 100%). Recommendations for additional imaging are common, and failure to document relevant recommendations for additional imaging in ED discharge instructions occurs frequently. The natural language processing algorithm's performance improved with each iteration and offers a promising error-prevention tool. Copyright © 2013 American College of Emergency Physicians. Published by Mosby, Inc. All rights reserved.
Aubin, Michèle; Verreault, René; Savoie, Maryse; LeMay, Sylvie; Hadjistavropoulos, Thomas; Fillion, Lise; Beaulieu, Marie; Viens, Chantal; Bergeron, Rénald; Vézina, Lucie; Misson, Lucie; Fuchs-Lacelle, Shannon
2008-01-01
This study presents the validation of the French Canadian version (PACLSAC-F) of the Pain Assessment Checklist for Seniors with Limited Ability to Communicate (PACSLAC). Unlike the published validation of the English version of the PACSLAC, which was validated retrospectively, the French version was validated prospectively. The PACSLAC-F was completed by nurses working in long-term care facilities after observing 86 seniors, with severe cognitive impairment, in calm, painful or distressing but non-painful situations. The test-retest and inter-observer reliability, the internal consistency, and the discriminent validity were found to be satisfactory. To evaluate the convergent validity with the DOLOPLUS-2 and the clinical relevance of the PACSLAC, it was also completed by nurses during their work shift, with 26 additional patients, for three days per week during a period of four weeks. These results encourage us to test the PACSLAC in a comprehensive program of pain management targeting this population.
49 CFR 1522.127 - Assessment report.
Code of Federal Regulations, 2010 CFR
2010-10-01
..., DEPARTMENT OF HOMELAND SECURITY SECURITY RULES FOR ALL MODES OF TRANSPORTATION TSA-APPROVED VALIDATION FIRMS AND VALIDATORS TSA-Approved Validation Firms and Validators for the Certified Cargo Screening Program...) The assessment report must include the following information, in addition to any other information...
Muhamad, Zailani; Ramli, Ayiesah; Amat, Salleh
2015-05-01
The aim of this study was to determine the content validity, internal consistency, test-retest reliability and inter-rater reliability of the Clinical Competency Evaluation Instrument (CCEVI) in assessing the clinical performance of physiotherapy students. This study was carried out between June and September 2013 at University Kebangsaan Malaysia (UKM), Kuala Lumpur, Malaysia. A panel of 10 experts were identified to establish content validity by evaluating and rating each of the items used in the CCEVI with regards to their relevance in measuring students' clinical competency. A total of 50 UKM undergraduate physiotherapy students were assessed throughout their clinical placement to determine the construct validity of these items. The instrument's reliability was determined through a cross-sectional study involving a clinical performance assessment of 14 final-year undergraduate physiotherapy students. The content validity index of the entire CCEVI was 0.91, while the proportion of agreement on the content validity indices ranged from 0.83-1.00. The CCEVI construct validity was established with factor loading of ≥0.6, while internal consistency (Cronbach's alpha) overall was 0.97. Test-retest reliability of the CCEVI was confirmed with a Pearson's correlation range of 0.91-0.97 and an intraclass coefficient correlation range of 0.95-0.98. Inter-rater reliability of the CCEVI domains ranged from 0.59 to 0.97 on initial and subsequent assessments. This pilot study confirmed the content validity of the CCEVI. It showed high internal consistency, thereby providing evidence that the CCEVI has moderate to excellent inter-rater reliability. However, additional refinement in the wording of the CCEVI items, particularly in the domains of safety and documentation, is recommended to further improve the validity and reliability of the instrument.
Ku, David Tawei; Shen, Chun-Yi
2009-01-01
The Felder-Soloman Index of Learning Styles (ILS) has been a popular instrument for measuring learning styles of college students for the past two decades. Even though several researchers have translated the ILS into Chinese for their own studies, a Chinese version has not been standardized and evaluated, nor has anyone reported on its reliability and validity. Based on data collected from 2,748 students at a large private university in Taiwan, this study investigates the reliability and validity of the Chinese version of the ILS. In addition, through factor analysis and structural equation modeling (SEM) analysis, problematic test items are identified for further modification. Results show that the reliability of each scale of the ILS has a pattern similar to that of previous studies. The study therefore investigates the identified problematic elements and discusses two key points: (1) the language and translation problems and (2) precision and design. In addition, results of the significant interaction effects of analysis of variance (ANOVA) for active/reflective and sensing/intuitive scales indicate the effect of college differences depends on the levels between genders. Moreover, in general, female students are significantly more intuitive and global and less visual than male students. Other detailed analysis of academic disciplines and gender onILS are also reported.
Dammerer, D; Liebensteiner, M C; Kujala, U M; Emmanuel, K; Kopf, S; Dirisamer, F; Giesinger, J M
2018-04-01
The Kujala score is the most frequently used questionnaire for patellofemoral disorders like pain, instability or osteoarthritis. Unfortunately, we are not aware of a validated German version of the Kujala score. The aim of our study was the translation and linguistic validation of the Kujala score in German-speaking patients with patella instability and the assessment of its measurement characteristics. The German Kujala score was developed in several steps of translation. In addition to healthy controls, the Kujala German was assessed in consecutive patients undergoing reconstruction of the medial patellofemoral ligament for recurrent patellar dislocations. Pre-op, 6 and 12 months postop the patients completed the Kujala German score, the KOOS, the Lysholm score, a VAS Pain, and the SF-12v2 scores. In addition, there was a Kujala German Score retest preop after a 1-week interval. We found high reliability in terms of internal consistency for the Kujala score (Cronbach's alpha = 0.87). Convergent validity with the KOOS (symptom r = 0.65, pain r = 0.78, ADL r = 0.74, sports/recreation r = 0.84, quality of life r = 0.70), the Lysholm score (r = 0.88) and the SF-12 physical component summary score (r = 0.79) and VAS pain (r = - 0.71) was also very high. Discriminant validity in terms of correlation with the SF-12 mental component summary Score was satisfactory (r = 0.14). In conclusion, the German version of the Kujala score proved to be a reliable and valid instrument in the setting of a typical patellofemoral disease treated with a standard patellofemoral procedure.
NASA Astrophysics Data System (ADS)
Risnawati; Khairinnisa, S.; Darwis, A. H.
2018-01-01
The purpose of this study was to develop a CORE model-based worksheet with recitation task that were valid and practical and could facilitate students’ communication skills in Linear Algebra course. This study was conducted in mathematics education department of one public university in Riau, Indonesia. Participants of the study were media and subject matter experts as validators as well as students from mathematics education department. The objects of this study are students’ worksheet and students’ mathematical communication skills. The results of study showed that: (1) based on validation of the experts, the developed students’ worksheet was valid and could be applied for students in Linear Algebra courses; (2) based on the group trial, the practicality percentage was 92.14% in small group and 90.19% in large group, so the worksheet was very practical and could attract students to learn; and (3) based on the post test, the average percentage of ideals was 87.83%. In addition, the results showed that the students’ worksheet was able to facilitate students’ mathematical communication skills in linear algebra course.
The preliminary analysis of the reliability and validity of the Chinese Edition of the CSBS DP.
Lin, Chu-Sui; Chang, Shu-Hui; Cheng, Shu-Fen; Chao, Pen-Chiang; Chiu, Chun-Hao
2015-03-01
This study marked a preliminary attempt to standardize the Chinese Edition of the Communication and Symbolic Behavior Scales Developmental Profile (Wetherby & Prizant, 2002; CSBS DP) to assist in the early identification of young children with special needs in Taiwan. The study was conducted among 171 infants and toddlers aged 1-2. It also included a follow-up study one year after the initial test. Three domestically developed standardized child development inventories were used to measure the concurrent validity and predictive validity. The Chinese Edition of the CSBS DP demonstrated overall good test-retest and inter-rater reliability. It also showed good concurrent and predictive validity. The current study yields preliminary evidence that the Chinese Edition of the CSBS DP could be a valuable assessment tool worthy of wider distribution. Future research should employ random sampling to establish a true national norm. Additionally, the follow-up study needs to include atypical groups and to expand to children aged 6-12 months to strengthen the applicability of the instrument in Taiwan. Copyright © 2014 Elsevier Ltd. All rights reserved.
Hasanpour, Neda; Attarbashi Moghadam, Behrouz; Sami, Ramin; Tavakol, Kamran
2016-08-01
The clinical COPD questionnaire (CCQ) has been developed to measure the health status of COPD patients. The aim of this study was to translate CCQ into the Persian language and assess the validity and reliability of the translated version. We used a forward-backward procedure to translate the questionnaire. In a cross-sectional study 100 COPD patients and 50 healthy subjects over 40 years old were selected to assess the reliability and construct validity of the instrument. The face and content validity were used for the questionnaire validity. Validity was examined in a population of patients with COPD, using the Persian validated version of the St George's Respiratory Questionnaire (PSGRQ). In order to assess the questionnaire's reliability, the Intraclass correlation coefficient (ICC) and Cronbach's alpha were calculated. Test-retest reliability was tested by re-administering the Persian version of the CCQ (PCCQ) after 1 week. Test-retest carry out of data demonstrates that the PCCQ has excellent reliability (ICC for all 3 domains were higher than 0.9). Internal consistency was found by Cronbach's alpha to be 0.96, 0.94, 0.97, and 0.98 for the symptom, mental state, functional state and total scores respectively. In addition, the correlation between the components of PCCQ and PSGRQ showed satisfactory construct validity. Analyzing the data from healthy subjects and patients divulged that the PCCQ has acceptable discriminant validity. In general, the PCCQ had satisfactory reliability and validity for assessing health-related quality of life status of Iranian COPD patients.
Agreeing on Validity Arguments
ERIC Educational Resources Information Center
Sireci, Stephen G.
2013-01-01
Kane (this issue) presents a comprehensive review of validity theory and reminds us that the focus of validation is on test score interpretations and use. In reacting to his article, I support the argument-based approach to validity and all of the major points regarding validation made by Dr. Kane. In addition, I call for a simpler, three-step…
Pontes, Halley M.; Király, Orsolya; Demetrovics, Zsolt; Griffiths, Mark D.
2014-01-01
Background Over the last decade, there has been growing concern about ‘gaming addiction’ and its widely documented detrimental impacts on a minority of individuals that play excessively. The latest (fifth) edition of the American Psychiatric Association's Diagnostic and Statistical Manual of Mental Disorders (DSM-5) included nine criteria for the potential diagnosis of Internet Gaming Disorder (IGD) and noted that it was a condition that warranted further empirical study. Aim: The main aim of this study was to develop a valid and reliable standardised psychometrically robust tool in addition to providing empirically supported cut-off points. Methods A sample of 1003 gamers (85.2% males; mean age 26 years) from 57 different countries were recruited via online gaming forums. Validity was assessed by confirmatory factor analysis (CFA), criterion-related validity, and concurrent validity. Latent profile analysis was also carried to distinguish disordered gamers from non-disordered gamers. Sensitivity and specificity analyses were performed to determine an empirical cut-off for the test. Results The CFA confirmed the viability of IGD-20 Test with a six-factor structure (salience, mood modification, tolerance, withdrawal, conflict and relapse) for the assessment of IGD according to the nine criteria from DSM-5. The IGD-20 Test proved to be valid and reliable. According to the latent profile analysis, 5.3% of the total participants were classed as disordered gamers. Additionally, an optimal empirical cut-off of 71 points (out of 100) seemed to be adequate according to the sensitivity and specificity analyses carried. Conclusions The present findings support the viability of the IGD-20 Test as an adequate standardised psychometrically robust tool for assessing internet gaming disorder. Consequently, the new instrument represents the first step towards unification and consensus in the field of gaming studies. PMID:25313515
Pontes, Halley M; Király, Orsolya; Demetrovics, Zsolt; Griffiths, Mark D
2014-01-01
Over the last decade, there has been growing concern about 'gaming addiction' and its widely documented detrimental impacts on a minority of individuals that play excessively. The latest (fifth) edition of the American Psychiatric Association's Diagnostic and Statistical Manual of Mental Disorders (DSM-5) included nine criteria for the potential diagnosis of Internet Gaming Disorder (IGD) and noted that it was a condition that warranted further empirical study. The main aim of this study was to develop a valid and reliable standardised psychometrically robust tool in addition to providing empirically supported cut-off points. A sample of 1003 gamers (85.2% males; mean age 26 years) from 57 different countries were recruited via online gaming forums. Validity was assessed by confirmatory factor analysis (CFA), criterion-related validity, and concurrent validity. Latent profile analysis was also carried to distinguish disordered gamers from non-disordered gamers. Sensitivity and specificity analyses were performed to determine an empirical cut-off for the test. The CFA confirmed the viability of IGD-20 Test with a six-factor structure (salience, mood modification, tolerance, withdrawal, conflict and relapse) for the assessment of IGD according to the nine criteria from DSM-5. The IGD-20 Test proved to be valid and reliable. According to the latent profile analysis, 5.3% of the total participants were classed as disordered gamers. Additionally, an optimal empirical cut-off of 71 points (out of 100) seemed to be adequate according to the sensitivity and specificity analyses carried. The present findings support the viability of the IGD-20 Test as an adequate standardised psychometrically robust tool for assessing internet gaming disorder. Consequently, the new instrument represents the first step towards unification and consensus in the field of gaming studies.
Li, Zhaofu; Liu, Hongyu; Luo, Chuan; Li, Yan; Li, Hengpeng; Pan, Jianjun; Jiang, Xiaosan; Zhou, Quansuo; Xiong, Zhengqin
2015-05-01
The Hydrological Simulation Program-Fortran (HSPF), which is a hydrological and water-quality computer model that was developed by the United States Environmental Protection Agency, was employed to simulate runoff and nutrient export from a typical small watershed in a hilly eastern monsoon region of China. First, a parameter sensitivity analysis was performed to assess how changes in the model parameters affect runoff and nutrient export. Next, the model was calibrated and validated using measured runoff and nutrient concentration data. The Nash-Sutcliffe efficiency (E NS ) values of the yearly runoff were 0.87 and 0.69 for the calibration and validation periods, respectively. For storms runoff events, the E NS values were 0.93 for the calibration period and 0.47 for the validation period. Antecedent precipitation and soil moisture conditions can affect the simulation accuracy of storm event flow. The E NS values for the total nitrogen (TN) export were 0.58 for the calibration period and 0.51 for the validation period. In addition, the correlation coefficients between the observed and simulated TN concentrations were 0.84 for the calibration period and 0.74 for the validation period. For phosphorus export, the E NS values were 0.89 for the calibration period and 0.88 for the validation period. In addition, the correlation coefficients between the observed and simulated orthophosphate concentrations were 0.96 and 0.94 for the calibration and validation periods, respectively. The nutrient simulation results are generally satisfactory even though the parameter-lumped HSPF model cannot represent the effects of the spatial pattern of land cover on nutrient export. The model parameters obtained in this study could serve as reference values for applying the model to similar regions. In addition, HSPF can properly describe the characteristics of water quantity and quality processes in this area. After adjustment, calibration, and validation of the parameters, the HSPF model is suitable for hydrological and water-quality simulations in watershed planning and management and for designing best management practices.
Psychometric properties of the Chinese Internet Gaming Disorder Scale.
Sigerson, Leif; Li, Angel Y-L; Cheung, Mike W-L; Luk, Jeremy W; Cheng, Cecilia
2017-11-01
To develop a consensus on the definition and measurement of Internet gaming disorder (IGD), several recent studies have used the DSM-5's proposed criteria for IGD as the basis in scale construction. This study contributes to this emerging consensus by developing and validating a new Chinese Internet Gaming Disorder Scale (C-IGDS) based on the DSM-5 criteria. A representative sample of Hong Kong community adults (n=502, 50% men, mean age=37.1, age range=18-60) was recruited for a telephone survey with random digit dialing. Various statistical techniques were used to assess the psychometric properties of the C-IGDS. The C-IGDS had good reliability (Cronbach's α=0.91) and structural validity (CFA model fit: RMSEA=0.027, CFI=0.991, TLI=0.988) in our sample. Moderate to moderately strong correlations with depressive symptoms (r=0.617, p<0.001), social anxiety symptoms (r=0.366, p<0.001), and gaming hours (r=0.412, p<0.001) supported the criterion validity of the C-IGDS. In addition, the C-IGDS exhibited strict measurement invariance for sex and at least strong measurement invariance for age. In addition to providing the first Chinese scale for measuring IGD based on the DSM-5's proposed criteria, this study provides empirical support for the validity of these diagnostic criteria as the basis for a universal measure of IGD. Most important, this study is the first to reveal the criteria's measurement invariance, thereby indicating their suitability for use with diverse demographic groups. Copyright © 2017 Elsevier Ltd. All rights reserved.
Shape Optimization by Bayesian-Validated Computer-Simulation Surrogates
NASA Technical Reports Server (NTRS)
Patera, Anthony T.
1997-01-01
A nonparametric-validated, surrogate approach to optimization has been applied to the computational optimization of eddy-promoter heat exchangers and to the experimental optimization of a multielement airfoil. In addition to the baseline surrogate framework, a surrogate-Pareto framework has been applied to the two-criteria, eddy-promoter design problem. The Pareto analysis improves the predictability of the surrogate results, preserves generality, and provides a means to rapidly determine design trade-offs. Significant contributions have been made in the geometric description used for the eddy-promoter inclusions as well as to the surrogate framework itself. A level-set based, geometric description has been developed to define the shape of the eddy-promoter inclusions. The level-set technique allows for topology changes (from single-body,eddy-promoter configurations to two-body configurations) without requiring any additional logic. The continuity of the output responses for input variations that cross the boundary between topologies has been demonstrated. Input-output continuity is required for the straightforward application of surrogate techniques in which simplified, interpolative models are fitted through a construction set of data. The surrogate framework developed previously has been extended in a number of ways. First, the formulation for a general, two-output, two-performance metric problem is presented. Surrogates are constructed and validated for the outputs. The performance metrics can be functions of both outputs, as well as explicitly of the inputs, and serve to characterize the design preferences. By segregating the outputs and the performance metrics, an additional level of flexibility is provided to the designer. The validated outputs can be used in future design studies and the error estimates provided by the output validation step still apply, and require no additional appeals to the expensive analysis. Second, a candidate-based a posteriori error analysis capability has been developed which provides probabilistic error estimates on the true performance for a design randomly selected near the surrogate-predicted optimal design.
Kavussanu, Maria; Stanger, Nicholas; Boardley, Ian D
2013-01-01
The purpose of this research was to provide further evidence for the construct validity (i.e., convergent, concurrent, and discriminant validity) of the Prosocial and Antisocial Behaviour in Sport Scale (PABSS), an instrument that has four subscales measuring prosocial and antisocial behaviour toward teammates and opponents. We also investigated test-retest reliability and stability of the PABSS. We conducted three studies using athletes from a variety of team sports. In Study 1, participants (N = 129) completed the PABSS and measures of physical and verbal aggression, hostility, anger, moral identity, and empathy; a sub-sample (n = 111) also completed the PABSS one week later. In Study 2, in addition to the PABSS, participants (N = 89) completed measures of competitive aggressiveness and anger, moral attitudes, moral disengagement, goal orientation, and anxiety. In Study 3, participants (N = 307) completed the PABSS and a measure of social goals. Across the three studies, the four subscales evidenced the hypothesised relationships with a number of variables. Correlations were large between the two antisocial behaviours and small between the two prosocial behaviours. Overall, the findings supported the convergent, concurrent, and discriminant validity of the scale, provided evidence for its test-retest reliability and stability, and suggest that the instrument is a valid and reliable measure of prosocial and antisocial behaviour in sport.
Kluemper, Donald H; McLarty, Benjamin D; Bing, Mark N
2015-01-01
It is widely established that the Big Five personality traits of conscientiousness, agreeableness, and emotional stability are antecedents to workplace deviance (Berry, Ones, & Sackett, 2007). However, these meta-analytic findings are based on self-reported personality traits. A recent meta-analysis by Oh, Wang, and Mount (2011) identified the value of acquaintance-reported personality in the prediction of job performance. The current investigation extends prior work by comparing the validities of self- and acquaintance-reported personality in the prediction of workplace deviance across 2 studies. We also hypothesized and tested an interactive, value-added integration of self- with acquaintance-reported personality using socioanalytic personality theory (R. T. Hogan, 1991). Both studies assessed self- and acquaintance-rated Big Five traits, along with supervisor-rated workplace deviance. However, the studies varied the measures of workplace deviance, and the 2nd study also included a self-rated workplace deviance criterion for additional comparison. Across both studies, the traits of conscientiousness and agreeableness were strong predictors of workplace deviance, and acquaintance-reported personality provided incremental validity beyond self-reports. Additionally, acquaintance-reported conscientiousness and agreeableness moderated the prediction of workplace deviance by interacting with the corresponding self-reported traits. Implications for personality theory and measurement are discussed along with applications for practice. (c) 2015 APA, all rights reserved.
Development and Validation of the Pediatric Diabetes Routines Questionnaire for Adolescents.
Pierce, Jessica S; Jordan, Sara S; Arnau, Randolph C
2018-04-06
This study describes the development and psychometric evaluation of an adolescent self-report version of the Pediatric Diabetes Routines Questionnaire (PDRQ:A), a measure of diabetes-specific routines for youth with type 1 diabetes, and further validation of the parent-version (PDRQ:P) in an adolescent sample. Participants included 120 parent-adolescent dyads (ages 12-17) and an additional 24 parents who completed measures of diabetes-specific adolescent routines, general adolescent routines, diabetes self-care, and family support of youth diabetes care. The PDRQ:P/A demonstrated good internal consistency, test-retest reliability, and parent-child agreement, and adequate validity coefficients. Confirmatory factor analysis supported a one-factor model. Promising results were obtained. The PDRQ:P/A is a clinically feasible parent- and self-report measure that can provide valuable information regarding how frequently adolescents engage in their diabetes management tasks in a consistent manner. Addition of an adolescent report format will enhance the utility of the measure for clinical and research use.
Harris, Joshua D; Erickson, Brandon J; Cvetanovich, Gregory L; Abrams, Geoffrey D; McCormick, Frank M; Gupta, Anil K; Verma, Nikhil N; Bach, Bernard R; Cole, Brian J
2014-02-01
Condition-specific questionnaires are important components in evaluation of outcomes of surgical interventions. No condition-specific study methodological quality questionnaire exists for evaluation of outcomes of articular cartilage surgery in the knee. To develop a reliable and valid knee articular cartilage-specific study methodological quality questionnaire. Cross-sectional study. A stepwise, a priori-designed framework was created for development of a novel questionnaire. Relevant items to the topic were identified and extracted from a recent systematic review of 194 investigations of knee articular cartilage surgery. In addition, relevant items from existing generic study methodological quality questionnaires were identified. Items for a preliminary questionnaire were generated. Redundant and irrelevant items were eliminated, and acceptable items modified. The instrument was pretested and items weighed. The instrument, the MARK score (Methodological quality of ARticular cartilage studies of the Knee), was tested for validity (criterion validity) and reliability (inter- and intraobserver). A 19-item, 3-domain MARK score was developed. The 100-point scale score demonstrated face validity (focus group of 8 orthopaedic surgeons) and criterion validity (strong correlation to Cochrane Quality Assessment score and Modified Coleman Methodology Score). Interobserver reliability for the overall score was good (intraclass correlation coefficient [ICC], 0.842), and for all individual items of the MARK score, acceptable to perfect (ICC, 0.70-1.000). Intraobserver reliability ICC assessed over a 3-week interval was strong for 2 reviewers (≥0.90). The MARK score is a valid and reliable knee articular cartilage condition-specific study methodological quality instrument. This condition-specific questionnaire may be used to evaluate the quality of studies reporting outcomes of articular cartilage surgery in the knee.
Shea, Beverley J; Grimshaw, Jeremy M; Wells, George A; Boers, Maarten; Andersson, Neil; Hamel, Candyce; Porter, Ashley C; Tugwell, Peter; Moher, David; Bouter, Lex M
2007-02-15
Our objective was to develop an instrument to assess the methodological quality of systematic reviews, building upon previous tools, empirical evidence and expert consensus. A 37-item assessment tool was formed by combining 1) the enhanced Overview Quality Assessment Questionnaire (OQAQ), 2) a checklist created by Sacks, and 3) three additional items recently judged to be of methodological importance. This tool was applied to 99 paper-based and 52 electronic systematic reviews. Exploratory factor analysis was used to identify underlying components. The results were considered by methodological experts using a nominal group technique aimed at item reduction and design of an assessment tool with face and content validity. The factor analysis identified 11 components. From each component, one item was selected by the nominal group. The resulting instrument was judged to have face and content validity. A measurement tool for the 'assessment of multiple systematic reviews' (AMSTAR) was developed. The tool consists of 11 items and has good face and content validity for measuring the methodological quality of systematic reviews. Additional studies are needed with a focus on the reproducibility and construct validity of AMSTAR, before strong recommendations can be made on its use.
Development and validation of the Vietnamese primary care assessment tool.
Hoa, Nguyen Thi; Tam, Nguyen Minh; Peersman, Wim; Derese, Anselme; Markuns, Jeffrey F
2018-01-01
To adapt the consumer version of the Primary Care Assessment Tool (PCAT) for Vietnam and determine its internal consistency and validity. A quantitative cross sectional study. 56 communes in 3 representative provinces of central Vietnam. Total of 3289 people who used health care services at health facility at least once over the past two years. The Vietnamese adult expanded consumer version of the PCAT (VN PCAT-AE) is an instrument for evaluation of primary care in Vietnam with 70 items comprising six scales representing four core primary care domains, and three additional scales representing three derivative domains. Sixteen other items from the original tool were not included in the final instrument, due to problems with missing values, floor or ceiling effects, and item-total correlations. All the retained scales have a Cronbach's alpha above 0.70 except for the subscale of Family Centeredness. The VN PCAT-AE demonstrates adequate internal consistency and validity to be used as an effective tool for measuring the quality of primary care in Vietnam from the consumer perspective. Additional work in the future to optimize valid measurement in all domains consistent with the original version of the tool may be helpful as the primary care system in Vietnam further develops.
Mötteli, S; Barbey, J; Keller, C; Bucher, T; Siegrist, M
2016-04-01
As a high-quality diet is associated with a lower risk for several diseases and all-cause mortality, current nutrition education tools provide people with information regarding how to build a healthy and a balanced meal. To assess this basic nutrition knowledge, the research aim was to develop and validate a brief scale to measure the Practical Knowledge about Balanced meals (PKB-7). A pool of 25 items was pretested with experts and laypeople before being tested on a random sample in Switzerland (n=517). For item selection, a Rasch model analysis was applied. The validity and reliability of the new scale were assessed by three additional studies including laypeople (n=597; n=145) and nutrition experts (n=59). The final scale consists of seven multiple-choice items, which met the assumptions of the Rasch model. The validity of the new scale was shown by several aspects: the Rasch model was replicated in a second study, and nutrition experts achieved significantly higher scores than laypeople (t(148)=20.27, P<0.001, d=1.78). In addition, the PKB-7 scale was correlated with other nutrition-related constructs and associated with reported vegetable consumption. Test-retest reliability (r=0.68, P<0.001) was acceptable. The PKB-7 scale is a reliable and a valid Rasch-based instrument in Swiss citizens aged between 18 and 80 years for measuring the practical knowledge about balanced meals based on current dietary guidelines. This brief and easy-to-use scale is intended for application in both research and practice.
Nigg, Claudio R; Motl, Robert W; Horwath, Caroline; Dishman, Rod K
2012-01-01
Objectives Physical activity (PA) research applying the Transtheoretical Model (TTM) to examine group differences and/or change over time requires preliminary evidence of factorial validity and invariance. The current study examined the factorial validity and longitudinal invariance of TTM constructs recently revised for PA. Method Participants from an ethnically diverse sample in Hawaii (N=700) completed questionnaires capturing each TTM construct. Results Factorial validity was confirmed for each construct using confirmatory factor analysis with full-information maximum likelihood. Longitudinal invariance was evidenced across a shorter (3-month) and longer (6-month) time period via nested model comparisons. Conclusions The questionnaires for each validated TTM construct are provided, and can now be generalized across similar subgroups and time points. Further validation of the provided measures is suggested in additional populations and across extended time points. PMID:22778669
NASA Astrophysics Data System (ADS)
Astutik, J.
2017-02-01
Food additives are materials that can not be separated from the lives of students and the community. Based on the preliminary questionnaire, it indicates the lack of kit supporting material additives in some schools and communities. The research objectives of this development are (1) to develop Kit experiment (SAYOFU KIT) and supplementary books to improve student learning outcomes in the classroom and public awareness on food additives (2) to describe the feasibility and potential effectiveness of SAYOFU KIT developed (3) to analyze the practice of SAYOFU KIT and benefits for students and the community. This development study uses 4-D models Thiagarajan, et al (1974). Through some stages, they are: defining, designing, developing and disseminating which involes the students and community. The developed SAYOFU KIT includes additives sample kit, borax test kit, curcumin test kit, formaldehyde test kit, modification heater to the identification of dyes and dye test paper. The study is conducted at SMP Plus Hidayatul Mubtadiin, and TKIT Al Uswah. The products are validated by experts and education practitioners. Qualitative data processing uses descriptive method, whereas quantitative data by using the N-gain. The average yield of expert validation of SAYOFU KIT with supplementary books 76.50% teacher’s book and 76.30% student’s book are eligible. The average yield of 96.81% validation of educational practitioners criteria, piloting a small group of 83.15%, and 82.89% field trials are very decent. The average yield on the student questionnaire responses SAYOFU kit and supplementary book is 87.6% with the criteria very well worth it. N-Gain 0:56 cognitive achievement with the criteria enough. The results of the public poll showed 95% feel the benefits SAYOFU kits for testing food. Based from description indicates that SAYOFU Kit developed feasible, practical, useful to support inquiry learning and improve student learning outcomes as well as public awareness of food additives.
Observing Parent Behavior: Reconciling Theoretical Concepts with Empirical Reality.
ERIC Educational Resources Information Center
Ge, Xiaojia
Using data from the Iowa Youth and Families Project, this longitudinal study investigated the predictive validity of different dimensions of observed parent behavior on adolescent externalizing (aggression, hostility) and internalizing (depression, anxiety) problems over a 2-year period. In addition, the study examined how observer ratings…
NASA Astrophysics Data System (ADS)
Liu, Jia; Liu, Longli; Xue, Yong; Dong, Jing; Hu, Yingcui; Hill, Richard; Guang, Jie; Li, Chi
2017-01-01
Workflow for remote sensing quantitative retrieval is the ;bridge; between Grid services and Grid-enabled application of remote sensing quantitative retrieval. Workflow averts low-level implementation details of the Grid and hence enables users to focus on higher levels of application. The workflow for remote sensing quantitative retrieval plays an important role in remote sensing Grid and Cloud computing services, which can support the modelling, construction and implementation of large-scale complicated applications of remote sensing science. The validation of workflow is important in order to support the large-scale sophisticated scientific computation processes with enhanced performance and to minimize potential waste of time and resources. To research the semantic correctness of user-defined workflows, in this paper, we propose a workflow validation method based on tacit knowledge research in the remote sensing domain. We first discuss the remote sensing model and metadata. Through detailed analysis, we then discuss the method of extracting the domain tacit knowledge and expressing the knowledge with ontology. Additionally, we construct the domain ontology with Protégé. Through our experimental study, we verify the validity of this method in two ways, namely data source consistency error validation and parameters matching error validation.
Wang, Meng-Cheng; Gao, Yu; Deng, Jiaxin; Lai, Hongyu; Deng, Qiaowen; Armour, Cherie
2017-01-01
The current study assesses the factor structure and construct validity of the self-reported Inventory of Callous-Unemotional Traits (ICU) in 637 Chinese community adults (mean age = 25.98, SD = 5.79). A series of theoretical models proposed in previous studies were tested through confirmatory factor analyses. Results indicated that a shortened form that consists of 11 items (ICU-11) to assess callousness and uncaring factors has excellent overall fit. Additionally, correlations with a wide range of external variables demonstrated that this shortened form has similar construct validity compared to the original ICU. In conclusion, our findings suggest that the ICU-11 may be a promising self-report tool that could be a good substitute for the original form to assess callous-uncaring traits in adults.
Lilienfeld, S O; Andrews, B P
1996-06-01
Research on psychopathology has been hindered by persisting difficulties and controversies regarding its assessment. The primary goals of this set of studies were to (a) develop, and initiate the construct validation of, a self-report measure that assesses the major personality traits of psychopathy in noncriminal populations and (b) clarify the nature of these traits via an exploratory approach to test construction. This measure, the Psychopathic Personality Inventory (PPI), was developed by writing items to assess a large number of personality domains relevant to psychopathy and performing successive item-level factor analyses and revisions on three undergraduate samples. The PPI total score and its eight subscales were found to possess satisfactory internal consistency and test-retest reliability. In four studies with undergraduates, the PPI and its subscales exhibited a promising pattern of convergent and discriminant validity with self-report, psychiatric interview, observer rating, and family history data. In addition, the PPI total score demonstrated incremental validity relative to several commonly used self-report psychopathy-related measures. Future construct validation studies, unresolved conceptual issues regarding the assessment of psychopathy, and potential research uses of the PPI are outlined.
Nazary-Moghadam, Salman; Zeinalzadeh, Afsaneh; Salavati, Mahyar; Almasi, Simin; Negahban, Hossein
2017-01-01
The aim of the present study was to culturally adapt and evaluate reliability and validity of Health Assessment Questionnaire-Disability Index (HAQ-DI) in Iranian patients with rheumatoid arthritis (RA). 234 patients with RA for validation study, Eighty-six participants for reliability study. Test-retest relative reliability and internal consistency of Persian version of HAQ-DI were examined by intraclass correlation coefficient (ICC) and Cronbach's alpha, respectively. Additionally, HAQ-DI construct validity (Spearman's correlation) was examined using Persian version of Short-Form 36 Health survey (SF-36), activity and severity parameters. Persian version of HAQ-DI total score showed excellent test-retest reliability (ICC = 0.98) and internal consistency (Cronbach's alpha = 0.95). Spearman's correlations between the total PHAQ-DI score and activity and severity parameters were above 0.55. Correlation between PHAQ-DI and SF-36 Physical Health were higher as compared with SF-36 Mental Health. Persian version of HAQ-DI is a reliable and valid culturally-adapted instrument in order to measure functional limitations in Iranian people with RA. Copyright © 2016 Elsevier Ltd. All rights reserved.
Development of Airport Surface Required Navigation Performance (RNP)
NASA Technical Reports Server (NTRS)
Cassell, Rick; Smith, Alex; Hicok, Dan
1999-01-01
The U.S. and international aviation communities have adopted the Required Navigation Performance (RNP) process for defining aircraft performance when operating the en-route, approach and landing phases of flight. RNP consists primarily of the following key parameters - accuracy, integrity, continuity, and availability. The processes and analytical techniques employed to define en-route, approach and landing RNP have been applied in the development of RNP for the airport surface. To validate the proposed RNP requirements several methods were used. Operational and flight demonstration data were analyzed for conformance with proposed requirements, as were several aircraft flight simulation studies. The pilot failure risk component was analyzed through several hypothetical scenarios. Additional simulator studies are recommended to better quantify crew reactions to failures as well as additional simulator and field testing to validate achieved accuracy performance, This research was performed in support of the NASA Low Visibility Landing and Surface Operations Programs.
Anestis, Joye C; Finn, Jacob A; Gottfried, Emily; Arbisi, Paul A; Joiner, Thomas E
2015-06-01
This study examined the utility of the Minnesota Multiphasic Personality Inventory-2 Restructured Form (MMPI-2-RF) Validity Scales in prediction of premature termination in a sample of 511 individuals seeking services from a university-based psychology clinic. Higher scores on True Response Inconsistency-Revised and Infrequent Psychopathology Responses increased the risk of premature termination, whereas higher scores on Adjustment Validity lowered the risk of premature termination. Additionally, when compared with individuals who did not prematurely terminate, individuals who prematurely terminated treatment had lower Global Assessment of Functioning scores at both intake and termination and made fewer improvements. Implications of these findings for the use of the MMPI-2-RF Validity Scales in promoting treatment compliance are discussed. © The Author(s) 2014.
Goldschmidt, Andrea B.
2017-01-01
Background Binge eating is a marker of weight gain and obesity, and a hallmark feature of eating disorders. Yet, its component constructs—overeating and loss of control (LOC) while eating—are poorly understood and difficult to measure. Objective To critically review the human literature concerning the validity of LOC and overeating across the age and weight spectrum. Data sources English-language articles addressing the face, convergent, discriminant, and predictive validity of LOC and overeating were included. Results LOC and overeating appear to have adequate face validity. Emerging evidence supports the convergent and predictive validity of the LOC construct, given its unique cross-sectional and prospective associations with numerous anthropometric, psychosocial, and eating behavior-related factors. Overeating may be best conceptualized as a marker of excess weight status. Limitations Binge eating constructs, particularly in the context of subjectively large episodes, are challenging to measure reliably. Few studies addressed overeating in the absence of LOC, thereby limiting conclusions about the validity of the overeating construct independent of LOC. Additional studies addressing the discriminant validity of both constructs are warranted. Discussion Suggestions for future weight-related research and for appropriately defining binge eating in the eating disorders diagnostic scheme are presented. PMID:28165655
Langer, Álvaro I; Ulloa, Valentina G; Aguilar-Parra, José M; Araya-Véliz, Claudio; Brito, Gonzalo
2016-03-31
Recent studies have associated positive emotions with several variables such as learning, coping strategies or assertive behaviour. The concept of gratitude has been specifically defined as a tendency to recognise and respond to people or situations with grateful emotion. Unfortunately in Latin America, no validated measures of gratitude on different populations are available. The aim of this study was to analyse the psychometric properties of the Gratitude Questionnaire (GQ-6) in two Chilean samples. Two studies were conducted: the first with 668 high school adolescents (390 women and 278 men, with ages ranging between 12 and 20, and a mean age 15.54 ± 1.22) and the second with 331 adults (231 women and 100 men, with an average age of 37.59 ± 12.6). An analysis of the psychometric properties of the GQ-6 scale to determine the validity and reliability of the instrument in Chilean adolescents and adults was performed. Bivariate correlations, multiple regression analyses, exploratory factor analysis (EFA) and Monte Carlo simulations were carried out. Finally, a confirmatory factor analysis (CFA) was performed. A single-factor solution was found in both studies, a 5 item version for the adolescents and 6 items for adults. This factorial solution was invariant across genders. Reliability of the GQ was adequate in both samples (using Cronbach's alpha coefficient). In addition convergent and discriminate validity were assessed. Additionally, a negative correlation between the GQ-5 and depression in adolescents and a positive correlation between the GQ-6 and happiness in adults was found. The GQ is a suitable measure for evaluating a person's disposition toward gratitude in Chilean adolescents and adults. This instrument may contribute to the advancement of the study of positive emotions in Latin America.
Validation of the Thai version of Bergen Facebook addiction scale (Thai-BFAS). .
Phanasathit, Muthita; Manwong, Mereerat; Hanprathet, Nitt; Khumsri, Jiraporn; Yingyeun, Rungmanee
2015-03-01
To determine the validity ofthe Thai version ofthe Bergen FacebookAddiction Scale (Thai-BFAS) using Facebook addiction screening in Thai high school students. The original BFAS was authorized for translation and validation in Thai. After content validity and usability were approved by three Thai psychiatrists, the Thai-BFAS was adjusted again by the authors and back translated by an English expert. Thisfinal version was investigated using the internal consistency method among 874 high schools students in Bangkok, Chiang Mai, Ubon Ratchathani and Songkhla, and confirmatoy factor analysis (CFA) was employed to prove that the six-component model could be representative of addiction behaviors. In addition, test-retest reliability was performed separately among 30 pilot high school students in Bangkok. The Thai-BFAS has six items, which are each scored on a 5-point scale with total score ranges from 0 to 24; the cutoff score for Facebook addition stands at least 12 points. The Cronbach s alpha coefficient was 0.91 (95% CI; 0.90, 0.92) and the inter-class correlation coefficient was 0.80 (95% CI; 0.49, 0.92). The CFA showed that the six items accurately represent the six-component model of addiction such as salience, mood modification, tolerance, withdrawal, relapse and conflict. The Thai-BFAS is consistent as a screening testfor Facebook addiction among high school students due to good reliability and validity. It also conforms well to the original version. The six items in the Thai-BFAS are a good representation of the addiction behaviors. Further studies should be undertaken in cases of sensitivity and specificity when compared with other similar tests of addiction as well as in various additional populations and circumstances.
The reliability and validity of the Maryland Assessment of Recovery in Serious Mental Illness Scale.
Drapalski, Amy L; Medoff, Deborah; Dixon, Lisa; Bellack, Alan
2016-05-30
The current study aims to further evaluate the psychometric properties of the Maryland Assessment of Recovery in Serious Mental Illness (MARS), a relatively new instrument designed to assess personal recovery status in individuals with serious mental illness. Two hundred and fifty individuals with serious mental illness receiving outpatient mental health treatment completed a baseline assessment which included the MARS and measures to assess recovery-related constructs, clinical outcomes, and social and community functioning. The MARS demonstrated excellent internal consistency and test-retest reliability. Good construct validity was evidenced by strong positive relationships between the MARS and recovery-related constructs (e.g. hope, empowerment, self-efficacy, and personal agency) and a strong negative relationship with self-stigma. Divergent validity was demonstrated by weaker relationships with cognitive and social functioning. The confirmatory factor analysis did not confirm the unitary factor structure found in previous research. Given the equivocal result of the CFA, additional exploratory work is needed to determine if a more complex factor structure is present. This study provides addition support for the psychometric soundness of the MARS and subsequently, its potential use as a measure of personal recovery status in people with serious mental illness. Published by Elsevier Ireland Ltd.
Bulfone, Giampiera; Fida, Roberta; Ghezzi, Valerio; Macale, Loreana; Sili, Alessandro; Alvaro, Rosaria; Palese, Alvisa
Student perceptions of self-efficacy (SE) prevent stress and burnout and improve engagement in nursing education, thus increasing learning outcomes. The study aims were to (1) validate a scale measuring nursing SE in psychomotor skills (NSE-PS), (2) describe changes in NSE-PS over time, and (3) explore NSE-PS correlations with burnout and engagement. A total of 1117 nursing students participated. A significant increase in the NSE-PS scores over the years has emerged; in addition, all NSE-PS dimensions were correlated negatively with burnout and positively with engagement.
Automation Hooks Architecture for Flexible Test Orchestration - Concept Development and Validation
NASA Technical Reports Server (NTRS)
Lansdowne, C. A.; Maclean, John R.; Winton, Chris; McCartney, Pat
2011-01-01
The Automation Hooks Architecture Trade Study for Flexible Test Orchestration sought a standardized data-driven alternative to conventional automated test programming interfaces. The study recommended composing the interface using multicast DNS (mDNS/SD) service discovery, Representational State Transfer (Restful) Web Services, and Automatic Test Markup Language (ATML). We describe additional efforts to rapidly mature the Automation Hooks Architecture candidate interface definition by validating it in a broad spectrum of applications. These activities have allowed us to further refine our concepts and provide observations directed toward objectives of economy, scalability, versatility, performance, severability, maintainability, scriptability and others.
John, Majnu; Lencz, Todd; Malhotra, Anil K; Correll, Christoph U; Zhang, Jian-Ping
2018-06-01
Meta-analysis of genetic association studies is being increasingly used to assess phenotypic differences between genotype groups. When the underlying genetic model is assumed to be dominant or recessive, assessing the phenotype differences based on summary statistics, reported for individual studies in a meta-analysis, is a valid strategy. However, when the genetic model is additive, a similar strategy based on summary statistics will lead to biased results. This fact about the additive model is one of the things that we establish in this paper, using simulations. The main goal of this paper is to present an alternate strategy for the additive model based on simulating data for the individual studies. We show that the alternate strategy is far superior to the strategy based on summary statistics.
Spanish validation of the Negative Symptom Assessment-16 (NSA-16) in patients with schizophrenia.
Garcia-Alvarez, Leticia; Garcia-Portilla, María Paz; Saiz, Pilar Alejandra; Fonseca-Pedrero, Eduardo; Bobes-Bascaran, María Teresa; Gomar, Jesús; Muñiz, José; Bobes, Julio
2018-04-05
Negative symptoms are prevalent in schizophrenia and associated with a poorer outcome. Validated newer psychometric instruments could contribute to better assessment and improved treatment of negative symptoms. The Negative Symptom Assessment-16 (NSA-16) has been shown to have strong psychometric properties, but there is a need for validation in non-English languages. This study aimed to examine the psychometric properties of a Spanish version of the NSA-16 (Sp-NSA-16). Observational, cross-sectional validation study in a sample of 123 outpatients with schizophrenia. NSA-16, PANSS, HDRS, CGI-SCH and PSP. The results indicate appropriate psychometric properties, high internal consistency (Cronbach's alpha=0.86), convergent validity (PANSS negative scale, PANSS Marder Negative Factor and CGI-negative symptoms r values between 0.81 and 0.94) and divergent validity (PANSS positive scale and the HDRS r values between 0.10 and 0.34). In addition, the NSA-16 also exhibited discriminant validity (ROC curve=0.97, 95% CI=0.94 to 1.00; 94.3% sensitivity and 83.3% specificity). The Sp-NSA-16 is reliable and valid for measuring negative symptoms in patients with schizophrenia. This provides Spanish clinicians with a new tool for clinical practice and research. However, it is necessary to provide further information about its inter-rater reliability. Copyright © 2018 SEP y SEPB. Publicado por Elsevier España, S.L.U. All rights reserved.
Hubert, C; Houari, S; Rozet, E; Lebrun, P; Hubert, Ph
2015-05-22
When using an analytical method, defining an analytical target profile (ATP) focused on quantitative performance represents a key input, and this will drive the method development process. In this context, two case studies were selected in order to demonstrate the potential of a quality-by-design (QbD) strategy when applied to two specific phases of the method lifecycle: the pre-validation study and the validation step. The first case study focused on the improvement of a liquid chromatography (LC) coupled to mass spectrometry (MS) stability-indicating method by the means of the QbD concept. The design of experiments (DoE) conducted during the optimization step (i.e. determination of the qualitative design space (DS)) was performed a posteriori. Additional experiments were performed in order to simultaneously conduct the pre-validation study to assist in defining the DoE to be conducted during the formal validation step. This predicted protocol was compared to the one used during the formal validation. A second case study based on the LC/MS-MS determination of glucosamine and galactosamine in human plasma was considered in order to illustrate an innovative strategy allowing the QbD methodology to be incorporated during the validation phase. An operational space, defined by the qualitative DS, was considered during the validation process rather than a specific set of working conditions as conventionally performed. Results of all the validation parameters conventionally studied were compared to those obtained with this innovative approach for glucosamine and galactosamine. Using this strategy, qualitative and quantitative information were obtained. Consequently, an analyst using this approach would be able to select with great confidence several working conditions within the operational space rather than a given condition for the routine use of the method. This innovative strategy combines both a learning process and a thorough assessment of the risk involved. Copyright © 2015 Elsevier B.V. All rights reserved.
2015-11-01
collected. We determined that the methodologies were valid and the data were reliable for our purposes. In addition, we interviewed DOD officials and...of contracts and cost of labor involved in preparing program documentation, to arrive at the estimates for savings. To validate the data used in the...studies to be reasonable, and the data were sufficiently reliable for our purposes. We interviewed officials from DOD and 5 of the 12 Test Program
Scaling a Single Attribute: A Methodological Study of Conservation
ERIC Educational Resources Information Center
Hofmann, Richard J.; Trepanier, Mary
1975-01-01
This study was designed to assess the acquisition of conservation of number on equal addition tasks through scalogram analysis to determine if this analysis defines a scale or continuum. Ten block tasks administered to 85 kindergarten children validated Piaget's theory that cognitive development is sequential and continuous. (Author/ED)
Measuring and Enhancing Creativity
ERIC Educational Resources Information Center
Mahboub, Kamyar C.; Portillo, Margaret B.; Liu, Yinhui; Chandraratna, Susantha
2004-01-01
The purpose of this study was to assess ways by which creativity may be enhanced in a design-oriented course. In order to demonstrate the validity of the approach, a statistically based study was employed. Additionally, the experiment was replicated in two design-oriented fields at the University of Kentucky. These fields were civil engineering…
The Windsor, Ontario Exposure Assessment Study evaluated the contribution of ambient air pollutants to personal and indoor exposures of adults and asthmatic children living in Windsor, Ontario, Canada. In addition, the role of personal, indoor, and outdoor air pollution exposures...
Garcia, Antonio F.; Acosta, Melina; Pirani, Saifa; Edwards, Daniel; Osman, Augustine
2017-01-01
We describe 2 studies designed to evaluate scores on the Multidimensional Shame-related Response Inventory-21 (MSRI-21), a recently developed instrument that measures affective and behavioral responses to shame. The inventory assesses shame-related responses in 3 categories: negative self-evaluation, fear of social consequences, and maladaptive behavior tendency. For Study 1, (N = 743) undergraduates completed the MSRI-21. Confirmatory factor analysis supported the validity of the MSRI-21 3-factor structure. Latent variable modeling of coefficient-α provided strong evidence for the internal consistency of scores on each scale. In Study 2, (N = 540) undergraduates completed the instrument along with 5 concurrent measures chosen for clinical significance. Achievement of factorial invariance supported the use of MSRI-21 scale scores to make valid mean comparisons across gender. In addition, MSRI-21 scale scores were associated as expected with scores on measures of self-harm, suicide, and other risk factors. Taken together, results of 2 studies support the internal consistency reliability, factorial validity, factorial invariance, and convergent validity of scores on the MSRI-21. Further work is needed to assess the temporal stability of the MSRI-21 scale scores, invariance across clinical status and other groupings, item-level measurement properties, and viability in highly symptomatic samples. PMID:28182490
Hashim, Hairul Anuar; Shaharuddin, Saidatin Sabiyah; Hamidan, Shazarina; Grove, J Robert
2017-02-01
This study examined psychometric properties of a Malaysian-language Sport Anxiety Scale-2 (SAS-2) in three separate studies. Study 1 examined the criterion validity and internal consistency of SAS-2 among 119 developmental hockey players. Measures of trait anxiety and mood states along with digit vigilance, choice reaction time, and depth perception tests were administered. Regression analysis revealed that somatic anxiety and concentration disruption were significantly associated with sustained attention. Worry was significantly associated with depth perception but not sustained attention. Pearson correlation coefficients also revealed significant relationships between SAS-2 subscales and negative mood state dimensions. Study 2 examined the convergent and discriminant validity of SAS-2 by correlating it with state anxiety measured by the CSAI-2R. Significant positive relationships were obtained between SAS-2 subscales and somatic and cognitive state anxiety. Conversely, state self-confidence was negatively related to SAS-2 subscales. In addition, significant differences were observed between men and women in somatic anxiety. Study 3 examined the factorial validity of the Malaysian SAS-2 using confirmatory factor analysis in a sample of 539 young athletes. Confirmatory factor analysis results provided strong support for the SAS-2 factor structure. Path loadings exceeding 0.5 indicated convergent validity among the subscales, and low to moderate subscale intercorrelations provided evidence of discriminant validity. Overall, the results supported the criterion and construct validity of this Malaysian-language SAS-2 instrument.
Reproducibility and validity of a semi-quantitative FFQ for trace elements.
Lee, Yujin; Park, Kyong
2016-09-01
The aim of this study was to test the reproducibility and validity of a self-administered FFQ for the Trace Element Study of Korean Adults in the Yeungnam area (SELEN). Study subjects were recruited from the SELEN cohort selected from rural and urban areas in Yeungnam, Korea. A semi-quantitative FFQ with 146 items was developed considering the dietary characteristics of cohorts in the study area. In a validation study, seventeen men and forty-eight women aged 38-62 years completed 3-d dietary records (DR) and two FFQ over a 3-month period. The validity was examined with the FFQ and DR, and the reproducibility was estimated using partial correlation coefficients, the Bland-Altman method and cross-classification. There were no significant differences between the mean intakes of selected nutrients as estimated from FFQ1, FFQ2 and DR. The median correlation coefficients for all nutrients were 0·47 and 0·56 in the reproducibility and validity tests, respectively. Bland-Altman's index and cross-classification showed acceptable agreement between FFQ1 and FFQ2 and between FFQ2 and DR. Ultimately, 78 % of the subjects were classified into the same and adjacent quartiles for most nutrients. In addition, the weighted κ value indicated that the two methods agreed fairly. In conclusion, this newly developed FFQ was a suitable dietary assessment method for the SELEN cohort study.
Automatic sleep stage classification using two facial electrodes.
Virkkala, Jussi; Velin, Riitta; Himanen, Sari-Leena; Värri, Alpo; Müller, Kiti; Hasan, Joel
2008-01-01
Standard sleep stage classification is based on visual analysis of central EEG, EOG and EMG signals. Automatic analysis with a reduced number of sensors has been studied as an easy alternative to the standard. In this study, a single-channel electro-oculography (EOG) algorithm was developed for separation of wakefulness, SREM, light sleep (S1, S2) and slow wave sleep (S3, S4). The algorithm was developed and tested with 296 subjects. Additional validation was performed on 16 subjects using a low weight single-channel Alive Monitor. In the validation study, subjects attached the disposable EOG electrodes themselves at home. In separating the four stages total agreement (and Cohen's Kappa) in the training data set was 74% (0.59), in the testing data set 73% (0.59) and in the validation data set 74% (0.59). Self-applicable electro-oculography with only two facial electrodes was found to provide reasonable sleep stage information.
White-Heisel, Regina; Canfield, James P; Young-Hughes, Sadie
Perceiving imminent safe patient handling and movement (SPH&M) dangers may reduce musculoskeletal (MSK) injuries for nurses in the workplace. The purpose of this study is to develop and validate the 17-item Safe Patient Handling Perception Scale (SPHPS) as an evaluation instrument assessing perceptual risk of MSK injury based on SPH&M knowledge, practice, and resource accessibility in the workplace. Data were collected from a convenience sample (N = 117) of nursing employees at a Veteran Affairs Medical Center. Factor analysis identified three factors: knowledge, practice, and accessibility. The SPHPS demonstrated high levels of reliability, supported by acceptable alpha scores (SPHM knowledge [α = .866], SPHM practices [α = .901], and access to SPHM resources [α = .855]), in addition to the relatively low standard error of measurement scores (SEM). The study outcomes suggest that the SPHPS is a valid and reliable tool that can measure participants' perceived risk factors for MSK injuries.
Aloba, Olutayo; Olabisi, Oluseyi; Aloba, Tolulope
2016-01-01
The 10-item Connor-Davidson Resilience Scale (CD-RISC) has demonstrated satisfactory psychometric properties as a measure of resilience in all the previous studies conducted in developed countries. The objective of this study was to explore the psychometric characteristics of the 10-item CD-RISC among students nurses in southwestern Nigeria. This descriptive cross-sectional study involved a total of 449 student nurses who completed the 10-item CD-RISC in addition to measures of self-esteem, depression, religiosity, and psychological distress. The scale demonstrated adequate reliability (Cronbach's α = .81) and satisfactory validity with significant correlations with the measures of self-esteem, depression, religiosity, and psychological distress. Factor analyses revealed that resilience was best explained by a two-factor construct. The scale is a valid measure of resilience among Nigerian student nurses. © The Author(s) 2016.
Tsai, Alexander C.; Scott, Jennifer A.; Hung, Kristin J.; Zhu, Jennifer Q.; Matthews, Lynn T.; Psaros, Christina; Tomlinson, Mark
2013-01-01
Background A major barrier to improving perinatal mental health in Africa is the lack of locally validated tools for identifying probable cases of perinatal depression or for measuring changes in depression symptom severity. We systematically reviewed the evidence on the reliability and validity of instruments to assess perinatal depression in African settings. Methods and Findings Of 1,027 records identified through searching 7 electronic databases, we reviewed 126 full-text reports. We included 25 unique studies, which were disseminated in 26 journal articles and 1 doctoral dissertation. These enrolled 12,544 women living in nine different North and sub-Saharan African countries. Only three studies (12%) used instruments developed specifically for use in a given cultural setting. Most studies provided evidence of criterion-related validity (20 [80%]) or reliability (15 [60%]), while fewer studies provided evidence of construct validity, content validity, or internal structure. The Edinburgh postnatal depression scale (EPDS), assessed in 16 studies (64%), was the most frequently used instrument in our sample. Ten studies estimated the internal consistency of the EPDS (median estimated coefficient alpha, 0.84; interquartile range, 0.71-0.87). For the 14 studies that estimated sensitivity and specificity for the EPDS, we constructed 2 x 2 tables for each cut-off score. Using a bivariate random-effects model, we estimated a pooled sensitivity of 0.94 (95% confidence interval [CI], 0.68-0.99) and a pooled specificity of 0.77 (95% CI, 0.59-0.88) at a cut-off score of ≥9, with higher cut-off scores yielding greater specificity at the cost of lower sensitivity. Conclusions The EPDS can reliably and validly measure perinatal depression symptom severity or screen for probable postnatal depression in African countries, but more validation studies on other instruments are needed. In addition, more qualitative research is needed to adequately characterize local understandings of perinatal depression-like syndromes in different African contexts. PMID:24340036
Validation of asthma recording in electronic health records: protocol for a systematic review.
Nissen, Francis; Quint, Jennifer K; Wilkinson, Samantha; Mullerova, Hana; Smeeth, Liam; Douglas, Ian J
2017-05-29
Asthma is a common, heterogeneous disease with significant morbidity and mortality worldwide. It can be difficult to define in epidemiological studies using electronic health records as the diagnosis is based on non-specific respiratory symptoms and spirometry, neither of which are routinely registered. Electronic health records can nonetheless be valuable to study the epidemiology, management, healthcare use and control of asthma. For health databases to be useful sources of information, asthma diagnoses should ideally be validated. The primary objectives are to provide an overview of the methods used to validate asthma diagnoses in electronic health records and summarise the results of the validation studies. EMBASE and MEDLINE will be systematically searched for appropriate search terms. The searches will cover all studies in these databases up to October 2016 with no start date and will yield studies that have validated algorithms or codes for the diagnosis of asthma in electronic health records. At least one test validation measure (sensitivity, specificity, positive predictive value, negative predictive value or other) is necessary for inclusion. In addition, we require the validated algorithms to be compared with an external golden standard, such as a manual review, a questionnaire or an independent second database. We will summarise key data including author, year of publication, country, time period, date, data source, population, case characteristics, clinical events, algorithms, gold standard and validation statistics in a uniform table. This study is a synthesis of previously published studies and, therefore, no ethical approval is required. The results will be submitted to a peer-reviewed journal for publication. Results from this systematic review can be used to study outcome research on asthma and can be used to identify case definitions for asthma. CRD42016041798. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Pregnant & Lactating Populations Research - NCS Dietary Assessment Literature Review
Identifying and studying additional biomarkers of energy and nutrient intake will advance validation efforts and lead to a better understanding of the biases and sources of measurement error in dietary assessment instruments in pregnant or lactating populations.
AdViSHE: A Validation-Assessment Tool of Health-Economic Models for Decision Makers and Model Users.
Vemer, P; Corro Ramos, I; van Voorn, G A K; Al, M J; Feenstra, T L
2016-04-01
A trade-off exists between building confidence in health-economic (HE) decision models and the use of scarce resources. We aimed to create a practical tool providing model users with a structured view into the validation status of HE decision models, to address this trade-off. A Delphi panel was organized, and was completed by a workshop during an international conference. The proposed tool was constructed iteratively based on comments from, and the discussion amongst, panellists. During the Delphi process, comments were solicited on the importance and feasibility of possible validation techniques for modellers, their relevance for decision makers, and the overall structure and formulation in the tool. The panel consisted of 47 experts in HE modelling and HE decision making from various professional and international backgrounds. In addition, 50 discussants actively engaged in the discussion at the conference workshop and returned 19 questionnaires with additional comments. The final version consists of 13 items covering all relevant aspects of HE decision models: the conceptual model, the input data, the implemented software program, and the model outcomes. Assessment of the Validation Status of Health-Economic decision models (AdViSHE) is a validation-assessment tool in which model developers report in a systematic way both on validation efforts performed and on their outcomes. Subsequently, model users can establish whether confidence in the model is justified or whether additional validation efforts should be undertaken. In this way, AdViSHE enhances transparency of the validation status of HE models and supports efficient model validation.
Systematic Review of Childhood Sedentary Behavior Questionnaires: What do We Know and What is Next?
Hidding, Lisan M; Altenburg, Teatske M; Mokkink, Lidwine B; Terwee, Caroline B; Chinapaw, Mai J M
2017-04-01
Accurate measurement of child sedentary behavior is necessary for monitoring trends, examining health effects, and evaluating the effectiveness of interventions. We therefore aimed to summarize studies examining the measurement properties of self-report or proxy-report sedentary behavior questionnaires for children and adolescents under the age of 18 years. Additionally, we provided an overview of the characteristics of the evaluated questionnaires. We performed systematic literature searches in the EMBASE, PubMed, and SPORTDiscus electronic databases. Studies had to report on at least one measurement property of a questionnaire assessing sedentary behavior. Questionnaire data were extracted using a standardized checklist, i.e. the Quality Assessment of Physical Activity Questionnaire (QAPAQ) checklist, and the methodological quality of the included studies was rated using a standardized tool, i.e. the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist. Forty-six studies on 46 questionnaires met our inclusion criteria, of which 33 examined test-retest reliability, nine examined measurement error, two examined internal consistency, 22 examined construct validity, eight examined content validity, and two examined structural validity. The majority of the included studies were of fair or poor methodological quality. Of the studies with at least a fair methodological quality, six scored positive on test-retest reliability, and two scored positive on construct validity. None of the questionnaires included in this review were considered as both valid and reliable. High-quality studies on the most promising questionnaires are required, with more attention to the content validity of the questionnaires. PROSPERO registration number: CRD42016035963.
Pressure ulcer prevention algorithm content validation: a mixed-methods, quantitative study.
van Rijswijk, Lia; Beitz, Janice M
2015-04-01
Translating pressure ulcer prevention (PUP) evidence-based recommendations into practice remains challenging for a variety of reasons, including the perceived quality, validity, and usability of the research or the guideline itself. Following the development and face validation testing of an evidence-based PUP algorithm, additional stakeholder input and testing were needed. Using convenience sampling methods, wound care experts attending a national wound care conference and a regional wound ostomy continence nursing (WOCN) conference and/or graduates of a WOCN program were invited to participate in an Internal Review Board-approved, mixed-methods quantitative survey with qualitative components to examine algorithm content validity. After participants provided written informed consent, demographic variables were collected and participants were asked to comment on and rate the relevance and appropriateness of each of the 26 algorithm decision points/steps using standard content validation study procedures. All responses were anonymous. Descriptive summary statistics, mean relevance/appropriateness scores, and the content validity index (CVI) were calculated. Qualitative comments were transcribed and thematically analyzed. Of the 553 wound care experts invited, 79 (average age 52.9 years, SD 10.1; range 23-73) consented to participate and completed the study (a response rate of 14%). Most (67, 85%) were female, registered (49, 62%) or advanced practice (12, 15%) nurses, and had > 10 years of health care experience (88, 92%). Other health disciplines included medical doctors, physical therapists, nurse practitioners, and certified nurse specialists. Almost all had received formal wound care education (75, 95%). On a Likert-type scale of 1 (not relevant/appropriate) to 4 (very relevant and appropriate), the average score for the entire algorithm/all decision points (N = 1,912) was 3.72 with an overall CVI of 0.94 (out of 1). The only decision point/step recommendation with a CVI of ≤ 0.70 was the recommendation to provide medical-grade sheepskin for patients at high risk for friction/shear. Many positive and substantive suggestions for minor modifications including color, flow, and algorithm orientation were received. The high overall and individual item rating scores and CVI further support the validity and appropriateness of the PUP algorithm with the addition of the minor modifications. The generic recommendations facilitate individualization, and future research should focus on construct validation testing.
49 CFR 1522.115 - Renewal of TSA approval.
Code of Federal Regulations, 2010 CFR
2010-10-01
... operate as a validation firm. If the validation firm submits the information in the month before or after it is due, the validation firm is considered to have submitted the information in the month it is due... application. (b) Content. In addition to any other information required by TSA, the validation firm must...
Validating the Riverside Acculturation Stress Inventory with Asian Americans.
Miller, Matthew J; Kim, Jungeun; Benet-Martínez, Verónica
2011-06-01
An emerging body of empirical research highlights the impact of acculturative stress in the lives of culturally diverse populations. Therefore, to facilitate future research in this area, we conducted 3 studies to examine the psychometric properties of the Riverside Acculturation Stress Inventory (RASI; Benet-Martínez & Haritatos, 2005) and its 5 subscales in a total sample of 793 self-identified Asian American participants. The reliability and validity of RASI scores and the hypothesized 1-factor higher order model (with 1st-order factors Language Skills, Work Challenges, Intercultural Relations, Discrimination, and Cultural Isolation) of the RASI were examined in Study 1. The RASI higher order structure and score validity and reliability were examined across different generational groups in Study 2. The stability of RASI scores over a 3-week period was examined in Study 3. Overall, findings from these studies support the hypothesized structure of the RASI and indicate that this brief instrument provides reliable and valid acculturative stress scores. In addition, results suggest that RASI items are interpreted in an equivalent manner across different generations of Asian American individuals. Implications for research and assessment are discussed. 2011 APA, all rights reserved
Trams, trains, planes and automobiles: logistics of conducting a statewide audit of medical records.
Flood, Margaret; Pollock, Wendy; McDonald, Susan; Davey, Mary-Ann
2016-10-01
This paper reports on the logistics of conducting a validation study of a routinely collected dataset against medical records at hospitals to inform planning of similar studies. A stratified random sample of 15 hospitals and two homebirth practitioners was included. Site visits were arranged following consent. In addition to the validation of perinatal data, information was collected regarding logistics. Records at 14 metropolitan and rural hospitals up to 500 km from the research centre, and two homebirth practitioners, were audited. Obtaining consent to participate took between 5 days and 10 months. Auditors visited sites on 101 days, auditing 737 medical record pairs at 16 sites. Median audit time per record was 51.3 minutes; electronic records each took 36 minutes longer than paper. Travel time accounted for nearly one-quarter of audit time. Delays obtaining consents, long travel times and electronic records prolonged audit duration and expense. Employment of experts maximised use of available audit time. Conducting a validation study is a time-consuming and expensive exercise; however, confidence in the accuracy of public health data is vital. Validation studies are unquestionably important. Three alternative strategies have been proposed to make future studies viable. © 2016 Public Health Association of Australia.
Jansen, Marleen E; Rigter, T; Rodenburg, W; Fleur, T M C; Houwink, E J F; Weda, M; Cornel, Martina C
2017-01-01
Advances from pharmacogenetics (PGx) have not been implemented into health care to the expected extent. One gap that will be addressed in this study is a lack of reporting on clinical validity and clinical utility of PGx-tests. A systematic review of current reporting in scientific literature was conducted on publications addressing PGx in the context of statins and muscle toxicity. Eighty-nine publications were included and information was selected on reported measures of effect, arguments, and accompanying conclusions. Most authors report associations to quantify the relationship between a genetic variation an outcome, such as adverse drug responses. Conclusions on the implementation of a PGx-test are generally based on these associations, without explicit mention of other measures relevant to evaluate the test's clinical validity and clinical utility. To gain insight in the clinical impact and select useful tests, additional outcomes are needed to estimate the clinical validity and utility, such as cost-effectiveness.
Plappert-Helbig, Ulla; Junker-Walker, Ursula; Martus, Hans-Joerg
2015-07-01
As a part of the Japanese Center for the Validation of Alternative Methods (JaCVAM)-initiative international validation study of the in vivo rat alkaline comet assay (comet assay), we examined methyl methanesulfonate, 2,6-diaminotoluene, and 5-fluorouracil under coded test conditions. Rats were treated orally with the maximum tolerated dose (MTD) and two additional descending doses of the respective compounds. In the MMS treated groups liver and stomach showed significantly elevated DNA damage at each dose level and a significant dose-response relationship. 2,6-diaminotoluene induced significantly elevated DNA damage in the liver at each dose and a statistically significant dose-response relationship whereas no DNA damage was obtained in the stomach. 5-fluorouracil did not induce DNA damage in either liver or stomach. Copyright © 2015 Elsevier B.V. All rights reserved.
Development and validation of a Response Bias Scale (RBS) for the MMPI-2.
Gervais, Roger O; Ben-Porath, Yossef S; Wygant, Dustin B; Green, Paul
2007-06-01
This study describes the development of a Minnesota Multiphasic Personality Inventory (MMPI-2) scale designed to detect negative response bias in forensic neuropsychological or disability assessment settings. The Response Bias Scale (RBS) consists of 28 MMPI-2 items that discriminated between persons who passed or failed the Word Memory Test (WMT), Computerized Assessment of Response Bias (CARB), and/or Test of Memory Malingering (TOMM) in a sample of 1,212 nonhead-injury disability claimants. Incremental validity of the RBS was evaluated by comparing its ability to detect poor performance on four separate symptom validity tests with that of the F and F(P) scales and the Fake Bad Scale (FBS). The RBS consistently outperformed F, F(P), and FBS. Study results suggest that the RBS may be a useful addition to existing MMPI-2 validity scales and indices in detecting symptom complaints predominantly associated with cognitive response bias and overreporting in forensic neuropsychological and disability assessment settings.
Koritar, Priscila; Philippi, Sonia Tucunduva; Alvarenga, Marle dos Santos; Santos, Bernardo dos
2014-08-01
The scope of this study was to show the cross-cultural adaptation and validation of the Health and Taste Attitude Scale in Portuguese. The methodology included translation of the scale; evaluation of conceptual, operational and item-based equivalence by 14 experts and 51 female undergraduates; semantic equivalence and measurement assessment by 12 bilingual women by the paired t-test, the Pearson correlation coefficient and the coefficient intraclass correlation; internal consistency and test-retest reliability by Cronbach's alpha and intraclass correlation coefficient, respectively, after application on 216 female undergraduates; assessment of discriminant and concurrent validity via the t-test and Spearman's correlation coefficient, respectively, in addition to Confirmatory Factor and Exploratory Factor Analysis. The scale was considered adequate and easily understood by the experts and university students and presented good internal consistency and reliability (µ 0.86, ICC 0.84). The results show that the scale is valid and can be used in studies with women to better understand attitudes related to taste.
The Outcome and Assessment Information Set (OASIS): A Review of Validity and Reliability
O’CONNOR, MELISSA; DAVITT, JOAN K.
2015-01-01
The Outcome and Assessment Information Set (OASIS) is the patient-specific, standardized assessment used in Medicare home health care to plan care, determine reimbursement, and measure quality. Since its inception in 1999, there has been debate over the reliability and validity of the OASIS as a research tool and outcome measure. A systematic literature review of English-language articles identified 12 studies published in the last 10 years examining the validity and reliability of the OASIS. Empirical findings indicate the validity and reliability of the OASIS range from low to moderate but vary depending on the item studied. Limitations in the existing research include: nonrepresentative samples; inconsistencies in methods used, items tested, measurement, and statistical procedures; and the changes to the OASIS itself over time. The inconsistencies suggest that these results are tentative at best; additional research is needed to confirm the value of the OASIS for measuring patient outcomes, research, and quality improvement. PMID:23216513
External validity of post-stroke interventional gait rehabilitation studies.
Kafri, Michal; Dickstein, Ruth
2017-01-01
Gait rehabilitation is a major component of stroke rehabilitation, and is supported by extensive research. The objective of this review was to examine the external validity of intervention studies aimed at improving gait in individuals post-stroke. To that end, two aspects of these studies were assessed: subjects' exclusion criteria and the ecological validity of the intervention, as manifested by the intervention's technological complexity and delivery setting. Additionally, we examined whether the target population as inferred from the titles/abstracts is broader than the population actually represented by the reported samples. We systematically researched PubMed for intervention studies to improve gait post-stroke, working backwards from the beginning of 2014. Exclusion criteria, the technological complexity of the intervention (defined as either elaborate or simple), setting, and description of the target population in the titles/abstracts were recorded. Fifty-two studies were reviewed. The samples were exclusive, with recurrent stroke, co-morbidities, cognitive status, walking level, and residency being major reasons for exclusion. In one half of the studies, the intervention was elaborate. Descriptions of participants in the title/abstract in almost one half of the studies included only the diagnosis (stroke or comparable terms) and its stage (acute, subacute, and chronic). The external validity of a substantial number of intervention studies about rehabilitation of gait post-stroke appears to be limited by exclusivity of the samples as well as by deficiencies in ecological validity of the interventions. These limitations are not accurately reflected in the titles or abstracts of the studies.
Sydora, Beate C; Fast, Hilary; Campbell, Sandy; Yuksel, Nese; Lewis, Jacqueline E; Ross, Sue
2016-09-01
The Menopause-Specific Quality of Life (MENQOL) questionnaire was developed as a validated research tool to measure condition-specific QOL in early postmenopausal women. We conducted a comprehensive scoping review to explore the extent of MENQOL's use in research and clinical practice to assess its value in providing effective, adequate, and comparable participant assessment information. Thirteen biomedical and clinical databases were systematically searched with "menqol" as a search term to find articles using MENQOL or its validated derivative MENQOL-Intervention as investigative or clinical tools from 1996 to November 2014 inclusive. Review articles, conference abstracts, proceedings, dissertations, and incomplete trials were excluded. Additional articles were collected from references within key articles. Three independent reviewers extracted data reflecting study design, intervention, sample characteristics, MENQOL questionnaire version, modifications and language, recall period, and analysis detail. Data analyses included categorization and descriptive statistics. The review included 220 eligible papers of various study designs, covering 39 countries worldwide and using MENQOL translated into more than 25 languages. A variety of modifications to the original questionnaire were identified, including omission or addition of items and alterations to the validated methodological analysis. No papers were found that described MENQOL's use in clinical practice. Our study found an extensive and steadily increasing use of MENQOL in clinical and epidemiological research over 18 years postpublication. Our results stress the importance of proper reporting and validation of translations and variations to ensure outcome comparison and transparency of MENQOL's use. The value of MENQOL in clinical practice remains unknown.
Chaabene, Helmi; Negra, Yassine; Bouguezzi, Raja; Capranica, Laura; Franchini, Emerson; Prieske, Olaf; Hbacha, Hamdi; Granacher, Urs
2018-01-01
The regular monitoring of physical fitness and sport-specific performance is important in elite sports to increase the likelihood of success in competition. This study aimed to systematically review and to critically appraise the methodological quality, validation data, and feasibility of the sport-specific performance assessment in Olympic combat sports like amateur boxing, fencing, judo, karate, taekwondo, and wrestling. A systematic search was conducted in the electronic databases PubMed, Google-Scholar, and Science-Direct up to October 2017. Studies in combat sports were included that reported validation data (e.g., reliability, validity, sensitivity) of sport-specific tests. Overall, 39 studies were eligible for inclusion in this review. The majority of studies (74%) contained sample sizes <30 subjects. Nearly, 1/3 of the reviewed studies lacked a sufficient description (e.g., anthropometrics, age, expertise level) of the included participants. Seventy-two percent of studies did not sufficiently report inclusion/exclusion criteria of their participants. In 62% of the included studies, the description and/or inclusion of a familiarization session (s) was either incomplete or not existent. Sixty-percent of studies did not report any details about the stability of testing conditions. Approximately half of the studies examined reliability measures of the included sport-specific tests (intraclass correlation coefficient [ICC] = 0.43-1.00). Content validity was addressed in all included studies, criterion validity (only the concurrent aspect of it) in approximately half of the studies with correlation coefficients ranging from r = -0.41 to 0.90. Construct validity was reported in 31% of the included studies and predictive validity in only one. Test sensitivity was addressed in 13% of the included studies. The majority of studies (64%) ignored and/or provided incomplete information on test feasibility and methodological limitations of the sport-specific test. In 28% of the included studies, insufficient information or a complete lack of information was provided in the respective field of the test application. Several methodological gaps exist in studies that used sport-specific performance tests in Olympic combat sports. Additional research should adopt more rigorous validation procedures in the application and description of sport-specific performance tests in Olympic combat sports.
Updating the OMERACT Filter: Implications for imaging and soluble biomarkers
D’Agostino, Maria-Antonietta; Boers, Maarten; Kirwan, John; van der Heijde, Desirée; Østergaard, Mikkel; Schett, Georg; Landewé, Robert B.M.; Maksymowych, Walter P.; Naredo, Esperanza; Dougados, Maxime; Iagnocco, Annamaria; Bingham, Clifton O.; Brooks, Peter; Beaton, Dorcas; Gandjbakhch, Frederique; Gossec, Laure; Guillemin, Francis; Hewlett, Sarah; Kloppenburg, Margreet; March, Lyn; Mease, Philip J; Moller, Ingrid; Simon, Lee S; Singh, Jasvinder A; Strand, Vibeke; Wakefield, Richard J; Wells, George; Tugwell, Peter; Conaghan, Philip G
2014-01-01
Objective The OMERACT Filter provides a framework for the validation of outcome measures for use in rheumatology clinical research. However, imaging and biochemical measures may face additional validation challenges due to their technical nature. The Imaging and Soluble Biomarker Session at OMERACT 11 aimed to provide a guide for the iterative development of an imaging or biochemical measurement instrument so it can be used in therapeutic assessment. Methods A hierarchical structure was proposed, reflecting 3 dimensions needed for validating an imaging or biochemical measurement instrument: outcome domain(s), study setting and performance of the instrument. Movement along the axes in any dimension reflects increasing validation. For a given test instrument, the 3-axis structure assesses the extent to which the instrument is a validated measure for the chosen domain, whether it assesses a patient or disease centred-variable, and whether its technical performance is adequate in the context of its application. Some currently used imaging and soluble biomarkers for rheumatoid arthritis, spondyloarthritis and knee osteoarthritis were then evaluated using the original OMERACT filter and the newly proposed structure. Break-out groups critically reviewed the extent to which the candidate biomarkers complied with the proposed step-wise approach, as a way of examining the utility of the proposed 3 dimensional structure. Results Although there was a broad acceptance of the value of the proposed structure in general, some areas for improvement were suggested including clarification of criteria for achieving a certain level of validation and how to deal with extension of the structure to areas beyond clinical trials. Conclusion General support was obtained for a proposed tri-axis structure to assess validation of imaging and soluble biomarkers; nevertheless, additional work is required to better evaluate its place within the OMERACT Filter 2.0. PMID:24584916
Updating the OMERACT filter: implications for imaging and soluble biomarkers.
D'Agostino, Maria-Antonietta; Boers, Maarten; Kirwan, John; van der Heijde, Désirée; Østergaard, Mikkel; Schett, Georg; Landewé, Robert B; Maksymowych, Walter P; Naredo, Esperanza; Dougados, Maxime; Iagnocco, Annamaria; Bingham, Clifton O; Brooks, Peter M; Beaton, Dorcas E; Gandjbakhch, Frederique; Gossec, Laure; Guillemin, Francis; Hewlett, Sarah E; Kloppenburg, Margreet; March, Lyn; Mease, Philip J; Moller, Ingrid; Simon, Lee S; Singh, Jasvinder A; Strand, Vibeke; Wakefield, Richard J; Wells, George A; Tugwell, Peter; Conaghan, Philip G
2014-05-01
The Outcome Measures in Rheumatology (OMERACT) Filter provides a framework for the validation of outcome measures for use in rheumatology clinical research. However, imaging and biochemical measures may face additional validation challenges because of their technical nature. The Imaging and Soluble Biomarker Session at OMERACT 11 aimed to provide a guide for the iterative development of an imaging or biochemical measurement instrument so it can be used in therapeutic assessment. A hierarchical structure was proposed, reflecting 3 dimensions needed for validating an imaging or biochemical measurement instrument: outcome domain(s), study setting, and performance of the instrument. Movement along the axes in any dimension reflects increasing validation. For a given test instrument, the 3-axis structure assesses the extent to which the instrument is a validated measure for the chosen domain, whether it assesses a patient-centered or disease-centered variable, and whether its technical performance is adequate in the context of its application. Some currently used imaging and soluble biomarkers for rheumatoid arthritis, spondyloarthritis, and knee osteoarthritis were then evaluated using the original OMERACT Filter and the newly proposed structure. Breakout groups critically reviewed the extent to which the candidate biomarkers complied with the proposed stepwise approach, as a way of examining the utility of the proposed 3-dimensional structure. Although there was a broad acceptance of the value of the proposed structure in general, some areas for improvement were suggested including clarification of criteria for achieving a certain level of validation and how to deal with extension of the structure to areas beyond clinical trials. General support was obtained for a proposed tri-axis structure to assess validation of imaging and soluble biomarkers; nevertheless, additional work is required to better evaluate its place within the OMERACT Filter 2.0.
Vincent, Ursula; Serano, Federica; von Holst, Christoph
2017-08-01
Carotenoids are used in animal nutrition mainly as sensory additives that favourably affect the colour of fish, birds and food of animal origin. Various analytical methods exist for their quantification in compound feed, reflecting the different physico-chemical characteristics of the carotenoid and the corresponding feed additives. They may be natural products or specific formulations containing the target carotenoids produced by chemical synthesis. In this study a multi-analyte method was developed that can be applied to the determination of all 10 carotenoids currently authorised within the European Union for compound feedingstuffs. The method functions regardless of whether the carotenoids have been added to the compound feed via natural products or specific formulations. It is comprised of three steps: (1) digestion of the feed sample with an enzyme; (2) pressurised liquid extraction; and (3) quantification of the analytes by reversed-phase HPLC coupled to a photodiode array detector in the visible range. The method was single-laboratory validated for poultry and fish feed covering a mass fraction range of the target analyte from 2.5 to 300 mg kg - 1 . The following method performance characteristics were obtained: the recovery rate varied from 82% to 129% and precision expressed as the relative standard deviation of intermediate precision varied from 1.6% to 15%. Based on the acceptable performance obtained in the validation study, the multi-analyte method is considered fit for the intended purpose.
Patterson, Fiona; Lopes, Safiatu; Harding, Stephen; Vaux, Emma; Berkin, Liz; Black, David
2017-02-01
The aim of this study was to follow up a sample of physicians who began core medical training (CMT) in 2009. This paper examines the long-term validity of CMT and GP selection methods in predicting performance in the Membership of Royal College of Physicians (MRCP(UK)) examinations. We performed a longitudinal study, examining the extent to which the GP and CMT selection methods (T1) predict performance in the MRCP(UK) examinations (T2). A total of 2,569 applicants from 2008-09 who completed CMT and GP selection methods were included in the study. Looking at MRCP(UK) part 1, part 2 written and PACES scores, both CMT and GP selection methods show evidence of predictive validity for the outcome variables, and hierarchical regressions show the GP methods add significant value to the CMT selection process. CMT selection methods predict performance in important outcomes and have good evidence of validity; the GP methods may have an additional role alongside the CMT selection methods. © Royal College of Physicians 2017. All rights reserved.
Yen, Po-Yin; Sousa, Karen H; Bakken, Suzanne
2014-01-01
Background In a previous study, we developed the Health Information Technology Usability Evaluation Scale (Health-ITUES), which is designed to support customization at the item level. Such customization matches the specific tasks/expectations of a health IT system while retaining comparability at the construct level, and provides evidence of its factorial validity and internal consistency reliability through exploratory factor analysis. Objective In this study, we advanced the development of Health-ITUES to examine its construct validity and predictive validity. Methods The health IT system studied was a web-based communication system that supported nurse staffing and scheduling. Using Health-ITUES, we conducted a cross-sectional study to evaluate users’ perception toward the web-based communication system after system implementation. We examined Health-ITUES's construct validity through first and second order confirmatory factor analysis (CFA), and its predictive validity via structural equation modeling (SEM). Results The sample comprised 541 staff nurses in two healthcare organizations. The CFA (n=165) showed that a general usability factor accounted for 78.1%, 93.4%, 51.0%, and 39.9% of the explained variance in ‘Quality of Work Life’, ‘Perceived Usefulness’, ‘Perceived Ease of Use’, and ‘User Control’, respectively. The SEM (n=541) supported the predictive validity of Health-ITUES, explaining 64% of the variance in intention for system use. Conclusions The results of CFA and SEM provide additional evidence for the construct and predictive validity of Health-ITUES. The customizability of Health-ITUES has the potential to support comparisons at the construct level, while allowing variation at the item level. We also illustrate application of Health-ITUES across stages of system development. PMID:24567081
Implementation and application of an interactive user-friendly validation software for RADIANCE
NASA Astrophysics Data System (ADS)
Sundaram, Anand; Boonn, William W.; Kim, Woojin; Cook, Tessa S.
2012-02-01
RADIANCE extracts CT dose parameters from dose sheets using optical character recognition and stores the data in a relational database. To facilitate validation of RADIANCE's performance, a simple user interface was initially implemented and about 300 records were evaluated. Here, we extend this interface to achieve a wider variety of functions and perform a larger-scale validation. The validator uses some data from the RADIANCE database to prepopulate quality-testing fields, such as correspondence between calculated and reported total dose-length product. The interface also displays relevant parameters from the DICOM headers. A total of 5,098 dose sheets were used to test the performance accuracy of RADIANCE in dose data extraction. Several search criteria were implemented. All records were searchable by accession number, study date, or dose parameters beyond chosen thresholds. Validated records were searchable according to additional criteria from validation inputs. An error rate of 0.303% was demonstrated in the validation. Dose monitoring is increasingly important and RADIANCE provides an open-source solution with a high level of accuracy. The RADIANCE validator has been updated to enable users to test the integrity of their installation and verify that their dose monitoring is accurate and effective.
Nascimento-Ferreira, Marcus Vinícius; De Moraes, Augusto César Ferreira; Toazza-Oliveira, Paulo Vinícius; Forjaz, Claudia L M; Aristizabal, Juan Carlos; Santaliesra-Pasías, Alba M; Lepera, Candela; Nascimento-Junior, Walter Viana; Skapino, Estela; Delgado, Carlos Alberto; Moreno, Luis Alberto; Carvalho, Heráclito Barbosa
2018-03-01
The objective of this article is to test the reliability and validity of the new and innovative physical activity (PA) questionnaire. Subsamples from the South American Youth/Child Cardiovascular and Environment Study (SAYCARE) study were included to examine its reliability (children: n = 161; adolescents: n = 177) and validity (children: n = 82; adolescents: n = 60). The questionnaire consists of three dimensions of PA (leisure, active commuting, and school) performed during the last week. To assess its validity, the subjects wore accelerometers for at least 3 days and 8 h/d (at least one weekend day). The reliability was analyzed by correlation coefficients. In addition, Bland-Altman analysis and a multilevel regression were applied to estimate the measurement bias, limits of agreement, and influence of contextual variables. In children, the questionnaire showed consistent reliability (ρ = 0.56) and moderate validity (ρ = 0.46), and the contextual variable variance explained 43.0% with -22.9 min/d bias. In adolescents, the reliability was higher (ρ = 0.76) and the validity was almost excellent (ρ = 0.88), with 66.7% of the variance explained by city level with 16.0 min/d PA bias. The SAYCARE PA questionnaire shows acceptable (in children) to strong (in adolescents) reliability and strong validity in the measurement of PA in the pediatric population from low- to middle-income countries. © 2018 The Obesity Society.
Güiza, Fabian; Depreitere, Bart; Piper, Ian; Citerio, Giuseppe; Jorens, Philippe G; Maas, Andrew; Schuhmann, Martin U; Lo, Tsz-Yan Milly; Donald, Rob; Jones, Patricia; Maier, Gottlieb; Van den Berghe, Greet; Meyfroidt, Geert
2017-03-01
A model for early detection of episodes of increased intracranial pressure in traumatic brain injury patients has been previously developed and validated based on retrospective adult patient data from the multicenter Brain-IT database. The purpose of the present study is to validate this early detection model in different cohorts of recently treated adult and pediatric traumatic brain injury patients. Prognostic modeling. Noninterventional, observational, retrospective study. The adult validation cohort comprised recent traumatic brain injury patients from San Gerardo Hospital in Monza (n = 50), Leuven University Hospital (n = 26), Antwerp University Hospital (n = 19), Tübingen University Hospital (n = 18), and Southern General Hospital in Glasgow (n = 8). The pediatric validation cohort comprised patients from neurosurgical and intensive care centers in Edinburgh and Newcastle (n = 79). None. The model's performance was evaluated with respect to discrimination, calibration, overall performance, and clinical usefulness. In the recent adult validation cohort, the model retained excellent performance as in the original study. In the pediatric validation cohort, the model retained good discrimination and a positive net benefit, albeit with a performance drop in the remaining criteria. The obtained external validation results confirm the robustness of the model to predict future increased intracranial pressure events 30 minutes in advance, in adult and pediatric traumatic brain injury patients. These results are a large step toward an early warning system for increased intracranial pressure that can be generally applied. Furthermore, the sparseness of this model that uses only two routinely monitored signals as inputs (intracranial pressure and mean arterial blood pressure) is an additional asset.
Further validation of the Internet-based Dementia Risk Assessment.
Brandt, Jason; Blehar, Justin; Anderson, Allan; Gross, Alden L
2014-01-01
Most approaches to the detection of presymptomatic or prodromal Alzheimer's disease require the costly collection and analysis of biological samples or neuroimaging measurements. The Dementia Risk Assessment (DRA) was developed to facilitate this detection by collecting self-report and proxy-report of dementia risk variables and episodic memory performance on a free Internet website. We now report two validation studies. In Study 1, 130 community-residing older adults seeking memory screening at senior health fairs were tested using the Mini-Cog, and were then observed while taking the DRA. They were compared to a demographically-matched subsample from our anonymous Internet sample. Participants seeking memory screening had more dementia risk factors and obtained lower scores on the DRA's recognition memory test (RMT) than their Internet controls. In addition, those who failed the Mini-Cog obtained much lower scores on the RMT than those who passed the Mini-Cog. In Study 2, 160 older adults seeking evaluation of cognitive difficulties took the DRA prior to diagnostic evaluations at outpatient dementia clinics. Patients who ultimately received the diagnosis of a dementia syndrome scored significantly lower on the RMT than those diagnosed with other conditions or deemed normal. Lower education, family history of dementia, presence of hypercholesterolemia and diabetes, and memory test score distinguished the dementia and no-dementia groups with around 82% accuracy. In addition, score on the RMT correlated highly with scores on other instruments widely used to detect cognitive decline. These findings support the concurrent validity of the DRA for detecting prevalent cognitive impairment. Prospective studies of cognitively normal persons who subsequently develop dementia will be necessary to establish its predictive validity.
Llamas-Ramos, Inés; Llamas-Ramos, Rocío; Buz, José; Cortés-Rodríguez, María; Martín-Nogueras, Ana María
2018-06-01
The Memorial Symptom Assessment Scale (MSAS) is a self-rating instrument for the assessment of symptom distress in cancer patients. The Spanish version of the MSAS has recently been validated. However, we lack evidence of the internal construct validity of the shorter versions (short form [MSAS-SF] and condensed form [CMSAS]). In addition, rigorous testing of these scales with modern psychometric methods is needed. The aim of this study was to evaluate the internal construct validity and reliability of the Spanish versions of the MSAS-SF and CMSAS in oncology outpatients using Rasch analysis. Data from a convenience sample of oncology outpatients receiving chemotherapy (n = 306; mean age 60 years; 63% women) at a university hospital were analyzed. The Rasch unidimensional measurement model was used to examine response category functioning, item hierarchy, targeting, unidimensionality, reliability, and differential item functioning by age, gender, and marital status. The response category structure of the symptom distress items was improved by collapsing two categories. The scales were adequately targeted to the study patients, showed overall Rasch model fit (mean Infit MnSq ranged from 0.98 to 1.05), met criteria for unidimensionality, and the reliability of scores was good (person reliability > 0.80), except for the CMSAS prevalence scale. Only four items showed differential item functioning. The present study demonstrated that the Spanish versions of the MSAS-SF and CMSAS have adequate psychometric properties to evaluate symptom distress in oncology outpatients. Additional studies of the CMSAS are recommended. Copyright © 2018 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.
Validation of a Latent Construct for Dementia in a Population-Wide Dataset from Singapore.
Peh, Chao Xu; Abdin, Edimansyah; Vaingankar, Janhavi A; Verma, Swapna; Chua, Boon Yiang; Sagayadevan, Vathsala; Seow, Esmond; Zhang, YunJue; Shahwan, Shazana; Ng, Li Ling; Prince, Martin; Chong, Siow Ann; Subramaniam, Mythily
2017-01-01
The latent variable δ has been proposed as a proxy for dementia. Previous validation studies have been conducted using convenience samples. It is currently unknown how δ performs in population-wide data. To validate δ in Singapore using population-wide epidemiological study data on persons aged 60 and above. δ was constructed using items from the Community Screening Instrument for Dementia (CSI'D) and World Health Organization Disability Assessment Schedule (WHODAS II). Confirmatory factor analysis (CFA) was conducted to examine δ model fit. Convergent validity was examined with the Clinical Dementia Rating scale (CDR) and GMS-AGECAT dementia. Divergent validity was examined with GMS-AGECAT depression. The δ model demonstrated fit to the data, χ2(df) = 249.71(55), p < 0.001, CFI = 0.990, TLI = 0.997, RMSEA = 0.037. Latent variable δ was significantly associated with CDR and GMS-AGECAT dementia (range: β= 0.32 to 0.63), and was not associated with GMS-AGECAT depression. Compared to unadjusted models, δ model fit was poor when adjusted for age, gender, ethnicity, and education. The study found some support for δ as a proxy for dementia in Singapore based on population data. Both convergent and divergent validity were established. In addition, the δ model structure appeared to be influenced by age, gender, ethnicity, and education covariates.
Adjusted linguistic validation and psychometric properties of the Colombian version of KIDSCREEN-52.
Jaimes-Valencia, Mary Luz; Perpiñá-Galvañ, Juana; Cabañero-Martínez, Maria José; Cabrero-García, Julio; Richart-Martínez, Miguel
2018-01-01
In health and clinical studies, health-related quality of life is often assessed using the well-established KIDSCREEN-52 questionnaires as well as the Vécu et Santé Perçue de l'Adolescent (VSP-A). The purpose of this study was twofold: to perform an adjusted linguistic validation of the Colombian version of the KIDSCREEN-52 and to assess its psychometric properties in children and adolescents. A total of 146 children and adolescents completed the KIDSCREEN-52, adolescents ( n = 48) additionally completed the VSP-A. Psychometric analyses focused on the internal consistency as well as the convergent and discriminant validity of the KIDSCREEN-52 Colombian version. Syntactic and semantic modifications were made to 19 items in the adapted version of the KIDSCREEN-52. Cronbach's α ranged from .74 to .89 for eight dimensions, while α < .70 was obtained for self-perception and social acceptance. We found evidence of good convergent validity with the VSP-A dimensions. Regarding known-groups validity, children aged between 8 and 10=years, male, with a high socioeconomic level and no chronic health condition obtained higher scores compared to the other categories. The developed Colombian version of the KIDSCREEN-52 showed acceptable reliability and validity. This study provides a cultural adaptation of the Spanish version of the KIDSCREEN-52 for Colombian children and adolescents.
Schneider, Gary; Kachroo, Sumesh; Jones, Natalie; Crean, Sheila; Rotella, Philip; Avetisyan, Ruzan; Reynolds, Matthew W
2012-01-01
The Food and Drug Administration's Mini-Sentinel pilot program initially aims to conduct active surveillance to refine safety signals that emerge for marketed medical products. A key facet of this surveillance is to develop and understand the validity of algorithms for identifying health outcomes of interest from administrative and claims data. This article summarizes the process and findings of the algorithm review of anaphylaxis. PubMed and Iowa Drug Information Service searches were conducted to identify citations applicable to the anaphylaxis health outcome of interest. Level 1 abstract reviews and Level 2 full-text reviews were conducted to find articles using administrative and claims data to identify anaphylaxis and including validation estimates of the coding algorithms. Our search revealed limited literature focusing on anaphylaxis that provided administrative and claims data-based algorithms and validation estimates. Only four studies identified via literature searches provided validated algorithms; however, two additional studies were identified by Mini-Sentinel collaborators and were incorporated. The International Classification of Diseases, Ninth Revision, codes varied, as did the positive predictive value, depending on the cohort characteristics and the specific codes used to identify anaphylaxis. Research needs to be conducted on designing validation studies to test anaphylaxis algorithms and estimating their predictive power, sensitivity, and specificity. Copyright © 2012 John Wiley & Sons, Ltd.
Incremental Validity of the Trait Emotional Intelligence Questionnaire-Short Form (TEIQue-SF).
Siegling, A B; Vesely, Ashley K; Petrides, K V; Saklofske, Donald H
2015-01-01
This study examined the incremental validity of the adult short form of the Trait Emotional Intelligence Questionnaire (TEIQue-SF) in predicting 7 construct-relevant criteria beyond the variance explained by the Five-factor model and coping strategies. Additionally, the relative contributions of the questionnaire's 4 subscales were assessed. Two samples of Canadian university students completed the TEIQue-SF, along with measures of the Big Five, coping strategies (Sample 1 only), and emotion-laden criteria. The TEIQue-SF showed consistent incremental effects beyond the Big Five or the Big Five and coping strategies, predicting all 7 criteria examined across the 2 samples. Furthermore, 2 of the 4 TEIQue-SF subscales accounted for the measure's incremental validity. Although the findings provide good support for the validity and utility of the TEIQue-SF, directions for further research are emphasized.
Gao, Yu; Deng, Jiaxin; Lai, Hongyu; Deng, Qiaowen; Armour, Cherie
2017-01-01
The current study assesses the factor structure and construct validity of the self-reported Inventory of Callous–Unemotional Traits (ICU) in 637 Chinese community adults (mean age = 25.98, SD = 5.79). A series of theoretical models proposed in previous studies were tested through confirmatory factor analyses. Results indicated that a shortened form that consists of 11 items (ICU-11) to assess callousness and uncaring factors has excellent overall fit. Additionally, correlations with a wide range of external variables demonstrated that this shortened form has similar construct validity compared to the original ICU. In conclusion, our findings suggest that the ICU-11 may be a promising self-report tool that could be a good substitute for the original form to assess callous-uncaring traits in adults. PMID:29216240
NASA Technical Reports Server (NTRS)
Lee, Henry C.; Klopfer, Goetz
2011-01-01
This report documents how OVERFLOW, a computational fluid dynamics code, predicts plume impingement of underexpanded axisymmetric jets onto both perpendicular and inclined flat plates. The effects of the plume impinging on a range of plate inclinations varying from 90deg to 30deg are investigated and compared to the experimental results in Reference 1 and 2. The flow fields are extremely complex due to the interaction between the shock waves from the free jet and those deflected by the plate. Additionally, complex mixing effects create very intricate structures in the flow. The experimental data is very limited, so these validation studies will focus only on cold plume impingement on flat and inclined plates. This validation study will help quantify the error in the OVERFLOW simulation when applied to stage separation scenarios.
Content Validation and Evaluation of an Endovascular Teamwork Assessment Tool.
Hull, L; Bicknell, C; Patel, K; Vyas, R; Van Herzeele, I; Sevdalis, N; Rudarakanchana, N
2016-07-01
To modify, content validate, and evaluate a teamwork assessment tool for use in endovascular surgery. A multistage, multimethod study was conducted. Stage 1 included expert review and modification of the existing Observational Teamwork Assessment for Surgery (OTAS) tool. Stage 2 included identification of additional exemplar behaviours contributing to effective teamwork and enhanced patient safety in endovascular surgery (using real-time observation, focus groups, and semistructured interviews of multidisciplinary teams). Stage 3 included content validation of exemplar behaviours using expert consensus according to established psychometric recommendations and evaluation of structure, content, feasibility, and usability of the Endovascular Observational Teamwork Assessment Tool (Endo-OTAS) by an expert multidisciplinary panel. Stage 4 included final team expert review of exemplars. OTAS core team behaviours were maintained (communication, coordination, cooperation, leadership team monitoring). Of the 114 OTAS behavioural exemplars, 19 were modified, four removed, and 39 additional endovascular-specific behaviours identified. Content validation of these 153 exemplar behaviours showed that 113/153 (73.9%) reached the predetermined Item-Content Validity Index rating for teamwork and/or patient safety. After expert team review, 140/153 (91.5%) exemplars were deemed to warrant inclusion in the tool. More than 90% of the expert panel agreed that Endo-OTAS is an appropriate teamwork assessment tool with observable behaviours. Some concerns were noted about the time required to conduct observations and provide performance feedback. Endo-OTAS is a novel teamwork assessment tool, with evidence for content validity and relevance to endovascular teams. Endo-OTAS enables systematic objective assessment of the quality of team performance during endovascular procedures. Copyright © 2016. Published by Elsevier Ltd.
Brunckhorst, Oliver; Shahid, Shahab; Aydin, Abdullatif; McIlhenny, Craig; Khan, Shahid; Raza, Syed Johar; Sahai, Arun; Brewin, James; Bello, Fernando; Kneebone, Roger; Khan, Muhammad Shamim; Dasgupta, Prokar; Ahmed, Kamran
2015-09-01
Current training modalities within ureteroscopy have been extensively validated and must now be integrated within a comprehensive curriculum. Additionally, non-technical skills often cause surgical error and little research has been conducted to combine this with technical skills teaching. This study therefore aimed to develop and validate a curriculum for semi-rigid ureteroscopy, integrating both technical and non-technical skills teaching within the programme. Delphi methodology was utilised for curriculum development and content validation, with a randomised trial then conducted (n = 32) for curriculum evaluation. The developed curriculum consisted of four modules; initially developing basic technical skills and subsequently integrating non-technical skills teaching. Sixteen participants underwent the simulation-based curriculum and were subsequently assessed, together with the control cohort (n = 16) within a full immersion environment. Both technical (Time to completion, OSATS and a task specific checklist) and non-technical (NOTSS) outcome measures were recorded with parametric and non-parametric analyses used depending on the distribution of our data as evaluated by a Shapiro-Wilk test. Improvements within the intervention cohort demonstrated educational value across all technical and non-technical parameters recorded, including time to completion (p < 0.01), OSATS scores (p < 0.001), task specific checklist scores (p = 0.011) and NOTSS scores (p < 0.001). Content validity, feasibility and acceptability were all demonstrated through curriculum development and post-study questionnaire results. The current developed curriculum demonstrates that integrating both technical and non-technical skills teaching is both educationally valuable and feasible. Additionally, the curriculum offers a validated simulation-based training modality within ureteroscopy and a framework for the development of other simulation-based programmes.
Whiteneck, Gale; Gassaway, Julie; Dijkers, Marcel P.; Heinemann, Allen W.; Kreider, Scott E. D.
2012-01-01
Background/objective To examine associations of patient characteristics along with treatment quantity delivered by seven clinical disciplines during inpatient spinal cord injury (SCI) rehabilitation with outcomes at rehabilitation discharge and 1-year post-injury. Methods Six inpatient SCI rehabilitation centers enrolled 1376 patients during the 5-year SCIRehab study. Clinicians delivering standard care documented details of treatment. Outcome data were derived from SCI Model Systems Form I and II and a project-specific interview conducted at approximately 1-year post-injury. Regression modeling was used to predict outcomes; models were cross-validated by examining relative shrinkage of the original model R2 using 75% of the dataset to the R2 for the same outcome using a validation subsample. Results Patient characteristics are strong predictors of outcome; treatment duration adds slightly more predictive power. More time in physical therapy was associated positively with motor Functional Independence Measure at discharge and the 1-year anniversary, CHART Physical Independence, Social Integration, and Mobility dimensions, and smaller likelihood of rehospitalization after discharge and reporting of pressure ulcer at the interview. More time in therapeutic recreation also had multiple similar positive associations. Time spent in other disciplines had fewer and mixed relationships. Seven models validated well, two validated moderately well, and four validated poorly. Conclusion Patient characteristics explain a large proportion of variation in multiple outcomes after inpatient rehabilitation. The total amount of treatment received during rehabilitation from each of seven disciplines explains little additional variance. Reasons for this and the phenomenon that sometimes more hours of service predict poorer outcome, need additional study. Note This is the first of nine articles in the SCIRehab series. PMID:23318033
Validation of the Virtual MET as an assessment tool for executive functions.
Rand, Debbie; Basha-Abu Rukan, Soraya; Weiss, Patrice L Tamar; Katz, Noomi
2009-08-01
The purpose of this study was to establish ecological validity and initial construct validity of a Virtual Multiple Errands Test (VMET) as an assessment tool for executive functions. It was implemented within the Virtual Mall (VMall), a novel functional video-capture virtual shopping environment. The main objectives were (1) to examine the relationships between the performance of three groups of participants in the Multiple Errands Test (MET) carried out in a real shopping mall and their performance in the VMET, (2) to assess the relationships between the MET and VMET of the post-stroke participant's level of executive functioning and independence in instrumental activities of daily living, and (3) to compare the performance of post-stroke participants to those of healthy young and older controls in both the MET and VMET. The study population included three groups; post-stroke participants (n = 9), healthy young participants (n = 20), and healthy older participants (n = 20). The VMET was able to differentiate between two age groups of healthy participants and between healthy and post-stroke participants thus demonstrating that it is sensitive to brain injury and ageing and supports construct validity between known groups. In addition, significant correlations were found between the MET and the VMET for both the post-stroke participants and older healthy participants. This provides initial support for the ecological validity of the VMET as an assessment tool of executive functions. However, further psychometric data on temporal stability are needed, namely test-retest reliability and responsiveness, before it is ready for clinical application. Further research using the VMET as an assessment tool within the VMall with larger groups and in additional populations is also recommended.
CADASTER QSPR Models for Predictions of Melting and Boiling Points of Perfluorinated Chemicals.
Bhhatarai, Barun; Teetz, Wolfram; Liu, Tao; Öberg, Tomas; Jeliazkova, Nina; Kochev, Nikolay; Pukalov, Ognyan; Tetko, Igor V; Kovarich, Simona; Papa, Ester; Gramatica, Paola
2011-03-14
Quantitative structure property relationship (QSPR) studies on per- and polyfluorinated chemicals (PFCs) on melting point (MP) and boiling point (BP) are presented. The training and prediction chemicals used for developing and validating the models were selected from Syracuse PhysProp database and literatures. The available experimental data sets were split in two different ways: a) random selection on response value, and b) structural similarity verified by self-organizing-map (SOM), in order to propose reliable predictive models, developed only on the training sets and externally verified on the prediction sets. Individual linear and non-linear approaches based models developed by different CADASTER partners on 0D-2D Dragon descriptors, E-state descriptors and fragment based descriptors as well as consensus model and their predictions are presented. In addition, the predictive performance of the developed models was verified on a blind external validation set (EV-set) prepared using PERFORCE database on 15 MP and 25 BP data respectively. This database contains only long chain perfluoro-alkylated chemicals, particularly monitored by regulatory agencies like US-EPA and EU-REACH. QSPR models with internal and external validation on two different external prediction/validation sets and study of applicability-domain highlighting the robustness and high accuracy of the models are discussed. Finally, MPs for additional 303 PFCs and BPs for 271 PFCs were predicted for which experimental measurements are unknown. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Development of the multiple sclerosis (MS) early mobility impairment questionnaire (EMIQ).
Ziemssen, Tjalf; Phillips, Glenn; Shah, Ruchit; Mathias, Adam; Foley, Catherine; Coon, Cheryl; Sen, Rohini; Lee, Andrew; Agarwal, Sonalee
2016-10-01
The Early Mobility Impairment Questionnaire (EMIQ) was developed to facilitate early identification of mobility impairments in multiple sclerosis (MS) patients. We describe the initial development of the EMIQ with a focus on the psychometric evaluation of the questionnaire using classical and item response theory methods. The initial 20-item EMIQ was constructed by clinical specialists and qualitatively tested among people with MS and physicians via cognitive interviews. Data from an observational study was used to make additional updates to the instrument based on exploratory factor analysis (EFA) and item response theory (IRT) analysis, and psychometric analyses were performed to evaluate the reliability and validity of the final instrument's scores and screening properties (i.e., sensitivity and specificity). Based on qualitative interview analyses, a revised 15-item EMIQ was included in the observational study. EFA, IRT and item-to-item correlation analyses revealed redundant items which were removed leading to the final nine-item EMIQ. The nine-item EMIQ performed well with respect to: test-retest reliability (ICC = 0.858); internal consistency (α = 0.893); convergent validity; and known-groups methods for construct validity. A cut-point of 41 on the 0-to-100 scale resulted in sufficient sensitivity and specificity statistics for viably identifying patients with mobility impairment. The EMIQ is a content valid and psychometrically sound instrument for capturing MS patients' experience with mobility impairments in a clinical practice setting. Additional research is suggested to further confirm the EMIQ's screening properties over time.
Tramm, Trine; Mohammed, Hayat; Myhre, Simen; Kyndi, Marianne; Alsner, Jan; Børresen-Dale, Anne-Lise; Sørlie, Therese; Frigessi, Arnoldo; Overgaard, Jens
2014-10-15
To identify genes predicting benefit of radiotherapy in patients with high-risk breast cancer treated with systemic therapy and randomized to receive or not receive postmastectomy radiotherapy (PMRT). The study was based on the Danish Breast Cancer Cooperative Group (DBCG82bc) cohort. Gene-expression analysis was performed in a training set of frozen tumor tissue from 191 patients. Genes were identified through the Lasso method with the endpoint being locoregional recurrence (LRR). A weighted gene-expression index (DBCG-RT profile) was calculated and transferred to quantitative real-time PCR (qRT-PCR) in corresponding formalin-fixed, paraffin-embedded (FFPE) samples, before validation in FFPE from 112 additional patients. Seven genes were identified, and the derived DBCG-RT profile divided the 191 patients into "high LRR risk" and "low LRR risk" groups. PMRT significantly reduced risk of LRR in "high LRR risk" patients, whereas "low LRR risk" patients showed no additional reduction in LRR rate. Technical transfer of the DBCG-RT profile to FFPE/qRT-PCR was successful, and the predictive impact was successfully validated in another 112 patients. A DBCG-RT gene profile was identified and validated, identifying patients with very low risk of LRR and no benefit from PMRT. The profile may provide a method to individualize treatment with PMRT. ©2014 American Association for Cancer Research.
Validation of the Minority Stress Scale Among Italian Gay and Bisexual Men.
Pala, Andrea Norcini; Dell'Amore, Francesca; Steca, Patrizia; Clinton, Lauren; Sandfort, Theodorus; Rael, Christine
2017-12-01
The experience of sexual orientation stigma (e.g., homophobic discrimination and physical aggression) generates minority stress, a chronic form of psychosocial stress. Minority stress has been shown to have a negative effect on gay and bisexual men's (GBM's) mental and physical health, increasing the rates of depression, suicidal ideation, and HIV risk behaviors. In conservative religious settings, such as Italy, sexual orientation stigma can be more frequently and/or more intensively experienced. However, minority stress among Italian GBM remains understudied. The aim of this study was to explore the dimensionality, internal reliability, and convergent validity of the Minority Stress Scale (MSS), a comprehensive instrument designed to assess the manifestations of sexual orientation stigma. The MSS consists of 50 items assessing (a) Structural Stigma, (b) Enacted Stigma, (c) Expectations of Discrimination, (d) Sexual Orientation Concealment, (e) Internalized Homophobia Toward Others, (f) Internalized Homophobia toward Oneself, and (g) Stigma Awareness. We recruited an online sample of 451 Italian GBM to take the MSS. We tested convergent validity using the Perceived Stress Questionnaire. Through exploratory factor analysis, we extracted the 7 theoretical factors and an additional 3-item factor assessing Expectations of Discrimination From Family Members. The MSS factors showed good internal reliability (ordinal α > .81) and good convergent validity. Our scale can be suitable for applications in research settings, psychosocial interventions, and, potentially, in clinical practice. Future studies will be conducted to further investigate the properties of the MSS, exploring the association with additional health-related measures (e.g., depressive symptoms and anxiety).
Validation of a Malay Version of the Smartphone Addiction Scale among Medical Students in Malaysia.
Ching, Siew Mooi; Yee, Anne; Ramachandran, Vasudevan; Sazlly Lim, Sazlyna Mohd; Wan Sulaiman, Wan Aliaa; Foo, Yoke Loong; Hoo, Fan Kee
2015-01-01
This study was initiated to determine the psychometric properties of the Smart Phone Addiction Scale (SAS) by translating and validating this scale into the Malay language (SAS-M), which is the main language spoken in Malaysia. This study can distinguish smart phone and internet addiction among multi-ethnic Malaysian medical students. In addition, the reliability and validity of the SAS was also demonstrated. A total of 228 participants were selected between August 2014 and September 2014 to complete a set of questionnaires, including the SAS and the modified Kimberly Young Internet addiction test (IAT) in the Malay language. There were 99 males and 129 females with ages ranging from 19 to 22 years old (21.7±1.1) included in this study. Descriptive and factor analyses, intra-class coefficients, t-tests and correlation analyses were conducted to verify the reliability and validity of the SAS. Bartlett's test of sphericity was significant (p <0.01), and the Kaiser-Mayer-Olkin measure of sampling adequacy for the SAS-M was 0.92, indicating meritoriously that the factor analysis was appropriate. The internal consistency and concurrent validity of the SAS-M were verified (Cronbach's alpha = 0.94). All of the subscales of the SAS-M, except for positive anticipation, were significantly related to the Malay version of the IAT. This study developed the first smart phone addiction scale among medical students. This scale was shown to be reliable and valid in the Malay language.
Tang, Hsin-Yao; Beer, Lynn A; Tanyi, Janos L; Zhang, Rugang; Liu, Qin; Speicher, David W
2013-08-26
New serological biomarkers for early detection and clinical management of ovarian cancer are urgently needed, and many candidates have been reported. A major challenge frequently encountered when validating candidates in patients is establishing quantitative assays that distinguish between highly homologous proteins. The current study tested whether multiple members of two recently discovered ovarian cancer biomarker protein families, chloride intracellular channel (CLIC) proteins and tropomyosins (TPM), were detectable in ovarian cancer patient sera. A multiplexed, label-free multiple reaction monitoring (MRM) assay was established to target peptides specific to all detected CLIC and TPM family members, and their serum levels were quantitated for ovarian cancer patients and non-cancer controls. In addition to CLIC1 and TPM1, which were the proteins initially discovered in a xenograft mouse model, CLIC4, TPM2, TPM3, and TPM4 were present in ovarian cancer patient sera at significantly elevated levels compared with controls. Some of the additional biomarkers identified in this homolog-centric verification and validation approach may be superior to the previously identified biomarkers at discriminating between ovarian cancer and non-cancer patients. This demonstrates the importance of considering all potential protein homologs and using quantitative assays for cancer biomarker validation with well-defined isoform specificity. This manuscript addresses the importance of distinguishing between protein homologs and isoforms when identifying and validating cancer biomarkers in plasma or serum. Specifically, it describes the use of targeted in-depth LC-MS/MS analysis to determine the members of two protein families, chloride intracellular channel (CLIC) and tropomyosin (TPM) proteins that are detectable in sera of ovarian cancer patients. It then establishes a multiplexed isoform- and homology-specific MRM assay to quantify all observed gene products in these two protein families as well as many of the closely related tropomyosin isoforms. Using this assay, levels of all detected CLICs and TPMs were quantified in ovarian cancer patient and control subject sera. These results demonstrate that in addition to the previously known CLIC1, multiple tropomyosins and CLIC4 are promising new ovarian cancer biomarkers. Based on these initial validation studies, these new ovarian cancer biomarkers appear to be superior to most previously known ovarian cancer biomarkers. Copyright © 2013 Elsevier B.V. All rights reserved.
An Icelandic Version of McMasters Family Assessment Device (FAD)
ERIC Educational Resources Information Center
Juliusdottir, Gudlaug M.; Olafsdottir, Hrefna
2015-01-01
Purpose: An analysis of the psychometric properties of an Icelandic version of McMasters Family Assessment Device (FAD) was conducted in this study. Method: Two groups, clinical and nonclinical, comprising of 529 parents answered the FAD. The study examined the internal reliability and discriminant validity of the instrument in addition to…
Federal Register 2010, 2011, 2012, 2013, 2014
2013-12-31
...-scale quantitative validation study of its integrated disclosures with 858 consumers, which concluded.../consumer-financial-protection-bureau-proposes-know-before-you-owe-mortgage-forms/ ; see also Blog post, U.S.... After additional study, the Bureau may propose rules on either or both topics. The Bureau also decided...
3-D and quasi-2-D discrete element modeling of grain commingling in a bucket elevator boot system
USDA-ARS?s Scientific Manuscript database
Unwanted grain commingling impedes new quality-based grain handling systems and has proven to be an expensive and time consuming issue to study experimentally. Experimentally validated models may reduce the time and expense of studying grain commingling while providing additional insight into detail...
Iancheva, Dessislava; Trenova, Anastasiya G; Terziyski, Kiril; Kandilarova, Sevdalina; Mantarova, Stefka
2018-04-03
Paced Auditory Serial Addition Test (PASAT) is used for assessment of information processing speed, attention, and working memory, which are the most frequently affected cognitive domains in multiple sclerosis (MS) patients, and may be significantly affected by fatigue. However, the effect of fatigue and mood on the PASAT performance in MS patients translationally validated by fMRI has not been studied yet. The aim of this study is to investigate the translational validity of the PASAT, using fMRI during a paced visual serial addition test (PVSAT) paradigm in patients with relapsing remitting MS (RRMS) and to assess the impact of fatigue and mood on test performance. Fourteen patients with RRMS in remission and 14 healthy controls, matched by sex, age, and educational status, were enrolled in the study. The subjects underwent a standard neurological examination, neuropsychological evaluation with the PASAT 3', fMRI scanning with a PVSAT paradigm, and Beck Depression Inventory. All patients were assessed by the Modified Fatigue Impact Scale. Paced Auditory Serial Addition Test score was lower in patients (41.4 ± 15.5 vs 51.6 ± 7.5, P = .035). A moderate negative correlation (P = -0.563, P = 0.036) was found between PASAT and MIFS scores. The fMRI scanning showed significant activations in several clusters that differed between patients and controls. The patient group presented wider cluster activation; Brodmann area (BA) 6-bilaterally; left BA7, 8, and 9; and right BA40, while controls presented with activations in left BA6 and BA44. Significant negative correlations between PASAT score and cortical activations in left BA23, right BA32, and left BA7 were observed in patients only. Our results show that poorer performance on the PASAT is associated with higher activation in areas connected with working memory, attention, and emotional processes during the fMRI assessment with PVSAT paradigm, which provides evidence for the translational validity of the PASAT in patients with RRMS. © 2018 John Wiley & Sons, Ltd.
Sleep-Wake Evaluation from Whole-Night Non-Contact Audio Recordings of Breathing Sounds
Dafna, Eliran; Tarasiuk, Ariel; Zigel, Yaniv
2015-01-01
Study Objectives To develop and validate a novel non-contact system for whole-night sleep evaluation using breathing sounds analysis (BSA). Design Whole-night breathing sounds (using ambient microphone) and polysomnography (PSG) were simultaneously collected at a sleep laboratory (mean recording time 7.1 hours). A set of acoustic features quantifying breathing pattern were developed to distinguish between sleep and wake epochs (30 sec segments). Epochs (n = 59,108 design study and n = 68,560 validation study) were classified using AdaBoost classifier and validated epoch-by-epoch for sensitivity, specificity, positive and negative predictive values, accuracy, and Cohen's kappa. Sleep quality parameters were calculated based on the sleep/wake classifications and compared with PSG for validity. Setting University affiliated sleep-wake disorder center and biomedical signal processing laboratory. Patients One hundred and fifty patients (age 54.0±14.8 years, BMI 31.6±5.5 kg/m2, m/f 97/53) referred for PSG were prospectively and consecutively recruited. The system was trained (design study) on 80 subjects; validation study was blindly performed on the additional 70 subjects. Measurements and Results Epoch-by-epoch accuracy rate for the validation study was 83.3% with sensitivity of 92.2% (sleep as sleep), specificity of 56.6% (awake as awake), and Cohen's kappa of 0.508. Comparing sleep quality parameters of BSA and PSG demonstrate average error of sleep latency, total sleep time, wake after sleep onset, and sleep efficiency of 16.6 min, 35.8 min, and 29.6 min, and 8%, respectively. Conclusions This study provides evidence that sleep-wake activity and sleep quality parameters can be reliably estimated solely using breathing sound analysis. This study highlights the potential of this innovative approach to measure sleep in research and clinical circumstances. PMID:25710495
Assessing motivation orientations in schizophrenia: Scale development and validation
Cooper, Shanna; Lavaysse, Lindsey M.; Gard, David E.
2014-01-01
Motivation deficits are common in several disorders including schizophrenia, and are an important factor in both functioning and treatment adherence. Self-Determination Theory (SDT), a leading macro-theory of motivation, has contributed a number of insights into how motivation is impaired in schizophrenia. Nonetheless, self-report measures of motivation appropriate for people with severe mental illness (including those that emphasize SDT) are generally lacking in literature. To fill this gap, we adapted and abbreviated the well-validated General Causality Orientation Scale for use with people with schizophrenia and with other severe mental disorders (GCOS-clinical populations; GCOS-CP). In Study 1, we tested the similarity of our measure to the existing GCOS (using a college sample) and then validated this new measure in a schizophrenia and healthy control sample (Study 2). Results from Study 1 (N=360) indicated that the GCOS-CP was psychometrically similar to the original GCOS and provided good convergent and discriminant validity. In Study 2, the GCOS-CP was given to individuals with (N=44) and without schizophrenia (N=42). In line with both laboratory-based and observer-based research, people with schizophrenia showed lower motivational autonomy and higher impersonal/amotivated orientations. Additional applications of the GCOS-CP are discussed. PMID:25454115
Tang, Hsin-Yao; Beer, Lynn A.; Tanyi, Janos L.; Zhang, Rugang; Liu, Qin; Speicher, David W.
2013-01-01
New serological biomarkers for early detection and clinical management of ovarian cancer are urgently needed, and many candidates have been reported. A major challenge frequently encountered when validating candidates in patients is establishing quantitative assays that distinguish between highly homologous proteins. The current study tested whether multiple members of two recently discovered ovarian cancer biomarker protein families, chloride intracellular channel (CLIC) proteins and tropomyosins (TPM), were detectable in ovarian cancer patient sera. A multiplexed, label-free multiple reaction monitoring (MRM) assay was established to target peptides specific to all detected CLIC and TPM family members, and their serum levels were quantitated for ovarian cancer patients and non-cancer controls. In addition to CLIC1 and TPM1, which were the proteins initially discovered in a xenograft mouse model, CLIC4, TPM2, TPM3, and TPM4 were present in ovarian cancer patient sera at significantly elevated levels compared with controls. Some of the additional biomarkers identified in this homolog-centric verification and validation approach may be superior to the previously identified biomarkers at discriminating between ovarian cancer and non-cancer patients. This demonstrates the importance of considering all potential protein homologs and using quantitative assays for cancer biomarker validation with well-defined isoform specificity. PMID:23792823
Robustness and Uncertainty: Applications for Policy in Climate and Hydrological Modeling
NASA Astrophysics Data System (ADS)
Fields, A. L., III
2015-12-01
Policymakers must often decide how to proceed when presented with conflicting simulation data from hydrological, climatological, and geological models. While laboratory sciences often appeal to the reproducibility of results to argue for the validity of their conclusions, simulations cannot use this strategy for a number of pragmatic and methodological reasons. However, robustness of predictions and causal structures can serve the same function for simulations as reproducibility does for laboratory experiments and field observations in either adjudicating between conflicting results or showing that there is insufficient justification to externally validate the results. Additionally, an interpretation of the argument from robustness is presented that involves appealing to the convergence of many well-built and diverse models rather than the more common version which involves appealing to the probability that one of a set of models is likely to be true. This interpretation strengthens the case for taking robustness as an additional requirement for the validation of simulation results and ultimately supports the idea that computer simulations can provide information about the world that is just as trustworthy as data from more traditional laboratory studies and field observations. Understanding the importance of robust results for the validation of simulation data is especially important for policymakers making decisions on the basis of potentially conflicting models. Applications will span climate, hydrological, and hydroclimatological models.
Development and validation of the Vietnamese primary care assessment tool
2018-01-01
Objective To adapt the consumer version of the Primary Care Assessment Tool (PCAT) for Vietnam and determine its internal consistency and validity. Design A quantitative cross sectional study. Setting 56 communes in 3 representative provinces of central Vietnam. Participants Total of 3289 people who used health care services at health facility at least once over the past two years. Results The Vietnamese adult expanded consumer version of the PCAT (VN PCAT-AE) is an instrument for evaluation of primary care in Vietnam with 70 items comprising six scales representing four core primary care domains, and three additional scales representing three derivative domains. Sixteen other items from the original tool were not included in the final instrument, due to problems with missing values, floor or ceiling effects, and item-total correlations. All the retained scales have a Cronbach’s alpha above 0.70 except for the subscale of Family Centeredness. Conclusions The VN PCAT-AE demonstrates adequate internal consistency and validity to be used as an effective tool for measuring the quality of primary care in Vietnam from the consumer perspective. Additional work in the future to optimize valid measurement in all domains consistent with the original version of the tool may be helpful as the primary care system in Vietnam further develops. PMID:29324851
Luo, Wen; Medrek, Sarah; Misra, Jatin; Nohynek, Gerhard J
2007-02-01
The objective of this study was to construct and validate a quantitative structure-activity relationship model for skin absorption. Such models are valuable tools for screening and prioritization in safety and efficacy evaluation, and risk assessment of drugs and chemicals. A database of 340 chemicals with percutaneous absorption was assembled. Two models were derived from the training set consisting 306 chemicals (90/10 random split). In addition to the experimental K(ow) values, over 300 2D and 3D atomic and molecular descriptors were analyzed using MDL's QsarIS computer program. Subsequently, the models were validated using both internal (leave-one-out) and external validation (test set) procedures. Using the stepwise regression analysis, three molecular descriptors were determined to have significant statistical correlation with K(p) (R2 = 0.8225): logK(ow), X0 (quantification of both molecular size and the degree of skeletal branching), and SsssCH (count of aromatic carbon groups). In conclusion, two models to estimate skin absorption were developed. When compared to other skin absorption QSAR models in the literature, our model incorporated more chemicals and explored a large number of descriptors. Additionally, our models are reasonably predictive and have met both internal and external statistical validations.
Zhang, Xin; Wu, Yuxia; Ren, Pengwei; Liu, Xueting; Kang, Deying
2015-10-30
To explore the relationship between the external validity and the internal validity of hypertension RCTs conducted in China. Comprehensive literature searches were performed in Medline, Embase, Cochrane Central Register of Controlled Trials (CCTR), CBMdisc (Chinese biomedical literature database), CNKI (China National Knowledge Infrastructure/China Academic Journals Full-text Database) and VIP (Chinese scientific journals database) as well as advanced search strategies were used to locate hypertension RCTs. The risk of bias in RCTs was assessed by a modified scale, Jadad scale respectively, and then studies with 3 or more grading scores were included for the purpose of evaluating of external validity. A data extract form including 4 domains and 25 items was used to explore relationship of the external validity and the internal validity. Statistic analyses were performed by using SPSS software, version 21.0 (SPSS, Chicago, IL). 226 hypertension RCTs were included for final analysis. RCTs conducted in university affiliated hospitals (P < 0.001) or secondary/tertiary hospitals (P < 0.001) were scored at higher internal validity. Multi-center studies (median = 4.0, IQR = 2.0) were scored higher internal validity score than single-center studies (median = 3.0, IQR = 1.0) (P < 0.001). Funding-supported trials had better methodological quality (P < 0.001). In addition, the reporting of inclusion criteria also leads to better internal validity (P = 0.004). Multivariate regression indicated sample size, industry-funding, quality of life (QOL) taken as measure and the university affiliated hospital as trial setting had statistical significance (P < 0.001, P < 0.001, P = 0.001, P = 0.006 respectively). Several components relate to the external validity of RCTs do associate with the internal validity, that do not stand in an easy relationship to each other. Regarding the poor reporting, other possible links between two variables need to trace in the future methodological researches.
Moreira, Sérgio; Carreiras, Joana; Cooper, Cary; Smeed, Matthew; Reis, Maria de Fátima; Pereira Miguel, José
2018-01-01
Objective The main objective of this work was to translate the English version of ASSET (A Shortened Stress Evaluation Tool) into the Portuguese version and to validate its psychometric properties. Additionally, this work tested the convergent validity of the instrument. Methods The translation and retroversion were conducted by experts and submitted to the authors for approval. Within an observational, cross-sectional study, regarding mental health at the workplace, ASSET together with other scales was applied to a sample of 405 participants. The psychometric validity of the subscales was studied using confirmatory factorial analysis. Results The factorial structure of ASSET is globally supported by the results, with the Perceptions of Your Job and Attitudes Towards your Organisation subscales requiring slight adjustments in the item structure and the Your Health subscales replicating the original structure. The convergent validity also supports the ASSET, showing that all subscales are significantly correlated with variables used to test convergence. Conclusions Globally, the results constitute an important contribution to ASSET and open the possibility of its usage among Portuguese-speaking countries. The results provide an evidence on the validity of the instrument and, in particular, of the mental and physical health subscales. PMID:29440211
Clarke, Diana E; Van Reekum, Robert; Patel, Jigisha; Simard, Martine; Gomez, Everlyne; Streiner, David L
2007-01-01
This article examines the psychometric properties of the clinician version of the Apathy Evaluation Scale (AES-C) to determine its ability to characterize, quantify and differentiate apathy. Critical appraisals of the item-reduction processes, effectiveness of the administration, coding and scoring procedures, and the reliability and validity of the scale were carried out. For training, administration and rating of the AES-C, clearer guidelines, including a more standardized list of verbal and non-verbal apathetic cues, are needed. There is evidence of high internal consistency for the scale across studies. In addition, the original study reported good test-retest and inter-rater reliability coefficients. However, there is a lack of replication on these more stable and informative measures of reliability and as such they warrant further investigation. The research evidence confirms that the AES-C shows good discriminant, convergent and criterion validity. However, evidence of its predictive validity is limited. As this aspect of validity refers to the scale's ability to predict future outcomes, which is important for treatment and rehabilitation planning, further assessment of the predictive validity of the AES-C is needed. In conclusion, the AES-C is a reliable and valid measure for the characterization and quantification of apathy. Copyright (c) 2007 John Wiley & Sons, Ltd.
Howe, Tsu-Hsin; Chen, Hao-Ling; Lee, Candy Chieh; Chen, Ying-Dar; Wang, Tien-Ni
2017-10-01
Visual perceptual motor skills have been proposed as underlying courses of handwriting difficulties. However, there is no evaluation tool currently available to assess these skills comprehensively and to serve as a sensitive measure. The purpose of this study was to validate the Computerized Perceptual Motor Skills Assessment (CPMSA), a newly developed evaluation tool for children in early elementary grades. Its test-retest reliability, concurrent validity, discriminant validity, and responsiveness were examined in 43 typically developing children and 26 children with handwriting difficulty. The CPMSA demonstrated excellent reliability across all subtests with intra-class correlation coefficients (ICCs)≥0.80. Significant moderate correlations between the domains of the CPMSA and corresponding gold standards including Beery VMI, the TVPS-3, and the eye-hand coordination subtest of the DTVP-2 demonstrated good concurrent validity. In addition, the CPMSA showed evidence of discriminant validity in samples of children with and without handwriting difficulty. This article provides evidence in support of the CPMSA. The CPMSA is a reliable, valid, and promising measure of visual perceptual motor skills for children in early elementary grades. Directions for future study and improvements to the assessment are discussed. Copyright © 2017. Published by Elsevier Ltd.
Soto, Sandra C; Arredondo, Elva M; Horton, Lucy A; Ayala, Guadalupe X
2016-03-01
Research shows that Latino parenting practices influence children's dietary and weight outcomes. Most studies use parent-reported data, however data from children may provide additional insight into how parents influence their children's diet and weight outcomes. The Parenting Strategies for Eating and Activity Scale (PEAS) has been validated in Latino adults, but not in children. This study evaluated the factor structure and concurrent and predictive validity of a modified version of the PEAS (PEAS-Diet) among Latino children. Data were collected from 361 children ages 7-13 from Imperial County, California, enrolled in a randomized controlled trial to promote healthy eating. The PEAS-Diet included 25 candidate items targeting six parenting practices pertaining to children's eating behaviors: (a) monitoring; (b) disciplining; (c) control; (d) permissiveness; (e) reinforcing; and (f) limit-setting. Children were on average ten years old (±2), 50% boys, 93% self-identified as Latino, 81% were US-born, and 55% completed English versus Spanish-language interviews. Using varimax rotation on baseline data with the total sample, six items were removed due to factor loadings <.40 and/or cross-loading (>.32 on more than one component). Parallel analysis and interpretability suggested a 5-factor solution explaining 59.46% of the variance. The subscale "limit-setting" was removed from the scale. The final scale consisted of 19 items and 5 subscales. Internal consistency of the subscales ranged from α = .63-.82. Confirmatory factor analyses provided additional evidence for the 5-factor scale using data collected 4 and 6 months post-baseline among the control group (n = 164, n = 161, respectively). Concurrent validity with dietary intake was established for monitoring, control, permissiveness, and reinforcing subscales in the expected directions. Predictive validity was not established. Results indicated that with the reported changes, the interview-administered PEAS-Diet is valid among Latino children aged 7-13 years. Copyright © 2015 Elsevier Ltd. All rights reserved.
Flood damage estimation of companies: A comparison of Stage-Damage-Functions and Random Forests
NASA Astrophysics Data System (ADS)
Sieg, Tobias; Kreibich, Heidi; Vogel, Kristin; Merz, Bruno
2017-04-01
The development of appropriate flood damage models plays an important role not only for the damage assessment after an event but also to develop adaptation and risk mitigation strategies. So called Stage-Damage-Functions (SDFs) are often applied as a standard approach to estimate flood damage. These functions assign a certain damage to the water depth depending on the use or other characteristics of the exposed objects. Recent studies apply machine learning algorithms like Random Forests (RFs) to model flood damage. These algorithms usually consider more influencing variables and promise to depict a more detailed insight into the damage processes. In addition they provide an inherent validation scheme. Our study focuses on direct, tangible damage of single companies. The objective is to model and validate the flood damage suffered by single companies with SDFs and RFs. The data sets used are taken from two surveys conducted after the floods in the Elbe and Danube catchments in the years 2002 and 2013 in Germany. Damage to buildings (n = 430), equipment (n = 651) as well as goods and stock (n = 530) are taken into account. The model outputs are validated via a comparison with the actual flood damage acquired by the surveys and subsequently compared with each other. This study investigates the gain in model performance with the use of additional data and the advantages and disadvantages of the RFs compared to SDFs. RFs show an increase in model performance with an increasing amount of data records over a comparatively large range, while the model performance of the SDFs is already saturated for a small set of records. In addition, the RFs are able to identify damage influencing variables, which improves the understanding of damage processes. Hence, RFs can slightly improve flood damage predictions and provide additional insight into the underlying mechanisms compared to SDFs.
Development and validation of age-dependent FE human models of a mid-sized male thorax.
El-Jawahri, Raed E; Laituri, Tony R; Ruan, Jesse S; Rouhana, Stephen W; Barbat, Saeed D
2010-11-01
The increasing number of people over 65 years old (YO) is an important research topic in the area of impact biomechanics, and finite element (FE) modeling can provide valuable support for related research. There were three objectives of this study: (1) Estimation of the representative age of the previously-documented Ford Human Body Model (FHBM) -- an FE model which approximates the geometry and mass of a mid-sized male, (2) Development of FE models representing two additional ages, and (3) Validation of the resulting three models to the extent possible with respect to available physical tests. Specifically, the geometry of the model was compared to published data relating rib angles to age, and the mechanical properties of different simulated tissues were compared to a number of published aging functions. The FHBM was determined to represent a 53-59 YO mid-sized male. The aforementioned aging functions were used to develop FE models representing two additional ages: 35 and 75 YO. The rib model was validated against human rib specimens and whole rib tests, under different loading conditions, with and without modeled fracture. In addition, the resulting three age-dependent models were validated by simulating cadaveric tests of blunt and sled impacts. The responses of the models, in general, were within the cadaveric response corridors. When compared to peak responses from individual cadavers similar in size and age to the age-dependent models, some responses were within one standard deviation of the test data. All the other responses, but one, were within two standard deviations.
Validation of the Older Adult Social Evaluative Scale (OASES) as a measure of social anxiety.
Kok, Brian C; Ma, Vanessa K; Gould, Christine E
2018-03-21
Social anxiety disorder (SAD) (formerly called social phobia) is among the most common mental health diagnoses among older adults; however, the research on late-life social anxiety is scarce. A limited number of studies have examined the assessment and diagnosis of social anxiety disorder in this population, and there are few social anxiety measures that are validated for use with older adults. One such measure, the Older Adult Social Evaluative Scale (OASES), was designed for use with this population, but until now has lacked validation against a gold-standard diagnostic interview. Using a sample of 47 community-dwelling older adults (aged 60 years and over) with anxiety, the present study compared OASES performance to that of the Structured Clinical Interview for DSM-5 Disorders (SCID-5), as well as other measures of anxiety and depression. The OASES demonstrated convergent validity with other measures of anxiety, and demonstrated discriminant validity on other measures (e.g. depression, somatic symptoms). Receiver operating characteristic (ROC) analysis revealed that a cut-point of ≥76 optimized sensitivity and specificity compared to SCID-5 derived diagnoses of social anxiety disorder. This study is the first study to provide psychometric validation for the OASES and one of the first to administer the SCID-5 to an older adult sample. In addition to establishing a clinically significant cut-off, this study also describes the clinical utility of the OASES, which can be used to identify distressing situations, track anxiety severity, and monitor behavioral avoidance across a variety of social situations.
Ghazanfari, Zeinab; Niknami, Shamsaddin; Ghofranipour, Fazlollah; Hajizadeh, Ebrahim; Montazeri, Ali
2010-11-09
This study carried out to develop a scale for assessing diabetic patients' perceptions about physical activity and to test its psychometric properties (The Physical Activity Questionnaire for Diabetic Patients-PAQ-DP). An item pool extracted from the Theory of Planned Behavior literature was generated. Then an expert panel evaluated the items by assessing content validity index and content validity ratio. Consequently exploratory factor analysis (EFA) was performed to indicate the scale constructs. In addition reliability analyses including internal consistency and test-retest analysis were carried out. In all a sample of 127 women with diabetes participated in the study. Twenty-two items were initially extracted from the literature. A six-factor solution (containing 19 items) emerged as a result of an exploratory factor analysis namely: instrumental attitude, subjective norm, perceived behavioral control, affective attitude, self-identity, and intention explaining 60.30% of the variance observed. Additional analyses indicated satisfactory results for internal consistency (Cronbach's alpha ranging from 0.54 to 0.8) and intraclass correlation coefficients (ranging from 0.40 to 0.92). The Physical Activity Questionnaire for Diabetic Patients (PAQ-DP) is the first instrument that applies the Theory of Planned Behavior in its constructs. The findings indicated that the PAQ-DP is a reliable and valid measure for assessing physical activity perceptions and now is available and can be used in future studies.
2010-01-01
Background This study carried out to develop a scale for assessing diabetic patients' perceptions about physical activity and to test its psychometric properties (The Physical Activity Questionnaire for Diabetic Patients-PAQ-DP). Methods An item pool extracted from the Theory of Planned Behavior literature was generated. Then an expert panel evaluated the items by assessing content validity index and content validity ratio. Consequently exploratory factor analysis (EFA) was performed to indicate the scale constructs. In addition reliability analyses including internal consistency and test-retest analysis were carried out. Results In all a sample of 127 women with diabetes participated in the study. Twenty-two items were initially extracted from the literature. A six-factor solution (containing 19 items) emerged as a result of an exploratory factor analysis namely: instrumental attitude, subjective norm, perceived behavioral control, affective attitude, self-identity, and intention explaining 60.30% of the variance observed. Additional analyses indicated satisfactory results for internal consistency (Cronbach's alpha ranging from 0.54 to 0.8) and intraclass correlation coefficients (ranging from 0.40 to 0.92). Conclusions The Physical Activity Questionnaire for Diabetic Patients (PAQ-DP) is the first instrument that applies the Theory of Planned Behavior in its constructs. The findings indicated that the PAQ-DP is a reliable and valid measure for assessing physical activity perceptions and now is available and can be used in future studies. PMID:21062466
Validation and Spatiotemporal Analysis of CERES Surface Net Radiation Product
Jia, Aolin; Jiang, Bo; Liang, Shunlin; ...
2016-01-23
The Clouds and the Earth’s Radiant Energy System (CERES) generates one of the few global satellite radiation products. The CERES ARM Validation Experiment (CAVE) has been providing long-term in situ observations for the validation of the CERES products. However, the number of these sites is low and their distribution is globally sparse, and particularly the surface net radiation product has not been rigorously validated yet. Therefore, additional validation efforts are highly required to determine the accuracy of the CERES radiation products. In this study, global land surface measurements were comprehensively collected for use in the validation of the CERES netmore » radiation (R n) product on a daily (340 sites) and a monthly (260 sites) basis, respectively. The validation results demonstrated that the CERES R n product was, overall, highly accurate. The daily validations had a Mean Bias Error (MBE) of 3.43 W·m −2, Root Mean Square Error (RMSE) of 33.56 W·m −2, and R 2 of 0.79, and the monthly validations had an MBE of 3.40 W·m −2, RMSE of 25.57 W·m −2, and R 2 of 0.84. The accuracy was slightly lower for the high latitudes. Following the validation, the monthly CERES R n product, from March 2000 to July 2014, was used for a further analysis. We analysed the global spatiotemporal variation of the R n, which occurred during the measurement period. In addition, two hot spot regions, the southern Great Plains and south-central Africa, were then selected for use in determining the driving factors or attribution of the R n variation. We determined that R n over the southern Great Plains decreased by −0.33 W·m −2 per year, which was mainly driven by changes in surface green vegetation and precipitation. In south-central Africa, R n decreased at a rate of −0.63 W·m −2 per year, the major driving factor of which was surface green vegetation.« less
Validation and Spatiotemporal Analysis of CERES Surface Net Radiation Product
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jia, Aolin; Jiang, Bo; Liang, Shunlin
The Clouds and the Earth’s Radiant Energy System (CERES) generates one of the few global satellite radiation products. The CERES ARM Validation Experiment (CAVE) has been providing long-term in situ observations for the validation of the CERES products. However, the number of these sites is low and their distribution is globally sparse, and particularly the surface net radiation product has not been rigorously validated yet. Therefore, additional validation efforts are highly required to determine the accuracy of the CERES radiation products. In this study, global land surface measurements were comprehensively collected for use in the validation of the CERES netmore » radiation (R n) product on a daily (340 sites) and a monthly (260 sites) basis, respectively. The validation results demonstrated that the CERES R n product was, overall, highly accurate. The daily validations had a Mean Bias Error (MBE) of 3.43 W·m −2, Root Mean Square Error (RMSE) of 33.56 W·m −2, and R 2 of 0.79, and the monthly validations had an MBE of 3.40 W·m −2, RMSE of 25.57 W·m −2, and R 2 of 0.84. The accuracy was slightly lower for the high latitudes. Following the validation, the monthly CERES R n product, from March 2000 to July 2014, was used for a further analysis. We analysed the global spatiotemporal variation of the R n, which occurred during the measurement period. In addition, two hot spot regions, the southern Great Plains and south-central Africa, were then selected for use in determining the driving factors or attribution of the R n variation. We determined that R n over the southern Great Plains decreased by −0.33 W·m −2 per year, which was mainly driven by changes in surface green vegetation and precipitation. In south-central Africa, R n decreased at a rate of −0.63 W·m −2 per year, the major driving factor of which was surface green vegetation.« less
Lievaart, Marien; Franken, Ingmar H A; Hovens, Johannes E
2016-03-01
The most commonly used instrument for measuring anger is the State-Trait Anger Expression Inventory-2 (STAXI-2; Spielberger, 1999). This study further examines the validity of the STAXI-2 and compares anger scores between several clinical and nonclinical samples. Reliability, concurrent, and construct validity were investigated in Dutch undergraduate students (N = 764), a general population sample (N = 1211), and psychiatric outpatients (N = 226). The results support the reliability and validity of the STAXI-2. Concurrent validity was strong, with meaningful correlations between the STAXI-2 scales and anger-related constructs in both clinical and nonclinical samples. Importantly, patients showed higher experience and expression of anger than the general population sample. Additionally, forensic outpatients with addiction problems reported higher Anger Expression-Out than general psychiatric outpatients. Our conclusion is that the STAXI-2 is a suitable instrument to measure both the experience and the expression of anger in both general and clinical populations. © 2016 Wiley Periodicals, Inc.
Assessing students' communication skills: validation of a global rating.
Scheffer, Simone; Muehlinghaus, Isabel; Froehmel, Annette; Ortwein, Heiderose
2008-12-01
Communication skills training is an accepted part of undergraduate medical programs nowadays. In addition to learning experiences its importance should be emphasised by performance-based assessment. As detailed checklists have been shown to be not well suited for the assessment of communication skills for different reasons, this study aimed to validate a global rating scale. A Canadian instrument was translated to German and adapted to assess students' communication skills during an end-of-semester-OSCE. Subjects were second and third year medical students at the reformed track of the Charité-Universitaetsmedizin Berlin. Different groups of raters were trained to assess students' communication skills using the global rating scale. Validity testing included concurrent validity and construct validity: Judgements of different groups of raters were compared to expert ratings as a defined gold standard. Furthermore, the amount of agreement between scores obtained with this global rating scale and a different instrument for assessing communication skills was determined. Results show that communication skills can be validly assessed by trained non-expert raters as well as standardised patients using this instrument.
Development and validation of an Arab family food security scale.
Sahyoun, Nadine R; Nord, Mark; Sassine, Anniebelle J; Seyfert, Karin; Hwalla, Nahla; Ghattas, Hala
2014-05-01
The objective of this article was to describe 1) the validation of 2 similar but not identical food security modules used to collect data from 2 vulnerable populations, southern Lebanon residents (n = 815) and Palestinian refugees in Lebanon (n = 2501), and 2) the development and validation of an Arab Family Food Security Scale (AFFSS). The surveys used a cluster-randomized sampling design. The 2 food security subscales underwent face and construct validity. In addition, both of these tools and the AFFSS underwent psychometric assessment for internal validity by using statistical methods based on Item Response Theory. The food security questions tested by focus groups were understood and accepted in all regions of Lebanon. The food security subscales and the AFFSS had acceptable levels of internal consistency. The psychometric assessment confirmed that the 7 items of the AFFSS had good internal validity and reasonable reliability with item in-fits from 0.73 to 1.16. Food insecurity was identified among 42% of southern Lebanese and 62% of Palestinian refugee households. The determinants and consequences of food security measured in this study provide additional support for the validity of the modules. Using multivariate logistic regression, the higher the mean monthly income per household member and the higher the educational attainment of the head of household, the lower the risk of food insecurity [ORs (95% CIs): 0.99 (0.98, 0.99) and 0.66 (0.54, 0.80), respectively]. There was a strong significant association between food insecurity and lower food expenditure and lower intake of all food categories except for legumes, which was significantly associated in the opposite direction (P < 0.001). The odds of borrowing money and accepting gifts/donations were significantly higher among moderately and severely food-insecure households (P < 0.000). The AFFSS has been validated within Lebanon and can potentially be extended to other Arab-speaking populations.
Validation of the Modified Fatigue Impact Scale in Parkinson's disease.
Schiehser, Dawn M; Ayers, Catherine R; Liu, Lin; Lessig, Stephanie; Song, David S; Filoteo, J Vincent
2013-03-01
Fatigue is a common symptom in Parkinson's disease (PD); however, a multidimensional scale that measures the impact of fatigue on functioning has yet to be validated in this population. The aim of this study was to examine the validity of the Modified Fatigue Impact Scale (MFIS), a self-report measure that assesses the effects of fatigue on physical, cognitive, and psychosocial functioning, in a sample of nondemented PD patients. PD patients (N = 100) completed the MFIS, the Positive and Negative Affect Schedule (PANAS-X), and several additional measures of psychosocial, cognitive, and motor functioning. A Principal Component Analysis (PCA) and item analysis using Cronbach's alpha were conducted to determine structural validity and internal consistency of the MFIS. Correlational analyses were performed between the MFIS and the PANAS-X fatigue subscale to evaluate convergent validity and between the MFIS and measures of depression, anxiety, apathy, and disease-related symptoms to determine divergent validity. The PCA identified two viable MFIS subscales: a cognitive subscale and a combination of the original scale's physical and psychosocial subscales as one factor. Item analysis revealed high internal consistency of all 21 items and the items within the two subscales. The MFIS had strong convergent validity with the PANAS-X fatigue subscale and adequate divergent validity with measures of disease stage, motor function, and cognition. Overall, this study demonstrates that the MFIS is a valid multidimensional measure that can be used to evaluate the impact of fatigue on cognitive and physical/social functioning in PD patients without dementia. Published by Elsevier Ltd.
Development and validation of a stock addiction inventory (SAI).
Youn, HyunChul; Choi, Jung-Seok; Kim, Dai-Jin; Choi, Sam-Wook
2016-01-01
Investing in financial markets is promoted and protected by the government as an essential economic activity, but can turn into a gambling addiction problem. Until now, few scales have widely been used to identify gambling addicts in financial markets. This study aimed to develop a self-rating scale to distinguish them. In addition, the reliability and validity of the stock addiction inventory (SAI) were demonstrated. A set of questionnaires, including the SAI, south oaks gambling screen (SOGS), and DSM-5 diagnostic criteria, for gambling disorder was completed by 1005 participants. Factor analysis, internal consistency testing, t tests, analysis of variance, and partial correlation analysis were conducted to verify the reliability and validity of SAI. The factor analysis results showed the final SAI consisting of two factors and nine items. The internal consistency and concurrent validity of SAI were verified. The Cronbach's α for the total scale was 0.892, and the SAI and its factors were significantly correlated with SOGS. This study developed a specific scale for financial market investments or trading; this scale proved to be reliable and valid. Our scale expands the understanding of gambling addiction in financial markets and provides a diagnostic reference.
Development of the Teen Oral Health-Related Quality of Life Instrument.
Wright, Wanda G; Spiro, Avron; Jones, Judith A; Rich, Sharron E; Garcia, Raul I
2017-03-01
The aim of this study was to develop the Teen Oral Health-Related Quality of Life instrument (TOQOL) and demonstrate its validity and reliability in a diverse sample of 13-18-year-old adolescents. A total of 363 adolescents aged 13-18 years participated in this cross sectional study. Oral screening examinations were conducted to collect oral health status data. Adolescents completed the TOQOL and a generic measure of health-related quality of life, the PedsQL. The psychometric properties of the TOQOL were evaluated in terms of face, content, convergent, concurrent and discriminant validity in addition to internal reliability. The 16-item TOQOL covers five domains: Physical functioning, Role functioning, Social functioning, Oral problems, and Emotional functioning. The total scale and subscales showed satisfactory reliability with Cronbach alpha ranging from 0.75 to 0.92. TOQOL scores showed significant associations with perceived oral health status and the PedsQL (convergent validity) and discriminated well between adolescents with caries and adolescents who were caries free (discriminate validity). The TOQOL is a valid and reliable oral health-related quality of life measurement that can be recommended for self-report in adolescents aged 13-18 years. © 2016 American Association of Public Health Dentistry.
Traditional Masculinity and Femininity: Validation of a New Scale Assessing Gender Roles.
Kachel, Sven; Steffens, Melanie C; Niedlich, Claudia
2016-01-01
Gender stereotype theory suggests that men are generally perceived as more masculine than women, whereas women are generally perceived as more feminine than men. Several scales have been developed to measure fundamental aspects of gender stereotypes (e.g., agency and communion, competence and warmth, or instrumentality and expressivity). Although omitted in later version, Bem's original Sex Role Inventory included the items "masculine" and "feminine" in addition to more specific gender-stereotypical attributes. We argue that it is useful to be able to measure these two core concepts in a reliable, valid, and parsimonious way. We introduce a new and brief scale, the Traditional Masculinity-Femininity (TMF) scale, designed to assess central facets of self-ascribed masculinity-femininity. Studies 1-2 used known-groups approaches (participants differing in gender and sexual orientation) to validate the scale and provide evidence of its convergent validity. As expected the TMF reliably measured a one-dimensional masculinity-femininity construct. Moreover, the TMF correlated moderately with other gender-related measures. Demonstrating incremental validity, the TMF predicted gender and sexual orientation in a superior way than established adjective-based measures. Furthermore, the TMF was connected to criterion characteristics, such as judgments as straight by laypersons for the whole sample, voice pitch characteristics for the female subsample, and contact to gay men for the male subsample, and outperformed other gender-related scales. Taken together, as long as gender differences continue to exist, we suggest that the TMF provides a valuable methodological addition for research into gender stereotypes.
Collins, Anne; Ross, Janine
2017-01-01
We performed a systematic review to identify all original publications describing the asymmetric inheritance of cellular organelles in normal animal eukaryotic cells and to critique the validity and imprecision of the evidence. Searches were performed in Embase, MEDLINE and Pubmed up to November 2015. Screening of titles, abstracts and full papers was performed by two independent reviewers. Data extraction and validity were performed by one reviewer and checked by a second reviewer. Study quality was assessed using the SYRCLE risk of bias tool, for animal studies and by developing validity tools for the experimental model, organelle markers and imprecision. A narrative data synthesis was performed. We identified 31 studies (34 publications) of the asymmetric inheritance of organelles after mitotic or meiotic division. Studies for the asymmetric inheritance of centrosomes (n = 9); endosomes (n = 6), P granules (n = 4), the midbody (n = 3), mitochondria (n = 3), proteosomes (n = 2), spectrosomes (n = 2), cilia (n = 2) and endoplasmic reticulum (n = 2) were identified. Asymmetry was defined and quantified by variable methods. Assessment of the statistical reliability of the results indicated only two studies (7%) were judged to have low concern, the majority of studies (77%) were 'unclear' and five (16%) were judged to have 'high concerns'; the main reasons were low technical repeats (<10). Assessment of model validity indicated that the majority of studies (61%) were judged to be valid, ten studies (32%) were unclear and two studies (7%) were judged to have 'high concerns'; both described 'stem cells' without providing experimental evidence to confirm this (pluripotency and self-renewal). Assessment of marker validity indicated that no studies had low concern, most studies were unclear (96.5%), indicating there were insufficient details to judge if the markers were appropriate. One study had high concern for marker validity due to the contradictory results of two markers for the same organelle. For most studies the validity and imprecision of results could not be confirmed. In particular, data were limited due to a lack of reporting of interassay variability, sample size calculations, controls and functional validation of organelle markers. An evaluation of 16 systematic reviews containing cell assays found that only 50% reported adherence to PRISMA or ARRIVE reporting guidelines and 38% reported a formal risk of bias assessment. 44% of the reviews did not consider how relevant or valid the models were to the research question. 75% reviews did not consider how valid the markers were. 69% of reviews did not consider the impact of the statistical reliability of the results. Future systematic reviews in basic or preclinical research should ensure the rigorous reporting of the statistical reliability of the results in addition to the validity of the methods. Increased awareness of the importance of reporting guidelines and validation tools is needed for the scientific community. PMID:28562636
Montazeri, Ali; Vahdaninia, Mariam; Mousavi, Sayed Javad; Omidvari, Speideh
2009-01-01
Background The 12-item Short Form Health Survey (SF-12) as a shorter alternative of the SF-36 is largely used in health outcomes surveys. The aim of this study was to validate the SF-12 in Iran. Methods A random sample of the general population aged 15 years and over living in Tehran, Iran completed the SF-12. Reliability was estimated using internal consistency and validity was assessed using known groups comparison and convergent validity. In addition, the factor structure of the questionnaire was extracted by performing both exploratory factor analysis (EFA) and confirmatory factor analysis (CFA). Results: In all, 5587 individuals were studied (2721 male and 2866 female). The mean age and formal education of the respondents were 35.1 (SD = 15.4) and 10.2 (SD = 4.4) years respectively. The results showed satisfactory internal consistency for both summary measures, that are the Physical Component Summary (PCS) and the Mental Component Summary (MCS); Cronbach's α for PCS-12 and MCS-12 was 0.73 and 0.72, respectively. Known-groups comparison showed that the SF-12 discriminated well between men and women and those who differed in age and educational status (P < 0.001). In addition, correlations between the SF-12 scales and single items showed that the physical functioning, role physical, bodily pain and general health subscales correlated higher with the PCS-12 score, while the vitality, social functioning, role emotional and mental health subscales more correlated with the MCS-12 score lending support to its good convergent validity. Finally the principal component analysis indicated a two-factor structure (physical and mental health) that jointly accounted for 57.8% of the variance. The confirmatory factory analysis also indicated a good fit to the data for the two-latent structure (physical and mental health). Conclusion In general the findings suggest that the SF-12 is a reliable and valid measure of health related quality of life among Iranian population. However, further studies are needed to establish stronger psychometric properties for this alternative form of the SF-36 Health Survey in Iran. PMID:19758427
Montazeri, Ali; Vahdaninia, Mariam; Mousavi, Sayed Javad; Omidvari, Speideh
2009-09-16
The 12-item Short Form Health Survey (SF-12) as a shorter alternative of the SF-36 is largely used in health outcomes surveys. The aim of this study was to validate the SF-12 in Iran. A random sample of the general population aged 15 years and over living in Tehran, Iran completed the SF-12. Reliability was estimated using internal consistency and validity was assessed using known groups comparison and convergent validity. In addition, the factor structure of the questionnaire was extracted by performing both exploratory factor analysis (EFA) and confirmatory factor analysis (CFA). In all, 5587 individuals were studied (2721 male and 2866 female). The mean age and formal education of the respondents were 35.1 (SD = 15.4) and 10.2 (SD = 4.4) years respectively. The results showed satisfactory internal consistency for both summary measures, that are the Physical Component Summary (PCS) and the Mental Component Summary (MCS); Cronbach's alpha for PCS-12 and MCS-12 was 0.73 and 0.72, respectively. Known-groups comparison showed that the SF-12 discriminated well between men and women and those who differed in age and educational status (P < 0.001). In addition, correlations between the SF-12 scales and single items showed that the physical functioning, role physical, bodily pain and general health subscales correlated higher with the PCS-12 score, while the vitality, social functioning, role emotional and mental health subscales more correlated with the MCS-12 score lending support to its good convergent validity. Finally the principal component analysis indicated a two-factor structure (physical and mental health) that jointly accounted for 57.8% of the variance. The confirmatory factory analysis also indicated a good fit to the data for the two-latent structure (physical and mental health). In general the findings suggest that the SF-12 is a reliable and valid measure of health related quality of life among Iranian population. However, further studies are needed to establish stronger psychometric properties for this alternative form of the SF-36 Health Survey in Iran.
Zhang, Jun; Li, Xiaohai; Mueller, Michael; Wang, Yueju; Zong, Chenggong; Deng, Ning; Vondriska, Thomas M.; Liem, David A.; Yang, Jeong-In; Korge, Paavo; Honda, Henry; Weiss, James N.; Apweiler, Rolf; Ping, Peipei
2009-01-01
Mitochondria play essential roles in cardiac pathophysiology and the murine model has been extensively used to investigate cardiovascular diseases. In the present study, we characterized murine cardiac mitochondria using an LC/MS/MS approach. We extracted and purified cardiac mitochondria; validated their functionality to ensure the final preparation contains necessary components to sustain their normal function; and subjected these validated organelles to LC/MS/MS-based protein identification. A total of 940 distinct proteins were identified from murine cardiac mitochondria, among which, 480 proteins were not previously identified by major proteomic profiling studies. The 940 proteins consist of functional clusters known to support oxidative phosphorylation, metabolism and biogenesis. In addition, there are several other clusters--including proteolysis, protein folding, and reduction/oxidation signaling-which ostensibly represent previously under-appreciated tasks of cardiac mitochondria. Moreover, many identified proteins were found to occupy other subcellular locations, including cytoplasm, ER, and golgi, in addition to their presence in the mitochondria. These results provide a comprehensive picture of the murine cardiac mitochondrial proteome and underscore tissue- and species-specification. Moreover, the use of functionally intact mitochondria insures that the proteomic observations in this organelle are relevant to its normal biology and facilitates decoding the interplay between mitochondria and other organelles. PMID:18348319
NASA Technical Reports Server (NTRS)
Pliutau, Denis; Prasad, Narasimha S
2013-01-01
Studies were performed to carry out semi-empirical validation of a new measurement approach we propose for molecular mixing ratios determination. The approach is based on relative measurements in bands of O2 and other molecules and as such may be best described as cross band relative absorption (CoBRA). . The current validation studies rely upon well verified and established theoretical and experimental databases, satellite data assimilations and modeling codes such as HITRAN, line-by-line radiative transfer model (LBLRTM), and the modern-era retrospective analysis for research and applications (MERRA). The approach holds promise for atmospheric mixing ratio measurements of CO2 and a variety of other molecules currently under investigation for several future satellite lidar missions. One of the advantages of the method is a significant reduction of the temperature sensitivity uncertainties which is illustrated with application to the ASCENDS mission for the measurement of CO2 mixing ratios (XCO2). Additional advantages of the method include the possibility to closely match cross-band weighting function combinations which is harder to achieve using conventional differential absorption techniques and the potential for additional corrections for water vapor and other interferences without using the data from numerical weather prediction (NWP) models.
Yurek, Leo A; Havens, Donna S; Hays, Spencer; Hughes, Linda C
2015-10-01
Decisional involvement is widely recognized as an essential component of a professional nursing practice environment. In recent years, researchers have added to the conceptualization of nurses' role in decision-making to differentiate between the content and context of nursing practice. Yet, instruments that clearly distinguish between these two dimensions of practice are lacking. The purpose of this study was to examine the factorial validity of the Decisional Involvement Scale (DIS) as a measure of both the content and context of nursing practice. This secondary analysis was conducted using data from a longitudinal action research project to improve the quality of nursing practice and patient care in six hospitals (N = 1,034) in medically underserved counties of Pennsylvania. A cross-sectional analysis of baseline data from the parent study was used to compare the factor structure of two models (one nested within the other) using confirmatory factor analysis. Although a comparison of the two models indicated that the addition of second-order factors for the content and context of nursing practice improved model fit, neither model provided optimal fit to the data. Additional model-generating research is needed to develop the DIS as a valid measure of decisional involvement for both the content and context of nursing practice. © 2015 Wiley Periodicals, Inc.
Psychobiological responses to critically evaluated multitasking.
Wetherell, Mark A; Craw, Olivia; Smith, Kenny; Smith, Michael A
2017-12-01
In order to understand psychobiological responses to stress it is necessary to observe how people react to controlled stressors. A range of stressors exist for this purpose; however, laboratory stressors that are representative of real life situations provide more ecologically valid opportunities for assessing stress responding. The current study assessed psychobiological responses to an ecologically valid laboratory stressor involving multitasking and critical evaluation. The stressor elicited significant increases in psychological and cardiovascular stress reactivity; however, no cortisol reactivity was observed. Other socially evaluative laboratory stressors that lead to cortisol reactivity typically require a participant to perform tasks that involve verbal responses, whilst standing in front of evaluative others. The current protocol contained critical evaluation of cognitive performance; however, this was delivered from behind a seated participant. The salience of social evaluation may therefore be related to the response format of the task and the method of evaluation. That is, the current protocol did not involve the additional vulnerability associated with in person, face-to-face contact, and verbal delivery. Critical evaluation of multitasking provides an ecologically valid technique for inducing laboratory stress and provides an alternative tool for assessing psychological and cardiovascular reactivity. Future studies could additionally use this paradigm to investigate those components of social evaluation necessary for eliciting a cortisol response.
Chen, Y-W; HajGhanbari, B; Road, J D; Coxson, H O; Camp, P G; Reid, W D
2018-06-08
Pain is prevalent in chronic obstructive pulmonary disease (COPD) and the Brief Pain Inventory (BPI) appears to be a feasible questionnaire to assess this symptom. However, the reliability and validity of the BPI have not been determined in individuals with COPD. This study aimed to determine the internal consistency, test-retest reliability and validity (construct, convergent, divergent and discriminant) of the BPI in individuals with COPD. In order to examine the test-retest reliability, individuals with COPD were recruited from pulmonary rehabilitation programmes to complete the BPI twice 1 week apart. In order to investigate validity, de-identified data was retrieved from two previous studies, including forced expiratory volume in 1-s, age, sex and data from four questionnaires: the BPI, short-form McGill Pain Questionnaire (SF-MPQ), 36-Item Short Form Survey (SF-36) and Community Health Activities Model Program for Seniors (CHAMPS) questionnaire. In total, 123 participants were included in the analyses (eligible data were retrieved from 86 participants and additional 37 participants were recruited). The BPI demonstrated excellent internal consistency and test-retest reliability. It also showed convergent validity with the SF-MPQ and divergent validity with the SF-36. The factor analysis yielded two factors of the BPI, which demonstrated that the two domains of the BPI measure the intended constructs. The BPI can also discriminate pain levels among COPD patients with varied levels of quality of life (SF-36) and physical activity (CHAMPS). The BPI is a reliable and valid pain questionnaire that can be used to evaluate pain in COPD. This study formally established the reliability and validity of the BPI in individuals with COPD, which have not been determined in this patient group. The results of this study provide strong evidence that assessment results from this pain questionnaire are reliable and valid. © 2018 European Pain Federation - EFIC®.
The psychometric validation of a 1-week recall period for the OAB-q.
Coyne, Karin S; Gelhorn, Heather; Thompson, Christine; Kopp, Zoe S; Guan, Zhonghong
2011-12-01
As shorter recall periods are sometimes preferable to longer recall periods, the objective of this study was to evaluate the psychometric characteristics and measurement properties of the 1-week recall version of the Overactive Bladder Questionnaire (OAB-q). Secondary analyses were performed on data for three 12-week clinical trials of fesoterodine. Patients completed the Patient Perception of Bladder Condition (PPBC), the Patient Perception of Urgency Scale (PPUS), and 3-day bladder diaries in addition to the OAB-q at baseline, 4 and 12 weeks. Analyses were conducted to evaluate the reliability, concurrent and discriminant validity and responsiveness of the OAB-q 1-week recall version. The patients in the three studies (Study 1: N=516, Study 2: N=441; Study 3: N=882) had a mean age of 59.6, 59.4, and 59.9 years, respectively; and most of the patients were female (77.1%, 88.9%, and 82.9%) and White (76.6%, 90.0%, and 88.0%). Patients had been diagnosed with OAB for a mean of 5.2, 8.3, and 9.1 years, respectively. Cronbach's alpha values were greater than 0.85 across all samples and subscales. Correlations between the 1-week recall version of the OAB-q and the PPBC, PPUS, and most of the bladder diary variables were moderate to strong. Discriminant validity of the OAB-q was good, with significant differences in mean OAB-q scores across all response categories of the PPUS. The OAB-q was highly responsive to changes in patients' conditions as indicated by moderate to large effect sizes. The OAB-q 1-week recall version has a similar factor structure to the 4-week recall version with each subscale model demonstrating acceptable fit. The 1-week recall version of the OAB-q appears to be reliable, valid, and responsive and is psychometrically equivalent to the 4-week recall version. The validation of the 1-week recall version offers researchers and clinicians an additional option for using the OAB-q.
Definition and Demonstration of a Methodology for Validating Aircraft Trajectory Predictors
NASA Technical Reports Server (NTRS)
Vivona, Robert A.; Paglione, Mike M.; Cate, Karen T.; Enea, Gabriele
2010-01-01
This paper presents a new methodology for validating an aircraft trajectory predictor, inspired by the lessons learned from a number of field trials, flight tests and simulation experiments for the development of trajectory-predictor-based automation. The methodology introduces new techniques and a new multi-staged approach to reduce the effort in identifying and resolving validation failures, avoiding the potentially large costs associated with failures during a single-stage, pass/fail approach. As a case study, the validation effort performed by the Federal Aviation Administration for its En Route Automation Modernization (ERAM) system is analyzed to illustrate the real-world applicability of this methodology. During this validation effort, ERAM initially failed to achieve six of its eight requirements associated with trajectory prediction and conflict probe. The ERAM validation issues have since been addressed, but to illustrate how the methodology could have benefited the FAA effort, additional techniques are presented that could have been used to resolve some of these issues. Using data from the ERAM validation effort, it is demonstrated that these new techniques could have identified trajectory prediction error sources that contributed to several of the unmet ERAM requirements.
Erdodi, Laszlo A; Sagar, Sanya; Seke, Kristian; Zuccato, Brandon G; Schwartz, Eben S; Roth, Robert M
2018-06-01
This study was designed to develop performance validity indicators embedded within the Delis-Kaplan Executive Function Systems (D-KEFS) version of the Stroop task. Archival data from a mixed clinical sample of 132 patients (50% male; M Age = 43.4; M Education = 14.1) clinically referred for neuropsychological assessment were analyzed. Criterion measures included the Warrington Recognition Memory Test-Words and 2 composites based on several independent validity indicators. An age-corrected scaled score ≤6 on any of the 4 trials reliably differentiated psychometrically defined credible and noncredible response sets with high specificity (.87-.94) and variable sensitivity (.34-.71). An inverted Stroop effect was less sensitive (.14-.29), but comparably specific (.85-90) to invalid performance. Aggregating the newly developed D-KEFS Stroop validity indicators further improved classification accuracy. Failing the validity cutoffs was unrelated to self-reported depression or anxiety. However, it was associated with elevated somatic symptom report. In addition to processing speed and executive function, the D-KEFS version of the Stroop task can function as a measure of performance validity. A multivariate approach to performance validity assessment is generally superior to univariate models. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Quantitative validation of carbon-fiber laminate low velocity impact simulations
English, Shawn A.; Briggs, Timothy M.; Nelson, Stacy M.
2015-09-26
Simulations of low velocity impact with a flat cylindrical indenter upon a carbon fiber fabric reinforced polymer laminate are rigorously validated. Comparison of the impact energy absorption between the model and experiment is used as the validation metric. Additionally, non-destructive evaluation, including ultrasonic scans and three-dimensional computed tomography, provide qualitative validation of the models. The simulations include delamination, matrix cracks and fiber breaks. An orthotropic damage and failure constitutive model, capable of predicting progressive damage and failure, is developed in conjunction and described. An ensemble of simulations incorporating model parameter uncertainties is used to predict a response distribution which ismore » then compared to experimental output using appropriate statistical methods. Lastly, the model form errors are exposed and corrected for use in an additional blind validation analysis. The result is a quantifiable confidence in material characterization and model physics when simulating low velocity impact in structures of interest.« less
Song, Zhen-Tao; Zhu, Ming-Jun
2017-03-01
Fermentation of herb Polygonum hydropiper L. (PHL) and cassava pulp (CP) for feed additive production with simultaneous flavonoid dissolution was investigated, and a two-stage response surface methodology (RSM) based on Plackett-Burman factorial design (PB design) was used to optimize the flavonoid dissolution and protein content. Using the screening function of PB design, four different significant factors for the two response variables were acquired: factors A (CP) and B (PHL) for the flavonoid dissolution versus factors G (inoculum size) and H (fermentation time) for protein content. Then, two RSMs were used sequentially to improve the values of the two response variables separately. The mutual corroboration of the experimental results in the present study confirmed the validity of the associated experimental design. The validation experiment showed a flavonoid dissolution rate of 94.00%, and a protein content of 18.20%, gaining an increase in 21.20% and 199.10% over the control, respectively. The present study confirms the feasibility of feed additive production by Saccharomyces cerevisiae with CP and PHL and simultaneous optimization of flavonoid dissolution and protein content using a two-stage RSM. © 2016 International Union of Biochemistry and Molecular Biology, Inc.
Methodological review of the quality of reach out and read: does it "work"?
Yeager Pelatti, Christina; Pentimonti, Jill M; Justice, Laura M
2014-04-01
A considerable percentage of American children and adults fail to learn adequate literacy skills and read below a third grade level. Shared book reading is perhaps the single most important activity to prepare young children for success in reading. The primary objective of this manuscript was to critically review the methodological quality of Read Out and Read (ROR), a clinically based literacy program/intervention that teaches parents strategies to incorporate while sharing books with children as a method of preventing reading difficulties and academic struggles. A PubMed search was conducted. Articles that met three criteria were considered. First, the study must be clinically based and include parent contact with a pediatrician. Second, parental counseling ("anticipatory guidance") about the importance of parent-child book reading must be included. Third, only experimental or quasi-experimental studies were included; no additional criteria were used. Published articles from any year and peer-reviewed journal were considered. Study quality was determined using a modified version of the Downs and Black (1998) checklist assessing four categories: (1) Reporting, (2) External Validity, (3) Internal Validity-Bias, and (4) Internal Validity-Confounding. We were also interested in whether quality differed based on study design, children's age, sample size, and study outcome. Eleven studies met the inclusion criteria. The overall quality of evidence was variable across all studies; Reporting and External Validity categories were relatively strong while methodological concerns were found in the area of internal validity. Quality scores differed on the four study characteristics. Implications related to clinical practice and future studies are discussed.
Development and Validation of Instruments to Measure Learning of Expert-Like Thinking
NASA Astrophysics Data System (ADS)
Adams, Wendy K.; Wieman, Carl E.
2011-06-01
This paper describes the process for creating and validating an assessment test that measures the effectiveness of instruction by probing how well that instruction causes students in a class to think like experts about specific areas of science. The design principles and process are laid out and it is shown how these align with professional standards that have been established for educational and psychological testing and the elements of assessment called for in a recent National Research Council study on assessment. The importance of student interviews for creating and validating the test is emphasized, and the appropriate interview procedures are presented. The relevance and use of standard psychometric statistical tests are discussed. Additionally, techniques for effective test administration are presented.
Time Investment and Time Management: An Analysis of Time Students Spend Working at Home for School
ERIC Educational Resources Information Center
Wagner, Petra; Schober, Barbara; Spiel, Christiane
2008-01-01
This paper deals with the time students spend working at home for school. In Study 1, we investigated amount and regulation of time. Study 2 serves to validate the results of Study 1 and, in addition, investigates the duration of the time units students used and their relation to scholastic success. In Study 1, the participants were 332 students…
Simonds, Elise C; Handel, Richard W; Archer, Robert P
2008-03-01
This study evaluated the incremental validity of scores from the Minnesota Multiphasic Personality Inventory-2 (MMPI-2) and the Symptom Checklist-90-Revised (SCL-90-R) in a sample of mental health inpatients originally published by Archer, Griffin, and Aiduk (1995). The incremental validity of scores from the SCL-90-R primary symptom dimensions and MMPI-2 Clinical, Content, and Restructured Clinical scales was assessed in a sample of 544 mental health inpatients using conceptually related items from the Brief Psychiatric Rating Scale (BPRS) as criteria. A series of hierarchical multiple regressions indicated that scores from the SCL-90-R primary symptom dimensions exhibited limited incremental validity (Mdn DeltaR(2) = .01, range = 0-.01), whereas scores from MMPI-2 scales contributed additional information in the prediction of ratings on all but one BPRS item (Mdn DeltaR( 2) = .08, range = .04-.12).
Eco-tourism Education Effectiveness Indicator System Research for Macau SAR in Practice
NASA Astrophysics Data System (ADS)
Ge, Mei
2018-01-01
The study of education in Ecotourism is one of the many important focuses among the field of Ecotourism; and the evaluation of Ecotourism education effectiveness in Ecotourism is a key topic in it. The result of this study can be very useful in the development of Ecotourism education. In addition, affect greatly in its improvement in the future. The Delphi method had been used in this study to establish a valid indicator system of evaluation in Ecotourism education; then followed by the analytic hierarchy process (AHP) with 11 paired-comparison matrices being constructed. Weights of these evaluation indicators were then determined by using Matlab 7.1. Throughout, data was obtained by doing sampling surveys, and the fuzzy comprehensive evaluation method was used to calculate the validity of Ecotourism education, where validity was classified into different levels. As the result, the evaluation model of the education effectiveness in Ecotourism was constructed. The Macau Special Administrative Region (Macau SAR) was chosen in this case for the empirical stage. Data of ecological knowledge, ecological cultural level, ecological consciousness, ecological ethics and ecological behaviors of tourists, who had entered and exited Macau SAR, was collected and analyzed in SPSS. Differences and impact of these indicators were studied to conclude the effect of its education in Ecotourism of this region. In addition, the results of education effectiveness in Ecotourism were also compared among different population subgroups and observations were given accordingly.
Validation of the Arabic Version of the Internet Gaming Disorder-20 Test.
Hawi, Nazir S; Samaha, Maya
2017-04-01
In recent years, researchers have been trying to shed light on gaming addiction and its association with different psychiatric disorders and psychological determinants. The latest edition version of the American Psychiatric Association's Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5) included in its Section 3 Internet Gaming Disorder (IGD) as a condition for further empirical study and proposed nine criteria for the diagnosis of IGD. The 20-item Internet Gaming Disorder (IGD-20) Test was developed as a valid and reliable tool to assess gaming addiction based on the nine criteria set by the DSM-5. The aim of this study is to validate an Arabic version of the IGD-20 Test. The Arabic version of IGD-20 will not only help in identifying Arabic-speaking pathological gamers but also stimulate cross-cultural studies that could contribute to an area in need of more research for insight and treatment. After a process of translation and back-translation and with the participation of a sizable sample of Arabic-speaking adolescents, the present study conducted a psychometric validation of the IGD-20 Test. Our confirmatory factor analysis showed the validity of the Arabic version of the IGD-20 Test. The one-factor model of the Arabic IGD-20 Test had very good psychometric properties, and it fitted the sample data extremely well. In addition, correlation analysis between the IGD-20 Test and the daily duration on weekdays and weekends gameplay revealed significant positive relationships that warranted a criterion-related validation. Thus, the Arabic version of the IGD-20 Test is a valid and reliable measure of IGD among Arabic-speaking populations.
Valcarcel, Alessandra M; Linn, Kristin A; Vandekar, Simon N; Satterthwaite, Theodore D; Muschelli, John; Calabresi, Peter A; Pham, Dzung L; Martin, Melissa Lynne; Shinohara, Russell T
2018-03-08
Magnetic resonance imaging (MRI) is crucial for in vivo detection and characterization of white matter lesions (WMLs) in multiple sclerosis. While WMLs have been studied for over two decades using MRI, automated segmentation remains challenging. Although the majority of statistical techniques for the automated segmentation of WMLs are based on single imaging modalities, recent advances have used multimodal techniques for identifying WMLs. Complementary modalities emphasize different tissue properties, which help identify interrelated features of lesions. Method for Inter-Modal Segmentation Analysis (MIMoSA), a fully automatic lesion segmentation algorithm that utilizes novel covariance features from intermodal coupling regression in addition to mean structure to model the probability lesion is contained in each voxel, is proposed. MIMoSA was validated by comparison with both expert manual and other automated segmentation methods in two datasets. The first included 98 subjects imaged at Johns Hopkins Hospital in which bootstrap cross-validation was used to compare the performance of MIMoSA against OASIS and LesionTOADS, two popular automatic segmentation approaches. For a secondary validation, a publicly available data from a segmentation challenge were used for performance benchmarking. In the Johns Hopkins study, MIMoSA yielded average Sørensen-Dice coefficient (DSC) of .57 and partial AUC of .68 calculated with false positive rates up to 1%. This was superior to performance using OASIS and LesionTOADS. The proposed method also performed competitively in the segmentation challenge dataset. MIMoSA resulted in statistically significant improvements in lesion segmentation performance compared with LesionTOADS and OASIS, and performed competitively in an additional validation study. Copyright © 2018 by the American Society of Neuroimaging.
Cloke, Jonathan; Evans, Katharine; Crabtree, David; Hughes, Annette; Simpson, Helen; Holopainen, Jani; Wickstrand, Nina; Kauppinen, Mikko; Leon-Velarde, Carlos; Larson, Nathan; Dave, Keron
2014-01-01
The Thermo Scientific SureTect Listeria species Assay is a new real-time PCR assay for the detection of all species of Listeria in food and environmental samples. This validation study was conducted using the AOAC Research Institute (RI) Performance Tested Methods program to validate the SureTect Listeria species Assay in comparison to the reference method detailed in International Organization for Standardization 11290-1:1996 including amendment 1:2004 in a variety of foods plus plastic and stainless steel. The food matrixes validated were smoked salmon, processed cheese, fresh bagged spinach, cantaloupe, cooked prawns, cooked sliced turkey meat, cooked sliced ham, salami, pork frankfurters, and raw ground beef. All matrixes were tested by Thermo Fisher Scientific, Microbiology Division, Basingstoke, UK. In addition, three matrixes (pork frankfurters, fresh bagged spinach, and stainless steel surface samples) were analyzed independently as part of the AOAC-RI-controlled independent laboratory study by the University ofGuelph, Canada. Using probability of detection statistical analysis, a significant difference in favour of the SureTect assay was demonstrated between the SureTect and reference method for high level spiked samples of pork frankfurters, smoked salmon, cooked prawns, stainless steel, and low-spiked samples of salami. For all other matrixes, no significant difference was seen between the two methods during the study. Inclusivity testing was conducted with 68 different isolates of Listeria species, all of which were detected by the SureTect Listeria species Assay. None of the 33 exclusivity isolates were detected by the SureTect Listeria species Assay. Ruggedness testing was conducted to evaluate the performance of the assay with specific method deviations outside of the recommended parameters open to variation, which demonstrated that the assay gave reliable performance. Accelerated stability testing was additionally conducted, validating the assay shelf life.
Evaluation of the Thermo Scientific™ SureTect™ Listeria species Assay.
Cloke, Jonathan; Evans, Katharine; Crabtree, David; Hughes, Annette; Simpson, Helen; Holopainen, Jani; Wickstrand, Nina; Kauppinen, Mikko
2014-03-01
The Thermo Scientific™ SureTect™ Listeria species Assay is a new real-time PCR assay for the detection of all species of Listeria in food and environmental samples. This validation study was conducted using the AOAC Research Institute (RI) Performance Tested MethodsSM program to validate the SureTect Listeria species Assay in comparison to the reference method detailed in International Organization for Standardization 11290-1:1996 including amendment 1:2004 in a variety of foods plus plastic and stainless steel. The food matrixes validated were smoked salmon, processed cheese, fresh bagged spinach, cantaloupe, cooked prawns, cooked sliced turkey meat, cooked sliced ham, salami, pork frankfurters, and raw ground beef. All matrixes were tested by Thermo Fisher Scientific, Microbiology Division, Basingstoke, UK. In addition, three matrixes (pork frankfurters, fresh bagged spinach, and stainless steel surface samples) were analyzed independently as part of the AOAC-RI-controlled independent laboratory study by the University of Guelph, Canada. Using probability of detection statistical analysis, a significant difference in favour of the SureTect assay was demonstrated between the SureTect and reference method for high level spiked samples of pork frankfurters, smoked salmon, cooked prawns, stainless steel, and low-spiked samples of salami. For all other matrixes, no significant difference was seen between the two methods during the study. Inclusivity testing was conducted with 68 different isolates of Listeria species, all of which were detected by the SureTect Listeria species Assay. None of the 33 exclusivity isolates were detected by the SureTect Listeria species Assay. Ruggedness testing was conducted to evaluate the performance of the assay with specific method deviations outside of the recommended parameters open to variation, which demonstrated that the assay gave reliable performance. Accelerated stability testing was additionally conducted, validating the assay shelf life.
Msaouel, Pavlos; Gralla, Richard J; Jones, Randy A; Hollen, Patricia J
2017-09-01
Evidence-based quality of life (QL) questionnaires require the identification of issues of importance to patients. The primary aim of this study was to inform providers on patient-expressed issues while enhancing the content validity of instruments assessing QL and patient-reported outcomes (PROs) in prostate cancer. The study provided additional psychometric properties for the new PRO and QL instrument, the Prostate Cancer Symptom Scale (PCSS). An anonymous web-based survey of 2128 patients with prostate cancer was conducted with patients rating 18 QL items on a five-point scale. Most respondents (74%) were aged 55-74 years, had early stage disease at diagnosis (81%) and were diagnosed within 2 years of the survey (81%). The top five-rated issues were: overall QL, ability to perform normal activities, maintaining independence, ability to sleep and not being a burden. These items were ranked as either 'very important' or 'important' by at least 88% of patients. None of the most highly ranked issues were symptoms. Instead, the highest ranked items were global issues reflecting the impact of symptoms on patients. In addition to the enhanced content validity findings, good reliability results and initial support for construct validity are reported for the PCSS. This is the largest survey providing patient-expressed background for content validity for QL and PRO measures. The findings of this study should aid development of newer practical questionnaires, such as the PCSS, which can be adapted to electronic platforms enhancing rapid and accurate PRO and QL evaluation. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.
Test Assembly Implications for Providing Reliable and Valid Subscores
ERIC Educational Resources Information Center
Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J.
2017-01-01
This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…
Energy Expenditure of Selected Household Activities during Pregnancy
ERIC Educational Resources Information Center
Chasan-Taber, Lisa; Freedson, Patty S.; Roberts, Dawn E.; Schmidt, Michael D.; Fragala, Maren S.
2007-01-01
Accurately measuring pregnancy physical activity is critical to assess the percentage of pregnant women meeting the American College of Obstetricians and Gynecologists (ACOG) guidelines. In addition, valid assessment of pregnancy physical activity is important for epidemiologic studies assessing the relationship between physical activity and…
Brunault, Paul; Ballon, Nicolas; Gaillard, Philippe; Réveillère, Christian; Courtois, Robert
2014-05-01
The concept of food addiction has recently been proposed by applying the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision, criteria for substance dependence to eating behaviour. Food addiction has received increased attention given that it may play a role in binge eating, eating disorders, and the recent increase in obesity prevalence. Currently, there is no psychometrically sound tool for assessing food addiction in French. Our study aimed to test the psychometric properties of a French version of the Yale Food Addiction Scale (YFAS) by establishing its factor structure and construct validity in a nonclinical population. A total of 553 participants were assessed for food addiction (French version of the YFAS) and binge eating behaviour (Bulimic Investigatory Test Edinburgh and Binge Eating Scale). We tested the scale's factor structure (factor analysis for dichotomous data based on tetrachoric correlation coefficients), internal consistency, and construct validity with measures of binge eating. Our results supported a 1-factor structure, which accounted for 54.1% of the variance. This tool had adequate reliability and high construct validity with measures of binge eating in this population, both in its diagnosis and symptom count version. A 2-factor structure explained an additional 9.1% of the variance, and could differentiate between patients with high, compared with low, levels of insight regarding addiction symptoms. In our study, we validated a psychometrically sound French version of the YFAS, both in its symptom count and diagnostic version. Future studies should validate this tool in clinical samples.
The 2014 Sandia Verification and Validation Challenge: Problem statement
Hu, Kenneth; Orient, George
2016-01-18
This paper presents a case study in utilizing information from experiments, models, and verification and validation (V&V) to support a decision. It consists of a simple system with data and models provided, plus a safety requirement to assess. The goal is to pose a problem that is flexible enough to allow challengers to demonstrate a variety of approaches, but constrained enough to focus attention on a theme. This was accomplished by providing a good deal of background information in addition to the data, models, and code, but directing the participants' activities with specific deliverables. In this challenge, the theme ismore » how to gather and present evidence about the quality of model predictions, in order to support a decision. This case study formed the basis of the 2014 Sandia V&V Challenge Workshop and this resulting special edition of the ASME Journal of Verification, Validation, and Uncertainty Quantification.« less
Kim, Youngmee; Evangelista, Lorraine S; Phillips, Linda R; Pavlish, Carol; Kopple, Joel D
2010-01-01
Reported treatment adherence rates of patients with end stage renal disease (ESRD) have been extremely varied due to lack of reliable and valid measurement tools. This study was conducted to develop and test an instrument to measure treatment adherence to hemodialysis (HD) attendance, medications, fluid restrictions, and diet prescription among patients with ESRD. This article describes the methodological approach used to develop and test the psychometric properties (such as reliability and validity) of the 46-item ESRD-Adherence Questionnaire (ESRD-AQ) in a cohort of patients receiving maintenance HD at dialysis centers in Los Angeles County. The ESRD-AQ is the first self-report instrument to address all components of adherence behaviors of patients with ESRD. The findings support that the instrument is reliable and valid and is easy to administer. Future studies are needed in a larger sample to determine whether additional modifications are needed.
Dynamic modelling and experimental validation of three wheeled tilting vehicles
NASA Astrophysics Data System (ADS)
Amati, Nicola; Festini, Andrea; Pelizza, Luigi; Tonoli, Andrea
2011-06-01
The present paper describes the study of the stability in the straight running of a three-wheeled tilting vehicle for urban and sub-urban mobility. The analysis was carried out by developing a multibody model in the Matlab/SimulinkSimMechanics environment. An Adams-Motorcycle model and an equivalent analytical model were developed for the cross-validation and for highlighting the similarities with the lateral dynamics of motorcycles. Field tests were carried out to validate the model and identify some critical parameters, such as the damping on the steering system. The stability analysis demonstrates that the lateral dynamic motions are characterised by vibration modes that are similar to that of a motorcycle. Additionally, it shows that the wobble mode is significantly affected by the castor trail, whereas it is only slightly affected by the dynamics of the front suspension. For the present case study, the frame compliance also has no influence on the weave and wobble.
McGeorge, Christi R; Carlson, Thomas S; Toomey, Russell B
2015-01-01
This study established the validity and factor structure of the Faculty Version of the Affirmative Training Inventory (ATI-F), which assesses faculty members' perceptions of the level of lesbian, gay, and bisexual (LGB) affirmative training that occurs in clinical programs. Additionally, this study examined the latent associations among the subscales of the ATI-F and three convergent validity items utilizing a sample of 117 faculty members from accredited family therapy programs. The findings provide empirical support for the relationship between including classroom content on LGB affirmative therapy and faculty members' beliefs about LGB individuals and relationships. Specifically, faculty members who report more positive beliefs about LGB clients appear to be more likely to include LGB affirmative therapy content in the courses they teach. © 2013 American Association for Marriage and Family Therapy.
Disc displacement without reduction: a retrospective study of a clinical diagnostic sign.
Giraudeau, Anne; Jeany, Marion; Ehrmann, Elodie; Déjou, Jacques; Ouni, Imed; Orthlieb, Jean-Daniel
2017-03-01
The purpose of this retrospective study is to evaluate a clinical diagnostic sign for disc displacement without reduction (DDWR), the absence of additional condylar translation during opening compared with protrusion. Thirty-eight electronic axiographic and magnetic resonance imaging (MRI) examinations of the TMJ were analyzed in order to compare the opening/protrusion ratio of condylar translation between non-painful DDWR and non-DDWR. According to the Mann-Whitney U test, the opening/protrusion ratio in non-painful DDWR differs significantly from non-DDWR (p < 0.0001). Among non-painful DDWR, there is no additional condylar translation during opening in comparison with protrusion, and this is probably also the case for DDWR without limited opening, which is a subtype that has not been validated by the Diagnostic Criteria for Temporomandibular Disorders (DC/TMD). Comparative condylar palpation can analyze this sign, and therefore, further comparative investigations between MRI and clinical examination are needed to validate the corresponding clinical test.
Development, linguistic and clinimetric validation of the WOMAC VA3.01 Bangla for Bangladesh Index.
Rabbani, M G; Haq, S A; Bellamy, N; Islam, M N; Choudhury, M R; Naheed, A; Ahmed, S; Shahin, A
2015-06-01
The aim of this study was to develop and to validate a Bengali version of the Western Ontario and McMaster Osteoarthritis (WOMAC) index in Bangladesh. The WOMAC was translated into the local language of Bangladesh (Bengali) and adapted in the local sociocultural context, following the standard guidelines by Beaton et al. Content validity of the preliminary Bengali version was assessed by using the index of content validity (ICV) and floor and ceiling effects. Patients were assessed at the Department of Rheumatology of Bangabandhu Sheikh Mujib Medical University and were diagnosed to have knee OA by American College of Rheumatology criteria and recruited according to the requirements of the validation study. Convergent and divergent validity were measured by comparing with Health Assessment Questionnaire (HAQ) and the Short Form-36 (SF-36), and internal consistency was assessed using Cronbach's alpha coefficient. The questionnaire was readministered to 40 patients within a week for assessing reliability by using intra-class correlation coefficient (ICC) and Spearman's rank correlation coefficient. In addition, factor analysis of Bengali WOMAC questionnaire was performed to examine the number of factors influencing a common set of items. A Bengali version was developed with changes in three items to suit local practices. The ICV of the content validity was 1 for all items. The Bengali WOMAC had similar construct validity when compared to the HAQ (ρ 0.74, n = 70) and SF-36 bodily pain and physical functioning. It had dissimilar construct validity to SF-36 mental health domain except WOMAC pain. Factor analysis revealed five factors with eigenvalues of more than 1.0. Cronbach's alpha and ICC exceeded 0.7 in all domains. In the test-retest reliability testing, Spearman's ρ for all items exceeded 0.4 (n = 40). This study has demonstrated that the Bengali version of WOMAC is a valid tool for assessing quality of life of patients with knee osteoarthritis in Bangladesh and is reliable.
Aliloo, Hassan; Pryce, Jennie E; González-Recio, Oscar; Cocks, Benjamin G; Hayes, Ben J
2015-07-22
It has been suggested that traits with low heritability, such as fertility, may have proportionately more genetic variation arising from non-additive effects than traits with higher heritability, such as milk yield. Here, we performed a large genome scan with 408,255 single nucleotide polymorphism (SNP) markers to identify chromosomal regions associated with additive, dominance and epistatic (pairwise additive × additive) variability in milk yield and a measure of fertility, calving interval, using records from a population of 7,055 Holstein cows. The results were subsequently validated in an independent set of 3,795 Jerseys. We identified genomic regions with validated additive effects on milk yield on Bos taurus autosomes (BTA) 5, 14 and 20, whereas SNPs with suggestive additive effects on fertility were observed on BTA 5, 9, 11, 18, 22, 27, 29 and the X chromosome. We also confirmed genome regions with suggestive dominance effects for milk yield (BTA 2, 3, 5, 26 and 27) and for fertility (BTA 1, 2, 3, 7, 23, 25 and 28). A number of significant epistatic effects for milk yield on BTA 14 were found across breeds. However on close inspection, these were likely to be associated with the mutation in the diacylglycerol O-acyltransferase 1 (DGAT1) gene, given that the associations were no longer significant when the additive effect of the DGAT1 mutation was included in the epistatic model. In general, we observed a low statistical power (high false discovery rates and small number of significant SNPs) for non-additive genetic effects compared with additive effects for both traits which could be an artefact of higher dependence on linkage disequilibrium between markers and causative mutations or smaller size of non-additive effects relative to additive effects. The results of our study suggest that individual non-additive effects make a small contribution to the genetic variation of milk yield and fertility. Although we found no individual mutation with large dominance effect for both traits under investigation, a contribution to genetic variance is still possible from a large number of small dominance effects, so methods that simultaneously incorporate genotypes across all loci are suggested to test the variance explained by dominance gene actions.
Assessing Predictive Validity of Pressure Ulcer Risk Scales- A Systematic Review and Meta-Analysis
PARK, Seong-Hi; LEE, Hea Shoon
2016-01-01
Background: The purpose of this study was to present a scientific reason for pressure ulcer risk scales: Cubbin& Jackson modified Braden, Norton, and Waterlow, as a nursing diagnosis tool by utilizing predictive validity of pressure sores. Methods: Articles published between 1966 and 2013 from periodicals indexed in the Ovid Medline, Embase, CINAHL, KoreaMed, NDSL, and other databases were selected using the key word “pressure ulcer”. QUADAS-II was applied for assessment for internal validity of the diagnostic studies. Selected studies were analyzed using meta-analysis with MetaDisc 1.4. Results: Seventeen diagnostic studies with high methodological quality, involving 5,185 patients, were included. In the results of the meta-analysis, sROC AUC of Braden, Norton, and Waterflow scale was over 0.7, showing moderate predictive validity, but they have limited interpretation due to significant differences between studies. In addition, Waterlow scale is insufficient as a screening tool owing to low sensitivity compared with other scales. Conclusion: The contemporary pressure ulcer risk scale is not suitable for uninform practice on patients under standardized criteria. Therefore, in order to provide more effective nursing care for bedsores, a new or modified pressure ulcer risk scale should be developed upon strength and weaknesses of existing tools. PMID:27114977
Niedhammer, Isabelle; Milner, Allison; LaMontagne, Anthony D; Chastang, Jean-François
2018-03-08
The objectives of the study were to construct a job-exposure matrix (JEM) for psychosocial work factors of the job strain model, to evaluate its validity, and to compare the results over time. The study was based on national representative data of the French working population with samples of 46,962 employees (2010 SUMER survey) and 24,486 employees (2003 SUMER survey). Psychosocial work factors included the job strain model factors (Job Content Questionnaire): psychological demands, decision latitude, social support, job strain and iso-strain. Job title was defined by three variables: occupation and economic activity coded using standard classifications, and company size. A JEM was constructed using a segmentation method (Classification and Regression Tree-CART) and cross-validation. The best quality JEM was found using occupation and company size for social support. For decision latitude and psychological demands, there was not much difference using occupation and company size with or without economic activity. The validity of the JEM estimates was higher for decision latitude, job strain and iso-strain, and lower for social support and psychological demands. Differential changes over time were observed for psychosocial work factors according to occupation, economic activity and company size. This study demonstrated that company size in addition to occupation may improve the validity of JEMs for psychosocial work factors. These matrices may be time-dependent and may need to be updated over time. More research is needed to assess the validity of JEMs given that these matrices may be able to provide exposure assessments to study a range of health outcomes.
A systematic review of the quality of homeopathic clinical trials
Jonas, Wayne B; Anderson, Rachel L; Crawford, Cindy C; Lyons, John S
2001-01-01
Background While a number of reviews of homeopathic clinical trials have been done, all have used methods dependent on allopathic diagnostic classifications foreign to homeopathic practice. In addition, no review has used established and validated quality criteria allowing direct comparison of the allopathic and homeopathic literature. Methods In a systematic review, we compared the quality of clinical-trial research in homeopathy to a sample of research on conventional therapies using a validated and system-neutral approach. All clinical trials on homeopathic treatments with parallel treatment groups published between 1945–1995 in English were selected. All were evaluated with an established set of 33 validity criteria previously validated on a broad range of health interventions across differing medical systems. Criteria covered statistical conclusion, internal, construct and external validity. Reliability of criteria application is greater than 0.95. Results 59 studies met the inclusion criteria. Of these, 79% were from peer-reviewed journals, 29% used a placebo control, 51% used random assignment, and 86% failed to consider potentially confounding variables. The main validity problems were in measurement where 96% did not report the proportion of subjects screened, and 64% did not report attrition rate. 17% of subjects dropped out in studies where this was reported. There was practically no replication of or overlap in the conditions studied and most studies were relatively small and done at a single-site. Compared to research on conventional therapies the overall quality of studies in homeopathy was worse and only slightly improved in more recent years. Conclusions Clinical homeopathic research is clearly in its infancy with most studies using poor sampling and measurement techniques, few subjects, single sites and no replication. Many of these problems are correctable even within a "holistic" paradigm given sufficient research expertise, support and methods. PMID:11801202
Leveraging Prior Calculus Study with Embedded Review
ERIC Educational Resources Information Center
Nikolov, Margaret C.; Withers, Wm. Douglas
2016-01-01
We propose a new course structure to address the needs of college students with previous calculus study but no course validations as an alternative to repeating the first year of calculus. Students are introduced directly to topics from Calculus III unpreceded by a formal review of topics from Calculus I or II, but with additional syllabus time…
Development of a Scale to Measure Academic Capital in High-Risk College Students
ERIC Educational Resources Information Center
Winkler, Christa; Sriram, Rishi
2015-01-01
This study presents a psychometric instrument that measures academic capital in college students. Academic capital is a set of social processes that aid students in acquiring the knowledge and support necessary to access and navigate higher education. This study establishes the validity and reliability of the Academic Capital Scale. In addition to…
ERIC Educational Resources Information Center
Jackson-Kersey, Rachel; Spray, Christopher
2013-01-01
The aim of this study was to assess the reliability and validity of the Amotivation Inventory in Physical Education (AI-PE). In addition, the study sought to identify the relationships between students' amotivation, physical self-concept, and teacher ratings of National Curriculum attainment levels in PE. Students ("N" = 510) from a…
ERIC Educational Resources Information Center
Markle, Ross; Olivera-Aguilar, Margarita; Jackson, Teresa; Noeth, Richard; Robbins, Steven
2013-01-01
The "SuccessNavigator"™ assessment is an online, 30 minute self-assessment of psychosocial and study skills designed for students entering postsecondary education. In addition to providing feedback in areas such as classroom and study behaviors, commitment to educational goals, management of academic stress, and connection to social…
14 CFR 60.13 - FSTD objective data requirements.
Code of Federal Regulations, 2014 CFR
2014-01-01
..., the data made available to the NSPM (the validation data package) must include the aircraft...) The validation data package may contain flight test data from a source in addition to or independent..., as described in the applicable QPS. (c) The validation data package may also contain predicted data...
14 CFR 60.13 - FSTD objective data requirements.
Code of Federal Regulations, 2011 CFR
2011-01-01
..., the data made available to the NSPM (the validation data package) must include the aircraft...) The validation data package may contain flight test data from a source in addition to or independent..., as described in the applicable QPS. (c) The validation data package may also contain predicted data...
14 CFR 60.13 - FSTD objective data requirements.
Code of Federal Regulations, 2010 CFR
2010-01-01
..., the data made available to the NSPM (the validation data package) must include the aircraft...) The validation data package may contain flight test data from a source in addition to or independent..., as described in the applicable QPS. (c) The validation data package may also contain predicted data...
14 CFR 60.13 - FSTD objective data requirements.
Code of Federal Regulations, 2012 CFR
2012-01-01
..., the data made available to the NSPM (the validation data package) must include the aircraft...) The validation data package may contain flight test data from a source in addition to or independent..., as described in the applicable QPS. (c) The validation data package may also contain predicted data...
14 CFR 60.13 - FSTD objective data requirements.
Code of Federal Regulations, 2013 CFR
2013-01-01
..., the data made available to the NSPM (the validation data package) must include the aircraft...) The validation data package may contain flight test data from a source in addition to or independent..., as described in the applicable QPS. (c) The validation data package may also contain predicted data...
The Play Experience Scale: development and validation of a measure of play.
Pavlas, Davin; Jentsch, Florian; Salas, Eduardo; Fiore, Stephen M; Sims, Valerie
2012-04-01
A measure of play experience in video games was developed through literature review and two empirical validation studies. Despite the considerable attention given to games in the behavioral sciences, play experience remains empirically underexamined. One reason for this gap is the absence of a scale that measures play experience. In Study 1, the initial Play Experience Scale (PES) was tested through an online validation that featured three different games (N = 203). In Study 2, a revised PES was assessed with a serious game in the laboratory (N = 77). Through principal component analysis of the Study 1 data, the initial 20-item PES was revised, resulting in the 16-item PES-16. Study 2 showed the PES-16 to be a robust instrument with the same patterns of correlations as in Study 1 via (a) internal consistency estimates, (b) correlations with established scales of motivation, (c) distributions of PES-16 scores in different game conditions, and (d) examination of the average variance extracted of the PES and the Intrinsic Motivation Scale. We suggest that the PES is appropriate for use in further validation studies. Additional examinations of the scale are required to determine its applicability to other contexts and its relationship with other constructs. The PES is potentially relevant to human factors undertakings involving video games, including basic research into play, games, and learning; prototype testing; and exploratory learning studies.
Palliative sedation: reliability and validity of sedation scales.
Arevalo, Jimmy J; Brinkkemper, Tijn; van der Heide, Agnes; Rietjens, Judith A; Ribbe, Miel; Deliens, Luc; Loer, Stephan A; Zuurmond, Wouter W A; Perez, Roberto S G M
2012-11-01
Observer-based sedation scales have been used to provide a measurable estimate of the comfort of nonalert patients in palliative sedation. However, their usefulness and appropriateness in this setting has not been demonstrated. To study the reliability and validity of observer-based sedation scales in palliative sedation. A prospective evaluation of 54 patients under intermittent or continuous sedation with four sedation scales was performed by 52 nurses. Included scales were the Minnesota Sedation Assessment Tool (MSAT), Richmond Agitation-Sedation Scale (RASS), Vancouver Interaction and Calmness Scale (VICS), and a sedation score proposed in the Guideline for Palliative Sedation of the Royal Dutch Medical Association (KNMG). Inter-rater reliability was tested with the intraclass correlation coefficient (ICC) and Cohen's kappa coefficient. Correlations between the scales using Spearman's rho tested concurrent validity. We also examined construct, discriminative, and evaluative validity. In addition, nurses completed a user-friendliness survey. Overall moderate to high inter-rater reliability was found for the VICS interaction subscale (ICC = 0.85), RASS (ICC = 0.73), and KNMG (ICC = 0.71). The largest correlation between scales was found for the RASS and KNMG (rho = 0.836). All scales showed discriminative and evaluative validity, except for the MSAT motor subscale and VICS calmness subscale. Finally, the RASS was less time consuming, clearer, and easier to use than the MSAT and VICS. The RASS and KNMG scales stand as the most reliable and valid among the evaluated scales. In addition, the RASS was less time consuming, clearer, and easier to use than the MSAT and VICS. Further research is needed to evaluate the impact of the scales on better symptom control and patient comfort. Copyright © 2012 U.S. Cancer Pain Relief Committee. Published by Elsevier Inc. All rights reserved.
Overview of Heat Addition and Efficiency Predictions for an Advanced Stirling Convertor
NASA Technical Reports Server (NTRS)
Wilson, Scott D.; Reid, Terry; Schifer, Nicholas; Briggs, Maxwell
2011-01-01
Past methods of predicting net heat input needed to be validated. Validation effort pursued with several paths including improving model inputs, using test hardware to provide validation data, and validating high fidelity models. Validation test hardware provided direct measurement of net heat input for comparison to predicted values. Predicted value of net heat input was 1.7 percent less than measured value and initial calculations of measurement uncertainty were 2.1 percent (under review). Lessons learned during validation effort were incorporated into convertor modeling approach which improved predictions of convertor efficiency.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, Xiaolin; Ye, Li; Wang, Xiaoxiang
2012-12-15
Several recent reports suggested that hydroxylated polybrominated diphenyl ethers (HO-PBDEs) may disturb thyroid hormone homeostasis. To illuminate the structural features for thyroid hormone activity of HO-PBDEs and the binding mode between HO-PBDEs and thyroid hormone receptor (TR), the hormone activity of a series of HO-PBDEs to thyroid receptors β was studied based on the combination of 3D-QSAR, molecular docking, and molecular dynamics (MD) methods. The ligand- and receptor-based 3D-QSAR models were obtained using Comparative Molecular Similarity Index Analysis (CoMSIA) method. The optimum CoMSIA model with region focusing yielded satisfactory statistical results: leave-one-out cross-validation correlation coefficient (q{sup 2}) was 0.571 andmore » non-cross-validation correlation coefficient (r{sup 2}) was 0.951. Furthermore, the results of internal validation such as bootstrapping, leave-many-out cross-validation, and progressive scrambling as well as external validation indicated the rationality and good predictive ability of the best model. In addition, molecular docking elucidated the conformations of compounds and key amino acid residues at the docking pocket, MD simulation further determined the binding process and validated the rationality of docking results. -- Highlights: ► The thyroid hormone activities of HO-PBDEs were studied by 3D-QSAR. ► The binding modes between HO-PBDEs and TRβ were explored. ► 3D-QSAR, molecular docking, and molecular dynamics (MD) methods were performed.« less
Chilean Adaptation and Validation of the Early Adolescent Temperament Questionnaire-Revised Version
Hoffmann, Marianela; Pérez, J. Carola; García, Catalina; Rojas, Graciela; Martínez, Vania
2017-01-01
The aim of this study was to develop an adapted version of the Early Adolescent Temperament Questionnaire-Revised (EATQ-R) that would be valid and reliable for assessing temperament and its components in Chileans between 12 and 18 years of age. Originally, Ellis and Rothbart (2001) developed this questionnaire (EATQ-R) to be used in North American adolescents. For the study in Chile, a translation protocol was developed, to maintain the original instrument's cultural and linguistic equivalence in the adapted version. Psychometric properties of the EATQ-R, such as factor structure, internal consistency, and convergent validity, were also assessed. The adaption and validation was carried out in two stages, with two different studies. The first study, which included 612 adolescent students from educational establishments in the cities of Santiago and Concepcion, Chile, developed the Chilean version of the 83-item EATQ-R, which has 13 dimensions, belonging to 4 theoretical factors with adequate internal consistency (Cronbach's alpha = 0.79–0.82). The second study assessed the questionnaire's convergent validity, through its application to 973 adolescent students in Santiago. Results show that the effortful control subscale was significantly inversely related to indicators of adolescent maladjustment, such as substance abuse and behavioral problems. In addition, it was directly associated with indicators of self-concept, including self-esteem and self-efficacy. The opposite pattern was observed when considering negative affect. These findings coincide with current knowledge on the relationship between temperament and adjustment in adolescents. PMID:29326616
A twin study of specific bulimia nervosa symptoms.
Mazzeo, S E; Mitchell, K S; Bulik, C M; Aggen, S H; Kendler, K S; Neale, M C
2010-07-01
Twin studies have suggested that additive genetic factors significantly contribute to liability to bulimia nervosa (BN). However, the diagnostic criteria for BN remain controversial. In this study, an item-factor model was used to examine the BN diagnostic criteria and the genetic and environmental contributions to BN in a population-based twin sample. The validity of the equal environment assumption (EEA) for BN was also tested. Participants were 1024 female twins (MZ n=614, DZ n=410) from the population-based Mid-Atlantic Twin Registry. BN was assessed using symptom-level (self-report) items consistent with DSM-IV and ICD-10 diagnostic criteria. Items assessing BN were included in an item-factor model. The EEA was measured by items assessing similarity of childhood and adolescent environment, which have demonstrated construct validity. Scores on the EEA factor were used to specify the degree to which twins shared environmental experiences in this model. The EEA was not violated for BN. Modeling results indicated that the majority of the variance in BN was due to additive genetic factors. There was substantial variability in additive genetic and environmental contributions to specific BN symptoms. Most notably, vomiting was very strongly influenced by additive genetic factors, while other symptoms were much less heritable, including the influence of weight on self-evaluation. These results highlight the importance of assessing eating disorders at the symptom level. Refinement of eating disorder phenotypes could ultimately lead to improvements in treatment and targeted prevention, by clarifying sources of variation for specific components of symptomatology.
Chaabene, Helmi; Negra, Yassine; Bouguezzi, Raja; Capranica, Laura; Franchini, Emerson; Prieske, Olaf; Hbacha, Hamdi; Granacher, Urs
2018-01-01
The regular monitoring of physical fitness and sport-specific performance is important in elite sports to increase the likelihood of success in competition. This study aimed to systematically review and to critically appraise the methodological quality, validation data, and feasibility of the sport-specific performance assessment in Olympic combat sports like amateur boxing, fencing, judo, karate, taekwondo, and wrestling. A systematic search was conducted in the electronic databases PubMed, Google-Scholar, and Science-Direct up to October 2017. Studies in combat sports were included that reported validation data (e.g., reliability, validity, sensitivity) of sport-specific tests. Overall, 39 studies were eligible for inclusion in this review. The majority of studies (74%) contained sample sizes <30 subjects. Nearly, 1/3 of the reviewed studies lacked a sufficient description (e.g., anthropometrics, age, expertise level) of the included participants. Seventy-two percent of studies did not sufficiently report inclusion/exclusion criteria of their participants. In 62% of the included studies, the description and/or inclusion of a familiarization session (s) was either incomplete or not existent. Sixty-percent of studies did not report any details about the stability of testing conditions. Approximately half of the studies examined reliability measures of the included sport-specific tests (intraclass correlation coefficient [ICC] = 0.43–1.00). Content validity was addressed in all included studies, criterion validity (only the concurrent aspect of it) in approximately half of the studies with correlation coefficients ranging from r = −0.41 to 0.90. Construct validity was reported in 31% of the included studies and predictive validity in only one. Test sensitivity was addressed in 13% of the included studies. The majority of studies (64%) ignored and/or provided incomplete information on test feasibility and methodological limitations of the sport-specific test. In 28% of the included studies, insufficient information or a complete lack of information was provided in the respective field of the test application. Several methodological gaps exist in studies that used sport-specific performance tests in Olympic combat sports. Additional research should adopt more rigorous validation procedures in the application and description of sport-specific performance tests in Olympic combat sports. PMID:29692739
Marsh, Herbert W; Martin, Andrew J; Jackson, Susan
2010-08-01
Based on the Physical Self Description Questionnaire (PSDQ) normative archive (n = 1,607 Australian adolescents), 40 of 70 items were selected to construct a new short form (PSDQ-S). The PSDQ-S was evaluated in a new cross-validation sample of 708 Australian adolescents and four additional samples: 349 Australian elite-athlete adolescents, 986 Spanish adolescents, 395 Israeli university students, 760 Australian older adults. Across these six groups, the 11 PSDQ-S factors had consistently high reliabilities and invariant factor structures. Study 1, using a missing-by-design variation of multigroup invariance tests, showed invariance across 40 PSDQ-S items and 70 PSDQ items. Study 2 demonstrated factorial invariance over a 1-year interval (test-retest correlations .57-.90; Mdn = .77), and good convergent and discriminant validity in relation to time. Study 3 showed good and nearly identical support for convergent and discriminant validity of PSDQ and PSDQ-S responses in relation to two other physical self-concept instruments.
Validation of the Impostor Phenomenon among Managers
Rohrmann, Sonja; Bechtoldt, Myriam N.; Leonhardt, Mona
2016-01-01
Following up on earlier investigations, the present research aims at validating the construct impostor phenomenon by taking other personality correlates into account and to examine whether the impostor phenomenon is a construct in its own right. In addition, gender effects as well as associations with dispositional working styles and strain are examined. In an online study we surveyed a sample of N = 242 individuals occupying leadership positions in different sectors. Confirmatory factor analyses provide empirical evidence for the discriminant validity of the impostor phenomenon. In accord with earlier studies we show that the impostor phenomenon is accompanied by higher levels of anxiety, dysphoric moods, emotional instability, a generally negative self-evaluation, and perfectionism. The study does not reveal any gender differences concerning the impostor phenomenon. With respect to working styles, persons with an impostor self-concept tend to show perfectionist as well as procrastinating behaviors. Moreover, they report being more stressed and strained by their work. In sum, the findings show that the impostor phenomenon constitutes a dysfunctional personality style. Practical implications are discussed. PMID:27313554
Jeon, Joonryong
2017-01-01
In this paper, a data compression technology-based intelligent data acquisition (IDAQ) system was developed for structural health monitoring of civil structures, and its validity was tested using random signals (El-Centro seismic waveform). The IDAQ system was structured to include a high-performance CPU with large dynamic memory for multi-input and output in a radio frequency (RF) manner. In addition, the embedded software technology (EST) has been applied to it to implement diverse logics needed in the process of acquiring, processing and transmitting data. In order to utilize IDAQ system for the structural health monitoring of civil structures, this study developed an artificial filter bank by which structural dynamic responses (acceleration) were efficiently acquired, and also optimized it on the random El-Centro seismic waveform. All techniques developed in this study have been embedded to our system. The data compression technology-based IDAQ system was proven valid in acquiring valid signals in a compressed size. PMID:28704945
Heo, Gwanghee; Jeon, Joonryong
2017-07-12
In this paper, a data compression technology-based intelligent data acquisition (IDAQ) system was developed for structural health monitoring of civil structures, and its validity was tested using random signals (El-Centro seismic waveform). The IDAQ system was structured to include a high-performance CPU with large dynamic memory for multi-input and output in a radio frequency (RF) manner. In addition, the embedded software technology (EST) has been applied to it to implement diverse logics needed in the process of acquiring, processing and transmitting data. In order to utilize IDAQ system for the structural health monitoring of civil structures, this study developed an artificial filter bank by which structural dynamic responses (acceleration) were efficiently acquired, and also optimized it on the random El-Centro seismic waveform. All techniques developed in this study have been embedded to our system. The data compression technology-based IDAQ system was proven valid in acquiring valid signals in a compressed size.
McClelland, Robyn L; Jorgensen, Neal W; Budoff, Matthew; Blaha, Michael J; Post, Wendy S; Kronmal, Richard A; Bild, Diane E; Shea, Steven; Liu, Kiang; Watson, Karol E; Folsom, Aaron R; Khera, Amit; Ayers, Colby; Mahabadi, Amir-Abbas; Lehmann, Nils; Jöckel, Karl-Heinz; Moebus, Susanne; Carr, J Jeffrey; Erbel, Raimund; Burke, Gregory L
2015-10-13
Several studies have demonstrated the tremendous potential of using coronary artery calcium (CAC) in addition to traditional risk factors for coronary heart disease (CHD) risk prediction. However, to date, no risk score incorporating CAC has been developed. The goal of this study was to derive and validate a novel risk score to estimate 10-year CHD risk using CAC and traditional risk factors. Algorithm development was conducted in the MESA (Multi-Ethnic Study of Atherosclerosis), a prospective community-based cohort study of 6,814 participants age 45 to 84 years, who were free of clinical heart disease at baseline and followed for 10 years. MESA is sex balanced and included 39% non-Hispanic whites, 12% Chinese Americans, 28% African Americans, and 22% Hispanic Americans. External validation was conducted in the HNR (Heinz Nixdorf Recall Study) and the DHS (Dallas Heart Study). Inclusion of CAC in the MESA risk score offered significant improvements in risk prediction (C-statistic 0.80 vs. 0.75; p < 0.0001). External validation in both the HNR and DHS studies provided evidence of very good discrimination and calibration. Harrell's C-statistic was 0.779 in HNR and 0.816 in DHS. Additionally, the difference in estimated 10-year risk between events and nonevents was approximately 8% to 9%, indicating excellent discrimination. Mean calibration, or calibration-in-the-large, was excellent for both studies, with average predicted 10-year risk within one-half of a percent of the observed event rate. An accurate estimate of 10-year CHD risk can be obtained using traditional risk factors and CAC. The MESA risk score, which is available online on the MESA web site for easy use, can be used to aid clinicians when communicating risk to patients and when determining risk-based treatment strategies. Copyright © 2015 American College of Cardiology Foundation. Published by Elsevier Inc. All rights reserved.
Effect of using truncated versus total foot length to calculate the arch height ratio.
McPoil, Thomas G; Cornwall, Mark W; Vicenzino, Bill; Teyhen, Deydre S; Molloy, Joseph M; Christie, Douglas S; Collins, Natalie
2008-12-01
The purpose of this study was to determine the arch height ratio in a large cohort of subjects as well as to assess the reliability and validity of the foot measurements utilized in the study. Eight hundred and fifty subjects, 393 women and 457 men, consented to participate in the study. The dorsal arch height, total foot length, and the truncated foot length were used to calculate two variations of the arch height ratio. In addition to determining within- and between-rater measurement reliability, radiographs were used to establish validity. The truncated arch height ratio can be estimated using the total foot length, unless toe deformities are present in the individual being assessed. All foot measurements had high levels of intra- and inter-rater reliability and the validity of measuring the dorsal arch height while standing with equal weight on both feet was established. This investigation provides normative values from a large cohort of healthy female and male subjects for two variations of the arch height ratio. The arch height ratio is a reliable and valid measurement that may prove useful to clinicians and researchers for the classification of foot posture.
Validation of a Malay Version of the Smartphone Addiction Scale among Medical Students in Malaysia
Sazlly Lim, Sazlyna Mohd; Wan Sulaiman, Wan Aliaa; Foo, Yoke Loong; Hoo, Fan kee
2015-01-01
Introduction This study was initiated to determine the psychometric properties of the Smart Phone Addiction Scale (SAS) by translating and validating this scale into the Malay language (SAS-M), which is the main language spoken in Malaysia. This study can distinguish smart phone and internet addiction among multi-ethnic Malaysian medical students. In addition, the reliability and validity of the SAS was also demonstrated. Materials and Methods A total of 228 participants were selected between August 2014 and September 2014 to complete a set of questionnaires, including the SAS and the modified Kimberly Young Internet addiction test (IAT) in the Malay language. Results There were 99 males and 129 females with ages ranging from 19 to 22 years old (21.7±1.1) included in this study. Descriptive and factor analyses, intra-class coefficients, t-tests and correlation analyses were conducted to verify the reliability and validity of the SAS. Bartlett’s test of sphericity was significant (p <0.01), and the Kaiser-Mayer-Olkin measure of sampling adequacy for the SAS-M was 0.92, indicating meritoriously that the factor analysis was appropriate. The internal consistency and concurrent validity of the SAS-M were verified (Cronbach’s alpha = 0.94). All of the subscales of the SAS-M, except for positive anticipation, were significantly related to the Malay version of the IAT. Conclusions This study developed the first smart phone addiction scale among medical students. This scale was shown to be reliable and valid in the Malay language. PMID:26431511
Montazeri, Ali; Torkan, Behnaz; Omidvari, Sepideh
2007-04-04
The Edinburgh Postnatal Depression Scale (EPDS) is a widely used instrument to measure postnatal depression. This study aimed to translate and to test the reliability and validity of the EPDS in Iran. The English language version of the EPDS was translated into Persian (Iranian language) and was used in this study. The questionnaire was administered to a consecutive sample of 100 women with normal (n = 50) and caesarean section (n = 50) deliveries at two points in time: 6 to 8 weeks and 12 to 14 weeks after delivery. Statistical analysis was performed to test the reliability and validity of the EPDS. Overall 22% of women at time 1 and 18% at time 2 reported experiencing postpartum depression. In general, the Iranian version of the EPDS was found to be acceptable to almost all women. Cronbach's alpha coefficient (to test reliability) was found to be 0.77 at time 1 and 0.86 at time 2. In addition, test-rest reliability was performed and the intraclass correlation coefficient was found to be 0.80. Validity as performed using known groups comparison showed satisfactory results. The questionnaire discriminated well between sub-groups of women differing in mode of delivery in the expected direction. The factor analysis indicated a three-factor structure that jointly accounted for 58% of the variance. This preliminary validation study of the Iranian version of the EPDS proved that it is an acceptable, reliable and valid measure of postnatal depression. It seems that the EPDS not only measures postpartum depression but also may be measuring something more.
Maïano, Christophe; Bégarie, Jérôme; Morin, Alexandre J S; Garbarino, Jean-Marie; Ninot, Grégory
2010-01-01
The purpose of this study was to test the reliability (i.e. internal consistency and test-retest reliability) and construct validity (i.e. content validity, factor validity, measurement invariance, and latent mean invariance) of the Nutrition and Activity Knowledge Scale (NAKS) in a sample of French adolescents with mild to moderate Intellectual Disability (ID). A total sample of 260 adolescents (144 boys and 116 girls), aged between 12 and 18 years old, with mild to moderate ID was involved in two studies. In the first study, analysis of items' content reveals that many words from the original version were not understood or induced confusion. These items were reworded and simplified while retaining their original meaning. In the second study, results provided support for: (i) the factor validity and reliability of a 15-item French version of the NAKS; (ii) the measurement invariance of the resulting NAKS across genders and ID levels; (iii) the partial measurement invariance of the resulting NAKS across age groups and type of school placement. In addition, the latent means of the 15-item French version of the NAKS proved to be invariant across gender, age categories, and ID levels, but to vary across type of school placement (with adolescents schooled in self-contained classes from regular schools presenting higher levels of NAK than adolescents placed in specialized establishments). The present results thus provide preliminary evidence regarding the construct validity of a 15-item French version of the NAKS in a sample of adolescents with ID.
Singer, Sara; Meterko, Mark; Baker, Laurence; Gaba, David; Falwell, Alyson; Rosen, Amy
2007-01-01
Objective To describe the development of an instrument for assessing workforce perceptions of hospital safety culture and to assess its reliability and validity. Data Sources/Study Setting Primary data collected between March 2004 and May 2005. Personnel from 105 U.S. hospitals completed a 38-item paper and pencil survey. We received 21,496 completed questionnaires, representing a 51 percent response rate. Study Design Based on review of existing safety climate surveys, we developed a list of key topics pertinent to maintaining a culture of safety in high-reliability organizations. We developed a draft questionnaire to address these topics and pilot tested it in four preliminary studies of hospital personnel. We modified the questionnaire based on experience and respondent feedback, and distributed the revised version to 42,249 hospital workers. Data Collection We randomly divided respondents into derivation and validation samples. We applied exploratory factor analysis to responses in the derivation sample. We used those results to create scales in the validation sample, which we subjected to multitrait analysis (MTA). Principal Findings We identified nine constructs, three organizational factors, two unit factors, three individual factors, and one additional factor. Constructs demonstrated substantial convergent and discriminant validity in the MTA. Cronbach's α coefficients ranged from 0.50 to 0.89. Conclusions It is possible to measure key salient features of hospital safety climate using a valid and reliable 38-item survey and appropriate hospital sample sizes. This instrument may be used in further studies to better understand the impact of safety climate on patient safety outcomes. PMID:17850530
NG, Chong Guan; CHIN, Soo Cheng; YEE, Anne Hway Ann; LOH, Huai Seng; SULAIMAN, Ahmad Hatim; Sherianne Sook Kuan, WONG; HABIL, Mohamed Hussain
2014-01-01
Background: The Snaith-Hamilton Pleasure Scale (SHAPS) is a self-assessment scale designed to evaluate anhedonia in various psychiatric disorders. In order to facilitate its use in Malaysian settings, our current study aimed to examine the validity of a Malay-translated version of the SHAPS (SHAPS-M). Methods: In this cross-sectional study, a total of 44 depressed patients and 82 healthy subjects were recruited from a university out-patient clinic. All participants were given both the Malay and English versions of the SHAPS, Fawcett-Clark Pleasure Scale (FCPS), General Health Questionnaire 12 (GHQ-12), and the Beck Depression Inventory (BDI) to assess their hedonic state, general mental health condition and levels of depression. Results: The results showed that the SHAPS-M has impressive internal consistency (α = 0.96), concurrent validity and good parallel-form reliability (intraclass coefficient, ICC = 0.65). Conclusion: In addition to demonstrating good psychometric properties, the SHAPS-M is easy to administer. Therefore, it is a valid, reliable, and suitable questionnaire for assessing anhedonia among depressed patients in Malaysia. PMID:25246837
Evaluation Instruments for Quality of Life Related to Melasma: An Integrative Review.
Pollo, Camila Fernandes; Meneguin, Silmara; Miot, Helio Amante
2018-05-21
The aim of this study was to analyze scientific production concerning the validation and cultural adaptation of quality of life evaluation instruments for patients with melasma and to offer a critical reflection on these methods. A literature review was performed based on a search of the Web of Science, Bireme, PubMed, Elsevier Scopus, and Google Scholar databases. All published articles from indexed periodicals in these electronic databases up to December 2015 were included. Eight articles were identified, of which only one (12.5%) referred to the development and validation of a specific instrument for evaluation of the quality of life of melasma patients. An additional six articles (75%) referred to transcultural adjustment and validation of the same instrument in other languages, and another (12.5%) article reported the development of a generic instrument for evaluation of quality of life in patients with pigment disorders. This review revealed only one specific instrument developed and validated in different cultures. Despite being widely used, this instrument did not follow the classic construction steps for psychometric instruments, which paves the way for future studies to develop novel instruments.
Ng, Chong Guan; Chin, Soo Cheng; Yee, Anne Hway Ann; Loh, Huai Seng; Sulaiman, Ahmad Hatim; Sherianne Sook Kuan, Wong; Habil, Mohamed Hussain
2014-05-01
The Snaith-Hamilton Pleasure Scale (SHAPS) is a self-assessment scale designed to evaluate anhedonia in various psychiatric disorders. In order to facilitate its use in Malaysian settings, our current study aimed to examine the validity of a Malay-translated version of the SHAPS (SHAPS-M). In this cross-sectional study, a total of 44 depressed patients and 82 healthy subjects were recruited from a university out-patient clinic. All participants were given both the Malay and English versions of the SHAPS, Fawcett-Clark Pleasure Scale (FCPS), General Health Questionnaire 12 (GHQ-12), and the Beck Depression Inventory (BDI) to assess their hedonic state, general mental health condition and levels of depression. The results showed that the SHAPS-M has impressive internal consistency (α = 0.96), concurrent validity and good parallel-form reliability (intraclass coefficient, ICC = 0.65). In addition to demonstrating good psychometric properties, the SHAPS-M is easy to administer. Therefore, it is a valid, reliable, and suitable questionnaire for assessing anhedonia among depressed patients in Malaysia.
Development and Validation of the Pregnancy and Infant Orientation Questionnaire
Meyer-Bahlburg, Heino F. L.; Dolezal, Curtis; Johnson, Laurel L.; Kessler, Suzanne J.; Schober, Justine M.; Zucker, Kenneth J.
2009-01-01
The purpose of this current study was the construction, scaling, and scale validation of a self-report questionnaire assessing biographical information and motivation for pregnancy and infant care in men and women with disorders of sex development or other gender variations of potential clinical relevance. The overall design of the questionnaire and the initial item pool were derived from related clinical and research experience. Collection of pilot data and, where appropriate, scale construction (via principal components analyses) were based on Canadian convenience samples of heterosexual (HET) and non-HET men and women (N=414). A sample of gender-dysphoric (GDYS) men and women (n=45) was added for validity analysis. Validation of the resulting scales was based on the demonstration of expected scale differences between HET men and women, as well as between HET, non-HET, and GDYS men and between HET, non-HET, and GDYS women, and was successful with one exception. This study concludes that this new questionnaire, in addition to its descriptive sections, provides several distinct scales related to desires for child bearing and child rearing and has good psychometric properties. PMID:19998066
Evaluation Instruments for Quality of Life Related to Melasma: An Integrative Review
Pollo, Camila Fernandes; Meneguin, Silmara; Miot, Helio Amante
2018-01-01
The aim of this study was to analyze scientific production concerning the validation and cultural adaptation of quality of life evaluation instruments for patients with melasma and to offer a critical reflection on these methods. A literature review was performed based on a search of the Web of Science, Bireme, PubMed, Elsevier Scopus, and Google Scholar databases. All published articles from indexed periodicals in these electronic databases up to December 2015 were included. Eight articles were identified, of which only one (12.5%) referred to the development and validation of a specific instrument for evaluation of the quality of life of melasma patients. An additional six articles (75%) referred to transcultural adjustment and validation of the same instrument in other languages, and another (12.5%) article reported the development of a generic instrument for evaluation of quality of life in patients with pigment disorders. This review revealed only one specific instrument developed and validated in different cultures. Despite being widely used, this instrument did not follow the classic construction steps for psychometric instruments, which paves the way for future studies to develop novel instruments. PMID:29791603
Further Validation of the Multidimensional Fatigue Symptom Inventory-Short Form
Stein, Kevin D.; Jacobsen, Paul B.; Blanchard, Chris M.; Thors, Christina
2008-01-01
A growing body of evidence is documenting the multidimensional nature of cancer-related fatigue. Although several multidimensional measures of fatigue have been developed, further validation of these scales is needed. To this end, the current study sought to evaluate the factorial and construct validity of the 30-item Multidimensional Fatigue Symptom Inventory-Short Form (MFSI-SF). A heterogeneous sample of 304 cancer patients (mean age 55 years) completed the MFSI-SF, along with several other measures of psychosocial functioning including the MOS-SF-36 and Fatigue Symptom Inventory, following the fourth cycle of chemotherapy treatment. The results of a confirmatory factor analysis indicated the 5-factor model provided a good fit to the data as evidenced by commonly used goodness of fit indices (CFI 0.90 and IFI 0.90). Additional evidence for the validity of the MFSI-SF was provided via correlations with other relevant instruments (range −0.21 to 0.82). In sum, the current study provides support for the MFSI-SF as a valuable tool for the multidimensional assessment of cancer-related fatigue. PMID:14711465
Externalizing disorders: cluster 5 of the proposed meta-structure for DSM-V and ICD-11.
Krueger, R F; South, S C
2009-12-01
The extant major psychiatric classifications DSM-IV and ICD-10 are purportedly atheoretical and largely descriptive. Although this achieves good reliability, the validity of a medical diagnosis is greatly enhanced by an understanding of the etiology. In an attempt to group mental disorders on the basis of etiology, five clusters have been proposed. We consider the validity of the fifth cluster, externalizing disorders, within this proposal. We reviewed the literature in relation to 11 validating criteria proposed by the Study Group of the DSM-V Task Force, in terms of the extent to which these criteria support the idea of a coherent externalizing spectrum of disorders. This cluster distinguishes itself by the central role of disinhibitory personality in mental disorders spread throughout sections of the current classifications, including substance dependence, antisocial personality disorder and conduct disorder. Shared biomarkers, co-morbidity and course offer additional evidence for a valid cluster of externalizing disorders. Externalizing disorders meet many of the salient criteria proposed by the Study Group of the DSM-V Task Force to suggest a classification cluster.
Crouse, Cecelia A; Yeung, Stephanie; Greenspoon, Susan; McGuckian, Amy; Sikorsky, Julie; Ban, Jeff; Mathies, Richard
2005-08-01
To present validation studies performed for the implementation of existing and new technologies to increase the efficiency in the forensic DNA Section of the Palm Beach County Sheriff's Office (PBSO) Crime Laboratory. Using federally funded grants, internal support, and an external Process Mapping Team, the PBSO collaborated with forensic vendors, universities, and other forensic laboratories to enhance DNA testing procedures, including validation of the DNA IQ magnetic bead extraction system, robotic DNA extraction using the BioMek2000, the ABI7000 Sequence Detection System, and is currently evaluating a micro Capillary Array Electrophoresis device. The PBSO successfully validated and implemented both manual and automated Promega DNA IQ magnetic bead extractions system, which have increased DNA profile results from samples with low DNA template concentrations. The Beckman BioMek2000 DNA robotic workstation has been validated for blood, tissue, bone, hair, epithelial cells (touch evidence), and mixed stains such as semen. There has been a dramatic increase in the number of samples tested per case since implementation of the robotic extraction protocols. The validation of the ABI7000 real-time quantitative polymerase chain reaction (qPCR) technology and the single multiplex short tandem repeat (STR) PowerPlex16 BIO amplification system has provided both a time and a financial benefit. In addition, the qPCR system allows more accurate DNA concentration data and the PowerPlex 16 BIO multiplex generates DNA profiles data in half the time when compared to PowerPlex1.1 and PowerPlex2.1 STR systems. The PBSO's future efficiency requirements are being addressed through collaboration with the University of California at Berkeley and the Virginia Division of Forensic Science to validate microcapillary array electrophoresis instrumentation. Initial data demonstrated the electrophoresis of 96 samples in less than twenty minutes. The PBSO demonstrated, through the validation of more efficient extraction and quantification technology, an increase in the number of evidence samples tested using robotic/DNA IQ magnetic bead DNA extraction, a decrease in the number of negative samples amplified due to qPCR and implementation of a single multiplex amplification system. In addition, initial studies show the microcapillary array electrophoresis device (microCAE) evaluation results provide greater sensitivity and faster STR analysis output than current platforms.
Blast effect on the lower extremities and its mitigation: a computational study.
Dong, Liqiang; Zhu, Feng; Jin, Xin; Suresh, Mahi; Jiang, Binhui; Sevagan, Gopinath; Cai, Yun; Li, Guangyao; Yang, King H
2013-12-01
A series of computational studies were performed to investigate the response of the lower extremities of mounted soldiers under landmine detonation. A numerical human body model newly developed at Wayne State University was used to simulate two types of experimental studies and the model predictions were validated against test data in terms of the tibia axial force as well as bone fracture pattern. Based on the validated model, the minimum axial force causing tibia facture was found. Then a series of parametric studies was conducted to determine the critical velocity (peak velocity of the floor plate) causing tibia fracture at different upper/lower leg angles. In addition, to limit the load transmission through the vehicular floor, two types of energy absorbing materials, namely IMPAXX(®) foam and aluminum alloy honeycomb, were selected for floor matting. Their performances in terms of blast effect mitigation were compared using the validated numerical model, and it has been found that honeycomb is a more efficient material for blast injury prevention under the loading conditions studied. © 2013 Elsevier Ltd. All rights reserved.
Development and validation of a two-dimensional fast-response flood estimation model
DOE Office of Scientific and Technical Information (OSTI.GOV)
Judi, David R; Mcpherson, Timothy N; Burian, Steven J
2009-01-01
A finite difference formulation of the shallow water equations using an upwind differencing method was developed maintaining computational efficiency and accuracy such that it can be used as a fast-response flood estimation tool. The model was validated using both laboratory controlled experiments and an actual dam breach. Through the laboratory experiments, the model was shown to give good estimations of depth and velocity when compared to the measured data, as well as when compared to a more complex two-dimensional model. Additionally, the model was compared to high water mark data obtained from the failure of the Taum Sauk dam. Themore » simulated inundation extent agreed well with the observed extent, with the most notable differences resulting from the inability to model sediment transport. The results of these validation studies complex two-dimensional model. Additionally, the model was compared to high water mark data obtained from the failure of the Taum Sauk dam. The simulated inundation extent agreed well with the observed extent, with the most notable differences resulting from the inability to model sediment transport. The results of these validation studies show that a relatively numerical scheme used to solve the complete shallow water equations can be used to accurately estimate flood inundation. Future work will focus on further reducing the computation time needed to provide flood inundation estimates for fast-response analyses. This will be accomplished through the efficient use of multi-core, multi-processor computers coupled with an efficient domain-tracking algorithm, as well as an understanding of the impacts of grid resolution on model results.« less
Peters, Johanna; Taute, Wolfgang; Bartscher, Kathrin; Döscher, Claas; Höft, Michael; Knöchel, Reinhard; Breitkreutz, Jörg
2017-04-08
Microwave sensor systems using resonance technology at a single resonance in the range of 2-3 GHz have been shown to be a rapid and reliable tool for moisture determination in solid materials including pharmaceutical granules. So far, their application is limited to lower moisture ranges or limitations above certain moisture contents had to be accepted. Aim of the present study was to develop a novel multi-resonance sensor system in order to expand the measurement range. Therefore, a novel sensor using additional resonances over a wide frequency band was designed and used to investigate inherent limitations of first generation sensor systems and material-related limits. Using granule samples with different moisture contents, an experimental protocol for calibration and validation of the method was established. Pursuant to this protocol, a multiple linear regression (MLR) prediction model built by correlating microwave moisture values to the moisture determined by Karl Fischer titration was chosen and rated using conventional criteria such as coefficient of determination (R 2 ) and root mean square error of calibration (RMSEC). Using different operators, different analysis dates and different ambient conditions the method was fully validated following the guidance of ICH Q2(R1). The study clearly showed explanations for measurement uncertainties of first generation sensor systems which confirmed the approach to overcome these by using additional resonances. The established prediction model could be validated in the range of 7.6-19.6%, demonstrating its fit for its future purpose, the moisture content determination during wet granulations. Copyright © 2017 Elsevier B.V. All rights reserved.
Extensive validation of the pain disability index in 3 groups of patients with musculoskeletal pain.
Soer, Remko; Köke, Albère J A; Vroomen, Patrick C A J; Stegeman, Patrick; Smeets, Rob J E M; Coppes, Maarten H; Reneman, Michiel F
2013-04-20
A cross-sectional study design was performed. To validate the pain disability index (PDI) extensively in 3 groups of patients with musculoskeletal pain. The PDI is a widely used and studied instrument for disability related to various pain syndromes, although there is conflicting evidence concerning factor structure, test-retest reliability, and missing items. Additionally, an official translation of the Dutch language version has never been performed. For reliability, internal consistency, factor structure, test-retest reliability and measurement error were calculated. Validity was tested with hypothesized correlations with pain intensity, kinesiophobia, Rand-36 subscales, Depression, Roland-Morris Disability Questionnaire, Quality of Life, and Work Status. Structural validity was tested with independent backward translation and approval from the original authors. One hundred seventy-eight patients with acute back pain, 425 patients with chronic low back pain and 365 with widespread pain were included. Internal consistency of the PDI was good. One factor was identified with factor analyses. Test-retest reliability was good for the PDI (intraclass correlation coefficient, 0.76). Standard error of measurement was 6.5 points and smallest detectable change was 17.9 points. Little correlations between the PDI were observed with kinesiophobia and depression, fair correlations with pain intensity, work status, and vitality and moderate correlations with the Rand-36 subscales and the Roland-Morris Disability Questionnaire. The PDI-Dutch language version is internally consistent as a 1-factor structure, and test-retest reliable. Missing items seem high in sexual and professional items. Using the PDI as a 2-factor questionnaire has no additional value and is unreliable.
Validation of the Minority Stress Scale Among Italian Gay and Bisexual Men
Pala, Andrea Norcini; Dell’Amore, Francesca; Steca, Patrizia; Clinton, Lauren; Sandfort, Theodorus; Rael, Christine
2017-01-01
The experience of sexual orientation stigma (e.g., homophobic discrimination and physical aggression) generates minority stress, a chronic form of psychosocial stress. Minority stress has been shown to have a negative effect on gay and bisexual men’s (GBM’s) mental and physical health, increasing the rates of depression, suicidal ideation, and HIV risk behaviors. In conservative religious settings, such as Italy, sexual orientation stigma can be more frequently and/or more intensively experienced. However, minority stress among Italian GBM remains understudied. The aim of this study was to explore the dimensionality, internal reliability, and convergent validity of the Minority Stress Scale (MSS), a comprehensive instrument designed to assess the manifestations of sexual orientation stigma. The MSS consists of 50 items assessing (a) Structural Stigma, (b) Enacted Stigma, (c) Expectations of Discrimination, (d) Sexual Orientation Concealment, (e) Internalized Homophobia Toward Others, (f) Internalized Homophobia toward Oneself, and (g) Stigma Awareness. We recruited an online sample of 451 Italian GBM to take the MSS. We tested convergent validity using the Perceived Stress Questionnaire. Through exploratory factor analysis, we extracted the 7 theoretical factors and an additional 3-item factor assessing Expectations of Discrimination From Family Members. The MSS factors showed good internal reliability (ordinal α > .81) and good convergent validity. Our scale can be suitable for applications in research settings, psychosocial interventions, and, potentially, in clinical practice. Future studies will be conducted to further investigate the properties of the MSS, exploring the association with additional health-related measures (e.g., depressive symptoms and anxiety). PMID:29479555
Süß, Heinz-Martin; Kretzschmar, André
2018-01-01
The original aim of complex problem solving (CPS) research was to bring the cognitive demands of complex real-life problems into the lab in order to investigate problem solving behavior and performance under controlled conditions. Up until now, the validity of psychometric intelligence constructs has been scrutinized with regard to its importance for CPS performance. At the same time, different CPS measurement approaches competing for the title of the best way to assess CPS have been developed. In the first part of the paper, we investigate the predictability of CPS performance on the basis of the Berlin Intelligence Structure Model and Cattell's investment theory as well as an elaborated knowledge taxonomy. In the first study, 137 students managed a simulated shirt factory ( Tailorshop ; i.e., a complex real life-oriented system) twice, while in the second study, 152 students completed a forestry scenario ( FSYS ; i.e., a complex artificial world system). The results indicate that reasoning - specifically numerical reasoning (Studies 1 and 2) and figural reasoning (Study 2) - are the only relevant predictors among the intelligence constructs. We discuss the results with reference to the Brunswik symmetry principle. Path models suggest that reasoning and prior knowledge influence problem solving performance in the Tailorshop scenario mainly indirectly. In addition, different types of system-specific knowledge independently contribute to predicting CPS performance. The results of Study 2 indicate that working memory capacity, assessed as an additional predictor, has no incremental validity beyond reasoning. We conclude that (1) cognitive abilities and prior knowledge are substantial predictors of CPS performance, and (2) in contrast to former and recent interpretations, there is insufficient evidence to consider CPS a unique ability construct. In the second part of the paper, we discuss our results in light of recent CPS research, which predominantly utilizes the minimally complex systems (MCS) measurement approach. We suggest ecologically valid microworlds as an indispensable tool for future CPS research and applications.
Preliminary validation of the Review of Musculoskeletal System (ROMS) questionnaire.
Bershadsky, Boris; Kane, Robert L; Wuerz, Thomas; Jones, Morgan; Brighton, Brian; Stitzlein, Russell; Parker, Richard; Iannotti, Joseph P
2015-04-01
Measurement of clinical outcomes is necessary to define best practice. It requires a validated tool that can be easily applied as part of clinical practice. We present the preliminary validation of a brief self-reported Review of Musculoskeletal System (ROMS) questionnaire that captures functional limitations due to musculoskeletal problems and other medical and emotional conditions. Data were derived from a clinical outcomes database (Orthopaedic Minimal Data Set [OrthoMiDaS]) that combines patient-reported data collected as part of routine care and secondary data extracted from electronic medical records. The study utilized 82,873 encounters collected from 24,116 consecutive patients with problems in the upper and lower extremities. In addition to the ROMS, the study used version 2 of the Short Form-12 (SF-12v2), the Penn Shoulder Score (PSS), the Hip disability and Osteoarthritis Outcome Score (HOOS), and the Knee injury and Osteoarthritis Outcome Score (KOOS) questionnaires. Fifteen cross-sectional samples were used to evaluate the floor and ceiling effects as well as the construct and content validity. Five longitudinal cohorts were used to measure test-retest reliability and responsiveness. Standard statistical tests were applied. The floor and ceiling effects of the ROMS questionnaire in patients with shoulder, hip, and knee problems ranged from 1.3% to 8.5%. Construct-validity tests confirmed convergent and divergent validity of the ROMS. The tests also justified its additional value when the ROMS was used with joint-specific tools. When measuring test-retest reliability of the ROMS scales, intraclass correlation ranged from 0.80 to 0.90 at approximately one week and from 0.71 to 0.87 at approximately four weeks. Responsiveness of the ROMS was greater than that of the SF-12 and less than that of the joint-specific questionnaires. The ROMS is compatible with routine clinical process and has good psychometric properties in patients with shoulder, hip, and knee disorders. It can be used as a primary outcome tool for large observational studies and can supplement more specific tools in controlled studies. The ROMS was developed as a tool to measure and monitor the clinical status of the musculoskeletal system in a population of patients during and after treatment as well as over time. Copyright © 2015 by The Journal of Bone and Joint Surgery, Incorporated.
Twinn, S
1997-08-01
Although the complexity of undertaking qualitative research with non-English speaking informants has become increasingly recognized, few empirical studies exist which explore the influence of translation on the findings of the study. The aim of this exploratory study was therefore to examine the influence of translation on the reliability and validity of the findings of a qualitative research study. In-depth interviews were undertaken in Cantonese with a convenience sample of six women to explore their perceptions of factors influencing their uptake of Pap smears. Data analysis involved three stages. The first stage involved the translation and transcription of all the interviews into English independently by two translators as well as transcription into Chinese by a third researcher. The second stage involved content analysis of the three data sets to develop categories and themes and the third stage involved a comparison of the categories and themes generated from the Chinese and English data sets. Despite no significant differences in the major categories generated from the Chinese and English data, some minor differences were identified in the themes generated from the data. More significantly the results of the study demonstrated some important issues to consider when using translation in qualitative research, in particular the complexity of managing data when no equivalent word exists in the target language and the influence of the grammatical style on the analysis. In addition the findings raise questions about the significance of the conceptual framework of the research design and sampling to the validity of the study. The importance of using only one translator to maximize the reliability of the study was also demonstrated. In addition the author suggests the findings demonstrate particular problems in using translation in phenomenological research designs.
Treatment of Neurogenic Cough with Tramadol: A Pilot Study.
Dion, Gregory R; Teng, Stephanie E; Achlatis, Efstratios; Fang, Yixin; Amin, Milan R
2017-07-01
This study employs validated cough assessment tools to prospectively determine the impact of tramadol on cough severity and quality of life in subjects with neurogenic cough. The study was a prospective case series with planned data collection at a tertiary care academic medical center laryngology practice. Sixteen consecutive collected subjects with neurogenic cough prospectively completed pre- and posttreatment validated cough assessment tools, the cough severity index (CSI) and Leicester Cough Questionnaire (LCQ). All subjects in the study reported at least some improvement in their cough symptoms. In a Wilcoxon signed rank test that compared paired results, CSI scores improved from 23 to 14 and LCQ scores improved from 74 to 103 ( P = .003 and P = .005, respectively). This small preliminary assessment suggests that tramadol warrants additional evaluation as a treatment for neurogenic cough.
Quality and rigor of the concept mapping methodology: a pooled study analysis.
Rosas, Scott R; Kane, Mary
2012-05-01
The use of concept mapping in research and evaluation has expanded dramatically over the past 20 years. Researchers in academic, organizational, and community-based settings have applied concept mapping successfully without the benefit of systematic analyses across studies to identify the features of a methodologically sound study. Quantitative characteristics and estimates of quality and rigor that may guide for future studies are lacking. To address this gap, we conducted a pooled analysis of 69 concept mapping studies to describe characteristics across study phases, generate specific indicators of validity and reliability, and examine the relationship between select study characteristics and quality indicators. Individual study characteristics and estimates were pooled and quantitatively summarized, describing the distribution, variation and parameters for each. In addition, variation in the concept mapping data collection in relation to characteristics and estimates was examined. Overall, results suggest concept mapping yields strong internal representational validity and very strong sorting and rating reliability estimates. Validity and reliability were consistently high despite variation in participation and task completion percentages across data collection modes. The implications of these findings as a practical reference to assess the quality and rigor for future concept mapping studies are discussed. Copyright © 2011 Elsevier Ltd. All rights reserved.
Multiscale GPS tomography during COPS: validation and applications
NASA Astrophysics Data System (ADS)
Champollion, Cédric; Flamant, Cyrille; Masson, Frédéric; Gégout, Pascal; Boniface, Karen; Richard, Evelyne
2010-05-01
Accurate 3D description of the water vapour field is of interest for process studies such as convection initiation. None of the current techniques (LIDAR, satellite, radio soundings, GPS) can provide an all weather continuous 3D field of moisture. The combination of GPS tomography with radio-soundings (and/or LIDAR) has been used for such process studies using both advantages of vertically resolved soundings and high temporal density of GPS measurements. GPS tomography has been used at short scale (10 km horizontal resolution but in a 50 km² area) for process studies such as the ESCOMPTE experiment (Bastin et al., 2005) and at larger scale (50 km horizontal resolution) during IHOP_2002. But no extensive statistical validation has been done so far. The overarching goal of the COPS field experiment is to advance the quality of forecasts of orographically induced convective precipitation by four-dimensional observations and modeling of its life cycle for identifying the physical and chemical processes responsible for deficiencies in QPF over low-mountain regions. During the COPS field experiment, a GPS network of about 100 GPS stations has been continuously operating during three months in an area of 500 km² in the East of France (Vosges Mountains) and West of Germany (Black Forest). If the mean spacing between the GPS is about 50 km, an East-West GPS profile with a density of about 10 km is dedicated to high resolution tomography. One major goal of the GPS COPS experiment is to validate the GPS tomography with different spatial resolutions. Validation is based on additional radio-soundings and airborne / ground-based LIDAR measurement. The number and the high quality of vertically resolved water vapor observations give an unique data set for GPS tomography validation. Numerous tests have been done on real data to show the type water vapor structures that can be imaging by GPS tomography depending of the assimilation of additional data (radio soundings), the resolution of the tomography grid and the density of GPS network. Finally some applications to different cases studies will be shortly presented.
Standard Specimen Reference Set: Pancreatic — EDRN Public Portal
The primary objective of the EDRN Pancreatic Cancer Working Group Proposal is to create a reference set consisting of well-characterized serum/plasma specimens to use as a resource for the development of biomarkers for the early detection of pancreatic adenocarcinoma. The testing of biomarkers on the same sample set permits direct comparison among them; thereby, allowing the development of a biomarker panel that can be evaluated in a future validation study. Additionally, the establishment of an infrastructure with core data elements and standardized operating procedures for specimen collection, processing and storage, will provide the necessary preparatory platform for larger validation studies when the appropriate marker/panel for pancreatic adenocarcinoma has been identified.
Sulfonamide-containing PTP 1B inhibitors: Docking studies, synthesis and model validation
NASA Astrophysics Data System (ADS)
Niu, Enli; Gan, Qiang; Chen, Xi; Feng, Changgen
2017-01-01
PTP 1B plays an important role in regulating insulin signaling pathway and is regarded as a valid target for curing diabetes and obesity. In this paper, two novel sulfonamide-containing PTP 1B inhibitors were designed, synthesized in mild condition, and characterized by FT-IR, 1H NMR, 13C NMR and elemental analysis. The single crystal of compounds 7 and 8 were obtained and their structures were determined by X-ray single crystal diffraction analysis. In addition, their inhibitory activity were predicted by genetic algorithm, and carried on in vitro enzyme activity test. Of which compound 8 showed good inhibitory activity, in consistent with docking studies.
Oetzel, John; Wallerstein, Nina; Solimon, Audrey; Garcia, Bruce; Siemon, Mark; Adeky, Sarah; Apachito, Gracie; Caston, Elissa; Finster, Carolyn; Belone, Lorenda; Tafoya, Greg
2011-06-01
The purpose of this study was to develop a measure of community capacity for American Indian communities. The study included development and testing phases to ensure face, content, construct, and predictive validity. There were 500 participants in two southwest tribes who completed a detailed community profile, which contained 21 common items in five dimensions (communication, sense of community, youth, elders, and language/culture). In addition, subscales of women and leadership were included in one tribe each. Confirmatory factor analysis primarily supported the factorial structure of the instruments, and the seven dimensions were found to correlate with previously validated measures of social capital, historical trauma, community influence, and physical health in expected directions.
Brewin, James; Tang, Jessica; Dasgupta, Prokar; Khan, Muhammad S; Ahmed, Kamran; Bello, Fernando; Kneebone, Roger; Jaye, Peter
2015-07-01
To evaluate the face, content and construct validity of the distributed simulation (DS) environment for technical and non-technical skills training in endourology. To evaluate the educational impact of DS for urology training. DS offers a portable, low-cost simulated operating room environment that can be set up in any open space. A prospective mixed methods design using established validation methodology was conducted in this simulated environment with 10 experienced and 10 trainee urologists. All participants performed a simulated prostate resection in the DS environment. Outcome measures included surveys to evaluate the DS, as well as comparative analyses of experienced and trainee urologist's performance using real-time and 'blinded' video analysis and validated performance metrics. Non-parametric statistical methods were used to compare differences between groups. The DS environment demonstrated face, content and construct validity for both non-technical and technical skills. Kirkpatrick level 1 evidence for the educational impact of the DS environment was shown. Further studies are needed to evaluate the effect of simulated operating room training on real operating room performance. This study has shown the validity of the DS environment for non-technical, as well as technical skills training. DS-based simulation appears to be a valuable addition to traditional classroom-based simulation training. © 2014 The Authors BJU International © 2014 BJU International Published by John Wiley & Sons Ltd.
Heitor Dos Santos, Maria João; Moreira, Sérgio; Carreiras, Joana; Cooper, Cary; Smeed, Matthew; Reis, Maria de Fátima; Pereira Miguel, José
2018-02-12
The main objective of this work was to translate the English version of ASSET (A Shortened Stress Evaluation Tool) into the Portuguese version and to validate its psychometric properties. Additionally, this work tested the convergent validity of the instrument. The translation and retroversion were conducted by experts and submitted to the authors for approval. Within an observational, cross-sectional study, regarding mental health at the workplace, ASSET together with other scales was applied to a sample of 405 participants. The psychometric validity of the subscales was studied using confirmatory factorial analysis. The factorial structure of ASSET is globally supported by the results, with the Perceptions of Your Job and Attitudes Towards your Organisation subscales requiring slight adjustments in the item structure and the Your Health subscales replicating the original structure. The convergent validity also supports the ASSET, showing that all subscales are significantly correlated with variables used to test convergence. Globally, the results constitute an important contribution to ASSET and open the possibility of its usage among Portuguese-speaking countries. The results provide an evidence on the validity of the instrument and, in particular, of the mental and physical health subscales. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
The Scientific Status of Projective Techniques.
Lilienfeld, S O; Wood, J M; Garb, H N
2000-11-01
Although projective techniques continue to be widely used in clinical and forensic settings, their scientific status remains highly controversial. In this monograph, we review the current state of the literature concerning the psychometric properties (norms, reliability, validity, incremental validity, treatment utility) of three major projective instruments: Rorschach Inkblot Test, Thematic Apperception Test (TAT), and human figure drawings. We conclude that there is empirical support for the validity of a small number of indexes derived from the Rorschach and TAT. However, the substantial majority of Rorschach and TAT indexes are not empirically supported. The validity evidence for human figure drawings is even more limited. With a few exceptions, projective indexes have not consistently demonstrated incremental validity above and beyond other psychometric data. In addition, we summarize the results of a new meta-analysis intended to examine the capacity of these three instruments to detect child sexual abuse. Although some projective instruments were better than chance at detecting child sexual abuse, there were virtually no replicated findings across independent investigative teams. This meta-analysis also provides the first clear evidence of substantial file drawer effects in the projectives literature, as the effect sizes from published studies markedly exceeded those from unpublished studies. We conclude with recommendations regarding the (a) construction of projective techniques with adequate validity, (b) forensic and clinical use of projective techniques, and (c) education and training of future psychologists regarding projective techniques. © 2000 Association for Psychological Science.
Psychometric and cognitive validation of a social capital measurement tool in Peru and Vietnam.
De Silva, Mary J; Harpham, Trudy; Tuan, Tran; Bartolini, Rosario; Penny, Mary E; Huttly, Sharon R
2006-02-01
Social capital is a relatively new concept which has attracted significant attention in recent years. No consensus has yet been reached on how to measure social capital, resulting in a large number of different tools available. While psychometric validation methods such as factor analysis have been used by a few studies to assess the internal validity of some tools, these techniques rely on data already collected by the tool and are therefore not capable of eliciting what the questions are actually measuring. The Young Lives (YL) study includes quantitative measures of caregiver's social capital in four countries (Vietnam, Peru, Ethiopia, and India) using a short version of the Adapted Social Capital Assessment Tool (SASCAT). A range of different psychometric methods including factor analysis were used to evaluate the construct validity of SASCAT in Peru and Vietnam. In addition, qualitative cognitive interviews with 20 respondents from Peru and 24 respondents from Vietnam were conducted to explore what each question is actually measuring. We argue that psychometric validation techniques alone are not sufficient to adequately validate multi-faceted social capital tools for use in different cultural settings. Psychometric techniques show SASCAT to be a valid tool reflecting known constructs and displaying postulated links with other variables. However, results from the cognitive interviews present a more mixed picture with some questions being appropriately interpreted by respondents, and others displaying significant differences between what the researchers intended them to measure and what they actually do. Using evidence from a range of methods of assessing validity has enabled the modification of an existing instrument into a valid and low cost tool designed to measure social capital within larger surveys in Peru and Vietnam, with the potential for use in other developing countries following local piloting and cultural adaptation of the tool.
Noormohammadpour, Pardis; Tavana, Bahareh; Mansournia, Mohammad Ali; Zeinalizadeh, Mehdi; Mirzashahi, Babak; Rostami, Mohsen; Kordi, Ramin
2018-05-01
Translation and cultural adaptation of the National Institutes of Health (NIH) Task Force's minimal dataset. The purpose of this study was to evaluate validity and reliability of the Farsi version of NIH Task Force's recommended multidimensional minimal dataset for research on chronic low back pain (CLBP). Considering the high treatment cost of CLBP and its increasing prevalence, NIH Pain Consortium developed research standards (including recommendations for definitions, a minimum dataset, and outcomes' report) for studies regarding CLBP. Application of these recommendations could standardize research and improve comparability of different studies in CLBP. This study has three phases: translation of dataset into Farsi and its cultural adaptation, assessment of pre-final version of dataset's comprehensibility via a pilot study, and investigation of the reliability and validity of final version of translated dataset. Subjects were 250 patients with CLBP. Test-retest reliability, content validity, and convergent validity (correlations among different dimensions of dataset and Farsi versions of Oswestry Disability Index, Roland Morris Disability Questionnaire, Fear-Avoidance Belief Questionnaire, and Beck Depression Inventory-II) were assessed. The Farsi version demonstrated good/excellent convergent validity (the correlation coefficient between impact dimension and ODI was r = 0.75 [P < 0.001], between impact dimension and Roland-Morris Disability Questionnaire was r = 0.80 [P < 0.001], and between psychological dimension and BDI was r = 0.62 [P < 0.001]). The test-retest reliability was also strong (intraclass correlation coefficient value ranged between 0.70 and 0.95) and the internal consistency was good/excellent (Chronbach's alpha coefficients' value for two main dimensions including impact dimension and psychological dimension were 0.91 and 0.82 [P < 0.001], respectively). In addition, its face validity and content validity were acceptable. The Farsi version of minimal dataset for research on CLBP is a reliable and valid instrument for data gathering in patients with CLBP. This minimum dataset can be a step toward standardization of research regarding CLBP. 3.
Fitzgibbons, Patrick L; Goldsmith, Jeffrey D; Souers, Rhona J; Fatheree, Lisa A; Volmar, Keith E; Stuart, Lauren N; Nowak, Jan A; Astles, J Rex; Nakhleh, Raouf E
2017-09-01
- Laboratories must demonstrate analytic validity before any test can be used clinically, but studies have shown inconsistent practices in immunohistochemical assay validation. - To assess changes in immunohistochemistry analytic validation practices after publication of an evidence-based laboratory practice guideline. - A survey on current immunohistochemistry assay validation practices and on the awareness and adoption of a recently published guideline was sent to subscribers enrolled in one of 3 relevant College of American Pathologists proficiency testing programs and to additional nonsubscribing laboratories that perform immunohistochemical testing. The results were compared with an earlier survey of validation practices. - Analysis was based on responses from 1085 laboratories that perform immunohistochemical staining. Of 1057 responses, 65.4% (691) were aware of the guideline recommendations before this survey was sent and 79.9% (550 of 688) of those have already adopted some or all of the recommendations. Compared with the 2010 survey, a significant number of laboratories now have written validation procedures for both predictive and nonpredictive marker assays and specifications for the minimum numbers of cases needed for validation. There was also significant improvement in compliance with validation requirements, with 99% (100 of 102) having validated their most recently introduced predictive marker assay, compared with 74.9% (326 of 435) in 2010. The difficulty in finding validation cases for rare antigens and resource limitations were cited as the biggest challenges in implementing the guideline. - Dissemination of the 2014 evidence-based guideline validation practices had a positive impact on laboratory performance; some or all of the recommendations have been adopted by nearly 80% of respondents.
Validity in Mixed Methods Research in Education: The Application of Habermas' Critical Theory
ERIC Educational Resources Information Center
Long, Haiying
2017-01-01
Mixed methods approach has developed into the third methodological movement in educational research. Validity in mixed methods research as an important issue, however, has not been examined as extensively as that of quantitative and qualitative research. Additionally, the previous discussions of validity in mixed methods research focus on research…
Enhancing the Validity of a Quality of Life Measure for Autistic People
ERIC Educational Resources Information Center
McConachie, Helen; Mason, David; Parr, Jeremy R.; Garland, Deborah; Wilson, Colin; Rodgers, Jacqui
2018-01-01
Accurate measurement of quality of life (QoL) is important for evaluation of autism services and trials of interventions. We undertook psychometric validation of the World Health Organisation measure--WHOQoL-BREF, examined construct validity of the WHO Disabilities module and developed nine additional autism-specific items (ASQoL) from extensive…
Variety and Drift in the Functions and Purposes of Assessment in K-12 Education
ERIC Educational Resources Information Center
Ho, Andrew D.
2014-01-01
Background/Context: The target of assessment validation is not an assessment but the use of an assessment for a purpose. Although the validation literature often provides examples of assessment purposes, comprehensive reviews of these purposes are rare. Additionally, assessment purposes posed for validation are generally described as discrete and…
Yen, Po-Yin; Sousa, Karen H; Bakken, Suzanne
2014-10-01
In a previous study, we developed the Health Information Technology Usability Evaluation Scale (Health-ITUES), which is designed to support customization at the item level. Such customization matches the specific tasks/expectations of a health IT system while retaining comparability at the construct level, and provides evidence of its factorial validity and internal consistency reliability through exploratory factor analysis. In this study, we advanced the development of Health-ITUES to examine its construct validity and predictive validity. The health IT system studied was a web-based communication system that supported nurse staffing and scheduling. Using Health-ITUES, we conducted a cross-sectional study to evaluate users' perception toward the web-based communication system after system implementation. We examined Health-ITUES's construct validity through first and second order confirmatory factor analysis (CFA), and its predictive validity via structural equation modeling (SEM). The sample comprised 541 staff nurses in two healthcare organizations. The CFA (n=165) showed that a general usability factor accounted for 78.1%, 93.4%, 51.0%, and 39.9% of the explained variance in 'Quality of Work Life', 'Perceived Usefulness', 'Perceived Ease of Use', and 'User Control', respectively. The SEM (n=541) supported the predictive validity of Health-ITUES, explaining 64% of the variance in intention for system use. The results of CFA and SEM provide additional evidence for the construct and predictive validity of Health-ITUES. The customizability of Health-ITUES has the potential to support comparisons at the construct level, while allowing variation at the item level. We also illustrate application of Health-ITUES across stages of system development. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
NASA Astrophysics Data System (ADS)
Steger, Stefan; Brenning, Alexander; Bell, Rainer; Petschko, Helene; Glade, Thomas
2016-06-01
Empirical models are frequently applied to produce landslide susceptibility maps for large areas. Subsequent quantitative validation results are routinely used as the primary criteria to infer the validity and applicability of the final maps or to select one of several models. This study hypothesizes that such direct deductions can be misleading. The main objective was to explore discrepancies between the predictive performance of a landslide susceptibility model and the geomorphic plausibility of subsequent landslide susceptibility maps while a particular emphasis was placed on the influence of incomplete landslide inventories on modelling and validation results. The study was conducted within the Flysch Zone of Lower Austria (1,354 km2) which is known to be highly susceptible to landslides of the slide-type movement. Sixteen susceptibility models were generated by applying two statistical classifiers (logistic regression and generalized additive model) and two machine learning techniques (random forest and support vector machine) separately for two landslide inventories of differing completeness and two predictor sets. The results were validated quantitatively by estimating the area under the receiver operating characteristic curve (AUROC) with single holdout and spatial cross-validation technique. The heuristic evaluation of the geomorphic plausibility of the final results was supported by findings of an exploratory data analysis, an estimation of odds ratios and an evaluation of the spatial structure of the final maps. The results showed that maps generated by different inventories, classifiers and predictors appeared differently while holdout validation revealed similar high predictive performances. Spatial cross-validation proved useful to expose spatially varying inconsistencies of the modelling results while additionally providing evidence for slightly overfitted machine learning-based models. However, the highest predictive performances were obtained for maps that explicitly expressed geomorphically implausible relationships indicating that the predictive performance of a model might be misleading in the case a predictor systematically relates to a spatially consistent bias of the inventory. Furthermore, we observed that random forest-based maps displayed spatial artifacts. The most plausible susceptibility map of the study area showed smooth prediction surfaces while the underlying model revealed a high predictive capability and was generated with an accurate landslide inventory and predictors that did not directly describe a bias. However, none of the presented models was found to be completely unbiased. This study showed that high predictive performances cannot be equated with a high plausibility and applicability of subsequent landslide susceptibility maps. We suggest that greater emphasis should be placed on identifying confounding factors and biases in landslide inventories. A joint discussion between modelers and decision makers of the spatial pattern of the final susceptibility maps in the field might increase their acceptance and applicability.
Vuong, Kylie; Armstrong, Bruce K; Weiderpass, Elisabete; Lund, Eiliv; Adami, Hans-Olov; Veierod, Marit B; Barrett, Jennifer H; Davies, John R; Bishop, D Timothy; Whiteman, David C; Olsen, Catherine M; Hopper, John L; Mann, Graham J; Cust, Anne E; McGeechan, Kevin
2016-08-01
Identifying individuals at high risk of melanoma can optimize primary and secondary prevention strategies. To develop and externally validate a risk prediction model for incident first-primary cutaneous melanoma using self-assessed risk factors. We used unconditional logistic regression to develop a multivariable risk prediction model. Relative risk estimates from the model were combined with Australian melanoma incidence and competing mortality rates to obtain absolute risk estimates. A risk prediction model was developed using the Australian Melanoma Family Study (629 cases and 535 controls) and externally validated using 4 independent population-based studies: the Western Australia Melanoma Study (511 case-control pairs), Leeds Melanoma Case-Control Study (960 cases and 513 controls), Epigene-QSkin Study (44 544, of which 766 with melanoma), and Swedish Women's Lifestyle and Health Cohort Study (49 259 women, of which 273 had melanoma). We validated model performance internally and externally by assessing discrimination using the area under the receiver operating curve (AUC). Additionally, using the Swedish Women's Lifestyle and Health Cohort Study, we assessed model calibration and clinical usefulness. The risk prediction model included hair color, nevus density, first-degree family history of melanoma, previous nonmelanoma skin cancer, and lifetime sunbed use. On internal validation, the AUC was 0.70 (95% CI, 0.67-0.73). On external validation, the AUC was 0.66 (95% CI, 0.63-0.69) in the Western Australia Melanoma Study, 0.67 (95% CI, 0.65-0.70) in the Leeds Melanoma Case-Control Study, 0.64 (95% CI, 0.62-0.66) in the Epigene-QSkin Study, and 0.63 (95% CI, 0.60-0.67) in the Swedish Women's Lifestyle and Health Cohort Study. Model calibration showed close agreement between predicted and observed numbers of incident melanomas across all deciles of predicted risk. In the external validation setting, there was higher net benefit when using the risk prediction model to classify individuals as high risk compared with classifying all individuals as high risk. The melanoma risk prediction model performs well and may be useful in prevention interventions reliant on a risk assessment using self-assessed risk factors.
Reliability and validity of the workplace social distance scale.
Yoshii, Hatsumi; Mandai, Nozomu; Saito, Hidemitsu; Akazawa, Kouhei
2014-10-29
Self-stigma, defined by a negative attitude toward oneself combined with the consciousness of being a target of prejudice, is a critical problem for psychiatric patients. Self-stigma studies among psychiatric patients have indicated that high stigma is predictive of detrimental effects such as the delay of treatment and decreases in social participation in patients, and levels of self-stigma should be statistically evaluated. In this study, we developed the Workplace Social Distance Scale (WSDS), rephrasing the eight items of the Japanese version of the Social Distance Scale (SDSJ) to apply to the work setting in Japan. We examined the reliability and validity of the WSDS among 83 psychiatric patients. Factor analysis extracted three factors from the scale items: "work relations," "shallow relationships," and "employment." These factors are similar to the assessment factors of the SDSJ. Cronbach's alpha coefficient for the WSDS was 0.753. The split-half reliability for the WSDS was 0.801, indicating significant correlations. In addition, the WSDS was significantly correlated with the SDSJ. These findings suggest that the WSDS represents an approximation of self-stigma in the workplace among psychiatric patients. Our study assessed the reliability and validity of the WSDS for measuring self-stigma in Japan. Future studies should investigate the reliability and validity of the scale in other countries.
Using meta-differential evolution to enhance a calculation of a continuous blood glucose level.
Koutny, Tomas
2016-09-01
We developed a new model of glucose dynamics. The model calculates blood glucose level as a function of transcapillary glucose transport. In previous studies, we validated the model with animal experiments. We used analytical method to determine model parameters. In this study, we validate the model with subjects with type 1 diabetes. In addition, we combine the analytic method with meta-differential evolution. To validate the model with human patients, we obtained a data set of type 1 diabetes study that was coordinated by Jaeb Center for Health Research. We calculated a continuous blood glucose level from continuously measured interstitial fluid glucose level. We used 6 different scenarios to ensure robust validation of the calculation. Over 96% of calculated blood glucose levels fit A+B zones of the Clarke Error Grid. No data set required any correction of model parameters during the time course of measuring. We successfully verified the possibility of calculating a continuous blood glucose level of subjects with type 1 diabetes. This study signals a successful transition of our research from an animal experiment to a human patient. Researchers can test our model with their data on-line at https://diabetes.zcu.cz. Copyright © 2016 The Author. Published by Elsevier Ireland Ltd.. All rights reserved.
Creating a measure of portion control self-efficacy.
Fast, Lindsey C; Harman, Jennifer J; Maertens, Julie A; Burnette, Jeni L; Dreith, Francesca
2015-01-01
Over the last few decades, food portion sizes have steadily increased by as much as 700% (Young & Nestle, 2002). Food portions are often much larger than dietary guidelines recommend, leaving individuals to manage their food consumption on their own and making it necessary to understand individual factors impacting food consumption. In the current paper, we focus on self-efficacy for portion control. Specifically, across three studies, we developed and validated a new measure of portion control self-efficacy (PCSE). The PCSE measure yielded good fit statistics and had acceptable test-retest reliability using two cross-sectional surveys (Studies 1(a) and 1(b)). Results from Study 2 demonstrated construct and predictive validity of the PCSE using the Food Amount Rating Scale (FARS; Dohm, & Striegel-Moore, 2002). Study 3 offered additional support for reliability and validity with a sample of overweight and obese adults currently trying to lose weight. Overall, findings indicate that the new PCSE measure is reliable and valid. Individuals often make inaccurate food portion estimates (Slawson & Eck, 1997; Yuhas, Bolland, & Bolland, 1989) which can lead to overeating and weight-gain. Thus, the discussion centers on the need to incorporate PCSE in future research and intervention work targeting weight loss, health, and food consumption. Copyright © 2014 Elsevier Ltd. All rights reserved.
Assessing motivation orientations in schizophrenia: Scale development and validation.
Cooper, Shanna; Lavaysse, Lindsey M; Gard, David E
2015-01-30
Motivation deficits are common in several disorders including schizophrenia, and are an important factor in both functioning and treatment adherence. Self-Determination Theory (SDT), a leading macro-theory of motivation, has contributed a number of insights into how motivation is impaired in schizophrenia. Nonetheless, self-report measures of motivation appropriate for people with severe mental illness (including those that emphasize SDT) are generally lacking in the literature. To fill this gap, we adapted and abbreviated the well-validated General Causality Orientation Scale for use with people with schizophrenia and with other severe mental disorders (GCOS-clinical populations; GCOS-CP). In Study 1, we tested the similarity of our measure to the existing GCOS (using a college sample) and then validated this new measure in a schizophrenia and healthy control sample (Study 2). Results from Study 1 (N=360) indicated that the GCOS-CP was psychometrically similar to the original GCOS and provided good convergent and discriminant validity. In Study 2, the GCOS-CP was given to individuals with (N=44) and without schizophrenia (N=42). In line with both laboratory-based and observer-based research, people with schizophrenia showed lower motivational autonomy and higher impersonal/amotivated orientations. Additional applications of the GCOS-CP are discussed. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Walls, Brittany D; Wallace, Elizabeth R; Brothers, Stacey L; Berry, David T R
2017-12-01
Recent concern about malingered self-report of symptoms of attention-deficit hyperactivity disorder (ADHD) in college students has resulted in an urgent need for scales that can detect feigning of this disorder. The present study provided further validation data for a recently developed validity scale for the Conners' Adult ADHD Rating Scale (CAARS), the CAARS Infrequency Index (CII), as well as for the Inconsistency Index (INC). The sample included 139 undergraduate students: 21 individuals with diagnoses of ADHD, 29 individuals responding honestly, 54 individuals responding randomly (full or half), and 35 individuals instructed to feign. Overall, the INC showed moderate sensitivity to random responding (.44-.63) and fairly high specificity to ADHD (.86-.91). The CII demonstrated modest sensitivity to feigning (.31-.46) and excellent specificity to ADHD (.91-.95). Sequential application of validity scales had correct classification rates of honest (93.1%), ADHD (81.0%), feigning (57.1%), half random (42.3%), and full random (92.9%). The present study suggests that the CII is modestly sensitive (true positive rate) to feigned ADHD symptoms, and highly specific (true negative rate) to ADHD. Additionally, this study highlights the utility of applying the CAARS validity scales in a sequential manner for identifying feigning. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Mörtberg, Ewa; Reuterskiöld, Lena; Tillfors, Maria; Furmark, Tomas; Öst, Lars-Göran
2017-06-01
Culturally validated rating scales for social anxiety disorder (SAD) are of significant importance when screening for the disorder, as well as for evaluating treatment efficacy. This study examined construct validity and additional psychometric properties of two commonly used scales, the Social Phobia Scale and the Social Interaction Anxiety Scale, in a clinical SAD population (n = 180) and in a normal population (n = 614) in Sweden. Confirmatory factor analyses of previously reported factor solutions were tested but did not reveal acceptable fit. Exploratory factor analyses (EFA) of the joint structure of the scales in the total population yielded a two-factor model (performance anxiety and social interaction anxiety), whereas EFA in the clinical sample revealed a three-factor solution, a social interaction anxiety factor and two performance anxiety factors. The SPS and SIAS showed good to excellent internal consistency, and discriminated well between patients with SAD and a normal population sample. Both scales showed good convergent validity with an established measure of SAD, whereas the discriminant validity of symptoms of social anxiety and depression could not be confirmed. The optimal cut-off score for SPS and SIAS were 18 and 22 points, respectively. It is concluded that the factor structure and the additional psychometric properties of SPS and SIAS support the use of the scales for assessment in a Swedish population.
1991-09-20
Agreement Number: 91-07-16- VRX See section 3.1 for any additional information about the testing environment. As a result of this validation effort...Washington DC 20301 92-06991 AVF Control Number: AVF-VSR-511.0292 18 February 1992 91-07-16- VRX Ada COMPILER VALIDATION SUMMARY REPORT: Certificate Number...System: Ironics IV9001 board (AMD 29000) (Am29000 bare VME machine) Customer Agreement Number: 91-07-16- VRX See section 3.1 for any additional information
Smart Water Conservation System for Irrigated Landscape. ESTCP Cost and Performance Report
2016-10-01
water use by as much as 70% in support of meeting EO 13693. Additional performance objectives were to validate energy reduction, cost effectiveness ...Additional performance objectives were to validate energy reduction, cost effectiveness , and system reliability while maintaining satisfactory plant health...developments. The demonstration was conducted for two different climatic regions in the southwestern part of the United States (U.S.), where a typical
González-de Paz, Luis; Devant-Altimir, Meritxell; Kostov, Belchin; Mitjavila-López, Joan; Navarro-Rubio, M Dolors; Sisó-Almirall, Antoni
2013-12-01
Assessing ethical endorsement is crucial to the study of professional performance and moral conduct. There are no specific instruments that verify patients and professional experiences of ethical practice in the specific area of primary health care (PHC). To study the psychometric properties of two questionnaires to identify professional and patient endorsement of normative ethics. A methodological study conducted in PHC centres from an urban area (Barcelona). A group of items from an ethical code were generated using a qualitative study with focus groups. Items underwent expert validation, item refinement and test-retest reliability. Two groups of items for PHC professionals and patients were validated. The structure of the constructs and the internal consistency were studied after participants completed the questionnaires. Principal component analysis with supplementary variables showed the utility of the validated questionnaires. The patients' questionnaire consisted of 17 general items plus 11 additional items on specific conditions, and the health professional's contained 24 general and 9 specific items. The construct of the questionnaires comprised a three-factor solution for patients and a five-factor solution for professionals. Principal component analysis with supplementary variables showed that patients with higher scores on ethical perception were associated with better opinions on health care quality and more confidence in professionals. In PHC professionals, higher scores were associated with effective knowledge of the code. Both questionnaires showed good psychometric properties and are valid to screen ethical attitudes. The instrument warrants further testing and use with culturally diverse patients and PHC professionals.
Mohammadsalehi, Narges; Mohammadbeigi, Abolfazl; Jadidi, Rahmatollah; Anbari, Zohreh; Ghaderi, Ebrahim; Akbari, Mojtaba
2015-09-01
Reliability and validity are the key concepts in measurement processes. Young internet addiction test (YIAT) is regarded as a valid and reliable questionnaire in English speaking countries for diagnosis of Internet-related behavior disorders. This study aimed at validating the Persian version of YIAT in the Iranian society. A pilot and a cross-sectional study were conducted on 28 and 254 students of Qom University of Medical Sciences, respectively, in order to validate the Persian version of YIAT. Forward and backward translations were conducted to develop a Persian version of the scale. Reliability was measured by test-retest, Cronbach's alpha and interclass correlation coefficient (ICC). Face, content and construct validity were approved by the importance score index, content validity ratio (CVR), content validity index (CVI), correlation matrix and factor analysis. The SPSS software was used for data analysis. The Cronbach's alpha was 0.917 (CI 95%; 0.901 - 0.931). The average of scale-level CVI was calculated to be 0.74; the CVI index for each item was higher than 0.83 and the average of CVI index was equal to 0.89. Factor analysis extracted three factors including personal activities disorder (PAD), emotional and mood disorder (EMD) and social activities disorder (SAD), with more than 55.8% of total variances. The ICC for different factors of Persian version of Young Questionnaire including PAD, EMD and for SAD was r = 0.884; CI 95%; 0.861 - 0.904, r = 0.766; CI 95%; 0.718 - 0.808 and r = 0.745; CI 95%; 0.686 - 0.795, respectively. Our study showed that the Persian version of YIAT is good and usable on Iranian people. The reliability of the instrument was very good. Moreover, the validity of the Persian translated version of the scale was sufficient. In addition, the reliability and validity of the three extracted factors of YIAT were evaluated and were acceptable.
Mohammadsalehi, Narges; Mohammadbeigi, Abolfazl; Jadidi, Rahmatollah; Anbari, Zohreh; Ghaderi, Ebrahim; Akbari, Mojtaba
2015-01-01
Background: Reliability and validity are the key concepts in measurement processes. Young internet addiction test (YIAT) is regarded as a valid and reliable questionnaire in English speaking countries for diagnosis of Internet-related behavior disorders. Objectives: This study aimed at validating the Persian version of YIAT in the Iranian society. Patients and Methods: A pilot and a cross-sectional study were conducted on 28 and 254 students of Qom University of Medical Sciences, respectively, in order to validate the Persian version of YIAT. Forward and backward translations were conducted to develop a Persian version of the scale. Reliability was measured by test-retest, Cronbach’s alpha and interclass correlation coefficient (ICC). Face, content and construct validity were approved by the importance score index, content validity ratio (CVR), content validity index (CVI), correlation matrix and factor analysis. The SPSS software was used for data analysis. Results: The Cronbach’s alpha was 0.917 (CI 95%; 0.901 - 0.931). The average of scale-level CVI was calculated to be 0.74; the CVI index for each item was higher than 0.83 and the average of CVI index was equal to 0.89. Factor analysis extracted three factors including personal activities disorder (PAD), emotional and mood disorder (EMD) and social activities disorder (SAD), with more than 55.8% of total variances. The ICC for different factors of Persian version of Young Questionnaire including PAD, EMD and for SAD was r = 0.884; CI 95%; 0.861 - 0.904, r = 0.766; CI 95%; 0.718 - 0.808 and r = 0.745; CI 95%; 0.686 - 0.795, respectively. Conclusions: Our study showed that the Persian version of YIAT is good and usable on Iranian people. The reliability of the instrument was very good. Moreover, the validity of the Persian translated version of the scale was sufficient. In addition, the reliability and validity of the three extracted factors of YIAT were evaluated and were acceptable. PMID:26495253
Zahoor, Hafiz; Chan, Albert P. C.; Utama, Wahyudi P.; Gao, Ran; Zafar, Irfan
2017-01-01
This study attempts to validate a safety performance (SP) measurement model in the cross-cultural setting of a developing country. In addition, it highlights the variations in investigating the relationship between safety climate (SC) factors and SP indicators. The data were collected from forty under-construction multi-storey building projects in Pakistan. Based on the results of exploratory factor analysis, a SP measurement model was hypothesized. It was tested and validated by conducting confirmatory factor analysis on calibration and validation sub-samples respectively. The study confirmed the significant positive impact of SC on safety compliance and safety participation, and negative impact on number of self-reported accidents/injuries. However, number of near-misses could not be retained in the final SP model because it attained a lower standardized path coefficient value. Moreover, instead of safety participation, safety compliance established a stronger impact on SP. The study uncovered safety enforcement and promotion as a novel SC factor, whereas safety rules and work practices was identified as the most neglected factor. The study contributed to the body of knowledge by unveiling the deviations in existing dimensions of SC and SP. The refined model is expected to concisely measure the SP in the Pakistani construction industry, however, caution must be exercised while generalizing the study results to other developing countries. PMID:28350366
Nabbe, P; Le Reste, J Y; Guillou-Landreat, M; Munoz Perez, M A; Argyriadou, S; Claveria, A; Fernández San Martín, M I; Czachowski, S; Lingner, H; Lygidakis, C; Sowinska, A; Chiron, B; Derriennic, J; Le Prielec, A; Le Floch, B; Montier, T; Van Marwijk, H; Van Royen, P
2017-01-01
Depression occurs frequently in primary care. Its broad clinical variability makes it difficult to diagnose. This makes it essential that family practitioner (FP) researchers have validated tools to minimize bias in studies of everyday practice. Which tools validated against psychiatric examination, according to the major depression criteria of DSM-IV or 5, can be used for research purposes? An international FP team conducted a systematic review using the following databases: Pubmed, Cochrane and Embase, from 2000/01/01 to 2015/10/01. The three databases search identified 770 abstracts: 546 abstracts were analyzed after duplicates had been removed (224 duplicates); 50 of the validity studies were eligible and 4 studies were included. In 4 studies, the following tools were found: GDS-5, GDS-15, GDS-30, CESD-R, HADS, PSC-51 and HSCL-25. Sensitivity, Specificity, Positive Predictive Value, Negative Predictive Value were collected. The Youden index was calculated. Using efficiency data alone to compare these studies could be misleading. Additional reliability, reproducibility and ergonomic data will be essential for making comparisons. This study selected seven tools, usable in primary care research, for the diagnosis of depression. In order to define the best tools in terms of efficiency, reproducibility, reliability and ergonomics for research in primary care, and for care itself, further research will be essential. Copyright © 2016. Published by Elsevier Masson SAS.
Zahoor, Hafiz; Chan, Albert P C; Utama, Wahyudi P; Gao, Ran; Zafar, Irfan
2017-03-28
This study attempts to validate a safety performance (SP) measurement model in the cross-cultural setting of a developing country. In addition, it highlights the variations in investigating the relationship between safety climate (SC) factors and SP indicators. The data were collected from forty under-construction multi-storey building projects in Pakistan. Based on the results of exploratory factor analysis, a SP measurement model was hypothesized. It was tested and validated by conducting confirmatory factor analysis on calibration and validation sub-samples respectively. The study confirmed the significant positive impact of SC on safety compliance and safety participation , and negative impact on number of self-reported accidents/injuries . However, number of near-misses could not be retained in the final SP model because it attained a lower standardized path coefficient value. Moreover, instead of safety participation , safety compliance established a stronger impact on SP. The study uncovered safety enforcement and promotion as a novel SC factor, whereas safety rules and work practices was identified as the most neglected factor. The study contributed to the body of knowledge by unveiling the deviations in existing dimensions of SC and SP. The refined model is expected to concisely measure the SP in the Pakistani construction industry, however, caution must be exercised while generalizing the study results to other developing countries.
Scaling the Information Processing Demands of Occupations
ERIC Educational Resources Information Center
Haase, Richard F.; Jome, LaRae M.; Ferreira, Joaquim Armando; Santos, Eduardo J. R.; Connacher, Christopher C.; Sendrowitz, Kerrin
2011-01-01
The purpose of this study was to provide additional validity evidence for a model of person-environment fit based on polychronicity, stimulus load, and information processing capacities. In this line of research the confluence of polychronicity and information processing (e.g., the ability of individuals to process stimuli from the environment…
Perfectionism and the Five-Factor Model of Personality
ERIC Educational Resources Information Center
Rice, Kenneth G.; Ashby, Jeffrey S.; Slaney, Robert B.
2007-01-01
In this study of the discriminant, convergent, and incremental validity of the Almost Perfect Scale-Revised (APS-R), university students completed the APS-R, additional measures of perfectionism, the NEO Five-Factor Inventory-Form S, and measures of self-esteem and depression. The results revealed expected significant, but not completely…
Assessing Students' Communication Skills: Validation of a Global Rating
ERIC Educational Resources Information Center
Scheffer, Simone; Muehlinghaus, Isabel; Froehmel, Annette; Ortwein, Heiderose
2008-01-01
Communication skills training is an accepted part of undergraduate medical programs nowadays. In addition to learning experiences its importance should be emphasised by performance-based assessment. As detailed checklists have been shown to be not well suited for the assessment of communication skills for different reasons, this study aimed to…
The Mindful Attention Awareness Scale (MAAS) in an Argentine Population
ERIC Educational Resources Information Center
Montes, Silvana Andrea; Ledesma, Rubén Daniel; García, Natalia Mariana; Poó, Fernando Martín
2014-01-01
The purpose of this study was to provide evidence of validity for the Mindful Attention Awareness Scale scores in an Argentine sample (Spanish-language version). Results indicated satisfactory psychometric properties (a one-factor structure, good item discrimination, high reliability, and significant correlations with additional measures). This…
Cognitive Predictors of Rapid Picture Naming
ERIC Educational Resources Information Center
Decker, Scott L.; Roberts, Alycia M.; Englund, Julia A.
2013-01-01
Deficits in rapid automatized naming (RAN) have been found to be a sensitive cognitive marker for children with dyslexia. However, there is a lack of consensus regarding the construct validity and theoretical neuro-cognitive processes involved in RAN. Additionally, most studies investigating RAN include a narrow range of cognitive measures. The…
Association Mapping of Leaf Rust Response in Durum Wheat
USDA-ARS?s Scientific Manuscript database
Resistance to leaf rust (Puccinia triticina Eriks.) is a main objective for durum wheat (Triticum durum Desf.) breeding.Association mapping on germplasm collections is now being used as an additional approach for the discovery and validation of major genes/QTLs. In this study, a collection of 164 el...
Zachariah, Marianne; Seidling, Hanna M; Neri, Pamela M; Cresswell, Kathrin M; Duke, Jon; Bloomrosen, Meryl; Volk, Lynn A; Bates, David W
2011-01-01
Background Medication-related decision support can reduce the frequency of preventable adverse drug events. However, the design of current medication alerts often results in alert fatigue and high over-ride rates, thus reducing any potential benefits. Methods The authors previously reviewed human-factors principles for relevance to medication-related decision support alerts. In this study, instrument items were developed for assessing the appropriate implementation of these human-factors principles in drug–drug interaction (DDI) alerts. User feedback regarding nine electronic medical records was considered during the development process. Content validity, construct validity through correlation analysis, and inter-rater reliability were assessed. Results The final version of the instrument included 26 items associated with nine human-factors principles. Content validation on three systems resulted in the addition of one principle (Corrective Actions) to the instrument and the elimination of eight items. Additionally, the wording of eight items was altered. Correlation analysis suggests a direct relationship between system age and performance of DDI alerts (p=0.0016). Inter-rater reliability indicated substantial agreement between raters (κ=0.764). Conclusion The authors developed and gathered preliminary evidence for the validity of an instrument that measures the appropriate use of human-factors principles in the design and display of DDI alerts. Designers of DDI alerts may use the instrument to improve usability and increase user acceptance of medication alerts, and organizations selecting an electronic medical record may find the instrument helpful in meeting their clinicians' usability needs. PMID:21946241
Traditional Masculinity and Femininity: Validation of a New Scale Assessing Gender Roles
Kachel, Sven; Steffens, Melanie C.; Niedlich, Claudia
2016-01-01
Gender stereotype theory suggests that men are generally perceived as more masculine than women, whereas women are generally perceived as more feminine than men. Several scales have been developed to measure fundamental aspects of gender stereotypes (e.g., agency and communion, competence and warmth, or instrumentality and expressivity). Although omitted in later version, Bem's original Sex Role Inventory included the items “masculine” and “feminine” in addition to more specific gender-stereotypical attributes. We argue that it is useful to be able to measure these two core concepts in a reliable, valid, and parsimonious way. We introduce a new and brief scale, the Traditional Masculinity-Femininity (TMF) scale, designed to assess central facets of self-ascribed masculinity-femininity. Studies 1–2 used known-groups approaches (participants differing in gender and sexual orientation) to validate the scale and provide evidence of its convergent validity. As expected the TMF reliably measured a one-dimensional masculinity-femininity construct. Moreover, the TMF correlated moderately with other gender-related measures. Demonstrating incremental validity, the TMF predicted gender and sexual orientation in a superior way than established adjective-based measures. Furthermore, the TMF was connected to criterion characteristics, such as judgments as straight by laypersons for the whole sample, voice pitch characteristics for the female subsample, and contact to gay men for the male subsample, and outperformed other gender-related scales. Taken together, as long as gender differences continue to exist, we suggest that the TMF provides a valuable methodological addition for research into gender stereotypes. PMID:27458394
[Witch hunt and hypotheses loss].
Köhler, Dieter
2015-12-01
At this time randomized controlled studies (RCT) in clinical trials usually have high quality. But this quality is only true for the included patients (intrinsic validity). It is common to generalize the results to more or less similar patients. This inductive method is prohibited in epistemology what is known for 250 years (D Hume, K R Popper). Therefore the external validity for the data of RCT is low. To solve this problem additional clinical and pathophysiological based data are necessary to generalize the results. Unfortunately RCT show less quality in their premises. This is partly due to the influence of the pharmaceutical industry. A loss of universality of the hypothesis for RCT decreases basically the extrinsic validity. The articles describe this problem with examples. © Georg Thieme Verlag KG Stuttgart · New York.
Aarons, Gregory A; McDonald, Elizabeth J; Connelly, Cynthia D; Newton, Rae R
2007-12-01
The purpose of this study was to examine the factor structure, reliability, and validity of the Family Assessment Device (FAD) among a national sample of Caucasian and Hispanic American families receiving public sector mental health services. A confirmatory factor analysis conducted to test model fit yielded equivocal findings. With few exceptions, indices of model fit, reliability, and validity were poorer for Hispanic Americans compared with Caucasian Americans. Contrary to our expectation, an exploratory factor analysis did not result in a better fitting model of family functioning. Without stronger evidence supporting a reformulation of the FAD, we recommend against such a course of action. Findings highlight the need for additional research on the role of culture in measurement of family functioning.
Roberts, Richard D; Schulze, Ralf; O'Brien, Kristin; MacCann, Carolyn; Reid, John; Maul, Andy
2006-11-01
Emotions measures represent an important means of obtaining construct validity evidence for emotional intelligence (EI) tests because they have the same theoretical underpinnings. Additionally, the extent to which both emotions and EI measures relate to intelligence is poorly understood. The current study was designed to address these issues. Participants (N = 138) completed the Mayer-Salovey-Caruso Emotional Intelligence Test (MSCEIT), two emotions measures, as well as four intelligence tests. Results provide mixed support for the model hypothesized to underlie the MSCEIT, with emotions research and EI measures failing to load on the same factor. The emotions measures loaded on the same factor as intelligence measures. The validity of certain EI components (in particular, Emotion Perception), as currently assessed, appears equivocal. Copyright 2006 APA, all rights reserved.
Developing an index to measure the voluntariness of consent to research.
Dugosh, Karen L; Festinger, David S; Marlowe, Douglas B; Clements, Nicolle T
2014-10-01
The goals of the current study were to expand the content domain and further validate the Coercion Assessment Scale (CAS), a measure of perceived coercion for criminally involved substance abusers being recruited into research. Unlike the few existing measures of this construct, the CAS identifies specific external sources of pressure that may influence one's decision to participate. In Phase 1, we conducted focus groups with criminal justice clients and stakeholders to expand the instrument by identifying additional sources of pressure. In Phase 2, we evaluated the expanded measure (i.e., endorsement rates, reliability, validity) in an ongoing research trial. Results identified new sources of pressure and provided evidence supporting the CAS's utility and reliability over time as well as convergent and discriminative validity. © The Author(s) 2014.
ERIC Educational Resources Information Center
Shindler, John; Taylor, Clint; Cadenas, Herminia; Jones, Albert
This study was a pilot effort to examine the efficacy of an analytic trait scale school climate assessment instrument and democratic change system in two urban high schools. Pilot study results indicate that the instrument shows promising soundness in that it exhibited high levels of validity and reliability. In addition, the analytic trait format…
ERIC Educational Resources Information Center
King, Christopher S.
2013-01-01
This dissertation presents a quantitative analysis of the relationship between intrinsic and extrinsic job satisfaction and the voluntary leaving behavior of IT professionals. In addition, the study adds to the validity and reliability of the Udechukwu and Mujtaba Mathematical Turnover Model. Surveyed within the study for their intrinsic and…
Uses, Gratifications and Avoidances and Voting Decisions in the 1976 Presidential Campaign.
ERIC Educational Resources Information Center
Semlak, William; Williams, Wenmouth, Jr.
In addition to factors analyzed in previous media research, the following three new validity checks were examined in a study of the audience's perceived uses and gratifications: the effect of time differences between administrations in a study, a comparison of respondents who claim to be gratified with those who do not, and the link between media…
ERIC Educational Resources Information Center
Benjamin, Jane; Petersen, Naomi Jeffery; Sink, Chris; Walker, Brenda
The construct validity of the revised Teacher Beliefs Survey (TBS) (S. Woolley and A. Woolley, 1999) was examined, and the populations that can use this instrument reliably were explored in this study. The TBS was designed to identify the dimensions of classroom practices. The revised instrument, which contained additional subscale items, was…
Validity of the alcohol purchase task: a meta-analysis.
Kiselica, Andrew M; Webber, Troy A; Bornovalova, Marina A
2016-05-01
Behavioral economists assess alcohol consumption as a function of unit price. This method allows construction of demand curves and demand indices, which are thought to provide precise numerical estimates of risk for alcohol problems. One of the more commonly used behavioral economic measures is the Alcohol Purchase Task (APT). Although the APT has shown promise as a measure of risk for alcohol problems, the construct validity and incremental utility of the APT remain unclear. This paper presents a meta-analysis of the APT literature. Sixteen studies were included in the meta-analysis. Studies were gathered via searches of the PsycInfo, PubMed, Web of Science and EconLit research databases. Random-effects meta-analyses with inverse variance weighting were used to calculate summary effect sizes for each demand index-drinking outcome relationship. Moderation of these effects by drinking status (regular versus heavy drinkers) was examined. Additionally, tests of the incremental utility of the APT indices in predicting drinking problems above and beyond measuring alcohol consumption were performed. The APT indices were correlated in the expected directions with drinking outcomes, although many effects were small in size. These effects were typically not moderated by the drinking status of the samples. Additionally, the intensity metric demonstrated incremental utility in predicting alcohol use disorder symptoms beyond measuring drinking. The Alcohol Purchase Task appears to have good construct validity, but limited incremental utility in estimating risk for alcohol problems. © 2015 Society for the Study of Addiction.
Development of a Three-Dimensional, Unstructured Material Response Design Tool
NASA Technical Reports Server (NTRS)
Schulz, Joseph C.; Stern, Eric C.; Muppidi, Suman; Palmer, Grant E.; Schroeder, Olivia
2017-01-01
A preliminary verification and validation of a new material response model is presented. This model, Icarus, is intended to serve as a design tool for the thermal protection systems of re-entry vehicles. Currently, the capability of the model is limited to simulating the pyrolysis of a material as a result of the radiative and convective surface heating imposed on the material from the surrounding high enthalpy gas. Since the major focus behind the development of Icarus has been model extensibility, the hope is that additional physics can be quickly added. This extensibility is critical since thermal protection systems are becoming increasing complex, e.g. woven carbon polymers. Additionally, as a three-dimensional, unstructured, finite-volume model, Icarus is capable of modeling complex geometries. In this paper, the mathematical and numerical formulation is presented followed by a discussion of the software architecture and some preliminary verification and validation studies.
Predicting herbicide mixture effects on multiple algal species using mixture toxicity models.
Nagai, Takashi
2017-10-01
The validity of the application of mixture toxicity models, concentration addition and independent action, to a species sensitivity distribution (SSD) for calculation of a multisubstance potentially affected fraction was examined in laboratory experiments. Toxicity assays of herbicide mixtures using 5 species of periphytic algae were conducted. Two mixture experiments were designed: a mixture of 5 herbicides with similar modes of action and a mixture of 5 herbicides with dissimilar modes of action, corresponding to the assumptions of the concentration addition and independent action models, respectively. Experimentally obtained mixture effects on 5 algal species were converted to the fraction of affected (>50% effect on growth rate) species. The predictive ability of the concentration addition and independent action models with direct application to SSD depended on the mode of action of chemicals. That is, prediction was better for the concentration addition model than the independent action model for the mixture of herbicides with similar modes of action. In contrast, prediction was better for the independent action model than the concentration addition model for the mixture of herbicides with dissimilar modes of action. Thus, the concentration addition and independent action models could be applied to SSD in the same manner as for a single-species effect. The present study to validate the application of the concentration addition and independent action models to SSD supports the usefulness of the multisubstance potentially affected fraction as the index of ecological risk. Environ Toxicol Chem 2017;36:2624-2630. © 2017 SETAC. © 2017 SETAC.
Tomizawa, Ryoko; Yamano, Mayumi; Osako, Mitue; Hirabayashi, Naotugu; Oshima, Nobuo; Sigeta, Masahiro; Reeves, Scott
2017-12-01
Few scales currently exist to assess the quality of interprofessional teamwork through team members' perceptions of working together in mental health settings. The purpose of this study was to revise and validate an interprofessional scale to assess the quality of teamwork in inpatient psychiatric units and to use it multi-nationally. A literature review was undertaken to identify evaluative teamwork tools and develop an additional 12 items to ensure a broad global focus. Focus group discussions considered adaptation to different care systems using subjective judgements from 11 participants in a pre-test of items. Data quality, construct validity, reproducibility, and internal consistency were investigated in the survey using an international comparative design. Exploratory factor analysis yielded five factors with 21 items: 'patient/community centred care', 'collaborative communication', 'interprofessional conflict', 'role clarification', and 'environment'. High overall internal consistency, reproducibility, adequate face validity, and reasonable construct validity were shown in the USA and Japan. The revised Collaborative Practice Assessment Tool (CPAT) is a valid measure to assess the quality of interprofessional teamwork in psychiatry and identifies the best strategies to improve team performance. Furthermore, the revised scale will generate more rigorous evidence for collaborative practice in psychiatry internationally.
Multi-analyte validation in heterogeneous solution by ELISA.
Lakshmipriya, Thangavel; Gopinath, Subash C B; Hashim, Uda; Murugaiyah, Vikneswaran
2017-12-01
Enzyme Linked Immunosorbent Assay (ELISA) is a standard assay that has been used widely to validate the presence of analyte in the solution. With the advancement of ELISA, different strategies have shown and became a suitable immunoassay for a wide range of analytes. Herein, we attempted to provide additional evidence with ELISA, to show its suitability for multi-analyte detection. To demonstrate, three clinically relevant targets have been chosen, which include 16kDa protein from Mycobacterium tuberculosis, human blood clotting Factor IXa and a tumour marker Squamous Cell Carcinoma antigen. Indeed, we adapted the routine steps from the conventional ELISA to validate the occurrence of analytes both in homogeneous and heterogeneous solutions. With the homogeneous and heterogeneous solutions, we could attain the sensitivity of 2, 8 and 1nM for the targets 16kDa protein, FIXa and SSC antigen, respectively. Further, the specific multi-analyte validations were evidenced with the similar sensitivities in the presence of human serum. ELISA assay in this study has proven its applicability for the genuine multiple target validation in the heterogeneous solution, can be followed for other target validations. Copyright © 2017 Elsevier B.V. All rights reserved.
Buntragulpoontawee, Montana; Phutrit, Suphatha; Tongprasert, Siam; Wongpakaran, Tinakon; Khunachiva, Jeeranan
2018-03-27
This study evaluated additional psychometric properties of the Thai version of the disabilities of the arm, shoulder and hand questionnaire (DASH-TH) which included, test-retest reliability, construct validity, internal consistency of in patients with carpal tunnel syndrome. As for determining construct validity, the Thai EuroQOL questionnaire (EQ-5D-5L) was also administered in order to examine convergent and divergent validity. Fifty patients completed both questionnaires. The DASH-TH showed excellent test-retest reliability (intraclass correlation coefficient = 0.811) and internal consistency (Cronbach's alpha = 0.911). The exploratory factor analysis yielded a six-factor solution while the confirmatory factor analysis denoted that the hypothesized model adequately fit the data with a comparative fit index of 0.967 and a Tucker-Lewis index of 0.964. The related subscales between the DASH-TH and the Thai EQ-5D-5L were significantly correlated, indicating the DASH-TH's convergent and discriminant validity. The DASH-TH demonstrated good reliability, internal consistency construct validity, and multidimensionality, in assessing the upper extremity function in carpal tunnel syndrome patients.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rainina, Evguenia I.; McCune, D. E.; Luna, Maria L.
2012-05-31
The goal of this study was to validate the previously observed high biological kill performance of PAEROSOL, a semi-dry, micro-aerosol decontamination technology, against common HAI in a non-human subject trial within a hospital setting of Madigan Army Medical Center (MAMC) on Joint Base Lewis-McChord in Tacoma, Washington. In addition to validating the disinfecting efficacy of PAEROSOL, the objectives of the trial included a demonstration of PAEROSOL environmental safety, (i.e., impact to hospital interior materials and electronic equipment exposed during testing) and PAEROSOL parameters optimization for future deployment.
NASA Technical Reports Server (NTRS)
Dunst, Ben
2011-01-01
The height at which smoke from a wildfire is injected into the atmosphere is an important parameter for climatology, because it determines how far the smoke can be transported. Using the MINX program to analyze MISR (Multi-angle Imaging Spectro-Radiometer) data, I digitized wildfire smoke plumes to add to an existing database of these heights for use by scientists studying smoke transport and plume dynamics. In addition to using MINX to do production digitizing of heights, I assisted in gathering lidar data for an ongoing validation of MINX and helped evaluate those data.
Brunault, Paul; Ballon, Nicolas; Gaillard, Philippe; Réveillère, Christian; Courtois, Robert
2014-01-01
Objective: The concept of food addiction has recently been proposed by applying the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision, criteria for substance dependence to eating behaviour. Food addiction has received increased attention given that it may play a role in binge eating, eating disorders, and the recent increase in obesity prevalence. Currently, there is no psychometrically sound tool for assessing food addiction in French. Our study aimed to test the psychometric properties of a French version of the Yale Food Addiction Scale (YFAS) by establishing its factor structure and construct validity in a nonclinical population. Method: A total of 553 participants were assessed for food addiction (French version of the YFAS) and binge eating behaviour (Bulimic Investigatory Test Edinburgh and Binge Eating Scale). We tested the scale’s factor structure (factor analysis for dichotomous data based on tetrachoric correlation coefficients), internal consistency, and construct validity with measures of binge eating. Results: Our results supported a 1-factor structure, which accounted for 54.1% of the variance. This tool had adequate reliability and high construct validity with measures of binge eating in this population, both in its diagnosis and symptom count version. A 2-factor structure explained an additional 9.1% of the variance, and could differentiate between patients with high, compared with low, levels of insight regarding addiction symptoms. Conclusions: In our study, we validated a psychometrically sound French version of the YFAS, both in its symptom count and diagnostic version. Future studies should validate this tool in clinical samples. PMID:25007281
Piette, Elizabeth R; Moore, Jason H
2018-01-01
Machine learning methods and conventions are increasingly employed for the analysis of large, complex biomedical data sets, including genome-wide association studies (GWAS). Reproducibility of machine learning analyses of GWAS can be hampered by biological and statistical factors, particularly so for the investigation of non-additive genetic interactions. Application of traditional cross validation to a GWAS data set may result in poor consistency between the training and testing data set splits due to an imbalance of the interaction genotypes relative to the data as a whole. We propose a new cross validation method, proportional instance cross validation (PICV), that preserves the original distribution of an independent variable when splitting the data set into training and testing partitions. We apply PICV to simulated GWAS data with epistatic interactions of varying minor allele frequencies and prevalences and compare performance to that of a traditional cross validation procedure in which individuals are randomly allocated to training and testing partitions. Sensitivity and positive predictive value are significantly improved across all tested scenarios for PICV compared to traditional cross validation. We also apply PICV to GWAS data from a study of primary open-angle glaucoma to investigate a previously-reported interaction, which fails to significantly replicate; PICV however improves the consistency of testing and training results. Application of traditional machine learning procedures to biomedical data may require modifications to better suit intrinsic characteristics of the data, such as the potential for highly imbalanced genotype distributions in the case of epistasis detection. The reproducibility of genetic interaction findings can be improved by considering this variable imbalance in cross validation implementation, such as with PICV. This approach may be extended to problems in other domains in which imbalanced variable distributions are a concern.
Yalın Sapmaz, Şermin; Ergin, Dilek; Şen Celasin, Nesrin; Özek Erkuran, Handan; Karaarslan, Duygu; Öztekin, Siğnem; Uzel Tanrıverdi, Bengisu; Köroğlu, Ertuğrul; Aydemir, Ömer
2017-01-01
The goal of this study was to assess the validity and reliability of the Turkish version of the DSM-5 (Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition) Dissociative Symptoms Severity Scale-Child Form. The scale was prepared by translating and then back-translating the DSM-5 Dissociative Symptoms Severity Scale. The study groups included one group of 30 patients diagnosed with posttraumatic stress disorder who were treated in a child and adolescent psychiatry unit and another group of 83 healthy volunteers from middle and high schools in the community. For assessment, the Adolescent Dissociative Experiences Scale (ADES) was used in addition to the DSM-5 Dissociative Symptoms Severity Scale. Regarding the reliability of the DSM-5 Dissociative Symptoms Severity Scale, Cronbach's alpha was .824 and item-total score correlation coefficients were between .464 and .648. The test-retest correlation coefficient was calculated to be r = .784. In terms of construct validity, one factor accounted for 45.2% of the variance. Furthermore, in terms of concurrent validity, the scale showed a high correlation with the ADES. In conclusion, the Turkish version of the DSM-5 Dissociative Symptoms Severity Scale-Child Form is a valid and reliable tool for both clinical practice and research.
Lingner, Thomas; Kataya, Amr R. A.; Reumann, Sigrun
2012-01-01
We recently developed the first algorithms specifically for plants to predict proteins carrying peroxisome targeting signals type 1 (PTS1) from genome sequences.1 As validated experimentally, the prediction methods are able to correctly predict unknown peroxisomal Arabidopsis proteins and to infer novel PTS1 tripeptides. The high prediction performance is primarily determined by the large number and sequence diversity of the underlying positive example sequences, which mainly derived from EST databases. However, a few constructs remained cytosolic in experimental validation studies, indicating sequencing errors in some ESTs. To identify erroneous sequences, we validated subcellular targeting of additional positive example sequences in the present study. Moreover, we analyzed the distribution of prediction scores separately for each orthologous group of PTS1 proteins, which generally resembled normal distributions with group-specific mean values. The cytosolic sequences commonly represented outliers of low prediction scores and were located at the very tail of a fitted normal distribution. Three statistical methods for identifying outliers were compared in terms of sensitivity and specificity.” Their combined application allows elimination of erroneous ESTs from positive example data sets. This new post-validation method will further improve the prediction accuracy of both PTS1 and PTS2 protein prediction models for plants, fungi, and mammals. PMID:22415050
Lingner, Thomas; Kataya, Amr R A; Reumann, Sigrun
2012-02-01
We recently developed the first algorithms specifically for plants to predict proteins carrying peroxisome targeting signals type 1 (PTS1) from genome sequences. As validated experimentally, the prediction methods are able to correctly predict unknown peroxisomal Arabidopsis proteins and to infer novel PTS1 tripeptides. The high prediction performance is primarily determined by the large number and sequence diversity of the underlying positive example sequences, which mainly derived from EST databases. However, a few constructs remained cytosolic in experimental validation studies, indicating sequencing errors in some ESTs. To identify erroneous sequences, we validated subcellular targeting of additional positive example sequences in the present study. Moreover, we analyzed the distribution of prediction scores separately for each orthologous group of PTS1 proteins, which generally resembled normal distributions with group-specific mean values. The cytosolic sequences commonly represented outliers of low prediction scores and were located at the very tail of a fitted normal distribution. Three statistical methods for identifying outliers were compared in terms of sensitivity and specificity." Their combined application allows elimination of erroneous ESTs from positive example data sets. This new post-validation method will further improve the prediction accuracy of both PTS1 and PTS2 protein prediction models for plants, fungi, and mammals.
An integrated bioanalytical method development and validation approach: case studies.
Xue, Y-J; Melo, Brian; Vallejo, Martha; Zhao, Yuwen; Tang, Lina; Chen, Yuan-Shek; Keller, Karin M
2012-10-01
We proposed an integrated bioanalytical method development and validation approach: (1) method screening based on analyte's physicochemical properties and metabolism information to determine the most appropriate extraction/analysis conditions; (2) preliminary stability evaluation using both quality control and incurred samples to establish sample collection, storage and processing conditions; (3) mock validation to examine method accuracy and precision and incurred sample reproducibility; and (4) method validation to confirm the results obtained during method development. This integrated approach was applied to the determination of compound I in rat plasma and compound II in rat and dog plasma. The effectiveness of the approach was demonstrated by the superior quality of three method validations: (1) a zero run failure rate; (2) >93% of quality control results within 10% of nominal values; and (3) 99% incurred sample within 9.2% of the original values. In addition, rat and dog plasma methods for compound II were successfully applied to analyze more than 900 plasma samples obtained from Investigational New Drug (IND) toxicology studies in rats and dogs with near perfect results: (1) a zero run failure rate; (2) excellent accuracy and precision for standards and quality controls; and (3) 98% incurred samples within 15% of the original values. Copyright © 2011 John Wiley & Sons, Ltd.
Güler, Sibel; Turan, F Nesrin
2015-09-30
Our aim was to translate the Quality of Life in Essential Tremor Questionnaire (QUEST) advanced by Troster (2005) and to analyse the validity and reliability of this questionnaire. Two hundred twelve consecutive patients with essential tremor (ET) and forty-three control subjects were included in the study. Permission for the translation and validation of the QUEST scale was obtained. The translation was performed according to the guidelines provided by the publisher. After the translation, the final version of the scale was administered to both groups to determine its reliability and validity. The QUEST Physical, Psychosocial, communication, Hobbies/leisure and Work/finance scores were 0.967, 0.968, 0.933, 0.964 and 0.925, respectively. There were good correlations between each of the QUEST scores that were indicative of good internal consistency. Additionally, we observed that all of the QUEST scores were most strongly related to the right and left arms (p=0.0001). However, we observed that all of the QUEST scores were weakly related to the voice, head and right leg (p=0.0001). These findings support the notion that the Turkish version of the Quality of Life in Essential Tremor (QUEST) questionnaire is a valid and reliable tool for the assessment of the quality of life of patients with ET.
Patient-completed or symptom-based screening tools for endometriosis: a scoping review.
Surrey, Eric; Carter, Cathryn M; Soliman, Ahmed M; Khan, Shahnaz; DiBenedetti, Dana B; Snabes, Michael C
2017-08-01
The objective of this review was to evaluate existing patient-completed screening questionnaires and/or symptom-based predictive models with respect to their potential for use as screening tools for endometriosis in adult women. Validated instruments were of particular interest. We conducted structured searches of PubMed and targeted searches of the gray literature to identify studies reporting on screening instruments used in endometriosis. Studies were screened according to inclusion and exclusion criteria that followed the PICOS (population, intervention, comparison, outcomes, study design) framework. A total of 16 studies were identified, of which 10 described measures for endometriosis in general, 2 described measures for endometriosis at specific sites, and 4 described measures for deep-infiltrating endometriosis. Only 1 study evaluated a questionnaire that was solely patient-completed. Most measures required physician, imaging, or laboratory assessments in addition to patient-completed questionnaires, and several measures relied on complex scoring. Validation for use as a screening tool in adult women with potential endometriosis was lacking in all studies, as most studies focused on diagnosis versus screening. This literature review did not identify any fully validated, symptom-based, patient-reported questionnaires for endometriosis screening in adult women.
Rosella, L; Bowman, C; Pach, B; Morgan, S; Fitzpatrick, T; Goel, V
2016-07-01
Most quality appraisal tools were developed for clinical medicine and tend to be study-specific with a strong emphasis on risk of bias. In order to be more relevant to public health, an appropriate quality appraisal tool needs to be less reliant on the evidence hierarchy and consider practice applicability. Given the broad range of study designs used in public health, the objective of this study was to develop and validate a meta-tool that combines public health-focused principles of appraisal coupled with a set of design-specific companion tools. Several design methods were used to develop and validate the tool including literature review, synthesis, and validation with a reference standard. A search of critical appraisal tools relevant to public health was conducted; core concepts were collated. The resulting framework was piloted during three feedback sessions with public health practitioners. Following subsequent revisions, the final meta-tool, the Meta Quality Appraisal Tool (MetaQAT), was then validated through a content analysis of appraisals conducted by two groups of experienced public health researchers (MetaQAT vs generic appraisal form). The MetaQAT framework consists of four domains: relevancy, reliability, validity, and applicability. In addition, a companion tool was assembled from existing critical appraisal tools to provide study design-specific guidance on validity appraisal. Content analysis showed similar methodological and generalizability concerns were raised by both groups; however, the MetaQAT appraisers commented more extensively on applicability to public health practice. Critical appraisal tools designed for clinical medicine have limitations for use in the context of public health. The meta-tool structure of the MetaQAT allows for rigorous appraisal, while allowing users to simultaneously appraise the multitude of study designs relevant to public health research and assess non-standard domains, such as applicability. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.
NASA Astrophysics Data System (ADS)
Babaeian, E.; Tuller, M.; Sadeghi, M.; Franz, T.; Jones, S. B.
2017-12-01
Soil Moisture Active Passive (SMAP) soil moisture products are commonly validated based on point-scale reference measurements, despite the exorbitant spatial scale disparity. The difference between the measurement depth of point-scale sensors and the penetration depth of SMAP further complicates evaluation efforts. Cosmic-ray neutron probes (CRNP) with an approximately 500-m radius footprint provide an appealing alternative for SMAP validation. This study is focused on the validation of SMAP level-4 root zone soil moisture products with 9-km spatial resolution based on CRNP observations at twenty U.S. reference sites with climatic conditions ranging from semiarid to humid. The CRNP measurements are often biased by additional hydrogen sources such as surface water, atmospheric vapor, or mineral lattice water, which sometimes yield unrealistic moisture values in excess of the soil water storage capacity. These effects were removed during CRNP data analysis. Comparison of SMAP data with corrected CRNP observations revealed a very high correlation for most of the investigated sites, which opens new avenues for validation of current and future satellite soil moisture products.
Schlautman, Brandon; Fajardo, Diego; Bougie, Tierney; Wiesman, Eric; Polashock, James; Vorsa, Nicholi; Steffan, Shawn; Zalapa, Juan
2015-01-27
The American cranberry, Vaccinium macrocarpon Ait., is an economically important North American fruit crop that is consumed because of its unique flavor and potential health benefits. However, a lack of abundant, genome-wide molecular markers has limited the adoption of modern molecular assisted selection approaches in cranberry breeding programs. To increase the number of available markers in the species, this study identified, tested, and validated microsatellite markers from existing nuclear and transcriptome sequencing data. In total, new primers were designed, synthesized, and tested for 979 SSR loci; 697 of the markers amplified allele patterns consistent with single locus segregation in a diploid organism and were considered polymorphic. Of the 697 polymorphic loci, 507 were selected for additional genetic diversity and segregation analyses in 29 cranberry genotypes. More than 95% of the 507 loci did not display segregation distortion at the p < 0.05 level, and contained moderate to high levels of polymorphism with a polymorphic information content >0.25. This comprehensive collection of developed and validated microsatellite loci represents a substantial addition to the molecular tools available for geneticists, genomicists, and breeders in cranberry and Vaccinium.
Stalmeijer, Renée E; Dolmans, Diana H J M; Wolfhagen, Ineke H A P; Muijtjens, Arno M M; Scherpbier, Albert J J A
2008-01-01
Research indicates that the quality of supervision strongly influences the learning of medical students in clinical practice. Clinical teachers need feedback to improve their supervisory skills. The available instruments either lack a clear theoretical framework or are not suitable for providing feedback to individual teachers. We developed an evaluation instrument based on the 'cognitive apprenticeship model'. The aim was to estimate the content validity of the developed instrument. Item relevance was rated on a five-point scale (1 = highly irrelevant, 5 = highly relevant) by three groups of stakeholders in undergraduate clinical teaching: educationalists (N = 12), doctors (N = 16) and students (N = 12). Additionally, stakeholders commented on content, wording and omission of items. The items were generally rated as very relevant (Mean = 4.3, SD = 0.38, response = 95%) and any differences between the stakeholder groups were small. The results led to elimination of 4 items, rewording of 13 items and addition of 1 item. The cognitive apprenticeship model appears to offer a useful framework for the development of an evaluation instrument aimed at providing feedback to individual clinical teachers on the quality of student supervision. Further studies in larger populations will have to establish the instrument's statistical validity and generalizability.
van Bokhorst-de van der Schueren, Marian A E; Guaitoli, Patrícia Realino; Jansma, Elise P; de Vet, Henrica C W
2014-02-01
Numerous nutrition screening tools for the hospital setting have been developed. The aim of this systematic review is to study construct or criterion validity and predictive validity of nutrition screening tools for the general hospital setting. A systematic review of English, French, German, Spanish, Portuguese and Dutch articles identified via MEDLINE, Cinahl and EMBASE (from inception to the 2nd of February 2012). Additional studies were identified by checking reference lists of identified manuscripts. Search terms included key words for malnutrition, screening or assessment instruments, and terms for hospital setting and adults. Data were extracted independently by 2 authors. Only studies expressing the (construct, criterion or predictive) validity of a tool were included. 83 studies (32 screening tools) were identified: 42 studies on construct or criterion validity versus a reference method and 51 studies on predictive validity on outcome (i.e. length of stay, mortality or complications). None of the tools performed consistently well to establish the patients' nutritional status. For the elderly, MNA performed fair to good, for the adults MUST performed fair to good. SGA, NRS-2002 and MUST performed well in predicting outcome in approximately half of the studies reviewed in adults, but not in older patients. Not one single screening or assessment tool is capable of adequate nutrition screening as well as predicting poor nutrition related outcome. Development of new tools seems redundant and will most probably not lead to new insights. New studies comparing different tools within one patient population are required. Copyright © 2013 Elsevier Ltd and European Society for Clinical Nutrition and Metabolism. All rights reserved.
Modelling of individual subject ozone exposure response kinetics.
Schelegle, Edward S; Adams, William C; Walby, William F; Marion, M Susan
2012-06-01
A better understanding of individual subject ozone (O(3)) exposure response kinetics will provide insight into how to improve models used in the risk assessment of ambient ozone exposure. To develop a simple two compartment exposure-response model that describes individual subject decrements in forced expiratory volume in one second (FEV(1)) induced by the acute inhalation of O(3) lasting up to 8 h. FEV(1) measurements of 220 subjects who participated in 14 previously completed studies were fit to the model using both particle swarm and nonlinear least squares optimization techniques to identify three subject-specific coefficients producing minimum "global" and local errors, respectively. Observed and predicted decrements in FEV(1) of the 220 subjects were used for validation of the model. Further validation was provided by comparing the observed O(3)-induced FEV(1) decrements in an additional eight studies with predicted values obtained using model coefficients estimated from the 220 subjects used in cross validation. Overall the individual subject measured and modeled FEV(1) decrements were highly correlated (mean R(2) of 0.69 ± 0.24). In addition, it was shown that a matrix of individual subject model coefficients can be used to predict the mean and variance of group decrements in FEV(1). This modeling approach provides insight into individual subject O(3) exposure response kinetics and provides a potential starting point for improving the risk assessment of environmental O(3) exposure.
Insights on in vitro models for safety and toxicity assessment of cosmetic ingredients.
Almeida, Andreia; Sarmento, Bruno; Rodrigues, Francisca
2017-03-15
According to the current European legislation, the safety assessment of each individual cosmetic ingredient of any formulation is the basis for the safety evaluation of a cosmetic product. Also, animal testing in the European Union is prohibited for cosmetic ingredients and products since 2004 and 2009, respectively. Additionally, the commercialization of any cosmetic products containing ingredients tested on animal models was forbidden in 2009. In consequence of these boundaries, the European Centre for the Validation of Alternative Methods (ECVAM) proposes a list of validated cell-based in vitro models for predicting the safety and toxicity of cosmetic ingredients. These models have been demonstrated as valuable and effective tools to overcome the limitations of animal in vivo studies. Although the use of in vitro cell-based models for the evaluation of absorption and permeability of cosmetic ingredients is widespread, a detailed study on the properties of these platforms and the in vitro-in vivo correlation compared with human data are required. Moreover, additional efforts must be taken to develop in vitro models to predict carcinogenicity, repeat dose toxicity and reproductive toxicity, for which no alternative in vitro methods are currently available. This review paper summarizes and characterizes the most relevant in vitro models validated by ECVAM employed to predict the safety and toxicology of cosmetic ingredients. Copyright © 2017 Elsevier B.V. All rights reserved.
Schmidt, A F; Nielen, M; Withrow, S J; Selmic, L E; Burton, J H; Klungel, O H; Groenwold, R H H; Kirpensteijn, J
2016-03-01
Canine osteosarcoma is the most common bone cancer, and an important cause of mortality and morbidity, in large purebred dogs. Previously we constructed two multivariable models to predict a dog's 5-month or 1-year mortality risk after surgical treatment for osteosarcoma. According to the 5-month model, dogs with a relatively low risk of 5-month mortality benefited most from additional chemotherapy treatment. In the present study, we externally validated these results using an independent cohort study of 794 dogs. External performance of our prediction models showed some disagreement between observed and predicted risk, mean difference: -0.11 (95% confidence interval [95% CI]-0.29; 0.08) for 5-month risk and 0.25 (95%CI 0.10; 0.40) for 1-year mortality risk. After updating the intercept, agreement improved: -0.0004 (95%CI-0.16; 0.16) and -0.002 (95%CI-0.15; 0.15). The chemotherapy by predicted mortality risk interaction (P-value=0.01) showed that the chemotherapy compared to no chemotherapy effectiveness was modified by 5-month mortality risk: dogs with a relatively lower risk of mortality benefited most from additional chemotherapy. Chemotherapy effectiveness on 1-year mortality was not significantly modified by predicted risk (P-value=0.28). In conclusion, this external validation study confirmed that our multivariable risk prediction models can predict a patient's mortality risk and that dogs with a relatively lower risk of 5-month mortality seem to benefit most from chemotherapy. Copyright © 2016 Elsevier B.V. All rights reserved.
The Multiple-Use of Accountability Assessments: Implications for the Process of Validation
ERIC Educational Resources Information Center
Koch, Martha J.
2014-01-01
Implications of the multiple-use of accountability assessments for the process of validation are examined. Multiple-use refers to the simultaneous use of results from a single administration of an assessment for its intended use and for one or more additional uses. A theoretical discussion of the issues for validation which emerge from…
2013-01-01
Background A scale validated in one language is not automatically valid in another language or culture. The purpose of this study was to validate the English version of the UNESP-Botucatu multidimensional composite pain scale (MCPS) to assess postoperative pain in cats. The English version was developed using translation, back-translation, and review by individuals with expertise in feline pain management. In sequence, validity and reliability tests were performed. Results Of the three domains identified by factor analysis, the internal consistency was excellent for ‘pain expression’ and ‘psychomotor change’ (0.86 and 0.87) but not for ‘physiological variables’ (0.28). Relevant changes in pain scores at clinically distinct time points (e.g., post-surgery, post-analgesic therapy), confirmed the construct validity and responsiveness (Wilcoxon test, p < 0.001). Favorable correlation with the IVAS scores (p < 0.001) and moderate to very good agreement between blinded observers and ‘gold standard’ evaluations, supported criterion validity. The cut-off point for rescue analgesia was > 7 (range 0–30 points) with 96.5% sensitivity and 99.5% specificity. Conclusions The English version of the UNESP-Botucatu-MCPS is a valid, reliable and responsive instrument for assessing acute pain in cats undergoing ovariohysterectomy, when used by anesthesiologists or anesthesia technicians. The cut-off point for rescue analgesia provides an additional tool for guiding analgesic therapy. PMID:23867090
Brondani, Juliana T; Mama, Khursheed R; Luna, Stelio P L; Wright, Bonnie D; Niyom, Sirirat; Ambrosio, Jennifer; Vogel, Pamela R; Padovani, Carlos R
2013-07-17
A scale validated in one language is not automatically valid in another language or culture. The purpose of this study was to validate the English version of the UNESP-Botucatu multidimensional composite pain scale (MCPS) to assess postoperative pain in cats. The English version was developed using translation, back-translation, and review by individuals with expertise in feline pain management. In sequence, validity and reliability tests were performed. Of the three domains identified by factor analysis, the internal consistency was excellent for 'pain expression' and 'psychomotor change' (0.86 and 0.87) but not for 'physiological variables' (0.28). Relevant changes in pain scores at clinically distinct time points (e.g., post-surgery, post-analgesic therapy), confirmed the construct validity and responsiveness (Wilcoxon test, p < 0.001). Favorable correlation with the IVAS scores (p < 0.001) and moderate to very good agreement between blinded observers and 'gold standard' evaluations, supported criterion validity. The cut-off point for rescue analgesia was > 7 (range 0-30 points) with 96.5% sensitivity and 99.5% specificity. The English version of the UNESP-Botucatu-MCPS is a valid, reliable and responsive instrument for assessing acute pain in cats undergoing ovariohysterectomy, when used by anesthesiologists or anesthesia technicians. The cut-off point for rescue analgesia provides an additional tool for guiding analgesic therapy.
Smith, Kathryn E; Ellison, Jo M; Crosby, Ross D; Engel, Scott G; Mitchell, James E; Crow, Scott J; Peterson, Carol B; Le Grange, Daniel; Wonderlich, Stephen A
2017-09-01
The DSM-5 includes severity specifiers (i.e., mild, moderate, severe, extreme) for anorexia nervosa (AN), bulimia nervosa (BN), and binge-eating disorder (BED), which are determined by weight status (AN) and frequencies of binge-eating episodes (BED) or inappropriate compensatory behaviors (BN). Given limited data regarding the validity of eating disorder (ED) severity specifiers, this study examined the concurrent and predictive validity of severity specifiers in AN, BN, and BED. Adults with AN (n = 109), BN (n = 76), and BED (n = 216) were identified from previous datasets. Concurrent validity was assessed by measures of ED psychopathology, depression, anxiety, quality of life, and physical health. Predictive validity was assessed by ED symptoms at the end of the treatment in BN and BED. Severity categories did not differ in baseline validators, though the mild AN group evidenced greater ED symptoms compared to the severe group. In BN, greater severity was related to greater end of treatment binge-eating and compensatory behaviors, and lower likelihood of abstinence; however, in BED, greater severity was related to lower ED symptoms at the end of the treatment. Results demonstrated limited support for the validity of DSM-5 severity specifiers. Future research is warranted to explore additional validators and possible alternative indicators of severity in EDs. © 2017 Wiley Periodicals, Inc.
One-range addition theorems for derivatives of Slater-type orbitals.
Guseinov, Israfil
2004-06-01
Using addition theorems for STOs introduced by the author with the help of complete orthonormal sets of psi(alpha)-ETOs (Guseinov II (2003) J Mol Model 9:190-194), where alpha=1, 0, -1, -2, ..., a large number of one-range addition theorems for first and second derivatives of STOs are established. These addition theorems are especially useful for computation of multicenter-multielectron integrals over STOs that arise in the Hartree-Fock-Roothaan approximation and also in the Hylleraas function method, which play a significant role for the study of electronic structure and electron-nuclei interaction properties of atoms, molecules, and solids. The relationships obtained are valid for arbitrary quantum numbers, screening constants and location of STOs.
Scott, Whitney; McCracken, Lance M
2015-06-01
The Patient Global Impression of Change (PGIC) measure has frequently been used as an indicator of meaningful change in treatments for chronic pain. However, limited research has examined the validity of PGIC items despite their wide adoption in clinical trials for pain. Additionally, research has not yet examined predictors of PGIC ratings following psychologically based treatment for pain. The purpose of the present study was to examine the validity, factor structure, and predictors of PGIC ratings following an interdisciplinary psychologically based treatment for chronic pain. Patients with chronic pain (N = 476) completed standard assessments of pain, daily functioning, and depression before and after a 4-week treatment program based on the principles of acceptance and commitment therapy. Following the program, patients rated 1 item assessing their impression of change overall and several items assessing their impression of more specific changes: physical and social functioning, work-related activities, mood, and pain. Results indicated that the global and specific impression of change items represent a single component. In the context of the acceptance and commitment therapy-based treatment studied here, overall PGIC ratings appeared to be influenced to a greater degree by patients' experienced improvements in physical activities and mood than by improvements in pain. The findings suggest that in addition to a single overall PGIC rating, domain-specific items may be relevant for some treatment trials. This article reports on the validity and predictors of patients' impression of change ratings following interdisciplinary psychologically based treatment for pain. In addition to a single overall PGIC rating, domain-specific items may be important for clinicians and researchers to consider depending on the focus of treatment. Copyright © 2015 American Pain Society. Published by Elsevier Inc. All rights reserved.
Marques, Alda; Jácome, Cristina; Gonçalves, Ana; Silva, Sara; Lucas, Carla; Cruz, Joana; Gabriel, Raquel
2014-06-01
This study aimed to validate the Comprehensive International Classification of Functioning, Disability and Health (ICF) Core Set for obstructive pulmonary diseases (OPDs) from the perspective of patients with chronic obstructive pulmonary disease. A cross-sectional qualitative study was carried out with outpatients with chronic obstructive pulmonary disease using focus groups with an ICF-based approach. Qualitative data were analysed using the meaning condensation procedure by two researchers with expertise in the ICF. Thirty-two participants (37.5% women; 63.8 ± 11.3 years old) were included in six focus groups. A total of 61 (86%) ICF categories of the Comprehensive ICF Core Set for OPD were confirmed. Thirty-nine additional second-level categories not included in the Core Set were identified: 15 from the body functions component, four from the body structures, nine from the activities and participation and 11 from the environmental factors. The majority of the categories included in the Comprehensive ICF Core Set for OPD were confirmed from the patients' perspective. However, additional categories, not included in the Core Set, were also reported. The categories included in the Core Set were not confirmed and the additional categories need to be investigated further to develop an instrument tailored to patients' needs. This will promote patient-centred assessments and rehabilitation interventions.
Merghati-Khoei, E; Maasoumi, R; Rahdari, F; Bayat, A; Hajmirzaei, S; Lotfi, S; Hajiaghababaei, M; Emami-Razavi, SH; Korte, JE; Atoof, F
2016-01-01
Study design This is a cross-sectional study Objectives The objective of this study was to examine the psychometric properties of the Sexual Adjustment Questionnaire (SAQ) for Iranian people with spinal cord injury Setting This study was conducted in the brain and Spinal Cord Injury Research Center, Tehran University of Medical Sciences, Tehran, Iran Methods We assessed the psychometric properties of the SAQ, with 200 participants (men = 146, women = 54) completing the scale. An evaluation of its test–retest reliability was performed over a 2-weeks period, on a subsample of 30 patients recruited from the overall group. Cronbach’s α-coefficient was computed for assessment of internal consistency reliability. In addition, content and face validity were examined by an expert committee. Construct validity was assessed by examining convergent and discriminant validity. Finally, exploratory factor analysis was used to extract the factor structure of the questionnaire. Results The Cronbach’s α and intraclass correlation coefficient were 0.77 and 0.72 retrospectively. With regard to construct validity, there was a significant (P = 0.009) negative correlation (r = − 0.28) between the SAQ score and age. Those with lower levels of educations scored significantly lower on the SAQ (P = 0.04). The exploratory factor analysis indicated a four-factor structure for the questionnaire, accounting for 68.9% of the observed variance. The expert committee approved the face and content validity of the developed measure. Conclusion The SAQ is a valid measure for assessing sexual adjustment in people with spinal cord injury. The evaluation of sexual well-being may be useful in clinical trials and practical settings. PMID:25917953
Risk prediction models of breast cancer: a systematic review of model performances.
Anothaisintawee, Thunyarat; Teerawattananon, Yot; Wiratkapun, Chollathip; Kasamesup, Vijj; Thakkinstian, Ammarin
2012-05-01
The number of risk prediction models has been increasingly developed, for estimating about breast cancer in individual women. However, those model performances are questionable. We therefore have conducted a study with the aim to systematically review previous risk prediction models. The results from this review help to identify the most reliable model and indicate the strengths and weaknesses of each model for guiding future model development. We searched MEDLINE (PubMed) from 1949 and EMBASE (Ovid) from 1974 until October 2010. Observational studies which constructed models using regression methods were selected. Information about model development and performance were extracted. Twenty-five out of 453 studies were eligible. Of these, 18 developed prediction models and 7 validated existing prediction models. Up to 13 variables were included in the models and sample sizes for each study ranged from 550 to 2,404,636. Internal validation was performed in four models, while five models had external validation. Gail and Rosner and Colditz models were the significant models which were subsequently modified by other scholars. Calibration performance of most models was fair to good (expected/observe ratio: 0.87-1.12), but discriminatory accuracy was poor to fair both in internal validation (concordance statistics: 0.53-0.66) and in external validation (concordance statistics: 0.56-0.63). Most models yielded relatively poor discrimination in both internal and external validation. This poor discriminatory accuracy of existing models might be because of a lack of knowledge about risk factors, heterogeneous subtypes of breast cancer, and different distributions of risk factors across populations. In addition the concordance statistic itself is insensitive to measure the improvement of discrimination. Therefore, the new method such as net reclassification index should be considered to evaluate the improvement of the performance of a new develop model.
Scheinemann, Katrin; Grotzer, Michael; Kompis, Martin; Kuehni, Claudia E.
2017-01-01
Background Hearing loss is a potential late effect after childhood cancer. Questionnaires are often used to assess hearing in large cohorts of childhood cancer survivors and it is important to know if they can provide valid measures of hearing loss. We therefore assessed agreement and validity of questionnaire-reported hearing in childhood cancer survivors using medical records as reference. Procedure In this validation study, we studied 361 survivors of childhood cancer from the Swiss Childhood Cancer Survivor Study (SCCSS) who had been diagnosed after 1989 and had been exposed to ototoxic cancer treatment. Questionnaire-reported hearing was compared to the information in medical records. Hearing loss was defined as ≥ grade 1 according to the SIOP Boston Ototoxicity Scale. We assessed agreement and validity of questionnaire-reported hearing overall and stratified by questionnaire respondents (survivor or parent), sociodemographic characteristics, time between follow-up and questionnaire and severity of hearing loss. Results Questionnaire reports agreed with medical records in 85% of respondents (kappa 0.62), normal hearing was correctly assessed in 92% of those with normal hearing (n = 249), and hearing loss was correctly assessed in 69% of those with hearing loss (n = 112). Sensitivity of the questionnaires was 92%, 74%, and 39% for assessment of severe, moderate and mild bilateral hearing loss; and 50%, 33% and 10% for severe, moderate and mild unilateral hearing loss, respectively. Results did not differ by sociodemographic characteristics of the respondents, and survivor- and parent-reports were equally valid. Conclusions Questionnaires are a useful tool to assess hearing in large cohorts of childhood cancer survivors, but underestimate mild and unilateral hearing loss. Further research should investigate whether the addition of questions with higher sensitivity for mild degrees of hearing loss could improve the results. PMID:28333999
Pien, Li-Chung; Chu, Hsin; Chen, Wen-Chun; Chang, Yu-Shiun; Liao, Yuan-Mei; Chen, Chiung-Hua; Chou, Kuei-Ru
2011-08-01
To examine the psychometric properties of the Chinese version of the Multidimensional Fatigue Symptom Inventory-Short Form (MFSI-SF-C) for use in Chinese-speaking countries. The assessment of fatigue is a challenging task for most researchers because culture may influence perceptions of meaning of fatigue. The lack of examination of the psychometric properties of the fatigue measures across studies limits the scientific rigour for generating additional research on the concept of 'fatigue.' A cross-sectional study. The study recruited 107 cancer inpatients from two medical centres in Taiwan. The MFSI-SF-C was examined using a two step process: (1) Translation and back-translation of the instrument; and (2) Examination of internal consistency reliability, test-retest reliability, content validity and construct validity. The results showed that the Cronbach's α of MFSI-SF-C total scale and subscales ranged between 0·83-0·92. The content validity index was 0·93. The difference between the fatigue of cancer patients and the comparison group of healthy people in the community was significant. The results demonstrated good convergent validity when comparing fatigue with depression and quality of life. Factor analysis confirmed the four dimensions of fatigue: physical, emotional, mental and vigour. It showed moderate intercorrelation between subscales and high factor loadings also helped to clarify the psychometric meaning. The reliability and validity information presented in this article support the use of the Chinese version of the MFSI-SF as a research instrument for measuring fatigue in Chinese populations. This study also provides evidence that the MFSI-SF possesses robust psychometric properties. The MFSI-SF-C is an effective and comprehensive tool for measuring fatigue in Chinese patients with cancer. © 2011 Blackwell Publishing Ltd.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Trinh, Thi-Kim-Hoang; Laboratoire de Science des Procédés et des Matériaux; Passarello, Jean-Philippe, E-mail: Jean-Philippe.Passarello@lspm.cnrs.fr
This work consists of the adaptation of a non-additive hard sphere theory inspired by Malakhov and Volkov [Polym. Sci., Ser. A 49(6), 745–756 (2007)] to a square-well chain. Using the thermodynamic perturbation theory, an additional term is proposed that describes the effect of perturbing the chain of square well spheres by a non-additive parameter. In order to validate this development, NPT Monte Carlo simulations of thermodynamic and structural properties of the non-additive square well for a pure chain and a binary mixture of chains are performed. Good agreements are observed between the compressibility factors originating from the theory and thosemore » from molecular simulations.« less
Validation of Procedures for Monitoring Crewmember Immune Function
NASA Technical Reports Server (NTRS)
Pierson, Duane; Crucian, Brian; Mehta, Satish; Stowe, Raymond; Uchakin, Peter; Quiriarte, Heather; Sams, Clarence
2010-01-01
The objective of this Supplemental Medical Objective (SMO) is to determine the status of the immune system, physiological stress and latent viral reactivation (a clinical outcome that can be measured) during both short and long-duration spaceflight. In addition, this study will develop and validate an immune monitoring strategy consistent with operational flight requirements and constraints. Pre-mission, in-flight and post-flight blood and saliva samples will be obtained from participating crewmembers. Assays included peripheral immunophenotype, T cell function, cytokine profiles, viral-specific immunity, latent viral reactivation (EBV, CMV, VZV), and stress hormone measurements. To date, 18 short duration (now completed) and 8 long-duration crewmembers have completed the study. The long-duration phase of this study is ongoing. For this presentation, the final data set for the short duration subjects will be discussed.
Matsumoto, David; Yoo, Seung Hee; Hirayama, Satoko; Petrova, Galina
2005-03-01
As one component of emotion regulation, display rules, which reflect the regulation of expressive behavior, have been the topic of many studies. Despite their theoretical and empirical importance, however, to date there is no measure of display rules that assesses a full range of behavioral responses that are theoretically possible when emotion is elicited. This article reports the development of a new measure of display rules that surveys 5 expressive modes: expression, deamplification, amplification, qualification, and masking. Two studies provide evidence for its internal and temporal reliability and for its content, convergent, discriminant, external, and concurrent predictive validity. Additionally, Study 1, involving American, Russian, and Japanese participants, demonstrated predictable cultural differences on each of the expressive modes. Copyright 2005 APA, all rights reserved.
Problem-solving style and multicultural personality dispositions: a study of construct validity.
Houtz, John C; Ponterotto, Joseph G; Burger, Claudia; Marino, Cherylynn
2010-06-01
This exploratory study examined the relationship between problem-solving styles and multicultural personality dispositions among 91 graduate students enrolled in an urban university located in the northeast United States. Problem-solving style was assessed with the three dimensions of the VIEW: an Assessment of Problem Solving Style. Multicultural personality was assessed with the five-factor Multicultural Personality Questionnaire (MPQ); its factors of Cultural Empathy, Open-mindedness, Social Initiative, and Flexibility correlated significantly with Explorer and External problem-solving styles, as predicted. The Emotional Stability subscale also correlated significantly with scores on Explorer style, suggesting that individuals who prefer "thinking in new directions" in problem solving are more likely to report remaining calm under stressful situations. Collectively, study results provided additional evidence of construct validity for the VIEW.
Monitoring genetic damage to ecosystems from hazardous waste
DOE Office of Scientific and Technical Information (OSTI.GOV)
Anderson, S.L.
1992-03-01
Applications of ecological toxicity testing to hazardous waste management have increased dramatically over the last few years, resulting in a greater awareness of the need for improved biomonitoring techniques. Our laboratory is developing advanced techniques to assess the genotoxic effects of environmental contamination on ecosystems. We have developed a novel mutagenesis assay using the nematode Caenorhabditis elegans, which is potentially applicable for multimedia studies in soil, sediment, and water. In addition, we are conducting validation studies of a previously developed anaphase aberration test that utilizes sea urchin embryos. Other related efforts include field validation studies of the new tests, evaluationmore » of their potential ecological relevance, and analysis of their sensitivity relative to that of existing toxicity tests that assess only lethal effects, rather than genetic damage.« less
Hazing DEOCS 4.1 Construct Validity Summary
2017-08-01
Hazing DEOCS 4.1 Construct Validity Summary DEFENSE EQUAL OPPORTUNITY MANAGEMENT INSTITUTE DIRECTORATE OF...the analysis. Tables 4 – 6 provide additional information regarding the descriptive statistics and reliability of the Hazing items. Table 7 provides
Paiva, Bianca Sakamoto Ribeiro; de Camargos, Mayara Goulart; Demarzo, Marcelo Marcos Piva; Hervás, Gonzalo; Vázquez, Carmelo; Paiva, Carlos Eduardo
2016-09-01
The Pemberton Happiness Index (PHI) is a recently developed integrative measure of well-being that includes components of hedonic, eudaimonic, social, and experienced well-being. The PHI has been validated in several languages, but not in Portuguese. Our aim was to cross-culturally adapt the Universal Portuguese version of the PHI and to assess its psychometric properties in a sample of the Brazilian population using online surveys.An expert committee evaluated 2 versions of the PHI previously translated into Portuguese by the original authors using a standardized form for assessment of semantic/idiomatic, cultural, and conceptual equivalence. A pretesting was conducted employing cognitive debriefing methods. In sequence, the expert committee evaluated all the documents and reached a final Universal Portuguese PHI version. For the evaluation of the psychometric properties, the data were collected using online surveys in a cross-sectional study. The study population included healthcare professionals and users of the social network site Facebook from several Brazilian geographic areas. In addition to the PHI, participants completed the Satisfaction with Life Scale (SWLS), Diener and Emmons' Positive and Negative Experience Scale (PNES), Psychological Well-being Scale (PWS), and the Subjective Happiness Scale (SHS). Internal consistency, convergent validity, known-group validity, and test-retest reliability were evaluated. Satisfaction with the previous day was correlated with the 10 items assessing experienced well-being using the Cramer V test. Additionally, a cut-off value of PHI to identify a "happy individual" was defined using receiver-operating characteristic (ROC) curve methodology.Data from 1035 Brazilian participants were analyzed (health professionals = 180; Facebook users = 855). Regarding reliability results, the internal consistency (Cronbach alpha = 0.890 and 0.914) and test-retest (intraclass correlation coefficient = 0.814) were both considered adequate. Most of the validity hypotheses formulated a priori (convergent and know-group) was further confirmed. The cut-off value of higher than 7 in remembered PHI was identified (AUC = 0.780, sensitivity = 69.2%, specificity = 78.2%) as the best one to identify a happy individual.We concluded that the Universal Portuguese version of the PHI is valid and reliable for use in the Brazilian population using online surveys.
Sysko, Robyn; Glasofer, Deborah R.; Hildebrandt, Tom; Klimek, Patrycja; Mitchell, James E.; Berg, Kelly C.; Peterson, Carol B.; Wonderlich, Stephen A.; Walsh, B. Timothy
2016-01-01
Objective Existing measures for DSM-IV eating disorder diagnoses have notable limitations, and there are important differences between DSM-IV and DSM-5 feeding and eating disorders. This study developed and validated a new semi-structured interview, the Eating Disorders Assessment for DSM-5 (EDA-5). Method Two studies evaluated the utility of the EDA-5. Study 1 compared the diagnostic validity of the EDA-5 to the Eating Disorder Examination (EDE) and evaluated the test-retest reliability of the new measure. Study 2 compared the diagnostic validity of an EDA-5 electronic application (“app”) to clinician interview and self-report assessments. Results In Study 1, the kappa for EDE and EDA-5 eating disorder diagnoses was 0.74 across all diagnoses (n= 64), with a range of κ=0.65 for Other Specified Feeding or Eating Disorder (OSFED)/Unspecified Feeding or Eating Disorder (USFED) to κ=0.90 for Binge Eating Disorder (BED). The EDA-5 test-retest kappa coefficient was 0.87 across diagnoses. For Study 2, clinical interview versus “app” conditions revealed a kappa of 0.83 for all eating disorder diagnoses (n=71). Across individual diagnostic categories, kappas ranged from 0.56 for OSFED/USFED to 0.94 for BN. Discussion High rates of agreement were found between diagnoses by EDA-5 and the EDE, and EDA-5 and clinical interviews. As this study supports the validity of the EDA-5 to generate DSM-5 eating disorders and the reliability of these diagnoses, the EDA-5 may be an option for the assessment of Anorexia Nervosa, Bulimia Nervosa, and BED. Additional research is needed to evaluate the utility of the EDA-5 in assessing DSM-5 feeding disorders. PMID:25639562
Evaluation and implementation of chemotherapy regimen validation in an electronic health record.
Diaz, Amber H; Bubalo, Joseph S
2014-12-01
Computerized provider order entry of chemotherapy regimens is quickly becoming the standard for prescribing chemotherapy in both inpatient and ambulatory settings. One of the difficulties with implementation of chemotherapy regimen computerized provider order entry lies in verifying the accuracy and completeness of all regimens built in the system library. Our goal was to develop, implement, and evaluate a process for validating chemotherapy regimens in an electronic health record. We describe our experience developing and implementing a process for validating chemotherapy regimens in the setting of a standard, commercially available computerized provider order entry system. The pilot project focused on validating chemotherapy regimens in the adult inpatient oncology setting and adult ambulatory hematologic malignancy setting. A chemotherapy regimen validation process was defined as a result of the pilot project. Over a 27-week pilot period, 32 chemotherapy regimens were validated using the process we developed. Results of the study suggest that by validating chemotherapy regimens, the amount of time spent by pharmacists in daily chemotherapy review was decreased. In addition, the number of pharmacist modifications required to make regimens complete and accurate were decreased. Both physician and pharmacy disciplines showed improved satisfaction and confidence levels with chemotherapy regimens after implementation of the validation system. Chemotherapy regimen validation required a considerable amount of planning and time but resulted in increased pharmacist efficiency and improved provider confidence and satisfaction. © The Author(s) 2014 Reprints and permissions: sagepub.co.uk/journalsPermissions.nav.
Wang, Wenyi; Kim, Marlene T.; Sedykh, Alexander
2015-01-01
Purpose Experimental Blood–Brain Barrier (BBB) permeability models for drug molecules are expensive and time-consuming. As alternative methods, several traditional Quantitative Structure-Activity Relationship (QSAR) models have been developed previously. In this study, we aimed to improve the predictivity of traditional QSAR BBB permeability models by employing relevant public bio-assay data in the modeling process. Methods We compiled a BBB permeability database consisting of 439 unique compounds from various resources. The database was split into a modeling set of 341 compounds and a validation set of 98 compounds. Consensus QSAR modeling workflow was employed on the modeling set to develop various QSAR models. A five-fold cross-validation approach was used to validate the developed models, and the resulting models were used to predict the external validation set compounds. Furthermore, we used previously published membrane transporter models to generate relevant transporter profiles for target compounds. The transporter profiles were used as additional biological descriptors to develop hybrid QSAR BBB models. Results The consensus QSAR models have R2=0.638 for fivefold cross-validation and R2=0.504 for external validation. The consensus model developed by pooling chemical and transporter descriptors showed better predictivity (R2=0.646 for five-fold cross-validation and R2=0.526 for external validation). Moreover, several external bio-assays that correlate with BBB permeability were identified using our automatic profiling tool. Conclusions The BBB permeability models developed in this study can be useful for early evaluation of new compounds (e.g., new drug candidates). The combination of chemical and biological descriptors shows a promising direction to improve the current traditional QSAR models. PMID:25862462
Validation of the mean radiant temperature simulated by the RayMan software in urban environments.
Lee, Hyunjung; Mayer, Helmut
2016-11-01
The RayMan software is worldwide applied in investigations on different issues in human-biometeorology. However, only the simulated mean radiant temperature (T mrt ) has been validated so far in a few case studies. They are based on T mrt values, which were experimentally determined in urban environments by use of a globe thermometer or applying the six-directional method. This study analyses previous T mrt validations in a comparative manner. Their results are extended by a recent validation of T mrt in an urban micro-environment in Freiburg (southwest Germany), which can be regarded as relatively heterogeneous due to different shading intensities by tree crowns. In addition, a validation of the physiologically equivalent temperature (PET) simulated by RayMan is conducted for the first time. The validations are based on experimentally determined T mrt and PET values, which were calculated from measured meteorological variables in the daytime of a clear-sky summer day. In total, the validation results show that RayMan is capable of simulating T mrt satisfactorily under relatively homogeneous site conditions. However, the inaccuracy of simulated T mrt is increasing with lower sun elevation and growing heterogeneity of the simulation site. As T mrt represents the meteorological variable that mostly governs PET in the daytime of clear-sky summer days, the accuracy of simulated T mrt is mainly responsible for the accuracy of simulated PET. The T mrt validations result in some recommendations, which concern an update of physical principles applied in the RayMan software to simulate the short- and long-wave radiant flux densities, especially from vertical building walls and tree crowns.
1991-01-17
Number: 90-03-08- VRX See Section 3.1 for any additional information about the testing environment. As a result of this validation effort, Validation...20301,Q:’ I.. AVF Control Number: AVF-VSR-365.0191 17 January 1991 90-03-08- VRX Ada COMPILER VALIDATION SUMMARY REPORT: Certificate Number: 900228W1.11003
NASA Astrophysics Data System (ADS)
Cataloglu, Erdat
The purpose of this study was to construct a valid and reliable multiple-choice achievement test to assess students' understanding of core concepts of introductory quantum mechanics. Development of the Quantum Mechanics Visualization Instrument (QMVI) occurred across four successive semesters in 1999--2001. During this time 213 undergraduate and graduate students attending the Pennsylvania State University (PSU) at University Park and Arizona State University (ASU) participated in this development and validation study. Participating students were enrolled in four distinct groups of courses: Modern Physics, Undergraduate Quantum Mechanics, Graduate Quantum Mechanics, and Chemistry Quantum Mechanics. Expert panels of professors of physics experienced in teaching quantum mechanics courses and graduate students in physics and science education established the core content and assisted in the validating of successive versions of the 24-question QMVI. Instrument development was guided by procedures outlined in the Standards for Educational and Psychological Testing (AERA-APA-NCME, 1999). Data gathered in this study provided information used in the development of successive versions of the QMVI. Data gathered in the final phase of administration of the QMVI also provided evidence that the intended score interpretation of the QMVI achievement test is valid and reliable. A moderate positive correlation coefficient of 0.49 was observed between the students' QMVI scores and their confidence levels. Analyses of variance indicated that students' scores in Graduate Quantum Mechanics and Undergraduate Quantum Mechanics courses were significantly higher than the mean scores of students in Modern Physics and Chemistry Quantum Mechanics courses (p < 0.05). That finding is consistent with the additional understanding and experience that should be anticipated in graduate students and junior-senior level students over sophomore physics majors and majors in another field. The moderate positive correlation coefficient of 0.42 observed between students' QMVI scores and their final course grades was also consistent with expectations in a valid instrument. In addition, the Cronbach-alpha reliability coefficient of the QMVI was found to be 0.82. Limited findings were drawn on students' understanding of introductory quantum mechanics concepts. Data suggested that the construct of quantum mechanics understanding is most likely multidimensional and the Main Topic defined as "Quantum Mechanics Postulates" may be an especially important factor for students in acquiring a successful understanding of quantum mechanics.
Fuermaier, Anselm B M; Tucha, Oliver; Koerts, Janneke; Lange, Klaus W; Weisbrod, Matthias; Aschenbrenner, Steffen; Tucha, Lara
2017-12-01
The assessment of performance validity is an essential part of the neuropsychological evaluation of adults with attention-deficit/hyperactivity disorder (ADHD). Most available tools, however, are inaccurate regarding the identification of noncredible performance. This study describes the development of a visuospatial working memory test, including a validity indicator for noncredible cognitive performance of adults with ADHD. Visuospatial working memory of adults with ADHD (n = 48) was first compared to the test performance of healthy individuals (n = 48). Furthermore, a simulation design was performed including 252 individuals who were randomly assigned to either a control group (n = 48) or to 1 of 3 simulation groups who were requested to feign ADHD (n = 204). Additional samples of 27 adults with ADHD and 69 instructed simulators were included to cross-validate findings from the first samples. Adults with ADHD showed impaired visuospatial working memory performance of medium size as compared to healthy individuals. Simulation groups committed significantly more errors and had shorter response times as compared to patients with ADHD. Moreover, binary logistic regression analysis was carried out to derive a validity index that optimally differentiates between true and feigned ADHD. ROC analysis demonstrated high classification rates of the validity index, as shown in excellent specificity (95.8%) and adequate sensitivity (60.3%). The visuospatial working memory test as presented in this study therefore appears sensitive in indicating cognitive impairment of adults with ADHD. Furthermore, the embedded validity index revealed promising results concerning the detection of noncredible cognitive performance of adults with ADHD. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Schmidt, Kathrin S; Mankertz, Joachim
2018-06-01
A sensitive and robust LC-MS/MS method allowing the rapid screening and confirmation of selective androgen receptor modulators in bovine urine was developed and successfully validated according to Commission Decision 2002/657/EC, chapter 3.1.3 'alternative validation', by applying a matrix-comprehensive in-house validation concept. The confirmation of the analytes in the validation samples was achieved both on the basis of the MRM ion ratios as laid down in Commission Decision 2002/657/EC and by comparison of their enhanced product ion (EPI) spectra with a reference mass spectral library by making use of the QTRAP technology. Here, in addition to the MRM survey scan, EPI spectra were generated in a data-dependent way according to an information-dependent acquisition criterion. Moreover, stability studies of the analytes in solution and in matrix according to an isochronous approach proved the stability of the analytes in solution and in matrix for at least the duration of the validation study. To identify factors that have a significant influence on the test method in routine analysis, a factorial effect analysis was performed. To this end, factors considered to be relevant for the method in routine analysis (e.g. operator, storage duration of the extracts before measurement, different cartridge lots and different hydrolysis conditions) were systematically varied on two levels. The examination of the extent to which these factors influence the measurement results of the individual analytes showed that none of the validation factors exerts a significant influence on the measurement results.
McKenna, Stephen P; Wilburn, Jeanette; Twiss, James; Crawford, Sigrid R; Hána, Václav; Karbownik-Lewinska, Malgorzata; Popovic, Vera; Pura, Mikulas; Koltowska-Häggström, Maria
2011-08-02
The Quality of Life in Adult Growth Hormone Deficiency Assessment (QoL-AGHDA) is a disease-specific quality of life measure specific to individuals who are growth hormone deficient. The present study describes the adaptation of the QoL-AGHDA for use in the following four Slavic languages; Czech, Polish, Serbian and Slovakian. The study involved three stages in each language; translation, cognitive debriefing and validation. The validation stage assessed internal consistency (Cronbach's alpha), reproducibility (test-retest reliability using Spearman's rank correlations), convergent and divergent validity (Correlations with the NHP) and known group validity. The QoL-AGHDA was successfully translated into the target languages with minimal problems. Cognitive debriefing interviewees (n = 15-18) found the measures easy to complete and identified few problems with the content. Internal consistency (Czech Republic = 0.91, Poland = 0.91, Serbia = 0.91 and Slovakia = 0.89) and reproducibility (Czech Republic = 0.91, Poland = 0.91, Serbia = 0.88 and Slovakia = 0.93) were good in all adaptations. Convergent and divergent validity and known group validity data were not available for Slovakia. The QoL-AGHDA correlated as expected with the NHP scales most relevant to GHD. The QoL-AGHDA was able to distinguish between participants based on a range of variables. The QoL-AGHDA was successfully adapted for use in the Czech Republic, Poland, Serbia and Slovakia. Further validation of the Slovakian version would be beneficial. The addition of these new language versions will prove valuable to multinational clinical trials and to clinical practice in the respective countries.
Development and Validation of the Biobanking Attitudes and Knowledge Survey (BANKS)
Wells, Kristen J.; Arevalo, Mariana; Meade, Cathy D.; Gwede, Clement K.; Quinn, Gwendolyn P.; Luque, John S.; Miguel, Gloria San; Watson, Dale; Phillips, Rebecca; Reyes, Carmen; Romo, Margarita; West, Jim; Jacobsen, Paul B.
2014-01-01
Background No validated multi-scale instruments exist that measure community members’ views on biobanking and biospecimen donation. This study describes the development and psychometric properties of the English-language BANKS (Biobanking Attitudes aNd Knowledge Survey). Methods The BANKS was created by item generation through review of scientific literature, focus groups with community members, and input from a community advisory board. Items were refined through cognitive interviews. Content validity was assessed through an expert panel review. Psychometric properties of the BANKS were assessed in a sample of 85 community members. Results The final BANKS includes 3 scales: Attitudes, Knowledge, and Self-Efficacy; as well as 3 single items, which evaluated receptivity and intention to donate a biospecimen for research. Cronbach's alpha coefficients for two scales that use Likert response format indicated high internal consistency (Attitudes: α=.88; Self-Efficacy: α=.95). Content validity indices were moderate, ranging from 0.69 to 0.89. Intention to donate blood and intention to donate urine were positively correlated with attitudes, knowledge, self-efficacy, and receptivity to learning more about biobanking (p's range from .029 to <.001). Conclusions The final BANKS shows evidence of satisfactory reliability and validity, is easy to administer, and is a promising tool to inform biospecimen research. Additional studies should be conducted with larger samples considering biospecimen donation to further assess the instrument's reliability and validity. Impact A valid and reliable instrument measuring community members’ views about biobanking may help researchers evaluate relevant communication interventions to enhance understanding, intention, and actual biospecimen donation. A Spanish-language BANKS is under development. PMID:24609846
Examining the Predictive Validity of NIH Peer Review Scores
Lindner, Mark D.; Nakamura, Richard K.
2015-01-01
The predictive validity of peer review at the National Institutes of Health (NIH) has not yet been demonstrated empirically. It might be assumed that the most efficient and expedient test of the predictive validity of NIH peer review would be an examination of the correlation between percentile scores from peer review and bibliometric indices of the publications produced from funded projects. The present study used a large dataset to examine the rationale for such a study, to determine if it would satisfy the requirements for a test of predictive validity. The results show significant restriction of range in the applications selected for funding. Furthermore, those few applications that are funded with slightly worse peer review scores are not selected at random or representative of other applications in the same range. The funding institutes also negotiate with applicants to address issues identified during peer review. Therefore, the peer review scores assigned to the submitted applications, especially for those few funded applications with slightly worse peer review scores, do not reflect the changed and improved projects that are eventually funded. In addition, citation metrics by themselves are not valid or appropriate measures of scientific impact. The use of bibliometric indices on their own to measure scientific impact would likely increase the inefficiencies and problems with replicability already largely attributed to the current over-emphasis on bibliometric indices. Therefore, retrospective analyses of the correlation between percentile scores from peer review and bibliometric indices of the publications resulting from funded grant applications are not valid tests of the predictive validity of peer review at the NIH. PMID:26039440
van Zijl, Floris V W J; Timman, Reinier; Datema, Frank R
2017-06-01
The nasal obstruction symptom evaluation (NOSE) scale is a validated disease-specific, self-completed questionnaire for the assessment of quality of life related to nasal obstruction. The aim of this study was to validate the Dutch (NL-NOSE) questionnaire. A prospective instrument validation study was performed in a tertiary academic referral center. Guidelines for the cross-cultural adaptation process from the original English language scale into a Dutch language version were followed. Patients undergoing functional septoplasty or septorhinoplasty and asymptomatic controls completed the questionnaire both before and 3 months after surgery to test reliability and validity. Additionally, we explored the possibility to reduce the NOSE scale even further using graded response models. 129 patients and 50 controls were included. Internal consistency (Cronbach's alpha 0.82) and test-retest reliability (intraclass correlation coefficient 0.89) were good. The instrument showed excellent between-group discrimination (Mann-Whitney U = 85, p < 0.001) and high response sensitivity to change (Wilcoxon rank p < 0.001). The NL-NOSE correlated well with the score on a visual analog scale measuring the subjective sensation of nasal obstruction, with exception of item 4 (trouble sleeping). Item 4 provided the least information to the total scale and item 3 (trouble breathing through nose) the most, particularly in the postoperative group. The Dutch version of the NOSE (NL-NOSE) demonstrated satisfactory reliability and validity. We recommend the use of the NL-NOSE as a validated instrument to measure subjective severity of nasal obstruction in Dutch adult patients.
Korjus, Kristjan; Hebart, Martin N.; Vicente, Raul
2016-01-01
Supervised machine learning methods typically require splitting data into multiple chunks for training, validating, and finally testing classifiers. For finding the best parameters of a classifier, training and validation are usually carried out with cross-validation. This is followed by application of the classifier with optimized parameters to a separate test set for estimating the classifier’s generalization performance. With limited data, this separation of test data creates a difficult trade-off between having more statistical power in estimating generalization performance versus choosing better parameters and fitting a better model. We propose a novel approach that we term “Cross-validation and cross-testing” improving this trade-off by re-using test data without biasing classifier performance. The novel approach is validated using simulated data and electrophysiological recordings in humans and rodents. The results demonstrate that the approach has a higher probability of discovering significant results than the standard approach of cross-validation and testing, while maintaining the nominal alpha level. In contrast to nested cross-validation, which is maximally efficient in re-using data, the proposed approach additionally maintains the interpretability of individual parameters. Taken together, we suggest an addition to currently used machine learning approaches which may be particularly useful in cases where model weights do not require interpretation, but parameters do. PMID:27564393
Korjus, Kristjan; Hebart, Martin N; Vicente, Raul
2016-01-01
Supervised machine learning methods typically require splitting data into multiple chunks for training, validating, and finally testing classifiers. For finding the best parameters of a classifier, training and validation are usually carried out with cross-validation. This is followed by application of the classifier with optimized parameters to a separate test set for estimating the classifier's generalization performance. With limited data, this separation of test data creates a difficult trade-off between having more statistical power in estimating generalization performance versus choosing better parameters and fitting a better model. We propose a novel approach that we term "Cross-validation and cross-testing" improving this trade-off by re-using test data without biasing classifier performance. The novel approach is validated using simulated data and electrophysiological recordings in humans and rodents. The results demonstrate that the approach has a higher probability of discovering significant results than the standard approach of cross-validation and testing, while maintaining the nominal alpha level. In contrast to nested cross-validation, which is maximally efficient in re-using data, the proposed approach additionally maintains the interpretability of individual parameters. Taken together, we suggest an addition to currently used machine learning approaches which may be particularly useful in cases where model weights do not require interpretation, but parameters do.
Jones, Jace W; Tudor, Gregory; Bennett, Alexander; Farese, Ann M; Moroni, Maria; Booth, Catherine; MacVittie, Thomas J; Kane, Maureen A
2014-07-01
The potential risk of a radiological catastrophe highlights the need for identifying and validating potential biomarkers that accurately predict radiation-induced organ damage. A key target organ that is acutely sensitive to the effects of irradiation is the gastrointestinal (GI) tract, referred to as the GI acute radiation syndrome (GI-ARS). Recently, citrulline has been identified as a potential circulating biomarker for radiation-induced GI damage. Prior to biologically validating citrulline as a biomarker for radiation-induced GI injury, there is the important task of developing and validating a quantitation assay for citrulline detection within the radiation animal models used for biomarker validation. Herein, we describe the analytical development and validation of citrulline detection using a liquid chromatography tandem mass spectrometry assay that incorporates stable-label isotope internal standards. Analytical validation for specificity, linearity, lower limit of quantitation, accuracy, intra- and interday precision, extraction recovery, matrix effects, and stability was performed under sample collection and storage conditions according to the Guidance for Industry, Bioanalytical Methods Validation issued by the US Food and Drug Administration. In addition, the method was biologically validated using plasma from well-characterized mouse, minipig, and nonhuman primate GI-ARS models. The results demonstrated that circulating citrulline can be confidently quantified from plasma. Additionally, circulating citrulline displayed a time-dependent response for radiological doses covering GI-ARS across multiple species.
Siemann, Julia; Herrmann, Manfred; Galashan, Daniela
2016-08-01
Usually, incongruent flanker stimuli provoke conflict processing whereas congruent flankers should facilitate task performance. Various behavioral studies reported improved or even absent conflict processing with correctly oriented selective attention. In the present study we attempted to reinvestigate these behavioral effects and to disentangle neuronal activity patterns underlying the attentional cueing effect taking advantage of a combination of the high temporal resolution of Electroencephalographic (EEG) and the spatial resolution of functional magnetic resonance imaging (fMRI). Data from 20 participants were acquired in different sessions per method. We expected the conflict-related N200 event-related potential (ERP) component and areas associated with flanker processing to show validity-specific modulations. Additionally, the spatio-temporal dynamics during cued flanker processing were examined using an fMRI-constrained source analysis approach. In the ERP data we found early differences in flanker processing between validity levels. An early centro-parietal relative positivity for incongruent stimuli occurred only with valid cueing during the N200 time window, while a subsequent fronto-central negativity was specific to invalidly cued interference processing. The source analysis additionally pointed to separate neural generators of these effects. Regional sources in visual areas were involved in conflict processing with valid cueing, while a regional source in the anterior cingulate cortex (ACC) seemed to contribute to the ERP differences with invalid cueing. Moreover, the ACC and precentral gyrus demonstrated an early and a late phase of congruency-related activity differences with invalid cueing. We discuss the first effect to reflect conflict detection and response activation while the latter more likely originated from conflict monitoring and control processes during response competition. Copyright © 2016 Elsevier Inc. All rights reserved.
Eisenberg, Dan T A; Kuzawa, Christopher W; Hayes, M Geoffrey
2015-01-01
Telomere length (TL) is commonly measured using quantitative PCR (qPCR). Although, easier than the southern blot of terminal restriction fragments (TRF) TL measurement method, one drawback of qPCR is that it introduces greater measurement error and thus reduces the statistical power of analyses. To address a potential source of measurement error, we consider the effect of well position on qPCR TL measurements. qPCR TL data from 3,638 people run on a Bio-Rad iCycler iQ are reanalyzed here. To evaluate measurement validity, correspondence with TRF, age, and between mother and offspring are examined. First, we present evidence for systematic variation in qPCR TL measurements in relation to thermocycler well position. Controlling for these well-position effects consistently improves measurement validity and yields estimated improvements in statistical power equivalent to increasing sample sizes by 16%. We additionally evaluated the linearity of the relationships between telomere and single copy gene control amplicons and between qPCR and TRF measures. We find that, unlike some previous reports, our data exhibit linear relationships. We introduce the standard error in percent, a superior method for quantifying measurement error as compared to the commonly used coefficient of variation. Using this measure, we find that excluding samples with high measurement error does not improve measurement validity in our study. Future studies using block-based thermocyclers should consider well position effects. Since additional information can be gleaned from well position corrections, rerunning analyses of previous results with well position correction could serve as an independent test of the validity of these results. © 2015 Wiley Periodicals, Inc.
Wang, Dong-Yu; Done, Susan J; Mc Cready, David R; Leong, Wey L
2014-07-04
Using genome-wide expression profiles of a prospective training cohort of breast cancer patients, ClinicoMolecular Triad Classification (CMTC) was recently developed to classify breast cancers into three clinically relevant groups to aid treatment decisions. CMTC was found to be both prognostic and predictive in a large external breast cancer cohort in that study. This study serves to validate the reproducibility of CMTC and its prognostic value using independent patient cohorts. An independent internal cohort (n = 284) and a new external cohort (n = 2,181) were used to validate the association of CMTC between clinicopathological factors, 12 known gene signatures, two molecular subtype classifiers, and 19 oncogenic signalling pathway activities, and to reproduce the abilities of CMTC to predict clinical outcomes of breast cancer. In addition, we also updated the outcome data of the original training cohort (n = 147). The original training cohort reached a statistically significant difference (p < 0.05) in disease-free survivals between the three CMTC groups after an additional two years of follow-up (median = 55 months). The prognostic value of the triad classification was reproduced in the second independent internal cohort and the new external validation cohort. CMTC achieved even higher prognostic significance when all available patients were analyzed (n = 4,851). Oncogenic pathways Myc, E2F1, Ras and β-catenin were again implicated in the high-risk groups. Both prospective internal cohorts and the independent external cohorts reproduced the triad classification of CMTC and its prognostic significance. CMTC is an independent prognostic predictor, and it outperformed 12 other known prognostic gene signatures, molecular subtype classifications, and all other standard prognostic clinicopathological factors. Our results support further development of CMTC portfolio into a guide for personalized breast cancer treatments.
NASA Technical Reports Server (NTRS)
Voemel, Holger
2004-01-01
The main goal of our work was to provide in situ water vapor and ozone profiles in the upper troposphere and lower stratosphere as reference measurements for the validation of SAGE III water vapor and ozone retrievals. We used the NOAA/CMDL frost point hygrometer and ECC ozone sondes on small research balloons to provide continuous profiles between the surface and the mid stratosphere. The NOAA/CMDL frost point hygrometer is currently the only lightweight balloon borne instrument capable of measuring water vapor between the lower troposphere and middle stratosphere. The validation measurements were based in the arctic region of Scandinavia for northern hemisphere observations and in New Zealand for southern hemisphere observations and timed to coincide with overpasses of the SAGE III instrument. In addition to SAGE III validation we also tried to coordinate launches with other instruments and studied dehydration and transport processes in the Arctic stratospheric vortex.
Tarescavage, Anthony M; Wygant, Dustin B; Gervais, Roger O; Ben-Porath, Yossef S
2013-01-01
The current study examined the over-reporting Validity Scales of the MMPI-2 Restructured Form (MMPI-2-RF; Ben-Porath & Tellegen, 2008/2011) in relation to the Slick, Sherman, and Iverson (1999) criteria for the diagnosis of Malingered Neurocognitive Dysfunction in a sample of 916 consecutive non-head injury disability claimants. The classification of Malingered Neurocognitive Dysfunction was based on scores from several cognitive symptom validity tests and response bias indicators built into traditional neuropsychological tests. Higher scores on MMPI-2-RF Validity Scales, particularly the Response Bias Scale (Gervais, Ben-Porath, Wygant, & Green, 2007), were associated with probable and definite Malingered Neurocognitive Dysfunction. The MMPI-2-RF's Validity Scales classification accuracy of Malingered Neurocognitive Dysfunction improved when multiple scales were interpreted. Additionally, higher scores on MMPI-2-RF substantive scales measuring distress, internalizing dysfunction, thought dysfunction, and social avoidance were associated with probable and definite Malingered Neurocognitive Dysfunction. Implications for clinical practice and future directions are noted.
Zuba, Anna; Warschburger, Petra
2018-06-01
Anti-fat bias is widespread and is linked to the internalization of weight bias and psychosocial problems. The purpose of this study was to examine the internalization of weight bias among children across weight categories and to evaluate the psychometric properties of the Weight Bias Internalization Scale for Children (WBIS-C). Data were collected from 1484 primary school children and their parents. WBIS-C demonstrated good internal consistency (α = .86) after exclusion of Item 1. The unitary factor structure was supported using exploratory and confirmatory factor analyses (factorial validity). Girls and overweight children reported higher WBIS-C scores in comparison to boys and non-overweight peers (known-groups validity). Convergent validity was shown by significant correlations with psychosocial problems. Internalization of weight bias explained additional variance in different indicators of psychosocial well-being. The results suggest that the WBIS-C is a psychometrically sound and informative tool to assess weight bias internalization among children. Copyright © 2018 Elsevier Ltd. All rights reserved.
Heritage, Brody; Gilbert, Jessica M.; Roberts, Lynne D.
2016-01-01
Job embeddedness is a construct that describes the manner in which employees can be enmeshed in their jobs, reducing their turnover intentions. Recent questions regarding the properties of quantitative job embeddedness measures, and their predictive utility, have been raised. Our study compared two competing reflective measures of job embeddedness, examining their convergent, criterion, and incremental validity, as a means of addressing these questions. Cross-sectional quantitative data from 246 Australian university employees (146 academic; 100 professional) was gathered. Our findings indicated that the two compared measures of job embeddedness were convergent when total scale scores were examined. Additionally, job embeddedness was capable of demonstrating criterion and incremental validity, predicting unique variance in turnover intention. However, this finding was not readily apparent with one of the compared job embeddedness measures, which demonstrated comparatively weaker evidence of validity. We discuss the theoretical and applied implications of these findings, noting that job embeddedness has a complementary place among established determinants of turnover intention. PMID:27199817
Tobbell, Jane; Boduszek, Daniel; Kola-Palmer, Susanna; Vaughan, Joanne; Hargreaves, Janet
2018-04-01
There is global recognition that the inclusion of service users in the education of health and social care students in higher education can lead to more compassionate professional identities which will enable better decision making. However, to date there is no systematic tool to explore learning and service user involvement in the curriculum. To generate and validate a psychometric instrument which will allow educators to evaluate service user pedagogy. Construction and validation of a new scale. 365 undergraduate students from health and social care departments in two universities. A two correlated factor scale. Factor 1 - perceived presence of service users in the taught curriculum and factor 2 - professionals and service users working together (correlation between factor 1 and factor 2 - r = 0.32). The Huddersfield Service User Pedagogy Scale provides a valid instrument for educators to evaluate student learning. In addition, the tool can contribute to student reflections on their shifting professional identities as they progress through their studies. Copyright © 2018 Elsevier Ltd. All rights reserved.
A Decision Tree for Nonmetric Sex Assessment from the Skull.
Langley, Natalie R; Dudzik, Beatrix; Cloutier, Alesia
2018-01-01
This study uses five well-documented cranial nonmetric traits (glabella, mastoid process, mental eminence, supraorbital margin, and nuchal crest) and one additional trait (zygomatic extension) to develop a validated decision tree for sex assessment. The decision tree was built and cross-validated on a sample of 293 U.S. White individuals from the William M. Bass Donated Skeletal Collection. Ordinal scores from the six traits were analyzed using the partition modeling option in JMP Pro 12. A holdout sample of 50 skulls was used to test the model. The most accurate decision tree includes three variables: glabella, zygomatic extension, and mastoid process. This decision tree yielded 93.5% accuracy on the training sample, 94% on the cross-validated sample, and 96% on a holdout validation sample. Linear weighted kappa statistics indicate acceptable agreement among observers for these variables. Mental eminence should be avoided, and definitions and figures should be referenced carefully to score nonmetric traits. © 2017 American Academy of Forensic Sciences.
Maeda, Shunta; Shimada, Hironori; Sato, Tomoya; Tashiro, Kyoko; Tanaka, Yuki
2017-04-01
Cognitive models of social anxiety posit that there are several types of maladaptive beliefs responsible for persistent social anxiety. Although these beliefs are theoretically important, currently there is no validated measure of these beliefs in Japan. In the present study, we translated into Japanese a well-validated measure of these beliefs, the Self-Beliefs Related to Social Anxiety (SBSA) Scale. The psychometric properties of the scale were also examined in two nonclinical samples ( ns = 401 and 30). Using confirmatory factor analysis, the Japanese version of the SBSA was found to have a correlated three-factor structure that consisted of "conditional beliefs," "unconditional beliefs," and "high standard beliefs." In addition, the Japanese version of the SBSA and its subscales demonstrated good internal consistency reliability and test-retest reliability. The Japanese version of the SBSA also demonstrated good convergent and divergent validity. Future applications of the Japanese version of the SBSA are discussed.
Sexual Sensation Seeking: A Validated Scale for Spanish Gay, Lesbian and Bisexual People.
Gil-Llario, María Dolores; Morell-Mengual, Vicente; Giménez-García, Cristina; Salmerón-Sánchez, Pedro; Ballester-Arnal, Rafael
2018-06-07
Sexual Sensation Seeking has been identified as a main predictor of unsafe sex that particularly affects LGB people. This study adapts and validates the Sexual Sensation Seeking Scale to Spanish LGB people. For this purpose, we tested the factor structure in 1237 people, ranged from 17 to 60 years old, 880 self-defined as homosexuals and 357 as bisexuals. The results support the appropriateness of this scale for Spanish LGB people and determine two factors, explaining the 49.91% of variance: "physical sensations attraction" and "sexual experiences". Our findings reveal optimal levels of internal consistency in the total scale (α = 0.81) and each factor (α = 0.84 and α = 0.71). Additional analyses have demonstrated convergent validity for this scale. Important implications of the validated Sexual Sensation Seeking Scale in Spanish LGB people are discussed, in order to early detection and preventive interventions for HIV and other sexual health problems.
Bernard, Larry C
2010-04-01
There are few multidimensional measures of individual differences in motivation available. The Assessment of Individual Motives-Questionnaire assesses 15 putative dimensions of motivation. The dimensions are based on evolutionary theory and preliminary evidence suggests the motive scales have good psychometric properties. The scales are reliable and there is evidence of their consensual validity (convergence of self-other ratings) and behavioral validity (relationships with self-other reported behaviors of social importance). Additional validity research is necessary, however, especially with respect to current models of personality. The present study tested two general and 24 specific hypotheses based on proposed evolutionary advantages/disadvantages and fitness benefits/costs of the five-factor model of personality together with the new motive scales in a sample of 424 participants (M age=28.8 yr., SD=14.6). Results were largely supportive of the hypotheses. These results support the validity of new motive dimensions and increase understanding of the five-factor model of personality.
Hotchkiss, David R; Aqil, Anwer; Lippeveld, Theo; Mukooyo, Edward
2010-07-03
Sound policy, resource allocation and day-to-day management decisions in the health sector require timely information from routine health information systems (RHIS). In most low- and middle-income countries, the RHIS is viewed as being inadequate in providing quality data and continuous information that can be used to help improve health system performance. In addition, there is limited evidence on the effectiveness of RHIS strengthening interventions in improving data quality and use. The purpose of this study is to evaluate the usefulness of the newly developed Performance of Routine Information System Management (PRISM) framework, which consists of a conceptual framework and associated data collection and analysis tools to assess, design, strengthen and evaluate RHIS. The specific objectives of the study are: a) to assess the reliability and validity of the PRISM instruments and b) to assess the validity of the PRISM conceptual framework. Facility- and worker-level data were collected from 110 health care facilities in twelve districts in Uganda in 2004 and 2007 using records reviews, structured interviews and self-administered questionnaires. The analysis procedures include Cronbach's alpha to assess internal consistency of selected instruments, test-retest analysis to assess the reliability and sensitivity of the instruments, and bivariate and multivariate statistical techniques to assess validity of the PRISM instruments and conceptual framework. Cronbach's alpha analysis suggests high reliability (0.7 or greater) for the indices measuring a promotion of a culture of information, RHIS tasks self-efficacy and motivation. The study results also suggest that a promotion of a culture of information influences RHIS tasks self-efficacy, RHIS tasks competence and motivation, and that self-efficacy and the presence of RHIS staff have a direct influence on the use of RHIS information, a key aspect of RHIS performance. The study results provide some empirical support for the reliability and validity of the PRISM instruments and the validity of the PRISM conceptual framework, suggesting that the PRISM approach can be effectively used by RHIS policy makers and practitioners to assess the RHIS and evaluate RHIS strengthening interventions. However, additional studies with larger sample sizes are needed to further investigate the value of the PRISM instruments in exploring the linkages between RHIS data quality and use, and health systems performance.
Holloway, Jamie M; Long, Toby; Biasini, Fred
2018-04-02
This study provides information on how two standardized measures based on different theoretical frameworks can be used in collecting information on motor development and performance in 4- and 5-year-olds with autism spectrum disorder (ASD). The purpose of the study was to determine the concurrent validity of the Miller Function and Participation Scales (M-FUN) with the Peabody Developmental Motor Scales, Second Edition (PDMS-2) in young children with ASD. The gross motor sections of the PDMS-2 and the M-FUN were administered to 22 children with ASD between the ages of 48 and 71 months. Concurrent validity between overall motor scores and agreement in identification of motor delay were assessed. A very strong correlation (Pearson's r =.851) was found between the M-FUN scale scores and the PDMS-2 gross motor quotients (GMQs). Strong agreement in identification of children with average motor skills and delayed motor skills at 1.5 standard deviations below the mean was also found. This study supports the concurrent validity of the M-FUN with the PDMS-2 for young children with ASD. While both tests provide information regarding motor delay, the M-FUN may provide additional information regarding the neurological profile of the child.
Mother-child bonding assessment tools☆
Perrelli, Jaqueline Galdino Albuquerque; Zambaldi, Carla Fonseca; Cantilino, Amaury; Sougey, Everton Botelho
2014-01-01
Objective: To identify and describe research tools used to evaluate bonding between mother and child up to one year of age, as well as to provide information on reliability and validity measures related to these tools. Data source: Research studies available on PUBMED, LILACS, ScienceDirect, PsycINFO and CINAHL databases with the following descriptors: mother-child relations and mother infant relationship, as well as the expressions validity, reliability and scale. Data synthesis: 23 research studies were selected and fully analyzed. Thirteen evaluation research tools were identified concerning mother and child attachment: seven scales, three questionnaires, two inventories and one observation method. From all tools analyzed, the Prenatal Attachment Inventory presented the higher validity and reliability measures to assess mother and fetus relation during pregnancy. Concerning the puerperal period, better consistency coefficients were found for Maternal Attachment Inventory and Postpartum Bonding Questionnaire. Besides, the last one revealed a higher sensibility to identify amenable and severe disorders in the affective relations between mother and child. Conclusions: The majority of research tools are reliable to study the phenomenon presented, although there are some limitations regarding the construct and criterion related to validity. In addition to this, only two of them are translated into Portuguese and adapted to women and children populations in Brazil, being a decisive gap to scientific production in this area. PMID:25479859
NASA Astrophysics Data System (ADS)
Yepes, Pablo P.; Eley, John G.; Liu, Amy; Mirkovic, Dragan; Randeniya, Sharmalee; Titt, Uwe; Mohan, Radhe
2016-04-01
Monte Carlo (MC) methods are acknowledged as the most accurate technique to calculate dose distributions. However, due its lengthy calculation times, they are difficult to utilize in the clinic or for large retrospective studies. Track-repeating algorithms, based on MC-generated particle track data in water, accelerate dose calculations substantially, while essentially preserving the accuracy of MC. In this study, we present the validation of an efficient dose calculation algorithm for intensity modulated proton therapy, the fast dose calculator (FDC), based on a track-repeating technique. We validated the FDC algorithm for 23 patients, which included 7 brain, 6 head-and-neck, 5 lung, 1 spine, 1 pelvis and 3 prostate cases. For validation, we compared FDC-generated dose distributions with those from a full-fledged Monte Carlo based on GEANT4 (G4). We compared dose-volume-histograms, 3D-gamma-indices and analyzed a series of dosimetric indices. More than 99% of the voxels in the voxelized phantoms describing the patients have a gamma-index smaller than unity for the 2%/2 mm criteria. In addition the difference relative to the prescribed dose between the dosimetric indices calculated with FDC and G4 is less than 1%. FDC reduces the calculation times from 5 ms per proton to around 5 μs.
DOT National Transportation Integrated Search
2012-10-05
This study is aimed at better understanding the effects of heavy trucks on the performance of asphalt pavements. As the U.S. freight movements are dominated by truck transportations, the addition of new roadway lane-miles in Region VII have not kept ...
Using a soil moisture and precipitation network for satellite validation
USDA-ARS?s Scientific Manuscript database
A long term in situ network for the study of soil moisture and precipitation was deployed in north central Iowa, in cooperation between USDA and NASA. A total of 20 dual precipitation gages were established across a watershed landscape with an area of approximately 600 km2. In addition, four soil mo...
Validity of False Belief Tasks in Blind Children
ERIC Educational Resources Information Center
Brambring, Michael; Asbrock, Doreen
2010-01-01
Previous studies have reported that congenitally blind children without any additional impairment reveal a developmental delay of at least 4 years in perspective taking based on testing first-order false-belief tasks. These authors interpret this delay as a sign of autism-like behavior. However, the delay may be caused by testing blind children…
Getting to Know Your Criterion: Examining College Course Grades and GPAs over Time
ERIC Educational Resources Information Center
Marini, Jessica; Shaw, Emily; Young, Linda; Ewing, Maureen
2018-01-01
This study investigated differences in college grading practices (first-year grade point average and course grades) by student and institutional characteristics and by academic discipline to inform and improve our understanding and use as among the most commonly employed criteria in validity and college readiness research. In addition, trends in…
Motivational Orientations of Senior Citizens Participating in the Elderhostel Program.
ERIC Educational Resources Information Center
Bova, Breda Murphy
A study was conducted (1) to analyze the Educational Participation Scale (EPS) factor patterns derived from a sample of senior citizens in order to contribute additional reliability and validity data to the instrument; and (2) to look at reasons that have influenced senior citizens to pursue educational activities, specifically the Elderhostel…
Genetic susceptibility for Alzheimer disease neuritic plaque pathology.
Shulman, Joshua M; Chen, Kewei; Keenan, Brendan T; Chibnik, Lori B; Fleisher, Adam; Thiyyagura, Pradeep; Roontiva, Auttawut; McCabe, Cristin; Patsopoulos, Nikolaos A; Corneveaux, Jason J; Yu, Lei; Huentelman, Matthew J; Evans, Denis A; Schneider, Julie A; Reiman, Eric M; De Jager, Philip L; Bennett, David A
2013-09-01
While numerous genetic susceptibility loci have been identified for clinical Alzheimer disease (AD), it is important to establish whether these variants are risk factors for the underlying disease pathology, including neuritic plaques. To investigate whether AD susceptibility loci from genome-wide association studies affect neuritic plaque pathology and to additionally identify novel risk loci for this trait. Candidate analysis of single-nucleotide polymorphisms and genome-wide association study in a joint clinicopathologic cohort, including 725 deceased subjects from the Religious Orders Study and the Rush Memory and Aging Project (2 prospective, community-based studies), followed by targeted validation in an independent neuroimaging cohort, including 114 subjects from multiple clinical and research centers. A quantitative measure of neuritic plaque pathologic burden, based on assessments of silver-stained tissue averaged from multiple brain regions. Validation based on β-amyloid load by immunocytochemistry, and replication with fibrillar β-amyloid positron emission tomographic imaging with Pittsburgh Compound B or florbetapir. Besides the previously reported APOE and CR1 loci, we found that the ABCA7 (rs3764650; P = .02) and CD2AP (rs9349407; P = .03) AD susceptibility loci are associated with neuritic plaque burden. In addition, among the top results of our genome-wide association study, we discovered a novel variant near the amyloid precursor protein gene (APP, rs2829887) that is associated with neuritic plaques (P = 3.3 × 10-6). This polymorphism was associated with postmortem β-amyloid load as well as fibrillar β-amyloid in 2 independent cohorts of adults with normal cognition. These findings enhance understanding of AD risk factors by relating validated susceptibility alleles to increased neuritic plaque pathology and implicate common genetic variation at the APP locus in the earliest, presymptomatic stages of AD.
NASA Astrophysics Data System (ADS)
Harshman, Jordan; Stains, Marilyne
2017-05-01
This study presents a review from 39 studies that provide evidence for the structural validity and internal consistency of the Approaches to Teaching Inventory (ATI). In addition to this review, we evaluate many alternative factor structures on a sample of 267 first- and second-year chemistry faculty members participating in a professional development, a sample of instructors for which the ATI was originally designed. A total of 26 unique factor structures were evaluated. Through robust checking of assumptions, compilations of existing evidence, and new exploratory and confirmatory analyses, we found that there is greater evidence for the structural validity and internal consistency for the 22-item ATI than the 16-item ATI. Additionally, evidence supporting the original two-factor and four-factor structures proposed by the ATI authors (focusing on information transmission and conceptual change) were not reproducible and while alternative models were empirically viable, more theoretical justification is warranted. Recommendations for ATI use and general comments regarding best practices of reporting psychometrics in educational research contexts are discussed.
Schmitter, Marc; Kress, Bodo; Leckel, Michael; Henschel, Volkmar; Ohlmann, Brigitte; Rammelsberg, Peter
2008-06-01
This hypothesis-generating study was performed to determine which items in the Research Diagnostic Criteria for Temporomandibular Disorders (RDC/TMD) and additional diagnostic tests have the best predictive accuracy for joint-related diagnoses. One hundred forty-nine TMD patients and 43 symptom-free subjects were examined in clinical examinations and with magnetic resonance imaging (MRI). The importance of each variable of the clinical examination for correct joint-related diagnosis was assessed by using MRI diagnoses. For this purpose, "random forest" statistical software (based on classification trees) was used. Maximum unassisted jaw opening, maximum assisted jaw opening, history of locked jaw, joint sound with and without compression, joint pain, facial pain, pain on palpation of the lateral pterygoid area, and overjet proved suitable for distinguishing between subtypes of joint-related TMD. Measurement of excursion, protrusion, and midline deviation were less important. The validity of clinical TMD examination procedures can be enhanced by using the 16 variables of greatest importance identified in this study. In addition to other variables, maximum unassisted and assisted opening and a history of locked jaw were important when assessing the status of the TMJ.
DOE Office of Scientific and Technical Information (OSTI.GOV)
English, Shawn A.; Briggs, Timothy M.; Nelson, Stacy M.
Simulations of low velocity impact with a flat cylindrical indenter upon a carbon fiber fabric reinforced polymer laminate are rigorously validated. Comparison of the impact energy absorption between the model and experiment is used as the validation metric. Additionally, non-destructive evaluation, including ultrasonic scans and three-dimensional computed tomography, provide qualitative validation of the models. The simulations include delamination, matrix cracks and fiber breaks. An orthotropic damage and failure constitutive model, capable of predicting progressive damage and failure, is developed in conjunction and described. An ensemble of simulations incorporating model parameter uncertainties is used to predict a response distribution which ismore » then compared to experimental output using appropriate statistical methods. Lastly, the model form errors are exposed and corrected for use in an additional blind validation analysis. The result is a quantifiable confidence in material characterization and model physics when simulating low velocity impact in structures of interest.« less
Programmable stream prefetch with resource optimization
Boyle, Peter; Christ, Norman; Gara, Alan; Mawhinney, Robert; Ohmacht, Martin; Sugavanam, Krishnan
2013-01-08
A stream prefetch engine performs data retrieval in a parallel computing system. The engine receives a load request from at least one processor. The engine evaluates whether a first memory address requested in the load request is present and valid in a table. The engine checks whether there exists valid data corresponding to the first memory address in an array if the first memory address is present and valid in the table. The engine increments a prefetching depth of a first stream that the first memory address belongs to and fetching a cache line associated with the first memory address from the at least one cache memory device if there is not yet valid data corresponding to the first memory address in the array. The engine determines whether prefetching of additional data is needed for the first stream within its prefetching depth. The engine prefetches the additional data if the prefetching is needed.
CASL Verification and Validation Plan
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mousseau, Vincent Andrew; Dinh, Nam
2016-06-30
This report documents the Consortium for Advanced Simulation of LWRs (CASL) verification and validation plan. The document builds upon input from CASL subject matter experts, most notably the CASL Challenge Problem Product Integrators, CASL Focus Area leaders, and CASL code development and assessment teams. This document will be a living document that will track progress on CASL to do verification and validation for both the CASL codes (including MPACT, CTF, BISON, MAMBA) and for the CASL challenge problems (CIPS, PCI, DNB). The CASL codes and the CASL challenge problems are at differing levels of maturity with respect to validation andmore » verification. The gap analysis will summarize additional work that needs to be done. Additional VVUQ work will be done as resources permit. This report is prepared for the Department of Energy’s (DOE’s) CASL program in support of milestone CASL.P13.02.« less
Cheng, Shu-Fen; Rose, Susan
2009-01-01
This study investigated the technical adequacy of curriculum-based measures of written expression (CBM-W) in terms of writing prompts and scoring methods for deaf and hard-of-hearing students. Twenty-two students at the secondary school-level completed 3-min essays within two weeks, which were scored for nine existing and alternative curriculum-based measurement (CBM) scoring methods. The technical features of the nine scoring methods were examined for interrater reliability, alternate-form reliability, and criterion-related validity. The existing CBM scoring method--number of correct minus incorrect word sequences--yielded the highest reliability and validity coefficients. The findings from this study support the use of the CBM-W as a reliable and valid tool for assessing general writing proficiency with secondary students who are deaf or hard of hearing. The CBM alternative scoring methods that may serve as additional indicators of written expression include correct subject-verb agreements, correct clauses, and correct morphemes.
Fitzsimmons-Craft, Ellen E; Bardone-Cone, Anna M; Harney, Megan B
2012-09-01
We constructed and validated a measure of comparison dimensions associated with eating pathology, namely, the body, eating, and exercise comparison orientation measure (BEECOM). Participants were 441 undergraduate women. In Study 1, items were generated and refined via exploratory factor analysis, yielding three interpretable factors (i.e., body, eating, and exercise comparison orientation). Confirmatory factor analysis was then used to confirm the three-factor structure of the BEECOM and to investigate the potential presence of a higher-order factor. Given that the lower-order factors loaded strongly onto a higher-order factor, it is appropriate to use a total BEECOM score, in addition to subscale scores. Further, the BEECOM's scores yielded evidence of internal consistency and construct validity in this sample. Study 2 demonstrated two-week test-retest reliability of the BEECOM among college women. Overall, the BEECOM demonstrated good psychometric properties and may be useful for more comprehensively assessing eating disorder-related social comparison behavior. Copyright © 2012 Elsevier Ltd. All rights reserved.
ERIC Educational Resources Information Center
de Blignieres-Legeraud, Anne; Bjornavold, Jens; Charraud, Anne-Marie; Gerard, Francoise; Diamanti, Stamatina; Freundlinger, Alfred; Bjerknes, Ellen; Covita, Horacio
A workshop aimed to clarify under what conditions the validation of knowledge gained through experience can be considered a professionalizing factor for European Union teachers and trainers by creating a better link between experience and training and between vocational training and qualifications. Seven papers were presented in addition to an…
ERIC Educational Resources Information Center
Harlen, Wynne
2005-01-01
This paper summarizes the findings of a systematic review of research on the reliability and validity of teachers' assessment used for summative purposes. In addition to the main question, the review also addressed the question "What conditions affect the reliability and validity of teachers' summative assessment?" The initial search for studies…
ERIC Educational Resources Information Center
Institute of Electrical and Electronics Engineers, Inc., New York, NY.
The Institute of Electrical and Electronics Engineers (IEEE) validation program is designed to motivate persons practicing in electrical and electronics engineering to pursue quality technical continuing education courses offered by any responsible sponsor. The rapid acceptance of the validation program necessitated the additional development of a…
Forensic Uncertainty Quantification of Explosive Dispersal of Particles
NASA Astrophysics Data System (ADS)
Hughes, Kyle; Park, Chanyoung; Haftka, Raphael; Kim, Nam-Ho
2017-06-01
In addition to the numerical challenges of simulating the explosive dispersal of particles, validation of the simulation is often plagued with poor knowledge of the experimental conditions. The level of experimental detail required for validation is beyond what is usually included in the literature. This presentation proposes the use of forensic uncertainty quantification (UQ) to investigate validation-quality experiments to discover possible sources of uncertainty that may have been missed in initial design of experiments or under-reported. The current experience of the authors has found that by making an analogy to crime scene investigation when looking at validation experiments, valuable insights may be gained. One examines all the data and documentation provided by the validation experimentalists, corroborates evidence, and quantifies large sources of uncertainty a posteriori with empirical measurements. In addition, it is proposed that forensic UQ may benefit from an independent investigator to help remove possible implicit biases and increases the likelihood of discovering unrecognized uncertainty. Forensic UQ concepts will be discussed and then applied to a set of validation experiments performed at Eglin Air Force Base. This work was supported in part by the U.S. Department of Energy, National Nuclear Security Administration, Advanced Simulation and Computing Program.
Longitudinal construct validity of the minimum data set health status index.
Jones, Aaron; Feeny, David; Costa, Andrew P
2018-05-24
The Minimum Data Set Health Status Index (MDS-HSI) is a generic, preference-based health-related quality of life (HRQOL) measure derived by mapping items from the Resident Assessment Instrument - Minimum Data Set (RAI-MDS) assessment onto the Health Utilities Index Mark 2 classification system. While the validity of the MDS-HSI has been examined in cross-sectional settings, the longitudinal validity has not been explored. The objective of this study was to investigate the longitudinal construct validity of the MDS-HSI in a home care population. This study utilized a retrospective cohort of home care patients in the Hamilton-Niagara-Haldimand-Brant health region of Ontario, Canada with at least two RAI-MDS Home Care assessments between January 2010 and December 2014. Convergent validity was assessed by calculating Spearman rank correlations between the change in MDS-HSI and changes in six validated indices of health domains that can be calculated from the RAI-MDS assessment. Known-groups validity was investigated by fitting multivariable linear regression models to estimate the mean change in MDS-HSI associated with clinically important changes in the six health domain indices and 15 disease symptoms from the RAI-MDS Home Care assessment, controlling for age and sex. The cohort contained 25,182 patients with two RAI-MDS Home Care assessments. Spearman correlations between the MDS-HSI change and changes in the health domain indices were all statistically significant and in the hypothesized small to moderate range [0.1 < ρ < 0.5]. Clinically important changes in all of the health domain indices and 13 of the 15 disease symptoms were significantly associated with clinically important changes in the MDS-HSI. The findings of this study support the longitudinal construct validity of the MDS-HSI in home care populations. In addition to evaluating changes in HRQOL among home care patients in clinical research, economic evaluation, and health technology assessment, the MDS-HSI may be used in system-level applications using routinely collected population-level data.
Cronly, Mark; Behan, P; Foley, B; Malone, E; Earley, S; Gallagher, M; Shearan, P; Regan, L
2010-12-01
A confirmatory method has been developed to allow for the analysis of fourteen prohibited medicinal additives in pig and poultry compound feed. These compounds are prohibited for use as feed additives although some are still authorised for use in medicated feed. Feed samples are extracted by acetonitrile with addition of sodium sulfate. The extracts undergo a hexane wash to aid with sample purification. The extracts are then evaporated to dryness and reconstituted in initial mobile phase. The samples undergo an ultracentrifugation step prior to injection onto the LC-MS/MS system and are analysed in a run time of 26 min. The LC-MS/MS system is run in MRM mode with both positive and negative electrospray ionisation. The method was validated over three days and is capable of quantitatively analysing for metronidazole, dimetridazole, ronidazole, ipronidazole, chloramphenicol, sulfamethazine, dinitolimide, ethopabate, carbadox and clopidol. The method is also capable of qualitatively analysing for sulfadiazine, tylosin, virginiamycin and avilamycin. A level of 100 microg kg(-1) was used for validation purposes and the method is capable of analysing to this level for all the compounds. Validation criteria of trueness, precision, repeatability and reproducibility along with measurement uncertainty are calculated for all analytes. Copyright (c) 2010 Elsevier B.V. All rights reserved.
Gamifying Self-Management of Chronic Illnesses: A Mixed-Methods Study
Wills, Gary; Ranchhod, Ashok
2016-01-01
Background Self-management of chronic illnesses is an ongoing issue in health care research. Gamification is a concept that arose in the field of computer science and has been borrowed by many other disciplines. It is perceived by many that gamification can improve the self-management experience of people with chronic illnesses. This paper discusses the validation of a framework (called The Wheel of Sukr) that was introduced to achieve this goal. Objective This research aims to (1) discuss a gamification framework targeting the self-management of chronic illnesses and (2) validate the framework by diabetic patients, medical professionals, and game experts. Methods A mixed-method approach was used to validate the framework. Expert interviews (N=8) were conducted in order to validate the themes of the framework. Additionally, diabetic participants completed a questionnaire (N=42) in order to measure their attitudes toward the themes of the framework. Results The results provide a validation of the framework. This indicates that gamification might improve the self-management of chronic illnesses, such as diabetes. Namely, the eight themes in the Wheel of Sukr (fun, esteem, socializing, self-management, self-representation, motivation, growth, sustainability) were perceived positively by 71% (30/42) of the participants with P value <.001. Conclusions In this research, both the interviews and the questionnaire yielded positive results that validate the framework (The Wheel of Sukr). Generally, this study indicates an overall acceptance of the notion of gamification in the self-management of diabetes. PMID:27612632
Gamifying Self-Management of Chronic Illnesses: A Mixed-Methods Study.
AlMarshedi, Alaa; Wills, Gary; Ranchhod, Ashok
2016-09-09
Self-management of chronic illnesses is an ongoing issue in health care research. Gamification is a concept that arose in the field of computer science and has been borrowed by many other disciplines. It is perceived by many that gamification can improve the self-management experience of people with chronic illnesses. This paper discusses the validation of a framework (called The Wheel of Sukr) that was introduced to achieve this goal. This research aims to (1) discuss a gamification framework targeting the self-management of chronic illnesses and (2) validate the framework by diabetic patients, medical professionals, and game experts. A mixed-method approach was used to validate the framework. Expert interviews (N=8) were conducted in order to validate the themes of the framework. Additionally, diabetic participants completed a questionnaire (N=42) in order to measure their attitudes toward the themes of the framework. The results provide a validation of the framework. This indicates that gamification might improve the self-management of chronic illnesses, such as diabetes. Namely, the eight themes in the Wheel of Sukr (fun, esteem, socializing, self-management, self-representation, motivation, growth, sustainability) were perceived positively by 71% (30/42) of the participants with P value <.001. In this research, both the interviews and the questionnaire yielded positive results that validate the framework (The Wheel of Sukr). Generally, this study indicates an overall acceptance of the notion of gamification in the self-management of diabetes.
Singer, Sara; Meterko, Mark; Baker, Laurence; Gaba, David; Falwell, Alyson; Rosen, Amy
2007-10-01
To describe the development of an instrument for assessing workforce perceptions of hospital safety culture and to assess its reliability and validity. Primary data collected between March 2004 and May 2005. Personnel from 105 U.S. hospitals completed a 38-item paper and pencil survey. We received 21,496 completed questionnaires, representing a 51 percent response rate. Based on review of existing safety climate surveys, we developed a list of key topics pertinent to maintaining a culture of safety in high-reliability organizations. We developed a draft questionnaire to address these topics and pilot tested it in four preliminary studies of hospital personnel. We modified the questionnaire based on experience and respondent feedback, and distributed the revised version to 42,249 hospital workers. We randomly divided respondents into derivation and validation samples. We applied exploratory factor analysis to responses in the derivation sample. We used those results to create scales in the validation sample, which we subjected to multitrait analysis (MTA). We identified nine constructs, three organizational factors, two unit factors, three individual factors, and one additional factor. Constructs demonstrated substantial convergent and discriminant validity in the MTA. Cronbach's alpha coefficients ranged from 0.50 to 0.89. It is possible to measure key salient features of hospital safety climate using a valid and reliable 38-item survey and appropriate hospital sample sizes. This instrument may be used in further studies to better understand the impact of safety climate on patient safety outcomes.
Causal inference with measurement error in outcomes: Bias analysis and estimation methods.
Shu, Di; Yi, Grace Y
2017-01-01
Inverse probability weighting estimation has been popularly used to consistently estimate the average treatment effect. Its validity, however, is challenged by the presence of error-prone variables. In this paper, we explore the inverse probability weighting estimation with mismeasured outcome variables. We study the impact of measurement error for both continuous and discrete outcome variables and reveal interesting consequences of the naive analysis which ignores measurement error. When a continuous outcome variable is mismeasured under an additive measurement error model, the naive analysis may still yield a consistent estimator; when the outcome is binary, we derive the asymptotic bias in a closed-form. Furthermore, we develop consistent estimation procedures for practical scenarios where either validation data or replicates are available. With validation data, we propose an efficient method for estimation of average treatment effect; the efficiency gain is substantial relative to usual methods of using validation data. To provide protection against model misspecification, we further propose a doubly robust estimator which is consistent even when either the treatment model or the outcome model is misspecified. Simulation studies are reported to assess the performance of the proposed methods. An application to a smoking cessation dataset is presented.
Martin, Phillip K; Schroeder, Ryan W; Heinrichs, Robin J; Baade, Lyle E
2015-08-01
Previous research has demonstrated RBS and FBS-r to identify non-credible reporters of cognitive symptoms, but the extent that these scales might be influenced by true neurocognitive dysfunction has not been previously studied. The present study examined the relationship between these cognitive validity scales and neurocognitive performance across seven domains of cognitive functioning, both before and after controlling for PVT status in 120 individuals referred for neuropsychological evaluations. Variance in RBS, but not FBS-r, was significantly accounted for by neurocognitive test performance across most cognitive domains. After controlling for PVT status, however, relationships between neurocognitive test performance and validity scales were no longer significant for RBS, and remained non-significant for FBS-r. Additionally, PVT failure accounted for a significant proportion of the variance in both RBS and FBS-r. Results support both the convergent and discriminant validity of RBS and FBS-r. As neither scale was impacted by true neurocognitive dysfunction, these findings provide further support for the use of RBS and FBS-r in neuropsychological evaluations. © The Author 2015. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Development and validity of the Emotion and Motivation Self-regulation Questionnaire (EMSR-Q).
Alonso-Tapia, Jesús; Panadero Calderón, Ernesto; Díaz Ruiz, Miguel A
2014-07-15
This study has two objectives, first, to develop and validate the "Emotion and Motivation Self-regulation Questionnaire" (EMSR-Q), and second, to analyze (in the context of the questionnaire validation process) the relationships between self-regulation styles (SRS) rooted in goal orientations, and classroom motivational climate (CMC). A total of 664 Secondary Education students from Madrid (Spain) formed the sample of the study. It was divided randomly in two groups to perform confirmatory factor analysis and to cross-validate the results. Both analyses supported a five first-order factor structure, organized around two second-order factors, "Learning self-regulation style" (LSR) and "Avoidance self-regulation style" (ASR): (χ 2 /df = 2.71; GFI = .89; IFI = .84; CFI = .84; RMSEA = .07). Hypotheses concerning the relationships between SRS, goal orientations and expectancies are supported by additional correlation and factor analyses. Moreover, several regression analyses supported for the most part of the remaining hypotheses concerning the role of self-regulation styles as predictors of classroom motivational climate (CMC) perception, of change in self-regulation attributed to teacher work, and of students' satisfaction with this same work. Theoretical and practical implications are discussed.
Olt, Helen; Jirwe, Maria; Gustavsson, Petter; Emami, Azita
2010-01-01
The purpose of this study was to describe the translation, adaption, and psychometric evaluation process in relation to validity and reliability of the Swedish version of the instrument, Inventory for Assessing The Process of Cultural Competence Among Healthcare Professionals-Revised (IAPCC-R) following the translation, adaptation, and psychometric evaluation process. Validity tests were conducted on the response processes (N = 15), the content (N = 7), and the internal structure of the instrument (N = 334). Reliability (alpha = .65 for the total scale varying between -.01 and .65 for the different subscales) was evaluated in terms of internal consistency. Results indicated weak validity and reliability though it is difficult to conclude whether this is related to adaptation issues or the original construction.The testing of the response process identified problems in relation to respondents' conceptualization of cultural competence. The test of the content identified a weak correspondence between the items and the underlying model. In addition, a confirmatory factor analysis did not confirm the proposed structure of the instrument. This study concludes that this instrument is not valid and reliable for use with a Swedish population of practicing nurses or nursing students.
Validation of the Practice Environment Scale to the Brazilian culture.
Gasparino, Renata C; Guirardello, Edinêis de B
2017-07-01
To validate the Brazilian version of the Practice Environment Scale. The Practice Environment Scale is a tool that evaluates the presence of characteristics that are favourable for professional nursing practice because a better work environment contributes to positive results for patients, professionals and institutions. Methodological study including 209 nurses. Validity was assessed via a confirmatory factor analysis using structural equation modelling, in which the correlations between the instrument and the following variables were tested: burnout, job satisfaction, safety climate, perception of quality of care and intention to leave the job. Subgroups were compared and the reliability was assessed using Cronbach's alpha and the composite reliability. Factor analysis resulted in exclusion of seven items. Significant correlations were obtained between the subscales and all variables in the study. The reliability was considered acceptable. The Brazilian version of the Practice Environment Scale is a valid and reliable tool used to assess the characteristics that promote professional nursing practice. Use of this tool in Brazilian culture should allow managers to implement changes that contribute to the achievement of better results, in addition to identifying and comparing the environments of health institutions. © 2017 John Wiley & Sons Ltd.
NASA Technical Reports Server (NTRS)
Sebok, Angelia; Wickens, Christopher; Sargent, Robert
2015-01-01
One human factors challenge is predicting operator performance in novel situations. Approaches such as drawing on relevant previous experience, and developing computational models to predict operator performance in complex situations, offer potential methods to address this challenge. A few concerns with modeling operator performance are that models need to realistic, and they need to be tested empirically and validated. In addition, many existing human performance modeling tools are complex and require that an analyst gain significant experience to be able to develop models for meaningful data collection. This paper describes an effort to address these challenges by developing an easy to use model-based tool, using models that were developed from a review of existing human performance literature and targeted experimental studies, and performing an empirical validation of key model predictions.
Personality and job performance: the Big Five revisited.
Hurtz, G M; Donovan, J J
2000-12-01
Prior meta-analyses investigating the relation between the Big 5 personality dimensions and job performance have all contained a threat to construct validity, in that much of the data included within these analyses was not derived from actual Big 5 measures. In addition, these reviews did not address the relations between the Big 5 and contextual performance. Therefore, the present study sought to provide a meta-analytic estimate of the criterion-related validity of explicit Big 5 measures for predicting job performance and contextual performance. The results for job performance closely paralleled 2 of the previous meta-analyses, whereas analyses with contextual performance showed more complex relations among the Big 5 and performance. A more critical interpretation of the Big 5-performance relationship is presented, and suggestions for future research aimed at enhancing the validity of personality predictors are provided.
Liebers, Falk; Brandstädt, Felix; Schust, Marianne; Serafin, Patrick; Schäfer, Andreas; Gebhardt, Hansjürgen; Hartmann, Bernd; Steinberg, Ulf
2017-01-01
Introduction The impact of work-related musculoskeletal disorders is considerable. The assessment of work tasks with physical workloads is crucial to estimate the work-related health risks of exposed employees. Three key indicator methods are available for risk assessment regarding manual lifting, holding and carrying of loads; manual pulling and pushing of loads; and manual handling operations. Three further KIMs for risk assessment regarding whole-body forces, awkward body postures and body movement have been developed de novo. In addition, the development of a newly drafted combined method for mixed exposures is planned. All methods will be validated regarding face validity, reliability, convergent validity, criterion validity and further aspects of utility under practical conditions. Methods and analysis As part of the joint project MEGAPHYS (multilevel risk assessment of physical workloads), a mixed-methods study is being designed for the validation of KIMs and conducted in companies of different sizes and branches in Germany. Workplaces are documented and analysed by observations, applying KIMs, interviews and assessment of environmental conditions. Furthermore, a survey among the employees at the respective workplaces takes place with standardised questionnaires, interviews and physical examinations. It is intended to include 1200 employees at 120 different workplaces. For analysis of the quality criteria, recommendations of the COSMIN checklist (COnsensus-based Standards for the selection of health Measurement INstruments) will be taken into account. Ethics and dissemination The study was planned and conducted in accordance with the German Medical Professional Code and the Declaration of Helsinki as well as the German Federal Data Protection Act. The design of the study was approved by ethics committees. We intend to publish the validated KIMs in 2018. Results will be published in peer-reviewed journals, presented at international meetings and disseminated to actual users for practical application. PMID:28827239
Klussmann, Andre; Liebers, Falk; Brandstädt, Felix; Schust, Marianne; Serafin, Patrick; Schäfer, Andreas; Gebhardt, Hansjürgen; Hartmann, Bernd; Steinberg, Ulf
2017-08-21
The impact of work-related musculoskeletal disorders is considerable. The assessment of work tasks with physical workloads is crucial to estimate the work-related health risks of exposed employees. Three key indicator methods are available for risk assessment regarding manual lifting, holding and carrying of loads; manual pulling and pushing of loads; and manual handling operations. Three further KIMs for risk assessment regarding whole-body forces, awkward body postures and body movement have been developed de novo. In addition, the development of a newly drafted combined method for mixed exposures is planned. All methods will be validated regarding face validity, reliability, convergent validity, criterion validity and further aspects of utility under practical conditions. As part of the joint project MEGAPHYS (multilevel risk assessment of physical workloads), a mixed-methods study is being designed for the validation of KIMs and conducted in companies of different sizes and branches in Germany. Workplaces are documented and analysed by observations, applying KIMs, interviews and assessment of environmental conditions. Furthermore, a survey among the employees at the respective workplaces takes place with standardised questionnaires, interviews and physical examinations. It is intended to include 1200 employees at 120 different workplaces. For analysis of the quality criteria, recommendations of the COSMIN checklist (COnsensus-based Standards for the selection of health Measurement INstruments) will be taken into account. The study was planned and conducted in accordance with the German Medical Professional Code and the Declaration of Helsinki as well as the German Federal Data Protection Act. The design of the study was approved by ethics committees. We intend to publish the validated KIMs in 2018. Results will be published in peer-reviewed journals, presented at international meetings and disseminated to actual users for practical application. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Citera, Maryalice; Freeman, Phyllis R; Horowitz, Richard I
2017-01-01
Lyme disease is spreading worldwide, with multiple Borrelia species causing a broad range of clinical symptoms that mimic other illnesses. A validated Lyme disease screening questionnaire would be clinically useful for both providers and patients. Three studies evaluated such a screening tool, namely the Horowitz Multiple Systemic Infectious Disease Syndrome (MSIDS) Questionnaire. The purpose was to see if the questionnaire could accurately distinguish between Lyme patients and healthy individuals. Study 1 examined the construct validity of the scale examining its factor structure and reliability of the questionnaire among 537 individuals being treated for Lyme disease. Study 2 involved an online sample of 999 participants, who self-identified as either healthy (N=217) or suffering from Lyme now (N=782) who completed the Horowitz MSIDS Questionnaire (HMQ) along with an outdoor activity survey. We examined convergent validity among components of the scale and evaluated discriminant validity with the Big Five personality characteristics. The third study compared a sample of 236 patients with confirmed Lyme disease with an online sample of 568 healthy individuals. Factor analysis results identified six underlying latent dimensions; four of these overlapped with critical symptoms identified by Horowitz - neuropathy, cognitive dysfunction, musculoskeletal pain, and fatigue. The HMQ showed acceptable levels of internal reliability using Cronbach's coefficient alpha and exhibited evidence of convergent and divergent validity. Components of the HMQ correlated more highly with each other than with unrelated traits. The results consistently demonstrated that the HMQ accurately differentiated those with Lyme disease from healthy individuals. Three migratory pain survey items (persistent muscular pain, arthritic pain, and nerve pain/paresthesias) robustly identified individuals with verified Lyme disease. The results support the use of the HMQ as a valid, efficient, and low-cost screening tool for medical practitioners to decide if additional testing is warranted to distinguish between Lyme disease and other illnesses.
Citera, Maryalice; Freeman, Phyllis R; Horowitz, Richard I
2017-01-01
Purpose Lyme disease is spreading worldwide, with multiple Borrelia species causing a broad range of clinical symptoms that mimic other illnesses. A validated Lyme disease screening questionnaire would be clinically useful for both providers and patients. Three studies evaluated such a screening tool, namely the Horowitz Multiple Systemic Infectious Disease Syndrome (MSIDS) Questionnaire. The purpose was to see if the questionnaire could accurately distinguish between Lyme patients and healthy individuals. Methods Study 1 examined the construct validity of the scale examining its factor structure and reliability of the questionnaire among 537 individuals being treated for Lyme disease. Study 2 involved an online sample of 999 participants, who self-identified as either healthy (N=217) or suffering from Lyme now (N=782) who completed the Horowitz MSIDS Questionnaire (HMQ) along with an outdoor activity survey. We examined convergent validity among components of the scale and evaluated discriminant validity with the Big Five personality characteristics. The third study compared a sample of 236 patients with confirmed Lyme disease with an online sample of 568 healthy individuals. Results Factor analysis results identified six underlying latent dimensions; four of these overlapped with critical symptoms identified by Horowitz – neuropathy, cognitive dysfunction, musculoskeletal pain, and fatigue. The HMQ showed acceptable levels of internal reliability using Cronbach’s coefficient alpha and exhibited evidence of convergent and divergent validity. Components of the HMQ correlated more highly with each other than with unrelated traits. Discussion The results consistently demonstrated that the HMQ accurately differentiated those with Lyme disease from healthy individuals. Three migratory pain survey items (persistent muscular pain, arthritic pain, and nerve pain/paresthesias) robustly identified individuals with verified Lyme disease. The results support the use of the HMQ as a valid, efficient, and low-cost screening tool for medical practitioners to decide if additional testing is warranted to distinguish between Lyme disease and other illnesses. PMID:28919803
Garcia-Lopez, Luis J; Inglés, Cándido J; García-Fernández, José M; Hidalgo, María D; Bermejo, Rosa; Puklek Levpušček, Melita
2011-01-01
This study examined the reliability and validity evidence drawn from the scores of the Spanish version of the Slovenian-developed Social Anxiety Scale for Adolescents (SASA; Puklek, 1997; Puklek & Vidmar, 2000) using a community sample (Study 1) and a clinical sample (Study 2). Confirmatory factor analysis in Study 1 replicated the 2-factor structure found by the original authors in a sample of Slovenian adolescents. Test-retest reliability was adequate. Furthermore, the SASA correlated significantly with other social anxiety scales, supporting concurrent validity evidence in Spanish adolescents. The results of Study 2 confirmed the correlations between the SASA and other social anxiety measures in a clinical sample. In addition, findings revealed that the SASA can effectively discriminate between adolescents with a clinical diagnosis of social anxiety disorder (SAD) and those without this disorder. Finally, cut-off scores for the SASA are provided for Spanish adolescents.
[Development the Japanese of the Self-Compassionate Reactions Inventory].
Miyagawa, Yuki; Taniguchi, Junichi
2016-04-01
Self-compassion is defined as being compassionate towards the self in times of suffering, and is composed of the following three components: self-kindness, common humanity, and mindfulness. This article reports the development of the Japanese version of the Self-Compassionate Reactions Inventory (SCRI-J). The SCRI-J measures self-compassion based on the degree to which people choose self-compassionate reactions to 8 hypothetical hardships. Study 1: (N = 179) showed that the SCRI-J had sufficient internal consistency. In terms of its validity, results showed a positive correlation between the SCRI-J and the Japanese version of the Self-Compassion Scale, supporting its concurrent validity. In addition, the SCRI-J was positively correlated with self-esteem and negatively correlated with psychological stress responses. Moreover, the association between the SCRI-J and stress responses remained significant when the effect of self-esteem was removed. In Study 2 (N = 90), the SCRI-J demonstrated high test-retest reliability over 3 weeks. Overall, the present study indicates that the SCRI-J has sufficient reliability and validity as a new scale for self-compassion.
Developmental validation of the PowerPlex(®) Fusion 6C System.
Ensenberger, Martin G; Lenz, Kristy A; Matthies, Learden K; Hadinoto, Gregory M; Schienman, John E; Przech, Angela J; Morganti, Michael W; Renstrom, Daniel T; Baker, Victoria M; Gawrys, Kori M; Hoogendoorn, Marlijn; Steffen, Carolyn R; Martín, Pablo; Alonso, Antonio; Olson, Hope R; Sprecher, Cynthia J; Storts, Douglas R
2016-03-01
The PowerPlex(®) Fusion 6C System is a 27-locus, six-dye, multiplex that includes all markers in the expanded CODIS core loci and increases overlap with STR database standards throughout the world. Additionally, it contains two, rapidly mutating, Y-STRs and is capable of both casework and database workflows, including direct amplification. A multi-laboratory developmental validation study was performed on the PowerPlex(®) Fusion 6C System. Here, we report the results of that study which followed SWGDAM guidelines and includes data for: species specificity, sensitivity, stability, precision, reproducibility and repeatability, case-type samples, concordance, stutter, DNA mixtures, and PCR-based procedures. Where appropriate we report data from both extracted DNA samples and direct amplification samples from various substrates and collection devices. Samples from all studies were separated on both Applied Biosystems 3500 series and 6-dye capable 3130 series Genetic Analyzers and data is reported for each. Together, the data validate the design and demonstrate the performance of the PowerPlex(®) Fusion 6C System. Copyright © 2015 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.
Xu, Hui; Tracey, Terence J G
2017-03-01
The current study developed an abbreviated version of the Career Indecision Profile-65 (CIP-65; Hacker, Carr, Abrams, & Brown, 2013) by using item response theory. In order to improve the efficiency of the CIP-65 in measuring career indecision, the individual item performance of the CIP-65 was examined with respect to the ordering of response occurrence and gender differential item functioning. The best 5 items of each scale of the CIP-65 (i.e., neuroticism/negative affectivity, choice/commitment anxiety, lack of readiness, and interpersonal conflicts) were retained in the CIP-Short using a sample of 588 college students. A validation sample (N = 174) supported the reliability and structural validity of the CIP-Short. The convergent and divergent validity of the CIP-Short was additionally supported in the findings of a hypothesized differential relational pattern in a separate sample (N = 360). While the current study supported the CIP-Short being a sound brief measure of career indecision, the limitations of this study and suggestions for future research were discussed as well. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Jiang, Jian; James, Christopher A; Wong, Philip
2016-09-05
A LC-MS/MS method has been developed and validated for the determination of glycine in human cerebrospinal fluid (CSF). The validated method used artificial cerebrospinal fluid as a surrogate matrix for calibration standards. The calibration curve range for the assay was 100-10,000ng/mL and (13)C2, (15)N-glycine was used as an internal standard (IS). Pre-validation experiments were performed to demonstrate parallelism with surrogate matrix and standard addition methods. The mean endogenous glycine concentration in a pooled human CSF determined on three days by using artificial CSF as a surrogate matrix and the method of standard addition was found to be 748±30.6 and 768±18.1ng/mL, respectively. A percentage difference of -2.6% indicated that artificial CSF could be used as a surrogate calibration matrix for the determination of glycine in human CSF. Quality control (QC) samples, except the lower limit of quantitation (LLOQ) QC and low QC samples, were prepared by spiking glycine into aliquots of pooled human CSF sample. The low QC sample was prepared from a separate pooled human CSF sample containing low endogenous glycine concentrations, while the LLOQ QC sample was prepared in artificial CSF. Standard addition was used extensively to evaluate matrix effects during validation. The validated method was used to determine the endogenous glycine concentrations in human CSF samples. Incurred sample reanalysis demonstrated reproducibility of the method. Copyright © 2016 Elsevier B.V. All rights reserved.
Soler, Joaquim; Franquesa, Alba; Feliu-Soler, Albert; Cebolla, Ausias; García-Campayo, Javier; Tejedor, Rosa; Demarzo, Marcelo; Baños, Rosa; Pascual, Juan Carlos; Portella, Maria J
2014-11-01
Decentering is defined as the ability to observe one's thoughts and feelings in a detached manner. The Experiences Questionnaire (EQ) is a self-report instrument that originally assessed decentering and rumination. The purpose of this study was to evaluate the psychometric properties of the Spanish version of EQ-Decentering and to explore its clinical usefulness. The 11-item EQ-Decentering subscale was translated into Spanish and psychometric properties were examined in a sample of 921 adult individuals, 231 with psychiatric disorders and 690 without. The subsample of nonpsychiatric participants was also split according to their previous meditative experience (meditative participants, n=341; and nonmeditative participants, n=349). Additionally, differences among these three subgroups were explored to determine clinical validity of the scale. Finally, EQ-Decentering was administered twice in a group of borderline personality disorder, before and after a 10-week mindfulness intervention. Confirmatory factor analysis indicated acceptable model fit, sbχ(2)=243.8836 (p<.001), CFI=.939, GFI=.936, SRMR=.040, and RMSEA=.06 (.060-.077), and psychometric properties were found to be satisfactory (reliability: Cronbach's α=.893; convergent validity: r>.46; and divergent validity: r<-.35). The scale detected changes in decentering after a 10-session intervention in mindfulness (t=-4.692, p<.00001). Differences among groups were significant (F=134.8, p<.000001), where psychiatric participants showed the lowest scores compared to nonpsychiatric meditative and nonmeditative participants. The Spanish version of the EQ-Decentering is a valid and reliable instrument to assess decentering either in clinical and nonclinical samples. In addition, the findings show that EQ-Decentering seems an adequate outcome instrument to detect changes after mindfulness-based interventions. Copyright © 2014. Published by Elsevier Ltd.
Lin, Chung-Ying; Pakpour, Amir H
2017-02-01
The problems of mood disorders are critical in people with epilepsy. Therefore, there is a need to validate a useful tool for the population. The Hospital Anxiety and Depression Scale (HADS) has been used on the population, and showed that it is a satisfactory screening tool. However, more evidence on its construct validity is needed. A total of 1041 people with epilepsy were recruited in this study, and each completed the HADS. Confirmatory factor analysis (CFA) and Rasch analysis were used to understand the construct validity of the HADS. In addition, internal consistency was tested using Cronbachs' α, person separation reliability, and item separation reliability. Ordering of the response descriptors and the differential item functioning (DIF) were examined using the Rasch models. The HADS showed that 55.3% of our participants had anxiety; 56.0% had depression based on its cutoffs. CFA and Rasch analyses both showed the satisfactory construct validity of the HADS; the internal consistency was also acceptable (α=0.82 in anxiety and 0.79 in depression; person separation reliability=0.82 in anxiety and 0.73 in depression; item separation reliability=0.98 in anxiety and 0.91 in depression). The difficulties of the four-point Likert scale used in the HADS were monotonically increased, which indicates no disordering response categories. No DIF items across male and female patients and across types of epilepsy were displayed in the HADS. The HADS has promising psychometric properties on construct validity in people with epilepsy. Moreover, the additive item score is supported for calculating the cutoff. Copyright © 2016 British Epilepsy Association. Published by Elsevier Ltd. All rights reserved.
Schmid, R; Eschen, A; Rüegger-Frey, B; Martin, M
2013-06-01
There is growing evidence that individuals with cognitive impairment and dementia require systematic assessment of needs for the selection of optimal treatments. Currently no valid instrument is applicable for illness-related need assessment in this growing population. The purpose of this study was to develop and validate a new instrument ("Bedürfnisinventar bei Gedächtnisstörungen", BIG-65) that systematically assesses illness-related needs. The development was based on an adequate theoretical framework and standardised procedural guidelines and validated to an appropriate sample of individuals attending a Swiss memory clinic (n = 83). The BIG-65 provides a comprehensive range of biopsychosocial and environmental needs items and offers a dementia-friendly structure for the assessment of illness-related needs. The BIG-65 has high face validity and very high test-retest reliability (rtt = 0,916). On average 3.5 (SD = 3.7) unmet needs were assessed. Most frequently mentioned needs were: "forget less" (50%), "better concentration" (23.2%), "information on illness" (20.7%), "information on treatments" (17.1%), "less worry", "less irritable", "improve mood", "improve orientation" (13.4% each). Needs profiles differed between patients with preclinical (subjective cognitive impairment, mild cognitive impairment) and clinical (dementia) diagnosis. The BIG-65 reliably assesses illness-related needs in individuals with moderate dementia. With decreasing cognitive functions or an MMSE <20 points, additional methods such as observation of the emotional expression may be applied. According to our results, individuals with cognitive impairment and dementia pursue individual strategies to stabilize their quality of life level. In addition to the assessment of objective illness symptoms the selection of optimal treatments may profit from a systematic needs assessment to optimally support patients in their individual quality of life strategies.
Rowland, Jared A; Miskey, Holly M; Brearly, Timothy W; Martindale, Sarah L; Shura, Robert D
2017-05-01
The current study addressed two aims: (i) determine how Word Memory Test (WMT) performance relates to test performance across numerous cognitive domains and (ii) evaluate how current psychiatric disorders or mild traumatic brain injury (mTBI) history affects performance on the WMT after excluding participants with poor symptom validity. Participants were 235 Iraq and Afghanistan-era veterans (Mage = 35.5) who completed a comprehensive neuropsychological battery. Participants were divided into two groups based on WMT performance (Pass = 193, Fail = 42). Tests were grouped into cognitive domains and an average z-score was calculated for each domain. Significant differences were found between those who passed and those who failed the WMT on the memory, attention, executive function, and motor output domain z-scores. WMT failure was associated with a larger performance decrement in the memory domain than the sensation or visuospatial-construction domains. Participants with a current psychiatric diagnosis or mTBI history were significantly more likely to fail the WMT, even after removing participants with poor symptom validity. Results suggest that the WMT is most appropriate for assessing validity in the domains of attention, executive function, motor output and memory, with little relationship to performance in domains of sensation or visuospatial-construction. Comprehensive cognitive batteries would benefit from inclusion of additional performance validity tests in these domains. Additionally, symptom validity did not explain higher rates of WMT failure in individuals with a current psychiatric diagnosis or mTBI history. Further research is needed to better understand how these conditions may affect WMT performance. Published by Oxford University Press 2016. This work is written by (a) US Government employee(s) and is in the public domain in the US.
2011-01-01
Background The SF-12v2 is the improved version of the SF-12v1. This study aimed to validate the SF-12v2 in Iran. Methods A random sample of the general population aged 18 years and over living in Tehran, Iran completed the instrument. Reliability was estimated using internal consistency and validity was assessed using known-groups comparison and convergent validity. In addition the factor structure of the questionnaire was extracted by performing both exploratory and confirmatory factor analyses (EFA and CFA). Results In all, 3685 individuals were studied (1887male and 1798 female). Internal consistency for both summary measures was satisfactory. Cronbach's α for the Physical Component Summary (PCS-12) was 0.87 and for the Mental Component Summary (MCS-12) it was 0.82. Known-groups comparison showed that the SF-12v2 discriminated well between men and women and those who differed in age and educational status (P < 0.05). Furthermore, as hypothesized the physical functioning, role physical, bodily pain and general health subscales correlated higher with the PCS-12, while the vitality, social functioning, role emotional and mental health subscales correlated higher with the MCS-12. Finally the exploratory factor analysis indicated a two-factor structure (physical and mental health) that jointly accounted for 59.9% of the variance. The confirmatory factory analysis also indicated a good fit to the data for the two-latent structure (physical and mental health). Conclusion Although the findings could not be generalized to the Iranian population, overall the findings suggest that the SF-12v2 is a reliable and valid measure of health related quality of life among Iranians and now could be used in future health outcome studies. However, further studies are recommended to establish its stability, responsiveness to change, and concurrent validity for this health survey in Iran. PMID:21385359
Montazeri, Ali; Vahdaninia, Mariam; Mousavi, Sayed Javad; Asadi-Lari, Mohsen; Omidvari, Sepideh; Tavousi, Mahmoud
2011-03-07
The SF-12v2 is the improved version of the SF-12v1. This study aimed to validate the SF-12v2 in Iran. A random sample of the general population aged 18 years and over living in Tehran, Iran completed the instrument. Reliability was estimated using internal consistency and validity was assessed using known-groups comparison and convergent validity. In addition the factor structure of the questionnaire was extracted by performing both exploratory and confirmatory factor analyses (EFA and CFA). In all, 3685 individuals were studied (1887 male and 1798 female). Internal consistency for both summary measures was satisfactory. Cronbach's α for the Physical Component Summary (PCS-12) was 0.87 and for the Mental Component Summary (MCS-12) it was 0.82. Known-groups comparison showed that the SF-12v2 discriminated well between men and women and those who differed in age and educational status (P < 0.05). Furthermore, as hypothesized the physical functioning, role physical, bodily pain and general health subscales correlated higher with the PCS-12, while the vitality, social functioning, role emotional and mental health subscales correlated higher with the MCS-12. Finally the exploratory factor analysis indicated a two-factor structure (physical and mental health) that jointly accounted for 59.9% of the variance. The confirmatory factory analysis also indicated a good fit to the data for the two-latent structure (physical and mental health). Although the findings could not be generalized to the Iranian population, overall the findings suggest that the SF-12v2 is a reliable and valid measure of health related quality of life among Iranians and now could be used in future health outcome studies. However, further studies are recommended to establish its stability, responsiveness to change, and concurrent validity for this health survey in Iran.
Seçkin, Gül
2011-12-01
This article presents initial psychometric validation of an instrument developed to measure cyber informational and decisional empowerment. The article provides preliminary insights into the extent to which cyber patients view the digital environment of peer-based information and support as a resource for informed and empowered participation in self health care management. Data come from cancer patients (N = 350) who participated in the Study of Virtual Health Networks for Cancer Patients of the 21st Century. Data were first analyzed using exploratory factor analysis with principle component extraction and Varimax rotation. Age-based split-sample analysis (≥ 51 and ≤ 50) was performed on a subsample, which consisted of only women (N = 255), in order to cross-validate psychometric data obtained from the full sample. A confirmatory factor analysis was conducted using AMOS 19.0. to further validate the scale. The composite scale is unidimensional with excellent internal consistency reliability. The highest average scores were obtained for informational empowerment items. The lowest average was for the item that measured empowerment to seek second opinion from additional health care professionals. The ability of this composite measure to provide information about the extent to which computer-connected patients view digital peer support as an empowerment tool makes it a valuable addition to the literature in health informatics, supportive cancer care, and health quality of life research.
Reduction of bias and variance for evaluation of computer-aided diagnostic schemes.
Li, Qiang; Doi, Kunio
2006-04-01
Computer-aided diagnostic (CAD) schemes have been developed to assist radiologists in detecting various lesions in medical images. In addition to the development, an equally important problem is the reliable evaluation of the performance levels of various CAD schemes. It is good to see that more and more investigators are employing more reliable evaluation methods such as leave-one-out and cross validation, instead of less reliable methods such as resubstitution, for assessing their CAD schemes. However, the common applications of leave-one-out and cross-validation evaluation methods do not necessarily imply that the estimated performance levels are accurate and precise. Pitfalls often occur in the use of leave-one-out and cross-validation evaluation methods, and they lead to unreliable estimation of performance levels. In this study, we first identified a number of typical pitfalls for the evaluation of CAD schemes, and conducted a Monte Carlo simulation experiment for each of the pitfalls to demonstrate quantitatively the extent of bias and/or variance caused by the pitfall. Our experimental results indicate that considerable bias and variance may exist in the estimated performance levels of CAD schemes if one employs various flawed leave-one-out and cross-validation evaluation methods. In addition, for promoting and utilizing a high standard for reliable evaluation of CAD schemes, we attempt to make recommendations, whenever possible, for overcoming these pitfalls. We believe that, with the recommended evaluation methods, we can considerably reduce the bias and variance in the estimated performance levels of CAD schemes.
A new class of enhanced kinetic sampling methods for building Markov state models
NASA Astrophysics Data System (ADS)
Bhoutekar, Arti; Ghosh, Susmita; Bhattacharya, Swati; Chatterjee, Abhijit
2017-10-01
Markov state models (MSMs) and other related kinetic network models are frequently used to study the long-timescale dynamical behavior of biomolecular and materials systems. MSMs are often constructed bottom-up using brute-force molecular dynamics (MD) simulations when the model contains a large number of states and kinetic pathways that are not known a priori. However, the resulting network generally encompasses only parts of the configurational space, and regardless of any additional MD performed, several states and pathways will still remain missing. This implies that the duration for which the MSM can faithfully capture the true dynamics, which we term as the validity time for the MSM, is always finite and unfortunately much shorter than the MD time invested to construct the model. A general framework that relates the kinetic uncertainty in the model to the validity time, missing states and pathways, network topology, and statistical sampling is presented. Performing additional calculations for frequently-sampled states/pathways may not alter the MSM validity time. A new class of enhanced kinetic sampling techniques is introduced that aims at targeting rare states/pathways that contribute most to the uncertainty so that the validity time is boosted in an effective manner. Examples including straightforward 1D energy landscapes, lattice models, and biomolecular systems are provided to illustrate the application of the method. Developments presented here will be of interest to the kinetic Monte Carlo community as well.
Martins, Danielly da Fonte Carvalho; Florindo, Lorena Coimbra; Machado, Anna Karolina Mouzer da Silva; Todeschini, Vítor; Sangoi, Maximiliano da Silva
2017-11-01
This study presents the development and validation of UV spectrophotometric methods for the determination of pinaverium bromide (PB) in tablet assay and dissolution studies. The methods were satisfactorily validated according to International Conference on Harmonization guidelines. The response was linear (r2 > 0.99) in the concentration ranges of 2-14 μg/mL at 213 nm and 10-70 μg/mL at 243 nm. The LOD and LOQ were 0.39 and 1.31 μg/mL, respectively, at 213 nm. For the 243 nm method, the LOD and LOQ were 2.93 and 9.77 μg/mL, respectively. Precision was evaluated by RSD, and the obtained results were lower than 2%. Adequate accuracy was also obtained. The methods proved to be robust using a full factorial design evaluation. For PB dissolution studies, the best conditions were achieved using a United States Pharmacopeia Dissolution Apparatus 2 (paddle) at 50 rpm and with 900 mL 0.1 M hydrochloric acid as the dissolution medium, presenting satisfactory results during the validation tests. In addition, the kinetic parameters of drug release were investigated using model-dependent methods, and the dissolution profiles were best described by the first-order model. Therefore, the proposed methods were successfully applied for the assay and dissolution analysis of PB in commercial tablets.
Rosenblum, Sara; Frisch, Carmit; Deutsh-Castel, Tsofia; Josman, Naomi
2015-01-01
Children with attention-deficit hyperactivity disorder (ADHD) often present with activities of daily living (ADL) performance deficits. This study aimed to compare the performance characteristics of children with ADHD to those of controls based on the Do-Eat assessment tool, and to establish the tool's validity. Participants were 23 children with ADHD and 24 matched controls, aged 6-9 years. In addition to the Do-Eat, the Children Activity Scale-Parent (ChAS-P) and the Behavioral Rating Inventory of Executive Function (BRIEF) were used to measure sensorimotor abilities and executive function (EF). Significant differences were found in the Do-Eat scores between children with ADHD and controls. Significant moderate correlations were found between the Do-Eat sensorimotor scores, the ChAS-P and the BRIEF scores in the ADHD group. Significant correlations were found between performance on the Do-Eat and the ChAS-P questionnaire scores, verifying the tool's ecological validity. A single discriminant function described primarily by four Do-Eat variables, correctly classified 95.5% of the study participants into their respective study groups, establishing the tool's predictive validity within this population. These preliminary findings indicate that the Do-Eat may serve as a reliable and valid tool that provides insight into the daily functioning characteristics of children with ADHD. However, further research on larger samples is indicated.
Validation of sterilizing grade filtration.
Jornitz, M W; Meltzer, T H
2003-01-01
Validation consideration of sterilizing grade filters, namely 0.2 micron, changed when FDA voiced concerns about the validity of Bacterial Challenge tests performed in the past. Such validation exercises are nowadays considered to be filter qualification. Filter validation requires more thorough analysis, especially Bacterial Challenge testing with the actual drug product under process conditions. To do so, viability testing is a necessity to determine the Bacterial Challenge test methodology. Additionally to these two compulsory tests, other evaluations like extractable, adsorption and chemical compatibility tests should be considered. PDA Technical Report # 26, Sterilizing Filtration of Liquids, describes all parameters and aspects required for the comprehensive validation of filters. The report is a most helpful tool for validation of liquid filters used in the biopharmaceutical industry. It sets the cornerstones of validation requirements and other filtration considerations.
PRN 96-1: Tolerance Enforcement Methods - Independent Laboratory Validation by Petitioner
This notice is intended to clarify the requirements for submission of an Independent Laboratory Validation to accompany new pesticide analytical methods and does not contain additional data requirements.This notice supersedes PR Notice 88-5.
Validation study of the in vitro skin irritation test with the LabCyte EPI-MODEL24.
Kojima, Hajime; Ando, Yoko; Idehara, Kenji; Katoh, Masakazu; Kosaka, Tadashi; Miyaoka, Etsuyoshi; Shinoda, Shinsuke; Suzuki, Tamie; Yamaguchi, Yoshihiro; Yoshimura, Isao; Yuasa, Atsuko; Watanabe, Yukihiko; Omori, Takashi
2012-03-01
A validation study on an in vitro skin irritation assay was performed with the reconstructed human epidermis (RhE) LabCyte EPI-MODEL24, developed by Japan Tissue Engineering Co. Ltd (Gamagori, Japan). The protocol that was followed in the current study was an optimised version of the EpiSkin protocol (LabCyte assay). According to the United Nations Globally Harmonised System (UN GHS) of classification for assessing the skin irritation potential of a chemical, 12 irritants and 13 non-irritants were validated by a minimum of six laboratories from the Japanese Society for Alternatives to Animal Experiments (JSAAE) skin irritation assay validation study management team (VMT). The 25 chemicals were listed in the European Centre for the Validation of Alternative Methods (ECVAM) performance standards. The reconstructed tissues were exposed to the chemicals for 15 minutes and incubated for 42 hours in fresh culture medium. Subsequently, the level of interleukin-1 alpha (IL-1 α) present in the conditioned medium was measured, and tissue viability was assessed by using the MTT assay. The results of the MTT assay obtained with the LabCyte EPI-MODEL24 (LabCyte MTT assay) demonstrated high within-laboratory and between-laboratory reproducibility, as well as high accuracy for use as a stand-alone assay to distinguish skin irritants from non-irritants. In addition, the IL-1α release measurements in the LabCyte assay were clearly unnecessary for the success of this model in the classification of chemicals for skin irritation potential. 2012 FRAME.
Favato, Giampiero; Noikokyris, Emmanouil; Vecchiato, Riccardo
2017-01-25
Sexually transmitted infection with high-risk, oncogenic strains of human papillomavirus (HPV) still induces a relevant burden of diseases on both men and women. Although vaccines appear to be highly efficacious in preventing the infection of the most common high-risk strains (HPV 6, 11, 16, 18), important questions regarding the appropriate target population for prophylactic vaccination are still debated. Models in the extant literature seem to converge on the cost-effectiveness of high coverage (>80%) of a single cohort of 12-year-old girls. This vaccination strategy should provide an adequate level of indirect protection (herd immunity) to the unvaccinated boys. This argument presupposes the ecological validity of the cost-effectiveness models; the implicit condition that the characteristics of the individuals and the sexual behaviours observed in the models is generalisable to the natural behaviours of the population. The primary aim of this review is to test the ecological validity of the cost-effectiveness models of universal HPV vaccination available in the literature. The ecological validity of each model will be defined by the number of representative characteristics and behaviours taken into consideration. Nine bibliographic databases will be searched: MEDLINE (via PubMed); Scopus; Science Direct; EMBASE via OVID SP, Web of Science, DARE, NHIR EED and HTA (via NHIR CRD); and CINHAL Plus. An additional search for grey literature will be conducted on Google Scholar and Open Grey. A search strategy will be developed for each of the databases. Data will be extracted following a pre-determined spreadsheet and then clustered and prioritised: the main outcomes will report the inputs to the demographic and epidemiological model, while additional outcomes will refer to basic inputs to the cost-effectiveness valuation. Each study included in the review will be scored by the number of representative characteristics and behaviours taken into consideration (yes or no) on both dimensions. Individual study's scores will be plotted in a 2 by 2 matrix: studies included in the upper right quadrant will be defined as ecologically valid, since which both individuals' characteristics and their sexual behaviours are representative. The proposed systematic review will be the first to assess the ecological validity of cost-effectiveness studies. In the context of sexually transmitted diseases, when this condition is violated, an error in predicting the protective impact of herd immunity would occur. Hence, a vaccination policy informed on ecologically invalid models would potentially expose boys to a residual risk of contracting HPV-induced malignancies. PROSPERO CRD42016034145.
Tozzoli, Rosangela; Maugliani, Antonella; Michelacci, Valeria; Minelli, Fabio; Caprioli, Alfredo; Morabito, Stefano
2018-05-08
In 2006, the European Committee for standardisation (CEN)/Technical Committee 275 - Food analysis - Horizontal methods/Working Group 6 - Microbiology of the food chain (TC275/WG6), launched the project of validating the method ISO 16654:2001 for the detection of Escherichia coli O157 in foodstuff by the evaluation of its performance, in terms of sensitivity and specificity, through collaborative studies. Previously, a validation study had been conducted to assess the performance of the Method No 164 developed by the Nordic Committee for Food Analysis (NMKL), which aims at detecting E. coli O157 in food as well, and is based on a procedure equivalent to that of the ISO 16654:2001 standard. Therefore, CEN established that the validation data obtained for the NMKL Method 164 could be exploited for the ISO 16654:2001 validation project, integrated with new data obtained through two additional interlaboratory studies on milk and sprouts, run in the framework of the CEN mandate No. M381. The ISO 16654:2001 validation project was led by the European Union Reference Laboratory for Escherichia coli including VTEC (EURL-VTEC), which organized the collaborative validation study on milk in 2012 with 15 participating laboratories and that on sprouts in 2014, with 14 participating laboratories. In both studies, a total of 24 samples were tested by each laboratory. Test materials were spiked with different concentration of E. coli O157 and the 24 samples corresponded to eight replicates of three levels of contamination: zero, low and high spiking level. The results submitted by the participating laboratories were analyzed to evaluate the sensitivity and specificity of the ISO 16654:2001 method when applied to milk and sprouts. The performance characteristics calculated on the data of the collaborative validation studies run under the CEN mandate No. M381 returned sensitivity and specificity of 100% and 94.4%, respectively for the milk study. As for sprouts matrix, the sensitivity resulted in 75.9% in the low level of contamination samples and 96.4% in samples spiked with high level of E. coli O157 and specificity was calculated as 99.1%. Copyright © 2018 Elsevier B.V. All rights reserved.
2017-01-01
Background The Information Assessment Method (IAM) allows clinicians to report the cognitive impact, clinical relevance, intention to use, and expected patient health benefits associated with clinical information received by email. More than 15,000 Canadian physicians and pharmacists use the IAM in continuing education programs. In addition, information providers can use IAM ratings and feedback comments from clinicians to improve their products. Objective Our general objective was to validate the IAM questionnaire for the delivery of educational material (ecological and logical content validity). Our specific objectives were to measure the relevance and evaluate the representativeness of IAM items for assessing information received by email. Methods A 3-part mixed methods study was conducted (convergent design). In part 1 (quantitative longitudinal study), the relevance of IAM items was measured. Participants were 5596 physician members of the Canadian Medical Association who used the IAM. A total of 234,196 ratings were collected in 2012. The relevance of IAM items with respect to their main construct was calculated using descriptive statistics (relevance ratio R). In part 2 (qualitative descriptive study), the representativeness of IAM items was evaluated. A total of 15 family physicians completed semistructured face-to-face interviews. For each construct, we evaluated the representativeness of IAM items using a deductive-inductive thematic qualitative data analysis. In part 3 (mixing quantitative and qualitative parts), results from quantitative and qualitative analyses were reviewed, juxtaposed in a table, discussed with experts, and integrated. Thus, our final results are derived from the views of users (ecological content validation) and experts (logical content validation). Results Of the 23 IAM items, 21 were validated for content, while 2 were removed. In part 1 (quantitative results), 21 items were deemed relevant, while 2 items were deemed not relevant (R=4.86% [N=234,196] and R=3.04% [n=45,394], respectively). In part 2 (qualitative results), 22 items were deemed representative, while 1 item was not representative. In part 3 (mixing quantitative and qualitative results), the content validity of 21 items was confirmed, and the 2 nonrelevant items were excluded. A fully validated version was generated (IAM-v2014). Conclusions This study produced a content validated IAM questionnaire that is used by clinicians and information providers to assess the clinical information delivered in continuing education programs. PMID:28292738
Irvine, Karen-Amanda; Ferguson, Adam R.; Mitchell, Kathleen D.; Beattie, Stephanie B.; Lin, Amity; Stuck, Ellen D.; Huie, J. Russell; Nielson, Jessica L.; Talbott, Jason F.; Inoue, Tomoo; Beattie, Michael S.; Bresnahan, Jacqueline C.
2014-01-01
The IBB scale is a recently developed forelimb scale for the assessment of fine control of the forelimb and digits after cervical spinal cord injury [SCI; (1)]. The present paper describes the assessment of inter-rater reliability and face, concurrent and construct validity of this scale following SCI. It demonstrates that the IBB is a reliable and valid scale that is sensitive to severity of SCI and to recovery over time. In addition, the IBB correlates with other outcome measures and is highly predictive of biological measures of tissue pathology. Multivariate analysis using principal component analysis (PCA) demonstrates that the IBB is highly predictive of the syndromic outcome after SCI (2), and is among the best predictors of bio-behavioral function, based on strong construct validity. Altogether, the data suggest that the IBB, especially in concert with other measures, is a reliable and valid tool for assessing neurological deficits in fine motor control of the distal forelimb, and represents a powerful addition to multivariate outcome batteries aimed at documenting recovery of function after cervical SCI in rats. PMID:25071704