validation set consisting: Topics by Science.gov

Sample records for validation set consisting

The Utrecht questionnaire (U-CEP) measuring knowledge on clinical epidemiology proved to be valid.

PubMed

Kortekaas, Marlous F; Bartelink, Marie-Louise E L; de Groot, Esther; Korving, Helen; de Wit, Niek J; Grobbee, Diederick E; Hoes, Arno W

2017-02-01

Knowledge on clinical epidemiology is crucial to practice evidence-based medicine. We describe the development and validation of the Utrecht questionnaire on knowledge on Clinical epidemiology for Evidence-based Practice (U-CEP); an assessment tool to be used in the training of clinicians. The U-CEP was developed in two formats: two sets of 25 questions and a combined set of 50. The validation was performed among postgraduate general practice (GP) trainees, hospital trainees, GP supervisors, and experts. Internal consistency, internal reliability (item-total correlation), item discrimination index, item difficulty, content validity, construct validity, responsiveness, test-retest reliability, and feasibility were assessed. The questionnaire was externally validated. Internal consistency was good with a Cronbach alpha of 0.8. The median item-total correlation and mean item discrimination index were satisfactory. Both sets were perceived as relevant to clinical practice. Construct validity was good. Both sets were responsive but failed on test-retest reliability. One set took 24 minutes and the other 33 minutes to complete, on average. External GP trainees had comparable results. The U-CEP is a valid questionnaire to assess knowledge on clinical epidemiology, which is a prerequisite for practicing evidence-based medicine in daily clinical practice. Copyright © 2016 Elsevier Inc. All rights reserved.
NASA Ocean Altimeter Pathfinder Project. Report 2; Data Set Validation

NASA Technical Reports Server (NTRS)

Koblinsky, C. J.; Ray, Richard D.; Beckley, Brian D.; Bremmer, Anita; Tsaoussi, Lucia S.; Wang, Yan-Ming

1999-01-01

The NOAA/NASA Pathfinder program was created by the Earth Observing System (EOS) Program Office to determine how existing satellite-based data sets can be processed and used to study global change. The data sets are designed to be long time-series data processed with stable calibration and community consensus algorithms to better assist the research community. The Ocean Altimeter Pathfinder Project involves the reprocessing of all altimeter observations with a consistent set of improved algorithms, based on the results from TOPEX/POSEIDON (T/P), into easy-to-use data sets for the oceanographic community for climate research. Details are currently presented in two technical reports: Report# 1: Data Processing Handbook Report #2: Data Set Validation This report describes the validation of the data sets against a global network of high quality tide gauge measurements and provides an estimate of the error budget. The first report describes the processing schemes used to produce the geodetic consistent data set comprised of SEASAT, GEOSAT, ERS-1, TOPEX/ POSEIDON, and ERS-2 satellite observations.
An Experimental Study of the Internal Consistency of Judgments Made in Bookmark Standard Setting

ERIC Educational Resources Information Center

Clauser, Brian E.; Baldwin, Peter; Margolis, Melissa J.; Mee, Janet; Winward, Marcia

2017-01-01

Validating performance standards is challenging and complex. Because of the difficulties associated with collecting evidence related to external criteria, validity arguments rely heavily on evidence related to internal criteria--especially evidence that expert judgments are internally consistent. Given its importance, it is somewhat surprising…
An Exploratory Factor Analysis and Construct Validity of the Resident Choice Assessment Scale with Paid Carers of Adults with Intellectual Disabilities and Challenging Behavior in Community Settings

ERIC Educational Resources Information Center

Ratti, Victoria; Vickerstaff, Victoria; Crabtree, Jason; Hassiotis, Angela

2017-01-01

Introduction: The Resident Choice Assessment Scale (RCAS) is used to assess choice availability for adults with intellectual disabilities (ID). The aim of the study was to explore the factor structure, construct validity, and internal consistency of the measure in community settings to further validate this tool. Method: 108 paid carers of adults…
American Alcohol Photo Stimuli (AAPS): A standardized set of alcohol and matched non-alcohol images.

PubMed

Stauffer, Christopher S; Dobberteen, Lily; Woolley, Joshua D

2017-11-01

Photographic stimuli are commonly used to assess cue reactivity in the research and treatment of alcohol use disorder. The stimuli used are often non-standardized, not properly validated, and poorly controlled. There are no previously published, validated, American-relevant sets of alcohol images created in a standardized fashion. We aimed to: 1) make available a standardized, matched set of photographic alcohol and non-alcohol beverage stimuli, 2) establish face validity, the extent to which the stimuli are subjectively viewed as what they are purported to be, and 3) establish construct validity, the degree to which a test measures what it claims to be measuring. We produced a standardized set of 36 images consisting of American alcohol and non-alcohol beverages matched for basic color, form, and complexity. A total of 178 participants (95 male, 82 female, 1 genderqueer) rated each image for appetitiveness. An arrow-probe task, in which matched pairs were categorized after being presented for 200 ms, assessed face validity. Criteria for construct validity were met if variation in AUDIT scores were associated with variation in performance on tasks during alcohol image presentation. Overall, images were categorized with >90% accuracy. Participants' AUDIT scores correlated significantly with alcohol "want" and "like" ratings [r(176) = 0.27, p = <0.001; r(176) = 0.36, p = <0.001] and arrow-probe latency [r(176) = -0.22, p = 0.004], but not with non-alcohol outcomes. Furthermore, appetitive ratings and arrow-probe latency for alcohol, but not non-alcohol, differed significantly for heavy versus light drinkers. Our image set provides valid and reliable alcohol stimuli for both explicit and implicit tests of cue reactivity. The use of standardized, validated, reliable image sets may improve consistency across research and treatment paradigms.
The Drug Abuse Screening Test preserves its excellent psychometric properties in psychiatric patients evaluated in an emergency setting.

PubMed

Giguère, Charles-Édouard; Potvin, Stéphane

2017-01-01

Substance use disorders (SUDs) are significant risk factors for psychiatric relapses and hospitalizations in psychiatric populations. Unfortunately, no instrument has been validated for the screening of SUDs in psychiatric emergency settings. The Drug Abuse Screening Test (DAST) is widely used in the addiction field, but is has not been validated in that particular context. The objective of the current study is to examine the psychometric properties of the DAST administered to psychiatric populations evaluated in an emergency setting. The DAST was administered to 912 psychiatric patients in an emergency setting, of which 119 had a SUD (excluding those misusing alcohol only). The internal consistency, the construct validity, the test-retest reliability and the predictive validity (using SUD diagnoses) of the DAST were examined. The convergent validity was also examined, using a validated impulsivity scale. Regarding the internal consistency of the DAST, the Cronbach's alpha was 0.88. The confirmatory factor analysis showed that the DAST has one underlying factor. The test-retest reliability analysis produced a correlation coefficient of 0.86. ROC curve analyses produced an area under the curve of 0.799. Interestingly, a sex effect was observed. Finally, the convergent validity analysis showed that the DAST total score is specifically correlated with the sensation seeking dimension of impulsivity. The results of this validation study shows that the DAST preserves its excellent psychometric properties in psychiatric populations evaluated in an emergency setting. These results should encourage the use of the DAST in this unstable clinical situation. Copyright © 2016 Elsevier Ltd. All rights reserved.
Older adult mistreatment risk screening: contribution to the validation of a screening tool in a domestic setting.

PubMed

Lindenbach, Jeannette M; Larocque, Sylvie; Lavoie, Anne-Marise; Garceau, Marie-Luce

2012-06-01

ABSTRACTThe hidden nature of older adult mistreatment renders its detection in the domestic setting particularly challenging. A validated screening instrument that can provide a systematic assessment of risk factors can facilitate this detection. One such instrument, the "expanded Indicators of Abuse" tool, has been previously validated in the Hebrew language in a hospital setting. The present study has contributed to the validation of the "e-IOA" in an English-speaking community setting in Ontario, Canada. It consisted of two phases: (a) a content validity review and adaptation of the instrument by experts throughout Ontario, and (b) an inter-rater reliability assessment by home visiting nurses. The adaptation, the "Mistreatment of Older Adult Risk Factors" tool, offers a comprehensive tool for screening in the home setting. This instrument is significant to professional practice as practitioners working with older adults will be better equipped to assess for risk of mistreatment.
An Application-Based Discussion of Construct Validity and Internal Consistency Reliability.

ERIC Educational Resources Information Center

Taylor, Dianne L.; Campbell, Kathleen T.

Several techniques for conducting studies of measurement integrity are explained and illustrated using a heuristic data set from a study of teachers' participation in decision making (D. L. Taylor, 1991). The sample consisted of 637 teachers. It is emphasized that validity and reliability are characteristics of data, and do not inure to tests as…
Development and validation of the Australian version of the Birth Satisfaction Scale-Revised (BSS-R).

PubMed

Jefford, Elaine; Hollins Martin, Caroline J; Martin, Colin R

2018-02-01

The 10-item Birth Satisfaction Scale-Revised (BSS-R) has recently been endorsed by international expert consensus for global use as the birth satisfaction outcome measure of choice. English-language versions of the tool include validated UK and US versions; however, the instrument has not, to date, been contextualised and validated in an Australian English-language version. The current investigation sought to develop and validate an English-language version of the tool for use within the Australian context. A two-stage study. Following review and modification by expert panel, the Australian BSS-R (A-BSS-R) was (Stage 1) evaluated for factor structure, internal consistency, known-groups discriminant validity and divergent validity. Stage 2 directly compared the A-BSS-R data set with the original UK data set to determine the invariance characteristics of the new instrument. Participants were a purposive sample of Australian postnatal women (n = 198). The A-BSS-R offered a good fit to data consistent with the BSS-R tridimensional measurement model and was found to be conceptually and measurement equivalent to the UK version. The A-BSS-R demonstrated excellent known-groups discriminant validity, generally good divergent validity and overall good internal consistency. The A-BSS-R represents a robust and valid measure of the birth satisfaction concept suitable for use within Australia and appropriate for application to International comparative studies.
A mixed methods approach to adapting and evaluating the functional assessment of HIV infection (FAHI), Swahili version, for use with low literacy populations.

PubMed

Nyongesa, Moses K; Sigilai, Antipa; Hassan, Amin S; Thoya, Janet; Odhiambo, Rachael; Van de Vijver, Fons J R; Newton, Charles R J C; Abubakar, Amina

2017-01-01

Despite bearing the largest HIV-related burden, little is known of the Health-Related Quality of Life (HRQoL) among people living with HIV in sub-Saharan Africa. One of the factors contributing to this gap in knowledge is the lack of culturally adapted and validated measures of HRQoL that are relevant for this setting. We set out to adapt the Functional Assessment of HIV Infection (FAHI) Questionnaire, an HIV-specific measure of HRQoL, and evaluate its internal consistency and validity. The three phase mixed-methods study took place in a rural setting at the Kenyan Coast. Phase one involved a scoping review to describe the evidence base of the reliability and validity of FAHI as well as the geographical contexts in which it has been administered. Phase two involved in-depth interviews (n = 38) to explore the content validity, and initial piloting for face validation of the adapted FAHI. Phase three was quantitative (n = 103) and evaluated the internal consistency, convergent and construct validities of the adapted interviewer-administered questionnaire. In the first phase of the study, we identified 16 studies that have used the FAHI. Most (82%) were conducted in North America. Only seven (44%) of the reviewed studies reported on the psychometric properties of the FAHI. In the second phase, most of the participants (37 out of 38) reported satisfaction with word clarity and content coverage whereas 34 (89%) reported satisfaction with relevance of the items, confirming the face validity of the adapted questionnaire during initial piloting. Our participants indicated that HIV impacted on their physical, functional, emotional, and social wellbeing. Their responses overlapped with items in four of the five subscales of the FAHI Questionnaire establishing its content validity. In the third phase, the internal consistency of the scale was found to be satisfactory with subscale Cronbach's α ranging from 0.55 to 0.78. The construct and convergent validity of the tool were supported by acceptable factor loadings for most of the items on the respective sub-scales and confirmation of expected significant correlations of the FAHI subscale scores with scores of a measure of common mental disorders. The adapted interviewer-administered Swahili version of FAHI questionnaire showed initial strong evidence of good psychometric properties with satisfactory internal consistency and acceptable validity (content, face, and convergent validity). It gives impetus for further validation work, especially construct validity, in similar settings before it can be used for research and clinical purposes in the entire East African region.
Validation of tsunami inundation model TUNA-RP using OAR-PMEL-135 benchmark problem set

NASA Astrophysics Data System (ADS)

Koh, H. L.; Teh, S. Y.; Tan, W. K.; Kh'ng, X. Y.

2017-05-01

A standard set of benchmark problems, known as OAR-PMEL-135, is developed by the US National Tsunami Hazard Mitigation Program for tsunami inundation model validation. Any tsunami inundation model must be tested for its accuracy and capability using this standard set of benchmark problems before it can be gainfully used for inundation simulation. The authors have previously developed an in-house tsunami inundation model known as TUNA-RP. This inundation model solves the two-dimensional nonlinear shallow water equations coupled with a wet-dry moving boundary algorithm. This paper presents the validation of TUNA-RP against the solutions provided in the OAR-PMEL-135 benchmark problem set. This benchmark validation testing shows that TUNA-RP can indeed perform inundation simulation with accuracy consistent with that in the tested benchmark problem set.
Systematic Screening at the Middle School Level: Score Reliability and Validity of the Student Risk Screening Scale

ERIC Educational Resources Information Center

Lane, Kathleen Lynne; Parks, Robin J.; Kalberg, Jemma Robertson; Carter, Erik W.

2007-01-01

This article presents findings of two studies, one conducted with middle school students (n = 500) in a rural setting and a second conducted with middle school students (n = 528) in an urban setting, of the reliability and validity of the "Student Risk Screening Scale" (SRSS; Drummond, 1994). Results revealed high internal consistency, test-retest…
Reliability and Validity of a Procedure To Measure Diagnostic Reasoning and Problem-Solving Skills Taught in Predoctoral Orthodontic Education.

ERIC Educational Resources Information Center

Albanese, Mark A.; Jacobs, Richard M.

Preliminary psychometric data assessing the reliability and validity of a method used to measure the diagnostic reasoning and problem-solving skills of predoctoral students in orthodontia are described. The measurement approach consisted of sets of patient demographic data and dental photos and x-rays, accompanied by a set of 33 multiple-choice…
A mixed methods approach to adapting and evaluating the functional assessment of HIV infection (FAHI), Swahili version, for use with low literacy populations

PubMed Central

Sigilai, Antipa; Hassan, Amin S.; Thoya, Janet; Odhiambo, Rachael; Van de Vijver, Fons J. R.; Newton, Charles R. J. C.; Abubakar, Amina

2017-01-01

Background Despite bearing the largest HIV-related burden, little is known of the Health-Related Quality of Life (HRQoL) among people living with HIV in sub-Saharan Africa. One of the factors contributing to this gap in knowledge is the lack of culturally adapted and validated measures of HRQoL that are relevant for this setting. Aims We set out to adapt the Functional Assessment of HIV Infection (FAHI) Questionnaire, an HIV-specific measure of HRQoL, and evaluate its internal consistency and validity. Methods The three phase mixed-methods study took place in a rural setting at the Kenyan Coast. Phase one involved a scoping review to describe the evidence base of the reliability and validity of FAHI as well as the geographical contexts in which it has been administered. Phase two involved in-depth interviews (n = 38) to explore the content validity, and initial piloting for face validation of the adapted FAHI. Phase three was quantitative (n = 103) and evaluated the internal consistency, convergent and construct validities of the adapted interviewer-administered questionnaire. Results In the first phase of the study, we identified 16 studies that have used the FAHI. Most (82%) were conducted in North America. Only seven (44%) of the reviewed studies reported on the psychometric properties of the FAHI. In the second phase, most of the participants (37 out of 38) reported satisfaction with word clarity and content coverage whereas 34 (89%) reported satisfaction with relevance of the items, confirming the face validity of the adapted questionnaire during initial piloting. Our participants indicated that HIV impacted on their physical, functional, emotional, and social wellbeing. Their responses overlapped with items in four of the five subscales of the FAHI Questionnaire establishing its content validity. In the third phase, the internal consistency of the scale was found to be satisfactory with subscale Cronbach’s α ranging from 0.55 to 0.78. The construct and convergent validity of the tool were supported by acceptable factor loadings for most of the items on the respective sub-scales and confirmation of expected significant correlations of the FAHI subscale scores with scores of a measure of common mental disorders. Conclusion The adapted interviewer-administered Swahili version of FAHI questionnaire showed initial strong evidence of good psychometric properties with satisfactory internal consistency and acceptable validity (content, face, and convergent validity). It gives impetus for further validation work, especially construct validity, in similar settings before it can be used for research and clinical purposes in the entire East African region. PMID:28380073
Ultrasonic inspection of a glued laminated timber fabricated with defects

Treesearch

Robert Emerson; David Pollock; David McLean; Kenneth Fridley; Robert Ross; Roy Pellerin

2001-01-01

The Federal Highway Administration (FHWA) set up a validation test to compare the effectiveness of various nondestructive inspection techniques for detecting artificial defects in glulam members. The validation test consisted of a glulam beam fabricated with artificial defects known to FHWA personnel but not originally known to the scientists performing the validation...
Validation of the Spanish version of the Index of Spouse Abuse.

PubMed

Plazaola-Castaño, Juncal; Ruiz-Pérez, Isabel; Escribà-Agüir, Vicenta; Jiménez-Martín, Juan Manuel; Hernández-Torres, Elisa

2009-04-01

Partner violence against women is a major public health problem. Although there are currently a number of validated screening and diagnostic tools that can be used to evaluate this type of violence, such tools are not available in Spain. The aim of this study is to analyze the validity and reliability of the Spanish version of the Index of Spouse Abuse (ISA). A cross-sectional study was carried out in 2005 in two health centers in Granada, Spain, in 390 women between 18 and 70 years old. Analyses of the factorial structure, internal consistency, test-retest reliability, and construct validity were conducted. Cutoff points for each subscale were also defined. For the construct validity analysis, the SF-36 perceived general health dimension, the Rosenberg Self-Esteem Scale and the Goldberg 12-item General Health Questionnaire were included. The psychometric analysis shows that the instrument has good internal consistency, reproducibility, and construct validity. The scale is useful for the analysis of partner violence against women in both a research setting and a healthcare setting.
The 11-item Medication Adherence Reasons Scale: reliability and factorial validity among patients with hypertension in Malaysian primary healthcare settings

PubMed Central

Shima, Razatul; Farizah, Hairi; Majid, Hazreen Abdul

2015-01-01

INTRODUCTION The aim of this study was to assess the reliability and validity of a modified Malaysian version of the Medication Adherence Reasons Scale (MAR-Scale). METHODS In this cross-sectional study, the 15-item MAR-Scale was administered to 665 patients with hypertension who attended one of the four government primary healthcare clinics in the Hulu Langat and Klang districts of Selangor, Malaysia, between early December 2012 and end-March 2013. The construct validity was examined in two phases. Phase I consisted of translation of the MAR-Scale from English to Malay, a content validity check by an expert panel, a face validity check via a small preliminary test among patients with hypertension, and exploratory factor analysis (EFA). Phase II involved internal consistency reliability calculations and confirmatory factor analysis (CFA). RESULTS EFA verified five existing factors that were previously identified (i.e. issues with medication management, multiple medications, belief in medication, medication availability, and the patient’s forgetfulness and convenience), while CFA extracted four factors (medication availability issues were not extracted). The final modified MAR-Scale model, which had 11 items and a four-factor structure, provided good evidence of convergent and discriminant validities. Cronbach’s alpha coefficient was > 0.7, indicating good internal consistency of the items in the construct. The results suggest that the modified MAR-Scale has good internal consistencies and construct validity. CONCLUSION The validated modified MAR-Scale (Malaysian version) was found to be suitable for use among patients with hypertension receiving treatment in primary healthcare settings. However, the comprehensive measurement of other factors that can also lead to non-adherence requires further exploration. PMID:25902719
The 11-item Medication Adherence Reasons Scale: reliability and factorial validity among patients with hypertension in Malaysian primary healthcare settings.

PubMed

Shima, Razatul; Farizah, Hairi; Majid, Hazreen Abdul

2015-08-01

The aim of this study was to assess the reliability and validity of a modified Malaysian version of the Medication Adherence Reasons Scale (MAR-Scale). In this cross-sectional study, the 15-item MAR-Scale was administered to 665 patients with hypertension who attended one of the four government primary healthcare clinics in the Hulu Langat and Klang districts of Selangor, Malaysia, between early December 2012 and end-March 2013. The construct validity was examined in two phases. Phase I consisted of translation of the MAR-Scale from English to Malay, a content validity check by an expert panel, a face validity check via a small preliminary test among patients with hypertension, and exploratory factor analysis (EFA). Phase II involved internal consistency reliability calculations and confirmatory factor analysis (CFA). EFA verified five existing factors that were previously identified (i.e. issues with medication management, multiple medications, belief in medication, medication availability, and the patient's forgetfulness and convenience), while CFA extracted four factors (medication availability issues were not extracted). The final modified MAR-Scale model, which had 11 items and a four-factor structure, provided good evidence of convergent and discriminant validities. Cronbach's alpha coefficient was > 0.7, indicating good internal consistency of the items in the construct. The results suggest that the modified MAR-Scale has good internal consistencies and construct validity. The validated modified MAR-Scale (Malaysian version) was found to be suitable for use among patients with hypertension receiving treatment in primary healthcare settings. However, the comprehensive measurement of other factors that can also lead to non-adherence requires further exploration.
Design and content validation of a set of SMS to promote seeking of specialized mental health care within the Allillanchu Project.

PubMed

Toyama, M; Diez-Canseco, F; Busse, P; Del Mastro, I; Miranda, J J

2018-01-01

The aim of this study was to design and develop a set of, short message service (SMS) to promote specialized mental health care seeking within the framework of the Allillanchu Project. The design phase consisted of 39 interviews with potential recipients of the SMS, about use of cellphones, and perceptions and motivations towards seeking mental health care. After the data collection, the research team developed a set of seven SMS for validation. The content validation phase consisted of 24 interviews. The participants answered questions regarding their understanding of the SMS contents and rated its appeal. The seven SMS subjected to content validation were tailored to the recipient using their name. The reminder message included the working hours of the psychology service at the patient's health center. The motivational messages addressed perceived barriers and benefits when seeking mental health services. The average appeal score of the seven SMS was 9.0 (SD±0.4) of 10 points. Participants did not make significant suggestions to change the wording of the messages. Five SMS were chosen to be used. This approach is likely to be applicable to other similar low-resource settings, and the methodology used can be adapted to develop SMS for other chronic conditions.
Improving machine learning reproducibility in genetic association studies with proportional instance cross validation (PICV).

PubMed

Piette, Elizabeth R; Moore, Jason H

2018-01-01

Machine learning methods and conventions are increasingly employed for the analysis of large, complex biomedical data sets, including genome-wide association studies (GWAS). Reproducibility of machine learning analyses of GWAS can be hampered by biological and statistical factors, particularly so for the investigation of non-additive genetic interactions. Application of traditional cross validation to a GWAS data set may result in poor consistency between the training and testing data set splits due to an imbalance of the interaction genotypes relative to the data as a whole. We propose a new cross validation method, proportional instance cross validation (PICV), that preserves the original distribution of an independent variable when splitting the data set into training and testing partitions. We apply PICV to simulated GWAS data with epistatic interactions of varying minor allele frequencies and prevalences and compare performance to that of a traditional cross validation procedure in which individuals are randomly allocated to training and testing partitions. Sensitivity and positive predictive value are significantly improved across all tested scenarios for PICV compared to traditional cross validation. We also apply PICV to GWAS data from a study of primary open-angle glaucoma to investigate a previously-reported interaction, which fails to significantly replicate; PICV however improves the consistency of testing and training results. Application of traditional machine learning procedures to biomedical data may require modifications to better suit intrinsic characteristics of the data, such as the potential for highly imbalanced genotype distributions in the case of epistasis detection. The reproducibility of genetic interaction findings can be improved by considering this variable imbalance in cross validation implementation, such as with PICV. This approach may be extended to problems in other domains in which imbalanced variable distributions are a concern.

Development and validation of a Response Bias Scale (RBS) for the MMPI-2.

PubMed

Gervais, Roger O; Ben-Porath, Yossef S; Wygant, Dustin B; Green, Paul

2007-06-01

This study describes the development of a Minnesota Multiphasic Personality Inventory (MMPI-2) scale designed to detect negative response bias in forensic neuropsychological or disability assessment settings. The Response Bias Scale (RBS) consists of 28 MMPI-2 items that discriminated between persons who passed or failed the Word Memory Test (WMT), Computerized Assessment of Response Bias (CARB), and/or Test of Memory Malingering (TOMM) in a sample of 1,212 nonhead-injury disability claimants. Incremental validity of the RBS was evaluated by comparing its ability to detect poor performance on four separate symptom validity tests with that of the F and F(P) scales and the Fake Bad Scale (FBS). The RBS consistently outperformed F, F(P), and FBS. Study results suggest that the RBS may be a useful addition to existing MMPI-2 validity scales and indices in detecting symptom complaints predominantly associated with cognitive response bias and overreporting in forensic neuropsychological and disability assessment settings.
Validation of a global scale to assess the quality of interprofessional teamwork in mental health settings.

PubMed

Tomizawa, Ryoko; Yamano, Mayumi; Osako, Mitue; Hirabayashi, Naotugu; Oshima, Nobuo; Sigeta, Masahiro; Reeves, Scott

2017-12-01

Few scales currently exist to assess the quality of interprofessional teamwork through team members' perceptions of working together in mental health settings. The purpose of this study was to revise and validate an interprofessional scale to assess the quality of teamwork in inpatient psychiatric units and to use it multi-nationally. A literature review was undertaken to identify evaluative teamwork tools and develop an additional 12 items to ensure a broad global focus. Focus group discussions considered adaptation to different care systems using subjective judgements from 11 participants in a pre-test of items. Data quality, construct validity, reproducibility, and internal consistency were investigated in the survey using an international comparative design. Exploratory factor analysis yielded five factors with 21 items: 'patient/community centred care', 'collaborative communication', 'interprofessional conflict', 'role clarification', and 'environment'. High overall internal consistency, reproducibility, adequate face validity, and reasonable construct validity were shown in the USA and Japan. The revised Collaborative Practice Assessment Tool (CPAT) is a valid measure to assess the quality of interprofessional teamwork in psychiatry and identifies the best strategies to improve team performance. Furthermore, the revised scale will generate more rigorous evidence for collaborative practice in psychiatry internationally.
A Reexamination of the Psychometric Properties of the "School-Wide Evaluation Tool" (SET)

ERIC Educational Resources Information Center

Vincent, Claudia; Spaulding, Scott; Tobin, Tary Jeanne

2010-01-01

As a follow-up to Horner et al., this study focuses on the internal consistency and validity of the School-wide Evaluation Tool (SET) at all school levels. Analyzing SET data from 833 elementary, 264 middle, and 93 high schools, the authors focused on (a) describing commonalities and differences in SET data across the school levels, (b) assessing…
Development process of an assessment tool for disruptive behavior problems in cross-cultural settings: the Disruptive Behavior International Scale – Nepal version (DBIS-N)

PubMed Central

Burkey, Matthew D.; Ghimire, Lajina; Adhikari, Ramesh P.; Kohrt, Brandon A.; Jordans, Mark J. D.; Haroz, Emily; Wissow, Lawrence

2017-01-01

Systematic processes are needed to develop valid measurement instruments for disruptive behavior disorders (DBDs) in cross-cultural settings. We employed a four-step process in Nepal to identify and select items for a culturally valid assessment instrument: 1) We extracted items from validated scales and local free-list interviews. 2) Parents, teachers, and peers (n=30) rated the perceived relevance and importance of behavior problems. 3) Highly rated items were piloted with children (n=60) in Nepal. 4) We evaluated internal consistency of the final scale. We identified 49 symptoms from 11 scales, and 39 behavior problems from free-list interviews (n=72). After dropping items for low ratings of relevance and severity and for poor item-test correlation, low frequency, and/or poor acceptability in pilot testing, 16 items remained for the Disruptive Behavior International Scale—Nepali version (DBIS-N). The final scale had good internal consistency (α=0.86). A 4-step systematic approach to scale development including local participation yielded an internally consistent scale that included culturally relevant behavior problems. PMID:28093575
A novel classifier based on three preoperative tumor markers predicting the cancer-specific survival of gastric cancer (CEA, CA19-9 and CA72-4).

PubMed

Guo, Jing; Chen, Shangxiang; Li, Shun; Sun, Xiaowei; Li, Wei; Zhou, Zhiwei; Chen, Yingbo; Xu, Dazhi

2018-01-12

Several studies have highlighted the prognostic value of the individual and the various combinations of the tumor markers for gastric cancer (GC). Our study was designed to assess establish a new novel model incorporating carcino-embryonic antigen (CEA), carbohydrate antigen 19-9 (CA19-9), carbohydrate antigen 72-4 (CA72-4). A total of 1,566 GC patients (Primary cohort) between Jan 2000 and July 2013 were analyzed. The Primary cohort was randomly divided into Training set (n=783) and Validation set (n=783). A three-tumor marker classifier was developed in the Training set and validated in the Validation set by multivariate regression and risk-score analysis. We have identified a three-tumor marker classifier (including CEA, CA19-9 and CA72-4) for the cancer specific survival (CSS) of GC (p<0.001). Consistent results were obtained in the both Training set and Validation set. Multivariate analysis showed that the classifier was an independent predictor of GC (All p value <0.001 in the Training set, Validation set and Primary cohort). Furthermore, when the leave-one-out approach was performed, the classifier showed superior predictive value to the individual or two of them (with the highest AUC (Area Under Curve); 0.618 for the Training set, and 0.625 for the Validation set), which ascertained its predictive value. Our three-tumor marker classifier is closely associated with the CSS of GC and may serve as a novel model for future decisions concerning treatments.
The Immune System as a Model for Pattern Recognition and Classification

PubMed Central

Carter, Jerome H.

2000-01-01

Objective: To design a pattern recognition engine based on concepts derived from mammalian immune systems. Design: A supervised learning system (Immunos-81) was created using software abstractions of T cells, B cells, antibodies, and their interactions. Artificial T cells control the creation of B-cell populations (clones), which compete for recognition of “unknowns.” The B-cell clone with the “simple highest avidity” (SHA) or “relative highest avidity” (RHA) is considered to have successfully classified the unknown. Measurement: Two standard machine learning data sets, consisting of eight nominal and six continuous variables, were used to test the recognition capabilities of Immunos-81. The first set (Cleveland), consisting of 303 cases of patients with suspected coronary artery disease, was used to perform a ten-way cross-validation. After completing the validation runs, the Cleveland data set was used as a training set prior to presentation of the second data set, consisting of 200 unknown cases. Results: For cross-validation runs, correct recognition using SHA ranged from a high of 96 percent to a low of 63.2 percent. The average correct classification for all runs was 83.2 percent. Using the RHA metric, 11.2 percent were labeled “too close to determine” and no further attempt was made to classify them. Of the remaining cases, 85.5 percent were correctly classified. When the second data set was presented, correct classification occurred in 73.5 percent of cases when SHA was used and in 80.3 percent of cases when RHA was used. Conclusions: The immune system offers a viable paradigm for the design of pattern recognition systems. Additional research is required to fully exploit the nuances of immune computation. PMID:10641961
Measuring the quality of motivational interviewing in primary health care encounters: The development and validation of the motivational interviewing assessment scale (MIAS).

PubMed

Campiñez Navarro, Manuel; Pérula de Torres, Luis Ángel; Bosch Fontcuberta, Josep M; Barragán Brun, Nieves; Arbonies Ortiz, Juan Carlos; Novo Rodríguez, Jesús Manuel; Bóveda Fontán, Julia; Martín Alvarez, Remedios; Prados Castillejo, Jose Antonio; Rivas Doutreleau, Gabriela Renée; Domingo Peña, Carmen; Castro Moreno, Jaime Jesús; Romero Rodríguez, Esperanza María

2016-09-01

Motivational interviewing (MI) is a collaborative, goal-oriented method to help patients change behaviour. Tools that are often used to measure MI are the motivational interviewing skills code' (MISC), the 'motivational interviewing treatment integrity' (MITI) and the 'behaviour change counselling index' (BECCI). The first two instruments have not been designed to be used in primary healthcare (PHC) settings. The BECCI actually is time-consuming. The motivational interviewing assessment scale (MIAS, 'EVEM' in Spanish) was developed to measure MI in PHC encounters as an alternative to the previous instruments. To validate MIAS as an instrument to assess the quality of MI in PHC settings. (a) Sixteen experts in MI participated in the design, face and consensus validity, using a Delphi-type methodology. (b) 27 PHC centres located in Spain. four experts in MI tested its psychometric properties with 332 video recordings coming from the Dislip-EM study (consultations provided by 37 practitioners). dimensionality, internal consistency, reliability (intra-class correlation coefficient-ICC), sensitivity to change and convergent validity with the BECCI scale. A 14-item scale was obtained after the validation process. Factor analysis: two factors explained 76.6% of the total variance. Internal consistency, α = 0.99. Reliability: intra-rater ICC = 0.96; inter-rater ICC = 0.97. Sensitivity to change: means before and after training were 23.63 versus 38.57 (P < 0.001). Spearman's coefficient between the MIAS and the BECCI scale was 0.98 (P < 0.001). The MIAS is a consistent and reliable instrument to assess the use of MI in PHC settings. [Box: see text].
Investigating the Incremental Validity of Cognitive Variables in Early Mathematics Screening

ERIC Educational Resources Information Center

Clarke, Ben; Shanley, Lina; Kosty, Derek; Baker, Scott K.; Cary, Mari Strand; Fien, Hank; Smolkowski, Keith

2018-01-01

The purpose of this study was to investigate the incremental validity of a set of domain general cognitive measures added to a traditional screening battery of early numeracy measures. The sample consisted of 458 kindergarten students of whom 285 were designated as severely at-risk for mathematics difficulty. Hierarchical multiple regression…
Development of the 3-SET 4P questionnaire for evaluating former ICU patients' physical and psychosocial problems over time: a pilot study.

PubMed

Akerman, Eva; Fridlund, Bengt; Ersson, Anders; Granberg-Axéll, Anetth

2009-04-01

Current studies reveal a lack of consensus for the evaluation of physical and psychosocial problems after ICU stay and their changes over time. The aim was to develop and evaluate the validity and reliability of a questionnaire for assessing physical and psychosocial problems over time for patients following ICU recovery. Thirty-nine patients completed the questionnaire, 17 were retested. The questionnaire was constructed in three sets: physical problems, psychosocial problems and follow-up care. Face and content validity were tested by nurses, researchers and patients. The questionnaire showed good construct validity in all three sets and had strong factor loadings (explained variance >70%, factor loadings >0.5) for all three sets. There was good concurrent validity compared with the SF 12 (r(s)>0.5). Internal consistency was shown to be reliable (Cronbach's alpha 0.70-0.85). Stability reliability on retesting was good for the physical and psychosocial sets (r(s)>0.5). The 3-set 4P questionnaire was a first step in developing an instrument for assessment of former ICU patients' problems over time. The sample size was small and thus, further studies are needed to confirm these findings.
A new test set for validating predictions of protein-ligand interaction.

PubMed

Nissink, J Willem M; Murray, Chris; Hartshorn, Mike; Verdonk, Marcel L; Cole, Jason C; Taylor, Robin

2002-12-01

We present a large test set of protein-ligand complexes for the purpose of validating algorithms that rely on the prediction of protein-ligand interactions. The set consists of 305 complexes with protonation states assigned by manual inspection. The following checks have been carried out to identify unsuitable entries in this set: (1) assessing the involvement of crystallographically related protein units in ligand binding; (2) identification of bad clashes between protein side chains and ligand; and (3) assessment of structural errors, and/or inconsistency of ligand placement with crystal structure electron density. In addition, the set has been pruned to assure diversity in terms of protein-ligand structures, and subsets are supplied for different protein-structure resolution ranges. A classification of the set by protein type is available. As an illustration, validation results are shown for GOLD and SuperStar. GOLD is a program that performs flexible protein-ligand docking, and SuperStar is used for the prediction of favorable interaction sites in proteins. The new CCDC/Astex test set is freely available to the scientific community (http://www.ccdc.cam.ac.uk). Copyright 2002 Wiley-Liss, Inc.
Ensemble Methods for Classification of Physical Activities from Wrist Accelerometry.

PubMed

Chowdhury, Alok Kumar; Tjondronegoro, Dian; Chandran, Vinod; Trost, Stewart G

2017-09-01

To investigate whether the use of ensemble learning algorithms improve physical activity recognition accuracy compared to the single classifier algorithms, and to compare the classification accuracy achieved by three conventional ensemble machine learning methods (bagging, boosting, random forest) and a custom ensemble model comprising four algorithms commonly used for activity recognition (binary decision tree, k nearest neighbor, support vector machine, and neural network). The study used three independent data sets that included wrist-worn accelerometer data. For each data set, a four-step classification framework consisting of data preprocessing, feature extraction, normalization and feature selection, and classifier training and testing was implemented. For the custom ensemble, decisions from the single classifiers were aggregated using three decision fusion methods: weighted majority vote, naïve Bayes combination, and behavior knowledge space combination. Classifiers were cross-validated using leave-one subject out cross-validation and compared on the basis of average F1 scores. In all three data sets, ensemble learning methods consistently outperformed the individual classifiers. Among the conventional ensemble methods, random forest models provided consistently high activity recognition; however, the custom ensemble model using weighted majority voting demonstrated the highest classification accuracy in two of the three data sets. Combining multiple individual classifiers using conventional or custom ensemble learning methods can improve activity recognition accuracy from wrist-worn accelerometer data.
A new test for the assessment of working memory in clinical settings: Validation and norming of a month ordering task.

PubMed

Buekenhout, Imke; Leitão, José; Gomes, Ana A

2018-05-24

Month ordering tasks have been used in experimental settings to obtain measures of working memory (WM) capacity in older/clinical groups based solely on their face validity. We sought to assess the appropriateness of using a month ordering task in other contexts, including clinical settings, as a psychometrically sound WM assessment. To this end, we constructed a month ordering task (ucMOT), studied its reliability (internal consistency and temporal stability), and gathered construct-related and criterion-related validity evidence for its use as a WM assessment. The ucMOT proved to be internally consistent and temporally stable, and analyses of the criterion-related validity evidence revealed that its scores predicted the efficiency of language comprehension processes known to depend crucially on WM resources, namely, processes involved in pronoun interpretation. Furthermore, all ucMOT items discriminated between younger and older age groups; the global scores were significantly correlated with scores on well-established WM tasks and presented lower correlations with instruments that evaluate different (although related) processes, namely, inhibition and processing speed. We conclude that the ucMOT possesses solid psychometric properties. Accordingly, we acquired normative data for the Portuguese population, which we present as a regression-based algorithm that yields z scores adjusted for age, gender, and years of formal education. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Standardization, evaluation and early-phase method validation of an analytical scheme for batch-consistency N-glycosylation analysis of recombinant produced glycoproteins.

PubMed

Zietze, Stefan; Müller, Rainer H; Brecht, René

2008-03-01

In order to set up a batch-to-batch-consistency analytical scheme for N-glycosylation analysis, several sample preparation steps including enzyme digestions and fluorophore labelling and two HPLC-methods were established. The whole method scheme was standardized, evaluated and validated according to the requirements on analytical testing in early clinical drug development by usage of a recombinant produced reference glycoprotein (RGP). The standardization of the methods was performed by clearly defined standard operation procedures. During evaluation of the methods, the major interest was in the loss determination of oligosaccharides within the analytical scheme. Validation of the methods was performed with respect to specificity, linearity, repeatability, LOD and LOQ. Due to the fact that reference N-glycan standards were not available, a statistical approach was chosen to derive accuracy from the linearity data. After finishing the validation procedure, defined limits for method variability could be calculated and differences observed in consistency analysis could be separated into significant and incidental ones.
Unlocking the potential of publicly available microarray data using inSilicoDb and inSilicoMerging R/Bioconductor packages.

PubMed

Taminau, Jonatan; Meganck, Stijn; Lazar, Cosmin; Steenhoff, David; Coletta, Alain; Molter, Colin; Duque, Robin; de Schaetzen, Virginie; Weiss Solís, David Y; Bersini, Hugues; Nowé, Ann

2012-12-24

With an abundant amount of microarray gene expression data sets available through public repositories, new possibilities lie in combining multiple existing data sets. In this new context, analysis itself is no longer the problem, but retrieving and consistently integrating all this data before delivering it to the wide variety of existing analysis tools becomes the new bottleneck. We present the newly released inSilicoMerging R/Bioconductor package which, together with the earlier released inSilicoDb R/Bioconductor package, allows consistent retrieval, integration and analysis of publicly available microarray gene expression data sets. Inside the inSilicoMerging package a set of five visual and six quantitative validation measures are available as well. By providing (i) access to uniformly curated and preprocessed data, (ii) a collection of techniques to remove the batch effects between data sets from different sources, and (iii) several validation tools enabling the inspection of the integration process, these packages enable researchers to fully explore the potential of combining gene expression data for downstream analysis. The power of using both packages is demonstrated by programmatically retrieving and integrating gene expression studies from the InSilico DB repository [https://insilicodb.org/app/].
Spanish Translation and Cross-Language Validation of a Sleep Habits Questionnaire for Use in Clinical and Research Settings

PubMed Central

Baldwin, Carol M.; Choi, Myunghan; McClain, Darya Bonds; Celaya, Alma; Quan, Stuart F.

2012-01-01

Study Objectives: To translate, back-translate and cross-language validate (English/Spanish) the Sleep Heart Health Study Sleep Habits Questionnaire for use with Spanish-speakers in clinical and research settings. Methods: Following rigorous translation and back-translation, this cross-sectional cross-language validation study recruited bilingual participants from academic, clinic, and community-based settings (N = 50; 52% women; mean age 38.8 ± 12 years; 90% of Mexican heritage). Participants completed English and Spanish versions of the Sleep Habits Questionnaire, the Epworth Sleepiness Scale, and the Acculturation Rating Scale for Mexican Americans II one week apart in randomized order. Psychometric properties were assessed, including internal consistency, convergent validity, scale equivalence, language version intercorrelations, and exploratory factor analysis using PASW (Version18) software. Grade level readability of the sleep measure was evaluated. Results: All sleep categories (duration, snoring, apnea, insomnia symptoms, other sleep symptoms, sleep disruptors, restless legs syndrome) showed Cronbach α, Spearman-Brown coefficients and intercorrelations ≥ 0.700, suggesting robust internal consistency, correlation, and agreement between language versions. The Epworth correlated significantly with snoring, apnea, sleep symptoms, restless legs, and sleep disruptors) on both versions, supporting convergent validity. Items loaded on 4 factors accounted for 68% and 67% of the variance on the English and Spanish versions, respectively. Conclusions: The Spanish-language Sleep Habits Questionnaire demonstrates conceptual and content equivalency. It has appropriate measurement properties and should be useful for assessing sleep health in community-based clinics and intervention studies among Spanish-speaking Mexican Americans. Both language versions showed readability at the fifth grade level. Further testing is needed with larger samples. Citation: Baldwin CM; Choi M; McClain DB; Celaya A; Quan SF. Spanish translation and cross-language validation of a Sleep Habits Questionnaire for use in clinical and research settings. J Clin Sleep Med 2012;8(2):137-146. PMID:22505858
The Validity of a New Structured Assessment of Gastrointestinal Symptoms Scale (SAGIS) for Evaluating Symptoms in the Clinical Setting.

PubMed

Koloski, N A; Jones, M; Hammer, J; von Wulffen, M; Shah, A; Hoelz, H; Kutyla, M; Burger, D; Martin, N; Gurusamy, S R; Talley, N J; Holtmann, G

2017-08-01

The clinical assessments of patients with gastrointestinal symptoms can be time-consuming, and the symptoms captured during the consultation may be influenced by a variety of patient and non-patient factors. To facilitate standardized symptom assessment in the routine clinical setting, we developed the Structured Assessment of Gastrointestinal Symptom (SAGIS) instrument to precisely characterize symptoms in a routine clinical setting. We aimed to validate SAGIS including its reliability, construct and discriminant validity, and utility in the clinical setting. Development of the SAGIS consisted of initial interviews with patients referred for the diagnostic work-up of digestive symptoms and relevant complaints identified. The final instrument consisted of 22 items as well as questions on extra intestinal symptoms and was given to 1120 consecutive patients attending a gastroenterology clinic randomly split into derivation (n = 596) and validation datasets (n = 551). Discriminant validity along with test-retest reliability was assessed. The time taken to perform a clinical assessment with and without the SAGIS was recorded along with doctor satisfaction with this tool. Exploratory factor analysis conducted on the derivation sample suggested five symptom constructs labeled as abdominal pain/discomfort (seven items), gastroesophageal reflux disease/regurgitation symptoms (four items), nausea/vomiting (three items), diarrhea/incontinence (five items), and difficult defecation and constipation (2 items). Confirmatory factor analysis conducted on the validation sample supported the initially developed five-factor measurement model ([Formula: see text], p < 0.0001, χ 2 /df = 4.6, CFI = 0.90, TLI = 0.88, RMSEA = 0.08). All symptom groups demonstrated differentiation between disease groups. The SAGIS was shown to be reliable over time and resulted in a 38% reduction of the time required for clinical assessment. The SAGIS instrument has excellent psychometric properties and supports the clinical assessment of and symptom-based categorization of patients with a wide spectrum of gastrointestinal symptoms.
Improving School Improvement: Development and Validation of the CSIS-360, a 360-Degree Feedback Assessment for School Improvement Specialists

ERIC Educational Resources Information Center

McDougall, Christie M.

2013-01-01

The purpose of the mixed methods study was to develop and validate the CSIS-360, a 360-degree feedback assessment to measure competencies of school improvement specialists from multiple perspectives. The study consisted of eight practicing school improvement specialists from a variety of settings. The specialists nominated 23 constituents to…
Farsi version of social skills rating system-secondary student form: cultural adaptation, reliability and construct validity.

PubMed

Eslami, Ahmad Ali; Amidi Mazaheri, Maryam; Mostafavi, Firoozeh; Abbasi, Mohamad Hadi; Noroozi, Ensieh

2014-01-01

Assessment of social skills is a necessary requirement to develop and evaluate the effectiveness of cognitive and behavioral interventions. This paper reports the cultural adaptation and psychometric properties of the Farsi version of the social skills rating system-secondary students form (SSRS-SS) questionnaire (Gresham and Elliot, 1990), in a normative sample of secondary school students. A two-phase design was used that phase 1 consisted of the linguistic adaptation and in phase 2, using cross-sectional sample survey data, the construct validity and reliability of the Farsi version of the SSRS-SS were examined in a sample of 724 adolescents aged from 13 to 19 years. Content validity index was excellent, and the floor/ceiling effects were low. After deleting five of the original SSRS-SS items, the findings gave support for the item convergent and divergent validity. Factor analysis revealed four subscales. RESULTS showed good internal consistency (0.89) and temporal stability (0.91) for the total scale score. Findings demonstrated support for the use of the 27-item Farsi version in the school setting. Directions for future research regarding the applicability of the scale in other settings and populations of adolescents are discussed.
Standard Specimen Reference Set: Pancreatic — EDRN Public Portal

Cancer.gov

The primary objective of the EDRN Pancreatic Cancer Working Group Proposal is to create a reference set consisting of well-characterized serum/plasma specimens to use as a resource for the development of biomarkers for the early detection of pancreatic adenocarcinoma. The testing of biomarkers on the same sample set permits direct comparison among them; thereby, allowing the development of a biomarker panel that can be evaluated in a future validation study. Additionally, the establishment of an infrastructure with core data elements and standardized operating procedures for specimen collection, processing and storage, will provide the necessary preparatory platform for larger validation studies when the appropriate marker/panel for pancreatic adenocarcinoma has been identified.
The Chinese version of the Outcome Expectations for Exercise scale: validation study.

PubMed

Lee, Ling-Ling; Chiu, Yu-Yun; Ho, Chin-Chih; Wu, Shu-Chen; Watson, Roger

2011-06-01

Estimates of the reliability and validity of the English nine-item Outcome Expectations for Exercise (OEE) scale have been tested and found to be valid for use in various settings, particularly among older people, with good internal consistency and validity. Data on the use of the OEE scale among older Chinese people living in the community and how cultural differences might affect the administration of the OEE scale are limited. To test the validity and reliability of the Chinese version of the Outcome Expectations for Exercise scale among older people. A cross-sectional validation study was designed to test the Chinese version of the OEE scale (OEE-C). Reliability was examined by testing both the internal consistency for the overall scale and the squared multiple correlation coefficient for the single item measure. The validity of the scale was tested on the basis of both a traditional psychometric test and a confirmatory factor analysis using structural equation modelling. The Mokken Scaling Procedure (MSP) was used to investigate if there were any hierarchical, cumulative sets of items in the measure. The OEE-C scale was tested in a group of older people in Taiwan (n=108, mean age=77.1). There was acceptable internal consistency (alpha=.85) and model fit in the scale. Evidence of the validity of the measure was demonstrated by the tests for criterion-related validity and construct validity. There was a statistically significant correlation between exercise outcome expectations and exercise self-efficacy (r=.34, p<.01). An analysis of the Mokken Scaling Procedure found that nine items of the scale were all retained in the analysis and the resulting scale was reliable and statistically significant (p=.0008). The results obtained in the present study provided acceptable levels of reliability and validity evidence for the Chinese Outcome Expectations for Exercise scale when used with older people in Taiwan. Future testing of the OEE-C scale needs to be carried out to see whether these results are generalisable to older Chinese people living in urban areas. Copyright © 2010 Elsevier Ltd. All rights reserved.

Health promoting behaviors in adolescence: validation of the Portuguese version of the Adolescent Lifestyle Profile.

PubMed

Sousa, Pedro; Gaspar, Pedro; Fonseca, Helena; Hendricks, Constance; Murdaugh, Carolyn

2015-01-01

Reliable and valid instruments are essential for understanding health-promoting behaviors in adolescents. This study analyzed the psychometric properties of the Portuguese version of the Adolescent Lifestyle Profile (ALP). A linguistic and cultural translation of the ALP was conducted with 236 adolescents from two different settings: a community (n=141) and a clinical setting (n=95). Internal consistency reliability and confirmatory factor analysis were performed. Results showed an adequate fit to data, yielding a 36-item, seven-factor structure (CMIN/DF=1.667, CFI=0.807, GFI=0.822, RMR=0.051, RMSEA=0.053, PNFI=0.575, PCFI=0.731). The ALP presented a high internal consistency (α=0.866), with the subscales presenting moderate reliability values (from 0.492 to 0.747). The highest values were in Interpersonal Relations (3.059±0.523) and Positive Life Perspective (2.985±0.588). Some gender differences were found. Findings showed that adolescents from the clinic reported an overall healthier lifestyle than those from the community setting (2.598±0.379 vs. 2.504±0.346; t=1.976, p=0.049). The ALP Portuguese version is a psychometrically reliable, valid, and useful measurement instrument for assessing health-promoting lifestyles in adolescence. The ALP is cross-culturally validated and can decisively contribute to a better understanding of adolescent health promotion needs. Additional research is needed to evaluate the instrument's predictive validity, as well as its clinical relevance for practice and research. Copyright © 2015 Sociedade Brasileira de Pediatria. Published by Elsevier Editora Ltda. All rights reserved.
Spanish translation and cross-language validation of a sleep habits questionnaire for use in clinical and research settings.

PubMed

Baldwin, Carol M; Choi, Myunghan; McClain, Darya Bonds; Celaya, Alma; Quan, Stuart F

2012-04-15

To translate, back-translate and cross-language validate (English/Spanish) the Sleep Heart Health Study Sleep Habits Questionnaire for use with Spanish-speakers in clinical and research settings. Following rigorous translation and back-translation, this cross-sectional cross-language validation study recruited bilingual participants from academic, clinic, and community-based settings (N = 50; 52% women; mean age 38.8 ± 12 years; 90% of Mexican heritage). Participants completed English and Spanish versions of the Sleep Habits Questionnaire, the Epworth Sleepiness Scale, and the Acculturation Rating Scale for Mexican Americans II one week apart in randomized order. Psychometric properties were assessed, including internal consistency, convergent validity, scale equivalence, language version intercorrelations, and exploratory factor analysis using PASW (Version18) software. Grade level readability of the sleep measure was evaluated. All sleep categories (duration, snoring, apnea, insomnia symptoms, other sleep symptoms, sleep disruptors, restless legs syndrome) showed Cronbach α, Spearman-Brown coefficients and intercorrelations ≥ 0.700, suggesting robust internal consistency, correlation, and agreement between language versions. The Epworth correlated significantly with snoring, apnea, sleep symptoms, restless legs, and sleep disruptors) on both versions, supporting convergent validity. Items loaded on 4 factors accounted for 68% and 67% of the variance on the English and Spanish versions, respectively. The Spanish-language Sleep Habits Questionnaire demonstrates conceptual and content equivalency. It has appropriate measurement properties and should be useful for assessing sleep health in community-based clinics and intervention studies among Spanish-speaking Mexican Americans. Both language versions showed readability at the fifth grade level. Further testing is needed with larger samples.
Robust QCT/FEA Models of Proximal Femur Stiffness and Fracture Load During a Sideways Fall on the Hip

PubMed Central

Dragomir-Daescu, Dan; Buijs, Jorn Op Den; McEligot, Sean; Dai, Yifei; Entwistle, Rachel C.; Salas, Christina; Melton, L. Joseph; Bennet, Kevin E.; Khosla, Sundeep; Amin, Shreyasee

2013-01-01

Clinical implementation of quantitative computed tomography-based finite element analysis (QCT/FEA) of proximal femur stiffness and strength to assess the likelihood of proximal femur (hip) fractures requires a unified modeling procedure, consistency in predicting bone mechanical properties, and validation with realistic test data that represent typical hip fractures, specifically, a sideways fall on the hip. We, therefore, used two sets (n = 9, each) of cadaveric femora with bone densities varying from normal to osteoporotic to build, refine, and validate a new class of QCT/FEA models for hip fracture under loading conditions that simulate a sideways fall on the hip. Convergence requirements of finite element models of the first set of femora led to the creation of a new meshing strategy and a robust process to model proximal femur geometry and material properties from QCT images. We used a second set of femora to cross-validate the model parameters derived from the first set. Refined models were validated experimentally by fracturing femora using specially designed fixtures, load cells, and high speed video capture. CT image reconstructions of fractured femora were created to classify the fractures. The predicted stiffness (cross-validation R2 = 0.87), fracture load (cross-validation R2 = 0.85), and fracture patterns (83% agreement) correlated well with experimental data. PMID:21052839
Data selection techniques in the interpretation of MAGSAT data over Australia

NASA Technical Reports Server (NTRS)

Johnson, B. D.; Dampney, C. N. G.

1983-01-01

The MAGSAT data require critical selection in order to produce a self-consistent data set suitable for map construction and subsequent interpretation. Interactive data selection techniques are described which involve the use of a special-purpose profile-oriented data base and a colour graphics display. The careful application of these data selection techniques permits validation every data value and ensures that the best possible self-consistent data set is being used to construct the maps of the magnetic field measured at satellite altitudes over Australia.
The Abbott RealTime High Risk HPV test is a clinically validated human papillomavirus assay for triage in the referral population and use in primary cervical cancer screening in women 30 years and older: a review of validation studies.

PubMed

Poljak, Mario; Oštrbenk, Anja

2013-01-01

Human papillomavirus (HPV) testing has become an essential part of current clinical practice in the management of cervical cancer and precancerous lesions. We reviewed the most important validation studies of a next-generation real-time polymerase chain reaction-based assay, the RealTime High Risk HPV test (RealTime)(Abbott Molecular, Des Plaines, IL, USA), for triage in referral population settings and for use in primary cervical cancer screening in women 30 years and older published in peer-reviewed journals from 2009 to 2013. RealTime is designed to detect 14 high-risk HPV genotypes with concurrent distinction of HPV-16 and HPV-18 from 12 other HPV genotypes. The test was launched on the European market in January 2009 and is currently used in many laboratories worldwide for routine detection of HPV. We concisely reviewed validation studies of a next-generation real-time polymerase chain reaction (PCR)-based assay: the Abbott RealTime High Risk HPV test. Eight validation studies of RealTime in referral settings showed its consistently high absolute clinical sensitivity for both CIN2+ (range 88.3-100%) and CIN3+ (range 93.0-100%), as well as comparative clinical sensitivity relative to the currently most widely used HPV test: the Qiagen/Digene Hybrid Capture 2 HPV DNA Test (HC2). Due to the significantly different composition of the referral populations, RealTime absolute clinical specificity for CIN2+ and CIN3+ varied greatly across studies, but was comparable relative to HC2. Four validation studies of RealTime performance in cervical cancer screening settings showed its consistently high absolute clinical sensitivity for both CIN2+ and CIN3+, as well as comparative clinical sensitivity and specificity relative to HC2 and GP5+/6+ PCR. RealTime has been extensively evaluated in the last 4 years. RealTime can be considered clinically validated for triage in referral population settings and for use in primary cervical cancer screening in women 30 years and older.
Translation and Initial Validation of the Chinese (Cantonese) Version of Community Integration Measure for Use in Patients with Chronic Stroke

PubMed Central

Ng, Shamay S. M.; Ng, Gabriel Y. F.

2014-01-01

Objectives. To (1) translate and culturally adapt the English version Community Integration Measure into Chinese (Cantonese), (2) report the results of initial validation of the Chinese (Cantonese) version of CIM (CIM-C) including the content validity, internal consistency, test-retest reliability, and factor structure of CIM-C for use in stroke survivors in a Chinese community setting, and (3) investigate the level of community integration of stroke survivors living in Hong Kong. Design. Cross-sectional study. Setting. University-based rehabilitation centre. Participants. 62 (n = 62) subjects with chronic stroke. Methods. The CIM-C was produced after forward-backward translation, expert panel review, and pretesting. 25 (n = 25) of the same subjects were reassessed after a 1-week interval. Results. The items of the CIM-C demonstrated high internal consistency with a Cronbach's α of 0.84. The CIM-C showed good test-retest reliability with an intraclass correlation coefficient (ICC) of 0.84 (95% confidence interval, 0.64–0.93). A 3-factor structure of the CIM-C including “relationship and engagement,” “sense of knowing,” and “independent living,” was consistent with the original theoretical model. Hong Kong stroke survivors revealed a high level of community integration as measured by the CIM-C (mean (SD): 43.48 (5.79)). Conclusions. The CIM-C is a valid and reliable measure for clinical use. PMID:24995317
Translating and validating a Training Needs Assessment tool into Greek

PubMed Central

Markaki, Adelais; Antonakis, Nikos; Hicks, Carolyn M; Lionis, Christos

2007-01-01

Background The translation and cultural adaptation of widely accepted, psychometrically tested tools is regarded as an essential component of effective human resource management in the primary care arena. The Training Needs Assessment (TNA) is a widely used, valid instrument, designed to measure professional development needs of health care professionals, especially in primary health care. This study aims to describe the translation, adaptation and validation of the TNA questionnaire into Greek language and discuss possibilities of its use in primary care settings. Methods A modified version of the English self-administered questionnaire consisting of 30 items was used. Internationally recommended methodology, mandating forward translation, backward translation, reconciliation and pretesting steps, was followed. Tool validation included assessing item internal consistency, using the alpha coefficient of Cronbach. Reproducibility (test – retest reliability) was measured by the kappa correlation coefficient. Criterion validity was calculated for selected parts of the questionnaire by correlating respondents' research experience with relevant research item scores. An exploratory factor analysis highlighted how the items group together, using a Varimax (oblique) rotation and subsequent Cronbach's alpha assessment. Results The psychometric properties of the Greek version of the TNA questionnaire for nursing staff employed in primary care were good. Internal consistency of the instrument was very good, Cronbach's alpha was found to be 0.985 (p < 0.001) and Kappa coefficient for reproducibility was found to be 0.928 (p < 0.0001). Significant positive correlations were found between respondents' current performance levels on each of the research items and amount of research involvement, indicating good criterion validity in the areas tested. Factor analysis revealed seven factors with eigenvalues of > 1.0, KMO (Kaiser-Meyer-Olkin) measure of sampling adequacy = 0.680 and Bartlett's test of sphericity, p < 0.001. Conclusion The translated and adapted Greek version is comparable with the original English instrument in terms of validity and reliability and it is suitable to assess professional development needs of nursing staff in Greek primary care settings. PMID:17474989
Case definitions for chronic fatigue syndrome/myalgic encephalomyelitis (CFS/ME): a systematic review

PubMed Central

Brurberg, Kjetil Gundro; Fønhus, Marita Sporstøl; Larun, Lillebeth; Flottorp, Signe; Malterud, Kirsti

2014-01-01

Objective To identify case definitions for chronic fatigue syndrome/myalgic encephalomyelitis (CFS/ME), and explore how the validity of case definitions can be evaluated in the absence of a reference standard. Design Systematic review. Setting International. Participants A literature search, updated as of November 2013, led to the identification of 20 case definitions and inclusion of 38 validation studies. Primary and secondary outcome measure Validation studies were assessed for risk of bias and categorised according to three validation models: (1) independent application of several case definitions on the same population, (2) sequential application of different case definitions on patients diagnosed with CFS/ME with one set of diagnostic criteria or (3) comparison of prevalence estimates from different case definitions applied on different populations. Results A total of 38 studies contributed data of sufficient quality and consistency for evaluation of validity, with CDC-1994/Fukuda as the most frequently applied case definition. No study rigorously assessed the reproducibility or feasibility of case definitions. Validation studies were small with methodological weaknesses and inconsistent results. No empirical data indicated that any case definition specifically identified patients with a neuroimmunological condition. Conclusions Classification of patients according to severity and symptom patterns, aiming to predict prognosis or effectiveness of therapy, seems useful. Development of further case definitions of CFS/ME should be given a low priority. Consistency in research can be achieved by applying diagnostic criteria that have been subjected to systematic evaluation. PMID:24508851
Development and validation of a radiomics nomogram for progression-free survival prediction in stage IV EGFR-mutant non-small cell lung cancer

NASA Astrophysics Data System (ADS)

Song, Jiangdian; Zang, Yali; Li, Weimin; Zhong, Wenzhao; Shi, Jingyun; Dong, Di; Fang, Mengjie; Liu, Zaiyi; Tian, Jie

2017-03-01

Accurately predict the risk of disease progression and benefit of tyrosine kinase inhibitors (TKIs) therapy for stage IV non-small cell lung cancer (NSCLC) patients with activing epidermal growth factor receptor (EGFR) mutations by current staging methods are challenge. We postulated that integrating a classifier consisted of multiple computed tomography (CT) phenotypic features, and other clinicopathological risk factors into a single model could improve risk stratification and prediction of progression-free survival (PFS) of EGFR TKIs for these patients. Patients confirmed as stage IV EGFR-mutant NSCLC received EGFR TKIs with no resection; pretreatment contrast enhanced CT performed at approximately 2 weeks before the treatment was enrolled. A six-CT-phenotypic-feature-based classifier constructed by the LASSO Cox regression model, and three clinicopathological factors: pathologic N category, performance status (PS) score, and intrapulmonary metastasis status were used to construct a nomogram in a training set of 115 patients. The prognostic and predictive accuracy of this nomogram was then subjected to an external independent validation of 107 patients. PFS between the training and independent validation set is no statistical difference by Mann-Whitney U test (P = 0.2670). PFS of the patients could be predicted with good consistency compared with the actual survival. C-index of the proposed individualized nomogram in the training set (0·707, 95%CI: 0·643, 0·771) and the independent validation set (0·715, 95%CI: 0·650, 0·780) showed the potential of clinical prognosis to predict PFS of stage IV EGFR-mutant NSCLC from EGFR TKIs. The individualized nomogram might facilitate patient counselling and individualise management of patients with this disease.
Design of psychosocial factors questionnaires: a systematic measurement approach

PubMed Central

Vargas, Angélica; Felknor, Sarah A

2012-01-01

Background Evaluation of psychosocial factors requires instruments that measure dynamic complexities. This study explains the design of a set of questionnaires to evaluate work and non-work psychosocial risk factors for stress-related illnesses. Methods The measurement model was based on a review of literature. Content validity was performed by experts and cognitive interviews. Pilot testing was carried out with a convenience sample of 132 workers. Cronbach’s alpha evaluated internal consistency and concurrent validity was estimated by Spearman correlation coefficients. Results Three questionnaires were constructed to evaluate exposure to work and non-work risk factors. Content validity improved the questionnaires coherence with the measurement model. Internal consistency was adequate (α=0.85–0.95). Concurrent validity resulted in moderate correlations of psychosocial factors with stress symptoms. Conclusions Questionnaires´ content reflected a wide spectrum of psychosocial factors sources. Cognitive interviews improved understanding of questions and dimensions. The structure of the measurement model was confirmed. PMID:22628068
Development and validation of a stock addiction inventory (SAI).

PubMed

Youn, HyunChul; Choi, Jung-Seok; Kim, Dai-Jin; Choi, Sam-Wook

2016-01-01

Investing in financial markets is promoted and protected by the government as an essential economic activity, but can turn into a gambling addiction problem. Until now, few scales have widely been used to identify gambling addicts in financial markets. This study aimed to develop a self-rating scale to distinguish them. In addition, the reliability and validity of the stock addiction inventory (SAI) were demonstrated. A set of questionnaires, including the SAI, south oaks gambling screen (SOGS), and DSM-5 diagnostic criteria, for gambling disorder was completed by 1005 participants. Factor analysis, internal consistency testing, t tests, analysis of variance, and partial correlation analysis were conducted to verify the reliability and validity of SAI. The factor analysis results showed the final SAI consisting of two factors and nine items. The internal consistency and concurrent validity of SAI were verified. The Cronbach's α for the total scale was 0.892, and the SAI and its factors were significantly correlated with SOGS. This study developed a specific scale for financial market investments or trading; this scale proved to be reliable and valid. Our scale expands the understanding of gambling addiction in financial markets and provides a diagnostic reference.
Development and validation of the ASPIRE-VA coaching fidelity checklist (ACFC): a tool to help ensure delivery of high-quality weight management interventions.

PubMed

Damschroder, Laura J; Goodrich, David E; Kim, Hyungjin Myra; Holleman, Robert; Gillon, Leah; Kirsh, Susan; Richardson, Caroline R; Lutes, Lesley D

2016-09-01

Practical and valid instruments are needed to assess fidelity of coaching for weight loss. The purpose of this study was to develop and validate the ASPIRE Coaching Fidelity Checklist (ACFC). Classical test theory guided ACFC development. Principal component analyses were used to determine item groupings. Psychometric properties, internal consistency, and inter-rater reliability were evaluated for each subscale. Criterion validity was tested by predicting weight loss as a function of coaching fidelity. The final 19-item ACFC consists of two domains (session process and session structure) and five subscales (sets goals and monitor progress, assess and personalize self-regulatory content, manages the session, creates a supportive and empathetic climate, and stays on track). Four of five subscales showed high internal consistency (Cronbach alphas > 0.70) for group-based coaching; only two of five subscales had high internal reliability for phone-based coaching. All five sub-scales were positively and significantly associated with weight loss for group- but not for phone-based coaching. The ACFC is a reliable and valid instrument that can be used to assess fidelity and guide skill-building for weight management interventionists.
Development and validation of a new instrument for testing functional health literacy in Japanese adults.

PubMed

Nakagami, Katsuyuki; Yamauchi, Toyoaki; Noguchi, Hiroyuki; Maeda, Tohru; Nakagami, Tomoko

2014-06-01

This study aimed to develop a reliable and valid measure of functional health literacy in a Japanese clinical setting. Test development consisted of three phases: generation of an item pool, consultation with experts to assess content validity, and comparison with external criteria (the Japanese Health Knowledge Test) to assess criterion validity. A trial version of the test was administered to 535 Japanese outpatients. Internal consistency reliability, calculated by Cronbach's alpha, was 0.81, and concurrent validity was moderate. Receiver Operating Characteristics and Item Response Theory were used to classify patients as having adequate, marginal, or inadequate functional health literacy. Both inadequate and marginal functional health literacy were associated with older age, lower income, lower educational attainment, and poor health knowledge. The time required to complete the test was 10-15 min. This test should enable health workers to better identify patients with inadequate health literacy. © 2013 Wiley Publishing Asia Pty Ltd.
Assessing the validity and intra-observer agreement of the MIDAM-LTC; an instrument measuring factors that influence personal dignity in long-term care facilities

PubMed Central

2014-01-01

Background Patients who are cared for in long-term care facilities are vulnerable to lose personal dignity. An instrument measuring factors that influence dignity can be used to better target dignity-conserving care to an individual patient, but no such instrument is yet available for the long-term care setting. The aim of this study was to create the Measurement Instrument for Dignity AMsterdam - for Long-Term Care facilities (MIDAM-LTC) and to assess its validity and intra-observer agreement. Methods Thirteen items specific for the LTC setting were added to the earlier developed, more general MIDAM. The MIDAM-LTC consisted of 39 symptoms or experiences for which presence as well as influence on dignity were asked, and a single item score for overall personal dignity. Questionnaires containing the MIDAM-LTC were administered face-to-face at two moments (with a 1-week interval) to 95 nursing home residents residing on general medical wards of six nursing homes in the Netherlands. Constructs related to dignity (WHO Well-Being Five Index, quality of life and physical health status) were also measured. Ten residents answered the questions while thinking aloud. Content validity, construct validity and intra-observer agreement were examined. Results Nine of the 39 items barely exerted influence on dignity. Eight of them could be omitted from the MIDAM-LTC, because the thinking aloud method revealed sensible explanations for their small influence on dignity. Residents reported that they missed no important items. Hypotheses to support construct validity, about the strength of correlations between on the one hand personal dignity and on the other hand well-being, quality of life or physical health status, were confirmed. On average, 83% of the scores given for each item’s influence on dignity were practically consistent over 1 week, and more than 80% of the residents gave consistent scores for the single item score for overall dignity. Conclusion The MIDAM-LTC has good content validity, construct validity and intra-observer agreement. By omitting 8 items from the instrument, a good balance between comprehensiveness and feasibility is realised. The MIDAM-LTC allows researchers to examine the concept of dignity more closely in the LTC setting, and can assist caregivers in providing dignity-conserving care. PMID:24512296
Functional Status Score for the Intensive Care Unit (FSS-ICU): An International Clinimetric Analysis of Validity, Responsiveness, and Minimal Important Difference

PubMed Central

Huang, Minxuan; Chan, Kitty S.; Zanni, Jennifer M.; Parry, Selina M.; Neto, Saint-Clair G. B.; Neto, Jose A. A.; da Silva, Vinicius Z. M.; Kho, Michelle E.; Needham, Dale M.

2017-01-01

Objective To evaluate the internal consistency, validity, responsiveness, and minimal important difference of the Functional Status Score for the Intensive Care Unit (FSS-ICU), a physical function measure designed for the intensive care unit (ICU). Design Clinimetric analysis. Settings Five international data sets from the United States, Australia, and Brazil. Patients 819 ICU patients. Intervention None. Measurements and Main Results Clinimetric analyses were initially conducted separately for each data source and time point to examine generalizability of findings, with pooled analyses performed thereafter to increase power of analyses. The FSS-ICU demonstrated good to excellent internal consistency. There was good convergent and discriminant validity, with significant and positive correlations (r = 0.30 to 0.95) between FSS-ICU and other physical function measures, and generally weaker correlations with non-physical measures (|r| = 0.01 to 0.70). Known group validity was demonstrated by significantly higher FSS-ICU scores among patients without ICU-acquired weakness (Medical Research Council sumscore ≥48 versus <48) and with hospital discharge to home (versus healthcare facility). FSS-ICU at ICU discharge predicted post-ICU hospital length of stay and discharge location. Responsiveness was supported via increased FSS-ICU scores with improvements in muscle strength. Distribution-based methods indicated a minimal important difference of 2.0 to 5.0. Conclusions The FSS-ICU has good internal consistency and is a valid and responsive measure of physical function for ICU patients. The estimated minimal important difference can be used in sample size calculations and in interpreting studies comparing the physical function of groups of ICU patients. PMID:27488220
Validation of reference genes for RT-qPCR analysis in Herbaspirillum seropedicae.

PubMed

Pessoa, Daniella Duarte Villarinho; Vidal, Marcia Soares; Baldani, José Ivo; Simoes-Araujo, Jean Luiz

2016-08-01

The RT-qPCR technique needs a validated set of reference genes for ensuring the consistency of the results from the gene expression. Expression stabilities for 9 genes from Herbaspirillum seropedicae, strain HRC54, grown with different carbon sources were calculated using geNorm and NormFinder, and the gene rpoA showed the best stability values. Copyright © 2016 Elsevier B.V. All rights reserved.
Profile Similarity Metrics Increase Personality Scale Validity (Briefing Charts)

DTIC Science & Technology

2016-04-15

REPORT DATE (DD-MM-YYYY) April 2016 2. REPORT TYPE Final 3. DATES COVERED (From - To) April 2015 – August 2015 4. TITLE AND SUBTITLE... temperament scales are used in employment settings to predict performance because they are generally valid and reduce adverse impact. This research...personality and temperament scales against job continuance outcomes. Analyses documented that: PSMs consistently accounted for over 90% of the variance in
An examination of three sets of MMPI-2 personality disorder scales.

PubMed

Jones, Alvin

2005-08-01

Three sets of personality disorder scales (PD scales) can be scored for the MMPI-2 (Butcher, Dahlstrom, Graham, Tellegen, & Kaemmer, 1989). Two sets (Levitt & Gotts, 1995; Morey, Waugh, & Blashfield, 1985) are derived from the MMPI (Hathaway & McKinley, 1983), and a third set (Somwaru & Ben-Porath, 1995) is based on the MMPI-2. There is no validity research for the Levitt and Gotts scale, and limited validity research is available for the Somwaru and Ben-Porath scales. There is a large body of research suggesting that the Morey et al. scales have good to excellent convergent validity when compared to a variety of other measures of personality disorders. Since the Morey et al. scales have established validity, there is a question if additional sets of PD scales are needed. The primary purpose of this research was to determine if the PD scales developed by Levitt and Gotts and those developed by Somwaru and Ben-Porath contribute incrementally to the scales developed by Morey et al. in predicting corresponding scales on the MCMI-II (Millon, 1987). In a sample of 494 individuals evaluated at an Army medical center, a hierarchical regression analysis demonstrated that the Somwaru and Ben-Porath Borderline, Antisocial, and Schizoid PD scales and the Levitt and Gotts Narcissistic and Histrionic scales contributed significantly and meaningfully to the Morey et al. scales in predicting the corresponding MCMI-II (Millon, 1987) scale. However, only the Somwaru and Ben-Porath scales demonstrated acceptable internal consistency and convergent validity.
Assessing vocational outcome expectancy in individuals with serious mental illness: a factor-analytic approach.

PubMed

Iwanaga, Kanako; Umucu, Emre; Wu, Jia-Rung; Yaghmaian, Rana; Lee, Hui-Ling; Fitzgerald, Sandra; Chan, Fong

2017-07-04

Self-determination theory (SDT) and self-efficacy theory (SET) can be used to conceptualize self-determined motivation to engage in mental health and vocational rehabilitation (VR) services and to predict recovery. To incorporate SDT and SET as a framework for vocational recovery, developing and validating SDT/SET measures in vocational rehabilitation is warranted. Outcome expectancy is an important SDT/SET variable affecting rehabilitation engagement and recovery. The purpose of this study was to validate the Vocational Outcome Expectancy Scale (VOES) for use within the SDT/SET vocational recovery framework. One hundred and twenty-four individuals with serious mental illness (SMI) participated in this study. Measurement structure of the VOES was evaluated using exploratory factor analysis (EFA) and confirmatory factor analysis (CFA). Both EFA and CFA results supported a two-factor structure: (a) positive outcome expectancy, and (b) negative outcome expectancy. The internal consistency reliability coefficients for both factors were acceptable. In addition, positive outcome expectancy correlated stronger than negative outcome expectancy with other SDT/SET constructs in the expected directions. The VOES is a brief, reliable and valid instrument for assessing vocational outcome expectancy in individuals with SMI that can be integrated into SDT/SET as a vocational rehabilitation engagement and recovery model in psychiatric rehabilitation.
Use of the Environment and Policy Evaluation and Observation as a Self-Report Instrument (EPAO-SR) to measure nutrition and physical activity environments in child care settings: validity and reliability evidence.

PubMed

Ward, Dianne S; Mazzucca, Stephanie; McWilliams, Christina; Hales, Derek

2015-09-26

Early care and education (ECE) centers are important settings influencing young children's diet and physical activity (PA) behaviors. To better understand their impact on diet and PA behaviors as well as to evaluate public health programs aimed at ECE settings, we developed and tested the Environment and Policy Assessment and Observation - Self-Report (EPAO-SR), a self-administered version of the previously validated, researcher-administered EPAO. Development of the EPAO-SR instrument included modification of items from the EPAO, community advisory group and expert review, and cognitive interviews with center directors and classroom teachers. Reliability and validity data were collected across 4 days in 3-5 year old classrooms in 50 ECE centers in North Carolina. Center teachers and directors completed relevant portions of the EPAO-SR on multiple days according to a standardized protocol, and trained data collectors completed the EPAO for 4 days in the centers. Reliability and validity statistics calculated included percent agreement, kappa, correlation coefficients, coefficients of variation, deviations, mean differences, and intraclass correlation coefficients (ICC), depending on the response option of the item. Data demonstrated a range of reliability and validity evidence for the EPAO-SR instrument. Reporting from directors and classroom teachers was consistent and similar to the observational data. Items that produced strongest reliability and validity estimates included beverages served, outside time, and physical activity equipment, while items such as whole grains served and amount of teacher-led PA had lower reliability (observation and self-report) and validity estimates. To overcome lower reliability and validity estimates, some items need administration on multiple days. This study demonstrated appropriate reliability and validity evidence for use of the EPAO-SR in the field. The self-administered EPAO-SR is an advancement of the measurement of ECE settings and can be used by researchers and practitioners to assess the nutrition and physical activity environments of ECE settings.

Cross-Study Homogeneity of Psoriasis Gene Expression in Skin across a Large Expression Range

PubMed Central

Kerkof, Keith; Timour, Martin; Russell, Christopher B.

2013-01-01

Background In psoriasis, only limited overlap between sets of genes identified as differentially expressed (psoriatic lesional vs. psoriatic non-lesional) was found using statistical and fold-change cut-offs. To provide a framework for utilizing prior psoriasis data sets we sought to understand the consistency of those sets. Methodology/Principal Findings Microarray expression profiling and qRT-PCR were used to characterize gene expression in PP and PN skin from psoriasis patients. cDNA (three new data sets) and cRNA hybridization (four existing data sets) data were compared using a common analysis pipeline. Agreement between data sets was assessed using varying qualitative and quantitative cut-offs to generate a DEG list in a source data set and then using other data sets to validate the list. Concordance increased from 67% across all probe sets to over 99% across more than 10,000 probe sets when statistical filters were employed. The fold-change behavior of individual genes tended to be consistent across the multiple data sets. We found that genes with <2-fold change values were quantitatively reproducible between pairs of data-sets. In a subset of transcripts with a role in inflammation changes detected by microarray were confirmed by qRT-PCR with high concordance. For transcripts with both PN and PP levels within the microarray dynamic range, microarray and qRT-PCR were quantitatively reproducible, including minimal fold-changes in IL13, TNFSF11, and TNFRSF11B and genes with >10-fold changes in either direction such as CHRM3, IL12B and IFNG. Conclusions/Significance Gene expression changes in psoriatic lesions were consistent across different studies, despite differences in patient selection, sample handling, and microarray platforms but between-study comparisons showed stronger agreement within than between platforms. We could use cut-offs as low as log10(ratio) = 0.1 (fold-change = 1.26), generating larger gene lists that validate on independent data sets. The reproducibility of PP signatures across data sets suggests that different sample sets can be productively compared. PMID:23308107
The semantics of Chemical Markup Language (CML): dictionaries and conventions.

PubMed

Murray-Rust, Peter; Townsend, Joe A; Adams, Sam E; Phadungsukanan, Weerapong; Thomas, Jens

2011-10-14

The semantic architecture of CML consists of conventions, dictionaries and units. The conventions conform to a top-level specification and each convention can constrain compliant documents through machine-processing (validation). Dictionaries conform to a dictionary specification which also imposes machine validation on the dictionaries. Each dictionary can also be used to validate data in a CML document, and provide human-readable descriptions. An additional set of conventions and dictionaries are used to support scientific units. All conventions, dictionaries and dictionary elements are identifiable and addressable through unique URIs.
The semantics of Chemical Markup Language (CML): dictionaries and conventions

PubMed Central

2011-01-01

The semantic architecture of CML consists of conventions, dictionaries and units. The conventions conform to a top-level specification and each convention can constrain compliant documents through machine-processing (validation). Dictionaries conform to a dictionary specification which also imposes machine validation on the dictionaries. Each dictionary can also be used to validate data in a CML document, and provide human-readable descriptions. An additional set of conventions and dictionaries are used to support scientific units. All conventions, dictionaries and dictionary elements are identifiable and addressable through unique URIs. PMID:21999509
Multiyear Plan for Validation of EnergyPlus Multi-Zone HVAC System Modeling using ORNL's Flexible Research Platform

DOE Office of Scientific and Technical Information (OSTI.GOV)

Im, Piljae; Bhandari, Mahabir S.; New, Joshua Ryan

This document describes the Oak Ridge National Laboratory (ORNL) multiyear experimental plan for validation and uncertainty characterization of whole-building energy simulation for a multi-zone research facility using a traditional rooftop unit (RTU) as a baseline heating, ventilating, and air conditioning (HVAC) system. The project’s overarching objective is to increase the accuracy of energy simulation tools by enabling empirical validation of key inputs and algorithms. Doing so is required to inform the design of increasingly integrated building systems and to enable accountability for performance gaps between design and operation of a building. The project will produce documented data sets that canmore » be used to validate key functionality in different energy simulation tools and to identify errors and inadequate assumptions in simulation engines so that developers can correct them. ASHRAE Standard 140, Method of Test for the Evaluation of Building Energy Analysis Computer Programs (ASHRAE 2004), currently consists primarily of tests to compare different simulation programs with one another. This project will generate sets of measured data to enable empirical validation, incorporate these test data sets in an extended version of Standard 140, and apply these tests to the Department of Energy’s (DOE) EnergyPlus software (EnergyPlus 2016) to initiate the correction of any significant deficiencies. The fitness-for-purpose of the key algorithms in EnergyPlus will be established and demonstrated, and vendors of other simulation programs will be able to demonstrate the validity of their products. The data set will be equally applicable to validation of other simulation engines as well.« less
Clinical audit project in undergraduate medical education curriculum: an assessment validation study

PubMed Central

Steketee, Carole; Mak, Donna

2016-01-01

Objectives To evaluate the merit of the Clinical Audit Project (CAP) in an assessment program for undergraduate medical education using a systematic assessment validation framework. Methods A cross-sectional assessment validation study at one medical school in Western Australia, with retrospective qualitative analysis of the design, development, implementation and outcomes of the CAP, and quantitative analysis of assessment data from four cohorts of medical students (2011- 2014). Results The CAP is fit for purpose with clear external and internal alignment to expected medical graduate outcomes. Substantive validity in students’ and examiners’ response processes is ensured through relevant methodological and cognitive processes. Multiple validity features are built-in to the design, planning and implementation process of the CAP. There is evidence of high internal consistency reliability of CAP scores (Cronbach’s alpha > 0.8) and inter-examiner consistency reliability (intra-class correlation>0.7). Aggregation of CAP scores is psychometrically sound, with high internal consistency indicating one common underlying construct. Significant but moderate correlations between CAP scores and scores from other assessment modalities indicate validity of extrapolation and alignment between the CAP and the overall target outcomes of medical graduates. Standard setting, score equating and fair decision rules justify consequential validity of CAP scores interpretation and use. Conclusions This study provides evidence demonstrating that the CAP is a meaningful and valid component in the assessment program. This systematic framework of validation can be adopted for all levels of assessment in medical education, from individual assessment modality, to the validation of an assessment program as a whole. PMID:27716612
Clinical audit project in undergraduate medical education curriculum: an assessment validation study.

PubMed

Tor, Elina; Steketee, Carole; Mak, Donna

2016-09-24

To evaluate the merit of the Clinical Audit Project (CAP) in an assessment program for undergraduate medical education using a systematic assessment validation framework. A cross-sectional assessment validation study at one medical school in Western Australia, with retrospective qualitative analysis of the design, development, implementation and outcomes of the CAP, and quantitative analysis of assessment data from four cohorts of medical students (2011- 2014). The CAP is fit for purpose with clear external and internal alignment to expected medical graduate outcomes. Substantive validity in students' and examiners' response processes is ensured through relevant methodological and cognitive processes. Multiple validity features are built-in to the design, planning and implementation process of the CAP. There is evidence of high internal consistency reliability of CAP scores (Cronbach's alpha > 0.8) and inter-examiner consistency reliability (intra-class correlation>0.7). Aggregation of CAP scores is psychometrically sound, with high internal consistency indicating one common underlying construct. Significant but moderate correlations between CAP scores and scores from other assessment modalities indicate validity of extrapolation and alignment between the CAP and the overall target outcomes of medical graduates. Standard setting, score equating and fair decision rules justify consequential validity of CAP scores interpretation and use. This study provides evidence demonstrating that the CAP is a meaningful and valid component in the assessment program. This systematic framework of validation can be adopted for all levels of assessment in medical education, from individual assessment modality, to the validation of an assessment program as a whole.
UNCLES: method for the identification of genes differentially consistently co-expressed in a specific subset of datasets.

PubMed

Abu-Jamous, Basel; Fa, Rui; Roberts, David J; Nandi, Asoke K

2015-06-04

Collective analysis of the increasingly emerging gene expression datasets are required. The recently proposed binarisation of consensus partition matrices (Bi-CoPaM) method can combine clustering results from multiple datasets to identify the subsets of genes which are consistently co-expressed in all of the provided datasets in a tuneable manner. However, results validation and parameter setting are issues that complicate the design of such methods. Moreover, although it is a common practice to test methods by application to synthetic datasets, the mathematical models used to synthesise such datasets are usually based on approximations which may not always be sufficiently representative of real datasets. Here, we propose an unsupervised method for the unification of clustering results from multiple datasets using external specifications (UNCLES). This method has the ability to identify the subsets of genes consistently co-expressed in a subset of datasets while being poorly co-expressed in another subset of datasets, and to identify the subsets of genes consistently co-expressed in all given datasets. We also propose the M-N scatter plots validation technique and adopt it to set the parameters of UNCLES, such as the number of clusters, automatically. Additionally, we propose an approach for the synthesis of gene expression datasets using real data profiles in a way which combines the ground-truth-knowledge of synthetic data and the realistic expression values of real data, and therefore overcomes the problem of faithfulness of synthetic expression data modelling. By application to those datasets, we validate UNCLES while comparing it with other conventional clustering methods, and of particular relevance, biclustering methods. We further validate UNCLES by application to a set of 14 real genome-wide yeast datasets as it produces focused clusters that conform well to known biological facts. Furthermore, in-silico-based hypotheses regarding the function of a few previously unknown genes in those focused clusters are drawn. The UNCLES method, the M-N scatter plots technique, and the expression data synthesis approach will have wide application for the comprehensive analysis of genomic and other sources of multiple complex biological datasets. Moreover, the derived in-silico-based biological hypotheses represent subjects for future functional studies.
Legitimacy and Justice Perceptions

ERIC Educational Resources Information Center

Mueller, Charles W.; Landsman, Miriam J.

2004-01-01

Consistent with the theoretical argument of Hegtvedt and Johnson, we empirically examine the relationship between collectivity-generated legitimacy of reward procedures and individual-level justice perceptions about reward distributions. Using data from a natural setting, we find that collectivity sources of validity (authorization and…
Structural design guidelines for concrete bridge decks reinforced with corrosion-resistant reinforcing bars.

DOT National Transportation Integrated Search

2014-10-01

This research program develops and validates structural design guidelines and details for concrete bridge decks with : corrosion-resistant reinforcing (CRR) bars. A two-phase experimental program was conducted where a control test set consistent : wi...
Developing Enhanced Blood–Brain Barrier Permeability Models: Integrating External Bio-Assay Data in QSAR Modeling

PubMed Central

Wang, Wenyi; Kim, Marlene T.; Sedykh, Alexander

2015-01-01

Purpose Experimental Blood–Brain Barrier (BBB) permeability models for drug molecules are expensive and time-consuming. As alternative methods, several traditional Quantitative Structure-Activity Relationship (QSAR) models have been developed previously. In this study, we aimed to improve the predictivity of traditional QSAR BBB permeability models by employing relevant public bio-assay data in the modeling process. Methods We compiled a BBB permeability database consisting of 439 unique compounds from various resources. The database was split into a modeling set of 341 compounds and a validation set of 98 compounds. Consensus QSAR modeling workflow was employed on the modeling set to develop various QSAR models. A five-fold cross-validation approach was used to validate the developed models, and the resulting models were used to predict the external validation set compounds. Furthermore, we used previously published membrane transporter models to generate relevant transporter profiles for target compounds. The transporter profiles were used as additional biological descriptors to develop hybrid QSAR BBB models. Results The consensus QSAR models have R2=0.638 for fivefold cross-validation and R2=0.504 for external validation. The consensus model developed by pooling chemical and transporter descriptors showed better predictivity (R2=0.646 for five-fold cross-validation and R2=0.526 for external validation). Moreover, several external bio-assays that correlate with BBB permeability were identified using our automatic profiling tool. Conclusions The BBB permeability models developed in this study can be useful for early evaluation of new compounds (e.g., new drug candidates). The combination of chemical and biological descriptors shows a promising direction to improve the current traditional QSAR models. PMID:25862462
Minimally Invasive Surgery Survey: A Survey of Surgical Team Members' Perceptions for Successful Minimally Invasive Surgery.

PubMed

Yurteri-Kaplan, Ladin A; Andriani, Leslie; Kumar, Anagha; Saunders, Pamela A; Mete, Mihriye M; Sokol, Andrew I

To develop a valid and reliable survey to measure surgical team members' perceptions regarding their institution's requirements for successful minimally invasive surgery (MIS). Questionnaire development and validation study (Canadian Task Force classification II-2). Three hospital types: rural, urban/academic, and community/academic. Minimally invasive staff (team members). Development and validation of a minimally invasive surgery survey (MISS). Using the Safety Attitudes questionnaire as a guide, we developed questions assessing study participants' attitudes regarding the requirements for successful MIS. The questions were closed-ended and responses based on a 5-point Likert scale. The large pool of questions was then given to 4 focus groups made up of 3 to 6 individuals. Each focus group consisted of individuals from a specific profession (e.g., surgeons, anesthesiologists, nurses, and surgical technicians). Questions were revised based on focus group recommendations, resulting in a final 52-question set. The question set was then distributed to MIS team members. Individuals were included if they had participated in >10 MIS cases and worked in the MIS setting in the past 3 months. Participants in the trial population were asked to repeat the questionnaire 4 weeks later to evaluate internal consistency. Participants' demographics, including age, gender, specialty, profession, and years of experience, were captured in the questionnaire. Factor analysis with varimax rotation was performed to determine domains (questions evaluating similar themes). For internal consistency and reliability, domains were tested using interitem correlations and Cronbach's α. Cronbach's α > .6 was considered internally consistent. Kendall's correlation coefficient τ closer to 1 and with p < .05 was considered significant for the test-retest reliability. Two hundred fifty participants answered the initial question set. Of those, 53 were eliminated because they did not meet inclusion criteria or failed to answer all questions, leaving 197 participants. Most participants were women (68% vs 32%), and 42% were between the ages 30 and 39 years. Factor analysis identified 6 domains: collaboration, error reporting, job proficiency/efficiency, problem-solving, job satisfaction, and situational awareness. Interitem correlations testing for redundancy for each domain ranged from .2 to .7, suggesting similar themed questions while avoiding redundancy. Cronbach's α, testing internal consistency, was .87. Sixty-two participants from the original cohort repeated the question set at 4 weeks. Forty-three were analyzed for test-retest reliability after excluding those who did not meet inclusion criteria. The final questions showed high test-retest reliability (τ = .3-.7, p < .05). The final questionnaire was made up of 29 questions from the original 52 question set. The MISS is a reliable and valid tool that can be used to measure how surgical team members conceptualize the requirements for successful MIS. The MISS revealed that participants identified 6 important domains of a successful workenvironment: collaboration, error reporting, job proficiency/efficiency, problem-solving, job satisfaction, and situational awareness. The questionnaire can be used to understand and align various surgical team members' goals and expectations and may help improve quality of care in the MIS setting. Copyright © 2017 American Association of Gynecologic Laparoscopists. Published by Elsevier Inc. All rights reserved.
A Brief Measure of Narcissism Among Female Juvenile Delinquents and Community Youths: The Narcissistic Personality Inventory-13.

PubMed

Pechorro, Pedro; Maroco, João; Ray, James V; Gonçalves, Rui Abrunhosa; Nunes, Cristina

2018-06-01

Research on narcissism has a long tradition, but there is limited knowledge regarding its application among female youth, especially for forensic samples of incarcerated female youth. Drawing on 377 female adolescents (103 selected from forensic settings and 274 selected from school settings) from Portugal, the current study is the first to examine simultaneously the psychometric properties of a brief version of the Narcissistic Personality Inventory (NPI-13) among females drawn from incarcerated and community settings. The results support the three-factor structure model of narcissism after the removal of one item due to its low factor loading. Internal consistency, convergent validity, and discriminant validity showed promising results. In terms of criterion-related validity, significant associations were found with criterion-related variables such as age of criminal onset, conduct disorder, crime severity, violent crimes, and alcohol and drug use. The findings provide support for use of the NPI-13 among female juveniles.
The Therapeutic Environment Screening Survey for Nursing Homes (TESS-NH): an observational instrument for assessing the physical environment of institutional settings for persons with dementia.

PubMed

Sloane, Philip D; Mitchell, C Madeline; Weisman, Gerald; Zimmerman, Sheryl; Foley, Kristie M Long; Lynn, Mary; Calkins, Margaret; Lawton, M Powell; Teresi, Jeanne; Grant, Leslie; Lindeman, David; Montgomery, Rhonda

2002-03-01

To develop an observational instrument that describes the ability of physical environments of institutional settings to address therapeutic goals for persons with dementia. A National Institute on Aging workgroup identified and subsequently revised items that evaluated exit control, maintenance, cleanliness, safety, orientation/cueing, privacy, unit autonomy, outdoor access, lighting, noise, visual/tactile stimulation, space/seating, and familiarity/homelikeness. The final instrument contains 84 discrete items and one global rating. A summary scale, the Special Care Unit Environmental Quality Scale (SCUEQS), consists of 18 items. Lighting items were validated using portable light meters. Concurrent criterion validation compared SCUEQS scores with the Professional Environmental Assessment Protocol (PEAP). Interrater kappa statistics for 74% of items were above.60. For another 10% of items, kappas could not be calculated due to empty cells, but interrater agreement was above 80%. The SCUEQS demonstrated an interrater reliability of.93, a test--retest reliability of.88, and an internal consistency of.81--.83. Light meter ratings correlated significantly with the Therapeutic Environment Screening Survey for Nursing Homes (TESS-NH) lighting items (r =.29--.38, p =.01--.04), and the SCUEQS correlated significantly with global PEAP ratings (r =.52, p <.01). The TESS-NH efficiently assesses discrete elements of the physical environment and has strong reliability and validity. The SCUEQS provides a quantitative measure of environmental quality in institutional settings.
Validity of the SAT for Predicting First-Year Grades: 2008 SAT Validity Sample. Statistical Report No. 2011-5

ERIC Educational Resources Information Center

Patterson, Brian F.; Mattern, Krista D.

2011-01-01

The findings for the 2008 sample are largely consistent with the previous reports. SAT scores were found to be correlated with FYGPA (r = 0.54), with a magnitude similar to HSGPA (r = 0.56). The best set of predictors of FYGPA remains SAT scores and HSGPA (r = 0.63), as the addition of the SAT sections to the correlation of HSGPA alone with FYGPA…
Evidence-based nursing-sensitive indicators for patients hospitalized with depression in Thailand.

PubMed

Thapinta, Darawan; Anders, Robert L; Mahatnirunkul, Suwat; Srikosai, Soontaree

2010-12-01

The aim of this study was to develop and validate nursing-sensitive indicators for patients hospitalized with depression in Thailand. The initial draft, consisting of 12 categories with 37 subcategories, was then evaluated by experts in the US and Thailand. Hospital records were then utilized to evaluate the feasibility and efficacy of the indicators. The finalized instrument consisted of 11 categories with 43 items with a validity of .98 and internal consistency of .88. This is the first set of indicators developed to evaluate nursing-sensitivity for patients hospitalized with a diagnosis of depression in Thailand. Having nursing indicators for depressed patients provides nurses with concrete tools to evaluate their work with depressed patients, allowing these staff to assess their work in a very specific, methodical, and consistent manner. When problems are discovered, both the staff and administration can work to address these issues through training, procedural changes, and departmental shifts.
Psychometric analyses and internal consistency of the PHEEM questionnaire to measure the clinical learning environment in the clerkship of a Medical School in Chile.

PubMed

Riquelme, Arnoldo; Herrera, Cristian; Aranis, Carolina; Oporto, Jorge; Padilla, Oslando

2009-06-01

The Spanish version of the Postgraduate Hospital Educational Environment Measure (PHEEM) was evaluated in this study to determine its psychometric properties, validity and internal consistency to measure the clinical learning environment in the hospital setting of Pontificia Universidad Católica de Chile Medical School's Internship. The 40-item PHEEM questionnaire was translated from English to Spanish and retranslated to English. Content validity was tested by a focus group and minor differences in meaning were adjusted. The PHEEM was administered to clerks in years 6 and 7. Construct validity was carried out using exploratory factor analysis followed by a Varimax rotation. Internal consistency was measured using Cronbach's alpha. A total of 125 out of 220 students responded to the PHEEM. The overall response rate was 56.8% and compliances with each item ranged from 99.2% to 100%. Analyses indicate that five factors instrument accounting for 58% of the variance and internal consistency of the 40-item questionnaire is 0.955 (Cronbach's alpha). The 40-item questionnaire had a mean score of 98.21 +/- 21.2 (maximum score of 160). The Spanish version of PHEEM is a multidimensional, valid and highly reliable instrument measuring the educational environment among undergraduate medical students working in hospital-based clerkships.
Validity and reliability of the Malay version of the Hill-Bone compliance to high blood pressure therapy scale for use in primary healthcare settings in Malaysia: A cross-sectional study.

PubMed

Cheong, A T; Tong, S F; Sazlina, S G

2015-01-01

Hill-Bone compliance to high blood pressure therapy scale (HBTS) is one of the useful scales in primary care settings. It has been tested in America, Africa and Turkey with variable validity and reliability. The aim of this paper was to determine the validity and reliability of the Malay version of HBTS (HBTS-M) for the Malaysian population. HBTS comprises three subscales assessing compliance to medication, appointment and salt intake. The content validity of HBTS to the local population was agreed through consensus of expert panel. The 14 items used in the HBTS were adapted to reflect the local situations. It was translated into Malay and then back-translated into English. The translated version was piloted in 30 participants. This was followed by structural and predictive validity, and internal consistency testing in 262 patients with hypertension, who were on antihypertensive agent(s) for at least 1 year in two primary healthcare clinics in Kuala Lumpur, Malaysia. Exploratory factor analyses and the correlation between HBTS-M total score and blood pressure were performed. The Cronbach's alpha was calculated accordingly. Factor analysis revealed a three-component structure represented by two components on medication adherence and one on salt intake adherence. The Kaiser-Meyer-Olkin statistic was 0.764. The variance explained by each factors were 23.6%, 10.4% and 9.8%, respectively. However, the internal consistency for each component was suboptimal with Cronbach's alpha of 0.64, 0.55 and 0.29, respectively. Although there were two components representing medication adherence, the theoretical concepts underlying each concept cannot be differentiated. In addition, there was no correlation between the HBTS-M total score and blood pressure. HBTS-M did not conform to the structural and predictive validity of the original scale. Its reliability on assessing medication and salt intake adherence would most probably to be suboptimal in the Malaysian primary care setting.
The FORBIO Climate data set for climate analyses

NASA Astrophysics Data System (ADS)

Delvaux, C.; Journée, M.; Bertrand, C.

2015-06-01

In the framework of the interdisciplinary FORBIO Climate research project, the Royal Meteorological Institute of Belgium is in charge of providing high resolution gridded past climate data (i.e. temperature and precipitation). This climate data set will be linked to the measurements on seedlings, saplings and mature trees to assess the effects of climate variation on tree performance. This paper explains how the gridded daily temperature (minimum and maximum) data set was generated from a consistent station network between 1980 and 2013. After station selection, data quality control procedures were developed and applied to the station records to ensure that only valid measurements will be involved in the gridding process. Thereafter, the set of unevenly distributed validated temperature data was interpolated on a 4 km × 4 km regular grid over Belgium. The performance of different interpolation methods has been assessed. The method of kriging with external drift using correlation between temperature and altitude gave the most relevant results.
Electric dipole moment of diatomic molecules by configuration interaction. IV.

NASA Technical Reports Server (NTRS)

Green, S.

1972-01-01

The theory of basis set dependence in configuration interaction calculations is discussed, taking into account a perturbation model which is valid for small changes in the self-consistent field orbitals. It is found that basis set corrections are essentially additive through first order. It is shown that an error found in a previously published dipole moment calculation by Green (1972) for the metastable first excited state of CO was indeed due to an inadequate basis set as claimed.
The Consumer Assessment of Healthcare Providers and Systems (CAHPS) cultural competence (CC) item set.

PubMed

Weech-Maldonado, Robert; Carle, Adam; Weidmer, Beverly; Hurtado, Margarita; Ngo-Metzger, Quyen; Hays, Ron D

2012-09-01

There is a need for reliable and valid measures of cultural competence (CC) from the patient's perspective. This paper evaluates the reliability and validity of the Consumer Assessments of Healthcare Providers and Systems (CAHPS) CC item set. Using 2008 survey data, we assessed the internal consistency of the CAHPS CC scales using the Cronbach α's and examined the validity of the measures using exploratory and confirmatory factor analysis, multitrait scaling analysis, and regression analysis. A random stratified sample (based on race/ethnicity and language) of 991 enrollees, younger than 65 years, from 2 Medicaid managed care plans in California and New York. CAHPS CC item set after excluding screener items and ratings. Confirmatory factor analysis (Comparative Fit Index=0.98, Tucker Lewis Index=0.98, and Root Mean Square Error or Approximation=0.06) provided support for a 7-factor structure: Doctor Communication--Positive Behaviors, Doctor Communication--Negative Behaviors, Doctor Communication--Health Promotion, Doctor Communication--Alternative Medicine, Shared Decision-Making, Equitable Treatment, and Trust. Item-total correlations (corrected for item overlap) for the 7 scales exceeded 0.40. Exploratory factor analysis showed support for 1 additional factor: Access to Interpreter Services. Internal consistency reliability estimates ranged from 0.58 (Alternative Medicine) to 0.92 (Positive Behaviors) and was 0.70 or higher for 4 of the 8 composites. All composites were positively and significantly associated with the overall doctor rating. The CAHPS CC 26-item set demonstrates adequate measurement properties and can be used as a supplemental item set to the CAHPS Clinician and Group Surveys in assessing culturally competent care from the patient's perspective.

A score to estimate the likelihood of detecting advanced colorectal neoplasia at colonoscopy

PubMed Central

Kaminski, Michal F; Polkowski, Marcin; Kraszewska, Ewa; Rupinski, Maciej; Butruk, Eugeniusz; Regula, Jaroslaw

2014-01-01

Objective This study aimed to develop and validate a model to estimate the likelihood of detecting advanced colorectal neoplasia in Caucasian patients. Design We performed a cross-sectional analysis of database records for 40-year-old to 66-year-old patients who entered a national primary colonoscopy-based screening programme for colorectal cancer in 73 centres in Poland in the year 2007. We used multivariate logistic regression to investigate the associations between clinical variables and the presence of advanced neoplasia in a randomly selected test set, and confirmed the associations in a validation set. We used model coefficients to develop a risk score for detection of advanced colorectal neoplasia. Results Advanced colorectal neoplasia was detected in 2544 of the 35 918 included participants (7.1%). In the test set, a logistic-regression model showed that independent risk factors for advanced colorectal neoplasia were: age, sex, family history of colorectal cancer, cigarette smoking (p<0.001 for these four factors), and Body Mass Index (p=0.033). In the validation set, the model was well calibrated (ratio of expected to observed risk of advanced neoplasia: 1.00 (95% CI 0.95 to 1.06)) and had moderate discriminatory power (c-statistic 0.62). We developed a score that estimated the likelihood of detecting advanced neoplasia in the validation set, from 1.32% for patients scoring 0, to 19.12% for patients scoring 7–8. Conclusions Developed and internally validated score consisting of simple clinical factors successfully estimates the likelihood of detecting advanced colorectal neoplasia in asymptomatic Caucasian patients. Once externally validated, it may be useful for counselling or designing primary prevention studies. PMID:24385598
A score to estimate the likelihood of detecting advanced colorectal neoplasia at colonoscopy.

PubMed

Kaminski, Michal F; Polkowski, Marcin; Kraszewska, Ewa; Rupinski, Maciej; Butruk, Eugeniusz; Regula, Jaroslaw

2014-07-01

This study aimed to develop and validate a model to estimate the likelihood of detecting advanced colorectal neoplasia in Caucasian patients. We performed a cross-sectional analysis of database records for 40-year-old to 66-year-old patients who entered a national primary colonoscopy-based screening programme for colorectal cancer in 73 centres in Poland in the year 2007. We used multivariate logistic regression to investigate the associations between clinical variables and the presence of advanced neoplasia in a randomly selected test set, and confirmed the associations in a validation set. We used model coefficients to develop a risk score for detection of advanced colorectal neoplasia. Advanced colorectal neoplasia was detected in 2544 of the 35,918 included participants (7.1%). In the test set, a logistic-regression model showed that independent risk factors for advanced colorectal neoplasia were: age, sex, family history of colorectal cancer, cigarette smoking (p<0.001 for these four factors), and Body Mass Index (p=0.033). In the validation set, the model was well calibrated (ratio of expected to observed risk of advanced neoplasia: 1.00 (95% CI 0.95 to 1.06)) and had moderate discriminatory power (c-statistic 0.62). We developed a score that estimated the likelihood of detecting advanced neoplasia in the validation set, from 1.32% for patients scoring 0, to 19.12% for patients scoring 7-8. Developed and internally validated score consisting of simple clinical factors successfully estimates the likelihood of detecting advanced colorectal neoplasia in asymptomatic Caucasian patients. Once externally validated, it may be useful for counselling or designing primary prevention studies. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Reliability and Validity of the Self Efficacy Expectations and Outcome Expectations After ICD Implantation Scales

PubMed Central

Dougherty, Cynthia M.; Johnston, Sandra K.; Thompson, Elaine Adams

2009-01-01

The purpose of this study was to assess the reliability and validity characteristics of two new scales that measure self-efficacy expectations (SE-ICD) and outcome expectations (OE-ICD) in survivors (n=168) of sudden cardiac arrest (SCA), all of whom received an implantable cardioverter defibrillator (ICD). Cronbach's alpha reliability demonstrated good internal consistency (SE-ICD α = 0.93 and OE-ICD α = 0.81). Correlations with other self-efficacy instruments (general self-efficacy and social self-efficacy) were consistently high. The instruments were responsive to change across time with effect sizes of 0.46 for SE-ICD, and 0.26 for OE-ICD. These reliable, valid, and responsive instruments for measurement of self-efficacy expectations and outcome expectations after an ICD can be used in research and clinical settings. PMID:17693214
Psychometric Properties and Normative values of Early Maladaptive Schema Questionnaires Set for Children and Adolescents (SQS).

PubMed

Güner, Olcay

2017-03-01

The Early Maladaptive Schema Questionnaires Set for Children and Adolescents (SQS) was developed to assess early maladaptive schemas in children between the ages of 10 and 16 in Turkey. The SQS consists of five questionnaires that represent five schema domains in Young's schema theory. Psychometric properties (n = 983) and normative values (n = 2250) of SQS were investigated in children and adolescents between the ages of 10 and 16. Both exploratory and confirmatory factor analyses were performed. Results revealed 15 schema factors under five schema domains, with good fit indexes. A total of 14 schema factors were in line with Young's early maladaptive schemas. In addition to these factors, one new schema emerged: self-disapproval. Reliability analyses showed that SQS has high internal consistency and consistency over a 1-month interval. Correlations of SQS with the Adjective Check List (ACL), the Inventory of Parent and Peer Attachment (IPPA), the Symptom Assessment (SA-45) and the Young Schema Questionnaire (YSQ) were investigated to assess criterion validity, and the correlations revealed encouraging results. SQS significantly differentiated between children who have clinical diagnoses (n = 78) and children who have no diagnosis (n = 100). Finally, general normative values (n = 2,250) were determined for age groups, gender and age/gender groups. In conclusion, the early maladaptive schema questionnaires set for children and adolescents turned out to be a reliable and valid questionnaire with standard scores.Copyright © 2016 John Wiley & Sons, Ltd. The early maladaptive schema questionnaires set for children and adolescents (SQS) is a psychometrically reliable and valid measure of early maladaptive schemas for children between the ages of 10 and 16. SQS consists of five schema domains that represent Young's schema domains including 15 early maladaptive schemas and 97 items. Normative values for each schema were determined for age, gender and age/gender groups. Clinically, SQS presents valuable information about early maladaptive schemas during childhood and adolescence, before such schemas become more pervasive and persistent. Copyright © 2016 John Wiley & Sons, Ltd.
Development of Survey Scales for Measuring Exposure and Behavioral Responses to Disruptive Intraoperative Behavior.

PubMed

Villafranca, Alexander; Hamlin, Colin; Rodebaugh, Thomas L; Robinson, Sandra; Jacobsohn, Eric

2017-09-10

Disruptive intraoperative behavior has detrimental effects to clinicians, institutions, and patients. How clinicians respond to this behavior can either exacerbate or attenuate its effects. Previous investigations of disruptive behavior have used survey scales with significant limitations. The study objective was to develop appropriate scales to measure exposure and responses to disruptive behavior. We obtained ethics approval. The scales were developed in a sequence of steps. They were pretested using expert reviews, computational linguistic analysis, and cognitive interviews. The scales were then piloted on Canadian operating room clinicians. Factor analysis was applied to half of the data set for question reduction and grouping. Item response analysis and theoretical reviews ensured that important questions were not eliminated. Internal consistency was evaluated using Cronbach α. Model fit was examined on the second half of the data set using confirmatory factor analysis. Content validity of the final scales was re-evaluated. Consistency between observed relationships and theoretical predictions was assessed. Temporal stability was evaluated on a subsample of 38 respondents. A total of 1433 and 746 clinicians completed the exposure and response scales, respectively. Content validity indices were excellent (exposure = 0.96, responses = 1.0). Internal consistency was good (exposure = 0.93, responses = 0.87). Correlations between the exposure scale and secondary measures were consistent with expectations based on theory. Temporal stability was acceptable (exposure = 0.77, responses = 0.73). We have developed scales measuring exposure and responses to disruptive behavior. They generate valid and reliable scores when surveying operating room clinicians, and they overcome the limitations of previous tools. These survey scales are freely available.
Development of a Quality of Meals and Meal Service Set of Indicators for Residential Facilities for Elderly.

PubMed

Van Damme, N; Buijck, B; Van Hecke, A; Verhaeghe, S; Goossens, E; Beeckman, D

2016-01-01

To develop a content validated set of indicators to evaluate the quality of meals and meal service in residential facilities for elderly. Inadequate food intake is an important risk factor for malnutrition in residential facilities for elderly. Through better meeting the needs and preferences of residents and optimization of meals and meal service, residents' food intake can improve. No indicators were available which could help to guide strategies to improve the quality of meals and meal service. The indicator set was developed according to the Indicator Development Manual of the Dutch Institute for Health Care Improvement (CBO). The working group consisted of three nurse researchers and one expert in gastrology and had expertise in elderly care, malnutrition, indicator development, and food quality. A preliminary list of potential indicators was compiled using the literature and the working group's expertise. Criteria necessary to measure the indicator in practice were developed for each potential indicator. In a double Delphi procedure, the list of potential indicators and respective criteria were analyzed for content validity, using a multidisciplinary expert panel of 11 experts in elderly meal care. A preliminary list of 20 quality indicators, including 45 criteria, was submitted to the expert panel in a double Delphi procedure. After the second Delphi round, 13 indicators and 25 criteria were accepted as having content validity. The content validity index (CVI) ranged from 0.83 to 1. The indicator set consisted of six structural, four result, and three outcome indicators covering the quality domains food, service and choice, as well as nutritional screening. The criteria measure diverse aspects of meal care which are part of the responsibility of kitchen staff and health care professionals. The 'quality of meals and meal service' set of indicators is a resource to map meal quality in residential facilities for elderly. As soon as feasibility tests in practice are completed, the indicator set can be used to guide meal and meal service quality improvement projects in collaboration with kitchen staff and health care professionals. These improvement projects will help to improve food intake and reduce the risk of malnutrition among elders living in residential facilities.
Development and validation of an early childhood development scale for use in low-resourced settings.

PubMed

McCoy, Dana Charles; Sudfeld, Christopher R; Bellinger, David C; Muhihi, Alfa; Ashery, Geofrey; Weary, Taylor E; Fawzi, Wafaie; Fink, Günther

2017-02-09

Low-cost, cross-culturally comparable measures of the motor, cognitive, and socioemotional skills of children under 3 years remain scarce. In the present paper, we aim to develop a new caregiver-reported early childhood development (ECD) scale designed to be implemented as part of household surveys in low-resourced settings. We evaluate the acceptability, test-retest reliability, internal consistency, and discriminant validity of the new ECD items, subscales, and full scale in a sample of 2481 18- to 36-month-old children from peri-urban and rural Tanzania. We also compare total and subscale scores with performance on the Bayley Scales of Infant Development (BSID-III) in a subsample of 1036 children. Qualitative interviews from 10 mothers and 10 field workers are used to inform quantitative data. Adequate levels of acceptability and internal consistency were found for the new scale and its motor, cognitive, and socioemotional subscales. Correlations between the new scale and the BSID-III were high (r > .50) for the motor and cognitive subscales, but low (r < .20) for the socioemotional subscale. The new scale discriminated between children's skills based on age, stunting status, caregiver-reported disability, and adult stimulation. Test-retest reliability scores were variable among a subset of items tested. Results of this study provide empirical support from a low-income country setting for the acceptability, reliability, and validity of a new caregiver-reported ECD scale. Additional research is needed to test these and other caregiver reported items in children in the full 0 to 3 year range across multiple cultural and linguistic settings.
STOPP/START Medication Criteria Modified for US Nursing Home Setting

PubMed Central

Khodyakov, Dmitry; Ochoa, Aileen; Olivieri-Mui, Brianne L.; Bouwmeester, Carla; Zarowitz, Barbara J.; Patel, Meenakshi; Ching, Diana; Briesacher, Becky

2016-01-01

STRUCTURED ABSTRACT BACKGROUND/OBJECTIVES A barrier to assessing the quality of prescribing in nursing homes (NH) is the lack of explicit criteria for this setting. Our objective was to develop a set of prescribing indicators measurable with available data from electronic nursing home databases by adapting the European-based 2014 STOPP/START criteria of potentially inappropriate and underused medications for the US setting. DESIGN A two-stage expert panel process. In first stage, investigator team reviewed 114 criteria for compatibility and measurability. In second stage, we convened an online modified e-Delphi (OMD) panel to rate the validity of criteria and two webinars to identify criteria with highest relevance to US NHs. PARTICIPANTS Seventeen experts with recognized reputations in NH care participated in the e-Delphi panel and 12 in the webinar. MEASUREMENTS Compatibility and measurability were assessed by comparing criteria to US terminology/setting standards and data elements in NH databases. Validity was rated with a 9-point Likert-type scale (1=not valid at all, 9=highly valid). Mean, median, interpercentile ranges, and agreement were determined for each criterion score. Relevance was determined by ranking the mean panel ratings on criteria that reached agreement; half of the criteria with the highest mean values were reviewed and approved by the webinar participants. RESULTS Fifty-three STOPP/START criteria were deemed as compatible with US setting and measurable using data from electronic NH databases. E-Delphi panelists rated 48 criteria as valid for US NHs. Twenty-four criteria were deemed as most relevant, consisting of 22 measures of potentially inappropriate medications and 2 measures of underused medications. CONCLUSION This study created the first explicit criteria for assessing the quality of prescribing in US NHs. PMID:28008599
A Psychometric Assessment of the "Businessweek," "U.S. News & World Report," and "Financial Times" Rankings of Business Schools' MBA Programs

ERIC Educational Resources Information Center

Iacobucci, Dawn

2013-01-01

This research investigates the reliability and validity of three major publications' rankings of MBA programs. Each set of rankings showed reasonable consistency over time, both at the level of the overall rankings and for most of the facets from which the rankings are derived. Each set of rankings also showed some levels of convergent and…
Cross-cultural adaptation, reliability and construct validity of the Tampa scale for kinesiophobia for temporomandibular disorders (TSK/TMD-Br) into Brazilian Portuguese.

PubMed

Aguiar, A S; Bataglion, C; Visscher, C M; Bevilaqua Grossi, D; Chaves, T C

2017-07-01

Fear of movement (kinesiophobia) seems to play an important role in the development of chronic pain. However, for temporomandibular disorders (TMD), there is a scarcity of studies about this topic. The Tampa Scale for Kinesiophobia for TMD (TSK/TMD) is the most widely used instrument to measure fear of movement and it is not available in Brazilian Portuguese. The purpose of this study was to culturally adapt the TSK/TMD to Brazilian Portuguese and to assess its psychometric properties regarding internal consistency, reliability, and construct and structural validity. A total of 100 female patients with chronic TMD participated in the validation process of the TSK/TMD-Br. The intraclass correlation coefficient (ICC) was used for statistical analysis of reliability (test-retest), Cronbach's alpha for internal consistency, Spearman's rank correlation for construct validity and confirmatory factor analysis (CFA) for structural validity. CFA endorsed the pre-specified model with two domains and 12-items (Activity Avoidance - AA/Somatic Focus - SF) and all items obtained a loading factor greater than 0·4. Acceptable levels of reliability were found (ICC > 0·75) for all questions and domains of the TSK/TMD-Br. For internal consistency, Cronbach's α of 0·78 for both domains were found. Moderate correlations (0·40 < r < 0.60) were observed for 84% of the analyses conducted between TSK/TMD-Br scores versus catastrophising, depression and jaw functional limitation. TSK/TMD-Br 12 items and two-factor demonstrated sound psychometric properties (transcultural validity, reliability, internal consistency and structural validity). In such a way, the instrument can be used in clinical settings and for research purposes. © 2017 John Wiley & Sons Ltd.
Development and Validation of a Spanish Version of the Grit-S Scale

PubMed Central

Arco-Tirado, Jose L.; Fernández-Martín, Francisco D.; Hoyle, Rick H.

2018-01-01

This paper describes the development and initial validation of a Spanish version of the Short Grit (Grit-S) Scale. The Grit-S Scale was adapted and translated into Spanish using the Translation, Review, Adjudication, Pre-testing, and Documentation model and responses to a preliminary set of items from a large sample of university students (N = 1,129). The resultant measure was validated using data from a large stratified random sample of young adults (N = 1,826). Initial validation involved evaluating the internal consistency of the adapted scale and its subscales and comparing the factor structure of the adapted version to that of the original scale. The results were comparable to results from similar analyses of the English version of the scale. Although the internal consistency of the subscales was low, the internal consistency of the full scale was well-within the acceptable range. A two-factor model offered an acceptable account of the data; however, when a single correlated error involving two highly similar items was included, a single factor model fit the data very well. The results support the use of overall scores from the Spanish Grit-S Scale in future research. PMID:29467705
A Finnish validation study of the SCL-90.

PubMed

Holi, M M; Sammallahti, P R; Aalberg, V A

1998-01-01

The Symptom Check-List-90 (SCL-90) is a widely used psychiatric questionnaire which has not yet been validated in Finland. We investigated the utility of the translated version of the SCL-90 in the Finnish population, and set community norms for it. The internal consistency of the original subscales was checked and found to be good. Discriminant function analysis, based on the nine original subscales, showed that the power of the SCL-90 to discriminate between patients and the community is good. Factor analysis of the items of the questionnaire yielded a very strong unrotated first factor, suggesting that a general factor may be present. This together with the fact that high intercorrelations were found between the nine original subscales suggests that the instrument is not multidimensional. The SCL-90 may be useful in a research setting as an instrument for measuring the change in symptomatic distress, or as a screening instrument. The American community norms should be used with caution, as the Finnish community sample scored consistently higher on all subscales.
Routine development of objectively derived search strategies.

PubMed

Hausner, Elke; Waffenschmidt, Siw; Kaiser, Thomas; Simon, Michael

2012-02-29

Over the past few years, information retrieval has become more and more professionalized, and information specialists are considered full members of a research team conducting systematic reviews. Research groups preparing systematic reviews and clinical practice guidelines have been the driving force in the development of search strategies, but open questions remain regarding the transparency of the development process and the available resources. An empirically guided approach to the development of a search strategy provides a way to increase transparency and efficiency. Our aim in this paper is to describe the empirically guided development process for search strategies as applied by the German Institute for Quality and Efficiency in Health Care (Institut für Qualität und Wirtschaftlichkeit im Gesundheitswesen, or "IQWiG"). This strategy consists of the following steps: generation of a test set, as well as the development, validation and standardized documentation of the search strategy. We illustrate our approach by means of an example, that is, a search for literature on brachytherapy in patients with prostate cancer. For this purpose, a test set was generated, including a total of 38 references from 3 systematic reviews. The development set for the generation of the strategy included 25 references. After application of textual analytic procedures, a strategy was developed that included all references in the development set. To test the search strategy on an independent set of references, the remaining 13 references in the test set (the validation set) were used. The validation set was also completely identified. Our conclusion is that an objectively derived approach similar to that used in search filter development is a feasible way to develop and validate reliable search strategies. Besides creating high-quality strategies, the widespread application of this approach will result in a substantial increase in the transparency of the development process of search strategies.
Identifying dyspepsia in the Greek population: translation and validation of a questionnaire.

PubMed

Anastasiou, Foteini; Antonakis, Nikos; Chaireti, Georgia; Theodorakis, Pavlos N; Lionis, Christos

2006-03-04

Studies on clinical issues, including diagnostic strategies, are considered to be the core content of general practice research. The use of standardised instruments is regarded as an important component for the development of Primary Health Care research capacity. Demand for epidemiological cross-cultural comparisons in the international setting and the use of common instruments and definitions valid to each culture is bigger than ever. Dyspepsia is a common complaint in primary practice but little is known with respect to its incidence in Greece. There are some references about the Helicobacter Pylori infection in patients with functional dyspepsia or gastric ulcer in Greece but there is no specific instrument for the identification of dyspepsia. This paper reports on the validation and translation into Greek, of an English questionnaire for the identification of dyspepsia in the general population and discusses several possibilities of its use in the Greek primary care. The selected English postal questionnaire for the identification of people with dyspepsia in the general population consists of 30 items and was developed in 1995. The translation and cultural adaptation of the questionnaire has been performed according to international standards. For the validation of the instrument the internal consistency of the items was established using the alpha coefficient of Chronbach, the reproducibility (test - retest reliability) was measured by kappa correlation coefficient and the criterion validity was calculated against the diagnosis of the patients' records using also kappa correlation coefficient. The final Greek version of the postal questionnaire for the identification of dyspepsia in the general population was reliably translated. The internal consistency of the questionnaire was good, Chronbach's alpha was found to be 0.88 (95% CI: 0.81-0.93), suggesting that all items were appropriate to measure. Kappa coefficient for reproducibility (test - retest reliability) was found 0.66 (95% CI: 0.62-0.71), whereas the kappa analysis for criterion validity was 0.63 (95% CI: 0.36-0.89). This study indicates that the Greek translation is comparable with the English-language version in terms of validity and reliability, and is suitable for epidemiological research within the Greek primary health care setting.
Validity and Reliability of Farsi Version of Youth Sport Environment Questionnaire

PubMed Central

Eshghi, Mohammad Ali; Kordi, Ramin; Memari, Amir Hossein; Ghaziasgar, Ahmad; Mansournia, Mohammad-Ali; Zamani Sani, Seyed Hojjat

2015-01-01

The Youth Sport Environment Questionnaire (YSEQ) had been developed from Group Environment Questionnaire, a well-known measure of team cohesion. The aim of this study was to adapt and examine the reliability and validity of the Farsi version of the YSEQ. This version was completed by 455 athletes aged 13–17 years. Results of confirmatory factor analysis indicated that two-factor solution showed a good fit to the data. The results also revealed that the Farsi YSEQ showed high internal consistency, test-retest reliability, and good concurrent validity. This study indicated that the Farsi version of the YSEQ is a valid and reliable measure to assess team cohesion in sport setting. PMID:26464900
International validation of quality indicators for evaluating priority setting in low income countries: process and key lessons.

PubMed

Kapiriri, Lydia

2017-06-19

While there have been efforts to develop frameworks to guide healthcare priority setting; there has been limited focus on evaluation frameworks. Moreover, while the few frameworks identify quality indicators for successful priority setting, they do not provide the users with strategies to verify these indicators. Kapiriri and Martin (Health Care Anal 18:129-147, 2010) developed a framework for evaluating priority setting in low and middle income countries. This framework provides BOTH parameters for successful priority setting and proposes means of their verification. Before its use in real life contexts, this paper presents results from a validation process of the framework. The framework validation involved 53 policy makers and priority setting researchers at the global, national and sub-national levels (in Uganda). They were requested to indicate the relative importance of the proposed parameters as well as the feasibility of obtaining the related information. We also pilot tested the proposed means of verification. Almost all the respondents evaluated all the parameters, including the contextual factors, as 'very important'. However, some respondents at the global level thought 'presence of incentives to comply', 'reduced disagreements', 'increased public understanding,' 'improved institutional accountability' and 'meeting the ministry of health objectives', which could be a reflection of their levels of decision making. All the proposed means of verification were assessed as feasible with the exception of meeting observations which would require an insider. These findings results were consistent with those obtained from the pilot testing. These findings are relevant to policy makers and researchers involved in priority setting in low and middle income countries. To the best of our knowledge, this is one of the few initiatives that has involved potential users of a framework (at the global and in a Low Income Country) in its validation. The favorable validation of all the parameters at the national and sub-national levels implies that the framework has potential usefulness at those levels, as is. The parameters that were disputed at the global level necessitate further discussion when using the framework at that level. The next step is to use the validated framework in evaluating actual priority setting at the different levels.
Reliability and construct validity of the Participation in Life Activities Scale for children and adolescents with asthma: an instrument evaluation study.

PubMed

Kintner, Eileen K; Sikorskii, Alla

2008-06-04

The purpose of this study was to evaluate the reliability and construct validity of the Participation in Life Activities Scale, an instrument designed to measure older school-age child and early adolescent level of involvement in chosen pursuits. A cross-sectional design was used. The convenience sample consisted of 313 school-age children and early adolescents with asthma, ages 9-15 years. The self-report summative scale of interest is a 3-indicator survey. Higher scores are reflective of higher levels of participation. Internal consistency reliability and construct validity for the entire sample and sub groups of the sample were evaluated. The instrument was deemed sound for the entire sample as well as sub groups based on sex, race, age, socioeconomic status, and severity of illness. Cronbach's alpha coefficient for internal consistency reliability for the entire sample was .74. Exploratory factor analysis indicated a single component solution (loadings .79-.85) accounting for 66% of the explained variance. Construct validity was established by testing the posed relationship between participation in life activities scores and severity of illness. Confirmatory factor analysis revealed a good fit between the data and specified model, chi2(10, n = 302) = 8.074, p = .62. This instrument could be used (a) in clinical settings to diagnose restricted participation in desired activities, guide decision-making about treatment plans to increase participation, and motivate behavioral change in the management of asthma; and (b) in research settings to explore factors influencing and consequences of restricted and unrestricted participation, and as an outcome measure to evaluate the effectiveness of programs designed to foster child and early adolescent management of asthma.
Somali Piracy and Anti-Shipping Activity Messages: Lessons for a Successful Counterpiracy Strategy

DTIC Science & Technology

2014-06-01

solving the differences found in each organization’s reporting. This suggestion seems valid and demonstrates an unbiased approach. 19 The studies...agrees the IMB data sets are not free from debate and criticism, yet claims the reports provide the only consistent and reliable set of figures to...different approach to using piracy incident reports. Hastings (2009) explores the political and economic landscapes of failed and weak states to determine
Cascade Back-Propagation Learning in Neural Networks

NASA Technical Reports Server (NTRS)

Duong, Tuan A.

2003-01-01

The cascade back-propagation (CBP) algorithm is the basis of a conceptual design for accelerating learning in artificial neural networks. The neural networks would be implemented as analog very-large-scale integrated (VLSI) circuits, and circuits to implement the CBP algorithm would be fabricated on the same VLSI circuit chips with the neural networks. Heretofore, artificial neural networks have learned slowly because it has been necessary to train them via software, for lack of a good on-chip learning technique. The CBP algorithm is an on-chip technique that provides for continuous learning in real time. Artificial neural networks are trained by example: A network is presented with training inputs for which the correct outputs are known, and the algorithm strives to adjust the weights of synaptic connections in the network to make the actual outputs approach the correct outputs. The input data are generally divided into three parts. Two of the parts, called the "training" and "cross-validation" sets, respectively, must be such that the corresponding input/output pairs are known. During training, the cross-validation set enables verification of the status of the input-to-output transformation learned by the network to avoid over-learning. The third part of the data, termed the "test" set, consists of the inputs that are required to be transformed into outputs; this set may or may not include the training set and/or the cross-validation set. Proposed neural-network circuitry for on-chip learning would be divided into two distinct networks; one for training and one for validation. Both networks would share the same synaptic weights.
Measuring implementation behaviour of menu guidelines in the childcare setting: confirmatory factor analysis of a theoretical domains framework questionnaire (TDFQ).

PubMed

Seward, Kirsty; Wolfenden, Luke; Wiggers, John; Finch, Meghan; Wyse, Rebecca; Oldmeadow, Christopher; Presseau, Justin; Clinton-McHarg, Tara; Yoong, Sze Lin

2017-04-04

While there are number of frameworks which focus on supporting the implementation of evidence based approaches, few psychometrically valid measures exist to assess constructs within these frameworks. This study aimed to develop and psychometrically assess a scale measuring each domain of the Theoretical Domains Framework for use in assessing the implementation of dietary guidelines within a non-health care setting (childcare services). A 75 item 14-domain Theoretical Domains Framework Questionnaire (TDFQ) was developed and administered via telephone interview to 202 centre based childcare service cooks who had a role in planning the service menu. Confirmatory factor analysis (CFA) was undertaken to assess the reliability, discriminant validity and goodness of fit of the 14-domain theoretical domain framework measure. For the CFA, five iterative processes of adjustment were undertaken where 14 items were removed, resulting in a final measure consisting of 14 domains and 61 items. For the final measure: the Chi-Square goodness of fit statistic was 3447.19; the Standardized Root Mean Square Residual (SRMR) was 0.070; the Root Mean Square Error of Approximation (RMSEA) was 0.072; and the Comparative Fit Index (CFI) had a value of 0.78. While only one of the three indices support goodness of fit of the measurement model tested, a 14-domain model with 61 items showed good discriminant validity and internally consistent items. Future research should aim to assess the psychometric properties of the developed TDFQ in other community-based settings.

Multidisciplinary approach to evaluate landslide susceptibility along highway in northern Calabria, Italy

NASA Astrophysics Data System (ADS)

Muto, Francesco; Conforti, Massimo; Critelli, Salvatore; Fabbricatore, Davide; Filomena, Luciana; Rago, Valeria; Robustelli, Gaetano; Scarciglia, Fabio; Versace, Pasquale

2014-05-01

The interaction of landslides with linear infrastructures is often the cause of disasters. In Italy landslide impact on roads, railways and buildings cause millions of Euro per year in damage and restoration as well. The proposed study is aimed to the landslide susceptibility evaluation using a multidisciplinary approach: geological and geomorphological survey, statistical analysis and GIS technique, along a section of highway "A3 (Salerno-Reggio Calabria)" between Cosenza Sud and Altilia, northern Calabria. This study is included in a wider research project, named: PON01-01503, Landslides Early Warning-Sistemi integrati per il monitoraggio e la mitigazione del rischio idrogeologico lungo le grandi vie di comunicazione - aimed at the hydrogeological risk mitigation and at the early warning along the highways. The work was first based on air-photo interpretations and field investigations, in order to realize the geological map, geomorphological map and landslide inventory map. In the study area the geomorphology is strongly controlled by its bedrock geology and tectonics. The bedrock geology consists of Neogene sedimentary rocks that cover a thick stack of allochthonous nappes. These nappes consist of crystalline rocks mainly gneiss, phyllite and schist. A total of 835 landslides were mapped and the type of movement are represented mainly by slides and complex and subordinately flow. In order to estimate and validate landslide susceptibility the landslides were divided in two group. One group (training set) was used to prepare susceptibility map and the second group (validation set) to validate the map. Then, the selection of predisposing factors was performed, according with the geological and geomorphological settings of the study area: lithology, distance from tectonic elements, land use, slope, aspect, stream power index (SPI) and plan curvature. In order to evaluate landslide susceptibility Conditional Analysis was applied to Unique Conditions Units (UCUs), that are a unique combination of the predisposing factors. Subsequently, the landslide area is determined within each UCU and the landslide density is computed. The outcome of the study was a classification of the study area into four susceptibility classes, ranked from low to very high. The results showed that the 33% of the study area is characterized by a high to very high degree of susceptibility. The validation procedure results, obtained by crossing the group of the landslide of validation set with the susceptibility map, showed that the predictive model is generally satisfactory; therefore, over 75% of the landslide of validation set is correctly classified falling in high and very high susceptibility classes. The consistency of the model is also suggested by computing the seed cell area index (SCAI) because the high and very high susceptibility classes have very low SCAI values, whereas the SCAI values of the very low and low susceptibility classes are very high. Finally, the landslide susceptibility map provides the baseline information for further evaluations of landslide hazards and related risks.
An early-biomarker algorithm predicts lethal graft-versus-host disease and survival

PubMed Central

Hartwell, Matthew J.; Özbek, Umut; Holler, Ernst; Major-Monfried, Hannah; Reddy, Pavan; Aziz, Mina; Hogan, William J.; Ayuk, Francis; Efebera, Yvonne A.; Hexner, Elizabeth O.; Bunworasate, Udomsak; Qayed, Muna; Ordemann, Rainer; Wölfl, Matthias; Mielke, Stephan; Chen, Yi-Bin; Devine, Steven; Jagasia, Madan; Kitko, Carrie L.; Litzow, Mark R.; Kröger, Nicolaus; Locatelli, Franco; Morales, George; Nakamura, Ryotaro; Reshef, Ran; Rösler, Wolf; Weber, Daniela; Yanik, Gregory A.; Levine, John E.; Ferrara, James L.M.

2017-01-01

BACKGROUND. No laboratory test can predict the risk of nonrelapse mortality (NRM) or severe graft-versus-host disease (GVHD) after hematopoietic cellular transplantation (HCT) prior to the onset of GVHD symptoms. METHODS. Patient blood samples on day 7 after HCT were obtained from a multicenter set of 1,287 patients, and 620 samples were assigned to a training set. We measured the concentrations of 4 GVHD biomarkers (ST2, REG3α, TNFR1, and IL-2Rα) and used them to model 6-month NRM using rigorous cross-validation strategies to identify the best algorithm that defined 2 distinct risk groups. We then applied the final algorithm in an independent test set (n = 309) and validation set (n = 358). RESULTS. A 2-biomarker model using ST2 and REG3α concentrations identified patients with a cumulative incidence of 6-month NRM of 28% in the high-risk group and 7% in the low-risk group (P < 0.001). The algorithm performed equally well in the test set (33% vs. 7%, P < 0.001) and the multicenter validation set (26% vs. 10%, P < 0.001). Sixteen percent, 17%, and 20% of patients were at high risk in the training, test, and validation sets, respectively. GVHD-related mortality was greater in high-risk patients (18% vs. 4%, P < 0.001), as was severe gastrointestinal GVHD (17% vs. 8%, P < 0.001). The same algorithm can be successfully adapted to define 3 distinct risk groups at GVHD onset. CONCLUSION. A biomarker algorithm based on a blood sample taken 7 days after HCT can consistently identify a group of patients at high risk for lethal GVHD and NRM. FUNDING. The National Cancer Institute, American Cancer Society, and the Doris Duke Charitable Foundation. PMID:28194439
The performance of seven QPrediction risk scores in an independent external sample of patients from general practice: a validation study

PubMed Central

Hippisley-Cox, Julia; Coupland, Carol; Brindle, Peter

2014-01-01

Objectives To validate the performance of a set of risk prediction algorithms developed using the QResearch database, in an independent sample from general practices contributing to the Clinical Research Data Link (CPRD). Setting Prospective open cohort study using practices contributing to the CPRD database and practices contributing to the QResearch database. Participants The CPRD validation cohort consisted of 3.3 million patients, aged 25–99 years registered at 357 general practices between 1 Jan 1998 and 31 July 2012. The validation statistics for QResearch were obtained from the original published papers which used a one-third sample of practices separate to those used to derive the score. A cohort from QResearch was used to compare incidence rates and baseline characteristics and consisted of 6.8 million patients from 753 practices registered between 1 Jan 1998 and until 31 July 2013. Outcome measures Incident events relating to seven different risk prediction scores: QRISK2 (cardiovascular disease); QStroke (ischaemic stroke); QDiabetes (type 2 diabetes); QFracture (osteoporotic fracture and hip fracture); QKidney (moderate and severe kidney failure); QThrombosis (venous thromboembolism); QBleed (intracranial bleed and upper gastrointestinal haemorrhage). Measures of discrimination and calibration were calculated. Results Overall, the baseline characteristics of the CPRD and QResearch cohorts were similar though QResearch had higher recording levels for ethnicity and family history. The validation statistics for each of the risk prediction scores were very similar in the CPRD cohort compared with the published results from QResearch validation cohorts. For example, in women, the QDiabetes algorithm explained 50% of the variation within CPRD compared with 51% on QResearch and the receiver operator curve value was 0.85 on both databases. The scores were well calibrated in CPRD. Conclusions Each of the algorithms performed practically as well in the external independent CPRD validation cohorts as they had in the original published QResearch validation cohorts. PMID:25168040
Hospital Anxiety and Depression Scale: Factor Structure, Internal Consistency and Convergent Validity in Patients with Dizziness.

PubMed

Piker, Erin G; Kaylie, David M; Garrison, Douglas; Tucci, Debara L

2015-01-01

Psychiatric comorbidities, particularly anxiety-related pathologies, are often observed in dizzy patients. The Hospital Anxiety and Depression Scale (HADS) is a widely used self-report instrument used to screen for anxiety and depression in medical outpatient settings. The purpose of this study was to assess the factor structure, internal consistency and convergent validity of the HADS in an unselected group of patients with dizziness. The HADS and the Dizziness Handicap Inventory (DHI) were administered to 205 dizzy patients. An exploratory factor analysis was conducted and indicated a 3-factor structure, inconsistent with the 2-subscale structure (i.e. anxiety and depression) of the HADS. The total scale was found to be internally consistent, and convergent validity, as assessed using the DHI, was acceptable. Overall findings suggest that the HADS should not be used as a tool for psychiatric differential diagnosis, but rather as a helpful screener for general psychiatric distress in the two domains of psychiatric illness most germane in dizzy patients. © 2015 S. Karger AG, Basel.
Risk assessment for juvenile justice: a meta-analysis.

PubMed

Schwalbe, Craig S

2007-10-01

Risk assessment instruments are increasingly employed by juvenile justice settings to estimate the likelihood of recidivism among delinquent juveniles. In concert with their increased use, validation studies documenting their predictive validity have increased in number. The purpose of this study was to assess the average predictive validity of juvenile justice risk assessment instruments and to identify risk assessment characteristics that are associated with higher predictive validity. A search of the published and grey literature yielded 28 studies that estimated the predictive validity of 28 risk assessment instruments. Findings of the meta-analysis were consistent with effect sizes obtained in larger meta-analyses of criminal justice risk assessment instruments and showed that brief risk assessment instruments had smaller effect sizes than other types of instruments. However, this finding is tentative owing to limitations of the literature.
A comparison of accuracy validation methods for genomic and pedigree-based predictions of swine litter size traits using Large White and simulated data.

PubMed

Putz, A M; Tiezzi, F; Maltecca, C; Gray, K A; Knauer, M T

2018-02-01

The objective of this study was to compare and determine the optimal validation method when comparing accuracy from single-step GBLUP (ssGBLUP) to traditional pedigree-based BLUP. Field data included six litter size traits. Simulated data included ten replicates designed to mimic the field data in order to determine the method that was closest to the true accuracy. Data were split into training and validation sets. The methods used were as follows: (i) theoretical accuracy derived from the prediction error variance (PEV) of the direct inverse (iLHS), (ii) approximated accuracies from the accf90(GS) program in the BLUPF90 family of programs (Approx), (iii) correlation between predictions and the single-step GEBVs from the full data set (GEBV Full ), (iv) correlation between predictions and the corrected phenotypes of females from the full data set (Y c ), (v) correlation from method iv divided by the square root of the heritability (Y ch ) and (vi) correlation between sire predictions and the average of their daughters' corrected phenotypes (Y cs ). Accuracies from iLHS increased from 0.27 to 0.37 (37%) in the Large White. Approximation accuracies were very consistent and close in absolute value (0.41 to 0.43). Both iLHS and Approx were much less variable than the corrected phenotype methods (ranging from 0.04 to 0.27). On average, simulated data showed an increase in accuracy from 0.34 to 0.44 (29%) using ssGBLUP. Both iLHS and Y ch approximated the increase well, 0.30 to 0.46 and 0.36 to 0.45, respectively. GEBV Full performed poorly in both data sets and is not recommended. Results suggest that for within-breed selection, theoretical accuracy using PEV was consistent and accurate. When direct inversion is infeasible to get the PEV, correlating predictions to the corrected phenotypes divided by the square root of heritability is adequate given a large enough validation data set. © 2017 Blackwell Verlag GmbH.
The CO₂ GAP Project--CO₂ GAP as a prognostic tool in emergency departments.

PubMed

Shetty, Amith L; Lai, Kevin H; Byth, Karen

2010-12-01

To determine whether CO₂ GAP [(a-ET) PCO₂] value differs consistently in patients presenting with shortness of breath to the ED requiring ventilatory support. To determine a cut-off value of CO₂ GAP, which is consistently associated with measured outcome and to compare its performance against other derived variables. This prospective observational study was conducted in ED on a convenience sample of 412 from 759 patients who underwent concurrent arterial blood gas and ETCO₂ (end-tidal CO₂) measurement. They were randomized to test sample of 312 patients and validation set of 100 patients. The primary outcome of interest was the need for ventilatory support and secondary outcomes were admission to high dependency unit or death during stay in ED. The randomly selected training set was used to select cut-points for the possible predictors; that is, CO₂ GAP, CO₂ gradient, physiologic dead space and A-a gradient. The sensitivity, specificity and predictive values of these predictors were validated in the test set of 100 patients. Analysis of the receiver operating characteristic curves revealed the CO₂ GAP performed significantly better than the arterial-alveolar gradient in patients requiring ventilator support (area under the curve 0.950 vs 0.726). A CO₂ GAP ≥10 was associated with assisted ventilation outcomes when applied to the validation test set (100% sensitivity 70% specificity). The CO₂ GAP [(a-ET) PCO₂] differs significantly in patients requiring assisted ventilation when presenting with shortness of breath to EDs and further research addressing the prognostic value of CO₂ GAP in this specific aspect is required. © 2010 The Authors. EMA © 2010 Australasian College for Emergency Medicine and Australasian Society for Emergency Medicine.
Cross-Cultural Adaptation of the Profile Fitness Mapping Neck Questionnaire to Brazilian Portuguese: Internal Consistency, Reliability, and Construct and Structural Validity.

PubMed

Ferreira, Mariana Cândido; Björklund, Martin; Dach, Fabiola; Chaves, Thais Cristina

The purpose of this study was to adapt and evaluate the psychometric properties of the ProFitMap-neck to Brazilian Portuguese. The cross-cultural adaptation consisted of 5 stages, and 180 female patients with chronic neck pain participated in the study. A subsample (n = 30) answered the pretest, and another subsample (n = 100) answered the questionnaire a second time. Internal consistency, test-retest reliability, and construct validity (hypothesis testing and structural validity) were estimated. For construct validity, the scores of the questionnaire were correlated with the Neck Disability Index (NDI), and the Hospital Anxiety and Depression Scale (HADS), the Tampa Scale of Kinesiophobia (TSK), and the 36-item Short-Form Health Survey (SF-36). Internal consistency was determined by adequate Cronbach's α values (α > 0.70). Strong reliability was identified by high intraclass correlation coefficients (ICC > 0.75). Construct validity was identified by moderate and strong correlations of the Br-ProFitMap-neck with total NDI score (-0.56 50%, Kaiser-Meyer-Olkin index > 0.50, eigenvalue > 1, and factor loadings > 0.2. Br-ProFitMap-neck had adequate psychometric properties and can be used in clinical settings, as well as research, in patients with chronic neck pain. Copyright © 2017. Published by Elsevier Inc.
Validation of Social Cognition Rating Tools in Indian Setting (SOCRATIS): A new test-battery to assess social cognition.

PubMed

Mehta, Urvakhsh M; Thirthalli, Jagadisha; Naveen Kumar, C; Mahadevaiah, Mahesh; Rao, Kiran; Subbakrishna, Doddaballapura K; Gangadhar, Bangalore N; Keshavan, Matcheri S

2011-09-01

Social cognition is a cognitive domain that is under substantial cultural influence. There are no culturally appropriate standardized tools in India to comprehensively test social cognition. This study describes validation of tools for three social cognition constructs: theory of mind, social perception and attributional bias. Theory of mind tests included adaptations of, (a) two first order tasks [Sally-Anne and Smarties task], (b) two second order tasks [Ice cream van and Missing cookies story], (c) two metaphor-irony tasks and (d) the faux pas recognition test. Internal, Personal, and Situational Attributions Questionnaire (IPSAQ) and Social Cue Recognition Test were adapted to assess attributional bias and social perception, respectively. These tests were first modified to suit the Indian cultural context without changing the constructs to be tested. A panel of experts then rated the tests on likert scales as to (1) whether the modified tasks tested the same construct as in the original and (2) whether they were culturally appropriate. The modified tests were then administered to groups of actively symptomatic and remitted schizophrenia patients as well as healthy comparison subjects. All tests of the Social Cognition Rating Tools in Indian Setting had good content validity and known groups validity. In addition, the social cure recognition test in Indian setting had good internal consistency and concurrent validity. Copyright © 2011 Elsevier B.V. All rights reserved.
The application of neural networks to the SSME startup transient

NASA Technical Reports Server (NTRS)

Meyer, Claudia M.; Maul, William A.

1991-01-01

Feedforward neural networks were used to model three parameters during the Space Shuttle Main Engine startup transient. The three parameters were the main combustion chamber pressure, a controlled parameter, the high pressure oxidizer turbine discharge temperature, a redlined parameter, and the high pressure fuel pump discharge pressure, a failure-indicating performance parameter. Network inputs consisted of time windows of data from engine measurements that correlated highly to the modeled parameter. A standard backpropagation algorithm was used to train the feedforward networks on two nominal firings. Each trained network was validated with four additional nominal firings. For all three parameters, the neural networks were able to accurately predict the data in the validation sets as well as the training set.
Validity in work-based assessment: expanding our horizons.

PubMed

Govaerts, Marjan; van der Vleuten, Cees P M

2013-12-01

Although work-based assessments (WBA) may come closest to assessing habitual performance, their use for summative purposes is not undisputed. Most criticism of WBA stems from approaches to validity consistent with the quantitative psychometric framework. However, there is increasing research evidence that indicates that the assumptions underlying the predictive, deterministic framework of psychometrics may no longer hold. In this discussion paper we argue that meaningfulness and appropriateness of current validity evidence can be called into question and that we need alternative strategies to assessment and validity inquiry that build on current theories of learning and performance in complex and dynamic workplace settings. Drawing from research in various professional fields we outline key issues within the mechanisms of learning, competence and performance in the context of complex social environments and illustrate their relevance to WBA. In reviewing recent socio-cultural learning theory and research on performance and performance interpretations in work settings, we demonstrate that learning, competence (as inferred from performance) as well as performance interpretations are to be seen as inherently contextualised, and can only be under-stood 'in situ'. Assessment in the context of work settings may, therefore, be more usefully viewed as a socially situated interpretive act. We propose constructivist-interpretivist approaches towards WBA in order to capture and understand contextualised learning and performance in work settings. Theoretical assumptions underlying interpretivist assessment approaches call for a validity theory that provides the theoretical framework and conceptual tools to guide the validation process in the qualitative assessment inquiry. Basic principles of rigour specific to qualitative research have been established, and they can and should be used to determine validity in interpretivist assessment approaches. If used properly, these strategies generate trustworthy evidence that is needed to develop the validity argument in WBA, allowing for in-depth and meaningful information about professional competence. © 2013 John Wiley & Sons Ltd.
Validation of geometric accuracy of Global Land Survey (GLS) 2000 data

USGS Publications Warehouse

Rengarajan, Rajagopalan; Sampath, Aparajithan; Storey, James C.; Choate, Michael J.

2015-01-01

The Global Land Survey (GLS) 2000 data were generated from Geocover™ 2000 data with the aim of producing a global data set of accuracy better than 25 m Root Mean Square Error (RMSE). An assessment and validation of accuracy of GLS 2000 data set, and its co-registration with Geocover™ 2000 data set is presented here. Since the availability of global data sets that have higher nominal accuracy than the GLS 2000 is a concern, the data sets were assessed in three tiers. In the first tier, the data were compared with the Geocover™ 2000 data. This comparison provided a means of localizing regions of higher differences. In the second tier, the GLS 2000 data were compared with systematically corrected Landsat-7 scenes that were obtained in a time period when the spacecraft pointing information was extremely accurate. These comparisons localize regions where the data are consistently off, which may indicate regions of higher errors. The third tier consisted of comparing the GLS 2000 data against higher accuracy reference data. The reference data were the Digital Ortho Quads over the United States, orthorectified SPOT data over Australia, and high accuracy check points obtained using triangulation bundle adjustment of Landsat-7 images over selected sites around the world. The study reveals that the geometric errors in Geocover™ 2000 data have been rectified in GLS 2000 data, and that the accuracy of GLS 2000 data can be expected to be better than 25 m RMSE for most of its constituent scenes.
A new biodegradation prediction model specific to petroleum hydrocarbons.

PubMed

Howard, Philip; Meylan, William; Aronson, Dallas; Stiteler, William; Tunkel, Jay; Comber, Michael; Parkerton, Thomas F

2005-08-01

A new predictive model for determining quantitative primary biodegradation half-lives of individual petroleum hydrocarbons has been developed. This model uses a fragment-based approach similar to that of several other biodegradation models, such as those within the Biodegradation Probability Program (BIOWIN) estimation program. In the present study, a half-life in days is estimated using multiple linear regression against counts of 31 distinct molecular fragments. The model was developed using a data set consisting of 175 compounds with environmentally relevant experimental data that was divided into training and validation sets. The original fragments from the Ministry of International Trade and Industry BIOWIN model were used initially as structural descriptors and additional fragments were then added to better describe the ring systems found in petroleum hydrocarbons and to adjust for nonlinearity within the experimental data. The training and validation sets had r2 values of 0.91 and 0.81, respectively.
Implicit leadership theories in applied settings: factor structure, generalizability, and stability over time.

PubMed

Epitropaki, Olga; Martin, Robin

2004-04-01

The present empirical investigation had a 3-fold purpose: (a) to cross-validate L. R. Offermann, J. K. Kennedy, and P. W. Wirtz's (1994) scale of Implicit Leadership Theories (ILTs) in several organizational settings and to further provide a shorter scale of ILTs in organizations; (b) to assess the generalizability of ILTs across different employee groups, and (c) to evaluate ILTs' change over time. Two independent samples were used for the scale validation (N1 = 500 and N2 = 439). A 6-factor structure (Sensitivity, Intelligence, Dedication, Dynamism, Tyranny, and Masculinity) was found to most accurately represent ELTs in organizational settings. Regarding the generalizability of ILTs, although the 6-factor structure was consistent across different employee groups, there was only partial support for total factorial invariance. Finally, evaluation of gamma, beta, and alpha change provided support for ILTs' stability over time.
Evolution of an Implementation-Ready Interprofessional Pain Assessment Reference Model

PubMed Central

Collins, Sarah A; Bavuso, Karen; Swenson, Mary; Suchecki, Christine; Mar, Perry; Rocha, Roberto A.

2017-01-01

Standards to increase consistency of comprehensive pain assessments are important for safety, quality, and analytics activities, including meeting Joint Commission requirements and learning the best management strategies and interventions for the current prescription Opioid epidemic. In this study we describe the development and validation of a Pain Assessment Reference Model ready for implementation on EHR forms and flowsheets. Our process resulted in 5 successive revisions of the reference model, which more than doubled the number of data elements to 47. The organization of the model evolved during validation sessions with panels totaling 48 subject matter experts (SMEs) to include 9 sets of data elements, with one set recommended as a minimal data set. The reference model also evolved when implemented into EHR forms and flowsheets, indicating specifications such as cascading logic that are important to inform secondary use of data. PMID:29854125
Performance of local orbital basis sets in the self-consistent Sternheimer method for dielectric matrices of extended systems

NASA Astrophysics Data System (ADS)

Hübener, H.; Pérez-Osorio, M. A.; Ordejón, P.; Giustino, F.

2012-09-01

We present a systematic study of the performance of numerical pseudo-atomic orbital basis sets in the calculation of dielectric matrices of extended systems using the self-consistent Sternheimer approach of [F. Giustino et al., Phys. Rev. B 81, 115105 (2010)]. In order to cover a range of systems, from more insulating to more metallic character, we discuss results for the three semiconductors diamond, silicon, and germanium. Dielectric matrices of silicon and diamond calculated using our method fall within 1% of reference planewaves calculations, demonstrating that this method is promising. We find that polarization orbitals are critical for achieving good agreement with planewaves calculations, and that only a few additional ζ's are required for obtaining converged results, provided the split norm is properly optimized. Our present work establishes the validity of local orbital basis sets and the self-consistent Sternheimer approach for the calculation of dielectric matrices in extended systems, and prepares the ground for future studies of electronic excitations using these methods.
Development and validation of a public attitudes toward epilepsy (PATE) scale.

PubMed

Lim, Kheng-Seang; Wu, Cathie; Choo, Wan-Yuen; Tan, Chong-Tin

2012-06-01

A quantitative scale of public attitudes toward epilepsy is essential to determine the magnitude of social stigma against epilepsy. This study aims to develop and validate a cross-culturally applicable scale of public attitudes toward epilepsy. A set of questions was selected from questionnaires identified from a literature review, following which a panel review determined the final version, consisting of 18 items. A 1-5 Likert scale was used for scoring. Additional questions, related to perception of the productivity of people with epilepsy and of a modified epilepsy stigma scale, were added as part of construct validation. One hundred and thirty heterogeneous respondents were collected, consisting of various age groups, ethnicity and occupation status levels. After item and factor analyses, the final version consisted of 14 items. Psychometric properties of the scale were first determined using factor analysis, which revealed a general and a personal domain, with good internal consistency (Cronbach's coefficient 0.868 and 0.633, respectively). Construct validation was demonstrated. The mean score for the personal domain was higher than that for the general domain (2.72±0.56 and 2.09±0.59, respectively). The mean scores of those with tertiary education were significantly lower for the general domain, but not for the personal domain. Age was positively correlated with the mean scores in the personal domain, but not in the general domain. This scale is a reliable and valid scale to assess public attitudes toward epilepsy, in both the general and personal domains. Copyright © 2012 Elsevier Inc. All rights reserved.
Development and validation of the Vietnamese primary care assessment tool

PubMed Central

2018-01-01

Objective To adapt the consumer version of the Primary Care Assessment Tool (PCAT) for Vietnam and determine its internal consistency and validity. Design A quantitative cross sectional study. Setting 56 communes in 3 representative provinces of central Vietnam. Participants Total of 3289 people who used health care services at health facility at least once over the past two years. Results The Vietnamese adult expanded consumer version of the PCAT (VN PCAT-AE) is an instrument for evaluation of primary care in Vietnam with 70 items comprising six scales representing four core primary care domains, and three additional scales representing three derivative domains. Sixteen other items from the original tool were not included in the final instrument, due to problems with missing values, floor or ceiling effects, and item-total correlations. All the retained scales have a Cronbach’s alpha above 0.70 except for the subscale of Family Centeredness. Conclusions The VN PCAT-AE demonstrates adequate internal consistency and validity to be used as an effective tool for measuring the quality of primary care in Vietnam from the consumer perspective. Additional work in the future to optimize valid measurement in all domains consistent with the original version of the tool may be helpful as the primary care system in Vietnam further develops. PMID:29324851
Dyspnoea-12: a translation and linguistic validation study in a Swedish setting

PubMed Central

Ekström, Magnus

2017-01-01

Background Dyspnoea consists of multiple dimensions including the intensity, unpleasantness, sensory qualities and emotional responses which may differ between patient groups, settings and in relation to treatment. The Dyspnoea-12 is a validated and convenient instrument for multidimensional measurement in English. We aimed to take forward a Swedish version of the Dyspnoea-12. Methods The linguistic validation of the Dyspnoea-12 was performed (Mapi Language Services, Lyon, France). The standardised procedure involved forward and backward translations by three independent certified translators and revisions after feedback from an in-country linguistic consultant, the developerand three native physicians. The understanding and convenience of the translated version was evaluated using qualitative in-depth interviews with five patients with dyspnoea. Results A Swedish version of the Dyspnoea-12 was elaborated and evaluated carefully according to international guidelines. The Swedish version, ‘Dyspné−12’, has the same layout as the original version, including 12 items distributed on seven physical and five affective items. The Dyspnoea-12 is copyrighted by the developer but can be used free of charge after permission for not industry-funded research. Conclusion A Swedish version of the Dyspnoea-12 is now available for clinical validation and multidimensional measurement across diseases and settings with the aim of improved evaluation and management of dyspnoea. PMID:28592574
Transcultural adaptation and validation of the Korean version of Caregiver Priorities & Child Health Index of Life with Disabilities (CPCHILD).

PubMed

Sung, Ki Hyuk; Kwon, Soon-Sun; Narayanan, Unni G; Chung, Chin Youb; Lee, Kyoung Min; Lee, Seung Yeol; Lee, Damian J; Park, Moon Seok

2015-01-01

The aim of this study was to translate and transculturally adapt the Caregiver Priorities & Child Health Index of Life with Disabilities (CPCHILD) questionnaire into Korean language, and to test the reliability and validity, including the internal consistency, known-group validity and factor analysis of the Korean version of the CPCHILD. A Korean version of CPCHILD was produced according to internationally accepted guidelines. For validity testing, 194 consecutive parents or caregivers of children with cerebral palsy (CP) were recruited and completed the questionnaire. Internal consistency, test-retest reliability, and known-groups validity were evaluated and factor analysis was performed to validate the Korean version of the CPCHILD. In terms of internal consistency, a Cronbach's alpha was above 0.90 in all domains of the CPCHILD (range 0.921 to 0.966), except the 5th domain (0.628). In terms of known-groups validity, the total score of the CPCHILD was significantly different according to the Gross Motor Function Classification System (GMFCS) level (p < 0.001). Intra-class correlation coefficient spanned from 0.517 to 0.801. Factor analysis showed that the five-factor solution of the CPCHILD explained 76.7% of the variance with 59.0, 6.5, 5.1, 4.2 and 3.2% of variance by each components number. The Korean version of CPCHILD was found to be a reliable and valid questionnaire of caregivers' perspectives on the health-related quality of life in severely affected children with CP. However, the Korean version of CPCHILD contains some redundant items, and factor analysis suggested a five-domain questionnaire. Implication for Rehabilitation The Korean version of CPCHILD is a reliable, internally consistent, valid instrument for assessing the health-related quality of life in severely affected children with CP from the perspective of caregivers. After the transcultural adaptation and validation of the Korean CPCHILD, it can be reliably used in clinical and research settings to evaluate the health-related quality of life in Korean patients with CP.

Validation of the German version of the Ford Insomnia Response to Stress Test.

PubMed

Dieck, Arne; Helbig, Susanne; Drake, Christopher L; Backhaus, Jutta

2018-06-01

The purpose of this study was to assess the psychometric properties of a German version of the Ford Insomnia Response to Stress Test with groups with and without sleep problems. Three studies were analysed. Data set 1 was based on an initial screening for a sleep training program (n = 393), data set 2 was based on a study to test the test-retest reliability of the Ford Insomnia Response to Stress Test (n = 284) and data set 3 was based on a study to examine the influence of competitive sport on sleep (n = 37). Data sets 1 and 2 were used to test internal consistency, factor structure, convergent validity, discriminant validity and test-retest reliability of the Ford Insomnia Response to Stress Test. Content validity was tested using data set 3. Cronbach's alpha of the Ford Insomnia Response to Stress Test was good (α = 0.80) and test-retest reliability was satisfactory (r = 0.72). Overall, the one-factor model showed the best fit. Furthermore, significant positive correlations between the Ford Insomnia Response to Stress Test and impaired sleep quality, depression and stress reactivity were in line with the expectations regarding the convergent validity. Subjects with sleep problems had significantly higher scores in the Ford Insomnia Response to Stress Test than subjects without sleep problems (P < 0.01). Competitive athletes with higher scores in the Ford Insomnia Response to Stress Test had significantly lower sleep quality (P = 0.01), demonstrating that vulnerability for stress-induced sleep disturbances accompanies poorer sleep quality in stressful episodes. The findings show that the German version of the Ford Insomnia Response to Stress Test is a reliable and valid questionnaire to assess the vulnerability to stress-induced sleep disturbances. © 2017 European Sleep Research Society.
Assessing the reliability, predictive and construct validity of historical, clinical and risk management-20 (HCR-20) in Mexican psychiatric inpatients.

PubMed

Sada, Andrea; Robles-García, Rebeca; Martínez-López, Nicolás; Hernández-Ramírez, Rafael; Tovilla-Zarate, Carlos-Alfonso; López-Munguía, Fernando; Suárez-Alvarez, Enrique; Ayala, Xochitl; Fresán, Ana

2016-08-01

Assessing dangerousness to gauge the likelihood of future violent behaviour has become an integral part of clinical mental health practice in forensic and non-forensic psychiatric settings, one of the most effective instruments for this being the Historical, Clinical and Risk Management-20 (HCR-20). To examine the HCR-20 factor structure in Mexican psychiatric inpatients and to obtain its predictive validity and reliability for use in this population. In total, 225 patients diagnosed with psychotic, affective or personality disorders were included. The HCR-20 was applied at hospital admission and violent behaviours were assessed during psychiatric hospitalization using the Overt Aggression Scale (OAS). Construct validity, predictive validity and internal consistency were determined. Violent behaviour remains more severe in patients classified in the high-risk group during hospitalization. Fifteen items displayed adequate communalities in the original designated domains of the HCR-20 and internal consistency of the instruments was high. The HCR-20 is a suitable instrument for predicting violence risk in Mexican psychiatric inpatients.
The MMPI-2-RF Personality Psychopathology Five (PSY-5-RF) scales: development and validity research.

PubMed

Harkness, Allan R; McNulty, John L; Finn, Jacob A; Reynolds, Shannon M; Shields, Susan M; Arbisi, Paul

2014-01-01

This article describes the development, internal psychometric, and external validation studies on scales designed to measure the Personality Psychopathology Five (PSY-5) from MMPI-2 Restructured Form (MMPI-2-RF) items. Diverse and comprehensive data sets, representing various clinical and nonclinical populations, were classified into development and validation research samples. Item selection, retention, and exclusion procedures are detailed. The final set of PSY-5-RF scales contain 104 items, with no item overlap between scales (same as the original MMPI-2 PSY-5 scales), and no item overlap with the Demoralization scale. Internal consistency estimates are comparable to the longer MMPI-2 PSY-5 scales. Appropriate convergent and discriminant validity findings utilizing various self-report, collateral rating, and record review data are reported and discussed. A particular emphasis is offered for the unique aspects of the PSY-5 model: psychoticism and disconstraint. The findings are connected to the broader PSY-5 literature and the recommended review of systems (Harkness, Reynolds, & Lilienfeld, this issue) presented in this series of articles.
Tutorial: Crystal orientations and EBSD — Or which way is up?

DOE Office of Scientific and Technical Information (OSTI.GOV)

Britton, T.B., E-mail: b.britton@imperial.ac.uk; Jiang, J.; Guo, Y.

2016-07-15

Electron backscatter diffraction (EBSD) is an automated technique that can measure the orientation of crystals in a sample very rapidly. There are many sophisticated software packages that present measured data. Unfortunately, due to crystal symmetry and differences in the set-up of microscope and EBSD software, there may be accuracy issues when linking the crystal orientation to a particular microstructural feature. In this paper we outline a series of conventions used to describe crystal orientations and coordinate systems. These conventions have been used to successfully demonstrate that a consistent frame of reference is used in the sample, unit cell, pole figuremore » and diffraction pattern frames of reference. We establish a coordinate system rooted in measurement of the diffraction pattern and subsequently link this to all other coordinate systems. A fundamental outcome of this analysis is to note that the beamshift coordinate system needs to be precisely defined for consistent 3D microstructure analysis. This is supported through a series of case studies examining particular features of the microscope settings and/or unambiguous crystallographic features. These case studies can be generated easily in most laboratories and represent an opportunity to demonstrate confidence in use of recorded orientation data. Finally, we include a simple software tool, written in both MATLAB® and Python, which the reader can use to compare consistency with their own microscope set-up and which may act as a springboard for further offline analysis. - Highlights: • Presentation of conventions used to describe crystal orientations • Three case studies that outline how conventions are consistent • Demonstrates a pathway for calibration and validation of EBSD based orientation measurements • EBSD computer code supplied for validation by the reader.« less
Rank Order Entropy: why one metric is not enough

PubMed Central

McLellan, Margaret R.; Ryan, M. Dominic; Breneman, Curt M.

2011-01-01

The use of Quantitative Structure-Activity Relationship models to address problems in drug discovery has a mixed history, generally resulting from the mis-application of QSAR models that were either poorly constructed or used outside of their domains of applicability. This situation has motivated the development of a variety of model performance metrics (r2, PRESS r2, F-tests, etc) designed to increase user confidence in the validity of QSAR predictions. In a typical workflow scenario, QSAR models are created and validated on training sets of molecules using metrics such as Leave-One-Out or many-fold cross-validation methods that attempt to assess their internal consistency. However, few current validation methods are designed to directly address the stability of QSAR predictions in response to changes in the information content of the training set. Since the main purpose of QSAR is to quickly and accurately estimate a property of interest for an untested set of molecules, it makes sense to have a means at hand to correctly set user expectations of model performance. In fact, the numerical value of a molecular prediction is often less important to the end user than knowing the rank order of that set of molecules according to their predicted endpoint values. Consequently, a means for characterizing the stability of predicted rank order is an important component of predictive QSAR. Unfortunately, none of the many validation metrics currently available directly measure the stability of rank order prediction, making the development of an additional metric that can quantify model stability a high priority. To address this need, this work examines the stabilities of QSAR rank order models created from representative data sets, descriptor sets, and modeling methods that were then assessed using Kendall Tau as a rank order metric, upon which the Shannon Entropy was evaluated as a means of quantifying rank-order stability. Random removal of data from the training set, also known as Data Truncation Analysis (DTA), was used as a means for systematically reducing the information content of each training set while examining both rank order performance and rank order stability in the face of training set data loss. The premise for DTA ROE model evaluation is that the response of a model to incremental loss of training information will be indicative of the quality and sufficiency of its training set, learning method, and descriptor types to cover a particular domain of applicability. This process is termed a “rank order entropy” evaluation, or ROE. By analogy with information theory, an unstable rank order model displays a high level of implicit entropy, while a QSAR rank order model which remains nearly unchanged during training set reductions would show low entropy. In this work, the ROE metric was applied to 71 data sets of different sizes, and was found to reveal more information about the behavior of the models than traditional metrics alone. Stable, or consistently performing models, did not necessarily predict rank order well. Models that performed well in rank order did not necessarily perform well in traditional metrics. In the end, it was shown that ROE metrics suggested that some QSAR models that are typically used should be discarded. ROE evaluation helps to discern which combinations of data set, descriptor set, and modeling methods lead to usable models in prioritization schemes, and provides confidence in the use of a particular model within a specific domain of applicability. PMID:21875058
Probability machines: consistent probability estimation using nonparametric learning machines.

PubMed

Malley, J D; Kruppa, J; Dasgupta, A; Malley, K G; Ziegler, A

2012-01-01

Most machine learning approaches only provide a classification for binary responses. However, probabilities are required for risk estimation using individual patient characteristics. It has been shown recently that every statistical learning machine known to be consistent for a nonparametric regression problem is a probability machine that is provably consistent for this estimation problem. The aim of this paper is to show how random forests and nearest neighbors can be used for consistent estimation of individual probabilities. Two random forest algorithms and two nearest neighbor algorithms are described in detail for estimation of individual probabilities. We discuss the consistency of random forests, nearest neighbors and other learning machines in detail. We conduct a simulation study to illustrate the validity of the methods. We exemplify the algorithms by analyzing two well-known data sets on the diagnosis of appendicitis and the diagnosis of diabetes in Pima Indians. Simulations demonstrate the validity of the method. With the real data application, we show the accuracy and practicality of this approach. We provide sample code from R packages in which the probability estimation is already available. This means that all calculations can be performed using existing software. Random forest algorithms as well as nearest neighbor approaches are valid machine learning methods for estimating individual probabilities for binary responses. Freely available implementations are available in R and may be used for applications.
Broadening Perspectives on Clinical Performance Assessment: Rethinking the Nature of In-Training Assessment

ERIC Educational Resources Information Center

Govaerts, Marjan J. B.; van der Vleuten, Cees P. M.; Schuwirth, Lambert W. T.; Muijtjens, Arno M. M.

2007-01-01

Context: In-training assessment (ITA), defined as multiple assessments of performance in the setting of day-to-day practice, is an invaluable tool in assessment programmes which aim to assess professional competence in a comprehensive and valid way. Research on clinical performance ratings, however, consistently shows weaknesses concerning…
Development and Validation of a Response Bias Scale (RBS) for the MMPI-2

ERIC Educational Resources Information Center

Gervais, Roger O.; Ben-Porath, Yossef S.; Wygant, Dustin B.; Green, Paul

2007-01-01

This study describes the development of a Minnesota Multiphasic Personality Inventory (MMPI-2) scale designed to detect negative response bias in forensic neuropsychological or disability assessment settings. The Response Bias Scale (RBS) consists of 28 MMPI-2 items that discriminated between persons who passed or failed the Word Memory Test…
Genomic predictability of single-step GBLUP for production traits in US Holstein

USDA-ARS?s Scientific Manuscript database

The objective of this study was to validate genomic predictability of single-step genomic BLUP for 305-day protein yield for US Holsteins. The genomic relationship matrix was created with the Algorithm of Proven and Young (APY) with 18,359 core animals. The full data set consisted of phenotypes coll...
A comprehensive clinical assessment tool to inform policy and practice: applications of the minimum data set.

PubMed

Mor, Vincent

2004-04-01

The Minimum Data Set (MDS) for nursing home (NH) resident assessment, designed to assess elders functional status and care needs, exemplifies how the information needs of clinical practice are congruent with those of research. Building on a review of the published literature, this article describes the development of the MDS, its reliability and validity testing, as well as the variety of different policy and research uses to which it has been applied. Interrater reliability of items and internal consistency of MDS summary scales is generally good to excellent. Validation studies reveal good correspondence to research quality instruments for cognition, activities of daily living, and diagnoses with more variable results for vision, pain, mood, and behavior scales. To date, no consistent evidence suggests that applications of MDS data for case-mix reimbursement and quality indicator monitoring systematically bias the data. Although facility variation in data quality could compromise some applications, creation of the MDS as a clinical tool for care planning provides an example of how assessment tools with clinical use can be used in administrative databases for research and policy applications.
MMPI-2 Symptom Validity (FBS) Scale: psychometric characteristics and limitations in a Veterans Affairs neuropsychological setting.

PubMed

Gass, Carlton S; Odland, Anthony P

2014-01-01

The Minnesota Multiphasic Personality Inventory-2 (MMPI-2) Symptom Validity (Fake Bad Scale [FBS]) Scale is widely used to assist in determining noncredible symptom reporting, despite a paucity of detailed research regarding its itemmetric characteristics. Originally designed for use in civil litigation, the FBS is often used in a variety of clinical settings. The present study explored its fundamental psychometric characteristics in a sample of 303 patients who were consecutively referred for a comprehensive examination in a Veterans Affairs (VA) neuropsychology clinic. FBS internal consistency (reliability) was .77. Its underlying factor structure consisted of three unitary dimensions (Tiredness/Distractibility, Stomach/Head Discomfort, and Claimed Virtue of Self/Others) accounting for 28.5% of the total variance. The FBS's internal structure showed factoral discordance, as Claimed Virtue was negatively related to most of the FBS and to its somatic complaint components. Scores on this 12-item FBS component reflected a denial of socially undesirable attitudes and behaviors (Antisocial Practices Scale) that is commonly expressed by the 1,138 males in the MMPI-2 normative sample. These 12 items significantly reduced FBS reliability, introducing systematic error variance. In this VA neuropsychological referral setting, scores on the FBS have ambiguous meaning because of its structural discordance.
Prospective evaluation of 64 serum autoantibodies as biomarkers for early detection of colorectal cancer in a true screening setting.

PubMed

Chen, Hongda; Werner, Simone; Butt, Julia; Zörnig, Inka; Knebel, Phillip; Michel, Angelika; Eichmüller, Stefan B; Jäger, Dirk; Waterboer, Tim; Pawlita, Michael; Brenner, Hermann

2016-03-29

Novel blood-based screening tests are strongly desirable for early detection of colorectal cancer (CRC). We aimed to identify and evaluate autoantibodies against tumor-associated antigens as biomarkers for early detection of CRC. 380 clinically identified CRC patients and samples of participants with selected findings from a cohort of screening colonoscopy participants in 2005-2013 (N=6826) were included in this analysis. Sixty-four serum autoantibody markers were measured by multiplex bead-based serological assays. A two-step approach with selection of biomarkers in a training set, and validation of findings in a validation set, the latter exclusively including participants from the screening setting, was applied. Anti-MAGEA4 exhibited the highest sensitivity for detecting early stage CRC and advanced adenoma. Multi-marker combinations substantially increased sensitivity at the price of a moderate loss of specificity. Anti-TP53, anti-IMPDH2, anti-MDM2 and anti-MAGEA4 were consistently included in the best-performing 4-, 5-, and 6-marker combinations. This four-marker panel yielded a sensitivity of 26% (95% CI, 13-45%) for early stage CRC at a specificity of 90% (95% CI, 83-94%) in the validation set. Notably, it also detected 20% (95% CI, 13-29%) of advanced adenomas. Taken together, the identified biomarkers could contribute to the development of a useful multi-marker blood-based test for CRC early detection.
The ToMenovela – A Photograph-Based Stimulus Set for the Study of Social Cognition with High Ecological Validity

PubMed Central

Herbort, Maike C.; Iseev, Jenny; Stolz, Christopher; Roeser, Benedict; Großkopf, Nora; Wüstenberg, Torsten; Hellweg, Rainer; Walter, Henrik; Dziobek, Isabel; Schott, Björn H.

2016-01-01

We present the ToMenovela, a stimulus set that has been developed to provide a set of normatively rated socio-emotional stimuli showing varying amount of characters in emotionally laden interactions for experimental investigations of (i) cognitive and (ii) affective Theory of Mind (ToM), (iii) emotional reactivity, and (iv) complex emotion judgment with respect to Ekman’s basic emotions (happiness, anger, disgust, fear, sadness, surprise, Ekman and Friesen, 1975). Stimuli were generated with focus on ecological validity and consist of 190 scenes depicting daily-life situations. Two or more of eight main characters with distinct biographies and personalities are depicted on each scene picture. To obtain an initial evaluation of the stimulus set and to pave the way for future studies in clinical populations, normative data on each stimulus of the set was obtained from a sample of 61 neurologically and psychiatrically healthy participants (31 female, 30 male; mean age 26.74 ± 5.84), including a visual analog scale rating of Ekman’s basic emotions (happiness, anger, disgust, fear, sadness, surprise) and free-text descriptions of the content of each scene. The ToMenovela is being developed to provide standardized material of social scenes that are available to researchers in the study of social cognition. It should facilitate experimental control while keeping ecological validity high. PMID:27994562
Calibrating EASY-Care independence scale to improve accuracy

PubMed Central

Jotheeswaran, A. T.; Dias, Amit; Philp, Ian; Patel, Vikram; Prince, Martin

2016-01-01

Background there is currently limited support for the reliability and validity of the EASY-Care independence scale, with little work carried out in low- or middle-income countries. Therefore, we assessed the internal construct validity and hierarchical and classical scaling properties among frail dependent older people in the community. Objective we assessed the internal construct validity and hierarchical and classical scaling properties among frail dependent older people in the community. Methods three primary care physicians administered EASY-Care comprehensive geriatric assessment for 150 frail and/or dependent older people in the primary care setting. A Mokken model was applied to investigate hierarchical scaling properties of EASY-Care independence scale, and internal consistency (Cronbach's alpha) of the scale was also examined. Results we found that EASY-Care independence scale is highly internally consistent and is a strong hierarchical scale, hence providing strong evidence for unidimensionality. However, two items in the scale (unable to use telephone and manage finances) had much lower item Loevinger H coefficients than others. Exclusion of these two items improved the overall internal consistency of the scale. Conclusions the strong performance of the EASY-Care independence scale among community-dwelling frail older people is encouraging. This study confirms that EASY-Care independence scale is highly internally consistent and a strong hierarchical scale. PMID:27496925
The Validation of Version 8 Ozone Profiles: Is SBUV Ready for Prime Time?

NASA Technical Reports Server (NTRS)

McPeters, R. D.; Wellemeyer, C. G.; Ahn, C.

2004-01-01

Ozone profile data are now available from a series of BUV instruments - SBUV on Nimbus 7 and SBW/2 instruments on NOAA 9, NOAA 11, and NOAA 16. The data have been processed through the new version 8 algorithm, which is designed to be more accurate and, more importantly, to reduce the influence of the a priori on ozone trends. As a part of the version 8 reprocessing we have attempted to apply a consistent calibration to the individual instruments so that their data records can be used together in a time series analysis. Validation consists of examining not only the mean difference from external datasets (i.e trends) but also consistency in the interannual variability of the data. Here we validate the v8 BUV data through comparison with ECC sondes, lidar and microwave measurements, and with SAGE II and HALOE satellite data records. We find that individual profiles generally agree with external data sets within +/-10% between 30 hPa and 1 hPa (approx. 24 - 50 km) and frequently agree within +/-5%. The interannual variability of the BUV ozone time series agrees well with that of SAGE II . On the average, different B W instruments usually agree within +/-5% with each other, though the relative error increases near the ends of the Nimbus 7 and NOAA 16 data records as a result of instrument problems. The combined v8 BUV data sets cover the 1979-2003 time period giving daily global coverage of the ozone vertical distribution to better accuracy than has ever been possible before.
The psychometric properties of the 'Hospital Survey on Patient Safety Culture' in Dutch hospitals.

PubMed

Smits, Marleen; Christiaans-Dingelhoff, Ingrid; Wagner, Cordula; Wal, Gerrit van der; Groenewegen, Peter P

2008-11-07

In many different countries the Hospital Survey on Patient Safety Culture (HSOPS) is used to assess the safety culture in hospitals. Accordingly, the questionnaire has been translated into Dutch for application in the Netherlands. The aim of this study was to examine the underlying dimensions and psychometric properties of the questionnaire in Dutch hospital settings, and to compare these results with the original questionnaire used in USA hospital settings. The HSOPS was completed by 583 staff members of four general hospitals, three teaching hospitals, and one university hospital in the Netherlands. Confirmatory factor analyses were performed to examine the applicability of the factor structure of the American questionnaire to the Dutch data. Explorative factor analyses were performed to examine whether another composition of items and factors would fit the data better. Supplementary psychometric analyses were performed, including internal consistency and construct validity. The confirmatory factor analyses were based on the 12-factor model of the original questionnaire and resulted in a few low reliability scores. 11 Factors were drawn with explorative factor analyses, with acceptable reliability scores and a good construct validity. Two items were removed from the questionnaire. The composition of the factors was very similar to that of the original questionnaire. A few items moved to another factor and two factors turned out to combine into a six-item dimension. All other dimensions consisted of two to five items. The Dutch translation of the HSOPS consists of 11 factors with acceptable reliability and good construct validity. and is similar to the original HSOPS factor structure.
The modified patient enablement instrument: a Portuguese cross-cultural adaptation, validity and reliability study.

PubMed

Remelhe, Mafalda; Teixeira, Pedro M; Lopes, Irene; Silva, Luís; Correia de Sousa, Jaime

2017-01-12

Enabling patients with asthma to obtain the knowledge, confidence and skills they need in order to assume a major role in the management of their disease is cost effective. It should be an integral part of any plan for long-term control of asthma. The modified Patient Enablement Instrument (mPEI) is an easily administered questionnaire that was adapted in the United Kingdom to measure patient enablement in asthma, but its applicability in Portugal is not known. Validity and reliability of questionnaires should be tested before use in settings different from those of the original version. The purpose of this study was to test the applicability of the mPEI to Portuguese asthma patients after translation and cross-cultural adaptation, and to verify the structural validity, internal consistency and reproducibility of the instrument. The mPEI was translated to Portuguese and back translated to English. Its content validity was assessed by a debriefing interview with 10 asthma patients. The translated instrument was then administered to a random sample of 142 patients with persistent asthma. Structural validity and internal consistency were assessed. For reproducibility analysis, 86 patients completed the instrument again 7 days later. Item-scale correlations and exploratory factor analysis were used to assess structural validity. Cronbach's alpha was used to test internal consistency, and the intra-class correlation coefficient was used for the analysis of reproducibility. All items of the Portuguese version of the mPEI were found to be equivalent to the original English version. There were strong item-scale correlations that confirmed construct validity, with a one component structure and good internal consistency (Cronbach's alpha >0.8) as well as high test-retest reliability (ICC=0.85). The mPEI showed sound psychometric properties for the evaluation of enablement in patients with asthma making it a reliable instrument for use in research and clinical practice in Portugal. Further studies are needed to confirm its responsiveness.
Reliability and validity of the Treatment Satisfaction Questionnaire for Medication among Portuguese-speaking Brazilian patients with hypertension.

PubMed

Sauer Liberato, Ana Carolina; Cunha Matheus Rodrigues, Roberta; Kim, MyoungJin; Mallory, Caroline

2016-07-01

This study examined the reliability and validity of the Brazilian Portuguese version of the Treatment Satisfaction Questionnaire for Medication (version 1.4) among patients with hypertension. Understanding the patient experience with treatment satisfaction will contribute to improved medication adherence and control of hypertension. Hypertension is a serious problem in Brazil that is associated with chronic illness controlled, in part, by consistent adherence to medications. Patient satisfaction with medication treatment is associated with adherence to medication. The Treatment Satisfaction Questionnaire for Medication (version 1.4) is a promising instrument for measuring medication; however, to date there has been no report of the reliability and validity of the instrument with Portuguese-speaking adults with hypertension in Brazil. Cross-sectional descriptive exploratory study. A convenience sample of 300 patients with hypertension in an outpatient setting in the southeast region of São Paulo state in Brazil completed the Treatment Satisfaction Questionnaire for Medication (version 1.4). The instrument, comprised of four subscales, was evaluated for reliability using correlation analyses and internal consistency. Confirmatory factor analysis was used to determine factorial validity. Correlational analyses, internal consistency (Cronbach's alpha) and hierarchical confirmatory factor analysis demonstrate adequate support for the four-factor dimensionality, reliability and factorial validity of the Treatment Satisfaction Questionnaire for Medication (version 1.4). This study provides modest evidence for internal consistency and factorial validity of the Treatment Satisfaction Questionnaire for Medication (version 1.4) in Portuguese-speaking adult Brazilians with hypertension. Future testing should focus on extending reliability testing, discriminant validity and potential translation and literacy issues in this population. Within known limitations, clinicians will find the Treatment Satisfaction Questionnaire for Medication (version 1.4) useful for identifying adult Portuguese-speaking Brazilian patients at risk of poor adherence and tailoring adherence interventions to promote hypertension control. © 2016 John Wiley & Sons Ltd.
Reliability and validity of the Dutch pediatric Voice Handicap Index.

PubMed

Veder, Laura; Pullens, Bas; Timmerman, Marieke; Hoeve, Hans; Joosten, Koen; Hakkesteegt, Marieke

2017-05-01

The pediatric voice handicap index (pVHI) has been developed to provide a better insight into the parents' perception of their child's voice related quality of life. The purpose of the present study was to validate the Dutch pVHI by evaluating its internal consistency and reliability. Furthermore, we determined the optimal cut-off point for a normal pVHI score. All items of the English pVHI were translated into Dutch. Parents of children in our dysphonic and control group were asked to fill out the questionnaire. For the test re-test analysis we used a different study group who filled out the pVHI twice as part of a large follow up study. Internal consistency was analyzed through Cronbach's α coefficient. The test-retest reliability was assessed by determining Pearson's correlation coefficient. Mann-Whitney test was used to compare the scores of the questionnaire of the control group with the dysphonic group. By calculating receiver operating characteristic (ROC) curves, sensitivity and specificity we were able to set a cut-off point. We obtained data from 122 asymptomatic children and from 79 dysphonic children. The scores of the questionnaire significantly differed between both groups. The internal consistency showed an overall Cronbach α coefficient of 0.96 and an excellent test-retest reliability of the total pVHI questionnaire with a Pearson's correlation coefficient of 0.90. A cut-off point for the total pVHI questionnaire was set at 7 points with a specificity of 85% and sensitivity of 100%. A cut-off point for the VAS score was set at 13 with a specificity of 93% and sensitivity of 97%. The Dutch pVHI is a valid and reliable tool for the assessment of children with voice problems. By setting a cut-off point for the score of the total pVHI questionnaire of 7 points and the VAS score of 13, the pVHI might be used as a screening tool to assess dysphonic complaints and the pVHI might be a useful and complementary tool to identify children with dysphonia. Copyright © 2017 Elsevier B.V. All rights reserved.
Validation of the Rapid Estimate for Adolescent Literacy in Medicine Short Form (REALM-TeenS)

PubMed Central

Colvin, Kimberly F.; Chisolm, Deena J.; Arnold, Connie; Hancock, Jill; Davis, Terry

2017-01-01

BACKGROUND: This study was designed to develop and validate a brief adolescent health literacy assessment tool (Rapid Estimate of Adolescent Literacy in Medicine Short Form [REALM-TeenS]). METHODS: We combined datasets from 2 existing research studies that used the REALM-Teen (n = 665) and conducted an item response theory analysis. The correlation between scores on the original 66-item REALM-Teen and the proposed REALM-TeenS was calculated, along with the decision consistency across forms with respect to grade level assignment of each adolescent and coefficient α. The proposed REALM-TeenS was validated with original REALM-Teen data from a third independent study (n = 174). RESULTS: Items with the largest discriminations across the scale, from low to high health literacy, were selected for inclusion in REALM-TeenS. From those, a set of 10 items was selected that maintained a reasonable level of SE across ability estimates and correlated highly (r = 0.92) with the original REALM-Teen scores. The coefficient α for the 10-item REALM-TeenS was .82. There was no evidence of model misfit (root mean square error of approximation < 0.001). In the validation sample, REALM-TeenS scores correlated highly with scores on the original REALM-Teen (r = 0.92), and the decision consistency across both forms was 80%. In pilot testing, administration took ∼20 seconds. CONCLUSIONS: The REALM-TeenS offers researchers and clinicians a brief validated screening tool that can be used to assess adolescent health literacy in a variety of settings. Scoring guidelines ensure that reading level assessment is appropriate by age and grade. PMID:28557740

Validation of the Rapid Estimate for Adolescent Literacy in Medicine Short Form (REALM-TeenS).

PubMed

Manganello, Jennifer A; Colvin, Kimberly F; Chisolm, Deena J; Arnold, Connie; Hancock, Jill; Davis, Terry

2017-05-01

This study was designed to develop and validate a brief adolescent health literacy assessment tool (Rapid Estimate of Adolescent Literacy in Medicine Short Form [REALM-TeenS]). We combined datasets from 2 existing research studies that used the REALM-Teen ( n = 665) and conducted an item response theory analysis. The correlation between scores on the original 66-item REALM-Teen and the proposed REALM-TeenS was calculated, along with the decision consistency across forms with respect to grade level assignment of each adolescent and coefficient α. The proposed REALM-TeenS was validated with original REALM-Teen data from a third independent study ( n = 174). Items with the largest discriminations across the scale, from low to high health literacy, were selected for inclusion in REALM-TeenS. From those, a set of 10 items was selected that maintained a reasonable level of SE across ability estimates and correlated highly ( r = 0.92) with the original REALM-Teen scores. The coefficient α for the 10-item REALM-TeenS was .82. There was no evidence of model misfit (root mean square error of approximation < 0.001). In the validation sample, REALM-TeenS scores correlated highly with scores on the original REALM-Teen ( r = 0.92), and the decision consistency across both forms was 80%. In pilot testing, administration took ∼20 seconds. The REALM-TeenS offers researchers and clinicians a brief validated screening tool that can be used to assess adolescent health literacy in a variety of settings. Scoring guidelines ensure that reading level assessment is appropriate by age and grade. Copyright © 2017 by the American Academy of Pediatrics.
Brazilian validation of the Alberta Infant Motor Scale.

PubMed

Valentini, Nadia Cristina; Saccani, Raquel

2012-03-01

The Alberta Infant Motor Scale (AIMS) is a well-known motor assessment tool used to identify potential delays in infants' motor development. Although Brazilian researchers and practitioners have used the AIMS in laboratories and clinical settings, its translation to Portuguese and validation for the Brazilian population is yet to be investigated. This study aimed to translate and validate all AIMS items with respect to internal consistency and content, criterion, and construct validity. A cross-sectional and longitudinal design was used. A cross-cultural translation was used to generate a Brazilian-Portuguese version of the AIMS. In addition, a validation process was conducted involving 22 professionals and 766 Brazilian infants (aged 0-18 months). The results demonstrated language clarity and internal consistency for the motor criteria (motor development score, α=.90; prone, α=.85; supine, α=.92; sitting, α=.84; and standing, α=.86). The analysis also revealed high discriminative power to identify typical and atypical development (motor development score, P<.001; percentile, P=.04; classification criterion, χ(2)=6.03; P=.05). Temporal stability (P=.07) (rho=.85, P<.001) was observed, and predictive power (P<.001) was limited to the group of infants aged from 3 months to 9 months. Limited predictive validity was observed, which may have been due to the restricted time that the groups were followed longitudinally. In sum, the translated version of AIMS presented adequate validity and reliability.
Identifying dyspepsia in the Greek population: translation and validation of a questionnaire

PubMed Central

Anastasiou, Foteini; Antonakis, Nikos; Chaireti, Georgia; Theodorakis, Pavlos N; Lionis, Christos

2006-01-01

Background Studies on clinical issues, including diagnostic strategies, are considered to be the core content of general practice research. The use of standardised instruments is regarded as an important component for the development of Primary Health Care research capacity. Demand for epidemiological cross-cultural comparisons in the international setting and the use of common instruments and definitions valid to each culture is bigger than ever. Dyspepsia is a common complaint in primary practice but little is known with respect to its incidence in Greece. There are some references about the Helicobacter Pylori infection in patients with functional dyspepsia or gastric ulcer in Greece but there is no specific instrument for the identification of dyspepsia. This paper reports on the validation and translation into Greek, of an English questionnaire for the identification of dyspepsia in the general population and discusses several possibilities of its use in the Greek primary care. Methods The selected English postal questionnaire for the identification of people with dyspepsia in the general population consists of 30 items and was developed in 1995. The translation and cultural adaptation of the questionnaire has been performed according to international standards. For the validation of the instrument the internal consistency of the items was established using the alpha coefficient of Chronbach, the reproducibility (test – retest reliability) was measured by kappa correlation coefficient and the criterion validity was calculated against the diagnosis of the patients' records using also kappa correlation coefficient. Results The final Greek version of the postal questionnaire for the identification of dyspepsia in the general population was reliably translated. The internal consistency of the questionnaire was good, Chronbach's alpha was found to be 0.88 (95% CI: 0.81–0.93), suggesting that all items were appropriate to measure. Kappa coefficient for reproducibility (test – retest reliability) was found 0.66 (95% CI: 0.62–0.71), whereas the kappa analysis for criterion validity was 0.63 (95% CI: 0.36–0.89). Conclusion This study indicates that the Greek translation is comparable with the English-language version in terms of validity and reliability, and is suitable for epidemiological research within the Greek primary health care setting. PMID:16515708
Nurses' knowledge and attitudes towards aged sexuality: validity and internal consistency of the Dutch version of the Aging Sexual Knowledge and Attitudes Scale.

PubMed

Mahieu, Lieslot; de Casterlé, Bernadette Dierckx; Van Elssen, Kim; Gastmans, Chris

2013-11-01

This paper reports a study testing the content and face validity and internal consistency of the Dutch version of the Aging Sexual Knowledge and Attitudes Scale. The ability of older residents to sexually express themselves is known to be influenced by the knowledge and attitudes of nursing home staff towards later-life sexuality. Although the Aging Sexual Knowledge and Attitudes Scale is a widely used instrument to measure this, there is no validated, Dutch translation available. Instrument development. Following a standard forward/backward translation into Dutch, the scale was further adapted for use in Flemish nursing home settings. Content and face validity and user-friendliness were assessed. The psychometric properties were determined by means of an exploratory study. Data were collected from March-April 2011 at eight Flemish nursing homes. Reliability was assessed using internal consistency and item-total correlations. Both subscales of the Flemish adaptation showed acceptable content validity. The face validity and user-friendliness were deemed favourable with hardly any remarks given by the expert panel. The Cronbach's α was 0.80 and 0.88 for the knowledge and attitude subscales, respectively. The item-total correlations ranged from 0.21-0.48 for the knowledge section and from 0.09-0.68 for the attitude subscale. We conclude from our study that the Dutch version of the scale has acceptable to good psychometric properties. The Flemish adaptation therefore seems to be a valuable instrument for studying nursing staff's knowledge and attitudes towards aged sexuality in Flanders. © 2013 Blackwell Publishing Ltd.
Community validation of the IDEA study cognitive screen in rural Tanzania.

PubMed

Gray, William K; Paddick, Stella Maria; Collingwood, Cecilia; Kisoli, Aloyce; Mbowe, Godfrey; Mkenda, Sarah; Lissu, Carolyn; Rogathi, Jane; Kissima, John; Walker, Richard W; Mushi, Declare; Chaote, Paul; Ogunniyi, Adesola; Dotchin, Catherine L

2016-11-01

The dementia diagnosis gap in sub-Saharan Africa (SSA) is large, partly because of difficulties in screening for cognitive impairment in the community. As part of the Identification and Intervention for Dementia in Elderly Africans (IDEA) study, we aimed to validate the IDEA cognitive screen in a community-based sample in rural Tanzania METHODS: Study participants were recruited from people who attended screening days held in villages within the rural Hai district of Tanzania. Criterion validity was assessed against the gold standard clinical dementia diagnosis using DSM-IV criteria. Construct validity was assessed against, age, education, sex and grip strength and instrumental activities of daily living (IADLs). Internal consistency and floor and ceiling effects were also examined. During community screening, the IDEA cognitive screen had high criterion validity, with an area under the receiver operating characteristic curve of 0.855 (95% CI 0.794 to 0.915). Higher scores on the screen were significantly correlated with lower age, male sex, having attended school, better grip strength and improved performance in activities of daily living. Factor analysis revealed a single factor with an eigenvalue greater than one, although internal consistency was only moderate (Cronbach's alpha = 0.534). The IDEA cognitive screen had high criterion and construct validity and is suitable for use as a cognitive screening instrument in a community setting in SSA. Only moderate internal consistency may partly reflect the multi-domain nature of dementia as diagnosed clinically. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Cerebrospinal Fluid Peptides as Potential Parkinson Disease Biomarkers: A Staged Pipeline for Discovery and Validation*

PubMed Central

Shi, Min; Movius, James; Dator, Romel; Aro, Patrick; Zhao, Yanchun; Pan, Catherine; Lin, Xiangmin; Bammler, Theo K.; Stewart, Tessandra; Zabetian, Cyrus P.; Peskind, Elaine R.; Hu, Shu-Ching; Quinn, Joseph F.; Galasko, Douglas R.; Zhang, Jing

2015-01-01

Finding robust biomarkers for Parkinson disease (PD) is currently hampered by inherent technical limitations associated with imaging or antibody-based protein assays. To circumvent the challenges, we adapted a staged pipeline, starting from our previous proteomic profiling followed by high-throughput targeted mass spectrometry (MS), to identify peptides in human cerebrospinal fluid (CSF) for PD diagnosis and disease severity correlation. In this multicenter study consisting of training and validation sets, a total of 178 subjects were randomly selected from a retrospective cohort, matching age and sex between PD patients, healthy controls, and neurological controls with Alzheimer disease (AD). From ∼14,000 unique peptides displaying differences between PD and healthy control in proteomic investigations, 126 peptides were selected based on relevance and observability in CSF using bioinformatic analysis and MS screening, and then quantified by highly accurate and sensitive selected reaction monitoring (SRM) in the CSF of 30 PD patients versus 30 healthy controls (training set), followed by diagnostic (receiver operating characteristics) and disease severity correlation analyses. The most promising candidates were further tested in an independent cohort of 40 PD patients, 38 AD patients, and 40 healthy controls (validation set). A panel of five peptides (derived from SPP1, LRP1, CSF1R, EPHA4, and TIMP1) was identified to provide an area under curve (AUC) of 0.873 (sensitivity = 76.7%, specificity = 80.0%) for PD versus healthy controls in the training set. The performance was essentially confirmed in the validation set (AUC = 0.853, sensitivity = 82.5%, specificity = 82.5%). Additionally, this panel could also differentiate the PD and AD groups (AUC = 0.990, sensitivity = 95.0%, specificity = 97.4%). Furthermore, a combination of two peptides belonging to proteins TIMP1 and APLP1 significantly correlated with disease severity as determined by the Unified Parkinson's Disease Rating Scale motor scores in both the training (r = 0.381, p = 0.038)j and the validation (r = 0.339, p = 0.032) sets. The novel panel of CSF peptides, if validated in independent cohorts, could be used to assist in clinical diagnosis of PD and has the potential to help monitoring or predicting disease progression. PMID:25556233
Construct validity test of evaluation tool for professional behaviors of entry-level occupational therapy students in the United States.

PubMed

Yuen, Hon K; Azuero, Andres; Lackey, Kaitlin W; Brown, Nicole S; Shrestha, Sangita

2016-01-01

This study aimed to test the construct validity of an instrument to measure student professional behaviors in entry-level occupational therapy (OT) students in the academic setting. A total of 718 students from 37 OT programs across the United States answered a self-assessment survey of professional behavior that we developed. The survey consisted of ranking 28 attributes, each on a 5-point Likert scale. A split-sample approach was used for exploratory and then confirmatory factor analysis. A three-factor solution with nine items was extracted using exploratory factor analysis [EFA] (n=430, 60%). The factors were 'Commitment to Learning' (2 items), 'Skills for Learning' (4 items), and 'Cultural Competence' (3 items). Confirmatory factor analysis (CFA) on the validation split (n=288, 40%) indicated fair fit for this three-factor model (fit indices: CFI=0.96, RMSEA=0.06, and SRMR=0.05). Internal consistency reliability estimates of each factor and the instrument ranged from 0.63 to 0.79. Results of the CFA in a separate validation dataset provided robust measures of goodness-of-fit for the three-factor solution developed in the EFA, and indicated that the three-factor model fitted the data well enough. Therefore, we can conclude that this student professional behavior evaluation instrument is a structurally validated tool to measure professional behaviors reported by entry-level OT students. The internal consistency reliability of each individual factor and the whole instrument was considered to be adequate to good.
Psychometric properties of the Postgraduate Hospital Educational Environment Measure in an Iranian hospital setting.

PubMed

Shokoohi, Shahrzad; Emami, Amir Hossein; Mohammadi, Aeen; Ahmadi, Soleiman; Mojtahedzadeh, Rita

2014-01-01

Background Students' perceptions of the educational environment are an important construct in assessing and enhancing the quality of medical training programs. Reliable and valid measurement, however, can be problematic - especially as instruments developed and tested in one culture are translated for use in another. Materials and method This study sought to explore the psychometric properties of the Postgraduate Hospital Educational Environment Measure (PHEEM) for use in an Iranian hospital training setting. We translated the instrument into Persian and ensured its content validity by back translation and expert review prior to administering it to 127 residents of Urmia University of Medical Science. Results Overall internal consistency of the translated measure was good (a=0.94). Principal components analysis revealed five factors accounting for 52.8% of the variance. Conclusion The Persian version of the PHEEM appears to be a reliable and potentially valid instrument for use in Iranian medical schools and may find favor in evaluating the educational environments of residency programs nationwide.
Psychometric properties of the postgraduate hospital educational environment measure in an Iranian hospital setting.

PubMed

Shokoohi, Shahrzad; Hossein Emami, Amir; Mohammadi, Aeen; Ahmadi, Soleiman; Mojtahedzadeh, Rita

2014-01-01

Students' perceptions of the educational environment are an important construct in assessing and enhancing the quality of medical training programs. Reliable and valid measurement, however, can be problematic - especially as instruments developed and tested in one culture are translated for use in another. This study sought to explore the psychometric properties of the Postgraduate Hospital Educational Environment Measure (PHEEM) for use in an Iranian hospital training setting. We translated the instrument into Persian and ensured its content validity by back translation and expert review prior to administering it to 127 residents of Urmia University of Medical Science. Overall internal consistency of the translated measure was good (a=0.94). Principal components analysis revealed five factors accounting for 52.8% of the variance. The Persian version of the PHEEM appears to be a reliable and potentially valid instrument for use in Iranian medical schools and may find favor in evaluating the educational environments of residency programs nationwide.
Development and validity of a scale to measure workplace culture of health.

PubMed

Kwon, Youngbum; Marzec, Mary L; Edington, Dee W

2015-05-01

To describe the development of and test the validity and reliability of the Workplace Culture of Health (COH) scale. Exploratory factor analysis and confirmatory factor analysis were performed on data from a health care organization (N = 627). To verify the factor structure, confirmatory factor analysis was performed on a second data set from a medical equipment manufacturer (N = 226). The COH scale included a structure of five orthogonal factors: senior leadership and polices, programs and rewards, quality assurance, supervisor support, and coworker support. With regard to construct validity (convergent and discriminant) and reliability, two different US companies showed the same factorial structure, satisfactory fit statistics, and suitable internal and external consistency. The COH scale represents a reliable and valid scale to assess the workplace environment and culture for supporting health.
[Internal consistency and criterion validity and reliability of the Mexican Version of the Child Behavior Checklist 1.5-5 (CBCL/1.5-5)].

PubMed

Albores-Gallo, Lilia; Hernández-Guzmán, Laura; Hasfura-Buenaga, Cecilia; Navarro-Luna, Enrique

To investigate the validity and internal consistency of the Mexican version of the CBCL/1.5 -5 that assesses the most common psychopathology in pre-school children in clinical and epidemiological settings. A total of 438 parents from two groups, clinical-psychiatric (N= 62) and community (N= 376) completed the CBCL/1.5-5/Mexican version. The internal consistency was high for total problems α=0.95, and internalized α=0.89 and externalized α=0.91 subscales. The test re-test (one week) using the intraclass correlation coefficient (ICC) was ≥ 0.95 for the internalized, externalized, and total problems subscales. The ROC curve for the criterion status of clinically-referred vs. non-referred using the total problems scale ≥ 24 resulted in an AUC (area under curve) of 0.77, a specificity 0.73, and a sensitivity of 0.70. The CBCL/1.5 -5/Mexican version is a reliable and valid tool. Copyright Â© 2016 Sociedad Chilena de Pediatría. Publicado por Elsevier España, S.L.U. All rights reserved.
Predicting free-living energy expenditure using a miniaturized ear-worn sensor: an evaluation against doubly labeled water.

PubMed

Bouarfa, Loubna; Atallah, Louis; Kwasnicki, Richard Mark; Pettitt, Claire; Frost, Gary; Yang, Guang-Zhong

2014-02-01

Accurate estimation of daily total energy expenditure (EE)is a prerequisite for assisted weight management and assessing certain health conditions. The use of wearable sensors for predicting free-living EE is challenged by consistent sensor placement, user compliance, and estimation methods used. This paper examines whether a single ear-worn accelerometer can be used for EE estimation under free-living conditions.An EE prediction model as first derived and validated in a controlled setting using healthy subjects involving different physical activities. Ten different activities were assessed showing a tenfold cross validation error of 0.24. Furthermore, the EE prediction model shows a mean absolute deviation(MAD) below 1.2 metabolic equivalent of tasks. The same model was applied to a free-living setting with a different population for further validation. The results were compared against those derived from doubly labeled water. In free-living settings, the predicted daily EE has a correlation of 0.74, p 0.008, and a MAD of 272 kcal day. These results demonstrate that laboratory-derived prediction models can be used to predict EE under free-living conditions [corrected].
The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS): A dynamic, multimodal set of facial and vocal expressions in North American English

PubMed Central

Russo, Frank A.

2018-01-01

The RAVDESS is a validated multimodal database of emotional speech and song. The database is gender balanced consisting of 24 professional actors, vocalizing lexically-matched statements in a neutral North American accent. Speech includes calm, happy, sad, angry, fearful, surprise, and disgust expressions, and song contains calm, happy, sad, angry, and fearful emotions. Each expression is produced at two levels of emotional intensity, with an additional neutral expression. All conditions are available in face-and-voice, face-only, and voice-only formats. The set of 7356 recordings were each rated 10 times on emotional validity, intensity, and genuineness. Ratings were provided by 247 individuals who were characteristic of untrained research participants from North America. A further set of 72 participants provided test-retest data. High levels of emotional validity and test-retest intrarater reliability were reported. Corrected accuracy and composite "goodness" measures are presented to assist researchers in the selection of stimuli. All recordings are made freely available under a Creative Commons license and can be downloaded at https://doi.org/10.5281/zenodo.1188976. PMID:29768426
Perceived functional ability assessed with the spinal function sort: is it valid for European rehabilitation settings in patients with non-specific non-acute low back pain?

PubMed Central

Hilfiker, R.; Kool, J. P.; Bachmann, S.; Hagen, K. B.

2010-01-01

The aim of this study involving 170 patients suffering from non-specific low back pain was to test the validity of the spinal function sort (SFS) in a European rehabilitation setting. The SFS, a picture-based questionnaire, assesses perceived functional ability of work tasks involving the spine. All measurements were taken by a blinded research assistant; work status was assessed with questionnaires. Our study demonstrated a high internal consistency shown by a Cronbach’s alpha of 0.98, reasonable evidence for unidimensionality, spearman correlations of >0.6 with work activities, and discriminating power for work status at 3 and 12 months by ROC curve analysis (area under curve = 0.760 (95% CI 0.689–0.822), respectively, 0.801 (95% CI 0.731–0.859). The standardised response mean within the two treatment groups was 0.18 and −0.31. As a result, we conclude that the perceived functional ability for work tasks can be validly assessed with the SFS in a European rehabilitation setting in patients with non-specific low back pain, and is predictive for future work status. PMID:20490874
Nutrition screening tools: does one size fit all? A systematic review of screening tools for the hospital setting.

PubMed

van Bokhorst-de van der Schueren, Marian A E; Guaitoli, Patrícia Realino; Jansma, Elise P; de Vet, Henrica C W

2014-02-01

Numerous nutrition screening tools for the hospital setting have been developed. The aim of this systematic review is to study construct or criterion validity and predictive validity of nutrition screening tools for the general hospital setting. A systematic review of English, French, German, Spanish, Portuguese and Dutch articles identified via MEDLINE, Cinahl and EMBASE (from inception to the 2nd of February 2012). Additional studies were identified by checking reference lists of identified manuscripts. Search terms included key words for malnutrition, screening or assessment instruments, and terms for hospital setting and adults. Data were extracted independently by 2 authors. Only studies expressing the (construct, criterion or predictive) validity of a tool were included. 83 studies (32 screening tools) were identified: 42 studies on construct or criterion validity versus a reference method and 51 studies on predictive validity on outcome (i.e. length of stay, mortality or complications). None of the tools performed consistently well to establish the patients' nutritional status. For the elderly, MNA performed fair to good, for the adults MUST performed fair to good. SGA, NRS-2002 and MUST performed well in predicting outcome in approximately half of the studies reviewed in adults, but not in older patients. Not one single screening or assessment tool is capable of adequate nutrition screening as well as predicting poor nutrition related outcome. Development of new tools seems redundant and will most probably not lead to new insights. New studies comparing different tools within one patient population are required. Copyright © 2013 Elsevier Ltd and European Society for Clinical Nutrition and Metabolism. All rights reserved.
Good validity of the international spinal cord injury quality of life basic data set.

PubMed

Post, M W M; Adriaansen, J J E; Charlifue, S; Biering-Sørensen, F; van Asbeck, F W A

2016-04-01

Cross-sectional validation study. To examine the construct and concurrent validity of the International Spinal Cord Injury (SCI) Quality of Life (QoL) Basic Data Set. Dutch community. People 28-65 years of age, who obtained their SCI between 18 and 35 years of age, were at least 10 years post SCI and were wheelchair users in daily life. MEASURE(S): The International SCI QoL Basic Data Set consists of three single items on satisfaction with life as a whole, physical health and psychological health (0=complete dissatisfaction; 10=complete satisfaction). Reference measures were the Mental Health Inventory-5 and three items of the World Health Organization Quality of Life measure. Data of 261 participants were available. Mean time after SCI was 24.1 years (s.d. 9.1); 90.4% had a traumatic SCI, 81.5% a motor complete SCI and 40% had tetraplegia. Mean age was 47.9 years (s.d. 8.8) and 73.2% were male. Mean scores were 6.9 (s.d. 1.9) for general QoL, 5.8 (s.d. 2.2) for physical health and 7.1 (s.d. 1.9) for psychological health. No floor or ceiling effects were found. Strong inter-correlations (0.48-0.71) were found between the items, and Cronbach's alpha of the scale was good (0.81). Correlations with the reference measures showed the strongest correlations between the WHOQOL general satisfaction item and general QoL (0.64), the WHOQOL health and daily activities items and physical health (0.69 and 0.60) and the Mental Health Inventory-5 and psychological health (0.70). This first validity study of the International SCI QoL Basic Data Set shows that it appears valid for persons with SCI.
Consistency of QSAR models: Correct split of training and test sets, ranking of models and performance parameters.

PubMed

Rácz, A; Bajusz, D; Héberger, K

2015-01-01

Recent implementations of QSAR modelling software provide the user with numerous models and a wealth of information. In this work, we provide some guidance on how one should interpret the results of QSAR modelling, compare and assess the resulting models, and select the best and most consistent ones. Two QSAR datasets are applied as case studies for the comparison of model performance parameters and model selection methods. We demonstrate the capabilities of sum of ranking differences (SRD) in model selection and ranking, and identify the best performance indicators and models. While the exchange of the original training and (external) test sets does not affect the ranking of performance parameters, it provides improved models in certain cases (despite the lower number of molecules in the training set). Performance parameters for external validation are substantially separated from the other merits in SRD analyses, highlighting their value in data fusion.
Cross-cultural adaptation and validation of the Norwegian pain catastrophizing scale in patients with low back pain.

PubMed

Fernandes, Linda; Storheim, Kjersti; Lochting, Ida; Grotle, Margreth

2012-06-22

Pain catastrophizing has been found to be an important predictor of disability and days lost from work in patients with low back pain. The most commonly used outcome measure to identify pain catastrophizing is the Pain Catastrophizing Scale (PCS). To enable the use of the PCS in clinical settings and research in Norwegian speaking patients, the PCS had to be translated. The purpose of this study was therefore to translate and cross-culturally adapt the PCS into Norwegian and to test internal consistency, construct validity and reproducibility of the PCS. The PCS was translated before it was tested for psychometric properties. Patients with subacute or chronic non-specific low back pain aged 18 years or more were recruited from primary and secondary care. Validity of the PCS was assessed by evaluating data quality (missing, floor and ceiling effects), principal components analysis, internal consistency (Cronbach's alpha), and construct validity (Spearman's rho). Reproducibility analyses included standard error of measurement, minimum detectable change, limits of agreement, and intraclass correlation coefficients. A total of 38 men and 52 women (n = 90), with a mean (SD) age of 47.6 (11.7) years, were included for baseline testing. A subgroup of 61 patients was included for test-retest assessments. The Norwegian PCS was easy-to-comprehend. The principal components analysis supported a three-factor structure, internal consistency was satisfactory for the PCS total score (α 0.90) and the subscales rumination (α 0.83) and helplessness (α 0.86), but not for the subscale magnification (α 0.53). In total, 86% of the correlation analyses were in accordance with predefined hypothesis. The reliability analyses showed intraclass correlation coefficients of 0.74 - 0.87 for the PCS total score and subscales. The PCS total score (range 0-52 points) showed a standard error of measurement of 4.6 points and a 95% minimum detectable change estimate of 12.8 points. The Norwegian PCS total score showed acceptable psychometric properties in terms of comprehensibility, consistency, construct validity, and reproducibility when applied to patients with subacute or chronic LBP from different clinical settings. Our study support the use of the PCS total score for clinical or research purposes identifying or evaluating pain catastrophizing.
Automatic, semi-automatic and manual validation of urban drainage data.

PubMed

Branisavljević, N; Prodanović, D; Pavlović, D

2010-01-01

Advances in sensor technology and the possibility of automated long distance data transmission have made continuous measurements the preferable way of monitoring urban drainage processes. Usually, the collected data have to be processed by an expert in order to detect and mark the wrong data, remove them and replace them with interpolated data. In general, the first step in detecting the wrong, anomaly data is called the data quality assessment or data validation. Data validation consists of three parts: data preparation, validation scores generation and scores interpretation. This paper will present the overall framework for the data quality improvement system, suitable for automatic, semi-automatic or manual operation. The first two steps of the validation process are explained in more detail, using several validation methods on the same set of real-case data from the Belgrade sewer system. The final part of the validation process, which is the scores interpretation, needs to be further investigated on the developed system.
Spanish Adaptation and Validation of the Outcome Questionnaire OQ-30.2

PubMed Central

Errázuriz, Paula; Opazo, Sebastián; Behn, Alex; Silva, Oscar; Gloger, Sergio

2017-01-01

This study assessed the psychometric properties of a Spanish version of the Shortened Outcome Questionnaire (OQ-30.2, Lambert et al., 2004) validated with a sample of 546 patients in an outpatient mental health clinic and 100 non-clinical adults in Chile. Our results show that this measure has similar normative data to the original measure, with a cutoff score for the Chilean population set at 43.36, and the reliable change index at 14. This Spanish OQ-30.2 has good internal consistency (α = 0.90), has concurrent validity with the Depressive, Anxious, and Somatoform disorders measuring scale (Alvarado and Vera, 1991), and is sensitive to change during psychotherapy. Consistent with previous studies, factorial analyses showed that both, the one-factor solution for a general scale and the three-factor solution containing three theoretical scales yielded poor fit estimates. Overall, our results are similar to past research on the OQ-45 and the OQ-30. The short version has adequate psychometric properties, comparable to those of the OQ-45, but provides a gain in application time that could be relevant in the setting of psychotherapy research with large samples, frequent assessments over time, and/or samples that may require more assistance completing items (e.g., low-literacy). We conclude that this measure will be a valuable instrument for research and clinical practice. PMID:28559857

Development and initial validation of the interprofessional team learning profiling questionnaire.

PubMed

Nisbet, Gillian; Dunn, Stewart; Lincoln, Michelle; Shaw, Joanne

2016-05-01

Informal workplace interprofessional learning occurs as health professionals interact with each other as part of everyday work practice. Participation in interprofessional team meetings is a practical way to foster learning. However, a gap exists in the availability of a reliable and valid instrument that adequately captures the nuances of informal workplace interprofessional learning in this setting. The purpose of this study was to develop a questionnaire to measure the different components of interprofessional learning that contribute to the quality of interprofessional learning within the interprofessional team meeting. Questionnaire items were developed from a review of the literature and interviews with health professionals. Exploratory factor analysis was used to determine the underlying factor structure. Two hundred and eighty-five health professionals completed a 98-item questionnaire. After elimination of unreliable items, the remaining items (n = 41) loaded onto four factors named personal and professional capacity; turning words into action-"walk the talk"; the rhetoric of interprofessional learning-"talk the talk"; and inclusiveness. Internal consistency was high for all sub-scales (Cronbach's alpha 0.91, 0.87, 0.83, and 0.83, respectively). Content, construct, and concurrent validity were assessed. The instrument developed in this study indicated consistency and robust psychometric properties. Future studies that further test the psychometric properties of the questionnaire will help to establish the usefulness of this measure in establishing evidence for the perceived effectiveness of interprofessional learning in a healthcare setting.
hEIDI: An Intuitive Application Tool To Organize and Treat Large-Scale Proteomics Data.

PubMed

Hesse, Anne-Marie; Dupierris, Véronique; Adam, Claire; Court, Magali; Barthe, Damien; Emadali, Anouk; Masselon, Christophe; Ferro, Myriam; Bruley, Christophe

2016-10-07

Advances in high-throughput proteomics have led to a rapid increase in the number, size, and complexity of the associated data sets. Managing and extracting reliable information from such large series of data sets require the use of dedicated software organized in a consistent pipeline to reduce, validate, exploit, and ultimately export data. The compilation of multiple mass-spectrometry-based identification and quantification results obtained in the context of a large-scale project represents a real challenge for developers of bioinformatics solutions. In response to this challenge, we developed a dedicated software suite called hEIDI to manage and combine both identifications and semiquantitative data related to multiple LC-MS/MS analyses. This paper describes how, through a user-friendly interface, hEIDI can be used to compile analyses and retrieve lists of nonredundant protein groups. Moreover, hEIDI allows direct comparison of series of analyses, on the basis of protein groups, while ensuring consistent protein inference and also computing spectral counts. hEIDI ensures that validated results are compliant with MIAPE guidelines as all information related to samples and results is stored in appropriate databases. Thanks to the database structure, validated results generated within hEIDI can be easily exported in the PRIDE XML format for subsequent publication. hEIDI can be downloaded from http://biodev.extra.cea.fr/docs/heidi .
Large scale study of multiple-molecule queries

PubMed Central

2009-01-01

Background In ligand-based screening, as well as in other chemoinformatics applications, one seeks to effectively search large repositories of molecules in order to retrieve molecules that are similar typically to a single molecule lead. However, in some case, multiple molecules from the same family are available to seed the query and search for other members of the same family. Multiple-molecule query methods have been less studied than single-molecule query methods. Furthermore, the previous studies have relied on proprietary data and sometimes have not used proper cross-validation methods to assess the results. In contrast, here we develop and compare multiple-molecule query methods using several large publicly available data sets and background. We also create a framework based on a strict cross-validation protocol to allow unbiased benchmarking for direct comparison in future studies across several performance metrics. Results Fourteen different multiple-molecule query methods were defined and benchmarked using: (1) 41 publicly available data sets of related molecules with similar biological activity; and (2) publicly available background data sets consisting of up to 175,000 molecules randomly extracted from the ChemDB database and other sources. Eight of the fourteen methods were parameter free, and six of them fit one or two free parameters to the data using a careful cross-validation protocol. All the methods were assessed and compared for their ability to retrieve members of the same family against the background data set by using several performance metrics including the Area Under the Accumulation Curve (AUAC), Area Under the Curve (AUC), F1-measure, and BEDROC metrics. Consistent with the previous literature, the best parameter-free methods are the MAX-SIM and MIN-RANK methods, which score a molecule to a family by the maximum similarity, or minimum ranking, obtained across the family. One new parameterized method introduced in this study and two previously defined methods, the Exponential Tanimoto Discriminant (ETD), the Tanimoto Power Discriminant (TPD), and the Binary Kernel Discriminant (BKD), outperform most other methods but are more complex, requiring one or two parameters to be fit to the data. Conclusion Fourteen methods for multiple-molecule querying of chemical databases, including novel methods, (ETD) and (TPD), are validated using publicly available data sets, standard cross-validation protocols, and established metrics. The best results are obtained with ETD, TPD, BKD, MAX-SIM, and MIN-RANK. These results can be replicated and compared with the results of future studies using data freely downloadable from http://cdb.ics.uci.edu/. PMID:20298525
The measurement of collaboration within healthcare settings: a systematic review of measurement properties of instruments.

PubMed

Walters, Stephen John; Stern, Cindy; Robertson-Malt, Suzanne

2016-04-01

There is a growing call by consumers and governments for healthcare to adopt systems and approaches to care to improve patient safety. Collaboration within healthcare settings is an important factor for improving systems of care. By using validated measurement instruments a standardized approach to assessing collaboration is possible, otherwise it is only an assumption that collaboration is occurring in any healthcare setting. The objective of this review was to evaluate and compare measurement properties of instruments that measure collaboration within healthcare settings, specifically those which have been psychometrically tested and validated. Participants could be healthcare professionals, the patient or any non-professional who contributes to a patient's care, for example, family members, chaplains or orderlies. The term participant type means the designation of any one participant; for example 'nurse', 'social worker' or 'administrator'. More than two participant types was mandatory. The focus of this review was the validity of tools used to measure collaboration within healthcare settings. The types of studies considered for inclusion were validation studies, but quantitative study designs such as randomized controlled trials, controlled trials and case studies were also eligible for inclusion. Studies that focused on Interprofessional Education, were published as an abstract only, contained patient self-reporting only or were not about care delivery were excluded. The outcome of interest was validation and interpretability of the instrument being assessed and included content validity, construct validity and reliability. Interpretability is characterized by statistics such as mean and standard deviation which can be translated to a qualitative meaning. The search strategy aimed to find both published and unpublished studies. A three-step search strategy was utilized in this review. The databases searched included PubMed, CINAHL, Embase, Cochrane Central Register of Controlled Trials, Emerald Fulltext, MD Consult Australia, PsycARTICLES, Psychology and Behavioural Sciences Collection, PsycINFO, Informit Health Databases, Scopus, UpToDate and Web of Science. The search for unpublished studies included EThOS (Electronic Thesis Online Service), Index to Theses and ProQuest- Dissertations and Theses. The assessment of methodological quality of the included studies was undertaken using the COSMIN checklist which is a validated tool that assesses the process of design and validation of healthcare measurement instruments. An Excel spreadsheet version of COSMIN was developed for data collection which included a worksheet for extracting participant characteristics and interpretability data. Statistical pooling of data was not possible for this review. Therefore, the findings are presented in a narrative form including tables and figures to aid in data presentation. To make a synthesis of the assessments of methodological quality of the different studies, each instrument was rated by accounting for the number of studies performed with an instrument, the appraisal of methodological quality and the consistency of results between studies. Twenty-one studies of 12 instruments were included in the review. The studies were diverse in their theoretical underpinnings, target population/setting and measurement objectives. Measurement objectives included: investigating beliefs, behaviors, attitudes, perceptions and relationships associated with collaboration; measuring collaboration between different levels of care or within a multi-rater/target group; assessing collaboration across teams; or assessing internal participation of both teams and patients.Studies produced validity or interpretability data but none of the studies assessed all validity and reliability properties. However, most of the included studies produced a factor structure or referred to prior factor analysis. A narrative synthesis of the individual study factor structures was generated consisting of nine headings: organizational settings, support structures, purpose and goals; communication; reflection on process; cooperation; coordination; role interdependence and partnership; relationships; newly created professional activities; and professional flexibility. Among the many instruments that measure collaboration within healthcare settings, the quality of each instrument varies; instruments are designed for specific populations and purposes, and are validated in various settings. Selecting an instrument requires careful consideration of the qualities of each. Therefore, referring to systematic reviews of measurement properties of instruments may be helpful to clinicians or researchers in instrument selection. Systematic reviews of measurement properties of instruments are valuable in aiding in instrument selection. This systematic review may be useful in instrument selection for the measurement of collaboration within healthcare settings with a complex mix of participant types. Evaluating collaboration provides important information on the strengths and limitations of different healthcare settings and the opportunities for continuous improvement via any remedial actions initiated. Development of a tool that can be used to measure collaboration within teams of healthcare professionals and non-professionals is important for practice. The use of different statistical modelling techniques, such as Item Response Theory modelling and the translation of models into Computer Adaptive Tests, may prove useful. Measurement equivalence is an important consideration for future instrument development and validation. Further development of the COSMIN tool should include appraisal for measurement equivalence. Researchers developing and validating measurement tools should consider multi-method research designs.
Analysis Of The IJCNN 2011 UTL Challenge

DTIC Science & Technology

2012-01-13

large datasets from various application domains: handwriting recognition, image recognition, video processing, text processing, and ecology. The goal...validation and final evaluation sets consist of 4096 examples each. Dataset Domain Features Sparsity Devel. Transf. AVICENNA Handwriting 120 0% 150205...documents [3]. Transfer learning methods could accelerate the application of handwriting recognizers to historical manuscript by reducing the need for
Maternal sensitivity and attachment security in Thailand: cross-cultural validation of Western measures.

PubMed

Chaimongkol, Nujjaree N; Flick, Louise H

2006-01-01

The purpose of this study was to examine the psychometric properties of Thai versions of the Maternal Behavior Q-Sort (MBQS), Caldwell's HOME, and the Attachment Q-set (AQS). A sample of 110 Thai mother-infant dyads were studied. The Content Validity Index (CVIs) of the Thai MBQS, HOME and AQS were between 91% and 99%. Internal consistency of the HOME was .71. Interobserver reliability of the MBQS, HOME, and AQS were .95, .87, and .87, respectively. Convergent validity was supported by finding a positive correlation between the MBQS and the HOME (r = .29, p < .001). A positive correlation of .45 (p < .001) between the scores of the MBQS and the AQS indicated concurrent validity of these scales. Study findings indicate the Thai MBQS, HOME, and AQS are reliable and valid in this Thai sample and suggest that the Thai versions reflect concepts similar to those in the original English versions.
Development and validation of Australian aphasia rehabilitation best practice statements using the RAND/UCLA appropriateness method

PubMed Central

Power, Emma; Thomas, Emma; Worrall, Linda; Rose, Miranda; Togher, Leanne; Nickels, Lyndsey; Hersh, Deborah; Godecke, Erin; O'Halloran, Robyn; Lamont, Sue; O'Connor, Claire; Clarke, Kim

2015-01-01

Objectives To develop and validate a national set of best practice statements for use in post-stroke aphasia rehabilitation. Design Literature review and statement validation using the RAND/UCLA Appropriateness Method (RAM). Participants A national Community of Practice of over 250 speech pathologists, researchers, consumers and policymakers developed a framework consisting of eight areas of care in aphasia rehabilitation. This framework provided the structure for the development of a care pathway containing aphasia rehabilitation best practice statements. Nine speech pathologists with expertise in aphasia rehabilitation participated in two rounds of RAND/UCLA appropriateness ratings of the statements. Panellists consisted of researchers, service managers, clinicians and policymakers. Main outcome measures Statements that achieved a high level of agreement and an overall median score of 7–9 on a nine-point scale were rated as ‘appropriate’. Results 74 best practice statements were extracted from the literature and rated across eight areas of care (eg, receiving the right referrals, providing intervention). At the end of Round 1, 71 of the 74 statements were rated as appropriate, no statements were rated as inappropriate, and three statements were rated as uncertain. All 74 statements were then rated again in the face-to-face second round. 16 statements were added through splitting existing items or adding new statements. Seven statements were deleted leaving 83 statements. Agreement was reached for 82 of the final 83 statements. Conclusions This national set of 82 best practice statements across eight care areas for the rehabilitation of people with aphasia is the first to be validated by an expert panel. These statements form a crucial component of the Australian Aphasia Rehabilitation Pathway (AARP) (http://www.aphasiapathway.com.au) and provide the basis for more consistent implementation of evidence-based practice in stroke rehabilitation. PMID:26137883
Development and validation of Australian aphasia rehabilitation best practice statements using the RAND/UCLA appropriateness method.

PubMed

Power, Emma; Thomas, Emma; Worrall, Linda; Rose, Miranda; Togher, Leanne; Nickels, Lyndsey; Hersh, Deborah; Godecke, Erin; O'Halloran, Robyn; Lamont, Sue; O'Connor, Claire; Clarke, Kim

2015-07-02

To develop and validate a national set of best practice statements for use in post-stroke aphasia rehabilitation. Literature review and statement validation using the RAND/UCLA Appropriateness Method (RAM). A national Community of Practice of over 250 speech pathologists, researchers, consumers and policymakers developed a framework consisting of eight areas of care in aphasia rehabilitation. This framework provided the structure for the development of a care pathway containing aphasia rehabilitation best practice statements. Nine speech pathologists with expertise in aphasia rehabilitation participated in two rounds of RAND/UCLA appropriateness ratings of the statements. Panellists consisted of researchers, service managers, clinicians and policymakers. Statements that achieved a high level of agreement and an overall median score of 7-9 on a nine-point scale were rated as 'appropriate'. 74 best practice statements were extracted from the literature and rated across eight areas of care (eg, receiving the right referrals, providing intervention). At the end of Round 1, 71 of the 74 statements were rated as appropriate, no statements were rated as inappropriate, and three statements were rated as uncertain. All 74 statements were then rated again in the face-to-face second round. 16 statements were added through splitting existing items or adding new statements. Seven statements were deleted leaving 83 statements. Agreement was reached for 82 of the final 83 statements. This national set of 82 best practice statements across eight care areas for the rehabilitation of people with aphasia is the first to be validated by an expert panel. These statements form a crucial component of the Australian Aphasia Rehabilitation Pathway (AARP) (http://www.aphasiapathway.com.au) and provide the basis for more consistent implementation of evidence-based practice in stroke rehabilitation. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Empirical gradient threshold technique for automated segmentation across image modalities and cell lines.

PubMed

Chalfoun, J; Majurski, M; Peskin, A; Breen, C; Bajcsy, P; Brady, M

2015-10-01

New microscopy technologies are enabling image acquisition of terabyte-sized data sets consisting of hundreds of thousands of images. In order to retrieve and analyze the biological information in these large data sets, segmentation is needed to detect the regions containing cells or cell colonies. Our work with hundreds of large images (each 21,000×21,000 pixels) requires a segmentation method that: (1) yields high segmentation accuracy, (2) is applicable to multiple cell lines with various densities of cells and cell colonies, and several imaging modalities, (3) can process large data sets in a timely manner, (4) has a low memory footprint and (5) has a small number of user-set parameters that do not require adjustment during the segmentation of large image sets. None of the currently available segmentation methods meet all these requirements. Segmentation based on image gradient thresholding is fast and has a low memory footprint. However, existing techniques that automate the selection of the gradient image threshold do not work across image modalities, multiple cell lines, and a wide range of foreground/background densities (requirement 2) and all failed the requirement for robust parameters that do not require re-adjustment with time (requirement 5). We present a novel and empirically derived image gradient threshold selection method for separating foreground and background pixels in an image that meets all the requirements listed above. We quantify the difference between our approach and existing ones in terms of accuracy, execution speed, memory usage and number of adjustable parameters on a reference data set. This reference data set consists of 501 validation images with manually determined segmentations and image sizes ranging from 0.36 Megapixels to 850 Megapixels. It includes four different cell lines and two image modalities: phase contrast and fluorescent. Our new technique, called Empirical Gradient Threshold (EGT), is derived from this reference data set with a 10-fold cross-validation method. EGT segments cells or colonies with resulting Dice accuracy index measurements above 0.92 for all cross-validation data sets. EGT results has also been visually verified on a much larger data set that includes bright field and Differential Interference Contrast (DIC) images, 16 cell lines and 61 time-sequence data sets, for a total of 17,479 images. This method is implemented as an open-source plugin to ImageJ as well as a standalone executable that can be downloaded from the following link: https://isg.nist.gov/. © 2015 The Authors Journal of Microscopy © 2015 Royal Microscopical Society.
Quick screening tool for patients with severe negative emotional reactions to chronic illness: psychometric study of the negative emotions due to chronic illness screening test (NECIS).

PubMed

Huang, Yun-Hsin; Wu, Chih-Hsun; Chen, Hsiu-Jung; Cheng, Yih-Ru; Hung, Fu-Chien; Leung, Kai-Kuan; Lue, Bee-Horng; Chen, Ching-Yu; Chiu, Tai-Yuan; Wu, Yin-Chang

2018-01-16

Severe negative emotional reactions to chronic illness are maladaptive to patients and they need to be addressed in a primary care setting. The psychometric properties of a quick screening tool-the Negative Emotions due to Chronic Illness Screening Test (NECIS)-for general emotional problems among patients with chronic illness being treated in a primary care setting was investigated. Three studies including 375 of patients with chronic illness were used to assess and analyze internal consistency, test-retest reliability, criterion-related validity, a cut-off point for distinguishing maladaptive emotions and clinical application validity of NECIS. Self-report questionnaires were used. Internal consistency (Cronbach's α) ranged from 0.78 to 0.82, and the test-retest reliability was 0.71 (P < 0.001). Criterion-related validity was 0.51 (P < 0.001). Based on the 'severe maladaptation' and 'moderate maladaptation' groups defined by using the 'Worsening due to Chronic Illness' index as the analysis reference, the receiver-operating characteristic curve analysis revealed an area under the curve of 0.81 and 0.82 (ps < 0.001), and a cut-off point of 19/20 was the most satisfactory for distinguishing those with overly negative emotions, with a sensitivity and specificity of 83.3 and 69.0%, and 68.5 and 83.0%, respectively. The clinical application validity analysis revealed that low NECIS group showed significantly better adaptation to chronic illness on the scales of subjective health, general satisfaction with life, self-efficacy of self-care for disease, illness perception and stressors in everyday life. The NECIS has satisfactory psychometric properties for use in the primary care setting. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Cross-cultural validation and psychometric evaluation of the Participation and Environment Measure for Children and Youth in Korea.

PubMed

Jeong, Yunwha; Law, Mary; Stratford, Paul; DeMatteo, Carol; Kim, Hwan

2016-11-01

To develop the Korean version of the Participation and Environment Measure for Children and Youth (KPEM-CY) and examine its psychometric properties. The PEM-CY was cross-culturally translated into Korean using a specific guideline: pre-review of participation items, forward/backward translation, expert committee review, pre-test of the KPEM-CY and final review. To establish internal consistency, test-retest reliability and construct validity of the KPEM-CY, 80 parents of children with disabilities aged 5-13 years were recruited in South Korea. Across the home, school and community settings, 76% of participation items and 29% of environment items were revised to improve their fit with Korean culture. Internal consistency was moderate to excellent (0.67-0.92) for different summary scores. Test-retest reliability was excellent (>0.75) in the summary scores of participation frequency and extent of involvement across the three settings and moderate to excellent (0.53-0.95) in all summary scores at home. Child's age, type of school and annual income were the factors that significantly influenced specific dimensions of participation and environment across all settings. Results indicated that the KPEM-CY is equivalent to the original PEM-CY and has initial evidence of reliability and validity for use with Korean children with disabilities. Implications for rehabilitation Because 'participation' is a key outcome of the rehabilitation, measuring comprehensive participation of children with disabilities is necessary. The PEM-CY is a parent-report survey measure to assess comprehensive participation of children and youth and environment, which affect their participation, at home, school and in the community. A cross-cultural adaptation process is mandatory to adapt the measurement tool to a new culture or country. The Korean PEM-CY has both reliability and validity and can therefore generate useful clinical data for Korean children with disabilities.
Quantitative parameters of CT texture analysis as potential markersfor early prediction of spontaneous intracranial hemorrhage enlargement.

PubMed

Shen, Qijun; Shan, Yanna; Hu, Zhengyu; Chen, Wenhui; Yang, Bing; Han, Jing; Huang, Yanfang; Xu, Wen; Feng, Zhan

2018-04-30

To objectively quantify intracranial hematoma (ICH) enlargement by analysing the image texture of head CT scans and to provide objective and quantitative imaging parameters for predicting early hematoma enlargement. We retrospectively studied 108 ICH patients with baseline non-contrast computed tomography (NCCT) and 24-h follow-up CT available. Image data were assessed by a chief radiologist and a resident radiologist. Consistency analysis between observers was tested. The patients were divided into training set (75%) and validation set (25%) by stratified sampling. Patients in the training set were dichotomized according to 24-h hematoma expansion ≥ 33%. Using the Laplacian of Gaussian bandpass filter, we chose different anatomical spatial domains ranging from fine texture to coarse texture to obtain a series of derived parameters (mean grayscale intensity, variance, uniformity) in order to quantify and evaluate all data. The parameters were externally validated on validation set. Significant differences were found between the two groups of patients within variance at V 1.0 and in uniformity at U 1.0 , U 1.8 and U 2.5 . The intraclass correlation coefficients for the texture parameters were between 0.67 and 0.99. The area under the ROC curve between the two groups of ICH cases was between 0.77 and 0.92. The accuracy of validation set by CTTA was 0.59-0.85. NCCT texture analysis can objectively quantify the heterogeneity of ICH and independently predict early hematoma enlargement. • Heterogeneity is helpful in predicting ICH enlargement. • CTTA could play an important role in predicting early ICH enlargement. • After filtering, fine texture had the best diagnostic performance. • The histogram-based uniformity parameters can independently predict ICH enlargement. • CTTA is more objective, more comprehensive, more independently operable, than previous methods.
Free kick instead of cross-validation in maximum-likelihood refinement of macromolecular crystal structures

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pražnikar, Jure; University of Primorska,; Turk, Dušan, E-mail: dusan.turk@ijs.si

2014-12-01

The maximum-likelihood free-kick target, which calculates model error estimates from the work set and a randomly displaced model, proved superior in the accuracy and consistency of refinement of crystal structures compared with the maximum-likelihood cross-validation target, which calculates error estimates from the test set and the unperturbed model. The refinement of a molecular model is a computational procedure by which the atomic model is fitted to the diffraction data. The commonly used target in the refinement of macromolecular structures is the maximum-likelihood (ML) function, which relies on the assessment of model errors. The current ML functions rely on cross-validation. Theymore » utilize phase-error estimates that are calculated from a small fraction of diffraction data, called the test set, that are not used to fit the model. An approach has been developed that uses the work set to calculate the phase-error estimates in the ML refinement from simulating the model errors via the random displacement of atomic coordinates. It is called ML free-kick refinement as it uses the ML formulation of the target function and is based on the idea of freeing the model from the model bias imposed by the chemical energy restraints used in refinement. This approach for the calculation of error estimates is superior to the cross-validation approach: it reduces the phase error and increases the accuracy of molecular models, is more robust, provides clearer maps and may use a smaller portion of data for the test set for the calculation of R{sub free} or may leave it out completely.« less
Reliability, validity, and responsiveness of the Persian version of Shoulder Activity Scale in a group of patients with shoulder disorders.

PubMed

Negahban, Hossein; Mohtasebi, Elham; Goharpey, Shahin

2015-01-01

The aim of this methodological study was to cross-culturally translate the Shoulder Activity Scale (SAS) into the Persian and determine its clinimetric properties including reliability, validity, and responsiveness in patients with shoulder disorders. Persian version of the SAS was obtained after standard forward-backward translation. Three questionnaires were completed by the respondents: SAS, shoulder pain and disability index (SPADI), and Short-Form 36 Health Survey (SF-36). The patients completed the SAS, 1 week after the first visit to evaluate the test-retest reliability. Construct validity was evaluated by examining the associations between the scores on the SAS and the scores obtained from the SPADI, SF-36, and age of the patients. To assess responsiveness, data were collected in the first visit and then again after 4 weeks physiotherapy intervention. Test-retest reliability and internal consistency were assessed using Intra-class Correlation Coefficient (ICC) and Cronbach's alpha, respectively. To evaluate construct validity, Spearman's rank correlation was used. The ability of the SAS to detect changes was evaluated by the receiver-operating characteristics method. No problem or language difficulties were reported during translation process. Test-retest reliability of the SAS was excellent with an ICC of 0.98. Also, the marginal Cronbach's alpha level of 0.64 was obtained. The correlation between the SAS and the SPADI was low, proving divergent validity, whereas the correlations between the SAS and the SF-36/age were moderate proving convergent validity. A marginally acceptable responsiveness was achieved for the Persian SAS. The study provides some evidences to support the test-retest reliability, internal consistency, construct validity, and responsiveness of the Persian version of the SAS in patients with shoulder disorders. Therefore, it seems that this instrument is a useful measure of shoulder activity level in research setting and clinical practice. The shoulder activity scale (SAS) is a reliable, valid, and responsive measure of shoulder activity level in Persian-speaking patients with different shoulder disorders. The results on clinimetric properties of the Persian SAS are comparable with its original, English version. Persian version of the SAS can be used in "clinical" and "research" settings of patients with shoulder disorders.
Validation of the modified Parenting Strategies for Eating and Physical Activity Scale-Diet (PEAS-Diet) in Latino children.

PubMed

Soto, Sandra C; Arredondo, Elva M; Horton, Lucy A; Ayala, Guadalupe X

2016-03-01

Research shows that Latino parenting practices influence children's dietary and weight outcomes. Most studies use parent-reported data, however data from children may provide additional insight into how parents influence their children's diet and weight outcomes. The Parenting Strategies for Eating and Activity Scale (PEAS) has been validated in Latino adults, but not in children. This study evaluated the factor structure and concurrent and predictive validity of a modified version of the PEAS (PEAS-Diet) among Latino children. Data were collected from 361 children ages 7-13 from Imperial County, California, enrolled in a randomized controlled trial to promote healthy eating. The PEAS-Diet included 25 candidate items targeting six parenting practices pertaining to children's eating behaviors: (a) monitoring; (b) disciplining; (c) control; (d) permissiveness; (e) reinforcing; and (f) limit-setting. Children were on average ten years old (±2), 50% boys, 93% self-identified as Latino, 81% were US-born, and 55% completed English versus Spanish-language interviews. Using varimax rotation on baseline data with the total sample, six items were removed due to factor loadings <.40 and/or cross-loading (>.32 on more than one component). Parallel analysis and interpretability suggested a 5-factor solution explaining 59.46% of the variance. The subscale "limit-setting" was removed from the scale. The final scale consisted of 19 items and 5 subscales. Internal consistency of the subscales ranged from α = .63-.82. Confirmatory factor analyses provided additional evidence for the 5-factor scale using data collected 4 and 6 months post-baseline among the control group (n = 164, n = 161, respectively). Concurrent validity with dietary intake was established for monitoring, control, permissiveness, and reinforcing subscales in the expected directions. Predictive validity was not established. Results indicated that with the reported changes, the interview-administered PEAS-Diet is valid among Latino children aged 7-13 years. Copyright © 2015 Elsevier Ltd. All rights reserved.
Inverse consistent non-rigid image registration based on robust point set matching

PubMed Central

2014-01-01

Background Robust point matching (RPM) has been extensively used in non-rigid registration of images to robustly register two sets of image points. However, except for the location at control points, RPM cannot estimate the consistent correspondence between two images because RPM is a unidirectional image matching approach. Therefore, it is an important issue to make an improvement in image registration based on RPM. Methods In our work, a consistent image registration approach based on the point sets matching is proposed to incorporate the property of inverse consistency and improve registration accuracy. Instead of only estimating the forward transformation between the source point sets and the target point sets in state-of-the-art RPM algorithms, the forward and backward transformations between two point sets are estimated concurrently in our algorithm. The inverse consistency constraints are introduced to the cost function of RPM and the fuzzy correspondences between two point sets are estimated based on both the forward and backward transformations simultaneously. A modified consistent landmark thin-plate spline registration is discussed in detail to find the forward and backward transformations during the optimization of RPM. The similarity of image content is also incorporated into point matching in order to improve image matching. Results Synthetic data sets, medical images are employed to demonstrate and validate the performance of our approach. The inverse consistent errors of our algorithm are smaller than RPM. Especially, the topology of transformations is preserved well for our algorithm for the large deformation between point sets. Moreover, the distance errors of our algorithm are similar to that of RPM, and they maintain a downward trend as whole, which demonstrates the convergence of our algorithm. The registration errors for image registrations are evaluated also. Again, our algorithm achieves the lower registration errors in same iteration number. The determinant of the Jacobian matrix of the deformation field is used to analyse the smoothness of the forward and backward transformations. The forward and backward transformations estimated by our algorithm are smooth for small deformation. For registration of lung slices and individual brain slices, large or small determinant of the Jacobian matrix of the deformation fields are observed. Conclusions Results indicate the improvement of the proposed algorithm in bi-directional image registration and the decrease of the inverse consistent errors of the forward and the reverse transformations between two images. PMID:25559889
A general contact mechanical formulation of multilayered structures and its application to deconvolute thickness/mechanical properties of glue used in surface force apparatus.

PubMed

Math, Souvik; Horn, Roger; Jayaram, Vikram; Biswas, Sanjay Kumar

2007-04-15

Currently data obtained from surface force apparatus experiments are convoluted with the mechanical response of glue of unknown thickness, used to bond mica sheets to the substrates. This paper describes a formulation to precisely deconvolute out the forces between the mica sheets by determining the thickness of glue, knowing the mechanical properties of the glue. The formulation consists of a general solution based on the noniterative Hankel transform of the Laplace equation. The generality is achieved by treating all the layers except the one in contact as an effective lumped system consisting of a set of springs in series, where each spring represents a layer. The solution is validated by nanoindentation of trilayer systems consisting of layers with widely diverse mechanical properties, some differing from each other by three orders of magnitude. SFA experiments are done with carefully metered slabs of glue. The proposed method is validated by comparing the actual glue thicknesses with those determined using the present analysis.
Validation of the Italian version of the dissociative experience scale for adolescents and young adults.

PubMed

De Pasquale, Concetta; Sciacca, Federica; Hichy, Zira

2016-01-01

The Dissociative Experience Scale for adolescent (A-DES), a 30-item, multidimensional, self-administered questionnaire, was validated using a large sample of American young people sample. We reported the linguistic validation process and the metric validity of the Italian version of A-DES in the Italy. A set of questionnaires was provided to a total of 633 participants from March 2015 to April 2016. The participants consisted of 282 boys and 351 girls, and their average age was between 18 and 24 years old. The translation process consisted of two consecutive steps: forward-backward translation and acceptability testing. The psychometric testing was applied to Italian students who were recruited from the Italian Public Schools and Universities in Sicily. Informed consent was obtained from all participants at the research. All individuals completed the A-DES. Reliability and validity were tested. The translated version was validated on a total of 633 Italian students. The reliability of A-DES total is .926. It is composed by 4 subscales: Dissociative amnesia, Absorption and imaginative involvement, Depersonalization and derealization, and Passive influence. The reliability of each subscale is: .756 for dissociative amnesia, .659 for absorption and imaginative involvement, .850 for depersonalization and derealization, and .743 for passive influence. The Italian version of the A-DES constitutes a useful instrument to measure dissociative experience in adolescents and young adults in Italy.
Development, pilot testing and psychometric validation of a short version of the coronary artery disease education questionnaire: The CADE-Q SV.

PubMed

Ghisi, Gabriela Lima de Melo; Sandison, Nicole; Oh, Paul

2016-03-01

To develop, pilot test and psychometrically validate a shorter version of the coronary artery disease education questionnaire (CADE-Q), called CADE-Q SV. Based on previous versions of the CADE-Q, cardiac rehabilitation (CR) experts developed 20 items divided into 5 knowledge domains to comprise the first version of the CADE-Q SV. To establish content validity, they were reviewed by an expert panel (N=12). Refined items were pilot-tested in 20 patients, in which clarity was provided. A final version was generated and psychometrically-tested in 132CR patients. Test-retest reliability was assessed via the intraclass correlation coefficient (ICC), the internal consistency using Cronbach's alpha, and criterion validity with regard to patients' education and duration in CR. All ICC coefficients meet the minimum recommended standard. All domains were considered internally consistent (α>0.7). Criterion validity was supported by significant differences in mean scores by educational level (p<0.01) and duration in CR (p<0.05). Knowledge about exercise and nutrition was higher than knowledge about medical condition. The CADE-Q SV was demonstrated to have good reliability and validity. This is a short, quick and appropriate tool for application in clinical and research settings, assessing patients' knowledge during CR and as part of education programming. Copyright © 2015. Published by Elsevier Ireland Ltd.
Development and psychometric validation of a scale to assess information needs in cardiac rehabilitation: the INCR Tool.

PubMed

Ghisi, Gabriela Lima de Melo; Grace, Sherry L; Thomas, Scott; Evans, Michael F; Oh, Paul

2013-06-01

To develop and psychometrically validate a tool to assess information needs in cardiac rehabilitation (CR) patients. After a literature search, 60 information items divided into 11 areas of needs were identified. To establish content validity, they were reviewed by an expert panel (N=10). Refined items were pilot-tested in 34 patients on a 5-point Likert-scale from 1 "really not helpful" to 5 "very important". A final version was generated and psychometrically tested in 203 CR patients. Test-retest reliability was assessed via the intraclass correlation coefficient (ICC), the internal consistency using Cronbach's alpha, and criterion validity was assessed with regard to patient's education and duration in CR. Five items were excluded after ICC analysis as well as one area of needs. All 10 areas were considered internally consistent (Cronbach's alpha>0.7). Criterion validity was supported by significant differences in mean scores by educational level (p<0.05) and duration in CR (p<0.001). The mean total score was 4.08 ± 0.53. Patients rated safety as their greatest information need. The INCR Tool was demonstrated to have good reliability and validity. This is an appropriate tool for application in clinical and research settings, assessing patients' needs during CR and as part of education programming. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

Improving Escalation of Care: Development and Validation of the Quality of Information Transfer Tool.

PubMed

Johnston, Maximilian J; Arora, Sonal; Pucher, Philip H; Reissis, Yannis; Hull, Louise; Huddy, Jeremy R; King, Dominic; Darzi, Ara

2016-03-01

To develop and provide validity and feasibility evidence for the QUality of Information Transfer (QUIT) tool. Prompt escalation of care in the setting of patient deterioration can prevent further harm. Escalation and information transfer skills are not currently measured in surgery. This study comprised 3 phases: the development (phase 1), validation (phase 2), and feasibility analysis (phase 3) of the QUIT tool. Phase 1 involved identification of core skills needed for successful escalation of care through literature review and 33 semistructured interviews with stakeholders. Phase 2 involved the generation of validity evidence for the tool using a simulated setting. Thirty surgeons assessed a deteriorating postoperative patient in a simulated ward and escalated their care to a senior colleague. The face and content validity were assessed using a survey. Construct and concurrent validity of the tool were determined by comparing performance scores using the QUIT tool with those measured using the Situation-Background-Assessment-Recommendation (SBAR) tool. Phase 3 was conducted using direct observation of escalation scenarios on surgical wards in 2 hospitals. A 7-category assessment tool was developed from phase 1 consisting of 24 items. Twenty-one of 24 items had excellent content validity (content validity index >0.8). All 7 categories and 18 of 24 (P < 0.05) items demonstrated construct validity. The correlation between the QUIT and SBAR tools used was strong indicating concurrent validity (r = 0.694, P < 0.001). Real-time scoring of escalation referrals was feasible and indicated that doctors currently have better information transfer skills than nurses when faced with a deteriorating patient. A validated tool to assess information transfer for deteriorating surgical patients was developed and tested using simulation and real-time clinical scenarios. It may improve the quality and safety of patient care on the surgical ward.
CODEMamb - an observational communication behavior assessment tool for use in ambulatory dementia care.

PubMed

Knebel, Maren; Haberstroh, Julia; Kümmel, Anne; Pantel, Johannes; Schröder, Johannes

2016-12-01

Communication improves well-being and quality of life for both people with dementia and their professional and family caregivers. Individualized communication, as required in informed consent procedures and psychosocial interventions, can improve quality of life, especially in ambulatory settings. However, few valid and reliable instruments exist that enable communication to be assessed and communication and behavioral resources to be identified. We, therefore, extended and adapted the newly developed observational instrument CODEM for use in ambulatory settings (CODEM amb ). Reliability and validity of the new instrument were studied in a total of 171 patients, whereby principal component analysis revealed three important factors: relationship aspects, verbal communication behavior and nonverbal communication behavior. CODEM amb [Formula: see text]s internal consistency, interrater and retest reliability were satisfactory to excellent. Convergent validity indices, as shown by examining correlations with similar but not identical constructs (CERAD-NP verbal subscales), were medium-high, while the divergent validity index (constructional praxis) was relatively low. The relationship to peer-rating remained nonsignificant. Criterion validity was investigated in groups of patients in accordance with their cognitive status. As expected, verbal communication abilities deteriorate faster than the relationship aspects of communication as the disease progresses. In summary, CODEM amb is a reliable and valid instrument that can be used to collect important information with the ultimate aim of supporting communication with people with dementia.
Development and validation of brief scales to measure emotional and behavioural problems among Chinese adolescents

PubMed Central

Shen, Minxue; Hu, Ming; Sun, Zhenqiu

2017-01-01

Objectives To develop and validate brief scales to measure common emotional and behavioural problems among adolescents in the examination-oriented education system and collectivistic culture of China. Setting Middle schools in Hunan province. Participants 5442 middle school students aged 11–19 years were sampled. 4727 valid questionnaires were collected and used for validation of the scales. The final sample included 2408 boys and 2319 girls. Primary and secondary outcome measures The tools were assessed by the item response theory, classical test theory (reliability and construct validity) and differential item functioning. Results Four scales to measure anxiety, depression, study problem and sociality problem were established. Exploratory factor analysis showed that each scale had two solutions. Confirmatory factor analysis showed acceptable to good model fit for each scale. Internal consistency and test–retest reliability of all scales were above 0.7. Item response theory showed that all items had acceptable discrimination parameters and most items had appropriate difficulty parameters. 10 items demonstrated differential item functioning with respect to gender. Conclusions Four brief scales were developed and validated among adolescents in middle schools of China. The scales have good psychometric properties with minor differential item functioning. They can be used in middle school settings, and will help school officials to assess the students’ emotional/behavioural problems. PMID:28062469
Prospective evaluation of 64 serum autoantibodies as biomarkers for early detection of colorectal cancer in a true screening setting

PubMed Central

Chen, Hongda; Werner, Simone; Butt, Julia; Zörnig, Inka; Knebel, Phillip; Michel, Angelika; Eichmüller, Stefan B.; Jäger, Dirk; Waterboer, Tim; Pawlita, Michael; Brenner, Hermann

2016-01-01

Novel blood-based screening tests are strongly desirable for early detection of colorectal cancer (CRC). We aimed to identify and evaluate autoantibodies against tumor-associated antigens as biomarkers for early detection of CRC. 380 clinically identified CRC patients and samples of participants with selected findings from a cohort of screening colonoscopy participants in 2005–2013 (N=6826) were included in this analysis. Sixty-four serum autoantibody markers were measured by multiplex bead-based serological assays. A two-step approach with selection of biomarkers in a training set, and validation of findings in a validation set, the latter exclusively including participants from the screening setting, was applied. Anti-MAGEA4 exhibited the highest sensitivity for detecting early stage CRC and advanced adenoma. Multi-marker combinations substantially increased sensitivity at the price of a moderate loss of specificity. Anti-TP53, anti-IMPDH2, anti-MDM2 and anti-MAGEA4 were consistently included in the best-performing 4-, 5-, and 6-marker combinations. This four-marker panel yielded a sensitivity of 26% (95% CI, 13–45%) for early stage CRC at a specificity of 90% (95% CI, 83–94%) in the validation set. Notably, it also detected 20% (95% CI, 13–29%) of advanced adenomas. Taken together, the identified biomarkers could contribute to the development of a useful multi-marker blood-based test for CRC early detection. PMID:26909861
Development and psychometric evaluation of the Professional Practice Environment (PPE) scale.

PubMed

Erickson, Jeanette Ives; Duffy, Mary E; Gibbons, M Patricia; Fitzmaurice, Joan; Ditomassi, Marianne; Jones, Dorothy

2004-01-01

To describe the Professional Practice Environment (PPE) scale, its conceptual development and psychometric evaluation, and its uses in measuring eight characteristics of the professional practice environment in an acute care setting. The 38-item PPE Scale was validated on a sample of 849 professional practice staff at the Massachusetts General Hospital in Boston. Psychometric analysis included: item analysis, principal components analysis (PCA) with varimax rotation and Kaiser normalization, and internal consistency reliability using Cronbach's alpha coefficient. Eight components were shown, confirming the original conceptually derived model's structure and accounting for 61% of explained variance. Cronbach's alpha coefficients for the eight PPE subscales ranged from .78 to .88. Findings showed the 38-item PPE Scale was reliable and valid for use in health outcomes research to examine the professional practice environment of staff working in acute care settings.
Development and validation of the Smartphone Addiction Inventory (SPAI).

PubMed

Lin, Yu-Hsuan; Chang, Li-Ren; Lee, Yang-Han; Tseng, Hsien-Wei; Kuo, Terry B J; Chen, Sue-Huei

2014-01-01

The aim of this study was to develop a self-administered scale based on the special features of smartphone. The reliability and validity of the Smartphone Addiction Inventory (SPAI) was demonstrated. A total of 283 participants were recruited from Dec. 2012 to Jul. 2013 to complete a set of questionnaires, including a 26-item SPAI modified from the Chinese Internet Addiction Scale and phantom vibration and ringing syndrome questionnaire. There were 260 males and 23 females, with ages 22.9 ± 2.0 years. Exploratory factor analysis, internal-consistency test, test-retest, and correlation analysis were conducted to verify the reliability and validity of the SPAI. Correlations between each subscale and phantom vibration and ringing were also explored. Exploratory factor analysis yielded four factors: compulsive behavior, functional impairment, withdrawal and tolerance. Test-retest reliabilities (intraclass correlations = 0.74-0.91) and internal consistency (Cronbach's α = 0.94) were all satisfactory. The four subscales had moderate to high correlations (0.56-0.78), but had no or very low correlation to phantom vibration/ringing syndrome. This study provides evidence that the SPAI is a valid and reliable, self-administered screening tool to investigate smartphone addiction. Phantom vibration and ringing might be independent entities of smartphone addiction.
Validating Dimensions of Psychosis Symptomatology: Neural Correlates and 20-year Outcomes

PubMed Central

Kotov, Roman; Foti, Dan; Li, Kaiqiao; Bromet, Evelyn J.; Hajcak, Greg; Ruggero, Camilo J.

2016-01-01

Heterogeneity of psychosis presents significant challenges for classification. Between two and 12 symptom dimensions have been proposed, and consensus is lacking. The present study sought to identify uniquely informative models by comparing the validity of these alternatives. An epidemiologic cohort of 628 first-admission inpatients with psychosis was interviewed 6 times over two decades and completed an electrophysiological assessment of error processing at year 20. We first analyzed a comprehensive set of 49 symptoms rated by interviewers at baseline, progressively extracting from one to 12 factors. Next, we compared the ability of resulting factor solutions to (a) account for concurrent neural dysfunction and (b) predict 20-year role, social, residential, and global functioning, and life satisfaction. A four-factor model showed incremental validity with all outcomes, and more complex models did not improve explanatory power. The four dimensions—reality distortion, disorganization, inexpressivity, and apathy/asociality—were replicable in 5 follow-ups, internally consistent, stable across assessments, and showed strong discriminant validity. These results reaffirm the value of separating disorganization and reality distortion, are consistent with recent findings distinguishing inexpressivity and apathy/asociality, and suggest that these four dimensions are fundamental to understanding neural abnormalities and long-term outcomes in psychosis. PMID:27819471
Portuguese Version of the Pain Beliefs and Perceptions Inventory: A Multicenter Validation Study.

PubMed

Azevedo, Luís Filipe; Sampaio, Rute; Camila Dias, Cláudia; Romão, José; Lemos, Laurinda; Agualusa, Luís; Vaz-Serra, Sílvia; Patto, Teresa; Costa-Pereira, Altamiro; Castro-Lopes, José Manuel

2017-07-01

We aimed to perform the translation, cultural adaptation, and validation of the Pain Beliefs and Perceptions Inventory (PBPI) for the European Portuguese language and chronic pain population. This is a longitudinal multicenter validation study. A Portuguese version of the PBPI (PBPI-P) was created through a process of translation, back translation, and expert panel evaluation. The PBPI-P was administered to a total of 122 patients from 13 chronic pain clinics in Portugal, at baseline and after 7 days. Internal consistency and test-retest reliability were assessed by Cronbach's alpha (α) and intraclass correlation coefficient (ICC). Construct (convergent and discriminant) validity was assessed based on a set of previously developed theoretical hypotheses about interrelations between the PBPI-P and other measures. Exploratory and confirmatory factor analyses were performed to test the theoretical structure of the PBPI-P. The internal consistency and test-retest reliability coefficients for each respective subscale were α = 0.620 and ICC = 0.801 for mystery; α = 0.744 and ICC = 0.841 for permanence; α = 0.778 and ICC = 0.791 for constancy; and α = 0.764 and ICC = 0.881 for self-blame. Exploratory and confirmatory factor analysis revealed a four-factor structure (performance, constancy, self-blame, and mystery) that explained 63% of the variance. The construct validity of the PBPI-P was shown to be adequate, with more than 90% of the previously defined hypotheses regarding interrelations with other measures confirmed. The PBPI-P has been shown to be adequate and to have excellent reliability, internal consistency, and validity. It may contribute to a better pain assessment and is suitable for research and clinical use. © 2016 World Institute of Pain.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Pritychenko, B.

The precision of double-beta ββ-decay experimental half lives and their uncertainties is reanalyzed. The method of Benford's distributions has been applied to nuclear reaction, structure and decay data sets. First-digit distribution trend for ββ-decay T 2v 1/2 is consistent with large nuclear reaction and structure data sets and provides validation of experimental half-lives. A complementary analysis of the decay uncertainties indicates deficiencies due to small size of statistical samples, and incomplete collection of experimental information. Further experimental and theoretical efforts would lead toward more precise values of-decay half-lives and nuclear matrix elements.
Correlation consistent valence basis sets for use with the Stuttgart-Dresden-Bonn relativistic effective core potentials: The atoms Ga-Kr and In-Xe

NASA Astrophysics Data System (ADS)

Martin, Jan M. L.; Sundermann, Andreas

2001-02-01

We propose large-core correlation-consistent (cc) pseudopotential basis sets for the heavy p-block elements Ga-Kr and In-Xe. The basis sets are of cc-pVTZ and cc-pVQZ quality, and have been optimized for use with the large-core (valence-electrons only) Stuttgart-Dresden-Bonn (SDB) relativistic pseudopotentials. Validation calculations on a variety of third-row and fourth-row diatomics suggest them to be comparable in quality to the all-electron cc-pVTZ and cc-pVQZ basis sets for lighter elements. Especially the SDB-cc-pVQZ basis set in conjunction with a core polarization potential (CPP) yields excellent agreement with experiment for compounds of the later heavy p-block elements. For accurate calculations on Ga (and, to a lesser extent, Ge) compounds, explicit treatment of 13 valence electrons appears to be desirable, while it seems inevitable for In compounds. For Ga and Ge, we propose correlation consistent basis sets extended for (3d) correlation. For accurate calculations on organometallic complexes of interest to homogenous catalysis, we recommend a combination of the standard cc-pVTZ basis set for first- and second-row elements, the presently derived SDB-cc-pVTZ basis set for heavier p-block elements, and for transition metals, the small-core [6s5p3d] Stuttgart-Dresden basis set-relativistic effective core potential combination supplemented by (2f1g) functions with exponents given in the Appendix to the present paper.
Assessment of performance validity in the Stroop Color and Word Test in mild traumatic brain injury patients: a criterion-groups validation design.

PubMed

Guise, Brian J; Thompson, Matthew D; Greve, Kevin W; Bianchini, Kevin J; West, Laura

2014-03-01

The current study assessed performance validity on the Stroop Color and Word Test (Stroop) in mild traumatic brain injury (TBI) using criterion-groups validation. The sample consisted of 77 patients with a reported history of mild TBI. Data from 42 moderate-severe TBI and 75 non-head-injured patients with other clinical diagnoses were also examined. TBI patients were categorized on the basis of Slick, Sherman, and Iverson (1999) criteria for malingered neurocognitive dysfunction (MND). Classification accuracy is reported for three indicators (Word, Color, and Color-Word residual raw scores) from the Stroop across a range of injury severities. With false-positive rates set at approximately 5%, sensitivity was as high as 29%. The clinical implications of these findings are discussed. © 2012 The British Psychological Society.
Reliability and validity of the Euthanasia Attitude Scale (EAS) for Hong Kong medical doctors.

PubMed

Tang, Wai-Kiu; Mak, Kwok-Kei; Kam, Philip Ming-Ho; Ho, Joanna Wing-Kiu; Chan, Denise Che-Ying; Suen, To-Lam; Lau, Michael Chak-Kwan; Cheng, Adrian Ka-Chun; Wan, Yuen-Ting; Wan, Ho-Yan; Hussain, Assad

2010-08-01

This study aimed to examine the reliability and validity of the Euthanasia Attitude Scale (EAS) in Hong Kong medical doctors. A total of 107 medical doctors (61.7% men) participated in a survey at clinical settings in 2008. The 21-item EAS was used to assess their attitudes toward euthanasia. The mean (standard deviation) and median of the EAS were 63.60 (60.31) and 63.00. Total EAS scores correlated well with ''Ethical Considerations,'' ''Practical Considerations,'' and ''Treasuring Life'' (Spearman rho =.37-.96, P < .001) but not ''Naturalistic Beliefs.'' The construct validity of the 3-factor model was appropriate (Kaiser-Meyer-Olkin [KMO] value = 0.90) and showed high internal consistency (Cronbach alpha =.79-.92). Euthanasia Attitude Scale may be a reliable and valid measure for assessing the attitudes toward euthanasia in medical professionals.
Validation of the PedsQL Epilepsy Module: A pediatric epilepsy-specific health-related quality of life measure.

PubMed

Modi, Avani C; Junger, Katherine F; Mara, Constance A; Kellermann, Tanja; Barrett, Lauren; Wagner, Janelle; Mucci, Grace A; Bailey, Laurie; Almane, Dace; Guilfoyle, Shanna M; Urso, Lauryn; Hater, Brooke; Hustzi, Heather; Smith, Gigi; Herrmann, Bruce; Perry, M Scott; Zupanc, Mary; Varni, James W

2017-11-01

To validate a brief and reliable epilepsy-specific, health-related quality of life (HRQOL) measure in children with various seizure types, treatments, and demographic characteristics. This national validation study was conducted across five epilepsy centers in the United States. Youth 5-18 years and caregivers of youth 2-18 years diagnosed with epilepsy completed the PedsQL Epilepsy Module and additional questionnaires to establish reliability and validity of the epilepsy-specific HRQOL instrument. Demographic and medical data were collected through chart reviews. Factor analysis was conducted, and internal consistency (Cronbach's alphas), test-retest reliability, and construct validity were assessed. Questionnaires were analyzed from 430 children with epilepsy (M age = 9.9 years; range 2-18 years; 46% female; 62% white: non-Hispanic; 76% monotherapy, 54% active seizures) and their caregivers. The final PedsQL Epilepsy Module is a 29-item measure with five subscales (i.e., Impact, Cognitive, Sleep, Executive Functioning, and Mood/Behavior) with parallel child and caregiver reports. Internal consistency coefficients ranged from 0.70-0.94. Construct validity and convergence was demonstrated in several ways, including strong relationships with seizure outcomes, antiepileptic drug (AED) side effects, and well-established measures of executive, cognitive, and emotional/behavioral functioning. The PedsQL Epilepsy Module is a reliable measure of HRQOL with strong evidence of its validity across the epilepsy spectrum in both clinical and research settings. Wiley Periodicals, Inc. © 2017 International League Against Epilepsy.
Elders Health Empowerment Scale: Spanish adaptation and psychometric analysis.

PubMed

Serrani Azcurra, Daniel Jorge Luis

2014-01-01

Empowerment refers to patient skills that allow them to become primary decision-makers in control of daily self-management of health problems. As important the concept as it is, particularly for elders with chronic diseases, few available instruments have been validated for use with Spanish speaking people. Translate and adapt the Health Empowerment Scale (HES) for a Spanish-speaking older adults sample and perform its psychometric validation. The HES was adapted based on the Diabetes Empowerment Scale-Short Form. Where "diabetes" was mentioned in the original tool, it was replaced with "health" terms to cover all kinds of conditions that could affect health empowerment. Statistical and Psychometric Analyses were conducted on 648 urban-dwelling seniors. The HES had an acceptable internal consistency with a Cronbach's α of 0.89. The convergent validity was supported by significant Pearson's Coefficient correlations between the HES total and item scores and the General Self Efficacy Scale (r= 0.77), Swedish Rheumatic Disease Empowerment Scale (r= 0.69) and Making Decisions Empowerment Scale (r= 0.70). Construct validity was evaluated using item analysis, half-split test and corrected item to total correlation coefficients; with good internal consistency (α> 0.8). The content validity was supported by Scale and Item Content Validity Index of 0.98 and 1.0, respectively. HES had acceptable face validity and reliability coefficients; which added to its ease administration and users' unbiased comprehension, could set it as a suitable tool in evaluating elder's outpatient empowerment-based medical education programs.
Elders Health Empowerment Scale

PubMed Central

2014-01-01

Introduction: Empowerment refers to patient skills that allow them to become primary decision-makers in control of daily self-management of health problems. As important the concept as it is, particularly for elders with chronic diseases, few available instruments have been validated for use with Spanish speaking people. Objective: Translate and adapt the Health Empowerment Scale (HES) for a Spanish-speaking older adults sample and perform its psychometric validation. Methods: The HES was adapted based on the Diabetes Empowerment Scale-Short Form. Where "diabetes" was mentioned in the original tool, it was replaced with "health" terms to cover all kinds of conditions that could affect health empowerment. Statistical and Psychometric Analyses were conducted on 648 urban-dwelling seniors. Results: The HES had an acceptable internal consistency with a Cronbach's α of 0.89. The convergent validity was supported by significant Pearson's Coefficient correlations between the HES total and item scores and the General Self Efficacy Scale (r= 0.77), Swedish Rheumatic Disease Empowerment Scale (r= 0.69) and Making Decisions Empowerment Scale (r= 0.70). Construct validity was evaluated using item analysis, half-split test and corrected item to total correlation coefficients; with good internal consistency (α> 0.8). The content validity was supported by Scale and Item Content Validity Index of 0.98 and 1.0, respectively. Conclusions: HES had acceptable face validity and reliability coefficients; which added to its ease administration and users' unbiased comprehension, could set it as a suitable tool in evaluating elder's outpatient empowerment-based medical education programs. PMID:25767307
The Smartphone Addiction Scale: Development and Validation of a Short Version for Adolescents

PubMed Central

Kwon, Min; Kim, Dai-Jin; Cho, Hyun; Yang, Soo

2013-01-01

Objective This study was designed to investigate the revised and short version of the smartphone addiction scale and the proof of its validity in adolescents. In addition, it suggested cutting off the values by gender in order to determine smartphone addiction and elaborate the characteristics of smartphone usage in adolescents. Method A set of questionnaires were provided to a total of 540 selected participants from April to May of 2013. The participants consisted of 343 boys and 197 girls, and their average age was 14.5 years old. The content validity was performed on a selection of shortened items, while an internal-consistency test was conducted for the verification of its reliability. The concurrent validity was confirmed using SAS, SAPS and KS-scale. Receiver operating characteristics analysis was conducted to suggest cut-off. Results The 10 final questions were selected using content validity. The internal consistency and concurrent validity of SAS were verified with a Cronbach's alpha of 0.911. The SAS-SV was significantly correlated with the SAS, SAPS and KS-scale. The SAS-SV scores of gender (p<.001) and self-evaluation of smartphone addiction (p<.001) showed significant difference. The ROC analysis results showed an area under a curve (AUC) value of 0.963(0.888–1.000), a cut-off value of 31, sensitivity value of 0.867 and specificity value of 0.893 in boys while an AUC value of 0.947(0.887–1.000), a cut-off value of 33, sensitivity value of 0.875, and a specificity value of 0.886 in girls. Conclusions The SAS-SV showed good reliability and validity for the assessment of smartphone addiction. The smartphone addiction scale short version, which was developed and validated in this study, could be used efficiently for the evaluation of smartphone addiction in community and research areas. PMID:24391787
The smartphone addiction scale: development and validation of a short version for adolescents.

PubMed

Kwon, Min; Kim, Dai-Jin; Cho, Hyun; Yang, Soo

2013-01-01

This study was designed to investigate the revised and short version of the smartphone addiction scale and the proof of its validity in adolescents. In addition, it suggested cutting off the values by gender in order to determine smartphone addiction and elaborate the characteristics of smartphone usage in adolescents. A set of questionnaires were provided to a total of 540 selected participants from April to May of 2013. The participants consisted of 343 boys and 197 girls, and their average age was 14.5 years old. The content validity was performed on a selection of shortened items, while an internal-consistency test was conducted for the verification of its reliability. The concurrent validity was confirmed using SAS, SAPS and KS-scale. Receiver operating characteristics analysis was conducted to suggest cut-off. The 10 final questions were selected using content validity. The internal consistency and concurrent validity of SAS were verified with a Cronbach's alpha of 0.911. The SAS-SV was significantly correlated with the SAS, SAPS and KS-scale. The SAS-SV scores of gender (p<.001) and self-evaluation of smartphone addiction (p<.001) showed significant difference. The ROC analysis results showed an area under a curve (AUC) value of 0.963(0.888-1.000), a cut-off value of 31, sensitivity value of 0.867 and specificity value of 0.893 in boys while an AUC value of 0.947(0.887-1.000), a cut-off value of 33, sensitivity value of 0.875, and a specificity value of 0.886 in girls. The SAS-SV showed good reliability and validity for the assessment of smartphone addiction. The smartphone addiction scale short version, which was developed and validated in this study, could be used efficiently for the evaluation of smartphone addiction in community and research areas.
Psychometric evaluation of 3-set 4P questionnaire.

PubMed

Akerman, Eva; Fridlund, Bengt; Samuelson, Karin; Baigi, Amir; Ersson, Anders

2013-02-01

This is a further development of a specific questionnaire, the 3-set 4P, to be used for measuring former ICU patients' physical and psychosocial problems after intensive care and the need for follow-up. The aim was to psychometrically test and evaluate the 3-set 4P questionnaire in a larger population. The questionnaire consists of three sets: "physical", "psychosocial" and "follow-up". The questionnaires were sent by mail to all patients with more than 24-hour length of stay on four ICUs in Sweden. Construct validity was measured with exploratory factor analysis with Varimax rotation. This resulted in three factors for the "physical set", five factors for the "psychosocial set" and four factors for the "follow-up set" with strong factor loadings and a total explained variance of 62-77.5%. Thirteen questions in the SF-36 were used for concurrent validity showing Spearman's r(s) 0.3-0.6 in eight questions and less than 0.2 in five. Test-retest was used for stability reliability. In set follow-up the correlation was strong to moderate and in physical and psychosocial sets the correlations were moderate to fair. This may have been because the physical and psychosocial status changed rapidly during the test period. All three sets had good homogeneity. In conclusion, the 3-set 4P showed overall acceptable results, but it has to be further modified in different cultures before being considered a fully operational instrument for use in clinical practice. Copyright © 2012 Elsevier Ltd. All rights reserved.
Reliability and Validity of Instruments for Assessing Perinatal Depression in African Settings: Systematic Review and Meta-Analysis

PubMed Central

Tsai, Alexander C.; Scott, Jennifer A.; Hung, Kristin J.; Zhu, Jennifer Q.; Matthews, Lynn T.; Psaros, Christina; Tomlinson, Mark

2013-01-01

Background A major barrier to improving perinatal mental health in Africa is the lack of locally validated tools for identifying probable cases of perinatal depression or for measuring changes in depression symptom severity. We systematically reviewed the evidence on the reliability and validity of instruments to assess perinatal depression in African settings. Methods and Findings Of 1,027 records identified through searching 7 electronic databases, we reviewed 126 full-text reports. We included 25 unique studies, which were disseminated in 26 journal articles and 1 doctoral dissertation. These enrolled 12,544 women living in nine different North and sub-Saharan African countries. Only three studies (12%) used instruments developed specifically for use in a given cultural setting. Most studies provided evidence of criterion-related validity (20 [80%]) or reliability (15 [60%]), while fewer studies provided evidence of construct validity, content validity, or internal structure. The Edinburgh postnatal depression scale (EPDS), assessed in 16 studies (64%), was the most frequently used instrument in our sample. Ten studies estimated the internal consistency of the EPDS (median estimated coefficient alpha, 0.84; interquartile range, 0.71-0.87). For the 14 studies that estimated sensitivity and specificity for the EPDS, we constructed 2 x 2 tables for each cut-off score. Using a bivariate random-effects model, we estimated a pooled sensitivity of 0.94 (95% confidence interval [CI], 0.68-0.99) and a pooled specificity of 0.77 (95% CI, 0.59-0.88) at a cut-off score of ≥9, with higher cut-off scores yielding greater specificity at the cost of lower sensitivity. Conclusions The EPDS can reliably and validly measure perinatal depression symptom severity or screen for probable postnatal depression in African countries, but more validation studies on other instruments are needed. In addition, more qualitative research is needed to adequately characterize local understandings of perinatal depression-like syndromes in different African contexts. PMID:24340036
Detection of overreported psychopathology with the MMPI-2-RF [corrected] validity scales.

PubMed

Sellbom, Martin; Bagby, R Michael

2010-12-01

We examined the utility of the validity scales on the recently released Minnesota Multiphasic Personality Inventory-2 Restructured Form (MMPI-2 RF; Ben-Porath & Tellegen, 2008) to detect overreported psychopathology. This set of validity scales includes a newly developed scale and revised versions of the original MMPI-2 validity scales. We used an analogue, experimental simulation in which MMPI-2 RF responses (derived from archived MMPI-2 protocols) of undergraduate students instructed to overreport psychopathology (in either a coached or noncoached condition) were compared with those of psychiatric inpatients who completed the MMPI-2 under standardized instructions. The MMPI-2 RF validity scale Infrequent Psychopathology Responses best differentiated the simulation groups from the sample of patients, regardless of experimental condition. No other validity scale added consistent incremental predictive utility to Infrequent Psychopathology Responses in distinguishing the simulation groups from the sample of patients. Classification accuracy statistics confirmed the recommended cut scores in the MMPI-2 RF manual (Ben-Porath & Tellegen, 2008).

Psychometric properties of the Brunel Mood Scale in Chinese adolescents and adults.

PubMed

Zhang, Chun-Qing; Si, Gangyan; Chung, Pak-Kwong; Du, Mengmeng; Terry, Peter C

2014-01-01

Building on the work of Terry and colleagues (Terry, P. C., Lane, A. M., Lane, H. J., & Keohane, L. (1999). Development and validation of a mood measure for adolescents. Journal of Sports Sciences, 17, 861-872; Terry, P. C., Lane, A. M., & Fogarty, G. J. (2003). Construct validity of the Profile of Mood States-Adolescents for use with adults. Psychology of Sport & Exercise, 4, 125-139.), the present study examined the validity and internal consistency reliability of the Chinese version of the Brunel Mood Scale (BRUMS-C) among 2,548 participants, comprising adolescent athletes (n = 520), adult athletes (n = 434), adolescent students (n = 673), and adult students (n = 921). Both adolescent and adult athletes completed the BRUMS-C before, during, or after regular training and both adolescent and adult students completed the BRUMS-C in a classroom setting. Confirmatory factor analyses (CFAs) provided support for the factorial validity of a 23-item six-factor model, with one item removed from the hypothesised measurement model. Internal consistency reliabilities were satisfactory for all subscales across each of the four samples. Criterion validity was supported with strong relationships between the BRUMS-C, abbreviated POMS, and Chinese Affect Scale consistent with theoretical predictions. Multi-sample CFAs showed the BRUMS-C to be invariant at the configural, metric, strong, and structural levels for all samples. Furthermore, latent mean difference analyses showed that athletes reported significantly higher levels of fatigue than students while maintaining almost the same levels of vigour, and adolescent students reported significantly higher levels of depressed mood than the other three samples.
Developmental monitoring using caregiver reports in a resource-limited setting: the case of Kilifi, Kenya

PubMed Central

Abubakar, A; Holding, P; Van de Vijver, F; Bomu, G; Van Baar, A

2010-01-01

Aim: The main aim of the current study was to evaluate the reliability, validity and acceptability of developmental monitoring using caregiver reports among mothers in a rural African setting. Methods: A structured interview for parents of children aged 24 months and less was developed through both participant consultation and a review of literature. The reliability and validity of the schedule was evaluated through a 10-month monitoring programme of 95 children, aged 2–10 months. The acceptability of the process was evaluated by studying retention rates and by organizing focus group discussions with participating mothers. Results: The structured interview ‘Developmental Milestones Checklist’ consisted of 66 items covering three broad domains of child functioning: motor, language and personal–social development. The interview yielded scores of developmental achievements that showed high internal consistency and excellent test–retest reliability. The results were sensitive to maturational changes and nutritional deficiencies. In addition, acceptable retention rates of approximately 80% were found. Participating mothers reported that they found the procedures both acceptable and beneficial. Conclusion: Developmental monitoring using caregiver report is a viable method to identify and monitor at-risk children in Sub-Saharan Africa. PMID:20353499
Cross-cultural adaptation and psychometric testing of the Quality of Dying and Death Questionnaire for the Spanish population.

PubMed

Gutiérrez Sánchez, Daniel; Cuesta-Vargas, Antonio I

2018-04-01

Many measurements have been developed to assess the quality of death (QoD). Among these, the Quality of Dying and Death Questionnaire (QODD) is the most widely studied and best validated. Informal carers and health professionals who care for the patient during their last days of life can complete this assessment tool. The aim of the study is to carry out a cross-cultural adaptation and a psychometric analysis of the QODD for the Spanish population. The translation was performed using a double forward and backward method. An expert panel evaluated the content validity. The questionnaire was tested in a sample of 72 Spanish-speaking adult carers of deceased cancer patients. A psychometric analysis was performed to evaluate internal consistency, divergent criterion-related validity with the Mini-Suffering State Examination (MSSE) and concurrent criterion-related validity with the Palliative Outcome Scale (POS). Some items were deleted and modified to create the Spanish version of the QODD (QODD-ESP-26). The instrument was readable and acceptable. The content validity index was 0.96, suggesting that all items are relevant for the measure of the QoD. This questionnaire showed high internal consistency (Cronbach's α coefficient = 0.88). Divergent validity with MSSE (r = -0.64) and convergent validity with POS (r = -0.61) were also demonstrated. The QODD-ESP-26 is a valid and reliable instrument for the assessment of the QoD of deceased cancer patients that can be used in a clinical and research setting. Copyright © 2018 Elsevier Ltd. All rights reserved.
Observations on CFD Verification and Validation from the AIAA Drag Prediction Workshops

NASA Technical Reports Server (NTRS)

Morrison, Joseph H.; Kleb, Bil; Vassberg, John C.

2014-01-01

The authors provide observations from the AIAA Drag Prediction Workshops that have spanned over a decade and from a recent validation experiment at NASA Langley. These workshops provide an assessment of the predictive capability of forces and moments, focused on drag, for transonic transports. It is very difficult to manage the consistency of results in a workshop setting to perform verification and validation at the scientific level, but it may be sufficient to assess it at the level of practice. Observations thus far: 1) due to simplifications in the workshop test cases, wind tunnel data are not necessarily the “correct” results that CFD should match, 2) an average of core CFD data are not necessarily a better estimate of the true solution as it is merely an average of other solutions and has many coupled sources of variation, 3) outlier solutions should be investigated and understood, and 4) the DPW series does not have the systematic build up and definition on both the computational and experimental side that is required for detailed verification and validation. Several observations regarding the importance of the grid, effects of physical modeling, benefits of open forums, and guidance for validation experiments are discussed. The increased variation in results when predicting regions of flow separation and increased variation due to interaction effects, e.g., fuselage and horizontal tail, point out the need for validation data sets for these important flow phenomena. Experiences with a recent validation experiment at NASA Langley are included to provide guidance on validation experiments.
Development and community-based validation of the IDEA study Instrumental Activities of Daily Living (IDEA-IADL) questionnaire.

PubMed

Collingwood, Cecilia; Paddick, Stella-Maria; Kisoli, Aloyce; Dotchin, Catherine L; Gray, William K; Mbowe, Godfrey; Mkenda, Sarah; Urasa, Sarah; Mushi, Declare; Chaote, Paul; Walker, Richard W

2014-01-01

The dementia diagnosis gap in sub-Saharan Africa (SSA) is large, partly due to difficulties in assessing function, an essential step in diagnosis. As part of the Identification and Intervention for Dementia in Elderly Africans (IDEA) study, to develop, pilot, and validate an Instrumental Activities of Daily Living (IADL) questionnaire for use in a rural Tanzanian population to assist in the identification of people with dementia alongside cognitive screening. The questionnaire was developed at a workshop for rural primary healthcare workers, based on culturally appropriate roles and usual activities of elderly people in this community. It was piloted in 52 individuals under follow-up from a dementia prevalence study. Validation subsequently took place during a community dementia-screening programme. Construct validation against gold standard clinical dementia diagnosis using DSM-IV criteria was carried out on a stratified sample of the cohort and validity assessed using area under the receiver operating characteristic (AUROC) curve analysis. An 11-item questionnaire (IDEA-IADL) was developed after pilot testing. During formal validation on 130 community-dwelling elderly people who presented for screening, the AUROC curve was 0.896 for DSM-IV dementia when used in isolation and 0.937 when used in conjunction with the IDEA cognitive screen, previously validated in Tanzania. The internal consistency was 0.959. Performance on the IDEA-IADL was not biased with regard to age, gender or education level. The IDEA-IADL questionnaire appears to be a useful aid to dementia screening in this setting. Further validation in other healthcare settings in SSA is required.
Translation of the Neck Disability Index and validation of the Greek version in a sample of neck pain patients.

PubMed

Trouli, Marianna N; Vernon, Howard T; Kakavelakis, Kyriakos N; Antonopoulou, Maria D; Paganas, Aristofanis N; Lionis, Christos D

2008-07-22

Neck pain is a highly prevalent condition resulting in major disability. Standard scales for measuring disability in patients with neck pain have a pivotal role in research and clinical settings. The Neck Disability Index (NDI) is a valid and reliable tool, designed to measure disability in activities of daily living due to neck pain. The purpose of our study was the translation and validation of the NDI in a Greek primary care population with neck complaints. The original version of the questionnaire was used. Based on international standards, the translation strategy comprised forward translations, reconciliation, backward translation and pre-testing steps. The validation procedure concerned the exploration of internal consistency (Cronbach alpha), test-retest reliability (Intraclass Correlation Coefficient, Bland and Altman method), construct validity (exploratory factor analysis) and responsiveness (Spearman correlation coefficient, Standard Error of Measurement and Minimal Detectable Change) of the questionnaire. Data quality was also assessed through completeness of data and floor/ceiling effects. The translation procedure resulted in the Greek modified version of the NDI. The latter was culturally adapted through the pre-testing phase. The validation procedure raised a large amount of missing data due to low applicability, which were assessed with two methods. Floor or ceiling effects were not observed. Cronbach alpha was calculated as 0.85, which was interpreted as good internal consistency. Intraclass correlation coefficient was found to be 0.93 (95% CI 0.84-0.97), which was considered as very good test-retest reliability. Factor analysis yielded one factor with Eigenvalue 4.48 explaining 44.77% of variance. The Spearman correlation coefficient (0.3; P = 0.02) revealed some relation between the change score in the NDI and Global Rating of Change (GROC). The SEM and MDC were calculated as 0.64 and 1.78 respectively. The Greek version of the NDI measures disability in patients with neck pain in a reliable, valid and responsive manner. It is considered a useful tool for research and clinical settings in Greek Primary Health Care.
Translation of the Neck Disability Index and validation of the Greek version in a sample of neck pain patients

PubMed Central

Trouli, Marianna N; Vernon, Howard T; Kakavelakis, Kyriakos N; Antonopoulou, Maria D; Paganas, Aristofanis N; Lionis, Christos D

2008-01-01

Background Neck pain is a highly prevalent condition resulting in major disability. Standard scales for measuring disability in patients with neck pain have a pivotal role in research and clinical settings. The Neck Disability Index (NDI) is a valid and reliable tool, designed to measure disability in activities of daily living due to neck pain. The purpose of our study was the translation and validation of the NDI in a Greek primary care population with neck complaints. Methods The original version of the questionnaire was used. Based on international standards, the translation strategy comprised forward translations, reconciliation, backward translation and pre-testing steps. The validation procedure concerned the exploration of internal consistency (Cronbach alpha), test-retest reliability (Intraclass Correlation Coefficient, Bland and Altman method), construct validity (exploratory factor analysis) and responsiveness (Spearman correlation coefficient, Standard Error of Measurement and Minimal Detectable Change) of the questionnaire. Data quality was also assessed through completeness of data and floor/ceiling effects. Results The translation procedure resulted in the Greek modified version of the NDI. The latter was culturally adapted through the pre-testing phase. The validation procedure raised a large amount of missing data due to low applicability, which were assessed with two methods. Floor or ceiling effects were not observed. Cronbach alpha was calculated as 0.85, which was interpreted as good internal consistency. Intraclass correlation coefficient was found to be 0.93 (95% CI 0.84–0.97), which was considered as very good test-retest reliability. Factor analysis yielded one factor with Eigenvalue 4.48 explaining 44.77% of variance. The Spearman correlation coefficient (0.3; P = 0.02) revealed some relation between the change score in the NDI and Global Rating of Change (GROC). The SEM and MDC were calculated as 0.64 and 1.78 respectively. Conclusion The Greek version of the NDI measures disability in patients with neck pain in a reliable, valid and responsive manner. It is considered a useful tool for research and clinical settings in Greek Primary Health Care. PMID:18647393
[Computerized system validation of clinical researches].

PubMed

Yan, Charles; Chen, Feng; Xia, Jia-lai; Zheng, Qing-shan; Liu, Daniel

2015-11-01

Validation is a documented process that provides a high degree of assurance. The computer system does exactly and consistently what it is designed to do in a controlled manner throughout the life. The validation process begins with the system proposal/requirements definition, and continues application and maintenance until system retirement and retention of the e-records based on regulatory rules. The objective to do so is to clearly specify that each application of information technology fulfills its purpose. The computer system validation (CSV) is essential in clinical studies according to the GCP standard, meeting product's pre-determined attributes of the specifications, quality, safety and traceability. This paper describes how to perform the validation process and determine relevant stakeholders within an organization in the light of validation SOPs. Although a specific accountability in the implementation of the validation process might be outsourced, the ultimate responsibility of the CSV remains on the shoulder of the business process owner-sponsor. In order to show that the compliance of the system validation has been properly attained, it is essential to set up comprehensive validation procedures and maintain adequate documentations as well as training records. Quality of the system validation should be controlled using both QC and QA means.
Measuring psychological distress in older Aboriginal and Torres Strait Islanders Australians: a comparison of the K-10 and K-5.

PubMed

McNamara, Bridgette J; Banks, Emily; Gubhaju, Lina; Williamson, Anna; Joshy, Grace; Raphael, Beverley; Eades, Sandra J

2014-12-01

To assess the cross-cultural validity of two Kessler psychological distress scales (K-10 and K-5) by examining their measurement properties among older Aboriginal and Torres Strait Islanders and comparing them to those in non-Aboriginal individuals from NSW Australia. Self-reported questionnaire data from the 45 and Up Study for 1,631 Aboriginal and 231,774 non-Aboriginal people were used to examine the factor structure, convergent validity, internal consistency and levels of missing data of K-10 and K-5. We found excellent agreement in classification of distress of Aboriginal participants by K-10 and K-5 (weighted kappa=0.87), high internal consistency (Cronbach's alpha K-10: 0.93, K-5: 0.88), and factor structures consistent with those for the total Australian population. Convergent validity was evidenced by a strong graded relationship between the level of distress and the odds of: problems with daily activities due to emotional problems; current treatment for depression or anxiety; and poor quality of life. K-10 and K-5 scales are promising tools for measuring psychological distress among Aboriginal and Torres Strait Islanders aged 45 and over in research and clinical settings. © 2014 Public Health Association of Australia.
Spanish translation and validation of four short pelvic floor disorders questionnaires.

PubMed

Treszezamsky, Alejandro D; Karp, Deborah; Dick-Biascoechea, Madeline; Ehsani, Nazanin; Dancz, Christina; Montoya, T Ignacio; Olivera, Cedric K; Smith, Aimee L; Cardenas, Rosa; Fashokun, Tola; Bradley, Catherine S

2013-04-01

Globally, Spanish is the primary language for 329 million people; however, most urogynecologic questionnaires are available in English. We set out to develop valid Spanish translations of the Questionnaire for Urinary Incontinence Diagnosis (QUID), the Three Incontinence Questions (3IQ), and the short Pelvic Floor Distress Inventory (PFDI-20) and Pelvic Floor Impact Questionnaire (PFIQ-7). The TRAPD method (translation, review, adjudication, pretesting, and documentation) was used for translation. Eight native Spanish-speaking translators developed Spanish versions collaboratively. These were pretested with cognitive interviews and revised until optimal. For validation, bilingual patients at seven clinics completed Spanish and English questionnaire versions in randomized order. Participants completed a second set of questionnaires later. The Spanish versions' internal consistency and reliability and Spanish-English agreement were measured using Cronbach's alpha, weighted kappa, and intraclass correlation coefficients. A total of 78 subjects were included; 94.9 % self-identified as Hispanic and 73.1 % spoke Spanish as their primary language. The proportion of per-item missing responses was similar in both languages (median 1.3 %). Internal consistency for Spanish PFDI-20 subscales was acceptable to good and for PFIQ-7 and QUID excellent. Test-retest reliability per item was moderate to near perfect for PFDI-20, substantial to near perfect for PFIQ-7 and 3IQ, and substantial for QUID. Spanish-English agreement for individual items was substantial to near perfect for all questionnaires (kappa range 0.64-0.95) and agreement for PFDI-20, PFIQ-7, and QUID subscales scores was high [intraclass correlation coefficient (ICC) range 0.92-0.99]. We obtained valid Spanish translations of the PFDI-20, PFIQ-7, QUID, and 3IQ. These results support their use as clinical and research assessment tools in Spanish-speaking populations.
A model for flexi-bar to evaluate intervertebral disc and muscle forces in exercises.

PubMed

Abdollahi, Masoud; Nikkhoo, Mohammad; Ashouri, Sajad; Asghari, Mohsen; Parnianpour, Mohamad; Khalaf, Kinda

2016-10-01

This study developed and validated a lumped parameter model for the FLEXI-BAR, a popular training instrument that provides vibration stimulation. The model which can be used in conjunction with musculoskeletal-modeling software for quantitative biomechanical analyses, consists of 3 rigid segments, 2 torsional springs, and 2 torsional dashpots. Two different sets of experiments were conducted to determine the model's key parameters including the stiffness of the springs and the damping ratio of the dashpots. In the first set of experiments, the free vibration of the FLEXI-BAR with an initial displacement at its end was considered, while in the second set, forced oscillations of the bar were studied. The properties of the mechanical elements in the lumped parameter model were derived utilizing a non-linear optimization algorithm which minimized the difference between the model's prediction and the experimental data. The results showed that the model is valid (8% error) and can be used for simulating exercises with the FLEXI-BAR for excitations in the range of the natural frequency. The model was then validated in combination with AnyBody musculoskeletal modeling software, where various lumbar disc, spinal muscles and hand muscles forces were determined during different FLEXI-BAR exercise simulations. Copyright © 2016 IPEM. Published by Elsevier Ltd. All rights reserved.
Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View

PubMed Central

2016-01-01

Background As more and more researchers are turning to big data for new opportunities of biomedical discoveries, machine learning models, as the backbone of big data analysis, are mentioned more often in biomedical journals. However, owing to the inherent complexity of machine learning methods, they are prone to misuse. Because of the flexibility in specifying machine learning models, the results are often insufficiently reported in research articles, hindering reliable assessment of model validity and consistent interpretation of model outputs. Objective To attain a set of guidelines on the use of machine learning predictive models within clinical settings to make sure the models are correctly applied and sufficiently reported so that true discoveries can be distinguished from random coincidence. Methods A multidisciplinary panel of machine learning experts, clinicians, and traditional statisticians were interviewed, using an iterative process in accordance with the Delphi method. Results The process produced a set of guidelines that consists of (1) a list of reporting items to be included in a research article and (2) a set of practical sequential steps for developing predictive models. Conclusions A set of guidelines was generated to enable correct application of machine learning models and consistent reporting of model specifications and results in biomedical research. We believe that such guidelines will accelerate the adoption of big data analysis, particularly with machine learning methods, in the biomedical research community. PMID:27986644
Response set bias, internal consistency and construct validity of the Oswestry Low Back Pain Disability Questionnaire

PubMed Central

Tibbles, Anthony C; Waalen, Judith K; Hains, François C

1998-01-01

Background: The Oswestry Low Back Pain Disability Questionnaire (ODQ) is a widely used 10-item paper and pencil measure of disability resulting from low back pain. However, few studies have assessed the psychometric properties of the instrument. This study evaluated the response set bias, the internal consistency, and the construct validity of the ODQ. Objectives: The original ODQ was compared to seven modified versions to examine whether a response set bias existed. The internal consistency of the ODQ was assessed using the Cronbach alpha. Finally, the relationship between scores on the ODQ and the Roland Morris Functional Disability Scal (RM) was examined. Methods: Seven modified versions of the ODQ were developed from the original. One of the eight versions was randomly allocated to 102 adult patients presenting with low lack pain. There was no attempt to select patients on the basis of pain intensity or prior treatment so as to maximize the range and diversity of low back pain sufferers. Results: Results suggest that the responses given on the eight versions of the ODQ are a function of content and not of the format in which the items are presented. The ODQ also has strong internal consisstency (alpha = 0.85) and is strongly correlated to the RM (r = .70, p = .0005). The ODQ is a significant predictor of the RM scores (T-9.45, p = .0005) and duration of symptoms (T = -2.17, p = .0325). Conclusion: The ODQ appears to possess stable psychometric properties. The use of more than one version provides practitioners with a means of repeatedly assessing the disability levels of patients suffering from low back pain over the course of treatment.
Rapid, Reliable Shape Setting of Superelastic Nitinol for Prototyping Robots

PubMed Central

Gilbert, Hunter B.; Webster, Robert J.

2016-01-01

Shape setting Nitinol tubes and wires in a typical laboratory setting for use in superelastic robots is challenging. Obtaining samples that remain superelastic and exhibit desired precurvatures currently requires many iterations, which is time consuming and consumes a substantial amount of Nitinol. To provide a more accurate and reliable method of shape setting, in this paper we propose an electrical technique that uses Joule heating to attain the necessary shape setting temperatures. The resulting high power heating prevents unintended aging of the material and yields consistent and accurate results for the rapid creation of prototypes. We present a complete algorithm and system together with an experimental analysis of temperature regulation. We experimentally validate the approach on Nitinol tubes that are shape set into planar curves. We also demonstrate the feasibility of creating general space curves by shape setting a helical tube. The system demonstrates a mean absolute temperature error of 10°C. PMID:27648473
Rapid, Reliable Shape Setting of Superelastic Nitinol for Prototyping Robots.

PubMed

Gilbert, Hunter B; Webster, Robert J

Shape setting Nitinol tubes and wires in a typical laboratory setting for use in superelastic robots is challenging. Obtaining samples that remain superelastic and exhibit desired precurvatures currently requires many iterations, which is time consuming and consumes a substantial amount of Nitinol. To provide a more accurate and reliable method of shape setting, in this paper we propose an electrical technique that uses Joule heating to attain the necessary shape setting temperatures. The resulting high power heating prevents unintended aging of the material and yields consistent and accurate results for the rapid creation of prototypes. We present a complete algorithm and system together with an experimental analysis of temperature regulation. We experimentally validate the approach on Nitinol tubes that are shape set into planar curves. We also demonstrate the feasibility of creating general space curves by shape setting a helical tube. The system demonstrates a mean absolute temperature error of 10°C.
Multi-criteria development and incorporation into decision tools for health technology adoption.

PubMed

Poulin, Paule; Austen, Lea; Scott, Catherine M; Waddell, Cameron D; Dixon, Elijah; Poulin, Michelle; Lafrenière, René

2013-01-01

When introducing new health technologies, decision makers must integrate research evidence with local operational management information to guide decisions about whether and under what conditions the technology will be used. Multi-criteria decision analysis can support the adoption or prioritization of health interventions by using criteria to explicitly articulate the health organization's needs, limitations, and values in addition to evaluating evidence for safety and effectiveness. This paper seeks to describe the development of a framework to create agreed-upon criteria and decision tools to enhance a pre-existing local health technology assessment (HTA) decision support program. The authors compiled a list of published criteria from the literature, consulted with experts to refine the criteria list, and used a modified Delphi process with a group of key stakeholders to review, modify, and validate each criterion. In a workshop setting, the criteria were used to create decision tools. A set of user-validated criteria for new health technology evaluation and adoption was developed and integrated into the local HTA decision support program. Technology evaluation and decision guideline tools were created using these criteria to ensure that the decision process is systematic, consistent, and transparent. This framework can be used by others to develop decision-making criteria and tools to enhance similar technology adoption programs. The development of clear, user-validated criteria for evaluating new technologies adds a critical element to improve decision-making on technology adoption, and the decision tools ensure consistency, transparency, and real-world relevance.
Parton Distributions based on a Maximally Consistent Dataset

NASA Astrophysics Data System (ADS)

Rojo, Juan

2016-04-01

The choice of data that enters a global QCD analysis can have a substantial impact on the resulting parton distributions and their predictions for collider observables. One of the main reasons for this has to do with the possible presence of inconsistencies, either internal within an experiment or external between different experiments. In order to assess the robustness of the global fit, different definitions of a conservative PDF set, that is, a PDF set based on a maximally consistent dataset, have been introduced. However, these approaches are typically affected by theory biases in the selection of the dataset. In this contribution, after a brief overview of recent NNPDF developments, we propose a new, fully objective, definition of a conservative PDF set, based on the Bayesian reweighting approach. Using the new NNPDF3.0 framework, we produce various conservative sets, which turn out to be mutually in agreement within the respective PDF uncertainties, as well as with the global fit. We explore some of their implications for LHC phenomenology, finding also good consistency with the global fit result. These results provide a non-trivial validation test of the new NNPDF3.0 fitting methodology, and indicate that possible inconsistencies in the fitted dataset do not affect substantially the global fit PDFs.
Breast Cancer Screening Beliefs Questionnaire: Psychometric properties assessment of the Arabic version.

PubMed

Kwok, Cannas; Endrawes, Gihane; Lee, Chun Fan

2016-02-01

The aim of the study was to report the psychometric properties of the Arabic version of the Breast Cancer Screening Beliefs Questionnaire (BCSBQ). A convenience sample of 251 Arabic-Australian women was recruited from a number of Arabic community organizations. Construct validity was examined by Cuzick's non-parametric test while Cronbach α was used to assess internal consistency reliability. Explanatory factor analysis was conducted to study the factor structure. The results indicated that the Arabic version of the BCSBQ had satisfactory validity and internal consistency. The Cronbach's alpha of the three subscales ranged between 0.810 and 0.93. The frequency of breast cancer screening practices (breast awareness, clinical breast-examination and mammography) were significantly associated with attitudes towards general health check-up and perceived barriers to mammographic screening. Exploratory factor analysis showed a similar fit for the hypothesized three-factor structure with our data set. The Arabic version of the BCBSQ is a culturally appropriate, valid and reliable instrument for assessing the beliefs, knowledge and attitudes to breast cancer and breast cancer screening practices among Arabic-Australian women. Copyright © 2015 Elsevier Ltd. All rights reserved.
Development of a Decisional Balance Scale for Young Adult Marijuana Use

PubMed Central

Elliott, Jennifer C.; Carey, Kate B.; Scott-Sheldon, Lori A. J.

2010-01-01

This study describes the development and validation of a decisional balance scale for marijuana use in young adults. Scale development was accomplished in four phases. First, 53 participants (70% female, 68% freshman) provided qualitative data that yielded content for an initial set of 47 items. In the second phase, an exploratory factor analysis on the responses of 260 participants (52% female, 68% freshman) revealed two factors, corresponding to pros and cons. Items that did not load well on the factors were omitted, resulting in a reduced set of 36 items. In the third phase, 182 participants (49% female, 37% freshmen) completed the revised scale and an evaluation of factor structure led to scale revisions and model respecification to create a good-fitting model. The final scales consisted of 8 pros (α = 0.91) and 16 cons (α = 0.93), and showed evidence of validity. In the fourth phase (N = 248, 66% female, 70% freshman), we confirmed the factor structure, and provided further evidence for reliability and validity. The Marijuana Decisional Balance Scale enhances our ability to study motivational factors associated with marijuana use among young adults. PMID:21261405
Measurement properties of depression questionnaires in patients with diabetes: a systematic review.

PubMed

van Dijk, Susan E M; Adriaanse, Marcel C; van der Zwaan, Lennart; Bosmans, Judith E; van Marwijk, Harm W J; van Tulder, Maurits W; Terwee, Caroline B

2018-06-01

To conduct a systematic review on measurement properties of questionnaires measuring depressive symptoms in adult patients with type 1 or type 2 diabetes. A systematic review of the literature in MEDLINE, EMbase and PsycINFO was performed. Full text, original articles, published in any language up to October 2016 were included. Eligibility for inclusion was independently assessed by three reviewers who worked in pairs. Methodological quality of the studies was evaluated by two independent reviewers using the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist. Quality of the questionnaires was rated per measurement property, based on the number and quality of the included studies and the reported results. Of 6286 unique hits, 21 studies met our criteria evaluating nine different questionnaires in multiple settings and languages. The methodological quality of the included studies was variable for the different measurement properties: 9/15 studies scored 'good' or 'excellent' on internal consistency, 2/5 on reliability, 0/1 on content validity, 10/10 on structural validity, 8/11 on hypothesis testing, 1/5 on cross-cultural validity, and 4/9 on criterion validity. For the CES-D, there was strong evidence for good internal consistency, structural validity, and construct validity; moderate evidence for good criterion validity; and limited evidence for good cross-cultural validity. The PHQ-9 and WHO-5 also performed well on several measurement properties. However, the evidence for structural validity of the PHQ-9 was inconclusive. The WHO-5 was less extensively researched and originally not developed to measure depression. Currently, the CES-D is best supported for measuring depressive symptoms in diabetes patients.

Dyspnoea-12: a translation and linguistic validation study in a Swedish setting.

PubMed

Sundh, Josefin; Ekström, Magnus

2017-06-06

Dyspnoea consists of multiple dimensions including the intensity, unpleasantness, sensory qualities and emotional responses which may differ between patient groups, settings and in relation to treatment. The Dyspnoea-12 is a validated and convenient instrument for multidimensional measurement in English. We aimed to take forward a Swedish version of the Dyspnoea-12. The linguistic validation of the Dyspnoea-12 was performed (Mapi Language Services, Lyon, France). The standardised procedure involved forward and backward translations by three independent certified translators and revisions after feedback from an in-country linguistic consultant, the developerand three native physicians. The understanding and convenience of the translated version was evaluated using qualitative in-depth interviews with five patients with dyspnoea. A Swedish version of the Dyspnoea-12 was elaborated and evaluated carefully according to international guidelines. The Swedish version, 'Dyspné-12', has the same layout as the original version, including 12 items distributed on seven physical and five affective items. The Dyspnoea-12 is copyrighted by the developer but can be used free of charge after permission for not industry-funded research. A Swedish version of the Dyspnoea-12 is now available for clinical validation and multidimensional measurement across diseases and settings with the aim of improved evaluation and management of dyspnoea. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
The thought-action fusion scale: further evidence for its reliability and validity.

PubMed

Rassin, E; Merckelbach, H; Muris, P; Schmidt, H

2001-05-01

Thought-action fusion (TAF) refers to a set of cognitive biases that are thought to play a role in the development of obsessional phenomena. To measure these biases, R. Shafran, D. S. Thordarson, and S. Rachman (1996; Journal of Anxiety Disorders, 10, 379-391) developed the TAF-scale. They concluded that the TAF-scale possesses adequate psychometric qualities. The current study sought to further explore the reliability and validity of the TAF-scale. Results indicate that the TAF-scale has good internal consistency. TAF-scores correlated with self-reports of obsessional problems. Furthermore, mean scores in a mixed sample of anxiety disordered patients were higher than those in a normal sample. However, temporal consistency was somewhat disappointing. Also, the question remains whether TAF is specific to obsessive-compulsive disorder or taps more pervasive biases that play a role in a variety of disorders.
Identifying Wrist Fracture Patients with High Accuracy by Automatic Categorization of X-ray Reports

PubMed Central

de Bruijn, Berry; Cranney, Ann; O’Donnell, Siobhan; Martin, Joel D.; Forster, Alan J.

2006-01-01

The authors performed this study to determine the accuracy of several text classification methods to categorize wrist x-ray reports. We randomly sampled 751 textual wrist x-ray reports. Two expert reviewers rated the presence (n = 301) or absence (n = 450) of an acute fracture of wrist. We developed two information retrieval (IR) text classification methods and a machine learning method using a support vector machine (TC-1). In cross-validation on the derivation set (n = 493), TC-1 outperformed the two IR based methods and six benchmark classifiers, including Naive Bayes and a Neural Network. In the validation set (n = 258), TC-1 demonstrated consistent performance with 93.8% accuracy; 95.5% sensitivity; 92.9% specificity; and 87.5% positive predictive value. TC-1 was easy to implement and superior in performance to the other classification methods. PMID:16929046
Testing the Zimbardo Time Perspective Inventory in the Chinese context.

PubMed

Wang, Ya; Chen, Xing-Jie; Cui, Ji-Fang; Liu, Lu-Lu

2015-09-01

In this study, the authors evaluated the Chinese version of the Zimbardo Time Perspective Inventory (ZTPI). The ZTPI was tested among a sample of 303 university students. A subsample of 51 participants was then asked to complete the ZTPI again along with another set of questionnaires. The five-factor model of a 20-item short version of the ZTPI showed good model fit, internal consistency, and test-retest reliability. The 20-item Chinese version of the ZTPI also provided good validity, showing correlations with other variables in expected directions. Past-Positive was positively correlated with reappraisal and negatively correlated with suppression emotion regulation strategies, and Present-Hedonistic was positively correlated with reappraisal emotion regulation strategies. These findings indicate that the ZTPI is a reliable and valid instrument for measuring time perspective in the Chinese setting. © 2015 The Institute of Psychology, Chinese Academy of Sciences and Wiley Publishing Asia Pty Ltd.
Condition monitoring of an electro-magnetic brake using an artificial neural network

NASA Astrophysics Data System (ADS)

Gofran, T.; Neugebauer, P.; Schramm, D.

2017-10-01

This paper presents a data-driven approach to Condition Monitoring of Electromagnetic brakes without use of additional sensors. For safe and efficient operation of electric motor a regular evaluation and replacement of the friction surface of the brake is required. One such evaluation method consists of direct or indirect sensing of the air-gap between pressure plate and magnet. A larger gap is generally indicative of worn surface(s). Traditionally this has been accomplished by the use of additional sensors - making existing systems complex, cost- sensitive and difficult to maintain. In this work a feed-forward Artificial Neural Network (ANN) is learned with the electrical data of the brake by supervised learning method to estimate the air-gap. The ANN model is optimized on the training set and validated using the test set. The experimental results of estimated air-gap with accuracy of over 95% demonstrate the validity of the proposed approach.
45 CFR 162.1011 - Valid code sets.

Code of Federal Regulations, 2012 CFR

2012-10-01

... 45 Public Welfare 1 2012-10-01 2012-10-01 false Valid code sets. 162.1011 Section 162.1011 Public Welfare DEPARTMENT OF HEALTH AND HUMAN SERVICES ADMINISTRATIVE DATA STANDARDS AND RELATED REQUIREMENTS ADMINISTRATIVE REQUIREMENTS Code Sets § 162.1011 Valid code sets. Each code set is valid within the dates...
45 CFR 162.1011 - Valid code sets.

Code of Federal Regulations, 2013 CFR

2013-10-01

... 45 Public Welfare 1 2013-10-01 2013-10-01 false Valid code sets. 162.1011 Section 162.1011 Public Welfare DEPARTMENT OF HEALTH AND HUMAN SERVICES ADMINISTRATIVE DATA STANDARDS AND RELATED REQUIREMENTS ADMINISTRATIVE REQUIREMENTS Code Sets § 162.1011 Valid code sets. Each code set is valid within the dates...
45 CFR 162.1011 - Valid code sets.

Code of Federal Regulations, 2010 CFR

2010-10-01

... 45 Public Welfare 1 2010-10-01 2010-10-01 false Valid code sets. 162.1011 Section 162.1011 Public Welfare DEPARTMENT OF HEALTH AND HUMAN SERVICES ADMINISTRATIVE DATA STANDARDS AND RELATED REQUIREMENTS ADMINISTRATIVE REQUIREMENTS Code Sets § 162.1011 Valid code sets. Each code set is valid within the dates...
45 CFR 162.1011 - Valid code sets.

Code of Federal Regulations, 2014 CFR

2014-10-01

... 45 Public Welfare 1 2014-10-01 2014-10-01 false Valid code sets. 162.1011 Section 162.1011 Public Welfare Department of Health and Human Services ADMINISTRATIVE DATA STANDARDS AND RELATED REQUIREMENTS ADMINISTRATIVE REQUIREMENTS Code Sets § 162.1011 Valid code sets. Each code set is valid within the dates...
45 CFR 162.1011 - Valid code sets.

Code of Federal Regulations, 2011 CFR

2011-10-01

... 45 Public Welfare 1 2011-10-01 2011-10-01 false Valid code sets. 162.1011 Section 162.1011 Public Welfare DEPARTMENT OF HEALTH AND HUMAN SERVICES ADMINISTRATIVE DATA STANDARDS AND RELATED REQUIREMENTS ADMINISTRATIVE REQUIREMENTS Code Sets § 162.1011 Valid code sets. Each code set is valid within the dates...
Psychometric evaluation of the Revised Professional Practice Environment (RPPE) scale.

PubMed

Erickson, Jeanette Ives; Duffy, Mary E; Ditomassi, Marianne; Jones, Dorothy

2009-05-01

The purpose was to examine the psychometric properties of the Revised Professional Practice Environment (RPPE) scale. Despite renewed focus on studying health professionals' practice environments, there are still few reliable and valid instruments available to assist nurse administrators in decision making. A psychometric evaluation using a random-sample cross-validation procedure (calibration sample [CS], n = 775; validation sample [VS], n = 775) was undertaken. Cronbach alpha internal consistency reliability of the total score (r = 0.93 [CS] and 0.92 [VS]), resulting subscale scores (r range: 0.80-0.87 [CS], 0.81-0.88 [VS]), and principal components analyses with Varimax rotation and Kaiser normalization (8 components, 59.2% variance [CS], 59.7% [VS]) produced almost identical results in both samples. The multidimensional RPPE is a psychometrically sound measure of 8 components of the professional practice environment in the acute care setting and sufficiently reliable and valid for use as independent subscales in healthcare research.
Reasoning, Problem Solving, and Intelligence.

DTIC Science & Technology

1980-04-01

designed to test the validity of their model of response choice in analogical reason- ing. In the first experiment, they set out to demonstrate that...second experiment were somewhat consistent with the prediction. The third experiment used a concept-formation design in which subjects were required to... designed to show interrelationships between various forms of inductive reasoning. Their model fits were highly comparable to those of Rumelhart and
Dutch version of the Fear of Pain Questionnaire for adolescents with chronic pain.

PubMed

Dekker, Carolien; Bastiaenen, Caroline H G; de Vries, Janneke E; Simons, Laura E; Goossens, Mariëlle E J B; Verbunt, Jeanine A M C F

2018-06-01

Fear of pain is important in the development and maintenance of chronic pain. The Fear of Pain Questionnaire-Child version has been developed to assess pain related fear in children and adolescents. Translating the original questionnaire into Dutch, and investigating internal consistency and construct validity to enable use in the Dutch pain rehabilitation setting for treatment and research. Cross-sectional validation study: After forward and back translation of the FOPQ-C, adolescents (11-22 years old) with chronic musculoskeletal pain completed an assessment containing the Dutch Fear of Pain Questionnaire, and questionnaires about demographics, pain catastrophizing, functional disability, and pain intensity. Internal consistency and construct validity were evaluated through exploratory factor analysis (principal axis factoring with oblique rotation) and hypotheses testing using pain catastrophizing, functional disability, and pain intensity as comparative constructs. Eighty-six adolescents completed the assessment. Exploratory factor analysis resulted in a two-factor structure, explaining 43% of the variance. Internal consistency was strong (Cronbach's α = 0.92 total scale, α = 0.88 factor 1, and α = .86 factor 2). Five out of 6 hypotheses were confirmed. The Dutch version demonstrated good internal consistency and good construct validity in a population of adolescents with chronic musculoskeletal pain. Implications for rehabilitation The Fear of Pain Questionnaire-Child version was developed to measure fear of pain and avoidance in children and adolescents with chronic pain. Identification of fear of pain and activities that are being avoided are important during screening and assessment of the adolescent for chronic pain rehabilitation treatment. The presence of fear of pain and/or avoidance behavior is important information to shape and target multidisciplinary rehabilitation treatment.
Using simple artificial intelligence methods for predicting amyloidogenesis in antibodies

PubMed Central

2010-01-01

Background All polypeptide backbones have the potential to form amyloid fibrils, which are associated with a number of degenerative disorders. However, the likelihood that amyloidosis would actually occur under physiological conditions depends largely on the amino acid composition of a protein. We explore using a naive Bayesian classifier and a weighted decision tree for predicting the amyloidogenicity of immunoglobulin sequences. Results The average accuracy based on leave-one-out (LOO) cross validation of a Bayesian classifier generated from 143 amyloidogenic sequences is 60.84%. This is consistent with the average accuracy of 61.15% for a holdout test set comprised of 103 AM and 28 non-amyloidogenic sequences. The LOO cross validation accuracy increases to 81.08% when the training set is augmented by the holdout test set. In comparison, the average classification accuracy for the holdout test set obtained using a decision tree is 78.64%. Non-amyloidogenic sequences are predicted with average LOO cross validation accuracies between 74.05% and 77.24% using the Bayesian classifier, depending on the training set size. The accuracy for the holdout test set was 89%. For the decision tree, the non-amyloidogenic prediction accuracy is 75.00%. Conclusions This exploratory study indicates that both classification methods may be promising in providing straightforward predictions on the amyloidogenicity of a sequence. Nevertheless, the number of available sequences that satisfy the premises of this study are limited, and are consequently smaller than the ideal training set size. Increasing the size of the training set clearly increases the accuracy, and the expansion of the training set to include not only more derivatives, but more alignments, would make the method more sound. The accuracy of the classifiers may also be improved when additional factors, such as structural and physico-chemical data, are considered. The development of this type of classifier has significant applications in evaluating engineered antibodies, and may be adapted for evaluating engineered proteins in general. PMID:20144194
Using simple artificial intelligence methods for predicting amyloidogenesis in antibodies.

PubMed

David, Maria Pamela C; Concepcion, Gisela P; Padlan, Eduardo A

2010-02-08

All polypeptide backbones have the potential to form amyloid fibrils, which are associated with a number of degenerative disorders. However, the likelihood that amyloidosis would actually occur under physiological conditions depends largely on the amino acid composition of a protein. We explore using a naive Bayesian classifier and a weighted decision tree for predicting the amyloidogenicity of immunoglobulin sequences. The average accuracy based on leave-one-out (LOO) cross validation of a Bayesian classifier generated from 143 amyloidogenic sequences is 60.84%. This is consistent with the average accuracy of 61.15% for a holdout test set comprised of 103 AM and 28 non-amyloidogenic sequences. The LOO cross validation accuracy increases to 81.08% when the training set is augmented by the holdout test set. In comparison, the average classification accuracy for the holdout test set obtained using a decision tree is 78.64%. Non-amyloidogenic sequences are predicted with average LOO cross validation accuracies between 74.05% and 77.24% using the Bayesian classifier, depending on the training set size. The accuracy for the holdout test set was 89%. For the decision tree, the non-amyloidogenic prediction accuracy is 75.00%. This exploratory study indicates that both classification methods may be promising in providing straightforward predictions on the amyloidogenicity of a sequence. Nevertheless, the number of available sequences that satisfy the premises of this study are limited, and are consequently smaller than the ideal training set size. Increasing the size of the training set clearly increases the accuracy, and the expansion of the training set to include not only more derivatives, but more alignments, would make the method more sound. The accuracy of the classifiers may also be improved when additional factors, such as structural and physico-chemical data, are considered. The development of this type of classifier has significant applications in evaluating engineered antibodies, and may be adapted for evaluating engineered proteins in general.
Danish VISA-A questionnaire with validation and reliability testing for Danish-speaking Achilles tendinopathy patients.

PubMed

Iversen, J V; Bartels, E M; Jørgensen, J E; Nielsen, T G; Ginnerup, C; Lind, M C; Langberg, H

2016-12-01

The VISA-A questionnaire has proven to be a valid and reliable tool for assessing severity of Achilles tendinopathy (AT). The aim was to translate and cross-culturally adapt the VISA-A questionnaire for a Danish-speaking AT population, and subsequently perform validity and reliability tests. Translation and following cross-cultural adaptation was performed as translation, synthesis, reverse translation, expert review, and pretesting. The final Danish version (VISA-A-DK) was tested for reliability on healthy controls (n = 75) and patients (n = 36). Tests for internal consistency, validity, and structure were performed on 71 patients. VISA-A-DK showed good reliability for patients (r = 0.80 ICC = 0.79) and healthy individuals (r = 0.98 ICC = 0.97). Internal consistency was 0.73 (Cronbach's alpha). The mean VISA-A-DK score in AT patients was 51 [47-55]. This was significantly lower than healthy controls with a score of 93 (90-95). Criterion validity was considered good when comparing the scores of the Danish version with the original version in both healthy individuals and patients. VISA-A-DK is a valid and reliable instrument and has shown compatible to the original version in assessment of AT patients. VISA-A-DK is a useful tool in the assessment of AT, both in research and in a clinical setting. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Multiple Score Comparison: a network meta-analysis approach to comparison and external validation of prognostic scores.

PubMed

Haile, Sarah R; Guerra, Beniamino; Soriano, Joan B; Puhan, Milo A

2017-12-21

Prediction models and prognostic scores have been increasingly popular in both clinical practice and clinical research settings, for example to aid in risk-based decision making or control for confounding. In many medical fields, a large number of prognostic scores are available, but practitioners may find it difficult to choose between them due to lack of external validation as well as lack of comparisons between them. Borrowing methodology from network meta-analysis, we describe an approach to Multiple Score Comparison meta-analysis (MSC) which permits concurrent external validation and comparisons of prognostic scores using individual patient data (IPD) arising from a large-scale international collaboration. We describe the challenges in adapting network meta-analysis to the MSC setting, for instance the need to explicitly include correlations between the scores on a cohort level, and how to deal with many multi-score studies. We propose first using IPD to make cohort-level aggregate discrimination or calibration scores, comparing all to a common comparator. Then, standard network meta-analysis techniques can be applied, taking care to consider correlation structures in cohorts with multiple scores. Transitivity, consistency and heterogeneity are also examined. We provide a clinical application, comparing prognostic scores for 3-year mortality in patients with chronic obstructive pulmonary disease using data from a large-scale collaborative initiative. We focus on the discriminative properties of the prognostic scores. Our results show clear differences in performance, with ADO and eBODE showing higher discrimination with respect to mortality than other considered scores. The assumptions of transitivity and local and global consistency were not violated. Heterogeneity was small. We applied a network meta-analytic methodology to externally validate and concurrently compare the prognostic properties of clinical scores. Our large-scale external validation indicates that the scores with the best discriminative properties to predict 3 year mortality in patients with COPD are ADO and eBODE.
Validation of the Penn Acoustic Neuroma Quality-of-Life Scale (PANQOL) for Spanish-Speaking Patients.

PubMed

Medina, Maria Del Mar; Carrillo, Alvaro; Polo, Ruben; Fernandez, Borja; Alonso, Daniel; Vaca, Miguel; Cordero, Adela; Perez, Cecilia; Muriel, Alfonso; Cobeta, Ignacio

2017-04-01

Objective To perform translation, cross-cultural adaptation, and validation of the Penn Acoustic Neuroma Quality-of-Life Scale (PANQOL) to the Spanish language. Study Design Prospective study. Setting Tertiary neurotologic referral center. Subjects and Methods PANQOL was translated and translated back, and a pretest trial was performed. The study included 27 individuals diagnosed with vestibular schwannoma. Inclusion criteria were adults with untreated vestibular schwannoma, diagnosed in the past 12 months. Feasibility, internal consistency, test-retest reliability, construct validity, and ceiling and floor effects were assessed for the present study. Results The mean overall score of the PANQOL was 69.21 (0-100 scale, lowest to highest quality of life). Cronbach's α was 0.87. Intraclass correlation coefficient was performed for each item, with an overall score of 0.92. The κ coefficient scores were between moderate and almost perfect in more than 92% of patients. Anxiety and energy domains of the PANQOL were correlated with both physical and mental components of the SF-12. Hearing, balance, and pain domains were correlated with the SF-12 physical component. Facial and general domains were not significantly correlated with any component of the SF-12. Furthermore, the overall score of the PANQOL was correlated with the physical component of the SF-12. Conclusion Feasibility, internal consistency, reliability, and construct validity outcomes in the current study support the validity of the Spanish version of the PANQOL.
The use of the FACT-H&N (v4) in clinical settings within a developing country: a mixed method study.

PubMed

Bilal, Sobia; Doss, Jennifer Geraldine; Rogers, Simon N

2014-12-01

In the last decade there has been an increasing awareness about 'quality of life' (QOL) of cancer survivors in developing countries. The study aimed to cross-culturally adapt and validate the FACT-H&N (v4) in Urdu language for Pakistani head and neck cancer patients. In this study the 'same language adaptation method' was used. Cognitive debriefing through in-depth interviews of 25 patients to assess semantic, operational and conceptual equivalence was done. The validation phase included 50 patients to evaluate the psychometric properties. The translated FACT-H&N was easily comprehended (100%). Cronbach's alpha for FACT-G subscales ranged from 0.726 - 0.969. The head and neck subscale and Pakistani questions subscale showed low internal consistency (0.426 and 0.541 respectively). Instrument demonstrated known-group validity in differentiating patients of different clinical stages, treatment status and tumor sites (p < 0.05). Most FACT summary scales correlated strongly with each other (r > 0.75) and showed convergent validity (r > 0.90), with little discriminant validity. Factor analysis revealed 6 factors explaining 85.1% of the total variance with very good (>0.8) Kaiser-Meyer-Olkin and highly significant Bartlett's Test of Sphericity (p < 0.001). The cross-culturally adapted FACT-H&N into Urdu language showed adequate reliability and validity to be incorporated in Pakistani clinical settings for head and neck cancer patients. Copyright © 2014 European Association for Cranio-Maxillo-Facial Surgery. Published by Elsevier Ltd. All rights reserved.
Development of the Human Factors Skills for Healthcare Instrument: a valid and reliable tool for assessing interprofessional learning across healthcare practice settings.

PubMed

Reedy, Gabriel B; Lavelle, Mary; Simpson, Thomas; Anderson, Janet E

2017-10-01

A central feature of clinical simulation training is human factors skills, providing staff with the social and cognitive skills to cope with demanding clinical situations. Although these skills are critical to safe patient care, assessing their learning is challenging. This study aimed to develop, pilot and evaluate a valid and reliable structured instrument to assess human factors skills, which can be used pre- and post-simulation training, and is relevant across a range of healthcare professions. Through consultation with a multi-professional expert group, we developed and piloted a 39-item survey with 272 healthcare professionals attending training courses across two large simulation centres in London, one specialising in acute care and one in mental health, both serving healthcare professionals working across acute and community settings. Following psychometric evaluation, the final 12-item instrument was evaluated with a second sample of 711 trainees. Exploratory factor analysis revealed a 12-item, one-factor solution with good internal consistency (α=0.92). The instrument had discriminant validity, with newly qualified trainees scoring significantly lower than experienced trainees ( t (98)=4.88, p<0.001) and was sensitive to change following training in acute and mental health settings, across professional groups (p<0.001). Confirmatory factor analysis revealed an adequate model fit (RMSEA=0.066). The Human Factors Skills for Healthcare Instrument provides a reliable and valid method of assessing trainees' human factors skills self-efficacy across acute and mental health settings. This instrument has the potential to improve the assessment and evaluation of human factors skills learning in both uniprofessional and interprofessional clinical simulation training.

Psychometric evaluation of the Arabic language person-centred climate questionnaire-staff version.

PubMed

Aljuaid, Mohammed; Elmontsri, Mustafa; Edvardsson, David; Rawaf, Salman; Majeed, Azeem

2018-05-01

To evaluate the psychometric properties of the Arabic language person-centred climate questionnaire-staff version. There have been increasing calls for a person-centred rather than a disease-centred approach to health care. A limited number of tools measure the extent to which care is delivered in a person-centred manner, and none of these tools have been validated for us in Arab settings. The validated form of the person-centred climate questionnaire-staff version was translated into Arabic and distributed to 152 health care staff in teaching and non-teaching hospitals in Saudi Arabia. Statistical estimates of validity and reliability were used for psychometric evaluation. Items on the Arabic form of the person-centred climate questionnaire-staff version had high reliability (Cronbach's alpha .98). Cronbach's alpha values for the three sub-scales (safety, everydayness and community), were .96, .97 and .95 respectively. Internal consistency was also high and measures of validity were very good. Arabic form of the person-centred climate questionnaire-staff version provides a valid and reliable way to measure the degree of perceived person-centredness. The tool can be used for comparing levels of person-centredness between wards, units, and public and private hospitals. The tool can also be used to measure the extent of person-centredness in health care settings in other Arab countries. © 2017 John Wiley & Sons Ltd.
Development and evaluation of the Korean Health Literacy Instrument.

PubMed

Kang, Soo Jin; Lee, Tae Wha; Paasche-Orlow, Michael K; Kim, Gwang Suk; Won, Hee Kwan

2014-01-01

The purpose of this study is to develop and validate the Korean Health Literacy Instrument, which measures the capacity to understand and use health-related information and make informed health decisions in Korean adults. In Phase 1, 33 initial items were generated to measure functional, interactive, and critical health literacy with prose, document, and numeracy tasks. These items included content from health promotion, disease management, and health navigation contexts. Content validity assessment was conducted by an expert panel, and 11 items were excluded. In Phase 2, the 22 remaining items were administered to a convenience sample of 292 adults from community and clinical settings. Exploratory factor and item difficulty and discrimination analyses were conducted and four items with low discrimination were deleted. In Phase 3, the remaining 18 items were administered to a convenience sample of 315 adults 40-64 years of age from community and clinical settings. A confirmatory factor analysis was performed to test the construct validity of the instrument. The Korean Health Literacy Instrument has a range of 0 to 18. The mean score in our validation study was 11.98. The instrument exhibited an internal consistency reliability coefficient of 0.82, and a test-retest reliability of 0.89. The instrument is suitable for screening individuals who have limited health literacy skills. Future studies are needed to further define the psychometric properties and predictive validity of the Korean Health Literacy Instrument.
Reliability and validity of the adolescent health profile-types.

PubMed

Riley, A W; Forrest, C B; Starfield, B; Green, B; Kang, M; Ensminger, M

1998-08-01

The purpose of this study was to demonstrate the preliminary reliability and validity of a set 13 profiles of adolescent health that describe distinct patterns of health and health service requirements on four domains of health. Reliability and validity were tested in four ethnically diverse population samples of urban and rural youths aged 11 to 17-years-old in public schools (N = 4,066). The reliability of the classification procedure and construct validity were examined in terms of the predicted and actual distributions of age, gender, race, socioeconomic status, and family type. School achievement, medical conditions, and the proportion of youths with a psychiatric disorder also were examined as tests of construct validity. The classification method was shown to produce consistent results across the four populations in terms of proportions of youths assigned with specific sociodemographic characteristics. Variations in health described by specific profiles showed expected relations to sociodemographic characteristics, family structure, school achievement, medical disorders, and psychiatric disorders. This taxonomy of health profile-types appears to effectively describe a set of patterns that characterize adolescent health. The profile-types provide a unique and practical method for identifying subgroups having distinct needs for health services, with potential utility for health policy and planning. Such integrative reporting methods are critical for more effective utilization of health status instruments in health resource planning and policy development.
Validating a new methodology for strain estimation from cardiac cine MRI

NASA Astrophysics Data System (ADS)

Elnakib, Ahmed; Beache, Garth M.; Gimel'farb, Georgy; Inanc, Tamer; El-Baz, Ayman

2013-10-01

This paper focuses on validating a novel framework for estimating the functional strain from cine cardiac magnetic resonance imaging (CMRI). The framework consists of three processing steps. First, the left ventricle (LV) wall borders are segmented using a level-set based deformable model. Second, the points on the wall borders are tracked during the cardiac cycle based on solving the Laplace equation between the LV edges. Finally, the circumferential and radial strains are estimated at the inner, mid-wall, and outer borders of the LV wall. The proposed framework is validated using synthetic phantoms of the material strains that account for the physiological features and the LV response during the cardiac cycle. Experimental results on simulated phantom images confirm the accuracy and robustness of our method.
Development and Validation of a Measure of Quality of Life for the Young Elderly in Sri Lanka.

PubMed

de Silva, Sudirikku Hennadige Padmal; Jayasuriya, Anura Rohan; Rajapaksa, Lalini Chandika; de Silva, Ambepitiyawaduge Pubudu; Barraclough, Simon

2016-01-01

Sri Lanka has one of the fastest aging populations in the world. Measurement of quality of life (QoL) in the elderly needs instruments developed that encompass the sociocultural settings. An instrument was developed to measure QoL in the young elderly in Sri Lanka (QLI-YES), using accepted methods to generate and reduce items. The measure was validated using a community sample. Construct, criterion and predictive validity and reliability were tested. A first-order model of 24 items with 6 domains was found to have good fit indices (CMIN/df = 1.567, RMR = 0.05, CFI = 0.95, and RMSEA = 0.053). Both criterion and predictive validity were demonstrated. Good internal consistency reliability (Cronbach's α = 0.93) was shown. The development of the QLI-YES using a societal perspective relevant to the social and cultural beliefs has resulted in a robust and valid instrument to measure QoL for the young elderly in Sri Lanka. © 2015 APJPH.
Brief reasons for living inventory: a psychometric investigation.

PubMed

Cwik, Jan Christopher; Siegmann, Paula; Willutzki, Ulrike; Nyhuis, Peter; Wolter, Marcus; Forkmann, Thomas; Glaesmer, Heide; Teismann, Tobias

2017-11-06

The present study aimed at validating the German version of the Brief Reasons for Living inventory (BRFL). Validity and reliability were established in a community (n = 339) and a clinical sample (n = 272). Convergent and discriminant validity were investigated, and confirmatory factor analyses were conducted for the complete BRFL as well as for a 10-item version excluding conditional items on child-related concerns. Furthermore, it was assessed how BRFL scores moderate the association between depression and suicide ideation. Results indicated an adequate fit of the data to the original factor structure. The total scale and the subscales of the German version of the BRFL had sufficient internal consistency, as well as good convergent and divergent validity. The BRFL demonstrated clinical utility by differentiating between participants with vs. without suicide ideation. Reasons for living proved to moderate the association between depression and suicide ideation. Results provide preliminary evidence that the BRFL may be a reliable and valid measure of adaptive reasons for living that can be used in clinic and research settings.
Development and Validation of a Measure of Quality of Life for the Young Elderly in Sri Lanka

PubMed Central

de Silva, Sudirikku Hennadige Padmal; Jayasuriya, Anura Rohan; Rajapaksa, Lalini Chandika; de Silva, Ambepitiyawaduge Pubudu; Barraclough, Simon

2016-01-01

Sri Lanka has one of the fastest aging populations in the world. Measurement of quality of life (QoL) in the elderly needs instruments developed that encompass the sociocultural settings. An instrument was developed to measure QoL in the young elderly in Sri Lanka (QLI-YES), using accepted methods to generate and reduce items. The measure was validated using a community sample. Construct, criterion and predictive validity and reliability were tested. A first-order model of 24 items with 6 domains was found to have good fit indices (CMIN/df = 1.567, RMR = 0.05, CFI = 0.95, and RMSEA = 0.053). Both criterion and predictive validity were demonstrated. Good internal consistency reliability (Cronbach’s α = 0.93) was shown. The development of the QLI-YES using a societal perspective relevant to the social and cultural beliefs has resulted in a robust and valid instrument to measure QoL for the young elderly in Sri Lanka. PMID:26712893
VALIDITY AND RELIABILITY OF THE SPIRITUAL COPING STRATEGIES SCALE ARABIC VERSION IN SAUDI PATIENTS UNDERGOING HAEMODIALYSIS.

PubMed

Cruz, Jonas P; Baldacchino, Donia R; Alquwez, Nahed

2016-06-01

Patients often resort to religious and spiritual activities to cope with physical and mental challenges. The effect of spiritual coping on overall health, adaptation and health-related quality of life among patients undergoing haemodialysis (HD) is well documented. Thus, it is essential to establish a valid and reliable instrument that can assess both the religious and non-religious coping methods in patients undergoing HD. This study aimed to assess the validity and reliability of the Spiritual Coping Strategies Scale Arabic version (SCS-A) in Saudi patients undergoing HD. A convenience sample of 60 Saudi patients undergoing HD was recruited for this descriptive, cross-sectional study. Data were collected between May and June 2015. Forward-backward translation was used to formulate the SCS-A. The SCS-A, Muslim Religiosity Scale and the Quality of Life Index Dialysis Version III were used to procure the data. Internal consistency reliability, stability reliability, factor analysis and construct validity tests were performed. Analyses were set at the 0.05 level of significance. The SCS-A showed an acceptable internal consistency and strong stability reliability over time. The EFA produced two factors (non-religious and religious coping). Satisfactory construct validity was established by the convergent and divergent validity and known-groups method. The SCS-A is a reliable and valid tool that can be used to measure the religious and non-religious coping strategies of patients undergoing HD in Saudi Arabia and other Muslim and Arabic-speaking countries. © 2016 European Dialysis and Transplant Nurses Association/European Renal Care Association.
Reliability and validity of the Lithuanian Tinnitus Handicap Inventory.

PubMed

Ulozienė, Ingrida; Balnytė, Renata; Alzbutienė, Giedrė; Arechvo, Irina; Vaitkus, Antanas; Šileikaitė, Milda; Šaferis, Viktoras; Ulozas, Virgilijus

2016-01-01

The aim of this study was to determine the reliability and validity of the Lithuanian version of the Tinnitus Handicap Inventory (THI), a self-report measure of perceived tinnitus handicap. A cross-sectional psychometric validation study was performed in the University Hospital. A total of 248 subjects reporting chronic tinnitus as their primary complaint or secondary to hearing loss were encluded in the study and filled in the Lithuanian version of THI. For assessment of construct validity a subgroup of 55 participants completed the Lithuanian version of the Hospital Anxiety and Depression Scale as a measure of self-perceived levels of anxiety and depression. Test-retest and internal consistency reliability as well as construct validity were calculated. The Lithuanian version of the THI and its subscales showed a robust internal consistency reliability (Cronbach's alpha=0.93) comparable to the original version. Statistically significant correlations were observed between the Lithuanian translation of the THI and the measures of self-perceived levels of anxiety and depression using HADS. Confirmatory factor analysis demonstrated that the three subscales of the THI Lithuanian version corresponded to three different factors, which strongly correlated between themselves. The results suggest that the Lithuanian version of THI maintains its original validity and may serve as reliable and valid measure of general tinnitus related distress that can be used in a clinical setting to quantify the impact of tinnitus on daily living. Copyright © 2016 The Lithuanian University of Health Sciences. Production and hosting by Elsevier Urban & Partner Sp. z o.o. All rights reserved.
The PDB_REDO server for macromolecular structure model optimization.

PubMed

Joosten, Robbie P; Long, Fei; Murshudov, Garib N; Perrakis, Anastassis

2014-07-01

The refinement and validation of a crystallographic structure model is the last step before the coordinates and the associated data are submitted to the Protein Data Bank (PDB). The success of the refinement procedure is typically assessed by validating the models against geometrical criteria and the diffraction data, and is an important step in ensuring the quality of the PDB public archive [Read et al. (2011 ▶), Structure, 19, 1395-1412]. The PDB_REDO procedure aims for 'constructive validation', aspiring to consistent and optimal refinement parameterization and pro-active model rebuilding, not only correcting errors but striving for optimal interpretation of the electron density. A web server for PDB_REDO has been implemented, allowing thorough, consistent and fully automated optimization of the refinement procedure in REFMAC and partial model rebuilding. The goal of the web server is to help practicing crystallo-graphers to improve their model prior to submission to the PDB. For this, additional steps were implemented in the PDB_REDO pipeline, both in the refinement procedure, e.g. testing of resolution limits and k-fold cross-validation for small test sets, and as new validation criteria, e.g. the density-fit metrics implemented in EDSTATS and ligand validation as implemented in YASARA. Innovative ways to present the refinement and validation results to the user are also described, which together with auto-generated Coot scripts can guide users to subsequent model inspection and improvement. It is demonstrated that using the server can lead to substantial improvement of structure models before they are submitted to the PDB.
[Validation and reliability study of the parent concerns about surgery questionnaire: What worries parents?

PubMed

Gironés Muriel, Alberto; Campos Segovia, Ana; Ríos Gómez, Patricia

2018-01-01

The study of mediating variables and psychological responses to child surgery involves the evaluation of both the patient and the parents as regards different stressors. To have a reliable and reproducible valid evaluation tool that assesses the level of paternal involvement in relation to different stressors in the setting of surgery. A self-report questionnaire study was completed by 123 subjects of both sexes, subdivided into 2populations, due to their relationship with the hospital setting. The items were determined by a group of experts and analysed using the Lawshe validity index to determine a first validity of content. Subsequently, the reliability of the tool was determined by an item-re-item analysis of the 2sub-populations. A factorial analysis was performed to analyse the construct validity with the maximum likelihood and rotation of varimax type factors. A questionnaire of paternal concern was offered, consisting of 21 items with a Cronbach coefficient of 0.97, giving good precision and stability. The posterior factor analysis gives an adequate validity to the questionnaire, with the determination of 10 common stressors that cover 74.08% of the common and non-common variance of the questionnaire. The proposed questionnaire is reliable, valid and easy-to-apply and is developed to assess the level of paternal concern about the surgery of a child and to be able to apply measures and programs through the prior assessment of these elements. Copyright © 2016 Asociación Española de Pediatría. Publicado por Elsevier España, S.L.U. All rights reserved.
The stroke impairment assessment set: its internal consistency and predictive validity.

PubMed

Tsuji, T; Liu, M; Sonoda, S; Domen, K; Chino, N

2000-07-01

To study the scale quality and predictive validity of the Stroke Impairment Assessment Set (SIAS) developed for stroke outcome research. Rasch analysis of the SIAS; stepwise multiple regression analysis to predict discharge functional independence measure (FIM) raw scores from demographic data, the SIAS scores, and the admission FIM scores; cross-validation of the prediction rule. Tertiary rehabilitation center in Japan. One hundred ninety stroke inpatients for the study of the scale quality and the predictive validity; a second sample of 116 stroke inpatients for the cross-validation study. Mean square fit statistics to study the degree of fit to the unidimensional model; logits to express item difficulties; discharge FIM scores for the study of predictive validity. The degree of misfit was acceptable except for the shoulder range of motion (ROM), pain, visuospatial function, and speech items; and the SIAS items could be arranged on a common unidimensional scale. The difficulty patterns were identical at admission and at discharge except for the deep tendon reflexes, ROM, and pain items. They were also similar for the right- and left-sided brain lesion groups except for the speech and visuospatial items. For the prediction of the discharge FIM scores, the independent variables selected were age, the SIAS total scores, and the admission FIM scores; and the adjusted R2 was .64 (p < .0001). Stability of the predictive equation was confirmed in the cross-validation sample (R2 = .68, p < .001). The unidimensionality of the SIAS was confirmed, and the SIAS total scores proved useful for stroke outcome prediction.
Syntactic and Semantic Validation without a Metadata Management System

NASA Technical Reports Server (NTRS)

Pollack, Janine; Gokey, Christopher D.; Kendig, David; Olsen, Lola; Wharton, Stephen W. (Technical Monitor)

2001-01-01

The ability to maintain quality information is essential to securing the confidence in any system for which the information serves as a data source. NASA's Global Change Master Directory (GCMD), an online Earth science data locator, holds over 9000 data set descriptions and is in a constant state of flux as metadata are created and updated on a daily basis. In such a system, the importance of maintaining the consistency and integrity of these-metadata is crucial. The GCMD has developed a metadata management system utilizing XML, controlled vocabulary, and Java technologies to ensure the metadata not only adhere to valid syntax, but also exhibit proper semantics.
Internal consistency, concurrent validity, and discriminant validity of a measure of public support for policies for active living in transportation (PAL-T) in a population-based sample of adults.

PubMed

Fuller, Daniel; Gauvin, Lise; Fournier, Michel; Kestens, Yan; Daniel, Mark; Morency, Patrick; Drouin, Louis

2012-04-01

Active living is a broad conceptualization of physical activity that incorporates domains of exercise; recreational, household, and occupational activities; and active transportation. Policy makers develop and implement a variety of transportation policies that can influence choices about how to travel from one location to another. In making such decisions, policy makers act in part in response to public opinion or support for proposed policies. Measures of the public's support for policies aimed at promoting active transportation can inform researchers and policy makers. This study examined the internal consistency, and concurrent and discriminant validity of a newly developed measure of the public's support for policies for active living in transportation (PAL-T). A series of 17 items representing potential policies for promoting active transportation was generated. Two samples of participants (n = 2,001 and n = 2,502) from Montreal, Canada, were recruited via random digit dialling. Analyses were conducted on the combined data set (n = 4,503). Participants were aged 18 through 94 years (58% female). The concurrent and discriminant validity of the PAL-T was assessed by examining relationships with physical activity and smoking. To explore the usability of the PAL-T, predicted scale scores were compared to the summed values of responses. Results showed that the internal consistency of the PAL-T was 0.70. Multilevel regression demonstrated no relationship between the PAL-T and smoking status (p > 0.05) but significant relationships with utilitarian walking (p < 0.05) and cycling (p < 0.01) for at least 30 minutes on 5 days/week. The PAL-T has acceptable internal consistency and good concurrent and discriminant validity. Measuring public opinion can inform policy makers and support advocacy efforts aimed at making built environments more suitable for active transportation while allowing researchers to examine the antecedents and consequences of public support for policies.
Fundamentals of endoscopic surgery: creation and validation of the hands-on test.

PubMed

Vassiliou, Melina C; Dunkin, Brian J; Fried, Gerald M; Mellinger, John D; Trus, Thadeus; Kaneva, Pepa; Lyons, Calvin; Korndorffer, James R; Ujiki, Michael; Velanovich, Vic; Kochman, Michael L; Tsuda, Shawn; Martinez, Jose; Scott, Daniel J; Korus, Gary; Park, Adrian; Marks, Jeffrey M

2014-03-01

The Fundamentals of Endoscopic Surgery™ (FES) program consists of online materials and didactic and skills-based tests. All components were designed to measure the skills and knowledge required to perform safe flexible endoscopy. The purpose of this multicenter study was to evaluate the reliability and validity of the hands-on component of the FES examination, and to establish the pass score. Expert endoscopists identified the critical skill set required for flexible endoscopy. They were then modeled in a virtual reality simulator (GI Mentor™ II, Simbionix™ Ltd., Airport City, Israel) to create five tasks and metrics. Scores were designed to measure both speed and precision. Validity evidence was assessed by correlating performance with self-reported endoscopic experience (surgeons and gastroenterologists [GIs]). Internal consistency of each test task was assessed using Cronbach's alpha. Test-retest reliability was determined by having the same participant perform the test a second time and comparing their scores. Passing scores were determined by a contrasting groups methodology and use of receiver operating characteristic curves. A total of 160 participants (17 % GIs) performed the simulator test. Scores on the five tasks showed good internal consistency reliability and all had significant correlations with endoscopic experience. Total FES scores correlated 0.73, with participants' level of endoscopic experience providing evidence of their validity, and their internal consistency reliability (Cronbach's alpha) was 0.82. Test-retest reliability was assessed in 11 participants, and the intraclass correlation was 0.85. The passing score was determined and is estimated to have a sensitivity (true positive rate) of 0.81 and a 1-specificity (false positive rate) of 0.21. The FES hands-on skills test examines the basic procedural components required to perform safe flexible endoscopy. It meets rigorous standards of reliability and validity required for high-stakes examinations, and, together with the knowledge component, may help contribute to the definition and determination of competence in endoscopy.
A rotorcraft flight database for validation of vision-based ranging algorithms

NASA Technical Reports Server (NTRS)

Smith, Phillip N.

1992-01-01

A helicopter flight test experiment was conducted at the NASA Ames Research Center to obtain a database consisting of video imagery and accurate measurements of camera motion, camera calibration parameters, and true range information. The database was developed to allow verification of monocular passive range estimation algorithms for use in the autonomous navigation of rotorcraft during low altitude flight. The helicopter flight experiment is briefly described. Four data sets representative of the different helicopter maneuvers and the visual scenery encountered during the flight test are presented. These data sets will be made available to researchers in the computer vision community.
Experiment module concepts study. Volume 3: Module and subsystem design

NASA Technical Reports Server (NTRS)

Hunter, J. R.; Chiarappa, D. J.

1970-01-01

The final common module set exhibiting wide commonality is described. The set consists of three types of modules: one free flying module and two modules that operate attached to the space station. The common module designs provide for the experiment program as defined. The feasibility, economy, and practicality of these modules hinges on factors that do not affect the approach or results of the commonality process, but are important to the validity of the common module concepts. Implementation of the total experiment program requires thirteen common modules: five CM-1, five CM-3, and three CM-4 modules.
Temporal and Geographic variation in the validity and internal consistency of the Nursing Home Resident Assessment Minimum Data Set 2.0.

PubMed

Mor, Vincent; Intrator, Orna; Unruh, Mark Aaron; Cai, Shubing

2011-04-15

The Minimum Data Set (MDS) for nursing home resident assessment has been required in all U.S. nursing homes since 1990 and has been universally computerized since 1998. Initially intended to structure clinical care planning, uses of the MDS expanded to include policy applications such as case-mix reimbursement, quality monitoring and research. The purpose of this paper is to summarize a series of analyses examining the internal consistency and predictive validity of the MDS data as used in the "real world" in all U.S. nursing homes between 1999 and 2007. We used person level linked MDS and Medicare denominator and all institutional claim files including inpatient (hospital and skilled nursing facilities) for all Medicare fee-for-service beneficiaries entering U.S. nursing homes during the period 1999 to 2007. We calculated the sensitivity and positive predictive value (PPV) of diagnoses taken from Medicare hospital claims and from the MDS among all new admissions from hospitals to nursing homes and the internal consistency (alpha reliability) of pairs of items within the MDS that logically should be related. We also tested the internal consistency of commonly used MDS based multi-item scales and examined the predictive validity of an MDS based severity measure viz. one year survival. Finally, we examined the correspondence of the MDS discharge record to hospitalizations and deaths seen in Medicare claims, and the completeness of MDS assessments upon skilled nursing facility (SNF) admission. Each year there were some 800,000 new admissions directly from hospital to US nursing homes and some 900,000 uninterrupted SNF stays. Comparing Medicare enrollment records and claims with MDS records revealed reasonably good correspondence that improved over time (by 2006 only 3% of deaths had no MDS discharge record, only 5% of SNF stays had no MDS, but over 20% of MDS discharges indicating hospitalization had no associated Medicare claim). The PPV and sensitivity levels of Medicare hospital diagnoses and MDS based diagnoses were between .6 and .7 for major diagnoses like CHF, hypertension, diabetes. Internal consistency, as measured by PPV, of the MDS ADL items with other MDS items measuring impairments and symptoms exceeded .9. The Activities of Daily Living (ADL) long form summary scale achieved an alpha inter-consistency level exceeding .85 and multi-item scale alpha levels of .65 were achieved for well being and mood, and .55 for behavior, levels that were sustained even after stratification by ADL and cognition. The Changes in Health, End-stage disease and Symptoms and Signs (CHESS) index, a summary measure of frailty was highly predictive of one year survival. The MDS demonstrates a reasonable level of consistency both in terms of how well MDS diagnoses correspond to hospital discharge diagnoses and in terms of the internal consistency of functioning and behavioral items. The level of alpha reliability and validity demonstrated by the scales suggest that the data can be useful for research and policy analysis. However, while improving, the MDS discharge tracking record should still not be used to indicate Medicare hospitalizations or mortality. It will be important to monitor the performance of the MDS 3.0 with respect to consistency, reliability and validity now that it has replaced version 2.0, using these results as a baseline that should be exceeded.
Stimulus-driven attentional capture by subliminal onset cues.

PubMed

Schoeberl, Tobias; Fuchs, Isabella; Theeuwes, Jan; Ansorge, Ulrich

2015-04-01

In two experiments, we tested whether subliminal abrupt onset cues capture attention in a stimulus-driven way. An onset cue was presented 16 ms prior to the stimulus display that consisted of clearly visible color targets. The onset cue was presented either at the same side as the target (the valid cue condition) or on the opposite side of the target (the invalid cue condition). Because the onset cue was presented 16 ms before other placeholders were presented, the cue was subliminal to the participant. To ensure that this subliminal cue captured attention in a stimulus-driven way, the cue's features did not match the top-down attentional control settings of the participants: (1) The color of the cue was always different than the color of the non-singleton targets ensuring that a top-down set for a specific color or for a singleton would not match the cue, and (2) colored targets and distractors had the same objective luminance (measured by the colorimeter) and subjective lightness (measured by flicker photometry), preventing a match between the top-down set for target and cue contrast. Even though a match between the cues and top-down settings was prevented, in both experiments, the cues captured attention, with faster response times in valid than invalid cue conditions (Experiments 1 and 2) and faster response times in valid than the neutral conditions (Experiment 2). The results support the conclusion that subliminal cues capture attention in a stimulus-driven way.
Measurement of latent cognitive abilities involved in concept identification learning.

PubMed

Thomas, Michael L; Brown, Gregory G; Gur, Ruben C; Moore, Tyler M; Patt, Virginie M; Nock, Matthew K; Naifeh, James A; Heeringa, Steven; Ursano, Robert J; Stein, Murray B

2015-01-01

We used cognitive and psychometric modeling techniques to evaluate the construct validity and measurement precision of latent cognitive abilities measured by a test of concept identification learning: the Penn Conditional Exclusion Test (PCET). Item response theory parameters were embedded within classic associative- and hypothesis-based Markov learning models and were fitted to 35,553 Army soldiers' PCET data from the Army Study to Assess Risk and Resilience in Servicemembers (Army STARRS). Data were consistent with a hypothesis-testing model with multiple latent abilities-abstraction and set shifting. Latent abstraction ability was positively correlated with number of concepts learned, and latent set-shifting ability was negatively correlated with number of perseverative errors, supporting the construct validity of the two parameters. Abstraction was most precisely assessed for participants with abilities ranging from 1.5 standard deviations below the mean to the mean itself. Measurement of set shifting was acceptably precise only for participants making a high number of perseverative errors. The PCET precisely measures latent abstraction ability in the Army STARRS sample, especially within the range of mildly impaired to average ability. This precision pattern is ideal for a test developed to measure cognitive impairment as opposed to cognitive strength. The PCET also measures latent set-shifting ability, but reliable assessment is limited to the impaired range of ability, reflecting that perseverative errors are rare among cognitively healthy adults. Integrating cognitive and psychometric models can provide information about construct validity and measurement precision within a single analytical framework.

Measuring potential predictors of burnout and engagement among young veterinary professionals; construction of a customised questionnaire (the Vet-DRQ).

PubMed

Mastenbroek, N J J M; Demerouti, E; van Beukelen, P; Muijtjens, A M M; Scherpbier, A J J A; Jaarsma, A D C

2014-02-15

The Job Demands-Resources model (JD-R model) was used as the theoretical basis of a tailormade questionnaire to measure the psychosocial work environment and personal resources of recently graduated veterinary professionals. According to the JD-R model, two broad categories of work characteristics that determine employee wellbeing can be distinguished: job demands and job resources. Recently, the JD-R model has been expanded by integrating personal resource measures into the model. Three semistructured group interviews with veterinarians active in different work domains were conducted to identify relevant job demands, job resources and personal resources. These demands and resources were organised in themes (constructs). For measurement purposes, a set of questions ('a priori scale') was selected from the literature for each theme. The full set of a priori scales was included in a questionnaire that was administered to 1760 veterinary professionals. Exploratory factor analysis and reliability analysis were conducted to arrive at the final set of validated scales (final scales). 860 veterinarians (73 per cent females) participated. The final set of scales consisted of seven job demands scales (32 items), nine job resources scales (41 items), and six personal resources scales (26 items) which were considered to represent the most relevant potential predictors of work-related wellbeing in this occupational group. The procedure resulted in a tailormade questionnaire: the Veterinary Job Demands and Resources Questionnaire (Vet-DRQ). The use of valid theory and validated scales enhances opportunities for comparative national and international research.
Modification and Evaluation of a Velopharyngeal Insufficiency Quality of Life Instrument

PubMed Central

Skirko, Jonathan R.; Weaver, Edward M; Perkins, Jonathan; Kinter, Sara; Sie, Kathleen C.Y.

2018-01-01

Objective Modify the existing 45-item velopharyngeal insufficiency (VPI) quality of life (QOL) instrument (VPIQL), assess the modified instrument for reliability and provide further validation. There are patient and parent versions of the instrument. Design Validation convenience sample from a previously conducted pilot study. Setting Two academic tertiary referral medical centers. Participants De-identified data were used from 29 subjects with VPI and 29 control subjects age 5–17 years, and parents. Outcome measures Subjects and parents completed VPIQL and a generic pediatric QOL instrument (PedsQL4-0). Data Analysis Twenty-two items were removed from the VPIQL for ceiling effects, floor effects, and redundancy, to produce the modified instrument, VPI Effects on Life Outcomes (VELO) instrument. VELO was tested for internal consistency (Chronbach’s alpha), discriminant validity (paired t-test with control subjects), and concurrent validity (Pearson correlation with the PedsQL4-0). These analyses were also completed for parents. Results The 45-item VPIQL instrument was reduced to the 23-item VELO instrument. The VELO had excellent internal consistency (Chronbach’s alpha 0.96 for parents and 0.95 for VPI subjects). The VELO discriminated well between VPI and control subjects, with mean score (SD) was significantly lower (worse) for VPI subjects (67.6 [23.9]) than for control subjects (97.0 [5.2]) (p<0.0001). The VELO total score was significantly correlated with the PedsQL4.0 (r=0.73) among subjects with VPI. Similar results were seen in parent responses. Conclusions The VELO is a 23-item QOL instrument that was designed to measure and follow QOL in subjects with VPI, with less burden than the original VPIQL. VELO demonstrates internal consistency, disciminant validty, and concurrent validity with the PedsQL4-0. PMID:23069823
The Second SeaWiFS HPLC Analysis Round-Robin Experiment (SeaHARRE-2)

NASA Technical Reports Server (NTRS)

2005-01-01

Eight international laboratories specializing in the determination of marine pigment concentrations using high performance liquid chromatography (HPLC) were intercompared using in situ samples and a variety of laboratory standards. The field samples were collected primarily from eutrophic waters, although mesotrophic waters were also sampled to create a dynamic range in chlorophyll concentration spanning approximately two orders of magnitude (0.3 25.8 mg m-3). The intercomparisons were used to establish the following: a) the uncertainties in quantitating individual pigments and higher-order variables (sums, ratios, and indices); b) an evaluation of spectrophotometric versus HPLC uncertainties in the determination of total chlorophyll a; and c) the reduction in uncertainties as a result of applying quality assurance (QA) procedures associated with extraction, separation, injection, degradation, detection, calibration, and reporting (particularly limits of detection and quantitation). In addition, the remote sensing requirements for the in situ determination of total chlorophyll a were investigated to determine whether or not the average uncertainty for this measurement is being satisfied. The culmination of the activity was a validation of the round-robin methodology plus the development of the requirements for validating an individual HPLC method. The validation process includes the measurements required to initially demonstrate a pigment is validated, and the measurements that must be made during sample analysis to confirm a method remains validated. The so-called performance-based metrics developed here describe a set of thresholds for a variety of easily-measured parameters with a corresponding set of performance categories. The aggregate set of performance parameters and categories establish a) the overall performance capability of the method, and b) whether or not the capability is consistent with the required accuracy objectives.
Development and Validation of the Smartphone Addiction Inventory (SPAI)

PubMed Central

Lin, Yu-Hsuan; Chang, Li-Ren; Lee, Yang-Han; Tseng, Hsien-Wei; Kuo, Terry B. J.; Chen, Sue-Huei

2014-01-01

Objective The aim of this study was to develop a self-administered scale based on the special features of smartphone. The reliability and validity of the Smartphone Addiction Inventory (SPAI) was demonstrated. Methods A total of 283 participants were recruited from Dec. 2012 to Jul. 2013 to complete a set of questionnaires, including a 26-item SPAI modified from the Chinese Internet Addiction Scale and phantom vibration and ringing syndrome questionnaire. There were 260 males and 23 females, with ages 22.9±2.0 years. Exploratory factor analysis, internal-consistency test, test-retest, and correlation analysis were conducted to verify the reliability and validity of the SPAI. Correlations between each subscale and phantom vibration and ringing were also explored. Results Exploratory factor analysis yielded four factors: compulsive behavior, functional impairment, withdrawal and tolerance. Test–retest reliabilities (intraclass correlations = 0.74–0.91) and internal consistency (Cronbach's α = 0.94) were all satisfactory. The four subscales had moderate to high correlations (0.56–0.78), but had no or very low correlation to phantom vibration/ringing syndrome. Conclusion This study provides evidence that the SPAI is a valid and reliable, self-administered screening tool to investigate smartphone addiction. Phantom vibration and ringing might be independent entities of smartphone addiction. PMID:24896252
Measurements of Pollution in the Troposphere (MOPITT) Validation Exercises During Summer 2004 Field Campaigns over North America

NASA Technical Reports Server (NTRS)

Emmons, L. K.; Pfister, G. G.; Edwards, D. P.; Gille, J. C.; Sachse, G.; Blake, D.; Wofsy, S.; Gerbig, C.; Matross, D.; Nedelec, P.

2007-01-01

Measurements of carbon monoxide (CO) made as part of three aircraft experiments during the summer of 2004 over North America have been used for the continued validation of the CO retrievals from the Measurements of Pollution in the Troposphere (MOPITT) instrument on board the Terra satellite. Vertical profiles measured during the NASA INTEX-A campaign, designed to be coincident with MOPITT overpasses, as well as measurements made during the COBRA-2004 and MOZAIC experiments, provided valuable validation comparisons. On average, the MOPITT CO retrievals are biased slightly high for these North America locations. While the mean bias differs between the different aircraft experiments (e.g., 7.0 ppbv for MOZAIC to 18.4 ppbv for COBRA at 700 hPa), the standard deviations are quite large, so the results for the three data sets can be considered consistent. On average, it is estimated that MOPITT is 7- 14% high at 700 hPa and 03% high at 350 hPa. These results are consistent with the validation results for the Carr, Colorado, Harvard Forest, Massachusetts, and Poker Flats, Alaska, aircraft profiles for "phase 2" presented by Emmons et al. (2004) and are generally within the design criteria of 10% accuracy.
Validating dimensions of psychosis symptomatology: Neural correlates and 20-year outcomes.

PubMed

Kotov, Roman; Foti, Dan; Li, Kaiqiao; Bromet, Evelyn J; Hajcak, Greg; Ruggero, Camilo J

2016-11-01

Heterogeneity of psychosis presents significant challenges for classification. Between 2 and 12 symptom dimensions have been proposed, and consensus is lacking. The present study sought to identify uniquely informative models by comparing the validity of these alternatives. An epidemiologic cohort of 628 first-admission inpatients with psychosis was interviewed 6 times over 2 decades and completed an electrophysiological assessment of error processing at year 20. We first analyzed a comprehensive set of 49 symptoms rated by interviewers at baseline, progressively extracting from 1 to 12 factors. Next, we compared the ability of resulting factor solutions to (a) account for concurrent neural dysfunction and (b) predict 20-year role, social, residential, and global functioning, and life satisfaction. A four-factor model showed incremental validity with all outcomes, and more complex models did not improve explanatory power. The 4 dimensions-reality distortion, disorganization, inexpressivity, and apathy/asociality-were replicable in 5 follow-ups, internally consistent, stable across assessments, and showed strong discriminant validity. These results reaffirm the value of separating disorganization and reality distortion, are consistent with recent findings distinguishing inexpressivity and apathy/asociality, and suggest that these 4 dimensions are fundamental to understanding neural abnormalities and long-term outcomes in psychosis. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
A comprehensive approach to psychometric assessment of instruments used in dementia educational interventions for health professionals: a cross-sectional study.

PubMed

Wang, Yao; Xiao, Lily Dongxia; He, Guo-Ping

2015-02-01

Suboptimal care for people with dementia in hospital settings has been reported and is attributed to the lack of knowledge and inadequate attitudes in dementia care among health professionals. Educational interventions have been widely used to improve care outcomes; however, Chinese-language instruments used in dementia educational interventions for health professionals are lacking. The aims of this study were to select, translate and evaluate instruments used in dementia educational interventions for Chinese health professionals in acute-care hospitals. A cross-sectional study design was used. A modified stratified random sampling was used to recruit 442 participants from different levels of hospitals in Changsha, China. Dementia care competence was used as a framework for the selection and evaluation of Alzheimer's Disease Knowledge Scale and Dementia Care Attitudes Scale for health professionals in the study. These two scales were translated into Chinese using forward and back translation method. Content validity, test-retest reliability and internal consistency were assessed. Construct validity was tested using exploratory factor analysis. Known-group validity was established by comparing scores of Alzheimer's Disease Knowledge Scale and Dementia Care Attitudes Scale in two sub-groups. A person-centred care scale was utilised as a gold standard to establish concurrent validity of these two scales. Results demonstrated acceptable content validity, internal consistency, test-retest reliability and concurrent validity. Exploratory factor analysis presented a single-factor structure of the Chinese Alzheimer's Disease Knowledge Scale and a two-factor structure of the Chinese Dementia Care Attitudes Scale, supporting the conceptual dimensions of the original scales. The Chinese Alzheimer's Disease Knowledge Scale and Chinese Dementia Care Attitudes Scale demonstrated known-group validity evidenced by significantly higher scores identified from the sub-group with a longer work experience compared to those in the sub-group with less work experience. The use of dementia care competence as a framework to inform the selection and evaluation of instruments used in dementia educational interventions for health professionals has wide applicability in other areas. The results support that Chinese Alzheimer's Disease Knowledge Scale and Chinese Dementia Care Attitudes Scale are reliable and valid instruments for health professionals to use in acute-care settings. Copyright © 2014 Elsevier Ltd. All rights reserved.
Puzzling with online games (BAM-COG): reliability, validity, and feasibility of an online self-monitor for cognitive performance in aging adults.

PubMed

Aalbers, Teun; Baars, Maria A E; Olde Rikkert, Marcel G M; Kessels, Roy P C

2013-12-03

Online interventions are aiming increasingly at cognitive outcome measures but so far no easy and fast self-monitors for cognition have been validated or proven reliable and feasible. This study examines a new instrument called the Brain Aging Monitor-Cognitive Assessment Battery (BAM-COG) for its alternate forms reliability, face and content validity, and convergent and divergent validity. Also, reference values are provided. The BAM-COG consists of four easily accessible, short, yet challenging puzzle games that have been developed to measure working memory ("Conveyer Belt"), visuospatial short-term memory ("Sunshine"), episodic recognition memory ("Viewpoint"), and planning ("Papyrinth"). A total of 641 participants were recruited for this study. Of these, 397 adults, 40 years and older (mean 54.9, SD 9.6), were eligible for analysis. Study participants played all games three times with 14 days in between sets. Face and content validity were based on expert opinion. Alternate forms reliability (AFR) was measured by comparing scores on different versions of the BAM-COG and expressed with an intraclass correlation (ICC: two-way mixed; consistency at 95%). Convergent validity (CV) was provided by comparing BAM-COG scores to gold-standard paper-and-pencil and computer-assisted cognitive assessment. Divergent validity (DV) was measured by comparing BAM-COG scores to the National Adult Reading Test IQ (NART-IQ) estimate. Both CV and DV are expressed as Spearman rho correlation coefficients. Three out of four games showed adequate results on AFR, CV, and DV measures. The games Conveyer Belt, Sunshine, and Papyrinth have AFR ICCs of .420, .426, and .645 respectively. Also, these games had good to very good CV correlations: rho=.577 (P=.001), rho=.669 (P<.001), and rho=.400 (P=.04), respectively. Last, as expected, DV correlations were low: rho=-.029 (P=.44), rho=-.029 (P=.45), and rho=-.134 (P=.28) respectively. The game Viewpoint provided less desirable results with an AFR ICC of .167, CV rho=.202 (P=.15), and DV rho=-.162 (P=.21). This study provides evidence for the use of the BAM-COG test battery as a feasible, reliable, and valid tool to monitor cognitive performance in healthy adults in an online setting. Three out of four games have good psychometric characteristics to measure working memory, visuospatial short-term memory, and planning capacity.
Refining and validating the Social Interaction Anxiety Scale and the Social Phobia Scale.

PubMed

Carleton, R Nicholas; Collimore, Kelsey C; Asmundson, Gordon J G; McCabe, Randi E; Rowa, Karen; Antony, Martin M

2009-01-01

The Social Interaction Anxiety Scale and Social Phobia Scale are companion measures for assessing symptoms of social anxiety and social phobia. The scales have good reliability and validity across several samples, however, exploratory and confirmatory factor analyses have yielded solutions comprising substantially different item content and factor structures. These discrepancies are likely the result of analyzing items from each scale separately or simultaneously. The current investigation sets out to assess items from those scales, both simultaneously and separately, using exploratory and confirmatory factor analyses in an effort to resolve the factor structure. Participants consisted of a clinical sample (n 5353; 54% women) and an undergraduate sample (n 5317; 75% women) who completed the Social Interaction Anxiety Scale and Social Phobia Scale, along with additional fear-related measures to assess convergent and discriminant validity. A three-factor solution with a reduced set of items was found to be most stable, irrespective of whether the items from each scale are assessed together or separately. Items from the Social Interaction Anxiety Scale represented one factor, whereas items from the Social Phobia Scale represented two other factors. Initial support for scale and factor validity, along with implications and recommendations for future research, is provided. (c) 2009 Wiley-Liss, Inc.
Development and initial validation of the appropriate antibiotic use self-efficacy scale.

PubMed

Hill, Erin M; Watkins, Kaitlin

2018-06-04

While there are various medication self-efficacy scales that exist, none assess self-efficacy for appropriate antibiotic use. The Appropriate Antibiotic Use Self-Efficacy Scale (AAUSES) was developed, pilot tested, and its psychometric properties were examined. Following pilot testing of the scale, a 28-item questionnaire was examined using a sample (n = 289) recruited through the Amazon Mechanical Turk platform. Participants also completed other scales and items, which were used in assessing discriminant, convergent, and criterion-related validity. Test-retest reliability was also examined. After examining the scale and removing items that did not assess appropriate antibiotic use, an exploratory factor analysis was conducted on 13 items from the original scale. Three factors were retained that explained 65.51% of the variance. The scale and its subscales had adequate internal consistency. The scale had excellent test-retest reliability, as well as demonstrated convergent, discriminant, and criterion-related validity. The AAUSES is a valid and reliable scale that assesses three domains of appropriate antibiotic use self-efficacy. The AAUSES may have utility in clinical and research settings in understanding individuals' beliefs about appropriate antibiotic use and related behavioral correlates. Future research is needed to examine the scale's utility in these settings. Copyright © 2018 Elsevier B.V. All rights reserved.
Predicting human skin absorption of chemicals: development of a novel quantitative structure activity relationship.

PubMed

Luo, Wen; Medrek, Sarah; Misra, Jatin; Nohynek, Gerhard J

2007-02-01

The objective of this study was to construct and validate a quantitative structure-activity relationship model for skin absorption. Such models are valuable tools for screening and prioritization in safety and efficacy evaluation, and risk assessment of drugs and chemicals. A database of 340 chemicals with percutaneous absorption was assembled. Two models were derived from the training set consisting 306 chemicals (90/10 random split). In addition to the experimental K(ow) values, over 300 2D and 3D atomic and molecular descriptors were analyzed using MDL's QsarIS computer program. Subsequently, the models were validated using both internal (leave-one-out) and external validation (test set) procedures. Using the stepwise regression analysis, three molecular descriptors were determined to have significant statistical correlation with K(p) (R2 = 0.8225): logK(ow), X0 (quantification of both molecular size and the degree of skeletal branching), and SsssCH (count of aromatic carbon groups). In conclusion, two models to estimate skin absorption were developed. When compared to other skin absorption QSAR models in the literature, our model incorporated more chemicals and explored a large number of descriptors. Additionally, our models are reasonably predictive and have met both internal and external statistical validations.
Development of a gridded meteorological dataset over Java island, Indonesia 1985-2014.

PubMed

Yanto; Livneh, Ben; Rajagopalan, Balaji

2017-05-23

We describe a gridded daily meteorology dataset consisting of precipitation, minimum and maximum temperature over Java Island, Indonesia at 0.125°×0.125° (~14 km) resolution spanning 30 years from 1985-2014. Importantly, this data set represents a marked improvement from existing gridded data sets over Java with higher spatial resolution, derived exclusively from ground-based observations unlike existing satellite or reanalysis-based products. Gap-infilling and gridding were performed via the Inverse Distance Weighting (IDW) interpolation method (radius, r, of 25 km and power of influence, α, of 3 as optimal parameters) restricted to only those stations including at least 3,650 days (~10 years) of valid data. We employed MSWEP and CHIRPS rainfall products in the cross-validation. It shows that the gridded rainfall presented here produces the most reasonable performance. Visual inspection reveals an increasing performance of gridded precipitation from grid, watershed to island scale. The data set, stored in a network common data form (NetCDF), is intended to support watershed-scale and island-scale studies of short-term and long-term climate, hydrology and ecology.
Development and community-based validation of the IDEA study Instrumental Activities of Daily Living (IDEA-IADL) questionnaire

PubMed Central

Collingwood, Cecilia; Paddick, Stella-Maria; Kisoli, Aloyce; Dotchin, Catherine L.; Gray, William K.; Mbowe, Godfrey; Mkenda, Sarah; Urasa, Sarah; Mushi, Declare; Chaote, Paul; Walker, Richard W.

2014-01-01

Background The dementia diagnosis gap in sub-Saharan Africa (SSA) is large, partly due to difficulties in assessing function, an essential step in diagnosis. Objectives As part of the Identification and Intervention for Dementia in Elderly Africans (IDEA) study, to develop, pilot, and validate an Instrumental Activities of Daily Living (IADL) questionnaire for use in a rural Tanzanian population to assist in the identification of people with dementia alongside cognitive screening. Design The questionnaire was developed at a workshop for rural primary healthcare workers, based on culturally appropriate roles and usual activities of elderly people in this community. It was piloted in 52 individuals under follow-up from a dementia prevalence study. Validation subsequently took place during a community dementia-screening programme. Construct validation against gold standard clinical dementia diagnosis using DSM-IV criteria was carried out on a stratified sample of the cohort and validity assessed using area under the receiver operating characteristic (AUROC) curve analysis. Results An 11-item questionnaire (IDEA-IADL) was developed after pilot testing. During formal validation on 130 community-dwelling elderly people who presented for screening, the AUROC curve was 0.896 for DSM-IV dementia when used in isolation and 0.937 when used in conjunction with the IDEA cognitive screen, previously validated in Tanzania. The internal consistency was 0.959. Performance on the IDEA-IADL was not biased with regard to age, gender or education level. Conclusions The IDEA-IADL questionnaire appears to be a useful aid to dementia screening in this setting. Further validation in other healthcare settings in SSA is required. PMID:25537940
Assessing the accuracy and stability of variable selection methods for random forest modeling in ecology.

PubMed

Fox, Eric W; Hill, Ryan A; Leibowitz, Scott G; Olsen, Anthony R; Thornbrugh, Darren J; Weber, Marc H

2017-07-01

Random forest (RF) modeling has emerged as an important statistical learning method in ecology due to its exceptional predictive performance. However, for large and complex ecological data sets, there is limited guidance on variable selection methods for RF modeling. Typically, either a preselected set of predictor variables are used or stepwise procedures are employed which iteratively remove variables according to their importance measures. This paper investigates the application of variable selection methods to RF models for predicting probable biological stream condition. Our motivating data set consists of the good/poor condition of n = 1365 stream survey sites from the 2008/2009 National Rivers and Stream Assessment, and a large set (p = 212) of landscape features from the StreamCat data set as potential predictors. We compare two types of RF models: a full variable set model with all 212 predictors and a reduced variable set model selected using a backward elimination approach. We assess model accuracy using RF's internal out-of-bag estimate, and a cross-validation procedure with validation folds external to the variable selection process. We also assess the stability of the spatial predictions generated by the RF models to changes in the number of predictors and argue that model selection needs to consider both accuracy and stability. The results suggest that RF modeling is robust to the inclusion of many variables of moderate to low importance. We found no substantial improvement in cross-validated accuracy as a result of variable reduction. Moreover, the backward elimination procedure tended to select too few variables and exhibited numerous issues such as upwardly biased out-of-bag accuracy estimates and instabilities in the spatial predictions. We use simulations to further support and generalize results from the analysis of real data. A main purpose of this work is to elucidate issues of model selection bias and instability to ecologists interested in using RF to develop predictive models with large environmental data sets.
Procedure-specific assessment tool for flexible pharyngo-laryngoscopy: gathering validity evidence and setting pass-fail standards.

PubMed

Melchiors, Jacob; Petersen, K; Todsen, T; Bohr, A; Konge, Lars; von Buchwald, Christian

2018-06-01

The attainment of specific identifiable competencies is the primary measure of progress in the modern medical education system. The system, therefore, requires a method for accurately assessing competence to be feasible. Evidence of validity needs to be gathered before an assessment tool can be implemented in the training and assessment of physicians. This evidence of validity must according to the contemporary theory on validity be gathered from specific sources in a structured and rigorous manner. The flexible pharyngo-laryngoscopy (FPL) is central to the otorhinolaryngologist. We aim to evaluate the flexible pharyngo-laryngoscopy assessment tool (FLEXPAT) created in a previous study and to establish a pass-fail level for proficiency. Eighteen physicians with different levels of experience (novices, intermediates, and experienced) were recruited to the study. Each performed an FPL on two patients. These procedures were video recorded, blinded, and assessed by two specialists. The score was expressed as the percentage of a possible max score. Cronbach's α was used to analyze internal consistency of the data, and a generalizability analysis was performed. The scores of the three different groups were explored, and a pass-fail level was determined using the contrasting groups' standard setting method. Internal consistency was strong with a Cronbach's α of 0.86. We found a generalizability coefficient of 0.72 sufficient for moderate stakes assessment. We found a significant difference between the novice and experienced groups (p < 0.001) and strong correlation between experience and score (Pearson's r = 0.75). The pass/fail level was established at 72% of the maximum score. Applying this pass-fail level in the test population resulted in half of the intermediary group receiving a failing score. We gathered validity evidence for the FLEXPAT according to the contemporary framework as described by Messick. Our results support a claim of validity and are comparable to other studies exploring clinical assessment tools. The high rate of physicians underperforming in the intermediary group demonstrates the need for continued educational intervention. Based on our work, we recommend the use of the FLEXPAT in clinical assessment of FPL and the application of a pass-fail level of 72% for proficiency.
Evidence of Validity for the Japanese Version of the Foot and Ankle Ability Measure

PubMed Central

Uematsu, Daisuke; Suzuki, Hidetomo; Sasaki, Shogo; Nagano, Yasuharu; Shinozuka, Nobuyuki; Sunagawa, Norihiko; Fukubayashi, Toru

2015-01-01

Context: The Foot and Ankle Ability Measure (FAAM) is a valid, reliable, and self-reported outcome instrument for the foot and ankle region. Objective: To provide evidence for translation, cross-cultural adaptation, validity, and reliability of the Japanese version of the FAAM (FAAM-J). Design: Cross-sectional study. Setting: Collegiate athletic training/sports medicine clinical setting. Patients or Other Participants: Eighty-three collegiate athletes. Main Outcome Measure(s): All participants completed the Activities of Daily Living and Sports subscales of the FAAM-J and the Physical Functioning and Mental Health subscales of the Japanese version of the Short Form-36v2 (SF-36). Also, 19 participants (23%) whose conditions were expected to be stable completed another FAAM-J 2 to 6 days later for test-retest reliability. We analyzed the scores of those subscales for convergent and divergent validity, internal consistency, and test-retest reliability. Results: The Activities of Daily Living and Sports subscales of the FAAM-J had correlation coefficients of 0.86 and 0.75, respectively, with the Physical Functioning section of the SF-36 for convergent validity. For divergent validity, the correlation coefficients with Mental Health of the SF-36 were 0.29 and 0.27 for each subscale, respectively. Cronbach α for internal consistency was 0.99 for the Activities of Daily Living and 0.98 for the Sports subscale. A 95% confidence interval with a single measure was ±8.1 and ±14.0 points for each subscale. The test-retest reliability measures revealed intraclass correlation coefficient values of 0.87 for the Activities of Daily Living and 0.91 for the Sports subscales with minimal detectable changes of ±6.8 and ±13.7 for the respective subscales. Conclusions: The FAAM was successfully translated for a Japanese version, and the FAAM-J was adapted cross-culturally. Thus, the FAAM-J can be used as a self-reported outcome measure for Japanese-speaking individuals; however, the scores must be interpreted with caution, especially when applied to different populations and other types of injury than those included in this study. PMID:25310247
Estimation model for habitual 24-hour urinary-sodium excretion using simple questionnaires from normotensive Koreans.

PubMed

Kong, Ji-Sook; Lee, Yeon-Kyung; Kim, Mi Kyung; Choi, Mi-Kyeong; Heo, Young-Ran; Hyun, Taisun; Kim, Sun Mee; Lyu, Eun-Soon; Oh, Se-Young; Park, Hae-Ryun; Rhee, Moo-Yong; Ro, Hee-Kyong; Song, Mi Kyung

2018-01-01

This study was conducted to develop an equation for estimation of 24-h urinary-sodium excretion that can serve as an alternative to 24-h dietary recall and 24-h urine collection for normotensive Korean adults. In total, data on 640 healthy Korean adults aged 19 to 69 years from 4 regions of the country were collected as a training set. In order to externally validate the equation developed from that training set, 200 subjects were recruited independently as a validation set. Due to heterogeneity by gender, we constructed a gender-specific equation for estimation of 24-h urinary-sodium excretion by using a multivariable linear regression model and assessed the performance of the developed equation in validation set. The best model consisted of age, body weight, dietary behavior ('eating salty food', 'Kimchi consumption', 'Korean soup or stew consumption', 'soy sauce or red pepper paste consumption'), and smoking status in men, and age, body weight, dietary behavior ('salt preference', 'eating salty food', 'checking sodium content for processed foods', 'nut consumption'), and smoking status in women, respectively. When this model was tested in the external validation set, the mean bias between the measured and estimated 24-h urinary-sodium excretion from Bland-Altman plots was -1.92 (95% CI: -113, 110) mmol/d for men and -1.51 (95% CI: -90.6, 87.6) mmol/d for women. The cut-points of sodium intake calculated based on the equations were ≥4,000 mg/d for men and ≥3,500 mg/d for women, with 89.8 and 76.6% sensitivity and 29.3 and 64.2% specificity, respectively. In this study, a habitual 24-hour urinary-sodium-excretion-estimation model of normotensive Korean adults based on anthropometric and lifestyle factors was developed and showed feasibility for an asymptomatic population.
Estimation model for habitual 24-hour urinary-sodium excretion using simple questionnaires from normotensive Koreans

PubMed Central

Choi, Mi-Kyeong; Heo, Young-Ran; Hyun, Taisun; Kim, Sun Mee; Lyu, Eun-Soon; Oh, Se-Young; Park, Hae-Ryun; Rhee, Moo-Yong; Ro, Hee-Kyong; Song, Mi Kyung

2018-01-01

This study was conducted to develop an equation for estimation of 24-h urinary-sodium excretion that can serve as an alternative to 24-h dietary recall and 24-h urine collection for normotensive Korean adults. In total, data on 640 healthy Korean adults aged 19 to 69 years from 4 regions of the country were collected as a training set. In order to externally validate the equation developed from that training set, 200 subjects were recruited independently as a validation set. Due to heterogeneity by gender, we constructed a gender-specific equation for estimation of 24-h urinary-sodium excretion by using a multivariable linear regression model and assessed the performance of the developed equation in validation set. The best model consisted of age, body weight, dietary behavior (‘eating salty food’, ‘Kimchi consumption’, ‘Korean soup or stew consumption’, ‘soy sauce or red pepper paste consumption’), and smoking status in men, and age, body weight, dietary behavior (‘salt preference’, ‘eating salty food’, ‘checking sodium content for processed foods’, ‘nut consumption’), and smoking status in women, respectively. When this model was tested in the external validation set, the mean bias between the measured and estimated 24-h urinary-sodium excretion from Bland-Altman plots was -1.92 (95% CI: -113, 110) mmol/d for men and -1.51 (95% CI: -90.6, 87.6) mmol/d for women. The cut-points of sodium intake calculated based on the equations were ≥4,000 mg/d for men and ≥3,500 mg/d for women, with 89.8 and 76.6% sensitivity and 29.3 and 64.2% specificity, respectively. In this study, a habitual 24-hour urinary-sodium-excretion-estimation model of normotensive Korean adults based on anthropometric and lifestyle factors was developed and showed feasibility for an asymptomatic population. PMID:29447201
[Assessment of Work Engagement in Patients with Hematological Malignancies: Psychometric Properties of the German Version of the Utrecht Work Engagement Scale 9 (UWES-9)].

PubMed

Sautier, L P; Scherwath, A; Weis, J; Sarkar, S; Bosbach, M; Schendel, M; Ladehoff, N; Koch, U; Mehnert, A

2015-10-01

Our purpose was the psychometric evaluation of the German version of the Utrecht Work Engagement Scale-9 (UWES-9), a self-assessment tool measuring work-related resources consisting of 9 items. Based on a sample of 179 patients with hematological malignancies in in-patient and rehabilitative oncological settings, we tested the dimensional structure by confirmatory and explorative factor analysis. We further evaluated reliability, item characteristics, and construct validity of the UWES-9. The confirmatory factor analysis showed acceptable fit for both a 1-dimensional factor structure and the original 3-factor model. Based on an explorative principal component analysis, we were able to replicate the 1-dimensional factor accounting for 67% of the total variance and showing very high internal consistency (α=0.94) and high factor loads (0.73-0.88). The construct validity was further supported by significant positive correlations between work engagement and meaning of work, corporate feeling, commitment to the workplace, and job satisfaction. The German version of the UWES-9 shows good psychometric qualities in measuring dedication to work in patients with hematological malignancies in in-patient and rehabilitative oncological settings. © Georg Thieme Verlag KG Stuttgart · New York.
Least-Squares Regression and Spectral Residual Augmented Classical Least-Squares Chemometric Models for Stability-Indicating Analysis of Agomelatine and Its Degradation Products: A Comparative Study.

PubMed

Naguib, Ibrahim A; Abdelrahman, Maha M; El Ghobashy, Mohamed R; Ali, Nesma A

2016-01-01

Two accurate, sensitive, and selective stability-indicating methods are developed and validated for simultaneous quantitative determination of agomelatine (AGM) and its forced degradation products (Deg I and Deg II), whether in pure forms or in pharmaceutical formulations. Partial least-squares regression (PLSR) and spectral residual augmented classical least-squares (SRACLS) are two chemometric models that are being subjected to a comparative study through handling UV spectral data in range (215-350 nm). For proper analysis, a three-factor, four-level experimental design was established, resulting in a training set consisting of 16 mixtures containing different ratios of interfering species. An independent test set consisting of eight mixtures was used to validate the prediction ability of the suggested models. The results presented indicate the ability of mentioned multivariate calibration models to analyze AGM, Deg I, and Deg II with high selectivity and accuracy. The analysis results of the pharmaceutical formulations were statistically compared to the reference HPLC method, with no significant differences observed regarding accuracy and precision. The SRACLS model gives comparable results to the PLSR model; however, it keeps the qualitative spectral information of the classical least-squares algorithm for analyzed components.

Derivation & validation of glycosylated haemoglobin (HbA1c) cut-off value as a diagnostic test for type 2 diabetes in south Indian population

PubMed Central

Mohan, Alladi; Reddy, S. Aparna; Sachan, Alok; Sarma, K.V.S.; Kumar, D. Prabath; Panchagnula, Mahesh V.; Rao, P.V.L.N. Srinivasa; Kumar, B. Siddhartha; Krishnaprasanthi, P.

2016-01-01

Background & Objectives: Glycosylated haemoglobin (HbA1c) has been in use for more than a decade, as a diagnostic test for type 2 diabetes. Validity of HbA1c needs to be established in the ethnic population in which it is intended to be used. The objective of this study was to derive and validate a HbA1c cut-off value for the diagnosis of type 2 diabetes in the ethnic population of Rayalaseema area of south India. Methods: In this cross-sectional study, consecutive patients suspected to have type 2 diabetes underwent fasting plasma glucose (FPG) and 2 h post-load plasma glucose (2 h-PG) measurements after a 75 g glucose load and HbA1c estimation. They were classified as having diabetes as per the American Diabetes Association criteria [(FPG ≥7 mmol/l (≥126 mg/dl) and/or 2 h-PG ≥11.1 mmol/l (≥200 mg/dl)]. In the training data set (n = 342), optimum cut-off value of HbA1c for defining type 2 diabetes was derived by receiver-operator characteristic (ROC) curve method using oral glucose tolerance test results as gold standard. This cut-off was validated in a validation data set (n = 341). Results: On applying HbA1c cut-off value of >6.3 per cent (45 mmol/mol) to the training data set, sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) for diagnosing type 2 diabetes were calculated to be 90.6, 85.2, 80.8 and 93.0 per cent, respectively. When the same cut-off value was applied to the validation data set, sensitivity, specificity, PPV and NPV were 88.8, 81.9, 74.0 and 92.7 per cent, respectively, although the latter were consistently smaller than the proportions for the training data set, the differences being not significant. Interpretation & conclusions: HbA1c >6.3 per cent (45 mmol/mol) appears to be the optimal cut-off value for the diagnosis of type 2 diabetes applicable to the ethnic population of Rayalaseema area of Andhra Pradesh state in south India. PMID:27934801
Clinical Significance of Tissue Factor Pathway Inhibitor 2, a Serum Biomarker Candidate for Ovarian Clear Cell Carcinoma

PubMed Central

Arakawa, Noriaki; Kobayashi, Hiroshi; Yonemoto, Naohiro; Masuishi, Yusuke; Ino, Yoko; Shigetomi, Hiroshi; Furukawa, Naoto; Ohtake, Norihisa; Miyagi, Yohei; Hirahara, Fumiki; Hirano, Hisashi; Miyagi, Etsuko

2016-01-01

Background There is currently no reliable serum biomarker for ovarian clear cell carcinoma (CCC), a highly lethal histological subtype of epithelial ovarian cancer (EOC). Previously, using a proteome-based approach, we identified tissue factor pathway inhibitor 2 (TFPI2) as a candidate serum biomarker for CCC. In this study, we sought to evaluate the clinical diagnostic performance of TFPI2 in preoperative prediction of CCC. Methods Serum TFPI2 levels were measured in serum samples from a retrospective training set consisting of patients with benign and borderline ovarian tumors, EOC subtypes, and uterine diseases. Via receiver operating characteristic (ROC) analyses, we compared the diagnostic performance of TFPI2 with that of CA125 in discrimination of patients with ovarian CCC from other patient groups. The observed diagnostic performances were examined in a prospective validation set. Results The 268-patient training set included 29 patients with ovarian CCC. Unlike CA125, which was also elevated in patients with endometriosis and several EOC subtypes, serum TFPI2 levels were specifically elevated only in ovarian CCC patients, consistent with the mRNA expression pattern in tumor tissues. The area under the ROC curve (AUC) of serum TFPI2 was obviously higher than that of CA125 for discrimination of CCC from other ovarian diseases (AUC = 0.891 versus 0.595). Applying a cut-off value of 280 pg/mL, TFPI2 could distinguish early-stage (FIGO I and II) CCC from endometriosis with 72.2% sensitivity, 93.3% specificity, and 88.8% accuracy. Similar results were confirmed in an independent 156-patient prospective validation set. Conclusions TFPI2 is a useful serum biomarker for preoperative clinical diagnosis of CCC. PMID:27798689
Clinical Significance of Tissue Factor Pathway Inhibitor 2, a Serum Biomarker Candidate for Ovarian Clear Cell Carcinoma.

PubMed

Arakawa, Noriaki; Kobayashi, Hiroshi; Yonemoto, Naohiro; Masuishi, Yusuke; Ino, Yoko; Shigetomi, Hiroshi; Furukawa, Naoto; Ohtake, Norihisa; Miyagi, Yohei; Hirahara, Fumiki; Hirano, Hisashi; Miyagi, Etsuko

2016-01-01

There is currently no reliable serum biomarker for ovarian clear cell carcinoma (CCC), a highly lethal histological subtype of epithelial ovarian cancer (EOC). Previously, using a proteome-based approach, we identified tissue factor pathway inhibitor 2 (TFPI2) as a candidate serum biomarker for CCC. In this study, we sought to evaluate the clinical diagnostic performance of TFPI2 in preoperative prediction of CCC. Serum TFPI2 levels were measured in serum samples from a retrospective training set consisting of patients with benign and borderline ovarian tumors, EOC subtypes, and uterine diseases. Via receiver operating characteristic (ROC) analyses, we compared the diagnostic performance of TFPI2 with that of CA125 in discrimination of patients with ovarian CCC from other patient groups. The observed diagnostic performances were examined in a prospective validation set. The 268-patient training set included 29 patients with ovarian CCC. Unlike CA125, which was also elevated in patients with endometriosis and several EOC subtypes, serum TFPI2 levels were specifically elevated only in ovarian CCC patients, consistent with the mRNA expression pattern in tumor tissues. The area under the ROC curve (AUC) of serum TFPI2 was obviously higher than that of CA125 for discrimination of CCC from other ovarian diseases (AUC = 0.891 versus 0.595). Applying a cut-off value of 280 pg/mL, TFPI2 could distinguish early-stage (FIGO I and II) CCC from endometriosis with 72.2% sensitivity, 93.3% specificity, and 88.8% accuracy. Similar results were confirmed in an independent 156-patient prospective validation set. TFPI2 is a useful serum biomarker for preoperative clinical diagnosis of CCC.
The SPAI-18, a brief version of the social phobia and anxiety inventory: reliability and validity in clinically referred and non-referred samples.

PubMed

de Vente, Wieke; Majdandžić, Mirjana; Voncken, Marisol J; Beidel, Deborah C; Bögels, Susan M

2014-03-01

We developed a new version of the Social Phobia and Anxiety Inventory (SPAI) in order to have a brief instrument for measuring social anxiety and social anxiety disorder (SAD) with a strong conceptual foundation. In the construction phase, a set of items representing 5 core aspects of social anxiety was selected by a panel of social anxiety experts. The selected item pool was validated using factor analysis, reliability analysis, and diagnostic analysis in a sample of healthy participants (N = 188) and a sample of clinically referred participants diagnosed with SAD (N = 98). This procedure resulted in an abbreviated version of the Social Phobia Subscale of the SPAI consisting of 18 items (i.e. the SPAI-18), which correlated strongly with the Social Phobia Subscale of the original SPAI (both groups r = .98). Internal consistency and diagnostic characteristics using a clinical cut-off score > 48 were good to excellent (Cronbach's alpha healthy group = .93; patient group = .91; sensitivity: .94; specificity: .88). The SPAI-18 was further validated in a community sample of parents-to-be without SAD (N = 237) and with SAD (N = 65). Internal consistency was again excellent (both groups Cronbach's alpha = .93) and a screening cut-off of > 36 proved to result in good sensitivity and specificity. The SPAI-18 also correlated strongly with other social anxiety instruments, supporting convergent validity. In sum, the SPAI-18 is a psychometrically sound instrument with good screening capacity for social anxiety disorder in clinical as well as community samples. Copyright © 2013 Elsevier Ltd. All rights reserved.
A Novel Tool Improves Existing Estimates of Recent Tuberculosis Transmission in Settings of Sparse Data Collection.

PubMed

Kasaie, Parastu; Mathema, Barun; Kelton, W David; Azman, Andrew S; Pennington, Jeff; Dowdy, David W

2015-01-01

In any setting, a proportion of incident active tuberculosis (TB) reflects recent transmission ("recent transmission proportion"), whereas the remainder represents reactivation. Appropriately estimating the recent transmission proportion has important implications for local TB control, but existing approaches have known biases, especially where data are incomplete. We constructed a stochastic individual-based model of a TB epidemic and designed a set of simulations (derivation set) to develop two regression-based tools for estimating the recent transmission proportion from five inputs: underlying TB incidence, sampling coverage, study duration, clustered proportion of observed cases, and proportion of observed clusters in the sample. We tested these tools on a set of unrelated simulations (validation set), and compared their performance against that of the traditional 'n-1' approach. In the validation set, the regression tools reduced the absolute estimation bias (difference between estimated and true recent transmission proportion) in the 'n-1' technique by a median [interquartile range] of 60% [9%, 82%] and 69% [30%, 87%]. The bias in the 'n-1' model was highly sensitive to underlying levels of study coverage and duration, and substantially underestimated the recent transmission proportion in settings of incomplete data coverage. By contrast, the regression models' performance was more consistent across different epidemiological settings and study characteristics. We provide one of these regression models as a user-friendly, web-based tool. Novel tools can improve our ability to estimate the recent TB transmission proportion from data that are observable (or estimable) by public health practitioners with limited available molecular data.
A Novel Tool Improves Existing Estimates of Recent Tuberculosis Transmission in Settings of Sparse Data Collection

PubMed Central

Kasaie, Parastu; Mathema, Barun; Kelton, W. David; Azman, Andrew S.; Pennington, Jeff; Dowdy, David W.

2015-01-01

In any setting, a proportion of incident active tuberculosis (TB) reflects recent transmission (“recent transmission proportion”), whereas the remainder represents reactivation. Appropriately estimating the recent transmission proportion has important implications for local TB control, but existing approaches have known biases, especially where data are incomplete. We constructed a stochastic individual-based model of a TB epidemic and designed a set of simulations (derivation set) to develop two regression-based tools for estimating the recent transmission proportion from five inputs: underlying TB incidence, sampling coverage, study duration, clustered proportion of observed cases, and proportion of observed clusters in the sample. We tested these tools on a set of unrelated simulations (validation set), and compared their performance against that of the traditional ‘n-1’ approach. In the validation set, the regression tools reduced the absolute estimation bias (difference between estimated and true recent transmission proportion) in the ‘n-1’ technique by a median [interquartile range] of 60% [9%, 82%] and 69% [30%, 87%]. The bias in the ‘n-1’ model was highly sensitive to underlying levels of study coverage and duration, and substantially underestimated the recent transmission proportion in settings of incomplete data coverage. By contrast, the regression models’ performance was more consistent across different epidemiological settings and study characteristics. We provide one of these regression models as a user-friendly, web-based tool. Novel tools can improve our ability to estimate the recent TB transmission proportion from data that are observable (or estimable) by public health practitioners with limited available molecular data. PMID:26679499
Mapping health outcome measures from a stroke registry to EQ-5D weights.

PubMed

Ghatnekar, Ola; Eriksson, Marie; Glader, Eva-Lotta

2013-03-07

To map health outcome related variables from a national register, not part of any validated instrument, with EQ-5D weights among stroke patients. We used two cross-sectional data sets including patient characteristics, outcome variables and EQ-5D weights from the national Swedish stroke register. Three regression techniques were used on the estimation set (n=272): ordinary least squares (OLS), Tobit, and censored least absolute deviation (CLAD). The regression coefficients for "dressing", "toileting", "mobility", "mood", "general health" and "proxy-responders" were applied to the validation set (n=272), and the performance was analysed with mean absolute error (MAE) and mean square error (MSE). The number of statistically significant coefficients varied by model, but all models generated consistent coefficients in terms of sign. Mean utility was underestimated in all models (least in OLS) and with lower variation (least in OLS) compared to the observed. The maximum attainable EQ-5D weight ranged from 0.90 (OLS) to 1.00 (Tobit and CLAD). Health states with utility weights <0.5 had greater errors than those with weights ≥ 0.5 (P<0.01). This study indicates that it is possible to map non-validated health outcome measures from a stroke register into preference-based utilities to study the development of stroke care over time, and to compare with other conditions in terms of utility.
Evidence-Based Diagnostic Algorithm for Glioma: Analysis of the Results of Pathology Panel Review and Molecular Parameters of EORTC 26951 and 26882 Trials.

PubMed

Kros, Johan M; Huizer, Karin; Hernández-Laín, Aurelio; Marucci, Gianluca; Michotte, Alex; Pollo, Bianca; Rushing, Elisabeth J; Ribalta, Teresa; French, Pim; Jaminé, David; Bekka, Nawal; Lacombe, Denis; van den Bent, Martin J; Gorlia, Thierry

2015-06-10

With the rapid discovery of prognostic and predictive molecular parameters for glioma, the status of histopathology in the diagnostic process should be scrutinized. Our project aimed to construct a diagnostic algorithm for gliomas based on molecular and histologic parameters with independent prognostic values. The pathology slides of 636 patients with gliomas who had been included in EORTC 26951 and 26882 trials were reviewed using virtual microscopy by a panel of six neuropathologists who independently scored 18 histologic features and provided an overall diagnosis. The molecular data for IDH1, 1p/19q loss, EGFR amplification, loss of chromosome 10 and chromosome arm 10q, gain of chromosome 7, and hypermethylation of the promoter of MGMT were available for some of the cases. The slides were divided in discovery (n = 426) and validation sets (n = 210). The diagnostic algorithm resulting from analysis of the discovery set was validated in the latter. In 66% of cases, consensus of overall diagnosis was present. A diagnostic algorithm consisting of two molecular markers and one consensus histologic feature was created by conditional inference tree analysis. The order of prognostic significance was: 1p/19q loss, EGFR amplification, and astrocytic morphology, which resulted in the identification of four diagnostic nodes. Validation of the nodes in the validation set confirmed the prognostic value (P < .001). We succeeded in the creation of a timely diagnostic algorithm for anaplastic glioma based on multivariable analysis of consensus histopathology and molecular parameters. © 2015 by American Society of Clinical Oncology.
Greek cultural adaption and validation of the Kujala anterior knee pain scale in patients with patellofemoral pain syndrome.

PubMed

Papadopoulos, Costas; Constantinou, Antonis; Cheimonidou, Areti-Zoi; Stasinopoulos, Dimitrios

2017-04-01

To cross-culturally adapt and validate the Greek version of the Kujala anterior knee pain scale (KAKPS). The Greek KAKPS was translated from the original English version following standard forward and backward translation procedures. The survey was then conducted in clinical settings by a questionnaire comprising the Greek KAKPS and patellofemoral pain syndrome (PFPS) severity scale. A total of 130 (62 women and 68 men) Greek-reading patients between 18 and 45 years old with anterior knee pain (AKP) for at least four weeks were recruited from physical therapy clinics. To establish test-retest reliability, the patients were asked to complete the KAKPS at initial visit and 2-3 days after the initial visit. The Greek version of the PFPS severity scale was also administered once at initial visit. Internal consistency of the translated instrument was measured using Cronbach's α. An intraclass correlation coefficient was used to assess the test-retest reliability of the KAKPS. Concurrent validity was measured by correlating the KAKPS with the PFPS severity scale using Pearson's correlation coefficient. The results showed that the Greek KAKPS has good internal consistency (Cronbach's α = 0.942), test-retest reliability (ICC = 0.921) and concurrent validity (r > 0.7). This study has shown that the Greek KAKPS has good internal consistency, test-retest reliability and concurrent validity when correlated with the PFPS severity scale in adult patients with AKP for at least four weeks. Implications for rehabilitation The Greek version of the KAKPS has been found to be reliable and valid when used in adult patients with AKP for at least four weeks. The results of the psychometric characteristics were compatible with those of the original English version. The KAKPS could be applied in a Greek-speaking population to assess functional limitations and symptoms in patients aged 18-45 years old with AKP for at least four weeks.
Measuring quality of life in dyspeptic patients: development and validation of a new specific health status questionnaire: final report from the Italian QPD project involving 4000 patients.

PubMed

Bamfi, F; Olivieri, A; Arpinelli, F; De Carli, G; Recchia, G; Gandolfi, L; Norberto, L; Pacini, F; Surrenti, C; Irvine, S H; Apolone, G

1999-03-01

Despite the fact that gastrointestinal disorders represent one of the most common reasons for medical consultations, formal assessment of patients' health-related quality of life (HRQOL) has been carried out only in a few studies, and in most cases generic questionnaires have been adopted. Because the specific issue of living with dyspeptic problems has been addressed in very few cases and no questionnaire has been shown to be appropriate for the Italian setting, a prospective project was launched to develop a specific HRQOL questionnaire for dyspepsia sufferers tailored to Italian patients but also appropriate in other cultural settings. The project consisted in a 3-yr, three-phase survey, in which different versions of the quality of life in peptic disease questionnaire (QPD) were developed through expert and patient focus groups and empiric field studies and then administered to patients recruited in five multicenter studies. Standard psychometric techniques were used to evaluate the validity, reliability, responsiveness, and patient acceptability of the QPD. Three different versions of the QPD questionnaire were self-administered to more than 4000 patients. The final 30-item version, measuring three health concepts related to dyspeptic disease (anxiety induced by pain, social restriction, symptom perception), fulfilled the recommended psychometric criteria in terms of reliability and validity, correlated with health concepts measured with a well-known independent generic HRQOL instrument (the SF-36 Health Survey questionnaire) and was relatively invariant to diagnosis and sociodemographic variables; it also correlated with a measure of gastric pain frequency and was able to detect meaningful differences over time. Although further validation studies in different cultural and linguistic settings are mandatory before any firm conclusions can be drawn regarding the cross-cultural validity of the QPD, the data obtained provide evidence of the psychometric validity and robustness of the questionnaire when used in a fairly large, well-characterized population of Italian dyspeptic patients.
Measuring theory of mind in children. Psychometric properties of the ToM Storybooks.

PubMed

Blijd-Hoogewys, E M A; van Geert, P L C; Serra, M; Minderaa, R B

2008-11-01

Although research on Theory-of-Mind (ToM) is often based on single task measurements, more comprehensive instruments result in a better understanding of ToM development. The ToM Storybooks is a new instrument measuring basic ToM-functioning and associated aspects. There are 34 tasks, tapping various emotions, beliefs, desires and mental-physical distinctions. Four studies on the validity and reliability of the test are presented, in typically developing children (n = 324, 3-12 years) and children with PDD-NOS (n = 30). The ToM Storybooks have good psychometric qualities. A component analysis reveals five components corresponding with the underlying theoretical constructs. The internal consistency, test-retest reliability, inter-rater reliability, construct validity and convergent validity are good. The ToM Storybooks can be used in research as well as in clinical settings.
The clinical nurse specialist in an Irish hospital.

PubMed

Wickham, Sheelagh

2011-01-01

This study was set in an acute Irish health care setting and aimed to explore the activity of the clinical nurse specialist (CNS) in this setting. Quantitative methodology, using a valid and reliable questionnaire, provided descriptive statistics that gave accurate data on the total population of CNSs in the health care setting. The study was set in an acute-care 750-bed hospital that had 25 CNSs in practice. The sample consisted of all 25 CNSs who are the total population of CNSs working in the acute health care institution. The findings show the CNS to be active in the roles of researcher, educator, communicator, change agent, leader, and clinical specialist, but the level of activity varies between different roles. There is variety in the activity of CNSs in the various roles and to what extent they enact the role. The findings merit further study on CNS role activity and possible variables that influence role activity.
Designing a valid randomized pragmatic primary care implementation trial: the my own health report (MOHR) project.

PubMed

Krist, Alex H; Glenn, Beth A; Glasgow, Russell E; Balasubramanian, Bijal A; Chambers, David A; Fernandez, Maria E; Heurtin-Roberts, Suzanne; Kessler, Rodger; Ory, Marcia G; Phillips, Siobhan M; Ritzwoller, Debra P; Roby, Dylan H; Rodriguez, Hector P; Sabo, Roy T; Sheinfeld Gorin, Sherri N; Stange, Kurt C

2013-06-25

There is a pressing need for greater attention to patient-centered health behavior and psychosocial issues in primary care, and for practical tools, study designs and results of clinical and policy relevance. Our goal is to design a scientifically rigorous and valid pragmatic trial to test whether primary care practices can systematically implement the collection of patient-reported information and provide patients needed advice, goal setting, and counseling in response. This manuscript reports on the iterative design of the My Own Health Report (MOHR) study, a cluster randomized delayed intervention trial. Nine pairs of diverse primary care practices will be randomized to early or delayed intervention four months later. The intervention consists of fielding the MOHR assessment--addresses 10 domains of health behaviors and psychosocial issues--and subsequent provision of needed counseling and support for patients presenting for wellness or chronic care. As a pragmatic participatory trial, stakeholder groups including practice partners and patients have been engaged throughout the study design to account for local resources and characteristics. Participatory tasks include identifying MOHR assessment content, refining the study design, providing input on outcomes measures, and designing the implementation workflow. Study outcomes include the intervention reach (percent of patients offered and completing the MOHR assessment), effectiveness (patients reporting being asked about topics, setting change goals, and receiving assistance in early versus delayed intervention practices), contextual factors influencing outcomes, and intervention costs. The MOHR study shows how a participatory design can be used to promote the consistent collection and use of patient-reported health behavior and psychosocial assessments in a broad range of primary care settings. While pragmatic in nature, the study design will allow valid comparisons to answer the posed research question, and findings will be broadly generalizable to a range of primary care settings. Per the pragmatic explanatory continuum indicator summary (PRECIS) framework, the study design is substantially more pragmatic than other published trials. The methods and findings should be of interest to researchers, practitioners, and policy makers attempting to make healthcare more patient-centered and relevant. Clinicaltrials.gov: NCT01825746.
Brazilian Portuguese version of the Revised Fibromyalgia Impact Questionnaire (FIQR-Br): cross-cultural validation, reliability, and construct and structural validation.

PubMed

Lupi, Jaqueline Basilio; Carvalho de Abreu, Daniela Cristina; Ferreira, Mariana Candido; Oliveira, Renê Donizeti Ribeiro de; Chaves, Thais Cristina

2017-08-01

This study aimed to culturally adapt and validate the Revised Fibromyalgia Impact Questionnaire (FIQR) to Brazilian Portuguese, by the use of analysis of internal consistency, reliability, and construct and structural validity. A total of 100 female patients with fibromyalgia participated in the validation process of the Brazilian Portuguese version of the FIQR (FIQR-Br).The intraclass correlation coefficient (ICC) was used for statistical analysis of reliability (test-retest), Cronbach's alpha for internal consistency, Pearson's rank correlation for construct validity, and confirmatory factor analysis (CFA) for structural validity. It was verified excellent levels of reliability, with ICC greater than 0.75 for all questions and domains of the FIQR-Br. For internal consistency, alpha values greater than 0.70 for the items and domains of the questionnaire were observed. Moderate (0.40 < r < 0.70) and strong (r > 0.70) correlations were observed for the scores of domains and total score between the FIQR-Br and FIQ-Br. The structure of the three domains of the FIQR-Br was confirmed by CFA. The results of this study suggest that that the FIQR-Br is a reliable and valid instrument for assessing fibromyalgia-related impact, and supports its use in clinical settings and research. The structure of the three domains of the FIQR-Br was also confirmed. Implications for Rehabilitation Fibromyalgia is a chronic musculoskeletal disorder characterized by widespread and diffuse pain, fatigue, sleep disturbances, and depression. The disease significantly impairs patients' quality of life and can be highly disabling. To be used in multicenter research efforts, the Revised Fibromyalgia Impact Questionnaire (FIQR) must be cross-culturally validated and psychometrically tested. This paper will make available a new version of the FIQR-Br since another version already exists, but there are concerns about its measurement properties. The availability of an instrument adapted to and validated for Brazilian Portuguese may make it possible to reliably verify the effects of rehabilitation programs on disability from fibromyalgia. The FIQR-Br showed results comparable with other versions of the FIQR in other languages, thereby enabling comparison of effects of rehabilitation interventions on disability from fibromyalgia conducted in Brazil with results of studies carried out in other parts of the world.
Asymmetry of Peak Thicknesses between the Superior and Inferior Retinal Nerve Fiber Layers for Early Glaucoma Detection: A Simple Screening Method.

PubMed

Bae, Hyoung Won; Lee, Sang Yeop; Kim, Sangah; Park, Chan Keum; Lee, Kwanghyun; Kim, Chan Yun; Seong, Gong Je

2018-01-01

To assess whether the asymmetry in the peripapillary retinal nerve fiber layer (pRNFL) thickness between superior and inferior hemispheres on optical coherence tomography (OCT) is useful for early detection of glaucoma. The patient population consisted of Training set (a total of 60 subjects with early glaucoma and 59 normal subjects) and Validation set (30 subjects with early glaucoma and 30 normal subjects). Two kinds of ratios were employed to measure the asymmetry between the superior and inferior pRNFL thickness using OCT. One was the ratio of the superior to inferior peak thicknesses (peak pRNFL thickness ratio; PTR), and the other was the ratio of the superior to inferior average thickness (average pRNFL thickness ratio; ATR). The diagnostic abilities of the PTR and ATR were compared to the color code classification in OCT. Using the optimal cut-off values of the PTR and ATR obtained from the Training set, the two ratios were independently validated for diagnostic capability. For the Training set, the sensitivities/specificities of the PTR, ATR, quadrants color code classification, and clock-hour color code classification were 81.7%/93.2%, 71.7%/74.6%, 75.0%/93.2%, and 75.0%/79.7%, respectively. The PTR showed a better diagnostic performance for early glaucoma detection than the ATR and the clock-hour color code classification in terms of areas under the receiver operating characteristic curves (AUCs) (0.898, 0.765, and 0.773, respectively). For the Validation set, the PTR also showed the best sensitivity and AUC. The PTR is a simple method with considerable diagnostic ability for early glaucoma detection. It can, therefore, be widely used as a new screening method for early glaucoma. © Copyright: Yonsei University College of Medicine 2018
Structure of the tropical lower stratosphere as revealed by three reanalysis data sets

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pawson, S.; Fiorino, M.

1996-05-01

While the skill of climate simulation models has advanced over the last decade, mainly through improvements in modeling, further progress will depend on the availability and the quality of comprehensive validation data sets covering long time periods. A new source of such validation data is atmospheric {open_quotes}reanalysis{close_quotes} where a fixed, state-of-the-art global atmospheric model/data assimilation system is run through archived and recovered observations to produce a consistent set of atmospheric analyses. Although reanalysis will be free of non-physical variability caused by changes in the models and/or the assimilation procedure, it is necessary to assess its quality. A region for stringentmore » testing of the quality of reanalysis is the tropical lower stratosphere. This portion of the atmosphere is sparse in observations but displays the prominent quasi-biennial oscillation (QBO) and an annual cycle, neither of which is fully understood, but which are likely coupled dynamically. We first consider the performance of three reanalyses, from NCEP/NCAR, NASA and ECMWF, against rawinsonde data in depicting the QBO and then examine the structure of the tropical lower stratosphere in NCEP and ECMWF data sets in detail. While the annual cycle and the QBO in wind and temperature are quite successfully represented, the mean meridional circulations in NCEP and ECMWF data sets contain unusual features which may be due to the assimilation process rather than being physically based. Further, the models capture the long-term temperature fluctuations associated with volcanic eruptions, even though the physical mechanisms are not included, thus implying that the model does not mask prominent stratospheric signals in the observational data. We conclude that reanalysis offers a unique opportunity to better understand the dynamics of QBO and can be applied to climate model validation.« less
Making and Executing Decisions for Safe and Independent Living (MED-SAIL): Development and Validation of a Brief Screening Tool

PubMed Central

Mills, Whitney L.; Regev, Tziona; Kunik, Mark E.; Wilson, Nancy L.; Moye, Jennifer; McCullough, Laurence B.; Naik, Aanand D.

2017-01-01

Objectives Older adults prefer to remain in their own homes for as long as possible. The purpose of this article is to describe the development and preliminary validation of Making and Executing Decisions for Safe and Independent Living (MED-SAIL), a brief screening tool for capacity to live safely and independently in the community. Design Prospective preliminary validation study. Setting Outpatient geriatrics clinic located in a community-based hospital. Participants Forty-nine community-dwelling older adults referred to the clinic for a comprehensive capacity assessment. Measurements We examined internal consistency, criterion-based validity, concurrent validity, and accuracy of classification for MED-SAIL. Results The items included in MED-SAIL demonstrated internal consistency (5 items; α = 0.85). MED-SAIL was significantly correlated with the Independent Living Scales (r = 0.573, p ≤ 0.001) and instrumental activities of daily living (r = 0.440, p ≤ 0.01). The Mann-Whitney U test revealed significant differences between the no capacity and partial/full capacity classifications on MED-SAIL (U(48) = 60.5, Z = −0.38, p <0.0001). The area under the curve was 0.864 (95% confidence interval: 0.84–0.99). Conclusions This study demonstrated the validity of MED-SAIL as a brief screening tool to identify older adults with impaired capacity for remaining safe and independent in their current living environment. MED-SAIL is useful tool for health and social service providers in the community for the purpose of referral for definitive capacity evaluation. PMID:23567420
The PDB_REDO server for macromolecular structure model optimization

PubMed Central

Joosten, Robbie P.; Long, Fei; Murshudov, Garib N.; Perrakis, Anastassis

2014-01-01

The refinement and validation of a crystallographic structure model is the last step before the coordinates and the associated data are submitted to the Protein Data Bank (PDB). The success of the refinement procedure is typically assessed by validating the models against geometrical criteria and the diffraction data, and is an important step in ensuring the quality of the PDB public archive [Read et al. (2011 ▶), Structure, 19, 1395–1412]. The PDB_REDO procedure aims for ‘constructive validation’, aspiring to consistent and optimal refinement parameterization and pro-active model rebuilding, not only correcting errors but striving for optimal interpretation of the electron density. A web server for PDB_REDO has been implemented, allowing thorough, consistent and fully automated optimization of the refinement procedure in REFMAC and partial model rebuilding. The goal of the web server is to help practicing crystallographers to improve their model prior to submission to the PDB. For this, additional steps were implemented in the PDB_REDO pipeline, both in the refinement procedure, e.g. testing of resolution limits and k-fold cross-validation for small test sets, and as new validation criteria, e.g. the density-fit metrics implemented in EDSTATS and ligand validation as implemented in YASARA. Innovative ways to present the refinement and validation results to the user are also described, which together with auto-generated Coot scripts can guide users to subsequent model inspection and improvement. It is demonstrated that using the server can lead to substantial improvement of structure models before they are submitted to the PDB. PMID:25075342
Systematic feature selection improves accuracy of methylation-based forensic age estimation in Han Chinese males.

PubMed

Feng, Lei; Peng, Fuduan; Li, Shanfei; Jiang, Li; Sun, Hui; Ji, Anquan; Zeng, Changqing; Li, Caixia; Liu, Fan

2018-03-23

Estimating individual age from biomarkers may provide key information facilitating forensic investigations. Recent progress has shown DNA methylation at age-associated CpG sites as the most informative biomarkers for estimating the individual age of an unknown donor. Optimal feature selection plays a critical role in determining the performance of the final prediction model. In this study we investigate methylation levels at 153 age-associated CpG sites from 21 previously reported genomic regions using the EpiTYPER system for their predictive power on individual age in 390 Han Chinese males ranging from 15 to 75 years of age. We conducted a systematic feature selection using a stepwise backward multiple linear regression analysis as well as an exhaustive searching algorithm. Both approaches identified the same subset of 9 CpG sites, which in linear combination provided the optimal model fitting with mean absolute deviation (MAD) of 2.89 years of age and explainable variance (R 2 ) of 0.92. The final model was validated in two independent Han Chinese male samples (validation set 1, N = 65, MAD = 2.49, R 2  = 0.95, and validation set 2, N = 62, MAD = 3.36, R 2  = 0.89). Other competing models such as support vector machine and artificial neural network did not outperform the linear model to any noticeable degree. The validation set 1 was additionally analyzed using Pyrosequencing technology for cross-platform validation and was termed as validation set 3. Directly applying our model, in which the methylation levels were detected by the EpiTYPER system, to the data from pyrosequencing technology showed, however, less accurate results in terms of MAD (validation set 3, N = 65 Han Chinese males, MAD = 4.20, R 2  = 0.93), suggesting the presence of a batch effect between different data generation platforms. This batch effect could be partially overcome by a z-score transformation (MAD = 2.76, R 2  = 0.93). Overall, our systematic feature selection identified 9 CpG sites as the optimal subset for forensic age estimation and the prediction model consisting of these 9 markers demonstrated high potential in forensic practice. An age estimator implementing our prediction model allowing missing markers is freely available at http://liufan.big.ac.cn/AgePrediction. Copyright © 2018 Elsevier B.V. All rights reserved.
Emotional and tangible social support in a German population-based sample: Development and validation of the Brief Social Support Scale (BS6).

PubMed

Beutel, Manfred E; Brähler, Elmar; Wiltink, Jörg; Michal, Matthias; Klein, Eva M; Jünger, Claus; Wild, Philipp S; Münzel, Thomas; Blettner, Maria; Lackner, Karl; Nickels, Stefan; Tibubos, Ana N

2017-01-01

Aim of the study was the development and validation of the psychometric properties of a six-item bi-factorial instrument for the assessment of social support (emotional and tangible support) with a population-based sample. A cross-sectional data set of N = 15,010 participants enrolled in the Gutenberg Health Study (GHS) in 2007-2012 was divided in two sub-samples. The GHS is a population-based, prospective, observational single-center cohort study in the Rhein-Main-Region in western Mid-Germany. The first sub-sample was used for scale development by performing an exploratory factor analysis. In order to test construct validity, confirmatory factor analyses were run to compare the extracted bi-factorial model with the one-factor solution. Reliability of the scales was indicated by calculating internal consistency. External validity was tested by investigating demographic characteristics health behavior, and distress using analysis of variance, Spearman and Pearson correlation analysis, and logistic regression analysis. Based on an exploratory factor analysis, a set of six items was extracted representing two independent factors. The two-factor structure of the Brief Social Support Scale (BS6) was confirmed by the results of the confirmatory factor analyses. Fit indices of the bi-factorial model were good and better compared to the one-factor solution. External validity was demonstrated for the BS6. The BS6 is a reliable and valid short scale that can be applied in social surveys due to its brevity to assess emotional and practical dimensions of social support.

Development of a brief instrument for assessing healthcare employee satisfaction in a low-income setting.

PubMed

Alpern, Rachelle; Canavan, Maureen E; Thompson, Jennifer T; McNatt, Zahirah; Tatek, Dawit; Lindfield, Tessa; Bradley, Elizabeth H

2013-01-01

Ethiopia is one of 57 countries identified by the World Health Report 2006 as having a severely limited number of health care professionals. In recognition of this shortage, the Ethiopian Federal Ministry of Health, through the Ethiopian Hospital Management Initiative, prioritized the need to improve retention of health care workers. Accordingly, we sought to develop the Satisfaction of Employees in Health Care (SEHC) survey for use in hospitals and health centers throughout Ethiopia. Literature reviews and cognitive interviews were used to generate a staff satisfaction survey for use in the Ethiopian healthcare setting. We pretested the survey in each of the six hospitals and four health centers across Ethiopia (98% response rate). We assessed content validity and convergent validity using factor analysis and examined reliability using the Cronbach alpha coefficients to assess internal consistency. The final survey was comprised of 18 questions about specific aspects of an individual's work and two overall staff satisfaction questions. We found support for content validity, as data from the 18 responses factored into three factors, which we characterized as 1) relationship with management and supervisors, 2) job content, and 3) relationships with coworkers. Summary scores for two factors (relationship with management and supervisors and job content) were significantly associated (P-value, <0.001) with the two overall satisfaction items. Cronbach's alpha coefficients showed good to excellent internal consistency (Cronbach alpha coefficients >0.70) for the items in the three summary scores. The introduction of consistent and reliable measures of staff satisfaction is crucial to understand and improve employee retention rates, which threaten the successful achievement of the Millennium Development Goals in low-income countries. The use of the SEHC survey in Ethiopian healthcare facilities has ample leadership support, which is essential for addressing problems that reduce staff satisfaction and exacerbate excessive workforce shortages.
Predicting cognitive function from clinical measures of physical function and health status in older adults.

PubMed

Bolandzadeh, Niousha; Kording, Konrad; Salowitz, Nicole; Davis, Jennifer C; Hsu, Liang; Chan, Alison; Sharma, Devika; Blohm, Gunnar; Liu-Ambrose, Teresa

2015-01-01

Current research suggests that the neuropathology of dementia-including brain changes leading to memory impairment and cognitive decline-is evident years before the onset of this disease. Older adults with cognitive decline have reduced functional independence and quality of life, and are at greater risk for developing dementia. Therefore, identifying biomarkers that can be easily assessed within the clinical setting and predict cognitive decline is important. Early recognition of cognitive decline could promote timely implementation of preventive strategies. We included 89 community-dwelling adults aged 70 years and older in our study, and collected 32 measures of physical function, health status and cognitive function at baseline. We utilized an L1-L2 regularized regression model (elastic net) to identify which of the 32 baseline measures were strongly predictive of cognitive function after one year. We built three linear regression models: 1) based on baseline cognitive function, 2) based on variables consistently selected in every cross-validation loop, and 3) a full model based on all the 32 variables. Each of these models was carefully tested with nested cross-validation. Our model with the six variables consistently selected in every cross-validation loop had a mean squared prediction error of 7.47. This number was smaller than that of the full model (115.33) and the model with baseline cognitive function (7.98). Our model explained 47% of the variance in cognitive function after one year. We built a parsimonious model based on a selected set of six physical function and health status measures strongly predictive of cognitive function after one year. In addition to reducing the complexity of the model without changing the model significantly, our model with the top variables improved the mean prediction error and R-squared. These six physical function and health status measures can be easily implemented in a clinical setting.
When less is more: validating a brief scale to rate interprofessional team competencies.

PubMed

Lie, Désirée A; Richter-Lagha, Regina; Forest, Christopher P; Walsh, Anne; Lohenry, Kevin

2017-01-01

There is a need for validated and easy-to-apply behavior-based tools for assessing interprofessional team competencies in clinical settings. The seven-item observer-based Modified McMaster-Ottawa scale was developed for the Team Objective Structured Clinical Encounter (TOSCE) to assess individual and team performance in interprofessional patient encounters. We aimed to improve scale usability for clinical settings by reducing item numbers while maintaining generalizability; and to explore the minimum number of observed cases required to achieve modest generalizability for giving feedback. We administered a two-station TOSCE in April 2016 to 63 students split into 16 newly-formed teams, each consisting of four professions. The stations were of similar difficulty. We trained sixteen faculty to rate two teams each. We examined individual and team performance scores using generalizability (G) theory and principal component analysis (PCA). The seven-item scale shows modest generalizability (.75) with individual scores. PCA revealed multicollinearity and singularity among scale items and we identified three potential items for removal. Reducing items for individual scores from seven to four (measuring Collaboration, Roles, Patient/Family-centeredness, and Conflict Management) changed scale generalizability from .75 to .73. Performance assessment with two cases is associated with reasonable generalizability (.73). Students in newly-formed interprofessional teams show a learning curve after one patient encounter. Team scores from a two-station TOSCE demonstrate low generalizability whether the scale consisted of four (.53) or seven items (.55). The four-item Modified McMaster-Ottawa scale for assessing individual performance in interprofessional teams retains the generalizability and validity of the seven-item scale. Observation of students in teams interacting with two different patients provides reasonably reliable ratings for giving feedback. The four-item scale has potential for assessing individual student skills and the impact of IPE curricula in clinical practice settings. IPE: Interprofessional education; SP: Standardized patient; TOSCE: Team objective structured clinical encounter.
Measuring cancer-specific child adjustment difficulties: Development and validation of the Children's Oncology Child Adjustment Scale (ChOCs).

PubMed

Burke, Kylie; McCarthy, Maria; Lowe, Cherie; Sanders, Matthew R; Lloyd, Erin; Bowden, Madeleine; Williams, Lauren

2017-03-01

Childhood cancer is associated with child adjustment difficulties including, eating and sleep disturbance, and emotional and other behavioral difficulties. However, there is a lack of validated instruments to measure the specific child adjustment issues associated with pediatric cancer treatments. The aim of this study was to develop and evaluate the reliability and validity of a parent-reported, child adjustment scale. One hundred thirty-two parents from two pediatric oncology centers who had children (aged 2-10 years) diagnosed with cancer completed the newly developed measure and additional measures of child behavior, sleep, diet, and quality of life. Children were more than 4 weeks postdiagnosis and less than 12 months postactive treatment. Factor structure, internal consistency, and construct (convergent) validity analyses were conducted. Principal component analysis revealed five distinct and theoretically coherent factors: Sleep Difficulties, Impact of Child's Illness, Eating Difficulties, Hospital-Related Behavior Difficulties, and General Behavior Difficulties. The final 25-item measure, the Children's Oncology Child Adjustment Scale (ChOCs), demonstrated good internal consistency (α = 0.79-0.91). Validity of the ChOCs was demonstrated by significant correlations between the subscales and measures of corresponding constructs. The ChOCs provides a new measure of child adjustment difficulties designed specifically for pediatric oncology. Preliminary analyses indicate strong theoretical and psychometric properties. Future studies are required to further examine reliability and validity of the scale, including test-retest reliability, discriminant validity, as well as change sensitivity and generalizability across different oncology samples and ages of children. The ChOCs shows promise as a measure of child adjustment relevant for oncology clinical settings and research purposes. © 2016 Wiley Periodicals, Inc.
Development and validation of the Multidimensional Home Environment Scale (MHES) for adolescents and their mothers.

PubMed

Tabbakh, Tamara; Freeland-Graves, Jeanne

2016-08-01

The home environment is an important setting for the development of weight status in adolescence. At present a limited number of valid and reliable tools are available to evaluate the weight-related comprehensive home environment of this population. The goal of this research was to develop the Multidimensional Home Environment Scale which measures multiple components of the home. It includes psychological, social, and environmental domains from the perspective of an adolescent and the mother. Items were generated based on a literature review and then assessed for content validity by an expert panel and focus group in the target population. Internal consistency reliability was determined using Cronbach's α. Principal components analysis with varimax rotation was employed for assessment of construct validity. Temporal stability was evaluated using paired sample t-tests and bivariate correlations between responses at two different times, 1-2weeks apart. Associations between adolescent and mother responses were utilized for convergent validity. The final versions contained 32-items for adolescents and 36-items for mothers; these were administered to 218 adolescents and mothers. The subscales on the questionnaires exhibited high construct validity, internal consistency reliability (adolescent: α=0.82, mother: α=0.83) and test-retest reliability (adolescent: r=0.90, p<0.01; mother: r=0.91, p<0.01). Total home environment scores were computed, with greater scores reflecting a better health environment. These results verify the utility of the MHES as a valid and reliable instrument. This promising tool can be utilized to capture the comprehensive home environment of young adolescents (11-14years old). Copyright © 2016 Elsevier Ltd. All rights reserved.
Measuring leprosy-related stigma - a pilot study to validate a toolkit of instruments.

PubMed

Rensen, Carin; Bandyopadhyay, Sudhakar; Gopal, Pala K; Van Brakel, Wim H

2011-01-01

Stigma negatively affects the quality of life of leprosy-affected people. Instruments are needed to assess levels of stigma and to monitor and evaluate stigma reduction interventions. We conducted a validation study of such instruments in Tamil Nadu and West Bengal, India. Four instruments were tested in a 'Community Based Rehabilitation' (CBR) setting, the Participation Scale, Internalised Scale of Mental Illness (ISMI) adapted for leprosy-affected persons, Explanatory Model Interview Catalogue (EMIC) for leprosy-affected and non-affected persons and the General Self-Efficacy (GSE) Scale. We evaluated the following components of validity, construct validity, internal consistency, test-retest reproducibility and reliability to distinguish between groups. Construct validity was tested by correlating instrument scores and by triangulating quantitative and qualitative findings. Reliability was evaluated by comparing levels of stigma among people affected by leprosy and community controls, and among affected people living in CBR project areas and those in non-CBR areas. For the Participation, ISMI and EMIC scores significant differences were observed between those affected by leprosy and those not affected (p = 0.0001), and between affected persons in the CBR and Control group (p < 0.05). The internal consistency of the instruments measured with Cronbach's α ranged from 0.83 to 0.96 and was very good for all instruments. Test-retest reproducibility coefficients were 0.80 for the Participation score, 0.70 for the EMIC score, 0.62 for the ISMI score and 0.50 for the GSE score. The construct validity of all instruments was confirmed. The Participation and EMIC Scales met all validity criteria, but test-retest reproducibility of the ISMI and GSE Scales needs further evaluation with a shorter test-retest interval and longer training and additional adaptations for the latter.
The German Version of the Gaze Anxiety Rating Scale (GARS): Reliability and Validity

PubMed Central

Domes, Gregor; Marx, Lisa; Spenthof, Ines; Heinrichs, Markus

2016-01-01

Objective Fear of eye gaze and avoidance of eye contact are core features of social anxiety disorders (SAD). To measure self-reported fear and avoidance of eye gaze, the Gaze Anxiety Rating Scale (GARS) has been developed and validated in recent years in its English version. The main objectives of the present study were to psychometrically evaluate the German translation of the GARS concerning its reliability, factorial structure, and validity. Methods Three samples of participants were enrolled in the study. (1) A non-patient sample (n = 353) completed the GARS and a set of trait questionnaires to assess internal consistency, test-retest reliability, factorial structure, and concurrent and divergent validity. (2) A sample of patients with SAD (n = 33) was compared to a healthy control group (n = 30) regarding their scores on the GARS and the trait measures. Results The German GARS fear and avoidance scales exhibited excellent internal consistency and high stability over 2 and 4 months, as did the original version. The English version’s factorial structure was replicated, yielding two categories of situations: (1) everyday situations and (2) situations involving high evaluative threat. GARS fear and avoidance displayed convergent validity with trait measures of social anxiety and were markedly higher in patients with GSAD than in healthy controls. Fear and avoidance of eye contact in situations involving high levels of evaluative threat related more closely to social anxiety than to gaze anxiety in everyday situations. Conclusions The German version of the GARS has demonstrated reliability and validity similar to the original version, and is thus well suited to capture fear and avoidance of eye contact in different social situations as a valid self-report measure of social anxiety and related disorders in the social domain for use in both clinical practice and research. PMID:26937638
The German Version of the Gaze Anxiety Rating Scale (GARS): Reliability and Validity.

PubMed

Domes, Gregor; Marx, Lisa; Spenthof, Ines; Heinrichs, Markus

2016-01-01

Fear of eye gaze and avoidance of eye contact are core features of social anxiety disorders (SAD). To measure self-reported fear and avoidance of eye gaze, the Gaze Anxiety Rating Scale (GARS) has been developed and validated in recent years in its English version. The main objectives of the present study were to psychometrically evaluate the German translation of the GARS concerning its reliability, factorial structure, and validity. Three samples of participants were enrolled in the study. (1) A non-patient sample (n = 353) completed the GARS and a set of trait questionnaires to assess internal consistency, test-retest reliability, factorial structure, and concurrent and divergent validity. (2) A sample of patients with SAD (n = 33) was compared to a healthy control group (n = 30) regarding their scores on the GARS and the trait measures. The German GARS fear and avoidance scales exhibited excellent internal consistency and high stability over 2 and 4 months, as did the original version. The English version's factorial structure was replicated, yielding two categories of situations: (1) everyday situations and (2) situations involving high evaluative threat. GARS fear and avoidance displayed convergent validity with trait measures of social anxiety and were markedly higher in patients with GSAD than in healthy controls. Fear and avoidance of eye contact in situations involving high levels of evaluative threat related more closely to social anxiety than to gaze anxiety in everyday situations. The German version of the GARS has demonstrated reliability and validity similar to the original version, and is thus well suited to capture fear and avoidance of eye contact in different social situations as a valid self-report measure of social anxiety and related disorders in the social domain for use in both clinical practice and research.
Development and psychometric evaluation of the nursing instructors' clinical teaching performance inventory.

PubMed

A Farahani, Mansoureh; Emamzadeh Ghasemi, Hormat Sadat; Nikpaima, Nasrin; Fereidooni, Zhila; Rasoli, Maryam

2014-10-29

Evaluation of nursing instructors' clinical teaching performance is a prerequisite to the quality assurance of nursing education. One of the most common procedures for this purpose is using student evaluations. This study was to develop and evaluate the psychometric properties of Nursing Instructors' Clinical Teaching Performance Inventory (NICTPI). The primary items of the inventory were generated by reviewing the published literature and the existing questionnaires as well as consulting with the members of the Faculties Evaluation Committee of the study setting. Psychometric properties were assessed by calculating its content validity ratio and index, and test-retest correlation coefficient as well as conducting an exploratory factor analysis and an internal consistency assessment. The content validity ratios and indices of the items were respectively higher than 0.85 and 0.79. The final version of the inventory consisted of 25 items, and in the exploratory factor analysis, items were loaded on three factors which jointly accounting for 72.85% of the total variance. The test-retest correlation coefficient and the Cronbach's alpha of the inventory were 0.93 and 0.973, respectively. The results revealed that the developed inventory is an appropriate, valid, and reliable instrument for evaluating nursing instructors' clinical teaching performance.
A New Standard in Dementia Knowledge Measurement: Comparative Validation of the Dementia Knowledge Assessment Scale and the Alzheimer's Disease Knowledge Scale.

PubMed

Annear, Michael J; Eccleston, Claire E; McInerney, Frances J; Elliott, Kate-Ellen J; Toye, Christine M; Tranter, Bruce K; Robinson, Andrew L

2016-06-01

To compare the psychometric performance of the Dementia Knowledge Assessment Scale (DKAS) and the Alzheimer's Disease Knowledge Scale (ADKS) when administered to a large international cohort before and after online dementia education. Comparative psychometric analysis with pre- and posteducation scale responses. The setting for this research encompassed 7,909 individuals from 124 countries who completed the 9-week Understanding Dementia Massive Open Online Course (MOOC). Volunteer respondents who completed the DKAS and ADKS before (n = 3,649) and after (n = 878) completion of the Understanding Dementia MOOC. Assessment and comparison of the DKAS and ADKS included evaluation of scale development procedures, interscale correlations, response distribution, internal consistency, and construct validity. The DKAS had superior internal consistency, wider response distribution with less ceiling effect, and better discrimination between pre- and posteducation scores and occupational cohorts than the ADKS. The 27-item DKAS is a reliable and preliminarily valid measure of dementia knowledge that is psychometrically and conceptually sound, overcomes limitations of existing instruments, and can be administered to diverse cohorts to measure baseline understanding and knowledge change. © 2016, Copyright the Authors Journal compilation © 2016, The American Geriatrics Society.
Design and internal validation of an obstetric early warning score: secondary analysis of the Intensive Care National Audit and Research Centre Case Mix Programme database.

PubMed

Carle, C; Alexander, P; Columb, M; Johal, J

2013-04-01

We designed and internally validated an aggregate weighted early warning scoring system specific to the obstetric population that has the potential for use in the ward environment. Direct obstetric admissions from the Intensive Care National Audit and Research Centre's Case Mix Programme Database were randomly allocated to model development (n = 2240) or validation (n = 2200) sets. Physiological variables collected during the first 24 h of critical care admission were analysed. Logistic regression analysis for mortality in the model development set was initially used to create a statistically based early warning score. The statistical score was then modified to create a clinically acceptable early warning score. Important features of this clinical obstetric early warning score are that the variables are weighted according to their statistical importance, a surrogate for the FI O2 /Pa O2 relationship is included, conscious level is assessed using a simplified alert/not alert variable, and the score, trigger thresholds and response are consistent with the new non-obstetric National Early Warning Score system. The statistical and clinical early warning scores were internally validated using the validation set. The area under the receiver operating characteristic curve was 0.995 (95% CI 0.992-0.998) for the statistical score and 0.957 (95% CI 0.923-0.991) for the clinical score. Pre-existing empirically designed early warning scores were also validated in the same way for comparison. The area under the receiver operating characteristic curve was 0.955 (95% CI 0.922-0.988) for Swanton et al.'s Modified Early Obstetric Warning System, 0.937 (95% CI 0.884-0.991) for the obstetric early warning score suggested in the 2003-2005 Report on Confidential Enquiries into Maternal Deaths in the UK, and 0.973 (95% CI 0.957-0.989) for the non-obstetric National Early Warning Score. This highlights that the new clinical obstetric early warning score has an excellent ability to discriminate survivors from non-survivors in this critical care data set. Further work is needed to validate our new clinical early warning score externally in the obstetric ward environment. Anaesthesia © 2013 The Association of Anaesthetists of Great Britain and Ireland.
Development and Validation of a Smartphone Addiction Scale (SAS)

PubMed Central

Kwon, Min; Lee, Joon-Yeop; Won, Wang-Youn; Park, Jae-Woo; Min, Jung-Ah; Hahn, Changtae; Gu, Xinyu; Choi, Ji-Hye; Kim, Dai-Jin

2013-01-01

Objective The aim of this study was to develop a self-diagnostic scale that could distinguish smartphone addicts based on the Korean self-diagnostic program for Internet addiction (K-scale) and the smartphone's own features. In addition, the reliability and validity of the smartphone addiction scale (SAS) was demonstrated. Methods A total of 197 participants were selected from Nov. 2011 to Jan. 2012 to accomplish a set of questionnaires, including SAS, K-scale, modified Kimberly Young Internet addiction test (Y-scale), visual analogue scale (VAS), and substance dependence and abuse diagnosis of DSM-IV. There were 64 males and 133 females, with ages ranging from 18 to 53 years (M = 26.06; SD = 5.96). Factor analysis, internal-consistency test, t-test, ANOVA, and correlation analysis were conducted to verify the reliability and validity of SAS. Results Based on the factor analysis results, the subscale “disturbance of reality testing” was removed, and six factors were left. The internal consistency and concurrent validity of SAS were verified (Cronbach's alpha = 0.967). SAS and its subscales were significantly correlated with K-scale and Y-scale. The VAS of each factor also showed a significant correlation with each subscale. In addition, differences were found in the job (p<0.05), education (p<0.05), and self-reported smartphone addiction scores (p<0.001) in SAS. Conclusions This study developed the first scale of the smartphone addiction aspect of the diagnostic manual. This scale was proven to be relatively reliable and valid. PMID:23468893
Development and validation of a smartphone addiction scale (SAS).

PubMed

Kwon, Min; Lee, Joon-Yeop; Won, Wang-Youn; Park, Jae-Woo; Min, Jung-Ah; Hahn, Changtae; Gu, Xinyu; Choi, Ji-Hye; Kim, Dai-Jin

2013-01-01

The aim of this study was to develop a self-diagnostic scale that could distinguish smartphone addicts based on the Korean self-diagnostic program for Internet addiction (K-scale) and the smartphone's own features. In addition, the reliability and validity of the smartphone addiction scale (SAS) was demonstrated. A total of 197 participants were selected from Nov. 2011 to Jan. 2012 to accomplish a set of questionnaires, including SAS, K-scale, modified Kimberly Young Internet addiction test (Y-scale), visual analogue scale (VAS), and substance dependence and abuse diagnosis of DSM-IV. There were 64 males and 133 females, with ages ranging from 18 to 53 years (M = 26.06; SD = 5.96). Factor analysis, internal-consistency test, t-test, ANOVA, and correlation analysis were conducted to verify the reliability and validity of SAS. Based on the factor analysis results, the subscale "disturbance of reality testing" was removed, and six factors were left. The internal consistency and concurrent validity of SAS were verified (Cronbach's alpha = 0.967). SAS and its subscales were significantly correlated with K-scale and Y-scale. The VAS of each factor also showed a significant correlation with each subscale. In addition, differences were found in the job (p<0.05), education (p<0.05), and self-reported smartphone addiction scores (p<0.001) in SAS. This study developed the first scale of the smartphone addiction aspect of the diagnostic manual. This scale was proven to be relatively reliable and valid.
Climate change vulnerability for species-Assessing the assessments.

PubMed

Wheatley, Christopher J; Beale, Colin M; Bradbury, Richard B; Pearce-Higgins, James W; Critchlow, Rob; Thomas, Chris D

2017-09-01

Climate change vulnerability assessments are commonly used to identify species at risk from global climate change, but the wide range of methodologies available makes it difficult for end users, such as conservation practitioners or policymakers, to decide which method to use as a basis for decision-making. In this study, we evaluate whether different assessments consistently assign species to the same risk categories and whether any of the existing methodologies perform well at identifying climate-threatened species. We compare the outputs of 12 climate change vulnerability assessment methodologies, using both real and simulated species, and validate the methods using historic data for British birds and butterflies (i.e. using historical data to assign risks and more recent data for validation). Our results show that the different vulnerability assessment methods are not consistent with one another; different risk categories are assigned for both the real and simulated sets of species. Validation of the different vulnerability assessments suggests that methods incorporating historic trend data into the assessment perform best at predicting distribution trends in subsequent time periods. This study demonstrates that climate change vulnerability assessments should not be used interchangeably due to the poor overall agreement between methods when considering the same species. The results of our validation provide more support for the use of trend-based rather than purely trait-based approaches, although further validation will be required as data become available. © 2017 The Authors. Global Change Biology Published by John Wiley & Sons Ltd.
Physician Enabling Skills Questionnaire

PubMed Central

Hudon, Catherine; Lambert, Mireille; Almirall, José

2015-01-01

Abstract Objective To evaluate the reliability and validity of the newly developed Physician Enabling Skills Questionnaire (PESQ) by assessing its internal consistency, test-retest reliability, concurrent validity with patient-centred care, and predictive validity with patient activation and patient enablement. Design Validation study. Setting Saguenay, Que. Participants One hundred patients with at least 1 chronic disease who presented in a waiting room of a regional health centre family medicine unit. Main outcome measures Family physicians’ enabling skills, measured with the PESQ at 2 points in time (ie, while in the waiting room at the family medicine unit and 2 weeks later through a mail survey); patient-centred care, assessed with the Patient Perception of Patient-Centredness instrument; patient activation, assessed with the Patient Activation Measure; and patient enablement, assessed with the Patient Enablement Instrument. Results The internal consistency of the 6 subscales of the PESQ was adequate (Cronbach α = .69 to .92). The test-retest reliability was very good (r = 0.90; 95% CI 0.84 to 0.93). Concurrent validity with the Patient Perception of Patient-Centredness instrument was good (r = −0.67; 95% CI −0.78 to −0.53; P < .001). The PESQ accounts for 11% of the total variance with the Patient Activation Measure (r2 = 0.11; P = .002) and 19% of the variance with the Patient Enablement Instrument (r2 = 0.19; P < .001). Conclusion The newly developed PESQ presents good psychometric properties, allowing for its use in practice and research. PMID:26889507
Exploring the use of the Dementia Management Strategies Scale in caregivers of persons with dementia in Singapore.

PubMed

Tan, Louisa; Yap, Philip; Ng, Wai Yee; Luo, Nan

2013-01-01

Well-being in persons with dementia (PWD) depends much on the quality and type of care received. The Dementia Management Strategies Scale (DMSS) is a useful instrument to appraise care styles of caregivers. The present study expanded on previous research by refining and establishing the scale's content validity and psychometric properties in the Singapore context. Five family caregivers and four dementia care professionals (nurse, occupational therapist, social worker and doctor) reviewed the DMSS for content validity. Two hundred and forty-six family caregivers completed questionnaires which assessed caregiver and patient characteristics, and dementia management strategies with DMSS. Internal consistency reliability was assessed and construct validity was evaluated through Pearson's correlation with extant instruments. Eight items from the 28-item DMSS were omitted after content review as they were deemed inappropriate in our socio-cultural setting. A factor analysis with Varimax rotation confirmed a two-factor structure (positive and negative dimensions) for the revised DMSS (rDMSS). The two subscales showed good internal consistency (Cronbach's alpha .89 and .87). Moderate to strong correlations (.35-.53) with the scales, Zarit Burden Instrument, Revised Memory and Behavioural Problems Checklist, General Health Questionnaire, Short Sense of Competence Scale, Gains in Alzheimer's Care Instrument and Positive Aspects of Caregiving established convergent and divergent construct validity of rDMSS. The shortened 20-item rDMSS is a psychometrically valid instrument which can serve as a measure of dementia care strategy from the perspective of the caregiver in Singapore.
Chemometric and biological validation of a capillary electrophoresis metabolomic experiment of Schistosoma mansoni infection in mice.

PubMed

Garcia-Perez, Isabel; Angulo, Santiago; Utzinger, Jürg; Holmes, Elaine; Legido-Quigley, Cristina; Barbas, Coral

2010-07-01

Metabonomic and metabolomic studies are increasingly utilized for biomarker identification in different fields, including biology of infection. The confluence of improved analytical platforms and the availability of powerful multivariate analysis software have rendered the multiparameter profiles generated by these omics platforms a user-friendly alternative to the established analysis methods where the quality and practice of a procedure is well defined. However, unlike traditional assays, validation methods for these new multivariate profiling tools have yet to be established. We propose a validation for models obtained by CE fingerprinting of urine from mice infected with the blood fluke Schistosoma mansoni. We have analysed urine samples from two sets of mice infected in an inter-laboratory experiment where different infection methods and animal husbandry procedures were employed in order to establish the core biological response to a S. mansoni infection. CE data were analysed using principal component analysis. Validation of the scores consisted of permutation scrambling (100 repetitions) and a manual validation method, using a third of the samples (not included in the model) as a test or prediction set. The validation yielded 100% specificity and 100% sensitivity, demonstrating the robustness of these models with respect to deciphering metabolic perturbations in the mouse due to a S. mansoni infection. A total of 20 metabolites across the two experiments were identified that significantly discriminated between S. mansoni-infected and noninfected control samples. Only one of these metabolites, allantoin, was identified as manifesting different behaviour in the two experiments. This study shows the reproducibility of CE-based metabolic profiling methods for disease characterization and screening and highlights the importance of much needed validation strategies in the emerging field of metabolomics.
Reliability and Validity of Survey Instruments to Measure Work-Related Fatigue in the Emergency Medical Services Setting: A Systematic Review.

PubMed

Patterson, P Daniel; Weaver, Matthew D; Fabio, Anthony; Teasley, Ellen M; Renn, Megan L; Curtis, Brett R; Matthews, Margaret E; Kroemer, Andrew J; Xun, Xiaoshuang; Bizhanova, Zhadyra; Weiss, Patricia M; Sequeira, Denisse J; Coppler, Patrick J; Lang, Eddy S; Higgins, J Stephen

2018-02-15

This study sought to systematically search the literature to identify reliable and valid survey instruments for fatigue measurement in the Emergency Medical Services (EMS) occupational setting. A systematic review study design was used and searched six databases, including one website. The research question guiding the search was developed a priori and registered with the PROSPERO database of systematic reviews: "Are there reliable and valid instruments for measuring fatigue among EMS personnel?" (2016:CRD42016040097). The primary outcome of interest was criterion-related validity. Important outcomes of interest included reliability (e.g., internal consistency), and indicators of sensitivity and specificity. Members of the research team independently screened records from the databases. Full-text articles were evaluated by adapting the Bolster and Rourke system for categorizing findings of systematic reviews, and the rated data abstracted from the body of literature as favorable, unfavorable, mixed/inconclusive, or no impact. The Grading of Recommendations, Assessment, Development and Evaluation (GRADE) methodology was used to evaluate the quality of evidence. The search strategy yielded 1,257 unique records. Thirty-four unique experimental and non-experimental studies were determined relevant following full-text review. Nineteen studies reported on the reliability and/or validity of ten different fatigue survey instruments. Eighteen different studies evaluated the reliability and/or validity of four different sleepiness survey instruments. None of the retained studies reported sensitivity or specificity. Evidence quality was rated as very low across all outcomes. In this systematic review, limited evidence of the reliability and validity of 14 different survey instruments to assess the fatigue and/or sleepiness status of EMS personnel and related shift worker groups was identified.
In-flight results of adaptive attitude control law for a microsatellite

NASA Astrophysics Data System (ADS)

Pittet, C.; Luzi, A. R.; Peaucelle, D.; Biannic, J.-M.; Mignot, J.

2015-06-01

Because satellites usually do not experience large changes of mass, center of gravity or inertia in orbit, linear time invariant (LTI) controllers have been widely used to control their attitude. But, as the pointing requirements become more stringent and the satellite's structure more complex with large steerable and/or deployable appendices and flexible modes occurring in the control bandwidth, one unique LTI controller is no longer sufficient. One solution consists in designing several LTI controllers, one for each set point, but the switching between them is difficult to tune and validate. Another interesting solution is to use adaptive controllers, which could present at least two advantages: first, as the controller automatically and continuously adapts to the set point without changing the structure, no switching logic is needed in the software; second, performance and stability of the closed-loop system can be assessed directly on the whole flight domain. To evaluate the real benefits of adaptive control for satellites, in terms of design, validation and performances, CNES selected it as end-of-life experiment on PICARD microsatellite. This paper describes the design, validation and in-flight results of the new adaptive attitude control law, compared to nominal control law.
Data preparation and evaluation techniques for x-ray diffraction microscopy.

PubMed

Steinbrener, Jan; Nelson, Johanna; Huang, Xiaojing; Marchesini, Stefano; Shapiro, David; Turner, Joshua J; Jacobsen, Chris

2010-08-30

The post-experiment processing of X-ray Diffraction Microscopy data is often time-consuming and difficult. This is mostly due to the fact that even if a preliminary result has been reconstructed, there is no definitive answer as to whether or not a better result with more consistently retrieved phases can still be obtained. We show here that the first step in data analysis, the assembly of two-dimensional diffraction patterns from a large set of raw diffraction data, is crucial to obtaining reconstructions of highest possible consistency. We have developed software that automates this process and results in consistently accurate diffraction patterns. We have furthermore derived some criteria of validity for a tool commonly used to assess the consistency of reconstructions, the phase retrieval transfer function, and suggest a modified version that has improved utility for judging reconstruction quality.

The Consumer Motivation Scale: A detailed review of item generation, exploration, confirmation, and validation procedures.

PubMed

Barbopoulos, I; Johansson, L-O

2017-08-01

This data article offers a detailed description of analyses pertaining to the development of the Consumer Motivation Scale (CMS), from item generation and the extraction of factors, to confirmation of the factor structure and validation of the emergent dimensions. The established goal structure - consisting of the sub-goals Value for Money, Quality, Safety, Stimulation, Comfort, Ethics, and Social Acceptance - is shown to be related to a variety of consumption behaviors in different contexts and for different products, and should thereby prove useful in standard marketing research, as well as in the development of tailored marketing strategies, and the segmentation of consumer groups, settings, brands, and products.
Sparse brain network using penalized linear regression

NASA Astrophysics Data System (ADS)

Lee, Hyekyoung; Lee, Dong Soo; Kang, Hyejin; Kim, Boong-Nyun; Chung, Moo K.

2011-03-01

Sparse partial correlation is a useful connectivity measure for brain networks when it is difficult to compute the exact partial correlation in the small-n large-p setting. In this paper, we formulate the problem of estimating partial correlation as a sparse linear regression with a l1-norm penalty. The method is applied to brain network consisting of parcellated regions of interest (ROIs), which are obtained from FDG-PET images of the autism spectrum disorder (ASD) children and the pediatric control (PedCon) subjects. To validate the results, we check their reproducibilities of the obtained brain networks by the leave-one-out cross validation and compare the clustered structures derived from the brain networks of ASD and PedCon.
Dark Energy Survey Year 1 Results: The Photometric Data Set for Cosmology

NASA Astrophysics Data System (ADS)

Drlica-Wagner, A.; Sevilla-Noarbe, I.; Rykoff, E. S.; Gruendl, R. A.; Yanny, B.; Tucker, D. L.; Hoyle, B.; Carnero Rosell, A.; Bernstein, G. M.; Bechtol, K.; Becker, M. R.; Benoit-Lévy, A.; Bertin, E.; Carrasco Kind, M.; Davis, C.; de Vicente, J.; Diehl, H. T.; Gruen, D.; Hartley, W. G.; Leistedt, B.; Li, T. S.; Marshall, J. L.; Neilsen, E.; Rau, M. M.; Sheldon, E.; Smith, J.; Troxel, M. A.; Wyatt, S.; Zhang, Y.; Abbott, T. M. C.; Abdalla, F. B.; Allam, S.; Banerji, M.; Brooks, D.; Buckley-Geer, E.; Burke, D. L.; Capozzi, D.; Carretero, J.; Cunha, C. E.; D’Andrea, C. B.; da Costa, L. N.; DePoy, D. L.; Desai, S.; Dietrich, J. P.; Doel, P.; Evrard, A. E.; Fausti Neto, A.; Flaugher, B.; Fosalba, P.; Frieman, J.; García-Bellido, J.; Gerdes, D. W.; Giannantonio, T.; Gschwend, J.; Gutierrez, G.; Honscheid, K.; James, D. J.; Jeltema, T.; Kuehn, K.; Kuhlmann, S.; Kuropatkin, N.; Lahav, O.; Lima, M.; Lin, H.; Maia, M. A. G.; Martini, P.; McMahon, R. G.; Melchior, P.; Menanteau, F.; Miquel, R.; Nichol, R. C.; Ogando, R. L. C.; Plazas, A. A.; Romer, A. K.; Roodman, A.; Sanchez, E.; Scarpine, V.; Schindler, R.; Schubnell, M.; Smith, M.; Smith, R. C.; Soares-Santos, M.; Sobreira, F.; Suchyta, E.; Tarle, G.; Vikram, V.; Walker, A. R.; Wechsler, R. H.; Zuntz, J.; DES Collaboration

2018-04-01

We describe the creation, content, and validation of the Dark Energy Survey (DES) internal year-one cosmology data set, Y1A1 GOLD, in support of upcoming cosmological analyses. The Y1A1 GOLD data set is assembled from multiple epochs of DES imaging and consists of calibrated photometric zero-points, object catalogs, and ancillary data products—e.g., maps of survey depth and observing conditions, star–galaxy classification, and photometric redshift estimates—that are necessary for accurate cosmological analyses. The Y1A1 GOLD wide-area object catalog consists of ∼ 137 million objects detected in co-added images covering ∼ 1800 {\\deg }2 in the DES grizY filters. The 10σ limiting magnitude for galaxies is g=23.4, r=23.2, i=22.5, z=21.8, and Y=20.1. Photometric calibration of Y1A1 GOLD was performed by combining nightly zero-point solutions with stellar locus regression, and the absolute calibration accuracy is better than 2% over the survey area. DES Y1A1 GOLD is the largest photometric data set at the achieved depth to date, enabling precise measurements of cosmic acceleration at z ≲ 1.
Study on the Algorithm of Judgment Matrix in Analytic Hierarchy Process

NASA Astrophysics Data System (ADS)

Lu, Zhiyong; Qin, Futong; Jin, Yican

2017-10-01

A new algorithm is proposed for the non-consistent judgment matrix in AHP. A primary judgment matrix is generated firstly through pre-ordering the targeted factor set, and a compared matrix is built through the top integral function. Then a relative error matrix is created by comparing the compared matrix with the primary judgment matrix which is regulated under the control of the relative error matrix and the dissimilar degree of the matrix step by step. Lastly, the targeted judgment matrix is generated to satisfy the requirement of consistence and the least dissimilar degree. The feasibility and validity of the proposed method are verified by simulation results.
A combined ligand-based and target-based drug design approach for G-protein coupled receptors: application to salvinorin A, a selective kappa opioid receptor agonist

NASA Astrophysics Data System (ADS)

Singh, Nidhi; Chevé, Gwénaël; Ferguson, David M.; McCurdy, Christopher R.

2006-08-01

Combined ligand-based and target-based drug design approaches provide a synergistic advantage over either method individually. Therefore, we set out to develop a powerful virtual screening model to identify novel molecular scaffolds as potential leads for the human KOP (hKOP) receptor employing a combined approach. Utilizing a set of recently reported derivatives of salvinorin A, a structurally unique KOP receptor agonist, a pharmacophore model was developed that consisted of two hydrogen bond acceptor and three hydrophobic features. The model was cross-validated by randomizing the data using the CatScramble technique. Further validation was carried out using a test set that performed well in classifying active and inactive molecules correctly. Simultaneously, a bovine rhodopsin based "agonist-bound" hKOP receptor model was also generated. The model provided more accurate information about the putative binding site of salvinorin A based ligands. Several protein structure-checking programs were used to validate the model. In addition, this model was in agreement with the mutation experiments carried out on KOP receptor. The predictive ability of the model was evaluated by docking a set of known KOP receptor agonists into the active site of this model. The docked scores correlated reasonably well with experimental p K i values. It is hypothesized that the integration of these two independently generated models would enable a swift and reliable identification of new lead compounds that could reduce time and cost of hit finding within the drug discovery and development process, particularly in the case of GPCRs.
Validation of motion correction techniques for liver CT perfusion studies

PubMed Central

Chandler, A; Wei, W; Anderson, E F; Herron, D H; Ye, Z; Ng, C S

2012-01-01

Objectives Motion in images potentially compromises the evaluation of temporally acquired CT perfusion (CTp) data; image registration should mitigate this, but first requires validation. Our objective was to compare the relative performance of manual, rigid and non-rigid registration techniques to correct anatomical misalignment in acquired liver CTp data sets. Methods 17 data sets in patients with liver tumours who had undergone a CTp protocol were evaluated. Each data set consisted of a cine acquisition during a breath-hold (Phase 1), followed by six further sets of cine scans (each containing 11 images) acquired during free breathing (Phase 2). Phase 2 images were registered to a reference image from Phase 1 cine using two semi-automated intensity-based registration techniques (rigid and non-rigid) and a manual technique (the only option available in the relevant vendor CTp software). The performance of each technique to align liver anatomy was assessed by four observers, independently and blindly, on two separate occasions, using a semi-quantitative visual validation study (employing a six-point score). The registration techniques were statistically compared using an ordinal probit regression model. Results 306 registrations (2448 observer scores) were evaluated. The three registration techniques were significantly different from each other (p=0.03). On pairwise comparison, the semi-automated techniques were significantly superior to the manual technique, with non-rigid significantly superior to rigid (p<0.0001), which in turn was significantly superior to manual registration (p=0.04). Conclusion Semi-automated registration techniques achieved superior alignment of liver anatomy compared with the manual technique. We hope this will translate into more reliable CTp analyses. PMID:22374283
Use of the recognition heuristic depends on the domain's recognition validity, not on the recognition validity of selected sets of objects.

PubMed

Pohl, Rüdiger F; Michalkiewicz, Martha; Erdfelder, Edgar; Hilbig, Benjamin E

2017-07-01

According to the recognition-heuristic theory, decision makers solve paired comparisons in which one object is recognized and the other not by recognition alone, inferring that recognized objects have higher criterion values than unrecognized ones. However, success-and thus usefulness-of this heuristic depends on the validity of recognition as a cue, and adaptive decision making, in turn, requires that decision makers are sensitive to it. To this end, decision makers could base their evaluation of the recognition validity either on the selected set of objects (the set's recognition validity), or on the underlying domain from which the objects were drawn (the domain's recognition validity). In two experiments, we manipulated the recognition validity both in the selected set of objects and between domains from which the sets were drawn. The results clearly show that use of the recognition heuristic depends on the domain's recognition validity, not on the set's recognition validity. In other words, participants treat all sets as roughly representative of the underlying domain and adjust their decision strategy adaptively (only) with respect to the more general environment rather than the specific items they are faced with.
Development and evaluation of an instrument for assessing brief behavioral change interventions.

PubMed

Strayer, Scott M; Martindale, James R; Pelletier, Sandra L; Rais, Salehin; Powell, Jon; Schorling, John B

2011-04-01

To develop an observational coding instrument for evaluating the fidelity and quality of brief behavioral change interventions based on the behavioral theories of the 5 A's, Stages of Change and Motivational Interviewing. Content and face validity were assessed prior to an intervention where psychometric properties were evaluated with a prospective cohort of 116 medical students. Properties assessed included the inter-rater reliability of the instrument, internal consistency of the full scale and sub-scales and descriptive statistics of the instrument. Construct validity was assessed based on student's scores. Inter-rater reliability for the instrument was 0.82 (intraclass correlation). Internal consistency for the full scale was 0.70 (KR20). Internal consistencies for the sub-scales were as follows: MI intervention component (KR20=.7); stage-appropriate MI-based intervention (KR20=.55); MI spirit (KR20=.5); appropriate assessment (KR20=.45) and appropriate assisting (KR20=.56). The instrument demonstrated good inter-rater reliability and moderate overall internal consistency when used to assess performing brief behavioral change interventions by medical students. This practical instrument can be used with minimal training and demonstrates promising psychometric properties when evaluated with medical students counseling standardized patients. Further testing is required to evaluate its usefulness in clinical settings. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.
Retrieval of overviews of systematic reviews in MEDLINE was improved by the development of an objectively derived and validated search strategy.

PubMed

Lunny, Carole; McKenzie, Joanne E; McDonald, Steve

2016-06-01

Locating overviews of systematic reviews is difficult because of an absence of appropriate indexing terms and inconsistent terminology used to describe overviews. Our objective was to develop a validated search strategy to retrieve overviews in MEDLINE. We derived a test set of overviews from the references of two method articles on overviews. Two population sets were used to identify discriminating terms, that is, terms that appear frequently in the test set but infrequently in two population sets of references found in MEDLINE. We used text mining to conduct a frequency analysis of terms appearing in the titles and abstracts. Candidate terms were combined and tested in MEDLINE in various permutations, and the performance of strategies measured using sensitivity and precision. Two search strategies were developed: a sensitivity-maximizing strategy, achieving 93% sensitivity (95% confidence interval [CI]: 87, 96) and 7% precision (95% CI: 6, 8), and a sensitivity-and-precision-maximizing strategy, achieving 66% sensitivity (95% CI: 58, 74) and 21% precision (95% CI: 17, 25). The developed search strategies enable users to more efficiently identify overviews of reviews compared to current strategies. Consistent language in describing overviews would aid in their identification, as would a specific MEDLINE Publication Type. Copyright © 2015 Elsevier Inc. All rights reserved.
Recursive SVM biomarker selection for early detection of breast cancer in peripheral blood.

PubMed

Zhang, Fan; Kaufman, Howard L; Deng, Youping; Drabier, Renee

2013-01-01

Breast cancer is worldwide the second most common type of cancer after lung cancer. Traditional mammography and Tissue Microarray has been studied for early cancer detection and cancer prediction. However, there is a need for more reliable diagnostic tools for early detection of breast cancer. This can be a challenge due to a number of factors and logistics. First, obtaining tissue biopsies can be difficult. Second, mammography may not detect small tumors, and is often unsatisfactory for younger women who typically have dense breast tissue. Lastly, breast cancer is not a single homogeneous disease but consists of multiple disease states, each arising from a distinct molecular mechanism and having a distinct clinical progression path which makes the disease difficult to detect and predict in early stages. In the paper, we present a Support Vector Machine based on Recursive Feature Elimination and Cross Validation (SVM-RFE-CV) algorithm for early detection of breast cancer in peripheral blood and show how to use SVM-RFE-CV to model the classification and prediction problem of early detection of breast cancer in peripheral blood.The training set which consists of 32 health and 33 cancer samples and the testing set consisting of 31 health and 34 cancer samples were randomly separated from a dataset of peripheral blood of breast cancer that is downloaded from Gene Express Omnibus. First, we identified the 42 differentially expressed biomarkers between "normal" and "cancer". Then, with the SVM-RFE-CV we extracted 15 biomarkers that yield zero cross validation score. Lastly, we compared the classification and prediction performance of SVM-RFE-CV with that of SVM and SVM Recursive Feature Elimination (SVM-RFE). We found that 1) the SVM-RFE-CV is suitable for analyzing noisy high-throughput microarray data, 2) it outperforms SVM-RFE in the robustness to noise and in the ability to recover informative features, and 3) it can improve the prediction performance (Area Under Curve) in the testing data set from 0.5826 to 0.7879. Further pathway analysis showed that the biomarkers are associated with Signaling, Hemostasis, Hormones, and Immune System, which are consistent with previous findings. Our prediction model can serve as a general model for biomarker discovery in early detection of other cancers. In the future, Polymerase Chain Reaction (PCR) is planned for validation of the ability of these potential biomarkers for early detection of breast cancer.
Comparison of Random Forest and Support Vector Machine classifiers using UAV remote sensing imagery

NASA Astrophysics Data System (ADS)

Piragnolo, Marco; Masiero, Andrea; Pirotti, Francesco

2017-04-01

Since recent years surveying with unmanned aerial vehicles (UAV) is getting a great amount of attention due to decreasing costs, higher precision and flexibility of usage. UAVs have been applied for geomorphological investigations, forestry, precision agriculture, cultural heritage assessment and for archaeological purposes. It can be used for land use and land cover classification (LULC). In literature, there are two main types of approaches for classification of remote sensing imagery: pixel-based and object-based. On one hand, pixel-based approach mostly uses training areas to define classes and respective spectral signatures. On the other hand, object-based classification considers pixels, scale, spatial information and texture information for creating homogeneous objects. Machine learning methods have been applied successfully for classification, and their use is increasing due to the availability of faster computing capabilities. The methods learn and train the model from previous computation. Two machine learning methods which have given good results in previous investigations are Random Forest (RF) and Support Vector Machine (SVM). The goal of this work is to compare RF and SVM methods for classifying LULC using images collected with a fixed wing UAV. The processing chain regarding classification uses packages in R, an open source scripting language for data analysis, which provides all necessary algorithms. The imagery was acquired and processed in November 2015 with cameras providing information over the red, blue, green and near infrared wavelength reflectivity over a testing area in the campus of Agripolis, in Italy. Images were elaborated and ortho-rectified through Agisoft Photoscan. The ortho-rectified image is the full data set, and the test set is derived from partial sub-setting of the full data set. Different tests have been carried out, using a percentage from 2 % to 20 % of the total. Ten training sets and ten validation sets are obtained from each test set. The control dataset consist of an independent visual classification done by an expert over the whole area. The classes are (i) broadleaf, (ii) building, (iii) grass, (iv) headland access path, (v) road, (vi) sowed land, (vii) vegetable. The RF and SVM are applied to the test set. The performances of the methods are evaluated using the three following accuracy metrics: Kappa index, Classification accuracy and Classification Error. All three are calculated in three different ways: with K-fold cross validation, using the validation test set and using the full test set. The analysis indicates that SVM gets better results in terms of good scores using K-fold cross or validation test set. Using the full test set, RF achieves a better result in comparison to SVM. It also seems that SVM performs better with smaller training sets, whereas RF performs better as training sets get larger.
AMSR2 Soil Moisture Product Validation

NASA Technical Reports Server (NTRS)

Bindlish, R.; Jackson, T.; Cosh, M.; Koike, T.; Fuiji, X.; de Jeu, R.; Chan, S.; Asanuma, J.; Berg, A.; Bosch, D.;

2017-01-01

The Advanced Microwave Scanning Radiometer 2 (AMSR2) is part of the Global Change Observation Mission-Water (GCOM-W) mission. AMSR2 fills the void left by the loss of the Advanced Microwave Scanning Radiometer Earth Observing System (AMSR-E) after almost 10 years. Both missions provide brightness temperature observations that are used to retrieve soil moisture. Merging AMSR-E and AMSR2 will help build a consistent long-term dataset. Before tackling the integration of AMSR-E and AMSR2 it is necessary to conduct a thorough validation and assessment of the AMSR2 soil moisture products. This study focuses on validation of the AMSR2 soil moisture products by comparison with in situ reference data from a set of core validation sites. Three products that rely on different algorithms were evaluated; the JAXA Soil Moisture Algorithm (JAXA), the Land Parameter Retrieval Model (LPRM), and the Single Channel Algorithm (SCA). Results indicate that overall the SCA has the best performance based upon the metrics considered.

Development and validation of an eating norms inventory. Americans' lay-beliefs about appropriate eating.

PubMed

Fisher, Robert J; Dubé, Laurette

2011-10-01

What do American adults believe about what, where, when, how much, and how often it is appropriate to eat? Such normative beliefs originate from family and friends through socialization processes, but they are also influenced by governments, educational institutions, and businesses. Norms therefore provide an important link between the social environment and individual attitudes and behaviors. This paper reports on five studies that identify, develop, and validate measures of normative beliefs about eating. In study 1 we use an inductive method to identify what American adults believe are appropriate or desirable eating behaviors. Studies 2 and 3 are used to purify and assess the discriminant and nomological validity of the proposed set of 18 unidimensional eating norms. Study 4 assesses predictive validity and finds that acting in a norm-consistent fashion is associated with lower Body Mass Index (BMI), and greater body satisfaction and subjective health. Study 5 assesses the underlying social desirability and perceived healthiness of the norms. Copyright © 2011 Elsevier Ltd. All rights reserved.
Reliability, validity, and interpretation of the dependence scale in mild to moderately severe Alzheimer's disease.

PubMed

Lenderking, William R; Wyrwich, Kathleen W; Stolar, Marilyn; Howard, Kellee A; Leibman, Chris; Buchanan, Jacqui; Lacey, Loretto; Kopp, Zoe; Stern, Yaakov

2013-12-01

The Dependence Scale (DS) was designed to measure dependence on others among patients with Alzheimer's disease (AD). The objectives of this research were primarily to strengthen the psychometric evidence for the use of the DS in AD studies. Patients with mild to moderately severe AD were examined in 3 study databases. Within each data set, internal consistency, validity, and responsiveness were examined, and structural equation models were fit. The DS has strong psychometric properties. The DS scores differed significantly across known groups and demonstrated moderate to strong correlations with measures hypothesized to be related to dependence (|r| ≥ .31). Structural equation modeling supported the validity of the DS concept. An anchor-based DS responder definition to interpret a treatment benefit over time was identified. The DS is a reliable, valid, and interpretable measure of dependence associated with AD and is shown to be related to--but provides information distinct from--cognition, functioning, and behavior.
Content validation using an expert panel: assessment process for assistive technology adopted by farmers with disabilities.

PubMed

Mathew, S N; Field, W E; French, B F

2011-07-01

This article reports the use of an expert panel to perform content validation of an experimental assessment process for the safety of assistive technology (AT) adopted by farmers with disabilities. The validation process was conducted by a panel of six experts experienced in the subject matter, i.e., design, use, and assessment of AT for farmers with disabilities. The exercise included an evaluation session and two focus group sessions. The evaluation session consisted of using the assessment process under consideration by the panel to evaluate a set of nine ATs fabricated by a farmer on his farm site. The expert panel also participated in the focus group sessions conducted immediately before and after the evaluation session. The resulting data were analyzed using discursive analysis, and the results were incorporated into the final assessment process. The method and the results are presented with recommendations for the use of expert panels in research projects and validation of assessment tools.
The effects of competition on achievement motivation in Chinese classrooms.

PubMed

Lam, Shui-fong; Yim, Pui-shan; Law, Josephine S F; Cheung, Rebecca W Y

2004-06-01

Laboratory studies have consistently found that competition induces performance goals and affects learning motivation. However, the ecological validity of these results is yet to be established. There is a need for investigation of whether the results hold in both the classroom context and non-Western culture. The study investigated the effects of competition on learning motivation among Chinese students in an authentic classroom setting. The participants were 52 students of grade 7 from two Hong Kong secondary schools. They were randomly assigned to either competitive or non-competitive conditions in a 2-hour Chinese typewriting course. Students in the competitive condition performed better in easy tasks than their counterparts in the non-competitive condition. However, they were more performance-oriented and more likely to sacrifice learning opportunities for better performance. They were also prone to have worse self-evaluation after failure. Although there were no statistically significant differences between the two conditions in task enjoyment and achievement attribution, the direction of the differences was consistently unfavourable to students in the competitive condition. The findings were consistent with the predictions of goal theory. Competitiveness induces performance goals and worse self-evaluation after failure among Chinese students in a classroom setting, as was found with Western students in a laboratory setting.
Psychometric Evaluation of the MMPI-2/MMPI-2-RF Restructured Clinical Scales in an Israeli Sample.

PubMed

Shkalim, Eleanor

2015-10-01

The current study cross-culturally evaluated the psychometric properties of the Minnesota Multiphasic Personality Inventory-2 (MMPI-2)/MMPI-2-Restructured Form Restructured Clinical (RC) Scales in psychiatric settings in Israel with a sample of 100 men and 133 women. Participants were administered the MMPI-2 and were rated by their therapists on a 188-item Patient Description Form. Results indicated that in most instances the RC Scales demonstrated equivalent or better internal consistencies and improved intercorrelation patterns relative to their clinical counterparts. Furthermore, external analyses revealed comparable or improved convergent validity (with the exceptions of Antisocial Behavior [RC4] and Ideas of Persecution [RC6] among men), and mostly greater discriminant validity. Overall, the findings indicate that consistent with previous findings, the RC Scales generally exhibit comparable to improved psychometric properties over the Clinical Scales. Implications of the results, limitations, and recommendations for future research are discussed. © The Author(s) 2014.
A psychometric analysis of the reading the mind in the eyes test: toward a brief form for research and applied settings

PubMed Central

Olderbak, Sally; Wilhelm, Oliver; Olaru, Gabriel; Geiger, Mattis; Brenneman, Meghan W.; Roberts, Richard D.

2015-01-01

The Reading the Mind in the Eyes Test is a popular measure of individual differences in Theory of Mind that is often applied in the assessment of particular clinical populations (primarily, individuals on the autism spectrum). However, little is known about the test's psychometric properties, including factor structure, internal consistency, and convergent validity evidence. We present a psychometric analysis of the test followed by an evaluation of other empirically proposed and statistically identified structures. We identified, and cross-validated in a second sample, an adequate short-form solution that is homogeneous with adequate internal consistency, and is moderately related to Cognitive Empathy, Emotion Perception, and strongly related to Vocabulary. We recommend the use of this short-form solution in normal adults as a more precise measure over the original version. Future revisions of the test should seek to reduce the test's reliance on one's vocabulary and evaluate the short-form structure in clinical populations. PMID:26500578
DNA barcode identification of black cohosh herbal dietary supplements.

PubMed

Baker, David A; Stevenson, Dennis W; Little, Damon P

2012-01-01

Black cohosh (Actaea racemosa) herbal dietary supplements are commonly consumed to treat menopausal symptoms, but there are reports of adverse events and toxicities associated with their use. Accidental misidentification and/or deliberate adulteration results in harvesting other related species that are then marketed as black cohosh. Some of these species are known to be toxic to humans. We have identified two matK nucleotides that consistently distinguish black cohosh from related species. Using these nucleotides, an assay was able to correctly identify all of the black cohosh samples in the validation set. None of the other Actaea species in the validation set were falsely identified as black cohosh. Of 36 dietary supplements sequenced, 27 (75%) had a sequence that exactly matched black cohosh. The remaining nine samples (25%) had a sequence identical to that of three Asian Actaea species (A. cimicifuga, A. dahurica, and A. simplex). Manufacturers should routinely test plant material using a reliable assay to ensure accurate labeling.
Rapid construction of a whole-genome transposon insertion collection for Shewanella oneidensis by Knockout Sudoku.

PubMed

Baym, Michael; Shaket, Lev; Anzai, Isao A; Adesina, Oluwakemi; Barstow, Buz

2016-11-10

Whole-genome knockout collections are invaluable for connecting gene sequence to function, yet traditionally, their construction has required an extraordinary technical effort. Here we report a method for the construction and purification of a curated whole-genome collection of single-gene transposon disruption mutants termed Knockout Sudoku. Using simple combinatorial pooling, a highly oversampled collection of mutants is condensed into a next-generation sequencing library in a single day, a 30- to 100-fold improvement over prior methods. The identities of the mutants in the collection are then solved by a probabilistic algorithm that uses internal self-consistency within the sequencing data set, followed by rapid algorithmically guided condensation to a minimal representative set of mutants, validation, and curation. Starting from a progenitor collection of 39,918 mutants, we compile a quality-controlled knockout collection of the electroactive microbe Shewanella oneidensis MR-1 containing representatives for 3,667 genes that is functionally validated by high-throughput kinetic measurements of quinone reduction.

On the granular fingering instability: controlled triggering in laboratory experiments and numerical simulations

NASA Astrophysics Data System (ADS)

Vriend, Nathalie; Tsang, Jonny; Arran, Matthew; Jin, Binbin; Johnsen, Alexander

2017-11-01

When a mixture of small, smooth particles and larger, coarse particles is released on a rough inclined plane, the initial uniform front may break up in distinct fingers which elongate over time. This fingering instability is sensitive to the unique arrangement of individual particles and is driven by granular segregation (Pouliquen et al., 1997). Variability in initial conditions create significant limitations for consistent experimental and numerical validation of newly developed theoretical models (Baker et al., 2016) for finger formation. We present an experimental study using a novel tool that sets the initial fingering width of the instability. By changing this trigger width between experiments, we explore the response of the avalanche breakup to perturbations of different widths. Discrete particle simulations (using MercuryDPM, Thornton et al., 2012) are conducted under a similar setting, reproducing the variable finger width, allowing validation between experiments and numerical simulations. A good agreement between simulations and experiments is obtained, and ongoing theoretical work is briefly introduced. NMV acknowledges the Royal Society Dorothy Hodgkin Research Fellowship.
Cross-cultural validation of the revised temperament and character inventory in the Bulgarian language.

PubMed

Tilov, Boris; Dimitrova, Donka; Stoykova, Maria; Tornjova, Bianka; Foreva, Gergana; Stoyanov, Drozdstoj

2012-12-01

Health-care professions have long been considered prone to work-related stress, yet recent research in Bulgaria indicates alarmingly high levels of burnout. Cloninger's inventory is used to analyse and evaluate correlation between personality characteristics and degree of burnout syndrome manifestation among the risk categories of health-care professionals. The primary goal of this study was to test the conceptual validity and cross-cultural applicability of the revised TCI (TCI-R), developed in the United States, in a culturally, socially and economically diverse setting. Linguistic validation, test-retest studies, statistical and expert analyses were performed to assess cross-cultural applicability of the revised Cloninger's temperament and character inventory in Bulgarian, its reliability and internal consistency and construct validity. The overall internal consistency of TCI-R and its scales as well as the interscale and test-retest correlations prove that the translated version of the questionnaire is acceptable and cross-culturally applicable for the purposes of studying organizational stress and burnout risk in health-care professionals. In general the cross-cultural adaptation process, even if carried out in a rigorous way, does not always lead to the best target version and suggests it would be useful to develop new scales specific to each culture and, at the same time, to think about the trans-cultural adaptation. © 2012 Blackwell Publishing Ltd.
Psychometric properties of the Dutch version of the London Measure of Unplanned Pregnancy in women with pregnancies ending in birth.

PubMed

Goossens, Joline; Verhaeghe, Sofie; Van Hecke, Ann; Barrett, Geraldine; Delbaere, Ilse; Beeckman, Dimitri

2018-01-01

To evaluate the psychometric properties of the Dutch version of the London Measure of Unplanned Pregnancy in women with pregnancies ending in birth. A two-phase psychometric evaluation design was set-up. Phase I comprised the translation from English into Dutch and pretesting with 6 women using cognitive interviews. In phase II, the reliability and validity of the Dutch version of the LMUP was assessed in 517 women giving birth recently. Reliability (internal consistency) was assessed using Cronbach's alpha, inter-item correlations, and corrected item-total correlations. Construct validity was assessed using principal components analysis and hypothesis testing. Exploratory Mokken scale analysis was carried out. 517 women aged 15-45 completed the Dutch version of the LMUP. Reliability testing showed acceptable internal consistency (alpha = 0.74, positive inter-item correlations between all items, all corrected item-total correlations >0.20). Validity testing confirmed the unidimensional structure of the scale and all hypotheses were confirmed. The overall Loevinger's H coefficient was 0.57, representing a 'strong' scale. The Dutch version of the LMUP is a reliable and valid measure that can be used in the Dutch-speaking population in Belgium to assess pregnancy planning. Future research is necessary to assess the stability of the Dutch version of the LMUP, and to evaluate its psychometric properties in women with abortions.
Site characterization in densely fractured dolomite: Comparison of methods

USGS Publications Warehouse

Muldoon, M.; Bradbury, K.R.

2005-01-01

One of the challenges in characterizing fractured-rock aquifers is determining whether the equivalent porous medium approximation is valid at the problem scale. Detailed hydrogeologic characterization completed at a small study site in a densely fractured dolomite has yielded an extensive data set that was used to evaluate the utility of the continuum and discrete-fracture approaches to aquifer characterization. There are two near-vertical sets of fractures at the site; near-horizontal bedding-plane partings constitute a third fracture set. Eighteen boreholes, including five coreholes, were drilled to a depth of ???10.6 m. Borehole geophysical logs revealed several laterally extensive horizontal fractures and dissolution zones. Flowmeter and short-interval packer testing identified which of these features were hydraulically important. A monitoring system, consisting of short-interval piezometers and multilevel samplers, was designed to monitor four horizontal fractures and two dissolution zones. The resulting network consisted of >70 sampling points and allowed detailed monitoring of head distributions in three dimensions. Comparison of distributions of hydraulic head - and hydraulic conductivity determined by these two approaches suggests that even in a densely fractured-carbonate aquifer, a characterization approach using traditional long-interval monitoring wells is inadequate to characterize ground water movement for the purposes of regulatory monitoring or site remediation. In addition, traditional multiwell pumping tests yield an average or bulk hydraulic conductivity that is not adequate for predicting rapid ground water travel times through the fracture network, and the pumping test response does not appear to be an adequate tool for assessing whether the porous medium approximation is valid. Copyright ?? 2005 National Ground Water Association.
Supervised group Lasso with applications to microarray data analysis

PubMed Central

Ma, Shuangge; Song, Xiao; Huang, Jian

2007-01-01

Background A tremendous amount of efforts have been devoted to identifying genes for diagnosis and prognosis of diseases using microarray gene expression data. It has been demonstrated that gene expression data have cluster structure, where the clusters consist of co-regulated genes which tend to have coordinated functions. However, most available statistical methods for gene selection do not take into consideration the cluster structure. Results We propose a supervised group Lasso approach that takes into account the cluster structure in gene expression data for gene selection and predictive model building. For gene expression data without biological cluster information, we first divide genes into clusters using the K-means approach and determine the optimal number of clusters using the Gap method. The supervised group Lasso consists of two steps. In the first step, we identify important genes within each cluster using the Lasso method. In the second step, we select important clusters using the group Lasso. Tuning parameters are determined using V-fold cross validation at both steps to allow for further flexibility. Prediction performance is evaluated using leave-one-out cross validation. We apply the proposed method to disease classification and survival analysis with microarray data. Conclusion We analyze four microarray data sets using the proposed approach: two cancer data sets with binary cancer occurrence as outcomes and two lymphoma data sets with survival outcomes. The results show that the proposed approach is capable of identifying a small number of influential gene clusters and important genes within those clusters, and has better prediction performance than existing methods. PMID:17316436
Site characterization in densely fractured dolomite: comparison of methods.

PubMed

Muldoon, Maureen; Bradbury, Ken R

2005-01-01

One of the challenges in characterizing fractured-rock aquifers is determining whether the equivalent porous medium approximation is valid at the problem scale. Detailed hydrogeologic characterization completed at a small study site in a densely fractured dolomite has yielded an extensive data set that was used to evaluate the utility of the continuum and discrete-fracture approaches to aquifer characterization. There are two near-vertical sets of fractures at the site; near-horizontal bedding-plane partings constitute a third fracture set. Eighteen boreholes, including five coreholes, were drilled to a depth of approximately 10.6 m. Borehole geophysical logs revealed several laterally extensive horizontal fractures and dissolution zones. Flowmeter and short-interval packer testing identified which of these features were hydraulically important. A monitoring system, consisting of short-interval piezometers and multilevel samplers, was designed to monitor four horizontal fractures and two dissolution zones. The resulting network consisted of >70 sampling points and allowed detailed monitoring of head distributions in three dimensions. Comparison of distributions of hydraulic head and hydraulic conductivity determined by these two approaches suggests that even in a densely fractured-carbonate aquifer, a characterization approach using traditional long-interval monitoring wells is inadequate to characterize ground water movement for the purposes of regulatory monitoring or site remediation. In addition, traditional multiwell pumping tests yield an average or bulk hydraulic conductivity that is not adequate for predicting rapid ground water travel times through the fracture network, and the pumping test response does not appear to be an adequate tool for assessing whether the porous medium approximation is valid.
Psychometric Evaluation of the Revised Michigan Diabetes Knowledge Test (V.2016) in Arabic: Translation and Validation

PubMed Central

Alhaiti, Ali Hassan; Alotaibi, Alanod Raffa; Jones, Linda Katherine; DaCosta, Cliff

2016-01-01

Objective. To translate the revised Michigan Diabetes Knowledge Test into the Arabic language and examine its psychometric properties. Setting. Of the 139 participants recruited through King Fahad Medical City in Riyadh, Saudi Arabia, 34 agreed to the second-round sample for retesting purposes. Methods. The translation process followed the World Health Organization's guidelines for the translation and adaptation of instruments. All translations were examined for their validity and reliability. Results. The translation process revealed excellent results throughout all stages. The Arabic version received 0.75 for internal consistency via Cronbach's alpha test and excellent outcomes in terms of the test-retest reliability of the instrument with a mean of 0.90 infraclass correlation coefficient. It also received positive content validity index scores. The item-level content validity index for all instrument scales fell between 0.83 and 1 with a mean scale-level index of 0.96. Conclusion. The Arabic version is proven to be a reliable and valid measure of patient's knowledge that is ready to be used in clinical practices. PMID:27995149
Defining physicians' readiness to screen and manage intimate partner violence in Greek primary care settings.

PubMed

Papadakaki, Maria; Prokopiadou, Dimitra; Petridou, Eleni; Kogevinas, Manolis; Lionis, Christos

2012-06-01

The current article aims to translate the PREMIS (Physician Readiness to Manage Intimate Partner Violence) survey into the Greek language and test its validity and reliability in a sample of primary care physicians. The validation study was conducted in 2010 and involved all the general practitioners serving two adjacent prefectures of Greece (n = 80). Maximum-likelihood factor analysis (MLF) was used to extract key survey factors. The instrument was further assessed for the following psychometric properties: (a) scale reliability, (b) item-specific reliability, (c) test-retest reliability, (d) scale construct validity, and (e) internal predictive validity. The MLF analysis of 23 opinion items revealed a seven-factor solution (preparation, constraint, workplace issues, screening, self-efficacy, alcohol/drugs, victim understanding), which was statistically sound (p = .293). Most of the newly derived scales displayed satisfactory internal consistency (α ≥ .60), high item-specific reliability, strong construct, and internal predictive validity (F = 2.82; p = .004), and high repeatability when retested with 20 individuals (intraclass correlation coefficient [ICC] > .70). The tool was found appropriate to facilitate the identification of competence deficits and the evaluation of training initiatives.
C-reactive protein and N-terminal prohormone brain natriuretic peptide as biomarkers in acute exacerbations of COPD leading to hospitalizations.

PubMed

Chen, Yu-Wei Roy; Chen, Virginia; Hollander, Zsuzsanna; Leipsic, Jonathon A; Hague, Cameron J; DeMarco, Mari L; FitzGerald, J Mark; McManus, Bruce M; Ng, Raymond T; Sin, Don D

2017-01-01

There are currently no accepted and validated blood tests available for diagnosing acute exacerbations of chronic obstructive pulmonary disease (AECOPD). In this study, we sought to determine the discriminatory power of blood C-reactive protein (CRP) and N-terminal prohormone brain natriuretic peptide (NT-proBNP) in the diagnosis of AECOPD requiring hospitalizations. The study cohort consisted of 468 patients recruited in the COPD Rapid Transition Program who were hospitalized with a primary diagnosis of AECOPD, and 110 stable COPD patients who served as controls. Logistic regression was used to build a classification model to separate AECOPD from convalescent or stable COPD patients. Performance was assessed using an independent validation set of patients who were not included in the discovery set. Serum CRP and whole blood NT-proBNP concentrations were highest at the time of hospitalization and progressively decreased over time. Of the 3 classification models, the one with both CRP and NT-proBNP had the highest AUC in discriminating AECOPD (cross-validated AUC of 0.80). These data were replicated in a validation cohort with an AUC of 0.88. A combination of CRP and NT-proBNP can reasonably discriminate AECOPD requiring hospitalization versus clinical stability and can be used to rapidly diagnose patients requiring hospitalization for AECOPD.
Development of a Short Questionnaire to Measure an Extended Set of Job Demands, Job Resources, and Positive Health Outcomes: The New Brief Job Stress Questionnaire

PubMed Central

INOUE, Akiomi; KAWAKAMI, Norito; SHIMOMITSU, Teruichi; TSUTSUMI, Akizumi; HARATANI, Takashi; YOSHIKAWA, Toru; SHIMAZU, Akihito; ODAGIRI, Yuko

2014-01-01

This study aimed to investigate the reliability and construct validity of a new version of the Brief Job Stress Questionnaire (New BJSQ), which measures an extended set of psychosocial factors at work by adding new scales/items to the current version of the BJSQ. Additional scales/items were extensively collected from theoretical job stress models and similar questionnaires in several countries. Scales/items were field-tested and refined through a pilot internet survey. Finally, an 84-item questionnaire (141 items in total when combined with the current BJSQ) was developed. A nationally representative survey was administered to employees in Japan (n=1,633) to examine the reliability and construct validity. Most scales showed acceptable levels of internal consistency and test-retest reliability. Principal component analyses showed that the first factor explained 50% or greater proportion of the variance in most scales. A scale factor analysis and a correlation analysis showed that these scales fit the theoretical expectations. These findings provided a piece of evidence that the New BJSQ scales are reliable and valid. Although more detailed content and construct validity should be examined in future study, the New BJSQ is a useful instrument to evaluate psychosocial work environment and positive mental health outcomes in the current workplace. PMID:24492763
Development of a short questionnaire to measure an extended set of job demands, job resources, and positive health outcomes: the new brief job stress questionnaire.

PubMed

Inoue, Akiomi; Kawakami, Norito; Shimomitsu, Teruichi; Tsutsumi, Akizumi; Haratani, Takashi; Yoshikawa, Toru; Shimazu, Akihito; Odagiri, Yuko

2014-01-01

This study aimed to investigate the reliability and construct validity of a new version of the Brief Job Stress Questionnaire (New BJSQ), which measures an extended set of psychosocial factors at work by adding new scales/items to the current version of the BJSQ. Additional scales/items were extensively collected from theoretical job stress models and similar questionnaires in several countries. Scales/items were field-tested and refined through a pilot internet survey. Finally, an 84-item questionnaire (141 items in total when combined with the current BJSQ) was developed. A nationally representative survey was administered to employees in Japan (n=1,633) to examine the reliability and construct validity. Most scales showed acceptable levels of internal consistency and test-retest reliability. Principal component analyses showed that the first factor explained 50% or greater proportion of the variance in most scales. A scale factor analysis and a correlation analysis showed that these scales fit the theoretical expectations. These findings provided a piece of evidence that the New BJSQ scales are reliable and valid. Although more detailed content and construct validity should be examined in future study, the New BJSQ is a useful instrument to evaluate psychosocial work environment and positive mental health outcomes in the current workplace.
The development and validation of the On-the-job Learning Styles Questionnaire for the Nursing Profession.

PubMed

Berings, Marjolein G M C; Poell, Rob F; Simons, P Robert-Jan; van Veldhoven, Marc J P M

2007-06-01

This paper is a report of a study to develop and test the psychometric properties of the On-the-job Learning Style Questionnaire for the Nursing Profession. Although numerous questionnaires measuring learning styles have been developed, none are suitable for working environments. Existing instruments do not meet the requirements for use in workplace settings and tend to ignore the influence of different learning situations. The questionnaire was constructed using a situation-response design, measuring learning activities in different on-the-job learning situations. Content validity was ensured by basing the questionnaire on interview studies. The questionnaire was distributed to 912 Registered Nurses working in different departments of 13 general hospitals in the Netherlands at the end of 2005. The response rate was 41% (372 questionnaires). The internal factor structure of the questionnaire was partly based on the learning activities in which nurses participate and partly on the learning situation in which they are performed. The internal consistency was good. The situation-response design of the questionnaire demonstrated its added value. Construct validity was estimated using intercorrelations between the scales, and criterion validity was estimated based on the relationships of the scales with perceived professional competence. The On-the-job Learning Styles Questionnaire for the Nursing Profession is well suited to describing nurses' learning styles in on-the-job settings and has satisfactory psychometric properties.
Assessing the validity and reliability of the Pool Activity Level (PAL) Checklist for use with older people with dementia.

PubMed

Wenborn, Jennifer; Challis, David; Pool, Jackie; Burgess, Jane; Elliott, Nicola; Orrell, Martin

2008-03-01

Activity is key to maintaining physical and mental health and well-being. However, as dementia affects the ability to engage in activity, care-givers can find it difficult to provide appropriate activities. The Pool Activity Level (PAL) Checklist guides the selection of appropriate, personally meaningful activities. The aim of this study was to assess the reliability and validity of the PAL Checklist when used with older people with dementia. A postal questionnaire sent to activity providers assessed content validity. Validity and reliability were measured in a sample of 60 older people with dementia. The questionnaire response rate was 83% (102/122). Most respondents felt no important items were missing. Seven of the nine activities were ranked as 'very important' or 'essential' by at least 77% of the sample, indicating very good content validity. Correlation with measures of cognition, severity of dementia and activity performance demonstrated strong concurrent validity. Inter-item correlation indicated strong construct validity. Cronbach's alpha coefficient measured internal consistency as excellent (0.95). All items achieved acceptable test-retest reliability, and the majority demonstrated acceptable inter-rater reliability. We conclude that the PAL Checklist demonstrates adequate validity and reliability when used with older people with dementia and appears a useful tool for a variety of care settings.
The property of the Japanese version of the Recovery Knowledge Inventory (RKI) among mental health service providers: a cross sectional study.

PubMed

Chiba, Rie; Umeda, Maki; Goto, Kyohei; Miyamoto, Yuki; Yamaguchi, Sosei; Kawakami, Norito

2017-01-01

The Recovery Knowledge Inventory (RKI) is one of the influential scales to assess knowledge and attitude toward recovery-oriented practices among mental health service providers. In the present study, we aimed to develop a Japanese version of RKI and examine the validity and reliability. We translated RKI into Japanese by reference to the guidelines for translating and adapting psychometric scales. A cross-sectional questionnaire survey was conducted with mental health service providers. Of a total of 475 eligible professionals, we used data from the 299 participants without missing value for the analyses (valid response rate = 62.9%). The questionnaire included Japanese RKI, Recovery Attitudes Questionnaire, The positive attitudes scale, and Japanese-language version of the Social Distance Scale. To examine the factorial validity of RKI, explanatory factor analysis and confirmatory factor analysis was employed. Convergent validity was assessed by calculating Pearson's correlation coefficients between the total RKI score and the scores for the other three scales. We also calculated Cronbach's α coefficients for the total score and for each domain of RKI to assess internal consistency reliability. The participants' mean age was 40.4 years and 30.4% were men. 20-item RKI did not provide any adequate or interpretable factor solutions at any number of factors by EFAs. Thus four items (#1, 4, 5, and 13) were subsequently eliminated in stages, then 16-item RKI was employed as a consequence for further analyses. EFA with four factor structures yielded marginally interpretable constitution. Each factor represented the knowledge regarding psychiatric symptoms and recovery; knowledge about the recovery process; the understanding of what is important for recovery; and the understanding of the challenges and responsibility in recovery, respectively. Subsequent CFA suggested good fit to the data. Good convergent validity and understandable internal consistency reliability were also observed. The Japanese 16-item RKI revealed reasonable factorial validity, good convergent validity, and understandable internal consistency reliability among mental health professionals. Japanese cultural settings seemed to influence the four-factor structure in the present study. It can be used for future study in Japan, while future large-scale research is required to ensure robust verification.
Reliability and validity of a self-administered tool for online neuropsychological testing: The Amsterdam Cognition Scan.

PubMed

Feenstra, Heleen E M; Murre, Jaap M J; Vermeulen, Ivar E; Kieffer, Jacobien M; Schagen, Sanne B

2018-04-01

To facilitate large-scale assessment of a variety of cognitive abilities in clinical studies, we developed a self-administered online neuropsychological test battery: the Amsterdam Cognition Scan (ACS). The current studies evaluate in a group of adult cancer patients: test-retest reliability of the ACS and the influence of test setting (home or hospital), and the relationship between our online and a traditional test battery (concurrent validity). Test-retest reliability was studied in 96 cancer patients (57 female; M age = 51.8 years) who completed the ACS twice. Intraclass correlation coefficients (ICCs) were used to assess consistency over time. The test setting was counterbalanced between home and hospital; influence on test performance was assessed by repeated measures analyses of variance. Concurrent validity was studied in 201 cancer patients (112 female; M age = 53.5 years) who completed both the online and an equivalent traditional neuropsychological test battery. Spearman or Pearson correlations were used to assess consistency between online and traditional tests. ICCs of the online tests ranged from .29 to .76, with an ICC of .78 for the ACS total score. These correlations are generally comparable with the test-retest correlations of the traditional tests as reported in the literature. Correlating online and traditional test scores, we observed medium to large concurrent validity (r/ρ = .42 to .70; total score r = .78), except for a visuospatial memory test (ρ = .36). Correlations were affected-as expected-by design differences between online tests and their offline counterparts. Although development and optimization of the ACS is an ongoing process, and reliability can be optimized for several tests, our results indicate that it is a highly usable tool to obtain (online) measures of various cognitive abilities. The ACS is expected to facilitate efficient gathering of data on cognitive functioning in the near future.
Numerical analysis of the dynamic interaction between wheel set and turnout crossing using the explicit finite element method

NASA Astrophysics Data System (ADS)

Xin, L.; Markine, V. L.; Shevtsov, I. Y.

2016-03-01

A three-dimensional (3-D) explicit dynamic finite element (FE) model is developed to simulate the impact of the wheel on the crossing nose. The model consists of a wheel set moving over the turnout crossing. Realistic wheel, wing rail and crossing geometries have been used in the model. Using this model the dynamic responses of the system such as the contact forces between the wheel and the crossing, crossing nose displacements and accelerations, stresses in rail material as well as in sleepers and ballast can be obtained. Detailed analysis of the wheel set and crossing interaction using the local contact stress state in the rail is possible as well, which provides a good basis for prediction of the long-term behaviour of the crossing (fatigue analysis). In order to tune and validate the FE model field measurements conducted on several turnouts in the railway network in the Netherlands are used here. The parametric study including variations of the crossing nose geometries performed here demonstrates the capabilities of the developed model. The results of the validation and parametric study are presented and discussed.
Development and Preliminary Validation of Refugee Trauma History Checklist (RTHC)—A Brief Checklist for Survey Studies

PubMed Central

Gottvall, Maria; Vaez, Marjan

2017-01-01

A high proportion of refugees have been subjected to potentially traumatic experiences (PTEs), including torture. PTEs, and torture in particular, are powerful predictors of mental ill health. This paper reports the development and preliminary validation of a brief refugee trauma checklist applicable for survey studies. Methods: A pool of 232 items was generated based on pre-existing instruments. Conceptualization, item selection and item refinement was conducted based on existing literature and in collaboration with experts. Ten cognitive interviews using a Think Aloud Protocol (TAP) were performed in a clinical setting, and field testing of the proposed checklist was performed in a total sample of n = 137 asylum seekers from Syria. Results: The proposed refugee trauma history checklist (RTHC) consists of 2 × 8 items, concerning PTEs that occurred before and during the respondents’ flight, respectively. Results show low item non-response and adequate psychometric properties Conclusions: RTHC is a usable tool for providing self-report data on refugee trauma history surveys of community samples. The core set of included events can be augmented and slight modifications can be applied to RTHC for use also in other refugee populations and settings. PMID:28976937
Development of a gridded meteorological dataset over Java island, Indonesia 1985–2014

PubMed Central

Yanto; Livneh, Ben; Rajagopalan, Balaji

2017-01-01

We describe a gridded daily meteorology dataset consisting of precipitation, minimum and maximum temperature over Java Island, Indonesia at 0.125°×0.125° (~14 km) resolution spanning 30 years from 1985–2014. Importantly, this data set represents a marked improvement from existing gridded data sets over Java with higher spatial resolution, derived exclusively from ground-based observations unlike existing satellite or reanalysis-based products. Gap-infilling and gridding were performed via the Inverse Distance Weighting (IDW) interpolation method (radius, r, of 25 km and power of influence, α, of 3 as optimal parameters) restricted to only those stations including at least 3,650 days (~10 years) of valid data. We employed MSWEP and CHIRPS rainfall products in the cross-validation. It shows that the gridded rainfall presented here produces the most reasonable performance. Visual inspection reveals an increasing performance of gridded precipitation from grid, watershed to island scale. The data set, stored in a network common data form (NetCDF), is intended to support watershed-scale and island-scale studies of short-term and long-term climate, hydrology and ecology. PMID:28534871
Avoiding Deontic Explosion by Contextually Restricting Aggregation

NASA Astrophysics Data System (ADS)

Meheus, Joke; Beirlaen, Mathieu; van de Putte, Frederik

In this paper, we present an adaptive logic for deontic conflicts, called P2.1 r , that is based on Goble's logic SDL a P e - a bimodal extension of Goble's logic P that invalidates aggregation for all prima facie obligations. The logic P2.1 r has several advantages with respect to SDL a P e. For consistent sets of obligations it yields the same results as Standard Deontic Logic and for inconsistent sets of obligations, it validates aggregation "as much as possible". It thus leads to a richer consequence set than SDL a P e. The logic P2.1 r avoids Goble's criticisms against other non-adjunctive systems of deontic logic. Moreover, it can handle all the 'toy examples' from the literature as well as more complex ones.
Sources of self-efficacy belief: development and validation of two scales.

PubMed

Liu, Ou Lydia; Wilson, Mark

2010-01-01

Self-efficacy belief has been an instrumental affective factor in predicting student behavior and achievement in academic settings. Although there is abundant literature on efficacy belief per se, the sources of efficacy belief have not been fully researched. Very few instruments exist to quantify the sources of efficacy-beliefs. To fill this void, we developed two scales for the two main sources of self-efficacy belief: past performance and social persuasion. Pilot test data were collected from 255 middle school students. A self-efficacy measure was also administered to the students as a criterion measure. The Rasch rating scale model was used to analyze the data. Information on item fit, item design, content validity, external validity, internal consistency, and person separation reliability was examined. The two scales displayed satisfactory psychometric properties. Applications and limitations of these two scales are also discussed.

Evaluating process in child and family interventions: aggression prevention as an example.

PubMed

Tolan, Patrick H; Hanish, Laura D; McKay, Mary M; Dickey, Mitchell H

2002-06-01

This article reports on 2 studies designed to develop and validate a set of measures for use in evaluating processes of child and family interventions. In Study 1 responses from 187 families attending an outpatient clinic for child behavior problems were factor analyzed to identify scales, consistent across sources: Alliance (Satisfactory Relationship with Interventionist and Program Satisfaction), Parenting Skill Attainment, Child Cooperation During Session, Child Prosocial Behavior, and Child Aggressive Behavior. Study 2 focused on patterns of scale scores among 78 families taking part in a 22-week preventive intervention designed to affect family relationships, parenting, and child antisocial and prosocial behaviors. The factor structure identified in Study 1 was replicated. Scale construct validity was demonstrated through across-source convergence, sensitivity to intervention change, and ability to discriminate individual differences. Path analysis validated the scales' utility in explaining key aspects of the intervention process. Implications for evaluating processes in family interventions are discussed.
Psychometric testing of the modified Care Dependency Scale among hospitalized school-aged children in Germany.

PubMed

Tork, Hanan; Lohrmann, Christa; Dassen, Theo

2008-03-01

The objectives of this study were to examine the psychometric properties of the modified Care Dependency Scale in a pediatric setting and to explore the extent of dependency of school-aged children regarding their self-care. The data were collected from 130 hospitalized children, aged 6-12 years. The reliability was determined by Cronbach's alpha, which showed a high level of consistency. The subsequent inter-rater reliability revealed moderate-to-substantial agreement. The criterion-related validity was tested by comparing the sum scores of the Care Dependency Scale for Paediatrics and the Visual Analog Scale. Factor analysis was used to investigate the construct validity and resulted in a one-factor solution. In conclusion, this study provides evidence that the Care Dependency Scale for Paediatrics is a valid and reliable measure that offers a comprehensive assessment from a nursing perspective and enables nurses to help children acquire independence.
Validation of the Implementation Leadership Scale (ILS) in Substance Use Disorder Treatment Organizations

PubMed Central

Ehrhart, Mark G.; Torres, Elisa M.; Finn, Natalie K.; Roesch, Scott C.

2016-01-01

There have been recent calls for pragmatic measures to assess factors that influence evidence-based practice (EBP) implementation processes and outcomes. The Implementation Leadership Scale (ILS) is a brief and efficient measure that can be used for research or organizational development purposes to assess leader behaviors and actions that actively support effective EBP implementation. The ILS was developed and validated in mental health settings. This study validates the ILS factor structure with providers in alcohol and other drug (AOD) use treatment agencies. Participants were 323 service providers working in 72 workgroups from three AOD use treatment agencies. Confirmatory factor analyses and reliability analyses were conducted to examine the psychometric properties of the ILS. Convergent and discriminant validity were also assessed. Confirmatory factor analyses demonstrated good fit to the hypothesized first and second order factor structure. Internal consistency reliability was excellent. Convergent and discriminant validity was supported. The ILS psychometric characteristics, reliability, and validity were supported in AOD use treatment agencies. The ILS is a brief and pragmatic measure that can be used for research and practice to assess leadership for EBP implementation in AOD use treatment agencies. PMID:27431044
Validation of the Implementation Leadership Scale (ILS) in Substance use Disorder Treatment Organizations.

PubMed

Aarons, Gregory A; Ehrhart, Mark G; Torres, Elisa M; Finn, Natalie K; Roesch, Scott C

2016-09-01

There have been recent calls for pragmatic measures to assess factors that influence evidence-based practice (EBP) implementation processes and outcomes. The Implementation Leadership Scale (ILS) is a brief and efficient measure that can be used for research or organizational development purposes to assess leader behaviors and actions that actively support effective EBP implementation. The ILS was developed and validated in mental health settings. This study validates the ILS factor structure with providers in alcohol and other drug (AOD) use treatment agencies. Participants were 323 service providers working in 72 workgroups from three AOD use treatment agencies. Confirmatory factor analyses and reliability analyses were conducted to examine the psychometric properties of the ILS. Convergent and discriminant validity were also assessed. Confirmatory factor analyses demonstrated good fit to the hypothesized first and second order factor structure. Internal consistency reliability was excellent. Convergent and discriminant validity was supported. The ILS psychometric characteristics, reliability, and validity were supported in AOD use treatment agencies. The ILS is a brief and pragmatic measure that can be used for research and practice to assess leadership for EBP implementation in AOD use treatment agencies. Copyright © 2016 Elsevier Inc. All rights reserved.
Reliability and Construct Validity of the Portuguese Version of the Psychological Capital Questionnaire.

PubMed

Antunes, Ana Cristina; Caetano, António; Pina E Cunha, Miguel

2017-06-01

The Psychological Capital Questionnaire (PCQ) is the most commonly used measure for assessing psychological capital in work settings. Although several studies confirmed its factorial validity, most validation studies only examined the four-factor structure preconized by Luthans, Youssef, and Avolio, not attending to empirical evidence on alternative factorial structures. The present study aimed to test the psychometric properties of the Portuguese version of the PCQ, by using two independent samples (NS1 = 542; NS2 = 115) of Portuguese employees. We conducted a series of confirmatory factor analyses and found that, unlike previous findings, a five-factor solution of the PCQ best fitted the data. The evidence obtained also supported the existence of a second-order factor, psychological capital. The coefficients of internal consistency, as measured by Cronbach's alpha, were adequate and test-retest reliability suggested that the PCQ presented a lower stability than personality factors. Convergent validity, assessed with average variance extracted, revealed problems in the optimism subscale. The discriminant validity of the PCQ was confirmed by its correlations with Positive and Negative Affect and Big Five personality factors. Hierarchical regression analyses showed that this measure has incremental validity over personality and affect when predicting job performance.
Validity and reliability of an adapted Thai version of Scoliosis Research Society-22 questionnaire for adolescent idiopathic scoliosis.

PubMed

Sathira-Angkura, Vera; Pithankuakul, Kongkit; Sakulpipatana, Susana; Piyaskulkaew, Chaiwat; Kunakornsawat, Sombat

2012-04-20

Cross-sectional observational study to investigate psychometric properties of an adapted Thai version of the refined Scoliosis Research Society-22 (SRS-22) questionnaire. To evaluate the reliability and validity of the adapted Thai version of the refined SRS-22 questionnaire. The SRS-22 questionnaire is a valid instrument for assessing the health-related quality of life for patients with adolescent idiopathic scoliosis. Recently, the questionnaire has been translated and validated in many languages for non-English-speaking countries. Translation/retranslation of the English version of the SRS-22 was conducted, and the cross-cultural adaptation process was performed. The Thai version SRS-22 and previously validated Thai version Short-Form survey version 2.0 (SF-36V2) questionnaires were administered to 77 patients with adolescent idiopathic scoliosis who had surgical treatment. Fifty-eight patients (52 adolescent girls) had filled out the first set of questionnaires. Thirty patients of the first-time responders completed the second set of questionnaires. The mean age at the time of operation was 14.6 years and the mean age at the time of the final follow-up was 18.7 years. The mean preoperative scoliosis curve magnitude was 55.4° (range, 30°-95°) and postoperative curve magnitude was 20.1° (range, 0°-60°). Internal consistency was determined with Cronbach α coefficient. Intraclass correlation coefficient was used for test-retest reliability. Concurrent validity was evaluated by comparing SRS-22 domains with relevant domains in the SF-36V2 questionnaire, using the Pearson correlation coefficient. The mean overall Cronbach α coefficient of the adapted Thai version SRS-22 was 0.76. The 2 of corresponding domains (mental health = 0.80 and self-image = 0.83) had satisfactory internal consistency and the remaining domains (pain = 0.78; function/activity = 0.74; and satisfaction = 0.76) were good. The intraclass correlation coefficient for 5 domains was ranged from 0.79 to 0.90, which demonstrated the satisfactory test/retest reproducibility. The concurrent validity, determined by the Pearson correlation coefficient between SRS-22 and SF-36V2 domains, had a good correlation for 15 relevant comparisons (r = 0.50-0.75). The adapted Thai version of the SRS-22 questionnaire had validity and reliability, which can be used to assess the outcome of treatment among Thai-speaking patients with adolescent idiopathic scoliosis.
Puzzling With Online Games (BAM-COG): Reliability, Validity, and Feasibility of an Online Self-Monitor for Cognitive Performance in Aging Adults

PubMed Central

Baars, Maria A E; Olde Rikkert, Marcel G M; Kessels, Roy P C

2013-01-01

Background Online interventions are aiming increasingly at cognitive outcome measures but so far no easy and fast self-monitors for cognition have been validated or proven reliable and feasible. Objective This study examines a new instrument called the Brain Aging Monitor–Cognitive Assessment Battery (BAM-COG) for its alternate forms reliability, face and content validity, and convergent and divergent validity. Also, reference values are provided. Methods The BAM-COG consists of four easily accessible, short, yet challenging puzzle games that have been developed to measure working memory (“Conveyer Belt”), visuospatial short-term memory (“Sunshine”), episodic recognition memory (“Viewpoint”), and planning (“Papyrinth”). A total of 641 participants were recruited for this study. Of these, 397 adults, 40 years and older (mean 54.9, SD 9.6), were eligible for analysis. Study participants played all games three times with 14 days in between sets. Face and content validity were based on expert opinion. Alternate forms reliability (AFR) was measured by comparing scores on different versions of the BAM-COG and expressed with an intraclass correlation (ICC: two-way mixed; consistency at 95%). Convergent validity (CV) was provided by comparing BAM-COG scores to gold-standard paper-and-pencil and computer-assisted cognitive assessment. Divergent validity (DV) was measured by comparing BAM-COG scores to the National Adult Reading Test IQ (NART-IQ) estimate. Both CV and DV are expressed as Spearman rho correlation coefficients. Results Three out of four games showed adequate results on AFR, CV, and DV measures. The games Conveyer Belt, Sunshine, and Papyrinth have AFR ICCs of .420, .426, and .645 respectively. Also, these games had good to very good CV correlations: rho=.577 (P=.001), rho=.669 (P<.001), and rho=.400 (P=.04), respectively. Last, as expected, DV correlations were low: rho=−.029 (P=.44), rho=−.029 (P=.45), and rho=−.134 (P=.28) respectively. The game Viewpoint provided less desirable results with an AFR ICC of .167, CV rho=.202 (P=.15), and DV rho=−.162 (P=.21). Conclusions This study provides evidence for the use of the BAM-COG test battery as a feasible, reliable, and valid tool to monitor cognitive performance in healthy adults in an online setting. Three out of four games have good psychometric characteristics to measure working memory, visuospatial short-term memory, and planning capacity. PMID:24300212
Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View.

PubMed

Luo, Wei; Phung, Dinh; Tran, Truyen; Gupta, Sunil; Rana, Santu; Karmakar, Chandan; Shilton, Alistair; Yearwood, John; Dimitrova, Nevenka; Ho, Tu Bao; Venkatesh, Svetha; Berk, Michael

2016-12-16

As more and more researchers are turning to big data for new opportunities of biomedical discoveries, machine learning models, as the backbone of big data analysis, are mentioned more often in biomedical journals. However, owing to the inherent complexity of machine learning methods, they are prone to misuse. Because of the flexibility in specifying machine learning models, the results are often insufficiently reported in research articles, hindering reliable assessment of model validity and consistent interpretation of model outputs. To attain a set of guidelines on the use of machine learning predictive models within clinical settings to make sure the models are correctly applied and sufficiently reported so that true discoveries can be distinguished from random coincidence. A multidisciplinary panel of machine learning experts, clinicians, and traditional statisticians were interviewed, using an iterative process in accordance with the Delphi method. The process produced a set of guidelines that consists of (1) a list of reporting items to be included in a research article and (2) a set of practical sequential steps for developing predictive models. A set of guidelines was generated to enable correct application of machine learning models and consistent reporting of model specifications and results in biomedical research. We believe that such guidelines will accelerate the adoption of big data analysis, particularly with machine learning methods, in the biomedical research community. ©Wei Luo, Dinh Phung, Truyen Tran, Sunil Gupta, Santu Rana, Chandan Karmakar, Alistair Shilton, John Yearwood, Nevenka Dimitrova, Tu Bao Ho, Svetha Venkatesh, Michael Berk. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 16.12.2016.
Quality of life for parents of children with influenza-like illness: development and validation of Care-ILI-QoL.

PubMed

Chow, Maria Yui Kwan; Morrow, Angela; Heron, Leon; Yin, Jiehui Kevin; Booy, Robert; Leask, Julie

2014-04-01

Influenza-like illnesses (ILI) cause paediatric morbidity and affect the quality of life (QoL) of children and their parents. We have developed a disease-specific questionnaire (Care-ILI-QoL) to measure the QoL of caregivers of children with ILI. The drafting of the Care-ILI-QoL questionnaire was based on a systematic review, a quantitative survey, qualitative interviews with parents, and meetings with paediatricians. Children aged 6-48 months recruited from childcare centres in Sydney, Australia, were followed up during the 2011 influenza season. Care-ILI-QoL and SF-12v2 Acute Form were administered to the parent of a sick child 2 weeks after the onset of ILI, and again 2 weeks after the child had recovered. Exploratory factor analysis was conducted. Internal consistency, concurrent validity, discriminant validity, homogeneity of items, and responsiveness were tested. Out of the 125 children enrolled from 48 childcare centres, 55 children had ILI (total 75 ILI episodes). Care-ILI-QoL was reduced from 25 to 16 items covering four factors: Daily Activities, Perceived Support, Social Life, and Emotions (Cronbach's alphas 0.90, 0.92, 0.78, and 0.72, respectively). Care-ILI-QoL has satisfactory concurrent and discriminant validity, good internal consistency, and excellent responsiveness. Total QoL and factor scores correlated well with SF-12v2 scores. Total QoL scores were significantly lower in parents who perceived their child as very/extremely sick, sacrificed 10 hours or more in work or recreation in caring for the child, or whose child had two or more general practitioner visits. Total QoL and factor scores were significantly higher after the child had recovered than when the child had ILI. Care-ILI-QoL is the first ILI-specific QoL instrument for parents and is demonstrated to be valid and reliable in a developed country setting where the child is affected by ILI. It has the potential to be applied in clinical and research settings to assist measurement of disease burden, as a needs assessment tool for resources or to inform policy changes.
Validation of Online Versions of Tinnitus Questionnaires Translated into Swedish.

PubMed

Müller, Karolina; Edvall, Niklas K; Idrizbegovic, Esma; Huhn, Robert; Cima, Rilana; Persson, Viktor; Leineweber, Constanze; Westerlund, Hugo; Langguth, Berthold; Schlee, Winfried; Canlon, Barbara; Cederroth, Christopher R

2016-01-01

Background: Due to the lack of objective measures for assessing tinnitus, its clinical evaluation largely relies on the use of questionnaires and psychoacoustic tests. A global assessment of tinnitus burden would largely benefit from holistic approaches that not only incorporate measures of tinnitus but also take into account associated fears, emotional aspects (stress, anxiety, and depression), and quality of life. In Sweden, only a few instruments are available for assessing tinnitus, and the existing tools lack validation. Therefore, we translated a set of questionnaires into Swedish and evaluated their reliability and validity in a group of tinnitus subjects. Methods: We translated the English versions of the Tinnitus Functional Index (TFI), the Fear of Tinnitus Questionnaire (FTQ), the Tinnitus Catastrophizing Scale (TCS), the Perceived Stress Questionnaire (PSQ-30), and the Tinnitus Sample Case History Questionnaire (TSCHQ) into Swedish. These translations were delivered via the internet with the already existing Swedish versions of the Tinnitus Handicap Inventory (THI), the Hospital Anxiety and Depression Scale (HADS), the Hyperacusis Questionnaire (HQ), and the World Health Organization Quality of Life questionnaire (WHOQoL-BREF). Psychometric properties were evaluated by means of internal consistency [Cronbach's alpha (α)] and test-retest reliability across a 9-week interval [Intraclass Correlation Coefficient (ICC), Cohen's kappa] in order to establish construct as well as clinical validity using a sample of 260 subjects from a population-based cohort. Results: Internal consistency was acceptable for all questionnaires (α > 0.7) with the exception of the "social relationships" subscale of the WHOQoL-BREF. Test-retest reliability was generally acceptable (ICC > 0.70, Cohens kappa > 0.60) for the tinnitus-related questionnaires, except for the TFI "sense of control" subscale and 15 items of the TSCHQ. Spearmen rank correlations showed that almost all questionnaires on tinnitus are significantly related, indicating that these questionnaires measure different aspects of the same construct. The data supported good clinical validity of the tinnitus-related questionnaires. Conclusion: Our results suggest that most Swedish adaptations of the questionnaires are suitable for clinical and research settings and should facilitate the assessment of treatment outcomes using a more holistic approach by including measures of tinnitus fears, emotional burden, and quality of life.
Validation of Online Versions of Tinnitus Questionnaires Translated into Swedish

PubMed Central

Müller, Karolina; Edvall, Niklas K.; Idrizbegovic, Esma; Huhn, Robert; Cima, Rilana; Persson, Viktor; Leineweber, Constanze; Westerlund, Hugo; Langguth, Berthold; Schlee, Winfried; Canlon, Barbara; Cederroth, Christopher R.

2016-01-01

Background: Due to the lack of objective measures for assessing tinnitus, its clinical evaluation largely relies on the use of questionnaires and psychoacoustic tests. A global assessment of tinnitus burden would largely benefit from holistic approaches that not only incorporate measures of tinnitus but also take into account associated fears, emotional aspects (stress, anxiety, and depression), and quality of life. In Sweden, only a few instruments are available for assessing tinnitus, and the existing tools lack validation. Therefore, we translated a set of questionnaires into Swedish and evaluated their reliability and validity in a group of tinnitus subjects. Methods: We translated the English versions of the Tinnitus Functional Index (TFI), the Fear of Tinnitus Questionnaire (FTQ), the Tinnitus Catastrophizing Scale (TCS), the Perceived Stress Questionnaire (PSQ-30), and the Tinnitus Sample Case History Questionnaire (TSCHQ) into Swedish. These translations were delivered via the internet with the already existing Swedish versions of the Tinnitus Handicap Inventory (THI), the Hospital Anxiety and Depression Scale (HADS), the Hyperacusis Questionnaire (HQ), and the World Health Organization Quality of Life questionnaire (WHOQoL-BREF). Psychometric properties were evaluated by means of internal consistency [Cronbach's alpha (α)] and test–retest reliability across a 9-week interval [Intraclass Correlation Coefficient (ICC), Cohen's kappa] in order to establish construct as well as clinical validity using a sample of 260 subjects from a population-based cohort. Results: Internal consistency was acceptable for all questionnaires (α > 0.7) with the exception of the “social relationships” subscale of the WHOQoL-BREF. Test–retest reliability was generally acceptable (ICC > 0.70, Cohens kappa > 0.60) for the tinnitus-related questionnaires, except for the TFI “sense of control” subscale and 15 items of the TSCHQ. Spearmen rank correlations showed that almost all questionnaires on tinnitus are significantly related, indicating that these questionnaires measure different aspects of the same construct. The data supported good clinical validity of the tinnitus-related questionnaires. Conclusion: Our results suggest that most Swedish adaptations of the questionnaires are suitable for clinical and research settings and should facilitate the assessment of treatment outcomes using a more holistic approach by including measures of tinnitus fears, emotional burden, and quality of life. PMID:27920720
The Psychometric Properties and the Development of the Indicators of Quality Nursing Work Environments in Taiwan.

PubMed

Lin, Chiou-Fen; Lu, Meei-Shiow; Huang, Hsiu-Ying

2016-03-01

The nursing shortage in medical institutions in Taiwan averaged 9% in 2012, considerably higher than the 5% indicated in the literature. As a result, many hospitals have been forced to close wards or reduce beds. Despite the acute need, the percentage of registered nurses who are employed as nurses in Taiwan (60.4%) is considerably lower than those in Canada or the United States. This low rate may be because of the poor working environment for nurses in Taiwan. This study aimed to develop a set of nursing work environment quality indicators for Taiwan and to test the reliability and validity of the resulting survey tool. Multiple methods were used in this study. In Phase 1, we organized an expert panel, reviewed the literature, and conducted seven rounds of expert panel discussion and six focus group discussions with nursing directors. The goal was to draft indicators representing a quality nursing work environment to fit current conditions in Taiwan. In Phase 2, we conducted an expert review for content validity, held three public hearings, and conducted a survey. Four hundred twenty-seven questionnaires were sent out, with 381 returned. The goal was to test the content validity, construct validity, and internal consistency reliability. The study produced a set of indicators of a quality nursing work environment with eight dimensions and 65 items. The content validity index for importance and suitability dimensions were 1.0, whereas the internal consistency was 0.91. The eight dimensions were safe practice environment (16 items), quality and quantity of staff (four items), salary and welfare (seven items), professional specialization and teamwork (seven items), work simplification (five items), informatics (five items), career development (nine items), and support and caring (12 items). The overall load for the indicators was 77.57%. The developed indicators may be used to evaluate the quality of nursing work environments. Furthermore, the indicators may be used in hospital surveys to establish baseline conditions and for outcome research that measures improvement in nursing work environments after interventions.
The Child-care Food and Activity Practices Questionnaire (CFAPQ): development and first validation steps.

PubMed

Gubbels, Jessica S; Sleddens, Ester Fc; Raaijmakers, Lieke Ch; Gies, Judith M; Kremers, Stef Pj

2016-08-01

To develop and validate a questionnaire to measure food-related and activity-related practices of child-care staff, based on existing, validated parenting practices questionnaires. A selection of items from the Comprehensive Feeding Practices Questionnaire (CFPQ) and the Preschooler Physical Activity Parenting Practices (PPAPP) questionnaire was made to include items most suitable for the child-care setting. The converted questionnaire was pre-tested among child-care staff during cognitive interviews and pilot-tested among a larger sample of child-care staff. Factor analyses with Varimax rotation and internal consistencies were used to examine the scales. Spearman correlations, t tests and ANOVA were used to examine associations between the scales and staff's background characteristics (e.g. years of experience, gender). Child-care centres in the Netherlands. The qualitative pre-test included ten child-care staff members. The quantitative pilot test included 178 child-care staff members. The new questionnaire, the Child-care Food and Activity Practices Questionnaire (CFAPQ), consists of sixty-three items (forty food-related and twenty-three activity-related items), divided over twelve scales (seven food-related and five activity-related scales). The CFAPQ scales are to a large extent similar to the original CFPQ and PPAPP scales. The CFAPQ scales show sufficient internal consistency with Cronbach's α ranging between 0·53 and 0·96, and average corrected item-total correlations within acceptable ranges (0·30-0·89). Several of the scales were significantly associated with child-care staff's background characteristics. Scale psychometrics of the CFAPQ indicate it is a valid questionnaire that assesses child-care staff's practices related to both food and activities.
A microsensor array for quantification of lubricant contaminants using a back propagation artificial neural network

NASA Astrophysics Data System (ADS)

Zhu, Xiaoliang; Du, Li; Liu, Bendong; Zhe, Jiang

2016-06-01

We present a method based on an electrochemical sensor array and a back propagation artificial neural network for detection and quantification of four properties of lubrication oil, namely water (0, 500 ppm, 1000 ppm), total acid number (TAN) (13.1, 13.7, 14.4, 15.6 mg KOH g-1), soot (0, 1%, 2%, 3%) and sulfur content (1.3%, 1.37%, 1.44%, 1.51%). The sensor array, consisting of four micromachined electrochemical sensors, detects the four properties with overlapping sensitivities. A total set of 36 oil samples containing mixtures of water, soot, and sulfuric acid with different concentrations were prepared for testing. The sensor array’s responses were then divided to three sets: training sets (80% data), validation sets (10%) and testing sets (10%). Several back propagation artificial neural network architectures were trained with the training and validation sets; one architecture with four input neurons, 50 and 5 neurons in the first and second hidden layer, and four neurons in the output layer was selected. The selected neural network was then tested using the four sets of testing data (10%). Test results demonstrated that the developed artificial neural network is able to quantitatively determine the four lubrication properties (water, TAN, soot, and sulfur content) with a maximum prediction error of 18.8%, 6.0%, 6.7%, and 5.4%, respectively, indicting a good match between the target and predicted values. With the developed network, the sensor array could be potentially used for online lubricant oil condition monitoring.
Using the epigenetic field defect to detect prostate cancer in biopsy negative patients.

PubMed

Truong, Matthew; Yang, Bing; Livermore, Andrew; Wagner, Jennifer; Weeratunga, Puspha; Huang, Wei; Dhir, Rajiv; Nelson, Joel; Lin, Daniel W; Jarrard, David F

2013-06-01

We determined whether a novel combination of field defect DNA methylation markers could predict the presence of prostate cancer using histologically normal transrectal ultrasound guided biopsy cores. Methylation was assessed using quantitative Pyrosequencing® in a training set consisting of 65 nontumor and tumor associated prostate tissues from University of Wisconsin. A multiplex model was generated using multivariate logistic regression and externally validated in blinded fashion in a set of 47 nontumor and tumor associated biopsy specimens from University of Washington. We observed robust methylation differences in all genes at all CpGs assayed (p <0.0001). Regression models incorporating individual genes (EVX1, CAV1 and FGF1) and a gene combination (EVX1 and FGF1) discriminated nontumor from tumor associated tissues in the original training set (AUC 0.796-0.898, p <0.001). On external validation uniplex models incorporating EVX1, CAV1 or FGF1 discriminated tumor from nontumor associated biopsy negative specimens (AUC 0.702, 0.696 and 0.658, respectively, p <0.05). A multiplex model (EVX1 and FGF1) identified patients with prostate cancer (AUC 0.774, p = 0.001) and had a negative predictive value of 0.909. Comparison between 2 separate cores in patients in this validation set revealed similar methylation defects, indicating detection of a widespread field defect. A widespread epigenetic field defect can be used to detect prostate cancer in patients with histologically negative biopsies. To our knowledge this assay is unique, in that it detects alterations in nontumor cells. With further validation this marker combination (EVX1 and FGF1) has the potential to decrease the need for repeat prostate biopsies, a procedure associated with cost and complications. Copyright © 2013 American Urological Association Education and Research, Inc. Published by Elsevier Inc. All rights reserved.
Children and Young People-Mental Health Safety Assessment Tool (CYP-MH SAT) study: Protocol for the development and psychometric evaluation of an assessment tool to identify immediate risk of self-harm and suicide in children and young people (10–19 years) in acute paediatric hospital settings

PubMed Central

Walker, Gemma M; Carter, Tim; Aubeeluck, Aimee; Witchell, Miranda; Coad, Jane

2018-01-01

Introduction Currently, no standardised, evidence-based assessment tool for assessing immediate self-harm and suicide in acute paediatric inpatient settings exists. Aim The aim of this study is to develop and test the psychometric properties of an assessment tool that identifies immediate risk of self-harm and suicide in children and young people (10–19 years) in acute paediatric hospital settings. Methods and analysis Development phase: This phase involved a scoping review of the literature to identify and extract items from previously published suicide and self-harm risk assessment scales. Using a modified electronic Delphi approach, these items will then be rated according to their relevance for assessment of immediate suicide or self-harm risk by expert professionals. Inclusion of items will be determined by 65%–70% consensus between raters. Subsequently, a panel of expert members will convene to determine the face validity, appropriate phrasing, item order and response format for the finalised items. Psychometric testing phase: The finalised items will be tested for validity and reliability through a multicentre, psychometric evaluation. Psychometric testing will be undertaken to determine the following: internal consistency, inter-rater reliability, convergent, divergent validity and concurrent validity. Ethics and dissemination Ethical approval was provided by the National Health Service East Midlands—Derby Research Ethics Committee (17/EM/0347) and full governance clearance received by the Health Research Authority and local participating sites. Findings from this study will be disseminated to professionals and the public via peer-reviewed journal publications, popular social media and conference presentations. PMID:29654046
Validation of Patient-Reported Outcomes Measurement Information System Short Forms for Use in Childhood-Onset Systemic Lupus Erythematosus.

PubMed

Jones, Jordan T; Carle, Adam C; Wootton, Janet; Liberio, Brianna; Lee, Jiha; Schanberg, Laura E; Ying, Jun; Morgan DeWitt, Esi; Brunner, Hermine I

2017-01-01

To validate the pediatric Patient-Reported Outcomes Measurement Information System short forms (PROMIS-SFs) in childhood-onset systemic lupus erythematosus (SLE) in a clinical setting. At 3 study visits, childhood-onset SLE patients completed the PROMIS-SFs (anger, anxiety, depressive symptoms, fatigue, physical function-mobility, physical function-upper extremity, pain interference, and peer relationships) using the PROMIS assessment center, and health-related quality of life (HRQoL) legacy measures (Pediatric Quality of Life Inventory, Childhood Health Assessment Questionnaire, Simple Measure of Impact of Lupus Erythematosus in Youngsters [SMILEY], and visual analog scales [VAS] of pain and well-being). Physicians rated childhood-onset SLE activity on a VAS and completed the Systemic Lupus Erythematosus Disease Activity Index 2000. Using a global rating scale of change (GRC) between study visits, physicians rated change of childhood-onset SLE activity (GRC-MD1: better/same/worse) and change of patient overall health (GRC-MD2: better/same/worse). Questionnaire scores were compared in support of validity and responsiveness to change (external standards: GRC-MD1, GRC-MD2). In this population-based cohort (n = 100) with a mean age of 15.8 years (range 10-20 years), the PROMIS-SFs were completed in less than 5 minutes in a clinical setting. The PROMIS-SF scores correlated at least moderately (Pearson's r ≥ 0.5) with those of legacy HRQoL measures, except for the SMILEY. Measures of childhood-onset SLE activity did not correlate with the PROMIS-SFs. Responsiveness to change of the PROMIS-SFs was supported by path, mixed-model, and correlation analyses. To assess HRQoL in childhood-onset SLE, the PROMIS-SFs demonstrated feasibility, internal consistency, construct validity, and responsiveness to change in a clinical setting. © 2016, American College of Rheumatology.
The development and psychometrical evaluation of a set of instruments to evaluate the effectiveness of diabetes patient education.

PubMed

Duprez, Veerle; De Pover, Marleen; De Spiegelaere, Marc; Beeckman, Dimitri

2014-02-01

To develop a set of psychometrically sound instruments to assess knowledge, self-management and self-efficacy of diabetic patients. Furthermore, a survey to evaluate the satisfaction about diabetes education for patients was developed and tested. Treatment and secondary prevention of diabetes require a complex combination of care components. Patients' education has been accepted to improve diabetes knowledge, self-management and self-efficacy. Psychometrically sound instruments are needed to measure these patient-centred outcomes. Psychometric instrument validation. The first phase included a systematic literature review to develop the instruments. Content validity was evaluated using a two-round Delphi procedure involving diabetes experts. The content validity of the instruments was excellent. In a second phase, a convenience sample of 188 diabetic patients in two hospitals in one specific care region in Belgium participated in the psychometric evaluation. The criterion-related validity and internal consistency reliability were evaluated. The study produced a 21-item knowledge instrument, reflecting knowledge about 'glycemic control' and 'medico-social management aspects'. The self-management instrument included 32 statements, reflecting 'treatment and compliance' and 'general lifestyle'. The self-efficacy instrument included 30 items, reflecting 'nutrition', 'treatment' and 'regimen'. The patient satisfaction survey included 36 items, reflecting satisfaction about the relationship among the diabetes specialist, the diabetes educator, podiatrist and dietician. An instrument set with sound psychometric characteristics was developed to assess knowledge, self-management and self-efficacy of diabetic patients. Future studies should focus on the association between the instrument outcomes and clinical patient outcomes. The current instrument can support the design of educational interventions and training programmes and reduce inconsistencies in the information that patients receive. Furthermore, the instruments can be used for benchmarking the quality of diabetic patient education. © 2013 Blackwell Publishing Ltd.
A Prognostic Model for One-year Mortality in Patients Requiring Prolonged Mechanical Ventilation

PubMed Central

Carson, Shannon S.; Garrett, Joanne; Hanson, Laura C.; Lanier, Joyce; Govert, Joe; Brake, Mary C.; Landucci, Dante L.; Cox, Christopher E.; Carey, Timothy S.

2009-01-01

Objective A measure that identifies patients who are at high risk of mortality after prolonged ventilation will help physicians communicate prognosis to patients or surrogate decision-makers. Our objective was to develop and validate a prognostic model for 1-year mortality in patients ventilated for 21 days or more. Design Prospective cohort study. Setting University-based tertiary care hospital Patients 300 consecutive medical, surgical, and trauma patients requiring mechanical ventilation for at least 21 days were prospectively enrolled. Measurements and Main Results Predictive variables were measured on day 21 of ventilation for the first 200 patients and entered into logistic regression models with 1-year and 3-month mortality as outcomes. Final models were validated using data from 100 subsequent patients. One-year mortality was 51% in the development set and 58% in the validation set. Independent predictors of mortality included requirement for vasopressors, hemodialysis, platelet count ≤150 ×109/L, and age ≥50. Areas under the ROC curve for the development model and validation model were 0.82 (se 0.03) and 0.82 (se 0.05) respectively. The model had sensitivity of 0.42 (se 0.12) and specificity of 0.99 (se 0.01) for identifying patients who had ≥90% risk of death at 1 year. Observed mortality was highly consistent with both 3- and 12-month predicted mortality. These four predictive variables can be used in a simple prognostic score that clearly identifies low risk patients (no risk factors, 15% mortality) and high risk patients (3 or 4 risk factors, 97% mortality). Conclusions Simple clinical variables measured on day 21 of mechanical ventilation can identify patients at highest and lowest risk of death from prolonged ventilation. PMID:18552692
Directed Design of Experiments for Validating Probability of Detection Capability of a Testing System

NASA Technical Reports Server (NTRS)

Generazio, Edward R. (Inventor)

2012-01-01

A method of validating a probability of detection (POD) testing system using directed design of experiments (DOE) includes recording an input data set of observed hit and miss or analog data for sample components as a function of size of a flaw in the components. The method also includes processing the input data set to generate an output data set having an optimal class width, assigning a case number to the output data set, and generating validation instructions based on the assigned case number. An apparatus includes a host machine for receiving the input data set from the testing system and an algorithm for executing DOE to validate the test system. The algorithm applies DOE to the input data set to determine a data set having an optimal class width, assigns a case number to that data set, and generates validation instructions based on the case number.

Cross-cultural validation and psychometric testing of the Norwegian version of the TeamSTEPPS® teamwork perceptions questionnaire.

PubMed

Ballangrud, Randi; Husebø, Sissel Eikeland; Hall-Lord, Marie Louise

2017-12-02

Teamwork is an integrated part of today's specialized and complex healthcare and essential to patient safety, and is considered as a core competency to improve twenty-first century healthcare. Teamwork measurements and evaluations show promising results to promote good team performance, and are recommended for identifying areas for improvement. The validated TeamSTEPPS® Teamwork Perception Questionnaire (T-TPQ) was found suitable for cross-cultural validation and testing in a Norwegian context. T-TPQ is a self-report survey that examines five dimensions of perception of teamwork within healthcare settings. The aim of the study was to translate and cross-validate the T-TPQ into Norwegian, and test the questionnaire for psychometric properties among healthcare personnel. The T-TPQ was translated and adapted to a Norwegian context according to a model of a back-translation process. A total of 247 healthcare personnel representing different professionals and hospital settings responded to the questionnaire. A confirmatory factor analysis was carried out to test the factor structure. Cronbach's alpha was used to establish internal consistency, and an Intraclass Correlation Coefficient was used to assess the test - retest reliability. A confirmatory factor analysis showed an acceptable fitting model (χ 2 (df) 969.46 (546), p < 0.001, Root Mean Square Error of Approximation (RMSEA) = 0.056, Tucker-Lewis Index (TLI) = 0.88, Comparative fit index (CFI) = 0.89, which indicates that each set of the items that was supposed to accompany each teamwork dimension clearly represents that specific construct. The Cronbach's alpha demonstrated acceptable values on the five subscales (0.786-0.844), and test-retest showed a reliability parameter, with Intraclass Correlation Coefficient scores from 0.672 to 0.852. The Norwegian version of T-TPQ was considered to be acceptable regarding the validity and reliability for measuring Norwegian individual healthcare personnel's perception of group level teamwork within their unit. However, it needs to be further tested, preferably in a larger sample and in different clinical settings.
Post-decision biases reveal a self-consistency principle in perceptual inference.

PubMed

Luu, Long; Stocker, Alan A

2018-05-15

Making a categorical judgment can systematically bias our subsequent perception of the world. We show that these biases are well explained by a self-consistent Bayesian observer whose perceptual inference process is causally conditioned on the preceding choice. We quantitatively validated the model and its key assumptions with a targeted set of three psychophysical experiments, focusing on a task sequence where subjects first had to make a categorical orientation judgment before estimating the actual orientation of a visual stimulus. Subjects exhibited a high degree of consistency between categorical judgment and estimate, which is difficult to reconcile with alternative models in the face of late, memory related noise. The observed bias patterns resemble the well-known changes in subjective preferences associated with cognitive dissonance, which suggests that the brain's inference processes may be governed by a universal self-consistency constraint that avoids entertaining 'dissonant' interpretations of the evidence. © 2018, Luu et al.
Data preparation and evaluation techniques for x-ray diffraction microscopy

DOE PAGES

Steinbrener, Jan; Nelson, Johanna; Huang, Xiaojing; ...

2010-01-01

The post-experiment processing of X-ray Diffraction Microscopy data is often time-consuming and difficult. This is mostly due to the fact that even if a preliminary result has been reconstructed, there is no definitive answer as to whether or not a better result with more consistently retrieved phases can still be obtained. In addition, we show here that the first step in data analysis, the assembly of two-dimensional diffraction patterns from a large set of raw diffraction data, is crucial to obtaining reconstructions of highest possible consistency. We have developed software that automates this process and results in consistently accurate diffractionmore » patterns. We have furthermore derived some criteria of validity for a tool commonly used to assess the consistency of reconstructions, the phase retrieval transfer function, and suggest a modified version that has improved utility for judging reconstruction quality.« less
Interplanetary medium data book

NASA Technical Reports Server (NTRS)

King, J. H.

1977-01-01

Unresolved questions on the physics of solar wind and its effects on magnetospheric processes and cosmic ray propagation were addressed with hourly averaged interplanetary plasma and magnetic field data. This composite data set is described with its content and extent, sources, limits of validity, and the mutual consistency studies and normalizations to which the input data were subjected. Hourly averaged parameters were presented in the form of digital listings and 27-day plots. The listings are contained in a separately bound appendix.
Combustion Integration Rack (CIR) Testing

NASA Image and Video Library

2015-02-18

Fluids and Combustion Facility (FCF), Combustion Integration Rack (CIR) during testing in the Structural Dynamics Laboratory (SDL). The Fluids and Combustion Facility (FCF) is a set of two International Space Station (ISS) research facilities designed to support physical and biological experiments in support of technology development and validation in space. The FCF consists of two modular, reconfigurable racks called the Combustion Integration Rack (CIR) and the Fluids Integration Rack (FIR). The CIR and FIR were developed at NASAʼs Glenn Research Center.
A Multi-Scale Structural Health Monitoring Approach for Damage Detection, Diagnosis and Prognosis in Aerospace Structures

DTIC Science & Technology

2012-01-20

ultrasonic Lamb waves to plastic strain and fatigue life. Theory was developed and validated to predict second harmonic generation for specific mode... Fatigue and damage generation and progression are processes consisting of a series of interrelated events that span large scales of space and time...strain and fatigue life A set of experiments were completed that worked to relate the acoustic nonlinearity measured with Lamb waves to both the
Meta-Analysis of Armed Service Vocational Aptitude Battery Composite Validity Data

DTIC Science & Technology

1988-01-01

before delving into this many-faceted topic. easurement of human abi lit i es has I cnp, been of interest to 2 scientists. Sir Francis Galton set up...those types of abilities gauged the level of an individual’s intellect (Anastasi, 1982). More refined measures of intelligence began principally with...Binet, who in 1905 developed an intelligence test based on what he felt were the essential components of intelligence . He believed intellect consists of
Flow process in combustors

NASA Technical Reports Server (NTRS)

Gouldin, F. C.

1982-01-01

Fluid mechanical effects on combustion processes in steady flow combustors, especially gas turbine combustors were investigated. Flow features of most interest were vorticity, especially swirl, and turbulence. Theoretical analyses, numerical calculations, and experiments were performed. The theoretical and numerical work focused on noncombusting flows, while the experimental work consisted of both reacting and nonreacting flow studies. An experimental data set, e.g., velocity, temperature and composition, was developed for a swirl flow combustor for use by combustion modelers for development and validation work.
Multicentre prospective validation of a urinary peptidome-based classifier for the diagnosis of type 2 diabetic nephropathy

PubMed Central

Siwy, Justyna; Schanstra, Joost P.; Argiles, Angel; Bakker, Stephan J.L.; Beige, Joachim; Boucek, Petr; Brand, Korbinian; Delles, Christian; Duranton, Flore; Fernandez-Fernandez, Beatriz; Jankowski, Marie-Luise; Al Khatib, Mohammad; Kunt, Thomas; Lajer, Maria; Lichtinghagen, Ralf; Lindhardt, Morten; Maahs, David M; Mischak, Harald; Mullen, William; Navis, Gerjan; Noutsou, Marina; Ortiz, Alberto; Persson, Frederik; Petrie, John R.; Roob, Johannes M.; Rossing, Peter; Ruggenenti, Piero; Rychlik, Ivan; Serra, Andreas L.; Snell-Bergeon, Janet; Spasovski, Goce; Stojceva-Taneva, Olivera; Trillini, Matias; von der Leyen, Heiko; Winklhofer-Roob, Brigitte M.; Zürbig, Petra; Jankowski, Joachim

2014-01-01

Background Diabetic nephropathy (DN) is one of the major late complications of diabetes. Treatment aimed at slowing down the progression of DN is available but methods for early and definitive detection of DN progression are currently lacking. The ‘Proteomic prediction and Renin angiotensin aldosterone system Inhibition prevention Of early diabetic nephRopathy In TYpe 2 diabetic patients with normoalbuminuria trial’ (PRIORITY) aims to evaluate the early detection of DN in patients with type 2 diabetes (T2D) using a urinary proteome-based classifier (CKD273). Methods In this ancillary study of the recently initiated PRIORITY trial we aimed to validate for the first time the CKD273 classifier in a multicentre (9 different institutions providing samples from 165 T2D patients) prospective setting. In addition we also investigated the influence of sample containers, age and gender on the CKD273 classifier. Results We observed a high consistency of the CKD273 classification scores across the different centres with areas under the curves ranging from 0.95 to 1.00. The classifier was independent of age (range tested 16–89 years) and gender. Furthermore, the use of different urine storage containers did not affect the classification scores. Analysis of the distribution of the individual peptides of the classifier over the nine different centres showed that fragments of blood-derived and extracellular matrix proteins were the most consistently found. Conclusion We provide for the first time validation of this urinary proteome-based classifier in a multicentre prospective setting and show the suitability of the CKD273 classifier to be used in the PRIORITY trial. PMID:24589724
Development of an instrument to measure patient perception of the quality of nursing care and related hospital services at the national hospital of sri lanka.

PubMed

Senarat, Upul; Gunawardena, Nalika S

2011-06-01

This study aimed to develop and validate an instrument to measure patient perception of quality of nursing care and related hospital services in a tertiary care setting. We compiled an instrument with 72 items that patients may perceive as quality of nursing care and related hospital services, following an extensive literature search, discussions with patients and care pro-I viders and a brainstorming session with an expert panel. A cross-sectional study was conducted at the National Hospital of Sri Lanka. A sample (n = 120) of patients stayed in general surgical or medical units responded to the interviewer administered instrument upon discharge. Item analysis and principal component factor analysis were performed to assess validity, and internal consistency was calculated to measure reliability. Of the 72 items, 18 had greater than 20% of responses as 'not relevant'. A further 11 items were eliminated since item-total correlations were less than .2. Factor analysis was performed on remaining 43 items which resulted in 36 items classifying into eight factors accounting for 71% of the variation. Factor loadings in the final solution after Varimax rotation were interpersonal aspects (.68-.85), efficiency (.62-.79), competency (.66-.68), comfort (.60-.84), physical environment (.65-.82), cleanliness (.81-.85), personalized information (.76-.83), and general instructions (.61-.78). The instrument had high Internal consistency (Cronbach's alpha = .91). We developed a comprehensive, reliable and valid, 36-item instrument that may be used to measure patient perception of quality of nursing care in tertiary care settings. Copyright © 2011 Korean Society of Nursing Science. Published by Elsevier B.V. All rights reserved.
Assessing Discriminative Performance at External Validation of Clinical Prediction Models

PubMed Central

Nieboer, Daan; van der Ploeg, Tjeerd; Steyerberg, Ewout W.

2016-01-01

Introduction External validation studies are essential to study the generalizability of prediction models. Recently a permutation test, focusing on discrimination as quantified by the c-statistic, was proposed to judge whether a prediction model is transportable to a new setting. We aimed to evaluate this test and compare it to previously proposed procedures to judge any changes in c-statistic from development to external validation setting. Methods We compared the use of the permutation test to the use of benchmark values of the c-statistic following from a previously proposed framework to judge transportability of a prediction model. In a simulation study we developed a prediction model with logistic regression on a development set and validated them in the validation set. We concentrated on two scenarios: 1) the case-mix was more heterogeneous and predictor effects were weaker in the validation set compared to the development set, and 2) the case-mix was less heterogeneous in the validation set and predictor effects were identical in the validation and development set. Furthermore we illustrated the methods in a case study using 15 datasets of patients suffering from traumatic brain injury. Results The permutation test indicated that the validation and development set were homogenous in scenario 1 (in almost all simulated samples) and heterogeneous in scenario 2 (in 17%-39% of simulated samples). Previously proposed benchmark values of the c-statistic and the standard deviation of the linear predictors correctly pointed at the more heterogeneous case-mix in scenario 1 and the less heterogeneous case-mix in scenario 2. Conclusion The recently proposed permutation test may provide misleading results when externally validating prediction models in the presence of case-mix differences between the development and validation population. To correctly interpret the c-statistic found at external validation it is crucial to disentangle case-mix differences from incorrect regression coefficients. PMID:26881753
Assessing Discriminative Performance at External Validation of Clinical Prediction Models.

PubMed

Nieboer, Daan; van der Ploeg, Tjeerd; Steyerberg, Ewout W

2016-01-01

External validation studies are essential to study the generalizability of prediction models. Recently a permutation test, focusing on discrimination as quantified by the c-statistic, was proposed to judge whether a prediction model is transportable to a new setting. We aimed to evaluate this test and compare it to previously proposed procedures to judge any changes in c-statistic from development to external validation setting. We compared the use of the permutation test to the use of benchmark values of the c-statistic following from a previously proposed framework to judge transportability of a prediction model. In a simulation study we developed a prediction model with logistic regression on a development set and validated them in the validation set. We concentrated on two scenarios: 1) the case-mix was more heterogeneous and predictor effects were weaker in the validation set compared to the development set, and 2) the case-mix was less heterogeneous in the validation set and predictor effects were identical in the validation and development set. Furthermore we illustrated the methods in a case study using 15 datasets of patients suffering from traumatic brain injury. The permutation test indicated that the validation and development set were homogenous in scenario 1 (in almost all simulated samples) and heterogeneous in scenario 2 (in 17%-39% of simulated samples). Previously proposed benchmark values of the c-statistic and the standard deviation of the linear predictors correctly pointed at the more heterogeneous case-mix in scenario 1 and the less heterogeneous case-mix in scenario 2. The recently proposed permutation test may provide misleading results when externally validating prediction models in the presence of case-mix differences between the development and validation population. To correctly interpret the c-statistic found at external validation it is crucial to disentangle case-mix differences from incorrect regression coefficients.
Shared decision making in Swedish community mental health services - an evaluation of three self-reporting instruments.

PubMed

Rosenberg, David; Schön, Ulla-Karin; Nyholm, Maria; Grim, Katarina; Svedberg, Petra

2017-04-01

Despite the potential impact of shared decision making on users satisfaction with care and quality in health care decisions, there is a lack of knowledge and skills regarding how to work with shared decision making among health care providers. The aim of this study was to evaluate the psychometric properties of three instruments that measure varied dimensions of shared decision making, based on self-reports by clients, in a Swedish community mental health context. The study sample consisted of 121 clients with experience of community mental health care, and involved in a wide range of decisions regarding both social support and treatment. The questionnaires were examined for face and content validity, internal consistency, test-retest reliability and construct validity. The instruments displayed good face and content validity, satisfactory internal consistency and a moderate to good level of stability in test-retest reliability with fair to moderate construct correlations, in a sample of clients with serious mental illness and experience of community mental health services in Sweden. The questionnaires are considered to be relevant to the decision making process, user-friendly and appropriate in a Swedish community mental health care context. They functioned well in settings where non-medical decisions, regarding social and support services, are the primary focus. The use of instruments that measure various dimensions of the self-reported experience of clients, can be a key factor in developing knowledge of how best to implement shared decision making in mental health services.
Development and Validation of a Measure of Maladaptive Social-Evaluative Beliefs Characteristic of Social Anxiety Disorder in Youth: The Report of Youth Social Cognitions (RYSC).

PubMed

Wong, Quincy J J; Certoma, Sarah P; McLellan, Lauren F; Halldorsson, Brynjar; Reyes, Natasha; Boulton, Kelsie; Hudson, Jennifer L; Rapee, Ronald M

2017-12-28

Recent research has started to examine the applicability of influential adult models of the maintenance of social anxiety disorder (SAD) to youth. This research is limited by the lack of psychometrically validated measures of underlying constructs that are developmentally appropriate for youth. One key construct in adult models of SAD is maladaptive social-evaluative beliefs. The current study aimed to develop and validate a measure of these beliefs in youth, known as the Report of Youth Social Cognitions (RYSC). The RYSC was developed with a clinical sample of youth with anxiety disorders (N = 180) and cross-validated in a community sample of youth (N = 305). In the clinical sample, the RYSC exhibited a 3-factor structure (negative evaluation, revealing self, and positive impression factors), good internal consistency, and construct validity. In the community sample, the 3-factor structure and the internal consistency of the RYSC were replicated, but the test of construct validity showed that the RYSC had similarly strong associations with social anxiety and depressed affect. The RYSC had good test-retest reliability overall, although the revealing self subscale showed lower temporal stability which improved when only older participants were considered (age ≥9 years). The RYSC in general was also shown to discriminate between youth with and without SAD although the revealing self subscale again performed suboptimally but improved when only older participants were considered. These findings provide psychometric support for the RYSC and justifies its use with youth in research and clinical settings requiring the assessment of maladaptive social-evaluative beliefs. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Measuring Advance Care Planning: Optimizing the Advance Care Planning Engagement Survey.

PubMed

Sudore, Rebecca L; Heyland, Daren K; Barnes, Deborah E; Howard, Michelle; Fassbender, Konrad; Robinson, Carole A; Boscardin, John; You, John J

2017-04-01

A validated 82-item Advance Care Planning (ACP) Engagement Survey measures a broad range of behaviors. However, concise surveys are needed. The objective of this study was to validate shorter versions of the survey. The survey included 57 process (e.g., readiness) and 25 action items (e.g., discussions). For item reduction, we systematically eliminated questions based on face validity, item nonresponse, redundancy, ceiling effects, and factor analysis. We assessed internal consistency (Cronbach's alpha) and construct validity with cross-sectional correlations and the ability of the progressively shorter survey versions to detect change one week after exposure to an ACP intervention (Pearson correlation coefficients). Five hundred one participants (four Canadian and three US sites) were included in item reduction (mean age 69 years [±10], 41% nonwhite). Because of high correlations between readiness and action items, all action items were removed. Because of high correlations and ceiling effects, two process items were removed. Successive factor analysis then created 55-, 34-, 15-, nine-, and four-item versions; 664 participants (from three US ACP clinical trials) were included in validity analysis (age 65 years [±8], 72% nonwhite, 34% Spanish speaking). Cronbach's alphas were high for all versions (four items 0.84-55 items 0.97). Compared with the original survey, cross-sectional correlations were high (four items 0.85; 55 items 0.97) as were delta correlations (four items 0.68; 55 items 0.93). Shorter versions of the ACP Engagement Survey are valid, internally consistent, and able to detect change across a broad range of ACP behaviors for English and Spanish speakers. Shorter ACP surveys can efficiently measure broad ACP behaviors in research and clinical settings. Published by Elsevier Inc.
Psychometric validation and reliability analysis of a Spanish version of the patient satisfaction with cancer-related care measure: a patient navigation research program study.

PubMed

Jean-Pierre, Pascal; Fiscella, Kevin; Winters, Paul C; Paskett, Electra; Wells, Kristen; Battaglia, Tracy

2012-09-01

Patient satisfaction (PS), a key measure of quality of cancer care, is a core study outcome of the multi-site National Cancer Institute-funded Patient Navigation Research Program. Despite large numbers of underserved monolingual Spanish speakers (MSS) residing in USA, there is no validated Spanish measure of PS that spans the whole spectrum of cancer-related care. The present study reports on the validation of the Patient Satisfaction with Cancer Care (PSCC) measure for Spanish (PSCC-Sp) speakers receiving diagnostic and therapeutic cancer-related care. Original PSCC items were professionally translated and back translated to ensure cultural appropriateness, meaningfulness, and equivalence. Then, the resulting 18-item PSCC-Sp measure was administered to 285 MSS. We evaluated latent structure and internal consistency of the PSCC-Sp using principal components analysis (PCA) and Cronbach coefficient alpha (α). We used correlation analyses to demonstrate divergence and convergence of the PSCC-Sp with a Spanish version of the Patient Satisfaction with Interpersonal Relationship with Navigator (PSN-I-Sp) measure and patients' demographics. The PCA revealed a coherent set of items that explicates 47% of the variance in PS. Reliability assessment demonstrated that the PSCC-Sp had high internal consistency (α = 0.92). The PSCC-Sp demonstrated good face validity and convergent and divergent validities as indicated by moderate correlations with the PSN-I-Sp (p = 0.003) and nonsignificant correlations with marital status and household income (all p(s) > 0.05). The PSCC-Sp is a valid and reliable measure of PS and should be tested in other MSS populations.
Cross-cultural adaptation and validation of the Physical Therapy Outpatient Satisfaction Survey in an Italian musculoskeletal population

PubMed Central

2013-01-01

Background Although patient satisfaction is a relevant outcome measure for health care providers, few satisfaction questionnaires have been generally available to physical therapists or have been validated in an Italian population for use in the outpatient setting. The aim of this study was to translate, culturally adapt, and validate the Italian version of the Physical Therapy Outpatient Satisfaction Survey (PTOPS). Methods The Italian version of the PTOPS (PTOPS-I) was developed through forward-backward translation, review, and field-testing a pre-final version. The reliability of the final questionnaire was measured by internal consistency and test-retest stability at 7 days. Factor analysis was also used to explore construct validity. Concurrent validity was measured by comparing PTOPS-I with a 5-point Likert-type scale measure assessing the Global Perceived Effect (GPE) of the treatment and with a Visual Analogue Scale (VAS). Results 354 outpatients completed the PTOPS-I, and 56 took the re-test. The internal consistency (Cronbach’s alpha) of the original domains (Enhancers, Detractors, Location, and Cost) was 0.758 for Enhancers, 0.847 for Detractors, 0.885 for Location, and 0.706 for Cost. The test-retest stability (Intra-class Correlation Coefficients) was 0.769 for Enhancers, 0.893 for Detractors, 0.862 for Location, and 0.862 for Cost. The factor analysis of the Italian version revealed a structure into four domains, named Depersonalization, Inaccessibility, Ambience, and Cost. Concurrent validity with GPE was significantly demonstrated for all domains except Inaccessibility. Irrelevant or non-significant correlations were observed with VAS. Conclusion The PTOPS-I showed good psychometric properties. Its use can be suggested for Italian-speaking outpatients who receive physical therapy. PMID:23560848
Development and psychometric testing of an abridged version of Dundee Ready Educational Environment Measure (DREEM).

PubMed

Jeyashree, Kathiresan; Shewade, Hemant Deepak; Kathirvel, Soundappan

2018-04-17

Dundee Ready Educational Environment Measure (DREEM) is a 50-item tool to assess the educational environment of medical institutions as perceived by the students. This cross-sectional study developed and validated an abridged version of the DREEM-50 with an aim to have a less resource-intensive (time, manpower), yet valid and reliable, version of DREEM-50 while also avoiding respondent fatigue. A methodology similar to that used in the development of WHO-BREF was adopted to develop the abridged version of DREEM. Medical students (n = 418) from a private teaching hospital in Madurai, India, were divided into two groups. Group I (n = 277) participated in the development of the abridged version. This was performed by domain-wise selection of items that had the highest item-total correlation. Group II (n = 141) participated in the testing of the abridged version for construct validity, internal consistency and test-retest reliability. Confirmatory factor analysis was performed to assess the construct validity of DREEM-12. The abridged version had 12 items (DREEM-12) spread over all five domains in DREEM-50. DREEM-12 explained 77.4% of the variance in DREEM-50 scores. Correlation between total scores of DREEM-50 and DREEM-12 was 0.88 (p < 0.001). Confirmatory factor analysis of DREEM-12 construct was statistically significant (LR test of model vs. saturated p = 0.0006). The internal consistency of DREEM-12 was 0.83. The test-retest reliability of DREEM-12 was 0.595, p < 0.001. DREEM-12 is a valid and reliable tool for use in educational research. Future research using DREEM-12 will establish its validity and reliability across different settings.
NASA Ocean Altimeter Pathfinder Project. Report 1; Data Processing Handbook

NASA Technical Reports Server (NTRS)

Koblinsky, C. J.; Beckley, Brian D.; Ray, Richard D.; Wang, Yan-Ming; Tsaoussi, Lucia; Brenner, Anita; Williamson, Ron

1998-01-01

The NOAA/NASA Pathfinder program was created by the Earth Observing System (EOS) Program Office to determine how satellite-based data sets can be processed and used to study global change. The data sets are designed to be long time-sedes data processed with stable calibration and community consensus algorithms to better assist the research community. The Ocean Altimeter Pathfinder Project involves the reprocessing of all altimeter observations with a consistent set of improved algorithms, based on the results from TOPEX/POSEIDON (T/P), into easy-to-use data sets for the oceanographic community for climate research. This report describes the processing schemes used to produce a consistent data set and two of the products derived f rom these data. Other reports have been produced that: a) describe the validation of these data sets against tide gauge measurements and b) evaluate the statistical properties of the data that are relevant to climate change. The use of satellite altimetry for earth observations was proposed in the early 1960s. The first successful space based radar altimeter experiment was flown on SkyLab in 1974. The first successful satellite radar altimeter was flown aboard the Geos-3 spacecraft between 1975 and 1978. While a useful data set was collected from this mission for geophysical studies, the noise in the radar measured and incomplete global coverage precluded ft from inclusion in the Ocean Altimeter Pathfinder program. This program initiated its analysis with the Seasat mission, which was the first satellite radar altimeter flown for oceanography.
Psychometric properties of the Sexual Adjustment Questionnaire (SAQ) in the Iranian population with spinal cord injury

PubMed Central

Merghati-Khoei, E; Maasoumi, R; Rahdari, F; Bayat, A; Hajmirzaei, S; Lotfi, S; Hajiaghababaei, M; Emami-Razavi, SH; Korte, JE; Atoof, F

2016-01-01

Study design This is a cross-sectional study Objectives The objective of this study was to examine the psychometric properties of the Sexual Adjustment Questionnaire (SAQ) for Iranian people with spinal cord injury Setting This study was conducted in the brain and Spinal Cord Injury Research Center, Tehran University of Medical Sciences, Tehran, Iran Methods We assessed the psychometric properties of the SAQ, with 200 participants (men = 146, women = 54) completing the scale. An evaluation of its test–retest reliability was performed over a 2-weeks period, on a subsample of 30 patients recruited from the overall group. Cronbach’s α-coefficient was computed for assessment of internal consistency reliability. In addition, content and face validity were examined by an expert committee. Construct validity was assessed by examining convergent and discriminant validity. Finally, exploratory factor analysis was used to extract the factor structure of the questionnaire. Results The Cronbach’s α and intraclass correlation coefficient were 0.77 and 0.72 retrospectively. With regard to construct validity, there was a significant (P = 0.009) negative correlation (r = − 0.28) between the SAQ score and age. Those with lower levels of educations scored significantly lower on the SAQ (P = 0.04). The exploratory factor analysis indicated a four-factor structure for the questionnaire, accounting for 68.9% of the observed variance. The expert committee approved the face and content validity of the developed measure. Conclusion The SAQ is a valid measure for assessing sexual adjustment in people with spinal cord injury. The evaluation of sexual well-being may be useful in clinical trials and practical settings. PMID:25917953

Patient Experience and Satisfaction with Inpatient Service: Development of Short Form Survey Instrument Measuring the Core Aspect of Inpatient Experience

PubMed Central

Wong, Eliza L. Y.; Coulter, Angela; Hewitson, Paul; Cheung, Annie W. L.; Yam, Carrie H. K.; Lui, Siu fai; Tam, Wilson W. S.; Yeoh, Eng-kiong

2015-01-01

Patient experience reflects quality of care from the patients’ perspective; therefore, patients’ experiences are important data in the evaluation of the quality of health services. The development of an abbreviated, reliable and valid instrument for measuring inpatients’ experience would reflect the key aspect of inpatient care from patients’ perspective as well as facilitate quality improvement by cultivating patient engagement and allow the trends in patient satisfaction and experience to be measured regularly. The study developed a short-form inpatient instrument and tested its ability to capture a core set of inpatients’ experiences. The Hong Kong Inpatient Experience Questionnaire (HKIEQ) was established in 2010; it is an adaptation of the General Inpatient Questionnaire of the Care Quality Commission created by the Picker Institute in United Kingdom. This study used a consensus conference and a cross-sectional validation survey to create and validate a short-form of the Hong Kong Inpatient Experience Questionnaire (SF-HKIEQ). The short-form, the SF-HKIEQ, consisted of 18 items derived from the HKIEQ. The 18 items mainly covered relational aspects of care under four dimensions of the patient’s journey: hospital staff, patient care and treatment, information on leaving the hospital, and overall impression. The SF-HKIEQ had a high degree of face validity, construct validity and internal reliability. The validated SF-HKIEQ reflects the relevant core aspects of inpatients’ experience in a hospital setting. It provides a quick reference tool for quality improvement purposes and a platform that allows both healthcare staff and patients to monitor the quality of hospital care over time. PMID:25860775
Time Domain Tool Validation Using ARES I-X Flight Data

NASA Technical Reports Server (NTRS)

Hough, Steven; Compton, James; Hannan, Mike; Brandon, Jay

2011-01-01

The ARES I-X vehicle was launched from NASA's Kennedy Space Center (KSC) on October 28, 2009 at approximately 11:30 EDT. ARES I-X was the first test flight for NASA s ARES I launch vehicle, and it was the first non-Shuttle launch vehicle designed and flown by NASA since Saturn. The ARES I-X had a 4-segment solid rocket booster (SRB) first stage and a dummy upper stage (US) to emulate the properties of the ARES I US. During ARES I-X pre-flight modeling and analysis, six (6) independent time domain simulation tools were developed and cross validated. Each tool represents an independent implementation of a common set of models and parameters in a different simulation framework and architecture. Post flight data and reconstructed models provide the means to validate a subset of the simulations against actual flight data and to assess the accuracy of pre-flight dispersion analysis. Post flight data consists of telemetered Operational Flight Instrumentation (OFI) data primarily focused on flight computer outputs and sensor measurements as well as Best Estimated Trajectory (BET) data that estimates vehicle state information from all available measurement sources. While pre-flight models were found to provide a reasonable prediction of the vehicle flight, reconstructed models were generated to better represent and simulate the ARES I-X flight. Post flight reconstructed models include: SRB propulsion model, thrust vector bias models, mass properties, base aerodynamics, and Meteorological Estimated Trajectory (wind and atmospheric data). The result of the effort is a set of independently developed, high fidelity, time-domain simulation tools that have been cross validated and validated against flight data. This paper presents the process and results of high fidelity aerospace modeling, simulation, analysis and tool validation in the time domain.
Electrostatics of cysteine residues in proteins: Parameterization and validation of a simple model

PubMed Central

Salsbury, Freddie R.; Poole, Leslie B.; Fetrow, Jacquelyn S.

2013-01-01

One of the most popular and simple models for the calculation of pKas from a protein structure is the semi-macroscopic electrostatic model MEAD. This model requires empirical parameters for each residue to calculate pKas. Analysis of current, widely used empirical parameters for cysteine residues showed that they did not reproduce expected cysteine pKas; thus, we set out to identify parameters consistent with the CHARMM27 force field that capture both the behavior of typical cysteines in proteins and the behavior of cysteines which have perturbed pKas. The new parameters were validated in three ways: (1) calculation across a large set of typical cysteines in proteins (where the calculations are expected to reproduce expected ensemble behavior); (2) calculation across a set of perturbed cysteines in proteins (where the calculations are expected to reproduce the shifted ensemble behavior); and (3) comparison to experimentally determined pKa values (where the calculation should reproduce the pKa within experimental error). Both the general behavior of cysteines in proteins and the perturbed pKa in some proteins can be predicted reasonably well using the newly determined empirical parameters within the MEAD model for protein electrostatics. This study provides the first general analysis of the electrostatics of cysteines in proteins, with specific attention paid to capturing both the behavior of typical cysteines in a protein and the behavior of cysteines whose pKa should be shifted, and validation of force field parameters for cysteine residues. PMID:22777874
Mapping health outcome measures from a stroke registry to EQ-5D weights

PubMed Central

2013-01-01

Purpose To map health outcome related variables from a national register, not part of any validated instrument, with EQ-5D weights among stroke patients. Methods We used two cross-sectional data sets including patient characteristics, outcome variables and EQ-5D weights from the national Swedish stroke register. Three regression techniques were used on the estimation set (n = 272): ordinary least squares (OLS), Tobit, and censored least absolute deviation (CLAD). The regression coefficients for “dressing“, “toileting“, “mobility”, “mood”, “general health” and “proxy-responders” were applied to the validation set (n = 272), and the performance was analysed with mean absolute error (MAE) and mean square error (MSE). Results The number of statistically significant coefficients varied by model, but all models generated consistent coefficients in terms of sign. Mean utility was underestimated in all models (least in OLS) and with lower variation (least in OLS) compared to the observed. The maximum attainable EQ-5D weight ranged from 0.90 (OLS) to 1.00 (Tobit and CLAD). Health states with utility weights <0.5 had greater errors than those with weights ≥0.5 (P < 0.01). Conclusion This study indicates that it is possible to map non-validated health outcome measures from a stroke register into preference-based utilities to study the development of stroke care over time, and to compare with other conditions in terms of utility. PMID:23496957
Review of TRMM/GPM Rainfall Algorithm Validation

NASA Technical Reports Server (NTRS)

Smith, Eric A.

2004-01-01

A review is presented concerning current progress on evaluation and validation of standard Tropical Rainfall Measuring Mission (TRMM) precipitation retrieval algorithms and the prospects for implementing an improved validation research program for the next generation Global Precipitation Measurement (GPM) Mission. All standard TRMM algorithms are physical in design, and are thus based on fundamental principles of microwave radiative transfer and its interaction with semi-detailed cloud microphysical constituents. They are evaluated for consistency and degree of equivalence with one another, as well as intercompared to radar-retrieved rainfall at TRMM's four main ground validation sites. Similarities and differences are interpreted in the context of the radiative and microphysical assumptions underpinning the algorithms. Results indicate that the current accuracies of the TRMM Version 6 algorithms are approximately 15% at zonal-averaged / monthly scales with precisions of approximately 25% for full resolution / instantaneous rain rate estimates (i.e., level 2 retrievals). Strengths and weaknesses of the TRMM validation approach are summarized. Because the dew of convergence of level 2 TRMM algorithms is being used as a guide for setting validation requirements for the GPM mission, it is important that the GPM algorithm validation program be improved to ensure concomitant improvement in the standard GPM retrieval algorithms. An overview of the GPM Mission's validation plan is provided including a description of a new type of physical validation model using an analytic 3-dimensional radiative transfer model.
Validation of the Neonatal Satisfaction Survey (NSS-8) in six Norwegian neonatal intensive care units: a quantitative cross-sectional study.

PubMed

Hagen, Inger Hilde; Svindseth, Marit Følsvik; Nesset, Erik; Orner, Roderick; Iversen, Valentina Cabral

2018-03-27

The experience of having their new-borns admitted to an intensive care unit (NICU) can be extremely distressing. Subsequent risk of post-incident-adjustment difficulties are increased for parents, siblings, and affected families. Patient and next of kin satisfaction surveys provide key indicators of quality in health care. Methodically constructed and validated survey tools are in short supply and parents' experiences of care in Neonatal Intensive Care Units is under-researched. This paper reports a validation of the Neonatal Satisfaction Survey (NSS-8) in six Norwegian NICUs. Parents' survey returns were collected using the Neonatal Satisfaction Survey (NSS-13). Data quality and psychometric properties were systematically assessed using exploratory factor analysis, tests of internal consistency, reliability, construct, convergent and discriminant validity. Each set of hospital returns were subjected to an apostasy analysis before an overall satisfaction rate was calculated. The survey sample of 568 parents represents 45% of total eligible population for the period of the study. Missing data accounted for 1,1% of all returns. Attrition analysis shows congruence between sample and total population. Exploratory factor analysis identified eight factors of concern to parents,"Care and Treatment", "Doctors", "Visits", "Information", "Facilities", "Parents' Anxiety", "Discharge" and "Sibling Visits". All factors showed satisfactory internal consistency, good reliability (Cronbach's alpha ranged from 0.70-0.94). For the whole scale of 51 items α 0.95. Convergent validity using Spearman's rank between the eight factors and question measuring overall satisfaction was significant on all factors. Discriminant validity was established for all factors. Overall satisfaction rates ranged from 86 to 90% while for each of the eight factors measures of satisfaction varied between 64 and 86%. The NSS-8 questionnaire is a valid and reliable scale for measuring parents' assessment of quality of care in NICU. Statistical analysis confirms the instrument's capacity to gauge parents' experiences of NICU. Further research is indicated to validate the survey questionnaire in other Nordic countries and beyond.
Quality indicators for pharmaceutical care: a comprehensive set with national scores for Dutch community pharmacies.

PubMed

Teichert, Martina; Schoenmakers, Tim; Kylstra, Nico; Mosk, Berend; Bouvy, Marcel L; van de Vaart, Frans; De Smet, Peter A G M; Wensing, Michel

2016-08-01

Background The quality of pharmaceutical care in community pharmacies in the Netherlands has been assessed annually since 2008. The initial set has been further developed with pharmacists and patient organizations, the healthcare inspectorate, the government and health insurance companies. The set over 2012 was the first set of quality indicators for community pharmacies which was validated and supported by all major stakeholders. The aims of this study were to describe the validated set of quality indicators for community pharmacies and to report their scores over 2012. In subanalyses the score development over 5 years was described for those indicators, that have been surveyed before and remained unchanged. Methods Community pharmacists in the Netherlands were invited in 2013 to provide information for the set of 2012. Quality indicators were mapped by categories relevant for pharmaceutical care and defined for structures, processes and dispensing outcomes. Scores for categorically-measured quality indicators were presented as the percentage of pharmacies reporting the presence of a quality aspect. For numerical quality indicators, the mean of all reported scores was expressed. In subanalyses for those indicators that had been questioned previously, scores were collected from earlier measurements for pharmacies providing their scores in 2012. Multilevel analysis was used to assess the consistency of scores within one pharmacy over time by the intra-class correlation coefficient (ICC). Results For the set in 2012, 1739 Dutch community pharmacies (88 % of the total) provided information for 66 quality indicators in 10 categories. Indicator scores on the presence of quality structures showed relatively high quality levels. Scores for processes and dispensing outcomes were lower. Subanalyses showed that overall indicators scores improved within pharmacies, but this development differed between pharmacies. Conclusions A set of validated quality indicators provided insight into the quality of pharmaceutical care in the Netherlands. The quality of pharmaceutical care improved over time. As of 2012 quality structures were present in at least 80 % of the community pharmacies. Variation in scores on care processes and outcomes between individual pharmacies and over time can initiate future research to better understand and facilitate quality improvement in community pharmacies.
Boredom proneness--the development and correlates of a new scale.

PubMed

Farmer, R; Sundberg, N D

1986-01-01

This article reports the development, validation, and correlates of a self-report measure of boredom proneness. The 28-item Boredom Proneness (BP) Scale demonstrates satisfactory levels of internal consistency (coefficient alpha = .79) and test-retest reliability (r = .83) over a 1-week interval. Evidence of validity for the BP is supported by correlations with other boredom measures and from a set of studies evaluating interest and attention in the classroom. Other hypothesized relationships with boredom were tested, with significant positive associations found with depression, hopelessness, perceived effort, loneliness, and amotivational orientation. Additional findings indicate boredom proneness to be negatively related to life satisfaction and autonomy orientation. The relationship of boredom to other affective states is discussed, and directions for future research are outlined.
Developing a primary care patient measure of safety (PC PMOS): a modified Delphi process and face validity testing.

PubMed

Hernan, Andrea L; Giles, Sally J; O'Hara, Jane K; Fuller, Jeffrey; Johnson, Julie K; Dunbar, James A

2016-04-01

Patients are a valuable source of information about ways to prevent harm in primary care and are in a unique position to provide feedback about the factors that contribute to safety incidents. Unlike in the hospital setting, there are currently no tools that allow the systematic capture of this information from patients. The aim of this study was to develop a quantitative primary care patient measure of safety (PC PMOS). A two-stage approach was undertaken to develop questionnaire domains and items. Stage 1 involved a modified Delphi process. An expert panel reached consensus on domains and items based on three sources of information (validated hospital PMOS, previous research conducted by our study team and literature on threats to patient safety). Stage 2 involved testing the face validity of the questionnaire developed during stage 1 with patients and primary care staff using the 'think aloud' method. Following this process, the questionnaire was revised accordingly. The PC PMOS was received positively by both patients and staff during face validity testing. Barriers to completion included the length, relevance and clarity of questions. The final PC PMOS consisted of 50 items across 15 domains. The contributory factors to safety incidents centred on communication, access to care, patient-related factors, organisation and care planning, task performance and information flow. This is the first tool specifically designed for primary care settings, which allows patients to provide feedback about factors contributing to potential safety incidents. The PC PMOS provides a way for primary care organisations to learn about safety from the patient perspective and make service improvements with the aim of reducing harm in this setting. Future research will explore the reliability and construct validity of the PC PMOS. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/
Adaptation and validation of the Spanish version of the graded chronic pain scale.

PubMed

Ferrer-Peña, Raúl; Gil-Martínez, Alfonso; Pardo-Montero, Joaquín; Jiménez-Penick, Virginia; Gallego-Izquierdo, Tomás; La Touche, Roy

2016-01-01

To adapt the Graded Chronic Pain Scale for use in Primary care patients in Spain, and to assess its psychometric properties. Clinical measures observational study investigating the severity of chronic pain. The methodology included a process of translation and back-translation following the international guidelines. Study participants were 75 patients who experienced lower back pain for more than six months and were sent to Primary Care physiotherapy units. Internal consistency, construct validity, test-retest reliability, floor and ceiling effects, and answering capacity were analysed. The Spanish version of the Graded Chronic Pain Scale had a high internal consistency, with a Cronbach's alpha of 0.87 and intraclass correlation coefficient of 0.81. Regarding construct validity, it was identified that two factors explained 72.37% of the variance. Convergent validity showed a moderate positive correlation with the Visual Analogue Scale, the activity avoidance subscale of the Tampa Scale of Kinesophobia, the Pain Catastrophizing Scale, the Roland-Morris Low Back Pain and Disability Questionnaire, and the FearAvoidance Beliefs Questionnaire. A moderate negative correlation was identified with the Chronic Pain Self-Efficacy Scale. The mean time of questionnaire administration was 2minutes and 28seconds. The Spanish version of the Graded Chronic Pain Scale appears to be a valid, reliable, and useful tool for measuring chronic pain at an early stage in Primary Care settings in Spain. Copyright © 2015 Elsevier España, S.L.U. and Sociedad Española de Reumatología y Colegio Mexicano de Reumatología. All rights reserved.
Latent structure and reliability analysis of the measure of body apperception: cross-validation for head and neck cancer patients.

PubMed

Jean-Pierre, Pascal; Fundakowski, Christopher; Perez, Enrique; Jean-Pierre, Shadae E; Jean-Pierre, Ashley R; Melillo, Angelica B; Libby, Rachel; Sargi, Zoukaa

2013-02-01

Cancer and its treatments are associated with psychological distress that can negatively impact self-perception, psychosocial functioning, and quality of life. Patients with head and neck cancers (HNC) are particularly susceptible to psychological distress. This study involved a cross-validation of the Measure of Body Apperception (MBA) for HNC patients. One hundred and twenty-two English-fluent HNC patients between 20 and 88 years of age completed the MBA on a Likert scale ranging from "1 = disagree" to "4 = agree." We assessed the latent structure and internal consistency reliability of the MBA using Principal Components Analysis (PCA) and Cronbach's coefficient alpha (α), respectively. We determined convergent and divergent validities of the MBA using correlations with the Hospital Anxiety and Depression Scale (HADS), observer disfigurement rating, and patients' clinical and demographic variables. The PCA revealed a coherent set of items that explained 38 % of the variance. The Kaiser-Meyer-Olkin measure of sampling adequacy was 0.73 and the Bartlett's test of sphericity was statistically significant (χ (2) (28) = 253.64; p < 0.001), confirming the suitability of the data for dimension reduction analysis. The MBA had good internal consistency reliability (α = 0.77) and demonstrated adequate convergent and divergent validities based on statistically significant moderate correlations with the HADS (p < 0.01) and observer rating of disfigurement (p < 0.026) and nonstatistically significant correlations with patients' clinical and demographic variables: tumor location, age at diagnosis, and birth place (all p (s) > 0.05). The MBA is a valid and reliable screening measure of body apperception for HNC patients.
Translation, cross-cultural adaptation and validation of the Brazilian version of the Nonarthritic Hip Score.

PubMed

Del Castillo, Letícia Nunes Carreras; Leporace, Gustavo; Cardinot, Themis Moura; Levy, Roger Abramino; Oliveira, Liszt Palmeira de

2013-01-01

CONTEXT AND OBJECTIVE The Nonarthritic Hip Score (NAHS) is a clinical evaluation questionnaire that was developed in the English language to evaluate hip function in young and physically active patients. The aims of this study were to translate this questionnaire into the Brazilian Portuguese language, to adapt it to Brazilian culture and to validate it. DESIGN AND SETTING Cohort study conducted between 2008 and 2010, at Universidade do Estado do Rio de Janeiro (UERJ). METHODS Questions about physical activities and household chores were modified to better fit Brazilian culture. Reproducibility, internal consistency and validity (correlations with the Algofunctional Lequesne Index and the Western Ontario and McMaster Universities Arthritis Index [WOMAC]) were tested. The NAHS-Brazil, Lequesne and WOMAC questionnaires were applied to 64 young and physically active patients (mean age, 40.9 years; 31 women). RESULTS The intraclass correlation coefficient (which measures reproducibility) was 0.837 (P < 0.001). Bland-Altman plots revealed a mean error in the difference between the two measurements of 0.42. The internal consistency was confirmed through a Cronbach alpha of 0.944. The validity between NAHS-Brazil and Lequesne and between NAHS-Brazil and WOMAC showed high correlations, r = 0.7340 and r = 0.9073, respectively. NAHS-Brazil showed good validity with no floor or ceiling effects. CONCLUSION The NAHS was translated into the Brazilian Portuguese language and was cross-culturally adapted to Brazilian culture. It was shown to be a useful tool in clinical practice for assessing the quality of life of young and physically active patients with hip pain.
Cross-cultural application of the Korean version of Ureteral Stent Symptoms Questionnaire.

PubMed

Park, Jinsung; Shin, Dong Wook; You, Changhee; Chung, Kyung Jin; Han, Deok Hyun; Joshi, Hrishi B; Park, Hyung Keun

2012-11-01

We validated the Korean version of the Ureteral Stent Symptoms Questionnaire (USSQ) in patients with an indwelling ureteral stent. Linguistic validation of the original USSQ was performed through a standard process including translation, back translation, and pilot study. A total of 65 patients who underwent ureteroscopic surgery were asked to complete the Korean USSQ as well as EuroQOL (male and female), the International Prostate Symptom Score (male), and Urogenital Distress Inventory-6 (female). Patients were evaluated at weeks 1 and 2 after stent placement and at week 4 after removal. Sixty-four healthy subjects without a ureteral stent were also asked to complete the Korean USSQ once. The psychometric properties of the questionnaire were analyzed. Internal consistencies (Cronbach α coefficients: 0.73-0.83) and test-retest reliability (Spearman correlation coefficient: ≥0.6) were satisfactory for urinary symptom, body pain, general health, and work performance domains. Most USSQ domains showed moderate correlations with each other. Convergent validity determined by correlation between other instruments and corresponding USSQ domain was satisfactory. Sensitivity to change and discriminant validity were also good in most domains (P<0.01). Only a small proportion of the study population had an active sexual life, with the stent in situ, limiting its analysis. The Korean version of the USSQ is a reliable and valid instrument that can be self-administered by Korean patients with a ureteral stent in the clinical and research settings. Further clinical studies in the Korean settings would be useful to provide robust data on sensitivity to change.
Implementing the Science Assessment Standards: Developing and validating a set of laboratory assessment tasks in high school biology

NASA Astrophysics Data System (ADS)

Saha, Gouranga Chandra

Very often a number of factors, especially time, space and money, deter many science educators from using inquiry-based, hands-on, laboratory practical tasks as alternative assessment instruments in science. A shortage of valid inquiry-based laboratory tasks for high school biology has been cited. Driven by this need, this study addressed the following three research questions: (1) How can laboratory-based performance tasks be designed and developed that are doable by students for whom they are designed/written? (2) Do student responses to the laboratory-based performance tasks validly represent at least some of the intended process skills that new biology learning goals want students to acquire? (3) Are the laboratory-based performance tasks psychometrically consistent as individual tasks and as a set? To answer these questions, three tasks were used from the six biology tasks initially designed and developed by an iterative process of trial testing. Analyses of data from 224 students showed that performance-based laboratory tasks that are doable by all students require careful and iterative process of development. Although the students demonstrated more skill in performing than planning and reasoning, their performances at the item level were very poor for some items. Possible reasons for the poor performances have been discussed and suggestions on how to remediate the deficiencies have been made. Empirical evidences for validity and reliability of the instrument have been presented both from the classical and the modern validity criteria point of view. Limitations of the study have been identified. Finally implications of the study and directions for further research have been discussed.
Use of Attribute Driven Incremental Discretization and Logic Learning Machine to build a prognostic classifier for neuroblastoma patients.

PubMed

Cangelosi, Davide; Muselli, Marco; Parodi, Stefano; Blengio, Fabiola; Becherini, Pamela; Versteeg, Rogier; Conte, Massimo; Varesio, Luigi

2014-01-01

Cancer patient's outcome is written, in part, in the gene expression profile of the tumor. We previously identified a 62-probe sets signature (NB-hypo) to identify tissue hypoxia in neuroblastoma tumors and showed that NB-hypo stratified neuroblastoma patients in good and poor outcome 1. It was important to develop a prognostic classifier to cluster patients into risk groups benefiting of defined therapeutic approaches. Novel classification and data discretization approaches can be instrumental for the generation of accurate predictors and robust tools for clinical decision support. We explored the application to gene expression data of Rulex, a novel software suite including the Attribute Driven Incremental Discretization technique for transforming continuous variables into simplified discrete ones and the Logic Learning Machine model for intelligible rule generation. We applied Rulex components to the problem of predicting the outcome of neuroblastoma patients on the bases of 62 probe sets NB-hypo gene expression signature. The resulting classifier consisted in 9 rules utilizing mainly two conditions of the relative expression of 11 probe sets. These rules were very effective predictors, as shown in an independent validation set, demonstrating the validity of the LLM algorithm applied to microarray data and patients' classification. The LLM performed as efficiently as Prediction Analysis of Microarray and Support Vector Machine, and outperformed other learning algorithms such as C4.5. Rulex carried out a feature selection by selecting a new signature (NB-hypo-II) of 11 probe sets that turned out to be the most relevant in predicting outcome among the 62 of the NB-hypo signature. Rules are easily interpretable as they involve only few conditions. Our findings provided evidence that the application of Rulex to the expression values of NB-hypo signature created a set of accurate, high quality, consistent and interpretable rules for the prediction of neuroblastoma patients' outcome. We identified the Rulex weighted classification as a flexible tool that can support clinical decisions. For these reasons, we consider Rulex to be a useful tool for cancer classification from microarray gene expression data.
3D reconstruction from non-uniform point clouds via local hierarchical clustering

NASA Astrophysics Data System (ADS)

Yang, Jiaqi; Li, Ruibo; Xiao, Yang; Cao, Zhiguo

2017-07-01

Raw scanned 3D point clouds are usually irregularly distributed due to the essential shortcomings of laser sensors, which therefore poses a great challenge for high-quality 3D surface reconstruction. This paper tackles this problem by proposing a local hierarchical clustering (LHC) method to improve the consistency of point distribution. Specifically, LHC consists of two steps: 1) adaptive octree-based decomposition of 3D space, and 2) hierarchical clustering. The former aims at reducing the computational complexity and the latter transforms the non-uniform point set into uniform one. Experimental results on real-world scanned point clouds validate the effectiveness of our method from both qualitative and quantitative aspects.
Anomaly Detection for Beam Loss Maps in the Large Hadron Collider

NASA Astrophysics Data System (ADS)

Valentino, Gianluca; Bruce, Roderik; Redaelli, Stefano; Rossi, Roberto; Theodoropoulos, Panagiotis; Jaster-Merz, Sonja

2017-07-01

In the LHC, beam loss maps are used to validate collimator settings for cleaning and machine protection. This is done by monitoring the loss distribution in the ring during infrequent controlled loss map campaigns, as well as in standard operation. Due to the complexity of the system, consisting of more than 50 collimators per beam, it is difficult to identify small changes in the collimation hierarchy, which may be due to setting errors or beam orbit drifts with such methods. A technique based on Principal Component Analysis and Local Outlier Factor is presented to detect anomalies in the loss maps and therefore provide an automatic check of the collimation hierarchy.
Validation of the Economic and Health Outcomes Model of Type 2 Diabetes Mellitus (ECHO-T2DM).

PubMed

Willis, Michael; Johansen, Pierre; Nilsson, Andreas; Asseburg, Christian

2017-03-01

The Economic and Health Outcomes Model of Type 2 Diabetes Mellitus (ECHO-T2DM) was developed to address study questions pertaining to the cost-effectiveness of treatment alternatives in the care of patients with type 2 diabetes mellitus (T2DM). Naturally, the usefulness of a model is determined by the accuracy of its predictions. A previous version of ECHO-T2DM was validated against actual trial outcomes and the model predictions were generally accurate. However, there have been recent upgrades to the model, which modify model predictions and necessitate an update of the validation exercises. The objectives of this study were to extend the methods available for evaluating model validity, to conduct a formal model validation of ECHO-T2DM (version 2.3.0) in accordance with the principles espoused by the International Society for Pharmacoeconomics and Outcomes Research (ISPOR) and the Society for Medical Decision Making (SMDM), and secondarily to evaluate the relative accuracy of four sets of macrovascular risk equations included in ECHO-T2DM. We followed the ISPOR/SMDM guidelines on model validation, evaluating face validity, verification, cross-validation, and external validation. Model verification involved 297 'stress tests', in which specific model inputs were modified systematically to ascertain correct model implementation. Cross-validation consisted of a comparison between ECHO-T2DM predictions and those of the seminal National Institutes of Health model. In external validation, study characteristics were entered into ECHO-T2DM to replicate the clinical results of 12 studies (including 17 patient populations), and model predictions were compared to observed values using established statistical techniques as well as measures of average prediction error, separately for the four sets of macrovascular risk equations supported in ECHO-T2DM. Sub-group analyses were conducted for dependent vs. independent outcomes and for microvascular vs. macrovascular vs. mortality endpoints. All stress tests were passed. ECHO-T2DM replicated the National Institutes of Health cost-effectiveness application with numerically similar results. In external validation of ECHO-T2DM, model predictions agreed well with observed clinical outcomes. For all sets of macrovascular risk equations, the results were close to the intercept and slope coefficients corresponding to a perfect match, resulting in high R 2 and failure to reject concordance using an F test. The results were similar for sub-groups of dependent and independent validation, with some degree of under-prediction of macrovascular events. ECHO-T2DM continues to match health outcomes in clinical trials in T2DM, with prediction accuracy similar to other leading models of T2DM.
Investigating the different mechanisms of genotoxic and non-genotoxic carcinogens by a gene set analysis.

PubMed

Lee, Won Jun; Kim, Sang Cheol; Lee, Seul Ji; Lee, Jeongmi; Park, Jeong Hill; Yu, Kyung-Sang; Lim, Johan; Kwon, Sung Won

2014-01-01

Based on the process of carcinogenesis, carcinogens are classified as either genotoxic or non-genotoxic. In contrast to non-genotoxic carcinogens, many genotoxic carcinogens have been reported to cause tumor in carcinogenic bioassays in animals. Thus evaluating the genotoxicity potential of chemicals is important to discriminate genotoxic from non-genotoxic carcinogens for health care and pharmaceutical industry safety. Additionally, investigating the difference between the mechanisms of genotoxic and non-genotoxic carcinogens could provide the foundation for a mechanism-based classification for unknown compounds. In this study, we investigated the gene expression of HepG2 cells treated with genotoxic or non-genotoxic carcinogens and compared their mechanisms of action. To enhance our understanding of the differences in the mechanisms of genotoxic and non-genotoxic carcinogens, we implemented a gene set analysis using 12 compounds for the training set (12, 24, 48 h) and validated significant gene sets using 22 compounds for the test set (24, 48 h). For a direct biological translation, we conducted a gene set analysis using Globaltest and selected significant gene sets. To validate the results, training and test compounds were predicted by the significant gene sets using a prediction analysis for microarrays (PAM). Finally, we obtained 6 gene sets, including sets enriched for genes involved in the adherens junction, bladder cancer, p53 signaling pathway, pathways in cancer, peroxisome and RNA degradation. Among the 6 gene sets, the bladder cancer and p53 signaling pathway sets were significant at 12, 24 and 48 h. We also found that the DDB2, RRM2B and GADD45A, genes related to the repair and damage prevention of DNA, were consistently up-regulated for genotoxic carcinogens. Our results suggest that a gene set analysis could provide a robust tool in the investigation of the different mechanisms of genotoxic and non-genotoxic carcinogens and construct a more detailed understanding of the perturbation of significant pathways.
Investigating the Different Mechanisms of Genotoxic and Non-Genotoxic Carcinogens by a Gene Set Analysis

PubMed Central

Lee, Won Jun; Kim, Sang Cheol; Lee, Seul Ji; Lee, Jeongmi; Park, Jeong Hill; Yu, Kyung-Sang; Lim, Johan; Kwon, Sung Won

2014-01-01

Based on the process of carcinogenesis, carcinogens are classified as either genotoxic or non-genotoxic. In contrast to non-genotoxic carcinogens, many genotoxic carcinogens have been reported to cause tumor in carcinogenic bioassays in animals. Thus evaluating the genotoxicity potential of chemicals is important to discriminate genotoxic from non-genotoxic carcinogens for health care and pharmaceutical industry safety. Additionally, investigating the difference between the mechanisms of genotoxic and non-genotoxic carcinogens could provide the foundation for a mechanism-based classification for unknown compounds. In this study, we investigated the gene expression of HepG2 cells treated with genotoxic or non-genotoxic carcinogens and compared their mechanisms of action. To enhance our understanding of the differences in the mechanisms of genotoxic and non-genotoxic carcinogens, we implemented a gene set analysis using 12 compounds for the training set (12, 24, 48 h) and validated significant gene sets using 22 compounds for the test set (24, 48 h). For a direct biological translation, we conducted a gene set analysis using Globaltest and selected significant gene sets. To validate the results, training and test compounds were predicted by the significant gene sets using a prediction analysis for microarrays (PAM). Finally, we obtained 6 gene sets, including sets enriched for genes involved in the adherens junction, bladder cancer, p53 signaling pathway, pathways in cancer, peroxisome and RNA degradation. Among the 6 gene sets, the bladder cancer and p53 signaling pathway sets were significant at 12, 24 and 48 h. We also found that the DDB2, RRM2B and GADD45A, genes related to the repair and damage prevention of DNA, were consistently up-regulated for genotoxic carcinogens. Our results suggest that a gene set analysis could provide a robust tool in the investigation of the different mechanisms of genotoxic and non-genotoxic carcinogens and construct a more detailed understanding of the perturbation of significant pathways. PMID:24497971

Evaluation of a Serum Lung Cancer Biomarker Panel.

PubMed

Mazzone, Peter J; Wang, Xiao-Feng; Han, Xiaozhen; Choi, Humberto; Seeley, Meredith; Scherer, Richard; Doseeva, Victoria

2018-01-01

A panel of 3 serum proteins and 1 autoantibody has been developed to assist with the detection of lung cancer. We aimed to validate the accuracy of the biomarker panel in an independent test set and explore the impact of adding a fourth serum protein to the panel, as well as the impact of combining molecular and clinical variables. The training set of serum samples was purchased from commercially available biorepositories. The testing set was from a biorepository at the Cleveland Clinic. All lung cancer and control subjects were >50 years old and had smoked a minimum of 20 pack-years. A panel of biomarkers including CEA (carcinoembryonic antigen), CYFRA21-1 (cytokeratin-19 fragment 21-1), CA125 (carbohydrate antigen 125), HGF (hepatocyte growth factor), and NY-ESO-1 (New York esophageal cancer-1 antibody) was measured using immunoassay techniques. The multiple of the median method, multivariate logistic regression, and random forest modeling was used to analyze the results. The training set consisted of 604 patient samples (268 with lung cancer and 336 controls) and the testing set of 400 patient samples (155 with lung cancer and 245 controls). With a threshold established from the training set, the sensitivity and specificity of both the 4- and 5-biomarker panels on the testing set was 49% and 96%, respectively. Models built on the testing set using only clinical variables had an area under the receiver operating characteristic curve of 0.68, using the biomarker panel 0.81 and by combining clinical and biomarker variables 0.86. This study validates the accuracy of a panel of proteins and an autoantibody in a population relevant to lung cancer detection and suggests a benefit to combining clinical features with the biomarker results.
Evaluation of a Serum Lung Cancer Biomarker Panel

PubMed Central

Mazzone, Peter J; Wang, Xiao-Feng; Han, Xiaozhen; Choi, Humberto; Seeley, Meredith; Scherer, Richard; Doseeva, Victoria

2018-01-01

Background: A panel of 3 serum proteins and 1 autoantibody has been developed to assist with the detection of lung cancer. We aimed to validate the accuracy of the biomarker panel in an independent test set and explore the impact of adding a fourth serum protein to the panel, as well as the impact of combining molecular and clinical variables. Methods: The training set of serum samples was purchased from commercially available biorepositories. The testing set was from a biorepository at the Cleveland Clinic. All lung cancer and control subjects were >50 years old and had smoked a minimum of 20 pack-years. A panel of biomarkers including CEA (carcinoembryonic antigen), CYFRA21-1 (cytokeratin-19 fragment 21-1), CA125 (carbohydrate antigen 125), HGF (hepatocyte growth factor), and NY-ESO-1 (New York esophageal cancer-1 antibody) was measured using immunoassay techniques. The multiple of the median method, multivariate logistic regression, and random forest modeling was used to analyze the results. Results: The training set consisted of 604 patient samples (268 with lung cancer and 336 controls) and the testing set of 400 patient samples (155 with lung cancer and 245 controls). With a threshold established from the training set, the sensitivity and specificity of both the 4- and 5-biomarker panels on the testing set was 49% and 96%, respectively. Models built on the testing set using only clinical variables had an area under the receiver operating characteristic curve of 0.68, using the biomarker panel 0.81 and by combining clinical and biomarker variables 0.86. Conclusions: This study validates the accuracy of a panel of proteins and an autoantibody in a population relevant to lung cancer detection and suggests a benefit to combining clinical features with the biomarker results. PMID:29371783
Reconstruction of an 8-lead surface ECG from two subcutaneous ICD vectors.

PubMed

Wilson, David G; Cronbach, Peter L; Panfilo, D; Greenhut, Saul E; Stegemann, Berthold P; Morgan, John M

2017-06-01

Techniques exist which allow surface ECGs to be reconstructed from reduced lead sets. We aimed to reconstruct an 8-lead ECG from two independent S-ICD sensing electrodes vectors as proof of this principle. Participants with ICDs (N=61) underwent 3minute ECGs using a TMSi Porti7 multi-channel signal recorder (TMS international, The Netherlands) with electrodes in the standard S-ICD and 12-lead positions. Participants were randomised to either a training (N=31) or validation (N=30) group. The transformation used was a linear combination of the 2 independent S-ICD vectors to each of the 8 independent leads of the 12-lead ECG, with coefficients selected that minimized the root mean square error (RMSE) between recorded and derived ECGs when applied to the training group. The transformation was then applied to the validation group and agreement between the recorded and derived lead pairs was measured by Pearson correlation coefficient (r) and normalised RMSE (NRMSE). In total, 27 patients with complete data sets were included in the validation set consisting of 57,888 data points from 216 full lead sets. The distribution of the r and NRMSE were skewed. Mean r=0.770 (SE 0.024), median r=0.925. NRMSE mean=0.233 (SE 0.015) median=0.171. We have demonstrated that the reconstruction of an 8-lead ECG from two S-ICD vectors is possible. If perfected, the ability to generate accurate multi-lead surface ECG data from an S-ICD would potentially allow recording and review of clinical arrhythmias at follow-up. Copyright © 2017 Elsevier B.V. All rights reserved.
Psychometric properties of the Japanese version of the Social Phobia Inventory.

PubMed

Nagata, Toshihiko; Nakajima, Takenori; Teo, Alan R; Yamada, Hisashi; Yoshimura, Chiho

2013-04-01

The aim of the current study was to study the psychometric properties of the Japanese version of the Social Phobia Inventory (SPIN-J) among Japanese subjects with social anxiety disorder (SAD). The sample consisted of 86 subjects with SAD and 86 controls. Diagnosis was based on a modified version of the Structured Clinical Interview for the DSM-IV. In addition to the SPIN-J, clinician-administered and self-rating scales, including the Japanese versions of the Liebowitz Social Anxiety Scale, the Social Phobia Scale, and the Social Interaction Anxiety Scale, were used. The SPIN-J showed adequate internal consistency (0.82-0.96) for the total and subscales. Correlations between the SPIN-J and the Liebowitz Social Anxiety Scale, the Social Phobia Scale, and the Social Interaction Anxiety Scale ranged from 0.83 to 0.89 and indicated adequate concurrent validity. A cut-off point of 22 between subjects with SAD and controls showed a sensitivity of 96.5% and specificity of 87.2%, indicating robust discriminant validity. The SPIN-J showed adequate reliability and validity for use as a screening tool for social anxiety disorder in Japanese clinical settings. © 2013 The Authors. Psychiatry and Clinical Neurosciences © 2013 Japanese Society of Psychiatry and Neurology.
The development and validation of a Real Time Location System to reliably monitor everyday activities in natural contexts.

PubMed

Judah, Gaby; de Witt Huberts, Jessie; Drassal, Allan; Aunger, Robert

2017-01-01

The accurate measurement of behaviour is vitally important to many disciplines and practitioners of various kinds. While different methods have been used (such as observation, diaries, questionnaire), none are able to accurately monitor behaviour over the long term in the natural context of people's own lives. The aim of this work was therefore to develop and test a reliable system for unobtrusively monitoring various behaviours of multiple individuals within the same household over a period of several months. A commercial Real Time Location System was adapted to meet these requirements and subsequently validated in three households by monitoring various bathroom behaviours. The results indicate that the system is robust, can monitor behaviours over the long-term in different households and can reliably distinguish between individuals. Precision rates were high and consistent. Recall rates were less consistent across households and behaviours, although recall rates improved considerably with practice at set-up of the system. The achieved precision and recall rates were comparable to the rates observed in more controlled environments using more valid methods of ground truthing. These initial findings indicate that the system is a valuable, flexible and robust system for monitoring behaviour in its natural environment that would allow new research questions to be addressed.
Multi-laboratory validation study of multilocus variable-number tandem repeat analysis (MLVA) for Salmonella enterica serovar Enteritidis, 2015

PubMed Central

Peters, Tansy; Bertrand, Sophie; Björkman, Jonas T; Brandal, Lin T; Brown, Derek J; Erdõsi, Tímea; Heck, Max; Ibrahem, Salha; Johansson, Karin; Kornschober, Christian; Kotila, Saara M; Le Hello, Simon; Lienemann, Taru; Mattheus, Wesley; Nielsen, Eva Møller; Ragimbeau, Catherine; Rumore, Jillian; Sabol, Ashley; Torpdahl, Mia; Trees, Eija; Tuohy, Alma; de Pinna, Elizabeth

2017-01-01

Multilocus variable-number tandem repeat analysis (MLVA) is a rapid and reproducible typing method that is an important tool for investigation, as well as detection, of national and multinational outbreaks of a range of food-borne pathogens. Salmonella enterica serovar Enteritidis is the most common Salmonella serovar associated with human salmonellosis in the European Union/European Economic Area and North America. Fourteen laboratories from 13 countries in Europe and North America participated in a validation study for MLVA of S. Enteritidis targeting five loci. Following normalisation of fragment sizes using a set of reference strains, a blinded set of 24 strains with known allele sizes was analysed by each participant. The S. Enteritidis 5-loci MLVA protocol was shown to produce internationally comparable results as more than 90% of the participants reported less than 5% discrepant MLVA profiles. All 14 participating laboratories performed well, even those where experience with this typing method was limited. The raw fragment length data were consistent throughout, and the inter-laboratory validation helped to standardise the conversion of raw data to repeat numbers with at least two countries updating their internal procedures. However, differences in assigned MLVA profiles remain between well-established protocols and should be taken into account when exchanging data. PMID:28277220
Functional assignment of solute-binding proteins of ABC transporters using a fluorescence-based thermal shift assay.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Giulliani, S. E.; Frank, A. E.; Collart, F. R.

2008-12-08

We have used a fluorescence-based thermal shift (FTS) assay to identify amino acids that bind to solute-binding proteins in the bacterial ABC transporter family. The assay was validated with a set of six proteins with known binding specificity and was consistently able to map proteins with their known binding ligands. The assay also identified additional candidate binding ligands for several of the amino acid-binding proteins in the validation set. We extended this approach to additional targets and demonstrated the ability of the FTS assay to unambiguously identify preferential binding for several homologues of amino acid-binding proteins with known specificity andmore » to functionally annotate proteins of unknown binding specificity. The assay is implemented in a microwell plate format and provides a rapid approach to validate an anticipated function or to screen proteins of unknown function. The ABC-type transporter family is ubiquitous and transports a variety of biological compounds, but the current annotation of the ligand-binding proteins is limited to mostly generic descriptions of function. The results illustrate the feasibility of the FTS assay to improve the functional annotation of binding proteins associated with ABC-type transporters and suggest this approach that can also be extended to other protein families.« less
Integrated cosmological probes: concordance quantified

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nicola, Andrina; Amara, Adam; Refregier, Alexandre, E-mail: andrina.nicola@phys.ethz.ch, E-mail: adam.amara@phys.ethz.ch, E-mail: alexandre.refregier@phys.ethz.ch

2017-10-01

Assessing the consistency of parameter constraints derived from different cosmological probes is an important way to test the validity of the underlying cosmological model. In an earlier work [1], we computed constraints on cosmological parameters for ΛCDM from an integrated analysis of CMB temperature anisotropies and CMB lensing from Planck, galaxy clustering and weak lensing from SDSS, weak lensing from DES SV as well as Type Ia supernovae and Hubble parameter measurements. In this work, we extend this analysis and quantify the concordance between the derived constraints and those derived by the Planck Collaboration as well as WMAP9, SPT andmore » ACT. As a measure for consistency, we use the Surprise statistic [2], which is based on the relative entropy. In the framework of a flat ΛCDM cosmological model, we find all data sets to be consistent with one another at a level of less than 1σ. We highlight that the relative entropy is sensitive to inconsistencies in the models that are used in different parts of the analysis. In particular, inconsistent assumptions for the neutrino mass break its invariance on the parameter choice. When consistent model assumptions are used, the data sets considered in this work all agree with each other and ΛCDM, without evidence for tensions.« less
Dark Energy Survey Year 1 Results: The Photometric Data Set for Cosmology

DOE Office of Scientific and Technical Information (OSTI.GOV)

Drlica-Wagner, A.; Sevilla-Noarbe, I.; Rykoff, E. S.

In this paper, we describe the creation, content, and validation of the Dark Energy Survey (DES) internal year-one cosmology data set, Y1A1 GOLD, in support of upcoming cosmological analyses. The Y1A1 GOLD data set is assembled from multiple epochs of DES imaging and consists of calibrated photometric zero-points, object catalogs, and ancillary data products—e.g., maps of survey depth and observing conditions, star–galaxy classification, and photometric redshift estimates—that are necessary for accurate cosmological analyses. The Y1A1 GOLD wide-area object catalog consists ofmore » $$\\sim 137$$ million objects detected in co-added images covering $$\\sim 1800\\,{\\deg }^{2}$$ in the DES grizY filters. The 10σ limiting magnitude for galaxies is $g=23.4$, $r=23.2$, $i=22.5$, $z=21.8$, and $Y=20.1$. Photometric calibration of Y1A1 GOLD was performed by combining nightly zero-point solutions with stellar locus regression, and the absolute calibration accuracy is better than 2% over the survey area. Finally, DES Y1A1 GOLD is the largest photometric data set at the achieved depth to date, enabling precise measurements of cosmic acceleration at z ≲ 1.« less
The consecutive dry days to trigger rainfall over West Africa

NASA Astrophysics Data System (ADS)

Lee, J. H.

2018-01-01

In order to resolve contradictions in addressing a soil moisture-precipitation feedback mechanism over West Africa and to clarify the impact of antecedent soil moisture on subsequent rainfall evolution, we first validated various data sets (SMOS satellite soil moisture observations, NOAH land surface model, TRMM rainfall, CMORPH rainfall and HadGEM climate models) with the Analyses Multidisciplinaires de la Mousson Africaine (AMMA) field campaign data. Based on this analysis, it was suggested that biases of data sets might cause contradictions in studying mechanisms. Thus, by taking into account uncertainties in data, it was found that the approach of consecutive dry days (i.e. a relative comparison of time-series) showed consistency across various data sets, while the direct comparison approach for soil moisture state and rainfall did not. Thus, it was discussed that it may be difficult to directly relate rain with soil moisture as the absolute value, however, it may be reasonable to compare a temporal progress of the variables. Based upon the results consistently showing a positive relationship between the consecutive dry days and rainfall, this study supports a negative feedback often neglected by climate model structure. This approach is less sensitive to interpretation errors arising from systematic errors in data sets, as this measures a temporal gradient of soil moisture state.
Dark Energy Survey Year 1 Results: The Photometric Data Set for Cosmology

DOE PAGES

Drlica-Wagner, A.; Sevilla-Noarbe, I.; Rykoff, E. S.; ...

2018-04-03

In this paper, we describe the creation, content, and validation of the Dark Energy Survey (DES) internal year-one cosmology data set, Y1A1 GOLD, in support of upcoming cosmological analyses. The Y1A1 GOLD data set is assembled from multiple epochs of DES imaging and consists of calibrated photometric zero-points, object catalogs, and ancillary data products—e.g., maps of survey depth and observing conditions, star–galaxy classification, and photometric redshift estimates—that are necessary for accurate cosmological analyses. The Y1A1 GOLD wide-area object catalog consists ofmore » $$\\sim 137$$ million objects detected in co-added images covering $$\\sim 1800\\,{\\deg }^{2}$$ in the DES grizY filters. The 10σ limiting magnitude for galaxies is $g=23.4$, $r=23.2$, $i=22.5$, $z=21.8$, and $Y=20.1$. Photometric calibration of Y1A1 GOLD was performed by combining nightly zero-point solutions with stellar locus regression, and the absolute calibration accuracy is better than 2% over the survey area. Finally, DES Y1A1 GOLD is the largest photometric data set at the achieved depth to date, enabling precise measurements of cosmic acceleration at z ≲ 1.« less
Multiple Versus Single Set Validation of Multivariate Models to Avoid Mistakes.

PubMed

Harrington, Peter de Boves

2018-01-02

Validation of multivariate models is of current importance for a wide range of chemical applications. Although important, it is neglected. The common practice is to use a single external validation set for evaluation. This approach is deficient and may mislead investigators with results that are specific to the single validation set of data. In addition, no statistics are available regarding the precision of a derived figure of merit (FOM). A statistical approach using bootstrapped Latin partitions is advocated. This validation method makes an efficient use of the data because each object is used once for validation. It was reviewed a decade earlier but primarily for the optimization of chemometric models this review presents the reasons it should be used for generalized statistical validation. Average FOMs with confidence intervals are reported and powerful, matched-sample statistics may be applied for comparing models and methods. Examples demonstrate the problems with single validation sets.
Assessing the accuracy and stability of variable selection ...

EPA Pesticide Factsheets

Random forest (RF) modeling has emerged as an important statistical learning method in ecology due to its exceptional predictive performance. However, for large and complex ecological datasets there is limited guidance on variable selection methods for RF modeling. Typically, either a preselected set of predictor variables are used, or stepwise procedures are employed which iteratively add/remove variables according to their importance measures. This paper investigates the application of variable selection methods to RF models for predicting probable biological stream condition. Our motivating dataset consists of the good/poor condition of n=1365 stream survey sites from the 2008/2009 National Rivers and Stream Assessment, and a large set (p=212) of landscape features from the StreamCat dataset. Two types of RF models are compared: a full variable set model with all 212 predictors, and a reduced variable set model selected using a backwards elimination approach. We assess model accuracy using RF's internal out-of-bag estimate, and a cross-validation procedure with validation folds external to the variable selection process. We also assess the stability of the spatial predictions generated by the RF models to changes in the number of predictors, and argue that model selection needs to consider both accuracy and stability. The results suggest that RF modeling is robust to the inclusion of many variables of moderate to low importance. We found no substanti
Analysis of NASA Common Research Model Dynamic Data

NASA Technical Reports Server (NTRS)

Balakrishna, S.; Acheson, Michael J.

2011-01-01

Recent NASA Common Research Model (CRM) tests at the Langley National Transonic Facility (NTF) and Ames 11-foot Transonic Wind Tunnel (11-foot TWT) have generated an experimental database for CFD code validation. The database consists of force and moment, surface pressures and wideband wing-root dynamic strain/wing Kulite data from continuous sweep pitch polars. The dynamic data sets, acquired at 12,800 Hz sampling rate, are analyzed in this study to evaluate CRM wing buffet onset and potential CRM wing flow separation.
Development and validation of PSPSQ 2.0 measuring patient satisfaction with pharmacist services.

PubMed

Sakharkar, Prashant; Bounthavong, Mark; Hirsch, Jan D; Morello, Candis M; Chen, Timothy C; Law, Anandi V

2015-01-01

The extant literature reveals a lack of psychometrically validated tools measuring patient satisfaction with pharmacist clinical services. The Patient Satisfaction with Pharmacist Services Questionnaire (PSPSQ 2.0) was developed to address this need using a mixed methods approach. To assess the psychometric properties of the PSPSQ 2.0, an instrument developed to measure patient satisfaction with clinical services provided by pharmacists. Validation studies were conducted in two Veterans Affairs (VA)-based and two community-based (diabetes and psychiatric care) disease management/medication therapy management clinics. The PSPSQ 2.0 consisted of 22-items related to three domains identified as quality of care, patient-pharmacist relationship and overall satisfaction using a 4-point, Likert-type scale. It was administered to participants following their session with a pharmacist at the clinics. Collected data were analyzed for descriptive statistics, internal consistency, and validity using exploratory factor analysis. A total of 149 patients completed the survey. Patients from VA clinics were on average 61 years old, mostly white (63%), and predominantly male (95%). Patients from non-VA clinics were on average 47 years old, mostly White (47%) and male (53%). Non-VA patients mostly had Medicaid (42%) and commercial health insurance (31%), whereas VA patients retained benefits with the US Department of Veterans Affairs. Reliability of the scale using internal consistency metrics revealed a Cronbach's alpha of 0.98, 0.98 and 0.95 for VA, diabetes, and psychiatric care clinics, respectively, whereas the Cronbach's alpha for the pooled sample was 0.96. Factor analyses resulted in a three-factor solution accounting for 91% and 69% variance for diabetes and psychiatric care clinics, respectively; however, VA clinics and pooled sample yielded only 2-factor solution with 80% and 66% variance, respectively, with more items loading on patient-pharmacist relationship domain. The results suggest that the PSPSQ 2.0 can serve as a reliable and valid tool for measuring patient satisfaction with pharmacists providing clinical services in VA- and non-VA settings upon further validation. Copyright © 2015 Elsevier Inc. All rights reserved.
Assessment of tautomer distribution using the condensed reaction graph approach

NASA Astrophysics Data System (ADS)

Gimadiev, T. R.; Madzhidov, T. I.; Nugmanov, R. I.; Baskin, I. I.; Antipin, I. S.; Varnek, A.

2018-03-01

We report the first direct QSPR modeling of equilibrium constants of tautomeric transformations (logK T ) in different solvents and at different temperatures, which do not require intermediate assessment of acidity (basicity) constants for all tautomeric forms. The key step of the modeling consisted in the merging of two tautomers in one sole molecular graph ("condensed reaction graph") which enables to compute molecular descriptors characterizing entire equilibrium. The support vector regression method was used to build the models. The training set consisted of 785 transformations belonging to 11 types of tautomeric reactions with equilibrium constants measured in different solvents and at different temperatures. The models obtained perform well both in cross-validation (Q2 = 0.81 RMSE = 0.7 logK T units) and on two external test sets. Benchmarking studies demonstrate that our models outperform results obtained with DFT B3LYP/6-311 ++ G(d,p) and ChemAxon Tautomerizer applicable only in water at room temperature.
Affective Pictures and the Open Library of Affective Foods (OLAF): Tools to Investigate Emotions toward Food in Adults.

PubMed

Miccoli, Laura; Delgado, Rafael; Guerra, Pedro; Versace, Francesco; Rodríguez-Ruiz, Sonia; Fernández-Santaella, M Carmen

2016-01-01

Recently, several sets of standardized food pictures have been created, supplying both food images and their subjective evaluations. However, to date only the OLAF (Open Library of Affective Foods), a set of food images and ratings we developed in adolescents, has the specific purpose of studying emotions toward food. Moreover, some researchers have argued that food evaluations are not valid across individuals and groups, unless feelings toward food cues are compared with feelings toward intense experiences unrelated to food, that serve as benchmarks. Therefore the OLAF presented here, comprising a set of original food images and a group of standardized highly emotional pictures, is intended to provide valid between-group judgments in adults. Emotional images (erotica, mutilations, and neutrals from the International Affective Picture System/IAPS) additionally ensure that the affective ratings are consistent with emotion research. The OLAF depicts high-calorie sweet and savory foods and low-calorie fruits and vegetables, portraying foods within natural scenes matching the IAPS features. An adult sample evaluated both food and affective pictures in terms of pleasure, arousal, dominance, and food craving, following standardized affective rating procedures. The affective ratings for the emotional pictures corroborated previous findings, thus confirming the reliability of evaluations for the food images. Among the OLAF images, high-calorie sweet and savory foods elicited the greatest pleasure, although they elicited, as expected, less arousal than erotica. The observed patterns were consistent with research on emotions and confirmed the reliability of OLAF evaluations. The OLAF and affective pictures constitute a sound methodology to investigate emotions toward food within a wider motivational framework. The OLAF is freely accessible at digibug.ugr.es.
Affective Pictures and the Open Library of Affective Foods (OLAF): Tools to Investigate Emotions toward Food in Adults

PubMed Central

Guerra, Pedro; Versace, Francesco; Rodríguez-Ruiz, Sonia; Fernández-Santaella, M. Carmen

2016-01-01

Recently, several sets of standardized food pictures have been created, supplying both food images and their subjective evaluations. However, to date only the OLAF (Open Library of Affective Foods), a set of food images and ratings we developed in adolescents, has the specific purpose of studying emotions toward food. Moreover, some researchers have argued that food evaluations are not valid across individuals and groups, unless feelings toward food cues are compared with feelings toward intense experiences unrelated to food, that serve as benchmarks. Therefore the OLAF presented here, comprising a set of original food images and a group of standardized highly emotional pictures, is intended to provide valid between-group judgments in adults. Emotional images (erotica, mutilations, and neutrals from the International Affective Picture System/IAPS) additionally ensure that the affective ratings are consistent with emotion research. The OLAF depicts high-calorie sweet and savory foods and low-calorie fruits and vegetables, portraying foods within natural scenes matching the IAPS features. An adult sample evaluated both food and affective pictures in terms of pleasure, arousal, dominance, and food craving, following standardized affective rating procedures. The affective ratings for the emotional pictures corroborated previous findings, thus confirming the reliability of evaluations for the food images. Among the OLAF images, high-calorie sweet and savory foods elicited the greatest pleasure, although they elicited, as expected, less arousal than erotica. The observed patterns were consistent with research on emotions and confirmed the reliability of OLAF evaluations. The OLAF and affective pictures constitute a sound methodology to investigate emotions toward food within a wider motivational framework. The OLAF is freely accessible at digibug.ugr.es. PMID:27513636
Development and Validation of the Behavioral Tendencies Questionnaire

PubMed Central

Van Dam, Nicholas T.; Brown, Anna; Mole, Tom B.; Davis, Jake H.; Britton, Willoughby B.; Brewer, Judson A.

2015-01-01

At a fundamental level, taxonomy of behavior and behavioral tendencies can be described in terms of approach, avoid, or equivocate (i.e., neither approach nor avoid). While there are numerous theories of personality, temperament, and character, few seem to take advantage of parsimonious taxonomy. The present study sought to implement this taxonomy by creating a questionnaire based on a categorization of behavioral temperaments/tendencies first identified in Buddhist accounts over fifteen hundred years ago. Items were developed using historical and contemporary texts of the behavioral temperaments, described as “Greedy/Faithful”, “Aversive/Discerning”, and “Deluded/Speculative”. To both maintain this categorical typology and benefit from the advantageous properties of forced-choice response format (e.g., reduction of response biases), binary pairwise preferences for items were modeled using Latent Class Analysis (LCA). One sample (n1 = 394) was used to estimate the item parameters, and the second sample (n2 = 504) was used to classify the participants using the established parameters and cross-validate the classification against multiple other measures. The cross-validated measure exhibited good nomothetic span (construct-consistent relationships with related measures) that seemed to corroborate the ideas present in the original Buddhist source documents. The final 13-block questionnaire created from the best performing items (the Behavioral Tendencies Questionnaire or BTQ) is a psychometrically valid questionnaire that is historically consistent, based in behavioral tendencies, and promises practical and clinical utility particularly in settings that teach and study meditation practices such as Mindfulness Based Stress Reduction (MBSR). PMID:26535904
Validity and reliability of an Arabic version of the state-trait anxiety inventory in a Saudi dental setting

PubMed Central

Bahammam, Maha A.

2016-01-01

Objectives: To test the psychometric properties of an adapted Arabic version of the state trait anxiety-form Y (STAI-Y) in Saudi adult dental patients. Methods: In this cross-sectional study, the published Arabic version of the STAI-Y was evaluated by 2 experienced bilingual professionals for its compatibility with Saudi culture and revised prior to testing. Three hundred and eighty-seven patients attending dental clinics for treatment at the Faculty of Dentistry Hospital, King Abdullah University, Jeddah, Kingdom of Saudi Arabia, participated in the study. The Arabic version of the modified dental anxiety scale (MDAS) and visual analogue scale (VAS) ratings of anxiety were used to assess the concurrent criterion validity. Results: The Arabic version of the STAI-Y had high internal consistency reliability (Cronbach’s alpha: 0.989) for state and trait subscales. Factor analysis indicated unidimensionality of the scale. Correlations between STAI-Y scores and both MDAS and VAS scores indicated strong concurrent criterion validity. Discriminant validity was supported by the findings that higher anxiety levels were present among females as opposed to males, younger individuals as compared to older individuals, and patients who do not visit the dentist unless they have a need as opposed to more frequent visitors to the dental office. Conclusion: The Arabic version of the STAI-Y has an adequate internal consistency reliability, generally similar to that reported in the international literature, suggesting it is appropriate for assessing dental anxiety in Arabic speaking populations. PMID:27279514

Development and validation of Big Four personality scales for the Schedule for Nonadaptive and Adaptive Personality--Second Edition (SNAP-2).

PubMed

Calabrese, William R; Rudick, Monica M; Simms, Leonard J; Clark, Lee Anna

2012-09-01

Recently, integrative, hierarchical models of personality and personality disorder (PD)--such as the Big Three, Big Four, and Big Five trait models--have gained support as a unifying dimensional framework for describing PD. However, no measures to date can simultaneously represent each of these potentially interesting levels of the personality hierarchy. To unify these measurement models psychometrically, we sought to develop Big Five trait scales within the Schedule for Nonadaptive and Adaptive Personality--Second Edition (SNAP-2). Through structural and content analyses, we examined relations between the SNAP-2, the Big Five Inventory (BFI), and the NEO Five-Factor Inventory (NEO-FFI) ratings in a large data set (N = 8,690), including clinical, military, college, and community participants. Results yielded scales consistent with the Big Four model of personality (i.e., Neuroticism, Conscientiousness, Introversion, and Antagonism) and not the Big Five, as there were insufficient items related to Openness. Resulting scale scores demonstrated strong internal consistency and temporal stability. Structural validity and external validity were supported by strong convergent and discriminant validity patterns between Big Four scale scores and other personality trait scores and expectable patterns of self-peer agreement. Descriptive statistics and community-based norms are provided. The SNAP-2 Big Four Scales enable researchers and clinicians to assess personality at multiple levels of the trait hierarchy and facilitate comparisons among competing big-trait models. PsycINFO Database Record (c) 2012 APA, all rights reserved.
Interest in Aesthetic Rhinoplasty Scale.

PubMed

Naraghi, Mohsen; Atari, Mohammad

2017-04-01

Interest in cosmetic surgery is increasing, with rhinoplasty being one of the most popular surgical procedures. It is essential that surgeons identify patients with existing psychological conditions before any procedure. This study aimed to develop and validate the Interest in Aesthetic Rhinoplasty Scale (IARS). Four studies were conducted to develop the IARS and to evaluate different indices of validity (face, content, construct, criterion, and concurrent validities) and reliability (internal consistency, split-half coefficient, and temporal stability) of the scale. The four study samples included a total of 463 participants. Statistical analysis revealed satisfactory psychometric properties in all samples. Scores on the IARS were negatively correlated with self-esteem scores ( r = -0.296; p < 0.01) and positively associated with scores for psychopathologic symptoms ( r = 0.164; p < 0.05), social dysfunction ( r = 0.268; p < 0.01), and depression ( r = 0.308; p < 0.01). The internal and test-retest coefficients of consistency were found to be high (α = 0.93; intraclass coefficient = 0.94). Rhinoplasty patients were found to have significantly higher IARS scores than nonpatients ( p < 0.001). Findings of the present studies provided evidence for face, content, construct, criterion, and concurrent validities and internal and test-retest reliability of the IARS. This evidence supports the use of the scale in clinical and research settings. Thieme Medical Publishers 333 Seventh Avenue, New York, NY 10001, USA.
Validation of the Korean Version of the Breast Cancer Screening Beliefs Questionnaire.

PubMed

Kwok, Cannas; Lee, Mi-Joung; Lee, Chun Fan

Korean immigrant women have been consistently reported as having low participation in breast cancer screening practices. A valid and reliable instrument to explore factors that affect their cancer screening behaviors is essential. The aim of this study was to report the psychometric properties of the Korean version of the Breast Cancer Screening Beliefs Questionnaire (BCSBQ). A convenience sample of 249 Korean Australian women was recruited through a number of Korean community organizations in Sydney. Exploratory factor analysis supports a similar fit for the original 3-factor structure of our data set. A significant association was found between the attitudes of these women toward general health checkups and the frequency of their performance of the breast awareness practices and having mammograms. Furthermore, it was found that knowledge and perceptions about the breast cancer scales were significantly associated with education level and that barriers to mammographic screening were much less evident among women who engaged in the 3 screening practices. The results indicated that the Korean version of the BCSBQ had satisfactory validity and internal consistency. The Cronbach's α of the 3 subscales ranged between .80 and .88. The Korean version of the BCSBQ was confirmed to be a culturally appropriate, valid, and reliable instrument for assessing the beliefs, knowledge, and attitudes to breast cancer and breast cancer screening practices among women of Korean background living in Australia. The Korean version of the BCBSQ can provide nurses with insights into the development of culturally sensitive breast health education programs.
Low-pleasure beliefs in patients with schizophrenia and individuals with social anhedonia.

PubMed

Yang, Yin; Yang, Zhuo-Ya; Zou, Ying-Min; Shi, Hai-Song; Wang, Yi; Xie, Dong-Jie; Zhang, Rui-Ting; Lui, Simon S Y; Cohen, Alex C; Strauss, Gregory P; Cheung, Eric F C; Chan, Raymond C K

2018-05-24

Anhedonia in schizophrenia has been suggested to comprise a set of low-pleasure beliefs, defined as beliefs that certain things/activities were not pleasurable or that one does not feel pleasant generally. However, no instrument has been intentionally developed to specifically measure low-pleasure beliefs, and there is a paucity of empirical evidence for low-pleasure beliefs and their relationship with anhedonia in both patients with schizophrenia and individuals with high social anhedonia. We developed and validated the Beliefs About Pleasure Scale (BAPS) using non-clinical (Studies 1, 2 & 3), chronic schizophrenia (Study 2), and first episode schizophrenia (Study 3) samples. Across these studies, we examined psychometric properties of the BAPS, including temporal stability, internal consistency, factor structure, and convergent validity. The 22 BAPS items loaded onto 4 factors, namely the "Devaluation of Pleasure", the "Pleasurable Activity Expectancies", the "Negative Outcomes Expectancies", and the "Attention to Pleasure". The measure demonstrated good internal consistency and convergent validity in each sample. Moreover, both individual with schizophrenia and non-clinical participants with high social anhedonia scored higher on the BAPS than controls (Study 3), supporting construct validity. These findings provide preliminary evidence for the presence of low-pleasure beliefs in both clinical and subclinical groups and suggest that the BAPS has promising initial psychometric properties. The BAPS will be useful for exploring the cognitive component of anhedonia and provides a novel assessment for mechanism of change in psychosocial treatment studies. Copyright © 2018. Published by Elsevier B.V.
Development and Validation of the Behavioral Tendencies Questionnaire.

PubMed

Van Dam, Nicholas T; Brown, Anna; Mole, Tom B; Davis, Jake H; Britton, Willoughby B; Brewer, Judson A

2015-01-01

At a fundamental level, taxonomy of behavior and behavioral tendencies can be described in terms of approach, avoid, or equivocate (i.e., neither approach nor avoid). While there are numerous theories of personality, temperament, and character, few seem to take advantage of parsimonious taxonomy. The present study sought to implement this taxonomy by creating a questionnaire based on a categorization of behavioral temperaments/tendencies first identified in Buddhist accounts over fifteen hundred years ago. Items were developed using historical and contemporary texts of the behavioral temperaments, described as "Greedy/Faithful", "Aversive/Discerning", and "Deluded/Speculative". To both maintain this categorical typology and benefit from the advantageous properties of forced-choice response format (e.g., reduction of response biases), binary pairwise preferences for items were modeled using Latent Class Analysis (LCA). One sample (n1 = 394) was used to estimate the item parameters, and the second sample (n2 = 504) was used to classify the participants using the established parameters and cross-validate the classification against multiple other measures. The cross-validated measure exhibited good nomothetic span (construct-consistent relationships with related measures) that seemed to corroborate the ideas present in the original Buddhist source documents. The final 13-block questionnaire created from the best performing items (the Behavioral Tendencies Questionnaire or BTQ) is a psychometrically valid questionnaire that is historically consistent, based in behavioral tendencies, and promises practical and clinical utility particularly in settings that teach and study meditation practices such as Mindfulness Based Stress Reduction (MBSR).
Adaptation of clinical prediction models for application in local settings.

PubMed

Kappen, Teus H; Vergouwe, Yvonne; van Klei, Wilton A; van Wolfswinkel, Leo; Kalkman, Cor J; Moons, Karel G M

2012-01-01

When planning to use a validated prediction model in new patients, adequate performance is not guaranteed. For example, changes in clinical practice over time or a different case mix than the original validation population may result in inaccurate risk predictions. To demonstrate how clinical information can direct updating a prediction model and development of a strategy for handling missing predictor values in clinical practice. A previously derived and validated prediction model for postoperative nausea and vomiting was updated using a data set of 1847 patients. The update consisted of 1) changing the definition of an existing predictor, 2) reestimating the regression coefficient of a predictor, and 3) adding a new predictor to the model. The updated model was then validated in a new series of 3822 patients. Furthermore, several imputation models were considered to handle real-time missing values, so that possible missing predictor values could be anticipated during actual model use. Differences in clinical practice between our local population and the original derivation population guided the update strategy of the prediction model. The predictive accuracy of the updated model was better (c statistic, 0.68; calibration slope, 1.0) than the original model (c statistic, 0.62; calibration slope, 0.57). Inclusion of logistical variables in the imputation models, besides observed patient characteristics, contributed to a strategy to deal with missing predictor values at the time of risk calculation. Extensive knowledge of local, clinical processes provides crucial information to guide the process of adapting a prediction model to new clinical practices.
The Child Adolescent Bullying Scale (CABS): Psychometric evaluation of a new measure.

PubMed

Strout, Tania D; Vessey, Judith A; DiFazio, Rachel L; Ludlow, Larry H

2018-06-01

While youth bullying is a significant public health problem, healthcare providers have been limited in their ability to identify bullied youths due to the lack of a reliable, and valid instrument appropriate for use in clinical settings. We conducted a multisite study to evaluate the psychometric properties of a new 22-item instrument for assessing youths' experiences of being bullied, the Child Adolescent Bullying Scale (CABS). The 20 items summed to produce the measure's score were evaluated here. Diagnostic performance was assessed through evaluation of sensitivity, specificity, predictive values, and area under receiver operating characteristic (AUROC) curve. A sample of 352 youths from diverse racial, ethnic, and geographic backgrounds (188 female, 159 male, 5 transgender, sample mean age 13.5 years) were recruited from two clinical sites. Participants completed the CABS and existing youth bullying measures. Analyses grounded in classical test theory, including assessments of reliability and validity, item analyses, and principal components analysis, were conducted. The diagnostic performance and test characteristics of the CABS were also evaluated. The CABS is comprised of one component, accounting for 67% of observed variance. Analyses established evidence of internal consistency reliability (Cronbach's α = 0.97), construct and convergent validity. Sensitivity was 84%, specificity was 65%, and the AUROC curve was 0.74 (95% CI: 0.69-0.80). Findings suggest that the CABS holds promise as a reliable, valid tool for healthcare provider use in screening for bullying exposure in the clinical setting. © 2018 Wiley Periodicals, Inc.
Measuring Access to Information and Technology: Environmental Factors Affecting Persons With Neurologic Disorders.

PubMed

Hahn, Elizabeth A; Garcia, Sofia F; Lai, Jin-Shei; Miskovic, Ana; Jerousek, Sara; Semik, Patrick; Wong, Alex; Heinemann, Allen W

2016-08-01

To develop and validate a patient-reported measure of access to information and technology (AIT) for persons with spinal cord injury, stroke, or traumatic brain injury. A mixed-methods approach was used to develop items, refine them through cognitive interviews, and evaluate their psychometric properties. Item responses were evaluated with the Rasch rating scale model. Correlational and analysis-of-variance methods were used to evaluate construct validity. Community-dwelling individuals participated in telephone interviews or traveled to the academic medical centers where this research took place. Individuals with a diagnosis of spinal cord injury, stroke, or traumatic brain injury (aged ≥18y, English speaking) participated in cognitive interviews (n=12 persons), field testing of the items (n=305 persons), and validation testing of the final set of items (n=604 persons). Not applicable. A set of items to measure AIT for people with disabilities. A user-friendly multimedia touchscreen was used for self-administration of the items. A 23-item AIT measure demonstrated good evidence of internal consistency reliability, and content and construct validity. This new AIT measure will enable researchers and clinicians to determine to what extent environmental factors influence health outcomes and social participation in people with disabilities. The AIT measure could also provide disability advocates with more specific and detailed information about environmental factors to lobby for elimination of barriers. Copyright © 2016 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Applying Classification Trees to Hospital Administrative Data to Identify Patients with Lower Gastrointestinal Bleeding

PubMed Central

Siddique, Juned; Ruhnke, Gregory W.; Flores, Andrea; Prochaska, Micah T.; Paesch, Elizabeth; Meltzer, David O.; Whelan, Chad T.

2015-01-01

Background Lower gastrointestinal bleeding (LGIB) is a common cause of acute hospitalization. Currently, there is no accepted standard for identifying patients with LGIB in hospital administrative data. The objective of this study was to develop and validate a set of classification algorithms that use hospital administrative data to identify LGIB. Methods Our sample consists of patients admitted between July 1, 2001 and June 30, 2003 (derivation cohort) and July 1, 2003 and June 30, 2005 (validation cohort) to the general medicine inpatient service of the University of Chicago Hospital, a large urban academic medical center. Confirmed cases of LGIB in both cohorts were determined by reviewing the charts of those patients who had at least 1 of 36 principal or secondary International Classification of Diseases, Ninth revision, Clinical Modification (ICD-9-CM) diagnosis codes associated with LGIB. Classification trees were used on the data of the derivation cohort to develop a set of decision rules for identifying patients with LGIB. These rules were then applied to the validation cohort to assess their performance. Results Three classification algorithms were identified and validated: a high specificity rule with 80.1% sensitivity and 95.8% specificity, a rule that balances sensitivity and specificity (87.8% sensitivity, 90.9% specificity), and a high sensitivity rule with 100% sensitivity and 91.0% specificity. Conclusion These classification algorithms can be used in future studies to evaluate resource utilization and assess outcomes associated with LGIB without the use of chart review. PMID:26406318
Measuring teamwork in health care settings: a review of survey instruments.

PubMed

Valentine, Melissa A; Nembhard, Ingrid M; Edmondson, Amy C

2015-04-01

Teamwork in health care settings is widely recognized as an important factor in providing high-quality patient care. However, the behaviors that comprise effective teamwork, the organizational factors that support teamwork, and the relationship between teamwork and patient outcomes remain empirical questions in need of rigorous study. To identify and review survey instruments used to assess dimensions of teamwork so as to facilitate high-quality research on this topic. We conducted a systematic review of articles published before September 2012 to identify survey instruments used to measure teamwork and to assess their conceptual content, psychometric validity, and relationships to outcomes of interest. We searched the ISI Web of Knowledge database, and identified relevant articles using the search terms team, teamwork, or collaboration in combination with survey, scale, measure, or questionnaire. We found 39 surveys that measured teamwork. Surveys assessed different dimensions of teamwork. The most commonly assessed dimensions were communication, coordination, and respect. Of the 39 surveys, 10 met all of the criteria for psychometric validity, and 14 showed significant relationships to nonself-report outcomes. Evidence of psychometric validity is lacking for many teamwork survey instruments. However, several psychometrically valid instruments are available. Researchers aiming to advance research on teamwork in health care should consider using or adapting one of these instruments before creating a new one. Because instruments vary considerably in the behavioral processes and emergent states of teamwork that they capture, researchers must carefully evaluate the conceptual consistency between instrument, research question, and context.
Validation of the Minority Stress Scale Among Italian Gay and Bisexual Men.

PubMed

Pala, Andrea Norcini; Dell'Amore, Francesca; Steca, Patrizia; Clinton, Lauren; Sandfort, Theodorus; Rael, Christine

2017-12-01

The experience of sexual orientation stigma (e.g., homophobic discrimination and physical aggression) generates minority stress, a chronic form of psychosocial stress. Minority stress has been shown to have a negative effect on gay and bisexual men's (GBM's) mental and physical health, increasing the rates of depression, suicidal ideation, and HIV risk behaviors. In conservative religious settings, such as Italy, sexual orientation stigma can be more frequently and/or more intensively experienced. However, minority stress among Italian GBM remains understudied. The aim of this study was to explore the dimensionality, internal reliability, and convergent validity of the Minority Stress Scale (MSS), a comprehensive instrument designed to assess the manifestations of sexual orientation stigma. The MSS consists of 50 items assessing (a) Structural Stigma, (b) Enacted Stigma, (c) Expectations of Discrimination, (d) Sexual Orientation Concealment, (e) Internalized Homophobia Toward Others, (f) Internalized Homophobia toward Oneself, and (g) Stigma Awareness. We recruited an online sample of 451 Italian GBM to take the MSS. We tested convergent validity using the Perceived Stress Questionnaire. Through exploratory factor analysis, we extracted the 7 theoretical factors and an additional 3-item factor assessing Expectations of Discrimination From Family Members. The MSS factors showed good internal reliability (ordinal α > .81) and good convergent validity. Our scale can be suitable for applications in research settings, psychosocial interventions, and, potentially, in clinical practice. Future studies will be conducted to further investigate the properties of the MSS, exploring the association with additional health-related measures (e.g., depressive symptoms and anxiety).
Methods for Geometric Data Validation of 3d City Models

NASA Astrophysics Data System (ADS)

Wagner, D.; Alam, N.; Wewetzer, M.; Pries, M.; Coors, V.

2015-12-01

Geometric quality of 3D city models is crucial for data analysis and simulation tasks, which are part of modern applications of the data (e.g. potential heating energy consumption of city quarters, solar potential, etc.). Geometric quality in these contexts is however a different concept as it is for 2D maps. In the latter case, aspects such as positional or temporal accuracy and correctness represent typical quality metrics of the data. They are defined in ISO 19157 and should be mentioned as part of the metadata. 3D data has a far wider range of aspects which influence their quality, plus the idea of quality itself is application dependent. Thus, concepts for definition of quality are needed, including methods to validate these definitions. Quality on this sense means internal validation and detection of inconsistent or wrong geometry according to a predefined set of rules. A useful starting point would be to have correct geometry in accordance with ISO 19107. A valid solid should consist of planar faces which touch their neighbours exclusively in defined corner points and edges. No gaps between them are allowed, and the whole feature must be 2-manifold. In this paper, we present methods to validate common geometric requirements for building geometry. Different checks based on several algorithms have been implemented to validate a set of rules derived from the solid definition mentioned above (e.g. water tightness of the solid or planarity of its polygons), as they were developed for the software tool CityDoctor. The method of each check is specified, with a special focus on the discussion of tolerance values where they are necessary. The checks include polygon level checks to validate the correctness of each polygon, i.e. closeness of the bounding linear ring and planarity. On the solid level, which is only validated if the polygons have passed validation, correct polygon orientation is checked, after self-intersections outside of defined corner points and edges are detected, among additional criteria. Self-intersection might lead to different results, e.g. intersection points, lines or areas. Depending on the geometric constellation, they might represent gaps between bounding polygons of the solids, overlaps, or violations of the 2-manifoldness. Not least due to the floating point problem in digital numbers, tolerances must be considered in some algorithms, e.g. planarity and solid self-intersection. Effects of different tolerance values and their handling is discussed; recommendations for suitable values are given. The goal of the paper is to give a clear understanding of geometric validation in the context of 3D city models. This should also enable the data holder to get a better comprehension of the validation results and their consequences on the deployment fields of the validated data set.
The development and exploratory analysis of the Back Pain Attitudes Questionnaire (Back-PAQ)

PubMed Central

Darlow, Ben; Perry, Meredith; Mathieson, Fiona; Stanley, James; Melloh, Markus; Marsh, Reginald; Baxter, G David; Dowell, Anthony

2014-01-01

Objectives To develop an instrument to assess attitudes and underlying beliefs about back pain, and subsequently investigate its internal consistency and underlying structures. Design The instrument was developed by a multidisciplinary team of clinicians and researchers based on analysis of qualitative interviews with people experiencing acute and chronic back pain. Exploratory analysis was conducted using data from a population-based cross-sectional survey. Setting Qualitative interviews with community-based participants and subsequent postal survey. Participants Instrument development informed by interviews with 12 participants with acute back pain and 11 participants with chronic back pain. Data for exploratory analysis collected from New Zealand residents and citizens aged 18 years and above. 1000 participants were randomly selected from the New Zealand Electoral Roll. 602 valid responses were received. Measures The 34-item Back Pain Attitudes Questionnaire (Back-PAQ) was developed. Internal consistency was evaluated by the Cronbach α coefficient. Exploratory analysis investigated the structure of the data using Principal Component Analysis. Results The 34-item long form of the scale had acceptable internal consistency (α=0.70; 95% CI 0.66 to 0.73). Exploratory analysis identified five two-item principal components which accounted for 74% of the variance in the reduced data set: ‘vulnerability of the back’; ‘relationship between back pain and injury’; ‘activity participation while experiencing back pain’; ‘prognosis of back pain’ and ‘psychological influences on recovery’. Internal consistency was acceptable for the reduced 10-item scale (α=0.61; 95% CI 0.56 to 0.66) and the identified components (α between 0.50 and 0.78). Conclusions The 34-item long form of the scale may be appropriate for use in future cross-sectional studies. The 10-item short form may be appropriate for use as a screening tool, or an outcome assessment instrument. Further testing of the 10-item Back-PAQ's construct validity, reliability, responsiveness to change and predictive ability needs to be conducted. PMID:24860003
Validation of the Italian version of the Apathy Evaluation Scale (AES-I) in institutionalized geriatric patients.

PubMed

Borgi, Marta; Caccamo, Floriana; Giuliani, Alessandro; Piergentili, Alessandro; Sessa, Sonia; Reda, Emilia; Alleva, Enrico; Cirulli, Francesca; Miraglia, Fabio

2016-01-01

Apathy is a very common symptom in the institutionalized elderly and represents a condition of both clinical and public health importance. The Apathy Evaluation Scale (AES) has been shown to be a valid and reliable tool for characterizing, quantifying and differentiating apathy in various health conditions. The aims of this study were to establish the validity and reliability of the Italian version of the AES, and to assess the severity of apathy in a sample of Italian institutionalized geriatric patients. Data were collected from clinical interviews using the AES informant version (AES-I). Associations between measures of apathy and depression, cognitive functioning and perceived quality of life were evaluated, as well as the effects of the living environment on apathetic symptoms. Multiple forms of reliability and validity (i.e. test-retest, internal consistency, discriminability of apathy rating from a standard measure of depression) were satisfied. Our results also show that the characteristics of the care setting may affect the severity of apathetic symptoms. The AES-I Italian version is a reliable and valid instrument for measuring apathy in Italian patients, also allowing a direct comparison with data gathered in other countries.
Development and preliminary validation of a self-report measure of psychopathic personality traits in noncriminal populations.

PubMed

Lilienfeld, S O; Andrews, B P

1996-06-01

Research on psychopathology has been hindered by persisting difficulties and controversies regarding its assessment. The primary goals of this set of studies were to (a) develop, and initiate the construct validation of, a self-report measure that assesses the major personality traits of psychopathy in noncriminal populations and (b) clarify the nature of these traits via an exploratory approach to test construction. This measure, the Psychopathic Personality Inventory (PPI), was developed by writing items to assess a large number of personality domains relevant to psychopathy and performing successive item-level factor analyses and revisions on three undergraduate samples. The PPI total score and its eight subscales were found to possess satisfactory internal consistency and test-retest reliability. In four studies with undergraduates, the PPI and its subscales exhibited a promising pattern of convergent and discriminant validity with self-report, psychiatric interview, observer rating, and family history data. In addition, the PPI total score demonstrated incremental validity relative to several commonly used self-report psychopathy-related measures. Future construct validation studies, unresolved conceptual issues regarding the assessment of psychopathy, and potential research uses of the PPI are outlined.
Development and validation of an instrument to measure nurse educator perceived confidence in clinical teaching.

PubMed

Nguyen, Van N B; Forbes, Helen; Mohebbi, Mohammadreza; Duke, Maxine

2017-12-01

Teaching nursing in clinical environments is considered complex and multi-faceted. Little is known about the role of the clinical nurse educator, specifically the challenges related to transition from clinician, or in some cases, from newly-graduated nurse to that of clinical nurse educator, as occurs in developing countries. Confidence in the clinical educator role has been associated with successful transition and the development of role competence. There is currently no valid and reliable instrument to measure clinical nurse educator confidence. This study was conducted to develop and psychometrically test an instrument to measure perceived confidence among clinical nurse educators. A multi-phase, multi-setting survey design was used. A total of 468 surveys were distributed, and 363 were returned. Data were analyzed using exploratory and confirmatory factor analyses. The instrument was successfully tested and modified in phase 1, and factorial validity was subsequently confirmed in phase 2. There was strong evidence of internal consistency, reliability, content, and convergent validity of the Clinical Nurse Educator Skill Acquisition Assessment instrument. The resulting instrument is applicable in similar contexts due to its rigorous development and validation process. © 2017 The Authors. Nursing & Health Sciences published by John Wiley & Sons Australia, Ltd.
Towards Automatic Validation and Healing of Citygml Models for Geometric and Semantic Consistency

NASA Astrophysics Data System (ADS)

Alam, N.; Wagner, D.; Wewetzer, M.; von Falkenhausen, J.; Coors, V.; Pries, M.

2013-09-01

A steadily growing number of application fields for large 3D city models have emerged in recent years. Like in many other domains, data quality is recognized as a key factor for successful business. Quality management is mandatory in the production chain nowadays. Automated domain-specific tools are widely used for validation of business-critical data but still common standards defining correct geometric modeling are not precise enough to define a sound base for data validation of 3D city models. Although the workflow for 3D city models is well-established from data acquisition to processing, analysis and visualization, quality management is not yet a standard during this workflow. Processing data sets with unclear specification leads to erroneous results and application defects. We show that this problem persists even if data are standard compliant. Validation results of real-world city models are presented to demonstrate the potential of the approach. A tool to repair the errors detected during the validation process is under development; first results are presented and discussed. The goal is to heal defects of the models automatically and export a corrected CityGML model.
Validation of Malay Version of Snaith-Hamilton Pleasure Scale: Comparison between Depressed Patients and Healthy Subjects at an Out-Patient Clinic in Malaysia

PubMed Central

NG, Chong Guan; CHIN, Soo Cheng; YEE, Anne Hway Ann; LOH, Huai Seng; SULAIMAN, Ahmad Hatim; Sherianne Sook Kuan, WONG; HABIL, Mohamed Hussain

2014-01-01

Background: The Snaith-Hamilton Pleasure Scale (SHAPS) is a self-assessment scale designed to evaluate anhedonia in various psychiatric disorders. In order to facilitate its use in Malaysian settings, our current study aimed to examine the validity of a Malay-translated version of the SHAPS (SHAPS-M). Methods: In this cross-sectional study, a total of 44 depressed patients and 82 healthy subjects were recruited from a university out-patient clinic. All participants were given both the Malay and English versions of the SHAPS, Fawcett-Clark Pleasure Scale (FCPS), General Health Questionnaire 12 (GHQ-12), and the Beck Depression Inventory (BDI) to assess their hedonic state, general mental health condition and levels of depression. Results: The results showed that the SHAPS-M has impressive internal consistency (α = 0.96), concurrent validity and good parallel-form reliability (intraclass coefficient, ICC = 0.65). Conclusion: In addition to demonstrating good psychometric properties, the SHAPS-M is easy to administer. Therefore, it is a valid, reliable, and suitable questionnaire for assessing anhedonia among depressed patients in Malaysia. PMID:25246837
A new validated method for the simultaneous determination of benzocaine, propylparaben and benzyl alcohol in a bioadhesive gel by HPLC.

PubMed

Pérez-Lozano, P; García-Montoya, E; Orriols, A; Miñarro, M; Ticó, J R; Suñé-Negre, J M

2005-10-04

A new HPLC-RP method has been developed and validated for the simultaneous determination of benzocaine, two preservatives (propylparaben (nipasol) and benzyl alcohol) and degradation products of benzocaine in a semisolid pharmaceutical dosage form (benzocaine gel). The method uses a Nucleosil 120 C18 column and gradient elution. The mobile phase consisted of a mixture of methanol and glacial acetic acid (10%, v/v) at different proportion according to a time-schedule programme, pumped at a flow rate of 2.0 ml min(-1). The DAD detector was set at 258 nm. The validation study was carried out fulfilling the ICH guidelines in order to prove that the new analytical method, meets the reliability characteristics, and these characteristics showed the capacity of analytical method to keep, throughout the time, the fundamental criteria for validation: selectivity, linearity, precision, accuracy and sensitivity. The method was applied during the quality control of benzocaine gel in order to quantify the drug (benzocaine), preservatives and degraded products and proved to be suitable for rapid and reliable quality control method.
Validation of Malay Version of Snaith-Hamilton Pleasure Scale: Comparison between Depressed Patients and Healthy Subjects at an Out-Patient Clinic in Malaysia.

PubMed

Ng, Chong Guan; Chin, Soo Cheng; Yee, Anne Hway Ann; Loh, Huai Seng; Sulaiman, Ahmad Hatim; Sherianne Sook Kuan, Wong; Habil, Mohamed Hussain

2014-05-01

The Snaith-Hamilton Pleasure Scale (SHAPS) is a self-assessment scale designed to evaluate anhedonia in various psychiatric disorders. In order to facilitate its use in Malaysian settings, our current study aimed to examine the validity of a Malay-translated version of the SHAPS (SHAPS-M). In this cross-sectional study, a total of 44 depressed patients and 82 healthy subjects were recruited from a university out-patient clinic. All participants were given both the Malay and English versions of the SHAPS, Fawcett-Clark Pleasure Scale (FCPS), General Health Questionnaire 12 (GHQ-12), and the Beck Depression Inventory (BDI) to assess their hedonic state, general mental health condition and levels of depression. The results showed that the SHAPS-M has impressive internal consistency (α = 0.96), concurrent validity and good parallel-form reliability (intraclass coefficient, ICC = 0.65). In addition to demonstrating good psychometric properties, the SHAPS-M is easy to administer. Therefore, it is a valid, reliable, and suitable questionnaire for assessing anhedonia among depressed patients in Malaysia.

Toward validation of a structural approach to conceptualizing psychopathology: A special section of the Journal of Abnormal Psychology.

PubMed

Krueger, Robert F; Tackett, Jennifer L; MacDonald, Angus

2016-11-01

Traditionally, psychopathology has been conceptualized in terms of polythetic categories derived from committee deliberations and enshrined in authoritative psychiatric nosologies-most notably the Diagnostic and Statistical Manual of Mental Disorders (DSM; American Psychiatric Association [APA], 2013). As the limitations of this form of classification have become evident, empirical data have been increasingly relied upon to investigate the structure of psychopathology. These efforts have borne fruit in terms of an increasingly consistent set of psychopathological constructs closely connected with similar personality constructs. However, the work of validating these constructs using convergent sources of data is an ongoing enterprise. This special section collects several new efforts to use structural approaches to study the validity of this empirically based organizational scheme for psychopathology. Inasmuch as a structural approach reflects the natural organization of psychopathology, it has great potential to facilitate comprehensive organization of information on the correlates of psychopathology, providing evidence for the convergent and discriminant validity of an empirical approach to classification. Here, we highlight several themes that emerge from this burgeoning literature. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
The Brighton musculoskeletal Patient-Reported Outcome Measure (BmPROM): An assessment of validity, reliability, and responsiveness.

PubMed

Bryant, Elizabeth; Murtagh, Shemane; Finucane, Laura; McCrum, Carol; Mercer, Christopher; Smith, Toby; Canby, Guy; Rowe, David A; Moore, Ann P

2018-05-11

In response for the need of a freely available, stand-alone, validated outcome measure for use within musculoskeletal (MSK) physiotherapy practice, sensitive enough to measure clinical effectiveness, we developed an MSK patient reported outcome measure. This study examined the validity and reliability of the newly developed Brighton musculoskeletal Patient-Reported Outcome Measure (BmPROM) within physiotherapy outpatient settings. Two hundred twenty-four patients attending physiotherapy outpatient departments in South East England with an MSK condition participated in this study. The BmPROM was assessed for user friendliness (rated feedback, N = 224), reliability (internal consistency and test-retest reliability, n = 42), validity (internal and external construct validity, N = 224), and responsiveness (internal, n = 25). Exploratory factor analysis indicated that a two-factor model provides a good fit to the data. Factors were representative of "Functionality" and "Wellbeing". Correlations observed between the BmPROM and SF-36 domains provided evidence of convergent validity. Reliability results indicated that both subscales were internally consistent with alphas above the acceptable limits for both "Functionality" (α = .85, 95% CI [.81, .88]) and 'Wellbeing' (α = .80, 95% CI [.75, .84]). Test-retest analyses (n = 42) demonstrated a high degree of reliability between "Functionality" (ICC = .84; 95% CI [.72, .91]) and "Wellbeing" scores (ICC = .84; 95% CI [.72, .91]). Further examination of test-retest reliability through the Bland-Altman analysis demonstrated that the difference between "Functionality" and "Wellbeing" test scores did not vary as a function of absolute test score. Large treatment effect sizes were found for both subscales (Functionality d = 1.10; Wellbeing 1.03). The BmPROM is a reliable and valid outcome measure for use in evaluating physiotherapy treatment of MSK conditions. Copyright © 2018 John Wiley & Sons, Ltd.
Developing a Risk-scoring Model for Ankylosing Spondylitis Based on a Combination of HLA-B27, Single-nucleotide Polymorphism, and Copy Number Variant Markers.

PubMed

Jung, Seung-Hyun; Cho, Sung-Min; Yim, Seon-Hee; Kim, So-Hee; Park, Hyeon-Chun; Cho, Mi-La; Shim, Seung-Cheol; Kim, Tae-Hwan; Park, Sung-Hwan; Chung, Yeun-Jun

2016-12-01

To develop a genotype-based ankylosing spondylitis (AS) risk prediction model that is more sensitive and specific than HLA-B27 typing. To develop the AS genetic risk scoring (AS-GRS) model, 648 individuals (285 cases and 363 controls) were examined for 5 copy number variants (CNV), 7 single-nucleotide polymorphisms (SNP), and an HLA-B27 marker by TaqMan assays. The AS-GRS model was developed using logistic regression and validated with a larger independent set (576 cases and 680 controls). Through logistic regression, we built the AS-GRS model consisting of 5 genetic components: HLA-B27, 3 CNV (1q32.2, 13q13.1, and 16p13.3), and 1 SNP (rs10865331). All significant associations of genetic factors in the model were replicated in the independent validation set. The discriminative ability of the AS-GRS model measured by the area under the curve was excellent: 0.976 (95% CI 0.96-0.99) in the model construction set and 0.951 (95% CI 0.94-0.96) in the validation set. The AS-GRS model showed higher specificity and accuracy than the HLA-B27-only model when the sensitivity was set to over 94%. When we categorized the individuals into quartiles based on the AS-GRS scores, OR of the 4 groups (low, intermediate-1, intermediate-2, and high risk) showed an increasing trend with the AS-GRS scores (r 2 = 0.950) and the highest risk group showed a 494× higher risk of AS than the lowest risk group (95% CI 237.3-1029.1). Our AS-GRS could be used to identify individuals at high risk for AS before major symptoms appear, which may improve the prognosis for them through early treatment.
The Imperial Paediatric Emergency Training Toolkit (IPETT) for use in paediatric emergency training: development and evaluation of feasibility and validity.

PubMed

Lambden, Simon; DeMunter, Claudine; Dowson, Anne; Cooper, Mehrengise; Gautama, Sanjay; Sevdalis, Nick

2013-06-01

To develop and test the feasibility, reliability, and validity of a practical toolkit for the assessment and feedback of skills required to manage paediatric emergencies in critical care settings. The Imperial Paediatric Emergency Training Toolkit (IPETT) was developed based on current evidence-base and expert input. IPETT assesses both technical and non-technical skills. The technical component covers skills in the areas of clinical assessment, airway and breathing, cardiovascular, and drugs. The non-technical component is based on the validated NOTECHS tool and covers communication and interaction, cooperation and team skills, leadership and managerial skills, and decision-making. The reliability (internal consistency), content validity (inter-correlations between different skills) and concurrent validity (correlations between global technical and non-technical scores) of IPETT were prospectively evaluated in 45 simulated paediatric crises carried out in a PICU with anaesthetic and paediatric trainees (N=52). Non-parametric analyses were carried out. Significance was set at P<0.05. Cronbach alpha reliability coefficients were overall acceptable for the technical (alpha range=0.638-0.810) and good for the non-technical (alpha range=0.701-0.899) component of IPETT. The median inter-skill correlation was rho=0.564 and rho=0.549 for the technical and non-technical components, respectively. These indicate good content validity, as the skills were inter-related but not redundant. We also demonstrate a correlation between the global technical and non-technical scores (rho=0.471) - all Ps<0.05 during the assessments. IPETT offers a psychometrically viable and feasible to use tool in the context of paediatric emergencies training. This study shows that assessment of technical and non-technical skills in combination may offer a more clinically relevant model for training in paediatric emergencies. Further validation should aim to demonstrate skill retention over time and skill transfer from simulation-based training to real emergencies. Copyright © 2013. Published by Elsevier Ireland Ltd.
External Validation of the Acoustic Voice Quality Index Version 03.01 With Extended Representativity.

PubMed

Barsties, Ben; Maryn, Youri

2016-07-01

The Acoustic Voice Quality Index (AVQI) is an objective method to quantify the severity of overall voice quality in concatenated continuous speech and sustained phonation segments. Recently, AVQI was successfully modified to be more representative and ecologically valid because the internal consistency of AVQI was balanced out through equal proportion of the 2 speech types. The present investigation aims to explore its external validation in a large data set. An expert panel of 12 speech-language therapists rated the voice quality of 1058 concatenated voice samples varying from normophonia to severe dysphonia. The Spearman rank-order correlation coefficients (r) were used to measure concurrent validity. The AVQI's diagnostic accuracy was evaluated with several estimates of its receiver operating characteristics (ROC). Finally, 8 of the 12 experts were chosen because of reliability criteria. A strong correlation was identified between AVQI and auditoryperceptual rating (r = 0.815, P = .000). It indicated that 66.4% of the auditory-perceptual rating's variation was explained by AVQI. Additionally, the ROC results showed again the best diagnostic outcome at a threshold of AVQI = 2.43. This study highlights external validation and diagnostic precision of the AVQI version 03.01 as a robust and ecologically valid measurement to objectify voice quality. © The Author(s) 2016.
How well should probabilistic seismic hazard maps work?

NASA Astrophysics Data System (ADS)

Vanneste, K.; Stein, S.; Camelbeeck, T.; Vleminckx, B.

2016-12-01

Recent large earthquakes that gave rise to shaking much stronger than shown in earthquake hazard maps have stimulated discussion about how well these maps forecast future shaking. These discussions have brought home the fact that although the maps are designed to achieve certain goals, we know little about how well they actually perform. As for any other forecast, this question involves verification and validation. Verification involves assessing how well the algorithm used to produce hazard maps implements the conceptual PSHA model ("have we built the model right?"). Validation asks how well the model forecasts the shaking that actually occurs ("have we built the right model?"). We explore the verification issue by simulating the shaking history of an area with assumed distribution of earthquakes, frequency-magnitude relation, temporal occurrence model, and ground-motion prediction equation. We compare the "observed" shaking at many sites over time to that predicted by a hazard map generated for the same set of parameters. PSHA predicts that the fraction of sites at which shaking will exceed that mapped is p = 1 - exp(t/T), where t is the duration of observations and T is the map's return period. This implies that shaking in large earthquakes is typically greater than shown on hazard maps, as has occurred in a number of cases. A large number of simulated earthquake histories yield distributions of shaking consistent with this forecast, with a scatter about this value that decreases as t/T increases. The median results are somewhat lower than predicted for small values of t/T and approach the predicted value for larger values of t/T. Hence, the algorithm appears to be internally consistent and can be regarded as verified for this set of simulations. Validation is more complicated because a real observed earthquake history can yield a fractional exceedance significantly higher or lower than that predicted while still being consistent with the hazard map in question. As a result, given that in the real world we have only a single sample, it is hard to assess whether a misfit between a map and observations arises by chance or reflects a biased map.
Psychometric properties of the Depression Anxiety and Stress Scale-21 in older primary care patients.

PubMed

Gloster, Andrew T; Rhoades, Howard M; Novy, Diane; Klotsche, Jens; Senior, Ashley; Kunik, Mark; Wilson, Nancy; Stanley, Melinda A

2008-10-01

The Depression Anxiety Stress Scale (DASS) was designed to efficiently measure the core symptoms of anxiety and depression and has demonstrated positive psychometric properties in adult samples of anxiety and depression patients and student samples. Despite these findings, the psychometric properties of the DASS remain untested in older adults, for whom the identification of efficient measures of these constructs is especially important. To determine the psychometric properties of the DASS 21-item version in older adults, we analyzed data from 222 medical patients seeking treatment to manage worry. Consistent with younger samples, a three-factor structure best fit the data. Results also indicated good internal consistency, excellent convergent validity, and good discriminative validity, especially for the Depression scale. Receiver operating curve analyses indicated that the DASS-21 predicted the diagnostic presence of generalized anxiety disorder and depression as well as other commonly used measures. These data suggest that the DASS may be used with older adults in lieu of multiple scales designed to measure similar constructs, thereby reducing participant burden and facilitating assessment in settings with limited assessment resources.
Validation of a screening tool for the rapid and reliable detection of CGG trinucleotide repeat expansions in FMR1.

PubMed

Basehore, Monica J; Marlowe, Natalia M; Jones, Julie R; Behlendorf, Deborah E; Laver, Thomas A; Friez, Michael J

2012-06-01

Most individuals with intellectual disability and/or autism are tested for Fragile X syndrome at some point in their lifetime. Greater than 99% of individuals with Fragile X have an expanded CGG trinucleotide repeat motif in the promoter region of the FMR1 gene, and diagnostic testing involves determining the size of the CGG repeat as well as methylation status when an expansion is present. Using a previously described triplet repeat-primed polymerase chain reaction, we have performed additional validation studies using two cohorts with previous diagnostic testing results available for comparison purposes. The first cohort (n=88) consisted of both males and females and had a high percentage of abnormal samples, while the second cohort (n=624) consisted of only females and was not enriched for expansion mutations. Data from each cohort were completely concordant with the results previously obtained during the course of diagnostic testing. This study further demonstrates the utility of using laboratory-developed triplet repeat-primed FMR1 testing in a clinical setting.
Parametric adaptive filtering and data validation in the bar GW detector AURIGA

NASA Astrophysics Data System (ADS)

Ortolan, A.; Baggio, L.; Cerdonio, M.; Prodi, G. A.; Vedovato, G.; Vitale, S.

2002-04-01

We report on our experience gained in the signal processing of the resonant GW detector AURIGA. Signal amplitude and arrival time are estimated by means of a matched-adaptive Wiener filter. The detector noise, entering in the filter set-up, is modelled as a parametric ARMA process; to account for slow non-stationarity of the noise, the ARMA parameters are estimated on an hourly basis. A requirement of the set-up of an unbiased Wiener filter is the separation of time spans with 'almost Gaussian' noise from non-Gaussian and/or strongly non-stationary time spans. The separation algorithm consists basically of a variance estimate with the Chauvenet convergence method and a threshold on the Curtosis index. The subsequent validation of data is strictly connected with the separation procedure: in fact, by injecting a large number of artificial GW signals into the 'almost Gaussian' part of the AURIGA data stream, we have demonstrated that the effective probability distributions of the signal-to-noise ratio χ2 and the time of arrival are those that are expected.
Validation of the Chinese Challenging Behaviour Scale: clinical correlates of challenging behaviours in nursing home residents with dementia.

PubMed

Lam, Chi Leung; Chan, W C; Mok, Cycbie C M; Li, S W; Lam, Linda C W

2006-08-01

Behavioural and psychological symptoms of dementia (BPSD) are associated with considerable burden to patients with dementia and their caregivers. Formal caregivers in residential care settings face different challenges when delivering care. This study aimed at assessing the clinical correlates of challenging BPSD using the Chinese version of the Challenging Behaviour Scale (CCBS) designed for residential care settings. One hundred and twenty-five participants were recruited from three care-and-attention homes in Hong Kong. The CCBS was administered together with the Cantonese version of Mini-Mental State Examination (MMSE), Clinical Dementia Rating (CDR), Disability Assessment for Dementia (DAD) and Neuropsychiatric Inventory (NPI) to explore the relationships between challenging behaviour and important clinical correlates. The CCBS had good internal consistency (alpha = 0.86), inter-rater (ICC = 0.79) and test-retest reliability (ICC = 0.98). A four-factor structure is demonstrated by factor analysis: hyperactivity behaviours, hypoactivity behaviours, verbally aggressive and aberrant behaviours. Challenging behaviours were associated with male gender, cognitive impairment, functional disability, neuropsychiatric symptoms, and higher caregiver's workload. The CCBS is a valid and reliable measure to assess BPSD in residential care settings in local Chinese community. It is useful in evaluating the challenges faced by formal caregivers during daily care of the dementia patients.
Ligand-based and structure-based approaches in identifying ideal pharmacophore against c-Jun N-terminal kinase-3.

PubMed

Kumar, B V S Suneel; Kotla, Rohith; Buddiga, Revanth; Roy, Jyoti; Singh, Sardar Shamshair; Gundla, Rambabu; Ravikumar, Muttineni; Sarma, Jagarlapudi A R P

2011-01-01

Structure and ligand based pharmacophore modeling and docking studies carried out using diversified set of c-Jun N-terminal kinase-3 (JNK3) inhibitors are presented in this paper. Ligand based pharmacophore model (LBPM) was developed for 106 inhibitors of JNK3 using a training set of 21 compounds to reveal structural and chemical features necessary for these molecules to inhibit JNK3. Hypo1 consisted of two hydrogen bond acceptors (HBA), one hydrogen bond donor (HBD), and a hydrophobic (HY) feature with a correlation coefficient (r²) of 0.950. This pharmacophore model was validated using test set containing 85 inhibitors and had a good r² of 0.846. All the molecules were docked using Glide software and interestingly, all the docked conformations showed hydrogen bond interactions with important hinge region amino acids (Gln155 and Met149)and these interactions were compared with Hypo1 features. The results of ligand based pharmacophore model (LBPM)and docking studies are validated each other. The structure based pharmacophore model (SBPM) studies have identified additional features, two hydrogen bond donors and one hydrogen bond acceptor. The combination of these methodologies is useful in designing ideal pharmacophore which provides a powerful tool for the discovery of novel and selective JNK3 inhibitors.
Evaluation of multiple forcing data sets for precipitation and shortwave radiation over major land areas of China

NASA Astrophysics Data System (ADS)

Yang, Fan; Lu, Hui; Yang, Kun; He, Jie; Wang, Wei; Wright, Jonathon S.; Li, Chengwei; Han, Menglei; Li, Yishan

2017-11-01

Precipitation and shortwave radiation play important roles in climatic, hydrological and biogeochemical cycles. Several global and regional forcing data sets currently provide historical estimates of these two variables over China, including the Global Land Data Assimilation System (GLDAS), the China Meteorological Administration (CMA) Land Data Assimilation System (CLDAS) and the China Meteorological Forcing Dataset (CMFD). The CN05.1 precipitation data set, a gridded analysis based on CMA gauge observations, also provides high-resolution historical precipitation data for China. In this study, we present an intercomparison of precipitation and shortwave radiation data from CN05.1, CMFD, CLDAS and GLDAS during 2008-2014. We also validate all four data sets against independent ground station observations. All four forcing data sets capture the spatial distribution of precipitation over major land areas of China, although CLDAS indicates smaller annual-mean precipitation amounts than CN05.1, CMFD or GLDAS. Time series of precipitation anomalies are largely consistent among the data sets, except for a sudden decrease in CMFD after August 2014. All forcing data indicate greater temporal variations relative to the mean in dry regions than in wet regions. Validation against independent precipitation observations provided by the Ministry of Water Resources (MWR) in the middle and lower reaches of the Yangtze River indicates that CLDAS provides the most realistic estimates of spatiotemporal variability in precipitation in this region. CMFD also performs well with respect to annual mean precipitation, while GLDAS fails to accurately capture much of the spatiotemporal variability and CN05.1 contains significant high biases relative to the MWR observations. Estimates of shortwave radiation from CMFD are largely consistent with station observations, while CLDAS and GLDAS greatly overestimate shortwave radiation. All three forcing data sets capture the key features of the spatial distribution, but estimates from CLDAS and GLDAS are systematically higher than those from CMFD over most of mainland China. Based on our evaluation metrics, CLDAS slightly outperforms GLDAS. CLDAS is also closer than GLDAS to CMFD with respect to temporal variations in shortwave radiation anomalies, with substantial differences among the time series. Differences in temporal variations are especially pronounced south of 34° N. Our findings provide valuable guidance for a variety of stakeholders, including land-surface modelers and data providers.
Modern modeling techniques had limited external validity in predicting mortality from traumatic brain injury.

PubMed

van der Ploeg, Tjeerd; Nieboer, Daan; Steyerberg, Ewout W

2016-10-01

Prediction of medical outcomes may potentially benefit from using modern statistical modeling techniques. We aimed to externally validate modeling strategies for prediction of 6-month mortality of patients suffering from traumatic brain injury (TBI) with predictor sets of increasing complexity. We analyzed individual patient data from 15 different studies including 11,026 TBI patients. We consecutively considered a core set of predictors (age, motor score, and pupillary reactivity), an extended set with computed tomography scan characteristics, and a further extension with two laboratory measurements (glucose and hemoglobin). With each of these sets, we predicted 6-month mortality using default settings with five statistical modeling techniques: logistic regression (LR), classification and regression trees, random forests (RFs), support vector machines (SVM) and neural nets. For external validation, a model developed on one of the 15 data sets was applied to each of the 14 remaining sets. This process was repeated 15 times for a total of 630 validations. The area under the receiver operating characteristic curve (AUC) was used to assess the discriminative ability of the models. For the most complex predictor set, the LR models performed best (median validated AUC value, 0.757), followed by RF and support vector machine models (median validated AUC value, 0.735 and 0.732, respectively). With each predictor set, the classification and regression trees models showed poor performance (median validated AUC value, <0.7). The variability in performance across the studies was smallest for the RF- and LR-based models (inter quartile range for validated AUC values from 0.07 to 0.10). In the area of predicting mortality from TBI, nonlinear and nonadditive effects are not pronounced enough to make modern prediction methods beneficial. Copyright © 2016 Elsevier Inc. All rights reserved.
Validation of Procedures for Monitoring Crewmember Immune Function

NASA Technical Reports Server (NTRS)

Pierson, Duane; Crucian, Brian; Mehta, Satish; Stowe, Raymond; Uchakin, Peter; Quiriarte, Heather; Sams, Clarence

2010-01-01

The objective of this Supplemental Medical Objective (SMO) is to determine the status of the immune system, physiological stress and latent viral reactivation (a clinical outcome that can be measured) during both short and long-duration spaceflight. In addition, this study will develop and validate an immune monitoring strategy consistent with operational flight requirements and constraints. Pre-mission, in-flight and post-flight blood and saliva samples will be obtained from participating crewmembers. Assays included peripheral immunophenotype, T cell function, cytokine profiles, viral-specific immunity, latent viral reactivation (EBV, CMV, VZV), and stress hormone measurements. To date, 18 short duration (now completed) and 8 long-duration crewmembers have completed the study. The long-duration phase of this study is ongoing. For this presentation, the final data set for the short duration subjects will be discussed.
TIE: an ability test of emotional intelligence.

PubMed

Śmieja, Magdalena; Orzechowski, Jarosław; Stolarski, Maciej S

2014-01-01

The Test of Emotional Intelligence (TIE) is a new ability scale based on a theoretical model that defines emotional intelligence as a set of skills responsible for the processing of emotion-relevant information. Participants are provided with descriptions of emotional problems, and asked to indicate which emotion is most probable in a given situation, or to suggest the most appropriate action. Scoring is based on the judgments of experts: professional psychotherapists, trainers, and HR specialists. The validation study showed that the TIE is a reliable and valid test, suitable for both scientific research and individual assessment. Its internal consistency measures were as high as .88. In line with theoretical model of emotional intelligence, the results of the TIE shared about 10% of common variance with a general intelligence test, and were independent of major personality dimensions.
Assessing leadership decision-making styles: psychometric properties of the Leadership Judgement Indicator.

PubMed

Faraci, Palmira; Lock, Michael; Wheeler, Robert

2013-01-01

This study aimed to validate the Italian version of the Leadership Judgement Indicator, an unconventional instrument devoted to measurement of leaders' judgments and preferred styles, ie, directive, consultative, consensual, or delegative, when dealing with a range of decision-making scenarios. After forward-translation and back-translation, its psychometric properties were estimated for 299 managers at various levels, who were asked to put themselves in the position of leader and to rate the appropriateness of certain ways of responding to challenge. Differences between several groups of managers, ranked in order of seniority, provided evidence for discriminant validity. Internal consistency was adequate. The findings show that the Italian adaptation of the Leadership Judgement Indicator has promising psychometric qualities, suggesting its suitability for use to improve outcomes in both organizational and selection settings.
The Multimodal Assessment of Adult Attachment Security: Developing the Biometric Attachment Test.

PubMed

Parra, Federico; Miljkovitch, Raphaële; Persiaux, Gwenaelle; Morales, Michelle; Scherer, Stefan

2017-04-06

Attachment theory has been proven essential for mental health, including psychopathology, development, and interpersonal relationships. Validated psychometric instruments to measure attachment abound but suffer from shortcomings common to traditional psychometrics. Recent developments in multimodal fusion and machine learning pave the way for new automated and objective psychometric instruments for adult attachment that combine psychophysiological, linguistic, and behavioral analyses in the assessment of the construct. The aim of this study was to present a new exposure-based, automatic, and objective adult-attachment assessment, the Biometric Attachment Test (BAT), which exposes participants to a short standardized set of visual and music stimuli, whereas their immediate reactions and verbal responses, captured by several computer sense modalities, are automatically analyzed for scoring and classification. We also aimed to empirically validate two of its assumptions: its capacity to measure attachment security and the viability of using themes as placeholders for rotating stimuli. A total of 59 French participants from the general population were assessed using the Adult Attachment Questionnaire (AAQ), the Adult Attachment Projective Picture System (AAP), and the Attachment Multiple Model Interview (AMMI) as ground truth for attachment security. They were then exposed to three different BAT stimuli sets, whereas their faces, voices, heart rate (HR), and electrodermal activity (EDA) were recorded. Psychophysiological features, such as skin-conductance response (SCR) and Bayevsky stress index; behavioral features, such as gaze and facial expressions; as well as linguistic and paralinguistic features, were automatically extracted. An exploratory analysis was conducted using correlation matrices to uncover the features that are most associated with attachment security. A confirmatory analysis was conducted by creating a single composite effects index and by testing it for correlations with attachment security. The stability of the theory-consistent features across three different stimuli sets was explored using repeated measures analysis of variances (ANOVAs). In total, 46 theory-consistent correlations were found during the exploration (out of 65 total significant correlations). For example, attachment security as measured by the AAP was correlated with positive facial expressions (r=.36, P=.01). AMMI's security with the father was inversely correlated with the low frequency (LF) of HRV (r=-.87, P=.03). Attachment security to partners as measured by the AAQ was inversely correlated with anger facial expression (r=-.43, P=.001). The confirmatory analysis showed that the composite effects index was significantly correlated to security in the AAP (r=.26, P=.05) and the AAQ (r=.30, P=.04) but not in the AMMI. Repeated measures ANOVAs conducted individually on each of the theory-consistent features revealed that only 7 of the 46 (15%) features had significantly different values among responses to three different stimuli sets. We were able to validate two of the instrument's core assumptions: its capacity to measure attachment security and the viability of using themes as placeholders for rotating stimuli. Future validation of other of its dimensions, as well as the ongoing development of its scoring and classification algorithms is discussed. ©Federico Parra, Raphaële Miljkovitch, Gwenaelle Persiaux, Michelle Morales, Stefan Scherer. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 06.04.2017.
The Multimodal Assessment of Adult Attachment Security: Developing the Biometric Attachment Test

PubMed Central

Miljkovitch, Raphaële; Persiaux, Gwenaelle; Morales, Michelle; Scherer, Stefan

2017-01-01

Background Attachment theory has been proven essential for mental health, including psychopathology, development, and interpersonal relationships. Validated psychometric instruments to measure attachment abound but suffer from shortcomings common to traditional psychometrics. Recent developments in multimodal fusion and machine learning pave the way for new automated and objective psychometric instruments for adult attachment that combine psychophysiological, linguistic, and behavioral analyses in the assessment of the construct. Objective The aim of this study was to present a new exposure-based, automatic, and objective adult-attachment assessment, the Biometric Attachment Test (BAT), which exposes participants to a short standardized set of visual and music stimuli, whereas their immediate reactions and verbal responses, captured by several computer sense modalities, are automatically analyzed for scoring and classification. We also aimed to empirically validate two of its assumptions: its capacity to measure attachment security and the viability of using themes as placeholders for rotating stimuli. Methods A total of 59 French participants from the general population were assessed using the Adult Attachment Questionnaire (AAQ), the Adult Attachment Projective Picture System (AAP), and the Attachment Multiple Model Interview (AMMI) as ground truth for attachment security. They were then exposed to three different BAT stimuli sets, whereas their faces, voices, heart rate (HR), and electrodermal activity (EDA) were recorded. Psychophysiological features, such as skin-conductance response (SCR) and Bayevsky stress index; behavioral features, such as gaze and facial expressions; as well as linguistic and paralinguistic features, were automatically extracted. An exploratory analysis was conducted using correlation matrices to uncover the features that are most associated with attachment security. A confirmatory analysis was conducted by creating a single composite effects index and by testing it for correlations with attachment security. The stability of the theory-consistent features across three different stimuli sets was explored using repeated measures analysis of variances (ANOVAs). Results In total, 46 theory-consistent correlations were found during the exploration (out of 65 total significant correlations). For example, attachment security as measured by the AAP was correlated with positive facial expressions (r=.36, P=.01). AMMI’s security with the father was inversely correlated with the low frequency (LF) of HRV (r=−.87, P=.03). Attachment security to partners as measured by the AAQ was inversely correlated with anger facial expression (r=−.43, P=.001). The confirmatory analysis showed that the composite effects index was significantly correlated to security in the AAP (r=.26, P=.05) and the AAQ (r=.30, P=.04) but not in the AMMI. Repeated measures ANOVAs conducted individually on each of the theory-consistent features revealed that only 7 of the 46 (15%) features had significantly different values among responses to three different stimuli sets. Conclusions We were able to validate two of the instrument’s core assumptions: its capacity to measure attachment security and the viability of using themes as placeholders for rotating stimuli. Future validation of other of its dimensions, as well as the ongoing development of its scoring and classification algorithms is discussed. PMID:28385683
Electrostatics of cysteine residues in proteins: parameterization and validation of a simple model.

PubMed

Salsbury, Freddie R; Poole, Leslie B; Fetrow, Jacquelyn S

2012-11-01

One of the most popular and simple models for the calculation of pK(a) s from a protein structure is the semi-macroscopic electrostatic model MEAD. This model requires empirical parameters for each residue to calculate pK(a) s. Analysis of current, widely used empirical parameters for cysteine residues showed that they did not reproduce expected cysteine pK(a) s; thus, we set out to identify parameters consistent with the CHARMM27 force field that capture both the behavior of typical cysteines in proteins and the behavior of cysteines which have perturbed pK(a) s. The new parameters were validated in three ways: (1) calculation across a large set of typical cysteines in proteins (where the calculations are expected to reproduce expected ensemble behavior); (2) calculation across a set of perturbed cysteines in proteins (where the calculations are expected to reproduce the shifted ensemble behavior); and (3) comparison to experimentally determined pK(a) values (where the calculation should reproduce the pK(a) within experimental error). Both the general behavior of cysteines in proteins and the perturbed pK(a) in some proteins can be predicted reasonably well using the newly determined empirical parameters within the MEAD model for protein electrostatics. This study provides the first general analysis of the electrostatics of cysteines in proteins, with specific attention paid to capturing both the behavior of typical cysteines in a protein and the behavior of cysteines whose pK(a) should be shifted, and validation of force field parameters for cysteine residues. Copyright © 2012 Wiley Periodicals, Inc.
Situating Standard Setting within Argument-Based Validity

ERIC Educational Resources Information Center

Papageorgiou, Spiros; Tannenbaum, Richard J.

2016-01-01

Although there has been substantial work on argument-based approaches to validation as well as standard-setting methodologies, it might not always be clear how standard setting fits into argument-based validity. The purpose of this article is to address this lack in the literature, with a specific focus on topics related to argument-based…

Validity Of The Nintendo Wii Balance Board To Assess Weight Bearing Asymmetry During Sit-To-Stand And Return-To-Sit Task

PubMed Central

Abujaber, Sumayeh; Gillispie, Gregory; Marmon, Adam; Zeni, Joseph

2015-01-01

Weight bearing asymmetry is common in patients with unilateral lower limb musculoskeletal pathologies. The Nintendo Wii Balance Board (WBB) has been suggested as a low-cost and widely-available tool to measure weight bearing asymmetry in a clinical environment; however no study has evaluated the validity of this tool during dynamic tasks. Therefore, the purpose of this study was to determine the concurrent validity of force measurements acquired from the WBB as compared to laboratory force plates. Thirty-five individuals before, or within 1 year of total joint arthroplasty performed a sit-to-stand and return-to-sit task in two conditions. First, subjects performed the task with both feet placed on a single WBB. Second, the task was repeated with each foot placed on an individual laboratory force plate. Peak vertical ground reaction force (VGRF) under each foot and the inter-limb symmetry ratio were calculated. Validity was examined using Intraclass Correlation Coefficients (ICC), regression analysis, 95% limits of agreement and Bland-Altman plots. Force plates and the WBB exhibited excellent agreement for all outcome measurements (ICC =0.83–0.99). Bland-Altman plots showed no obvious relationship between the difference and the mean for the peak VGRF, but there was a consistent trend in which VGRF on the unaffected side was lower and VGRF on the affected side was higher when using the WBB. However, these consistent biases can be adjusted for by utilizing regression equations that estimate the force plate values based on the WBB force. The WBB may serve as a valid, suitable, and low-cost alternative to expensive, laboratory force plates for measuring weight bearing asymmetry in clinical settings. PMID:25715680
Validity of the Nintendo Wii Balance Board to assess weight bearing asymmetry during sit-to-stand and return-to-sit task.

PubMed

Abujaber, Sumayeh; Gillispie, Gregory; Marmon, Adam; Zeni, Joseph

2015-02-01

Weight bearing asymmetry is common in patients with unilateral lower limb musculoskeletal pathologies. The Nintendo Wii Balance Board (WBB) has been suggested as a low-cost and widely-available tool to measure weight bearing asymmetry in a clinical environment; however no study has evaluated the validity of this tool during dynamic tasks. Therefore, the purpose of this study was to determine the concurrent validity of force measurements acquired from the WBB as compared to laboratory force plates. Thirty-five individuals before, or within 1 year of total joint arthroplasty performed a sit-to-stand and return-to-sit task in two conditions. First, subjects performed the task with both feet placed on a single WBB. Second, the task was repeated with each foot placed on an individual laboratory force plate. Peak vertical ground reaction force (VGRF) under each foot and the inter-limb symmetry ratio were calculated. Validity was examined using Intraclass Correlation Coefficients (ICC), regression analysis, 95% limits of agreement and Bland-Altman plots. Force plates and the WBB exhibited excellent agreement for all outcome measurements (ICC=0.83-0.99). Bland-Altman plots showed no obvious relationship between the difference and the mean for the peak VGRF, but there was a consistent trend in which VGRF on the unaffected side was lower and VGRF on the affected side was higher when using the WBB. However, these consistent biases can be adjusted for by utilizing regression equations that estimate the force plate values based on the WBB force. The WBB may serve as a valid, suitable, and low-cost alternative to expensive, laboratory force plates for measuring weight bearing asymmetry in clinical settings. Copyright © 2015 Elsevier B.V. All rights reserved.
Reliability and validity of the Persian lower extremity functional scale (LEFS) in a heterogeneous sample of outpatients with lower limb musculoskeletal disorders.

PubMed

Negahban, Hossein; Hessam, Masumeh; Tabatabaei, Saeid; Salehi, Reza; Sohani, Soheil Mansour; Mehravar, Mohammad

2014-01-01

The aim was to culturally translate and validate the Persian lower extremity functional scale (LEFS) in a heterogeneous sample of outpatients with lower extremity musculoskeletal disorders (n = 304). This is a prospective methodological study. After a standard forward-backward translation, psychometric properties were assessed in terms of test-retest reliability, internal consistency, construct validity, dimensionality, and ceiling or floor effects. The acceptable level of intraclass correlation coefficient >0.70 and Cronbach's alpha coefficient >0.70 was obtained for the Persian LEFS. Correlations between Persian LEFS and Short-Form 36 Health Survey (SF-36) subscales of Physical Health component (rs range = 0.38-0.78) were higher than correlations between Persian LEFS and SF-36 subscales of Mental Health component (rs range = 0.15-0.39). A corrected item--total correlation of >0.40 (Spearman's rho) was obtained for all items of the Persian LEFS. Horn's parallel analysis detected a total of two factors. No ceiling or floor effects were detected for the Persian LEFS. The Persian version of the LEFS is a reliable and valid instrument that can be used to measure functional status in Persian-speaking patients with different musculoskeletal disorders of the lower extremity. Implications for Rehabilitation The Persian lower extremity functional scale (LEFS) is a reliable, internally consistent and valid instrument, with no ceiling or floor effects, to determine functional status of heterogeneous patients with musculoskeletal disorders of the lower extremity. The Persian version of the LEFS can be used in clinical and research settings to measure function in Iranian patients with different musculoskeletal disorders of the lower extremity.
Quality of life in patients with cognitive impairment: validation of the Quality of Life-Alzheimer's Disease scale in Portugal.

PubMed

Bárrios, Helena; Verdelho, Ana; Narciso, Sofia; Gonçalves-Pereira, Manuel; Logsdon, Rebecca; de Mendonça, Alexandre

2013-07-01

Quality of Life-Alzheimer's Disease (QOL-AD) is a widely used scale for the study of quality of life in patients with dementia. The aim of this study is the transcultural adaptation and validation of the QOL-AD scale in Portugal. Translation and transcultural adaptation was performed according to state-of-the-art recommendations. For the validation study, 104 patient/caregiver pairs were enrolled. Patients had mild cognitive impairment or mild-to-moderate dementia (due to Alzheimer's disease or vascular dementia). Participants were recruited in a dementia outpatient clinic setting and a long-term care dementia ward. An additional comparison group of 22 patients without cognitive impairment, and their proxies, was recruited in a family practice outpatient clinic. Sociodemographic information on patients and caregivers was obtained. Acceptability, reliability, and construct validity were analyzed. Internal consistency of the Portuguese version of QOL-AD was good for both patient and caregiver report (Cronbach's α = 0.867 and 0.858, respectively). Construct validity was confirmed by the correlation of patient reported QOL-AD with patient geriatric depression scale scores (ρ = -0.702, p < 0.001) and satisfaction with life scale scores (ρ = 0.543, p < 0.001). Caregiver ratings were correlated with neuropsychiatric inventory (NPI) total score (ρ = -0.404, p < 0.001), NPI-distress (ρ = -0.346, p < 0.001), and patient Mini-Mental State Examination (ρ = 0.319, p < 0.01). QOL-AD patient ratings were higher than caregiver ratings (p < 0.001). Both patient- and caregiver-rated QOL-AD scores were lower in patients with cognitive impairment than in the comparison group without cognitive impairment (p < 0.01). A Portuguese version of QOL-AD with consistent psychometric properties was obtained and is proposed as a useful tool for research and clinical purposes.
Development and evaluation of the Expressions of Moral Injury Scale-Military Version.

PubMed

Currier, Joseph M; Farnsworth, Jacob K; Drescher, Kent D; McDermott, Ryon C; Sims, Brook M; Albright, David L

2018-05-01

There is consensus that military personnel can encounter a far more diverse set of challenges than researchers and clinicians have historically appreciated. Moral injury (MI) represents an emerging construct to capture behavioural, social, and spiritual suffering that may transcend and overlap with mental health diagnoses (e.g., post-traumatic stress disorder and major depressive disorder). The Expressions of Moral Injury Scale-Military Version (EMIS-M) was developed to provide a reliable and valid means for assessing the warning signs of a MI in military populations. Drawing on independent samples of veterans who had served in a war-zone environment, factor analytic results revealed 2 distinct factors related to MI expressions directed at both self (9 items) and others (8 items). These subscales generated excellent internal consistency and temporal stability over a 6-month period. When compared to measures of post-traumatic stress disorder, major depressive disorder, and other theoretically relevant constructs (e.g., forgiveness, social support, moral emotions, and combat exposure), EMIS-M scores demonstrated strong convergent, divergent, and incremental validity. In addition, although structural equation modelling findings supported a possible general MI factor in Study 2, the patterns of associations for self- and other-directed expressions yielded evidence for differential validity with varying forms of forgiveness and combat exposure. As such, the EMIS-M provides a face valid, psychometrically validated tool for assessing expressions of apparent MI subtypes in research and clinical settings. Looking ahead, the EMIS-M will hopefully advance the scientific understanding of MI while supporting innovation for clinicians to tailor evidence-based treatments and/or develop novel approaches for addressing MI in their work. Copyright © 2017 John Wiley & Sons, Ltd.
Psychometric evaluation of the Swedish language Person-centred Climate Questionnaire-family version.

PubMed

Lindahl, Jeanette; Elmqvist, Carina; Thulesius, Hans; Edvardsson, David

2015-12-01

In a holistic view of care, the family is important for the patient as well as for the staff and integration of family members in health care is a growing trend. Yet, family participation in the care is sparsely investigated and valid assessment instruments are needed. Data were collected from 200 family members participating in an intervention study at an emergency department (ED) in Sweden. The Person-centred Climate Questionnaire-Family (PCQ-F) is a measure for how family members perceive the psychosocial climate. PCQ-F is a self-report instrument that contains 17 items assessing safety, everydayness and hospitality--three subscale dimensions that mirror the Swedish patient version of the questionnaire, the PCQ-P. The aim of this study was to evaluate the psychometric properties of the Swedish version of the PCQ-F in an ED context. The psychometric properties of the PCQ-F were evaluated using statistical estimates of validity and reliability and showed high content validity and internal consistency. Cronbach's Alpha was >0.7 and item-total correlations were >0.3 and <0.7. In terms of psychometrics, the findings in this study indicate that the PCQ-F can be used with satisfactory validity and reliability to explore to what degree family members perceive ED settings as being person-centred, safe, welcoming and hospitable within an everyday and decorated physical environment. As the PCQ already exists in a valid and reliable patient (PCQ-P) and staff (PCQ-S) version, this new family member version is a significant addition to the literature as it enables further comparative studies of how diverse care settings are perceived by different stakeholders. © 2015 Nordic College of Caring Science.
The adaptation of the Sheffield Profile for Assessment and Referral for Care (SPARC) to the Polish clinical setting for needs assessment of advanced cancer patients.

PubMed

Leppert, Wojciech; Majkowicz, Mikolaj; Ahmedzai, Sam H

2012-12-01

Assessment of the needs of advanced cancer patients is a very important issue in palliative care. The aim of the study was to adapt the Sheffield Profile for Assessment and Referral for Care (SPARC) to the Polish environment and evaluate its usefulness in needs assessment of patients with advanced cancer. A forward-back translation of the SPARC to Polish was done. The SPARC was used once in 58 consecutive patients with advanced cancer during follow-up. The patients were enrolled from a palliative care unit (25 patients), home care (18 patients), and a day care center (15 patients). The reliability was evaluated by establishing the internal consistency using Cronbach's alpha coefficients. Content validity was analyzed in accordance with the theories of needs by Murray and Maslow as a nonstatistical method of validity assessment. Factor analysis with principal components extraction and varimax rotation of raw data was used to reduce the set of data and assess the construct validity. There were differences regarding religious and spiritual issues and independence and activity between patients in the palliative care unit (worse results) and those at the day care center (better scores). Communication and need for more information items were associated with psychological, social, spiritual, and treatment issues. Cronbach's alpha coefficients and factor analysis demonstrated, respectively, satisfactory reliability and construct validity of the tool. The study demonstrated that the Polish version of the SPARC is a valid and reliable tool recommended for the needs assessment and symptom evaluation of patients with advanced cancer. Copyright © 2012 U.S. Cancer Pain Relief Committee. Published by Elsevier Inc. All rights reserved.
RNA-seq reveals more consistent reference genes for gene expression studies in human non-melanoma skin cancers

PubMed Central

Tan, Jean-Marie; Payne, Elizabeth J.; Lin, Lynlee L.; Sinnya, Sudipta; Raphael, Anthony P.; Lambie, Duncan; Frazer, Ian H.; Dinger, Marcel E.; Soyer, H. Peter

2017-01-01

Identification of appropriate reference genes (RGs) is critical to accurate data interpretation in quantitative real-time PCR (qPCR) experiments. In this study, we have utilised next generation RNA sequencing (RNA-seq) to analyse the transcriptome of a panel of non-melanoma skin cancer lesions, identifying genes that are consistently expressed across all samples. Genes encoding ribosomal proteins were amongst the most stable in this dataset. Validation of this RNA-seq data was examined using qPCR to confirm the suitability of a set of highly stable genes for use as qPCR RGs. These genes will provide a valuable resource for the normalisation of qPCR data for the analysis of non-melanoma skin cancer. PMID:28852586
Neutron Reference Benchmark Field Specification: ACRR Free-Field Environment (ACRR-FF-CC-32-CL).

DOE Office of Scientific and Technical Information (OSTI.GOV)

Vega, Richard Manuel; Parma, Edward J.; Griffin, Patrick J.

2015-07-01

This report was put together to support the International Atomic Energy Agency (IAEA) REAL- 2016 activity to validate the dosimetry community’s ability to use a consistent set of activation data and to derive consistent spectral characterizations. The report captures details of integral measurements taken in the Annular Core Research Reactor (ACRR) central cavity free-field reference neutron benchmark field. The field is described and an “a priori” calculated neutron spectrum is reported, based on MCNP6 calculations, and a subject matter expert (SME) based covariance matrix is given for this “a priori” spectrum. The results of 31 integral dosimetry measurements in themore » neutron field are reported.« less
Innovative learning model for improving students’ argumentation skill and concept understanding on science

NASA Astrophysics Data System (ADS)

Nafsiati Astuti, Rini

2018-04-01

Argumentation skill is the ability to compose and maintain arguments consisting of claims, supports for evidence, and strengthened-reasons. Argumentation is an important skill student needs to face the challenges of globalization in the 21st century. It is not an ability that can be developed by itself along with the physical development of human, but it must be developed under nerve like process, giving stimulus so as to require a person to be able to argue. Therefore, teachers should develop students’ skill of arguing in science learning in the classroom. The purpose of this study is to obtain an innovative learning model that are valid in terms of content and construct in improving the skills of argumentation and concept understanding of junior high school students. The assessment of content validity and construct validity was done through Focus Group Discussion (FGD), using the content and construct validation sheet, book model, learning video, and a set of learning aids for one meeting. Assessment results from 3 (three) experts showed that the learning model developed in the category was valid. The validity itself shows that the developed learning model has met the content requirement, the student needs, state of the art, strong theoretical and empirical foundation and construct validity, which has a connection of syntax stages and components of learning model so that it can be applied in the classroom activities
Goal setting as an outcome measure: A systematic review.

PubMed

Hurn, Jane; Kneebone, Ian; Cropley, Mark

2006-09-01

Goal achievement has been considered to be an important measure of outcome by clinicians working with patients in physical and neurological rehabilitation settings. This systematic review was undertaken to examine the reliability, validity and sensitivity of goal setting and goal attainment scaling approaches when used with working age and older people. To review the reliability, validity and sensitivity of both goal setting and goal attainment scaling when employed as an outcome measure within a physical and neurological working age and older person rehabilitation environment, by examining the research literature covering the 36 years since goal-setting theory was proposed. Data sources included a computer-aided literature search of published studies examining the reliability, validity and sensitivity of goal setting/goal attainment scaling, with further references sourced from articles obtained through this process. There is strong evidence for the reliability, validity and sensitivity of goal attainment scaling. Empirical support was found for the validity of goal setting but research demonstrating its reliability and sensitivity is limited. Goal attainment scaling appears to be a sound measure for use in physical rehabilitation settings with working age and older people. Further work needs to be carried out with goal setting to establish its reliability and sensitivity as a measurement tool.
Design and validation of a comprehensive fecal incontinence questionnaire.

PubMed

Macmillan, Alexandra K; Merrie, Arend E H; Marshall, Roger J; Parry, Bryan R

2008-10-01

Fecal incontinence can have a profound effect on quality of life. Its prevalence remains uncertain because of stigma, lack of consistent definition, and dearth of validated measures. This study was designed to develop a valid clinical and epidemiologic questionnaire, building on current literature and expertise. Patients and experts undertook face validity testing. Construct validity, criterion validity, and test-retest reliability was undertaken. Construct validity comprised factor analysis and internal consistency of the quality of life scale. The validity of known groups was tested against 77 control subjects by using regression models. Questionnaire results were compared with a stool diary for criterion validity. Test-retest reliability was calculated from repeated questionnaire completion. The questionnaire achieved good face validity. It was completed by 104 patients. The quality of life scale had four underlying traits (factor analysis) and high internal consistency (overall Cronbach alpha = 0.97). Patients and control subjects answered the questionnaire significantly differently (P < 0.01) in known-groups validity testing. Criterion validity assessment found mean differences close to zero. Median reliability for the whole questionnaire was 0.79 (range, 0.35-1). This questionnaire compares favorably with other available instruments, although the interpretation of stool consistency requires further research. Its sensitivity to treatment still needs to be investigated.
MABAL: a Novel Deep-Learning Architecture for Machine-Assisted Bone Age Labeling.

PubMed

Mutasa, Simukayi; Chang, Peter D; Ruzal-Shapiro, Carrie; Ayyala, Rama

2018-02-05

Bone age assessment (BAA) is a commonly performed diagnostic study in pediatric radiology to assess skeletal maturity. The most commonly utilized method for assessment of BAA is the Greulich and Pyle method (Pediatr Radiol 46.9:1269-1274, 2016; Arch Dis Child 81.2:172-173, 1999) atlas. The evaluation of BAA can be a tedious and time-consuming process for the radiologist. As such, several computer-assisted detection/diagnosis (CAD) methods have been proposed for automation of BAA. Classical CAD tools have traditionally relied on hard-coded algorithmic features for BAA which suffer from a variety of drawbacks. Recently, the advent and proliferation of convolutional neural networks (CNNs) has shown promise in a variety of medical imaging applications. There have been at least two published applications of using deep learning for evaluation of bone age (Med Image Anal 36:41-51, 2017; JDI 1-5, 2017). However, current implementations are limited by a combination of both architecture design and relatively small datasets. The purpose of this study is to demonstrate the benefits of a customized neural network algorithm carefully calibrated to the evaluation of bone age utilizing a relatively large institutional dataset. In doing so, this study will aim to show that advanced architectures can be successfully trained from scratch in the medical imaging domain and can generate results that outperform any existing proposed algorithm. The training data consisted of 10,289 images of different skeletal age examinations, 8909 from the hospital Picture Archiving and Communication System at our institution and 1383 from the public Digital Hand Atlas Database. The data was separated into four cohorts, one each for male and female children above the age of 8, and one each for male and female children below the age of 10. The testing set consisted of 20 radiographs of each 1-year-age cohort from 0 to 1 years to 14-15+ years, half male and half female. The testing set included left-hand radiographs done for bone age assessment, trauma evaluation without significant findings, and skeletal surveys. A 14 hidden layer-customized neural network was designed for this study. The network included several state of the art techniques including residual-style connections, inception layers, and spatial transformer layers. Data augmentation was applied to the network inputs to prevent overfitting. A linear regression output was utilized. Mean square error was used as the network loss function and mean absolute error (MAE) was utilized as the primary performance metric. MAE accuracies on the validation and test sets for young females were 0.654 and 0.561 respectively. For older females, validation and test accuracies were 0.662 and 0.497 respectively. For young males, validation and test accuracies were 0.649 and 0.585 respectively. Finally, for older males, validation and test set accuracies were 0.581 and 0.501 respectively. The female cohorts were trained for 900 epochs each and the male cohorts were trained for 600 epochs. An eightfold cross-validation set was employed for hyperparameter tuning. Test error was obtained after training on a full data set with the selected hyperparameters. Using our proposed customized neural network architecture on our large available data, we achieved an aggregate validation and test set mean absolute errors of 0.637 and 0.536 respectively. To date, this is the best published performance on utilizing deep learning for bone age assessment. Our results support our initial hypothesis that customized, purpose-built neural networks provide improved performance over networks derived from pre-trained imaging data sets. We build on that initial work by showing that the addition of state-of-the-art techniques such as residual connections and inception architecture further improves prediction accuracy. This is important because the current assumption for use of residual and/or inception architectures is that a large pre-trained network is required for successful implementation given the relatively small datasets in medical imaging. Instead we show that a small, customized architecture incorporating advanced CNN strategies can indeed be trained from scratch, yielding significant improvements in algorithm accuracy. It should be noted that for all four cohorts, testing error outperformed validation error. One reason for this is that our ground truth for our test set was obtained by averaging two pediatric radiologist reads compared to our training data for which only a single read was used. This suggests that despite relatively noisy training data, the algorithm could successfully model the variation between observers and generate estimates that are close to the expected ground truth.
Validation of the Edinburgh Postnatal Depression Scale (EPDS) on the Thai–Myanmar border

PubMed Central

Ing, Harriet; Fellmeth, Gracia; White, Jitrachote; Stein, Alan; Simpson, Julie A; McGready, Rose

2017-01-01

Postnatal depression is common and may have severe consequences for women and their children. Locally validated screening tools are required to identify at-risk women in marginalised populations. The Edinburgh Postnatal Depression Scale (EPDS) is one of the most frequently used tools globally. This cross-sectional study assessed the validity and acceptability of the EPDS in Karen and Burmese among postpartum migrant and refugee women on the Thai–Myanmar border. The EPDS was administered to participants and results compared with a diagnostic interview. Local staff provided feedback on the acceptability of the EPDS through a focus group discussion. Results from 670 women showed high accuracy and reasonable internal consistency of the EPDS. However, acceptability to local staff was low, limiting the utility of the EPDS in this setting despite its good psychometrics. Further work is required to identify a tool that is acceptable and sensitive to cultural manifestations of depression in this vulnerable population. PMID:28699396
AFSS: athlete's foot severity score. A proposal and validation.

PubMed

Cohen, A D; Wolak, A; Alkan, M; Shalev, R; Vardy, D A

2002-04-01

We developed a simple scoring system to evaluate the severity of tinea pedis (Athlete's foot severity score, AFSS). The AFSS consists of a clinical evaluation, using a three-point scale, of erythema and scaling in the plantar and interdigital spaces of the feet, and counts of interdigital spaces involved. Each foot is evaluated separately. The validity of the AFSS was assessed in 224 soldiers of the Israel Defense Force using mycological cultures as the main outcome measure and subjective assessment of pruritus as the secondary outcome measure. Mycological examinations were performed in 106 patients who had clinical evidence of tinea pedis. AFSS was significantly associated with culture results (P<0.0001), as well as with the presence of pruritus (P=0.002), and pruritus scores (P=0.025). We conclude the AFSS is valid for the clinical evaluation of tinea pedis severity in military settings. The application of AFSS to civilian morbidity should be subjected to further evaluation. AFSS: Schweregrad-Beurteilung des Athletenfusses. Ein Vorschlag
Investigating the incremental validity of cognitive variables in early mathematics screening.

PubMed

Clarke, Ben; Shanley, Lina; Kosty, Derek; Baker, Scott K; Cary, Mari Strand; Fien, Hank; Smolkowski, Keith

2018-03-26

The purpose of this study was to investigate the incremental validity of a set of domain general cognitive measures added to a traditional screening battery of early numeracy measures. The sample consisted of 458 kindergarten students of whom 285 were designated as severely at-risk for mathematics difficulty. Hierarchical multiple regression results indicated that Wechsler Abbreviated Scales of Intelligence (WASI) Matrix Reasoning and Vocabulary subtests, and Digit Span Forward and Backward measures explained a small, but unique portion of the variance in kindergarten students' mathematics performance on the Test of Early Mathematics Ability-Third Edition (TEMA-3) when controlling for Early Numeracy Curriculum Based Measurement (EN-CBM) screening measures (R² change = .01). Furthermore, the incremental validity of the domain general cognitive measures was relatively stronger for the severely at-risk sample. We discuss results from the study in light of instructional decision-making and note the findings do not justify adding domain general cognitive assessments to mathematics screening batteries. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
The development and validation of the Dieting Intentions Scale (DIS).

PubMed

Cruwys, Tegan; Platow, Michael J; Rieger, Elizabeth; Byrne, Don G

2013-03-01

This article presents information on the psychometric properties of the Dieting Intentions Scale (DIS), a new scale of dieting that predicts future behavioral efforts to lose weight. We begin by reviewing recent research indicating theoretical and empirical problems with traditional approaches to measuring dieting. The DIS addresses several of these problems by (a) focusing on naturalistic dieting behavior and (b) being future-oriented. Four validation studies are presented with a total of 741 participants. We demonstrate that the DIS has predictive utility for dieting behaviors and is positively correlated with other measures related to eating, weight, and shape. Furthermore, the DIS demonstrates discriminant validity by not being related to constructs such as self-esteem and social desirability. The DIS also has high internal consistency, with a 1-factor solution replicated with confirmatory factor analysis. The potential uses of the scale in both research and clinical settings are considered. PsycINFO Database Record (c) 2013 APA, all rights reserved.
An atomic model of brome mosaic virus using direct electron detection and real-space optimization.

PubMed

Wang, Zhao; Hryc, Corey F; Bammes, Benjamin; Afonine, Pavel V; Jakana, Joanita; Chen, Dong-Hua; Liu, Xiangan; Baker, Matthew L; Kao, Cheng; Ludtke, Steven J; Schmid, Michael F; Adams, Paul D; Chiu, Wah

2014-09-04

Advances in electron cryo-microscopy have enabled structure determination of macromolecules at near-atomic resolution. However, structure determination, even using de novo methods, remains susceptible to model bias and overfitting. Here we describe a complete workflow for data acquisition, image processing, all-atom modelling and validation of brome mosaic virus, an RNA virus. Data were collected with a direct electron detector in integrating mode and an exposure beyond the traditional radiation damage limit. The final density map has a resolution of 3.8 Å as assessed by two independent data sets and maps. We used the map to derive an all-atom model with a newly implemented real-space optimization protocol. The validity of the model was verified by its match with the density map and a previous model from X-ray crystallography, as well as the internal consistency of models from independent maps. This study demonstrates a practical approach to obtain a rigorously validated atomic resolution electron cryo-microscopy structure.
Emotional suppression and breast cancer: validation research on the Spanish Adaptation of the Courtauld Emotional Control Scale (CECS).

PubMed

Durá, Estrella; Andreu, Yolanda; Galdón, Maria José; Ibáñez, Elena; Pérez, Sandra; Ferrando, Maite; Murgui, Sergio; Martínez, Paula

2010-05-01

Emotional suppression has played an important role in the research on psychosocial factors related to cancer. It has been argued to be an important psychological factor predicting worse psychosocial adjustment in people with cancer and it may mediate health outcomes. The reference instrument in the research on emotional suppression is the Courtauld Emotional Control Scale (CECS). The present study analysed construct validity of a new Spanish adaptation of the CECS in a sample of 175 breast cancer patients. The results confirmed the proposal by Watson and Greer claiming that the CECS is composed of three subscales that measure different dimensions, but not independent, from emotional control. The present Spanish version of the CECS showed high internal consistency in each subseale as well as the total score. According to Derogatis (BSI-18) criteria, emotional suppression predicts clinically significant distress. In short, our results support the reliability, validity and utility of this Spanish adaptation of the CECS in clinical and research settings.
Validation of the Mobile Information Software Evaluation Tool (MISET) With Nursing Students.

PubMed

Secco, M Loretta; Furlong, Karen E; Doyle, Glynda; Bailey, Judy

2016-07-01

This study evaluated the Mobile Information Software Evaluation Tool (MISET) with a sample of Canadian undergraduate nursing students (N = 240). Psychometric analyses determined how well the MISET assessed the extent that nursing students find mobile device-based information resources useful and supportive of learning in the clinical and classroom settings. The MISET has a valid three-factor structure with high explained variance (74.7%). Internal consistency reliabilities were high for the MISET total (.90) and three subscales: Usefulness/Helpfulness, Information Literacy Support, and Use of Evidence-Based Sources (.87 to .94). Construct validity evidence included significantly higher mean total MISET, Helpfulness/Usefulness, and Information Literacy Support scores for senior students and those with higher computer competence. The MISET is a promising tool to evaluate mobile information technologies and information literacy support; however, longitudinal assessment of changes in scores over time would determine scale sensitivity and responsiveness. [J Nurs Educ. 2016;55(7):385-390.]. Copyright 2016, SLACK Incorporated.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.