Ban, Jong-Wook; Emparanza, José Ignacio; Urreta, Iratxe; Burls, Amanda
2016-01-01
Background Many new clinical prediction rules are derived and validated. But the design and reporting quality of clinical prediction research has been less than optimal. We aimed to assess whether design characteristics of validation studies were associated with the overestimation of clinical prediction rules’ performance. We also aimed to evaluate whether validation studies clearly reported important methodological characteristics. Methods Electronic databases were searched for systematic reviews of clinical prediction rule studies published between 2006 and 2010. Data were extracted from the eligible validation studies included in the systematic reviews. A meta-analytic meta-epidemiological approach was used to assess the influence of design characteristics on predictive performance. From each validation study, it was assessed whether 7 design and 7 reporting characteristics were properly described. Results A total of 287 validation studies of clinical prediction rule were collected from 15 systematic reviews (31 meta-analyses). Validation studies using case-control design produced a summary diagnostic odds ratio (DOR) 2.2 times (95% CI: 1.2–4.3) larger than validation studies using cohort design and unclear design. When differential verification was used, the summary DOR was overestimated by twofold (95% CI: 1.2 -3.1) compared to complete, partial and unclear verification. The summary RDOR of validation studies with inadequate sample size was 1.9 (95% CI: 1.2 -3.1) compared to studies with adequate sample size. Study site, reliability, and clinical prediction rule was adequately described in 10.1%, 9.4%, and 7.0% of validation studies respectively. Conclusion Validation studies with design shortcomings may overestimate the performance of clinical prediction rules. The quality of reporting among studies validating clinical prediction rules needs to be improved. PMID:26730980
Ban, Jong-Wook; Emparanza, José Ignacio; Urreta, Iratxe; Burls, Amanda
2016-01-01
Many new clinical prediction rules are derived and validated. But the design and reporting quality of clinical prediction research has been less than optimal. We aimed to assess whether design characteristics of validation studies were associated with the overestimation of clinical prediction rules' performance. We also aimed to evaluate whether validation studies clearly reported important methodological characteristics. Electronic databases were searched for systematic reviews of clinical prediction rule studies published between 2006 and 2010. Data were extracted from the eligible validation studies included in the systematic reviews. A meta-analytic meta-epidemiological approach was used to assess the influence of design characteristics on predictive performance. From each validation study, it was assessed whether 7 design and 7 reporting characteristics were properly described. A total of 287 validation studies of clinical prediction rule were collected from 15 systematic reviews (31 meta-analyses). Validation studies using case-control design produced a summary diagnostic odds ratio (DOR) 2.2 times (95% CI: 1.2-4.3) larger than validation studies using cohort design and unclear design. When differential verification was used, the summary DOR was overestimated by twofold (95% CI: 1.2 -3.1) compared to complete, partial and unclear verification. The summary RDOR of validation studies with inadequate sample size was 1.9 (95% CI: 1.2 -3.1) compared to studies with adequate sample size. Study site, reliability, and clinical prediction rule was adequately described in 10.1%, 9.4%, and 7.0% of validation studies respectively. Validation studies with design shortcomings may overestimate the performance of clinical prediction rules. The quality of reporting among studies validating clinical prediction rules needs to be improved.
Clinical instruments: reliability and validity critical appraisal.
Brink, Yolandi; Louw, Quinette A
2012-12-01
RATIONALE, AIM AND OBJECTIVES: There is a lack of health care practitioners using objective clinical tools with sound psychometric properties. There is also a need for researchers to improve their reporting of the validity and reliability results of these clinical tools. Therefore, to promote the use of valid and reliable tools or tests for clinical evaluation, this paper reports on the development of a critical appraisal tool to assess the psychometric properties of objective clinical tools. A five-step process was followed to develop the new critical appraisal tool: (1) preliminary conceptual decisions; (2) defining key concepts; (3) item generation; (4) assessment of face validity; and (5) formulation of the final tool. The new critical appraisal tool consists of 13 items, of which five items relate to both validity and reliability studies, four items to validity studies only and four items to reliability studies. The 13 items could be scored as 'yes', 'no' or 'not applicable'. This critical appraisal tool will aid both the health care practitioner to critically appraise the relevant literature and researchers to improve the quality of reporting of the validity and reliability of objective clinical tools. © 2011 Blackwell Publishing Ltd.
Santani, Avni; Murrell, Jill; Funke, Birgit; Yu, Zhenming; Hegde, Madhuri; Mao, Rong; Ferreira-Gonzalez, Andrea; Voelkerding, Karl V; Weck, Karen E
2017-06-01
- The number of targeted next-generation sequencing (NGS) panels for genetic diseases offered by clinical laboratories is rapidly increasing. Before an NGS-based test is implemented in a clinical laboratory, appropriate validation studies are needed to determine the performance characteristics of the test. - To provide examples of assay design and validation of targeted NGS gene panels for the detection of germline variants associated with inherited disorders. - The approaches used by 2 clinical laboratories for the development and validation of targeted NGS gene panels are described. Important design and validation considerations are examined. - Clinical laboratories must validate performance specifications of each test prior to implementation. Test design specifications and validation data are provided, outlining important steps in validation of targeted NGS panels by clinical diagnostic laboratories.
ERIC Educational Resources Information Center
Shriver, Mark D.; Frerichs, Lynae J.; Williams, Melissa; Lancaster, Blake M.
2013-01-01
Direct observation is often considered the "gold standard" for assessing the function, frequency, and intensity of problem behavior. Currently, the literature investigating the construct validity of direct observation conducted in the clinic setting reveals conflicting results. Previous studies on the construct validity of clinic-based…
A call for change: clinical evaluation of student registered nurse anesthetists.
Collins, Shawn; Callahan, Margaret Faut
2014-02-01
The ability to integrate theory with practice is integral to a student's success. A common reason for attrition from a nurse anesthesia program is clinical issues. To document clinical competence, students are evaluated using various tools. For use of a clinical evaluation tool as possible evidence for a student's dismissal, an important psychometric property to ensure is instrument validity. Clinical evaluation instruments of nurse anesthesia programs are not standardized among programs, which suggests a lack of instrument validity. The lack of established validity of the instruments used to evaluate students' clinical progress brings into question their ability to detect a student who is truly in jeopardy of attrition. Given this possibility, clinical instrument validity warrants research to be fair to students and improve attrition rates based on valid data. This ex post facto study evaluated a 17-item clinical instrument tool to demonstrate the need for validity of clinical evaluation tools. It also compared clinical scores with scores on the National Certification Examination.
Muhamad, Zailani; Ramli, Ayiesah; Amat, Salleh
2015-05-01
The aim of this study was to determine the content validity, internal consistency, test-retest reliability and inter-rater reliability of the Clinical Competency Evaluation Instrument (CCEVI) in assessing the clinical performance of physiotherapy students. This study was carried out between June and September 2013 at University Kebangsaan Malaysia (UKM), Kuala Lumpur, Malaysia. A panel of 10 experts were identified to establish content validity by evaluating and rating each of the items used in the CCEVI with regards to their relevance in measuring students' clinical competency. A total of 50 UKM undergraduate physiotherapy students were assessed throughout their clinical placement to determine the construct validity of these items. The instrument's reliability was determined through a cross-sectional study involving a clinical performance assessment of 14 final-year undergraduate physiotherapy students. The content validity index of the entire CCEVI was 0.91, while the proportion of agreement on the content validity indices ranged from 0.83-1.00. The CCEVI construct validity was established with factor loading of ≥0.6, while internal consistency (Cronbach's alpha) overall was 0.97. Test-retest reliability of the CCEVI was confirmed with a Pearson's correlation range of 0.91-0.97 and an intraclass coefficient correlation range of 0.95-0.98. Inter-rater reliability of the CCEVI domains ranged from 0.59 to 0.97 on initial and subsequent assessments. This pilot study confirmed the content validity of the CCEVI. It showed high internal consistency, thereby providing evidence that the CCEVI has moderate to excellent inter-rater reliability. However, additional refinement in the wording of the CCEVI items, particularly in the domains of safety and documentation, is recommended to further improve the validity and reliability of the instrument.
ERIC Educational Resources Information Center
Daviss, W. Burleson; Birmaher, Boris; Melhem, Nadine A.; Axelson, David A.; Michaels, Shana M.; Brent, David A.
2006-01-01
Background: Previous measures of pediatric depression have shown inconsistent validity in groups with differing demographics, comorbid diagnoses, and clinic or non-clinic origins. The current study re-examines the criterion validity of child- and parent-versions of the Mood and Feelings Questionnaire (MFQ-C, MFQ-P) in a heterogeneous sample of…
Development and validation of a Clinical Assessment Tool for Nursing Education (CAT-NE).
Skúladóttir, Hafdís; Svavarsdóttir, Margrét Hrönn
2016-09-01
The aim of this study was to develop a valid assessment tool to guide clinical education and evaluate students' performance in clinical nursing education. The development of the Clinical Assessment Tool for Nursing Education (CAT-NE) was based on the theory of nursing as professional caring and the Bologna learning outcomes. Benson and Clark's four steps of instrument development and validation guided the development and assessment of the tool. A mixed-methods approach with individual structured cognitive interviewing and quantitative assessments was used to validate the tool. Supervisory teachers, a pedagogical consultant, clinical expert teachers, clinical teachers, and nursing students at the University of Akureyri in Iceland participated in the process. This assessment tool is valid to assess the clinical performance of nursing students; it consists of rubrics that list the criteria for the students' expected performance. According to the students and their clinical teachers, the assessment tool clarified learning objectives, enhanced the focus of the assessment process, and made evaluation more objective. Training clinical teachers on how to assess students' performances in clinical studies and use the tool enhanced the quality of clinical assessment in nursing education. Copyright © 2016 Elsevier Ltd. All rights reserved.
Boerboom, T B B; Dolmans, D H J M; Jaarsma, A D C; Muijtjens, A M M; Van Beukelen, P; Scherpbier, A J J A
2011-01-01
Feedback to aid teachers in improving their teaching requires validated evaluation instruments. When implementing an evaluation instrument in a different context, it is important to collect validity evidence from multiple sources. We examined the validity and reliability of the Maastricht Clinical Teaching Questionnaire (MCTQ) as an instrument to evaluate individual clinical teachers during short clinical rotations in veterinary education. We examined four sources of validity evidence: (1) Content was examined based on theory of effective learning. (2) Response process was explored in a pilot study. (3) Internal structure was assessed by confirmatory factor analysis using 1086 student evaluations and reliability was examined utilizing generalizability analysis. (4) Relations with other relevant variables were examined by comparing factor scores with other outcomes. Content validity was supported by theory underlying the cognitive apprenticeship model on which the instrument is based. The pilot study resulted in an additional question about supervision time. A five-factor model showed a good fit with the data. Acceptable reliability was achievable with 10-12 questionnaires per teacher. Correlations between the factors and overall teacher judgement were strong. The MCTQ appears to be a valid and reliable instrument to evaluate clinical teachers' performance during short rotations.
Deichmann Nielsen, Lea; Bech, Per; Hounsgaard, Lise; Alkier Gildberg, Frederik
2017-08-01
Unstructured risk assessment, as well as confounders (underlying reasons for the patient's risk behaviour and alliance), risk behaviour, and parameters of alliance, have been identified as factors that prolong the duration of mechanical restraint among forensic mental health inpatients. To clinically validate a new, structured short-term risk assessment instrument called the Mechanical Restraint-Confounders, Risk, Alliance Score (MR-CRAS), with the intended purpose of supporting the clinicians' observation and assessment of the patient's readiness to be released from mechanical restraint. The content and layout of MR-CRAS and its user manual were evaluated using face validation by forensic mental health clinicians, content validation by an expert panel, and pilot testing within two, closed forensic mental health inpatient units. The three sub-scales (Confounders, Risk, and a parameter of Alliance) showed excellent content validity. The clinical validations also showed that MR-CRAS was perceived and experienced as a comprehensible, relevant, comprehensive, and useable risk assessment instrument. MR-CRAS contains 18 clinically valid items, and the instrument can be used to support the clinical decision-making regarding the possibility of releasing the patient from mechanical restraint. The present three studies have clinically validated a short MR-CRAS scale that is currently being psychometrically tested in a larger study.
Steele, John C; Clark, Hadleigh J; Hong, Catherine H L; Jurge, Sabine; Muthukrishnan, Arvind; Kerr, A Ross; Wray, David; Prescott-Clements, Linda; Felix, David H; Sollecito, Thomas P
2015-08-01
To explore international consensus for the validation of clinical competencies for advanced training in Oral Medicine. An electronic survey of clinical competencies was designed. The survey was sent to and completed by identified international stakeholders during a 10-week period. To be validated, an individual competency had to achieve 90% or greater consensus to keep it in its current format. Stakeholders from 31 countries responded. High consensus agreement was achieved with 93 of 101 (92%) competencies exceeding the benchmark for agreement. Only 8 warranted further attention and were reviewed by a focus group. No additional competencies were suggested. This is the first international validated study of clinical competencies for advanced training in Oral Medicine. These validated clinical competencies could provide a model for countries developing an advanced training curriculum for Oral Medicine and also inform review of existing curricula. Copyright © 2015 Elsevier Inc. All rights reserved.
Springate, David A; Kontopantelis, Evangelos; Ashcroft, Darren M; Olier, Ivan; Parisi, Rosa; Chamapiwa, Edmore; Reeves, David
2014-01-01
Lists of clinical codes are the foundation for research undertaken using electronic medical records (EMRs). If clinical code lists are not available, reviewers are unable to determine the validity of research, full study replication is impossible, researchers are unable to make effective comparisons between studies, and the construction of new code lists is subject to much duplication of effort. Despite this, the publication of clinical codes is rarely if ever a requirement for obtaining grants, validating protocols, or publishing research. In a representative sample of 450 EMR primary research articles indexed on PubMed, we found that only 19 (5.1%) were accompanied by a full set of published clinical codes and 32 (8.6%) stated that code lists were available on request. To help address these problems, we have built an online repository where researchers using EMRs can upload and download lists of clinical codes. The repository will enable clinical researchers to better validate EMR studies, build on previous code lists and compare disease definitions across studies. It will also assist health informaticians in replicating database studies, tracking changes in disease definitions or clinical coding practice through time and sharing clinical code information across platforms and data sources as research objects.
Springate, David A.; Kontopantelis, Evangelos; Ashcroft, Darren M.; Olier, Ivan; Parisi, Rosa; Chamapiwa, Edmore; Reeves, David
2014-01-01
Lists of clinical codes are the foundation for research undertaken using electronic medical records (EMRs). If clinical code lists are not available, reviewers are unable to determine the validity of research, full study replication is impossible, researchers are unable to make effective comparisons between studies, and the construction of new code lists is subject to much duplication of effort. Despite this, the publication of clinical codes is rarely if ever a requirement for obtaining grants, validating protocols, or publishing research. In a representative sample of 450 EMR primary research articles indexed on PubMed, we found that only 19 (5.1%) were accompanied by a full set of published clinical codes and 32 (8.6%) stated that code lists were available on request. To help address these problems, we have built an online repository where researchers using EMRs can upload and download lists of clinical codes. The repository will enable clinical researchers to better validate EMR studies, build on previous code lists and compare disease definitions across studies. It will also assist health informaticians in replicating database studies, tracking changes in disease definitions or clinical coding practice through time and sharing clinical code information across platforms and data sources as research objects. PMID:24941260
Hickey, Graeme L; Blackstone, Eugene H
2016-08-01
Clinical risk-prediction models serve an important role in healthcare. They are used for clinical decision-making and measuring the performance of healthcare providers. To establish confidence in a model, external model validation is imperative. When designing such an external model validation study, thought must be given to patient selection, risk factor and outcome definitions, missing data, and the transparent reporting of the analysis. In addition, there are a number of statistical methods available for external model validation. Execution of a rigorous external validation study rests in proper study design, application of suitable statistical methods, and transparent reporting. Copyright © 2016 The American Association for Thoracic Surgery. Published by Elsevier Inc. All rights reserved.
McKown, Clark
2007-03-01
In this study, the validity of 5 tests of children's social-emotional cognition, defined as their encoding, memory, and interpretation of social information, was tested. Participants were 126 clinic-referred children between the ages of 5 and 17. All 5 tests were evaluated in terms of their (a) concurrent validity, (b) incremental validity, and (c) clinical usefulness in predicting social functioning. Tests included measures of nonverbal sensitivity, social language, and social problem solving. Criterion measures included parent and teacher report of social functioning. Analyses support the concurrent validity of all measures, and the incremental validity and clinical usefulness of tests of pragmatic language and problem solving.
Andrade, E L; Bento, A F; Cavalli, J; Oliveira, S K; Freitas, C S; Marcon, R; Schwanke, R C; Siqueira, J M; Calixto, J B
2016-10-24
This review presents a historical overview of drug discovery and the non-clinical stages of the drug development process, from initial target identification and validation, through in silico assays and high throughput screening (HTS), identification of leader molecules and their optimization, the selection of a candidate substance for clinical development, and the use of animal models during the early studies of proof-of-concept (or principle). This report also discusses the relevance of validated and predictive animal models selection, as well as the correct use of animal tests concerning the experimental design, execution and interpretation, which affect the reproducibility, quality and reliability of non-clinical studies necessary to translate to and support clinical studies. Collectively, improving these aspects will certainly contribute to the robustness of both scientific publications and the translation of new substances to clinical development.
Cook, Karon F; Jensen, Sally E; Schalet, Benjamin D; Beaumont, Jennifer L; Amtmann, Dagmar; Czajkowski, Susan; Dewalt, Darren A; Fries, James F; Pilkonis, Paul A; Reeve, Bryce B; Stone, Arthur A; Weinfurt, Kevin P; Cella, David
2016-05-01
To present an overview of a series of studies in which the clinical validity of the National Institutes of Health's Patient Reported Outcome Measurement Information System (NIH; PROMIS) measures was evaluated, by domain, across six clinical populations. Approximately 1,500 individuals at baseline and 1,300 at follow-up completed PROMIS measures. The analyses reported in this issue were conducted post hoc, pooling data across six previous studies, and accommodating the different designs of the six, within-condition, parent studies. Changes in T-scores, standardized response means, and effect sizes were calculated in each study. When a parent study design allowed, known groups validity was calculated using a linear mixed model. The results provide substantial support for the clinical validity of nine PROMIS measures in a range of chronic conditions. The cross-condition focus of the analyses provided a unique and multifaceted perspective on how PROMIS measures function in "real-world" clinical settings and provides external anchors that can support comparative effectiveness research. The current body of clinical validity evidence for the nine PROMIS measures indicates the success of NIH PROMIS in developing measures that are effective across a range of chronic conditions. Copyright © 2016 Elsevier Inc. All rights reserved.
Jansen, Marleen E; Rigter, T; Rodenburg, W; Fleur, T M C; Houwink, E J F; Weda, M; Cornel, Martina C
2017-01-01
Advances from pharmacogenetics (PGx) have not been implemented into health care to the expected extent. One gap that will be addressed in this study is a lack of reporting on clinical validity and clinical utility of PGx-tests. A systematic review of current reporting in scientific literature was conducted on publications addressing PGx in the context of statins and muscle toxicity. Eighty-nine publications were included and information was selected on reported measures of effect, arguments, and accompanying conclusions. Most authors report associations to quantify the relationship between a genetic variation an outcome, such as adverse drug responses. Conclusions on the implementation of a PGx-test are generally based on these associations, without explicit mention of other measures relevant to evaluate the test's clinical validity and clinical utility. To gain insight in the clinical impact and select useful tests, additional outcomes are needed to estimate the clinical validity and utility, such as cost-effectiveness.
Curcio, Cristiane Schumann Silva; Lucchetti, Giancarlo; Moreira-Almeida, Alexander
2015-04-01
Despite Brazil's high levels of religious involvement, there is a scarcity of validated religiousness/spirituality (R/S) measures in Portuguese, particularly multidimensional ones. This study presents the validation of the Portuguese version of the "Brief Multidimensional Measure in Religiousness and Spirituality" (BMMRS) within the Brazilian context. Inpatients (262) and caregivers (389) at two hospitals of Brazil answered the BMMRS, the DUREL-p, and a sociodemographic questionnaire. The internal and convergent validity and test-retest reliability for major dimensions were good. Discriminant validity was high (except for the Forgiveness dimension). The Portuguese version of the BMMRS is a reliable and valid instrument to assess multiple R/S dimensions in clinical and non-clinical samples.
Löfmark, Anna; Mårtensson, Gunilla
2017-03-01
The aim of the present study was to establish the validity of the tool Assessment of Clinical Education (AssCE). The tool is widely used in Sweden and some Nordic countries for assessing nursing students' performance in clinical education. It is important that the tools in use be subjected to regular audit and critical reviews. The validation process, performed in two stages, was concluded with a high level of congruence. In the first stage, Delphi technique was used to elaborate the AssCE tool using a group of 35 clinical nurse lecturers. After three rounds, we reached consensus. In the second stage, a group of 46 clinical nurse lecturers representing 12 universities in Sweden and Norway audited the revised version of the AssCE in relation to learning outcomes from the last clinical course at their respective institutions. Validation of the revised AssCE was established with high congruence between the factors in the AssCE and examined learning outcomes. The revised AssCE tool seems to meet its objective to be a validated assessment tool for use in clinical nursing education. Copyright © 2016 Elsevier Ltd. All rights reserved.
A Construct Validity Study of Clinical Competence: A Multitrait Multimethod Matrix Approach
ERIC Educational Resources Information Center
Baig, Lubna; Violato, Claudio; Crutcher, Rodney
2010-01-01
Introduction: The purpose of the study was to adduce evidence for estimating the construct validity of clinical competence measured through assessment instruments used for high-stakes examinations. Methods: Thirty-nine international physicians (mean age = 41 + 6.5 y) participated in high-stakes examination and 3-month supervised clinical practice…
Imaging biomarker roadmap for cancer studies
O’Connor, James P. B.; Aboagye, Eric O.; Adams, Judith E.; Aerts, Hugo J. W. L.; Barrington, Sally F.; Beer, Ambros J.; Boellaard, Ronald; Bohndiek, Sarah E.; Brady, Michael; Brown, Gina; Buckley, David L.; Chenevert, Thomas L.; Clarke, Laurence P.; Collette, Sandra; Cook, Gary J.; deSouza, Nandita M.; Dickson, John C.; Dive, Caroline; Evelhoch, Jeffrey L.; Faivre-Finn, Corinne; Gallagher, Ferdia A.; Gilbert, Fiona J.; Gillies, Robert J.; Goh, Vicky; Griffiths, John R.; Groves, Ashley M.; Halligan, Steve; Harris, Adrian L.; Hawkes, David J.; Hoekstra, Otto S.; Huang, Erich P.; Hutton, Brian F.; Jackson, Edward F.; Jayson, Gordon C.; Jones, Andrew; Koh, Dow-Mu; Lacombe, Denis; Lambin, Philippe; Lassau, Nathalie; Leach, Martin O.; Lee, Ting-Yim; Leen, Edward L.; Lewis, Jason S.; Liu, Yan; Lythgoe, Mark F.; Manoharan, Prakash; Maxwell, Ross J.; Miles, Kenneth A.; Morgan, Bruno; Morris, Steve; Ng, Tony; Padhani, Anwar R.; Parker, Geoff J. M.; Partridge, Mike; Pathak, Arvind P.; Peet, Andrew C.; Punwani, Shonit; Reynolds, Andrew R.; Robinson, Simon P.; Shankar, Lalitha K.; Sharma, Ricky A.; Soloviev, Dmitry; Stroobants, Sigrid; Sullivan, Daniel C.; Taylor, Stuart A.; Tofts, Paul S.; Tozer, Gillian M.; van Herk, Marcel; Walker-Samuel, Simon; Wason, James; Williams, Kaye J.; Workman, Paul; Yankeelov, Thomas E.; Brindle, Kevin M.; McShane, Lisa M.; Jackson, Alan; Waterton, John C.
2017-01-01
Imaging biomarkers (IBs) are integral to the routine management of patients with cancer. IBs used daily in oncology include clinical TNM stage, objective response and left ventricular ejection fraction. Other CT, MRI, PET and ultrasonography biomarkers are used extensively in cancer research and drug development. New IBs need to be established either as useful tools for testing research hypotheses in clinical trials and research studies, or as clinical decision-making tools for use in healthcare, by crossing ‘translational gaps’ through validation and qualification. Important differences exist between IBs and biospecimen-derived biomarkers and, therefore, the development of IBs requires a tailored ‘roadmap’. Recognizing this need, Cancer Research UK (CRUK) and the European Organisation for Research and Treatment of Cancer (EORTC) assembled experts to review, debate and summarize the challenges of IB validation and qualification. This consensus group has produced 14 key recommendations for accelerating the clinical translation of IBs, which highlight the role of parallel (rather than sequential) tracks of technical (assay) validation, biological/clinical validation and assessment of cost-effectiveness; the need for IB standardization and accreditation systems; the need to continually revisit IB precision; an alternative framework for biological/clinical validation of IBs; and the essential requirements for multicentre studies to qualify IBs for clinical use. PMID:27725679
Assessing Clinical Reasoning (ASCLIRE): Instrument Development and Validation
ERIC Educational Resources Information Center
Kunina-Habenicht, Olga; Hautz, Wolf E.; Knigge, Michel; Spies, Claudia; Ahlers, Olaf
2015-01-01
Clinical reasoning is an essential competency in medical education. This study aimed at developing and validating a test to assess diagnostic accuracy, collected information, and diagnostic decision time in clinical reasoning. A norm-referenced computer-based test for the assessment of clinical reasoning (ASCLIRE) was developed, integrating the…
Identification and Validation of Established and Novel Biomarkers for Infections in Burns
2017-10-01
in burn patients have been proposed, but not validated. In our four site study , we are enrolling severely burned adults and children , and...identify the early stages of infection prior to clinical detection. This multicenter study will enable us to identify novel biomarkers, validate whether...a multicenter study 3. Develop a model of prediction of infection using clinical data and proteomic information. Relevance: 5% of combat-sustained
2018-01-01
Artificial intelligence (AI) is projected to substantially influence clinical practice in the foreseeable future. However, despite the excitement around the technologies, it is yet rare to see examples of robust clinical validation of the technologies and, as a result, very few are currently in clinical use. A thorough, systematic validation of AI technologies using adequately designed clinical research studies before their integration into clinical practice is critical to ensure patient benefit and safety while avoiding any inadvertent harms. We would like to suggest several specific points regarding the role that peer-reviewed medical journals can play, in terms of study design, registration, and reporting, to help achieve proper and meaningful clinical validation of AI technologies designed to make medical diagnosis and prediction, focusing on the evaluation of diagnostic accuracy efficacy. Peer-reviewed medical journals can encourage investigators who wish to validate the performance of AI systems for medical diagnosis and prediction to pay closer attention to the factors listed in this article by emphasizing their importance. Thereby, peer-reviewed medical journals can ultimately facilitate translating the technological innovations into real-world practice while securing patient safety and benefit. PMID:29805337
Park, Seong Ho; Kressel, Herbert Y
2018-05-28
Artificial intelligence (AI) is projected to substantially influence clinical practice in the foreseeable future. However, despite the excitement around the technologies, it is yet rare to see examples of robust clinical validation of the technologies and, as a result, very few are currently in clinical use. A thorough, systematic validation of AI technologies using adequately designed clinical research studies before their integration into clinical practice is critical to ensure patient benefit and safety while avoiding any inadvertent harms. We would like to suggest several specific points regarding the role that peer-reviewed medical journals can play, in terms of study design, registration, and reporting, to help achieve proper and meaningful clinical validation of AI technologies designed to make medical diagnosis and prediction, focusing on the evaluation of diagnostic accuracy efficacy. Peer-reviewed medical journals can encourage investigators who wish to validate the performance of AI systems for medical diagnosis and prediction to pay closer attention to the factors listed in this article by emphasizing their importance. Thereby, peer-reviewed medical journals can ultimately facilitate translating the technological innovations into real-world practice while securing patient safety and benefit.
Mollayeva, Tatyana; Thurairajah, Pravheen; Burton, Kirsteen; Mollayeva, Shirin; Shapiro, Colin M; Colantonio, Angela
2016-02-01
This review appraises the process of development and the measurement properties of the Pittsburgh sleep quality index (PSQI), gauging its potential as a screening tool for sleep dysfunction in non-clinical and clinical samples; it also compares non-clinical and clinical populations in terms of PSQI scores. MEDLINE, Embase, PsycINFO, and HAPI databases were searched. Critical appraisal of studies of measurement properties was performed using COSMIN. Of 37 reviewed studies, 22 examined construct validity, 19 - known-group validity, 15 - internal consistency, and three - test-retest reliability. Study quality ranged from poor to excellent, with the majority designated fair. Internal consistency, based on Cronbach's alpha, was good. Discrepancies were observed in factor analytic studies. In non-clinical and clinical samples with known differences in sleep quality, the PSQI global scores and all subscale scores, with the exception of sleep disturbance, differed significantly. The best evidence synthesis for the PSQI showed strong reliability and validity, and moderate structural validity in a variety of samples, suggesting the tool fulfills its intended utility. A taxonometric analysis can contribute to better understanding of sleep dysfunction as either a dichotomous or continuous construct. Copyright © 2015 Elsevier Ltd. All rights reserved.
Tavoli, Azadeh; Melyani, Mahdiyeh; Bakhtiari, Maryam; Ghaedi, Gholam Hossein; Montazeri, Ali
2009-07-09
The Brief Fear of Negative Evaluation Scale (BFNE) is a commonly used instrument to measure social anxiety. This study aimed to translate and to test the reliability and validity of the BFNE in Iran. The English language version of the BFNE was translated into Persian (Iranian language) and was used in this study. The questionnaire was administered to a consecutive sample of 235 students with (n = 33, clinical group) and without social phobia (n = 202, non-clinical group). In addition to the BFNE, two standard instruments were used to measure social phobia severity: the Social Phobia Inventory (SPIN), and the Social Interaction Anxiety Scale (SIAS). All participants completed a brief background information questionnaire, the SPIN, the SIAS and the BFNE scales. Statistical analysis was performed to test the reliability and validity of the BFNE. In all 235 students were studied (111 male and 124 female). The mean age for non-clinical group was 22.2 (SD = 2.1) years and for clinical sample it was 22.4 (SD = 1.8) years. Cronbach's alpha coefficient (to test reliability) was acceptable for both non-clinical and clinical samples (alpha = 0.90 and 0.82 respectively). In addition, 3-week test-retest reliability was performed in non-clinical sample and the intraclass correlation coefficient (ICC) was quite high (ICC = 0.71). Validity as performed using convergent and discriminant validity showed satisfactory results. The questionnaire correlated well with established measures of social phobia such as the SPIN (r = 0.43, p < 0.001) and the SIAS (r = 0.54, p < 0.001). Also the BFNE discriminated well between men and women with and without social phobia in the expected direction. Factor analysis supported a two-factor solution corresponding to positive and reverse-worded items. This validation study of the Iranian version of BFNE proved that it is an acceptable, reliable and valid measure of social phobia. However, since the scale showed a two-factor structure and this does not confirm to the theoretical basis for the BFNE, thus we suggest the use of the BFNE-II when it becomes available in Iran. The validation study of the BFNE-II is in progress.
Johnco, Carly; Knight, Ashleigh; Tadic, Dusanka; Wuthrich, Viviana M
2015-07-01
The Geriatric Anxiety Inventory is a 20-item geriatric-specific measure of anxiety severity. While studies suggest good internal consistency and convergent validity, divergent validity from measures of depression are weak. Clinical cutoffs have been developed that vary across studies due to the small clinical samples used. A six-item short form (GAI-SF) has been developed, and while this scale is promising, the research assessing the psychometrics of this scale is limited. This study examined the psychometric properties of GAI and GAI-SF in a large sample of 197 clinical geriatric participants with a comorbid anxiety and unipolar mood disorder, and a non-clinical control sample (N = 59). The internal consistency and convergent validity with other measures of anxiety was adequate for GAI and GAI-SF. Divergent validity from depressive symptoms was good in the clinical sample but weak in the total and non-clinical samples. Divergent validity from cognitive functioning was good in all samples. The one-factor structure was replicated for both measures. Receiver Operating Characteristic analyses indicated that the GAI is more accurate at identifying clinical status than the GAI-SF, although the sensitivity and specificity for the recommended cutoffs was adequate for both measures. Both GAI and GAI-SF show good psychometric properties for identifying geriatric anxiety. The GAI-SF may be a useful alternative screening measure for identifying anxiety in older adults.
Liou, Shwu-Ru; Liu, Hsiu-Chen; Tsai, Hsiu-Min; Tsai, Ying-Huang; Lin, Yu-Ching; Chang, Chia-Hao; Cheng, Ching-Yu
2016-03-01
The purpose of the study was to develop and psychometrically test the Nurses Clinical Reasoning Scale. Clinical reasoning is an essential skill for providing safe and quality patient care. Identifying pre-graduates' and nurses' needs and designing training courses to improve their clinical reasoning competence becomes a critical task. However, there is no instrument focusing on clinical reasoning in the nursing profession. Cross-sectional design was used. This study included the development of the scale, a pilot study that preliminary tested the readability and reliability of the developed scale and a main study that implemented and tested the psychometric properties of the developed scale. The Nurses Clinical Reasoning Scale was developed based on the Clinical Reasoning Model. The scale includes 15 items using a Likert five-point scale. Data were collected from 2013-2014. Two hundred and fifty-one participants comprising clinical nurses and nursing pre-graduates completed and returned the questionnaires in the main study. The instrument was tested for internal consistency and test-retest reliability. Its validity was tested with content, construct and known-groups validity. One factor emerged from the factor analysis. The known-groups validity was confirmed. The Cronbach's alpha for the entire instrument was 0·9. The reliability and validity of the Nurses Clinical Reasoning Scale were supported. The scale is a useful tool and can be easily administered for the self-assessment of clinical reasoning competence of clinical nurses and future baccalaureate nursing graduates. Study limitations and further recommendations are discussed. © 2015 John Wiley & Sons Ltd.
Evidence for the Criterion Validity and Clinical Utility of the Pathological Narcissism Inventory
ERIC Educational Resources Information Center
Thomas, Katherine M.; Wright, Aidan G. C.; Lukowitsky, Mark R.; Donnellan, M. Brent; Hopwood, Christopher J.
2012-01-01
In this study, the authors evaluated aspects of criterion validity and clinical utility of the grandiosity and vulnerability components of the Pathological Narcissism Inventory (PNI) using two undergraduate samples (N = 299 and 500). Criterion validity was assessed by evaluating the correlations of narcissistic grandiosity and narcissistic…
Poljak, Mario; Oštrbenk, Anja
2013-01-01
Human papillomavirus (HPV) testing has become an essential part of current clinical practice in the management of cervical cancer and precancerous lesions. We reviewed the most important validation studies of a next-generation real-time polymerase chain reaction-based assay, the RealTime High Risk HPV test (RealTime)(Abbott Molecular, Des Plaines, IL, USA), for triage in referral population settings and for use in primary cervical cancer screening in women 30 years and older published in peer-reviewed journals from 2009 to 2013. RealTime is designed to detect 14 high-risk HPV genotypes with concurrent distinction of HPV-16 and HPV-18 from 12 other HPV genotypes. The test was launched on the European market in January 2009 and is currently used in many laboratories worldwide for routine detection of HPV. We concisely reviewed validation studies of a next-generation real-time polymerase chain reaction (PCR)-based assay: the Abbott RealTime High Risk HPV test. Eight validation studies of RealTime in referral settings showed its consistently high absolute clinical sensitivity for both CIN2+ (range 88.3-100%) and CIN3+ (range 93.0-100%), as well as comparative clinical sensitivity relative to the currently most widely used HPV test: the Qiagen/Digene Hybrid Capture 2 HPV DNA Test (HC2). Due to the significantly different composition of the referral populations, RealTime absolute clinical specificity for CIN2+ and CIN3+ varied greatly across studies, but was comparable relative to HC2. Four validation studies of RealTime performance in cervical cancer screening settings showed its consistently high absolute clinical sensitivity for both CIN2+ and CIN3+, as well as comparative clinical sensitivity and specificity relative to HC2 and GP5+/6+ PCR. RealTime has been extensively evaluated in the last 4 years. RealTime can be considered clinically validated for triage in referral population settings and for use in primary cervical cancer screening in women 30 years and older.
Development and Validation of a Photonumeric Scale for Assessment of Chin Retrusion.
Sykes, Jonathan M; Carruthers, Alastair; Hardas, Bhushan; Murphy, Diane K; Jones, Derek; Carruthers, Jean; Donofrio, Lisa; Creutz, Lela; Marx, Ann; Dill, Sara
2016-10-01
A validated scale is needed for objective and reproducible comparisons of chin appearance before and after chin augmentation in practice and clinical studies. To describe the development and validation of the 5-point photonumeric Allergan Chin Retrusion Scale. The Allergan Chin Retrusion Scale was developed to include an assessment guide, verbal descriptors, morphed images, and real subject images for each scale grade. The clinical significance of a 1-point score difference was evaluated in a review of multiple image pairs representing varying differences in severity. Interrater and intrarater reliability was evaluated in a live-subject validation study (N = 298) completed during 2 sessions occurring 3 weeks apart. A difference of ≥1 point on the scale was shown to reflect a clinically meaningful difference (mean [95% confidence interval] absolute score difference, 1.07 [0.94-1.20] for clinically different image pairs and 0.51 [0.39-0.63] for not clinically different pairs). Intrarater agreement between the 2 live-subject validation sessions was substantial (mean weighted kappa = 0.79). Interrater agreement was substantial during the second rating session (0.68, primary end point). The Allergan Chin Retrusion Scale is a validated and reliable scale for physician rating of severity of chin retrusion.
Bender, Amy M; Lawson, Doug; Werthner, Penny; Samuels, Charles H
2018-06-04
Previous research has established that general sleep screening questionnaires are not valid and reliable in an athlete population. The Athlete Sleep Screening Questionnaire (ASSQ) was developed to address this need. While the initial validation of the ASSQ has been established, the clinical validity of the ASSQ has yet to be determined. The main objective of the current study was to evaluate the clinical validity of the ASSQ. Canadian National Team athletes (N = 199; mean age 24.0 ± 4.2 years, 62% females; from 23 sports) completed the ASSQ. A subset of athletes (N = 46) were randomized to the clinical validation sub-study which required subjects to complete an ASSQ at times 2 and 3 and to have a clinical sleep interview by a sleep medicine physician (SMP) who rated each subjects' category of clinical sleep problem and provided recommendations to improve sleep. To assess clinical validity, the SMP category of clinical sleep problem was compared to the ASSQ. The internal consistency (Cronbach's alpha = 0.74) and test-retest reliability (r = 0.86) of the ASSQ were acceptable. The ASSQ demonstrated good agreement with the SMP (Cohen's kappa = 0.84) which yielded a diagnostic sensitivity of 81%, specificity of 93%, positive predictive value of 87%, and negative predictive value of 90%. There were 25.1% of athletes identified to have clinically relevant sleep disturbances that required further clinical sleep assessment. Sleep improved from time 1 at baseline to after the recommendations at time 3. Sleep screening athletes with the ASSQ provides a method of accurately determining which athletes would benefit from preventative measures and which athletes suffer from clinically significant sleep problems. The process of sleep screening athletes and providing recommendations improves sleep and offers a clinical intervention output that is simple and efficient for teams and athletes to implement.
Sharland, Michael J; Waring, Stephen C; Johnson, Brian P; Taran, Allise M; Rusin, Travis A; Pattock, Andrew M; Palcher, Jeanette A
2018-01-01
Assessing test performance validity is a standard clinical practice and although studies have examined the utility of cognitive/memory measures, few have examined attention measures as indicators of performance validity beyond the Reliable Digit Span. The current study further investigates the classification probability of embedded Performance Validity Tests (PVTs) within the Brief Test of Attention (BTA) and the Conners' Continuous Performance Test (CPT-II), in a large clinical sample. This was a retrospective study of 615 patients consecutively referred for comprehensive outpatient neuropsychological evaluation. Non-credible performance was defined two ways: failure on one or more PVTs and failure on two or more PVTs. Classification probability of the BTA and CPT-II into non-credible groups was assessed. Sensitivity, specificity, positive predictive value, and negative predictive value were derived to identify clinically relevant cut-off scores. When using failure on two or more PVTs as the indicator for non-credible responding compared to failure on one or more PVTs, highest classification probability, or area under the curve (AUC), was achieved by the BTA (AUC = .87 vs. .79). CPT-II Omission, Commission, and Total Errors exhibited higher classification probability as well. Overall, these findings corroborate previous findings, extending them to a large clinical sample. BTA and CPT-II are useful embedded performance validity indicators within a clinical battery but should not be used in isolation without other performance validity indicators.
Liaw, Sok Ying; Rashasegaran, Ahtherai; Wong, Lai Fun; Deneen, Christopher Charles; Cooper, Simon; Levett-Jones, Tracy; Goh, Hongli Sam; Ignacio, Jeanette
2018-03-01
The development of clinical reasoning skills in recognising and responding to clinical deterioration is essential in pre-registration nursing education. Simulation has been increasingly used by educators to develop this skill. To develop and evaluate the psychometric properties of a Clinical Reasoning Evaluation Simulation Tool (CREST) for measuring clinical reasoning skills in recognising and responding to clinical deterioration in a simulated environment. A scale development with psychometric testing and mixed methods study. Nursing students and academic staff were recruited at a university. A three-phase prospective study was conducted. Phase 1 involved the development and content validation of the CREST; Phase 2 included the psychometric testing of the tool with 15 second-year and 15 third-year nursing students who undertook the simulation-based assessment; Phase 3 involved the usability testing of the tool with nine academic staff through a survey questionnaire and focus group discussion. A 10-item CREST was developed based on a model of clinical reasoning. A content validity of 0.93 was obtained from the validation of 15 international experts. The construct validity was supported as the third-year students demonstrated significantly higher (p<0.001) clinical reasoning scores than the second-year students. The concurrent validity was also supported with significant positive correlations between global rating scores and almost all subscale scores, and the total scores. The predictive validity was supported with an existing tool. The internal consistency was high with a Cronbach's alpha of 0.92. A high inter-rater reliability was demonstrated with an intraclass correlation coefficient of 0.88. The usability of the tool was rated positively by the nurse educators but the need to ease the scoring process was highlighted. A valid and reliable tool was developed to measure the effectiveness of simulation in developing clinical reasoning skills for recognising and responding to clinical deterioration. Copyright © 2017. Published by Elsevier Ltd.
Braido, Fulvio; Santus, Pierachille; Corsico, Angelo Guido; Di Marco, Fabiano; Melioli, Giovanni; Scichilone, Nicola; Solidoro, Paolo
2018-01-01
The purposes of this study were development and validation of an expert system (ES) aimed at supporting the diagnosis of chronic obstructive lung disease (COLD). A questionnaire and a WebFlex code were developed and validated in silico. An expert panel pilot validation on 60 cases and a clinical validation on 241 cases were performed. The developed questionnaire and code validated in silico resulted in a suitable tool to support the medical diagnosis. The clinical validation of the ES was performed in an academic setting that included six different reference centers for respiratory diseases. The results of the ES expressed as a score associated with the risk of suffering from COLD were matched and compared with the final clinical diagnoses. A set of 60 patients were evaluated by a pilot expert panel validation with the aim of calculating the sample size for the clinical validation study. The concordance analysis between these preliminary ES scores and diagnoses performed by the experts indicated that the accuracy was 94.7% when both experts and the system confirmed the COLD diagnosis and 86.3% when COLD was excluded. Based on these results, the sample size of the validation set was established in 240 patients. The clinical validation, performed on 241 patients, resulted in ES accuracy of 97.5%, with confirmed COLD diagnosis in 53.6% of the cases and excluded COLD diagnosis in 32% of the cases. In 11.2% of cases, a diagnosis of COLD was made by the experts, although the imaging results showed a potential concomitant disorder. The ES presented here (COLD ES ) is a safe and robust supporting tool for COLD diagnosis in primary care settings.
Tavoli, Azadeh; Melyani, Mahdiyeh; Bakhtiari, Maryam; Ghaedi, Gholam Hossein; Montazeri, Ali
2009-01-01
Background The Brief Fear of Negative Evaluation Scale (BFNE) is a commonly used instrument to measure social anxiety. This study aimed to translate and to test the reliability and validity of the BFNE in Iran. Methods The English language version of the BFNE was translated into Persian (Iranian language) and was used in this study. The questionnaire was administered to a consecutive sample of 235 students with (n = 33, clinical group) and without social phobia (n = 202, non-clinical group). In addition to the BFNE, two standard instruments were used to measure social phobia severity: the Social Phobia Inventory (SPIN), and the Social Interaction Anxiety Scale (SIAS). All participants completed a brief background information questionnaire, the SPIN, the SIAS and the BFNE scales. Statistical analysis was performed to test the reliability and validity of the BFNE. Results In all 235 students were studied (111 male and 124 female). The mean age for non-clinical group was 22.2 (SD = 2.1) years and for clinical sample it was 22.4 (SD = 1.8) years. Cronbach's alpha coefficient (to test reliability) was acceptable for both non-clinical and clinical samples (α = 0.90 and 0.82 respectively). In addition, 3-week test-retest reliability was performed in non-clinical sample and the intraclass correlation coefficient (ICC) was quite high (ICC = 0.71). Validity as performed using convergent and discriminant validity showed satisfactory results. The questionnaire correlated well with established measures of social phobia such as the SPIN (r = 0.43, p < 0.001) and the SIAS (r = 0.54, p < 0.001). Also the BFNE discriminated well between men and women with and without social phobia in the expected direction. Factor analysis supported a two-factor solution corresponding to positive and reverse-worded items. Conclusion This validation study of the Iranian version of BFNE proved that it is an acceptable, reliable and valid measure of social phobia. However, since the scale showed a two-factor structure and this does not confirm to the theoretical basis for the BFNE, thus we suggest the use of the BFNE-II when it becomes available in Iran. The validation study of the BFNE-II is in progress. PMID:19589161
Development and Validation of a Photonumeric Scale for Evaluation of Volume Deficit of the Hand
Donofrio, Lisa; Hardas, Bhushan; Murphy, Diane K.; Carruthers, Jean; Carruthers, Alastair; Sykes, Jonathan M.; Creutz, Lela; Marx, Ann; Dill, Sara
2016-01-01
BACKGROUND A validated scale is needed for objective and reproducible comparisons of hand appearance before and after treatment in practice and clinical studies. OBJECTIVE To describe the development and validation of the 5-point photonumeric Allergan Hand Volume Deficit Scale. METHODS The scale was developed to include an assessment guide, verbal descriptors, morphed images, and real-subject images for each grade. The clinical significance of a 1-point score difference was evaluated in a review of image pairs representing varying differences in severity. Interrater and intrarater reliability was evaluated in a live-subject validation study (N = 296) completed during 2 sessions occurring 3 weeks apart. RESULTS A score difference of ≥1 point was shown to reflect a clinically significant difference (mean [95% confidence interval] absolute score difference, 1.12 [0.99–1.26] for clinically different image pairs and 0.45 [0.33–0.57] for not clinically different pairs). Intrarater agreement between the 2 validation sessions was almost perfect (mean weighted kappa = 0.83). Interrater agreement was almost perfect during the second session (0.82, primary end point). CONCLUSION The Allergan Hand Volume Deficit Scale is a validated and reliable scale for physician rating of hand volume deficit. PMID:27661741
Development and Validation of a Photonumeric Scale for Evaluation of Facial Skin Texture
Carruthers, Alastair; Hardas, Bhushan; Murphy, Diane K.; Carruthers, Jean; Jones, Derek; Sykes, Jonathan M.; Creutz, Lela; Marx, Ann; Dill, Sara
2016-01-01
BACKGROUND A validated scale is needed for objective and reproducible comparisons of facial skin roughness before and after aesthetic treatment in practice and in clinical studies. OBJECTIVE To describe the development and validation of the 5-point photonumeric Allergan Skin Roughness Scale. METHODS The scale was developed to include an assessment guide, verbal descriptors, morphed images, and real subject images for each grade. The clinical significance of a 1-point score difference was evaluated in a review of image pairs representing varying differences in severity. Interrater and intrarater reliability was evaluated in a live-subject validation study (N = 290) completed during 2 sessions occurring 3 weeks apart. RESULTS A score difference of ≥1 point was shown to reflect a clinically meaningful difference (mean [95% confidence interval] absolute score difference 1.09 [0.96–1.23] for clinically different image pairs and 0.53 [0.38–0.67] for not clinically different pairs). Intrarater agreement between the 2 validation sessions was almost perfect (weighted kappa = 0.83). Interrater agreement was almost perfect during the second rating session (0.81, primary end point). CONCLUSION The Allergan Skin Roughness Scale is a validated and reliable scale for physician rating of midface skin roughness. PMID:27661744
Development and Validation of a Photonumeric Scale for Evaluation of Volume Deficit of the Temple
Jones, Derek; Hardas, Bhushan; Murphy, Diane K.; Donofrio, Lisa; Sykes, Jonathan M.; Carruthers, Alastair; Creutz, Lela; Marx, Ann; Dill, Sara
2016-01-01
BACKGROUND A validated scale is needed for objective and reproducible comparisons of temple appearance before and after aesthetic treatment in practice and clinical studies. OBJECTIVE To describe the development and validation of the 5-point photonumeric Allergan Temple Hollowing Scale. METHODS The scale was developed to include an assessment guide, verbal descriptors, morphed images, and real subject images for each grade. The clinical significance of a 1-point score difference was evaluated in a review of image pairs representing varying differences in severity. Interrater and intrarater reliability was evaluated in a live-subject validation study (N = 298) completed during 2 sessions occurring 3 weeks apart. RESULTS A score difference of ≥1 point was shown to reflect a clinically significant difference (mean [95% confidence interval] absolute score difference, 1.1 [0.94–1.26] for clinically different image pairs and 0.67 [0.51–0.83] for not clinically different pairs). Intrarater agreement between the 2 validation sessions was almost perfect (mean weighted kappa = 0.86). Interrater agreement was almost perfect during the second session (0.81, primary endpoint). CONCLUSION The Allergan Temple Hollowing Scale is a validated and reliable scale for physician rating of temple volume deficit. PMID:27661742
The Chinese Version of the Self-Report Family Inventory: Reliability and Validity.
ERIC Educational Resources Information Center
Shek, Daniel T. L.; Lai, Kelly Y. C.
2001-01-01
Reliability and validity of Chinese Self-Report Family Inventory (C-SFI) were examined in three studies. Study 1 showed C-SFI was temporally stable and internally consistent. Study 2 indicated C-SFI could discriminate between clinical and nonclinical groups. Study 3 gave support for internal consistency, concurrent validity and construct validity.…
ERIC Educational Resources Information Center
Graham, Roseanna
2010-01-01
This study evaluated the reliability, validity, and educational usefulness of a comprehensive, multidisciplinary Objective Structured Clinical Examination (OSCE) in dental education. The OSCE was administered to dental students at the Columbia University College of Dental Medicine (CDM) before they entered clinical training. Participants in this…
Wu, Xi Vivien; Enskär, Karin; Pua, Lay Hoon; Heng, Doreen Gek Noi; Wang, Wenru
2016-09-22
A major focus in nursing education is on the judgement of clinical performance, and it is a complex process due to the diverse nature of nursing practice. A holistic approach in assessment of competency is advocated. Difficulties in the development of valid and reliable assessment measures in nursing competency have resulted in the development of assessment instruments with an increase in face and content validity, but few studies have tested these instruments psychometrically. It is essential to develop a holistic assessment tool to meet the needs of the clinical education. The study aims to develop a Holistic Clinical Assessment Tool (HCAT) and test its psychometric properties. The HCAT was developed based on the systematic literature review and the findings of qualitative studies. An expert panel was invited to evaluate the content validity of the tool. A total of 130 final-year nursing undergraduate students were recruited to evaluate the psychometric properties (i.e. factor structure, internal consistency and test-retest reliability) of the tool. The HCAT has good content validity with content validity index of .979. The exploratory factor analysis reveals a four-factor structure of the tool. The internal consistency and test-retest reliability of the HCAT are satisfactory with Cronbach alpha ranging from .789 to .965 and Intraclass Correlation Coefficient ranging from .881 to .979 for the four subscales and total scale. HCAT has the potential to be used as a valid measure to evaluate clinical competence in nursing students, and provide specific and ongoing feedback to enhance the holistic clinical learning experience. In addition, HCAT functions as a tool for self-reflection, peer-assessment and guides preceptors in clinical teaching and assessment.
2014-01-01
Background Patient-reported outcome validation needs to achieve validity and reliability standards. Among reliability analysis parameters, test-retest reliability is an important psychometric property. Retested patients must be in a clinically stable condition. This is particularly problematic in palliative care (PC) settings because advanced cancer patients are prone to a faster rate of clinical deterioration. The aim of this study was to evaluate the methods by which multi-symptom and health-related qualities of life (HRQoL) based on patient-reported outcomes (PROs) have been validated in oncological PC settings with regards to test-retest reliability. Methods A systematic search of PubMed (1966 to June 2013), EMBASE (1980 to June 2013), PsychInfo (1806 to June 2013), CINAHL (1980 to June 2013), and SCIELO (1998 to June 2013), and specific PRO databases was performed. Studies were included if they described a set of validation studies. Studies were included if they described a set of validation studies for an instrument developed to measure multi-symptom or multidimensional HRQoL in advanced cancer patients under PC. The COSMIN checklist was used to rate the methodological quality of the study designs. Results We identified 89 validation studies from 746 potentially relevant articles. From those 89 articles, 31 measured test-retest reliability and were included in this review. Upon critical analysis of the overall quality of the criteria used to determine the test-retest reliability, 6 (19.4%), 17 (54.8%), and 8 (25.8%) of these articles were rated as good, fair, or poor, respectively, and no article was classified as excellent. Multi-symptom instruments were retested over a shortened interval when compared to the HRQoL instruments (median values 24 hours and 168 hours, respectively; p = 0.001). Validation studies that included objective confirmation of clinical stability in their design yielded better results for the test-retest analysis with regard to both pain and global HRQoL scores (p < 0.05). The quality of the statistical analysis and its description were of great concern. Conclusion Test-retest reliability has been infrequently and poorly evaluated. The confirmation of clinical stability was an important factor in our analysis, and we suggest that special attention be focused on clinical stability when designing a PRO validation study that includes advanced cancer patients under PC. PMID:24447633
A systematic review of the quality of homeopathic clinical trials
Jonas, Wayne B; Anderson, Rachel L; Crawford, Cindy C; Lyons, John S
2001-01-01
Background While a number of reviews of homeopathic clinical trials have been done, all have used methods dependent on allopathic diagnostic classifications foreign to homeopathic practice. In addition, no review has used established and validated quality criteria allowing direct comparison of the allopathic and homeopathic literature. Methods In a systematic review, we compared the quality of clinical-trial research in homeopathy to a sample of research on conventional therapies using a validated and system-neutral approach. All clinical trials on homeopathic treatments with parallel treatment groups published between 1945–1995 in English were selected. All were evaluated with an established set of 33 validity criteria previously validated on a broad range of health interventions across differing medical systems. Criteria covered statistical conclusion, internal, construct and external validity. Reliability of criteria application is greater than 0.95. Results 59 studies met the inclusion criteria. Of these, 79% were from peer-reviewed journals, 29% used a placebo control, 51% used random assignment, and 86% failed to consider potentially confounding variables. The main validity problems were in measurement where 96% did not report the proportion of subjects screened, and 64% did not report attrition rate. 17% of subjects dropped out in studies where this was reported. There was practically no replication of or overlap in the conditions studied and most studies were relatively small and done at a single-site. Compared to research on conventional therapies the overall quality of studies in homeopathy was worse and only slightly improved in more recent years. Conclusions Clinical homeopathic research is clearly in its infancy with most studies using poor sampling and measurement techniques, few subjects, single sites and no replication. Many of these problems are correctable even within a "holistic" paradigm given sufficient research expertise, support and methods. PMID:11801202
Methodological challenges of validating a clinical decision-making tool in the practice environment.
Brennan, Caitlin W; Daly, Barbara J
2015-04-01
Validating a measurement tool intended for use in the practice environment poses challenges that may not be present when validating a tool intended solely for research purposes. The aim of this article is to describe the methodological challenges of validating a clinical decision-making tool, the Oncology Acuity Tool, which nurses use to make nurse assignment and staffing decisions prospectively each shift. Data were derived from a larger validation study, during which several methodological challenges arose. Revisions to the tool, including conducting iterative feedback cycles with end users, were necessary before the validation study was initiated. The "true" value of patient acuity is unknown, and thus, two approaches to inter-rater reliability assessment were used. Discordant perspectives existed between experts and end users. Balancing psychometric rigor with clinical relevance may be achieved through establishing research-practice partnerships, seeking active and continuous feedback with end users, and weighing traditional statistical rules of thumb with practical considerations. © The Author(s) 2014.
Circulating tumor cells: clinical validity and utility.
Cabel, Luc; Proudhon, Charlotte; Gortais, Hugo; Loirat, Delphine; Coussy, Florence; Pierga, Jean-Yves; Bidard, François-Clément
2017-06-01
Circulating tumor cells (CTCs) are rare tumor cells and have been investigated as diagnostic, prognostic and predictive biomarkers in many types of cancer. Although CTCs are not currently used in clinical practice, CTC studies have accumulated a high level of clinical validity, especially in breast, lung, prostate and colorectal cancers. In this review, we present an overview of the current clinical validity of CTCs in metastatic and non-metastatic disease, and the main concepts and studies investigating the clinical utility of CTCs. In particular, this review will focus on breast, lung, colorectal and prostate cancer. Three major topics concerning the clinical utility of CTC are discussed-(1) treatment based on CTCs used as liquid biopsy, (2) treatment based on CTC count or CTC variations, and (3) treatment based on CTC biomarker expression. A summary of published or ongoing phase II and III trials is also presented.
USDA-ARS?s Scientific Manuscript database
BACKGROUND: Previous studies have suggested that for clinical purposes, subjects with fasting triglycerides (TGs) between 89-180 mg/dl (1-2 mmol/l) would benefit from postprandial TGs testing. OBJECTIVE: To determine the postprandial TG response in 2 independent studies and validate who should benef...
Nguyen, Van N B; Forbes, Helen; Mohebbi, Mohammadreza; Duke, Maxine
2017-12-01
Teaching nursing in clinical environments is considered complex and multi-faceted. Little is known about the role of the clinical nurse educator, specifically the challenges related to transition from clinician, or in some cases, from newly-graduated nurse to that of clinical nurse educator, as occurs in developing countries. Confidence in the clinical educator role has been associated with successful transition and the development of role competence. There is currently no valid and reliable instrument to measure clinical nurse educator confidence. This study was conducted to develop and psychometrically test an instrument to measure perceived confidence among clinical nurse educators. A multi-phase, multi-setting survey design was used. A total of 468 surveys were distributed, and 363 were returned. Data were analyzed using exploratory and confirmatory factor analyses. The instrument was successfully tested and modified in phase 1, and factorial validity was subsequently confirmed in phase 2. There was strong evidence of internal consistency, reliability, content, and convergent validity of the Clinical Nurse Educator Skill Acquisition Assessment instrument. The resulting instrument is applicable in similar contexts due to its rigorous development and validation process. © 2017 The Authors. Nursing & Health Sciences published by John Wiley & Sons Australia, Ltd.
Rusli, B N; Amrina, K; Trived, S; Loh, K P; Shashi, M
2017-10-01
The 21-item English version of the Depression Anxiety Stress Scale (DASS-21) has been proposed as a method for assessing self-perceived depression, anxiety and stress over the past week in various clinical and nonclinical populations. Several Malay versions of the DASS-21 have been validated in various populations with varying success. One particular Malay version has been validated in various occupational groups (such as nurses and automotive workers) but not among male clinic outpatient attendees in Malaysia. To validate the Malay version of the DASS-21 (Malay-DASS-21) among male outpatient clinic attendees in Johor. A validation study with a random sample of 402 male respondents attending the outpatient clinic of a major public outpatient clinic in Johor Bahru and Segamat was carried out from January to March 2016. Construct validity of the Malay-DASS-21 was examined using Exploratory Factor Analysis (KMO = 0.947; Bartlett's test of sphericity is significant, p<0.001) through Principal Component Analysis and orthogonal (varimax) rotation with Kaiser Normalization to confirm the psychometric properties of the Malay-DASS- 21 and the internal consistency reliability using Cronbach's alpha. Construct validity of the Malay-DASS-21 based on eigenvalues and factor loadings to confirm the three factor structure (depression, anxiety, and stress) was acceptable. The internal consistency reliability of the factor construct was very impressive with Cronbach's alpha values in the range of 0.837 to 0.863. The present study showed that the Malay- DASS-21 has acceptable psychometric construct and high internal consistency reliability to measure self-perceived depression, anxiety and stress over the past week in male outpatient clinic attendees in Johor. Further studies are necessary to revalidate the Malay-DASS-21 across different populations and cultures, and using confirmatory factor analyses.
ERIC Educational Resources Information Center
Wijnen-Meijer, M.; Van der Schaaf, M.; Booij, E.; Harendza, S.; Boscardin, C.; Wijngaarden, J. Van; Ten Cate, Th. J.
2013-01-01
There is a need for valid methods to assess the readiness for clinical practice of medical graduates. This study evaluates the validity of Utrecht Hamburg Trainee Responsibility for Unfamiliar Situations Test (UHTRUST), an authentic simulation procedure to assess whether medical trainees are ready to be entrusted with unfamiliar clinical tasks…
Validation of the Clinical Learning Environment Inventory.
Chan, Dominic S
2003-08-01
One hundred eight preregistration nursing students took part in this survey study, which assessed their perceptions of the clinical learning environment. Statistical data based on the sample confirmed the reliability and validity of the Clinical Learning Environment Inventory (CLEI), which was developed using the concept of classroom learning environment studies. The study also found that there were significant differences between students' actual and preferred perceptions of the clinical learning environments. In terms of the CLEI scales, students preferred a more positive and favorable clinical environment than they perceived as being actually present. The achievement of certain outcomes of clinical field placements might be enhanced by attempting to change the actual clinical environment in ways that make it more congruent with that preferred by the students.
Henry, Teresa R; Penn, Lara D; Conerty, Jason R; Wright, Francesca E; Gorman, Gregory; Pack, Brian W
2016-11-01
Non-clinical dose formulations (also known as pre-clinical or GLP formulations) play a key role in early drug development. These formulations are used to introduce active pharmaceutical ingredients (APIs) into test organisms for both pharmacokinetic and toxicological studies. Since these studies are ultimately used to support dose and safety ranges in human studies, it is important to understand not only the concentration and PK/PD of the active ingredient but also to generate safety data for likely process impurities and degradation products of the active ingredient. As such, many in the industry have chosen to develop and validate methods which can accurately detect and quantify the active ingredient along with impurities and degradation products. Such methods often provide trendable results which are predictive of stability, thus leading to the name; stability indicating methods. This document provides an overview of best practices for those choosing to include development and validation of such methods as part of their non-clinical drug development program. This document is intended to support teams who are either new to stability indicating method development and validation or who are less familiar with the requirements of validation due to their position within the product development life cycle.
First-in-human Phase 1 CRISPR Gene Editing Cancer Trials: Are We Ready?
Baylis, Francoise; McLeod, Marcus
2017-01-01
A prospective first-in-human Phase 1 CRISPR gene editing trial in the United States for patients with melanoma, synovial sarcoma, and multiple myeloma offers hope that gene editing tools may usefully treat human disease. An overarching ethical challenge with first-in-human Phase 1 clinical trials, however, is knowing when it is ethically acceptable to initiate such trials on the basis of safety and efficacy data obtained from pre-clinical studies. If the pre-clinical studies that inform trial design are themselves poorly designed - as a result of which the quality of pre-clinical evidence is deficient - then the ethical requirement of scientific validity for clinical research may not be satisfied. In turn, this could mean that the Phase 1 clinical trial will be unsafe and that trial participants will be exposed to risk for no potential benefit. To assist sponsors, researchers, clinical investigators and reviewers in deciding when it is ethically acceptable to initiate first-in-human Phase 1 CRISPR gene editing clinical trials, structured processes have been developed to assess and minimize translational distance between pre-clinical and clinical research. These processes draw attention to various features of internal validity, construct validity, and external validity. As well, the credibility of supporting evidence is to be critically assessed with particular attention to optimism bias, financial conflicts of interest and publication bias. We critically examine the pre-clinical evidence used to justify the first-inhuman Phase 1 CRISPR gene editing cancer trial in the United States using these tools. We conclude that the proposed trial cannot satisfy the ethical requirement of scientific validity because the supporting pre-clinical evidence used to inform trial design is deficient. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
ERIC Educational Resources Information Center
Woodburn, Jim; Sutcliffe, Nick
1996-01-01
The Objective Structured Clinical Examination (OSCE), initially developed for undergraduate medical education, has been adapted for assessment of clinical skills in podiatry students. A 12-month pilot study found the test had relatively low levels of reliability, high construct and criterion validity, and good stability of performance over time.…
Mayo, Ann M
2015-01-01
It is important for CNSs and other APNs to consider the reliability and validity of instruments chosen for clinical practice, evidence-based practice projects, or research studies. Psychometric testing uses specific research methods to evaluate the amount of error associated with any particular instrument. Reliability estimates explain more about how well the instrument is designed, whereas validity estimates explain more about scores that are produced by the instrument. An instrument may be architecturally sound overall (reliable), but the same instrument may not be valid. For example, if a specific group does not understand certain well-constructed items, then the instrument does not produce valid scores when used with that group. Many instrument developers may conduct reliability testing only once, yet continue validity testing in different populations over many years. All CNSs should be advocating for the use of reliable instruments that produce valid results. Clinical nurse specialists may find themselves in situations where reliability and validity estimates for some instruments that are being utilized are unknown. In such cases, CNSs should engage key stakeholders to sponsor nursing researchers to pursue this most important work.
The validity and clinical utility of purging disorder.
Keel, Pamela K; Striegel-Moore, Ruth H
2009-12-01
To review evidence of the validity and clinical utility of Purging Disorder and examine options for the Diagnostic and Statistical Manual of Mental Disorders fifth edition (DSM-V). Articles were identified by computerized and manual searches and reviewed to address five questions about Purging Disorder: Is there "ample" literature? Is the syndrome clearly defined? Can it be measured and diagnosed reliably? Can it be differentiated from other eating disorders? Is there evidence of syndrome validity? Although empirical classification and concurrent validity studies provide emerging support for the distinctiveness of Purging Disorder, questions remain about definition, diagnostic reliability in clinical settings, and clinical utility (i.e., prognostic validity). We discuss strengths and weaknesses associated with various options for the status of Purging Disorder in the DSM-V ranging from making no changes from DSM-IV to designating Purging Disorder a diagnosis on equal footing with Anorexia Nervosa and Bulimia Nervosa.
Walenkamp, Monique M J; Bentohami, Abdelali; Slaar, Annelie; Beerekamp, M S H Suzan; Maas, Mario; Jager, L C Cara; Sosef, Nico L; van Velde, Romuald; Ultee, Jan M; Steyerberg, Ewout W; Goslings, J C Carel; Schep, Niels W L
2016-01-01
Although only 39% of patients with wrist trauma have sustained a fracture, the majority of patients is routinely referred for radiography. The purpose of this study was to derive and externally validate a clinical decision rule that selects patients with acute wrist trauma in the Emergency Department (ED) for radiography. This multicenter prospective study consisted of three components: (1) derivation of a clinical prediction model for detecting wrist fractures in patients following wrist trauma; (2) external validation of this model; and (3) design of a clinical decision rule. The study was conducted in the EDs of five Dutch hospitals: one academic hospital (derivation cohort) and four regional hospitals (external validation cohort). We included all adult patients with acute wrist trauma. The main outcome was fracture of the wrist (distal radius, distal ulna or carpal bones) diagnosed on conventional X-rays. A total of 882 patients were analyzed; 487 in the derivation cohort and 395 in the validation cohort. We derived a clinical prediction model with eight variables: age; sex, swelling of the wrist; swelling of the anatomical snuffbox, visible deformation; distal radius tender to palpation; pain on radial deviation and painful axial compression of the thumb. The Area Under the Curve at external validation of this model was 0.81 (95% CI: 0.77-0.85). The sensitivity and specificity of the Amsterdam Wrist Rules (AWR) in the external validation cohort were 98% (95% CI: 95-99%) and 21% (95% CI: 15%-28). The negative predictive value was 90% (95% CI: 81-99%). The Amsterdam Wrist Rules is a clinical prediction rule with a high sensitivity and negative predictive value for fractures of the wrist. Although external validation showed low specificity and 100 % sensitivity could not be achieved, the Amsterdam Wrist Rules can provide physicians in the Emergency Department with a useful screening tool to select patients with acute wrist trauma for radiography. The upcoming implementation study will further reveal the impact of the Amsterdam Wrist Rules on the anticipated reduction of X-rays requested, missed fractures, Emergency Department waiting times and health care costs.
Developing an instrument to measure effective factors on Clinical Learning.
Dadgaran, Ideh; Shirazi, Mandana; Mohammadi, Aeen; Ravari, Ali
2016-07-01
Although nursing students spend a large part of their learning period in the clinical environment, clinical learning has not been perceived by its nature yet. To develop an instrument to measure effective factors on clinical learning in nursing students. This is a mixed methods study performed in 2 steps. First, the researchers defined "clinical learning" in nursing students through qualitative content analysis and designed items of the questionnaire based on semi-structured individual interviews with nursing students. Then, as the second step, psychometric properties of the questionnaire were evaluated using the face validity, content validity, construct validity, and internal consistency evaluated on 227 students from fourth or higher semesters. All the interviews were recorded and transcribed, and then, they were analyzed using Max Qualitative Data Analysis and all of qualitative data were analyzed using SPSS 14. To do the study, we constructed the preliminary questionnaire containing 102 expressions. After determination of face and content validities by qualitative and quantitative approaches, the expressions of the questionnaire were reduced to 45. To determine the construct validity, exploratory factor analysis was applied. The results indicated that the maximum variance percentage (40.55%) was defined by the first 3 factors while the rest of the total variance percentage (59.45%) was determined by the other 42 factors. Results of exploratory factor analysis of this questionnaire indicated the presence of 3 instructor-staff, students, and educational related factors. Finally, 41 expressions were kept in 3 factor groups. The α-Cronbach coefficient (0.93) confirmed the high internal consistency of the questionnaire. Results indicated that the prepared questionnaire was an efficient instrument in the study of the effective factors on clinical learning as viewed by nursing students since it involves 41 expressions and properties such as instrument design based on perception and experiences of the nursing students about effective factors on clinical learning, definition of facilitator and preventive factors of the clinical learning, simple scoring, suitable validity and reliability, and applicability in different occasions.
Mucci, A; Galderisi, S; Merlotti, E; Rossi, A; Rocca, P; Bucci, P; Piegari, G; Chieffi, M; Vignapiano, A; Maj, M
2015-07-01
The Brief Negative Symptom Scale (BNSS) was developed to address the main limitations of the existing scales for the assessment of negative symptoms of schizophrenia. The initial validation of the scale by the group involved in its development demonstrated good convergent and discriminant validity, and a factor structure confirming the two domains of negative symptoms (reduced emotional/verbal expression and anhedonia/asociality/avolition). However, only relatively small samples of patients with schizophrenia were investigated. Further independent validation in large clinical samples might be instrumental to the broad diffusion of the scale in clinical research. The present study aimed to examine the BNSS inter-rater reliability, convergent/discriminant validity and factor structure in a large Italian sample of outpatients with schizophrenia. Our results confirmed the excellent inter-rater reliability of the BNSS (the intraclass correlation coefficient ranged from 0.81 to 0.98 for individual items and was 0.98 for the total score). The convergent validity measures had r values from 0.62 to 0.77, while the divergent validity measures had r values from 0.20 to 0.28 in the main sample (n=912) and in a subsample without clinically significant levels of depression and extrapyramidal symptoms (n=496). The BNSS factor structure was supported in both groups. The study confirms that the BNSS is a promising measure for quantifying negative symptoms of schizophrenia in large multicenter clinical studies. Copyright © 2015 Elsevier Masson SAS. All rights reserved.
ERIC Educational Resources Information Center
Bianchini, Kevin J.; Etherton, Joseph L.; Greve, Kevin W.; Heinly, Matthew T.; Meyers, John E.
2008-01-01
The purpose of this study was to determine the accuracy of "Minnesota Multiphasic Personality Inventory" 2nd edition (MMPI-2; Butcher, Dahlstrom, Graham, Tellegen, & Kaemmer, 1989) validity indicators in the detection of malingering in clinical patients with chronic pain using a hybrid clinical-known groups/simulator design. The…
Lee-Hsieh, Jane; O'Brien, Anthony; Liu, Chieh-Yu; Cheng, Su-Fen; Lee, Yea-Wen; Kao, Yu-Hsiu
2016-03-01
Few studies have examined the perceptions of clinical teaching behaviors among both nurse preceptors and preceptees. To develop a Clinical Teaching Behavior Inventory (CTBI) for nurse preceptors' self-evaluation, and for new graduate nurse preceptee evaluation of preceptor clinical teaching behaviors and to test the validity and reliability of the CTBI. This study used mixed research techniques in five phases. Phase I: based on a literature review, the researchers developed an instrument to measure clinical teaching behaviors. Phase II: 17 focus group interviews were conducted with 63 preceptors and 24 new graduate nurses from five hospitals across Taiwan. Clinical teaching behavior themes were extracted from the focus group data and integrated into the domains and items of the CTBI. Phase III: two rounds of an expert Delphi study were conducted to determine the content validity of the instrument. Phase IV: a total of 290 nurse preceptors and 260 new graduate nurses were recruited voluntarily in the same five hospitals in Taiwan. Of these, 521 completed questionnaires to test the construct validity of CTBI by using confirmatory factory analysis. Phase V: the internal consistency and reliability of the instrument were tested. CTBI consists of 23 items in six domains: (1) 'Committing to Teaching'; (2) 'Building a Learning Atmosphere'; (3) 'Using Appropriate Teaching Strategies'; (4) 'Guiding Inter-professional Communication'; (5) 'Providing Feedback and Evaluation'; and (6) 'Showing Concern and Support'. The confirmatory factor analysis yielded a good fit and reliable scores for the CTBI-23 model. The CTBI-23 is a valid and reliable instrument for identifying the clinical teaching behaviors of a preceptor as perceived by preceptors and new graduate preceptees. The CTBI-23 depicts clinical teaching behaviors of nurse preceptors in Taiwan. Copyright © 2015 Elsevier Ltd. All rights reserved.
An Italian multicentre validation study of the coma recovery scale-revised.
Estraneo, A; Moretta, P; De Tanti, A; Gatta, G; Giacino, J T; Trojano, L
2015-10-01
Rate of misdiagnosis of disorders of consciousness (DoC) can be reduced by employing validated clinical diagnostic tools, such as the Coma Recovery Scale-Revised (CRS-R). An Italian version of the CRS-R has been recently developed, but its applicability across different clinical settings, and its concurrent validity and diagnostic sensitivity have not been estimated yet. To perform a multicentre validation study of the Italian version of the Coma Recovery Scale-Revised (CRS-R). Analysis of inter-rater reliability, concurrent validity and diagnostic sensitivity of the scale. One Intensive Care Unit, 8 Post-acute rehabilitation centres and 2 Long-term facilities Twenty-seven professionals (physicians, N.=11; psychologists, N.=5; physiotherapists, N.=3; speech therapists, N.=6; nurses, N.=2) from 11 Italian Centres. CRS-R and Disability Rating Scale (DRS) applied to 122 patients with clinical diagnosis of Vegetative State (VS) or Minimally Conscious State (MCS). CRS-R has good-to-excellent inter-rater reliability for all subscales, particularly for the communication subscale. The Italian version of the CRS-R showed a high sensitivity and specificity in detecting MCS with reference to clinical consensus diagnosis. The CRS-R showed good concurrent validity with the Disability Rating Scale, which had very low specificity with reference to clinical consensus diagnosis. The Italian version of the CRS-R is a valid scale for use from the sub-acute to chronic stages of DoC. It can be administered reliably by all members of the rehabilitation team with different specialties, levels of experience and settings. The present study promote use of the Italian version of the CRS-R to improve diagnosis of DoC patients, and plan tailored rehabilitation treatment.
Lucena-Santos, Paola; Trindade, Inês A; Oliveira, Margareth; Pinto-Gouveia, José
2017-05-19
Given the clinical usefulness of the CFQ-BI (Cognitive Fusion Questionnaire-Body Image; the only existing measure to assess the body-image-related cognitive fusion), the present study aimed to confirm its one-factor structure, to verify its measurement invariance between clinical and non-clinical samples, to analyze its internal consistency and sensitivity to detect differences between samples, as well as to explore the incremental and convergent validities of the CFQ-BI scores in Brazilian samples. This was a cross-sectional study, which was conducted in clinical (women with overweight or obesity in treatment for weight loss) and non-clinical samples (women from the general population). The one-factor structure was confirmed showing factorial measurement invariance across clinical and non-clinical samples. The CFQ-BI scores presented an excellent internal consistency, were able to discriminate clinical and non-clinical samples, and were positively associated with binge eating severity, general cognitive fusion, and psychological inflexibility. Furthermore, body-image-related cognitive fusion scores (CFQ-BI) presented incremental validity over a general measure of cognitive fusion in the prediction of binge eating symptoms. This study demonstrated that CFQ-BI is a short scale with reliable and robust scores in Brazilian samples, presenting incremental and convergent validities, measurement invariance, and sensitivity to detect differences between clinical and non-clinical groups of women, enabling comparative studies between them.
ERIC Educational Resources Information Center
Langer, David A.; Wood, Jeffrey J.; Bergman, R. Lindsey; Piacentini, John C.
2010-01-01
The present study examines the construct validity of separation anxiety disorder (SAD), social phobia (SoP), panic disorder (PD), and generalized anxiety disorder (GAD) in a clinical sample of children. Participants were 174 children, 6 to 17 years old (94 boys) who had undergone a diagnostic evaluation at a university hospital based clinic.…
Factorial Validity and Invariance of the GHQ-12 among Clinical and Nonclinical Samples
ERIC Educational Resources Information Center
Fernandes, Helder Miguel; Vasconcelos-Raposo, Jose
2013-01-01
The purpose of this study was to examine the internal reliability, factorial validity, and measurement invariance of a Brazilian-Portuguese version of the General Health Questionnaire-12 (GHQ-12) across clinical and nonclinical groups. The clinical sample consisted of 228 chronic hemodialysis patients (41.7% female), with a mean age of 48.23 (SD =…
Crary, Michael A.; Carnaby, Giselle D.; Sia, Isaac
2017-01-01
Background The aim of this study was to compare spontaneous swallow frequency analysis (SFA) with clinical screening protocols for identification of dysphagia in acute stroke. Methods In all, 62 patients with acute stroke were evaluated for spontaneous swallow frequency rates using a validated acoustic analysis technique. Independent of SFA, these same patients received a routine nurse-administered clinical dysphagia screening as part of standard stroke care. Both screening tools were compared against a validated clinical assessment of dysphagia for acute stroke. In addition, psychometric properties of SFA were compared against published, validated clinical screening protocols. Results Spontaneous SFA differentiates patients with versus without dysphagia after acute stroke. Using a previously identified cut point based on swallows per minute, spontaneous SFA demonstrated superior ability to identify dysphagia cases compared with a nurse-administered clinical screening tool. In addition, spontaneous SFA demonstrated equal or superior psychometric properties to 4 validated, published clinical dysphagia screening tools. Conclusions Spontaneous SFA has high potential to identify dysphagia in acute stroke with psychometric properties equal or superior to clinical screening protocols. PMID:25088166
Crary, Michael A; Carnaby, Giselle D; Sia, Isaac
2014-09-01
The aim of this study was to compare spontaneous swallow frequency analysis (SFA) with clinical screening protocols for identification of dysphagia in acute stroke. In all, 62 patients with acute stroke were evaluated for spontaneous swallow frequency rates using a validated acoustic analysis technique. Independent of SFA, these same patients received a routine nurse-administered clinical dysphagia screening as part of standard stroke care. Both screening tools were compared against a validated clinical assessment of dysphagia for acute stroke. In addition, psychometric properties of SFA were compared against published, validated clinical screening protocols. Spontaneous SFA differentiates patients with versus without dysphagia after acute stroke. Using a previously identified cut point based on swallows per minute, spontaneous SFA demonstrated superior ability to identify dysphagia cases compared with a nurse-administered clinical screening tool. In addition, spontaneous SFA demonstrated equal or superior psychometric properties to 4 validated, published clinical dysphagia screening tools. Spontaneous SFA has high potential to identify dysphagia in acute stroke with psychometric properties equal or superior to clinical screening protocols. Copyright © 2014 National Stroke Association. Published by Elsevier Inc. All rights reserved.
Wu, Danny T Y; Smart, Nikolas; Ciemins, Elizabeth L; Lanham, Holly J; Lindberg, Curt; Zheng, Kai
2017-01-01
To develop a workflow-supported clinical documentation system, it is a critical first step to understand clinical workflow. While Time and Motion studies has been regarded as the gold standard of workflow analysis, this method can be resource consuming and its data may be biased due to the cognitive limitation of human observers. In this study, we aimed to evaluate the feasibility and validity of using EHR audit trail logs to analyze clinical workflow. Specifically, we compared three known workflow changes from our previous study with the corresponding EHR audit trail logs of the study participants. The results showed that EHR audit trail logs can be a valid source for clinical workflow analysis, and can provide an objective view of clinicians' behaviors, multi-dimensional comparisons, and a highly extensible analysis framework.
Visualizing and Validating Metadata Traceability within the CDISC Standards.
Hume, Sam; Sarnikar, Surendra; Becnel, Lauren; Bennett, Dorine
2017-01-01
The Food & Drug Administration has begun requiring that electronic submissions of regulated clinical studies utilize the Clinical Data Information Standards Consortium data standards. Within regulated clinical research, traceability is a requirement and indicates that the analysis results can be traced back to the original source data. Current solutions for clinical research data traceability are limited in terms of querying, validation and visualization capabilities. This paper describes (1) the development of metadata models to support computable traceability and traceability visualizations that are compatible with industry data standards for the regulated clinical research domain, (2) adaptation of graph traversal algorithms to make them capable of identifying traceability gaps and validating traceability across the clinical research data lifecycle, and (3) development of a traceability query capability for retrieval and visualization of traceability information.
Visualizing and Validating Metadata Traceability within the CDISC Standards
Hume, Sam; Sarnikar, Surendra; Becnel, Lauren; Bennett, Dorine
2017-01-01
The Food & Drug Administration has begun requiring that electronic submissions of regulated clinical studies utilize the Clinical Data Information Standards Consortium data standards. Within regulated clinical research, traceability is a requirement and indicates that the analysis results can be traced back to the original source data. Current solutions for clinical research data traceability are limited in terms of querying, validation and visualization capabilities. This paper describes (1) the development of metadata models to support computable traceability and traceability visualizations that are compatible with industry data standards for the regulated clinical research domain, (2) adaptation of graph traversal algorithms to make them capable of identifying traceability gaps and validating traceability across the clinical research data lifecycle, and (3) development of a traceability query capability for retrieval and visualization of traceability information. PMID:28815125
Willis, Brian H; Riley, Richard D
2017-09-20
An important question for clinicians appraising a meta-analysis is: are the findings likely to be valid in their own practice-does the reported effect accurately represent the effect that would occur in their own clinical population? To this end we advance the concept of statistical validity-where the parameter being estimated equals the corresponding parameter for a new independent study. Using a simple ('leave-one-out') cross-validation technique, we demonstrate how we may test meta-analysis estimates for statistical validity using a new validation statistic, Vn, and derive its distribution. We compare this with the usual approach of investigating heterogeneity in meta-analyses and demonstrate the link between statistical validity and homogeneity. Using a simulation study, the properties of Vn and the Q statistic are compared for univariate random effects meta-analysis and a tailored meta-regression model, where information from the setting (included as model covariates) is used to calibrate the summary estimate to the setting of application. Their properties are found to be similar when there are 50 studies or more, but for fewer studies Vn has greater power but a higher type 1 error rate than Q. The power and type 1 error rate of Vn are also shown to depend on the within-study variance, between-study variance, study sample size, and the number of studies in the meta-analysis. Finally, we apply Vn to two published meta-analyses and conclude that it usefully augments standard methods when deciding upon the likely validity of summary meta-analysis estimates in clinical practice. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd.
Chew, Keng Sheng; Kueh, Yee Cheng; Abdul Aziz, Adlihafizi
2017-03-21
Despite their importance on diagnostic accuracy, there is a paucity of literature on questionnaire tools to assess clinicians' awareness toward cognitive errors. A validation study was conducted to develop a questionnaire tool to evaluate the Clinician's Awareness Towards Cognitive Errors (CATChES) in clinical decision making. This questionnaire is divided into two parts. Part A is to evaluate the clinicians' awareness towards cognitive errors in clinical decision making while Part B is to evaluate their perception towards specific cognitive errors. Content validation for both parts was first determined followed by construct validation for Part A. Construct validation for Part B was not determined as the responses were set in a dichotomous format. For content validation, all items in both Part A and Part B were rated as "excellent" in terms of their relevance in clinical settings. For construct validation using exploratory factor analysis (EFA) for Part A, a two-factor model with total variance extraction of 60% was determined. Two items were deleted. Then, the EFA was repeated showing that all factor loadings are above the cut-off value of >0.5. The Cronbach's alpha for both factors are above 0.6. The CATChES questionnaire tool is a valid questionnaire tool aimed to evaluate the awareness among clinicians toward cognitive errors in clinical decision making.
Bridges, Patricia H; Carter, Vincent; Rehm, Stephanie; Tintl, Sara Bowers; Halperin, Rebecca; Kniesly, Elizabeth; Pelino, Soni
2013-01-01
Conduct a pilot study to establish the reliability and validity of a survey instrument that directly measures the objectives and content of the APTA CIECP; and measure the self-reported frequency of use of the behaviors taught in the APTA CIECP. Eighteen (18) APTA credentialed CIs. Develop a web-based survey consisting of 58 items representative of the behaviors taught in the APTA CIECP and 8 demographic characteristics. Establish the content validity and reliability of the survey instrument. Conduct a descriptive analysis of the frequency of self-reported use of the behaviors. The APTA Clinical Instructor Education Board (CIEB) reviewed the items and determined that the items matched the objectives and content of the APTA CIECP, thereby establishing content validity. Cronbach's alpha coefficients ranging from 0.79-0.90 confirmed the reliability. The overall mean for all items on a 1-6 scale was 4.81. The content validity and reliability of the survey instrument were established. The outcomes of this pilot study suggest that when measured by a valid and reliable instrument that is representative of the objectives and content of the CIECP, the behaviors taught in the CIECP are being applied in the clinical setting by APTA credentialed clinical instructors.
de Klerk, Susan; Buchanan, Helen; Jerosch-Herold, Christina
Systematic review. The Disabilities of the Arm Shoulder and Hand Questionnaire has multiple language versions from many countries around the world. In addition there is extensive research evidence of its psychometric properties. The purpose of this study was to systematically review the evidence available on the validity and clinical utility of the Disabilities of the Arm Shoulder and Hand as a measure of activity and participation in patients with musculoskeletal hand injuries in developing country contexts. We registered the review with international prospective register of systematic reviews prior to conducting a comprehensive literature search and extracting descriptive data. Two reviewers independently assessed methodological quality with the Consensus-Based Standards for the Selection of Health Measurement Instruments critical appraisal tool, the checklist to operationalize measurement characteristics of patient-rated outcome measures and the multidimensional model of clinical utility. Fourteen studies reporting 12 language versions met the eligibility criteria. Two language versions (Persian and Turkish) had an overall rating of good, and one (Thai) had an overall rating of excellent for cross-cultural validity. The remaining 9 language versions had an overall poor rating for cross-cultural validity. Content and construct validity and clinical utility yielded similar results. Poor quality ratings for validity and clinical utility were due to insufficient documentation of results and inadequate psychometric testing. With the increase in migration and globalization, hand therapists are likely to require a range of culturally adapted and translated versions of the Disabilities of the Arm Shoulder and Hand. Recommendations include rigorous application and reporting of cross-cultural adaptation, appropriate psychometric testing, and testing of clinical utility in routine clinical practice. Copyright © 2017 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Walenkamp, Monique M J; Bentohami, Abdelali; Slaar, Annelie; Beerekamp, M Suzan H; Maas, Mario; Jager, L Cara; Sosef, Nico L; van Velde, Romuald; Ultee, Jan M; Steyerberg, Ewout W; Goslings, J Carel; Schep, Niels W L
2015-12-18
Although only 39 % of patients with wrist trauma have sustained a fracture, the majority of patients is routinely referred for radiography. The purpose of this study was to derive and externally validate a clinical decision rule that selects patients with acute wrist trauma in the Emergency Department (ED) for radiography. This multicenter prospective study consisted of three components: (1) derivation of a clinical prediction model for detecting wrist fractures in patients following wrist trauma; (2) external validation of this model; and (3) design of a clinical decision rule. The study was conducted in the EDs of five Dutch hospitals: one academic hospital (derivation cohort) and four regional hospitals (external validation cohort). We included all adult patients with acute wrist trauma. The main outcome was fracture of the wrist (distal radius, distal ulna or carpal bones) diagnosed on conventional X-rays. A total of 882 patients were analyzed; 487 in the derivation cohort and 395 in the validation cohort. We derived a clinical prediction model with eight variables: age; sex, swelling of the wrist; swelling of the anatomical snuffbox, visible deformation; distal radius tender to palpation; pain on radial deviation and painful axial compression of the thumb. The Area Under the Curve at external validation of this model was 0.81 (95 % CI: 0.77-0.85). The sensitivity and specificity of the Amsterdam Wrist Rules (AWR) in the external validation cohort were 98 % (95 % CI: 95-99 %) and 21 % (95 % CI: 15 %-28). The negative predictive value was 90 % (95 % CI: 81-99 %). The Amsterdam Wrist Rules is a clinical prediction rule with a high sensitivity and negative predictive value for fractures of the wrist. Although external validation showed low specificity and 100 % sensitivity could not be achieved, the Amsterdam Wrist Rules can provide physicians in the Emergency Department with a useful screening tool to select patients with acute wrist trauma for radiography. The upcoming implementation study will further reveal the impact of the Amsterdam Wrist Rules on the anticipated reduction of X-rays requested, missed fractures, Emergency Department waiting times and health care costs. This study was registered in the Dutch Trial Registry, reference number NTR2544 on October 1(st), 2010.
Morasco, Benjamin J; Gfeller, Jeffrey D; Elder, Katherine A
2007-06-01
In this psychometric study, we compared the recently developed Validity Scales from the Revised NEO Personality Inventory (NEO PI-R; Costa & McCrae, 1992b) with the MMPI-2 (Butcher, Dahstrom, Graham, Tellegen, & Kaemmer, 1989) Validity Scales. We collected data from clients (n = 74) who completed comprehensive psychological evaluations at a university-based outpatient mental health clinic. Correlations between the Validity Scales of the NEO-PI-R and MMPI-2 were significant and in the expected directions. The relationships provide support for convergent and discriminant validity of the NEO-PI-R Validity Scales. The percent agreement of invalid responding on the two measures was high, although the diagnostic agreement was modest (kappa = .22-.33). Finally, clients who responded in an invalid manner on the NEO-PI-R Validity Scales produced significantly different clinical profiles on the NEO-PI-R and MMPI-2 than clients with valid protocols. These results provide additional support for the clinical utility of the NEO-PI-R Validity Scales as indicators of response bias.
Clinical validation of the nursing diagnosis of ineffective protection in haemodialysis patients.
de Sá Tinôco, Jéssica Dantas; de Paiva, Maria das Graças Mariano Nunes; de Queiroz Frazão, Cecília Maria Farias; Lucio, Kadyjina Daiane Batista; Fernandes, Maria Isabel da Conceição Dias; de Oliveira Lopes, Marcos Venicios; de Carvalho Lira, Ana Luisa Brandão
2018-01-01
To evaluate the clinical validity of indicators of the nursing diagnosis of "ineffective protection" in haemodialysis patients. Haemodialysis patients have reduced protection. Studies on the nursing diagnosis of "ineffective protection" are scarce in the literature. The use of indicators to diagnose "ineffective protection" could improve the care of haemodialysis patients. The clinical usefulness of the indicators requires clinical validation. This was a diagnostic accuracy study. This study assessed a sample of 200 patients undergoing haemodialysis in a reference clinic for nephrology during the first half of 2015. Operational definitions were created for each clinical indicator based on concept analysis and content validation by experts for these indicators. Diagnostic accuracy measurement was performed with latent class analysis with randomised effects. The clinical indicator of "fatigue" had high sensitivity (p = .999) and specificity (p = 1.000) for the identification of "ineffective protection." Additionally, "maladaptive response to stress" (p = .711) and "coagulation change" (p = .653) were sensitive indicators. The main indicators that showed high specificity were "fever" (p = .987), "increased number of hospitalisations" (p = .911), "weakness" (p = .937), "infected vascular access" (p = .962) and "vascular access dysfunction" (p = .722). A set of nine clinical indicators of "ineffective protection" were accurate and statistically significant for haemodialysis patients. Three clinical indicators showed sensitivity, and six indicators showed specificity. Accurate measures for nursing diagnoses can help nurses confirm or rule out the probability of the occurrence of "ineffective protection" in patients undergoing haemodialysis. © 2017 John Wiley & Sons Ltd.
Fardell, Joanna E; Jones, Georden; Smith, Allan Ben; Lebel, Sophie; Thewes, Belinda; Costa, Daniel; Tiller, Kerry; Simard, Sébastien; Feldstain, Andrea; Beattie, Sara; McCallum, Megan; Butow, Phyllis
2018-02-01
Fear of cancer recurrence (FCR) is a common concern among cancer survivors. Identifying survivors with clinically significant FCR requires validated screening measures and clinical cut-offs. We evaluated the Fear of Cancer Recurrence Inventory-Short Form (FCRI-SF) clinical cut-off in 2 samples. Level of FCR in study 1 participants (from an Australian randomized controlled trial: ConquerFear) was compared with FCRI-SF scores. Based on a biopsychosocial interview, clinicians rated participants as having nonclinical, subclinical, or clinical FCR. Study 2 participants (from a Canadian FCRI-English validation study) were classified as having clinical or nonclinical FCR by using the semistructured clinical interview for FCR (SIFCR). Receiver operating characteristic analyses evaluated the screening ability of the FCRI-SF against clinician ratings (study 1) and the SIFCR (study 2). In study 1, 167 cancer survivors (mean age: 53 years, SD = 10.1) participated. Clinicians rated 43% as having clinical FCR. In study 2, 40 cancer survivors (mean age: 68 years, SD = 7.0) participated; 25% met criteria for clinical FCR according to the SIFCR. For both studies 1 and 2, receiver operating characteristic analyses suggested a cut-off ≥22 on the FCRI-SF identified cancer survivors with clinical levels of FCR with adequate sensitivity and specificity. Establishing clinical cut-offs on FCR screening measures is crucial to tailoring individual care and conducting rigorous research. Our results suggest using a higher cut-off on the FCRI-SF than previously reported to identify clinically significant FCR. Continued evaluation and validation of the FCRI-SF cut-off is required across diverse cancer populations. Copyright © 2017 John Wiley & Sons, Ltd.
The Clinical Impression of Severity Index for Parkinson's Disease: international validation study.
Martínez-Martín, Pablo; Rodríguez-Blázquez, Carmen; Forjaz, Maria João; de Pedro, Jesús
2009-01-30
This study sought to provide further information about the psychometric properties of the Clinical Impression of Severity Index for Parkinson's Disease (CISI-PD), in a large, international, cross-culturally diverse sample. Six hundred and fourteen patients with PD participated in the study. Apart from the CISI-PD, assessments were based on Hoehn & Yahr (HY) staging, the Scales for Outcomes in PD-Motor (SCOPA-M), -Cognition (SCOPA-COG) and -Psychosocial (SCOPA-PS), the Cumulative Illness Rating Scale-Geriatrics, and the Hospital Anxiety and Depression Scale. The total CISI-PD score displayed no floor or ceiling effects. Internal consistency was 0.81, the test-retest intraclass correlation coefficient was 0.84, and item homogeneity was 0.52. Exploratory and confirmatory factor analysis (CFI = 0.99, RMSEA = 0.07) confirmed CISI-PD's unifactorial structure. The CISI-PD showed adequate convergent validity with SCOPA-COG and SCOPA-M (r(S) = 0.46-0.85, respectively) and discriminative validity for HY stages and disease duration (P < 0.0001). In a multiple regression model, main CISI-PD predictors were SCOPA-M, disease duration, and depression. The results obtained were not only comparable to but also extended those yielded by the preliminary validation study, thus showing that the CISI-PD is a valid instrument to measure clinical impression of severity in PD. Its simplicity and easy application make it an attractive and useful tool for clinical practice and research.
Kruizinga, Ingrid; Jansen, Wilma; de Haan, Carolien L.; Raat, Hein
2012-01-01
Background The KIPPPI (Brief Instrument Psychological and Pedagogical Problem Inventory) is a Dutch questionnaire that measures psychosocial and pedagogical problems in 2-year olds and consists of a KIPPPI Total score, Wellbeing scale, Competence scale, and Autonomy scale. This study examined the reliability, validity, screening accuracy and clinical application of the KIPPPI. Methods Parents of 5959 2-year-old children in the Rotterdam area, the Netherlands, were invited to participate in the study. Parents of 3164 children (53.1% of all invited parents) completed the questionnaire. The internal consistency was evaluated and in subsamples the test-retest reliability and concurrent validity with regard to the Child Behavioral Checklist (CBCL). Discriminative validity was evaluated by comparing scores of parents who worried about their child’s upbringing and parent’s that did not. Screening accuracy of the KIPPPI was evaluated against the CBCL by calculating the Receiver Operating Characteristic (ROC) curves. The clinical application was evaluated by the relation between KIPPPI scores and the clinical decision made by the child health professionals. Results Psychometric properties of the KIPPPI Total score, Wellbeing scale, Competence scale and Autonomy scale were respectively: Cronbach’s alphas: 0.88, 0.86, 0.83, 0.58. Test-retest correlations: 0.80, 0.76, 0.73, 0.60. Concurrent validity was as hypothesised. The KIPPPI was able to discriminate between parents that worried about their child and parents that did not. Screening accuracy was high (>0.90) for the KIPPPI Total score and for the Wellbeing scale. The KIPPPI scale scores and clinical decision of the child health professional were related (p<0.05), indicating a good clinical application. Conclusion The results in this large-scale study of a diverse general population sample support the reliability, validity and clinical application of the KIPPPI Total score, Wellbeing scale and Competence scale. Also, the screening accuracy of the KIPPPI Total score and Wellbeing scale were supported. The Autonomy scale needs further study. PMID:23185388
López-Villalobos, José A; Andrés-De Llano, Jesús; López-Sánchez, María V; Rodríguez-Molinero, Luis; Garrido-Redondo, Mercedes; Sacristán-Martín, Ana M; Martínez-Rivera, María T; Alberola-López, Susana
2017-02-01
The aim of this research is to analyze Attention Deficit Hyperactivity Disorder Rating Scales IV (ADHD RS-IV) criteria validity and its clinical usefulness for the assessment of Attention Deficit Hyperactivity Disorder (ADHD) as a function of assessment method and age. A sample was obtained from an epidemiological study (n = 1095, 6-16 years). Clinical cases of ADHD (ADHD-CL) were selected by dimensional ADHD RS-IV and later by clinical interview (DSM-IV). ADHD-CL cases were compared with four categorical results of ADHD RS-IV provided by parents (CATPA), teachers (CATPR), either parents or teachers (CATPAOPR) and both parents and teachers (CATPA&PR). Criterion validity and clinical usefulness of the answer modalities to ADHD RS-IV were studied. ADHD-CL rate was 6.9% in childhood, 6.2% in preadolescence and 6.9% in adolescence. Alternative methods to the clinical interview led to increased numbers of ADHD cases in all age groups analyzed, in the following sequence: CATPAOPR> CATPRO> CATPA> CATPA&PR> ADHD-CL. CATPA&PR was the procedure with the greatest validity, specificity and clinical usefulness in all three age groups, particularly in the childhood. Isolated use of ADHD RS-IV leads to an increase in ADHD cases compared to clinical interview, and varies depending on the procedure used.
Reliable Digit Span: A Systematic Review and Cross-Validation Study
ERIC Educational Resources Information Center
Schroeder, Ryan W.; Twumasi-Ankrah, Philip; Baade, Lyle E.; Marshall, Paul S.
2012-01-01
Reliable Digit Span (RDS) is a heavily researched symptom validity test with a recent literature review yielding more than 20 studies ranging in dates from 1994 to 2011. Unfortunately, limitations within some of the research minimize clinical generalizability. This systematic review and cross-validation study was conducted to address these…
LeBeau, Richard T; Mesri, Bita; Craske, Michelle G
2016-10-30
With DSM-5, the APA began providing guidelines for anxiety disorder severity assessment that incorporates newly developed self-report scales. The scales share a common template, are brief, and are free of copyright restrictions. Initial validation studies have been promising, but the English-language versions of the scales have not been formally validated in clinical samples. Forty-seven individuals with a principal diagnosis of Social Anxiety Disorder (SAD) completed a diagnostic assessment, as well as the DSM-5 SAD severity scale and several previously validated measures. The scale demonstrated internal consistency, convergent validity, and discriminant validity. The next steps in the validation process are outlined. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Kim, Sun Hee; Yoo, So Yeon; Kim, Yae Young
2018-02-01
This study was conducted to evaluate the validity and reliability of the Korean version of the clinical learning environment, supervision and nurse teacher evaluation scale (CLES+T) that measures the clinical learning environment and the conditions associated with supervision and nurse teachers. The English CLES+T was translated into Korean with forward and back translation. Survey data were collected from 434 nursing students who had more than four days of clinical practice in Korean hospitals. Internal consistency reliability and construct validity using confirmatory and exploratory factor analysis were conducted. SPSS 20.0 and AMOS 22.0 programs were used for data analysis. The exploratory factor analysis revealed seven factors for the thirty three-item scale. Confirmatory factor analysis supported good convergent and discriminant validities. The Cronbach's alpha for the overall scale was .94 and for the seven subscales ranged from .78 to .94. The findings suggest that the 33-items Korean CLES+T is an appropriate instrument to measure Korean nursing students'clinical learning environment with good validity and reliability. © 2018 Korean Society of Nursing Science.
Ahmed, Adil; Vairavan, Srinivasan; Akhoundi, Abbasali; Wilson, Gregory; Chiofolo, Caitlyn; Chbat, Nicolas; Cartin-Ceba, Rodrigo; Li, Guangxi; Kashani, Kianoush
2015-10-01
Timely detection of acute kidney injury (AKI) facilitates prevention of its progress and potentially therapeutic interventions. The study objective is to develop and validate an electronic surveillance tool (AKI sniffer) to detect AKI in 2 independent retrospective cohorts of intensive care unit (ICU) patients. The primary aim is to compare the sensitivity, specificity, and positive and negative predictive values of AKI sniffer performance against a reference standard. This study is conducted in the ICUs of a tertiary care center. The derivation cohort study subjects were Olmsted County, MN, residents admitted to all Mayo Clinic ICUs from July 1, 2010, through December 31, 2010, and the validation cohort study subjects were all patients admitted to a Mayo Clinic, Rochester, campus medical/surgical ICU on January 12, 2010, through March 23, 2010. All included records were reviewed by 2 independent investigators who adjudicated AKI using the Acute Kidney Injury Network criteria; disagreements were resolved by a third reviewer. This constituted the reference standard. An electronic algorithm was developed; its precision and reliability were assessed in comparison with the reference standard in 2 separate cohorts, derivation and validation. Of 1466 screened patients, a total of 944 patients were included in the study: 482 for derivation and 462 for validation. Compared with the reference standard in the validation cohort, the sensitivity and specificity of the AKI sniffer were 88% and 96%, respectively. The Cohen κ (95% confidence interval) agreement between the electronic and the reference standard was 0.84 (0.78-0.89) and 0.85 (0.80-0.90) in the derivation and validation cohorts. Acute kidney injury can reliably and accurately be detected electronically in ICU patients. The presented method is applicable for both clinical (decision support) and research (enrollment for clinical trials) settings. Prospective validation is required. Copyright © 2015 Elsevier Inc. All rights reserved.
Vreugdenhil, Jettie; Spek, Bea
2018-03-01
Clinical reasoning in patient care is a skill that cannot be observed directly. So far, no reliable, valid instrument exists for the assessment of nursing students' clinical reasoning skills in hospital practice. Lasater's clinical judgment rubric (LCJR), based on Tanner's model "Thinking like a nurse" has been tested, mainly in academic simulation settings. The aim is to develop a Dutch version of the LCJR (D-LCJR) and to test its psychometric properties when used in a hospital traineeship context. A mixed-model approach was used to develop and to validate the instrument. Ten dedicated educational units in a university hospital. A well-mixed group of 52 nursing students, nurse coaches and nurse educators. A Delphi panel developed the D-LCJR. Students' clinical reasoning skills were assessed "live" by nurse coaches, nurse educators and students who rated themselves. The psychometric properties tested during the assessment process are reliability, reproducibility, content validity and construct validity by testing two hypothesis: 1) a positive correlation between assessed and self-reported sum scores (convergent validity) and 2) a linear relation between experience and sum score (clinical validity). The obtained D-LCJR was found to be internally consistent, Cronbach's alpha 0.93. The rubric is also reproducible with intraclass correlations between 0.69 and 0.78. Experts judged it to be content valid. The two hypothesis were both tested significant, supporting evidence for construct validity. The translated and modified LCJR, is a promising tool for the evaluation of nursing students' development in clinical reasoning in hospital traineeships, by students, nurse coaches and nurse educators. More evidence on construct validity is necessary, in particular for students at the end of their hospital traineeship. Based on our research, the D-LCJR applied in hospital traineeships is a usable and reliable tool. Copyright © 2017 Elsevier Ltd. All rights reserved.
Iglesias-Parra, Maria Rosa; García-Guerrero, Alfonso; García-Mayor, Silvia; Kaknani-Uttumchandani, Shakira; León-Campos, Álvaro; Morales-Asencio, José Miguel
2015-07-01
To develop an evaluation system of clinical competencies for the practicum of nursing students based on the Nursing Interventions Classification (NIC). Psychometric validation study: the first two phases addressed definition and content validation, and the third phase consisted of a cross-sectional study for analyzing reliability. The study population was undergraduate nursing students and clinical tutors. Through the Delphi technique, 26 competencies and 91 interventions were isolated. Cronbach's α was 0.96. Factor analysis yielded 18 factors that explained 68.82% of the variance. Overall inter-item correlation was 0.26, and total-item correlation ranged between 0.66 and 0.19. A competency system for the nursing practicum, structured on the NIC, is a reliable method for assessing and evaluating clinical competencies. Further evaluations in other contexts are needed. The availability of standardized language systems in the nursing discipline supposes an ideal framework to develop the nursing curricula. © 2015 Sigma Theta Tau International.
Clinical Validation of the "Sedentary Lifestyle" Nursing Diagnosis in Secondary School Students
ERIC Educational Resources Information Center
de Oliveira, Marcos Renato; da Silva, Viviane Martins; Guedes, Nirla Gomes; de Oliveira Lopes, Marcos Venícios
2016-01-01
This study clinically validated the nursing diagnosis of "sedentary lifestyle" (SL) among 564 Brazilian adolescents. Measures of diagnostic accuracy were calculated for defining characteristics, and Mantel--Haenszel analysis was used to identify related factors. The measures of diagnostic accuracy showed that the following defining…
Ustün, B; Compton, W; Mager, D; Babor, T; Baiyewu, O; Chatterji, S; Cottler, L; Göğüş, A; Mavreas, V; Peters, L; Pull, C; Saunders, J; Smeets, R; Stipec, M R; Vrasti, R; Hasin, D; Room, R; Van den Brink, W; Regier, D; Blaine, J; Grant, B F; Sartorius, N
1997-09-25
The WHO Study on the reliability and validity of the alcohol and drug use disorder instruments in an international study which has taken place in centres in ten countries, aiming to test the reliability and validity of three diagnostic instruments for alcohol and drug use disorders: the Composite International Diagnostic Interview (CIDI), the Schedules for Clinical Assessment in Neuropsychiatry (SCAN) and a special version of the Alcohol Use Disorder and Associated Disabilities Interview schedule-alcohol/drug-revised (AUDADIS-ADR). The purpose of the reliability and validity (R&V) study is to further develop the alcohol and drug sections of these instruments so that a range of substance-related diagnoses can be made in a systematic, consistent, and reliable way. The study focuses on new criteria proposed in the tenth revision of the International Classification of Diseases (ICD-10) and the fourth revision of the diagnostic and statistical manual of mental disorders (DSM-IV) for dependence, harmful use and abuse categories for alcohol and psychoactive substance use disorders. A systematic study including a scientifically rigorous measure of reliability (i.e. 1 week test-retest reliability) and validity (i.e. comparison between clinical and non-clinical measures) has been undertaken. Results have yielded useful information on reliability and validity of these instruments at diagnosis, criteria and question level. Overall the diagnostic concordance coefficients (kappa, kappa) were very good for dependence disorders (0.7-0.9), but were somewhat lower for the abuse and harmful use categories. The comparisons among instruments and independent clinical evaluations and debriefing interviews gave important information about possible sources of unreliability, and provided useful clues on the applicability and consistency of nosological concepts across cultures.
[Design and validation of an instrument to assess families at risk for health problems].
Puschel, Klaus; Repetto, Paula; Solar, María Olga; Soto, Gabriela; González, Karla
2012-04-01
There is a paucity of screening instruments with a high clinical predictive value to identify families at risk and therefore, develop focused interventions in primary care. To develop an easy to apply screening instrument with a high clinical predictive value to identify families with a higher health vulnerability. In the first stage of the study an instrument with a high content validity was designed through a review of existent instruments, qualitative interviews with families and expert opinions following a Delphi approach of three rounds. In the second stage, concurrent validity was tested through a comparative analysis between the pilot instrument and a family clinical interview conducted to 300 families randomly selected from a population registered at a primary care clinic in Santiago. The sampling was blocked based on the presence of diabetes, depression, child asthma, behavioral disorders, presence of an older person or the lack of previous conditions among family members. The third stage, was directed to test the clinical predictive validity of the instrument by comparing the baseline vulnerability obtained by the instrument and the change in clinical status and health related quality of life perceptions of the family members after nine months of follow-up. The final SALUFAM instrument included 13 items and had a high internal consistency (Cronbach's alpha: 0.821), high test re-test reproducibility (Pearson correlation: 0.84) and a high clinical predictive value for clinical deterioration (Odds ratio: 1.826; 95% confidence intervals: 1.101-3.029). SALUFAM instrument is applicable, replicable, has a high content validity, concurrent validity and clinical predictive value.
Keogh, Claire; Wallace, Emma; O’Brien, Kirsty K.; Galvin, Rose; Smith, Susan M.; Lewis, Cliona; Cummins, Anthony; Cousins, Grainne; Dimitrov, Borislav D.; Fahey, Tom
2014-01-01
PURPOSE We describe the methodology used to create a register of clinical prediction rules relevant to primary care. We also summarize the rules included in the register according to various characteristics. METHODS To identify relevant articles, we searched the MEDLINE database (PubMed) for the years 1980 to 2009 and supplemented the results with searches of secondary sources (books on clinical prediction rules) and personal resources (eg, experts in the field). The rules described in relevant articles were classified according to their clinical domain, the stage of development, and the clinical setting in which they were studied. RESULTS Our search identified clinical prediction rules reported between 1965 and 2009. The largest share of rules (37.2%) were retrieved from PubMed. The number of published rules increased substantially over the study decades. We included 745 articles in the register; many contained more than 1 clinical prediction rule study (eg, both a derivation study and a validation study), resulting in 989 individual studies. In all, 434 unique rules had gone through derivation; however, only 54.8% had been validated and merely 2.8% had undergone analysis of their impact on either the process or outcome of clinical care. The rules most commonly pertained to cardiovascular disease, respiratory, and musculoskeletal conditions. They had most often been studied in the primary care or emergency department settings. CONCLUSIONS Many clinical prediction rules have been derived, but only about half have been validated and few have been assessed for clinical impact. This lack of thorough evaluation for many rules makes it difficult to retrieve and identify those that are ready for use at the point of patient care. We plan to develop an international web-based register of clinical prediction rules and computer-based clinical decision support systems. PMID:25024245
Development and Validation of the Body-Focused Shame and Guilt Scale
Weingarden, Hilary; Renshaw, Keith D.; Tangney, June P.; Wilhelm, Sabine
2015-01-01
Body shame is described as central in clinical literature on body dysmorphic disorder (BDD). However, empirical investigations of body shame within BDD are rare. One potential reason for the scarcity of such research may be that existing measures of body shame focus on eating and weight-based content. Within BDD, however, body shame likely focuses more broadly on shame felt in response to perceived appearance flaws in one’s body parts. We describe the development and validation of the Body-Focused Shame and Guilt Scale (BF-SGS), a measure of BDD-relevant body shame, across two studies: a two time-point study of undergraduates, and a follow-up study in two Internet-recruited clinical samples (BDD, obsessive compulsive disorder) and healthy controls. Across both studies, the BF-SGS shame subscale demonstrated strong reliability and construct validity, with Study 2 providing initial clinical norms. PMID:26640760
Lievaart, Marien; Franken, Ingmar H A; Hovens, Johannes E
2016-03-01
The most commonly used instrument for measuring anger is the State-Trait Anger Expression Inventory-2 (STAXI-2; Spielberger, 1999). This study further examines the validity of the STAXI-2 and compares anger scores between several clinical and nonclinical samples. Reliability, concurrent, and construct validity were investigated in Dutch undergraduate students (N = 764), a general population sample (N = 1211), and psychiatric outpatients (N = 226). The results support the reliability and validity of the STAXI-2. Concurrent validity was strong, with meaningful correlations between the STAXI-2 scales and anger-related constructs in both clinical and nonclinical samples. Importantly, patients showed higher experience and expression of anger than the general population sample. Additionally, forensic outpatients with addiction problems reported higher Anger Expression-Out than general psychiatric outpatients. Our conclusion is that the STAXI-2 is a suitable instrument to measure both the experience and the expression of anger in both general and clinical populations. © 2016 Wiley Periodicals, Inc.
An NCI-FDA Interagency Oncology Task Force (IOTF) Molecular Diagnostics Workshop was held on October 30, 2008 in Cambridge, MA, to discuss requirements for analytical validation of protein-based multiplex technologies in the context of its intended use. This workshop developed through NCI's Clinical Proteomic Technologies for Cancer initiative and the FDA focused on technology-specific analytical validation processes to be addressed prior to use in clinical settings. In making this workshop unique, a case study approach was used to discuss issues related to
ERIC Educational Resources Information Center
Abdekhodaie, Zahra; Tabatabaei, Seyed Mahmood; Gholizadeh, Mortaza
2012-01-01
In this study, the prevalence of attention-deficit hyperactivity disorder (ADHD) in kindergarten children in northeast Iran was investigated, and the criterion validity of Conners' parent-teacher questionnaire was evaluated through the use of clinical interviews. This study was a cross-sectional descriptive research project with children in…
ERIC Educational Resources Information Center
Carlson, Thomas Stone; McGeorge, Christi R.; Toomey, Russell B.
2013-01-01
This study established the validity and factor structure of the Affirmative Training Inventory (ATI; T. S. Carlson, C. R. McGeorge & M. Rock, unpublished) as a measure of lesbian, gay, and bisexual (LGB) affirmative clinical training. Additionally, this study examined the latent associations among the subscales of the ATI and the Sexual…
The use of video clips in teleconsultation for preschool children with movement disorders.
Gorter, Hetty; Lucas, Cees; Groothuis-Oudshoorn, Karin; Maathuis, Carel; van Wijlen-Hempel, Rietje; Elvers, Hans
2013-01-01
To investigate the reliability and validity of video clips in assessing movement disorders in preschool children. The study group included 27 children with neuromotor concerns. The explorative validity group included children with motor problems (n = 21) or with typical development (n = 9). Hempel screening was used for live observation of the child, full recording, and short video clips. The explorative study tested the validity of the clinical classifications "typical" or "suspect." Agreement between live observation and the full recording was almost perfect; Agreement for the clinical classification "typical" or "suspect" was substantial. Agreement between the full recording and short video clips was substantial to moderate. The explorative validity study, based on short video clips and the presence of a neuromotor developmental disorder, showed substantial agreement. Hempel screening enables reliable and valid observation of video clips, but further research is necessary to demonstrate the predictive value.
Kossioni, A E; Lyrakos, G; Ntinalexi, I; Varela, R; Economu, I
2014-05-01
The aim of this study was to develop and validate according to psychometric standards a self-administered instrument to measure the students' self-perceptions of the undergraduate clinical dental environment (DECLEI). The initial questionnaire was developed using feedback from dental students, experts' opinion and an extensive literature review. Critical incident technique (CIT) analysis was used to generate items and identify domains. Thirty clinical dental students participated in a pilot validation that generated a 67-item questionnaire. To develop a shorter and more practical version of the instrument, DECLEI-67 was distributed to 153 clinical students at the University of Athens and its English version to 51 students from various dental schools, attending the 2012 European Dental Students Association meeting. This final procedure aimed to select items, identify subscales and measure internal consistency and discriminant validity. A total of 202 students returned the questionnaires (response rate 99%). The final instrument included 24 items divided into three subscales: (i) organisation and learning opportunities, (ii) professionalism and communication and (iii) satisfaction and commitment to the dental studies. Cronbach's α for the total questionnaire was 0.89. The interscale correlations ranged from 0.39 to 0.48. The instrument identified differences related to school of origin, age and duration of clinical experience. An interpretation of the scores (range 0–100) has been proposed. The 24-item DECLEI seemed to be a practical and valid instrument to measure a dental school's undergraduate clinical learning environment.
Dudas, Robert A; Colbert, Jorie M; Goldstein, Seth; Barone, Michael A
2012-01-01
Medical knowledge is one of six core competencies in medicine. Medical student assessments should be valid and reliable. We assessed the relationship between faculty and resident global assessment of pediatric medical student knowledge and performance on a standardized test in medical knowledge. Retrospective cross-sectional study of medical students on a pediatric clerkship in academic year 2008-2009 at one academic health center. Faculty and residents rated students' clinical knowledge on a 5-point Likert scale. The inter-rater reliability of clinical knowledge ratings was assessed by calculating the intra-class correlation coefficient (ICC) for residents' ratings, faculty ratings, and both rating types combined. Convergent validity between clinical knowledge ratings and scores on the National Board of Medical Examiners (NBME) clinical subject examination in pediatrics was assessed with Pearson product moment correlation correction and the coefficient of the determination. There was moderate agreement for global clinical knowledge ratings by faculty and moderate agreement for ratings by residents. The agreement was also moderate when faculty and resident ratings were combined. Global ratings of clinical knowledge had high convergent validity with pediatric examination scores when students were rated by both residents and faculty. Our findings provide evidence for convergent validity of global assessment of medical students' clinical knowledge with NBME subject examination scores in pediatrics. Copyright © 2012 Academic Pediatric Association. Published by Elsevier Inc. All rights reserved.
The Clinical Interview Schedule-Revised (CIS-R)-Malay Version, Clinical Validation.
Subramaniam, Kavitha; Krishnaswamy, Saroja; Jemain, Abdul Aziz; Hamid, Abdul; Patel, Vikram
2006-01-01
Use of instruments or questionnaires in different cultural settings without proper validation can result in inaccurate results. Issues like reliability, validity, feasibility and acceptability should be considered in the use of an instrument. The study aims to determine the usefulness of the CIS-R Malay version in detecting common mental health problems specifically to establish the validity. The CIS-R instrument (PROQSY* format) was translated through the back translation process into Malay. Inter rater reliability was established for raters who were medical students. Cases and controls for the study were psychiatric in patients, out patient and relatives or friends accompanying the patients to the clinic or visiting the inpatients. The Malay version of CIS-R was administered to all cases and controls. All cases and controls involved in the study were rated by psychiatrists for psychiatric morbidity using the SCID as a guideline. Specificity and sensitivity of the CIS-R to the assessment by the psychiatrist were determined. The Malay version of CIS-R showed 100% sensitivity and 96.15% specificity at a cut off score of 9. The CIS-R can be a useful instrument for clinical and research use in the Malaysian population for diagnosing common mental disorders like depression and anxiety.
Mikkonen, Kristina; Elo, Satu; Miettunen, Jouko; Saarikoski, Mikko; Kääriäinen, Maria
2017-08-01
The purpose of this study was to develop and test the psychometric properties of the new Cultural and Linguistic Diversity scale, which is designed to be used with the newly validated Clinical Learning Environment, Supervision and Nurse Teacher scale for assessing international nursing students' clinical learning environments. In various developed countries, clinical placements are known to present challenges in the professional development of international nursing students. A cross-sectional survey. Data were collected from eight Finnish universities of applied sciences offering nursing degree courses taught in English during 2015-2016. All the relevant students (N = 664) were invited and 50% chose to participate. Of the total data submitted by the participants, 28% were used for scale validation. The construct validity of the two scales was tested by exploratory factor analysis, while their validity with respect to convergence and discriminability was assessed using Spearman's correlation. Construct validation of the Clinical Learning Environment, Supervision and Nurse Teacher scale yielded an eight-factor model with 34 items, while validation of the Cultural and Linguistic Diversity scale yielded a five-factor model with 21 items. A new scale was developed to improve evidence-based mentorship of international nursing students in clinical learning environments. The instrument will be useful to educators seeking to identify factors that affect the learning of international students. © 2017 John Wiley & Sons Ltd.
Biomarkers and Surrogate Endpoints: Lessons Learned From Glaucoma
Medeiros, Felipe A.
2017-01-01
With the recent progress in imaging technologies for assessment of structural damage in glaucoma, a debate has emerged on whether these measurements can be used as valid surrogate endpoints in clinical trials evaluating new therapies for the disease. A discussion of surrogates should be grounded on knowledge acquired from their use in other areas of medicine as well as regulatory requirements. This article reviews the conditions for valid surrogacy in the context of glaucoma clinical trials and critically evaluates the role of biomarkers such as IOP and imaging measurements as potential surrogates for clinically relevant outcomes. Valid surrogate endpoints must be able to predict a clinically relevant endpoint, such as loss of vision or decrease in quality of life. In addition, the effect of a proposed treatment on the surrogate must capture the effect of the treatment on the clinically relevant endpoint. Despite its widespread use in clinical trials, no proper validation of IOP as a surrogate endpoint has yet been conducted for any class of IOP-lowering treatments. Although strong evidence has accumulated about imaging measurements as predictors of relevant functional outcomes in glaucoma, there is still insufficient evidence to support their use as valid surrogate endpoints. However, imaging biomarkers could potentially be used as part of composite endpoints in glaucoma trials, overcoming weaknesses of the use of structural or functional endpoints in isolation. Efforts should be taken to properly design and conduct studies that can provide proper validation of potential biomarkers in glaucoma clinical trials. PMID:28475699
Biomarkers and Surrogate Endpoints: Lessons Learned From Glaucoma.
Medeiros, Felipe A
2017-05-01
With the recent progress in imaging technologies for assessment of structural damage in glaucoma, a debate has emerged on whether these measurements can be used as valid surrogate endpoints in clinical trials evaluating new therapies for the disease. A discussion of surrogates should be grounded on knowledge acquired from their use in other areas of medicine as well as regulatory requirements. This article reviews the conditions for valid surrogacy in the context of glaucoma clinical trials and critically evaluates the role of biomarkers such as IOP and imaging measurements as potential surrogates for clinically relevant outcomes. Valid surrogate endpoints must be able to predict a clinically relevant endpoint, such as loss of vision or decrease in quality of life. In addition, the effect of a proposed treatment on the surrogate must capture the effect of the treatment on the clinically relevant endpoint. Despite its widespread use in clinical trials, no proper validation of IOP as a surrogate endpoint has yet been conducted for any class of IOP-lowering treatments. Although strong evidence has accumulated about imaging measurements as predictors of relevant functional outcomes in glaucoma, there is still insufficient evidence to support their use as valid surrogate endpoints. However, imaging biomarkers could potentially be used as part of composite endpoints in glaucoma trials, overcoming weaknesses of the use of structural or functional endpoints in isolation. Efforts should be taken to properly design and conduct studies that can provide proper validation of potential biomarkers in glaucoma clinical trials.
Falasinnu, Titilola; Gilbert, Mark; Gustafson, Paul; Shoveller, Jean
2016-02-01
One component of effective sexually transmitted infections (STIs) control is ensuring those at highest risk of STIs have access to clinical services because terminating transmission in this group will prevent most future cases. Here, we describe the results of a validation study of a clinical prediction rule for identifying individuals at increased risk for chlamydia and gonorrhoea infection derived in Vancouver, British Columbia (BC), against a population of asymptomatic patients attending sexual health clinics in other geographical settings in BC. We examined electronic records (2000-2012) from clinic visits at seven sexual health clinics in geographical locations outside Vancouver. The model's calibration and discrimination were examined by the area under the receiver operating characteristic curve (AUC) and the Hosmer-Lemeshow (H-L) statistic, respectively. We also examined the sensitivity and proportion of patients that would need to be screened at different cut-offs of the risk score. The prevalence of infection was 5.3% (n=10 425) in the geographical validation population. The prediction rule showed good performance in this population (AUC, 0.69; H-L p=0.26). Possible risk scores ranged from -2 to 27. We identified a risk score cut-off point of ≥8 that detected cases with a sensitivity of 86% by screening 63% of the geographical validation population. The prediction rule showed good generalisability in STI clinics outside of Vancouver with improved discriminative performance compared with temporal validation. The prediction rule has the potential for augmenting triaging services in STI clinics and enhancing targeted testing in population-based screening programmes. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/
Developing an instrument to measure effective factors on Clinical Learning
DADGARAN, IDEH; SHIRAZI, MANDANA; MOHAMMADI, AEEN; RAVARI, ALI
2016-01-01
Introduction Although nursing students spend a large part of their learning period in the clinical environment, clinical learning has not been perceived by its nature yet. To develop an instrument to measure effective factors on clinical learning in nursing students. Methods This is a mixed methods study performed in 2 steps. First, the researchers defined “clinical learning” in nursing students through qualitative content analysis and designed items of the questionnaire based on semi-structured individual interviews with nursing students. Then, as the second step, psychometric properties of the questionnaire were evaluated using the face validity, content validity, construct validity, and internal consistency evaluated on 227 students from fourth or higher semesters. All the interviews were recorded and transcribed, and then, they were analyzed using Max Qualitative Data Analysis and all of qualitative data were analyzed using SPSS 14. Results To do the study, we constructed the preliminary questionnaire containing 102 expressions. After determination of face and content validities by qualitative and quantitative approaches, the expressions of the questionnaire were reduced to 45. To determine the construct validity, exploratory factor analysis was applied. The results indicated that the maximum variance percentage (40.55%) was defined by the first 3 factors while the rest of the total variance percentage (59.45%) was determined by the other 42 factors. Results of exploratory factor analysis of this questionnaire indicated the presence of 3 instructor-staff, students, and educational related factors. Finally, 41 expressions were kept in 3 factor groups. The α-Cronbach coefficient (0.93) confirmed the high internal consistency of the questionnaire. Conclusion Results indicated that the prepared questionnaire was an efficient instrument in the study of the effective factors on clinical learning as viewed by nursing students since it involves 41 expressions and properties such as instrument design based on perception and experiences of the nursing students about effective factors on clinical learning, definition of facilitator and preventive factors of the clinical learning, simple scoring, suitable validity and reliability, and applicability in different occasions. PMID:27382579
Riley, Richard D.
2017-01-01
An important question for clinicians appraising a meta‐analysis is: are the findings likely to be valid in their own practice—does the reported effect accurately represent the effect that would occur in their own clinical population? To this end we advance the concept of statistical validity—where the parameter being estimated equals the corresponding parameter for a new independent study. Using a simple (‘leave‐one‐out’) cross‐validation technique, we demonstrate how we may test meta‐analysis estimates for statistical validity using a new validation statistic, Vn, and derive its distribution. We compare this with the usual approach of investigating heterogeneity in meta‐analyses and demonstrate the link between statistical validity and homogeneity. Using a simulation study, the properties of Vn and the Q statistic are compared for univariate random effects meta‐analysis and a tailored meta‐regression model, where information from the setting (included as model covariates) is used to calibrate the summary estimate to the setting of application. Their properties are found to be similar when there are 50 studies or more, but for fewer studies Vn has greater power but a higher type 1 error rate than Q. The power and type 1 error rate of Vn are also shown to depend on the within‐study variance, between‐study variance, study sample size, and the number of studies in the meta‐analysis. Finally, we apply Vn to two published meta‐analyses and conclude that it usefully augments standard methods when deciding upon the likely validity of summary meta‐analysis estimates in clinical practice. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. PMID:28620945
Cohen Freue, Gabriela V.; Meredith, Anna; Smith, Derek; Bergman, Axel; Sasaki, Mayu; Lam, Karen K. Y.; Hollander, Zsuzsanna; Opushneva, Nina; Takhar, Mandeep; Lin, David; Wilson-McManus, Janet; Balshaw, Robert; Keown, Paul A.; Borchers, Christoph H.; McManus, Bruce; Ng, Raymond T.; McMaster, W. Robert
2013-01-01
Recent technical advances in the field of quantitative proteomics have stimulated a large number of biomarker discovery studies of various diseases, providing avenues for new treatments and diagnostics. However, inherent challenges have limited the successful translation of candidate biomarkers into clinical use, thus highlighting the need for a robust analytical methodology to transition from biomarker discovery to clinical implementation. We have developed an end-to-end computational proteomic pipeline for biomarkers studies. At the discovery stage, the pipeline emphasizes different aspects of experimental design, appropriate statistical methodologies, and quality assessment of results. At the validation stage, the pipeline focuses on the migration of the results to a platform appropriate for external validation, and the development of a classifier score based on corroborated protein biomarkers. At the last stage towards clinical implementation, the main aims are to develop and validate an assay suitable for clinical deployment, and to calibrate the biomarker classifier using the developed assay. The proposed pipeline was applied to a biomarker study in cardiac transplantation aimed at developing a minimally invasive clinical test to monitor acute rejection. Starting with an untargeted screening of the human plasma proteome, five candidate biomarker proteins were identified. Rejection-regulated proteins reflect cellular and humoral immune responses, acute phase inflammatory pathways, and lipid metabolism biological processes. A multiplex multiple reaction monitoring mass-spectrometry (MRM-MS) assay was developed for the five candidate biomarkers and validated by enzyme-linked immune-sorbent (ELISA) and immunonephelometric assays (INA). A classifier score based on corroborated proteins demonstrated that the developed MRM-MS assay provides an appropriate methodology for an external validation, which is still in progress. Plasma proteomic biomarkers of acute cardiac rejection may offer a relevant post-transplant monitoring tool to effectively guide clinical care. The proposed computational pipeline is highly applicable to a wide range of biomarker proteomic studies. PMID:23592955
Construct Validity of the Anxiety Sensitivity Index-3 in Clinical Samples
ERIC Educational Resources Information Center
Kemper, Christoph J.; Lutz, Johannes; Bahr, Tobias; Ruddel, Heinz; Hock, Michael
2012-01-01
Using two clinical samples of patients, the presented studies examined the construct validity of the recently revised Anxiety Sensitivity Index-3 (ASI-3). Confirmatory factor analyses established a clear three-factor structure that corresponds to the postulated subdivision of the construct into correlated somatic, social, and cognitive components.…
[Development of Autogenic Training Clinical Effectiveness Scale (ATCES)].
Ikezuki, Makoto; Miyauchi, Yuko; Yamaguchi, Hajime; Koshikawa, Fusako
2002-02-01
The purpose of the present study was to develop a scale measuring clinical effectiveness of autogenic training. In Study 1, 167 undergraduates completed a survey of items concerning physical and mental states, which were thought to vary in the course of autogenic training. With item and factor analyses, 20 items were selected, and the resulting scale (ATCES) had high discrimination and clear factor structure. In Study 2, reliability and concurrent and clinical validity of the scale were examined with three groups of respondents: 85 mentally healthy, 31 control, 13 clinical persons. The scale showed a high test-retest correlation (r = .83) and alpha coefficient (alpha = .86). ATCES had a Pearson correlation coefficient of r = .56 with General Health Questionnaire (GHQ-12), and r = .73 with trait anxiety (STAI-T). And ATCES successfully discriminated the mentally healthy and clinical groups in terms of clinical effectiveness. These results demonstrated high reliability and sufficient concurrent and clinical validity of the new scale.
Validity and reliability of the NAB Naming Test.
Sachs, Bonnie C; Rush, Beth K; Pedraza, Otto
2016-05-01
Confrontation naming is commonly assessed in neuropsychological practice, but few standardized measures of naming exist and those that do are susceptible to the effects of education and culture. The Neuropsychological Assessment Battery (NAB) Naming Test is a 31-item measure used to assess confrontation naming. Despite adequate psychometric information provided by the test publisher, there has been limited independent validation of the test. In this study, we investigated the convergent and discriminant validity, internal consistency, and alternate forms reliability of the NAB Naming Test in a sample of adults (Form 1: n = 247, Form 2: n = 151) clinically referred for neuropsychological evaluation. Results indicate adequate-to-good internal consistency and alternate forms reliability. We also found strong convergent validity as demonstrated by relationships with other neurocognitive measures. We found preliminary evidence that the NAB Naming Test demonstrates a more pronounced ceiling effect than other commonly used measures of naming. To our knowledge, this represents the largest published independent validation study of the NAB Naming Test in a clinical sample. Our findings suggest that the NAB Naming Test demonstrates adequate validity and reliability and merits consideration in the test arsenal of clinical neuropsychologists.
Development and evaluation of an automated fall risk assessment system.
Lee, Ju Young; Jin, Yinji; Piao, Jinshi; Lee, Sun-Mi
2016-04-01
Fall risk assessment is the first step toward prevention, and a risk assessment tool with high validity should be used. This study aimed to develop and validate an automated fall risk assessment system (Auto-FallRAS) to assess fall risks based on electronic medical records (EMRs) without additional data collected or entered by nurses. This study was conducted in a 1335-bed university hospital in Seoul, South Korea. The Auto-FallRAS was developed using 4211 fall-related clinical data extracted from EMRs. Participants included fall patients and non-fall patients (868 and 3472 for the development study; 752 and 3008 for the validation study; and 58 and 232 for validation after clinical application, respectively). The system was evaluated for predictive validity and concurrent validity. The final 10 predictors were included in the logistic regression model for the risk-scoring algorithm. The results of the Auto-FallRAS were shown as high/moderate/low risk on the EMR screen. The predictive validity analyzed after clinical application of the Auto-FallRAS was as follows: sensitivity = 0.95, NPV = 0.97 and Youden index = 0.44. The validity of the Morse Fall Scale assessed by nurses was as follows: sensitivity = 0.68, NPV = 0.88 and Youden index = 0.28. This study found that the Auto-FallRAS results were better than were the nurses' predictions. The advantage of the Auto-FallRAS is that it automatically analyzes information and shows patients' fall risk assessment results without requiring additional time from nurses. © The Author 2016. Published by Oxford University Press in association with the International Society for Quality in Health Care; all rights reserved.
Liebe, J D; Hübner, U; Straede, M C; Thye, J
2015-01-01
Availability and usage of individual IT applications have been studied intensively in the past years. Recently, IT support of clinical processes is attaining increasing attention. The underlying construct that describes the IT support of clinical workflows is clinical information logistics. This construct needs to be better understood, operationalised and measured. It is therefore the aim of this study to propose and develop a workflow composite score (WCS) for measuring clinical information logistics and to examine its quality based on reliability and validity analyses. We largely followed the procedural model of MacKenzie and colleagues (2011) for defining and conceptualising the construct domain, for developing the measurement instrument, assessing the content validity, pretesting the instrument, specifying the model, capturing the data and computing the WCS and testing the reliability and validity. Clinical information logistics was decomposed into the descriptors data and information, function, integration and distribution, which embraced the framework validated by an analysis of the international literature. This framework was refined selecting representative clinical processes. We chose ward rounds, pre- and post-surgery processes and discharge as sample processes that served as concrete instances for the measurements. They are sufficiently complex, represent core clinical processes and involve different professions, departments and settings. The score was computed on the basis of data from 183 hospitals of different size, ownership, location and teaching status. Testing the reliability and validity yielded encouraging results: the reliability was high with r(split-half) = 0.89, the WCS discriminated between groups; the WCS correlated significantly and moderately with two EHR models and the WCS received good evaluation results by a sample of chief information officers (n = 67). These findings suggest the further utilisation of the WCS. As the WCS does not assume ideal workflows as a gold standard but measures IT support of clinical workflows according to validated descriptors a high portability of the WCS to other hospitals in other countries is very likely. The WCS will contribute to a better understanding of the construct clinical information logistics.
2016 Revisions to the 2010/2011 fibromyalgia diagnostic criteria.
Wolfe, Frederick; Clauw, Daniel J; Fitzcharles, Mary-Ann; Goldenberg, Don L; Häuser, Winfried; Katz, Robert L; Mease, Philip J; Russell, Anthony S; Russell, Irwin Jon; Walitt, Brian
2016-12-01
The provisional criteria of the American College of Rheumatology (ACR) 2010 and the 2011 self-report modification for survey and clinical research are widely used for fibromyalgia diagnosis. To determine the validity, usefulness, potential problems, and modifications required for the criteria, we assessed multiple research reports published in 2010-2016 in order to provide a 2016 update to the criteria. We reviewed 14 validation studies that compared 2010/2011 criteria with ACR 1990 classification and clinical criteria, as well as epidemiology, clinical, and databank studies that addressed important criteria-level variables. Based on definitional differences between 1990 and 2010/2011 criteria, we interpreted 85% sensitivity and 90% specificity as excellent agreement. Against 1990 and clinical criteria, the median sensitivity and specificity of the 2010/2011 criteria were 86% and 90%, respectively. The 2010/2011 criteria led to misclassification when applied to regional pain syndromes, but when a modified widespread pain criterion (the "generalized pain criterion") was added misclassification was eliminated. Based on the above data and clinic usage data, we developed a (2016) revision to the 2010/2011 fibromyalgia criteria. Fibromyalgia may now be diagnosed in adults when all of the following criteria are met: CONCLUSIONS: The fibromyalgia criteria have good sensitivity and specificity. This revision combines physician and questionnaire criteria, minimizes misclassification of regional pain disorders, and eliminates the previously confusing recommendation regarding diagnostic exclusions. The physician-based criteria are valid for individual patient diagnosis. The self-report version of the criteria is not valid for clinical diagnosis in individual patients but is valid for research studies. These changes allow the criteria to function as diagnostic criteria, while still being useful for classification. Copyright © 2016 Elsevier Inc. All rights reserved.
2016-10-01
Study (PASS). We are in the process of evaluating these three biomarker panels in tissue, blood, and urine samples with well annotated clinical and...impacting both the initial choice of therapy and decision-making during AS. The objective of the study is to utilize analytically validated assays that...predict reclassification from Gleason 6 cancer to Gleason 7 or greater. The analysis plan was determined before specimens were selected for the study
2016-10-01
Study (PASS). We are in the process of evaluating these three biomarker panels in tissue, blood, and urine samples with well annotated clinical and...choice of therapy and decision-making during AS. The objective of the study is to utilize analytically validated assays that take into account tumor...Gleason 6 cancer to Gleason 7 or greater. The analysis plan was determined before specimens were selected for the study , and included 7 breaking
Place, Skyler; Rubin, Channah; Gorrostieta, Cristina; Mead, Caroline; Kane, John; Marx, Brian P; Feast, Joshua; Deckersbach, Thilo; Pentland, Alex “Sandy”; Nierenberg, Andrew; Azarbayejani, Ali
2017-01-01
Background There is a critical need for real-time tracking of behavioral indicators of mental disorders. Mobile sensing platforms that objectively and noninvasively collect, store, and analyze behavioral indicators have not yet been clinically validated or scalable. Objective The aim of our study was to report on models of clinical symptoms for post-traumatic stress disorder (PTSD) and depression derived from a scalable mobile sensing platform. Methods A total of 73 participants (67% [49/73] male, 48% [35/73] non-Hispanic white, 33% [24/73] veteran status) who reported at least one symptom of PTSD or depression completed a 12-week field trial. Behavioral indicators were collected through the noninvasive mobile sensing platform on participants’ mobile phones. Clinical symptoms were measured through validated clinical interviews with a licensed clinical social worker. A combination hypothesis and data-driven approach was used to derive key features for modeling symptoms, including the sum of outgoing calls, count of unique numbers texted, absolute distance traveled, dynamic variation of the voice, speaking rate, and voice quality. Participants also reported ease of use and data sharing concerns. Results Behavioral indicators predicted clinically assessed symptoms of depression and PTSD (cross-validated area under the curve [AUC] for depressed mood=.74, fatigue=.56, interest in activities=.75, and social connectedness=.83). Participants reported comfort sharing individual data with physicians (Mean 3.08, SD 1.22), mental health providers (Mean 3.25, SD 1.39), and medical researchers (Mean 3.03, SD 1.36). Conclusions Behavioral indicators passively collected through a mobile sensing platform predicted symptoms of depression and PTSD. The use of mobile sensing platforms can provide clinically validated behavioral indicators in real time; however, further validation of these models and this platform in large clinical samples is needed. PMID:28302595
Mangueira, Suzana de Oliveira; Lopes, Marcos Venícios de Oliveira
2016-10-01
To evaluate the clinical validity indicators for the nursing diagnosis of dysfunctional family processes related to alcohol abuse. Alcoholism is a chronic disease that negatively affects family relationships. Studies on the nursing diagnosis of dysfunctional family processes are scarce in the literature. This diagnosis is currently composed of 115 defining characteristics, hindering their use in practice and highlighting the need for clinical validation. This was a diagnostic accuracy study. A sample of 110 alcoholics admitted to a reference centre for alcohol treatment was assessed during the second half of 2013 for the presence or absence of the defining characteristics of the diagnosis. Operational definitions were created for each defining characteristic based on concept analysis and experts evaluated the content of these definitions. Diagnostic accuracy measures were calculated from latent class models with random effects. All 89 clinical indicators were found in the sample and a set of 24 clinical indicators was identified as clinically valid for a diagnostic screening for family dysfunction from the report of alcoholics. Main clinical indicators with high specificity included sexual abuse, disturbance in academic performance in children and manipulation. The main indicators that showed high sensitivity values were distress, loss, anxiety, low self-esteem, confusion, embarrassment, insecurity, anger, loneliness, deterioration in family relationships and disturbance in family dynamics. Eighteen clinical indicators showed a high capacity for diagnostic screening for alcoholics (high sensitivity) and six indicators can be used for confirmatory diagnosis (high specificity). © 2016 John Wiley & Sons Ltd.
[Psychometric Characteristics of the Clinical Nursing Mentors' Behavior Scale].
Zhao, Rong; Chen, Yan-Hua; Yu, Hui-Ting; Xiao, Lu; Wen, Jing; Yeh, Tzu-Pei
2017-08-01
The behavior of mentors impacts the quality and experience of nursing students who are studying in clinical placement. Accurately assessing the behavior of mentors is fundamental to training, regulating, guiding, and improving their behavior and quality of teaching. To test the validity and reliability of the Clinical Nursing Mentors' Behavior Scale (CNMBS) among mentors. This study included three stages. During the first stage, seven Chinese experts were invited to evaluate content validity. During the second stage, the test-retest reliability was examined with 63 mentors. During the third stage, a cross-sectional study was conducted. Seven hundred and sixty-six nursing mentors from five hospitals in Beijing, Shenzhen, and Sichuan completed the survey either online or in hard copy form. The data collected from the questionnaire were analyzed using item analysis, construct validity, internal consistency and discriminant validity, with the results used to determine the psychometric characteristics of the CNMBS. The content validity index for the CNMBS was .91. The intra-class correlation coefficient was .89; the range of the item discrimination critical ratio was 9.42-22.43 (p < .001), and the item-total correlation was .35- .70 (p < .001). The three factors of "guiding personal growth", "promoting professional development", and "providing psychosocial support" and a total of 23 items were identified, with item factor loadings ranging from .51 to .79. The three factors explained 50.99% of total variance. The internal consistency of the CNMBS earned a Cronbach's α coefficient of .92, while those of the three subscales were .89, .86 and .75, respectively. The Clinical Nursing Mentors' Behavior Scale demonstrated high validity and reliability, supporting the CNMBS as a valid tool for assessing the teaching behavior of mentors.
Sfendla, Anis; Zouini, Btissame; Lemrani, Dina; Berman, Anne H; Senhaji, Meftaha; Kerekes, Nóra
2017-04-01
The study aimed to validate the Arabic version of the Drug Use Disorders Identification Test (DUDIT) by (1) assessing its factor structure, (2) determining structural validity, (3) evaluating item-total and inter-item correlation, and (4) assessing its predictive validity. The study population included 169 prison inmates, 51 patients with clinical diagnosis of substance used disorder, and 53 students (N = 273). All participants completed the self-report version of the Arabic DUDIT. After exploratory factor analysis, internal consistency of the Arabic DUDIT was determined and external validation was performed. Principal factor analysis showed that Arabic DUDIT exhibited only one factor, which explained 66.9% of the variance. Reliability based on Cronbach's alpha was .95. When compared to the DSM-IV substance use disorder diagnosis in a clinical sample, DUDIT had an area under the curve (AUC) of .98, with a sensitivity of .98 and a specificity of .90. The Arabic version of DUDIT is a valid and reliable tool for screening for drug use in Arabic-speaking countries.
Grooten, Wilhelmus Johannes Andreas; Sandberg, Lisa; Ressman, John; Diamantoglou, Nicolas; Johansson, Elin; Rasmussen-Barr, Eva
2018-01-08
Clinical examinations are subjective and often show a low validity and reliability. Objective and highly reliable quantitative assessments are available in laboratory settings using 3D motion analysis, but these systems are too expensive to use for simple clinical examinations. Qinematic™ is an interactive movement analyses system based on the Kinect camera and is an easy-to-use clinical measurement system for assessing posture, balance and side-bending. The aim of the study was to test the test-retest the reliability and construct validity of Qinematic™ in a healthy population, and to calculate the minimal clinical differences for the variables of interest. A further aim was to identify the discriminative validity of Qinematic™ in people with low-back pain (LBP). We performed a test-retest reliability study (n = 37) with around 1 week between the occasions, a construct validity study (n = 30) in which Qinematic™ was tested against a 3D motion capture system, and a discriminative validity study, in which a group of people with LBP (n = 20) was compared to healthy controls (n = 17). We tested a large range of psychometric properties of 18 variables in three sections: posture (head and pelvic position, weight distribution), balance (sway area and velocity in single- and double-leg stance), and side-bending. The majority of the variables in the posture and balance sections, showed poor/fair reliability (ICC < 0.4) and poor/fair validity (Spearman <0.4), with significant differences between occasions, between Qinematic™ and the 3D-motion capture system. In the clinical study, Qinematic™ did not differ between people with LPB and healthy for these variables. For one variable, side-bending to the left, there was excellent reliability (ICC =0.898), excellent validity (r = 0.943), and Qinematic™ could differentiate between LPB and healthy individuals (p = 0.012). This paper shows that a novel software program (Qinematic™) based on the Kinect camera for measuring balance, posture and side-bending has poor psychometric properties, indicating that the variables on balance and posture should not be used for monitoring individual changes over time or in research. Future research on the dynamic tasks of Qinematic™ is warranted.
Sydora, Beate C; Fast, Hilary; Campbell, Sandy; Yuksel, Nese; Lewis, Jacqueline E; Ross, Sue
2016-09-01
The Menopause-Specific Quality of Life (MENQOL) questionnaire was developed as a validated research tool to measure condition-specific QOL in early postmenopausal women. We conducted a comprehensive scoping review to explore the extent of MENQOL's use in research and clinical practice to assess its value in providing effective, adequate, and comparable participant assessment information. Thirteen biomedical and clinical databases were systematically searched with "menqol" as a search term to find articles using MENQOL or its validated derivative MENQOL-Intervention as investigative or clinical tools from 1996 to November 2014 inclusive. Review articles, conference abstracts, proceedings, dissertations, and incomplete trials were excluded. Additional articles were collected from references within key articles. Three independent reviewers extracted data reflecting study design, intervention, sample characteristics, MENQOL questionnaire version, modifications and language, recall period, and analysis detail. Data analyses included categorization and descriptive statistics. The review included 220 eligible papers of various study designs, covering 39 countries worldwide and using MENQOL translated into more than 25 languages. A variety of modifications to the original questionnaire were identified, including omission or addition of items and alterations to the validated methodological analysis. No papers were found that described MENQOL's use in clinical practice. Our study found an extensive and steadily increasing use of MENQOL in clinical and epidemiological research over 18 years postpublication. Our results stress the importance of proper reporting and validation of translations and variations to ensure outcome comparison and transparency of MENQOL's use. The value of MENQOL in clinical practice remains unknown.
Reliability, Validity and Treatment Sensitivity of the Schizophrenia Cognition Rating Scale
Keefe, Richard S.E.; Davis, Vicki G.; Spagnola, Nathan B.; Hilt, Dana; Dgetluck, Nancy; Ruse, Stacy; Patterson, Thomas L.; Narasimhan, Meera; Harvey, Philip D.
2014-01-01
Cognitive functioning can be assessed with performance-based assessments such as neuropsychological tests and with interview-based assessments. Both assessment methods have the potential to assess whether treatments for schizophrenia improve clinically relevant aspects of cognitive impairment. However, little is known about the reliability, validity and treatment responsiveness of interview-based measures, especially in the context of clinical trials. Data from two studies were utilized to assess these features of the Schizophrenia Cognition Rating Scale (SCoRS). One of the studies was a validation study involving 79 patients with schizophrenia assessed at 3 academic research centers in the US. The other study was a 32-site clinical trial conducted in the US and Europe comparing the effects of encenicline, an alpha-7 nicotine agonist, to placebo in 319 patients with schizophrenia. The SCoRS interviewer ratings demonstrated excellent test-retest reliability in several different circumstances, including those that did not involve treatment (ICC> 0.90), and during treatment (ICC>0.80). SCoRS interviewer ratings were related to cognitive performance as measured by the MCCB (r= −0.35), and demonstrated significant sensitivity to treatment with encenicline compared to placebo (P<.001). These data suggest that the SCoRS has potential as a clinically relevant measure in clinical trials aiming to improve cognition in schizophrenia, and may be useful for clinical practice. The weaknesses of the SCoRS include its reliance on informant information, which is not available for some patients, and reduced validity when patient self-report is the sole information source. PMID:25028065
Reliability, validity and treatment sensitivity of the Schizophrenia Cognition Rating Scale.
Keefe, Richard S E; Davis, Vicki G; Spagnola, Nathan B; Hilt, Dana; Dgetluck, Nancy; Ruse, Stacy; Patterson, Thomas D; Narasimhan, Meera; Harvey, Philip D
2015-02-01
Cognitive functioning can be assessed with performance-based assessments such as neuropsychological tests and with interview-based assessments. Both assessment methods have the potential to assess whether treatments for schizophrenia improve clinically relevant aspects of cognitive impairment. However, little is known about the reliability, validity and treatment responsiveness of interview-based measures, especially in the context of clinical trials. Data from two studies were utilized to assess these features of the Schizophrenia Cognition Rating Scale (SCoRS). One of the studies was a validation study involving 79 patients with schizophrenia assessed at 3 academic research centers in the US. The other study was a 32-site clinical trial conducted in the US and Europe comparing the effects of encenicline, an alpha-7 nicotine agonist, to placebo in 319 patients with schizophrenia. The SCoRS interviewer ratings demonstrated excellent test-retest reliability in several different circumstances, including those that did not involve treatment (ICC> 0.90), and during treatment (ICC>0.80). SCoRS interviewer ratings were related to cognitive performance as measured by the MCCB (r=-0.35), and demonstrated significant sensitivity to treatment with encenicline compared to placebo (P<.001). These data suggest that the SCoRS has potential as a clinically relevant measure in clinical trials aiming to improve cognition in schizophrenia, and may be useful for clinical practice. The weaknesses of the SCoRS include its reliance on informant information, which is not available for some patients, and reduced validity when patient's self-report is the sole information source. Copyright © 2014 Elsevier B.V. and ECNP. All rights reserved.
Fleeman, N; McLeod, C; Bagust, A; Beale, S; Boland, A; Dundar, Y; Jorgensen, A; Payne, K; Pirmohamed, M; Pushpakom, S; Walley, T; de Warren-Penny, P; Dickson, R
2010-01-01
To determine whether testing for cytochrome P450 (CYP) polymorphisms in adults entering antipsychotic treatment for schizophrenia leads to improvement in outcomes, is useful in medical, personal or public health decision-making, and is a cost-effective use of health-care resources. The following electronic databases were searched for relevant published literature: Cochrane Controlled Trials Register, Cochrane Database of Systematic Reviews, Database of Abstracts of Reviews of Effectiveness, EMBASE, Health Technology Assessment database, ISI Web of Knowledge, MEDLINE, PsycINFO, NHS Economic Evaluation Database, Health Economic Evaluation Database, Cost-effectiveness Analysis (CEA) Registry and the Centre for Health Economics website. In addition, publicly available information on various genotyping tests was sought from the internet and advisory panel members. A systematic review of analytical validity, clinical validity and clinical utility of CYP testing was undertaken. Data were extracted into structured tables and narratively discussed, and meta-analysis was undertaken when possible. A review of economic evaluations of CYP testing in psychiatry and a review of economic models related to schizophrenia were also carried out. For analytical validity, 46 studies of a range of different genotyping tests for 11 different CYP polymorphisms (most commonly CYP2D6) were included. Sensitivity and specificity were high (99-100%). For clinical validity, 51 studies were found. In patients tested for CYP2D6, an association between genotype and tardive dyskinesia (including Abnormal Involuntary Movement Scale scores) was found. The only other significant finding linked the CYP2D6 genotype to parkinsonism. One small unpublished study met the inclusion criteria for clinical utility. One economic evaluation assessing the costs and benefits of CYP testing for prescribing antidepressants and 28 economic models of schizophrenia were identified; none was suitable for developing a model to examine the cost-effectiveness of CYP testing. Tests for determining genotypes appear to be accurate although not all aspects of analytical validity were reported. Given the absence of convincing evidence from clinical validity studies, the lack of clinical utility and economic studies, and the unsuitability of published schizophrenia models, no model was developed; instead key features and data requirements for economic modelling are presented. Recommendations for future research cover both aspects of research quality and data that will be required to inform the development of future economic models.
Ruff, Jessica; Wang, Tiffany L; Quatman-Yates, Catherine C; Phieffer, Laura S; Quatman, Carmen E
2015-02-01
Commercially available gaming systems (CAGS) such as the Wii Balance Board (WBB) and Microsoft Xbox with Kinect (Xbox Kinect) are increasingly used as balance training and rehabilitation tools. The purpose of this review was to answer the question, "Are commercially available gaming systems valid and reliable instruments for use as clinical diagnostic and functional assessment tools in orthopaedic settings?" and provide a summary of relevant studies, identify their strengths and weaknesses, and generate conclusions regarding general validity/reliability of WBB and Xbox Kinect in orthopaedics. A systematic search was performed using MEDLINE (1996-2013) and Scopus (1996-2013). Inclusion criteria were minimum of 5 subjects, full manuscript provided in English or translated, and studies incorporating investigation of CAG measurement properties. Exclusion criteria included reviews, systematic reviews, summary/clinical commentaries, or case studies; conference proceedings/presentations; cadaveric studies; studies of non-reversible, non-orthopaedic-related musculoskeletal disease; non-human trials; and therapeutic studies not reporting comparative evaluation to already established functional assessment criteria. All studies meeting inclusion and exclusion criteria were appraised for quality by two independent reviewers. Evidence levels (I-V) were assigned to each study based on established methodological criteria. 3 Level II, 7 level III, and 1 Level IV studies met inclusion criteria and provided information related to the use of the WBB and Xbox Kinect as clinical assessment tools in the field of orthopaedics. Studies have used the WBB in a variety of clinical applications, including the measurement of center of pressure (COP), measurement of medial-to-lateral (M/L) or anterior-to-posterior (A/P) symmetry, assessment anatomic landmark positioning, and assessment of fall risk. However, no uniform protocols or outcomes were used to evaluate the quality of the WBB as a clinical assessment tool; therefore a wide range of sensitivities, specificities, accuracies, and validities were reported. Currently it is not possible to make a universal generalization about the clinical utility of CAGS in the field of orthopaedics. However, there is evidence to support using the WBB and the Xbox Kinect as tools to obtain reliable and valid COP measurements. The Wii Fit Game may specifically provide reliable and valid measurements for predicting fall risk. Copyright © 2014 Elsevier Ltd. All rights reserved.
Baldwin, Carol M.; Choi, Myunghan; McClain, Darya Bonds; Celaya, Alma; Quan, Stuart F.
2012-01-01
Study Objectives: To translate, back-translate and cross-language validate (English/Spanish) the Sleep Heart Health Study Sleep Habits Questionnaire for use with Spanish-speakers in clinical and research settings. Methods: Following rigorous translation and back-translation, this cross-sectional cross-language validation study recruited bilingual participants from academic, clinic, and community-based settings (N = 50; 52% women; mean age 38.8 ± 12 years; 90% of Mexican heritage). Participants completed English and Spanish versions of the Sleep Habits Questionnaire, the Epworth Sleepiness Scale, and the Acculturation Rating Scale for Mexican Americans II one week apart in randomized order. Psychometric properties were assessed, including internal consistency, convergent validity, scale equivalence, language version intercorrelations, and exploratory factor analysis using PASW (Version18) software. Grade level readability of the sleep measure was evaluated. Results: All sleep categories (duration, snoring, apnea, insomnia symptoms, other sleep symptoms, sleep disruptors, restless legs syndrome) showed Cronbach α, Spearman-Brown coefficients and intercorrelations ≥ 0.700, suggesting robust internal consistency, correlation, and agreement between language versions. The Epworth correlated significantly with snoring, apnea, sleep symptoms, restless legs, and sleep disruptors) on both versions, supporting convergent validity. Items loaded on 4 factors accounted for 68% and 67% of the variance on the English and Spanish versions, respectively. Conclusions: The Spanish-language Sleep Habits Questionnaire demonstrates conceptual and content equivalency. It has appropriate measurement properties and should be useful for assessing sleep health in community-based clinics and intervention studies among Spanish-speaking Mexican Americans. Both language versions showed readability at the fifth grade level. Further testing is needed with larger samples. Citation: Baldwin CM; Choi M; McClain DB; Celaya A; Quan SF. Spanish translation and cross-language validation of a Sleep Habits Questionnaire for use in clinical and research settings. J Clin Sleep Med 2012;8(2):137-146. PMID:22505858
Validation of the SETOC Instrument--Student Evaluation of Teaching in Outpatient Clinics
ERIC Educational Resources Information Center
Zuberi, Rukhsana W.; Bordage, Georges; Norman, Geoffrey R.
2007-01-01
Purpose: There is a paucity of evaluation forms specifically developed and validated for outpatient settings. The purpose of this study was to develop and validate an instrument specifically for evaluating outpatient teaching, to provide reliable and valid ratings for individual and group feedback to faculty, and to identify outstanding teachers…
Vanderploeg, Rodney D; Cooper, Douglas B; Belanger, Heather G; Donnell, Alison J; Kennedy, Jan E; Hopewell, Clifford A; Scott, Steven G
2014-01-01
To develop and cross-validate internal validity scales for the Neurobehavioral Symptom Inventory (NSI). Four existing data sets were used: (1) outpatient clinical traumatic brain injury (TBI)/neurorehabilitation database from a military site (n = 403), (2) National Department of Veterans Affairs TBI evaluation database (n = 48 175), (3) Florida National Guard nonclinical TBI survey database (n = 3098), and (4) a cross-validation outpatient clinical TBI/neurorehabilitation database combined across 2 military medical centers (n = 206). Secondary analysis of existing cohort data to develop (study 1) and cross-validate (study 2) internal validity scales for the NSI. The NSI, Mild Brain Injury Atypical Symptoms, and Personality Assessment Inventory scores. Study 1: Three NSI validity scales were developed, composed of 5 unusual items (Negative Impression Management [NIM5]), 6 low-frequency items (LOW6), and the combination of 10 nonoverlapping items (Validity-10). Cut scores maximizing sensitivity and specificity on these measures were determined, using a Mild Brain Injury Atypical Symptoms score of 8 or more as the criterion for invalidity. Study 2: The same validity scale cut scores again resulted in the highest classification accuracy and optimal balance between sensitivity and specificity in the cross-validation sample, using a Personality Assessment Inventory Negative Impression Management scale with a T score of 75 or higher as the criterion for invalidity. The NSI is widely used in the Department of Defense and Veterans Affairs as a symptom-severity assessment following TBI, but is subject to symptom overreporting or exaggeration. This study developed embedded NSI validity scales to facilitate the detection of invalid response styles. The NSI Validity-10 scale appears to hold considerable promise for validity assessment when the NSI is used as a population-screening tool.
Larsen, Camilla Marie; Juul-Kristensen, Birgit; Lund, Hans; Søgaard, Karen
2014-10-01
The aims were to compile a schematic overview of clinical scapular assessment methods and critically appraise the methodological quality of the involved studies. A systematic, computer-assisted literature search using Medline, CINAHL, SportDiscus and EMBASE was performed from inception to October 2013. Reference lists in articles were also screened for publications. From 50 articles, 54 method names were identified and categorized into three groups: (1) Static positioning assessment (n = 19); (2) Semi-dynamic (n = 13); and (3) Dynamic functional assessment (n = 22). Fifteen studies were excluded for evaluation due to no/few clinimetric results, leaving 35 studies for evaluation. Graded according to the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN checklist), the methodological quality in the reliability and validity domains was "fair" (57%) to "poor" (43%), with only one study rated as "good". The reliability domain was most often investigated. Few of the assessment methods in the included studies that had "fair" or "good" measurement property ratings demonstrated acceptable results for both reliability and validity. We found a substantially larger number of clinical scapular assessment methods than previously reported. Using the COSMIN checklist the methodological quality of the included measurement properties in the reliability and validity domains were in general "fair" to "poor". None were examined for all three domains: (1) reliability; (2) validity; and (3) responsiveness. Observational evaluation systems and assessment of scapular upward rotation seem suitably evidence-based for clinical use. Future studies should test and improve the clinimetric properties, and especially diagnostic accuracy and responsiveness, to increase utility for clinical practice.
The, Bertram; Reininga, Inge H F; El Moumni, Mostafa; Eygendaal, Denise
2013-10-01
The modern standard of evaluating treatment results includes the use of rating systems. Elbow-specific rating systems are frequently used in studies aiming at elbow-specific pathology. However, proper validation studies seem to be relatively sparse. In addition, these scoring systems might not always be used for appropriate populations of interest. Both of these issues might give rise to invalid conclusions being reported in the literature. Our aim was to investigate the extent to which the available elbow-specific outcome measurement tools have been validated and the quality of the validation itself. We also aimed to provide characteristics of the populations used for validation of these scales to enable clinicians to use them appropriately. A literature search identified 17 studies of 12 different elbow-specific scoring systems. These were assessed for validity, reliability, and responsiveness characteristics. The quality of these assessments was rated according to the Consensus Based Standards for the Selection of Health Measurement Instruments (COSMIN) checklist criteria, a standardized and validated tool developed specifically for this purpose. Currently, the only elbow-specific rating system that is validated using high-quality methodology is the Oxford Elbow Score, a patient-administered outcome measure tool that has been validated on heterogeneous study populations. Other rating systems still have to be proven in the future to be as good as the Oxford Elbow Score for clinical or research purposes. Additional validation studies are needed. Copyright © 2013 Journal of Shoulder and Elbow Surgery Board of Trustees. Published by Mosby, Inc. All rights reserved.
Sideline coverage: when to get radiographs? A review of clinical decision tools.
Gould, Sara J; Cardone, Dennis A; Munyak, John; Underwood, Philipp J; Gould, Stephen A
2014-05-01
Sidelines coverage presents unique challenges in the evaluation of injured athletes. Health care providers may be confronted with the question of when to obtain radiographs following an injury. Given that most sidelines coverage occurs outside the elite level, radiographs are not readily available at the time of injury, and the decision of when to send a player for radiographs must be made based on physical examination. Clinical tools have been developed to aid in identifying injuries that are likely to result in radiographically important fractures or dislocations. A search for the keywords x-ray and decision rule along with the anatomic locations shoulder, elbow, wrist, knee, and ankle was performed using the PubMed database. No limits were set regarding year of publication. We selected meta-analyses, randomized controlled trials, and survey results. Our selection focused on the largest, most well-studied published reports. We also attempted to include studies that reported the application of the rules to the field of sports medicine. Retrospective literature review. Level 4. The Ottawa Foot and Ankle Rules have been validated and implemented and are appropriate for use in both pediatric and adult populations. The Ottawa Knee Rules have been widely studied, validated, and accepted for evaluation of knee injuries. There are promising studies of decision rules for clinically important fractures of the wrist, but these studies have not been validated. The elbow has been evaluated with good outcomes via the elbow extension test, which has been validated in both single and multicenter studies. Currently, there are no reliable clinical decision tools for traumatic sports injuries to the shoulder to aid in the decision of when to obtain radiographs. Clinical decision tools have been developed to aid in the diagnosis and management of injuries commonly sustained during sporting events. Tools that have been appropriately validated in populations outside the initial study population can assist sports medicine physicians in the decision of when to get radiographs from the sidelines.
Judd, Belinda K; Scanlan, Justin N; Alison, Jennifer A; Waters, Donna; Gordon, Christopher J
2016-08-05
Despite the recent widespread adoption of simulation in clinical education in physiotherapy, there is a lack of validated tools for assessment in this setting. The Assessment of Physiotherapy Practice (APP) is a comprehensive tool used in clinical placement settings in Australia to measure professional competence of physiotherapy students. The aim of the study was to evaluate the validity of the APP for student assessment in simulation settings. A total of 1260 APPs were collected, 971 from students in simulation and 289 from students in clinical placements. Rasch analysis was used to examine the construct validity of the APP tool in three different simulation assessment formats: longitudinal assessment over 1 week of simulation; longitudinal assessment over 2 weeks; and a short-form (25 min) assessment of a single simulation scenario. Comparison with APPs from 5 week clinical placements in hospital and clinic-based settings were also conducted. The APP demonstrated acceptable fit to the expectations of the Rasch model for the 1 and 2 week clinical simulations, exhibiting unidimensional properties that were able to distinguish different levels of student performance. For the short-form simulation, nine of the 20 items recorded greater than 25 % of scores as 'not-assessed' by clinical educators which impacted on the suitability of the APP tool in this simulation format. The APP was a valid assessment tool when used in longitudinal simulation formats. A revised APP may be required for assessment in short-form simulation scenarios.
2017-10-01
mRNA and has been shown in many studies to improve predictive accuracy for cancer on initial biopsy,3,7-9 and to be correlated with more aggressive... Study (PASS). We are in the process of evaluating these three biomarker panels in tissue, blood, and urine samples with well annotated clinical and...during AS. The objective of the study is to utilize analytically validated assays that take into account tumor heterogeneity to measure biomarkers in
Use of patient-reported outcome measures in foot and ankle research.
Hunt, Kenneth J; Hurwit, Daniel
2013-08-21
In the orthopaedic literature, there is a wide range of clinical outcome measurement tools that have been used in evaluating foot and ankle procedures, disorders, and outcomes, with no broadly accepted consensus as to which tools are preferred. The purpose of this study was to determine the frequency and distribution of the various outcome instruments used in the foot and ankle literature, and to identify trends for use of these instruments over time. We conducted a systematic review of all original clinical articles reporting on foot and/or ankle topics in six orthopaedic journals over a ten-year period (2002 to 2011). All clinical patient-reported outcome rating instruments used in these articles were recorded, as were study date, study design, clinical topic, and level of evidence. A total of 878 clinical foot and ankle articles that used at least one patient-reported outcome measure were identified among 16,513 total articles published during the ten-year period. There were 139 unique clinical outcome scales used, and the five most popular scales (as a percentage of foot/ankle outcome articles) were the American Orthopaedic Foot & Ankle Society (AOFAS) scales (55.9%), visual analog scale (VAS) for pain (22.9%), Short Form-36 (SF-36) Health Survey (13.7%), Foot Function Index (FFI) (5.5%), and American Academy of Orthopaedic Surgeons (AAOS) outcomes instruments (3.3%). The majority of articles described Level-IV studies (70.1%); only 9.4% reported Level-I studies. A considerable variety of outcome measurement tools are used in the foot and ankle clinical literature, with a small proportion used consistently. The AOFAS scales continue to be used at a high rate relative to other scales that have been validated. Data from the present study underscore the need for a paradigm shift toward the use of consistent, valid, and reliable outcome measures for studies of foot and ankle procedures and disorders. It is not clear which existing validated outcome instruments will emerge as widely used and clinically meaningful. These data support the need for a paradigm shift toward the consistent use of valid and reliable outcome measures for foot and ankle clinical research.
Development and validation of the Child Oral Health Impact Profile - Preschool version.
Ruff, R R; Sischo, L; Chinn, C H; Broder, H L
2017-09-01
The Child Oral Health Impact Profile (COHIP) is a validated instrument created to measure the oral health-related quality of life of school-aged children. The purpose of this study was to develop and validate a preschool version of the COHIP (COHIP-PS) for children aged 2-5. The COHIP-PS was developed and validated using a multi-stage process consisting of item selection, face validity testing, item impact testing, reliability and validity testing, and factor analysis. A cross-sectional convenience sample of caregivers having children 2-5 years old from four groups completed item clarity and impact forms. Groups were recruited from pediatric health clinics or preschools/daycare centers, speech clinics, dental clinics, or cleft/craniofacial centers. Participants had a variety of oral health-related conditions, including caries, congenital orofacial anomalies, and speech/language deficiencies such as articulation and language disorders. COHIP-PS. The COHIP-PS was found to have acceptable internal validity (a = 0.71) and high test-retest reliability (0.87), though internal validity was below the accepted threshold for the community sample. While discriminant validity results indicated significant differences across study groups, the overall magnitude of differences was modest. Results from confirmatory factor analyses support the use of a four-factor model consisting of 11 items across oral health, functional well-being, social-emotional well-being, and self-image domains. Quality of life is an integral factor in understanding and assessing children's well-being. The COHIP-PS is a validated oral health-related quality of life measure for preschool children with cleft or other oral conditions. Copyright© 2017 Dennis Barber Ltd.
Explanatory Versus Pragmatic Trials: An Essential Concept in Study Design and Interpretation.
Merali, Zamir; Wilson, Jefferson R
2017-11-01
Randomized clinical trials often represent the highest level of clinical evidence available to evaluate the efficacy of an intervention in clinical medicine. Although the process of randomization serves to maximize internal validity, the external validity, or generalizability, of such studies depends on several factors determined at the design phase of the trial including eligibility criteria, study setting, and outcomes of interest. In general, explanatory trials are optimized to demonstrate the efficacy of an intervention in a highly selected patient group; however, findings from these studies may not be generalizable to the larger clinical problem. In contrast, pragmatic trials attempt to understand the real-world benefit of an intervention by incorporating design elements that allow for greater generalizability and clinical applicability of study results. In this article we describe the explanatory-pragmatic continuum for clinical trials in greater detail. Further, a well-accepted tool for grading trials on this continuum is described, and applied, to 2 recently published trials pertaining to the surgical management of lumbar degenerative spondylolisthesis.
Xiao, Lan; Lv, Nan; Rosas, Lisa G; Au, David; Ma, Jun
2017-02-01
To validate clinic weights in electronic health records against researcher-measured weights for outcome assessment in weight loss trials. Clinic and researcher-measured weights from a published trial (BE WELL) were compared using Lin's concordance correlation coefficient, Bland and Altman's limits of agreement, and polynomial regression model. Changes in clinic and researcher-measured weights in BE WELL and another trial, E-LITE, were analyzed using growth curve modeling. Among BE WELL (n = 330) and E-LITE (n = 241) participants, 96% and 90% had clinic weights (mean [SD] of 5.8 [6.1] and 3.7 [3.9] records) over 12 and 15 months of follow-up, respectively. The concordance correlation coefficient was 0.99, and limits of agreement plots showed no pattern between or within treatment groups, suggesting overall good agreement between researcher-measured and nearest-in-time clinic weights up to 3 months. The 95% confidence intervals for predicted percent differences fell within ±3% for clinic weights within 3 months of the researcher-measured weights. Furthermore, the growth curve slopes for clinic and researcher-measured weights by treatment group did not differ significantly, suggesting similar inferences about treatment effects over time, in both trials. Compared with researcher-measured weights, close-in-time clinic weights showed high agreement and inference validity. Clinic weights could be a valid pragmatic outcome measure in weight loss studies. © 2017 The Obesity Society.
Singh, Devita; Deogracias, Joseph J; Johnson, Laurel L; Bradley, Susan J; Kibblewhite, Sarah J; Owen-Anderson, Allison; Peterson-Badali, Michele; Meyer-Bahlburg, Heino F L; Zucker, Kenneth J
2010-01-01
This study aimed to provide further validity evidence for the dimensional measurement of gender identity and gender dysphoria in both adolescents and adults. Adolescents and adults with gender identity disorder (GID) were compared to clinical control (CC) adolescents and adults on the Gender Identity/Gender Dysphoria Questionnaire for Adolescents and Adults (GIDYQ-AA), a 27-item scale originally developed by Deogracias et al. (2007). In Study 1, adolescents with GID (n = 44) were compared to CC adolescents (n = 98); and in Study 2, adults with GID (n = 41) were compared to CC adults (n = 94). In both studies, clients with GID self-reported significantly more gender dysphoria than did the CCs, with excellent sensitivity and specificity rates. In both studies, degree of self-reported gender dysphoria was significantly correlated with recall of cross-gender behavior in childhood-a test of convergent validity. The research and clinical utility of the GIDYQ-AA is discussed, including directions for further research in distinct clinical populations.
Steel, Amie; Adams, Jon
2011-06-01
The approach of evidence-based medicine (EBM), providing a paradigm to validate information sources and a process for critiquing their value, is an important platform for guiding practice. Researchers have explored the application and value of information sources in clinical practice with regard to a range of health professions; however, naturopathic practice has been overlooked. An exploratory study of naturopaths' perspectives of the application and value of information sources has been undertaken. Semi-structured interviews with 12 naturopaths in current clinical practice, concerning the information sources used in clinical practice and their perceptions of these sources. Thematic analysis identified differences in the application of the variety of information sources used, depending upon the perceived validity. Internet databases were viewed as highly valid. Textbooks, formal education and interpersonal interactions were judged based upon a variety of factors, whilst validation of general internet sites and manufacturers information was required prior to use. The findings of this study will provide preliminary aid to those responsible for supporting naturopaths' information use and access. In particular, it may assist publishers, medical librarians and professional associations in developing strategies to expand the clinically useful information sources available to naturopaths. © 2011 The authors. Health Information and Libraries Journal © 2011 Health Libraries Group.
Nursing Outcomes for Patients with Risk of Perioperative Positioning Injury.
de Lima, Luciana Bjorklund; E Cardozo, Michelle Cardoso; Bernardes, Daniela de Souza; Rabelo-Silva, Eneida Rejane
2018-04-16
To select and refine the outcomes and indicators of Nursing Outcomes Classification for the diagnosis of risk for perioperative positioning injury. Validation study on expert consensus and refinement through pilot study. Eight outcomes and 35 indicators were selected in consensus. After clinical testing was performed, in which 10 patients were assessed at five different times. Eight outcomes and 33 indicators remained in the protocol. This study made it possible to select the most relevant outcomes and indicators to be measured for this diagnosis in clinical practice. Validation studies by consensus and clinical testing are important to promote the accuracy, creating opportunities to legitimize, and improve the concepts of taxonomies. © 2018 NANDA International, Inc.
Parvizi, Mohammad Mahdi; Amini, Mitra; Dehghani, Mohammad Reza; Jafari, Peyman; Parvizi, Zahra
2016-01-01
Purpose Evaluation is the main component in design and implementation of educational activities and rapid growth of educational institution programs. Outpatient medical education and clinical training environment is one of the most important parts of training of medical residents. This study aimed to determine the validity and reliability of the Persian version of Ambulatory Care Learning Educational Environment Measure (ACLEEM) questionnaire, as an instrument for assessment of educational environments in residency medical clinics. Materials and methods This study was performed on 180 residents in Shiraz University of Medical Sciences, Shiraz, Iran, in 2014–2015. The questionnaire designers’ electronic permission (by email) and the residents’ verbal consent were obtained before distributing the questionnaires. The study data were gathered using ACLEEM questionnaire developed by Arnoldo Riquelme in 2013. The data were analyzed using the SPSS statistical software, version 14, and MedCalc® software. Then, the construct validity, including convergent and discriminant validities, of the Persian version of ACLEEM questionnaire was assessed. Its internal consistency was also checked by Cronbach’s alpha coefficient. Results Five team members who were experts in medical education were consulted to test the cultural adaptation, linguistic equivalency, and content validity of the Persian version of the questionnaire. Content validity indexes were >0.9 in all items. In factor analysis of the instrument, the Kaiser–Meyer–Olkin index was 0.928 and Barlett’s sphericity test yielded the following results: X2=6,717.551, df =1,225, and P≤0.001. Besides, Cronbach’s alpha coefficient of ACLEEM questionnaire was 0.964. Cronbach’s alpha coefficients were also >0.80 in all the three domains of the questionnaire. Overall, the Persian version of ACLEEM showed excellent convergent validity and acceptable discriminant validity, except for the clinical training domain. Conclusion According to the results, the Persian version of ACLEEM questionnaire was a valid and reliable instrument for Iranian residents to assess specialized clinics and residency ambulatory settings. PMID:27729824
Parvizi, Mohammad Mahdi; Amini, Mitra; Dehghani, Mohammad Reza; Jafari, Peyman; Parvizi, Zahra
2016-01-01
Evaluation is the main component in design and implementation of educational activities and rapid growth of educational institution programs. Outpatient medical education and clinical training environment is one of the most important parts of training of medical residents. This study aimed to determine the validity and reliability of the Persian version of Ambulatory Care Learning Educational Environment Measure (ACLEEM) questionnaire, as an instrument for assessment of educational environments in residency medical clinics. This study was performed on 180 residents in Shiraz University of Medical Sciences, Shiraz, Iran, in 2014-2015. The questionnaire designers' electronic permission (by email) and the residents' verbal consent were obtained before distributing the questionnaires. The study data were gathered using ACLEEM questionnaire developed by Arnoldo Riquelme in 2013. The data were analyzed using the SPSS statistical software, version 14, and MedCalc ® software. Then, the construct validity, including convergent and discriminant validities, of the Persian version of ACLEEM questionnaire was assessed. Its internal consistency was also checked by Cronbach's alpha coefficient. Five team members who were experts in medical education were consulted to test the cultural adaptation, linguistic equivalency, and content validity of the Persian version of the questionnaire. Content validity indexes were >0.9 in all items. In factor analysis of the instrument, the Kaiser-Meyer-Olkin index was 0.928 and Barlett's sphericity test yielded the following results: X 2 =6,717.551, df =1,225, and P ≤0.001. Besides, Cronbach's alpha coefficient of ACLEEM questionnaire was 0.964. Cronbach's alpha coefficients were also >0.80 in all the three domains of the questionnaire. Overall, the Persian version of ACLEEM showed excellent convergent validity and acceptable discriminant validity, except for the clinical training domain. According to the results, the Persian version of ACLEEM questionnaire was a valid and reliable instrument for Iranian residents to assess specialized clinics and residency ambulatory settings.
Longitudinal construct validity of the minimum data set health status index.
Jones, Aaron; Feeny, David; Costa, Andrew P
2018-05-24
The Minimum Data Set Health Status Index (MDS-HSI) is a generic, preference-based health-related quality of life (HRQOL) measure derived by mapping items from the Resident Assessment Instrument - Minimum Data Set (RAI-MDS) assessment onto the Health Utilities Index Mark 2 classification system. While the validity of the MDS-HSI has been examined in cross-sectional settings, the longitudinal validity has not been explored. The objective of this study was to investigate the longitudinal construct validity of the MDS-HSI in a home care population. This study utilized a retrospective cohort of home care patients in the Hamilton-Niagara-Haldimand-Brant health region of Ontario, Canada with at least two RAI-MDS Home Care assessments between January 2010 and December 2014. Convergent validity was assessed by calculating Spearman rank correlations between the change in MDS-HSI and changes in six validated indices of health domains that can be calculated from the RAI-MDS assessment. Known-groups validity was investigated by fitting multivariable linear regression models to estimate the mean change in MDS-HSI associated with clinically important changes in the six health domain indices and 15 disease symptoms from the RAI-MDS Home Care assessment, controlling for age and sex. The cohort contained 25,182 patients with two RAI-MDS Home Care assessments. Spearman correlations between the MDS-HSI change and changes in the health domain indices were all statistically significant and in the hypothesized small to moderate range [0.1 < ρ < 0.5]. Clinically important changes in all of the health domain indices and 13 of the 15 disease symptoms were significantly associated with clinically important changes in the MDS-HSI. The findings of this study support the longitudinal construct validity of the MDS-HSI in home care populations. In addition to evaluating changes in HRQOL among home care patients in clinical research, economic evaluation, and health technology assessment, the MDS-HSI may be used in system-level applications using routinely collected population-level data.
Validity of a computerized population registry of dementia based on clinical databases.
Mar, J; Arrospide, A; Soto-Gordoa, M; Machón, M; Iruin, Á; Martinez-Lage, P; Gabilondo, A; Moreno-Izco, F; Gabilondo, A; Arriola, L
2018-05-08
The handling of information through digital media allows innovative approaches for identifying cases of dementia through computerized searches within the clinical databases that include systems for coding diagnoses. The aim of this study was to analyze the validity of a dementia registry in Gipuzkoa based on the administrative and clinical databases existing in the Basque Health Service. This is a descriptive study based on the evaluation of available data sources. First, through review of medical records, the diagnostic validity was evaluated in 2 samples of cases identified and not identified as dementia. The sensitivity, specificity and positive and negative predictive value of the diagnosis of dementia were measured. Subsequently, the cases of living dementia in December 31, 2016 were searched in the entire Gipuzkoa population to collect sociodemographic and clinical variables. The validation samples included 986 cases and 327 no cases. The calculated sensitivity was 80.2% and the specificity was 99.9%. The negative predictive value was 99.4% and positive value was 95.1%. The cases in Gipuzkoa were 10,551, representing 65% of the cases predicted according to the literature. Antipsychotic medication were taken by a 40% and a 25% of the cases were institutionalized. A registry of dementias based on clinical and administrative databases is valid and feasible. Its main contribution is to show the dimension of dementia in the health system. Copyright © 2018 Sociedad Española de Neurología. Publicado por Elsevier España, S.L.U. All rights reserved.
Clinical utility and validity of minoxidil response testing in androgenetic alopecia.
Goren, Andy; Shapiro, Jerry; Roberts, Janet; McCoy, John; Desai, Nisha; Zarrab, Zoulikha; Pietrzak, Aldona; Lotti, Torello
2015-01-01
Clinical response to 5% topical minoxidil for the treatment of androgenetic alopecia (AGA) is typically observed after 3-6 months. Approximately 40% of patients will regrow hair. Given the prolonged treatment time required to elicit a response, a diagnostic test for ruling out nonresponders would have significant clinical utility. Two studies have previously reported that sulfotransferase enzyme activity in plucked hair follicles predicts a patient's response to topical minoxidil therapy. The aim of this study was to assess the clinical utility and validity of minoxidil response testing. In this communication, the present authors conducted an analysis of completed and ongoing studies of minoxidil response testing. The analysis confirmed the clinical utility of a sulfotransferase enzyme test in successfully ruling out 95.9% of nonresponders to topical minoxidil for the treatment of AGA. © 2014 Wiley Periodicals, Inc.
Adamowska, Sylwia; Sylwia, Adamowska; Adamowski, Tomasz; Tomasz, Adamowski; Frydecka, Dorota; Dorota, Frydecka; Kiejna, Andrzej; Andrzej, Kiejna
2014-10-01
Since over forty years structuralized interviews for clinical and epidemiological research in child and adolescent psychiatry are being developed that should increase validity and reliability of diagnoses according to classification systems (DSM and ICD). The aim of the study is to assess the validity of the Polish version of MINI-KID (Mini International Neuropsychiatric Interview for Children and Adolescents) in comparison to clinical diagnosis made by a specialist in the field of child and adolescent psychiatry. There were 140 patients included in the study (93 boys, 66.4%, mean age 11.8±3.0 and 47 girls 33.5%, mean age 14.0±2.9). All the patients were diagnosed by the specialist in the field of child and adolescent psychiatry according to ICD-10 criteria and by the independent interviewer with the Polish version of MINI-KID (version 2.0, 2001). There was higher agreement between clinical diagnoses and diagnoses based on MINI-KID interview with respect to eating disorders and externalizing disorders (κ 0.43-0.56) and lower in internalizing disorders (κ 0.13-0.45). In the clinical interview, there was smaller number of diagnostic categories (maximum 3 diagnoses per one patient) in comparison to MINI-KID (maximum 10 diagnoses per one patient), and the smaller percentage of patients with one diagnosis (65,7%) in comparison to MINI-KID interview (72%). Our study has shown satisfactory validity parameters of MINI-KID questionnaire, promoting its use for clinical and epidemiological settings. The Mini International Neuropsychiatry Interview for Children and Adolescent (MINI-KID) is the first structuralized diagnostic interview for assessing mental status in children and adolescents, which has been translated into Polish language. Our validation study demonstrated satisfactory psychometric properties of the questionnaire, enabling its use in clinical practice and in research projects. Copyright © 2014 Elsevier Inc. All rights reserved.
Rosenson, Robert S; Miller, Kate; Bayliss, Martha; Sanchez, Robert J; Baccara-Dinet, Marie T; Chibedi-De-Roche, Daniela; Taylor, Beth; Khan, Irfan; Manvelian, Garen; White, Michelle; Jacobson, Terry A
2017-04-01
The Statin-Associated Muscle Symptom Clinical Index (SAMS-CI) is a method for assessing the likelihood that a patient's muscle symptoms (e.g., myalgia or myopathy) were caused or worsened by statin use. The objectives of this study were to prepare the SAMS-CI for clinical use, estimate its inter-rater reliability, and collect feedback from physicians on its practical application. For content validity, we conducted structured in-depth interviews with its original authors as well as with a panel of independent physicians. Estimation of inter-rater reliability involved an analysis of 30 written clinical cases which were scored by a sample of physicians. A separate group of physicians provided feedback on the clinical use of the SAMS-CI and its potential utility in practice. Qualitative interviews with providers supported the content validity of the SAMS-CI. Feedback on the clinical use of the SAMS-CI included several perceived benefits (such as brevity, clear wording, and simple scoring process) and some possible concerns (workflow issues and applicability in primary care). The inter-rater reliability of the SAMS-CI was estimated to be 0.77 (confidence interval 0.66-0.85), indicating high concordance between raters. With additional provider feedback, a revised SAMS-CI instrument was created suitable for further testing, both in the clinical setting and in prospective validation studies. With standardized questions, vetted language, easily interpreted scores, and demonstrated reliability, the SAMS aims to estimate the likelihood that a patient's muscle symptoms were attributable to statins. The SAMS-CI may support better detection of statin-associated muscle symptoms in clinical practice, optimize treatment for patients experiencing muscle symptoms, and provide a useful tool for further clinical research.
Galvin, Rose; Joyce, Doireann; Downey, Eithne; Boland, Fiona; Fahey, Tom; Hill, Arnold K
2014-10-03
The number of primary care referrals of women with breast symptoms to symptomatic breast units (SBUs) has increased exponentially in the past decade in Ireland. The aim of this study is to develop and validate a clinical prediction rule (CPR) to identify women with breast cancer so that a more evidence based approach to referral from primary care to these SBUs can be developed. We analysed routine data from a prospective cohort of consecutive women reviewed at a SBU with breast symptoms. The dataset was split into a derivation and validation cohort. Regression analysis was used to derive a CPR from the patient's history and clinical findings. Validation of the CPR consisted of estimating the number of breast cancers predicted to occur compared with the actual number of observed breast cancers across deciles of risk. A total of 6,590 patients were included in the derivation study and 4.9% were diagnosed with breast cancer. Independent clinical predictors for breast cancer were: increasing age by year (adjusted odds ratio 1.08, 95% CI 1.07-1.09); presence of a lump (5.63, 95% CI 4.2-7.56); nipple change (2.77, 95% CI 1.68-4.58) and nipple discharge (2.09, 95% CI 1.1-3.97). Validation of the rule (n = 911) demonstrated that the probability of breast cancer was higher with an increasing number of these independent variables. The Hosmer-Lemeshow goodness of fit showed no overall significant difference between the expected and the observed numbers of breast cancer (χ(2)HL: 6.74, p-value: 0.56). This study derived and validated a CPR for breast cancer in women attending an Irish national SBU. We found that increasing age, presence of a lump, nipple discharge and nipple change are all associated with increased risk of breast cancer. Further validation of the rule is necessary as well as an assessment of its impact on referral practice.
Designing and validation of a yoga-based intervention for schizophrenia.
Govindaraj, Ramajayam; Varambally, Shivarama; Sharma, Manjunath; Gangadhar, Bangalore Nanjundaiah
2016-06-01
Schizophrenia is a chronic mental illness which causes significant distress and dysfunction. Yoga has been found to be effective as an add-on therapy in schizophrenia. Modules of yoga used in previous studies were based on individual researcher's experience. This study aimed to develop and validate a specific generic yoga-based intervention module for patients with schizophrenia. The study was conducted at NIMHANS Integrated Centre for Yoga (NICY). A yoga module was designed based on traditional and contemporary yoga literature as well as published studies. The yoga module along with three case vignettes of adult patients with schizophrenia was sent to 10 yoga experts for their validation. Experts (n = 10) gave their opinion on the usefulness of a yoga module for patients with schizophrenia with some modifications. In total, 87% (13 of 15 items) of the items in the initial module were retained, with modification in the remainder as suggested by the experts. A specific yoga-based module for schizophrenia was designed and validated by experts. Further studies are needed to confirm efficacy and clinical utility of the module. Additional clinical validation is suggested.
Development of the Clinical Teaching Effectiveness Questionnaire in the United States.
Wormley, Michelle E; Romney, Wendy; Greer, Anna E
2017-01-01
The purpose of this study was to develop a valid measure for assessing clinical teaching effectiveness within the field of physical therapy. The Clinical Teaching Effectiveness Questionnaire (CTEQ) was developed via a 4-stage process, including (1) initial content development, (2) content analysis with 8 clinical instructors with over 5 years of clinical teaching experience, (3) pilot testing with 205 clinical instructors from 2 universities in the Northeast of the United States, and (4) psychometric evaluation, including principal component analysis. The scale development process resulted in a 30-item questionnaire with 4 sections that relate to clinical teaching: learning experiences, learning environment, communication, and evaluation. The CTEQ provides a preliminary valid measure for assessing clinical teaching effectiveness in physical therapy practice.
A semi-automatic method for left ventricle volume estimate: an in vivo validation study
NASA Technical Reports Server (NTRS)
Corsi, C.; Lamberti, C.; Sarti, A.; Saracino, G.; Shiota, T.; Thomas, J. D.
2001-01-01
This study aims to the validation of the left ventricular (LV) volume estimates obtained by processing volumetric data utilizing a segmentation model based on level set technique. The validation has been performed by comparing real-time volumetric echo data (RT3DE) and magnetic resonance (MRI) data. A validation protocol has been defined. The validation protocol was applied to twenty-four estimates (range 61-467 ml) obtained from normal and pathologic subjects, which underwent both RT3DE and MRI. A statistical analysis was performed on each estimate and on clinical parameters as stroke volume (SV) and ejection fraction (EF). Assuming MRI estimates (x) as a reference, an excellent correlation was found with volume measured by utilizing the segmentation procedure (y) (y=0.89x + 13.78, r=0.98). The mean error on SV was 8 ml and the mean error on EF was 2%. This study demonstrated that the segmentation technique is reliably applicable on human hearts in clinical practice.
Validation of a quality-of-life instrument for patients with nonmelanoma skin cancer.
Rhee, John S; Matthews, B Alex; Neuburg, Marcy; Logan, Brent R; Burzynski, Mary; Nattinger, Ann B
2006-01-01
To validate a disease-specific quality-of-life instrument--the Skin Cancer Index--intended to measure quality-of-life issues relevant to patients with nonmelanoma skin cancer. Internal reliability, convergent and divergent validity with existing scales, and factor analyses were performed in a cross-sectional study of 211 patients presenting with cervicofacial nonmelanoma skin cancer to a dermatologic surgery clinic. Factor analyses of the Skin Cancer Index confirmed a multidimensional scale with 3 distinct subscales-emotional, social, and appearance. Excellent internal validity of the 3 subscales was demonstrated. Substantial evidence was observed for convergent validity with the Dermatology Life Quality Index, Rosenberg Self-Esteem Scale, Lerman's Cancer Worry Scale, and Medical Outcomes Survey Short-Form 12 domains for vitality, emotion, social function, and mental health. These findings validate a new disease-specific quality-of-life instrument for patients with cervicofacial nonmelanoma skin cancer. Studies on the responsiveness of the Skin Cancer Index to clinical intervention are currently under way.
Lam, Lucia L.; Ghadessi, Mercedeh; Erho, Nicholas; Vergara, Ismael A.; Alshalalfa, Mohammed; Buerki, Christine; Haddad, Zaid; Sierocinski, Thomas; Triche, Timothy J.; Skinner, Eila C.; Davicioni, Elai; Daneshmand, Siamak; Black, Peter C.
2014-01-01
Background Nearly half of muscle-invasive bladder cancer patients succumb to their disease following cystectomy. Selecting candidates for adjuvant therapy is currently based on clinical parameters with limited predictive power. This study aimed to develop and validate genomic-based signatures that can better identify patients at risk for recurrence than clinical models alone. Methods Transcriptome-wide expression profiles were generated using 1.4 million feature-arrays on archival tumors from 225 patients who underwent radical cystectomy and had muscle-invasive and/or node-positive bladder cancer. Genomic (GC) and clinical (CC) classifiers for predicting recurrence were developed on a discovery set (n = 133). Performances of GC, CC, an independent clinical nomogram (IBCNC), and genomic-clinicopathologic classifiers (G-CC, G-IBCNC) were assessed in the discovery and independent validation (n = 66) sets. GC was further validated on four external datasets (n = 341). Discrimination and prognostic abilities of classifiers were compared using area under receiver-operating characteristic curves (AUCs). All statistical tests were two-sided. Results A 15-feature GC was developed on the discovery set with area under curve (AUC) of 0.77 in the validation set. This was higher than individual clinical variables, IBCNC (AUC = 0.73), and comparable to CC (AUC = 0.78). Performance was improved upon combining GC with clinical nomograms (G-IBCNC, AUC = 0.82; G-CC, AUC = 0.86). G-CC high-risk patients had elevated recurrence probabilities (P < .001), with GC being the best predictor by multivariable analysis (P = .005). Genomic-clinicopathologic classifiers outperformed clinical nomograms by decision curve and reclassification analyses. GC performed the best in validation compared with seven prior signatures. GC markers remained prognostic across four independent datasets. Conclusions The validated genomic-based classifiers outperform clinical models for predicting postcystectomy bladder cancer recurrence. This may be used to better identify patients who need more aggressive management. PMID:25344601
Leveraging biospecimen resources for discovery or validation of markers for early cancer detection.
Schully, Sheri D; Carrick, Danielle M; Mechanic, Leah E; Srivastava, Sudhir; Anderson, Garnet L; Baron, John A; Berg, Christine D; Cullen, Jennifer; Diamandis, Eleftherios P; Doria-Rose, V Paul; Goddard, Katrina A B; Hankinson, Susan E; Kushi, Lawrence H; Larson, Eric B; McShane, Lisa M; Schilsky, Richard L; Shak, Steven; Skates, Steven J; Urban, Nicole; Kramer, Barnett S; Khoury, Muin J; Ransohoff, David F
2015-04-01
Validation of early detection cancer biomarkers has proven to be disappointing when initial promising claims have often not been reproducible in diagnostic samples or did not extend to prediagnostic samples. The previously reported lack of rigorous internal validity (systematic differences between compared groups) and external validity (lack of generalizability beyond compared groups) may be effectively addressed by utilizing blood specimens and data collected within well-conducted cohort studies. Cohort studies with prediagnostic specimens (eg, blood specimens collected prior to development of clinical symptoms) and clinical data have recently been used to assess the validity of some early detection biomarkers. With this background, the Division of Cancer Control and Population Sciences (DCCPS) and the Division of Cancer Prevention (DCP) of the National Cancer Institute (NCI) held a joint workshop in August 2013. The goal was to advance early detection cancer research by considering how the infrastructure of cohort studies that already exist or are being developed might be leveraged to include appropriate blood specimens, including prediagnostic specimens, ideally collected at periodic intervals, along with clinical data about symptom status and cancer diagnosis. Three overarching recommendations emerged from the discussions: 1) facilitate sharing of existing specimens and data, 2) encourage collaboration among scientists developing biomarkers and those conducting observational cohort studies or managing healthcare systems with cohorts followed over time, and 3) conduct pilot projects that identify and address key logistic and feasibility issues regarding how appropriate specimens and clinical data might be collected at reasonable effort and cost within existing or future cohorts. © Published by Oxford University Press 2015.
Leveraging Biospecimen Resources for Discovery or Validation of Markers for Early Cancer Detection
Carrick, Danielle M.; Mechanic, Leah E.; Srivastava, Sudhir; Anderson, Garnet L.; Baron, John A.; Berg, Christine D.; Cullen, Jennifer; Diamandis, Eleftherios P.; Doria-Rose, V. Paul; Goddard, Katrina A. B.; Hankinson, Susan E.; Kushi, Lawrence H.; Larson, Eric B.; McShane, Lisa M.; Schilsky, Richard L.; Shak, Steven; Skates, Steven J.; Urban, Nicole; Kramer, Barnett S.; Khoury, Muin J.; Ransohoff, David F.
2015-01-01
Validation of early detection cancer biomarkers has proven to be disappointing when initial promising claims have often not been reproducible in diagnostic samples or did not extend to prediagnostic samples. The previously reported lack of rigorous internal validity (systematic differences between compared groups) and external validity (lack of generalizability beyond compared groups) may be effectively addressed by utilizing blood specimens and data collected within well-conducted cohort studies. Cohort studies with prediagnostic specimens (eg, blood specimens collected prior to development of clinical symptoms) and clinical data have recently been used to assess the validity of some early detection biomarkers. With this background, the Division of Cancer Control and Population Sciences (DCCPS) and the Division of Cancer Prevention (DCP) of the National Cancer Institute (NCI) held a joint workshop in August 2013. The goal was to advance early detection cancer research by considering how the infrastructure of cohort studies that already exist or are being developed might be leveraged to include appropriate blood specimens, including prediagnostic specimens, ideally collected at periodic intervals, along with clinical data about symptom status and cancer diagnosis. Three overarching recommendations emerged from the discussions: 1) facilitate sharing of existing specimens and data, 2) encourage collaboration among scientists developing biomarkers and those conducting observational cohort studies or managing healthcare systems with cohorts followed over time, and 3) conduct pilot projects that identify and address key logistic and feasibility issues regarding how appropriate specimens and clinical data might be collected at reasonable effort and cost within existing or future cohorts. PMID:25688116
ERIC Educational Resources Information Center
McGill, Ryan J.
2015-01-01
The current study examined the incremental validity of the clinical clusters from the Woodcock-Johnson III Tests of Cognitive Abilities (WJ-III COG) for predicting scores on the Woodcock-Johnson III Tests of Achievement (WJ-III ACH). All participants were children and adolescents (N = 4,722) drawn from the nationally representative WJ-III…
Development and validation of a nursing professionalism evaluation model in a career ladder system.
Kim, Yeon Hee; Jung, Young Sun; Min, Ja; Song, Eun Young; Ok, Jung Hui; Lim, Changwon; Kim, Kyunghee; Kim, Ji-Su
2017-01-01
The clinical ladder system categorizes the degree of nursing professionalism and rewards and is an important human resource tool for managing nursing. We developed a model to evaluate nursing professionalism, which determines the clinical ladder system levels, and verified its validity. Data were collected using a clinical competence tool developed in this study, and existing methods such as the nursing professionalism evaluation tool, peer reviews, and face-to-face interviews to evaluate promotions and verify the presented content in a medical institution. Reliability and convergent and discriminant validity of the clinical competence evaluation tool were verified using SmartPLS software. The validity of the model for evaluating overall nursing professionalism was also analyzed. Clinical competence was determined by five dimensions of nursing practice: scientific, technical, ethical, aesthetic, and existential. The structural model explained 66% of the variance. Clinical competence scales, peer reviews, and face-to-face interviews directly determined nursing professionalism levels. The evaluation system can be used for evaluating nurses' professionalism in actual medical institutions from a nursing practice perspective. A conceptual framework for establishing a human resources management system for nurses and a tool for evaluating nursing professionalism at medical institutions is provided.
de Menezes, Tarciana Nobre; Oliveira, Elaine Cristina Tôrres; de Sousa Fischer, Milena Abreu Tavares
2014-02-01
Self-reported information has been used as an easy and quick method to estimate the prevalence of systemic hypertension in populations. However, verification of whether self-reports of the disease are consistent with clinical diagnosis is essential for proper use of this information. This study aimed to verify the validity and concordance between self-reported and clinical diagnosis of hypertension in the elderly population of a city in northeastern Brazil. This was a cross-sectional and population-based study. The prevalence of diagnosed and self-reported hypertension and the validity and concordance between self-reported and clinical diagnosis and their distribution according to demographic and socioeconomic variables were assessed. The validity of self-reported hypertension was determined by sensitivity, specificity, and positive and negative predictive value. Overall, 795 elderly patients were evaluated (69.1% women). There was a high prevalence of hypertension among the elderly (diagnosed: 75.1%, 95% confidence interval (CI) = 71.1%-77.9%; self-reported: 59.7%, 95% CI = 56.3%-63.1%). For self-reported hypertension, sensitivity was substantial (77.1%), specificity was excellent (93.4%), positive predictive value was excellent (97.3%), and negative predictive value was moderate (57.2%). There was a moderate concordance between self-reported and clinical diagnosis of hypertension (kappa = 0.59; P < 0.001). Reasonable validity and moderate concordance of self-reported information on hypertension was observed, which reinforces the idea that this information can be used as strategy for detecting the disease prevalence in this population. However, because of nonachievement of excellence in the validity and reliability of the measured blood pressure, this information should be carefully considered for the strategic planning of health services.
Dimitrov, Borislav D; Motterlini, Nicola; Fahey, Tom
2015-01-01
Objective Estimating calibration performance of clinical prediction rules (CPRs) in systematic reviews of validation studies is not possible when predicted values are neither published nor accessible or sufficient or no individual participant or patient data are available. Our aims were to describe a simplified approach for outcomes prediction and calibration assessment and evaluate its functionality and validity. Study design and methods: Methodological study of systematic reviews of validation studies of CPRs: a) ABCD2 rule for prediction of 7 day stroke; and b) CRB-65 rule for prediction of 30 day mortality. Predicted outcomes in a sample validation study were computed by CPR distribution patterns (“derivation model”). As confirmation, a logistic regression model (with derivation study coefficients) was applied to CPR-based dummy variables in the validation study. Meta-analysis of validation studies provided pooled estimates of “predicted:observed” risk ratios (RRs), 95% confidence intervals (CIs), and indexes of heterogeneity (I2) on forest plots (fixed and random effects models), with and without adjustment of intercepts. The above approach was also applied to the CRB-65 rule. Results Our simplified method, applied to ABCD2 rule in three risk strata (low, 0–3; intermediate, 4–5; high, 6–7 points), indicated that predictions are identical to those computed by univariate, CPR-based logistic regression model. Discrimination was good (c-statistics =0.61–0.82), however, calibration in some studies was low. In such cases with miscalibration, the under-prediction (RRs =0.73–0.91, 95% CIs 0.41–1.48) could be further corrected by intercept adjustment to account for incidence differences. An improvement of both heterogeneities and P-values (Hosmer-Lemeshow goodness-of-fit test) was observed. Better calibration and improved pooled RRs (0.90–1.06), with narrower 95% CIs (0.57–1.41) were achieved. Conclusion Our results have an immediate clinical implication in situations when predicted outcomes in CPR validation studies are lacking or deficient by describing how such predictions can be obtained by everyone using the derivation study alone, without any need for highly specialized knowledge or sophisticated statistics. PMID:25931829
Validation of the Older Adult Social Evaluative Scale (OASES) as a measure of social anxiety.
Kok, Brian C; Ma, Vanessa K; Gould, Christine E
2018-03-21
Social anxiety disorder (SAD) (formerly called social phobia) is among the most common mental health diagnoses among older adults; however, the research on late-life social anxiety is scarce. A limited number of studies have examined the assessment and diagnosis of social anxiety disorder in this population, and there are few social anxiety measures that are validated for use with older adults. One such measure, the Older Adult Social Evaluative Scale (OASES), was designed for use with this population, but until now has lacked validation against a gold-standard diagnostic interview. Using a sample of 47 community-dwelling older adults (aged 60 years and over) with anxiety, the present study compared OASES performance to that of the Structured Clinical Interview for DSM-5 Disorders (SCID-5), as well as other measures of anxiety and depression. The OASES demonstrated convergent validity with other measures of anxiety, and demonstrated discriminant validity on other measures (e.g. depression, somatic symptoms). Receiver operating characteristic (ROC) analysis revealed that a cut-point of ≥76 optimized sensitivity and specificity compared to SCID-5 derived diagnoses of social anxiety disorder. This study is the first study to provide psychometric validation for the OASES and one of the first to administer the SCID-5 to an older adult sample. In addition to establishing a clinically significant cut-off, this study also describes the clinical utility of the OASES, which can be used to identify distressing situations, track anxiety severity, and monitor behavioral avoidance across a variety of social situations.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Leichner, P.K.
This report summarizes research in beta-particle dosimetry, quantitative single-photon emission computed tomography (SPECT), the clinical implementation of these two areas of research in radioimmunotherapy (RIT), and postgraduate training provided since the inception of this grant on July 15, 1989. To improve beta-particle dosimetry, a point source function was developed that is valid for a wide range of beta emitters. Analytical solutions for beta-particle dose rates within out outside slabs of finite thickness were validated in experimental tumors and are now being used in clinical RIT. Quantitative SPECT based on the circular harmonic transform (CHT) algorithm was validated in phantom, experimental,more » and clinical studies. This has led to improved macrodosimetry in clinical RIT. In dosimetry at the multi-cellular level studies were made of the HepG2 human hepatoblastoma grown subcutaneously in nude mice. Histologic sections and autoradiographs were prepared to quantitate activity distributions of radiolabeled antibodies. Absorbed-dose calculations are being carried out for {sup 131}I and {sup 90}Y beta particles for these antibody distributions.« less
The Anomalous Sentences Repetition Test: Replication and Validation Study.
ERIC Educational Resources Information Center
Weeks, David J.
1986-01-01
Presents a brief clinical test, derived from earlier neuropsychological instruments, with evidence for its reliability, interscorer agreement, and validity. The latter is based upon correlations with both CAT scan measures of cortical atrophy and ventricular enlargement, as well as correlations with seven other previously validated cognitive…
Interpreting SF-12 mental component score: an investigation of its convergent validity with CESD-10.
Yu, Doris S F; Yan, Elsie C W; Chow, Choi Kai
2015-09-01
To examine the convergent validity of Mental Component Scale of the Short-Form 12 (SF-12 MCS) with the Center for Epidemiologic Studies Depression Scale (CESD-10). The CESD-10 is a screening tool for probably clinically significant depression in the Chinese population. Data were obtained from a household survey carried out in Hong Kong. A two-stage stratified sampling method successfully interviewed 1795 adult subjects from 1239 households. Data on SF-12 MCS and the CESD-10 were extracted. Receiver operating characteristics (ROC) analyses were performed to examine the convergent validity of SF-12 MCS against the CESD-10 threshold for probably clinically significant depression for the younger to middle-aged, late middle-aged and older population cohorts. ROC analysis indicated the excellent convergent validity of SF-12 MCS with the CESD-10 threshold for identifying probably clinically significant depression, with the area under curve ranged from 0.81 to 0.85. The optimal cutoff scores for depression among the younger to middle age group, late middle age group and older age group were 48.1, 50.2 and 50.2, respectively, with sensitivities ranged from 77 to 83 % and specificities ranged from 73 to 78 %. Bootstrapping estimates of the mean difference indicated no significant difference in the optimal cutoff scores between these age cohorts. SF-12 is a widely adopted measure to capture the health profile of Chinese population. The study findings indicated the satisfactory performance of the SF-12 MCS in identifying probably clinical depression. Future study is warrant to examine the diagnostic validity of the SF-12 MCS by using gold standard to assess clinical depression.
Román-Cereto, Montserrat; García-Mayor, Silvia; Kaknani-Uttumchandani, Shakira; García-Gámez, Marina; León-Campos, Alvaro; Fernández-Ordóñez, Eloisa; Ruiz-García, Maria Luisa; Martí-García, C; López-Leiva, Inmaculada; Lasater, Kathie; Morales-Asencio, José Miguel
2018-05-01
The clinical judgment and decision-making abilities of nurses can influence many health outcomes, hence the importance of addressing these qualities in university studies. In this respect, clinical simulation is a commonly employed teaching method. The evaluation of simulation activities requires standardised instruments, such as the Lasater Clinical Judgment Rubric, which is widely used for this purpose, although a culturally adapted and validated version in Spain is not available. To obtain a Spanish culturally adapted and validated version of the rubric for undergraduate students of nursing. Cultural adaptation and psychometric validation study carried out with undergraduate nursing students in the simulation laboratories at the University of Málaga (Spain). A process of translation/back-translation and cultural adaptation was carried out in accordance with international standards. The rubric was empirically evaluated in standardised scenarios with high and medium-fidelity simulators. Each student took part in two different simulation sessions, led by two instructors. In each simulation, the data were collected by two independent observers. 152 observations were obtained from 76 students. The interobserver reliability was high, with an intraclass correlation coefficient of 0.93 (95% CI 0.92-0.95) (p = 0.0001) and Cronbach's alpha of 0.93. According to the confirmatory factor analysis, the fit of the model was satisfactory in all indices, with a χ 2 /df value of 1.08, GFI 0.96, TLI 0.99, NFI 0.97 and RMSEA 0.24 (90% CI 0.000-0.066). The rubric obtained is culturally adapted to the Spanish educational context, and is valid and reliable for nursing students. Further prospective studies should be undertaken to evaluate the responsiveness, potential for transfer to clinical practice and cost-benefit ratios of different simulation designs. Copyright © 2018 Elsevier Ltd. All rights reserved.
Oei, Tian P S; Boschen, Mark J
2009-10-01
Previous research has established efficacy of cognitive behavioral therapy (CBT) for anxiety disorders, yet it has not been widely assessed in routine community clinic practices. Efficacy research sacrifices external validity to achieve maximum internal validity. Recently, effectiveness research has been advocated as more ecologically valid for assessing routine clinical work in community clinics. Furthermore, there is a lack of effectiveness research in group CBT. This study aims to extend existing research on the effectiveness of CBT from individual therapy into group therapy delivery. It aimed also to examine outcome using not only symptom measures, but also measures of related symptoms, cognitions, and life quality and satisfaction. Results from a cohort of patients with various anxiety disorders demonstrated that treatment was effective in reducing anxiety symptoms to an extent comparable with other effectiveness studies. Despite this, only 43% of individuals showed reliable change, and 17% were 'recovered' from their anxiety symptoms, and the post-treatment measures were still significantly different from the level of anxiety symptoms observed in the general population.
1984-05-01
Satisfaction Measures Between Clinics.... 39 Lessons Learned From the Pilot Studies ...................... 42 Telephonic Versus Clinic Survey...Between Clinics. 63 Comments from Survey Participants ....................... 64 Lessons Learned from the Study ............................. 67...attempted to apply principles learned from a review of the multitude of studies conducted in the area of patient satisfaction. Validated dimensions of
Psychometric Testing of the Greek Version of the Clinical Learning Environment-Teacher (CLES+T).
Papastavrou, Evridiki; Dimitriadou, Maria; Tsangari, Haritini
2015-09-01
Clinical practice is an important part of nursing education, and robust instruments are required to evaluate the effectiveness of the hospital setting as a learning environment. The study aim is the psychometric test of the Clinical Learning Environment+Teacher (CLES+T) scale-Greek version. 463 students practicing in acute care hospitals participated in the study. The reliability of the instrument was estimated with Cronbach's alpha coefficients. The construct validity was evaluated using exploratory factor analysis (EFA) with Varimax rotation. Convergent validity was examined by measuring the bivariate correlations between the scale/subscales. Content, validity and semantic equivalence were examined through reviews by a panel of experts. The total scale showed high internal consistency (α=0.95). EFA was identical to the original scale, had eigen values larger than one and explained a total of 67.4% of the variance. The factor with the highest eigen value and the largest percentage of variance explained was "supervisory relationship", with an original eigenvalue of 13.1 (6.8 after Varimax rotation) and an explanation of around 38% of the variance (or 20% after rotation). Convergent validity was examined by measuring the bivariate correlations between the scale and a question that measured the general satisfaction. The Greek version of the CLES+T is a valid and reliable instrument that can be used to examine students' perceptions of the clinical learning environment.
The Danish Neuro-Oncology Registry: establishment, completeness and validity.
Hansen, Steinbjørn; Nielsen, Jan; Laursen, René J; Rasmussen, Birthe Krogh; Nørgård, Bente Mertz; Gradel, Kim Oren; Guldberg, Rikke
2016-08-30
The Danish Neuro-Oncology Registry (DNOR) is a nationwide clinical cancer database that has prospectively registered data on patients with gliomas since January 2009. The purpose of this study was to describe the establishment of the DNOR and further to evaluate the database completeness of patient registration and validity of data. The completeness of the number of patients registered in the database was evaluated in the study period from January 2009 through December 2014 by comparing cases reported to the DNOR with the Danish National Patient Registry and the Danish Pathology Registry. The data validity of important clinical variables was evaluated by a random sample of 100 patients from the DNOR using the medical records as reference. A total of 2241 patients were registered in the DNOR by December 2014 with an overall patient completeness of 92 %, which increased during the study period (from 78 % in 2009 to 96 % in 2014). Medical records were available for all patients in the validity analyses. Most variables showed a high agreement proportion (56-100 %), with a fair to good chance-corrected agreement (k = 0.43-1.0). The completeness of patient registration was very high (92 %) and the validity of the most important patient data was good. The DNOR is a newly established national database, which is a reliable source for future scientific studies and clinical quality assessments among patients with gliomas.
Kumar, Y Kiran; Mehta, Shashi Bhushan; Ramachandra, Manjunath
2017-01-01
The purpose of this work is to provide some validation methods for evaluating the hemodynamic assessment of Cerebral Arteriovenous Malformation (CAVM). This article emphasizes the importance of validating noninvasive measurements for CAVM patients, which are designed using lumped models for complex vessel structure. The validation of the hemodynamics assessment is based on invasive clinical measurements and cross-validation techniques with the Philips proprietary validated software's Qflow and 2D Perfursion. The modeling results are validated for 30 CAVM patients for 150 vessel locations. Mean flow, diameter, and pressure were compared between modeling results and with clinical/cross validation measurements, using an independent two-tailed Student t test. Exponential regression analysis was used to assess the relationship between blood flow, vessel diameter, and pressure between them. Univariate analysis is used to assess the relationship between vessel diameter, vessel cross-sectional area, AVM volume, AVM pressure, and AVM flow results were performed with linear or exponential regression. Modeling results were compared with clinical measurements from vessel locations of cerebral regions. Also, the model is cross validated with Philips proprietary validated software's Qflow and 2D Perfursion. Our results shows that modeling results and clinical results are nearly matching with a small deviation. In this article, we have validated our modeling results with clinical measurements. The new approach for cross-validation is proposed by demonstrating the accuracy of our results with a validated product in a clinical environment.
Jessen, Marie K; Skibsted, Simon; Shapiro, Nathan I
2017-06-01
The aim of this study was to validate the association between number of organ dysfunctions and mortality in emergency department (ED) patients with suspected infection. This study was conducted at two medical care center EDs. The internal validation set was a prospective cohort study conducted in Boston, USA. The external validation set was a retrospective case-control study conducted in Aarhus, Denmark. The study included adult patients (>18 years) with clinically suspected infection. Laboratory results and clinical data were used to assess organ dysfunctions. Inhospital mortality was the outcome measure. Multivariate logistic regression was used to determine the independent mortality odds for number and types of organ dysfunctions. We enrolled 4952 (internal) and 483 (external) patients. The mortality rate significantly increased with increasing number of organ dysfunctions: internal validation: 0 organ dysfunctions: 0.5% mortality, 1: 3.6%, 2: 9.5%, 3: 17%, and 4 or more: 37%; external validation: 2.2, 6.7, 17, 41, and 57% mortality (both P<0.001 for trend). Age-adjusted and comorbidity-adjusted number of organ dysfunctions remained an independent predictor. The effect of specific types of organ dysfunction on mortality was most pronounced for hematologic [odds ratio (OR) 3.3 (95% confidence interval (CI) 2.0-5.4)], metabolic [OR 3.3 (95% CI 2.4-4.6); internal validation], and cardiovascular dysfunctions [OR 14 (95% CI 3.7-50); external validation]. The number of organ dysfunctions predicts sepsis mortality.
Mansutti, Irene; Saiani, Luisa; Grassetti, Luca; Palese, Alvisa
2017-03-01
The clinical learning environment is fundamental to nursing education paths, capable of affecting learning processes and outcomes. Several instruments have been developed in nursing education, aimed at evaluating the quality of the clinical learning environments; however, no systematic review of the psychometric properties and methodological quality of these studies has been performed to date. The aims of the study were: 1) to identify validated instruments evaluating the clinical learning environments in nursing education; 2) to evaluate critically the methodological quality of the psychometric property estimation used; and 3) to compare psychometric properties across the instruments available. A systematic review of the literature (using the Preferred Reporting Items for Systematic Reviews and Meta-Analysis guidelines) and an evaluation of the methodological quality of psychometric properties (using the COnsensus-based Standards for the selection of health Measurement INstruments guidelines). The Medline and CINAHL databases were searched. Eligible studies were those that satisfied the following criteria: a) validation studies of instruments evaluating the quality of clinical learning environments; b) in nursing education; c) published in English or Italian; d) before April 2016. The included studies were evaluated for the methodological quality of the psychometric properties measured and then compared in terms of both the psychometric properties and the methodological quality of the processes used. The search strategy yielded a total of 26 studies and eight clinical learning environment evaluation instruments. A variety of psychometric properties have been estimated for each instrument, with differing qualities in the methodology used. Concept and construct validity were poorly assessed in terms of their significance and rarely judged by the target population (nursing students). Some properties were rarely considered (e.g., reliability, measurement error, criterion validity), whereas others were frequently estimated, but using different coefficients and statistical analyses (e.g., internal consistency, structural validity), thus rendering comparison across instruments difficult. Moreover, the methodological quality adopted in the property assessments was poor or fair in most studies, compromising the goodness of the psychometric values estimated. Clinical learning placements represent the key strategies in educating the future nursing workforce: instruments evaluating the quality of the settings, as well as their capacity to promote significant learning, are strongly recommended. Studies estimating psychometric properties, using an increased quality of research methodologies are needed in order to support nursing educators in the process of clinical placements accreditation and quality improvement. Copyright © 2017 Elsevier Ltd. All rights reserved.
Brief reasons for living inventory: a psychometric investigation.
Cwik, Jan Christopher; Siegmann, Paula; Willutzki, Ulrike; Nyhuis, Peter; Wolter, Marcus; Forkmann, Thomas; Glaesmer, Heide; Teismann, Tobias
2017-11-06
The present study aimed at validating the German version of the Brief Reasons for Living inventory (BRFL). Validity and reliability were established in a community (n = 339) and a clinical sample (n = 272). Convergent and discriminant validity were investigated, and confirmatory factor analyses were conducted for the complete BRFL as well as for a 10-item version excluding conditional items on child-related concerns. Furthermore, it was assessed how BRFL scores moderate the association between depression and suicide ideation. Results indicated an adequate fit of the data to the original factor structure. The total scale and the subscales of the German version of the BRFL had sufficient internal consistency, as well as good convergent and divergent validity. The BRFL demonstrated clinical utility by differentiating between participants with vs. without suicide ideation. Reasons for living proved to moderate the association between depression and suicide ideation. Results provide preliminary evidence that the BRFL may be a reliable and valid measure of adaptive reasons for living that can be used in clinic and research settings.
Validation and Diagnostic Efficiency of the Mini-SPIN in Spanish-Speaking Adolescents
Garcia-Lopez, LuisJoaquín; Moore, Harry T. A.
2015-01-01
Objectives Social Anxiety Disorder (SAD) is one of the most common mental disorders in adolescence. Many validated psychometric tools are available to diagnose individuals with SAD efficaciously. However, there is a demand for shortened self-report instruments that identify adolescents at risk of developing SAD. We validate the Mini-SPIN and its diagnostic efficiency in overcoming this problem in Spanish-speaking adolescents in Spain. Methods The psychometric properties of the 3-item Mini-SPIN scale for adolescents were assessed in a community (study 1) and clinical sample (study 2). Results Study 1 consisted of 573 adolescents, and found the Mini-SPIN to have appropriate internal consistency and high construct validity. Study 2 consisted of 354 adolescents (147 participants diagnosed with SAD and 207 healthy controls). Data revealed that the Mini-SPIN has good internal consistency, high construct validity and adequate diagnostic efficiency. Conclusions Our findings suggest that the Mini-SPIN has good psychometric properties on clinical and healthy control adolescents and general population, which indicates that it can be used as a screening tool in Spanish-speaking adolescents. Cut-off scores are provided. PMID:26317695
Validation and Diagnostic Efficiency of the Mini-SPIN in Spanish-Speaking Adolescents.
Garcia-Lopez, LuisJoaquín; Moore, Harry T A
2015-01-01
Social Anxiety Disorder (SAD) is one of the most common mental disorders in adolescence. Many validated psychometric tools are available to diagnose individuals with SAD efficaciously. However, there is a demand for shortened self-report instruments that identify adolescents at risk of developing SAD. We validate the Mini-SPIN and its diagnostic efficiency in overcoming this problem in Spanish-speaking adolescents in Spain. The psychometric properties of the 3-item Mini-SPIN scale for adolescents were assessed in a community (study 1) and clinical sample (study 2). Study 1 consisted of 573 adolescents, and found the Mini-SPIN to have appropriate internal consistency and high construct validity. Study 2 consisted of 354 adolescents (147 participants diagnosed with SAD and 207 healthy controls). Data revealed that the Mini-SPIN has good internal consistency, high construct validity and adequate diagnostic efficiency. Our findings suggest that the Mini-SPIN has good psychometric properties on clinical and healthy control adolescents and general population, which indicates that it can be used as a screening tool in Spanish-speaking adolescents. Cut-off scores are provided.
Granholm, Anders; Perner, Anders; Krag, Mette; Hjortrup, Peter Buhl; Haase, Nicolai; Holst, Lars Broksø; Marker, Søren; Collet, Marie Oxenbøll; Jensen, Aksel Karl Georg; Møller, Morten Hylander
2017-03-09
Mortality prediction scores are widely used in intensive care units (ICUs) and in research, but their predictive value deteriorates as scores age. Existing mortality prediction scores are imprecise and complex, which increases the risk of missing data and decreases the applicability bedside in daily clinical practice. We propose the development and validation of a new, simple and updated clinical prediction rule: the Simplified Mortality Score for use in the Intensive Care Unit (SMS-ICU). During the first phase of the study, we will develop and internally validate a clinical prediction rule that predicts 90-day mortality on ICU admission. The development sample will comprise 4247 adult critically ill patients acutely admitted to the ICU, enrolled in 5 contemporary high-quality ICU studies/trials. The score will be developed using binary logistic regression analysis with backward stepwise elimination of candidate variables, and subsequently be converted into a point-based clinical prediction rule. The general performance, discrimination and calibration of the score will be evaluated, and the score will be internally validated using bootstrapping. During the second phase of the study, the score will be externally validated in a fully independent sample consisting of 3350 patients included in the ongoing Stress Ulcer Prophylaxis in the Intensive Care Unit trial. We will compare the performance of the SMS-ICU to that of existing scores. We will use data from patients enrolled in studies/trials already approved by the relevant ethical committees and this study requires no further permissions. The results will be reported in accordance with the Transparent Reporting of multivariate prediction models for Individual Prognosis Or Diagnosis (TRIPOD) statement, and submitted to a peer-reviewed journal. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.
Light, Gregory A.; Swerdlow, Neal R.; Thomas, Michael L.; Calkins, Monica E.; Green, Michael F.; Greenwood, Tiffany A.; Gur, Raquel E.; Gur, Ruben C.; Lazzeroni, Laura C.; Nuechterlein, Keith H.; Pela, Marlena; Radant, Allen D.; Seidman, Larry J.; Sharp, Richard F.; Siever, Larry J.; Silverman, Jeremy M.; Sprock, Joyce; Stone, William S.; Sugar, Catherine A.; Tsuang, Debby W.; Tsuang, Ming T.; Braff, David L.; Turetsky, Bruce I.
2014-01-01
Mismatch negativity (MMN) and P3a are auditory event-related potential (ERP) components that show robust deficits in schizophrenia (SZ) patients and exhibit qualities of endophenotypes, including substantial heritability, test-retest reliability, and trait-like stability. These measures also fulfill criteria for use as cognition and function-linked biomarkers in outcome studies, but have not yet been validated for use in large-scale multi-site clinical studies. This study tested the feasibility of adding MMN and P3a to the ongoing Consortium on the Genetics of Schizophrenia (COGS) study. The extent to which demographic, clinical, cognitive, and functional characteristics contribute to variability in MMN and P3a amplitudes was also examined. Participants (HCS n=824, SZ n=966) underwent testing at 5 geographically distributed COGS laboratories. Valid ERP data was obtained from 91% of HCS and 91% of SZ patients. Highly significant MMN (d=0.96) and P3a (d=0.93) amplitude reductions were observed in SZ patients, comparable in magnitude to those observed in single-lab studies with no appreciable differences across laboratories. Demographic characteristics accounted for 26% and 18% of the variance in MMN and P3a amplitudes, respectively. Significant relationships were observed among demographically-adjusted MMN and P3a measures and medication status as well as several clinical, cognitive, and functional characteristics of the SZ patients. This study demonstrates that MMN and P3a ERP biomarkers can be feasibly used in multi-site clinical studies. As with many clinical tests of brain function, demographic factors contribute to MMN and P3a amplitudes and should be carefully considered in future biomarker-informed clinical studies. PMID:25449710
Light, Gregory A; Swerdlow, Neal R; Thomas, Michael L; Calkins, Monica E; Green, Michael F; Greenwood, Tiffany A; Gur, Raquel E; Gur, Ruben C; Lazzeroni, Laura C; Nuechterlein, Keith H; Pela, Marlena; Radant, Allen D; Seidman, Larry J; Sharp, Richard F; Siever, Larry J; Silverman, Jeremy M; Sprock, Joyce; Stone, William S; Sugar, Catherine A; Tsuang, Debby W; Tsuang, Ming T; Braff, David L; Turetsky, Bruce I
2015-04-01
Mismatch negativity (MMN) and P3a are auditory event-related potential (ERP) components that show robust deficits in schizophrenia (SZ) patients and exhibit qualities of endophenotypes, including substantial heritability, test-retest reliability, and trait-like stability. These measures also fulfill criteria for use as cognition and function-linked biomarkers in outcome studies, but have not yet been validated for use in large-scale multi-site clinical studies. This study tested the feasibility of adding MMN and P3a to the ongoing Consortium on the Genetics of Schizophrenia (COGS) study. The extent to which demographic, clinical, cognitive, and functional characteristics contribute to variability in MMN and P3a amplitudes was also examined. Participants (HCS n=824, SZ n=966) underwent testing at 5 geographically distributed COGS laboratories. Valid ERP recordings were obtained from 91% of HCS and 91% of SZ patients. Highly significant MMN (d=0.96) and P3a (d=0.93) amplitude reductions were observed in SZ patients, comparable in magnitude to those observed in single-lab studies with no appreciable differences across laboratories. Demographic characteristics accounted for 26% and 18% of the variance in MMN and P3a amplitudes, respectively. Significant relationships were observed among demographically-adjusted MMN and P3a measures and medication status as well as several clinical, cognitive, and functional characteristics of the SZ patients. This study demonstrates that MMN and P3a ERP biomarkers can be feasibly used in multi-site clinical studies. As with many clinical tests of brain function, demographic factors contribute to MMN and P3a amplitudes and should be carefully considered in future biomarker-informed clinical studies. Published by Elsevier B.V.
Erdodi, Laszlo A; Sagar, Sanya; Seke, Kristian; Zuccato, Brandon G; Schwartz, Eben S; Roth, Robert M
2018-06-01
This study was designed to develop performance validity indicators embedded within the Delis-Kaplan Executive Function Systems (D-KEFS) version of the Stroop task. Archival data from a mixed clinical sample of 132 patients (50% male; M Age = 43.4; M Education = 14.1) clinically referred for neuropsychological assessment were analyzed. Criterion measures included the Warrington Recognition Memory Test-Words and 2 composites based on several independent validity indicators. An age-corrected scaled score ≤6 on any of the 4 trials reliably differentiated psychometrically defined credible and noncredible response sets with high specificity (.87-.94) and variable sensitivity (.34-.71). An inverted Stroop effect was less sensitive (.14-.29), but comparably specific (.85-90) to invalid performance. Aggregating the newly developed D-KEFS Stroop validity indicators further improved classification accuracy. Failing the validity cutoffs was unrelated to self-reported depression or anxiety. However, it was associated with elevated somatic symptom report. In addition to processing speed and executive function, the D-KEFS version of the Stroop task can function as a measure of performance validity. A multivariate approach to performance validity assessment is generally superior to univariate models. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Methodological review of the quality of reach out and read: does it "work"?
Yeager Pelatti, Christina; Pentimonti, Jill M; Justice, Laura M
2014-04-01
A considerable percentage of American children and adults fail to learn adequate literacy skills and read below a third grade level. Shared book reading is perhaps the single most important activity to prepare young children for success in reading. The primary objective of this manuscript was to critically review the methodological quality of Read Out and Read (ROR), a clinically based literacy program/intervention that teaches parents strategies to incorporate while sharing books with children as a method of preventing reading difficulties and academic struggles. A PubMed search was conducted. Articles that met three criteria were considered. First, the study must be clinically based and include parent contact with a pediatrician. Second, parental counseling ("anticipatory guidance") about the importance of parent-child book reading must be included. Third, only experimental or quasi-experimental studies were included; no additional criteria were used. Published articles from any year and peer-reviewed journal were considered. Study quality was determined using a modified version of the Downs and Black (1998) checklist assessing four categories: (1) Reporting, (2) External Validity, (3) Internal Validity-Bias, and (4) Internal Validity-Confounding. We were also interested in whether quality differed based on study design, children's age, sample size, and study outcome. Eleven studies met the inclusion criteria. The overall quality of evidence was variable across all studies; Reporting and External Validity categories were relatively strong while methodological concerns were found in the area of internal validity. Quality scores differed on the four study characteristics. Implications related to clinical practice and future studies are discussed.
Forzley, Brian; Er, Lee; Chiu, Helen Hl; Djurdjev, Ognjenka; Martinusen, Dan; Carson, Rachel C; Hargrove, Gaylene; Levin, Adeera; Karim, Mohamud
2018-02-01
End-stage kidney disease is associated with poor prognosis. Health care professionals must be prepared to address end-of-life issues and identify those at high risk for dying. A 6-month mortality prediction model for patients on dialysis derived in the United States is used but has not been externally validated. We aimed to assess the external validity and clinical utility in an independent cohort in Canada. We examined the performance of the published 6-month mortality prediction model, using discrimination, calibration, and decision curve analyses. Data were derived from a cohort of 374 prevalent dialysis patients in two regions of British Columbia, Canada, which included serum albumin, age, peripheral vascular disease, dementia, and answers to the "the surprise question" ("Would I be surprised if this patient died within the next year?"). The observed mortality in the validation cohort was 11.5% at 6 months. The prediction model had reasonable discrimination (c-stat = 0.70) but poor calibration (calibration-in-the-large = -0.53 (95% confidence interval: -0.88, -0.18); calibration slope = 0.57 (95% confidence interval: 0.31, 0.83)) in our data. Decision curve analysis showed the model only has added value in guiding clinical decision in a small range of threshold probabilities: 8%-20%. Despite reasonable discrimination, the prediction model has poor calibration in this external study cohort; thus, it may have limited clinical utility in settings outside of where it was derived. Decision curve analysis clarifies limitations in clinical utility not apparent by receiver operating characteristic curve analysis. This study highlights the importance of external validation of prediction models prior to routine use in clinical practice.
Moskoei, Sara; Mohtashami, Jamileh; Ghalenoeei, Mahdie; Nasiri, Maliheh; Tafreshi, Mansoreh Zaghari
2017-01-01
Introduction Evaluation of clinical competency in nurses has a distinct importance in healthcare due to its significant impact on improving the quality of patient care and creation of opportunities for professional promotion. This is a psychometric study for development of the “Clinical Competency of Mental Health Nursing”(CCMHN) rating scale. Methods In this methodological research that was conducted in 2015, in Tehran, Iran, the main items were developed after literature review and the validity and reliability of the tool were identified. The face, content (content validity ratio and content validity index) and construct validities were calculated. For face and content validity, experts’ comments were used. Exploratory factor analysis was used to determine the construct validity. The reliability of scale was determined by the internal consistency and inter-rater correlation. The collected data were analyzed by SPSS version 16, using descriptive statistical analysis. Results A scale with 45 items in two parts including Emotional/Moral and Specific Care competencies was developed. Content validity ratio and content validity index were 0.88, 0.97 respectively. Exploratory factor analysis indicated two factors: The first factor with 23.93 eigenvalue and second factor with eigenvalue 2.58. Cronbach’s alpha coefficient for determination of internal consistency was 0.98 and the ICC for confirmation inter-rater correlation was 0.98. Conclusion A scale with 45 items and two areas was developed with appropriate validity and reliability. This scale can be used to assess the clinical competency in nursing students and mental health nurses. PMID:28607650
Heinig, Katja; Miya, Kazuhiro; Kamei, Tomonori; Guerini, Elena; Fraier, Daniela; Yu, Li; Bansal, Surendra; Morcos, Peter N
2016-07-01
Alectinib is a novel anaplastic lymphoma kinase (ALK) inhibitor for treatment of patients with ALK-positive non-small-cell lung cancer who have progressed on or are intolerant to crizotinib. To support clinical development, concentrations of alectinib and metabolite M4 were determined in plasma from patients and healthy subjects. LC-MS/MS methods were developed and validated in two different laboratories: Chugai used separate assays for alectinib and M4 in a pivotal Phase I/II study while Roche established a simultaneous assay for both analytes for another pivotal study and all other studies. Cross-validation assessment revealed a bias between the two bioanalytical laboratories, which was confirmed with the clinical PK data between both pivotal studies using the different bioanalytical methods.
Deutsch, Judith E; Romney, Wendy; Reynolds, Jan; Manal, Tara Jo
2015-10-08
PTNow.org is an evidence-based, on-line portal created by a professional membership association to promote use of evidence in practice and to help decrease unwarranted variation in practice. The site contains synthesis documents designed to promote efficient clinical reasoning. These documents were written and peer-reviewed by teams of content experts and master clinicians. The purpose of this paper is to report on the content and construct validity as well as usability of the site. Physical therapist participants used clinical summaries (available in 3 formats--as a full summary with hyperlinks, "quick takes" with hyperlinks, and a portable two-page version) on the PTNow.org site to answer knowledge acquisition and clinical reasoning questions related to four patient scenarios. They also responded to questions about ease of use related to website navigation and about format and completeness of information using a 1-5 Likert scale. Responses were coded to reflect how participants used the site and then were summarized descriptively. Preferences for clinical summary format were analyzed using an analysis of variance (ANOVA) and a Dunnett T3 post hoc analysis. Seventeen participants completed the study. Clinical relevance and completeness ratings by experienced clinicians, which were used as the measure of content validity, ranged from 3.1 to 4.6 on a 5 point scale. Construct validity based on the information on the PTNow.org site was supported for knowledge acquisition questions 66 % of the time and for clinical reasoning questions 40 % of the time. Usability ratings for the full clinical summary were 4.6 (1.2); for the quick takes, 3.5 (.98); and for the portable clinical summary, 4.0 (.45). Participants preferred the full clinical summary over the other two formats (F = 5.908, P = 0.007). One hundred percent of the participants stated that they would recommend the PTNow site to their colleagues. Prelimary evidence supported both content validity and construct validity of knowledge acquisition, and partially supported construct validity of clinical reasoning for the clinical summaries on the PTNow.org site. Usability was supported, with users preferring the full clinical summary over the other two formats. Iterative design is ongoing.
Federal Register 2010, 2011, 2012, 2013, 2014
2012-05-24
...; Comment Request; Web-Based Assessment of the Clinical Studies Support Center (CSSC) Summary: Under the... current valid OMB control number. Proposed Collection: Title: Web-Based Assessment of the Clinical Studies... Operations and Procedures (MOP); coordinating meeting space and logistics for in-person meetings, Web...
Shedden, Kerby; Taylor, Jeremy M.G.; Enkemann, Steve A.; Tsao, Ming S.; Yeatman, Timothy J.; Gerald, William L.; Eschrich, Steve; Jurisica, Igor; Venkatraman, Seshan E.; Meyerson, Matthew; Kuick, Rork; Dobbin, Kevin K.; Lively, Tracy; Jacobson, James W.; Beer, David G.; Giordano, Thomas J.; Misek, David E.; Chang, Andrew C.; Zhu, Chang Qi; Strumpf, Dan; Hanash, Samir; Shepherd, Francis A.; Ding, Kuyue; Seymour, Lesley; Naoki, Katsuhiko; Pennell, Nathan; Weir, Barbara; Verhaak, Roel; Ladd-Acosta, Christine; Golub, Todd; Gruidl, Mike; Szoke, Janos; Zakowski, Maureen; Rusch, Valerie; Kris, Mark; Viale, Agnes; Motoi, Noriko; Travis, William; Sharma, Anupama
2009-01-01
Although prognostic gene expression signatures for survival in early stage lung cancer have been proposed, for clinical application it is critical to establish their performance across different subject populations and in different laboratories. Here we report a large, training-testing, multi-site blinded validation study to characterize the performance of several prognostic models based on gene expression for 442 lung adenocarcinomas. The hypotheses proposed examined whether microarray measurements of gene expression either alone or combined with basic clinical covariates (stage, age, sex) can be used to predict overall survival in lung cancer subjects. Several models examined produced risk scores that substantially correlated with actual subject outcome. Most methods performed better with clinical data, supporting the combined use of clinical and molecular information when building prognostic models for early stage lung cancer. This study also provides the largest available set of microarray data with extensive pathological and clinical annotation for lung adenocarcinomas. PMID:18641660
The Myotonometer: Not a Valid Measurement Tool for Active Hamstring Musculotendinous Stiffness.
Pamukoff, Derek N; Bell, Sarah E; Ryan, Eric D; Blackburn, J Troy
2016-05-01
Hamstring musculotendinous stiffness (MTS) is associated with lower-extremity injury risk (ie, hamstring strain, anterior cruciate ligament injury) and is commonly assessed using the damped oscillatory technique. However, despite a preponderance of studies that measure MTS reliably in laboratory settings, there are no valid clinical measurement tools. A valid clinical measurement technique is needed to assess MTS and permit identification of individuals at heightened risk of injury and track rehabilitation progress. To determine the validity and reliability of the Myotonometer for measuring active hamstring MTS. Descriptive laboratory study. Laboratory. 33 healthy participants (15 men, age 21.33 ± 2.94 y, height 172.03 ± 16.36 cm, mass 74.21 ± 16.36 kg). Hamstring MTS was assessed using the damped oscillatory technique and the Myotonometer. Intraclass correlations were used to determine the intrasession, intersession, and interrater reliability of the Myotonometer. Criterion validity was assessed via Pearson product-moment correlation between MTS measures obtained from the Myotonometer and from the damped oscillatory technique. The Myotonometer demonstrated good intrasession (ICC3,1 = .807) and interrater reliability (ICC2,k = .830) and moderate intersession reliability (ICC2,k = .693). However, it did not provide a valid measurement of MTS compared with the damped oscillatory technique (r = .346, P = .061). The Myotonometer does not provide a valid measure of active hamstring MTS. Although the Myotonometer does not measure active MTS, it possesses good reliability and portability and could be used clinically to measure tissue compliance, muscle tone, or spasticity associated with multiple musculoskeletal disorders. Future research should focus on portable and clinically applicable tools to measure active hamstring MTS in efforts to prevent and monitor injuries.
van Stiphout, Ruud G P M; Valentini, Vincenzo; Buijsen, Jeroen; Lammering, Guido; Meldolesi, Elisa; van Soest, Johan; Leccisotti, Lucia; Giordano, Alessandro; Gambacorta, Maria A; Dekker, Andre; Lambin, Philippe
2014-11-01
To develop and externally validate a predictive model for pathologic complete response (pCR) for locally advanced rectal cancer (LARC) based on clinical features and early sequential (18)F-FDG PETCT imaging. Prospective data (i.a. THUNDER trial) were used to train (N=112, MAASTRO Clinic) and validate (N=78, Università Cattolica del S. Cuore) the model for pCR (ypT0N0). All patients received long-course chemoradiotherapy (CRT) and surgery. Clinical parameters were age, gender, clinical tumour (cT) stage and clinical nodal (cN) stage. PET parameters were SUVmax, SUVmean, metabolic tumour volume (MTV) and maximal tumour diameter, for which response indices between pre-treatment and intermediate scan were calculated. Using multivariate logistic regression, three probability groups for pCR were defined. The pCR rates were 21.4% (training) and 23.1% (validation). The selected predictive features for pCR were cT-stage, cN-stage, response index of SUVmean and maximal tumour diameter during treatment. The models' performances (AUC) were 0.78 (training) and 0.70 (validation). The high probability group for pCR resulted in 100% correct predictions for training and 67% for validation. The model is available on the website www.predictcancer.org. The developed predictive model for pCR is accurate and externally validated. This model may assist in treatment decisions during CRT to select complete responders for a wait-and-see policy, good responders for extra RT boost and bad responders for additional chemotherapy. Copyright © 2014 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.
Chevat, Catherine; Viala-Danten, Muriel; Dias-Barbosa, Carla; Nguyen, Van Hung
2009-01-01
Background Influenza is among the most common infectious diseases. The main protection against influenza is vaccination. A self-administered questionnaire was developed and validated for use in clinical trials to assess subjects' perception and acceptance of influenza vaccination and its subsequent injection site reactions (ISR). Methods The VAPI questionnaire was developed based on interviews with vaccinees. The initial version was administered to subjects in international clinical trials comparing intradermal with intramuscular influenza vaccination. Item reduction and scale construction were carried out using principal component and multitrait analyses (n = 549). Psychometric validation of the final version was conducted per country (n = 5,543) and included construct and clinical validity and internal consistency reliability. All subjects gave their written informed consent before being interviewed or included in the clinical studies. Results The final questionnaire comprised 4 dimensions ("bother from ISR"; "arm movement"; "sleep"; "acceptability") grouping 16 items, and 5 individual items (anxiety before vaccination; bother from pain during vaccination; satisfaction with injection system; willingness to be vaccinated next year; anxiety about vaccination next year). Construct validity was confirmed for all scales in most of the countries. Internal consistency reliability was good for all versions (Cronbach's alpha ranging from 0.68 to 0.94), as was clinical validity: scores were positively correlated with the severity of ISR and pain. Conclusion The VAPI questionnaire is a valid and reliable tool, assessing the acceptance of vaccine injection and reactions following vaccination. Trial registration NCT00258934, NCT00383526, NCT00383539. PMID:19261173
Validity of three clinical performance assessments of internal medicine clerks.
Hull, A L; Hodder, S; Berger, B; Ginsberg, D; Lindheim, N; Quan, J; Kleinhenz, M E
1995-06-01
To analyze the construct validity of three methods to assess the clinical performances of internal medicine clerks. A multitrait-multimethod (MTMM) study was conducted at the Case Western Reserve University School of Medicine to determine the convergent and divergent validity of a clinical evaluation form (CEF) completed by faculty and residents, an objective structured clinical examination (OSCE), and the medicine subject test of the National Board of Medical Examiners. Three traits were involved in the analysis: clinical skills, knowledge, and personal characteristics. A correlation matrix was computed for 410 third-year students who completed the clerkship between August 1988 and July 1991. There was a significant (p < .01) convergence of the four correlations that assessed the same traits by using different methods. However, the four convergent correlations were of moderate magnitude (ranging from .29 to .47). Divergent validity was assessed by comparing the magnitudes of the convergence correlations with the magnitudes of correlations among unrelated assessments (i.e., different traits by different methods). Seven of nine possible coefficients were smaller than the convergent coefficients, suggesting evidence of divergent validity. A significant CEF method effect was identified. There was convergent validity and some evidence of divergent validity with a significant method effect. The findings were similar for correlations corrected for attenuation. Four conclusions were reached: (1) the reliability of the OSCE must be improved, (2) the CEF ratings must be redesigned to further discriminate among the specific traits assessed, (3) additional methods to assess personal characteristics must be instituted, and (4) several assessment methods should be used to evaluate individual student performances.
Garcia-Lopez, Luis J; Inglés, Cándido J; García-Fernández, José M; Hidalgo, María D; Bermejo, Rosa; Puklek Levpušček, Melita
2011-01-01
This study examined the reliability and validity evidence drawn from the scores of the Spanish version of the Slovenian-developed Social Anxiety Scale for Adolescents (SASA; Puklek, 1997; Puklek & Vidmar, 2000) using a community sample (Study 1) and a clinical sample (Study 2). Confirmatory factor analysis in Study 1 replicated the 2-factor structure found by the original authors in a sample of Slovenian adolescents. Test-retest reliability was adequate. Furthermore, the SASA correlated significantly with other social anxiety scales, supporting concurrent validity evidence in Spanish adolescents. The results of Study 2 confirmed the correlations between the SASA and other social anxiety measures in a clinical sample. In addition, findings revealed that the SASA can effectively discriminate between adolescents with a clinical diagnosis of social anxiety disorder (SAD) and those without this disorder. Finally, cut-off scores for the SASA are provided for Spanish adolescents.
Evidence-based dentistry: analysis of dental anxiety scales for children.
Al-Namankany, A; de Souza, M; Ashley, P
2012-03-09
To review paediatric dental anxiety measures (DAMs) and assess the statistical methods used for validation and their clinical implications. A search of four computerised databases between 1960 and January 2011 associated with DAMs, using pre-specified search terms, to assess the method of validation including the reliability as intra-observer agreement 'repeatability or stability' and inter-observer agreement 'reproducibility' and all types of validity. Fourteen paediatric DAMs were predominantly validated in schools and not in the clinical setting while five of the DAMs were not validated at all. The DAMs that were validated were done so against other paediatric DAMs which may not have been validated previously. Reliability was not assessed in four of the DAMs. However, all of the validated studies assessed reliability which was usually 'good' or 'acceptable'. None of the current DAMs used a formal sample size technique. Diversity was seen between the studies ranging from a few simple pictograms to lists of questions reported by either the individual or an observer. To date there is no scale that can be considered as a gold standard, and there is a need to further develop an anxiety scale with a cognitive component for children and adolescents.
A psychometric study of the Fear of Sleep Inventory-Short Form (FoSI-SF).
Pruiksma, Kristi E; Taylor, Daniel J; Ruggero, Camilo; Boals, Adriel; Davis, Joanne L; Cranston, Christopher; DeViva, Jason C; Zayfert, Claudia
2014-05-15
Fear of sleep may play a significant role in sleep disturbances in individuals with posttraumatic stress disorder (PTSD). This report describes a psychometric study of the Fear of Sleep Inventory (FoSI), which was developed to measure this construct. The psychometric properties of the FoSI were examined in a non-clinical sample of 292 college students (Study I) and in a clinical sample of 67 trauma-exposed adults experiencing chronic nightmares (Study II). Data on the 23 items of the FoSI were subjected to exploratory factor analyses (EFA) to identify items uniquely assessing fear of sleep. Next, reliability and validity of a 13-item version of the FoSI was examined in both samples. A 13-item Short-Form version (FoSI-SF) was identified as having a clear 2-factor structure with high internal consistency in both the non-clinical (α = 0.76-0.94) and clinical (α = 0.88-0.91) samples. Both studies demonstrated good convergent validity with measures of PTSD (0.48-0.61) and insomnia (0.39-0.48) and discriminant validity with a measure of sleep hygiene (0.19-0.27). The total score on the FoSI-SF was significantly higher in the clinical sample (mean = 17.90, SD = 12.56) than in the non-clinical sample (mean = 4.80, SD = 7.72); t(357) = 8.85 p < 0.001. Although all items are recommended for clinical purposes, the data support the use of the 13-item FoSI-SF for research purposes. Replication of the factor structure in clinical samples is needed. Results are discussed in terms of limitations of this study and directions for further research.
Zhou, Ting; Yang, Kaixiang; Thapa, Sudip; Fu, Qiang; Jiang, Yongsheng; Yu, Shiying
2017-04-01
The assessment of quality of life (QOL) is an important part of cachexia management for cancer patients. Functional assessment of anorexia-cachexia therapy (FAACT), a specific QOL instrument for cachexia patients, has not been validated in Chinese population. The aim of this study was to validate the FAACT scale in Chinese cancer patients for its future use. Eligible cancer patients were included in our study. Patients' demographic and clinical characteristics were collected from the electronic medical records. Patients were asked to complete the Chinese version of FAACT scale and the MD Anderson symptom inventory (MDASI), and then the reliability and validity were analyzed. A total of 285 patients were enrolled in our study, data of 241 patients were evaluated. Coefficients of Cronbach's alpha, test-retest and split-half analyses were all greater than 0.8, which indicated an excellent reliability for FAACT scale. In item-subscale correlation analysis and factor analysis, good construct validity for FAACT scale was found. The correlation between FAACT and MDASI interference subscale showed reasonable criterion-related validity, and for further clinical validation, the FAACT scale showed excellent discriminative validity for distinguishing patients in different cachexia status and in different performance status. The Chinese version of FAACT scale has good reliability and validity and is suitable for measuring QOL of cachexia patients in Chinese population.
Outcome Measures in Spinal Cord Injury
Alexander, Marcalee S.; Anderson, Kim; Biering-Sorensen, Fin; Blight, Andrew R.; Brannon, Ruth; Bryce, Thomas; Creasey, Graham; Catz, Amiram; Curt, Armin; Donovan, William; Ditunno, John; Ellaway, Peter; Finnerup, Nanna B.; Graves, Daniel E.; Haynes, Beth Ann; Heinemann, Allen W.; Jackson, Amie B.; Johnston, Mark; Kalpakjian, Claire Z.; Kleitman, Naomi; Krassioukov, Andrei; Krogh, Klaus; Lammertse, Daniel; Magasi, Susan; Mulcahey, MJ; Schurch, Brigitte; Sherwood, Arthur; Steeves, John D.; Stiens, Steven; Tulsky, David S.; van Hedel, Hubertus J.A.; Whiteneck, Gale
2009-01-01
Study Design review by the Spinal Cord Outcomes Partnership Endeavor (SCOPE), which is a broad-based international consortium of scientists and clinical researchers representing academic institutions, industry, government agencies, not-for-profit organizations and foundations. Objectives assessment of current and evolving tools for evaluating human spinal cord injury (SCI) outcomes for both clinical diagnosis and clinical research studies. Methods a framework for the appraisal of evidence of metric properties was used to examine outcome tools or tests for accuracy, sensitivity, reliability and validity for human SCI. Results imaging, neurological, functional, autonomic, sexual health, bladder/bowel, pain, and psycho-social tools were evaluated. Several specific tools for human SCI studies have or are being developed to allow the more accurate determination for a clinically meaningful benefit (improvement in functional outcome or quality of life) being achieved as a result of a therapeutic intervention. Conclusion significant progress has been made, but further validation studies are required to identify the most appropriate tools for specific targets in a human SCI study or clinical trial. PMID:19381157
ERIC Educational Resources Information Center
Queen, Alexander H.; Ehrenreich-May, Jill; Hershorin, Eugene R.
2012-01-01
This study examines the validity of a brief screening tool for adolescent panic disorder (PD) in a primary care setting. A total of 165 participants (ages 12-17 years) seen in two pediatric primary care clinics completed the Autonomic Nervous System Questionnaire (ANS; Stein et al. in Psychosomatic Med 61:359-364, 40). A subset of those screening…
Methodology Series Module 9: Designing Questionnaires and Clinical Record Forms - Part II.
Setia, Maninder Singh
2017-01-01
This article is a continuation of the previous module on designing questionnaires and clinical record form in which we have discussed some basic points about designing the questionnaire and clinical record forms. In this section, we will discuss the reliability and validity of questionnaires. The different types of validity are face validity, content validity, criterion validity, and construct validity. The different types of reliability are test-retest reliability, inter-rater reliability, and intra-rater reliability. Some of these parameters are assessed by subject area experts. However, statistical tests should be used for evaluation of other parameters. Once the questionnaire has been designed, the researcher should pilot test the questionnaire. The items in the questionnaire should be changed based on the feedback from the pilot study participants and the researcher's experience. After the basic structure of the questionnaire has been finalized, the researcher should assess the validity and reliability of the questionnaire or the scale. If an existing standard questionnaire is translated in the local language, the researcher should assess the reliability and validity of the translated questionnaire, and these values should be presented in the manuscript. The decision to use a self- or interviewer-administered, paper- or computer-based questionnaire depends on the nature of the questions, literacy levels of the target population, and resources.
Methodology Series Module 9: Designing Questionnaires and Clinical Record Forms – Part II
Setia, Maninder Singh
2017-01-01
This article is a continuation of the previous module on designing questionnaires and clinical record form in which we have discussed some basic points about designing the questionnaire and clinical record forms. In this section, we will discuss the reliability and validity of questionnaires. The different types of validity are face validity, content validity, criterion validity, and construct validity. The different types of reliability are test-retest reliability, inter-rater reliability, and intra-rater reliability. Some of these parameters are assessed by subject area experts. However, statistical tests should be used for evaluation of other parameters. Once the questionnaire has been designed, the researcher should pilot test the questionnaire. The items in the questionnaire should be changed based on the feedback from the pilot study participants and the researcher's experience. After the basic structure of the questionnaire has been finalized, the researcher should assess the validity and reliability of the questionnaire or the scale. If an existing standard questionnaire is translated in the local language, the researcher should assess the reliability and validity of the translated questionnaire, and these values should be presented in the manuscript. The decision to use a self- or interviewer-administered, paper- or computer-based questionnaire depends on the nature of the questions, literacy levels of the target population, and resources. PMID:28584367
[Selection of risk and diagnosis in diabetic polyneuropathy. Validation of method of new systems].
Jurado, Jerónimo; Caula, Jacinto; Pou i Torelló, Josep Maria
2006-06-30
In a previous study we developed a specific algorithm, the polyneuropathy selection method (PSM) with 4 parameters (age, HDL-C, HbA1c, and retinopathy), to select patients at risk of diabetic polyneuropathy (DPN). We also developed a simplified method for DPN diagnosis: outpatient polyneuropathy diagnosis (OPD), with 4 variables (symptoms and 3 objective tests). To confirm the validity of conventional tests for DPN diagnosis; to validate the discriminatory power of the PSM and the diagnostic value of OPD by evaluating their relationship to electrodiagnosis studies and objective clinical neurological assessment; and to evaluate the correlation of DPN and pro-inflammatory status. Cross-sectional, crossed association for PSM validation. Paired samples for OPD validation. Primary care in 3 counties. Random sample of 75 subjects from the type-2 diabetes census for PSM evaluation. Thirty DPN patients and 30 non-DPN patients (from 2 DM2 sub-groups in our earlier study) for OPD evaluation. The gold standard for DPN diagnosis will be studied by means of a clinical neurological study (symptoms, physical examination, and sensitivity tests) and electrodiagnosis studies (sensitivity and motor EMG). Risks of neuropathy, macroangiopathy and pro-inflammatory status (PCR, TNF soluble fraction and total TGF-beta1) will be studied in every subject. Electrodiagnosis studies should confirm the validity of conventional tests for DPN diagnosis. PSM and OPD will be valid methods for selecting patients at risk and diagnosing DPN. There will be a significant relationship between DPN and pro-inflammatory tests.
Lemeunier, Nadège; da Silva-Oolup, S; Chow, N; Southerst, D; Carroll, L; Wong, J J; Shearer, H; Mastragostino, P; Cox, J; Côté, E; Murnaghan, K; Sutton, D; Côté, P
2017-09-01
To determine the reliability and validity of clinical tests to assess the anatomical integrity of the cervical spine in adults with neck pain and its associated disorders. We updated the systematic review of the 2000-2010 Bone and Joint Decade Task Force on Neck Pain and its Associated Disorders. We also searched the literature to identify studies on the reliability and validity of Doppler velocimetry for the evaluation of cervical arteries. Two independent reviewers screened and critically appraised studies. We conducted a best evidence synthesis of low risk of bias studies and ranked the phases of investigations using the classification proposed by Sackett and Haynes. We screened 9022 articles and critically appraised 8 studies; all 8 studies had low risk of bias (three reliability and five validity Phase II-III studies). Preliminary evidence suggests that the extension-rotation test may be reliable and has adequate validity to rule out pain arising from facet joints. The evidence suggests variable reliability and preliminary validity for the evaluation of cervical radiculopathy including neurological examination (manual motor testing, dermatomal sensory testing, deep tendon reflexes, and pathological reflex testing), Spurling's and the upper limb neurodynamic tests. No evidence was found for doppler velocimetry. Little evidence exists to support the use of clinical tests to evaluate the anatomical integrity of the cervical spine in adults with neck pain and its associated disorders. We found preliminary evidence to support the use of the extension-rotation test, neurological examination, Spurling's and the upper limb neurodynamic tests.
Torlakovic, Emina E; Cheung, Carol C; D'Arrigo, Corrado; Dietel, Manfred; Francis, Glenn D; Gilks, C Blake; Hall, Jacqueline A; Hornick, Jason L; Ibrahim, Merdol; Marchetti, Antonio; Miller, Keith; van Krieken, J Han; Nielsen, Soren; Swanson, Paul E; Vyberg, Mogens; Zhou, Xiaoge; Taylor, Clive R
2017-03-01
Validation of immunohistochemistry (IHC) assays is a subject that is of great importance to clinical practice as well as basic research and clinical trials. When applied to clinical practice and focused on patient safety, validation of IHC assays creates objective evidence that IHC assays used for patient care are "fit-for-purpose." Validation of IHC assays needs to be properly informed by and modeled to assess the purpose of the IHC assay, which will further determine what sphere of validation is required, as well as the scope, type, and tier of technical validation. These concepts will be defined in this review, part 3 of the 4-part series "Evolution of Quality Assurance for Clinical Immunohistochemistry in the Era of Precision Medicine."
Mueller, Gerhard; Mylonas, Demetrius; Schumacher, Petra
2018-07-01
Within nursing education, the clinical learning environment is of a high importance in regards to the development of competencies and abilities. The organization, atmosphere, and supervision in the clinical learning environment are only a few factors that influence this development. In Austria there is currently no valid instrument available for the evaluation of influencing factors. The aim of the study was to test the construct validity with principal component analysis as well as the internal consistency of the German Clinical Learning Environment, Supervision and Teacher Scale (CLES+T scale) in Austria. The present validation study has a descriptive-quantitative cross-sectional design. The sample consisted of 385 nursing students from thirteen training institutions in Austria. The data collection was carried out online between March and April 2016. Starting with a polychoric correlation matrix, a parallel analysis with principal component extraction and promax rotation was carried out due to the ordinal data. The exploratory ordinal factor analysis supported a four-component solution and explained 73% of the total variance. The internal consistency of all 25 items reached a Cronbach's α of 0.95 and the four components ranged between 0.83 and 0.95. The German version of the CLES+T scale seems to be a useful instrument for identifying potential areas of improvement in clinical practice in order to derive specific quality measures for the practical learning environment. Copyright © 2018 Elsevier Ltd. All rights reserved.
[The use of systematic review to develop a self-management program for CKD].
Lee, Yu-Chin; Wu, Shu-Fang Vivienne; Lee, Mei-Chen; Chen, Fu-An; Yao, Yen-Hong; Wang, Chin-Ling
2014-12-01
Chronic kidney disease (CKD) has become a public health issue of international concern due to its high prevalence. The concept of self-management has been comprehensively applied in education programs that address chronic diseases. In recent years, many studies have used self-management programs in CKD interventions and have investigated the pre- and post-intervention physiological and psychological effectiveness of this approach. However, a complete clinical application program in the self-management model has yet to be developed for use in clinical renal care settings. A systematic review is used to develop a self-management program for CKD. Three implementation steps were used in this study. These steps include: (1) A systematic literature search and review using databases including CEPS (Chinese Electronic Periodical Services) of Airiti, National Digital Library of Theses and Dissertations in Taiwan, CINAHL, Pubmed, Medline, Cochrane Library, and Joanna Briggs Institute. A total of 22 studies were identified as valid and submitted to rigorous analysis. Of these, 4 were systematic literature reviews, 10 were randomized experimental studies, and 8 were non-randomized experimental studies. (2) Empirical evidence then was used to draft relevant guidelines on clinical application. (3) Finally, expert panels tested the validity of the draft to ensure the final version was valid for application in practice. This study designed a self-management program for CKD based on the findings of empirical studies. The content of this program included: design principles, categories, elements, and the intervention measures used in the self-management program. This program and then was assessed using the content validity index (CVI) and a four-point Liker's scale. The content validity score was .98. The guideline of self-management program to CKD was thus developed. This study developed a self-management program applicable to local care of CKD. It is hoped that the guidelines developed in this study offer a reference for clinical caregivers to improve their healthcare practices.
Psychometric validation of the Italian Rehabilitation Complexity Scale-Extended version 13
Agosti, Maurizio; Merlo, Andrea; Maini, Maurizio; Lombardi, Francesco; Tedeschi, Claudio; Benedetti, Maria Grazia; Basaglia, Nino; Contini, Mara; Nicolotti, Domenico; Brianti, Rodolfo
2017-01-01
In Italy, at present, a well-known problem is inhomogeneous provision of rehabilitative services, as stressed by MoH, requiring appropriate criteria and parameters to plan rehabilitation actions. According to the Italian National Rehabilitation Plan, Comorbidity, Disability and Clinical Complexity should be assessed to define the patient’s real needs. However, to date, clinical complexity is still difficult to measure with shared and validated tools. The study aims to psychometrically validate the Italian Rehabilitation Complexity Scale-Extended v13 (RCS-E v13), in order to meet the guidelines requirements. An observational multicentre prospective cohort study, involving 8 intensive rehabilitation facilities of the Emilia-Romagna Region and 1712 in-patients, [823 male (48%) and 889 female (52%), mean age 68.34 years (95% CI 67.69–69.00 years)] showing neurological, orthopaedic and cardiological problems, was carried out. The construct and concurrent validity of the RCS-E v13 was confirmed through its correlation to Barthel Index (disability) and Cumulative Illness Rating Scale (comorbidity) and appropriate admission criteria (not yet published), respectively. Furthermore, the factor analysis indicated two different components (“Basic Care or Risk—Equipment” and “Medical—Nursing Needs and Therapy Disciplines”) of the RCS-E v13. In conclusion, the Italian RCS-E v13 appears to be a useful tool to assess clinical complexity in the Italian rehab scenario case-mix and its psychometric validation may have an important clinical rehabilitation impact allowing the assessment of the rehabilitation needs considering all three dimensions (disability, comorbidity and clinical complexity) as required by the Guidelines and the inhomogeneity could be reduced. PMID:29045409
2011-01-01
Background Computerized Clinical Records, which are incorporated in primary health care practice, have great potential for research. In order to use this information, data quality and reliability must be assessed to prevent compromising the validity of the results. The aim of this study is to validate the diagnosis of hypertension and diabetes mellitus in the computerized clinical records of primary health care, taking the diagnosis criteria established in the most prominently used clinical guidelines as the gold standard against which what measure the sensitivity, specificity, and determine the predictive values. The gold standard for diabetes mellitus was the diagnostic criteria established in 2003 American Diabetes Association Consensus Statement for diabetic subjects. The gold standard for hypertension was the diagnostic criteria established in the Joint National Committee published in 2003. Methods A cross-sectional multicentre validation study of diabetes mellitus and hypertension diagnoses in computerized clinical records of primary health care was carried out. Diagnostic criteria from the most prominently clinical practice guidelines were considered for standard reference. Sensitivity, specificity, positive and negative predictive values, and global agreement (with kappa index), were calculated. Results were shown overall and stratified by sex and age groups. Results The agreement for diabetes mellitus with the reference standard as determined by the guideline was almost perfect (κ = 0.990), with a sensitivity of 99.53%, a specificity of 99.49%, a positive predictive value of 91.23% and a negative predictive value of 99.98%. Hypertension diagnosis showed substantial agreement with the reference standard as determined by the guideline (κ = 0.778), the sensitivity was 85.22%, the specificity 96.95%, the positive predictive value 85.24%, and the negative predictive value was 96.95%. Sensitivity results were worse in patients who also had diabetes and in those aged 70 years or over. Conclusions Our results substantiate the validity of using diagnoses of diabetes and hypertension found within the computerized clinical records for epidemiologic studies. PMID:22035202
Validating archetypes for the Multiple Sclerosis Functional Composite.
Braun, Michael; Brandt, Alexander Ulrich; Schulz, Stefan; Boeker, Martin
2014-08-03
Numerous information models for electronic health records, such as openEHR archetypes are available. The quality of such clinical models is important to guarantee standardised semantics and to facilitate their interoperability. However, validation aspects are not regarded sufficiently yet. The objective of this report is to investigate the feasibility of archetype development and its community-based validation process, presuming that this review process is a practical way to ensure high-quality information models amending the formal reference model definitions. A standard archetype development approach was applied on a case set of three clinical tests for multiple sclerosis assessment: After an analysis of the tests, the obtained data elements were organised and structured. The appropriate archetype class was selected and the data elements were implemented in an iterative refinement process. Clinical and information modelling experts validated the models in a structured review process. Four new archetypes were developed and publicly deployed in the openEHR Clinical Knowledge Manager, an online platform provided by the openEHR Foundation. Afterwards, these four archetypes were validated by domain experts in a team review. The review was a formalised process, organised in the Clinical Knowledge Manager. Both, development and review process turned out to be time-consuming tasks, mostly due to difficult selection processes between alternative modelling approaches. The archetype review was a straightforward team process with the goal to validate archetypes pragmatically. The quality of medical information models is crucial to guarantee standardised semantic representation in order to improve interoperability. The validation process is a practical way to better harmonise models that diverge due to necessary flexibility left open by the underlying formal reference model definitions.This case study provides evidence that both community- and tool-enabled review processes, structured in the Clinical Knowledge Manager, ensure archetype quality. It offers a pragmatic but feasible way to reduce variation in the representation of clinical information models towards a more unified and interoperable model.
Validating archetypes for the Multiple Sclerosis Functional Composite
2014-01-01
Background Numerous information models for electronic health records, such as openEHR archetypes are available. The quality of such clinical models is important to guarantee standardised semantics and to facilitate their interoperability. However, validation aspects are not regarded sufficiently yet. The objective of this report is to investigate the feasibility of archetype development and its community-based validation process, presuming that this review process is a practical way to ensure high-quality information models amending the formal reference model definitions. Methods A standard archetype development approach was applied on a case set of three clinical tests for multiple sclerosis assessment: After an analysis of the tests, the obtained data elements were organised and structured. The appropriate archetype class was selected and the data elements were implemented in an iterative refinement process. Clinical and information modelling experts validated the models in a structured review process. Results Four new archetypes were developed and publicly deployed in the openEHR Clinical Knowledge Manager, an online platform provided by the openEHR Foundation. Afterwards, these four archetypes were validated by domain experts in a team review. The review was a formalised process, organised in the Clinical Knowledge Manager. Both, development and review process turned out to be time-consuming tasks, mostly due to difficult selection processes between alternative modelling approaches. The archetype review was a straightforward team process with the goal to validate archetypes pragmatically. Conclusions The quality of medical information models is crucial to guarantee standardised semantic representation in order to improve interoperability. The validation process is a practical way to better harmonise models that diverge due to necessary flexibility left open by the underlying formal reference model definitions. This case study provides evidence that both community- and tool-enabled review processes, structured in the Clinical Knowledge Manager, ensure archetype quality. It offers a pragmatic but feasible way to reduce variation in the representation of clinical information models towards a more unified and interoperable model. PMID:25087081
The use of the FACT-H&N (v4) in clinical settings within a developing country: a mixed method study.
Bilal, Sobia; Doss, Jennifer Geraldine; Rogers, Simon N
2014-12-01
In the last decade there has been an increasing awareness about 'quality of life' (QOL) of cancer survivors in developing countries. The study aimed to cross-culturally adapt and validate the FACT-H&N (v4) in Urdu language for Pakistani head and neck cancer patients. In this study the 'same language adaptation method' was used. Cognitive debriefing through in-depth interviews of 25 patients to assess semantic, operational and conceptual equivalence was done. The validation phase included 50 patients to evaluate the psychometric properties. The translated FACT-H&N was easily comprehended (100%). Cronbach's alpha for FACT-G subscales ranged from 0.726 - 0.969. The head and neck subscale and Pakistani questions subscale showed low internal consistency (0.426 and 0.541 respectively). Instrument demonstrated known-group validity in differentiating patients of different clinical stages, treatment status and tumor sites (p < 0.05). Most FACT summary scales correlated strongly with each other (r > 0.75) and showed convergent validity (r > 0.90), with little discriminant validity. Factor analysis revealed 6 factors explaining 85.1% of the total variance with very good (>0.8) Kaiser-Meyer-Olkin and highly significant Bartlett's Test of Sphericity (p < 0.001). The cross-culturally adapted FACT-H&N into Urdu language showed adequate reliability and validity to be incorporated in Pakistani clinical settings for head and neck cancer patients. Copyright © 2014 European Association for Cranio-Maxillo-Facial Surgery. Published by Elsevier Ltd. All rights reserved.
Pu, Xia; Ye, Yuanqing; Wu, Xifeng
2014-01-01
Despite the advances made in cancer management over the past few decades, improvements in cancer diagnosis and prognosis are still poor, highlighting the need for individualized strategies. Toward this goal, risk prediction models and molecular diagnostic tools have been developed, tailoring each step of risk assessment from diagnosis to treatment and clinical outcomes based on the individual's clinical, epidemiological, and molecular profiles. These approaches hold increasing promise for delivering a new paradigm to maximize the efficiency of cancer surveillance and efficacy of treatment. However, they require stringent study design, methodology development, comprehensive assessment of biomarkers and risk factors, and extensive validation to ensure their overall usefulness for clinical translation. In the current study, the authors conducted a systematic review using breast cancer as an example and provide general guidelines for risk prediction models and molecular diagnostic tools, including development, assessment, and validation. © 2013 American Cancer Society.
Validity and reliability of the iPhone to measure rib hump in scoliosis.
Balg, Frederic; Juteau, Mathieu; Theoret, Chantal; Svotelis, Amy; Grenier, Guillaume
2014-12-01
This was a prospective blinded validity and reliability analysis. The aim of this study was validation and reliability evaluation of the Scoligauge iPhone app. The scoliometer is used to clinically measure the rib hump in scoliosis as a means to evaluate the axial trunk rotation. The increasing availability of smartphone with built-in accelerometer led to the development of a vast number of applications to measure angles. Of these, the Scoligauge mimics a scoliometer. The aim of this study was to compare the validity of the Scoligauge iPhone application without an associated adapter with the traditional scoliometer and to test the reliability of the application in a clinical setting. Two observers measured the rib hump deformity on 34 consecutive patients with idiopathic scoliosis with an average Cobb angle of 24.2 ± 13.5 degrees (range, 4 to 65 degrees). Measurements were made with an iPhone without the adapter and with a scoliometer. The validity as well as the interobserver and intraobserver reliability were calculated using the intraclass coefficient (ICC) and the Bland-Altman test. The mean difference between the scoliometer and the Scoligauge application was 0.4 degrees [95% confidence interval (CI) of ± 3.1 degrees] with an ICC of 0.947 (P < 0.001). The intraobserver and interobserver ICC were 0.961 (P < 0.001) and 0.901 (P < 0.001), respectively. The mean intraobserver difference was 0.0 degrees (95% CI of ± 2.7 degrees) and the mean interobserver difference was 0.1 degrees (95% CI of ± 4.4 degrees). The intraobserver and interobserver reliability of the Scoligauge iPhone app, as well as its validity compared with the scoliometer, are excellent. The mean differences between measurements are small and clinically not significant. Thus, the Scoligauge application is valid for clinical evaluation even without special adapter. Level I (Diagnostic Study).
ERIC Educational Resources Information Center
Lin, Keh-chung; Chen, Hui-fang; Chen, Chia-ling; Wang, Tien-ni; Wu, Ching-yi; Hsieh, Yu-wei; Wu, Li-ling
2012-01-01
This study examined criterion-related validity and clinimetric properties of the Pediatric Motor Activity Log (PMAL) in children with cerebral palsy. Study participants were 41 children (age range: 28-113 months) and their parents. Criterion-related validity was evaluated by the associations between the PMAL and criterion measures at baseline and…
Plochg, Thomas; Arah, Onyebuchi A; Botje, Daan; Thompson, Caroline A; Klazinga, Niek S; Wagner, Cordula; Mannion, Russell; Lombarts, Kiki
2014-04-01
Clinical management is hypothesized to be critical for hospital management and hospital performance. The aims of this study were to develop and validate professional involvement scales for measuring the level of clinical management by physicians and nurses in European hospitals. Testing of validity and reliability of scales derived from a questionnaire of 21 items was developed on the basis of a previous study and expert opinion and administered in a cross-sectional seven-country research project 'Deepening our Understanding of Quality improvement in Europe' (DUQuE). A sample of 3386 leading physicians and nurses working in 188 hospitals located in Czech Republic, France, Germany, Poland, Portugal, Spain and Turkey. Validity and reliability of professional involvement scales and subscales. Psychometric analysis yielded four subscales for leading physicians: (i) Administration and budgeting, (ii) Managing medical practice, (iii) Strategic management and (iv) Managing nursing practice. Only the first three factors applied well to the nurses. Cronbach's alpha for internal consistency ranged from 0.74 to 0.86 for the physicians, and from 0.61 to 0.81 for the nurses. Except for the 0.74 correlation between 'Administration and budgeting' and 'Managing medical practice' among physicians, all inter-scale correlations were <0.70 (range 0.43-0.61). Under testing for construct validity, the subscales were positively correlated with 'formal management roles' of physicians and nurses. The professional involvement scales appear to yield reliable and valid data in European hospital settings, but the scale 'Managing medical practice' for nurses needs further exploration. The measurement instrument can be used for international research on clinical management.
The reliability and validity of ultrasound to quantify muscles in older adults: a systematic review
Scafoglieri, Aldo; Jager‐Wittenaar, Harriët; Hobbelen, Johannes S.M.; van der Schans, Cees P.
2017-01-01
Abstract This review evaluates the reliability and validity of ultrasound to quantify muscles in older adults. The databases PubMed, Cochrane, and Cumulative Index to Nursing and Allied Health Literature were systematically searched for studies. In 17 studies, the reliability (n = 13) and validity (n = 8) of ultrasound to quantify muscles in community‐dwelling older adults (≥60 years) or a clinical population were evaluated. Four out of 13 reliability studies investigated both intra‐rater and inter‐rater reliability. Intraclass correlation coefficient (ICC) scores for reliability ranged from −0.26 to 1.00. The highest ICC scores were found for the vastus lateralis, rectus femoris, upper arm anterior, and the trunk (ICC = 0.72 to 1.000). All included validity studies found ICC scores ranging from 0.92 to 0.999. Two studies describing the validity of ultrasound to predict lean body mass showed good validity as compared with dual‐energy X‐ray absorptiometry (r 2 = 0.92 to 0.96). This systematic review shows that ultrasound is a reliable and valid tool for the assessment of muscle size in older adults. More high‐quality research is required to confirm these findings in both clinical and healthy populations. Furthermore, ultrasound assessment of small muscles needs further evaluation. Ultrasound to predict lean body mass is feasible; however, future research is required to validate prediction equations in older adults with varying function and health. PMID:28703496
Development and testing of the cancer multidisciplinary team meeting observational tool (MDT-MOT)
Harris, Jenny; Taylor, Cath; Sevdalis, Nick; Jalil, Rozh; Green, James S.A.
2016-01-01
Abstract Objective To develop a tool for independent observational assessment of cancer multidisciplinary team meetings (MDMs), and test criterion validity, inter-rater reliability/agreement and describe performance. Design Clinicians and experts in teamwork used a mixed-methods approach to develop and refine the tool. Study 1 observers rated pre-determined optimal/sub-optimal MDM film excerpts and Study 2 observers independently rated video-recordings of 10 MDMs. Setting Study 2 included 10 cancer MDMs in England. Participants Testing was undertaken by 13 health service staff and a clinical and non-clinical observer. Intervention None. Main Outcome Measures Tool development, validity, reliability/agreement and variability in MDT performance. Results Study 1: Observers were able to discriminate between optimal and sub-optimal MDM performance (P ≤ 0.05). Study 2: Inter-rater reliability was good for 3/10 domains. Percentage of absolute agreement was high (≥80%) for 4/10 domains and percentage agreement within 1 point was high for 9/10 domains. Four MDTs performed well (scored 3+ in at least 8/10 domains), 5 MDTs performed well in 6–7 domains and 1 MDT performed well in only 4 domains. Leadership and chairing of the meeting, the organization and administration of the meeting, and clinical decision-making processes all varied significantly between MDMs (P ≤ 0.01). Conclusions MDT-MOT demonstrated good criterion validity. Agreement between clinical and non-clinical observers (within one point on the scale) was high but this was inconsistent with reliability coefficients and warrants further investigation. If further validated MDT-MOT might provide a useful mechanism for the routine assessment of MDMs by the local workforce to drive improvements in MDT performance. PMID:27084499
Development and testing of the cancer multidisciplinary team meeting observational tool (MDT-MOT).
Harris, Jenny; Taylor, Cath; Sevdalis, Nick; Jalil, Rozh; Green, James S A
2016-06-01
To develop a tool for independent observational assessment of cancer multidisciplinary team meetings (MDMs), and test criterion validity, inter-rater reliability/agreement and describe performance. Clinicians and experts in teamwork used a mixed-methods approach to develop and refine the tool. Study 1 observers rated pre-determined optimal/sub-optimal MDM film excerpts and Study 2 observers independently rated video-recordings of 10 MDMs. Study 2 included 10 cancer MDMs in England. Testing was undertaken by 13 health service staff and a clinical and non-clinical observer. None. Tool development, validity, reliability/agreement and variability in MDT performance. Study 1: Observers were able to discriminate between optimal and sub-optimal MDM performance (P ≤ 0.05). Study 2: Inter-rater reliability was good for 3/10 domains. Percentage of absolute agreement was high (≥80%) for 4/10 domains and percentage agreement within 1 point was high for 9/10 domains. Four MDTs performed well (scored 3+ in at least 8/10 domains), 5 MDTs performed well in 6-7 domains and 1 MDT performed well in only 4 domains. Leadership and chairing of the meeting, the organization and administration of the meeting, and clinical decision-making processes all varied significantly between MDMs (P ≤ 0.01). MDT-MOT demonstrated good criterion validity. Agreement between clinical and non-clinical observers (within one point on the scale) was high but this was inconsistent with reliability coefficients and warrants further investigation. If further validated MDT-MOT might provide a useful mechanism for the routine assessment of MDMs by the local workforce to drive improvements in MDT performance. © The Author 2016. Published by Oxford University Press in association with the International Society for Quality in Health Care; all rights reserved.
Commissioning and validation of COMPASS system for VMAT patient specific quality assurance
NASA Astrophysics Data System (ADS)
Pimthong, J.; Kakanaporn, C.; Tuntipumiamorn, L.; Laojunun, P.; Iampongpaiboon, P.
2016-03-01
Pre-treatment patient specific quality assurance (QA) of advanced treatment techniques such as volumetric modulated arc therapy (VMAT) is one of important QA in radiotherapy. The fast and reliable dosimetric device is required. The objective of this study is to commission and validate the performance of COMPASS system for dose verification of VMAT technique. The COMPASS system is composed of an array of ionization detectors (MatriXX) mounted to the gantry using a custom holder and software for the analysis and visualization of QA results. We validated the COMPASS software for basic and advanced clinical application. For the basic clinical study, the simple open field in various field sizes were validated in homogeneous phantom. And the advanced clinical application, the fifteen prostate and fifteen nasopharyngeal cancers VMAT plans were chosen to study. The treatment plans were measured by the MatriXX. The doses and dose-volume histograms (DVHs) reconstructed from the fluence measurements were compared to the TPS calculated plans. And also, the doses and DVHs computed using collapsed cone convolution (CCC) Algorithm were compared with Eclipse TPS calculated plans using Analytical Anisotropic Algorithm (AAA) that according to dose specified in ICRU 83 for PTV.
Kilner, T M; Brace, S J; Cooke, M W; Stallard, N; Bleetman, A; Perkins, G D
2011-05-01
The term "big bang" major incidents is used to describe sudden, usually traumatic,catastrophic events, involving relatively large numbers of injured individuals, where demands on clinical services rapidly outstrip the available resources. Triage tools support the pre-hospital provider to prioritise which patients to treat and/or transport first based upon clinical need. The aim of this review is to identify existing triage tools and to determine the extent to which their reliability and validity have been assessed. A systematic review of the literature was conducted to identify and evaluate published data validating the efficacy of the triage tools. Studies using data from trauma patients that report on the derivation, validation and/or reliability of the specific pre-hospital triage tools were eligible for inclusion.Purely descriptive studies, reviews, exercises or reports (without supporting data) were excluded. The search yielded 1982 papers. After initial scrutiny of title and abstract, 181 papers were deemed potentially applicable and from these 11 were identified as relevant to this review (in first figure). There were two level of evidence one studies, three level of evidence two studies and six level of evidence three studies. The two level of evidence one studies were prospective validations of Clinical Decision Rules (CDR's) in children in South Africa, all the other studies were retrospective CDR derivation, validation or cohort studies. The quality of the papers was rated as good (n=3), fair (n=7), poor (n=1). There is limited evidence for the validity of existing triage tools in big bang major incidents.Where evidence does exist it focuses on sensitivity and specificity in relation to prediction of trauma death or severity of injury based on data from single or small number patient incidents. The Sacco system is unique in combining survivability modelling with the degree by which the system is overwhelmed in the triage decision system. The practicalities, training implications, performance characteristics and reliance on computer technology during a mass casualty incident require further evaluation. 2010 Elsevier Ltd. All rights reserved.
Sinniah, Aishvarya; Oei, Tian P S; Chinna, Karuthan; Shah, Shamsul A; Maniam, T; Subramaniam, Ponnusamy
2015-01-01
The PANSI is a measure designed to assess the risk and protective factors related to suicidal behaviors. The present study evaluated the psychometric properties and factor structure of the Positive and Negative Suicide Ideation (PANSI) Inventory in a sample of clinical outpatients at a major hospital in Malaysia. In this study, 283 psychiatric patients and 200 medical (non-psychiatric) patients participated. All the patients completed the PANSI and seven other self-report instruments. Confirmative factor analysis supported the 2-factor oblique model. The internal consistency of the two subscales of PANSI-Negative and the PANSI-Positive were 0.93 and 0.84, respectively. In testing construct validity, PANSI showed sizable correlation with the other seven scales. Criterion validity was supported by scores on PANSI which differentiated psychiatric patients from medical patients. Logistic regression analyses showed PANSI can be used to classify the patients into suicidal or non-suicidal. The PANSI is a reliable and valid instrument to measure the severity of suicidal ideation among clinical outpatients in Malaysia.
Sinniah, Aishvarya; Oei, Tian P. S.; Chinna, Karuthan; Shah, Shamsul A.; Maniam, T.; Subramaniam, Ponnusamy
2015-01-01
The PANSI is a measure designed to assess the risk and protective factors related to suicidal behaviors. The present study evaluated the psychometric properties and factor structure of the Positive and Negative Suicide Ideation (PANSI) Inventory in a sample of clinical outpatients at a major hospital in Malaysia. In this study, 283 psychiatric patients and 200 medical (non-psychiatric) patients participated. All the patients completed the PANSI and seven other self-report instruments. Confirmative factor analysis supported the 2-factor oblique model. The internal consistency of the two subscales of PANSI-Negative and the PANSI-Positive were 0.93 and 0.84, respectively. In testing construct validity, PANSI showed sizable correlation with the other seven scales. Criterion validity was supported by scores on PANSI which differentiated psychiatric patients from medical patients. Logistic regression analyses showed PANSI can be used to classify the patients into suicidal or non-suicidal. The PANSI is a reliable and valid instrument to measure the severity of suicidal ideation among clinical outpatients in Malaysia. PMID:26733920
Baldwin, Carol M; Choi, Myunghan; McClain, Darya Bonds; Celaya, Alma; Quan, Stuart F
2012-04-15
To translate, back-translate and cross-language validate (English/Spanish) the Sleep Heart Health Study Sleep Habits Questionnaire for use with Spanish-speakers in clinical and research settings. Following rigorous translation and back-translation, this cross-sectional cross-language validation study recruited bilingual participants from academic, clinic, and community-based settings (N = 50; 52% women; mean age 38.8 ± 12 years; 90% of Mexican heritage). Participants completed English and Spanish versions of the Sleep Habits Questionnaire, the Epworth Sleepiness Scale, and the Acculturation Rating Scale for Mexican Americans II one week apart in randomized order. Psychometric properties were assessed, including internal consistency, convergent validity, scale equivalence, language version intercorrelations, and exploratory factor analysis using PASW (Version18) software. Grade level readability of the sleep measure was evaluated. All sleep categories (duration, snoring, apnea, insomnia symptoms, other sleep symptoms, sleep disruptors, restless legs syndrome) showed Cronbach α, Spearman-Brown coefficients and intercorrelations ≥ 0.700, suggesting robust internal consistency, correlation, and agreement between language versions. The Epworth correlated significantly with snoring, apnea, sleep symptoms, restless legs, and sleep disruptors) on both versions, supporting convergent validity. Items loaded on 4 factors accounted for 68% and 67% of the variance on the English and Spanish versions, respectively. The Spanish-language Sleep Habits Questionnaire demonstrates conceptual and content equivalency. It has appropriate measurement properties and should be useful for assessing sleep health in community-based clinics and intervention studies among Spanish-speaking Mexican Americans. Both language versions showed readability at the fifth grade level. Further testing is needed with larger samples.
Gould, Sara J.; Cardone, Dennis A.; Munyak, John; Underwood, Philipp J.; Gould, Stephen A.
2014-01-01
Context: Sidelines coverage presents unique challenges in the evaluation of injured athletes. Health care providers may be confronted with the question of when to obtain radiographs following an injury. Given that most sidelines coverage occurs outside the elite level, radiographs are not readily available at the time of injury, and the decision of when to send a player for radiographs must be made based on physical examination. Clinical tools have been developed to aid in identifying injuries that are likely to result in radiographically important fractures or dislocations. Evidence Acquisition: A search for the keywords x-ray and decision rule along with the anatomic locations shoulder, elbow, wrist, knee, and ankle was performed using the PubMed database. No limits were set regarding year of publication. We selected meta-analyses, randomized controlled trials, and survey results. Our selection focused on the largest, most well-studied published reports. We also attempted to include studies that reported the application of the rules to the field of sports medicine. Study Design: Retrospective literature review. Level of Evidence: Level 4. Results: The Ottawa Foot and Ankle Rules have been validated and implemented and are appropriate for use in both pediatric and adult populations. The Ottawa Knee Rules have been widely studied, validated, and accepted for evaluation of knee injuries. There are promising studies of decision rules for clinically important fractures of the wrist, but these studies have not been validated. The elbow has been evaluated with good outcomes via the elbow extension test, which has been validated in both single and multicenter studies. Currently, there are no reliable clinical decision tools for traumatic sports injuries to the shoulder to aid in the decision of when to obtain radiographs. Conclusion: Clinical decision tools have been developed to aid in the diagnosis and management of injuries commonly sustained during sporting events. Tools that have been appropriately validated in populations outside the initial study population can assist sports medicine physicians in the decision of when to get radiographs from the sidelines. PMID:24790698
Schiffman, Eric L.; Truelove, Edmond L.; Ohrbach, Richard; Anderson, Gary C.; John, Mike T.; List, Thomas; Look, John O.
2011-01-01
AIMS The purpose of the Research Diagnostic Criteria for Temporomandibular Disorders (RDC/TMD) Validation Project was to assess the diagnostic validity of this examination protocol. An overview is presented, including Axis I and II methodology and descriptive statistics for the study participant sample. This paper details the development of reliable methods to establish the reference standards for assessing criterion validity of the Axis I RDC/TMD diagnoses. Validity testing for the Axis II biobehavioral instruments was based on previously validated reference standards. METHODS The Axis I reference standards were based on the consensus of 2 criterion examiners independently performing a comprehensive history, clinical examination, and evaluation of imaging. Intersite reliability was assessed annually for criterion examiners and radiologists. Criterion exam reliability was also assessed within study sites. RESULTS Study participant demographics were comparable to those of participants in previous studies using the RDC/TMD. Diagnostic agreement of the criterion examiners with each other and with the consensus-based reference standards was excellent with all kappas ≥ 0.81, except for osteoarthrosis (moderate agreement, k = 0.53). Intrasite criterion exam agreement with reference standards was excellent (k ≥ 0.95). Intersite reliability of the radiologists for detecting computed tomography-disclosed osteoarthrosis and magnetic resonance imaging-disclosed disc displacement was good to excellent (k = 0.71 and 0.84, respectively). CONCLUSION The Validation Project study population was appropriate for assessing the reliability and validity of the RDC/TMD Axis I and II. The reference standards used to assess the validity of Axis I TMD were based on reliable and clinically credible methods. PMID:20213028
Anxiety measures validated in perinatal populations: a systematic review.
Meades, Rose; Ayers, Susan
2011-09-01
Research and screening of anxiety in the perinatal period is hampered by a lack of psychometric data on self-report anxiety measures used in perinatal populations. This paper aimed to review self-report measures that have been validated with perinatal women. A systematic search was carried out of four electronic databases. Additional papers were obtained through searching identified articles. Thirty studies were identified that reported validation of an anxiety measure with perinatal women. Most commonly validated self-report measures were the General Health Questionnaire (GHQ), State-Trait Anxiety Inventory (STAI), and Hospital Anxiety and Depression Scales (HADS). Of the 30 studies included, 11 used a clinical interview to provide criterion validity. Remaining studies reported one or more other forms of validity (factorial, discriminant, concurrent and predictive) or reliability. The STAI shows criterion, discriminant and predictive validity and may be most useful for research purposes as a specific measure of anxiety. The Kessler 10 (K-10) may be the best short screening measure due to its ability to differentiate anxiety disorders. The Depression Anxiety Stress Scales 21 (DASS-21) measures multiple types of distress, shows appropriate content, and remains to be validated against clinical interview in perinatal populations. Nineteen studies did not report sensitivity or specificity data. The early stages of research into perinatal anxiety, the multitude of measures in use, and methodological differences restrict comparison of measures across studies. There is a need for further validation of self-report measures of anxiety in the perinatal period to enable accurate screening and detection of anxiety symptoms and disorders. Copyright © 2010 Elsevier B.V. All rights reserved.
Burden-Teh, E; Phillips, R C; Thomas, K S; Ratib, S; Grindlay, D; Murphy, R
2017-11-06
The diagnosis of psoriasis in adults and children is made clinically, for both patient management and the selection of participants in research. Diagnostic criteria provide a structure for clinical assessment, which in turn helps standardize patient recruitment into clinical trials and case definitions in observational studies. The aim of this systematic review was to identify and critically appraise the published studies to date that had a primary research aim to develop or validate diagnostic criteria for psoriasis. A search of Ovid MEDLINE and Ovid Embase was conducted in October 2016. The primary objective was to record the sensitivity and specificity of diagnostic criteria for psoriasis. Secondary objectives included diagnostic recommendations, applicability to children and study characteristics. Diagnostic accuracy studies were critically appraised for risk of bias using the QUADAS-2 tool. Twenty-three studies met the inclusion criteria. None detailed clinical examination-based diagnostic criteria. The included criteria varied from genetic and molecular diagnostic models to skin imaging, histopathology, and questionnaire-based, computer-aided and traditional Chinese medicine criteria. High sensitivity and specificity (> 90%) were reported in many studies. However, the study authors often did not specify how the criteria would be used clinically or in research. This review identified studies with varying risk of bias, and due to each study developing separate criteria meta-analysis was not possible. Clinical examination-based diagnostic criteria are currently lacking for psoriasis. Future research could follow an international collaborative approach and employ study designs allowing high-quality diagnostic accuracy testing. Existing and newly developed criteria require validation. © 2017 British Association of Dermatologists.
Nicholson, Amanda; Mahon, James; Boland, Angela; Beale, Sophie; Dwan, Kerry; Fleeman, Nigel; Hockenhull, Juliet; Dundar, Yenal
2015-10-01
There is no single definitive test to identify prostate cancer in men. Biopsies are commonly used to obtain samples of prostate tissue for histopathological examination. However, this approach frequently misses cases of cancer, meaning that repeat biopsies may be necessary to obtain a diagnosis. The PROGENSA(®) prostate cancer antigen 3 (PCA3) assay (Hologic Gen-Probe, Marlborough, MA, USA) and the Prostate Health Index (phi; Beckman Coulter Inc., Brea, CA, USA) are two new tests (a urine test and a blood test, respectively) that are designed to be used to help clinicians decide whether or not to recommend a repeat biopsy. To evaluate the clinical effectiveness and cost-effectiveness of the PCA3 assay and the phi in the diagnosis of prostate cancer. Multiple publication databases and trial registers were searched in May 2014 (from 2000 to May 2014), including MEDLINE, EMBASE, The Cochrane Library, ISI Web of Science, Medion, Aggressive Research Intelligence Facility database, ClinicalTrials.gov, International Standard Randomised Controlled Trial Number Register and World Health Organization International Clinical Trials Registry Platform. The assessment of clinical effectiveness involved three separate systematic reviews, namely reviews of the analytical validity, the clinical validity of these tests and the clinical utility of these tests. The assessment of cost-effectiveness comprised a systematic review of full economic evaluations and the development of a de novo economic model. The perspective of the evaluation was the NHS in England and Wales. Men suspected of having prostate cancer for whom the results of an initial prostate biopsy were negative or equivocal. The use of the PCA3 score or phi in combination with existing tests (including histopathology results, prostate-specific antigen level and digital rectal examination), multiparametric magnetic resonance imaging and clinical judgement. In addition to documents published by the manufacturers, six studies were identified for inclusion in the analytical validity review. The review identified issues concerning the precision of the PCA3 assay measurements. It also highlighted issues relating to the storage requirements and stability of samples intended for analysis using the phi assay. Fifteen studies met the inclusion criteria for the clinical validity review. These studies reported results for 10 different clinical comparisons. There was insufficient evidence to enable the identification of appropriate test threshold values for use in a clinical setting. In addition, the implications of adding either the PCA3 assay or the phi to clinical assessment were not clear. Furthermore, the addition of the PCA3 assay or the phi to clinical assessment plus magnetic resonance imaging was not found to improve discrimination. No published papers met the inclusion criteria for either the clinical utility review or the cost-effectiveness review. The results from the cost-effectiveness analyses indicated that using either the PCA3 assay or the phi in the NHS was not cost-effective. The main limitations of the systematic review of clinical validity are that the review conclusions are over-reliant on findings from one study, the descriptions of clinical assessment vary widely within reviewed studies and many of the reported results for the clinical validity outcomes do not include either standard errors or confidence intervals. The clinical benefit of using the PCA3 assay or the phi in combination with existing tests, scans and clinical judgement has not yet been confirmed. The results from the cost-effectiveness analyses indicate that the use of these tests in the NHS would not be cost-effective. This study is registered as PROSPERO CRD42014009595. The National Institute for Health Research Health Technology Assessment programme.
Statistical considerations on prognostic models for glioma
Molinaro, Annette M.; Wrensch, Margaret R.; Jenkins, Robert B.; Eckel-Passow, Jeanette E.
2016-01-01
Given the lack of beneficial treatments in glioma, there is a need for prognostic models for therapeutic decision making and life planning. Recently several studies defining subtypes of glioma have been published. Here, we review the statistical considerations of how to build and validate prognostic models, explain the models presented in the current glioma literature, and discuss advantages and disadvantages of each model. The 3 statistical considerations to establishing clinically useful prognostic models are: study design, model building, and validation. Careful study design helps to ensure that the model is unbiased and generalizable to the population of interest. During model building, a discovery cohort of patients can be used to choose variables, construct models, and estimate prediction performance via internal validation. Via external validation, an independent dataset can assess how well the model performs. It is imperative that published models properly detail the study design and methods for both model building and validation. This provides readers the information necessary to assess the bias in a study, compare other published models, and determine the model's clinical usefulness. As editors, reviewers, and readers of the relevant literature, we should be cognizant of the needed statistical considerations and insist on their use. PMID:26657835
Predictive validity of the Braden Scale, Norton Scale, and Waterlow Scale in the Czech Republic.
Šateková, Lenka; Žiaková, Katarína; Zeleníková, Renáta
2017-02-01
The aim of this study was to determine the predictive validity of the Braden, Norton, and Waterlow scales in 2 long-term care departments in the Czech Republic. Assessing the risk for developing pressure ulcers is the first step in their prevention. At present, many scales are used in clinical practice, but most of them have not been properly validated yet (for example, the Modified Norton Scale in the Czech Republic). In the Czech Republic, only the Braden Scale has been validated so far. This is a prospective comparative instrument testing study. A random sample of 123 patients was recruited. The predictive validity of the pressure ulcer risk assessment scales was evaluated based on sensitivity, specificity, positive and negative predictive values, and the area under the receiver operating characteristic curve. The data were collected from April to August 2014. In the present study, the best predictive validity values were observed for the Norton Scale, followed by the Braden Scale and the Waterlow Scale, in that order. We recommended that the above 3 pressure ulcer risk assessment scales continue to be evaluated in the Czech clinical setting. © 2016 John Wiley & Sons Australia, Ltd.
Ertuğ, Nurcan
2018-06-01
The aim of this study was to determine the validity and reliability of the Turkish version of the V-scale, which measures nurses' attitudes towards vital signs monitoring in the detection of clinical deterioration. This validity and reliability study was conducted at a tertiary hospital in Ankara, Turkey, in 2016. A total of 169 ward nurses participated in the study. Exploratory factor analysis, Cronbach's alpha coefficient, and the intraclass correlation coefficient were used to determine the validity and reliability of the scale. A 5-factor, 16-item scale explained 60.823% of the total variance according to the validity analysis. Our version matched the original scale in terms of the number of items and factor structure. Cronbach's alpha coefficient of the Turkish version of the V-scale was 0.764. The test-retest reliability results were 0.855 for the overall intraclass correlation coefficient, and the t-test result was P > 0.05. The V-scale is a reliable and valid instrument to measure Turkish nurses' attitudes towards vital signs monitoring in the detection of clinical deterioration. © 2018 John Wiley & Sons Australia, Ltd.
The city of hope-quality of life-ostomy questionnaire: persian translation and validation.
Anaraki, F; Vafaie, M; Behboo, R; Esmaeilpour, S; Maghsoodi, N; Safaee, A; Grant, M
2014-07-01
Since there is no disease-specific instrument for measuring quality-of-life (QOL) in Ostomy patients in Persian language. This study was designed to translate and evaluate the validity and reliability of City of Hope-quality of life-Ostomy questionnaire (COH-QOL-Ostomy questionnaire). This study was designed as cross-sectional study. Reliability of the subscales and the summary scores were demonstrated by intra-class correlation coefficients. Pearson's correlations of an item with its own scale and other scales were calculated to evaluated convergent and discriminant validity. Clinical validity was also evaluated by known-group comparisons. Cronbach's alpha coefficient for all subscales was about 0.70 or higher. Results of interscale correlation were satisfactory and each subscale only measured a single and specified trait. All subscales met the standards of convergent and discriminant validity. Known group comparison analysis showed significant differences in social and spiritual well-being. The findings confirmed the reliability and validity of Persian version of COH-QOL-Ostomy questionnaire. The instrument was also well received by the Iranian patients. It can be considered as a valuable instrument to assess the different aspects of health related quality-of-life in Ostomy patients and used in clinical research in the future.
Development of a clinical feeding assessment scale for very young infants in South Africa
2016-01-01
Background There is a need for validated neonatal feeding assessment instruments in South Africa. A locally developed instrument may contribute to standardised evaluation procedures of high-risk neonates and address needs in resource constrained developing settings. Objective The aim of the study was to develop and validate the content of a clinical feeding assessment scale to diagnose oropharyngeal dysphagia (OPD) in neonates. Method The Neonatal Feeding Assessment Scale (NFAS) was developed using the Delphi method. Five international and South African speech-language therapists (SLTs) formed the expert panel, participating in two rounds of electronic questionnaires to develop and validate the content of the NFAS. Results All participants agreed on the need for the development of a valid clinical feeding assessment instrument to use with the neonatal population. The initial NFAS consisted of 240 items across 8 sections, and after the Delphi process was implemented, the final format was reduced to 211 items across 6 sections. The final format of the NFAS is scored using a binary scoring system guiding the clinician to diagnose the presence or absence of OPD. All members agreed on the format, the scoring system and the feeding constructs addressed in the revised final format of the NFAS. Conclusion The Delphi method and the diverse clinical and research experience of participants could be integrated to develop the NFAS which may be used in clinical practice in South Africa or similar developing contexts. Because of demographically different work settings marked by developed versus developing contexts, participants did not have the same expectations of a clinical dysphagia assessment. The international participants contributed to evidence-based content development. Local participants considered the contextual challenges of South African SLTs entering the field with basic competencies in neonatal dysphagia management, thereby justifying a comprehensive clinical instrument. The NFAS is aimed at clinicians working in Neonatal Intensive Care Units where they manage large caseloads of high-risk neonates. Further validation of the NFAS is recommended to determine its criterion validity in comparison with a widely accepted standard such as the modified barium swallow study. PMID:27796101
Gruen, Margaret E.; Griffith, Emily H.; Thomson, Andrea E.; Simpson, Wendy; Lascelles, B. Duncan X.
2015-01-01
Introduction Degenerative joint disease and associated pain are common in cats, particularly in older cats. There is a need for treatment options, however evaluation of putative therapies is limited by a lack of suitable, validated outcome measures that can be used in the target population of client owned cats. The objectives of this study were to evaluate low-dose daily meloxicam for the treatment of pain associated with degenerative joint disease in cats, and further validate two clinical metrology instruments, the Feline Musculoskeletal Pain Index (FMPI) and the Client Specific Outcome Measures (CSOM). Methods Sixty-six client owned cats with degenerative joint disease and owner-reported impairments in mobility were screened and enrolled into a double-masked, placebo-controlled, randomized clinical trial. Following a run-in baseline period, cats were given either placebo or meloxicam for 21 days, then in a masked washout, cats were all given placebo for 21 days. Subsequently, cats were given the opposite treatment, placebo or meloxicam, for 21 days. Cats wore activity monitors throughout the study, owners completed clinical metrology instruments following each period. Results Activity counts were increased in cats during treatment with daily meloxicam (p<0.0001) compared to baseline. The FMPI results and activity count data offer concurrent validation for the FMPI, though the relationship between baseline activity counts and FMPI scores at baseline was poor (R2=0.034). The CSOM did not show responsiveness for improvement in this study, and the relationship between baseline activity counts and CSOM scores at baseline was similarly poor (R2=0.042). Conclusions Refinements to the FMPI, including abbreviation of the instrument and scoring as percent of possible score are recommended. This study offered further validation of the FMPI as a clinical metrology instrument for use in detecting therapeutic efficacy in cats with degenerative joint disease. PMID:26162101
Clinical Validity of the ADI-R in a US-Based Latino Population
ERIC Educational Resources Information Center
Vanegas, Sandra B.; Magaña, Sandra; Morales, Miguel; McNamara, Ellyn
2016-01-01
The Autism Diagnostic Interview-Revised (ADI-R) has been validated as a tool to aid in the diagnosis of Autism; however, given the growing diversity in the United States, the ADI-R must be validated for different languages and cultures. This study evaluates the validity of the ADI-R in a US-based Latino, Spanish-speaking population of 50 children…
Virues-Ortega, Javier; Montaño-Fidalgo, Montserrat; Froján-Parga, María Xesús; Calero-Elvira, Ana
2011-12-01
This study analyzes the interobserver agreement and hypothesis-based known-group validity of the Therapist's Verbal Behavior Category System (SISC-INTER). The SISC-INTER is a behavioral observation protocol comprised of a set of verbal categories representing putative behavioral functions of the in-session verbal behavior of a therapist (e.g., discriminative, reinforcing, punishing, and motivational operations). The complete therapeutic process of a clinical case of an individual with marital problems was recorded (10 sessions, 8 hours), and data were arranged in a temporal sequence using 10-min periods. Hypotheses based on the expected performance of the putative behavioral functions portrayed by the SISC-INTER codes across prevalent clinical activities (i.e., assessing, explaining, Socratic method, providing clinical guidance) were tested using autoregressive integrated moving average (ARIMA) models. Known-group validity analyses provided support to all hypotheses. The SISC-INTER may be a useful tool to describe therapist-client interaction in operant terms. The utility of reliable and valid protocols for the descriptive analysis of clinical practice in terms of verbal behavior is discussed. Copyright © 2011. Published by Elsevier Ltd.
Ruiz, Francisco J; García-Beltrán, Diana M; Suárez-Falcón, Juan C
2017-10-01
The General Health Questionnaire - 12 (GHQ-12) is a widely used screening self-report for emotional disorders among adults. However, there is little evidence concerning the validity of the GHQ-12 in Colombia and its factorial invariance between nonclinical and clinical samples. Accordingly, the current study aims to explore the GHQ-12 validity in Colombian nonclinical and clinical samples. The GHQ-12 was administered to a total of 1641 participants, including a sample of undergraduates, one of general population, and a clinical sample. The internal consistency of the GHQ-12 across samples was good (overall alpha of .90). The one-factor model showed a good fit to the data and was considered theoretically more coherent than the two-factor model with positive and negative items loading in separate factors. Metric and scalar invariance were observed across nonclinical and clinical samples. The GHQ-12 scores were strongly and positively related to emotional symptoms and experiential avoidance, and negatively related to life satisfaction. According to the receiver operating characteristic (ROC) curves, a threshold score of 11/12 was optimal to identify emotional disorders. In conclusion, the GHQ-12 is a valid screening self-report in Colombia that provides scores that can be compared across clinical and nonclinical participants. Copyright © 2017 Elsevier B.V. All rights reserved.
Validity of the Kinect for Gait Assessment: A Focused Review
Springer, Shmuel; Yogev Seligmann, Galit
2016-01-01
Gait analysis may enhance clinical practice. However, its use is limited due to the need for expensive equipment which is not always available in clinical settings. Recent evidence suggests that Microsoft Kinect may provide a low cost gait analysis method. The purpose of this report is to critically evaluate the literature describing the concurrent validity of using the Kinect as a gait analysis instrument. An online search of PubMed, CINAHL, and ProQuest databases was performed. Included were studies in which walking was assessed with the Kinect and another gold standard device, and consisted of at least one numerical finding of spatiotemporal or kinematic measures. Our search identified 366 papers, from which 12 relevant studies were retrieved. The results demonstrate that the Kinect is valid only for some spatiotemporal gait parameters. Although the kinematic parameters measured by the Kinect followed the trend of the joint trajectories, they showed poor validity and large errors. In conclusion, the Kinect may have the potential to be used as a tool for measuring spatiotemporal aspects of gait, yet standardized methods should be established, and future examinations with both healthy subjects and clinical participants are required in order to integrate the Kinect as a clinical gait analysis tool. PMID:26861323
ClinicalTrials.gov and Drugs@FDA: A comparison of results reporting for new drug approval trials
Schwartz, Lisa M.; Woloshin, Steven; Zheng, Eugene; Tse, Tony; Zarin, Deborah A.
2016-01-01
Background Pharmaceutical companies and other trial sponsors must submit certain trial results to ClinicalTrials.gov. The validity of these results is unclear. Purpose To validate results posted on ClinicalTrials.gov against publicly-available FDA reviews on Drugs@FDA. Data sources ClinicalTrials.gov (registry and results database) and Drugs@FDA (medical/statistical reviews). Study selection 100 parallel-group, randomized trials for new drug approvals (1/2013 – 7/2014) with results posted on ClinicalTrials.gov (3/15/2015). Data extraction Two assessors systematically extracted, and another verified, trial design, primary/secondary outcomes, adverse events, and deaths. Results The 100 trials were mostly phase 3 (90%) double-blind (92%), placebo-controlled (73%), representing 32 drugs from 24 companies. Of 137 primary outcomes from ClinicalTrials.gov, 134 (98%) had corresponding data in Drugs@FDA, 130 (95%) had concordant definitions, and 107 (78%) had concordant results; most differences were nominal (i.e. relative difference < 10%). Of 100 trials, primary outcome results in 14 could not be validated . Of 1,927 secondary outcomes from ClinicalTrials.gov, 1,061 (55%) definitions could be validated and 367 (19%) had results. Of 96 trials with ≥ 1 serious adverse event in either source, 14 could be compared and 7 were discordant. Of 62 trials with ≥ 1 death in either source, 25 could be compared and 17 were discordant. Limitations Unknown generalizability to uncontrolled or crossover trial results. Conclusion Primary outcome definitions and results were largely concordant between ClinicalTrials.gov and Drugs@FDA. Half of secondary outcomes could not be validated because Drugs@FDA only includes “key outcomes” for regulatory decision-making; nor could serious adverse events and deaths because Drugs@FDA frequently only includes results aggregated across multiple trials. PMID:27294570
Bowen, Raffick A R; Adcock, Dorothy M
2016-12-01
Blood collection tubes (BCTs) are an often under-recognized variable in the preanalytical phase of clinical laboratory testing. Unfortunately, even the best-designed and manufactured BCTs may not work well in all clinical settings. Clinical laboratories, in collaboration with healthcare providers, should carefully evaluate BCTs prior to putting them into clinical use to determine their limitations and ensure that patients are not placed at risk because of inaccuracies due to poor tube performance. Selection of the best BCTs can be achieved through comparing advertising materials, reviewing the literature, observing the device at a scientific meeting, receiving a demonstration, evaluating the device under simulated conditions, or testing the device with patient samples. Although many publications have discussed method validations, few detail how to perform experiments for tube verification and validation. This article highlights the most common and impactful variables related to BCTs and discusses the validation studies that a typical clinical laboratory should perform when selecting BCTs. We also present a brief review of how in vitro diagnostic devices, particularly BCTs, are regulated in the United States, the European Union, and Canada. The verification and validation of BCTs will help to avoid the economic and human costs associated with incorrect test results, including poor patient care, unnecessary testing, and delays in test results. We urge laboratorians, tube manufacturers, diagnostic companies, and other researchers to take all the necessary steps to protect against the adverse effects of BCT components and their additives on clinical assays. Copyright © 2016 The Canadian Society of Clinical Chemists. Published by Elsevier Inc. All rights reserved.
Are You "Tilting at Windmills" or Undertaking a Valid Clinical Trial?
Zariffa, Jose; Kramer, John L.K.
2011-01-01
In this review, several aspects surrounding the choice of a therapeutic intervention and the conduct of clinical trials are discussed. Some of the background for why human studies have evolved to their current state is also included. Specifically, the following questions have been addressed: 1) What criteria should be used to determine whether a scientific discovery or invention is worthy of translation to human application? 2) What recent scientific advance warrants a deeper understanding of clinical trials by everyone? 3) What are the different types and phases of a clinical trial? 4) What characteristics of a human disorder should be noted, tracked, or stratified for a clinical trial and what inclusion /exclusion criteria are important to enrolling appropriate trial subjects? 5) What are the different study designs that can be used in a clinical trial program? 6) What confounding factors can alter the accurate interpretation of clinical trial outcomes? 7) What are the success rates of clinical trials and what can we learn from previous clinical trials? 8) What are the essential principles for the conduct of valid clinical trials? PMID:21786433
Junghaenel, Doerte U.; Schneider, Stefan; Stone, Arthur A.; Christodoulou, Christopher; Broderick, Joan E.
2014-01-01
Objective This study examined the ecological validity and clinical utility of NIH Patient Reported-Outcomes Measurement Information System (PROMIS®) instruments for anger, depression, and fatigue in women with premenstrual symptoms. Methods One-hundred women completed daily diaries and weekly PROMIS assessments over 4 weeks. Weekly assessments were administered through Computerized Adaptive Testing (CAT). Weekly CATs and corresponding daily scores were compared to evaluate ecological validity. To test clinical utility, we examined if CATs could detect changes in symptom levels, if these changes mirrored those obtained from daily scores, and if CATs could identify clinically meaningful premenstrual symptom change. Results PROMIS CAT scores were higher in the pre-menstrual than the baseline (ps < .0001) and post-menstrual (ps < .0001) weeks. The correlations between CATs and aggregated daily scores ranged from .73 to .88 supporting ecological validity. Mean CAT scores showed systematic changes in accordance with the menstrual cycle and the magnitudes of the changes were similar to those obtained from the daily scores. Finally, Receiver Operating Characteristic (ROC) analyses demonstrated the ability of the CATs to discriminate between women with and without clinically meaningful premenstrual symptom change. Conclusions PROMIS CAT instruments for anger, depression, and fatigue demonstrated validity and utility in premenstrual symptom assessment. The results provide encouraging initial evidence of the utility of PROMIS instruments for the measurement of affective premenstrual symptoms. PMID:24630180
Construct validity of the Iowa Gambling Task.
Buelow, Melissa T; Suhr, Julie A
2009-03-01
The Iowa Gambling Task (IGT) was created to assess real-world decision making in a laboratory setting and has been applied to various clinical populations (i.e., substance abuse, schizophrenia, pathological gamblers) outside those with orbitofrontal cortex damage, for whom it was originally developed. The current review provides a critical examination of lesion, functional neuroimaging, developmental, and clinical studies in order to examine the construct validity of the IGT. The preponderance of evidence provides support for the use of the IGT to detect decision making deficits in clinical populations, in the context of a more comprehensive evaluation. The review includes a discussion of three critical issues affecting the validity of the IGT, as it has recently become available as a clinical instrument: the lack of a concise definition as to what aspect of decision making the IGT measures, the lack of data regarding reliability of the IGT, and the influence of personality and state mood on IGT performance.
Klein, Eric A; Cooperberg, Matthew R; Magi-Galluzzi, Cristina; Simko, Jeffry P; Falzarano, Sara M; Maddala, Tara; Chan, June M; Li, Jianbo; Cowan, Janet E; Tsiatis, Athanasios C; Cherbavaz, Diana B; Pelham, Robert J; Tenggara-Hunter, Imelda; Baehner, Frederick L; Knezevic, Dejan; Febbo, Phillip G; Shak, Steven; Kattan, Michael W; Lee, Mark; Carroll, Peter R
2014-09-01
Prostate tumor heterogeneity and biopsy undersampling pose challenges to accurate, individualized risk assessment for men with localized disease. To identify and validate a biopsy-based gene expression signature that predicts clinical recurrence, prostate cancer (PCa) death, and adverse pathology. Gene expression was quantified by reverse transcription-polymerase chain reaction for three studies-a discovery prostatectomy study (n=441), a biopsy study (n=167), and a prospectively designed, independent clinical validation study (n=395)-testing retrospectively collected needle biopsies from contemporary (1997-2011) patients with low to intermediate clinical risk who were candidates for active surveillance (AS). The main outcome measures defining aggressive PCa were clinical recurrence, PCa death, and adverse pathology at prostatectomy. Cox proportional hazards regression models were used to evaluate the association between gene expression and time to event end points. Results from the prostatectomy and biopsy studies were used to develop and lock a multigene-expression-based signature, called the Genomic Prostate Score (GPS); in the validation study, logistic regression was used to test the association between the GPS and pathologic stage and grade at prostatectomy. Decision-curve analysis and risk profiles were used together with clinical and pathologic characteristics to evaluate clinical utility. Of the 732 candidate genes analyzed, 288 (39%) were found to predict clinical recurrence despite heterogeneity and multifocality, and 198 (27%) were predictive of aggressive disease after adjustment for prostate-specific antigen, Gleason score, and clinical stage. Further analysis identified 17 genes representing multiple biological pathways that were combined into the GPS algorithm. In the validation study, GPS predicted high-grade (odds ratio [OR] per 20 GPS units: 2.3; 95% confidence interval [CI], 1.5-3.7; p<0.001) and high-stage (OR per 20 GPS units: 1.9; 95% CI, 1.3-3.0; p=0.003) at surgical pathology. GPS predicted high-grade and/or high-stage disease after controlling for established clinical factors (p<0.005) such as an OR of 2.1 (95% CI, 1.4-3.2) when adjusting for Cancer of the Prostate Risk Assessment score. A limitation of the validation study was the inclusion of men with low-volume intermediate-risk PCa (Gleason score 3+4), for whom some providers would not consider AS. Genes representing multiple biological pathways discriminate PCa aggressiveness in biopsy tissue despite tumor heterogeneity, multifocality, and limited sampling at time of biopsy. The biopsy-based 17-gene GPS improves prediction of the presence or absence of adverse pathology and may help men with PCa make more informed decisions between AS and immediate treatment. Prostate cancer (PCa) is often present in multiple locations within the prostate and has variable characteristics. We identified genes with expression associated with aggressive PCa to develop a biopsy-based, multigene signature, the Genomic Prostate Score (GPS). GPS was validated for its ability to predict men who have high-grade or high-stage PCa at diagnosis and may help men diagnosed with PCa decide between active surveillance and immediate definitive treatment. Copyright © 2014 European Association of Urology. Published by Elsevier B.V. All rights reserved.
Newton, Katherine M; Peissig, Peggy L; Kho, Abel Ngo; Bielinski, Suzette J; Berg, Richard L; Choudhary, Vidhu; Basford, Melissa; Chute, Christopher G; Kullo, Iftikhar J; Li, Rongling; Pacheco, Jennifer A; Rasmussen, Luke V; Spangler, Leslie; Denny, Joshua C
2013-06-01
Genetic studies require precise phenotype definitions, but electronic medical record (EMR) phenotype data are recorded inconsistently and in a variety of formats. To present lessons learned about validation of EMR-based phenotypes from the Electronic Medical Records and Genomics (eMERGE) studies. The eMERGE network created and validated 13 EMR-derived phenotype algorithms. Network sites are Group Health, Marshfield Clinic, Mayo Clinic, Northwestern University, and Vanderbilt University. By validating EMR-derived phenotypes we learned that: (1) multisite validation improves phenotype algorithm accuracy; (2) targets for validation should be carefully considered and defined; (3) specifying time frames for review of variables eases validation time and improves accuracy; (4) using repeated measures requires defining the relevant time period and specifying the most meaningful value to be studied; (5) patient movement in and out of the health plan (transience) can result in incomplete or fragmented data; (6) the review scope should be defined carefully; (7) particular care is required in combining EMR and research data; (8) medication data can be assessed using claims, medications dispensed, or medications prescribed; (9) algorithm development and validation work best as an iterative process; and (10) validation by content experts or structured chart review can provide accurate results. Despite the diverse structure of the five EMRs of the eMERGE sites, we developed, validated, and successfully deployed 13 electronic phenotype algorithms. Validation is a worthwhile process that not only measures phenotype performance but also strengthens phenotype algorithm definitions and enhances their inter-institutional sharing.
Tompkins, D. Andrew; Bigelow, George E.; Harrison, Joseph A.; Johnson, Rolley E.; Fudala, Paul J.; Strain, Eric C.
2009-01-01
Introduction The Clinical Opiate Withdrawal Scale (COWS) is an 11-item clinician-administered scale assessing opioid withdrawal. Though commonly used in clinical practice, it has not been systematically validated. The present study validated the COWS in comparison to the validated Clinical Institute Narcotic Assessment (CINA) scale. Method Opioid-dependent volunteers were enrolled in a residential trial and stabilized on morphine 30 mg given subcutaneously four times daily. Subjects then underwent double-blind, randomized challenges of intramuscularly administered placebo and naloxone (0.4 mg) on separate days, during which the COWS, CINA, and visual analog scale (VAS) assessments were concurrently obtained. Subjects completing both challenges were included (N=46). Correlations between mean peak COWS and CINA scores as well as self-report VAS questions were calculated. Results Mean peak COWS and CINA scores of 7.6 and 24.4, respectively, occurred on average 30 minutes post-injection of naloxone. Mean COWS and CINA scores 30 minutes after placebo injection were 1.3 and 18.9, respectively. The Pearson correlation coefficient for peak COWS and CINA scores during the naloxone challenge session was 0.85 (p<0.001). Peak COWS scores also correlated well with peak VAS self-report scores of bad drug effect (r=0.57, p<0.001) and feeling sick (r=0.57, p<0.001), providing additional evidence of concurrent validity. Placebo was not associated with any significant elevation of COWS, CINA, or VAS scores, indicating discriminant validity. Cronbach’s alpha for the COWS was 0.78, indicating good internal consistency (reliability). Discussion COWS, CINA, and certain VAS items are all valid measurement tools for acute opiate withdrawal. PMID:19647958
Anonymization of electronic medical records for validating genome-wide association studies
Loukides, Grigorios; Gkoulalas-Divanis, Aris; Malin, Bradley
2010-01-01
Genome-wide association studies (GWAS) facilitate the discovery of genotype–phenotype relations from population-based sequence databases, which is an integral facet of personalized medicine. The increasing adoption of electronic medical records allows large amounts of patients’ standardized clinical features to be combined with the genomic sequences of these patients and shared to support validation of GWAS findings and to enable novel discoveries. However, disseminating these data “as is” may lead to patient reidentification when genomic sequences are linked to resources that contain the corresponding patients’ identity information based on standardized clinical features. This work proposes an approach that provably prevents this type of data linkage and furnishes a result that helps support GWAS. Our approach automatically extracts potentially linkable clinical features and modifies them in a way that they can no longer be used to link a genomic sequence to a small number of patients, while preserving the associations between genomic sequences and specific sets of clinical features corresponding to GWAS-related diseases. Extensive experiments with real patient data derived from the Vanderbilt's University Medical Center verify that our approach generates data that eliminate the threat of individual reidentification, while supporting GWAS validation and clinical case analysis tasks. PMID:20385806
Eglseer, Doris; Halfens, Ruud J G; Lohrmann, Christa
2017-05-01
The aims of this study were to evaluate the association between the use of clinical guidelines and the use of validated screening tools, evaluate the nutritional screening policy in hospitals, and examine the association between the use of validated screening tools and the prevalence of malnutrition and nutritional interventions in hospitalized patients. This was a cross-sectional, multicenter study. Data were collected using a standardized questionnaire on three levels: institution (presence of a guideline for malnutrition), department (use of a validated screening tool), and patient (e.g., malnutrition prevalence). In all, 53 hospitals with 5255 patients participated. About 45% of the hospitals indicated that they have guidelines for malnutrition. Of the departments surveyed, 38.6% used validated screening tools as part of a standard procedure. The nutritional status of 74.5% of the patients was screened during admission, mostly on the basis of clinical observation and patient weight. A validated screening tool was used for 21.2% of the patients. Significant differences between wards with and without validated screening tools were found with regard to malnutrition prevalence (P = 0.002) and the following interventions: referral to a dietitian (P < 0.001), provision of energy-enriched snacks (P = 0.038), adjustment of consistency (food/drinks; P = 0.004), monitoring of the nutritional intake (P = 0.001), and adjustment of the meal ambiance (P < 0.001). Nutritional screening with validated tools in hospitalized patients remains poor. Generally, the nutritional status of patients is screened with unreliable parameters such as clinical observation and body mass index. The results of the present study suggest that the use of validated malnutrition screening tools is associated with better nutritional care and lower malnutrition prevalence rates in hospitalized patients. Copyright © 2017 Elsevier Inc. All rights reserved.
ERIC Educational Resources Information Center
Reiter, Harold I.; Rosenfeld, Jack; Nandagopal, Kiruthiga; Eva, Kevin W.
2004-01-01
Context: Various research studies have examined the question of whether expert or non-expert raters, faculty or students, evaluators or standardized patients, give more reliable and valid summative assessments of performance on Objective Structured Clinical Examinations (OSCEs). Less studied has been the question of whether or not non-faculty…
Scott, Jan; Geoffroy, Pierre Alexis; Sportiche, Sarah; Brichant-Petit-Jean, Clara; Gard, Sebastien; Kahn, Jean-Pierre; Azorin, Jean-Michel; Henry, Chantal; Etain, Bruno; Bellivier, Frank
2017-01-15
It is increasingly recognised that reliable and valid assessments of lithium response are needed in order to target more efficiently the use of this medication in bipolar disorders (BD) and to identify genotypes, endophenotypes and biomarkers of response. In a large, multi-centre, clinically representative sample of 300 cases of BD, we assess external clinical validators of lithium response phenotypes as defined using three different recommended approaches to scoring the Alda lithium response scale. The scale comprises an A scale (rating lithium response) and a B scale (assessing confounders). Analysis of the two continuous scoring methods (A scale score minus the B scale score, or A scale score in those with a low B scale score) demonstrated that 21-23% of the explained variance in lithium response was accounted for by a positive family history of BD I and the early introduction of lithium. Categorical definitions of response suggest poor response is also associated with a positive history of alcohol and/or substance use comorbidities. High B scale scores were significantly associated with longer duration of illness prior to receiving lithium and the presence of psychotic symptoms. The original sample was not recruited specifically to study lithium response. The Alda scale is designed to assess response retrospectively. This cross-validation study identifies different clinical phenotypes of lithium response when defined by continuous or categorical measures. Future clinical, genetic and biomarker studies should report both the findings and the method employed to assess lithium response according to the Alda scale. Copyright © 2016 Elsevier B.V. All rights reserved.
Modified Moral Distress Scale (MDS-11): Validation Study Among Italian Nurses.
Badolamenti, Sondra; Fida, Roberto; Biagioli, Valentina; Caruso, Rosario; Zaghini, Francesco; Sili, Alessandro; Rea, Teresa
2017-01-01
Moral distress (MD) has significant implications on individual and organizational health. However there is a lack of an instrument to assess it among Italian nurses. The main aim of this study was to validate a brief instrument to assess MD, developed from the Corley's Moral Distress Scale (MDS). The modified MDS scale was subjected to content and cultural validity. The scale was administered to 347 nurses. Psychometric analysis were performed to assess construct validity. The scale consists of 11 items, investigating MD in nursing practice in different clinical settings. The dimensionality of the scale was investigated through exploratory factor analysis (EFA), which showed a two-dimensional structure labeled futility and potential damage. The futility refers to feelings of powerlessness and ineffectiveness in some clinical situations; the potential damage dimension captures feelings of powerlessness when nurses are forced to tolerate or perform perceived abusive clinical proceedings. Nurses who experienced higher MD, were more lilely to experience burnout. The modified MDS showed good psychometric properties, and it is valid and reliable for assessing moral distress among Italian nurses. Hence, the modified MDS allows to monitor the distress experienced by nurses and it is an important contribution to the scientific community and all those dealing with well-being of health workers.
Cunningham, Jennifer; Wallston, Kenneth A; Wilkins, Consuelo H; Hull, Pamela C; Miller, Stephania T
2015-12-01
This study describes the development and psychometric evaluation of HPV Clinical Trial Survey for Parents with Children Aged 9 to 15 (CTSP-HPV) using traditional instrument development methods and community engagement principles. An expert panel and parental input informed survey content and parents recommended study design changes (e.g., flyer wording). A convenience sample of 256 parents completed the final survey measuring parental willingness to consent to HPV clinical trial (CT) participation and other factors hypothesized to influence willingness (e.g., HPV vaccine benefits). Cronbach's a, Spearman correlations, and multiple linear regression were used to estimate internal consistency, convergent and discriminant validity, and predictively validity, respectively. Internal reliability was confirmed for all scales (a ≥ 0.70.). Parental willingness was positively associated (p < 0.05) with trust in medical researchers, adolescent CT knowledge, HPV vaccine benefits, advantages of adolescent CTs (r range 0.33-0.42), supporting convergent validity. Moderate discriminant construct validity was also demonstrated. Regression results indicate reasonable predictive validity with the six scales accounting for 31% of the variance in parents' willingness. This instrument can inform interventions based on factors that influence parental willingness, which may lead to the eventual increase in trial participation. Further psychometric testing is warranted. © 2015 Wiley Periodicals, Inc.
Aubin, Michèle; Verreault, René; Savoie, Maryse; LeMay, Sylvie; Hadjistavropoulos, Thomas; Fillion, Lise; Beaulieu, Marie; Viens, Chantal; Bergeron, Rénald; Vézina, Lucie; Misson, Lucie; Fuchs-Lacelle, Shannon
2008-01-01
This study presents the validation of the French Canadian version (PACLSAC-F) of the Pain Assessment Checklist for Seniors with Limited Ability to Communicate (PACSLAC). Unlike the published validation of the English version of the PACSLAC, which was validated retrospectively, the French version was validated prospectively. The PACSLAC-F was completed by nurses working in long-term care facilities after observing 86 seniors, with severe cognitive impairment, in calm, painful or distressing but non-painful situations. The test-retest and inter-observer reliability, the internal consistency, and the discriminent validity were found to be satisfactory. To evaluate the convergent validity with the DOLOPLUS-2 and the clinical relevance of the PACSLAC, it was also completed by nurses during their work shift, with 26 additional patients, for three days per week during a period of four weeks. These results encourage us to test the PACSLAC in a comprehensive program of pain management targeting this population.
Development and validation of PSPSQ 2.0 measuring patient satisfaction with pharmacist services.
Sakharkar, Prashant; Bounthavong, Mark; Hirsch, Jan D; Morello, Candis M; Chen, Timothy C; Law, Anandi V
2015-01-01
The extant literature reveals a lack of psychometrically validated tools measuring patient satisfaction with pharmacist clinical services. The Patient Satisfaction with Pharmacist Services Questionnaire (PSPSQ 2.0) was developed to address this need using a mixed methods approach. To assess the psychometric properties of the PSPSQ 2.0, an instrument developed to measure patient satisfaction with clinical services provided by pharmacists. Validation studies were conducted in two Veterans Affairs (VA)-based and two community-based (diabetes and psychiatric care) disease management/medication therapy management clinics. The PSPSQ 2.0 consisted of 22-items related to three domains identified as quality of care, patient-pharmacist relationship and overall satisfaction using a 4-point, Likert-type scale. It was administered to participants following their session with a pharmacist at the clinics. Collected data were analyzed for descriptive statistics, internal consistency, and validity using exploratory factor analysis. A total of 149 patients completed the survey. Patients from VA clinics were on average 61 years old, mostly white (63%), and predominantly male (95%). Patients from non-VA clinics were on average 47 years old, mostly White (47%) and male (53%). Non-VA patients mostly had Medicaid (42%) and commercial health insurance (31%), whereas VA patients retained benefits with the US Department of Veterans Affairs. Reliability of the scale using internal consistency metrics revealed a Cronbach's alpha of 0.98, 0.98 and 0.95 for VA, diabetes, and psychiatric care clinics, respectively, whereas the Cronbach's alpha for the pooled sample was 0.96. Factor analyses resulted in a three-factor solution accounting for 91% and 69% variance for diabetes and psychiatric care clinics, respectively; however, VA clinics and pooled sample yielded only 2-factor solution with 80% and 66% variance, respectively, with more items loading on patient-pharmacist relationship domain. The results suggest that the PSPSQ 2.0 can serve as a reliable and valid tool for measuring patient satisfaction with pharmacists providing clinical services in VA- and non-VA settings upon further validation. Copyright © 2015 Elsevier Inc. All rights reserved.
Standard Specimen Reference Set: Colon — EDRN Public Portal
The Early Detection Research Network, Great Lakes-New England Clinical, Epidemiological and Validation Center (GLNE CVC) announces the availability of serum, plasma and urine samples for the early detection for colon cancer and validation studies.
2013-01-01
Background The Scale to Assess Unawareness in Mental Disorder (SUMD) is widely used in clinical trials and epidemiological studies but more rarely in clinical practice because of its length (74 items). In clinical practice, it is necessary to provide shorter instruments. The aim of this study was to investigate the validity and reliability of the abbreviated version of the SUMD. Methods Design: We used data from four cross-sectional studies conducted in several psychiatric hospitals in France. Inclusion criteria: a diagnosis of schizophrenia based on DSM-IV criteria. Data collection: socio-demographic and clinical data (including duration of illness, Positive and Negative Syndrome Scale, and the Calgary Depression Scale); quality of life; SUMD. Statistical analysis: confirmatory factor analyses, item-dimension correlations, Cronbach’s alpha coefficients, Rasch statistics, relationships between the SUMD and other parameters. We tested two different scoring models and considered the response ‘not applicable’ as ‘0’ or as missing data. Results Five hundred and thirty-one patients participated in this study. The 3-factor structure of the SUMD (awareness of the disease, consequences and need for treatment; awareness of positive symptoms; and awareness of negative symptoms) was confirmed using LISREL confirmatory factor analysis for the two models. Internal item consistency and reliability were satisfactory for all dimensions. External validity testing revealed that dimension scores correlated significantly with all PANSS scores, especially with the G12 item (lack of judgement and awareness). Significant associations with age, disease duration, education level, and living arrangements showed good discriminant validity. Conclusion The abbreviated version of the SUMD appears to be a valid and reliable instrument for measuring insight in patients with schizophrenia and may be used by clinicians to accurately assess insight in clinical settings. PMID:24053640
Wang, Zhenlei; Jiang, Ji; Hu, Pei; Zhao, Qian
2017-02-01
Fotagliptin is a novel dipeptidyl peptidase IV inhibitor under clinical development for the treatment of Type II diabetes mellitus. The objective of this study was to develop and validate a specific and sensitive ultra-performance liquid chromatography (UPLC)-MS/MS method for simultaneous determination of fotagliptin and its two major metabolites in human plasma and urine. Methodology & results: After being pretreated using an automatized procedure, the plasma and urine samples were separated and detected using a UPLC-ESI-MS/MS method, which was validated following the international guidelines. A selective and sensitive UPLC-MS/MS method was first developed and validated for quantifying fotagliptin and its metabolite in human plasma and urine. The method was successfully applied to support the clinical study of fotagliptin in Chinese healthy subjects.
A Comparative Study of Adolescent Risk Assessment Instruments: Predictive and Incremental Validity
ERIC Educational Resources Information Center
Welsh, Jennifer L.; Schmidt, Fred; McKinnon, Lauren; Chattha, H. K.; Meyers, Joanna R.
2008-01-01
Promising new adolescent risk assessment tools are being incorporated into clinical practice but currently possess limited evidence of predictive validity regarding their individual and/or combined use in risk assessments. The current study compares three structured adolescent risk instruments, Youth Level of Service/Case Management Inventory…
Knowledge and skills of cancer clinical trials nurses in Australia.
Scott, Kathleen; White, Kate; Johnson, Catherine; Roydhouse, Jessica K
2012-05-01
This paper is a report of the development and testing of a questionnaire measuring knowledge and skills of cancer clinical trials nurse in Australia. The role of cancer clinical trials nurse, widely acknowledged as an integral member of the clinical research team, has evolved in recent years. Elements of the clinical trials nurse role in cancer have previously been described. To evaluate specific cancer clinical trials nurse educational and training needs, the development of a valid and reliable tool is required. In 2009, a study was conducted in three stages. Stage I: questionnaire development and pilot testing; stage II: focus group; stage III: national survey. Internal consistency reliability testing and multi-trait analysis of item convergent/divergent validity were employed. Regression analysis was used to identify predictors of clinical trials nurse knowledge and skills. The national survey was a 48-item questionnaire, measuring six clinical trial knowledge and seven skills sub-scales. Of 61 respondents, 90% were women, with mean age 43 years, 19 years as a Registered Nurse and 5 years as a cancer clinical trials nurse. Self-reported knowledge and skills were satisfactory to good. Internal consistency reliability was high (Cronbach's alpha: knowledge = 0·98; skills = 0·90). Criteria for item convergent/divergent validity were met. Number of years as cancer clinical trials nurse was positively related to self-reported knowledge and skills. Preliminary data suggest that the national survey is reliable and valid. Data have contributed to better understanding the knowledge and skills of cancer clinical trials nurse in Australia and development of a postgraduate course in clinical trials. © 2011 Blackwell Publishing Ltd.
Allen, Michael H; Daniel, David G; Revicki, Dennis A; Canuso, Carla M; Turkoz, Ibrahim; Fu, Dong-Jing; Alphs, Larry; Ishak, K Jack; Bartko, John J; Lindenmayer, Jean-Pierre
2012-01-01
The Clinical Global Impression for Schizoaffective Disorder scale is a new rating scale adapted from the Clinical Global Impression scale for use in patients with schizoaffective disorder. The psychometric characteristics of the Clinical Global Impression for Schizoaffective Disorder are described. Content validity was assessed using an investigator questionnaire. Inter-rater reliability was determined with 12 sets of videotaped interviews rated independently by two trained individuals. Test-retest reliability was assessed using 30 randomly selected raters from clinical trials who evaluated the same videos on separate occasions two weeks apart. Convergent and divergent validity and effect size were evaluated by comparing scores between the Clinical Global Impression for Schizoaffective Disorder and the Positive and Negative Syndrome Scale, 21-item Hamilton Rating Scale for Depression, and Young Mania Rating Scale scales using pooled patient data from two clinical trials. Clinical Global Impression for Schizoaffective Disorder scores were then linked to corresponding Positive and Negative Syndrome Scale scores. Content validity was strong. Inter-rater agreement was good to excellent for most scales and subscales (intra-class correlation coefficient ≥ 0.50). Test-retest showed good reproducibility, with intraclass correlation coefficients ranging from 0.444 to 0.898. Spearman correlations between Clinical Global Impression for Schizoaffective Disorder domains and corresponding symptom scales were 0.60 or greater, and effect sizes for Clinical Global Impression for Schizoaffective Disorder overall and domain scores were similar to Positive and Negative Syndrome Scale Young Mania Rating Scale, and 21-item Hamilton Rating Scale for Depression scores. Raters anticipated that the scale might be less effective in distinguishing negative from depressive symptoms, and, in fact, the results here may reflect that clinical reality. Multiple lines of evidence support the reliability and validity of the Clinical Global Impression for Schizoaffective Disorder for studies in schizoaffective disorder.
Li, Wenkui; Doherty, John P; Kulmatycki, Kenneth; Smith, Harold T; Tse, Francis Ls
2012-06-01
In support of a pilot clinical trial using acetaminophen as the model compound to assess dried blood spot (DBS) sampling as the method for clinical pharmacokinetic sample collection, a novel sensitive LC-MS/MS method was developed and validated for the simultaneous determination of acetaminophen and its major metabolites, acetaminophen glucuronide and sulfate, in human DBS samples collected by subjects via fingerprick. The validated assay dynamic range was from 50.0 to 5000 ng/ml for each compound using a 1/8´´ (3-mm) disc punched from a DBS sample. Baseline separation of the three analytes was achieved to eliminate the possible impact of insource fragmentation of the conjugated metabolites on the analysis of the parent. The overall extraction efficiency was from 61.3 to 78.8% for the three analytes by direct extraction using methanol. The validated method was successfully implemented in the pilot clinical study with the obtained pharmacokinetic parameters in agreement with the values reported in literature.
Matimba, Alice; Li, Fang; Livshits, Alina; Cartwright, Cher S; Scully, Stephen; Fridley, Brooke L; Jenkins, Gregory; Batzler, Anthony; Wang, Liewei; Weinshilboum, Richard; Lennard, Lynne
2014-01-01
Aim We investigated candidate genes associated with thiopurine metabolism and clinical response in childhood acute lymphoblastic leukemia. Materials & methods We performed genome-wide SNP association studies of 6-thioguanine and 6-mercaptopurine cytotoxicity using lymphoblastoid cell lines. We then genotyped the top SNPs associated with lymphoblastoid cell line cytotoxicity, together with tagSNPs for genes in the ‘thiopurine pathway’ (686 total SNPs), in DNA from 589 Caucasian UK ALL97 patients. Functional validation studies were performed by siRNA knockdown in cancer cell lines. Results SNPs in the thiopurine pathway genes ABCC4, ABCC5, IMPDH1, ITPA, SLC28A3 and XDH, and SNPs located within or near ATP6AP2, FRMD4B, GNG2, KCNMA1 and NME1, were associated with clinical response and measures of thiopurine metabolism. Functional validation showed shifts in cytotoxicity for these genes. Conclusion The clinical response to thiopurines may be regulated by variation in known thiopurine pathway genes and additional novel genes outside of the thiopurine pathway. PMID:24624911
Protein mass spectra data analysis for clinical biomarker discovery: a global review.
Roy, Pascal; Truntzer, Caroline; Maucort-Boulch, Delphine; Jouve, Thomas; Molinari, Nicolas
2011-03-01
The identification of new diagnostic or prognostic biomarkers is one of the main aims of clinical cancer research. In recent years there has been a growing interest in using high throughput technologies for the detection of such biomarkers. In particular, mass spectrometry appears as an exciting tool with great potential. However, to extract any benefit from the massive potential of clinical proteomic studies, appropriate methods, improvement and validation are required. To better understand the key statistical points involved with such studies, this review presents the main data analysis steps of protein mass spectra data analysis, from the pre-processing of the data to the identification and validation of biomarkers.
A Psychometric Study of the Fear of Sleep Inventory-Short Form (FoSI-SF)
Pruiksma, Kristi E.; Taylor, Daniel J.; Ruggero, Camilo; Boals, Adriel; Davis, Joanne L.; Cranston, Christopher; DeViva, Jason C.; Zayfert, Claudia
2014-01-01
Study Objectives: Fear of sleep may play a significant role in sleep disturbances in individuals with posttraumatic stress disorder (PTSD). This report describes a psychometric study of the Fear of Sleep Inventory (FoSI), which was developed to measure this construct. Methods: The psychometric properties of the FoSI were examined in a non-clinical sample of 292 college students (Study I) and in a clinical sample of 67 trauma-exposed adults experiencing chronic nightmares (Study II). Data on the 23 items of the FoSI were subjected to exploratory factor analyses (EFA) to identify items uniquely assessing fear of sleep. Next, reliability and validity of a 13-item version of the FoSI was examined in both samples. Results: A 13-item Short-Form version (FoSI-SF) was identified as having a clear 2-factor structure with high internal consistency in both the non-clinical (α = 0.76–0.94) and clinical (α = 0.88-0.91) samples. Both studies demonstrated good convergent validity with measures of PTSD (0.48-0.61) and insomnia (0.39-0.48) and discriminant validity with a measure of sleep hygiene (0.19-0.27). The total score on the FoSI-SF was significantly higher in the clinical sample (mean = 17.90, SD = 12.56) than in the non-clinical sample (mean = 4.80, SD = 7.72); t357 = 8.85 p < 0.001. Conclusions: Although all items are recommended for clinical purposes, the data support the use of the 13-item FoSI-SF for research purposes. Replication of the factor structure in clinical samples is needed. Results are discussed in terms of limitations of this study and directions for further research. Citation: Pruiksma KE, Taylor DJ, Ruggero C, Boals A, Davis JL, Cranston C, DeViva JC, Zayfert C. A psychometric study of the Fear of Sleep Inventory-short form (FoSI-SF). J Clin Sleep Med 2014;10(5):551-558. PMID:24812541
Leveraging the power of pooled data for cancer outcomes research.
Hugh-Yeun, Kiara; Cheung, Winson Y
2016-08-02
Clinical trials continue to be the gold standard for determining the efficacy of novel cancer treatments, but they may also expose participants to the potential risks of unpredictable or severe toxicities. The development of validated tools that better inform patients of the benefits and risks associated with clinical trial participation can facilitate the informed consent process. The design and validation of such instruments are strengthened when we leverage the power of pooled data analysis for cancer outcomes research. In a recent study published in the Journal of Clinical Oncology entitled "Determinants of early mortality among 37,568 patients with colon cancer who participated in 25 clinical trials from the adjuvant colon cancer endpoints database," using a large pooled analysis of over 30,000 study participants who were enrolled in clinical trials of adjuvant therapy for early-stage colon cancer, we developed and validated a nomogram depicting the predictors of early cancer mortality. This database of pooled individual-level data allowed for a comprehensive analysis of poor prognostic factors associated with early death; furthermore, it enabled the creation of a nomogram that was able to reliably capture and quantify the benefit-to-risk profile for patients who are considering clinical trial participation. This tool can facilitate treatment decision-making discussions. As China and other Asian countries continue to conduct oncology clinical trials, efforts to collate patient-level information from these studies into a large data repository should be strongly considered since pooled data can increase future capacity for cancer outcomes research, which, in turn, can enhance patient-physician discussions and optimize clinical care.
Blok, E J; Bastiaannet, E; van den Hout, W B; Liefers, G J; Smit, V T H B M; Kroep, J R; van de Velde, C J H
2018-01-01
Gene expression profiles with prognostic capacities have shown good performance in multiple clinical trials. However, with multiple assays available and numerous types of validation studies performed, the added value for daily clinical practice is still unclear. In Europe, the MammaPrint, OncotypeDX, PAM50/Prosigna and Endopredict assays are commercially available. In this systematic review, we aim to assess these assays on four important criteria: Assay development and methodology, clinical validation, clinical utility and economic value. We performed a literature search covering PubMed, Embase, Web of Science and Cochrane, for studies related to one or more of the four selected assays. We identified 147 papers for inclusion in this review. MammaPrint and OncotypeDX both have evidence available, including level IA clinical trial results for both assays. Both assays provide prognostic information. Predictive value has only been shown for OncotypeDX. In the clinical utility studies, a higher reduction in chemotherapy was achieved by OncotypeDX, although the number of available studies differ considerably between tests. On average, economic evaluations estimate that genomic testing results in a moderate increase in total costs, but that these costs are acceptable in relation to the expected improved patient outcome. PAM50/prosigna and EndoPredict showed comparable prognostic capacities, but with less economical and clinical utility studies. Furthermore, for these assays no level IA trial data are available yet. In summary, all assays have shown excellent prognostic capacities. The differences in the quantity and quality of evidence are discussed. Future studies shall focus on the selection of appropriate subgroups for testing and long-term outcome of validation trials, in order to determine the place of these assays in daily clinical practice. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.
Kim, Shin-Jeong; Kim, Sunghee; Kang, Kyung-Ah; Oh, Jina; Lee, Myung-Nam
2016-02-01
The lack of reliable and valid tools to evaluate learning outcomes during simulations has limited the adoption and progress of simulation-based nursing education. This study had two aims: (a) to develop a simulation evaluation tool (SET(c-dehydration)) to assess students' clinical judgment in caring for children with dehydration based on the Lasater Clinical Judgment Rubric (LCJR) and (b) to examine its reliability and validity. Undergraduate nursing students from two nursing schools in South Korea participated in this study from March 3 through June 10, 2014. The SET(c-dehydration) was developed, and 120 nursing students' clinical judgment was evaluated. Descriptive statistics, Cronbach's alpha, Cohen's kappa coefficient, and confirmatory factor analysis (CFA) were used to analyze the data. A 41-item version of the SET(c-dehydration) with three subscales was developed. Cohen's kappa (measuring inter-observer reliability) of the sessions ranged from .73 to .95, and Cronbach's alpha was .87. The mean total rating of the SET(c-dehydration) by the instructors was 1.92 (±.25), and the mean scores for the four LCJR dimensions of clinical judgment were as follows: noticing (1.74±.27), interpreting (1.85±.43), responding (2.17±.32), and reflecting (1.79±.35). CFA, which was performed to test construct validity, showed that the four dimensions of the SET(c-dehydration) was an appropriate framework. The SET(c-dehydration) provides a means to evaluate clinical judgment in simulation education. Its reliability and validity should be examined further. Copyright © 2015 Elsevier Ltd. All rights reserved.
[An instrument for assessing clinical aptitude in cervicovaginitis in the family medicine practice].
Arrieta-Pérez, Raúl Tomás; Lona-Calixto, Beatriz
2011-01-01
the cervicovaginitis is one of the first twelve causes on demand at primary care medicine thus the family physician must be able to identify and treat it. The objective was to validate a constructed instrument for measuring the clinical aptitude on cervicovaginitis. cross-sectional, descriptive, prolective study was carried out. An instrument with five clinical cases was done. It has seven indicators, whose answers were true, false and I do not know. The validity content was done by three family physicians and a Gynecologist, with experience in education. The trustworthiness was determined by means of the test of Kuder-Richardson formula 20 with the results obtained in a pilot test in 50 family medicine residents. the instrument was constituted by five clinical cases with 140 Items distributed in seven indicators with 20 items for each indicator and a total of 70 true answers and 70 false answers; seven categories for the degree of clinical aptitude settled down. The trustworthiness of the instrument was 0.81. the instrument is valid and reliable to identify the clinical aptitude of the family physician on cervicovaginitis.
Morf, Carolyn C; Schürch, Eva; Küfner, Albrecht; Siegrist, Philip; Vater, Aline; Back, Mitja; Mestel, Robert; Schröder-Abé, Michela
2017-06-01
The Pathological Narcissism Inventory (PNI) is a multidimensional measure for assessing grandiose and vulnerable features in narcissistic pathology. The aim of the present research was to construct and validate a German translation of the PNI and to provide further information on the PNI's nomological net. Findings from a first study confirm the psychometric soundness of the PNI and replicate its seven-factor first-order structure. A second-order structure was also supported but with several equivalent models. A second study investigating associations with a broad range of measures ( DSM Axis I and II constructs, emotions, personality traits, interpersonal and dysfunctional behaviors, and well-being) supported the concurrent validity of the PNI. Discriminant validity with the Narcissistic Personality Inventory was also shown. Finally, in a third study an extension in a clinical inpatient sample provided further evidence that the PNI is a useful tool to assess the more pathological end of narcissism.
Émond, Marcel; Guimont, Chantal; Chauny, Jean-Marc; Daoust, Raoul; Bergeron, Éric; Vanier, Laurent; Moore, Lynne; Plourde, Miville; Kuimi, Batomen; Boucher, Valérie; Allain-Boulé, Nadine; Le Sage, Natalie
2017-01-01
Background: About 75% of patients with minor thoracic injury are discharged after an emergency department visit. However, complications such as delayed hemothorax can occur. We sought to derive and validate a clinical decision rule to predict hemothorax in patients discharged from the emergency department. Methods: We conducted a 6-year prospective cohort study in 4 university-affiliated emergency departments. Patients aged 16 years or older presenting with a minor thoracic injury were assessed at 5 time points (initial visit and 7, 14, 30 and 90 d after the injury). Radiologists' reports were reviewed for the presence of hemothorax. We used log-binomial regression models to identify predictors of hemothorax. Results: A total of 1382 patients were included: 830 in the derivation phase and 552 in the validation phase. Of these, 151 (10.9%) had hemothorax at the 14-day follow-up. Patients 65 years of age or older represented 25.3% (210/830) and 23.7% (131/552) of the derivation and validation cohorts, respectively. The final clinical decision rule included a combination of age (> 70 yr, 2 points; 45-70 yr, 1 point), fracture of any high to mid thorax rib (ribs 3-9, 2 points) and presence of 3 or more rib fractures (1 point). Twenty (30.8%) of the 65 high-risk patients (score ≥ 4) experienced hemothorax during the follow-up period. The clinical decision rule had a high specificity (90.7%, 95% confidence interval 87.7%-93.1%) in this high-risk group, thus guiding appropriate post-emergency care. Interpretation: One patient out of every 10 presented with delayed hemothorax after discharge from the emergency department. Implementation of this validated clinical decision rule for minor thoracic injury could guide emergency discharge plans. PMID:28611156
Burns, Ted M.; Conaway, Mark; Sanders, Donald B.
2010-01-01
Objective: To study the concurrent and construct validity and test-retest reliability in the practice setting of an outcome measure for myasthenia gravis (MG). Methods: Eleven centers participated in the validation study of the Myasthenia Gravis Composite (MGC) scale. Patients with MG were evaluated at 2 consecutive visits. Concurrent and construct validities of the MGC were assessed by evaluating MGC scores in the context of other MG-specific outcome measures. We used numerous potential indicators of clinical improvement to assess the sensitivity and specificity of the MGC for detecting clinical improvement. Test-retest reliability was performed on patients at the University of Virginia. Results: A total of 175 patients with MG were enrolled at 11 sites from July 1, 2008, to January 31, 2009. A total of 151 patients were seen in follow-up. Total MGC scores showed excellent concurrent validity with other MG-specific scales. Analyses of sensitivities and specificities of the MGC revealed that a 3-point improvement in total MGC score was optimal for signifying clinical improvement. A 3-point improvement in the MGC also appears to represent a meaningful improvement to most patients, as indicated by improved 15-item myasthenia gravis quality of life scale (MG-QOL15) scores. The psychometric properties were no better for an individualized subscore made up of the 2 functional domains that the patient identified as most important to treat. The test-retest reliability coefficient of the MGC was 98%, with a lower 95% confidence interval of 97%, indicating excellent test-retest reliability. Conclusions: The Myasthenia Gravis Composite is a reliable and valid instrument for measuring clinical status of patients with myasthenia gravis in the practice setting and in clinical trials. PMID:20439845
Ward, Amanda M
2016-11-01
Episodic future thinking is defined as the ability to mentally simulate a future event. Although episodic future thinking has been studied extensively in neuroscience, this construct has not been explored in depth from the perspective of clinical neuropsychology. The aim of this critical narrative review is to assess the validity and clinical implications of episodic future thinking. A systematic review of episodic future thinking literature was conducted. PubMed and PsycInfo were searched through July 2015 for review and empirical articles with the following search terms: "episodic future thinking," "future mental simulation," "imagining the future," "imagining new experiences," "future mental time travel," "future autobiographical experience," and "prospection." The review discusses evidence that episodic future thinking is important for adaptive functioning, which has implications for neurological populations. To determine the validity of episodic future thinking, the construct is evaluated with respect to related constructs, such as imagination, episodic memory, autobiographical memory, prospective memory, narrative construction, and working memory. Although it has been minimally investigated, there is evidence of convergent and discriminant validity for episodic future thinking. Research has not addressed the incremental validity of episodic future thinking. Practical considerations of episodic future thinking tasks and related constructs in a clinical neuropsychological setting are considered. The utility of episodic future thinking is currently unknown due to the lack of research investigating the validity of episodic future thinking. Future work is discussed, which could determine whether episodic future thinking is an important missing piece in standard clinical neuropsychological assessment. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Tarescavage, Anthony M; Wygant, Dustin B; Boutacoff, Lana I; Ben-Porath, Yossef S
2013-12-01
In the current study, we examined the reliability, validity, and clinical utility of Minnesota Multiphasic Personality Inventory-2-Restructured Form (MMPI-2-RF; Ben-Porath & Tellegen, 2011) scores in a sample of 759 bariatric surgery candidates. We provide descriptives for all scales, internal consistency and standard error of measurement estimates for all substantive scales, external correlates of substantive scales using chart review and self-report criteria, and relative risk ratios to assess the clinical utility of the instrument. Results generally support the reliability, validity, and clinical utility of MMPI-2-RF scale scores in the psychological evaluation of bariatric surgery candidates. Limitations, future directions, and practical application of these results are discussed. (c) 2013 APA, all rights reserved.
Schiffman, Eric L; Truelove, Edmond L; Ohrbach, Richard; Anderson, Gary C; John, Mike T; List, Thomas; Look, John O
2010-01-01
The purpose of the Research Diagnostic Criteria for Temporomandibular Disorders (RDC/TMD) Validation Project was to assess the diagnostic validity of this examination protocol. The aim of this article is to provide an overview of the project's methodology, descriptive statistics, and data for the study participant sample. This article also details the development of reliable methods to establish the reference standards for assessing criterion validity of the Axis I RDC/TMD diagnoses. The Axis I reference standards were based on the consensus of two criterion examiners independently performing a comprehensive history, clinical examination, and evaluation of imaging. Intersite reliability was assessed annually for criterion examiners and radiologists. Criterion examination reliability was also assessed within study sites. Study participant demographics were comparable to those of participants in previous studies using the RDC/TMD. Diagnostic agreement of the criterion examiners with each other and with the consensus-based reference standards was excellent with all kappas > or = 0.81, except for osteoarthrosis (moderate agreement, k = 0.53). Intrasite criterion examiner agreement with reference standards was excellent (k > or = 0.95). Intersite reliability of the radiologists for detecting computed tomography-disclosed osteoarthrosis and magnetic resonance imaging-disclosed disc displacement was good to excellent (k = 0.71 and 0.84, respectively). The Validation Project study population was appropriate for assessing the reliability and validity of the RDC/TMD Axis I and II. The reference standards used to assess the validity of Axis I TMD were based on reliable and clinically credible methods.
Jones, B E; South, B R; Shao, Y; Lu, C C; Leng, J; Sauer, B C; Gundlapalli, A V; Samore, M H; Zeng, Q
2018-01-01
Identifying pneumonia using diagnosis codes alone may be insufficient for research on clinical decision making. Natural language processing (NLP) may enable the inclusion of cases missed by diagnosis codes. This article (1) develops a NLP tool that identifies the clinical assertion of pneumonia from physician emergency department (ED) notes, and (2) compares classification methods using diagnosis codes versus NLP against a gold standard of manual chart review to identify patients initially treated for pneumonia. Among a national population of ED visits occurring between 2006 and 2012 across the Veterans Affairs health system, we extracted 811 physician documents containing search terms for pneumonia for training, and 100 random documents for validation. Two reviewers annotated span- and document-level classifications of the clinical assertion of pneumonia. An NLP tool using a support vector machine was trained on the enriched documents. We extracted diagnosis codes assigned in the ED and upon hospital discharge and calculated performance characteristics for diagnosis codes, NLP, and NLP plus diagnosis codes against manual review in training and validation sets. Among the training documents, 51% contained clinical assertions of pneumonia; in the validation set, 9% were classified with pneumonia, of which 100% contained pneumonia search terms. After enriching with search terms, the NLP system alone demonstrated a recall/sensitivity of 0.72 (training) and 0.55 (validation), and a precision/positive predictive value (PPV) of 0.89 (training) and 0.71 (validation). ED-assigned diagnostic codes demonstrated lower recall/sensitivity (0.48 and 0.44) but higher precision/PPV (0.95 in training, 1.0 in validation); the NLP system identified more "possible-treated" cases than diagnostic coding. An approach combining NLP and ED-assigned diagnostic coding classification achieved the best performance (sensitivity 0.89 and PPV 0.80). System-wide application of NLP to clinical text can increase capture of initial diagnostic hypotheses, an important inclusion when studying diagnosis and clinical decision-making under uncertainty. Schattauer GmbH Stuttgart.
McGaghie, William C; Cohen, Elaine R; Wayne, Diane B
2011-01-01
United States Medical Licensing Examination (USMLE) scores are frequently used by residency program directors when evaluating applicants. The objectives of this report are to study the chain of reasoning and evidence that underlies the use of USMLE Step 1 and 2 scores for postgraduate medical resident selection decisions and to evaluate the validity argument about the utility of USMLE scores for this purpose. This is a research synthesis using the critical review approach. The study first describes the chain of reasoning that underlies a validity argument about using test scores for a specific purpose. It continues by summarizing correlations of USMLE Step 1 and 2 scores and reliable measures of clinical skill acquisition drawn from nine studies involving 393 medical learners from 2005 to 2010. The integrity of the validity argument about using USMLE Step 1 and 2 scores for postgraduate residency selection decisions is tested. The research synthesis shows that USMLE Step 1 and 2 scores are not correlated with reliable measures of medical students', residents', and fellows' clinical skill acquisition. The validity argument about using USMLE Step 1 and 2 scores for postgraduate residency selection decisions is neither structured, coherent, nor evidence based. The USMLE score validity argument breaks down on grounds of extrapolation and decision/interpretation because the scores are not associated with measures of clinical skill acquisition among advanced medical students, residents, and subspecialty fellows. Continued use of USMLE Step 1 and 2 scores for postgraduate medical residency selection decisions is discouraged.
Lew, Charles Chin Han; Yandell, Rosalie; Fraser, Robert J L; Chua, Ai Ping; Chong, Mary Foong Fong; Miller, Michelle
2017-07-01
Malnutrition is associated with poor clinical outcomes among hospitalized patients. However, studies linking malnutrition with poor clinical outcomes in the intensive care unit (ICU) often have conflicting findings due in part to the inappropriate diagnosis of malnutrition. We primarily aimed to determine whether malnutrition diagnosed by validated nutrition assessment tools such as the Subjective Global Assessment (SGA) or Mini Nutritional Assessment (MNA) is independently associated with poorer clinical outcomes in the ICU and if the use of nutrition screening tools demonstrate a similar association. PubMed, CINAHL, Scopus, and Cochrane Library were systematically searched for eligible studies. Search terms included were synonyms of malnutrition, nutritional status, screening, assessment, and intensive care unit. Eligible studies were case-control or cohort studies that recruited adults in the ICU; conducted the SGA, MNA, or used nutrition screening tools before or within 48 hours of ICU admission; and reported the prevalence of malnutrition and relevant clinical outcomes including mortality, length of stay (LOS), and incidence of infection (IOI). Twenty of 1168 studies were eligible. The prevalence of malnutrition ranged from 38% to 78%. Malnutrition diagnosed by nutrition assessments was independently associated with increased ICU LOS, ICU readmission, IOI, and the risk of hospital mortality. The SGA clearly had better predictive validity than the MNA. The association between malnutrition risk determined by nutrition screening was less consistent. Malnutrition is independently associated with poorer clinical outcomes in the ICU. Compared with nutrition assessment tools, the predictive validity of nutrition screening tools were less consistent.
ERIC Educational Resources Information Center
Wittich, Christopher M.; Pawlina, Wojciech; Drake, Richard L.; Szostek, Jason H.; Reed, Darcy A.; Lachman, Nirusha; McBride, Jennifer M.; Mandrekar, Jayawant N.; Beckman, Thomas J.
2013-01-01
Improving professional attitudes and behaviors requires critical self reflection. Research on reflection is necessary to understand professionalism among medical students. The aims of this prospective validation study at the Mayo Medical School and Cleveland Clinic Lerner College of Medicine were: (1) to develop and validate a new instrument for…
Clinical audit project in undergraduate medical education curriculum: an assessment validation study
Steketee, Carole; Mak, Donna
2016-01-01
Objectives To evaluate the merit of the Clinical Audit Project (CAP) in an assessment program for undergraduate medical education using a systematic assessment validation framework. Methods A cross-sectional assessment validation study at one medical school in Western Australia, with retrospective qualitative analysis of the design, development, implementation and outcomes of the CAP, and quantitative analysis of assessment data from four cohorts of medical students (2011- 2014). Results The CAP is fit for purpose with clear external and internal alignment to expected medical graduate outcomes. Substantive validity in students’ and examiners’ response processes is ensured through relevant methodological and cognitive processes. Multiple validity features are built-in to the design, planning and implementation process of the CAP. There is evidence of high internal consistency reliability of CAP scores (Cronbach’s alpha > 0.8) and inter-examiner consistency reliability (intra-class correlation>0.7). Aggregation of CAP scores is psychometrically sound, with high internal consistency indicating one common underlying construct. Significant but moderate correlations between CAP scores and scores from other assessment modalities indicate validity of extrapolation and alignment between the CAP and the overall target outcomes of medical graduates. Standard setting, score equating and fair decision rules justify consequential validity of CAP scores interpretation and use. Conclusions This study provides evidence demonstrating that the CAP is a meaningful and valid component in the assessment program. This systematic framework of validation can be adopted for all levels of assessment in medical education, from individual assessment modality, to the validation of an assessment program as a whole. PMID:27716612
Hasanpour, Neda; Attarbashi Moghadam, Behrouz; Sami, Ramin; Tavakol, Kamran
2016-08-01
The clinical COPD questionnaire (CCQ) has been developed to measure the health status of COPD patients. The aim of this study was to translate CCQ into the Persian language and assess the validity and reliability of the translated version. We used a forward-backward procedure to translate the questionnaire. In a cross-sectional study 100 COPD patients and 50 healthy subjects over 40 years old were selected to assess the reliability and construct validity of the instrument. The face and content validity were used for the questionnaire validity. Validity was examined in a population of patients with COPD, using the Persian validated version of the St George's Respiratory Questionnaire (PSGRQ). In order to assess the questionnaire's reliability, the Intraclass correlation coefficient (ICC) and Cronbach's alpha were calculated. Test-retest reliability was tested by re-administering the Persian version of the CCQ (PCCQ) after 1 week. Test-retest carry out of data demonstrates that the PCCQ has excellent reliability (ICC for all 3 domains were higher than 0.9). Internal consistency was found by Cronbach's alpha to be 0.96, 0.94, 0.97, and 0.98 for the symptom, mental state, functional state and total scores respectively. In addition, the correlation between the components of PCCQ and PSGRQ showed satisfactory construct validity. Analyzing the data from healthy subjects and patients divulged that the PCCQ has acceptable discriminant validity. In general, the PCCQ had satisfactory reliability and validity for assessing health-related quality of life status of Iranian COPD patients.
Tor, Elina; Steketee, Carole; Mak, Donna
2016-09-24
To evaluate the merit of the Clinical Audit Project (CAP) in an assessment program for undergraduate medical education using a systematic assessment validation framework. A cross-sectional assessment validation study at one medical school in Western Australia, with retrospective qualitative analysis of the design, development, implementation and outcomes of the CAP, and quantitative analysis of assessment data from four cohorts of medical students (2011- 2014). The CAP is fit for purpose with clear external and internal alignment to expected medical graduate outcomes. Substantive validity in students' and examiners' response processes is ensured through relevant methodological and cognitive processes. Multiple validity features are built-in to the design, planning and implementation process of the CAP. There is evidence of high internal consistency reliability of CAP scores (Cronbach's alpha > 0.8) and inter-examiner consistency reliability (intra-class correlation>0.7). Aggregation of CAP scores is psychometrically sound, with high internal consistency indicating one common underlying construct. Significant but moderate correlations between CAP scores and scores from other assessment modalities indicate validity of extrapolation and alignment between the CAP and the overall target outcomes of medical graduates. Standard setting, score equating and fair decision rules justify consequential validity of CAP scores interpretation and use. This study provides evidence demonstrating that the CAP is a meaningful and valid component in the assessment program. This systematic framework of validation can be adopted for all levels of assessment in medical education, from individual assessment modality, to the validation of an assessment program as a whole.
[Computerized system validation of clinical researches].
Yan, Charles; Chen, Feng; Xia, Jia-lai; Zheng, Qing-shan; Liu, Daniel
2015-11-01
Validation is a documented process that provides a high degree of assurance. The computer system does exactly and consistently what it is designed to do in a controlled manner throughout the life. The validation process begins with the system proposal/requirements definition, and continues application and maintenance until system retirement and retention of the e-records based on regulatory rules. The objective to do so is to clearly specify that each application of information technology fulfills its purpose. The computer system validation (CSV) is essential in clinical studies according to the GCP standard, meeting product's pre-determined attributes of the specifications, quality, safety and traceability. This paper describes how to perform the validation process and determine relevant stakeholders within an organization in the light of validation SOPs. Although a specific accountability in the implementation of the validation process might be outsourced, the ultimate responsibility of the CSV remains on the shoulder of the business process owner-sponsor. In order to show that the compliance of the system validation has been properly attained, it is essential to set up comprehensive validation procedures and maintain adequate documentations as well as training records. Quality of the system validation should be controlled using both QC and QA means.
The Depression Anxiety and Stress Scale (DASS): The Study of Validity and Reliability
ERIC Educational Resources Information Center
Akin, Ahmet; Cetin, Bayram
2007-01-01
This study investigated the validity and reliability of the Turkish version of the Depression Anxiety Stress Scale (DASS). The sample of the study consisted of 590 university students, 121 English teachers and 136 emotionally disturbed individuals who sought treatment in various clinics and counseling centers. Factor loadings of the scale ranged…
Kimball, A B; Naegeli, A N; Edson-Heredia, E; Lin, C-Y; Gaich, C; Nikaï, E; Wyrwich, K; Yosipovitch, G
2016-07-01
Itching is a profoundly distressing symptom for many patients with psoriasis, but it has not been rigorously studied using validated tools for this condition. This study investigated the psychometric properties of the Itch Numeric Rating Scale (Itch NRS), a single-item patient-reported outcome (PRO) measuring the worst itching severity due to psoriasis in the past 24 h. Using disease-specific clinician-rated and PRO data from one phase II and three phase III randomized clinical studies of subjects with moderate-to-severe plaque psoriasis, the Itch NRS was evaluated for test-retest reliability, construct validity and responsiveness. A responder definition was explored using anchor- and distribution-based methods. Test-retest reliability analyses supported the reproducibility of the measure (intraclass correlation coefficient range 0·71-0·74). To support the construct validity of the Itch NRS, large cross-sectional correlations with the Dermatology Life Quality Index (DLQI) Symptoms and Feelings domain (r ≥ 0·60 at baseline and r ≥ 0·80 at week 12) supported a priori hypotheses, while large correlations (r ≥ 0·71) between changes in Itch NRS scores and changes in DLQI Symptoms and Feelings domain scores from baseline to week 12 established responsiveness. A 4-point change was optimal for demonstrating a level of clinically meaningful improvement in itch severity after 12 weeks of treatment, which corresponds with marked clinical improvements in plaque psoriasis. The Itch NRS demonstrated sufficient reliability, validity and responsiveness, and appropriate interpretation standards for evaluating change over time in itch severity among patients with moderate-to-severe plaque psoriasis when validated using clinical trial data for this condition. © 2016 The Authors. British Journal of Dermatology published by John Wiley & Sons Ltd on behalf of British Association of Dermatologists.
Reliability and Validity of Bedside Version of Persian WAB (P-WAB-1).
Nilipour, Reza; Pourshahbaz, Abbas; Ghoreyshi, Zahra Sadat
2014-10-01
In this study, we reported the reliability and validity of Bedside version of Persian WAB (P-WAB-1) adapted from Western Aphasia Battery (WAB-R) (1,2). P-WAB-1 is a clinical linguistic measuring tool to determine severity and type of aphasia in brain damaged patients based on Aphasia Quotient (AQ) as a functional measure. For the purposes of a quick clinical screening of aphasia in Persian, we adapted the bedside version of WAB-R to assess the performance of Persian aphasic patients. The data we reported on adaptation, validity and reliability of P-WAB-1 are based on faithful translation and criterion validity ratio (CVR) taken from the expert panel and the performance of 60 consecutive brain damaged patients referred to different university clinics for rehabilitation and 30 healthy subjects as norms and 40 age-matched epileptic patients as the control group. Based on the results of this study, P-WAB-1 has internal consistency (a=0.71) and test-retest reliability (r=.65 P<0.001) and the subtests are sensitive enough to contribute to Aphasia Quotient (AQ) as a functional measure of severity of aphasia in Iranian brain damaged patients. Based on AQ results, our aphasic patients were classified into four distinct groups of severity. P-WAB-1 is the first clinical linguistic test to determine severity of aphasia based on an operational index and can be considered as a valid baseline for screening and diagnosis of aphasia among Persian speaking brain damaged patients. This study is the initial step on adaptation of different versions of WAB-R to measure the severity of aphasia using AQ, LQ and CQ as operational measures and to classify Persian speaking aphasic patients into different types.
Vaughan, Brett
2018-01-01
Clinical teaching evaluations are common in health profession education programs to ensure students are receiving a quality clinical education experience. Questionnaires students use to evaluate their clinical teachers have been developed in professions such as medicine and nursing. The development of a questionnaire that is specifically for the osteopathy on-campus, student-led clinic environment is warranted. Previous work developed the 30-item Osteopathy Clinical Teaching Questionnaire. The current study utilised Rasch analysis to investigate the construct validity of the Osteopathy Clinical Teaching Questionnaire and provide evidence for the validity argument through fit to the Rasch model. Senior osteopathy students at four institutions in Australia, New Zealand and the United Kingdom rated their clinical teachers using the Osteopathy Clinical Teaching Questionnaire. Three hundred and ninety-nine valid responses were received and the data were evaluated for fit to the Rasch model. Reliability estimations (Cronbach's alpha and McDonald's omega) were also evaluated for the final model. The initial analysis demonstrated the data did not fit the Rasch model. Accordingly, modifications to the questionnaire were made including removing items, removing person responses, and rescoring one item. The final model contained 12 items and fit to the Rasch model was adequate. Support for unidimensionality was demonstrated through both the Principal Components Analysis/t-test, and the Cronbach's alpha and McDonald's omega reliability estimates. Analysis of the questionnaire using McDonald's omega hierarchical supported a general factor (quality of clinical teaching in osteopathy). The evidence for unidimensionality and the presence of a general factor support the calculation of a total score for the questionnaire as a sufficient statistic. Further work is now required to investigate the reliability of the 12-item Osteopathy Clinical Teaching Questionnaire to provide evidence for the validity argument.
Hobden, Breanne; Schwandt, Melanie L.; Carey, Mariko; Lee, Mary R.; Farokhnia, Mehdi; Bouhlal, Sofia; Oldmeadow, Christopher; Leggio, Lorenzo
2017-01-01
Background The Montgomery-Asberg Depression Rating Scale (MADRS) is commonly used to examine depressive symptoms in clinical settings, including facilities treating patients for alcohol addiction. No studies have examined the validity of the MADRS compared to an established clinical diagnostic tool of depression in this population. This study aimed to examine: 1) the validity of the MADRS compared to a clinical diagnosis of a depressive disorder (using the Structured Clinical Interview for DSM-IV (SCID)) in patients seeking treatment for alcohol dependence (AD); 2) whether the validity of the MADRS differs by type of SCID-based diagnosis of depression; and 3) which items contribute to the optimal predictive model of the MADRS compared to a SCID diagnosis of a depressive disorder. Methods Individuals seeking treatment for AD and admitted to an inpatient unit were administered the MADRS at day 2 of their detoxification program. Clinical diagnoses of AD and depression were made via the Structured Clinical Interview for the Diagnostic and Statistical Manual of Mental Disorders-IV at the beginning of treatment. Results In total, 803 participants were included in the study. The MADRS demonstrated low overall accuracy relative to the clinical diagnosis of depression with an area under the curve of 0.68. The optimal threshold for balancing sensitivity and specificity identified by the Euclidean distance was >14. This cut-point demonstrated a sensitivity of 66%, a specificity of 60%, a positive predictive value of 50% and a negative predictive value of 75%. The MADRS performed slightly better for major depressive disorders compared to alcohol-induced depression. Items related to lassitude, concentration and appetite slightly decreased the accuracy of the MADRS. Conclusion The MADRS does not appear to be an appropriate substitute for a diagnostic tool among alcohol-dependent patients. The MADRS may, however, still be a useful screening tool assuming careful consideration of cut-off scores. PMID:28421616
Defining Surrogate Endpoints for Clinical Trials in Severe Falciparum Malaria
Plewes, Katherine; Maude, Richard J.; Hanson, Josh; Herdman, M. Trent; Leopold, Stije J.; Ngernseng, Thatsanun; Charunwatthana, Prakaykaew; Phu, Nguyen Hoan; Ghose, Aniruddha; Hasan, M. Mahtab Uddin; Fanello, Caterina I.; Faiz, Md Abul; Hien, Tran Tinh; Day, Nicholas P. J.; White, Nicholas J.; Dondorp, Arjen M.
2017-01-01
Background Clinical trials in severe falciparum malaria require a large sample size to detect clinically meaningful differences in mortality. This means few interventions can be evaluated at any time. Using a validated surrogate endpoint for mortality would provide a useful alternative allowing a smaller sample size. Here we evaluate changes in coma score and plasma lactate as surrogate endpoints for mortality in severe falciparum malaria. Methods Three datasets of clinical studies in severe malaria were re-evaluated: studies from Chittagong, Bangladesh (adults), the African ‘AQUAMAT’ trial comparing artesunate and quinine (children), and the Vietnamese ‘AQ’ study (adults) comparing artemether with quinine. The absolute change, relative change, slope of the normalization over time, and time to normalization were derived from sequential measurements of plasma lactate and coma score, and validated for their use as surrogate endpoint, including the proportion of treatment effect on mortality explained (PTE) by these surrogate measures. Results Improvements in lactate concentration or coma scores over the first 24 hours of admission, were strongly prognostic for survival in all datasets. In hyperlactataemic patients in the AQ study (n = 173), lower mortality with artemether compared to quinine closely correlated with faster reduction in plasma lactate concentration, with a high PTE of the relative change in plasma lactate at 8 and 12 hours of 0.81 and 0.75, respectively. In paediatric patients enrolled in the ‘AQUAMAT’ study with cerebral malaria (n = 785), mortality was lower with artesunate compared to quinine, but this was not associated with faster coma recovery. Conclusions The relative changes in plasma lactate concentration assessed at 8 or 12 hours after admission are valid surrogate endpoints for severe malaria studies on antimalarial drugs or adjuvant treatments aiming at improving the microcirculation. Measures of coma recovery are not valid surrogate endpoints for mortality. PMID:28052109
Defining Surrogate Endpoints for Clinical Trials in Severe Falciparum Malaria.
Jeeyapant, Atthanee; Kingston, Hugh W; Plewes, Katherine; Maude, Richard J; Hanson, Josh; Herdman, M Trent; Leopold, Stije J; Ngernseng, Thatsanun; Charunwatthana, Prakaykaew; Phu, Nguyen Hoan; Ghose, Aniruddha; Hasan, M Mahtab Uddin; Fanello, Caterina I; Faiz, Md Abul; Hien, Tran Tinh; Day, Nicholas P J; White, Nicholas J; Dondorp, Arjen M
2017-01-01
Clinical trials in severe falciparum malaria require a large sample size to detect clinically meaningful differences in mortality. This means few interventions can be evaluated at any time. Using a validated surrogate endpoint for mortality would provide a useful alternative allowing a smaller sample size. Here we evaluate changes in coma score and plasma lactate as surrogate endpoints for mortality in severe falciparum malaria. Three datasets of clinical studies in severe malaria were re-evaluated: studies from Chittagong, Bangladesh (adults), the African 'AQUAMAT' trial comparing artesunate and quinine (children), and the Vietnamese 'AQ' study (adults) comparing artemether with quinine. The absolute change, relative change, slope of the normalization over time, and time to normalization were derived from sequential measurements of plasma lactate and coma score, and validated for their use as surrogate endpoint, including the proportion of treatment effect on mortality explained (PTE) by these surrogate measures. Improvements in lactate concentration or coma scores over the first 24 hours of admission, were strongly prognostic for survival in all datasets. In hyperlactataemic patients in the AQ study (n = 173), lower mortality with artemether compared to quinine closely correlated with faster reduction in plasma lactate concentration, with a high PTE of the relative change in plasma lactate at 8 and 12 hours of 0.81 and 0.75, respectively. In paediatric patients enrolled in the 'AQUAMAT' study with cerebral malaria (n = 785), mortality was lower with artesunate compared to quinine, but this was not associated with faster coma recovery. The relative changes in plasma lactate concentration assessed at 8 or 12 hours after admission are valid surrogate endpoints for severe malaria studies on antimalarial drugs or adjuvant treatments aiming at improving the microcirculation. Measures of coma recovery are not valid surrogate endpoints for mortality.
Place, Skyler; Blanch-Hartigan, Danielle; Rubin, Channah; Gorrostieta, Cristina; Mead, Caroline; Kane, John; Marx, Brian P; Feast, Joshua; Deckersbach, Thilo; Pentland, Alex Sandy; Nierenberg, Andrew; Azarbayejani, Ali
2017-03-16
There is a critical need for real-time tracking of behavioral indicators of mental disorders. Mobile sensing platforms that objectively and noninvasively collect, store, and analyze behavioral indicators have not yet been clinically validated or scalable. The aim of our study was to report on models of clinical symptoms for post-traumatic stress disorder (PTSD) and depression derived from a scalable mobile sensing platform. A total of 73 participants (67% [49/73] male, 48% [35/73] non-Hispanic white, 33% [24/73] veteran status) who reported at least one symptom of PTSD or depression completed a 12-week field trial. Behavioral indicators were collected through the noninvasive mobile sensing platform on participants' mobile phones. Clinical symptoms were measured through validated clinical interviews with a licensed clinical social worker. A combination hypothesis and data-driven approach was used to derive key features for modeling symptoms, including the sum of outgoing calls, count of unique numbers texted, absolute distance traveled, dynamic variation of the voice, speaking rate, and voice quality. Participants also reported ease of use and data sharing concerns. Behavioral indicators predicted clinically assessed symptoms of depression and PTSD (cross-validated area under the curve [AUC] for depressed mood=.74, fatigue=.56, interest in activities=.75, and social connectedness=.83). Participants reported comfort sharing individual data with physicians (Mean 3.08, SD 1.22), mental health providers (Mean 3.25, SD 1.39), and medical researchers (Mean 3.03, SD 1.36). Behavioral indicators passively collected through a mobile sensing platform predicted symptoms of depression and PTSD. The use of mobile sensing platforms can provide clinically validated behavioral indicators in real time; however, further validation of these models and this platform in large clinical samples is needed. ©Skyler Place, Danielle Blanch-Hartigan, Channah Rubin, Cristina Gorrostieta, Caroline Mead, John Kane, Brian P Marx, Joshua Feast, Thilo Deckersbach, Alex “Sandy” Pentland, Andrew Nierenberg, Ali Azarbayejani. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 16.03.2017.
A reliability and validity study of the Palliative Performance Scale
Ho, Francis; Lau, Francis; Downing, Michael G; Lesperance, Mary
2008-01-01
Background The Palliative Performance Scale (PPS) was first introduced in1996 as a new tool for measurement of performance status in palliative care. PPS has been used in many countries and has been translated into other languages. Methods This study evaluated the reliability and validity of PPS. A web-based, case scenarios study with a test-retest format was used to determine reliability. Fifty-three participants were recruited and randomly divided into two groups, each evaluating 11 cases at two time points. The validity study was based on the content validation of 15 palliative care experts conducted over telephone interviews, with discussion on five themes: PPS as clinical assessment tool, the usefulness of PPS, PPS scores affecting decision making, the problems in using PPS, and the adequacy of PPS instruction. Results The intraclass correlation coefficients for absolute agreement were 0.959 and 0.964 for Group 1, at Time-1 and Time-2; 0.951 and 0.931 for Group 2, at Time-1 and Time-2 respectively. Results showed that the participants were consistent in their scoring over the two times, with a mean Cohen's kappa of 0.67 for Group 1 and 0.71 for Group 2. In the validity study, all experts agreed that PPS is a valuable clinical assessment tool in palliative care. Many of them have already incorporated PPS as part of their practice standard. Conclusion The results of the reliability study demonstrated that PPS is a reliable tool. The validity study found that most experts did not feel a need to further modify PPS and, only two experts requested that some performance status measures be defined more clearly. Areas of PPS use include prognostication, disease monitoring, care planning, hospital resource allocation, clinical teaching and research. PPS is also a good communication tool between palliative care workers. PMID:18680590
Assessment of Semi-Structured Clinical Interview for Mobile Phone Addiction Disorder.
Alavi, Seyyed Salman; Mohammadi, Mohammad Reza; Jannatifard, Fereshteh; Mohammadi Kalhori, Soroush; Sepahbodi, Ghazal; BabaReisi, Mohammad; Sajedi, Sahar; Farshchi, Mojtaba; KhodaKarami, Rasul; Hatami Kasvaee, Vahid
2016-04-01
The Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision (DSM-IV-TR) classified mobile phone addiction disorder under "impulse control disorder not elsewhere classified". This study surveyed the diagnostic criteria of DSM-IV-TR for the diagnosis of mobile phone addiction in correspondence with Iranian society and culture. Two hundred fifty students of Tehran universities were entered into this descriptive-analytical and cross-sectional study. Quota sampling method was used. At first, semi- structured clinical interview (based on DSM-IV-TR) was performed for all the cases, and another specialist reevaluated the interviews. Data were analyzed using content validity, inter-scorer reliability (Kappa coefficient) and test-retest via SPSS18 software. The content validity of the semi- structured clinical interview matched the DSM-IV-TR criteria for behavioral addiction. Moreover, their content was appropriate, and two items, including "SMS pathological use" and "High monthly cost of using the mobile phone" were added to promote its validity. Internal reliability (Kappa) and test-retest reliability were 0.55 and r = 0.4 (p<0. 01) respectively. The results of this study revealed that semi- structured diagnostic criteria of DSM-IV-TR are valid and reliable for diagnosing mobile phone addiction, and this instrument is an effective tool to diagnose this disorder.
Clinical validity of prototype personality disorder ratings in adolescents.
Defife, Jared A; Haggerty, Greg; Smith, Scott W; Betancourt, Luis; Ahmed, Zain; Ditkowsky, Keith
2015-01-01
A growing body of research shows that personality pathology in adolescents is clinically distinctive and frequently stable into adulthood. A reliable and useful method for rating personality pathology in adolescent patients has the potential to enhance conceptualization, dissemination, and treatment effectiveness. The aim of this study is to examine the clinical validity of a prototype matching approach (derived from the Shedler Westen Assessment Procedure-Adolescent Version) for quantifying personality pathology in an adolescent inpatient sample. Sixty-six adolescent inpatients and their parents or legal guardians completed forms of the Child Behavior Checklist (CBCL) assessing emotional and behavioral problems. Clinical criterion variables including suicide history, substance use, and fights with peers were also assessed. Patients' individual and group therapists on the inpatient unit completed personality prototype ratings. Prototype diagnoses demonstrated substantial reliability (median intraclass correlation coefficient =.75) across independent ratings from individual and group therapists. Personality prototype ratings correlated with the CBCL scales and clinical criterion variables in anticipated and meaningful ways. As seen in prior research with adult samples, prototype personality ratings show clinical validity across independent clinician raters previously unfamiliar with the approach, and they are meaningfully related to clinical symptoms, behavioral problems, and adaptive functioning.
Clinical Validity of Prototype Personality Disorder Ratings in Adolescents
DeFife, Jared A.; Haggerty, Greg; Smith, Scott W.; Betancourt, Luis; Ahmed, Zain; Ditkowsky, Keith
2015-01-01
A growing body of research shows that personality pathology in adolescents is clinically distinctive and frequently stable into adulthood. A reliable and useful method for rating personality pathology in adolescent patients has the potential to enhance conceptualization, dissemination, and treatment effectiveness. The aim of this study is to examine the clinical validity of a prototype matching approach (derived from the Shedler Westen Assessment Procedure – Adolescent Version) for quantifying personality pathology in an adolescent inpatient sample. Sixty-six adolescent inpatients and their parents or legal guardians completed forms of the Child Behavior Checklist (CBCL) assessing emotional and behavioral problems. Clinical criterion variables including suicide history, substance use, and fights with peers were also assessed. Patients’ individual and group therapists on the inpatient unit completed personality prototype ratings. Prototype diagnoses demonstrated substantial reliability (median ICC = .75) across independent ratings from individual and group therapists. Personality prototype ratings correlated with the CBCL scales and clinical criterion variables in anticipated and meaningful ways. As seen in prior research with adult samples, prototype personality ratings show clinical validity across independent clinician raters previously unfamiliar with the approach, and they are meaningfully related to clinical symptoms, behavioral problems, and adaptive functioning. PMID:25457971
2004-01-01
Background Evaluation is a challenging but necessary part of the development cycle of clinical information systems like the electronic medical records (EMR) system. It is believed that such evaluations should include multiple perspectives, be comparative and employ both qualitative and quantitative methods. Self-administered questionnaires are frequently used as a quantitative evaluation method in medical informatics, but very few validated questionnaires address clinical use of EMR systems. Methods We have developed a task-oriented questionnaire for evaluating EMR systems from the clinician's perspective. The key feature of the questionnaire is a list of 24 general clinical tasks. It is applicable to physicians of most specialties and covers essential parts of their information-oriented work. The task list appears in two separate sections, about EMR use and task performance using the EMR, respectively. By combining these sections, the evaluator may estimate the potential impact of the EMR system on health care delivery. The results may also be compared across time, site or vendor. This paper describes the development, performance and validation of the questionnaire. Its performance is shown in two demonstration studies (n = 219 and 80). Its content is validated in an interview study (n = 10), and its reliability is investigated in a test-retest study (n = 37) and a scaling study (n = 31). Results In the interviews, the physicians found the general clinical tasks in the questionnaire relevant and comprehensible. The tasks were interpreted concordant to their definitions. However, the physicians found questions about tasks not explicitly or only partially supported by the EMR systems difficult to answer. The two demonstration studies provided unambiguous results and low percentages of missing responses. In addition, criterion validity was demonstrated for a majority of task-oriented questions. Their test-retest reliability was generally high, and the non-standard scale was found symmetric and ordinal. Conclusion This questionnaire is relevant for clinical work and EMR systems, provides reliable and interpretable results, and may be used as part of any evaluation effort involving the clinician's perspective of an EMR system. PMID:15018620
Laerum, Hallvard; Faxvaag, Arild
2004-02-09
Evaluation is a challenging but necessary part of the development cycle of clinical information systems like the electronic medical records (EMR) system. It is believed that such evaluations should include multiple perspectives, be comparative and employ both qualitative and quantitative methods. Self-administered questionnaires are frequently used as a quantitative evaluation method in medical informatics, but very few validated questionnaires address clinical use of EMR systems. We have developed a task-oriented questionnaire for evaluating EMR systems from the clinician's perspective. The key feature of the questionnaire is a list of 24 general clinical tasks. It is applicable to physicians of most specialties and covers essential parts of their information-oriented work. The task list appears in two separate sections, about EMR use and task performance using the EMR, respectively. By combining these sections, the evaluator may estimate the potential impact of the EMR system on health care delivery. The results may also be compared across time, site or vendor. This paper describes the development, performance and validation of the questionnaire. Its performance is shown in two demonstration studies (n = 219 and 80). Its content is validated in an interview study (n = 10), and its reliability is investigated in a test-retest study (n = 37) and a scaling study (n = 31). In the interviews, the physicians found the general clinical tasks in the questionnaire relevant and comprehensible. The tasks were interpreted concordant to their definitions. However, the physicians found questions about tasks not explicitly or only partially supported by the EMR systems difficult to answer. The two demonstration studies provided unambiguous results and low percentages of missing responses. In addition, criterion validity was demonstrated for a majority of task-oriented questions. Their test-retest reliability was generally high, and the non-standard scale was found symmetric and ordinal. This questionnaire is relevant for clinical work and EMR systems, provides reliable and interpretable results, and may be used as part of any evaluation effort involving the clinician's perspective of an EMR system.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cho, Daniel S.; Linte, Cristian; Chen, Elvis C. S.
Purpose: Although robot-assisted coronary artery bypass grafting (RA-CABG) has gained more acceptance worldwide, its success still depends on the surgeon's experience and expertise, and the conversion rate to full sternotomy is in the order of 15%-25%. One of the reasons for conversion is poor pre-operative planning, which is based solely on pre-operative computed tomography (CT) images. In this paper, the authors propose a technique to estimate the global peri-operative displacement of the heart and to predict the intra-operative target vessel location, validated via both an in vitro and a clinical study. Methods: As the peri-operative heart migration during RA-CABG hasmore » never been reported in the literatures, a simple in vitro validation study was conducted using a heart phantom. To mimic the clinical workflow, a pre-operative CT as well as peri-operative ultrasound images at three different stages in the procedure (Stage{sub 0}--following intubation; Stage{sub 1}--following lung deflation; and Stage{sub 2}--following thoracic insufflation) were acquired during the experiment. Following image acquisition, a rigid-body registration using iterative closest point algorithm with the robust estimator was employed to map the pre-operative stage to each of the peri-operative ones, to estimate the heart migration and predict the peri-operative target vessel location. Moreover, a clinical validation of this technique was conducted using offline patient data, where a Monte Carlo simulation was used to overcome the limitations arising due to the invisibility of the target vessel in the peri-operative ultrasound images. Results: For the in vitro study, the computed target registration error (TRE) at Stage{sub 0}, Stage{sub 1}, and Stage{sub 2} was 2.1, 3.3, and 2.6 mm, respectively. According to the offline clinical validation study, the maximum TRE at the left anterior descending (LAD) coronary artery was 4.1 mm at Stage{sub 0}, 5.1 mm at Stage{sub 1}, and 3.4 mm at Stage{sub 2}. Conclusions: The authors proposed a method to measure and validate peri-operative shifts of the heart during RA-CABG. In vitro and clinical validation studies were conducted and yielded a TRE in the order of 5 mm for all cases. As the desired clinical accuracy imposed by this procedure is on the order of one intercostal space (10-15 mm), our technique suits the clinical requirements. The authors therefore believe this technique has the potential to improve the pre-operative planning by updating peri-operative migration patterns of the heart and, consequently, will lead to reduced conversion to conventional open thoracic procedures.« less
Schut, Henk; Stroebe, Margaret S.; Wilson, Stewart; Birrell, John
2016-01-01
Objective This study assessed the validity of the Indicator of Bereavement Adaptation Cruse Scotland (IBACS). Designed for use in clinical and non-clinical settings, the IBACS measures severity of grief symptoms and risk of developing complications. Method N = 196 (44 male, 152 female) help-seeking, bereaved Scottish adults participated at two timepoints: T1 (baseline) and T2 (after 18 months). Four validated assessment instruments were administered: CORE-R, ICG-R, IES-R, SCL-90-R. Discriminative ability was assessed using ROC curve analysis. Concurrent validity was tested through correlation analysis at T1. Predictive validity was assessed using correlation analyses and ROC curve analysis. Optimal IBACS cutoff values were obtained by calculating a maximal Youden index J in ROC curve analysis. Clinical implications were compared across instruments. Results ROC curve analysis results (AUC = .84, p < .01, 95% CI between .77 and .90) indicated the IBACS is a good diagnostic instrument for assessing complicated grief. Positive correlations (p < .01, 2-tailed) with all four instruments at T1 demonstrated the IBACS' concurrent validity, strongest with complicated grief measures (r = .82). Predictive validity was shown to be fair in T2 ROC curve analysis results (n = 67, AUC = .78, 95% CI between .65 and .92; p < .01). Predictive validity was also supported by stable positive correlations between IBACS and other instruments at T2. Clinical indications were found not to differ across instruments. Conclusions The IBACS offers effective grief symptom and risk assessment for use by non-clinicians. Indications are sufficient to support intake assessment for a stepped model of bereavement intervention. PMID:27741246
Cooper, Dale J; Scammell, Brigitte E; Batt, Mark E; Palmer, Debbie
2018-01-17
The impracticalities and comparative expense of carrying out a clinical assessment is an obstacle in many large epidemiological studies. The purpose of this study was to develop and validate a series of electronic self-reported line drawing instruments based on the modified Beighton scoring system for the assessment of self-reported generalised joint hypermobility. Five sets of line drawings were created to depict the 9-point Beighton score criteria. Each instrument consisted of an explanatory question whereby participants were asked to select the line drawing which best represented their joints. Fifty participants completed the self-report online instrument on two occasions, before attending a clinical assessment. A blinded expert clinical observer then assessed participants' on two occasions, using a standardised goniometry measurement protocol. Validity of the instrument was assessed by participant-observer agreement and reliability by participant repeatability and observer repeatability using unweighted Cohen's kappa (k). Validity and reliability were assessed for each item in the self-reported instrument separately, and for the sum of the total scores. An aggregate score for generalised joint hypermobility was determined based on a Beighton score of 4 or more out of 9. Observer-repeatability between the two clinical assessments demonstrated perfect agreement (k 1.00; 95% CI 1.00, 1.00). Self-reported participant-repeatability was lower but it was still excellent (k 0.91; 95% CI 0.74, 1.00). The participant-observer agreement was excellent (k 0.96; 95% CI 0.87, 1.00). Validity was excellent for the self-report instrument, with a good sensitivity of 0.87 (95% CI 0.81, 0.91) and excellent specificity of 0.99 (95% CI 0.98, 1.00). The self-reported instrument provides a valid and reliable assessment of the presence of generalised joint hypermobility and may have practical use in epidemiological studies.
Standards for testing and clinical validation of seizure detection devices.
Beniczky, Sándor; Ryvlin, Philippe
2018-06-01
To increase the quality of studies on seizure detection devices, we propose standards for testing and clinical validation of such devices. We identified 4 key features that are important for studies on seizure detection devices: subjects, recordings, data analysis and alarms, and reference standard. For each of these features, we list the specific aspects that need to be addressed in the studies, and depending on these, studies are classified into 5 phases (0-4). We propose a set of outcome measures that need to be reported, and we propose standards for reporting the results. These standards will help in designing and reporting studies on seizure detection devices, they will give readers clear information on the level of evidence provided by the studies, and they will help regulatory bodies in assessing the quality of the validation studies. These standards are flexible, allowing classification of the studies into one of the 5 phases. We propose actions that can facilitate development of novel methods and devices. Wiley Periodicals, Inc. © 2018 International League Against Epilepsy.
Current progress in patient-specific modeling
2010-01-01
We present a survey of recent advancements in the emerging field of patient-specific modeling (PSM). Researchers in this field are currently simulating a wide variety of tissue and organ dynamics to address challenges in various clinical domains. The majority of this research employs three-dimensional, image-based modeling techniques. Recent PSM publications mostly represent feasibility or preliminary validation studies on modeling technologies, and these systems will require further clinical validation and usability testing before they can become a standard of care. We anticipate that with further testing and research, PSM-derived technologies will eventually become valuable, versatile clinical tools. PMID:19955236
Lin, Keh-chung; Chen, Hui-fang; Chen, Chia-ling; Wang, Tien-ni; Wu, Ching-yi; Hsieh, Yu-wei; Wu, Li-ling
2012-01-01
This study examined criterion-related validity and clinimetric properties of the Pediatric Motor Activity Log (PMAL) in children with cerebral palsy. Study participants were 41 children (age range: 28-113 months) and their parents. Criterion-related validity was evaluated by the associations between the PMAL and criterion measures at baseline and posttreatment, including the self-care, mobility, and cognition subscale, the total performance of the Functional Independence Measure in children (WeeFIM), and the grasping and visual-motor integration of the Peabody Developmental Motor Scales. Pearson correlation coefficients were calculated. Responsiveness was examined using the paired t test and the standardized response mean, the minimal detectable change was captured at the 90% confidence level, and the minimal clinically important change was estimated using anchor-based and distribution-based approaches. The PMAL-QOM showed fair concurrent validity at pretreatment and posttreatment and predictive validity, whereas the PMAL-AOU had fair concurrent validity at posttreatment only. The PMAL-AOU and PMAL-QOM were both markedly responsive to change after treatment. Improvement of at least 0.67 points on the PMAL-AOU and 0.66 points on the PMAL-QOM can be considered as a true change, not measurement error. A mean change has to exceed the range of 0.39-0.94 on the PMAL-AOU and the range of 0.38-0.74 on the PMAL-QOM to be regarded as clinically important change. Copyright © 2011 Elsevier Ltd. All rights reserved.
Shirasaki, Osamu; Asou, Yosuke; Takahashi, Yukio
2007-12-01
Owing to fast or stepwise cuff deflation, or measuring at places other than the upper arm, the clinical accuracy of most recent automated sphygmomanometers (auto-BPMs) cannot be validated by one-arm simultaneous comparison, which would be the only accurate validation method based on auscultation. Two main alternative methods are provided by current standards, that is, two-arm simultaneous comparison (method 1) and one-arm sequential comparison (method 2); however, the accuracy of these validation methods might not be sufficient to compensate for the suspicious accuracy in lateral blood pressure (BP) differences (LD) and/or BP variations (BPV) between the device and reference readings. Thus, the Japan ISO-WG for sphygmomanometer standards has been studying a new method that might improve validation accuracy (method 3). The purpose of this study is to determine the appropriateness of method 3 by comparing immunity to LD and BPV with those of the current validation methods (methods 1 and 2). The validation accuracy of the above three methods was assessed in human participants [N=120, 45+/-15.3 years (mean+/-SD)]. An oscillometric automated monitor, Omron HEM-762, was used as the tested device. When compared with the others, methods 1 and 3 showed a smaller intra-individual standard deviation of device error (SD1), suggesting their higher reproducibility of validation. The SD1 by method 2 (P=0.004) significantly correlated with the participant's BP, supporting our hypothesis that the increased SD of device error by method 2 is at least partially caused by essential BPV. Method 3 showed a significantly (P=0.0044) smaller interparticipant SD of device error (SD2), suggesting its higher interparticipant consistency of validation. Among the methods of validation of the clinical accuracy of auto-BPMs, method 3, which showed the highest reproducibility and highest interparticipant consistency, can be proposed as being the most appropriate.
Jørgensen, Line Dahl; Willadsen, Elisabeth
2017-01-01
The purpose of this study was to develop and validate a clinically useful speech-language screening procedure for young children with cleft palate ± cleft lip (CP) to identify those in need of speech-language intervention. Twenty-two children with CP were assigned to a +/- need for intervention conditions based on assessment of consonant inventory using a real-time listening procedure in combination with parent-reported expressive vocabulary. These measures allowed evaluation of early speech-language skills found to correlate significantly with later speech-language performance in longitudinal studies of children with CP. The external validity of this screening procedure was evaluated by comparing the +/- need for intervention assignment determined by the screening procedure to experienced speech-language pathologist (SLP)s' clinical judgement of whether or not a child needed early intervention. The results of real-time listening assessment showed good-excellent inter-rater agreement on different consonant inventory measures. Furthermore, there was almost perfect agreement between the children selected for intervention with the screening procedure and the clinical judgement of experienced SLPs indicate that the screening procedure is a valid way of identifying children with CP who need early intervention.
Clinical validation of nursing outcome mobility in patients with cerebrovascular accidents.
Moreira, Rafaella Pessoa; Araujo, Thelma Leite de; Lopes, Marcos Venicios de Oliveira; Cavalcante, Tahissa Frota; Guedes, Nirla Gomes; Chaves, Emília Soares; Portela, Regiane Campos; Holanda, Rose-Eloise
2016-12-15
To clinically validate the nursing outcome Mobility in patients with cerebrovascular accidents. Descriptive study, conducted in July 2011, with 38 outpatients, in northeastern Brazil. Data collection took place by evaluating two pairs of specialist nurses, where one pair used the instrument containing the constitutive and operational definitions of the indicators and magnitudes of the Mobility Outcome and the other pair without such definitions. When analyzing the evaluations among nurses, all indicators showed significant differences by the Friedman test (p <0.05). The constitutive and operational definitions submitted to the validation process provide greater accuracy in assessing the cerebrovascular accident patient's mobility state.
Construct Validity of the WISC-IV[superscript UK] with a Large Referred Irish Sample
ERIC Educational Resources Information Center
Watkins, Marley W.; Canivez, Gary L.; James, Trevor; James, Kate; Good, Rebecca
2013-01-01
Irish educational psychologists frequently use the Wechsler Intelligence Scale for Children-Fourth U.K. Edition (WISC-IV[superscript UK]) in clinical assessments of children with learning difficulties. Unfortunately, reliability and validity studies of the WISC-IV[superscript UK] have not yet been reported. This study examined the construct…
The City of Hope-Quality of Life-Ostomy Questionnaire: Persian Translation and Validation
Anaraki, F; Vafaie, M; Behboo, R; Esmaeilpour, S; Maghsoodi, N; Safaee, A; Grant, M
2014-01-01
Background: Since there is no disease-specific instrument for measuring quality-of-life (QOL) in Ostomy patients in Persian language. Aim: This study was designed to translate and evaluate the validity and reliability of City of Hope-quality of life-Ostomy questionnaire (COH-QOL-Ostomy questionnaire). Subjects and Methods: This study was designed as cross-sectional study. Reliability of the subscales and the summary scores were demonstrated by intra-class correlation coefficients. Pearson's correlations of an item with its own scale and other scales were calculated to evaluated convergent and discriminant validity. Clinical validity was also evaluated by known-group comparisons. Results: Cronbach's alpha coefficient for all subscales was about 0.70 or higher. Results of interscale correlation were satisfactory and each subscale only measured a single and specified trait. All subscales met the standards of convergent and discriminant validity. Known group comparison analysis showed significant differences in social and spiritual well-being. Conclusion: The findings confirmed the reliability and validity of Persian version of COH-QOL-Ostomy questionnaire. The instrument was also well received by the Iranian patients. It can be considered as a valuable instrument to assess the different aspects of health related quality-of-life in Ostomy patients and used in clinical research in the future. PMID:25221719
Rosing, H.; Hillebrand, M. J. X.; Blesson, S.; Mengesha, B.; Diro, E.; Hailu, A.; Schellens, J. H. M.; Beijnen, J. H.
2016-01-01
To facilitate future pharmacokinetic studies of combination treatments against leishmaniasis in remote regions in which the disease is endemic, a simple cheap sampling method is required for miltefosine quantification. The aims of this study were to validate a liquid chromatography-tandem mass spectrometry method to quantify miltefosine in dried blood spot (DBS) samples and to validate its use with Ethiopian patients with visceral leishmaniasis (VL). Since hematocrit (Ht) levels are typically severely decreased in VL patients, returning to normal during treatment, the method was evaluated over a range of clinically relevant Ht values. Miltefosine was extracted from DBS samples using a simple method of pretreatment with methanol, resulting in >97% recovery. The method was validated over a calibration range of 10 to 2,000 ng/ml, and accuracy and precision were within ±11.2% and ≤7.0% (≤19.1% at the lower limit of quantification), respectively. The method was accurate and precise for blood spot volumes between 10 and 30 μl and for Ht levels of 20 to 35%, although a linear effect of Ht levels on miltefosine quantification was observed in the bioanalytical validation. DBS samples were stable for at least 162 days at 37°C. Clinical validation of the method using paired DBS and plasma samples from 16 VL patients showed a median observed DBS/plasma miltefosine concentration ratio of 0.99, with good correlation (Pearson's r = 0.946). Correcting for patient-specific Ht levels did not further improve the concordance between the sampling methods. This successfully validated method to quantify miltefosine in DBS samples was demonstrated to be a valid and practical alternative to venous blood sampling that can be applied in future miltefosine pharmacokinetic studies with leishmaniasis patients, without Ht correction. PMID:26787691
[Methodologic and clinical comparison of four different ergospirometry systems].
Winter, U J; Fritsch, J; Gitt, A K; Pothoff, G; Berge, P G; Hilger, H H
1994-01-01
The clinician who uses cardio-pulmonary exercise testing (CPX) systems relies on the technical informations from the device producers. In this paper, the practicability, the accuracy and the safety of four different, available CPX systems are compared in the clinical area, using clinically orientated criteria. The exercise tests were performed in healthy subjects, in patients with cardiac and/or pulmonary disease as well as in young or old people. The comparison study showed, that there were partially large differences in device design and measurement accuracy. Furthermore, our investigation demonstrated that beneath repetitive calibrations of the CPX systems a frequent validation of the devices by means of a metabolic simulator is necessary. Problems in calibration can be caused by an inadequate performance or by unclean calibration gases. Problems in validation can be due to incompatibility of the CPX device and the validator. The comparison study of the four different systems showed that in the future standards for CPX testing should be defined.
Wolf, Erika J.; Miller, Mark W.; Orazem, Robert J.; Weierich, Mariann R.; Castillo, Diane T.; Milford, Jaime; Kaloupek, Danny G.; Keane, Terence M.
2008-01-01
This study examined the psychometric properties of the Minnesota Multiphasic Personality Inventory-2 (MMPI-2) Restructured Clinical Scales (RCSs) in individuals with posttraumatic stress disorder (PTSD) receiving clinical services at Veterans Affairs medical centers. Study 1 included 1,098 men who completed the MMPI-2 and were assessed for a range of psychological disorders via structured clinical interview. Study 2 included 136 women who completed the MMPI-2 and were interviewed with the Clinician Administered Scale for PTSD. The utility of the RCSs was compared to that of the Clinical Scales (CSs) and the Keane PTSD (PK) scale. The RCSs demonstrated good psychometric properties along with patterns of associations with other measures of psychopathology that corresponded to current theory regarding the structure of comorbidity. A notable advantage of the RCSs compared to the MMPI-2 CSs was their enhanced construct validity and clinical utility in the assessment of comorbid internalizing and externalizing psychopathology. The PK scale demonstrated incremental validity in the prediction of PTSD beyond that of the RCSs or CSs. PMID:19086756
Ramsperger, Robert; Meckler, Stefan; Heger, Tanja; van Uem, Janet; Hucker, Svenja; Braatz, Ulrike; Graessner, Holm; Berg, Daniela; Manoli, Yiannos; Serrano, J Artur; Ferreira, Joaquim J; Hobert, Markus A; Maetzler, Walter
2016-05-01
Dyskinesias in Parkinson's disease (PD) patients are a common side effect of long-term dopaminergic therapy and are associated with motor dysfunctions, including gait and balance deficits. Although promising compounds have been developed to treat these symptoms, clinical trials have failed. This failure may, at least partly, be explained by the lack of objective and continuous assessment strategies. This study tested the clinical validity and ecological effect of an algorithm that detects and quantifies dyskinesias of the legs using a single ankle-worn sensor. Twenty-three PD patients (seven with leg dyskinesias) and 13 control subjects were investigated in the lab. Participants performed purposeful daily activity-like tasks while being video-taped. Clinical evaluation was performed using the leg dyskinesia item of the Unified Dyskinesia Rating Scale. The ecological effect of the developed algorithm was investigated in a multi-center, 12-week, home-based sub-study that included three patients with and seven without dyskinesias. In the lab-based sub-study, the sensor-based algorithm exhibited a specificity of 98%, a sensitivity of 85%, and an accuracy of 0.96 for the detection of dyskinesias and a correlation level of 0.61 (p < 0.001) with the clinical severity score. In the home-based sub-study, all patients could be correctly classified regarding the presence or absence of leg dyskinesias, supporting the ecological relevance of the algorithm. This study provides evidence of clinical validity and ecological effect of an algorithm derived from a single sensor on the ankle for detecting leg dyskinesias in PD patients. These results should motivate the investigation of leg dyskinesias in larger studies using wearable sensors. Copyright © 2016 Elsevier Ltd. All rights reserved.
Comparison of Clinical, Maternal, and Self Pubertal Assessments: Implications for Health Studies
Goldberg, Mandy; Schechter, Sarah; Houghton, Lauren C.; White, Melissa L.; O’Toole, Karen; Chung, Wendy K.; Daly, Mary B.; Keegan, Theresa H.M.; Andrulis, Irene L.; Bradbury, Angela R.; Schwartz, Lisa; Knight, Julia A.; John, Esther M.; Buys, Saundra S.
2016-01-01
BACKGROUND: Most epidemiologic studies of puberty have only 1 source of pubertal development information (maternal, self or clinical). Interpretation of results across studies requires data on reliability and validity across sources. METHODS: The LEGACY Girls Study, a 5-site prospective study of girls aged 6 to 13 years (n = 1040) collected information on breast and pubic hair development from mothers (for all daughters) and daughters (if ≥10 years) according to Tanner stage (T1–5) drawings. At 2 LEGACY sites, girls (n = 282) were also examined in the clinic by trained professionals. We assessed agreement (κ) and validity (sensitivity and specificity) with the clinical assessment (gold standard) for both the mothers’ and daughters’ assessment in the subcohort of 282. In the entire cohort, we examined the agreement between mothers and daughters. RESULTS: Compared with clinical assessment, sensitivity of maternal assessment for breast development was 77.2 and specificity was 94.3. In girls aged ≥11 years, self-assessment had higher sensitivity and specificity than maternal report. Specificity for both mothers and self, but not sensitivity, was significantly lower for overweight girls. In the overall cohort, maternal and daughter agreement for breast development and pubic hair development (T2+ vs T1) were similar (0.66, [95% confidence interval 0.58–0.75] and 0.69 [95% confidence interval 0.61–0.77], respectively), but declined with age. Mothers were more likely to report a lower Tanner stage for both breast and pubic hair compared with self-assessments. CONCLUSIONS: These differences in validity should be considered in studies measuring pubertal changes longitudinally when they do not have access to clinical assessments. PMID:27279647
Elvén, Maria; Hochwälder, Jacek; Dean, Elizabeth; Söderlund, Anne
2015-05-01
A biopsychosocial approach and behaviour change strategies have long been proposed to serve as a basis for addressing current multifaceted health problems. This emphasis has implications for clinical reasoning of health professionals. This study's aim was to develop and validate a conceptual model to guide physiotherapists' clinical reasoning focused on clients' behaviour change. Phase 1 consisted of the exploration of existing research and the research team's experiences and knowledge. Phases 2a and 2b consisted of validation and refinement of the model based on input from physiotherapy students in two focus groups (n = 5 per group) and from experts in behavioural medicine (n = 9). Phase 1 generated theoretical and evidence bases for the first version of a model. Phases 2a and 2b established the validity and value of the model. The final model described clinical reasoning focused on clients' behaviour change as a cognitive, reflective, collaborative and iterative process with multiple interrelated levels that included input from the client and physiotherapist, a functional behavioural analysis of the activity-related target behaviour and the selection of strategies for behaviour change. This unique model, theory- and evidence-informed, has been developed to help physiotherapists to apply clinical reasoning systematically in the process of behaviour change with their clients.
The Utrecht questionnaire (U-CEP) measuring knowledge on clinical epidemiology proved to be valid.
Kortekaas, Marlous F; Bartelink, Marie-Louise E L; de Groot, Esther; Korving, Helen; de Wit, Niek J; Grobbee, Diederick E; Hoes, Arno W
2017-02-01
Knowledge on clinical epidemiology is crucial to practice evidence-based medicine. We describe the development and validation of the Utrecht questionnaire on knowledge on Clinical epidemiology for Evidence-based Practice (U-CEP); an assessment tool to be used in the training of clinicians. The U-CEP was developed in two formats: two sets of 25 questions and a combined set of 50. The validation was performed among postgraduate general practice (GP) trainees, hospital trainees, GP supervisors, and experts. Internal consistency, internal reliability (item-total correlation), item discrimination index, item difficulty, content validity, construct validity, responsiveness, test-retest reliability, and feasibility were assessed. The questionnaire was externally validated. Internal consistency was good with a Cronbach alpha of 0.8. The median item-total correlation and mean item discrimination index were satisfactory. Both sets were perceived as relevant to clinical practice. Construct validity was good. Both sets were responsive but failed on test-retest reliability. One set took 24 minutes and the other 33 minutes to complete, on average. External GP trainees had comparable results. The U-CEP is a valid questionnaire to assess knowledge on clinical epidemiology, which is a prerequisite for practicing evidence-based medicine in daily clinical practice. Copyright © 2016 Elsevier Inc. All rights reserved.
Yousuf, Naveed; Violato, Claudio; Zuberi, Rukhsana W
2015-01-01
CONSTRUCT: Authentic standard setting methods will demonstrate high convergent validity evidence of their outcomes, that is, cutoff scores and pass/fail decisions, with most other methods when compared with each other. The objective structured clinical examination (OSCE) was established for valid, reliable, and objective assessment of clinical skills in health professions education. Various standard setting methods have been proposed to identify objective, reliable, and valid cutoff scores on OSCEs. These methods may identify different cutoff scores for the same examinations. Identification of valid and reliable cutoff scores for OSCEs remains an important issue and a challenge. Thirty OSCE stations administered at least twice in the years 2010-2012 to 393 medical students in Years 2 and 3 at Aga Khan University are included. Psychometric properties of the scores are determined. Cutoff scores and pass/fail decisions of Wijnen, Cohen, Mean-1.5SD, Mean-1SD, Angoff, borderline group and borderline regression (BL-R) methods are compared with each other and with three variants of cluster analysis using repeated measures analysis of variance and Cohen's kappa. The mean psychometric indices on the 30 OSCE stations are reliability coefficient = 0.76 (SD = 0.12); standard error of measurement = 5.66 (SD = 1.38); coefficient of determination = 0.47 (SD = 0.19), and intergrade discrimination = 7.19 (SD = 1.89). BL-R and Wijnen methods show the highest convergent validity evidence among other methods on the defined criteria. Angoff and Mean-1.5SD demonstrated least convergent validity evidence. The three cluster variants showed substantial convergent validity with borderline methods. Although there was a high level of convergent validity of Wijnen method, it lacks the theoretical strength to be used for competency-based assessments. The BL-R method is found to show the highest convergent validity evidences for OSCEs with other standard setting methods used in the present study. We also found that cluster analysis using mean method can be used for quality assurance of borderline methods. These findings should be further confirmed by studies in other settings.
Cognitive assessment: A challenge for occupational therapists in Brazil
Conti, Juliana
2017-01-01
Cognitive impairment is a common dysfunction after neurological injury. Cognitive assessment tools can help the therapist understand how impairments are affecting functional status and quality of life. Objective The aim of the study was to identify instruments for cognitive assessment that Occupational Therapists (OT) can use in clinical practice. Methods The instruments published in English and Portuguese between 1999 and 2016 were systematically reviewed. Results The search identified 17 specific instruments for OT not validated in Brazilian Portuguese, 10 non-specific instruments for OT not validated in Brazilian Portuguese, and 25 instruments validated for Portuguese, only one of which was specific for OT (Lowenstein Occupational Therapy Cognitive Assessment). Conclusion There are few assessment cognitive tools validated for use in the Brazilian culture and language. The majority of the instruments appear not to be validated for use by OT in clinical practice. PMID:29213503
Junghaenel, Doerte U; Schneider, Stefan; Stone, Arthur A; Christodoulou, Christopher; Broderick, Joan E
2014-04-01
This study examined the ecological validity and clinical utility of NIH Patient Reported-Outcomes Measurement Information System (PROMIS®) instruments for anger, depression, and fatigue in women with premenstrual symptoms. One-hundred women completed daily diaries and weekly PROMIS assessments over 4weeks. Weekly assessments were administered through Computerized Adaptive Testing (CAT). Weekly CATs and corresponding daily scores were compared to evaluate ecological validity. To test clinical utility, we examined if CATs could detect changes in symptom levels, if these changes mirrored those obtained from daily scores, and if CATs could identify clinically meaningful premenstrual symptom change. PROMIS CAT scores were higher in the pre-menstrual than the baseline (ps<.0001) and post-menstrual (ps<.0001) weeks. The correlations between CATs and aggregated daily scores ranged from .73 to .88 supporting ecological validity. Mean CAT scores showed systematic changes in accordance with the menstrual cycle and the magnitudes of the changes were similar to those obtained from the daily scores. Finally, Receiver Operating Characteristic (ROC) analyses demonstrated the ability of the CATs to discriminate between women with and without clinically meaningful premenstrual symptom change. PROMIS CAT instruments for anger, depression, and fatigue demonstrated validity and utility in premenstrual symptom assessment. The results provide encouraging initial evidence of the utility of PROMIS instruments for the measurement of affective premenstrual symptoms. Copyright © 2014 Elsevier Inc. All rights reserved.
Freitag, Christine M
2014-05-01
Autism Spectrum Disorder (ASD) in DSM-5 comprises the former DSM-IV-TR diagnoses of Autistic Disorder, Asperger's Disorder and PDD-nos. The criteria for ASD in DSM-5 were considerably revised from those of ICD-10 and DSM-IV-TR. The present article compares the diagnostic criteria, presents studies on the validity and reliability of ASD, and discusses open questions. It ends with a clinical and research perspective.
Park, Dong-Jin; Kang, Ji-Hyoun; Lee, Jeong-Won; Lee, Kyung-Eun; Wen, Lihui; Kim, Tae-Jong; Park, Yong-Wook; Nam, Tai-Seung; Kim, Myung-Sun; Lee, Shin-Seok
2013-07-01
The aim of this study was to assess and validate the Korean version of the Boston Carpal Tunnel Questionnaire (K-BCTQ) in patients with carpal tunnel syndrome (CTS). After translation and cultural adaptation of the BCTQ to a Korean version, the K-BCTQ was administered to 54 patients with CTS; it was administered again after 2 weeks to assess reliability. Additionally, we administered K-DASH and EQ-5D to assess construct-validity. In a prospective study of responsiveness to clinical change, 29 of 54 patients were treated by ultrasonography-guided local corticosteroid injection therapy. The internal consistency of the K-BCTQ was high (Cronbach's alpha: 0.915) and the intra-class correlation coefficients were 0.931 for the symptom severity scale (P<0.001) and 0.844 for the functional severity scale (P<0.001). The construct-validity between the symptom severity scale and the K-DASH, and between the functional severity scale and the K-DASH were significantly correlated (both P<0.001). Clinical improvement was noted in 29 patients with injection therapy. The effect size of symptom severity was 0.67, and that of functional severity was 0.58. In conclusion, the K-BCTQ shows good reliability, construct-validity, and acceptable responsiveness after local corticosteroid injection therapy (Clinical trial number, KCT0000050).
Chen, Hongda; Knebel, Phillip; Brenner, Hermann
2016-07-01
Search for biomarkers for early detection of cancer is a very active area of research, but most studies are done in clinical rather than screening settings. We aimed to empirically evaluate the role of study setting for early detection marker identification and validation. A panel of 92 candidate cancer protein markers was measured in 35 clinically identified colorectal cancer patients and 35 colorectal cancer patients identified at screening colonoscopy. For each case group, we selected 38 controls without colorectal neoplasms at screening colonoscopy. Single-, two- and three-marker combinations discriminating cases and controls were identified in each setting and subsequently validated in the alternative setting. In all scenarios, a higher number of predictive biomarkers were initially detected in the clinical setting, but a substantially lower proportion of identified biomarkers could subsequently be confirmed in the screening setting. Confirmation rates were 50.0%, 84.5%, and 74.2% for one-, two-, and three-marker algorithms identified in the screening setting and were 42.9%, 18.6%, and 25.7% for algorithms identified in the clinical setting. Validation of early detection markers of cancer in a true screening setting is important to limit the number of false-positive findings. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
Matulis, Simone; Loos, Laura; Langguth, Nadine; Schreiber, Franziska; Gutermann, Jana; Gawrilow, Caterina; Steil, Regina
2015-01-01
Background The Trauma Symptom Checklist for Children (TSC-C) is the most widely used self-report scale to assess trauma-related symptoms in children and adolescents on six clinical scales. The purpose of the present study was to develop a German version of the TSC-C and to investigate its psychometric properties, such as factor structure, reliability, and validity, in a sample of German adolescents. Method A normative sample of N=583 and a clinical sample of N=41 adolescents with a history of physical or sexual abuse aged between 13 and 21 years participated in the study. Results The Confirmatory Factor Analysis on the six-factor model (anger, anxiety, depression, dissociation, posttraumatic stress, and sexual concerns with the subdimensions preoccupation and distress) revealed acceptable to good fit statistics in the normative sample. One item had to be excluded from the German version of the TSC-C because the factor loading was too low. All clinical scales presented acceptable to good reliability, with Cronbach's α's ranging from .80 to .86 in the normative sample and from .72 to .87 in the clinical sample. Concurrent validity was also demonstrated by the high correlations between the TSC-C scales and instruments measuring similar psychopathology. TSC-C scores reliably differentiated between adolescents with trauma history and those without trauma history, indicating discriminative validity. Conclusions In conclusion, the German version of the TSC-C is a reliable and valid instrument for assessing trauma-related symptoms on six different scales in adolescents aged between 13 and 21 years. PMID:26498182
Matulis, Simone; Loos, Laura; Langguth, Nadine; Schreiber, Franziska; Gutermann, Jana; Gawrilow, Caterina; Steil, Regina
2015-01-01
The Trauma Symptom Checklist for Children (TSC-C) is the most widely used self-report scale to assess trauma-related symptoms in children and adolescents on six clinical scales. The purpose of the present study was to develop a German version of the TSC-C and to investigate its psychometric properties, such as factor structure, reliability, and validity, in a sample of German adolescents. A normative sample of N=583 and a clinical sample of N=41 adolescents with a history of physical or sexual abuse aged between 13 and 21 years participated in the study. The Confirmatory Factor Analysis on the six-factor model (anger, anxiety, depression, dissociation, posttraumatic stress, and sexual concerns with the subdimensions preoccupation and distress) revealed acceptable to good fit statistics in the normative sample. One item had to be excluded from the German version of the TSC-C because the factor loading was too low. All clinical scales presented acceptable to good reliability, with Cronbach's α's ranging from .80 to .86 in the normative sample and from .72 to .87 in the clinical sample. Concurrent validity was also demonstrated by the high correlations between the TSC-C scales and instruments measuring similar psychopathology. TSC-C scores reliably differentiated between adolescents with trauma history and those without trauma history, indicating discriminative validity. In conclusion, the German version of the TSC-C is a reliable and valid instrument for assessing trauma-related symptoms on six different scales in adolescents aged between 13 and 21 years.
Rajpal, Saurabh; Alshawabkeh, Laith; Opotowsky, Alexander R
2017-06-01
There is an increasing number of adult patients with congenital heart disease (CHD). While several biomarkers have been validated and integrated into general cardiology clinical practice, these tests are often applied to adults with CHD in the absence of disease-specific validation. Although these patients are often grouped into a single population, there is heterogeneous pathophysiology, variable disease chronicity, extensive multisystem involvement, and a low event rate relative to acquired heart disease. These stand as challenges to systematic investigation and clinical application of biomarkers for adults with CHD. This paper reviews recent studies investigating the use of biomarkers in this population, with emphasis on biomarkers applied in clinical adult CHD care. A handful of biomarkers have been integrated into adult CHD practice, such as iron studies in cyanotic heart disease and stool alpha-1 antitrypsin for diagnosis of protein losing enteropathy in the Fontan circulation. Use of kidney and liver tests has been studied in prognostication of adult CHD patients. A few other biomarkers like natriuretic peptides and troponins seem likely to provide useful information in other ACHD situations based on limited disease-specific data and extrapolation from acquired heart disease. More research is needed to support the robust validity of most existing clinical biomarkers in adult congenital cardiology practice. Until data from larger, prospectively enrolled cohorts are available, clinical use of biomarkers in these patients will require careful interpretation with attention to underlying pathophysiology, as well as detailed understanding of potential pitfalls of specific assays and clinical contexts.
Low-pleasure beliefs in patients with schizophrenia and individuals with social anhedonia.
Yang, Yin; Yang, Zhuo-Ya; Zou, Ying-Min; Shi, Hai-Song; Wang, Yi; Xie, Dong-Jie; Zhang, Rui-Ting; Lui, Simon S Y; Cohen, Alex C; Strauss, Gregory P; Cheung, Eric F C; Chan, Raymond C K
2018-05-24
Anhedonia in schizophrenia has been suggested to comprise a set of low-pleasure beliefs, defined as beliefs that certain things/activities were not pleasurable or that one does not feel pleasant generally. However, no instrument has been intentionally developed to specifically measure low-pleasure beliefs, and there is a paucity of empirical evidence for low-pleasure beliefs and their relationship with anhedonia in both patients with schizophrenia and individuals with high social anhedonia. We developed and validated the Beliefs About Pleasure Scale (BAPS) using non-clinical (Studies 1, 2 & 3), chronic schizophrenia (Study 2), and first episode schizophrenia (Study 3) samples. Across these studies, we examined psychometric properties of the BAPS, including temporal stability, internal consistency, factor structure, and convergent validity. The 22 BAPS items loaded onto 4 factors, namely the "Devaluation of Pleasure", the "Pleasurable Activity Expectancies", the "Negative Outcomes Expectancies", and the "Attention to Pleasure". The measure demonstrated good internal consistency and convergent validity in each sample. Moreover, both individual with schizophrenia and non-clinical participants with high social anhedonia scored higher on the BAPS than controls (Study 3), supporting construct validity. These findings provide preliminary evidence for the presence of low-pleasure beliefs in both clinical and subclinical groups and suggest that the BAPS has promising initial psychometric properties. The BAPS will be useful for exploring the cognitive component of anhedonia and provides a novel assessment for mechanism of change in psychosocial treatment studies. Copyright © 2018. Published by Elsevier B.V.
Cerebrospinal Fluid Biomarkers for Huntington's Disease.
Byrne, Lauren M; Wild, Edward J
2016-01-01
Cerebrospinal fluid (CSF) is enriched in brain-derived components and represents an accessible and appealing means of interrogating the CNS milieu to study neurodegenerative diseases and identify biomarkers to facilitate the development of novel therapeutics. Many such CSF biomarkers have been proposed for Huntington's disease (HD) but none has been validated for clinical trial use. Across many studies proposing dozens of biomarker candidates, there is a notable lack of statistical power, consistency, rigor and validation. Here we review proposed CSF biomarkers including neurotransmitters, transglutaminase activity, kynurenine pathway metabolites, oxidative stress markers, inflammatory markers, neuroendocrine markers, protein markers of neuronal death, proteomic approaches and mutant huntingtin protein itself. We reflect on the need for large-scale, standardized CSF collections with detailed phenotypic data to validate and qualify much-needed CSF biomarkers for clinical trial use in HD.
Patterson, Fiona; Lievens, Filip; Kerrin, Máire; Munro, Neil; Irish, Bill
2013-01-01
Background The selection methodology for UK general practice is designed to accommodate several thousand applicants per year and targets six core attributes identified in a multi-method job-analysis study Aim To evaluate the predictive validity of selection methods for entry into postgraduate training, comprising a clinical problem-solving test, a situational judgement test, and a selection centre. Design and setting A three-part longitudinal predictive validity study of selection into training for UK general practice. Method In sample 1, participants were junior doctors applying for training in general practice (n = 6824). In sample 2, participants were GP registrars 1 year into training (n = 196). In sample 3, participants were GP registrars sitting the licensing examination after 3 years, at the end of training (n = 2292). The outcome measures include: assessor ratings of performance in a selection centre comprising job simulation exercises (sample 1); supervisor ratings of trainee job performance 1 year into training (sample 2); and licensing examination results, including an applied knowledge examination and a 12-station clinical skills objective structured clinical examination (OSCE; sample 3). Results Performance ratings at selection predicted subsequent supervisor ratings of job performance 1 year later. Selection results also significantly predicted performance on both the clinical skills OSCE and applied knowledge examination for licensing at the end of training. Conclusion In combination, these longitudinal findings provide good evidence of the predictive validity of the selection methods, and are the first reported for entry into postgraduate training. Results show that the best predictor of work performance and training outcomes is a combination of a clinical problem-solving test, a situational judgement test, and a selection centre. Implications for selection methods for all postgraduate specialties are considered. PMID:24267856
Patterson, Fiona; Lievens, Filip; Kerrin, Máire; Munro, Neil; Irish, Bill
2013-11-01
The selection methodology for UK general practice is designed to accommodate several thousand applicants per year and targets six core attributes identified in a multi-method job-analysis study To evaluate the predictive validity of selection methods for entry into postgraduate training, comprising a clinical problem-solving test, a situational judgement test, and a selection centre. A three-part longitudinal predictive validity study of selection into training for UK general practice. In sample 1, participants were junior doctors applying for training in general practice (n = 6824). In sample 2, participants were GP registrars 1 year into training (n = 196). In sample 3, participants were GP registrars sitting the licensing examination after 3 years, at the end of training (n = 2292). The outcome measures include: assessor ratings of performance in a selection centre comprising job simulation exercises (sample 1); supervisor ratings of trainee job performance 1 year into training (sample 2); and licensing examination results, including an applied knowledge examination and a 12-station clinical skills objective structured clinical examination (OSCE; sample 3). Performance ratings at selection predicted subsequent supervisor ratings of job performance 1 year later. Selection results also significantly predicted performance on both the clinical skills OSCE and applied knowledge examination for licensing at the end of training. In combination, these longitudinal findings provide good evidence of the predictive validity of the selection methods, and are the first reported for entry into postgraduate training. Results show that the best predictor of work performance and training outcomes is a combination of a clinical problem-solving test, a situational judgement test, and a selection centre. Implications for selection methods for all postgraduate specialties are considered.
Validation of the 4P's Plus screen for substance use in pregnancy validation of the 4P's Plus.
Chasnoff, I J; Wells, A M; McGourty, R F; Bailey, L K
2007-12-01
The purpose of this study is to validate the 4P's Plus screen for substance use in pregnancy. A total of 228 pregnant women enrolled in prenatal care underwent screening with the 4P's Plus and received a follow-up clinical assessment for substance use. Statistical analyses regarding reliability, sensitivity, specificity, and positive and negative predictive validity of the 4Ps Plus were conducted. The overall reliability for the five-item measure was 0.62. Seventy-four (32.5%) of the women had a positive screen. Sensitivity and specificity were very good, at 87 and 76%, respectively. Positive predictive validity was low (36%), but negative predictive validity was quite high (97%). Of the 31 women who had a positive clinical assessment, 45% were using less than 1 day per week. The 4P's Plus reliably and effectively screens pregnant women for risk of substance use, including those women typically missed by other perinatal screening methodologies.
Development and Validation of Personality Disorder Spectra Scales for the MMPI-2-RF.
Sellbom, Martin; Waugh, Mark H; Hopwood, Christopher J
2018-01-01
The purpose of this study was to develop and validate a set of MMPI-2-RF (Ben-Porath & Tellegen, 2008/2011) personality disorder (PD) spectra scales. These scales could serve the purpose of assisting with DSM-5 PD diagnosis and help link categorical and dimensional conceptions of personality pathology within the MMPI-2-RF. We developed and provided initial validity results for scales corresponding to the 10 PD constructs listed in the DSM-5 using data from student, community, clinical, and correctional samples. Initial validation efforts indicated good support for criterion validity with an external PD measure as well as with dimensional personality traits included in the DSM-5 alternative model for PDs. Construct validity results using psychosocial history and therapists' ratings in a large clinical sample were generally supportive as well. Overall, these brief scales provide clinicians using MMPI-2-RF data with estimates of DSM-5 PD constructs that can support cross-model connections between categorical and dimensional assessment approaches.
The Arabic version of the Mayo-Portland Adaptability Inventory 4: a validation study.
Hamed, Razan; Tariah, Hashem Abu; Malkawi, Somaya; Holm, Margo B
2012-09-01
The Mayo-Portland Adaptability Inventory 4 (MPAI-4) is a valid and reliable assessment tool to detect clinical impairments in patients with acquired brain injury. The tool is widely used by rehabilitation therapists worldwide, given its good psychometric properties and its availability in several languages. The purpose of this study was to translate the tool into Arabic and to examine its validity and reliability with multiple sclerosis and stroke patients. A total of 128 participants were enrolled in this study: 49 with multiple sclerosis, 17 with stroke, and 62 healthy adults. The psychometric properties of discriminative and convergent construct validity as well as test-retest reliability were tested. The translated tool, the Arabic-MPAI-4 (A-MPAI-4), significantly discriminated among the three subgroups (F=50.93, P<0.001), correlated moderately but significantly with the Arabic version of the Performance Assessment of Self-Care Skills Self-Report as a measure of functional independence in daily activities (r=-0.35, P<0.001), and showed good stability over time (r=0.73, P<0.001). The A-MPAI-4 is a valid and reliable tool for clinical use with multiple sclerosis and stroke patients who speak Arabic.
Alladin, Assen; Sabatini, Linda; Amundson, Jon K
2007-04-01
This paper briefly surveys the trend of and controversy surrounding empirical validation in psychotherapy. Empirical validation of hypnotherapy has paralleled the practice of validation in psychotherapy and the professionalization of clinical psychology, in general. This evolution in determining what counts as evidence for bona fide clinical practice has gone from theory-driven clinical approaches in the 1960s and 1970s through critical attempts at categorization of empirically supported therapies in the 1990s on to the concept of evidence-based practice in 2006. Implications of this progression in professional psychology are discussed in the light of hypnosis's current quest for validation and empirical accreditation.
Monga, Ash K; Tracey, Michael R; Subbaroyan, Jeyakumar
2012-08-01
The aim of this manuscript was to provide a systematic literature review of clinical trial evidence for a range of electrical stimulation therapies in the treatment of lower urinary tract symptoms (LUTS). The databases MEDLINE, BIOSIS Previews, Inside Conferences, and EMBASE were searched. Original clinical studies with greater than 15 subjects were included. Seventy-three studies were included, representing implanted sacral nerve stimulation (SNS), percutaneous posterior tibial nerve stimulation (PTNS), and transcutaneous electrical stimulation (TENS) therapy modalities. Median mean reductions in incontinence episodes and voiding frequency were similar for implanted SNS and PTNS. However, long-term follow-up data to validate the sustained benefit of PTNS are lacking. Despite a substantial body of research devoted to SNS validation, it is not possible to definitively define the appropriate role of this therapy owing largely to study design flaws that inhibited rigorous intention to treat analyses for the majority of these studies.
Hu, Alan Shiun Yew; Donohue, Peter O'; Gunnarsson, Ronny K; de Costa, Alan
2018-03-14
Valid and user-friendly prediction models for conversion to open cholecystectomy allow for proper planning prior to surgery. The Cairns Prediction Model (CPM) has been in use clinically in the original study site for the past three years, but has not been tested at other sites. A retrospective, single-centred study collected ultrasonic measurements and clinical variables alongside with conversion status from consecutive patients who underwent laparoscopic cholecystectomy from 2013 to 2016 in The Townsville Hospital, North Queensland, Australia. An area under the curve (AUC) was calculated to externally validate of the CPM. Conversion was necessary in 43 (4.2%) out of 1035 patients. External validation showed an area under the curve of 0.87 (95% CI 0.82-0.93, p = 1.1 × 10 -14 ). In comparison with most previously published models, which have an AUC of approximately 0.80 or less, the CPM has the highest AUC of all published prediction models both for internal and external validation. Crown Copyright © 2018. Published by Elsevier Inc. All rights reserved.
Rey-Martinez, Jorge; Pérez-Fernández, Nicolás
2016-12-01
The proposed validation goal of 0.9 in intra-class correlation coefficient was reached with the results of this study. With the obtained results we consider that the developed software (RombergLab) is a validated balance assessment software. The reliability of this software is dependent of the used force platform technical specifications. Develop and validate a posturography software and share its source code in open source terms. Prospective non-randomized validation study: 20 consecutive adults underwent two balance assessment tests, six condition posturography was performed using a clinical approved software and force platform and the same conditions were measured using the new developed open source software using a low cost force platform. Intra-class correlation index of the sway area obtained from the center of pressure variations in both devices for the six conditions was the main variable used for validation. Excellent concordance between RombergLab and clinical approved force platform was obtained (intra-class correlation coefficient =0.94). A Bland and Altman graphic concordance plot was also obtained. The source code used to develop RombergLab was published in open source terms.
Oh, HyunSoo; Lee, Seul; Kim, JiSun; Lee, EunJu; Min, HyoNam; Cho, OkJa; Seo, WhaSook
2015-07-01
This study was conducted to develop a family relocation stress scale by modifying the Son's Relocation Stress Syndrome Scale, to examine its clinical validity and reliability and to confirm its suitability for measuring family relocation stress. The transfer of ICU patients to general wards is a significant anxiety-producing event for family members. However, no relocation stress scale has been developed specifically for families. A nonexperimental, correlation design was adopted. The study subjects were 95 family members of 95 ICU patients at a university hospital located in Incheon, South Korea. Face and construct validities of the devised family relocation stress scale were examined. Construct validity was examined using factor analysis and by using a nomological validity test. Reliability was also examined. Face and content validity of the scale were verified by confirming that its items adequately measured family relocation stress. Factor analysis yielded four components, and the total variance explained by these four components was 63·0%, which is acceptable. Nomological validity was well supported by significant relationships between relocation stress and degree of preparation for relocation, patient self-care ability, family burden and satisfaction with the relocation process. The devised scale was also found to have good reliability. The family relocation stress scale devised in this study was found to have good validity and reliability, and thus, is believed to offer a means of assessing family relocation stress. The findings of this study provide a reliable and valid assessment tool when nurses prepare families for patient transfer from an ICU to a ward setting, and may also provide useful information to those developing an intervention programme for family relocation stress management. © 2015 John Wiley & Sons Ltd.
Vision Test Validation Study for the Health Examination Survey Among Youths 12-17 years.
ERIC Educational Resources Information Center
Roberts, Jean
A validation study of the vision test battery used in the Health Examination Survey of 1966-1970 was conducted among 210 youths 12-17 years-old who had been part of the larger survey. The study was designed to discover the degree of correspondence between survey test results and clinical examination by an opthalmologist in determining the…
Khoury, Joseph D; Wang, Wei-Lien; Prieto, Victor G; Medeiros, L Jeffrey; Kalhor, Neda; Hameed, Meera; Broaddus, Russell; Hamilton, Stanley R
2018-02-01
Biomarkers that guide therapy selection are gaining unprecedented importance as targeted therapy options increase in scope and complexity. In conjunction with high-throughput molecular techniques, therapy-guiding biomarker assays based upon immunohistochemistry (IHC) have a critical role in cancer care in that they inform about the expression status of a protein target. Here, we describe the validation procedures for four clinical IHC biomarker assays-PTEN, RB, MLH1, and MSH2-for use as integral biomarkers in the nationwide NCI-Molecular Analysis for Therapy Choice (NCI-MATCH) EAY131 clinical trial. Validation procedures were developed through an iterative process based on collective experience and adaptation of broad guidelines from the FDA. The steps included primary antibody selection; assay optimization; development of assay interpretation criteria incorporating biological considerations; and expected staining patterns, including indeterminate results, orthogonal validation, and tissue validation. Following assay lockdown, patient samples and cell lines were used for analytic and clinical validation. The assays were then approved as laboratory-developed tests and used for clinical trial decisions for treatment selection. Calculations of sensitivity and specificity were undertaken using various definitions of gold-standard references, and external validation was required for the PTEN IHC assay. In conclusion, validation of IHC biomarker assays critical for guiding therapy in clinical trials is feasible using comprehensive preanalytic, analytic, and postanalytic steps. Implementation of standardized guidelines provides a useful framework for validating IHC biomarker assays that allow for reproducibility across institutions for routine clinical use. Clin Cancer Res; 24(3); 521-31. ©2017 AACR . ©2017 American Association for Cancer Research.
Preliminary validation of the Review of Musculoskeletal System (ROMS) questionnaire.
Bershadsky, Boris; Kane, Robert L; Wuerz, Thomas; Jones, Morgan; Brighton, Brian; Stitzlein, Russell; Parker, Richard; Iannotti, Joseph P
2015-04-01
Measurement of clinical outcomes is necessary to define best practice. It requires a validated tool that can be easily applied as part of clinical practice. We present the preliminary validation of a brief self-reported Review of Musculoskeletal System (ROMS) questionnaire that captures functional limitations due to musculoskeletal problems and other medical and emotional conditions. Data were derived from a clinical outcomes database (Orthopaedic Minimal Data Set [OrthoMiDaS]) that combines patient-reported data collected as part of routine care and secondary data extracted from electronic medical records. The study utilized 82,873 encounters collected from 24,116 consecutive patients with problems in the upper and lower extremities. In addition to the ROMS, the study used version 2 of the Short Form-12 (SF-12v2), the Penn Shoulder Score (PSS), the Hip disability and Osteoarthritis Outcome Score (HOOS), and the Knee injury and Osteoarthritis Outcome Score (KOOS) questionnaires. Fifteen cross-sectional samples were used to evaluate the floor and ceiling effects as well as the construct and content validity. Five longitudinal cohorts were used to measure test-retest reliability and responsiveness. Standard statistical tests were applied. The floor and ceiling effects of the ROMS questionnaire in patients with shoulder, hip, and knee problems ranged from 1.3% to 8.5%. Construct-validity tests confirmed convergent and divergent validity of the ROMS. The tests also justified its additional value when the ROMS was used with joint-specific tools. When measuring test-retest reliability of the ROMS scales, intraclass correlation ranged from 0.80 to 0.90 at approximately one week and from 0.71 to 0.87 at approximately four weeks. Responsiveness of the ROMS was greater than that of the SF-12 and less than that of the joint-specific questionnaires. The ROMS is compatible with routine clinical process and has good psychometric properties in patients with shoulder, hip, and knee disorders. It can be used as a primary outcome tool for large observational studies and can supplement more specific tools in controlled studies. The ROMS was developed as a tool to measure and monitor the clinical status of the musculoskeletal system in a population of patients during and after treatment as well as over time. Copyright © 2015 by The Journal of Bone and Joint Surgery, Incorporated.
Reveles, Kelly R; Mortensen, Eric M; Koeller, Jim M; Lawson, Kenneth A; Pugh, Mary Jo V; Rumbellow, Sarah A; Argamany, Jacqueline R; Frei, Christopher R
2018-03-01
Prior studies have identified risk factors for recurrent Clostridium difficile infection (CDI), but few studies have integrated these factors into a clinical prediction rule that can aid clinical decision-making. The objectives of this study were to derive and validate a CDI recurrence prediction rule to identify patients at risk for first recurrence in a national cohort of veterans. Retrospective cohort study. Veterans Affairs Informatics and Computing Infrastructure. A total of 22,615 adult Veterans Health Administration beneficiaries with first-episode CDI between October 1, 2002, and September 30, 2014; of these patients, 7538 were assigned to the derivation cohort and 15,077 to the validation cohort. A 60-day CDI recurrence prediction rule was created in a derivation cohort using backward logistic regression. Those variables significant at p<0.01 were assigned an integer score proportional to the regression coefficient. The model was then validated in the derivation cohort and a separate validation cohort. Patients were then split into three risk categories, and rates of recurrence were described for each category. The CDI recurrence prediction rule included the following predictor variables with their respective point values: prior third- and fourth-generation cephalosporins (1 point), prior proton pump inhibitors (1 point), prior antidiarrheals (1 point), nonsevere CDI (2 points), and community-onset CDI (3 points). In the derivation cohort, the 60-day CDI recurrence risk for each score ranged from 7.5% (0 points) to 57.9% (8 points). The risk score was strongly correlated with recurrence (R 2 = 0.94). Patients were split into low-risk (0-2 points), medium-risk (3-5 points), and high-risk (6-8 points) classes and had the following recurrence rates: 8.9%, 20.2%, and 35.0%, respectively. Findings were similar in the validation cohort. Several CDI and patient-specific factors were independently associated with 60-day CDI recurrence risk. When integrated into a clinical prediction rule, higher risk scores and risk classes were strongly correlated with CDI recurrence. This clinical prediction rule can be used by providers to identify patients at high risk for CDI recurrence and help guide preventive strategy decisions, while accounting for clinical judgment. © 2018 Pharmacotherapy Publications, Inc.
Itani, Leila; Calugi, Simona; Kreidieh, Dima; El Kassas, Germine; El Masri, Dana; Tannir, Hana; Dalle Grave, Riccardo; Harfoush, Aya; El Ghoch, Marwan
2018-01-10
No specific questionnaire that evaluates Health-Related Quality Of Life (HRQOL) in individuals with obesity is available in the Arabic language. The aim of this study was therefore to propose and examine the validity and reliability of an Arabic language version of the ORWELL 97, a validated obesity-related HRQOL questionnaire. The ORWELL 97 questionnaire was translated from English to Arabic language and administered to 318 Arabic-speaking participants (106 from clinical and 212 from community samples), and underwent internal consistency, test-retest reliability, construct and discriminative validity analysis. Internal consistency and the test-retest reliability were excellent for ORWELL 97 global scores in the clinical sample. Participants with obesity displayed significantly higher ORWELL 97 scores than participants from the community sample, confirming the good discriminant validity of the questionnaire. Confirmatory factor analysis in the clinical sample revealed a good fit for a modified two-factor structure. Overall, the Arabic version of the ORWELL 97 can be considered validated in Arabic adult patients with obesity, paving the way to further assessment of its responsiveness in measuring changes in health-related quality of life associated with obesity treatment. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Wallston, Kenneth A.; Wilkins, Consuelo H.; Hull, Pamela C.; Miller, Stephania T.
2015-01-01
Abstract Objective This study describes the development and psychometric evaluation of HPV Clinical Trial Survey for Parents with Children Aged 9 to 15 (CTSP‐HPV) using traditional instrument development methods and community engagement principles. Methods An expert panel and parental input informed survey content and parents recommended study design changes (e.g., flyer wording). A convenience sample of 256 parents completed the final survey measuring parental willingness to consent to HPV clinical trial (CT) participation and other factors hypothesized to influence willingness (e.g., HPV vaccine benefits). Cronbach's a, Spearman correlations, and multiple linear regression were used to estimate internal consistency, convergent and discriminant validity, and predictively validity, respectively. Results Internal reliability was confirmed for all scales (a ≥ 0.70.). Parental willingness was positively associated (p < 0.05) with trust in medical researchers, adolescent CT knowledge, HPV vaccine benefits, advantages of adolescent CTs (r range 0.33–0.42), supporting convergent validity. Moderate discriminant construct validity was also demonstrated. Regression results indicate reasonable predictive validity with the six scales accounting for 31% of the variance in parents’ willingness. Conclusions This instrument can inform interventions based on factors that influence parental willingness, which may lead to the eventual increase in trial participation. Further psychometric testing is warranted. PMID:26530324
Rienstra, Anne; Groot, Paul F C; Spaan, Pauline E J; Majoie, Charles B L M; Nederveen, Aart J; Walstra, Gerard J M; de Jonghe, Jos F M; van Gool, Willem A; Olabarriaga, Silvia D; Korkhov, Vladimir V; Schmand, Ben
2013-01-01
Patients with mild cognitive impairment (MCI) do not always convert to dementia. In such cases, abnormal neuropsychological test results may not validly reflect cognitive symptoms due to brain disease, and the usual brain-behavior relationships may be absent. This study examined symptom validity in a memory clinic sample and its effect on the associations between hippocampal volume and memory performance. Eleven of 170 consecutive patients (6.5%; 13% of patients younger than 65 years) referred to memory clinics showed noncredible performance on symptom validity tests (SVTs, viz. Word Memory Test and Test of Memory Malingering). They were compared to a demographically matched group (n = 57) selected from the remaining patients. Hippocampal volume, measured by an automated volumetric method (Freesurfer), was correlated with scores on six verbal memory tests. The median correlation was r = .49 in the matched group. However, the relation was absent (median r = -.11) in patients who failed SVTs. Memory clinic samples may include patients who show noncredible performance, which invalidates their MCI diagnosis. This underscores the importance of applying SVTs in evaluating patients with cognitive complaints that may signify a predementia stage, especially when these patients are relatively young.
Rostad, Hanne Marie; Utne, Inger; Grov, Ellen Karine; Puts, Martine; Halvorsrud, Liv
2017-11-02
The Doloplus-2 is a pain assessment scale for assessing pain in older adults with cognitive impairment. It is used in clinical practice and research. However, evidence for its measurement properties, feasibility and clinical utility remain incomplete. This systematic review synthesizes previous research on the measurement properties, feasibility and clinical utility of the scale. We conducted a systematic search in three databases (CINAHL, Medline and PsycINFO) for studies published in English, French, German, Dutch/Flemish or a Scandinavian language between 1990 and April 2017. We also reviewed the Doloplus-2 homepage and reference lists of included studies to supplement our search. Two reviewers independently reviewed titles and abstracts and performed the quality assessment and data abstraction. A total of 24 studies were included in this systematic review. The quality of the studies varied, but many lacked sufficient detail about the samples and response rates. The Doloplus-2 has been studied using diverse samples in a variety of settings; most study participants were in long-term care and in people with dementia. Sixteen studies addressed various aspects of the scale's feasibility and clinical utility, but their results are limited and inconsistent across settings and samples. Support for the scale's reliability, validity and responsiveness varied widely across the studies. Generally, the reliability coefficients reached acceptable benchmarks, but the evidence for different aspects of the scale's validity and responsiveness was incomplete. Additional high-quality studies are warranted to determine in which populations of older adults with cognitive impairment the Doloplus-2 is reliable, valid and feasible. The ability of the Doloplus-2 to meaningfully quantify pain, measure treatment response and improve patient outcomes also needs further investigation. PROSPERO reg. no.: CRD42016049697 registered 20. Oct. 2016.
Wongpakaran, Tinakon; Wongpakaran, Nahathai
2012-01-01
This study seeks to investigate the psychometric properties of the short version of the revised 'Experience of Close Relationships' questionnaire, comparing non-clinical and clinical samples. In total 702 subjects participated in this study, of whom 531 were non-clinical participants and 171 were psychiatric patients. They completed the short version of the revised 'Experience of Close Relationships' questionnaire (ECR-R-18), the Perceived Stress Scale-10(PSS-10), the Rosenberg Self-Esteem Scale (RSES) and the UCLA Loneliness scale. A retest of the ECR-R-18 was then performed at four-week intervals. Then, confirmatory factor analyses were performed to test the validity of the new scale. The ECR-R-18 showed a fair to good internal consistency (α 0.77 to 0.87) for both samples, and the test-retest reliability was found to be satisfactory (ICC = 0.75). The anxiety sub-scale demonstrated concurrent validity with PSS-10 and RSES, while the avoidance sub-scale showed concurrent validity with the UCLA Loneliness Scale. Confirmatory factor analysis using method factors yielded two factors with an acceptable model fit for both groups. An invariance test revealed that the ECR-R-18 when used on the clinical group differed from when used with the non-clinical group. The ECR-R-18 questionnaire revealed an overall better level of fit than the original 36 item questionnaire, indicating its suitability for use with a broader group of samples, including clinical samples. The reliability of the ECR-R- 18 might be increased if a modified scoring system is used and if our suggestions with regard to future studies are followed up.
Wong, H M; Chow, L Y
2011-06-01
Borderline personality disorder is an important but under-recognised clinical entity, for which there are only a few available diagnostic instruments in the Chinese language. None has been tested for its psychometric properties in the Cantonese-speaking population in Hong Kong. The present study aimed to assess the validity of the Chinese version of the Borderline Personality Disorder subscale of the Structured Clinical Interview for the Diagnostic and Statistical Manual of Mental Disorders Axis II Personality Disorders (SCID-II) in Cantonese-speaking Hong Kong Chinese. A convenience sampling method was used. The subjects were seen by a multidisciplinary clinical team, who arrived at a best-estimate diagnosis and then by application of the SCID-II rater using the Chinese version of the Borderline Personality Disorder subscale. The study was carried out at the psychiatric clinic of the Prince of Wales Hospital in Hong Kong. A total of 87 patients of Chinese ethnicity aged 18 to 64 years who attended the clinic in April 2007 were recruited. The aforementioned patient parameters were used to examine the internal consistency, best-estimate clinical diagnosis-SCID diagnosis agreement, sensitivity, and specificity of the Chinese version of the subscale. The Borderline Personality Disorder subscale (Chinese version) of SCID-II had an internal consistency of 0.82 (Cronbach's alpha coefficient), best-estimate clinical diagnosis-SCID diagnosis agreement of 0.82 (kappa), sensitivity of 0.92, and specificity of 0.94. The Borderline Personality Disorder subscale (Chinese version) of the SCID-II rater had reasonable validity when applied to Cantonese-speaking Chinese subjects in Hong Kong.
Hijazi, Ziad; Oldgren, Jonas; Lindbäck, Johan; Alexander, John H; Connolly, Stuart J; Eikelboom, John W; Ezekowitz, Michael D; Held, Claes; Hylek, Elaine M; Lopes, Renato D; Yusuf, Salim; Granger, Christopher B; Siegbahn, Agneta; Wallentin, Lars
2018-01-01
Abstract Aims In atrial fibrillation (AF), mortality remains high despite effective anticoagulation. A model predicting the risk of death in these patients is currently not available. We developed and validated a risk score for death in anticoagulated patients with AF including both clinical information and biomarkers. Methods and results The new risk score was developed and internally validated in 14 611 patients with AF randomized to apixaban vs. warfarin for a median of 1.9 years. External validation was performed in 8548 patients with AF randomized to dabigatran vs. warfarin for 2.0 years. Biomarker samples were obtained at study entry. Variables significantly contributing to the prediction of all-cause mortality were assessed by Cox-regression. Each variable obtained a weight proportional to the model coefficients. There were 1047 all-cause deaths in the derivation and 594 in the validation cohort. The most important predictors of death were N-terminal pro B-type natriuretic peptide, troponin-T, growth differentiation factor-15, age, and heart failure, and these were included in the ABC (Age, Biomarkers, Clinical history)-death risk score. The score was well-calibrated and yielded higher c-indices than a model based on all clinical variables in both the derivation (0.74 vs. 0.68) and validation cohorts (0.74 vs. 0.67). The reduction in mortality with apixaban was most pronounced in patients with a high ABC-death score. Conclusion A new biomarker-based score for predicting risk of death in anticoagulated AF patients was developed, internally and externally validated, and well-calibrated in two large cohorts. The ABC-death risk score performed well and may contribute to overall risk assessment in AF. ClinicalTrials.gov identifier NCT00412984 and NCT00262600 PMID:29069359
Badia, X; Mascaró, J M; Lozano, R
1999-10-01
The aim of this study was to assess the feasibility, validity, reliability and sensitivity to change of a Spanish version of the Dermatology Life Quality Index (DLQI) in patients with mild to moderate eczema and psoriasis who were treated with topical corticosteroids. The final study sample comprised 237 patients (48% eczema). Discriminant validity was tested by comparing patients' scores with those of a random sample of the general population (n = 100), and convergent validity by analysing correlations between DLQI scores, measures of clinical severity, and domain scores on the Nottingham Health Profile (NHP). Internal consistency and test-retest reliability were tested in clinically stable patients (n = 94), and responsiveness in a clinically unstable group (n = 143) initiating treatment with topical corticosteroids. Patient scores were significantly higher than general population scores (4.3 vs. 0. 27, P < 0.001). Correlations with NHP domains ranged from 0.12 to 0. 32, and there was significant correlation with clinical measures (r = 0.26, P < 0.001). Reliability was good (Cronbach's alpha = 0.83; intraclass correlation coefficient = 0.88), and the instrument proved responsive to change (effect size for the total group of de novo patients = 0.70), though the great majority of changes occurred in items 1 and 2. The NHP Emotional Reactions and Mobility domains were more responsive than some DLQI domains. In clinical trials of treatments for mild to moderate eczema and psoriasis, it is likely that only items 1 and 2 of the DLQI will be needed, and it is probably advisable to include generic instruments alongside the DLQI.
Pietsch, Torsten; Schmidt, Rene; Remke, Marc; Korshunov, Andrey; Hovestadt, Volker; Jones, David TW; Felsberg, Jörg; Kaulich, Kerstin; Goschzik, Tobias; Kool, Marcel; Northcott, Paul A.; von Hoff, Katja; von Bueren, André O.; Friedrich, Carsten; Skladny, Heyko; Fleischhack, Gudrun; Taylor, Michael D.; Cremer, Friedrich; Lichter, Peter; Faldum, Andreas; Reifenberger, Guido; Rutkowski, Stefan; Pfister, Stefan M.
2014-01-01
BACKGROUND: This study aimed to prospectively evaluate clinical, histopathological and molecular variables for outcome prediction in medulloblastoma patients. METHODS: Patients from the HIT2000 cooperative clinical trial were prospectively enrolled based on the availability of sufficient tumor material and complete clinical information. This revealed a cohort of 184 patients (median age 7.6 years), which was randomly split at a 2:1 ratio into a training (n = 127), and a validation (n = 57) dataset. All samples were subjected to thorough histopathological investigation, CTNNB1 mutation analysis, quantitative PCR, MLPA and FISH analyses for cytogenetic variables, and methylome analysis. RESULTS: By univariable analysis, clinical factors (M-stage), histopathological variables (large cell component, endothelial proliferation, synaptophysin pattern), and molecular features (chromosome 6q status, MYC amplification, TOP2A copy-number, subgrouping) were found to be prognostic. Molecular consensus subgrouping (WNT, SHH, Group 3, Group 4) was validated as an independent feature to stratify patients into different risk groups. When comparing methods for the identification of WNT-driven medulloblastoma, this study identified CTNNB1 sequencing and methylation profiling to most reliably identify these patients. After removing patients with particularly favorable (CTNNB1 mutation, extensive nodularity) or unfavorable (MYC amplification) markers, a risk score for the remaining “intermediate molecular risk” population dependent on age, M-stage, pattern of synaptophysin expression, and MYCN copy-number status was identified and validated, with speckled synaptophysin expression indicating worse outcome. CONCLUSIONS: Methylation subgrouping and CTNNB1 mutation status represent robust tools for the risk-stratification of medulloblastoma. A simple clinico-pathological risk score for “intermediate molecular risk” patients was identified, which deserves further validation. SECONDARY CATEGORY: Pediatrics.
Vasli, Parvaneh; Dehghan-Nayeri, Nahid; Khosravi, Laleh
2018-01-01
Despite the emphasis placed on the implementation of continuing professional education programs in Iran, researchers or practitioners have not developed an instrument for assessing the factors that affect the knowledge transfer from such programs to clinical practice. The aim of this study was to design and validate such instrument for the Iranian context. The research used a three-stage mix method. In the first stage, in-depth interviews with nurses and content analysis were conducted, after which themes were extracted from the data. In the second stage, the findings of the content analysis and literature review were examined, and preliminary instrument options were developed. In the third stage, qualitative content validity, face validity, content validity ratio, content validity index, and construct validity using exploratory factor analysis was conducted. The reliability of the instrument was measured before and after the determination of construct validity. Primary tool instrument initially comprised 53 items, and its content validity index was 0.86. In the multi-stage factor analysis, eight questions were excluded, thereby reducing 11 factors to five and finally, to four. The final instrument with 43 items consists of the following dimensions: structure and organizational climate, personal characteristics, nature and status of professionals, and nature of educational programs. Managers can use the Iranian instrument to identify factors affecting knowledge transfer of continuing professional education to clinical practice. Copyright © 2017. Published by Elsevier Ltd.
Sertdemir, Y; Burgut, R
2009-01-01
In recent years the use of surrogate end points (S) has become an interesting issue. In clinical trials, it is important to get treatment outcomes as early as possible. For this reason there is a need for surrogate endpoints (S) which are measured earlier than the true endpoint (T). However, before a surrogate endpoint can be used it must be validated. For a candidate surrogate endpoint, for example time to recurrence, the validation result may change dramatically between clinical trials. The aim of this study is to show how the validation criterion (R(2)(trial)) proposed by Buyse et al. are influenced by the magnitude of treatment effect with an application using real data. The criterion R(2)(trial) proposed by Buyse et al. (2000) is applied to the four data sets from colon cancer clinical trials (C-01, C-02, C-03 and C-04). Each clinical trial is analyzed separately for treatment effect on survival (true endpoint) and recurrence free survival (surrogate endpoint) and this analysis is done also for each center in each trial. Results are used for standard validation analysis. The centers were grouped by the Wald statistic in 3 equal groups. Validation criteria R(2)(trial) were 0.641 95% CI (0.432-0.782), 0.223 95% CI (0.008-0.503), 0.761 95% CI (0.550-0.872) and 0.560 95% CI (0.404-0.687) for C-01, C-02, C-03 and C-04 respectively. The R(2)(trial) criteria changed by the Wald statistics observed for the centers used in the validation process. Higher the Wald statistic groups are higher the R(2)(trial) values observed. The recurrence free survival is not a good surrogate for overall survival in clinical trials with non significant treatment effects and moderate for significant treatment effects. This shows that the level of significance of treatment effect should be taken into account in validation process of surrogate endpoints.
Odegaard, Justin I; Vincent, John J; Mortimer, Stefanie; Vowles, James V; Ulrich, Bryan C; Banks, Kimberly C; Fairclough, Stephen R; Zill, Oliver A; Sikora, Marcin; Mokhtari, Reza; Abdueva, Diana; Nagy, Rebecca J; Lee, Christine E; Kiedrowski, Lesli A; Paweletz, Cloud P; Eltoukhy, Helmy; Lanman, Richard B; Chudova, Darya I; Talasaz, AmirAli
2018-04-24
Purpose: To analytically and clinically validate a circulating cell-free tumor DNA sequencing test for comprehensive tumor genotyping and demonstrate its clinical feasibility. Experimental Design: Analytic validation was conducted according to established principles and guidelines. Blood-to-blood clinical validation comprised blinded external comparison with clinical droplet digital PCR across 222 consecutive biomarker-positive clinical samples. Blood-to-tissue clinical validation comprised comparison of digital sequencing calls to those documented in the medical record of 543 consecutive lung cancer patients. Clinical experience was reported from 10,593 consecutive clinical samples. Results: Digital sequencing technology enabled variant detection down to 0.02% to 0.04% allelic fraction/2.12 copies with ≤0.3%/2.24-2.76 copies 95% limits of detection while maintaining high specificity [prevalence-adjusted positive predictive values (PPV) >98%]. Clinical validation using orthogonal plasma- and tissue-based clinical genotyping across >750 patients demonstrated high accuracy and specificity [positive percent agreement (PPAs) and negative percent agreement (NPAs) >99% and PPVs 92%-100%]. Clinical use in 10,593 advanced adult solid tumor patients demonstrated high feasibility (>99.6% technical success rate) and clinical sensitivity (85.9%), with high potential actionability (16.7% with FDA-approved on-label treatment options; 72.0% with treatment or trial recommendations), particularly in non-small cell lung cancer, where 34.5% of patient samples comprised a directly targetable standard-of-care biomarker. Conclusions: High concordance with orthogonal clinical plasma- and tissue-based genotyping methods supports the clinical accuracy of digital sequencing across all four types of targetable genomic alterations. Digital sequencing's clinical applicability is further supported by high rates of technical success and biomarker target discovery. Clin Cancer Res; 1-11. ©2018 AACR. ©2018 American Association for Cancer Research.
Psallidas, Ioannis; Kanellakis, Nikolaos I; Gerry, Stephen; Thézénas, Marie Laëtitia; Charles, Philip D; Samsonova, Anastasia; Schiller, Herbert B; Fischer, Roman; Asciak, Rachelle; Hallifax, Robert J; Mercer, Rachel; Dobson, Melissa; Dong, Tao; Pavord, Ian D; Collins, Gary S; Kessler, Benedikt M; Pass, Harvey I; Maskell, Nick; Stathopoulos, Georgios T; Rahman, Najib M
2018-06-13
The prevalence of malignant pleural effusion is increasing worldwide, but prognostic biomarkers to plan treatment and to understand the underlying mechanisms of disease progression remain unidentified. The PROMISE study was designed with the objectives to discover, validate, and prospectively assess biomarkers of survival and pleurodesis response in malignant pleural effusion and build a score that predicts survival. In this multicohort study, we used five separate and independent datasets from randomised controlled trials to investigate potential biomarkers of survival and pleurodesis. Mass spectrometry-based discovery was used to investigate pleural fluid samples for differential protein expression in patients from the discovery group with different survival and pleurodesis outcomes. Clinical, radiological, and biological variables were entered into least absolute shrinkage and selection operator regression to build a model that predicts 3-month mortality. We evaluated the model using internal and external validation. 17 biomarker candidates of survival and seven of pleurodesis were identified in the discovery dataset. Three independent datasets (n=502) were used for biomarker validation. All pleurodesis biomarkers failed, and gelsolin, macrophage migration inhibitory factor, versican, and tissue inhibitor of metalloproteinases 1 (TIMP1) emerged as accurate predictors of survival. Eight variables (haemoglobin, C-reactive protein, white blood cell count, Eastern Cooperative Oncology Group performance status, cancer type, pleural fluid TIMP1 concentrations, and previous chemotherapy or radiotherapy) were validated and used to develop a survival score. Internal validation with bootstrap resampling and external validation with 162 patients from two independent datasets showed good discrimination (C statistic values of 0·78 [95% CI 0·72-0·83] for internal validation and 0·89 [0·84-0·93] for external validation of the clinical PROMISE score). To our knowledge, the PROMISE score is the first prospectively validated prognostic model for malignant pleural effusion that combines biological and clinical parameters to accurately estimate 3-month mortality. It is a robust, clinically relevant prognostic score that can be applied immediately, provide important information on patient prognosis, and guide the selection of appropriate management strategies. European Respiratory Society, Medical Research Funding-University of Oxford, Slater & Gordon Research Fund, and Oxfordshire Health Services Research Committee Research Grants. Copyright © 2018 Elsevier Ltd. All rights reserved.
Critical thinking of student nurses during clinical accompaniment.
Uys, B Y; Meyer, S M
2005-08-01
The purpose of this study was to investigate the methods of clinical accompaniment used by clinical facilitators in practice. The findings of the study also reflected facilitators' perceptions regarding critical thinking and the facilitation thereof. A quantitative research design was used. A literature study was conducted to identify the methods of accompaniment that facilitate critical thinking. Data was collected by means of a questionnaire developed for that purpose. Making a content-related validity judgment, and involving seven clinical facilitators in an academic institution, ensured the validity of the questionnaire. The results of the study indicated that various clinical methods of accompaniment were used. To a large extent, these methods correlated with those discussed in the literature review. The researcher further concluded that the concepts 'critical thinking' and 'facilitation' were not interpreted correctly by the respondents, and would therefore not be implemented in a proper manner in nursing practice. Furthermore, it seemed evident that tutor-driven learning realised more often than student-driven learning. In this regard, the requirement of outcomes-based education was not satisfied. The researcher is therefore of the opinion that a practical programme for the development of critical thinking skills during clinical accompaniment must be developed within the framework of outcomes-based education.
McDonald, Craig M; Henricson, Erik K; Abresch, R Ted; Florence, Julaine; Eagle, Michelle; Gappmaier, Eduard; Glanzman, Allan M; Spiegel, Robert; Barth, Jay; Elfring, Gary; Reha, Allen; Peltz, Stuart W
2013-01-01
Introduction: An international clinical trial enrolled 174 ambulatory males ≥5 years old with nonsense mutation Duchenne muscular dystrophy (nmDMD). Pretreatment data provide insight into reliability, concurrent validity, and minimal clinically important differences (MCIDs) of the 6-minute walk test (6MWT) and other endpoints. Methods: Screening and baseline evaluations included the 6-minute walk distance (6MWD), timed function tests (TFTs), quantitative strength by myometry, the PedsQL, heart rate–determined energy expenditure index, and other exploratory endpoints. Results: The 6MWT proved feasible and reliable in a multicenter context. Concurrent validity with other endpoints was excellent. The MCID for 6MWD was 28.5 and 31.7 meters based on 2 statistical distribution methods. Conclusions: The ratio of MCID to baseline mean is lower for 6MWD than for other endpoints. The 6MWD is an optimal primary endpoint for Duchenne muscular dystrophy (DMD) clinical trials that are focused therapeutically on preservation of ambulation and slowing of disease progression. Muscle Nerve 48: 357–368, 2013 PMID:23674289
MacDonald, James; Wilson, Julie; Young, Julie; Duerson, Drew; Swisher, Gail; Collins, Christy L; Meehan, William P
2015-01-01
A common sequela of concussions is impaired reaction time. Computerized neurocognitive tests commonly measure reaction time. A simple clinical test for reaction time has been studied previously in college athletes; whether this test is valid and reliable when assessing younger athletes remains unknown. Our study examines the reliability and validity of this test in a population of high school athletes. Cross-sectional study. Two American High Schools. High school athletes (N = 448) participating in American football or soccer during the academic years 2011 to 2012 and 2012 to 2013. All study participants completed a computerized baseline neurocognitive assessment that included a measure of reaction time (RT comp), in addition to a clinical measure of reaction time that assessed how far a standard measuring device would fall prior to the athlete catching it (RT clin). Validity was assessed by determining the correlation between RT clin and RT comp. Reliability was assessed by measuring the intraclass correlation coefficients (ICCs) between the repeated measures of RT clin and RT comp taken 1 year apart. In the first year of study, RT clin and RT comp were positively but weakly correlated (rs = 0.229, P < 0.001). In the second year, there was no significant correlation between RT clin and RT comp (rs = 0.084, P = 0.084). Both RT clin [ICC = 0.608; 95% confidence interval (CI), 0.434-0.728] and RT comp (ICC = 0.691; 95% CI, 0.554-0.786) had marginal reliability. In a population of high school athletes, RT clin had poor validity when compared with RT comp as a standard. Both RT clin and RT comp had marginal test-retest reliability. Before considering the clinical use of RT clin in the assessment of sport-related concussions sustained by high school athletes, the factors affecting reliability and validity should be investigated further. Reaction time impairment commonly results from concussion and is among the most clinically important measures of the condition. The device evaluated in this study has previously been investigated as a reaction time measure in college athletes. This study investigates the clinical generalizability of the device in a younger population. A video abstract showing how the RT clin device is used in practice is available as Supplemental Digital Content 1, http://links.lww.com/JSM/A43.
Faurholt-Jepsen, Maria; Munkholm, Klaus; Frost, Mads; Bardram, Jakob E; Kessing, Lars Vedel
2016-01-15
Various paper-based mood charting instruments are used in the monitoring of symptoms in bipolar disorder. During recent years an increasing number of electronic self-monitoring tools have been developed. The objectives of this systematic review were 1) to evaluate the validity of electronic self-monitoring tools as a method of evaluating mood compared to clinical rating scales for depression and mania and 2) to investigate the effect of electronic self-monitoring tools on clinically relevant outcomes in bipolar disorder. A systematic review of the scientific literature, reported according to the Preferred Reporting items for Systematic Reviews and Meta-Analysis (PRISMA) guidelines was conducted. MEDLINE, Embase, PsycINFO and The Cochrane Library were searched and supplemented by hand search of reference lists. Databases were searched for 1) studies on electronic self-monitoring tools in patients with bipolar disorder reporting on validity of electronically self-reported mood ratings compared to clinical rating scales for depression and mania and 2) randomized controlled trials (RCT) evaluating electronic mood self-monitoring tools in patients with bipolar disorder. A total of 13 published articles were included. Seven articles were RCTs and six were longitudinal studies. Electronic self-monitoring of mood was considered valid compared to clinical rating scales for depression in six out of six studies, and in two out of seven studies compared to clinical rating scales for mania. The included RCTs primarily investigated the effect of heterogeneous electronically delivered interventions; none of the RCTs investigated the sole effect of electronic mood self-monitoring tools. Methodological issues with risk of bias at different levels limited the evidence in the majority of studies. Electronic self-monitoring of mood in depression appears to be a valid measure of mood in contrast to self-monitoring of mood in mania. There are yet few studies on the effect of electronic self-monitoring of mood in bipolar disorder. The evidence of electronic self-monitoring is limited by methodological issues and by a lack of RCTs. Although the idea of electronic self-monitoring of mood seems appealing, studies using rigorous methodology investigating the beneficial as well as possible harmful effects of electronic self-monitoring are needed.
Sturt, Jackie; Taylor, Hafrun; Docherty, Andrea; Dale, Jeremy; Louise, Taylor
2006-01-01
Background The objectives of this study were twofold (i) to develop the Diabetes Manual, a self-management educational intervention aimed at improving biomedical and psychosocial outcomes (ii) to produce early phase evidence relating to validity and clinical feasibility to inform future research and systematic reviews. Methods Using the UK Medical Research Council's complex intervention framework, the Diabetes Manual and associated self management interventions were developed through pre-clinical, and phase I evaluation phases guided by adult-learning and self-efficacy theories, clinical feasibility and health policy protocols. A qualitative needs assessment and an RCT contributed data to the pre-clinical phase. Phase I incorporated intervention development informed by the pre-clinical phase and a feasibility survey. Results The pre-clinical and phase I studies resulted in the production in the Diabetes Manual programme for trial evaluation as delivered within routine primary care consultations. Conclusion This complex intervention shows early feasibility and face validity for both diabetes health professionals and people with diabetes. Randomised trial will determine effectiveness against clinical and psychological outcomes. Further study of some component parts, delivered in alternative combinations, is recommended. PMID:17129376
Block, Darci R; Algeciras-Schimnich, Alicia
2013-01-01
Requests for testing various analytes in serous fluids (e.g., pleural, peritoneal, pericardial effusions) are submitted daily to clinical laboratories. Testing of these fluids deviates from assay manufacturers' specifications, as most laboratory assays are optimized for testing blood or urine specimens. These requests add a burden to clinical laboratories, which need to validate assay performance characteristics in these fluids to exclude matrix interferences (given the different composition of body fluids) while maintaining regulatory compliance. Body fluid testing for a number of analytes has been reported in the literature; however, understanding the clinical utility of these analytes is critical because laboratories must address the analytic and clinical validation requirements, while educating clinicians on proper test utilization. In this article, we review the published data to evaluate the clinical utility of testing for numerous analytes in body fluid specimens. We also highlight the pre-analytic and analytic variables that need to be considered when reviewing published studies in body fluid testing. Finally, we provide guidance on how published studies might (or might not) guide interpretation of test results in today's clinical laboratories.
Assessing clinical competency in the health sciences
NASA Astrophysics Data System (ADS)
Panzarella, Karen Joanne
To test the success of integrated curricula in schools of health sciences, meaningful measurements of student performance are required to assess clinical competency. This research project analyzed a new performance assessment tool, the Integrated Standardized Patient Examination (ISPE), for assessing clinical competency: specifically, to assess Doctor of Physical Therapy (DPT) students' clinical competence as the ability to integrate basic science knowledge with clinical communication skills. Thirty-four DPT students performed two ISPE cases, one of a patient who sustained a stroke and the other a patient with a herniated lumbar disc. Cases were portrayed by standardized patients (SPs) in a simulated clinical setting. Each case was scored by an expert evaluator in the exam room and then by one investigator and the students themselves via videotape. The SPs scored each student on an overall encounter rubric. Written feedback was obtained from all participants in the study. Acceptable reliability was demonstrated via inter-rater agreement as well as inter-rater correlations on items that used a dichotomous scale, whereas the items requiring the use of the 4-point rubric were somewhat less reliable. For the entire scale both cases had a significant correlation between the Expert-Investigator pair of raters, for the CVA case r = .547, p < .05 and for the HD case r = .700, p < .01. The SPs scored students higher than the other raters. Students' self-assessments were most closely aligned with the investigator. Effects were apparent due to case. Content validity was gathered in the process of developing cases and patient scenarios that were used in this study. Construct validity was obtained from the survey results analyzed from the experts and students. Future studies should examine the effect of rater training upon the reliability. Criterion or predictive validity could be further studied by comparing students' performances on the ISPE with other independent estimates of students' competence. The unique integration questions of the ISPE were judged to have good content validity from experts and students, suggestive that integration, a most crucial element of clinical competence, while done in the mind of the student, can be practiced, learned and assessed.
Argilés, Josep M.; Betancourt, Angelica; Guàrdia-Olmos, Joan; Peró-Cebollero, Maribel; López-Soriano, Francisco J.; Madeddu, Clelia; Serpe, Roberto; Busquets, Sílvia
2017-01-01
The CAchexia SCOre (CASCO) was described as a tool for the staging of cachectic cancer patients. The aim of this study is to show the metric properties of CASCO in order to classify cachectic cancer patients into three different groups, which are associated with a numerical scoring. The final aim was to clinically validate CASCO for its use in the classification of cachectic cancer patients in clinical practice. We carried out a case -control study that enrolled prospectively 186 cancer patients and 95 age-matched controls. The score includes five components: (1) body weight loss and composition, (2) inflammation/metabolic disturbances/immunosuppression, (3) physical performance, (4) anorexia, and (5) quality of life. The present study provides clinical validation for the use of the score. In order to show the metric properties of CASCO, three different groups of cachectic cancer patients were established according to the results obtained with the statistical approach used: mild cachexia (15 ≤ × ≤ 28), moderate cachexia (29 ≤ × ≤ 46), and severe cachexia (47 ≤ × ≤ 100). In addition, a simplified version of CASCO, MiniCASCO (MCASCO), was also presented and it contributes as a valid and easy-to-use tool for cachexia staging. Significant statistically correlations were found between CASCO and other validated indexes such as Eastern Cooperative Oncology Group (ECOG) and the subjective diagnosis of cachexia by specialized oncologists. A very significant estimated correlation between CASCO and MCASCO was found that suggests that MCASCO might constitute an easy and valid tool for the staging of the cachectic cancer patients. CASCO and MCASCO provide a new tool for the quantitative staging of cachectic cancer patients with a clear advantage over previous classifications. PMID:28261113
Argilés, Josep M; Betancourt, Angelica; Guàrdia-Olmos, Joan; Peró-Cebollero, Maribel; López-Soriano, Francisco J; Madeddu, Clelia; Serpe, Roberto; Busquets, Sílvia
2017-01-01
The CAchexia SCOre (CASCO) was described as a tool for the staging of cachectic cancer patients. The aim of this study is to show the metric properties of CASCO in order to classify cachectic cancer patients into three different groups, which are associated with a numerical scoring. The final aim was to clinically validate CASCO for its use in the classification of cachectic cancer patients in clinical practice. We carried out a case -control study that enrolled prospectively 186 cancer patients and 95 age-matched controls. The score includes five components: (1) body weight loss and composition, (2) inflammation/metabolic disturbances/immunosuppression, (3) physical performance, (4) anorexia, and (5) quality of life. The present study provides clinical validation for the use of the score. In order to show the metric properties of CASCO, three different groups of cachectic cancer patients were established according to the results obtained with the statistical approach used: mild cachexia (15 ≤ × ≤ 28), moderate cachexia (29 ≤ × ≤ 46), and severe cachexia (47 ≤ × ≤ 100). In addition, a simplified version of CASCO, MiniCASCO (MCASCO), was also presented and it contributes as a valid and easy-to-use tool for cachexia staging. Significant statistically correlations were found between CASCO and other validated indexes such as Eastern Cooperative Oncology Group (ECOG) and the subjective diagnosis of cachexia by specialized oncologists. A very significant estimated correlation between CASCO and MCASCO was found that suggests that MCASCO might constitute an easy and valid tool for the staging of the cachectic cancer patients. CASCO and MCASCO provide a new tool for the quantitative staging of cachectic cancer patients with a clear advantage over previous classifications.
Bennett, R J; Jayakody, D M P; Eikelboom, R H; Taljaard, D S; Atlas, M D
2016-02-01
To investigate the ability of cochlear implant (CI) recipients to physically handle and care for their hearing implant device(s) and to identify factors that may influence skills. To assess device management skills, a clinical survey was developed and validated on a clinical cohort of CI recipients. Survey development and validation. A prospective convenience cohort design study. Specialist hearing implant clinic. Forty-nine post-lingually deafened, adult CI recipients, at least 12 months postoperative. Survey test-retest reliability, interobserver reliability and responsiveness. Correlations between management skills and participant demographic, audiometric, clinical outcomes and device factors. The Cochlear Implant Management Skills survey was developed, demonstrating high test-retest reliability (0.878), interobserver reliability (0.972) and responsiveness to intervention (skills training) [t(20) = -3.913, P = 0.001]. Cochlear Implant Management Skills survey scores range from 54.69% to 100% (mean: 83.45%, sd: 12.47). No associations were found between handling skills and participant factors. This is the first study to demonstrate a range in cochlear implant device handling skills in CI recipients and offers clinicians and researchers a tool to systematically and objectively identify shortcomings in CI recipients' device handling skills. © 2015 John Wiley & Sons Ltd.
Cropp, Carola; Salzer, Simone; Häusser, Leonard F; Streeck-Fischer, Annette
2013-01-01
The axis structure of the Operationalized Psychodynamic Diagnostics in childhood and adolescence (OPD-CA) has proven to be a reliable and valid diagnostic tool under research conditions. However, corresponding data regarding the integration of OPD-CA axis structure into clinical practice is still lacking. Hence, this aspect was examined as part of a randomized controlled clinical trial realized at Asklepios Fachklinikum Tiefenbrunn. Here, the OPD-CA axis structure has been applied to assess the structural level of 42 adolescent patients (15-19 years). In contrast to previous studies, the assessment was not carried out by independent raters using a videotaped OPD-CA interview, but the rating was part of clinical routine procedures. Also under these conditions, inter-rater reliability was high, in particular regarding the four subscales of the OPD-CA axis structure. With respect to construct validity, the results of our study supported a two-factor solution, which is in accordance with the findings of two previous works. One factor corresponded to the dimension "self-regulation" while the other factor included both the dimension "self-perception and object perception" as well as the dimension "communication skills". Implications of the findings for research and practice are discussed.
A systematic review of clinical assessment for undergraduate nursing students.
Wu, Xi Vivien; Enskär, Karin; Lee, Cindy Ching Siang; Wang, Wenru
2015-02-01
Consolidated clinical practicum prepares pre-registration nursing students to function as beginning practitioners. The clinical competencies of final-year nursing students provide a key indication of professional standards of practice and patient safety. Thus, clinical assessment of nursing students is a crucial issue for educators and administrators. The aim of this systematic review was to explore the clinical competency assessment for undergraduate nursing students. PubMed, CINAHL, ScienceDirect, Web of Science, and EBSCO were systematically searched from January 2000 to December 2013. The systematic review was in line with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines. Published quantitative and qualitative studies that examined clinical assessment practices and tools used in clinical nursing education were retrieved. Quality assessment, data extraction, and analysis were completed on all included studies. This review screened 2073 titles, abstracts and full-text records, resulting in 33 included studies. Two reviewers assessed the quality of the included studies. Fourteen quantitative and qualitative studies were identified for this evaluation. The evidence was ordered into emergent themes; the overarching themes were current practices in clinical assessment, issues of learning and assessment, development of assessment tools, and reliability and validity of assessment tools. There is a need to develop a holistic clinical assessment tool with reasonable level of validity and reliability. Clinical assessment is a robust activity and requires collaboration between clinical partners and academia to enhance the clinical experiences of students, the professional development of preceptors, and the clinical credibility of academics. Copyright © 2014 Elsevier Ltd. All rights reserved.
Validity of juvenile idiopathic arthritis diagnoses using administrative health data.
Stringer, Elizabeth; Bernatsky, Sasha
2015-03-01
Administrative health databases are valuable sources of data for conducting research including disease surveillance, outcomes research, and processes of health care at the population level. There has been limited use of administrative data to conduct studies of pediatric rheumatic conditions and no studies validating case definitions in Canada. We report a validation study of incident cases of juvenile idiopathic arthritis in the Canadian province of Nova Scotia. Cases identified through administrative data algorithms were compared to diagnoses in a clinical database. The sensitivity of algorithms that included pediatric rheumatology specialist claims was 81-86%. However, 35-48% of cases that were identified could not be verified in the clinical database depending on the algorithm used. Our case definitions would likely lead to overestimates of disease burden. Our findings may be related to issues pertaining to the non-fee-for-service remuneration model in Nova Scotia, in particular, systematic issues related to the process of submitting claims.
Cavalcante, Tahissa Frota; de Araújo, Thelma Leite; Moreira, Rafaella Pessoa; Guedes, Nirla Gomes; Lopes, Marcos Venicios de Oliveira; da Silva, Viviane Martins
2013-01-01
the study's objective was the clinical validation of the nursing diagnosis Risk for Aspiration among patients who experienced cerebrovascular accidents (CVA). a prospective cohort study was conducted with 24 patients hospitalized due to a CVA. The instrument used to collect the data addressed the risk factors for respiratory aspiration, validated by concept analysis and by experts. the most frequent risk factors for respiratory aspiration were: dysphagia (54.2%) and impaired physical mobility (41.7%). The prevalence of the nursing diagnosis Risk for Aspiration was 58.3% and the prevalence of respiratory aspiration over the span of 48 hours (monitoring period) was 37.5%. Risk factors for dysphagia and impaired physical mobility were significantly associated with respiratory aspiration. the risk factors dysphagia and impaired physical mobility are good predictors of the nursing diagnosis Risk for Aspiration. This study contributed to improving the NANDA-I Taxonomy and the systematization of the nursing process.
Lange, Rael T; Brickell, Tracey A; Lippa, Sara M; French, Louis M
2015-01-01
The purpose of this study was to examine the clinical utility of three recently developed validity scales (Validity-10, NIM5, and LOW6) designed to screen for symptom exaggeration using the Neurobehavioral Symptom Inventory (NSI). Participants were 272 U.S. military service members who sustained a mild, moderate, severe, or penetrating traumatic brain injury (TBI) and who were evaluated by the neuropsychology service at Walter Reed Army Medical Center within 199 weeks post injury. Participants were divided into two groups based on the Negative Impression Management scale of the Personality Assessment Inventory: (a) those who failed symptom validity testing (SVT-fail; n = 27) and (b) those who passed symptom validity testing (SVT-pass; n = 245). Participants in the SVT-fail group had significantly higher scores (p<.001) on the Validity-10, NIM5, LOW6, NSI total, and Personality Assessment Inventory (PAI) clinical scales (range: d = 0.76 to 2.34). Similarly high sensitivity, specificity, positive predictive power (PPP), and negative predictive (NPP) values were found when using all three validity scales to differentiate SVT-fail versus SVT-pass groups. However, the Validity-10 scale consistently had the highest overall values. The optimal cutoff score for the Validity-10 scale to identify possible symptom exaggeration was ≥19 (sensitivity = .59, specificity = .89, PPP = .74, NPP = .80). For the majority of people, these findings provide support for the use of the Validity-10 scale as a screening tool for possible symptom exaggeration. When scores on the Validity-10 exceed the cutoff score, it is recommended that (a) researchers and clinicians do not interpret responses on the NSI, and (b) clinicians follow up with a more detailed evaluation, using well-validated symptom validity measures (e.g., Minnesota Multiphasic Personality Inventory-2 Restructured Form, MMPI-2-RF, validity scales), to seek confirmatory evidence to support an hypothesis of symptom exaggeration.
Garcia, Antonio F.; Acosta, Melina; Pirani, Saifa; Edwards, Daniel; Osman, Augustine
2017-01-01
We describe 2 studies designed to evaluate scores on the Multidimensional Shame-related Response Inventory-21 (MSRI-21), a recently developed instrument that measures affective and behavioral responses to shame. The inventory assesses shame-related responses in 3 categories: negative self-evaluation, fear of social consequences, and maladaptive behavior tendency. For Study 1, (N = 743) undergraduates completed the MSRI-21. Confirmatory factor analysis supported the validity of the MSRI-21 3-factor structure. Latent variable modeling of coefficient-α provided strong evidence for the internal consistency of scores on each scale. In Study 2, (N = 540) undergraduates completed the instrument along with 5 concurrent measures chosen for clinical significance. Achievement of factorial invariance supported the use of MSRI-21 scale scores to make valid mean comparisons across gender. In addition, MSRI-21 scale scores were associated as expected with scores on measures of self-harm, suicide, and other risk factors. Taken together, results of 2 studies support the internal consistency reliability, factorial validity, factorial invariance, and convergent validity of scores on the MSRI-21. Further work is needed to assess the temporal stability of the MSRI-21 scale scores, invariance across clinical status and other groupings, item-level measurement properties, and viability in highly symptomatic samples. PMID:28182490
2014-01-01
Biomarker research is continuously expanding in the field of clinical proteomics. A combination of different proteomic–based methodologies can be applied depending on the specific clinical context of use. Moreover, current advancements in proteomic analytical platforms are leading to an expansion of biomarker candidates that can be identified. Specifically, mass spectrometric techniques could provide highly valuable tools for biomarker research. Ideally, these advances could provide with biomarkers that are clinically applicable for disease diagnosis and/ or prognosis. Unfortunately, in general the biomarker candidates fail to be implemented in clinical decision making. To improve on this current situation, a well-defined study design has to be established driven by a clear clinical need, while several checkpoints between the different phases of discovery, verification and validation have to be passed in order to increase the probability of establishing valid biomarkers. In this review, we summarize the technical proteomic platforms that are available along the different stages in the biomarker discovery pipeline, exemplified by clinical applications in the field of bladder cancer biomarker research. PMID:24679154
Atypical Speech and Language Development: A Consensus Study on Clinical Signs in the Netherlands
ERIC Educational Resources Information Center
Visser-Bochane, Margot I.; Gerrits, Ellen; van der Schans, Cees P.; Reijneveld, Sijmen A.; Luinge, Margreet R.
2017-01-01
Background: Atypical speech and language development is one of the most common developmental difficulties in young children. However, which clinical signs characterize atypical speech-language development at what age is not clear. Aim: To achieve a national and valid consensus on clinical signs and red flags (i.e. most urgent clinical signs) for…
ERIC Educational Resources Information Center
Gupta, Akriti; Singh, Satendra; Khaliq, Farah; Dhaliwal, Upreet; Madhu, S. V.
2018-01-01
In the country presently, preclinical medical students are not routinely exposed to real patients. Thus, when they start clinical postings, they are found to have poor clinical reasoning skills. Simulated virtual patients (SVPs) can improve clinical skills without endangering real patients. This pilot study describes the development of two SVPs in…
2017-01-01
Background The Information Assessment Method (IAM) allows clinicians to report the cognitive impact, clinical relevance, intention to use, and expected patient health benefits associated with clinical information received by email. More than 15,000 Canadian physicians and pharmacists use the IAM in continuing education programs. In addition, information providers can use IAM ratings and feedback comments from clinicians to improve their products. Objective Our general objective was to validate the IAM questionnaire for the delivery of educational material (ecological and logical content validity). Our specific objectives were to measure the relevance and evaluate the representativeness of IAM items for assessing information received by email. Methods A 3-part mixed methods study was conducted (convergent design). In part 1 (quantitative longitudinal study), the relevance of IAM items was measured. Participants were 5596 physician members of the Canadian Medical Association who used the IAM. A total of 234,196 ratings were collected in 2012. The relevance of IAM items with respect to their main construct was calculated using descriptive statistics (relevance ratio R). In part 2 (qualitative descriptive study), the representativeness of IAM items was evaluated. A total of 15 family physicians completed semistructured face-to-face interviews. For each construct, we evaluated the representativeness of IAM items using a deductive-inductive thematic qualitative data analysis. In part 3 (mixing quantitative and qualitative parts), results from quantitative and qualitative analyses were reviewed, juxtaposed in a table, discussed with experts, and integrated. Thus, our final results are derived from the views of users (ecological content validation) and experts (logical content validation). Results Of the 23 IAM items, 21 were validated for content, while 2 were removed. In part 1 (quantitative results), 21 items were deemed relevant, while 2 items were deemed not relevant (R=4.86% [N=234,196] and R=3.04% [n=45,394], respectively). In part 2 (qualitative results), 22 items were deemed representative, while 1 item was not representative. In part 3 (mixing quantitative and qualitative results), the content validity of 21 items was confirmed, and the 2 nonrelevant items were excluded. A fully validated version was generated (IAM-v2014). Conclusions This study produced a content validated IAM questionnaire that is used by clinicians and information providers to assess the clinical information delivered in continuing education programs. PMID:28292738
Validation of the diagnostic score for acute lower abdominal pain in women of reproductive age.
Jearwattanakanok, Kijja; Yamada, Sirikan; Suntornlimsiri, Watcharin; Smuthtai, Waratsuda; Patumanond, Jayanton
2014-01-01
Background. The differential diagnoses of acute appendicitis obstetrics, and gynecological conditions (OB-GYNc) or nonspecific abdominal pain in young adult females with lower abdominal pain are clinically challenging. The present study aimed to validate the recently developed clinical score for the diagnosis of acute lower abdominal pain in female of reproductive age. Method. Medical records of reproductive age women (15-50 years) who were admitted for acute lower abdominal pain were collected. Validation data were obtained from patients admitted during a different period from the development data. Result. There were 302 patients in the validation cohort. For appendicitis, the score had a sensitivity of 91.9%, a specificity of 79.0%, and a positive likelihood ratio of 4.39. The sensitivity, specificity, and positive likelihood ratio in diagnosis of OB-GYNc were 73.0%, 91.6%, and 8.73, respectively. The areas under the receiver operating curves (ROC), the positive likelihood ratios, for appendicitis and OB-GYNc in the validation data were not significantly different from the development data, implying similar performances. Conclusion. The clinical score developed for the diagnosis of acute lower abdominal pain in female of reproductive age may be applied to guide differential diagnoses in these patients.
A Farahani, Mansoureh; Emamzadeh Ghasemi, Hormat Sadat; Nikpaima, Nasrin; Fereidooni, Zhila; Rasoli, Maryam
2014-10-29
Evaluation of nursing instructors' clinical teaching performance is a prerequisite to the quality assurance of nursing education. One of the most common procedures for this purpose is using student evaluations. This study was to develop and evaluate the psychometric properties of Nursing Instructors' Clinical Teaching Performance Inventory (NICTPI). The primary items of the inventory were generated by reviewing the published literature and the existing questionnaires as well as consulting with the members of the Faculties Evaluation Committee of the study setting. Psychometric properties were assessed by calculating its content validity ratio and index, and test-retest correlation coefficient as well as conducting an exploratory factor analysis and an internal consistency assessment. The content validity ratios and indices of the items were respectively higher than 0.85 and 0.79. The final version of the inventory consisted of 25 items, and in the exploratory factor analysis, items were loaded on three factors which jointly accounting for 72.85% of the total variance. The test-retest correlation coefficient and the Cronbach's alpha of the inventory were 0.93 and 0.973, respectively. The results revealed that the developed inventory is an appropriate, valid, and reliable instrument for evaluating nursing instructors' clinical teaching performance.
The Arabic Version of the Mayo-Portland Adaptability Inventory 4: A Validation Study
ERIC Educational Resources Information Center
Hamed, Razan; Tariah, Hashem Abu; Malkawi, Somaya; Holm, Margo B.
2012-01-01
The Mayo-Portland Adaptability Inventory 4 (MPAI-4) is a valid and reliable assessment tool to detect clinical impairments in patients with acquired brain injury. The tool is widely used by rehabilitation therapists worldwide, given its good psychometric properties and its availability in several languages. The purpose of this study was to…
Validation of the Parenting Stress Index--Short Form with Minority Caregivers
ERIC Educational Resources Information Center
Lee, Sang Jung; Gopalan, Geetha; Harrington, Donna
2016-01-01
Objectives: There has been little examination of the structural validity of the Parenting Stress Index--Short Form (PSI-SF) for minority populations in clinical contexts in the Unites States. This study aimed to test prespecified factor structures (one-factor, two-factor, and three-factor models) of the PSI-SF. Methods: This study used…
Discovery and Validation of Biomarkers to Guide Clinical Management of Pneumonia in African Children
Huang, Honglei; Ideh, Readon C.; Gitau, Evelyn; Thézénas, Marie L.; Jallow, Muminatou; Ebruke, Bernard; Chimah, Osaretin; Oluwalana, Claire; Karanja, Henri; Mackenzie, Grant; Adegbola, Richard A.; Kwiatkowski, Dominic; Kessler, Benedikt M.; Berkley, James A.; Howie, Stephen R. C.; Casals-Pascual, Climent
2014-01-01
Background. Pneumonia is the leading cause of death in children globally. Clinical algorithms remain suboptimal for distinguishing severe pneumonia from other causes of respiratory distress such as malaria or distinguishing bacterial pneumonia and pneumonia from others causes, such as viruses. Molecular tools could improve diagnosis and management. Methods. We conducted a mass spectrometry–based proteomic study to identify and validate markers of severity in 390 Gambian children with pneumonia (n = 204) and age-, sex-, and neighborhood-matched controls (n = 186). Independent validation was conducted in 293 Kenyan children with respiratory distress (238 with pneumonia, 41 with Plasmodium falciparum malaria, and 14 with both). Predictive value was estimated by the area under the receiver operating characteristic curve (AUC). Results. Lipocalin 2 (Lpc-2) was the best protein biomarker of severe pneumonia (AUC, 0.71 [95% confidence interval, .64–.79]) and highly predictive of bacteremia (78% [64%–92%]), pneumococcal bacteremia (84% [71%–98%]), and “probable bacterial etiology” (91% [84%–98%]). These results were validated in Kenyan children with severe malaria and respiratory distress who also met the World Health Organization definition of pneumonia. The combination of Lpc-2 and haptoglobin distinguished bacterial versus malaria origin of respiratory distress with high sensitivity and specificity in Gambian children (AUC, 99% [95% confidence interval, 99%–100%]) and Kenyan children (82% [74%–91%]). Conclusions. Lpc-2 and haptoglobin can help discriminate the etiology of clinically defined pneumonia and could be used to improve clinical management. These biomarkers should be further evaluated in prospective clinical studies. PMID:24696240
Storrs, Mark J; Alexander, Heather; Sun, Jing; Kroon, Jeroen; Evans, Jane L
2015-03-01
Previous research on interprofessional education (IPE) assessment has shown the need to evaluate the influence of team-based processes on the quality of clinical education. This study aimed to develop a valid and reliable instrument to evaluate the effectiveness of interprofessional team-based treatment planning (TBTP) on the quality of clinical education at the Griffith University School of Dentistry and Oral Health, Queensland, Australia. A scale was developed and evaluated to measure interprofessional student team processes and their effect on the quality of clinical education for dental, oral health therapy, and dental technology students (known more frequently as intraprofessional education). A face validity analysis by IPE experts confirmed that items on the scale reflected the meaning of relevant concepts. After piloting, 158 students (61% response rate) involved with TBTP participated in a survey. An exploratory factor analysis using the principal component method retained 23 items with a total variance of 64.6%, suggesting high content validity. Three subscales accounted for 45.7%, 11.4%, and 7.5% of the variance. Internal consistency of the scale (α=0.943) and subscales 1 (α=0.953), 2 (α=0.897), and 3 (α=0.813) was high. A reliability analysis yielded moderate (rs=0.43) to high correlations (0.81) with the remaining scale items. Confirmatory factor analyses verified convergent validity and confirmed that this structure had a good model fit. This study suggests that the instrument might be useful in evaluating interprofessional or intraprofessional team-based processes and their influence on the quality of clinical education in academic dental institutions.
Skates, Steven J.; Gillette, Michael A.; LaBaer, Joshua; Carr, Steven A.; Anderson, N. Leigh; Liebler, Daniel C.; Ransohoff, David; Rifai, Nader; Kondratovich, Marina; Težak, Živana; Mansfield, Elizabeth; Oberg, Ann L.; Wright, Ian; Barnes, Grady; Gail, Mitchell; Mesri, Mehdi; Kinsinger, Christopher R.; Rodriguez, Henry; Boja, Emily S.
2014-01-01
Protein biomarkers are needed to deepen our understanding of cancer biology and to improve our ability to diagnose, monitor and treat cancers. Important analytical and clinical hurdles must be overcome to allow the most promising protein biomarker candidates to advance into clinical validation studies. Although contemporary proteomics technologies support the measurement of large numbers of proteins in individual clinical specimens, sample throughput remains comparatively low. This problem is amplified in typical clinical proteomics research studies, which routinely suffer from a lack of proper experimental design, resulting in analysis of too few biospecimens to achieve adequate statistical power at each stage of a biomarker pipeline. To address this critical shortcoming, a joint workshop was held by the National Cancer Institute (NCI), National Heart, Lung and Blood Institute (NHLBI), and American Association for Clinical Chemistry (AACC), with participation from the U.S. Food and Drug Administration (FDA). An important output from the workshop was a statistical framework for the design of biomarker discovery and verification studies. Herein, we describe the use of quantitative clinical judgments to set statistical criteria for clinical relevance, and the development of an approach to calculate biospecimen sample size for proteomic studies in discovery and verification stages prior to clinical validation stage. This represents a first step towards building a consensus on quantitative criteria for statistical design of proteomics biomarker discovery and verification research. PMID:24063748
Skates, Steven J; Gillette, Michael A; LaBaer, Joshua; Carr, Steven A; Anderson, Leigh; Liebler, Daniel C; Ransohoff, David; Rifai, Nader; Kondratovich, Marina; Težak, Živana; Mansfield, Elizabeth; Oberg, Ann L; Wright, Ian; Barnes, Grady; Gail, Mitchell; Mesri, Mehdi; Kinsinger, Christopher R; Rodriguez, Henry; Boja, Emily S
2013-12-06
Protein biomarkers are needed to deepen our understanding of cancer biology and to improve our ability to diagnose, monitor, and treat cancers. Important analytical and clinical hurdles must be overcome to allow the most promising protein biomarker candidates to advance into clinical validation studies. Although contemporary proteomics technologies support the measurement of large numbers of proteins in individual clinical specimens, sample throughput remains comparatively low. This problem is amplified in typical clinical proteomics research studies, which routinely suffer from a lack of proper experimental design, resulting in analysis of too few biospecimens to achieve adequate statistical power at each stage of a biomarker pipeline. To address this critical shortcoming, a joint workshop was held by the National Cancer Institute (NCI), National Heart, Lung, and Blood Institute (NHLBI), and American Association for Clinical Chemistry (AACC) with participation from the U.S. Food and Drug Administration (FDA). An important output from the workshop was a statistical framework for the design of biomarker discovery and verification studies. Herein, we describe the use of quantitative clinical judgments to set statistical criteria for clinical relevance and the development of an approach to calculate biospecimen sample size for proteomic studies in discovery and verification stages prior to clinical validation stage. This represents a first step toward building a consensus on quantitative criteria for statistical design of proteomics biomarker discovery and verification research.
Legitimating Clinical Research in the Study of Organizational Culture.
ERIC Educational Resources Information Center
Schein, Edgar H.
1993-01-01
Argues that traditional research model used in industrial-organizational psychology is not useful in understanding deeper dynamics of organizations, especially those phenomena labeled as "cultural." Contends that use of data obtained during clinical and consulting work should be legitimated as valid research data. Spells out clinical model and…
Gorgon, Edward James R; Lazaro, Rolando T
2016-01-01
The Upright Motor Control Test (UMCT) has been used in clinical practice and research to assess functional strength of the hemiparetic lower limb in adults with stroke. It is unclear if evidence is sufficient to warrant its use. The purpose of this systematic review was to synthesize available evidence on the measurement properties of the UMCT for stroke rehabilitation. Electronic databases that indexed biomedical literature were systematically searched from inception until October 2015 (week 4): Embase, PubMed, Web of Science, CINAHL, PEDro, Cochrane Library, Scopus, ScienceDirect, SPORTDiscus, LILACS, DOAJ, and Google Scholar. All studies that had used the UMCT in the time period covered underwent hand searching for any additional study. Observational studies involving adults with stroke that explored any measurement property of the UMCT were included. The COnsensus-based Standards for the selection of health Measurement INstruments was used to assess the methodological quality of included studies. The CanChild Outcome Measures Rating Form was used for extracting data on measurement properties and clinical utility. The search yielded three methodologic studies that addressed criterion-related validity and contruct validity. Two studies of fair methodological quality demonstrated moderate-level evidence that Knee Extension and Knee Flexion subtest scores were predictive of community-level and household-level ambulation. One study of fair methodological quality provided limited-level evidence for the correlation of Knee Extension subtest scores with a laboratory measure of ground reaction forces. No published studies formally assessed reliability, responsiveness, or clinical utility. Limited information on responsiveness and clinical utility dimensions could be inferred from the included studies. The UMCT is a practical assessment tool for voluntary control or functional strength of the hemiparetic lower limb in standing in adults with stroke. Although different levels of evidence suggest that the Knee Extension and Knee Flexion subtests may possess criterion and construct validity, the lack of published literature examining content validity, reliability, and responsiveness raises questions regarding the use of the UMCT in routine clinical practice. These key findings highlight the need to further investigate the UMCT's measurement properties toward enhancing its standardization.
Validity Issues in Clinical Assessment.
ERIC Educational Resources Information Center
Foster, Sharon L.; Cone, John D.
1995-01-01
Validation issues that arise with measures of constructs and behavior are addressed with reference to general reasons for using assessment procedures in clinical psychology. A distinction is made between the representational phase of validity assessment and the elaborative validity phase in which the meaning and utility of scores are examined.…
Pauling, L; Herman, Z S
1989-01-01
With the assumption of the validity of the Hardin Jones principle that the death rate of members of a homogeneous cohort of cancer patients is constant, three criteria for the validity of clinical trials of cancer treatments are formulated. These criteria are satisfied by most published clinical trials, but one trial was found to violate all three, rendering the validity of its reported results uncertain. PMID:2780542
Validity of FAA-approved color vision tests for class II and class III aeromedical screening.
DOT National Transportation Integrated Search
1993-09-01
All clinical color vision tests currently used in the medical examination of pilots were studied regarding validity for prediction of performance on practical tests of ability to discriminate the aviation signal colors, red, green, and white given un...
Counselors as Caregivers: The Validation of the Counselor Caregiving Questionnaire (CCQ)
ERIC Educational Resources Information Center
Fitch, Jenelle C.
2008-01-01
This research is a validation study of the Counselor Caregiving Questionnaire (CCQ). Doctoral-level students (N = 188) in clinical and counseling psychology training programs completed the following questionnaires: (a) Counselor Caregiving Questionnaire (Fitch & Pistole, 2006), (b) Relationship Questionnaire (Bartholomew & Horowitz, 1991),…
Construct Validity of Three Clerkship Performance Assessments
ERIC Educational Resources Information Center
Lee, Ming; Wimmers, Paul F.
2010-01-01
This study examined construct validity of three commonly used clerkship performance assessments: preceptors' evaluations, OSCE-type clinical performance measures, and the NBME [National Board of Medical Examiners] medicine subject examination. Six hundred and eighty-six students taking the inpatient medicine clerkship from 2003 to 2007…
The development and psychometric properties of the selective mutism questionnaire.
Bergman, R Lindsey; Keller, Melody L; Piacentini, John; Bergman, Andrea J
2008-04-01
Research on selective mutism (SM) has been limited by the absence of standardized, psychometrically sound assessment measures. The purpose of our investigation was to present two studies that examined the factor structure and initial reliability and validity of the Selective Mutism Questionnaire (SMQ), a 17-item parent report measure of failure to speak related to SM. Study 1 (N = 589) utilized an Internet sample of parents of children ages 3 to 11 to demonstrate that the SMQ has a theoretically and clinically meaningful factor structure accounting for a significant portion of variance in responses with good internal consistency. Study 2 (N = 66) supported the validity of the SMQ in that scores discriminated clinic-referred children with SM from children with other anxiety disorders. Scores on the SMQ were correlated with measures of several theoretically and clinically important dimensions.
Discovery and validation of cell cycle arrest biomarkers in human acute kidney injury
2013-01-01
Introduction Acute kidney injury (AKI) can evolve quickly and clinical measures of function often fail to detect AKI at a time when interventions are likely to provide benefit. Identifying early markers of kidney damage has been difficult due to the complex nature of human AKI, in which multiple etiologies exist. The objective of this study was to identify and validate novel biomarkers of AKI. Methods We performed two multicenter observational studies in critically ill patients at risk for AKI - discovery and validation. The top two markers from discovery were validated in a second study (Sapphire) and compared to a number of previously described biomarkers. In the discovery phase, we enrolled 522 adults in three distinct cohorts including patients with sepsis, shock, major surgery, and trauma and examined over 300 markers. In the Sapphire validation study, we enrolled 744 adult subjects with critical illness and without evidence of AKI at enrollment; the final analysis cohort was a heterogeneous sample of 728 critically ill patients. The primary endpoint was moderate to severe AKI (KDIGO stage 2 to 3) within 12 hours of sample collection. Results Moderate to severe AKI occurred in 14% of Sapphire subjects. The two top biomarkers from discovery were validated. Urine insulin-like growth factor-binding protein 7 (IGFBP7) and tissue inhibitor of metalloproteinases-2 (TIMP-2), both inducers of G1 cell cycle arrest, a key mechanism implicated in AKI, together demonstrated an AUC of 0.80 (0.76 and 0.79 alone). Urine [TIMP-2]·[IGFBP7] was significantly superior to all previously described markers of AKI (P <0.002), none of which achieved an AUC >0.72. Furthermore, [TIMP-2]·[IGFBP7] significantly improved risk stratification when added to a nine-variable clinical model when analyzed using Cox proportional hazards model, generalized estimating equation, integrated discrimination improvement or net reclassification improvement. Finally, in sensitivity analyses [TIMP-2]·[IGFBP7] remained significant and superior to all other markers regardless of changes in reference creatinine method. Conclusions Two novel markers for AKI have been identified and validated in independent multicenter cohorts. Both markers are superior to existing markers, provide additional information over clinical variables and add mechanistic insight into AKI. Trial registration ClinicalTrials.gov number NCT01209169. PMID:23388612
Hobden, Breanne; Schwandt, Melanie L; Carey, Mariko; Lee, Mary R; Farokhnia, Mehdi; Bouhlal, Sofia; Oldmeadow, Christopher; Leggio, Lorenzo
2017-06-01
The Montgomery-Asberg Depression Rating Scale (MADRS) is commonly used to examine depressive symptoms in clinical settings, including facilities treating patients for alcohol addiction. No studies have examined the validity of the MADRS compared to an established clinical diagnostic tool of depression in this population. This study aimed to examine the following: (i) the validity of the MADRS compared to a clinical diagnosis of a depressive disorder (using the Structured Clinical Interview for DSM-IV-TR [SCID-IV-TR]) in patients seeking treatment for alcohol dependence (AD); (ii) whether the validity of the MADRS differs by type of SCID-IV-TR-based diagnosis of depression; and (iii) which items contribute to the optimal predictive model of the MADRS compared to a SCID-IV-TR diagnosis of a depressive disorder. Individuals seeking treatment for AD and admitted to an inpatient unit were administered the MADRS at day 2 of their detoxification program. Clinical diagnoses of AD and depression were made via the SCID-IV-TR at the beginning of treatment. In total, 803 participants were included in the study. The MADRS demonstrated low overall accuracy relative to the clinical diagnosis of depression with an area under the receiver operating characteristic curve of 0.68. The optimal threshold for balancing sensitivity and specificity identified by the Euclidean distance was >14. This cut-point demonstrated a sensitivity of 66%, a specificity of 60%, a positive predictive value of 50%, and a negative predictive value of 75%. The MADRS performed slightly better for major depressive disorders compared to alcohol-induced depression. Items related to lassitude, concentration, and appetite slightly decreased the accuracy of the MADRS. The MADRS does not appear to be an appropriate substitute for a diagnostic tool among alcohol-dependent patients. The MADRS may, however, still be a useful screening tool assuming careful consideration of cut-points. Copyright © 2017 by the Research Society on Alcoholism.
ERIC Educational Resources Information Center
Ay, Ismail; Secer, Ismail; Simsek, Mustafa Kerim
2017-01-01
Purpose of this study is to adapt anxiety and depression questionnaire for children into Turkish culture and to analyze the psychometric characteristics of it on clinical and nonclinical samples separately. The study is a descriptive survey research. The study was conducted on two different sample groups, clinical and nonclinical. The clinical…
Geraets, Daan; Cuzick, Jack; Cadman, Louise; Moore, Catherine; Vanden Broeck, Davy; Padalko, Elisaveta; Quint, Wim; Arbyn, Marc
2016-01-01
The Validation of Human Papillomavirus (HPV) Genotyping Tests (VALGENT) studies offer an opportunity to clinically validate HPV assays for use in primary screening for cervical cancer and also provide a framework for the comparison of analytical and type-specific performance. Through VALGENT, we assessed the performance of the cartridge-based Xpert HPV assay (Xpert HPV), which detects 14 high-risk (HR) types and resolves HPV16 and HPV18/45. Samples from women attending the United Kingdom cervical screening program enriched with cytologically abnormal samples were collated. All had been previously tested by a clinically validated standard comparator test (SCT), the GP5+/6+ enzyme immunoassay (EIA). The clinical sensitivity and specificity of the Xpert HPV for the detection of cervical intraepithelial neoplasia grade 2 or higher (CIN2+) and CIN3+ relative to those of the SCT were assessed as were the inter- and intralaboratory reproducibilities according to international criteria for test validation. Type concordance for HPV16 and HPV18/45 between the Xpert HPV and the SCT was also analyzed. The Xpert HPV detected 94% of CIN2+ and 98% of CIN3+ lesions among all screened women and 90% of CIN2+ and 96% of CIN3+ lesions in women 30 years and older. The specificity for CIN1 or less (≤CIN1) was 83% (95% confidence interval [CI], 80 to 85%) in all women and 88% (95% CI, 86 to 91%) in women 30 years and older. Inter- and intralaboratory agreements for the Xpert HPV were 98% and 97%, respectively. The kappa agreements for HPV16 and HPV18/45 between the clinically validated reference test (GP5+/6+ LMNX) and the Xpert HPV were 0.92 and 0.91, respectively. The clinical performance and reproducibility of the Xpert HPV are comparable to those of well-established HPV assays and fulfill the criteria for use in primary cervical cancer screening. PMID:27385707
Revised NEO Personality Inventory profiles of male and female U.S. Air Force pilots.
Callister, J D; King, R E; Retzlaff, P D; Marsh, R W
1999-12-01
The study of pilot personality characteristics has a long and controversial history. Personality characteristics seem to be fairly poor predictors of training outcome; however, valid personality assessment is essential to clinical psychological evaluations. Therefore, the personality characteristics of pilots must be studied to ensure valid clinical assessment. This paper describes normative personality characteristics of U.S. Air Force pilots based on the Revised NEO Personality Inventory profiles of 1,301 U.S. Air Force student pilots. Compared with male adult norms, male student pilots had higher levels of extraversion and lower levels of agreeableness. Compared with female adult norms, female student pilots had higher levels of extraversion and openness and lower levels of agreeableness. Descriptive statistics and percentile tables for the five domain scores and 30 facet scores are provided for clinical use, and a case vignette is provided as an example of the clinical utility of these U.S. Air Force norms.
Green, Michael F.
2013-01-01
Social cognitive impairment is prominent in schizophrenia, and it is closely related to functional outcome. Partly for these reasons, it has rapidly become a target for both training and psychopharmacological interventions. However, there is a paucity of reliable and valid social cognitive endpoints that can be used to evaluate treatment response in clinical trials. Also, clinical studies in schizophrenia have benefited rather little from the surge of activity and knowledge in nonclinical social neuroscience. The National Institute of Mental Health-sponsored study, “Social Cognition and Functioning in Schizophrenia” (SCAF), attempted to address this translational challenge by selecting paradigms from social neuroscience that could be adapted for use in schizophrenia. The project also evaluated the psychometric properties and external validity of the tasks to determine their suitability for multisite clinical trials. This first article in the theme section presents the goals, conceptual background, and rationale for the SCAF project. PMID:24072811
Development of a pharmacogenetic-guided warfarin dosing algorithm for Puerto Rican patients.
Ramos, Alga S; Seip, Richard L; Rivera-Miranda, Giselle; Felici-Giovanini, Marcos E; Garcia-Berdecia, Rafael; Alejandro-Cowan, Yirelia; Kocherla, Mohan; Cruz, Iadelisse; Feliu, Juan F; Cadilla, Carmen L; Renta, Jessica Y; Gorowski, Krystyna; Vergara, Cunegundo; Ruaño, Gualberto; Duconge, Jorge
2012-12-01
This study was aimed at developing a pharmacogenetic-driven warfarin-dosing algorithm in 163 admixed Puerto Rican patients on stable warfarin therapy. A multiple linear-regression analysis was performed using log-transformed effective warfarin dose as the dependent variable, and combining CYP2C9 and VKORC1 genotyping with other relevant nongenetic clinical and demographic factors as independent predictors. The model explained more than two-thirds of the observed variance in the warfarin dose among Puerto Ricans, and also produced significantly better 'ideal dose' estimates than two pharmacogenetic models and clinical algorithms published previously, with the greatest benefit seen in patients ultimately requiring <7 mg/day. We also assessed the clinical validity of the model using an independent validation cohort of 55 Puerto Rican patients from Hartford, CT, USA (R(2) = 51%). Our findings provide the basis for planning prospective pharmacogenetic studies to demonstrate the clinical utility of genotyping warfarin-treated Puerto Rican patients.
Meltzer, Eli O; Hadley, James; Blaiss, Michael; Benninger, Michael; Kimel, Miriam; Kleinman, Leah; Dupclay, Leon; Garcia, Jorge; Leahy, Michael; Georges, George
2005-02-01
To develop a questionnaire to evaluate preferences for attributes of intranasal corticosteroids (INSs) in clinical trials with allergic rhinitis (AR) patients. Established questionnaire development practices were used, including performance of a literature review and use of patient and physician focus groups, cognitive debriefing interviews, and pilot testing before validation. Findings from patient and physician focus groups suggest that sensory attributes are relevant to AR patients when choosing INSs. Physician focus groups identified the need for 2 distinct preference instruments, a clinical trial patient preference questionnaire (CTPPQ) and a clinical practice preference questionnaire (CPPPQ). A pilot study suggests that the CTPPQ is capable of discriminating between 2 INSs in the clinical trial setting. Initial findings suggest that items in the CTPPQ and CPPPQ are easy to understand and relevant to patients. Further validation studies with larger sample sizes are needed to assess the psychometric properties of both questionnaires. B-20.
Cross-cultural Adaption and Validation of the Danish Voice Handicap Index.
Sorensen, Jesper Roed; Printz, Trine; Mehlum, Camilla Slot; Heidemann, Christian Hamilton; Groentved, Aagot Moeller; Godballe, Christian
2018-02-02
We aimed to assess psychometric properties, including internal consistency, reliability, and clinical validity of the Danish version of the Voice Handicap Index (VHI). A cross-sectional survey study was carried out. For validation, the existing nonvalidated Danish version of the VHI was used. Data from 208 patients with voice disorders of different etiology (neurogenic, functional, and structural) and a control group of 85 vocally healthy individuals were included. A test-retest reliability analysis of 42 patients and 45 control persons was performed. The internal consistency, test-retest reliability, and clinical validity of the questionnaire were assessed. Internal consistency was high with a Cronbach α >0.90 for both the patient and control group. Test-retest reliability measured as intraclass correlation coefficient was good with 0.93 (95% confidence interval [95% confidence interval]: 0.87-0.96) for patients and 0.78 (95% confidence interval: 0.63-0.87) for the control group which indicates sufficient reliability of the questionnaire. The Danish VHI has good clinical validity as it has a strong correlation between patient's perception of the severity of their voice disorder and the VHI score from the Spearman correlation of 0.69. The existing Danish version of the VHI has been thoroughly validated and found to be in line with the original VHI from Jacobsen et al. It showed good internal consistency, test-retest reliability, and clinical validity. It is suitable for use in daily practice and in research projects as it is able to assess patients' perception of their voice disorder severity. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
van der Put, Claudia E; Bouwmeester-Landweer, Merian B R; Landsmeer-Beker, Eleonore A; Wit, Jan M; Dekker, Friedo W; Kousemaker, N Pieter J; Baartman, Herman E M
2017-08-01
For preventive purposes it is important to be able to identify families with a high risk of child maltreatment at an early stage. Therefore we developed an actuarial instrument for screening families with a newborn baby, the Instrument for identification of Parents At Risk for child Abuse and Neglect (IPARAN). The aim of this study was to assess the predictive validity of the IPARAN and to examine whether combining actuarial and clinical methods leads to an improvement of the predictive validity. We examined the predictive validity by calculating several performance indicators (i.e., sensitivity, specificity and the Area Under the receiver operating characteristic Curve [AUC]) in a sample of 4692 Dutch families with newborns. The outcome measure was a report of child maltreatment at Child Protection Services during a follow-up of 3 years. For 17 children (.4%) a report of maltreatment was registered. The predictive validity of the IPARAN was significantly better than chance (AUC=.700, 95% CI [.567-.832]), in contrast to a low value for clinical judgement of nurses of the Youth Health Care Centers (AUC=.591, 95% CI [.422-.759]). The combination of the IPARAN and clinical judgement resulted in the highest predictive validity (AUC=.720, 95% CI [.593-.847]), however, the difference between the methods did not reach statistical significance. The good predictive validity of the IPARAN in combination with clinical judgment of the nurse enables professionals to assess risks at an early stage and to make referrals to early intervention programs. Copyright © 2017 Elsevier Ltd. All rights reserved.
The Growing Need for Validated Biomarkers and Endpoints for Dry Eye Clinical Research.
Roy, Neeta S; Wei, Yi; Kuklinski, Eric; Asbell, Penny A
2017-05-01
Biomarkers with minimally invasive and reproducible objective metrics provide the key to future paradigm shifts in understanding of the underlying causes of dry eye disease (DED) and approaches to treatment of DED. We review biomarkers and their validity in providing objective metrics for DED clinical research and patient care. The English-language literature in PubMed primarily over the last decade was surveyed for studies related to identification of biomarkers of DED: (1) inflammation, (2) point-of-care, (3) ocular imaging, and (4) genetics. Relevant studies in each group were individually evaluated for (1) methodological and analytical details, (2) data and concordance with other similar studies, and (3) potential to serve as validated biomarkers with objective metrics. Significant work has been done to identify biomarkers for DED clinical trials and for patient care. Interstudy variation among studies dealing with the same biomarker type was high. This could be attributed to biologic variations and/or differences in processing, and data analysis. Correlation with other signs and symptoms of DED was not always clear or present. Many of the biomarkers reviewed show the potential to serve as validated and objective metrics for clinical research and patient care in DED. Interstudy variation for a given biomarker emphasizes the need for detailed reporting of study methodology, including information on subject characteristics, quality control, processing, and analysis methods to optimize development of nonsubjective metrics. Biomarker development offers a rich opportunity to significantly move forward clinical research and patient care in DED. DED is an unmet medical need - a chronic pain syndrome associated with variable vision that affects quality of life, is common with advancing age, interferes with the comfortable use of contact lenses, and can diminish results of eye surgeries, such as cataract extraction, LASIK, and glaucoma procedures. It is a worldwide medical challenge with a prevalence rate ranging from 8% to 50%. Many clinicians and researchers across the globe are searching for better answers to understand the mechanisms related to the development and chronicity of DED. Though there have been many clinical trials for DED, few new treatments have emerged over the last decade. Biomarkers may provide the needed breakthrough to propel our understanding of DED to the next level and the potential to realize our goal of truly personalized medicine based on scientific evidence. Clinical trials and research on DED have suffered from the lack of validated biomarkers and less than objective and reproducible endpoints. Current work on biomarkers has provided the groundwork to move forward. This review highlights primarily ocular biomarkers that have been investigated for use in DED, discusses the methodologic outcomes in providing objective metrics for clinical research, and suggests recommendations for further work.
Gobbi, Erica; Elliot, Catherine; Varnier, Maurizio; Carraro, Attilio
2016-01-01
The purpose of this research was to assess an Italian version of the Physical Activity Questionnaire for Older Children (PAQ-C-It). Three separate studies were conducted, whereby testing general psychometric properties, construct validity, concurrent validity and the factor structure of the PAQ-C-It among general and clinical pediatric population. Study 1 (n = 1170) examined the psychometric properties, internal consistency, factor structure (exploratory factor analysis, EFA) and construct validity with enjoyment perception during physical activity. Study 2 (n = 59) reported on reliability, construct validity with enjoyment and BMI, and on cross-sectional concurrent validity with objectively measured MVPA (tri-axial accelerometry) over the span of seven consecutive days. Study 3 (n = 58) examined the PAQ-C-It reliability, construct validity with BMI and VO2max as the objective measurement among a population of children with congenital heart defects (CHD). In study 2 and 3, the factor structure of the PAQ-C-It was then re-examined with an EFA. The PAQ-C-It showed acceptable to good reliability (alpha .70 to .83). Results on construct validity showed moderate but significant association with enjoyment perception (r = .30 and .36), with BMI (r = -.30 and -.79 for CHD simple form), and with the VO2max (r = .55 for CHD simple form). Significant concurrent validity with the objectively measured MVPA was reported (rho = .30, p < .05). Findings of the EFA suggested a two-factor structure for the PAQ-C-It, with items 2, 3, and 4 contributing little to the total score. This study supports the PAQ-C-It as an appropriate instrument to assess the MVPA levels of Italian children, including children with simple forms of CHD. Support is given to the possible instrument effectiveness on a large international perspective in order to level out data gathering across the globe.
Gobbi, Erica; Elliot, Catherine; Varnier, Maurizio; Carraro, Attilio
2016-01-01
The purpose of this research was to assess an Italian version of the Physical Activity Questionnaire for Older Children (PAQ-C-It). Three separate studies were conducted, whereby testing general psychometric properties, construct validity, concurrent validity and the factor structure of the PAQ-C-It among general and clinical pediatric population. Study 1 (n = 1170) examined the psychometric properties, internal consistency, factor structure (exploratory factor analysis, EFA) and construct validity with enjoyment perception during physical activity. Study 2 (n = 59) reported on reliability, construct validity with enjoyment and BMI, and on cross-sectional concurrent validity with objectively measured MVPA (tri-axial accelerometry) over the span of seven consecutive days. Study 3 (n = 58) examined the PAQ-C-It reliability, construct validity with BMI and VO2max as the objective measurement among a population of children with congenital heart defects (CHD). In study 2 and 3, the factor structure of the PAQ-C-It was then re-examined with an EFA. The PAQ-C-It showed acceptable to good reliability (alpha .70 to .83). Results on construct validity showed moderate but significant association with enjoyment perception (r = .30 and .36), with BMI (r = -.30 and -.79 for CHD simple form), and with the VO2max (r = .55 for CHD simple form). Significant concurrent validity with the objectively measured MVPA was reported (rho = .30, p < .05). Findings of the EFA suggested a two-factor structure for the PAQ-C-It, with items 2, 3, and 4 contributing little to the total score. This study supports the PAQ-C-It as an appropriate instrument to assess the MVPA levels of Italian children, including children with simple forms of CHD. Support is given to the possible instrument effectiveness on a large international perspective in order to level out data gathering across the globe. PMID:27228050
Roze, S; Liens, D; Palmer, A; Berger, W; Tucker, D; Renaudin, C
2006-12-01
The aim of this study was to describe a health economic model developed to project lifetime clinical and cost outcomes of lipid-modifying interventions in patients not reaching target lipid levels and to assess the validity of the model. The internet-based, computer simulation model is made up of two decision analytic sub-models, the first utilizing Monte Carlo simulation, and the second applying Markov modeling techniques. Monte Carlo simulation generates a baseline cohort for long-term simulation by assigning an individual lipid profile to each patient, and applying the treatment effects of interventions under investigation. The Markov model then estimates the long-term clinical (coronary heart disease events, life expectancy, and quality-adjusted life expectancy) and cost outcomes up to a lifetime horizon, based on risk equations from the Framingham study. Internal and external validation analyses were performed. The results of the model validation analyses, plotted against corresponding real-life values from Framingham, 4S, AFCAPS/TexCAPS, and a meta-analysis by Gordon et al., showed that the majority of values were close to the y = x line, which indicates a perfect fit. The R2 value was 0.9575 and the gradient of the regression line was 0.9329, both very close to the perfect fit (= 1). Validation analyses of the computer simulation model suggest the model is able to recreate the outcomes from published clinical studies and would be a valuable tool for the evaluation of new and existing therapy options for patients with persistent dyslipidemia.
Dougados, Maxime; Jousse-Joulin, Sandrine; Mistretta, Frederic; d'Agostino, Maria-Antonietta; Backhaus, Marina; Bentin, Jacques; Chalès, Gérard; Chary-Valckenaere, Isabelle; Conaghan, Philip; Etchepare, Fabien; Gaudin, Philippe; Grassi, Walter; van der Heijde, Désirée; Sellam, Jérémie; Naredo, Esperanza; Szkudlarek, Marcin; Wakefield, Richard; Saraux, Alain
2010-05-01
To evaluate different global ultrasonographic (US) synovitis scoring systems as potential outcome measures of rheumatoid arthritis (RA) according to the Outcome Measures in Rheumatoid Arthritis Clinical Trials (OMERACT) filter. To study selected global scoring systems, for the clinical, B mode and power Doppler techniques, the following joints were evaluated: 28 joints (28-joint Disease Activity Score (DAS28)), 20 joints (metacarpophalangeals (MCPs) + metatarsophalangeals (MTPs)) and 38 joints (28 joints + MTPs) using either a binary (yes/no) or a 0-3 grade. The study was a prospective, 4-month duration follow-up of 76 patients with RA requiring anti-tumour necrosis factor (TNF) therapy (complete follow-up data: 66 patients). Intraobserver reliability was evaluated using the intraclass correlation coefficient (ICC), construct validity was evaluated using the Cronbach alpha test and external validity was evaluated using level of correlation between scoring system and C reactive protein (CRP). Sensitivity to change was evaluated using the standardised response mean. Discriminating capacity was evaluated using the standardised mean differences in patients considered by the doctor as significantly improved or not at the end of the study. Different clinimetric properties of various US scoring systems were at least as good as the clinical scores with, for example, intraobserver reliability ranging from 0.61 to 0.97 versus from 0.53 to 0.82, construct validity ranging from 0.76 to 0.89 versus from 0.76 to 0.88, correlation with CRP ranging from 0.28 to 0.34 versus from 0.28 to 0.35 and sensitivity to change ranging from 0.60 to 1.21 versus from 0.96 to 1.36 for US versus clinical scoring systems, respectively. This study suggests that US evaluation of synovitis is an outcome measure at least as relevant as physical examination. Further studies are required in order to achieve optimal US scoring systems for monitoring patients with RA in clinical trials and in clinical practice.
Weiss, Nicole H.; Gratz, Kim L.; Lavender, Jason M.
2015-01-01
Emotion regulation difficulties are a transdiagnostic construct relevant to numerous clinical difficulties. Although the Difficulties in Emotion Regulation Scale (Gratz & Roemer, 2004) is a multidimensional measure of maladaptive ways of responding to emotions, it focuses on difficulties with the regulation of negative emotions and does not assess emotion dysregulation in the form of problematic responding to positive emotions. The aim of this study was to develop and validate a measure of clinically-relevant difficulties in the regulation of positive emotions (DERS-Positive). Findings revealed a 3-factor structure and supported the internal consistency and construct validity of the total and subscale scores. PMID:25576185
Weiss, Nicole H; Gratz, Kim L; Lavender, Jason M
2015-05-01
Emotion regulation difficulties are a transdiagnostic construct relevant to numerous clinical difficulties. Although the Difficulties in Emotion Regulation Scale (DERS) is a multidimensional measure of maladaptive ways of responding to emotions, it focuses on difficulties with the regulation of negative emotions and does not assess emotion dysregulation in the form of problematic responding to positive emotions. The aim of this study was to develop and validate a measure of clinically relevant difficulties in the regulation of positive emotions (DERS-Positive). Findings revealed a three-factor structure and supported the internal consistency and construct validity of the total and subscale scores. © The Author(s) 2015.
Daniel, David G; Revicki, Dennis A; Canuso, Carla M; Turkoz, Ibrahim; Fu, Dong-Jing; Alphs, Larry; Ishak, K. Jack; Bartko, John J; Lindenmayer, Jean-Pierre
2012-01-01
Objective: The Clinical Global Impression for Schizoaffective Disorder scale is a new rating scale adapted from the Clinical Global Impression scale for use in patients with schizoaffective disorder. The psychometric characteristics of the Clinical Global Impression for Schizoaffective Disorder are described. Design: Content validity was assessed using an investigator questionnaire. Inter-rater reliability was determined with 12 sets of videotaped interviews rated independently by two trained individuals. Test-retest reliability was assessed using 30 randomly selected raters from clinical trials who evaluated the same videos on separate occasions two weeks apart. Convergent and divergent validity and effect size were evaluated by comparing scores between the Clinical Global Impression for Schizoaffective Disorder and the Positive and Negative Syndrome Scale, 21-item Hamilton Rating Scale for Depression, and Young Mania Rating Scale scales using pooled patient data from two clinical trials. Clinical Global Impression for Schizoaffective Disorder scores were then linked to corresponding Positive and Negative Syndrome Scale scores. Results: Content validity was strong. Inter-rater agreement was good to excellent for most scales and subscales (intra-class correlation coefficient ≥0.50). Test-retest showed good reproducibility, with intraclass correlation coefficients ranging from 0.444 to 0.898. Spearman correlations between Clinical Global Impression for Schizoaffective Disorder domains and corresponding symptom scales were 0.60 or greater, and effect sizes for Clinical Global Impression for Schizoaffective Disorder overall and domain scores were similar to Positive and Negative Syndrome Scale Young Mania Rating Scale, and 21-item Hamilton Rating Scale for Depression scores. Raters anticipated that the scale might be less effective in distinguishing negative from depressive symptoms, and, in fact, the results here may reflect that clinical reality. Conclusion: Multiple lines of evidence support the reliability and validity of the Clinical Global Impression for Schizoaffective Disorder for studies in schizoaffective disorder. PMID:22347687
Seizures in the elderly: development and validation of a diagnostic algorithm.
Dupont, Sophie; Verny, Marc; Harston, Sandrine; Cartz-Piver, Leslie; Schück, Stéphane; Martin, Jennifer; Puisieux, François; Alecu, Cosmin; Vespignani, Hervé; Marchal, Cécile; Derambure, Philippe
2010-05-01
Seizures are frequent in the elderly, but their diagnosis can be challenging. The objective of this work was to develop and validate an expert-based algorithm for the diagnosis of seizures in elderly people. A multidisciplinary group of neurologists and geriatricians developed a diagnostic algorithm using a combination of selected clinical, electroencephalographical and radiological criteria. The algorithm was validated by multicentre retrospective analysis of data of patients referred for specific symptoms and classified by the experts as epileptic patients or not. The algorithm was applied to all the patients, and the diagnosis provided by the algorithm was compared to the clinical diagnosis of the experts. Twenty-nine clinical, electroencephalographical and radiological criteria were selected for the algorithm. According to criteria combination, seizures were classified in four levels of diagnosis: certain, highly probable, possible or improbable. To validate the algorithm, the medical records of 269 elderly patients were analyzed (138 with epileptic seizures, 131 with non-epileptic manifestations). Patients were mainly referred for a transient focal deficit (40%), confusion (38%), unconsciousness (27%). The algorithm best classified certain and probable seizures versus possible and improbable seizures, with 86.2% sensitivity and 67.2% specificity. Using logistical regression, 2 simplified models were developed, the first with 13 criteria (Se 85.5%, Sp 90.1%), and the second with 7 criteria only (Se 84.8%, Sp 88.6%). In conclusion, the present study validated the use of a revised diagnostic algorithm to help diagnosis epileptic seizures in the elderly. A prospective study is planned to further validate this algorithm. Copyright 2010 Elsevier B.V. All rights reserved.
Owolabi, Mayowa O
2014-01-01
Teacher's attitude domain, a pivotal aspect of clinical teaching, is missing in the Stanford Faculty Development Program Questionnaire (SFDPQ), the most widely used student-based assessment method of clinical teaching skills. This study was conducted to develop and validate the teacher's attitude domain and evaluate the validity and internal consistency reliability of the augmented SFDPQ. Items generated for the new domain included teacher's enthusiasm, sobriety, humility, thoroughness, empathy, and accessibility. The study involved 20 resident doctors assessed once by 64 medical students using the augmented SFDPQ. Construct validity was explored using correlation among the different domains and a global rating scale. Factor analysis was performed. The response rate was 94%. The new domain had a Cronbach's alpha of 0.89, with 1-factor solution explaining 57.1% of its variance. It showed the strongest correlation to the global rating scale (rho = 0.71). The augmented SFDPQ, which had a Cronbach's alpha of 0.93, correlated better (rho = 0.72, p < 0.00001) to the global rating scale than the original SFDPQ (rho = 0.67, p < 0.00001). The new teacher's attitude domain exhibited good internal consistency and construct and factorial validity. It enhanced the content and construct validity of the SFDPQ. The validated construct of the augmented SFDPQ is recommended for design and evaluation of basic and continuing clinical teaching programs. © 2014 The Alliance for Continuing Education in the Health Professions, the Society for Academic Continuing Medical Education, and the Council on Continuing Medical Education, Association for Hospital Medical Education.
Validity of a novel computerized screening test system for mild cognitive impairment.
Park, Jin-Hyuck; Jung, Minye; Kim, Jongbae; Park, Hae Yean; Kim, Jung-Ran; Park, Ji-Hyuk
2018-06-20
ABSTRACTBackground:The mobile screening test system for screening mild cognitive impairment (mSTS-MCI) was developed for clinical use. However, the clinical usefulness of mSTS-MCI to detect elderly with MCI from those who are cognitively healthy has yet to be validated. Moreover, the comparability between this system and traditional screening tests for MCI has not been evaluated. The purpose of this study was to examine the validity and reliability of the mSTS-MCI and confirm the cut-off scores to detect MCI. The data were collected from 107 healthy elderly people and 74 elderly people with MCI. Concurrent validity was examined using the Korean version of Montreal Cognitive Assessment (MoCA-K) as a gold standard test, and test-retest reliability was investigated using 30 of the study participants at four-week intervals. The sensitivity, specificity, positive predictive value, and negative predictive value (NPV) were confirmed through Receiver Operating Characteristic (ROC) analysis, and the cut-off scores for elderly people with MCI were identified. Concurrent validity showed statistically significant correlations between the mSTS-MCI and MoCA-K and test-rests reliability indicated high correlation. As a result of screening predictability, the mSTS-MCI had a higher NPV than the MoCA-K. The mSTS-MCI was identified as a system with a high degree of validity and reliability. In addition, the mSTS-MCI showed high screening predictability, indicating it can be used in the clinical field as a screening test system for mild cognitive impairment.
Reporting Guidelines for Clinical Pharmacokinetic Studies: The ClinPK Statement.
Kanji, Salmaan; Hayes, Meghan; Ling, Adam; Shamseer, Larissa; Chant, Clarence; Edwards, David J; Edwards, Scott; Ensom, Mary H H; Foster, David R; Hardy, Brian; Kiser, Tyree H; la Porte, Charles; Roberts, Jason A; Shulman, Rob; Walker, Scott; Zelenitsky, Sheryl; Moher, David
2015-07-01
Transparent reporting of all research is essential for assessing the validity of any study. Reporting guidelines are available and endorsed for many types of research but are lacking for clinical pharmacokinetic studies. Such tools promote the consistent reporting of a minimal set of information for end users, and facilitate knowledge translation of research. The objective of this study was to create a guideline to assist in the transparent and complete reporting of clinical pharmacokinetic studies. Preliminary content to be considered was identified from a systematic search of the literature and regulatory documents. Stakeholders were identified to participate in a modified Delphi exercise and a virtual meeting to generate consensus for items considered essential in the reporting of clinical pharmacokinetic studies. The proposed checklist was pilot tested on 100 recently published clinical pharmacokinetic studies. Overall and itemized compliance with the proposed guidance was determined for each study. Sixty-eight stakeholders from nine countries consented to participate. Four rounds of a modified Delphi survey and a series of small virtual meetings were required to generate consensus for a 24-item checklist considered to be essential to the reporting of clinical pharmacokinetic studies. When applied to the 100 most recently published clinical pharmacokinetic studies, 45 were determined to be compliant with at least 80 % of the checklist items. Explanatory text was prepared using examples of compliant reporting from these and other relevant studies. The reader's ability to judge the validity of pharmacokinetic research can be greatly compromised by the incomplete reporting of study information. Using consensus methods, we have developed a tool to guide transparent and accurate reporting of clinical pharmacokinetic studies. Endorsement and implementation of these guidelines by researchers, clinicians and journals would promote more consistent reporting of these studies and allow for better assessment of utility for clinical applications.
Kim, SungHwan; Lin, Chien-Wei; Tseng, George C
2016-07-01
Supervised machine learning is widely applied to transcriptomic data to predict disease diagnosis, prognosis or survival. Robust and interpretable classifiers with high accuracy are usually favored for their clinical and translational potential. The top scoring pair (TSP) algorithm is an example that applies a simple rank-based algorithm to identify rank-altered gene pairs for classifier construction. Although many classification methods perform well in cross-validation of single expression profile, the performance usually greatly reduces in cross-study validation (i.e. the prediction model is established in the training study and applied to an independent test study) for all machine learning methods, including TSP. The failure of cross-study validation has largely diminished the potential translational and clinical values of the models. The purpose of this article is to develop a meta-analytic top scoring pair (MetaKTSP) framework that combines multiple transcriptomic studies and generates a robust prediction model applicable to independent test studies. We proposed two frameworks, by averaging TSP scores or by combining P-values from individual studies, to select the top gene pairs for model construction. We applied the proposed methods in simulated data sets and three large-scale real applications in breast cancer, idiopathic pulmonary fibrosis and pan-cancer methylation. The result showed superior performance of cross-study validation accuracy and biomarker selection for the new meta-analytic framework. In conclusion, combining multiple omics data sets in the public domain increases robustness and accuracy of the classification model that will ultimately improve disease understanding and clinical treatment decisions to benefit patients. An R package MetaKTSP is available online. (http://tsenglab.biostat.pitt.edu/software.htm). ctseng@pitt.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Guise, Brian J; Thompson, Matthew D; Greve, Kevin W; Bianchini, Kevin J; West, Laura
2014-03-01
The current study assessed performance validity on the Stroop Color and Word Test (Stroop) in mild traumatic brain injury (TBI) using criterion-groups validation. The sample consisted of 77 patients with a reported history of mild TBI. Data from 42 moderate-severe TBI and 75 non-head-injured patients with other clinical diagnoses were also examined. TBI patients were categorized on the basis of Slick, Sherman, and Iverson (1999) criteria for malingered neurocognitive dysfunction (MND). Classification accuracy is reported for three indicators (Word, Color, and Color-Word residual raw scores) from the Stroop across a range of injury severities. With false-positive rates set at approximately 5%, sensitivity was as high as 29%. The clinical implications of these findings are discussed. © 2012 The British Psychological Society.
Assessment of Semi-Structured Clinical Interview for Mobile Phone Addiction Disorder
Alavi, Seyyed Salman; Jannatifard, Fereshteh; Mohammadi Kalhori, Soroush; Sepahbodi, Ghazal; BabaReisi, Mohammad; Sajedi, Sahar; Farshchi, Mojtaba; KhodaKarami, Rasul; Hatami Kasvaee, Vahid
2016-01-01
Objective: The Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision (DSM-IV-TR) classified mobile phone addiction disorder under “impulse control disorder not elsewhere classified”. This study surveyed the diagnostic criteria of DSM-IV-TR for the diagnosis of mobile phone addiction in correspondence with Iranian society and culture. Method: Two hundred fifty students of Tehran universities were entered into this descriptive-analytical and cross-sectional study. Quota sampling method was used. At first, semi- structured clinical interview (based on DSM-IV-TR) was performed for all the cases, and another specialist reevaluated the interviews. Data were analyzed using content validity, inter-scorer reliability (Kappa coefficient) and test-retest via SPSS18 software. Results: The content validity of the semi- structured clinical interview matched the DSM–IV-TR criteria for behavioral addiction. Moreover, their content was appropriate, and two items, including “SMS pathological use” and “High monthly cost of using the mobile phone” were added to promote its validity. Internal reliability (Kappa) and test–retest reliability were 0.55 and r = 0.4 (p<0. 01) respectively. Conclusion: The results of this study revealed that semi- structured diagnostic criteria of DSM-IV-TR are valid and reliable for diagnosing mobile phone addiction, and this instrument is an effective tool to diagnose this disorder. PMID:27437008
Development and psychometric validation of a cystic fibrosis knowledge scale.
Balfour, Louise; Armstrong, Michael; Holly, Crystal; Gaudet, Ena; Aaron, Shawn; Tasca, George; Cameron, William; Pakhale, Smita
2014-11-01
Well-developed and validated measures of cystic fibrosis (CF) knowledge are scarce. The purpose of the present study is to develop and validate a CF knowledge scale that is brief, easy to use, self-administered and demonstrates clinical utility. A comprehensive literature search generated a pool of scale items; an expert panel of CF team members reviewed and provided recommendations for item inclusion. A focus group of CF patients and family members (n = 12) then reviewed the items for face validity and reading clarity. To evaluate the validity and reliability of the newly developed CF knowledge scale, it was administered to several different samples including CF patients (n = 45), respirology patients (n = 100), health-care providers (n = 74) and university student samples (psychology students, n = 71; medical students, n = 36). Internal consistency of the scale was high, with an alpha coefficient for the overall sample of .95 (n = 326). The scale also demonstrated excellent construct validity. This study is an important first step in a line of research that aims to develop and empirically validate a psycho-educational adherence intervention for improving quality of life and treatment outcomes among adult CF patients. The CF knowledge scale has potential applications as a clinical teaching tool with patients and health-care providers and could be used as an outcome measure in CF educational intervention studies aimed at optimizing CF treatment knowledge, adherence and quality of life among CF patients. © 2014 Asian Pacific Society of Respirology.
Kleikers, Pamela W M; Hooijmans, Carlijn; Göb, Eva; Langhauser, Friederike; Rewell, Sarah S J; Radermacher, Kim; Ritskes-Hoitinga, Merel; Howells, David W; Kleinschnitz, Christoph; Schmidt, Harald H H W
2015-08-27
Biomedical research suffers from a dramatically poor translational success. For example, in ischemic stroke, a condition with a high medical need, over a thousand experimental drug targets were unsuccessful. Here, we adopt methods from clinical research for a late-stage pre-clinical meta-analysis (MA) and randomized confirmatory trial (pRCT) approach. A profound body of literature suggests NOX2 to be a major therapeutic target in stroke. Systematic review and MA of all available NOX2(-/y) studies revealed a positive publication bias and lack of statistical power to detect a relevant reduction in infarct size. A fully powered multi-center pRCT rejects NOX2 as a target to improve neurofunctional outcomes or achieve a translationally relevant infarct size reduction. Thus stringent statistical thresholds, reporting negative data and a MA-pRCT approach can ensure biomedical data validity and overcome risks of bias.
Masucci, Giuseppe V; Cesano, Alessandra; Hawtin, Rachael; Janetzki, Sylvia; Zhang, Jenny; Kirsch, Ilan; Dobbin, Kevin K; Alvarez, John; Robbins, Paul B; Selvan, Senthamil R; Streicher, Howard Z; Butterfield, Lisa H; Thurin, Magdalena
2016-01-01
Immunotherapies have emerged as one of the most promising approaches to treat patients with cancer. Recently, there have been many clinical successes using checkpoint receptor blockade, including T cell inhibitory receptors such as cytotoxic T-lymphocyte-associated antigen 4 (CTLA-4) and programmed cell death-1 (PD-1). Despite demonstrated successes in a variety of malignancies, responses only typically occur in a minority of patients in any given histology. Additionally, treatment is associated with inflammatory toxicity and high cost. Therefore, determining which patients would derive clinical benefit from immunotherapy is a compelling clinical question. Although numerous candidate biomarkers have been described, there are currently three FDA-approved assays based on PD-1 ligand expression (PD-L1) that have been clinically validated to identify patients who are more likely to benefit from a single-agent anti-PD-1/PD-L1 therapy. Because of the complexity of the immune response and tumor biology, it is unlikely that a single biomarker will be sufficient to predict clinical outcomes in response to immune-targeted therapy. Rather, the integration of multiple tumor and immune response parameters, such as protein expression, genomics, and transcriptomics, may be necessary for accurate prediction of clinical benefit. Before a candidate biomarker and/or new technology can be used in a clinical setting, several steps are necessary to demonstrate its clinical validity. Although regulatory guidelines provide general roadmaps for the validation process, their applicability to biomarkers in the cancer immunotherapy field is somewhat limited. Thus, Working Group 1 (WG1) of the Society for Immunotherapy of Cancer (SITC) Immune Biomarkers Task Force convened to address this need. In this two volume series, we discuss pre-analytical and analytical (Volume I) as well as clinical and regulatory (Volume II) aspects of the validation process as applied to predictive biomarkers for cancer immunotherapy. To illustrate the requirements for validation, we discuss examples of biomarker assays that have shown preliminary evidence of an association with clinical benefit from immunotherapeutic interventions. The scope includes only those assays and technologies that have established a certain level of validation for clinical use (fit-for-purpose). Recommendations to meet challenges and strategies to guide the choice of analytical and clinical validation design for specific assays are also provided.
Recommendations for the Definition of Clinical Responder in Insulin Preservation Studies
Gitelman, Stephen E.; Palmer, Jerry P.
2014-01-01
Clinical responder studies should contribute to the translation of effective treatments and interventions to the clinic. Since ultimately this translation will involve regulatory approval, we recommend that clinical trials prespecify a responder definition that can be assessed against the requirements and suggestions of regulatory agencies. In this article, we propose a clinical responder definition to specifically assist researchers and regulatory agencies in interpreting the clinical importance of statistically significant findings for studies of interventions intended to preserve β-cell function in newly diagnosed type 1 diabetes. We focus on studies of 6-month β-cell preservation in type 1 diabetes as measured by 2-h–stimulated C-peptide. We introduce criteria (bias, reliability, and external validity) for the assessment of responder definitions to ensure they meet U.S. Food and Drug Administration and European Medicines Agency guidelines. Using data from several published TrialNet studies, we evaluate our definition (no decrease in C-peptide) against published alternatives and determine that our definition has minimum bias with external validity. We observe that reliability could be improved by using changes in C-peptide later than 6 months beyond baseline. In sum, to support efficacy claims of β-cell preservation therapies in type 1 diabetes submitted to U.S. and European regulatory agencies, we recommend use of our definition. PMID:24722251
Validation of a next-generation sequencing assay for clinical molecular oncology.
Cottrell, Catherine E; Al-Kateb, Hussam; Bredemeyer, Andrew J; Duncavage, Eric J; Spencer, David H; Abel, Haley J; Lockwood, Christina M; Hagemann, Ian S; O'Guin, Stephanie M; Burcea, Lauren C; Sawyer, Christopher S; Oschwald, Dayna M; Stratman, Jennifer L; Sher, Dorie A; Johnson, Mark R; Brown, Justin T; Cliften, Paul F; George, Bijoy; McIntosh, Leslie D; Shrivastava, Savita; Nguyen, Tudung T; Payton, Jacqueline E; Watson, Mark A; Crosby, Seth D; Head, Richard D; Mitra, Robi D; Nagarajan, Rakesh; Kulkarni, Shashikant; Seibert, Karen; Virgin, Herbert W; Milbrandt, Jeffrey; Pfeifer, John D
2014-01-01
Currently, oncology testing includes molecular studies and cytogenetic analysis to detect genetic aberrations of clinical significance. Next-generation sequencing (NGS) allows rapid analysis of multiple genes for clinically actionable somatic variants. The WUCaMP assay uses targeted capture for NGS analysis of 25 cancer-associated genes to detect mutations at actionable loci. We present clinical validation of the assay and a detailed framework for design and validation of similar clinical assays. Deep sequencing of 78 tumor specimens (≥ 1000× average unique coverage across the capture region) achieved high sensitivity for detecting somatic variants at low allele fraction (AF). Validation revealed sensitivities and specificities of 100% for detection of single-nucleotide variants (SNVs) within coding regions, compared with SNP array sequence data (95% CI = 83.4-100.0 for sensitivity and 94.2-100.0 for specificity) or whole-genome sequencing (95% CI = 89.1-100.0 for sensitivity and 99.9-100.0 for specificity) of HapMap samples. Sensitivity for detecting variants at an observed 10% AF was 100% (95% CI = 93.2-100.0) in HapMap mixes. Analysis of 15 masked specimens harboring clinically reported variants yielded concordant calls for 13/13 variants at AF of ≥ 15%. The WUCaMP assay is a robust and sensitive method to detect somatic variants of clinical significance in molecular oncology laboratories, with reduced time and cost of genetic analysis allowing for strategic patient management. Copyright © 2014 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Arujuna, Aruna V; Housden, R James; Ma, Yingliang; Rajani, Ronak; Gao, Gang; Nijhof, Niels; Cathier, Pascal; Bullens, Roland; Gijsbers, Geert; Parish, Victoria; Kapetanakis, Stamatis; Hancock, Jane; Rinaldi, C Aldo; Cooklin, Michael; Gill, Jaswinder; Thomas, Martyn; O'neill, Mark D; Razavi, Reza; Rhode, Kawal S
2014-01-01
Real-time imaging is required to guide minimally invasive catheter-based cardiac interventions. While transesophageal echocardiography allows for high-quality visualization of cardiac anatomy, X-ray fluoroscopy provides excellent visualization of devices. We have developed a novel image fusion system that allows real-time integration of 3-D echocardiography and the X-ray fluoroscopy. The system was validated in the following two stages: 1) preclinical to determine function and validate accuracy; and 2) in the clinical setting to assess clinical workflow feasibility and determine overall system accuracy. In the preclinical phase, the system was assessed using both phantom and porcine experimental studies. Median 2-D projection errors of 4.5 and 3.3 mm were found for the phantom and porcine studies, respectively. The clinical phase focused on extending the use of the system to interventions in patients undergoing either atrial fibrillation catheter ablation (CA) or transcatheter aortic valve implantation (TAVI). Eleven patients were studied with nine in the CA group and two in the TAVI group. Successful real-time view synchronization was achieved in all cases with a calculated median distance error of 2.2 mm in the CA group and 3.4 mm in the TAVI group. A standard clinical workflow was established using the image fusion system. These pilot data confirm the technical feasibility of accurate real-time echo-fluoroscopic image overlay in clinical practice, which may be a useful adjunct for real-time guidance during interventional cardiac procedures.
Teaching method validation in the clinical laboratory science curriculum.
Moon, Tara C; Legrys, Vicky A
2008-01-01
With the Clinical Laboratory Improvement Amendment's (CLIA) final rule, the ability of the Clinical Laboratory Scientist (CLS) to perform method validation has become increasingly important. Knowledge of the statistical methods and procedures used in method validation is imperative for clinical laboratory scientists. However, incorporating these concepts in a CLS curriculum can be challenging, especially at a time of limited resources. This paper provides an outline of one approach to addressing these topics in lecture courses and integrating them in the student laboratory and the clinical practicum for direct application.
Genomic markers for decision making: what is preventing us from using markers?
Coyle, Vicky M; Johnston, Patrick G
2010-02-01
The advent of novel genomic technologies that enable the evaluation of genomic alterations on a genome-wide scale has significantly altered the field of genomic marker research in solid tumors. Researchers have moved away from the traditional model of identifying a particular genomic alteration and evaluating the association between this finding and a clinical outcome measure to a new approach involving the identification and measurement of multiple genomic markers simultaneously within clinical studies. This in turn has presented additional challenges in considering the use of genomic markers in oncology, such as clinical study design, reproducibility and interpretation and reporting of results. This Review will explore these challenges, focusing on microarray-based gene-expression profiling, and highlights some common failings in study design that have impacted on the use of putative genomic markers in the clinic. Despite these rapid technological advances there is still a paucity of genomic markers in routine clinical use at present. A rational and focused approach to the evaluation and validation of genomic markers is needed, whereby analytically validated markers are investigated in clinical studies that are adequately powered and have pre-defined patient populations and study endpoints. Furthermore, novel adaptive clinical trial designs, incorporating putative genomic markers into prospective clinical trials, will enable the evaluation of these markers in a rigorous and timely fashion. Such approaches have the potential to facilitate the implementation of such markers into routine clinical practice and consequently enable the rational and tailored use of cancer therapies for individual patients.
Chamberlain, Samuel R; Grant, Jon E
2018-07-01
Disorders of impulsivity are common, functionally impairing, and highly relevant across different clinical and research settings. Few structured clinical interviews for the identification and diagnosis of impulse control disorders exist, and none have been validated in a community sample in terms of psychometric properties. The Minnesota Impulse control disorders Interview (MIDI v2.0) was administered to an enriched sample of 293 non-treatment seeking adults aged 18-35 years, recruited using media advertisements in two large US cities. In addition to the MIDI, participants undertook extended clinical interview for other mental disorders, the Barratt impulsiveness questionnaire, and the Padua obsessive-compulsive inventory. The psychometric properties of the MIDI were characterized. In logistic regression, the MIDI showed good concurrent validity against the reference measures (versus gambling disorder interview, p < 0.001; Barratt impulsiveness attentional and non-planning scores p < 0.05), and good discriminant validity versus primarily non-impulsive symptoms, including against anxiety, depression, and obsessive-compulsive symptoms (all p > 0.05). Test re-test reliability was excellent (0.95). The MIDI has good psychometric properties and thus may be a valuable interview tool for clinical and research studies involving impulse control disorders. Further research is needed to better understanding the optimal diagnostic classification and neurobiology of these neglected disorders. Crown Copyright © 2018. Published by Elsevier B.V. All rights reserved.
Hall, Emily A; Docherty, Carrie L
2017-07-01
To determine the concurrent validity of standard clinical outcome measures compared to laboratory outcome measure while performing the weight-bearing lunge test (WBLT). Cross-sectional study. Fifty participants performed the WBLT to determine dorsiflexion ROM using four different measurement techniques: dorsiflexion angle with digital inclinometer at 15cm distal to the tibial tuberosity (°), dorsiflexion angle with inclinometer at tibial tuberosity (°), maximum lunge distance (cm), and dorsiflexion angle using a 2D motion capture system (°). Outcome measures were recorded concurrently during each trial. To establish concurrent validity, Pearson product-moment correlation coefficients (r) were conducted, comparing each dependent variable to the 2D motion capture analysis (identified as the reference standard). A higher correlation indicates strong concurrent validity. There was a high correlation between each measurement technique and the reference standard. Specifically the correlation between the inclinometer placement at 15cm below the tibial tuberosity (44.9°±5.5°) and the motion capture angle (27.0°±6.0°) was r=0.76 (p=0.001), between the inclinometer placement at the tibial tuberosity angle (39.0°±4.6°) and the motion capture angle was r=0.71 (p=0.001), and between the distance from the wall clinical measure (10.3±3.0cm) to the motion capture angle was r=0.74 (p=0.001). This study determined that the clinical measures used during the WBLT have a high correlation with the reference standard for assessing dorsiflexion range of motion. Therefore, obtaining maximum lunge distance and inclinometer angles are both valid assessments during the weight-bearing lunge test. Copyright © 2016 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Steffan, Jean; Olivry, Thierry; Forster, Sophie L; Seewald, Wolfgang
2012-10-01
Hypersensitivity (allergic) dermatitis (HD) is commonly seen in cats, causing pruritus and various patterns of skin lesions, including at least one of the following: head and neck excoriations, self-induced alopecia, eosinophilic plaques and miliary dermatitis. Few studies have evaluated the efficacy of therapeutic interventions for feline HD, and although various scales have been considered, none has been formally validated for the assessment of disease severity and its response to therapy. To design and validate a novel scale (SCORing Feline Allergic Dermatitis; SCORFAD) to assess the value of different criteria used as outcome measures for the treatment of feline HD and to set minimal thresholds for defining the clinical success of tested interventions. One hundred client-owned cats. The SCORFAD scale was designed to include the four most frequently identified lesion types in feline HD (eosinophilic plaque, head and neck excoriations, self-induced alopecia and miliary dermatitis) across 10 body regions. The extent and severity of each lesion type were graded prior to inclusion and after 3 and 6 weeks in a clinical study to compare the efficacy of two doses of ciclosporin with placebo. The SCORFAD scale was found to exhibit satisfactory content, construct, criterion and sensitivity to change. The percentage reduction in SCORFAD from baseline was determined to be the most valid assessment of clinical response. Inter- and intra-observer reliability was not assessed. The SCORFAD scale is proposed for use as a validated tool for the assessment of disease severity and response to therapeutic interventions in clinical trials for feline HD. © 2012 The Authors. Veterinary Dermatology © 2012 ESVD and ACVD.
Fahey, Marion; Rudd, Anthony; Béjot, Yannick; Wolfe, Charles; Douiri, Abdel
2017-01-01
Introduction Stroke is a leading cause of adult disability and death worldwide. The neurological impairments associated with stroke prevent patients from performing basic daily activities and have enormous impact on families and caregivers. Practical and accurate tools to assist in predicting outcome after stroke at patient level can provide significant aid for patient management. Furthermore, prediction models of this kind can be useful for clinical research, health economics, policymaking and clinical decision support. Methods 2869 patients with first-ever stroke from South London Stroke Register (SLSR) (1995–2004) will be included in the development cohort. We will use information captured after baseline to construct multilevel models and a Cox proportional hazard model to predict cognitive impairment, functional outcome and mortality up to 5 years after stroke. Repeated random subsampling validation (Monte Carlo cross-validation) will be evaluated in model development. Data from participants recruited to the stroke register (2005–2014) will be used for temporal validation of the models. Data from participants recruited to the Dijon Stroke Register (1985–2015) will be used for external validation. Discrimination, calibration and clinical utility of the models will be presented. Ethics Patients, or for patients who cannot consent their relatives, gave written informed consent to participate in stroke-related studies within the SLSR. The SLSR design was approved by the ethics committees of Guy’s and St Thomas’ NHS Foundation Trust, Kings College Hospital, Queens Square and Westminster Hospitals (London). The Dijon Stroke Registry was approved by the Comité National des Registres and the InVS and has authorisation of the Commission Nationale de l’Informatique et des Libertés. PMID:28821511
van Rooij, Antonius J; Schoenmakers, Tim M; van de Mheen, Dike
2017-01-01
Clinicians struggle with the identification of video gaming problems. To address this issue, a clinical assessment tool (C-VAT 2.0) was developed and tested in a clinical setting. The instrument allows exploration of the validity of the DSM-5 proposal for 'internet gaming disorder'. Using C-VAT 2.0, the current study provides a sensitivity analysis of the proposed DSM-5 criteria in a clinical youth sample (13-23years old) in treatment for video gaming disorder (N=32). The study also explores the clinical characteristics of these patients. The patients were all male and reported spending extensive amounts of time on video games. At least half of the patients reported playing online games (n=15). Comorbid problems were common (n=22) and included (social) anxiety disorders, PDD NOS, ADHD/ADD, Parent-Child relationship problem, and various types of depressive mood problems. The sensitivity of the test was good: results further show that the C-VAT correctly identified 91% of the sample at the proposed cut-off score of at least 5 out of 9 of the criteria. As our study did not include healthy, extreme gamers, we could not assess the specificity of the tool: future research should make this a priority. Using the proposed DSM-5 cut-off score, the C-VAT 2.0 shows preliminary validity in a sample of gamers in treatment for gaming disorder, but the discriminating value of the instrument should be studied further. In the meantime, it is crucial that therapists try to avoid false positives by using expert judgment of functional impairment in each case. Copyright © 2015 Elsevier Ltd. All rights reserved.
QNOTE: an instrument for measuring the quality of EHR clinical notes.
Burke, Harry B; Hoang, Albert; Becher, Dorothy; Fontelo, Paul; Liu, Fang; Stephens, Mark; Pangaro, Louis N; Sessums, Laura L; O'Malley, Patrick; Baxi, Nancy S; Bunt, Christopher W; Capaldi, Vincent F; Chen, Julie M; Cooper, Barbara A; Djuric, David A; Hodge, Joshua A; Kane, Shawn; Magee, Charles; Makary, Zizette R; Mallory, Renee M; Miller, Thomas; Saperstein, Adam; Servey, Jessica; Gimbel, Ronald W
2014-01-01
The outpatient clinical note documents the clinician's information collection, problem assessment, and patient management, yet there is currently no validated instrument to measure the quality of the electronic clinical note. This study evaluated the validity of the QNOTE instrument, which assesses 12 elements in the clinical note, for measuring the quality of clinical notes. It also compared its performance with a global instrument that assesses the clinical note as a whole. Retrospective multicenter blinded study of the clinical notes of 100 outpatients with type 2 diabetes mellitus who had been seen in clinic on at least three occasions. The 300 notes were rated by eight general internal medicine and eight family medicine practicing physicians. The QNOTE instrument scored the quality of the note as the sum of a set of 12 note element scores, and its inter-rater agreement was measured by the intraclass correlation coefficient. The Global instrument scored the note in its entirety, and its inter-rater agreement was measured by the Fleiss κ. The overall QNOTE inter-rater agreement was 0.82 (CI 0.80 to 0.84), and its note quality score was 65 (CI 64 to 66). The Global inter-rater agreement was 0.24 (CI 0.19 to 0.29), and its note quality score was 52 (CI 49 to 55). The QNOTE quality scores were consistent, and the overall QNOTE score was significantly higher than the overall Global score (p=0.04). We found the QNOTE to be a valid instrument for evaluating the quality of electronic clinical notes, and its performance was superior to that of the Global instrument. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Wressle, Ewa; Eriksson, Lennart; Fahlander, Amie; Rasmusson, Ing-Marie; Tedemalm, Ulla; Tängmark, Karin
2006-06-01
The aim was to develop and test a questionnaire for use in telephone interviews concerning patient evaluation of geriatric care and rehabilitation. Instrument development was performed comprising qualitative interviews, construction of items, content validation, pilot study and data collection for evaluation of care and rehabilitation, clinical utility, reliability and construct validity. Qualitative interviews were performed with 12 elderly participants. The qualitative interviews formed the basis for the construction of 45 items. An expert panel performed a content validation of the questionnaire resulting in a revised version. A pilot study comprised 29 participants recently discharged from geriatric wards and the main data collection comprised 221 participants. Inclusion criteria were being able to perform a telephone interview and willingness to participate. Clinical utility was examined through questions to the interviewers, answered in writing. Cronbach's alpha coefficient was 0.79. According to a factor analysis and the evaluation of clinical utility, the underlying dimensions of the final revised questionnaire concern 'Respect and safety', 'Information and participation' and 'Rehabilitation interventions', scored in 18 items. In addition, one global item concerns satisfaction with care, resulting in 19 items in total. The revised questionnaire was named PaPeR, Patient Perspective on care and Rehabilitation. The questionnaire is considered valid, reliable and judged to have good clinical utility. The time consumption for the telephone interview is about 10-20 minutes. The questionnaire is useful in defining areas for potential quality improvement in geriatric wards.
Fallatah, Hind I; Tekian, Ara; Park, Yoon Soo; Al Shawa, Lana
2015-02-01
Exams are essential components of medical students' knowledge and skill assessment during their clinical years of study. The paper provides a retrospective analysis of validity evidence for the internal medicine component of the written and clinical exams administered in 2012 and 2013 at King Abdulaziz University's Faculty of Medicine. Students' scores for the clinical and written exams were obtained. Four faculty members (two senior members and two junior members) were asked to rate the exam questions, including MCQs and OSCEs, for evidence of content validity using a rating scale of 1-5 for each item. Cronbach's alpha was used to measure the internal consistency reliability. Correlations were used to examine the associations between different forms of assessment and groups of students. A total of 824 students completed the internal medicine course and took the exam. The numbers of rated questions were 320 and 46 for the MCQ and OSCE, respectively. Significant correlations were found between the MCQ section, the OSCE section, and the continuous assessment marks, which include 20 long-case presentations during the course; participation in daily rounds, clinical sessions and tutorials; the performance of simple procedures, such as IV cannulation and ABG extraction; and the student log book. Although the OSCE exam was reliable for the two groups that had taken the final clinical OSCE, the clinical long- and short-case exams were not reliable across the two groups that had taken the oral clinical exams. The correlation analysis showed a significant linear association between the raters with respect to evidence of content validity for both the MCQ and OSCE, r = .219 P < .001 and r = .678 P < .001, respectively, and r = .241 P < .001 and r = .368 P = .023 for the internal structure validity, respectively. Reliability measured using Cronbach's alpha was greater for assessments administered in 2013. The pattern of relationships between the MCQ and OSCE scores provides evidence of the validity of these measures for use in the evaluation of knowledge and clinical skills in internal medicine. The OSCE exam is more reliable than the short- and long-case clinical exams and requires less effort on the part of examiners and patients.
Merker, Jason D; Oxnard, Geoffrey R; Compton, Carolyn; Diehn, Maximilian; Hurley, Patricia; Lazar, Alexander J; Lindeman, Neal; Lockwood, Christina M; Rai, Alex J; Schilsky, Richard L; Tsimberidou, Apostolia M; Vasalos, Patricia; Billman, Brooke L; Oliver, Thomas K; Bruinooge, Suanna S; Hayes, Daniel F; Turner, Nicholas C
2018-06-01
Purpose Clinical use of analytical tests to assess genomic variants in circulating tumor DNA (ctDNA) is increasing. This joint review from ASCO and the College of American Pathologists summarizes current information about clinical ctDNA assays and provides a framework for future research. Methods An Expert Panel conducted a literature review on the use of ctDNA assays for solid tumors, including pre-analytical variables, analytical validity, interpretation and reporting, and clinical validity and utility. Results The literature search identified 1,338 references. Of those, 390, plus 31 references supplied by the Expert Panel, were selected for full-text review. There were 77 articles selected for inclusion. Conclusion The evidence indicates that testing for ctDNA is optimally performed on plasma collected in cell stabilization or EDTA tubes, with EDTA tubes processed within 6 hours of collection. Some ctDNA assays have demonstrated clinical validity and utility with certain types of advanced cancer; however, there is insufficient evidence of clinical validity and utility for the majority of ctDNA assays in advanced cancer. Evidence shows discordance between the results of ctDNA assays and genotyping tumor specimens and supports tumor tissue genotyping to confirm undetected results from ctDNA tests. There is no evidence of clinical utility and little evidence of clinical validity of ctDNA assays in early-stage cancer, treatment monitoring, or residual disease detection. There is no evidence of clinical validity and clinical utility to suggest that ctDNA assays are useful for cancer screening, outside of a clinical trial. Given the rapid pace of research, re-evaluation of the literature will shortly be required, along with the development of tools and guidance for clinical practice.
Candida Pneumonia in Intensive Care Unit?
Schnabel, Ronny M.; Linssen, Catharina F.; Guion, Nele; van Mook, Walther N.; Bergmans, Dennis C.
2014-01-01
It has been questioned if Candida pneumonia exists as a clinical entity. Only histopathology can establish the definite diagnosis. Less invasive diagnostic strategies lack specificity and have been insufficiently validated. Scarcity of this pathomechanism and nonspecific clinical presentation make validation and the development of a clinical algorithm difficult. In the present study, we analyze whether Candida pneumonia exists in our critical care population. We used a bronchoalveolar lavage (BAL) specimen database that we have built in a structural diagnostic approach to ventilator-associated pneumonia for more than a decade consisting of 832 samples. Microbiological data were linked to clinical information and available autopsy data. We searched for critically ill patients with respiratory failure with no other microbiological or clinical explanation than exclusive presence of Candida species in BAL fluid. Five cases could be identified with Candida as the likely cause of pneumonia. PMID:25734099
Dugosh, Karen Leggett; Festinger, David S.; Croft, Jason R.; Marlowe, Douglas B.
2011-01-01
Despite many efforts aimed to ensure that research participation is autonomous and not coerced, there exists no reliable and valid measure of perceived coercion for the doubly vulnerable population of substance-abusing offenders. The current study describes the development and initial validation of an instrument measuring perceived coercion to participate in research among substance-abusing offenders. The results indicated that a substantial number of individuals report feeling coerced to participate in the study. In addition, the instrument has adequate levels of internal consistency, a one-dimensional factor structure, and evidence of discriminative validity. This study provides initial support for the instrument’s validity and clinical utility. PMID:20235867
ERIC Educational Resources Information Center
Chen, Chia-ling; Shen, I-hsuan; Chen, Chung-yao; Wu, Ching-yi; Liu, Wen-Yu; Chung, Chia-ying
2013-01-01
This study examined criterion-related validity and clinimetric properties of the pediatric balance scale ("PBS") in children with cerebral palsy (CP). Forty-five children with CP (age range: 19-77 months) and their parents participated in this study. At baseline and at follow up, Pearson correlation coefficients were used to determine…
ERIC Educational Resources Information Center
Japuntich, Sandra J.; Piper, Megan E.; Schlam, Tanya R.; Bolt, Daniel M.; Baker, Timothy B.
2009-01-01
Few studies have examined whether nicotine dependence self-report questionnaires can predict specific behaviors and symptoms at specific points in time. The present study used data from a randomized clinical trial (N = 608; M. E. Piper et al., 2007) to assess the construct validity of scales and items from 3 nicotine dependence measures: the…
ERIC Educational Resources Information Center
Maerlender, Arthur
2010-01-01
Auditory processing disorders (APDs) are of interest to educators and clinicians, as they impact school functioning. Little work has been completed to demonstrate how children with APDs perform on clinical tests. In a series of studies, standard clinical (psychometric) tests from the Wechsler Intelligence Scale for Children, Fourth Edition…
Rodriguez-Roisin, Roberto; Tetzlaff, Kay; Watz, Henrik; Wouters, Emiel FM; Disse, Bernd; Finnigan, Helen; Magnussen, Helgo; Calverley, Peter MA
2016-01-01
The WISDOM study (NCT00975195) reported a change in lung function following withdrawal of fluticasone propionate in patients with severe to very severe COPD treated with tiotropium and salmeterol. However, little is known about the validity of home-based spirometry measurements of lung function in COPD. Therefore, as part of this study, following suitable training, patients recorded daily home-based spirometry measurements in addition to undergoing periodic in-clinic spirometric testing throughout the study duration. We subsequently determined the validity of home-based spirometry for detecting changes in lung function by comparing in-clinic and home-based forced expiratory volume in 1 second in patients who underwent stepwise fluticasone propionate withdrawal over 12 weeks versus patients remaining on fluticasone propionate for 52 weeks. Bland–Altman analysis of these data confirmed good agreement between in-clinic and home-based measurements, both across all visits and at the individual visits at study weeks 6, 12, 18, and 52. There was a measurable difference between the forced expiratory volume in 1 second values recorded at home and in the clinic (mean difference of −0.05 L), which may be due to suboptimal patient effort in performing unsupervised recordings. However, this difference remained consistent over time. Overall, these data demonstrate that home-based and in-clinic spirometric measurements were equally valid and reliable for assessing lung function in patients with COPD, and suggest that home-based spirometry may be a useful tool to facilitate analysis of changes in lung function on a day-to-day basis. PMID:27578972
Fluorescence In Situ Hybridization Probe Validation for Clinical Use.
Gu, Jun; Smith, Janice L; Dowling, Patricia K
2017-01-01
In this chapter, we provide a systematic overview of the published guidelines and validation procedures for fluorescence in situ hybridization (FISH) probes for clinical diagnostic use. FISH probes-which are classified as molecular probes or analyte-specific reagents (ASRs)-have been extensively used in vitro for both clinical diagnosis and research. Most commercially available FISH probes in the United States are strictly regulated by the U.S. Food and Drug Administration (FDA), the Centers for Disease Control and Prevention (CDC), the Centers for Medicare & Medicaid Services (CMS) the Clinical Laboratory Improvement Amendments (CLIA), and the College of American Pathologists (CAP). Although home-brewed FISH probes-defined as probes made in-house or acquired from a source that does not supply them to other laboratories-are not regulated by these agencies, they too must undergo the same individual validation process prior to clinical use as their commercial counterparts. Validation of a FISH probe involves initial validation and ongoing verification of the test system. Initial validation includes assessment of a probe's technical specifications, establishment of its standard operational procedure (SOP), determination of its clinical sensitivity and specificity, development of its cutoff, baseline, and normal reference ranges, gathering of analytics, confirmation of its applicability to a specific research or clinical setting, testing of samples with or without the abnormalities that the probe is meant to detect, staff training, and report building. Ongoing verification of the test system involves testing additional normal and abnormal samples using the same method employed during the initial validation of the probe.
Khoury, Nabil T.; Hossain, Md Monir; Wootton, Susan H.; Salazar, Lucrecia; Hasbun, Rodrigo
2012-01-01
Objective To derive and validate a risk score for an adverse clinical outcome in adults with meningitis and a negative cerebrospinal fluid (CSF) Gram stain. Patients and Methods We conducted a retrospective study of 567 adults from Houston, Texas, with meningitis evaluated between January 1, 2005, and January 1, 2010. The patients were divided into derivation (N=292) and validation (N=275) cohorts. An adverse clinical outcome was defined as a Glasgow Outcome Scale score of 4 or less. Results Of the 567 patients, 62 (11%) had an adverse clinical outcome. A predictive model was created using 3 baseline variables that were independently associated with an adverse clinical outcome (P<.05): age greater than 60 years, abnormal findings on neurologic examination (altered mental status, focal neurologic deficits, or seizures), and CSF glucose level of less than 2.4975 mmol/L (to convert CSF glucose to mmol/L, multiply by 0.05551). The model classified patients into 2 categories of risk for an adverse clinical outcome—derivation sample: low risk, 0.6% and high risk, 32.8%; P<.001; and validation sample: low risk, 0.5% and high risk, 21.1%; P<.001. Conclusion Adults with meningitis and a negative CSF Gram stain can be accurately stratified for the risk of an adverse clinical outcome using clinical variables available at presentation. PMID:23218086
Petersen, John R; Stevenson, Heather L; Kasturi, Krishna S; Naniwadekar, Ashutosh; Parkes, Julie; Cross, Richard; Rosenberg, William M; Xiao, Shu-Yuan; Snyder, Ned
2014-04-01
The assessment of liver fibrosis in chronic hepatitis C patients is important for prognosis and making decisions regarding antiviral treatment. Although liver biopsy is considered the reference standard for assessing hepatic fibrosis in patients with chronic hepatitis C, it is invasive and associated with sampling and interobserver variability. Serum fibrosis markers have been utilized as surrogates for a liver biopsy. We completed a prospective study of 191 patients in which blood draws and liver biopsies were performed on the same visit. Using liver biopsies the sensitivity, specificity, and negative and positive predictive values for both aspartate aminotransferase/platelet ratio index (APRI) and enhanced liver fibrosis (ELF) were determined. The patients were divided into training and validation patient sets to develop and validate a clinically useful algorithm for differentiating mild and significant fibrosis. The area under the ROC curve for the APRI and ELF tests for the training set was 0.865 and 0.880, respectively. The clinical sensitivity in separating mild (F0-F1) from significant fibrosis (F2-F4) was 80% and 86.0% with a clinical specificity of 86.7% and 77.8%, respectively. For the validation sets the area under the ROC curve for the APRI and ELF tests was, 0.855 and 0.780, respectively. The clinical sensitivity of the APRI and ELF tests in separating mild (F0-F1) from significant (F2-F4) fibrosis for the validation set was 90.0% and 70.0% with a clinical specificity of 73.3% and 86.7%, respectively. There were no differences between the APRI and ELF tests in distinguishing mild from significant fibrosis for either the training or validation sets (P=0.61 and 0.20, respectively). Using APRI as the primary test followed by ELF for patients in the intermediate zone, would have decreased the number of liver biopsies needed by 40% for the validation set. Overall, use of our algorithm would have decreased the number of patients who needed a liver biopsy from 95 to 24-a 74.7% reduction. This study has shown that the APRI and ELF tests are equally accurate in distinguishing mild from significant liver fibrosis, and combining them into a validated algorithm improves their performance in distinguishing mild from significant fibrosis.
Validation of the Economic and Health Outcomes Model of Type 2 Diabetes Mellitus (ECHO-T2DM).
Willis, Michael; Johansen, Pierre; Nilsson, Andreas; Asseburg, Christian
2017-03-01
The Economic and Health Outcomes Model of Type 2 Diabetes Mellitus (ECHO-T2DM) was developed to address study questions pertaining to the cost-effectiveness of treatment alternatives in the care of patients with type 2 diabetes mellitus (T2DM). Naturally, the usefulness of a model is determined by the accuracy of its predictions. A previous version of ECHO-T2DM was validated against actual trial outcomes and the model predictions were generally accurate. However, there have been recent upgrades to the model, which modify model predictions and necessitate an update of the validation exercises. The objectives of this study were to extend the methods available for evaluating model validity, to conduct a formal model validation of ECHO-T2DM (version 2.3.0) in accordance with the principles espoused by the International Society for Pharmacoeconomics and Outcomes Research (ISPOR) and the Society for Medical Decision Making (SMDM), and secondarily to evaluate the relative accuracy of four sets of macrovascular risk equations included in ECHO-T2DM. We followed the ISPOR/SMDM guidelines on model validation, evaluating face validity, verification, cross-validation, and external validation. Model verification involved 297 'stress tests', in which specific model inputs were modified systematically to ascertain correct model implementation. Cross-validation consisted of a comparison between ECHO-T2DM predictions and those of the seminal National Institutes of Health model. In external validation, study characteristics were entered into ECHO-T2DM to replicate the clinical results of 12 studies (including 17 patient populations), and model predictions were compared to observed values using established statistical techniques as well as measures of average prediction error, separately for the four sets of macrovascular risk equations supported in ECHO-T2DM. Sub-group analyses were conducted for dependent vs. independent outcomes and for microvascular vs. macrovascular vs. mortality endpoints. All stress tests were passed. ECHO-T2DM replicated the National Institutes of Health cost-effectiveness application with numerically similar results. In external validation of ECHO-T2DM, model predictions agreed well with observed clinical outcomes. For all sets of macrovascular risk equations, the results were close to the intercept and slope coefficients corresponding to a perfect match, resulting in high R 2 and failure to reject concordance using an F test. The results were similar for sub-groups of dependent and independent validation, with some degree of under-prediction of macrovascular events. ECHO-T2DM continues to match health outcomes in clinical trials in T2DM, with prediction accuracy similar to other leading models of T2DM.
Measuring Practitioner Attitudes toward Evidence-Based Treatments: A Validation Study
ERIC Educational Resources Information Center
Ashcraft, Rindee G. P.; Foster, Sharon L.; Lowery, Amy E.; Henggeler, Scott W.; Chapman, Jason E.; Rowland, Melisa D.
2011-01-01
A better understanding of clinicians' attitudes toward evidence-based treatments (EBT) will presumably enhance the transfer of EBTs for substance-abusing adolescents from research to clinical application. The reliability and validity of two measures of therapist attitudes toward EBT were examined: the Evidence-Based Practice Attitude Scale…
Grades of Severity and the Validation of an Atopic Dermatitis Assessment Measure (ADAM).
ERIC Educational Resources Information Center
Charman, Denise P.; Varigos, George A.
1999-01-01
Studied the validity of the Atopic Dermatitis Assessment Measure (D. Charman and others, 1999) with 171 pediatric patients in Australia using partial credit analyses to produce clinically relevant "word pictures" of grades of severity for atopic dermatitis. Discusses implications for measurement in medicine. (SLD)
The Predictive Validity of Projective Measures.
ERIC Educational Resources Information Center
Suinn, Richard M.; Oskamp, Stuart
Written for use by clinical practitioners as well as psychological researchers, this book surveys recent literature (1950-1965) on projective test validity by reviewing and critically evaluating studies which shed light on what may reliably be predicted from projective test results. Two major instruments are covered: the Rorschach and the Thematic…
Feasibility and validity of computerized ambulatory monitoring in stroke patients.
Johnson, E I; Sibon, I; Renou, P; Rouanet, F; Allard, M; Swendsen, J
2009-11-10
Computerized ambulatory monitoring provides real-time assessments of clinical outcomes in natural contexts, and it has been increasingly applied in recent years to investigate symptom expression in a wide range of disorders. The purpose of this study was to examine the feasibility and validity of this data collection strategy with adult stroke patients. Forty-eight individuals (75% of the contacted sample) agreed to participate in the current study and were instructed to complete electronic interviews using a personal digital assistant 5 times per day over a 1-week period. More than 80% of programmed assessments were completed by the sample, and no evidence was found for fatigue effects. Expected patterns of associations were observed among daily life variables, and data collected through ambulatory monitoring were significantly correlated with standard clinic-based measures of similar constructs. Support was found for the feasibility and validity of computerized ambulatory monitoring with stroke patients. The application of these novel methods with stroke patients should provide complementary information that is inaccessible to standard hospital-based assessments and permit increased understanding of the significance of clinical results and test scores for daily life experience.
Crawford, Thomas N; Cohen, Patricia; Johnson, Jeffrey G; Kasen, Stephanie; First, Michael B; Gordon, Kathy; Brook, Judith S
2005-02-01
Approximately 800 youths from the Children in the Community Study (Cohen & Cohen, 1996) have been assessed prospectively for over 20 years to study personality disorders (PDs) in adolescents and young adults. In this article we evaluate the Children in the Community Self-Report (CIC-SR) Scales, which were designed to assess DSM-IV PDs using self-reported prospective data from this longitudinal sample. To evaluate convergent validity, we assessed concordance between the CIC-SR Scales and the Structured Clinical Interview for DSM-IV Personality Disorders (SCID-II; First, Gibbon, Spitzer, Williams, & Benjamin, 1995) in 644 participants at mean age 33. To assess predictive validity, we used CIC-SR Scales at mean age 22 to predict subsequent CIC-SR and SCID-II Personality Questionnaire scores at mean age 33. In these analyses the CIC-SR Scales matched or exceeded benchmarks established in previous comparisons between self-report instruments and structured clinical interviews. Unlike other self-report scales, the CIC-SR did not appear to overestimate diagnoses when compared with SCID-II clinical diagnoses.
Penny, Katherine; Barron, Alex; Higgins, Ann-Marie; Gee, Susan; Croucher, Matthew; Cheung, Gary
2016-09-19
Depression Rating Scale (DRS) is one of the clinical outcome measures of the International Resident Assessment Instrument (interRAI) assessment. The primary aim of this study is to investigate the diagnostic accuracy and concurrent validity of the 3-day assessment window version of the DRS. The performance of DRS was compared with a gold standard clinical diagnosis of depression in 92 patients (age ≥65) who had interRAI version 9.1 Home Care assessment completed within 30 days of discharge from psychogeriatric inpatient care or memory clinic assessment. The DRS had poor diagnostic accuracy for depression diagnosis with an area under the curve of 0.68 (95% confidence interval [CI] = 0.57-0.77). The DRS score had a poor to moderate correlation with the Health of the Nation Outcome Scale 65+ depression item score (r s = 0.30, 95% CI = 0.09-0.48, P = .006). This study and the existing literature raise concerns that the DRS is not an adequate measure of depression. © The Author(s) 2016.
Educating psychotherapy supervisors.
Watkins, C Edward
2012-01-01
What do we know clinically and empirically about the education of psychotherapy supervisors? In this paper, I attempt to address that question by: (1) reviewing briefly current thinking about psychotherapy supervisor training; and (2) examining the available research where supervisor training and supervision have been studied. The importance of such matters as training format and methods, supervision topics for study, supervisor development, and supervisor competencies are considered, and some prototypical, competency-based supervisor training programs that hold educational promise are identified and described. Twenty supervisor training studies are critiqued, and their implications for practice and research are examined. Based on this review of training programs and research, the following conclusions are drawn: (1) the clinical validity of supervisor education appears to be strong, solid, and sound, (2) although research suggests that supervisor training can have value in stimulating the development of supervisor trainees and better preparing them for the supervisory role, any such base of empirical support or validity should be regarded as tentative at best; and (3) the most formidable challenge for psychotherapy supervisor education may well be correcting the imbalance that currently exists between clinical and empirical validity and "raising the bar" on the rigor, relevance, and replicability of future supervisor training research.
Crona, D J; Ramirez, J; Qiao, W; de Graan, A-J; Ratain, M J; van Schaik, R H N; Mathijssen, R H J; Rosner, G L; Innocenti, F
2016-02-01
The overall goal of this study was to provide evidence for the clinical validity of nine genetic variants in five genes previously associated with irinotecan neutropenia and pharmacokinetics. Variants associated with absolute neutrophil count (ANC) nadir and/or irinotecan pharmacokinetics in a discovery cohort of cancer patients were genotyped in an independent replication cohort of 108 cancer patients. Patients received single-agent irinotecan every 3 weeks. For ANC nadir, we replicated UGT1A1*28, UGT1A1*93 and SLCO1B1*1b in univariate analyses. For irinotecan area under the concentration-time curve (AUC0-24), we replicated ABCC2 -24C>T; however, ABCC2 -24C>T only predicted a small fraction of the variance. For SN-38 AUC0-24 and the glucuronidation ratio, we replicated UGT1A1*28 and UGT1A1*93. In addition to UGT1A1*28, this study independently validated UGT1A1*93 and SLCO1B1*1b as new predictors of irinotecan neutropenia. Further demonstration of their clinical utility will optimize irinotecan therapy in cancer patients.
Validity and repeatability of inertial measurement units for measuring gait parameters.
Washabaugh, Edward P; Kalyanaraman, Tarun; Adamczyk, Peter G; Claflin, Edward S; Krishnan, Chandramouli
2017-06-01
Inertial measurement units (IMUs) are small wearable sensors that have tremendous potential to be applied to clinical gait analysis. They allow objective evaluation of gait and movement disorders outside the clinic and research laboratory, and permit evaluation on large numbers of steps. However, repeatability and validity data of these systems are sparse for gait metrics. The purpose of this study was to determine the validity and between-day repeatability of spatiotemporal metrics (gait speed, stance percent, swing percent, gait cycle time, stride length, cadence, and step duration) as measured with the APDM Opal IMUs and Mobility Lab system. We collected data on 39 healthy subjects. Subjects were tested over two days while walking on a standard treadmill, split-belt treadmill, or overground, with IMUs placed in two locations: both feet and both ankles. The spatiotemporal measurements taken with the IMU system were validated against data from an instrumented treadmill, or using standard clinical procedures. Repeatability and minimally detectable change (MDC) of the system was calculated between days. IMUs displayed high to moderate validity when measuring most of the gait metrics tested. Additionally, these measurements appear to be repeatable when used on the treadmill and overground. The foot configuration of the IMUs appeared to better measure gait parameters; however, both the foot and ankle configurations demonstrated good repeatability. In conclusion, the IMU system in this study appears to be both accurate and repeatable for measuring spatiotemporal gait parameters in healthy young adults. Copyright © 2017 Elsevier B.V. All rights reserved.
Dobbin, Kevin K; Cesano, Alessandra; Alvarez, John; Hawtin, Rachael; Janetzki, Sylvia; Kirsch, Ilan; Masucci, Giuseppe V; Robbins, Paul B; Selvan, Senthamil R; Streicher, Howard Z; Zhang, Jenny; Butterfield, Lisa H; Thurin, Magdalena
2016-01-01
There is growing recognition that immunotherapy is likely to significantly improve health outcomes for cancer patients in the coming years. Currently, while a subset of patients experience substantial clinical benefit in response to different immunotherapeutic approaches, the majority of patients do not but are still exposed to the significant drug toxicities. Therefore, a growing need for the development and clinical use of predictive biomarkers exists in the field of cancer immunotherapy. Predictive cancer biomarkers can be used to identify the patients who are or who are not likely to derive benefit from specific therapeutic approaches. In order to be applicable in a clinical setting, predictive biomarkers must be carefully shepherded through a step-wise, highly regulated developmental process. Volume I of this two-volume document focused on the pre-analytical and analytical phases of the biomarker development process, by providing background, examples and "good practice" recommendations. In the current Volume II, the focus is on the clinical validation, validation of clinical utility and regulatory considerations for biomarker development. Together, this two volume series is meant to provide guidance on the entire biomarker development process, with a particular focus on the unique aspects of developing immune-based biomarkers. Specifically, knowledge about the challenges to clinical validation of predictive biomarkers, which has been gained from numerous successes and failures in other contexts, will be reviewed together with statistical methodological issues related to bias and overfitting. The different trial designs used for the clinical validation of biomarkers will also be discussed, as the selection of clinical metrics and endpoints becomes critical to establish the clinical utility of the biomarker during the clinical validation phase of the biomarker development. Finally, the regulatory aspects of submission of biomarker assays to the U.S. Food and Drug Administration as well as regulatory considerations in the European Union will be covered.
Hemphälä, Malin; Gustavsson, J Petter; Tengström, Anders
2013-01-01
The aim was to study the validity of 2 personality instruments, the Health-Relevant Personality Inventory (HP5i) and the Junior Temperament and Character Inventory (JTCI), among adolescents with a substance use problem. Clinical interviews were completed with 180 adolescents and followed up after 12 months. Discriminant validity was demonstrated in the lack of correlation to intelligence in both instruments' scales. Two findings were in support of convergent validity: Negative affectivity (HP5i) and harm avoidance (JTCI) were correlated to internalizing symptoms, and impulsivity (HP5i) and novelty seeking (JTCI) were correlated to externalizing symptoms. The predictive validity of JTCI was partly supported. When psychiatric symptoms at baseline were controlled for, cooperativeness predicted conduct disorder after 12 months. Summarizing, both instruments can be used in adolescent clinical samples to tailor treatment efforts, although some scales need further investigation. It is important to include personality assessment when evaluating psychiatric problems in adolescents.
Validation of the Communication Skills Questionnaire (CSQ) in people with schizophrenia.
Prat, Gemma; Casas-Anguera, Emma; Garcia-Franco, Mar; Escandell, Maria José; Martin, José Ramón; Vilamala, Sonia; Villalta-Gil, Victoria; Gimenez-Salinas, Jordi; Hernández-Rambla, Carla; Ochoa, Susana
2014-12-15
This present study describes the validation of the Communication Skills Questionnaire (CSQ) in people with schizophrenia. A total of 125 clinically stable people in rehabilitation treatment who were diagnosed with schizophrenia were included. For convergent and discriminant validity the following tests were administered; the Gambrill and Richie (GR) Assertiveness Inventory, the Social Functioning Scale (SFS), Life Skills Profile (LSP), Clinical Global Impression scale for schizophrenia (CGI-S) and the Global Assessment of Functioning (GAF) scale. Internal consistency of the CSQ had a Cronbach׳s alpha of 0.96. Test-retest reliability showed coefficients between 0.60 and 0.70. Convergent validity showed significant relations at p<0.0001 for all instruments assessed. None of the subscales used for assessing discriminant validity showed a significant correlation with the CSQ except for the CGI-S depression subscale. The instrument shows good psychometric properties and demonstrates that it is a useful instrument for evaluating communication skills in people with schizophrenia. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Riecher-Rössler, A; Aston, J; Ventura, J; Merlo, M; Borgwardt, S; Gschwandtner, U; Stieglitz, R-D
2008-04-01
Early detection of psychosis is of growing clinical importance. So far there is, however, no screening instrument for detecting individuals with beginning psychosis in the atypical early stages of the disease with sufficient validity. We have therefore developed the Basel Screening Instrument for Psychosis (BSIP) and tested its feasibility, interrater-reliability and validity. Aim of this paper is to describe the development and structure of the instrument, as well as to report the results of the studies on reliability and validity. The instrument was developed based on a comprehensive search of literature on the most important risk factors and early signs of schizophrenic psychoses. The interraterreliability study was conducted on 24 psychiatric cases. Validity was tested based on 206 individuals referred to our early detection clinic from 3/1/2000 until 2/28/2003. We identified seven categories of relevance for early detection of psychosis and used them to construct a semistructured interview. Interrater-reliability for high risk individuals was high (Kappa .87). Predictive validity was comparable to other, more comprehensive instruments: 16 (32 %) of 50 individuals classified as being at risk for psychosis by the BSIP have in fact developed frank psychosis within an follow-up period of two to five years. The BSIP is the first screening instrument for the early detection of psychosis which has been validated based on transition to psychosis. The BSIP is easy to use by experienced psychiatrists and has a very good interrater-reliability and predictive validity.
Bidell, Markus P
2017-01-01
These three studies provide initial evidence for the development, factor structure, reliability, and validity of the Lesbian, Gay, Bisexual, and Transgender Development of Clinical Skills Scale (LGBT-DOCSS), a new interdisciplinary LGBT clinical self-assessment for health and mental health providers. Research participants were voluntarily recruited in the United States and United Kingdom and included trainees, clinicians, and educators from applied psychology, counseling, psychotherapy, and primary care medicine. Study 1 (N = 602) used exploratory and confirmatory factor analytic techniques, revealing an 18-item three-factor structure (Clinical Preparedness, Attitudinal Awareness, and Basic Knowledge). Study 2 established internal consistency for the overall LGBT-DOCSS (α = .86) and for each of the three subscales (Clinical Preparedness = .88, Attitudinal Awareness = .80, and Basic Knowledge = .83) and 2-week test-retest reliability (.87). In study 3 (N = 564), participant criteria (sexual orientation and education level) and four established scales that measured LGBT prejudice, assessment skills, and social desirability were used to support initial content and discriminant validity. Psychometric properties, limitations, and recommendations are discussed.
Self-efficacy in weight management.
Clark, M M; Abrams, D B; Niaura, R S; Eaton, C A; Rossi, J S
1991-10-01
Self-efficacy is an important mediating mechanism in advancing understanding of the treatment of obesity. This study developed and validated the Weight Efficacy Life-Style Questionnaire (WEL), improving on previous studies by the use of clinical populations, cross-validation of the initial factor analysis, exploration of the best fitting theoretical model of self-efficacy, and examination of change in treatment. The resulting 20-item WEL consists of five situational factors: Negative Emotions, Availability, Social Pressure, Physical Discomfort, and Positive Activities. A hierarchical model was found to provide the best fit to the data. Results from two separate clinical treatment studies (total N = 382) show that the WEL is sensitive to changes in global scores as well as to a subset of the five situational factor scores. Treatment programs may be incomplete if they change only a subset of the situational dimensions of self-efficacy. Theoretical and clinical implications are discussed.
Gismervik, Sigmund Ø; Drogset, Jon O; Granviken, Fredrik; Rø, Magne; Leivseth, Gunnar
2017-01-25
Physical examination tests of the shoulder (PETS) are clinical examination maneuvers designed to aid the assessment of shoulder complaints. Despite more than 180 PETS described in the literature, evidence of their validity and usefulness in diagnosing the shoulder is questioned. This meta-analysis aims to use diagnostic odds ratio (DOR) to evaluate how much PETS shift overall probability and to rank the test performance of single PETS in order to aid the clinician's choice of which tests to use. This study adheres to the principles outlined in the Cochrane guidelines and the PRISMA statement. A fixed effect model was used to assess the overall diagnostic validity of PETS by pooling DOR for different PETS with similar biomechanical rationale when possible. Single PETS were assessed and ranked by DOR. Clinical performance was assessed by sensitivity, specificity, accuracy and likelihood ratio. Six thousand nine-hundred abstracts and 202 full-text articles were assessed for eligibility; 20 articles were eligible and data from 11 articles could be included in the meta-analysis. All PETS for SLAP (superior labral anterior posterior) lesions pooled gave a DOR of 1.38 [1.13, 1.69]. The Supraspinatus test for any full thickness rotator cuff tear obtained the highest DOR of 9.24 (sensitivity was 0.74, specificity 0.77). Compression-Rotation test obtained the highest DOR (6.36) among single PETS for SLAP lesions (sensitivity 0.43, specificity 0.89) and Hawkins test obtained the highest DOR (2.86) for impingement syndrome (sensitivity 0.58, specificity 0.67). No single PETS showed superior clinical test performance. The clinical performance of single PETS is limited. However, when the different PETS for SLAP lesions were pooled, we found a statistical significant change in post-test probability indicating an overall statistical validity. We suggest that clinicians choose their PETS among those with the highest pooled DOR and to assess validity to their own specific clinical settings, review the inclusion criteria of the included primary studies. We further propose that future studies on the validity of PETS use randomized research designs rather than the accuracy design relying less on well-established gold standard reference tests and efficient treatment options.
Risk score to predict gastrointestinal bleeding after acute ischemic stroke.
Ji, Ruijun; Shen, Haipeng; Pan, Yuesong; Wang, Penglian; Liu, Gaifen; Wang, Yilong; Li, Hao; Singhal, Aneesh B; Wang, Yongjun
2014-07-25
Gastrointestinal bleeding (GIB) is a common and often serious complication after stroke. Although several risk factors for post-stroke GIB have been identified, no reliable or validated scoring system is currently available to predict GIB after acute stroke in routine clinical practice or clinical trials. In the present study, we aimed to develop and validate a risk model (acute ischemic stroke associated gastrointestinal bleeding score, the AIS-GIB score) to predict in-hospital GIB after acute ischemic stroke. The AIS-GIB score was developed from data in the China National Stroke Registry (CNSR). Eligible patients in the CNSR were randomly divided into derivation (60%) and internal validation (40%) cohorts. External validation was performed using data from the prospective Chinese Intracranial Atherosclerosis Study (CICAS). Independent predictors of in-hospital GIB were obtained using multivariable logistic regression in the derivation cohort, and β-coefficients were used to generate point scoring system for the AIS-GIB. The area under the receiver operating characteristic curve (AUROC) and the Hosmer-Lemeshow goodness-of-fit test were used to assess model discrimination and calibration, respectively. A total of 8,820, 5,882, and 2,938 patients were enrolled in the derivation, internal validation and external validation cohorts. The overall in-hospital GIB after AIS was 2.6%, 2.3%, and 1.5% in the derivation, internal, and external validation cohort, respectively. An 18-point AIS-GIB score was developed from the set of independent predictors of GIB including age, gender, history of hypertension, hepatic cirrhosis, peptic ulcer or previous GIB, pre-stroke dependence, admission National Institutes of Health stroke scale score, Glasgow Coma Scale score and stroke subtype (Oxfordshire). The AIS-GIB score showed good discrimination in the derivation (0.79; 95% CI, 0.764-0.825), internal (0.78; 95% CI, 0.74-0.82) and external (0.76; 95% CI, 0.71-0.82) validation cohorts. The AIS-GIB score was well calibrated in the derivation (P = 0.42), internal (P = 0.45) and external (P = 0.86) validation cohorts. The AIS-GIB score is a valid clinical grading scale to predict in-hospital GIB after AIS. Further studies on the effect of the AIS-GIB score on reducing GIB and improving outcome after AIS are warranted.
The development of an instrument to match individuals with disabilities and service animals.
Zapf, S A; Rough, R B
There has been an increase in the use of service animals assisting persons with disabilities in the past decade. However many of the service dog agencies do not utilize an assessment that is designed to match the person to the animal in the rehabilitation and psycho-social domains. The purpose of this study was to develop the Service Animal Adaptive Intervention Assessment (SAAIA) and to measure the content validity, inter-rater reliability and clinical utility of the assessment. Two subject groups were used. Subject group one had 43 subjects who measured the content validity and clinical utility of the SAAIA Survey. Subject group two had 12 subjects who measured the inter-rater reliability by completing the SAAIA using information obtained through a video-taped client case scenario. Content validity results indicated a good to high percentage of agreement and a fair percentage of agreement for clinical utility. Inter-rater reliability results indicate good to high agreement on six of the eight variables of the SAAIA. However, the Kappa score indicates low inter-rater reliability. Results indicate the SAAIA has good content validity and inter-rater reliability and fair clinical utility based on percent agreement. However, further research is needed on the reliability of the SAAIA.
Inertial Measurement Units for Clinical Movement Analysis: Reliability and Concurrent Validity
Nicholas, Kevin; Sparkes, Valerie; Sheeran, Liba; Davies, Jennifer L
2018-01-01
The aim of this study was to investigate the reliability and concurrent validity of a commercially available Xsens MVN BIOMECH inertial-sensor-based motion capture system during clinically relevant functional activities. A clinician with no prior experience of motion capture technologies and an experienced clinical movement scientist each assessed 26 healthy participants within each of two sessions using a camera-based motion capture system and the MVN BIOMECH system. Participants performed overground walking, squatting, and jumping. Sessions were separated by 4 ± 3 days. Reliability was evaluated using intraclass correlation coefficient and standard error of measurement, and validity was evaluated using the coefficient of multiple correlation and the linear fit method. Day-to-day reliability was generally fair-to-excellent in all three planes for hip, knee, and ankle joint angles in all three tasks. Within-day (between-rater) reliability was fair-to-excellent in all three planes during walking and squatting, and poor-to-high during jumping. Validity was excellent in the sagittal plane for hip, knee, and ankle joint angles in all three tasks and acceptable in frontal and transverse planes in squat and jump activity across joints. Our results suggest that the MVN BIOMECH system can be used by a clinician to quantify lower-limb joint angles in clinically relevant movements. PMID:29495600
Ersoy, Mehmet Akif; Varan, Azmi
2007-01-01
The aim of this study was to evaluate the reliability and validity of the Turkish version of the Internalized Stigma of Mental Illness Scale (ISMI) in patients with psychiatric disorders. The study included 203 patients diagnosed with various psychiatric disorders in a psychiatry outpatient clinic of a university hospital. The reliability of the scale was assessed by investigation of its internal consistency and split-half reliability. The convergent validity of the scale was demonstrated by the relationship between the Turkish form of the ISMI and various criteria scales. Cronbach's alpha value was 0.93 for the entire scale and ranged between 0.63 and 0.87 for the 5 subscales of the ISMI. In terms of convergent validity, the total score of the Turkish ISMI significantly correlated with the Beck Depression Inventory, Rosenberg Self-Esteem Scale, Sociotropy-Autonomy Scale, Brief Symptom Inventory, Multidimensional Scale of Perceived Social Support, Clinical Global Impression Scale, and Global Assessment of Functioning Scale scores. All values were in the expected direction. In the light of the findings, it was concluded that the Turkish version of ISMI could be used as a reliable and valid tool in assessing internalized stigma of the Turkish psychiatric patients.
MacRae, Jayden; Love, Tom; Baker, Michael G; Dowell, Anthony; Carnachan, Matthew; Stubbe, Maria; McBain, Lynn
2015-10-06
We designed and validated a rule-based expert system to identify influenza like illness (ILI) from routinely recorded general practice clinical narrative to aid a larger retrospective research study into the impact of the 2009 influenza pandemic in New Zealand. Rules were assessed using pattern matching heuristics on routine clinical narrative. The system was trained using data from 623 clinical encounters and validated using a clinical expert as a gold standard against a mutually exclusive set of 901 records. We calculated a 98.2 % specificity and 90.2 % sensitivity across an ILI incidence of 12.4 % measured against clinical expert classification. Peak problem list identification of ILI by clinical coding in any month was 9.2 % of all detected ILI presentations. Our system addressed an unusual problem domain for clinical narrative classification; using notational, unstructured, clinician entered information in a community care setting. It performed well compared with other approaches and domains. It has potential applications in real-time surveillance of disease, and in assisted problem list coding for clinicians. Our system identified ILI presentation with sufficient accuracy for use at a population level in the wider research study. The peak coding of 9.2 % illustrated the need for automated coding of unstructured narrative in our study.
Esbensen, A J; Hoffman, E K; Stansberry, E; Shaffer, R
2018-04-01
There is a need for rigorous measures of sleep in children with Down syndrome as sleep is a substantial problem in this population and there are barriers to obtaining the gold standard polysomnography (PSG). PSG is cost-prohibitive when measuring treatment effects in some clinical trials, and children with Down syndrome may not cooperate with undergoing a PSG. Minimal information is available on the validity of alternative methods of assessing sleep in children with Down syndrome, such as actigraphy and parent ratings. Our study examined the concurrent and convergent validity of different measures of sleep, including PSG, actigraphy and parent reports of sleep among children with Down syndrome. A clinic (n = 27) and a community (n = 47) sample of children with Down syndrome were examined. In clinic, children with Down syndrome wore an actigraph watch during a routine PSG. In the community, children with Down syndrome wore an actigraph watch for a week at home at night as part of a larger study on sleep and behaviour. Their parent completed ratings of the child's sleep during that same week. Actigraph watches demonstrated convergent validity with PSG when measuring a child with Down syndrome's total amount of sleep time, total wake time after sleep onset and sleep period efficiency. In contrast, actigraph watches demonstrated poor correlations with parent reports of sleep, and with PSG when measuring the total time in bed and total wake episodes. Actigraphy, PSG and parent ratings of sleep demonstrated poor concurrent validity with clinical diagnosis of obstructive sleep apnoea. Our current data suggest that actigraph watches demonstrate convergent validity and are sensitive to measuring certain sleep constructs (duration, efficiency) in children with Down syndrome. However, parent reports, such as the Children's Sleep Habits Questionnaire, may be measuring other sleep constructs. These findings highlight the importance of selecting measures of sleep related to target concerns. © 2018 MENCAP and International Association of the Scientific Study of Intellectual and Developmental Disabilities and John Wiley & Sons Ltd.
Validation of a clinical leadership qualities framework for managers in aged care: a Delphi study.
Jeon, Yun-Hee; Conway, Jane; Chenoweth, Lynn; Weise, Janelle; Thomas, Tamsin Ht; Williams, Anna
2015-04-01
To establish validity of a clinical leadership framework for aged care middle managers (The Aged care Clinical Leadership Qualities Framework). Middle managers in aged care have responsibility not only for organisational governance also and operational management but also quality service delivery. There is a need to better define clinical leadership abilities in aged care middle managers, in order to optimise their positional authority to lead others to achieve quality outcomes. A Delphi method. Sixty-nine experts in aged care were recruited, representing rural, remote and metropolitan community and residential aged care settings. Panellists were asked to rate the proposed framework in terms of the relevance and importance of each leadership quality using four-point Likert scales, and to provide comments. Three rounds of consultation were conducted. The number and corresponding percentage of the relevance and importance rating for each quality was calculated for each consultation round, as well as mean scores. Consensus was determined to be reached when a percentage score reached 70% or greater. Twenty-three panellists completed all three rounds of consultation. Following the three rounds of consultation, the acceptability and face validity of the framework was confirmed. The study confirmed the framework as useful in identifying leadership requirements for middle managers in Australian aged care settings. The framework is the first validated framework of clinical leadership attributes for middle managers in aged care and offers an initial step forward in clarifying the aged care middle manager role. The framework provides clarity in the breadth of role expectations for the middle managers and can be used to inform an aged care specific leadership program development, individuals' and organisations' performance and development processes; and policy and guidelines about the types of activities required of middle managers in aged care. © 2014 John Wiley & Sons Ltd.
Hooshmand, Elaheh; Tourani, Sogand; Ravaghi, Hamid; Vafaee Najar, Ali; Meraji, Marziye; Ebrahimipour, Hossein
2015-04-08
The purpose of implementing a system such as Clinical Governance (CG) is to integrate, establish and globalize distinct policies in order to improve quality through increasing professional knowledge and the accountability of healthcare professional toward providing clinical excellence. Since CG is related to change, and change requires money and time, CG implementation has to be focused on priority areas that are in more dire need of change. The purpose of the present study was to validate and determine the significance of items used for evaluating CG implementation. The present study was descriptive-quantitative in method and design. Items used for evaluating CG implementation were first validated by the Delphi method and then compared with one another and ranked based on the Analytical Hierarchy Process (AHP) model. The items that were validated for evaluating CG implementation in Iran include performance evaluation, training and development, personnel motivation, clinical audit, clinical effectiveness, risk management, resource allocation, policies and strategies, external audit, information system management, research and development, CG structure, implementation prerequisites, the management of patients' non-medical needs, complaints and patients' participation in the treatment process. The most important items based on their degree of significance were training and development, performance evaluation, and risk management. The least important items included the management of patients' non-medical needs, patients' participation in the treatment process and research and development. The fundamental requirements of CG implementation included having an effective policy at national level, avoiding perfectionism, using the expertise and potentials of the entire country and the coordination of this model with other models of quality improvement such as accreditation and patient safety. © 2015 by Kerman University of Medical Sciences.
Ito, Masaya; Bentley, Kate H.; Oe, Yuki; Nakajima, Shun; Fujisato, Hiroko; Kato, Noriko; Miyamae, Mitsuhiro; Kanie, Ayako; Horikoshi, Masaru; Barlow, David H.
2015-01-01
Background The Overall Depression Severity and Impairment Scale (ODSIS) is a brief, five-item measure for assessing the frequency and intensity of depressive symptoms, as well as functional impairments in pleasurable activities, work or school, and interpersonal relationships due to depression. Although this scale is expected to be useful in various psychiatric and mental health settings, the reliability, validity, and interpretability have not yet been fully examined. This study was designed to examine the reliability, factorial, convergent, and discriminant validity of a Japanese version of the ODSIS, as well as its ability to distinguish between individuals with and without a major depressive disorder diagnosis. Methods From a pool of registrants at an internet survey company, 2830 non-clinical and clinical participants were selected randomly (619 with major depressive disorder, 619 with panic disorder, 576 with social anxiety disorder, 645 with obsessive–compulsive disorder, and 371 non-clinical panelists). Participants were asked to respond to the ODSIS and conventional measures of depression, functional impairment, anxiety, neuroticism, satisfaction with life, and emotion regulation. Results Exploratory and confirmatory factor analysis of three split subsamples indicated the unidimensional factor structure of ODSIS. Multi-group confirmatory factor analysis showed invariance of factor loadings between non-clinical and clinical subsamples. The ODSIS also showed excellent internal consistency and test–retest intraclass correlation coefficients. Convergence and discriminance of the ODSIS with various measures were in line with our expectations. Receiver operating characteristic curve analyses showed that the ODSIS was able to detect a major depressive syndrome accurately. Conclusions This study supports the reliability and validity of ODSIS in a non-western population, which can be interpreted as demonstrating cross-cultural validity. PMID:25874558
NIKE: a new clinical tool for establishing levels of indications for cataract surgery.
Lundström, Mats; Albrecht, Susanne; Håkansson, Ingemar; Lorefors, Ragnhild; Ohlsson, Sven; Polland, Werner; Schmid, Andrea; Svensson, Göran; Wendel, Eva
2006-08-01
The purpose of this study was to construct a new clinical tool for establishing levels of indications for cataract surgery, and to validate this tool. Teams from nine eye clinics reached an agreement about the need to develop a clinical tool for setting levels of indications for cataract surgery and about the items that should be included in the tool. The tool was to be called 'NIKE' (Nationell Indikationsmodell för Kataraktextraktion). The Canadian Cataract Priority Criteria Tool served as a model for the NIKE tool, which was modified for Swedish conditions. Items included in the tool were visual acuity of both eyes, patients' perceived difficulties in day-to-day life, cataract symptoms, the ability to live independently, and medical/ophthalmic reasons for surgery. The tool was validated and tested in 343 cataract surgery patients. Validity, stability and reliability were tested and the outcome of surgery was studied in relation to the indication setting. Four indication groups (IGs) were suggested. The group with the greatest indications for surgery was named group 1 and that with the lowest, group 4. Validity was proved to be good. Surgery had the greatest impact on the group with the highest indications for surgery. Test-retest reliability test and interexaminer tests of indication settings showed statistically significant intraclass correlations (intraclass correlation coefficients [ICCs] 0.526 and 0.923, respectively). A new clinical tool for indication setting in cataract surgery is presented. This tool, the NIKE, takes into account both visual acuity and the patient's perceived problems in day-to-day life because of cataract. The tool seems to be stable and reliable and neutral towards different examiners.
Therapeutic Misconception in Research Subjects: Development and Validation of a Measure
Appelbaum, Paul S.; Anatchkova, Milena; Albert, Karen; Dunn, Laura B.; Lidz, Charles W.
2013-01-01
Background Therapeutic misconception (TM), which occurs when research subjects fail to appreciate the distinction between the imperatives of clinical research and ordinary treatment, may undercut the process of obtaining meaningful consent to clinical research participation. Previous studies have found TM is widespread, but progress in addressing TM has been stymied by the absence of a validated method for assessing its presence. Purpose The goal of this study was to develop and validate a theoretically grounded measure of TM, assess its diagnostic accuracy, and test previous findings regarding its prevalence. Methods 220 participants were recruited from clinical trials at 4 academic medical centers in the U.S. Participants completed a 28-item Likert-type questionnaire to assess the presence of beliefs associated with TM, and a semi-structured TM interview designed to elicit their perceptions of the nature of the clinical trial in which they were participating. Data from the questionnaires were subjected to factor analysis and items with poor factor loadings were excluded. This resulted in a 10-item scale, with 3 strongly correlated factors and excellent internal consistency; the fit indices of the model across 10 training sets were consistent with the original results, suggesting a stable factor solution. Results The scale was validated against the TM interview, with significantly higher scores among subjects coded as displaying evidence of TM. ROC analysis based on a 10-fold internal cross-validation yielded AUC=.682 for any evidence of TM. When sensitivity (0.72) and specificity (0.61) were both optimized, Positive Predictive Value was 0.65 and Negative Predictive Value was 0.68, with a Positive Likelihood Ratio of 1.89, and a Negative Likelihood Ratio of 0.47. 50.5% (n=101) of participants manifested evidence of TM on the TM interview, a somewhat lower rate than in most previous studies. Limitations The predictive value of the scale compared with the “gold standard” clinical interview is modest, although similar to other instruments based on self-report assessing states of mind rather than discrete symptoms. Thus, although the scale can offer evidence of which subjects are at risk for distortions in their decisions and to what degree, it will not allow researchers to conclude definitively that TM is present in a given subject. Conclusions The development of a reliable and valid TM scale, even with modest predictive power, should permit investigators in clinical trials to identify subjects with tendencies to misinterpret the nature of the situation and to provide additional information to them. It should also stimulate research on how best to decrease TM and facilitate meaningful informed consent to clinical research. PMID:22942217
Keil, Lukas G; Platts-Mills, Timothy F; Jones, Christopher W
2015-10-01
Publication bias compromises the validity of systematic reviews. This problem can be addressed in part through searching clinical trials registries to identify unpublished studies. This study aims to determine how often systematic reviews published in emergency medicine journals include clinical trials registry searches. We identified all systematic reviews published in the 6 highest-impact emergency medicine journals between January 1 and December 31, 2013. Systematic reviews that assessed the effects of an intervention were further examined to determine whether the authors described searching a clinical trials registry and whether this search identified relevant unpublished studies. Of 191 articles identified through PubMed search, 80 were confirmed to be systematic reviews. Our sample consisted of 41 systematic reviews that assessed a specific intervention. Eight of these 41 (20%) searched a clinical trials registry. For 4 of these 8 reviews, the registry search identified at least 1 relevant unpublished study. Systematic reviews published in emergency medicine journals do not routinely include searches of clinical trials registries. By helping authors identify unpublished trial data, the addition of registry searches may improve the validity of systematic reviews. Copyright © 2014 American College of Emergency Physicians. Published by Elsevier Inc. All rights reserved.
Villafañe, Jorge H; Gobbo, Massimiliano; Peranzoni, Matteo; Naik, Ganesh; Imperio, Grace; Cleland, Joshua A; Negrini, Stefano
2016-09-01
This systematic literature review aimed at examining the validity and applicability in everyday clinical rehabilitation practise of methods for the assessment of back muscle fatiguability in patients with chronic non-specific low back pain (CNSLBP). Extensive research was performed in MEDLINE, Cumulative Index of Nursing and Allied Health Literature (CINAHL), Embase, Physiotherapy Evidence Database (PEDro) and Cochrane Central Register of Controlled Trials (CENTRAL) databases from their inception to September 2014. Potentially relevant articles were also manually looked for in the reference lists of the identified publications. Studies examining lumbar muscle fatigue in people with CNSLBP were selected. Two reviewers independently selected the articles, carried out the study quality assessment and extracted the results. A modified Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) scale was used to evaluate the scientific rigour of the selected works. Twenty-four studies fulfilled the selection criteria and were included in the systematic review. We found conflicting data regarding the validity of methods used to examine back muscle fatigue. The Biering-Sorensen test, performed in conjunction with surface electromyography spectral analysis, turned out to be the most widely used and comparatively, the most optimal modality currently available to assess objective back muscle fatigue in daily clinical practise, even though critical limitations are discussed. Future research should address the identification of an advanced method for lower back fatigue assessment in patients with CNSLBP which, eventually, might provide physical therapists with an objective and reliable test usable in everyday clinical practise. Implications for Rehabilitation Despite its limitations, the Biering-Sorensen test is currently the most used, convenient and easily available fatiguing test for lumbar muscles. To increase validity and reliability of the Biering-Sorensen test, concomitant activation of synergistic muscles should be taken into account. Pooled mean frequency and half-width of the spectrum are currently the most valid electromyographic parameters to assess fatigue in chronic non-specific low back pain. Body mass index, grading of pain and level of disability of the study population should be reported to enhance research quality.
Berkhof, Farida F; Metzemaekers, Leola; Uil, Steven M; Kerstjens, Huib AM; van den Berg, Jan WK
2014-01-01
Background Chronic obstructive pulmonary disease (COPD) and heart failure (HF) are both common diseases that coexist frequently. Patients with both diseases have worse stable state health status when compared with patients with one of these diseases. In many outpatient clinics, health status is monitored routinely in COPD patients using the Clinical COPD Questionnaire (CCQ) and in HF patients with the Minnesota Living with Heart Failure Questionnaire (MLHF-Q). This study validated and compared which questionnaire, ie, the CCQ or the MLHF-Q, is suited best for patients with coexistent COPD and HF. Methods Patients with both COPD and HF and aged ≥40 years were included. Construct validity, internal consistency, test–retest reliability, and agreement were determined. The Short-Form 36 was used as the external criterion. All questionnaires were completed at baseline. The CCQ and MLHF-Q were repeated after 2 weeks, together with a global rating of change. Results Fifty-eight patients were included, of whom 50 completed the study. Construct validity was acceptable. Internal consistency was adequate for CCQ and MLHF-Q total and domain scores, with a Cronbach’s alpha ≥0.70. Reliability was adequate for MLHF-Q and CCQ total and domain scores, and intraclass correlation coefficients were 0.70–0.90, except for the CCQ symptom score (intraclass correlation coefficient 0.42). The standard error of measurement on the group level was smaller than the minimal clinical important difference for both questionnaires. However, the standard error of measurement on the individual level was larger than the minimal clinical important difference. Agreement was acceptable on the group level and limited on the individual level. Conclusion CCQ and MLHF-Q were both valid and reliable questionnaires for assessment of health status in patients with coexistent COPD and HF on the group level, and hence for research. However, in clinical practice, on the individual level, the characteristics of both questionnaires were not as good. There is room for a questionnaire with good evaluative properties on the individual level, preferably tested in a setting of patients with COPD or HF, or both. PMID:25285000
Park, Yoon Soo; Hyderi, Abbas; Heine, Nancy; May, Win; Nevins, Andrew; Lee, Ming; Bordage, Georges; Yudkowsky, Rachel
2017-11-01
To examine validity evidence of local graduation competency examination scores from seven medical schools using shared cases and to provide rater training protocols and guidelines for scoring patient notes (PNs). Between May and August 2016, clinical cases were developed, shared, and administered across seven medical schools (990 students participated). Raters were calibrated using training protocols, and guidelines were developed collaboratively across sites to standardize scoring. Data included scores from standardized patient encounters for history taking, physical examination, and PNs. Descriptive statistics were used to examine scores from the different assessment components. Generalizability studies (G-studies) using variance components were conducted to estimate reliability for composite scores. Validity evidence was collected for response process (rater perception), internal structure (variance components, reliability), relations to other variables (interassessment correlations), and consequences (composite score). Student performance varied by case and task. In the PNs, justification of differential diagnosis was the most discriminating task. G-studies showed that schools accounted for less than 1% of total variance; however, for the PNs, there were differences in scores for varying cases and tasks across schools, indicating a school effect. Composite score reliability was maximized when the PN was weighted between 30% and 40%. Raters preferred using case-specific scoring guidelines with clear point-scoring systems. This multisite study presents validity evidence for PN scores based on scoring rubric and case-specific scoring guidelines that offer rigor and feedback for learners. Variability in PN scores across participating sites may signal different approaches to teaching clinical reasoning among medical schools.
Cacho-Martínez, Pilar; García-Muñoz, Ángel; Ruiz-Cantero, María Teresa
2013-01-01
Purpose To analyze the diagnostic criteria used in the scientific literature published in the past 25 years for accommodative and nonstrabismic binocular dysfunctions and to explore if the epidemiological analysis of diagnostic validity has been used to propose which clinical criteria should be used for diagnostic purposes. Methods We carried out a systematic review of papers on accommodative and non-strabic binocular disorders published from 1986 to 2012 analysing the MEDLINE, CINAHL, PsycINFO and FRANCIS databases. We admitted original articles about diagnosis of these anomalies in any population. We identified 839 articles and 12 studies were included. The quality of included articles was assessed using the QUADAS-2 tool. Results The review shows a wide range of clinical signs and cut-off points between authors. Only 3 studies (regarding accommodative anomalies) assessed diagnostic accuracy of clinical signs. Their results suggest using the accommodative amplitude and monocular accommodative facility for diagnosing accommodative insufficiency and a high positive relative accommodation for accommodative excess. The remaining 9 articles did not analyze diagnostic accuracy, assessing a diagnosis with the criteria the authors considered. We also found differences between studies in the way of considering patients’ symptomatology. 3 studies of 12 analyzed, performed a validation of a symptom survey used for convergence insufficiency. Conclusions Scientific literature reveals differences between authors according to diagnostic criteria for accommodative and nonstrabismic binocular dysfunctions. Diagnostic accuracy studies show that there is only certain evidence for accommodative conditions. For binocular anomalies there is only evidence about a validated questionnaire for convergence insufficiency with no data of diagnostic accuracy. PMID:24646897
Schieffelin, John; Moses, Lina M; Shaffer, Jeffrey; Goba, Augustine; Grant, Donald S
2015-01-01
The current Ebola outbreak in West Africa has affected more people than all previous outbreaks combined. The current diagnostic method of choice, quantitative polymerase chain reaction, requires specialized conditions as well as specially trained technicians. Insufficient testing capacity has extended the time from sample collection to results. These delays have led to further delays in the transfer and treatment to Ebola Treatment Units. A sensitive and specific point-of-care device that could be used reliably in low resource settings by healthcare workers with minimal training would increase the efficiency of triage and appropriate transfer of care. This article describes a study designed to validate the sensitivity and specificity of the ReEBOVTM RDT using venous whole blood and capillary blood obtained via fingerprick. We present the scientific and clinical rationale for the decisions made in the design of a diagnostic validation study to be conducted in an outbreak setting. The multi-site strategy greatly complicated implementation. In addition, a decrease in cases in one geographic area along with a concomitant increase in other areas made site selection challenging. Initiation of clinical trials during rapidly evolving outbreaks requires significant cooperation on a national level between research teams implementing studies and clinical care providers. Coordination and streamlining of approval process is essential if trials are to be implemented in a timely fashion. PMID:26768566
Psychometric evaluation of the HIV symptom distress scale
Marc, Linda G.; Wang, Ming-Mei; Testa, Marcia A.
2012-01-01
The objective of this paper is to psychometrically validate the HIV Symptom Distress Scale (SDS), an instrument that can be used to measure overall HIV symptom distress or clinically relevant groups of HIV symptoms. A secondary data analysis was conducted using the Collaborations in HIV Outcomes Research U.S. Cohort (CHORUS). Inclusion criteria required study participants (N=5,521) to have a valid baseline measure of the AIDS Clinical Trial Group Symptom Distress Module, with an SF-12 or SF-36 completed on the same day. Psychometric testing assessed unidimensionality, internal consistency and factor structure using exploratory and confirmatory factor analysis, and structural equation modeling (SEM). Construct validity examined whether the new measure discriminates across clinical significance (CD4 and HIV viral load). Findings show that the SDS has high reliability (α=0.92), and SEM supports a correlated second-order factor model (physical and mental distress) with acceptable fit (GFI=0.88, AGFI=0.85, NFI=0.99, NNFI=0.99; RMSEA=0.06, [90% CI 0.06 – 0.06]; Satorra Bentler Scaled, C2 =3274.20; p=0.0). Construct validity shows significant differences across categories for HIV-1 viral load (p< 0.001) and CD4 (p< 0.001). Differences in mean SDS scores exist across gender (p< 0.001), race/ethnicity (p< 0.05) and educational attainment (p < 0.001). Hence, the HIV Symptom Distress Scale is a reliable and valid instrument, which measures overall HIV symptoms or clinically relevant groups of symptoms. PMID:22409246
Development and validation of a measure of pediatric oral health-related quality of life: the POQL
Huntington, Noelle L; Spetter, Dante; Jones, Judith A.; Rich, Sharon E.; Garcia, Raul I.; Spiro, Avron
2011-01-01
Objective To develop a brief measure of oral health-related quality of life in children and demonstrate its reliability and validity in a diverse population. Methods We administered the initial 20-item POQL to children (Child Self-Report) and parents (Parent Report on Child) from diverse populations in both school-based and clinic-based settings. Clinical oral health status was measured on a subset of children. We used factor analysis to determine the underlying scales and then reduced the measure to 10 items based on several considerations. Multitrait analysis on the resulting 10-item POQL was used to reaffirm the discrimination of scales and assess the measure’s internal consistency and interscale correlations. We established discriminant and convergent validity with clinical status, perceived oral health and responses on the PedsQL and determined sensitivity to change with children undergoing ECC surgical repair. Results Factor analysis returned a four-scale solution for the initial items – Physical Functioning, Role Functioning, Social Functioning and Emotional Functioning. The reduced items represented the same four scales – two each on Physical and Role and three each on Social and Emotional. Good reliability and validity were shown for the POQL as a whole and for each of the scales. Conclusions The POQL is a valid and reliable measure of oral health-related quality of life for use in pre-school and school-aged children, with high utility for both clinical assessments and large-scale population studies. PMID:21972458
Greenslade, Kathryn J; Coggins, Truman E
2014-01-01
Identifying what a communication partner is looking at (referential intention) and why (social intention) is essential to successful social communication, and may be challenging for children with social communication deficits. This study explores a clinical task that assesses these intention-reading abilities within an authentic context. To gather evidence of the task's reliability and validity, and to discuss its clinical utility. The intention-reading task was administered to twenty 4-7-year-olds with typical development (TD) and ten with autism spectrum disorder (ASD). Task items were embedded in an authentic activity, and they targeted the child's ability to identify the examiner's referential and social intentions, which were communicated through joint attention behaviours. Reliability and construct validity evidence were addressed using established psychometric methods. Reliability and validity evidence supported the use of task scores for identifying children whose intention-reading warranted concern. Evidence supported the reliability of task administration and coding, and item-level codes were highly consistent with overall task performance. Supporting task validity, group differences aligned with predictions, with children with ASD exhibiting poorer and more variable task scores than children with TD. Also, as predicted, task scores correlated significantly with verbal mental age and ratings of parental concerns regarding social communication abilities. The evidence provides preliminary support for the reliability and validity of the clinical task's scores in assessing young children's real-time intention-reading abilities, which are essential for successful interactions in school and beyond. © 2014 Royal College of Speech and Language Therapists.
Development and validation of a measure of pediatric oral health-related quality of life: the POQL.
Huntington, Noelle L; Spetter, Dante; Jones, Judith A; Rich, Sharron E; Garcia, Raul I; Spiro, Avron
2011-01-01
To develop a brief measure of oral health-related quality of life (OHQL) in children and demonstrate its reliability and validity in a diverse population. We administered the initial 20-item Pediatric Oral Health-Related Quality of Life (POQL) to children (Child Self-Report) and parents (Parent Report on Child) from diverse populations in both school-based and clinic-based settings. Clinical oral health status was measured on a subset of children. We used factor analysis to determine the underlying scales and then reduced the measure to 10 items based on several considerations. Multitrait analysis on the resulting 10-item POQL was used to reaffirm the discrimination of scales and assess the measure's internal consistency and interscale correlations. We established discriminant and convergent validity with clinical status, perceived oral health and responses on the PedsQL, and determined sensitivity to change with children undergoing ECC surgical repair. Factor analysis returned a four-scale solution for the initial items--Physical Functioning, Role Functioning, Social Functioning, and Emotional Functioning. The reduced items represented the same four scales--two each on Physical and Role and three each on Social and Emotional. Good reliability and validity were shown for the POQL as a whole and for each of the scales. The POQL is a valid and reliable measure of OHQL for use in preschool and school-aged children, with high utility for both clinical assessments and large-scale population studies.
Validation of the Physician-Pharmacist Collaborative Index for physicians in Malaysia.
Sellappans, Renukha; Ng, Chirk Jenn; Lai, Pauline Siew Mei
2015-12-01
Establishing a collaborative working relationship between doctors and pharmacists is essential for the effective provision of pharmaceutical care. The Physician-Pharmacist Collaborative Index (PPCI) was developed to assess the professional exchanges between doctors and pharmacists. Two versions of the PPCI was developed: one for physicians and one for pharmacists. However, these instruments have not been validated in Malaysia. To determine the validity and reliability of the PPCI for physicians in Malaysia. An urban tertiary hospital in Malaysia. This prospective study was conducted from June to August 2014. Doctors were grouped as either a "collaborator" or a "non-collaborator". Collaborators were doctors who regularly worked with one particular clinical pharmacist in their ward, while non-collaborators were doctors who interacted with any random pharmacist who answered the general pharmacy telephone line whenever they required assistance on medication-related enquiries, as they did not have a clinical pharmacist in their ward. Collaborators were firstly identified by the clinical pharmacist he/she worked with, then invited to participate in this study through email, as it was difficult to locate and approach them personally. Non-collaborators were sampled conveniently by approaching them in person as these doctors could be easily sampled from any wards without a clinical pharmacist. The PPCI for physicians was administered at baseline and 2 weeks later. Validity (face validity, factor analysis and discriminative validity) and reliability (internal consistency and test-retest) of the PPCI for physicians. A total of 116 doctors (18 collaborators and 98 non-collaborators) were recruited. Confirmatory factor analysis confirmed that the PPCI for physicians was a 3-factor model. The correlation of the mean domain scores ranged from 0.711 to 0.787. "Collaborators" had significantly higher scores compared to "non-collaborators" (81.4 ± 10.1 vs. 69.3 ± 12.1, p < 0.001). The Cronbach alpha for the overall PPCI for physicians was 0.949, while the Cronbach alpha values for the individual domains ranged from 0.877 to 0.926. Kappa values at test-retest ranged from 0.553 to 0.752. The PPCI for physicians was a valid and reliable measure in determining doctors' views about collaborative working relationship with pharmacists, in Malaysia.
Impact of hormone therapy on quality of life after menopause.
Utian, Wulf H; Woods, Nancy Fugate
2013-10-01
Given the complexity of the literature on quality of life (QOL) and hormone therapy (HT) among women in the menopausal transition and postmenopause, the purposes of this integrative review were to (1) define QOL as a multidimensional construct; (2) review validated instruments for measurement of QOL; (3) review results of HT and QOL clinical trials that have used validated instruments; and (4) assess the effectiveness of HT on QOL, including health-related QOL (HRQOL), menopause-specific QOL (MSQOL), and global QOL (GQOL). The literature on HT and QOL was searched for definitions of QOL and validated instruments for measuring QOL, and the results were summarized. The purposes of this integrative review were to evaluate the effects of HT on HRQOL, differentiating the effects of HT on GQOL, HRQOL, and MSQOL. As a basis for this review, we searched for published controlled clinical trials in which the effects of HT on QOL were studied using validated QOL instruments, in particular menopause-specific validated instruments. Clear definitions are elucidated. Validated instruments for the measurements of HRQOL, GQOL, and MSQOL are summarized, and the necessity of their incorporation into future research and clinical practice is emphasized. The published effects on QOL of estrogens and progestogens administered to symptomatic and nonsymptomatic women in the menopausal transition and beyond are reviewed. The impact of various health state-related symptoms on HRQOL and GQOL is now an integral component of contemporary health care. Effects of HT include GQOL and HRQOL and should be menopause-specific. There is clearly a need for further studies on menopause and menopause-related therapies using appropriate and validated instruments. Literature review shows that HT provides a significant benefit for MSQOL in midlife women, mainly through relief of symptoms, but treatment also may result in a global increase in sense of well-being (GQOL). HRQOL benefits are contingent on symptom status, as are MSQOL outcomes. Women who are severely symptomatic experience a significant improvement in HRQOL and MSQOL, although this improvement is not significant among women without severe symptoms at baseline measures in clinical trials.
Stevens, Adam; Murray, Philip; Wojcik, Jerome; Raelson, John; Koledova, Ekaterina; Chatelain, Pierre
2016-01-01
Objective Single-nucleotide polymorphisms (SNPs) associated with the response to recombinant human growth hormone (r-hGH) have previously been identified in growth hormone deficiency (GHD) and Turner syndrome (TS) children in the PREDICT long-term follow-up (LTFU) study (Nbib699855). Here, we describe the PREDICT validation (VAL) study (Nbib1419249), which aimed to confirm these genetic associations. Design and methods Children with GHD (n = 293) or TS (n = 132) were recruited retrospectively from 29 sites in nine countries. All children had completed 1 year of r-hGH therapy. 48 SNPs previously identified as associated with first year growth response to r-hGH were genotyped. Regression analysis was used to assess the association between genotype and growth response using clinical/auxological variables as covariates. Further analysis was undertaken using random forest classification. Results The children were younger, and the growth response was higher in VAL study. Direct genotype analysis did not replicate what was found in the LTFU study. However, using exploratory regression models with covariates, a consistent relationship with growth response in both VAL and LTFU was shown for four genes – SOS1 and INPPL1 in GHD and ESR1 and PTPN1 in TS. The random forest analysis demonstrated that only clinical covariates were important in the prediction of growth response in mild GHD (>4 to <10 μg/L on GH stimulation test), however, in severe GHD (≤4 μg/L) several SNPs contributed (in IGF2, GRB10, FOS, IGFBP3 and GHRHR). Conclusions The PREDICT validation study supports, in an independent cohort, the association of four of 48 genetic markers with growth response to r-hGH treatment in both pre-pubertal GHD and TS children after controlling for clinical/auxological covariates. However, the contribution of these SNPs in a prediction model of first-year response is not sufficient for routine clinical use. PMID:27651465
Stevens, Adam; Murray, Philip; Wojcik, Jerome; Raelson, John; Koledova, Ekaterina; Chatelain, Pierre; Clayton, Peter
2016-12-01
Single-nucleotide polymorphisms (SNPs) associated with the response to recombinant human growth hormone (r-hGH) have previously been identified in growth hormone deficiency (GHD) and Turner syndrome (TS) children in the PREDICT long-term follow-up (LTFU) study (Nbib699855). Here, we describe the PREDICT validation (VAL) study (Nbib1419249), which aimed to confirm these genetic associations. Children with GHD (n = 293) or TS (n = 132) were recruited retrospectively from 29 sites in nine countries. All children had completed 1 year of r-hGH therapy. 48 SNPs previously identified as associated with first year growth response to r-hGH were genotyped. Regression analysis was used to assess the association between genotype and growth response using clinical/auxological variables as covariates. Further analysis was undertaken using random forest classification. The children were younger, and the growth response was higher in VAL study. Direct genotype analysis did not replicate what was found in the LTFU study. However, using exploratory regression models with covariates, a consistent relationship with growth response in both VAL and LTFU was shown for four genes - SOS1 and INPPL1 in GHD and ESR1 and PTPN1 in TS. The random forest analysis demonstrated that only clinical covariates were important in the prediction of growth response in mild GHD (>4 to <10 μg/L on GH stimulation test), however, in severe GHD (≤4 μg/L) several SNPs contributed (in IGF2, GRB10, FOS, IGFBP3 and GHRHR). The PREDICT validation study supports, in an independent cohort, the association of four of 48 genetic markers with growth response to r-hGH treatment in both pre-pubertal GHD and TS children after controlling for clinical/auxological covariates. However, the contribution of these SNPs in a prediction model of first-year response is not sufficient for routine clinical use. © 2016 European Society of Endocrinology.
Surrogate endpoints in randomized cardiovascular clinical trials.
Domanski, Michael; Pocock, Stuart; Bernaud, Corine; Borer, Jeffrey; Geller, Nancy; Revkin, James; Zannad, Faiez
2011-08-01
Surrogate endpoints predict the occurrence and timing of a clinical endpoint of interest (CEI). Substitution of a surrogate endpoint for a CEI can dramatically reduce the time and cost necessary to complete a Phase III clinical trial. However, assurance that use of a surrogate endpoint will result in a correct conclusion regarding treatment effect on a CEI requires prior rigorous validation of the surrogate. Surrogate endpoints can also be of substantial use in Phase I and II studies to assess whether the intended therapeutic pathway is operative, thus providing assurance regarding the reasonableness of proceeding to a Phase III trial. This paper discusses the uses and validation of surrogate endpoints. © 2010 The Authors Fundamental and Clinical Pharmacology © 2010 Société Française de Pharmacologie et de Thérapeutique.
Current advances in esophageal cancer proteomics.
Uemura, Norihisa; Kondo, Tadashi
2015-06-01
We review the current status of proteomics for esophageal cancer (EC) from a clinician's viewpoint. The ultimate goal of cancer proteomics is the improvement of clinical outcome. The proteome as a functional translation of the genome is a straightforward representation of genomic mechanisms that trigger carcinogenesis. Cancer proteomics has identified the mechanisms of carcinogenesis and tumor progression, detected biomarker candidates for early diagnosis, and provided novel therapeutic targets for personalized treatments. Our review focuses on three major topics in EC proteomics: diagnostics, treatment, and molecular mechanisms. We discuss the major histological differences between EC types, i.e., esophageal squamous cell carcinoma and adenocarcinoma, and evaluate the clinical significance of published proteomics studies, including promising diagnostic biomarkers and novel therapeutic targets, which should be further validated prior to launching clinical trials. Multi-disciplinary collaborations between basic scientists, clinicians, and pathologists should be established for inter-institutional validation. In conclusion, EC proteomics has provided significant results, which after thorough validation, should lead to the development of novel clinical tools and improvement of the clinical outcome for esophageal cancer patients. This article is part of a Special Issue entitled: Medical Proteomics. Copyright © 2014 Elsevier B.V. All rights reserved.
Stalmeijer, Renée E; Dolmans, Diana H J M; Wolfhagen, Ineke H A P; Muijtjens, Arno M M; Scherpbier, Albert J J A
2008-01-01
Research indicates that the quality of supervision strongly influences the learning of medical students in clinical practice. Clinical teachers need feedback to improve their supervisory skills. The available instruments either lack a clear theoretical framework or are not suitable for providing feedback to individual teachers. We developed an evaluation instrument based on the 'cognitive apprenticeship model'. The aim was to estimate the content validity of the developed instrument. Item relevance was rated on a five-point scale (1 = highly irrelevant, 5 = highly relevant) by three groups of stakeholders in undergraduate clinical teaching: educationalists (N = 12), doctors (N = 16) and students (N = 12). Additionally, stakeholders commented on content, wording and omission of items. The items were generally rated as very relevant (Mean = 4.3, SD = 0.38, response = 95%) and any differences between the stakeholder groups were small. The results led to elimination of 4 items, rewording of 13 items and addition of 1 item. The cognitive apprenticeship model appears to offer a useful framework for the development of an evaluation instrument aimed at providing feedback to individual clinical teachers on the quality of student supervision. Further studies in larger populations will have to establish the instrument's statistical validity and generalizability.
Aguilar-Navarro, Sara Gloria; Fuentes-Cantú, Alejandro; Avila-Funes, José Alberto; García-Mayo, Emilio José
2007-01-01
To assess the validity and reliability of a geriatric depression questionnaire used in the Mexican Health and Age Study (MHAS). The study was conducted at the Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubirán (INCMNSZ) clinic from May 2005 to March 2006. This depression screening nine-item questionnaire was validated using the Diagnostic and Statistical Manual of Mental Disorders (DSM-IV-TR) (fourth revised version) and Yesavage's 15-item Geriatric Depression Scale (GDS-15) criteria. The instrument belongs to the MHAS, a prospective panel study of health and aging in Mexico. A total of 199 subjects 65 years of age and older participated in the validation process (median age= 79.5 years). MHAS questionnaire result was significantly correlated to the clinical depression diagnosis (p<0.001) and to the GDS-15 score (p<0.001). Internal consistency was adequate (alpha coefficient: 0.74). The cutoff point > or = 5/9 points yielded an 80.7% and 68.7% sensitivity and specificity respectively. The fidelity for the test retest was excellent (intra-class correlation coefficient= 0.933). Finally, the Bland and Altman agreement points indicated a difference 0.22 percent points between test retest. The MHAS questionnaire is valid and trustworthy, and allows screening in the research field for the presence of depression in the elderly.
Oncology biomarkers: discovery, validation, and clinical use.
Heckman-Stoddard, Brandy M
2012-05-01
To discuss the discovery, validation, and clinical use of multiple types of biomarkers. Medical literature and published guidelines. Formal validation of biomarkers should include both retrospective analyses of well-characterized samples as well as a prospective clinical trial in which the biomarker is tested for its ability to predict the presence of disease or the efficacy of a cancer therapy. Biomarker development is complicated, with very few biomarker discoveries leading to clinically useful tests. Nurses should understand how a biomarker was developed, including the sensitivity and specificity before applying new biomarkers in the clinical setting. Copyright © 2012. Published by Elsevier Inc.
Kim, Eun-Mi; Kim, Sun-Aee; Lee, Ju-Ry; Burlison, Jonathan D; Oh, Eui Geum
2018-02-13
"Second victims" are defined as healthcare professionals whose wellness is influenced by adverse clinical events. The Second Victim Experience and Support Tool (SVEST) was used to measure the second-victim experience and quality of support resources. Although the reliability and validity of the original SVEST have been validated, those for the Korean tool have not been validated. The aim of the study was to evaluate the psychometric properties of the Korean version of the SVEST. The study included 305 clinical nurses as participants. The SVEST was translated into Korean via back translation. Content validity was assessed by seven experts, and test-retest reliability was evaluated by 30 clinicians. Internal consistency and construct validity were assessed via confirmatory factor analysis. The analyses were performed using SPSS 23.0 and STATA 13.0 software. The content validity index value demonstrated validity; item- and scale-level content validity index values were both 0.95. Test-retest reliability and internal consistency reliability were satisfactory: the intraclass consistent coefficient was 0.71, and Cronbach α values ranged from 0.59 to 0.87. The CFA showed a significantly good fit for an eight-factor structure (χ = 578.21, df = 303, comparative fit index = 0.92, Tucker-Lewis index = 0.90, root mean square error of approximation = 0.05). The K-SVEST demonstrated good psychometric properties and adequate validity and reliability. The results showed that the Korean version of SVEST demonstrated the extent of second victimhood and support resources in Korean healthcare workers and could aid in the development of support programs and evaluation of their effectiveness.
ERIC Educational Resources Information Center
Young, Meredith E.; Cruess, Sylvia R.; Cruess, Richard L.; Steinert, Yvonne
2014-01-01
Physicians function as clinicians, teachers, and role models within the clinical environment. Negative learning environments have been shown to be due to many factors, including the presence of unprofessional behaviors among clinical teachers. Reliable and valid assessments of clinical teacher performance, including professional behaviors, may…
Simonds, Elise C; Handel, Richard W; Archer, Robert P
2008-03-01
This study evaluated the incremental validity of scores from the Minnesota Multiphasic Personality Inventory-2 (MMPI-2) and the Symptom Checklist-90-Revised (SCL-90-R) in a sample of mental health inpatients originally published by Archer, Griffin, and Aiduk (1995). The incremental validity of scores from the SCL-90-R primary symptom dimensions and MMPI-2 Clinical, Content, and Restructured Clinical scales was assessed in a sample of 544 mental health inpatients using conceptually related items from the Brief Psychiatric Rating Scale (BPRS) as criteria. A series of hierarchical multiple regressions indicated that scores from the SCL-90-R primary symptom dimensions exhibited limited incremental validity (Mdn DeltaR(2) = .01, range = 0-.01), whereas scores from MMPI-2 scales contributed additional information in the prediction of ratings on all but one BPRS item (Mdn DeltaR( 2) = .08, range = .04-.12).
Construction and validation of clinical contents for development of learning objects.
Hortense, Flávia Tatiana Pedrolo; Bergerot, Cristiane Decat; Domenico, Edvane Birelo Lopes de
2018-01-01
to describe the process of construction and validation of clinical contents for health learning objects, aimed at patients in the treatment of head and neck cancer. descriptive, methodological study. The development of the script and the storyboard were based on scientific evidence and submitted to the appreciation of specialists for validation of content. The agreement index was checked quantitatively and the suggestions were qualitatively evaluated. The items described in the roadmap were approved by 99% of expert experts. The suggestions for adjustments were inserted in their entirety in the final version. The free-marginal kappa statistical test, for multiple evaluators, presented value equal to 0.68%, granting a substantial agreement. The steps taken in the construction and validation of the content for the production of educational material for patients with head and neck cancer were adequate, relevant and suitable for use in other subjects.
aMAP is a validated pipeline for registration and segmentation of high-resolution mouse brain data
Niedworok, Christian J.; Brown, Alexander P. Y.; Jorge Cardoso, M.; Osten, Pavel; Ourselin, Sebastien; Modat, Marc; Margrie, Troy W.
2016-01-01
The validation of automated image registration and segmentation is crucial for accurate and reliable mapping of brain connectivity and function in three-dimensional (3D) data sets. While validation standards are necessarily high and routinely met in the clinical arena, they have to date been lacking for high-resolution microscopy data sets obtained from the rodent brain. Here we present a tool for optimized automated mouse atlas propagation (aMAP) based on clinical registration software (NiftyReg) for anatomical segmentation of high-resolution 3D fluorescence images of the adult mouse brain. We empirically evaluate aMAP as a method for registration and subsequent segmentation by validating it against the performance of expert human raters. This study therefore establishes a benchmark standard for mapping the molecular function and cellular connectivity of the rodent brain. PMID:27384127
Inter-rater agreement of comorbid DSM-IV personality disorders in substance abusers.
Hesse, Morten; Thylstrup, Birgitte
2008-05-17
Little is known about the inter-rater agreement of personality disorders in clinical settings. Clinicians rated 75 patients with substance use disorders on the DSM-IV criteria of personality disorders in random order, and on rating scales representing the severity of each. Convergent validity agreement was moderate (range for r = 0.55, 0.67) for cluster B disorders rated with DSM-IV criteria, and discriminant validity was moderate for eight of the ten personality disorders. Convergent validity of the rating scales was only moderate for antisocial and narcissistic personality disorder. Dimensional ratings may be used in research studies and clinical practice with some caution, and may be collected as one of several sources of information to describe the personality of a patient.
Topping, Alice; Kappel, Franz; Thijssen, Stephan; Kotanko, Peter
2018-01-01
In silico approaches have been proposed as a novel strategy to increase the repertoire of clinical trial designs. Realistic simulations of clinical trials can provide valuable information regarding safety and limitations of treatment protocols and have been shown to assist in the cost‐effective planning of clinical studies. In this report, we present a blueprint for the stepwise integration of internal, external, and ecological validity considerations in virtual clinical trials (VCTs). We exemplify this approach in the context of a model‐based in silico clinical trial aimed at anemia treatment in patients undergoing hemodialysis (HD). Hemoglobin levels and subsequent anemia treatment were simulated on a per patient level over the course of a year and compared to real‐life clinical data of 79,426 patients undergoing HD. The novel strategies presented here, aimed to improve external and ecological validity of a VCT, significantly increased the predictive power of the discussed in silico trial. PMID:29368434
Assessing Psychodynamic Conflict.
Simmonds, Joshua; Constantinides, Prometheas; Perry, J Christopher; Drapeau, Martin; Sheptycki, Amanda R
2015-09-01
Psychodynamic psychotherapies suggest that symptomatic relief is provided, in part, with the resolution of psychic conflicts. Clinical researchers have used innovative methods to investigate such phenomenon. This article aims to review the literature on quantitative psychodynamic conflict rating scales. An electronic search of the literature was conducted to retrieve quantitative observer-rated scales used to assess conflict noting each measure's theoretical model, information source, and training and clinical experience required. Scales were also examined for levels of reliability and validity. Five quantitative observer-rated conflict scales were identified. Reliability varied from poor to excellent with each measure demonstrating good validity. However a small number of studies and limited links to current conflict theory suggest further clinical research is needed.
"Bed Side" Human Milk Analysis in the Neonatal Intensive Care Unit: A Systematic Review.
Fusch, Gerhard; Kwan, Celia; Kotrri, Gynter; Fusch, Christoph
2017-03-01
Human milk analyzers can measure macronutrient content in native breast milk to tailor adequate supplementation with fortifiers. This article reviews all studies using milk analyzers, including (i) evaluation of devices, (ii) the impact of different conditions on the macronutrient analysis of human milk, and (iii) clinical trials to improve growth. Results lack consistency, potentially due to systematic errors in the validation of the device, or pre-analytical sample preparation errors like homogenization. It is crucial to introduce good laboratory and clinical practice when using these devices; otherwise a non-validated clinical usage can severely affect growth outcomes of infants. Copyright © 2016 Elsevier Inc. All rights reserved.
Iranian Effective Clinical Nurse Instructor evaluation tool: Development and psychometric testing
Shahsavari, Hooman; Yekta, Zohreh Parsa; Zare, Zahra; Sigaroodi, Abdolhossain Emami
2014-01-01
Background: Clinical education is the heart of the nursing education program. Effective nursing clinical instructors are needed for graduating the future qualified nurses. There is a well-developed body of knowledge about the effectiveness of clinical teaching and the instructors. However, translating this knowledge into a context-based evaluation tool for measuring the effectiveness of Iranian clinical nursing instructors remains a deficiency. The purpose of this study is to describe the development and psychometric testing process of an instrument to evaluate the characteristics of Iranian effective clinical nurse instructor. Materials and Methods: Following a precise review of Iranian literatures and expert consultation, 83 statements about the characteristics that make clinical nurse instructors effective were extracted. In the next phase, the psychometric properties of the instrument were established by looking at the content validity, face validity, and internal consistency. Content validity of the instrument was assessed based on the comments of an expert panel including 10 nursing faculty members. During this phase, 30 items of the instrument were omitted or merged. Face validity of the instrument was assured based on the advices of 10 nursing students and 10 nursing faculty members. Finally, in the pilot test, the data of 168 filled questionnaires were gathered and analyzed by an exploratory factor analysis to reduce the items and identify the factor structure of the instrument. Results: Through subsequent analyses, of the 83 items, 31 items were merged or omitted. At last, 52 retained items were divided into four subscales including student-centric behaviors, clinical performances, planning ability, and personality traits. The Cronbach's alpha level of the inventory was 0.96, with the value for each domain ranging from 0.87 to 0.94. Conclusions: Iranian Effective Clinical Nurse Instructor evaluation tool has acceptable psychometric properties and can be used in evaluating the effectiveness of clinical nursing instructors. PMID:24834081
Rater reliability and construct validity of a mobile application for posture analysis
Szucs, Kimberly A.; Brown, Elena V. Donoso
2018-01-01
[Purpose] Measurement of posture is important for those with a clinical diagnosis as well as researchers aiming to understand the impact of faulty postures on the development of musculoskeletal disorders. A reliable, cost-effective and low tech posture measure may be beneficial for research and clinical applications. The purpose of this study was to determine rater reliability and construct validity of a posture screening mobile application in healthy young adults. [Subjects and Methods] Pictures of subjects were taken in three standing positions. Two raters independently digitized the static standing posture image twice. The app calculated posture variables, including sagittal and coronal plane translations and angulations. Intra- and inter-rater reliability were calculated using the appropriate ICC models for complete agreement. Construct validity was determined through comparison of known groups using repeated measures ANOVA. [Results] Intra-rater reliability ranged from 0.71 to 0.99. Inter-rater reliability was good to excellent for all translations. ICCs were stronger for translations versus angulations. The construct validity analysis found that the app was able to detect the change in the four variables selected. [Conclusion] The posture mobile application has demonstrated strong rater reliability and preliminary evidence of construct validity. This application may have utility in clinical and research settings. PMID:29410561
Kawata, Ariane K; Wilson, Hilary; Ong, Siew Hwa; Kulich, Karoly; Coyne, Karin
2016-10-01
The aim of this study was to evaluate the factor structure and psychometric characteristics of the Hypoglycemia Perspectives Questionnaire (HPQ) assessing experience and perceptions of hypoglycemia in patients with type 2 diabetes mellitus (T2DM). HPQ was administered to adults with T2DM in a clinical sample from Cyprus (HYPO-Cyprus, n = 500) and a community sample in the United States (US, n = 1257) from the 2011 US National Health and Wellness Survey. Demographic and clinical data were collected. Analysis of HPQ data from two convenience samples examined item performance, factor structure, and HPQ measurement properties (reliability, convergent validity, known-groups validity). Analyses supported three HPQ domains: symptom concern (six items), compensatory behavior (five items), and worry (five items). Internal consistency was high for all three domains (all ≥0.75), supporting reliability. Convergent validity was supported by moderate Spearman correlations between HPQ domain scores and the Audit of Diabetes-Dependent Quality of Life (ADDQoL-19) total score. Patients with recent hypoglycemia events had significantly higher HPQ scores, supporting known-group validity. HPQ may be a valid and reliable measure capturing the experience and impact of hypoglycemia and useful in clinical trials and community-based settings.
Rater reliability and construct validity of a mobile application for posture analysis.
Szucs, Kimberly A; Brown, Elena V Donoso
2018-01-01
[Purpose] Measurement of posture is important for those with a clinical diagnosis as well as researchers aiming to understand the impact of faulty postures on the development of musculoskeletal disorders. A reliable, cost-effective and low tech posture measure may be beneficial for research and clinical applications. The purpose of this study was to determine rater reliability and construct validity of a posture screening mobile application in healthy young adults. [Subjects and Methods] Pictures of subjects were taken in three standing positions. Two raters independently digitized the static standing posture image twice. The app calculated posture variables, including sagittal and coronal plane translations and angulations. Intra- and inter-rater reliability were calculated using the appropriate ICC models for complete agreement. Construct validity was determined through comparison of known groups using repeated measures ANOVA. [Results] Intra-rater reliability ranged from 0.71 to 0.99. Inter-rater reliability was good to excellent for all translations. ICCs were stronger for translations versus angulations. The construct validity analysis found that the app was able to detect the change in the four variables selected. [Conclusion] The posture mobile application has demonstrated strong rater reliability and preliminary evidence of construct validity. This application may have utility in clinical and research settings.
The druggable genome and support for target identification and validation in drug development.
Finan, Chris; Gaulton, Anna; Kruger, Felix A; Lumbers, R Thomas; Shah, Tina; Engmann, Jorgen; Galver, Luana; Kelley, Ryan; Karlsson, Anneli; Santos, Rita; Overington, John P; Hingorani, Aroon D; Casas, Juan P
2017-03-29
Target identification (determining the correct drug targets for a disease) and target validation (demonstrating an effect of target perturbation on disease biomarkers and disease end points) are important steps in drug development. Clinically relevant associations of variants in genes encoding drug targets model the effect of modifying the same targets pharmacologically. To delineate drug development (including repurposing) opportunities arising from this paradigm, we connected complex disease- and biomarker-associated loci from genome-wide association studies to an updated set of genes encoding druggable human proteins, to agents with bioactivity against these targets, and, where there were licensed drugs, to clinical indications. We used this set of genes to inform the design of a new genotyping array, which will enable association studies of druggable genes for drug target selection and validation in human disease. Copyright © 2017, American Association for the Advancement of Science.
Žvanut, Boštjan; Lovrić, Robert; Kolnik, Tamara Štemberger; Šavle, Majda; Pucer, Patrik
2018-05-01
Nursing clinical learning environments are particularly important for the achievement of good practice in clinical training of student nurses, and thus, for the nursing competence development. Hence, it is important to have an instrument consisting of reliable and valid criteria for assessing the clinical learning environment, applicable in different contexts, and translated in the respondents mother tongue. The goal of the present research was to test the reliability and validity of the Slovenian version of the "Clinical Learning Environment, Supervision and Nurse Teacher evaluation scale", and to compare it with the Croatian version. The data was collected between 10 March and 10 June 2015 at four Slovenian institutions, where nursing BSc study programmes are performed. The final sample consisted of 232 students (response rate 68.8%): 81.9% were females and 18.1% males, average age was 23. The translated instrument in Slovenian language resulted as reliable and valid, it reflects the expected five factors of the original version despite some minor problems in the factor structure and in test-retest. The most important difference between the Slovenian and Croatian version is in the factor structure regarding the implementation of roles in clinical learning environment. Copyright © 2018 Elsevier Ltd. All rights reserved.