O'Neil, Margaret E; Fragala-Pinkham, Maria; Lennon, Nancy; George, Ameeka; Forman, Jeffrey; Trost, Stewart G
2016-01-01
Physical therapy for youth with cerebral palsy (CP) who are ambulatory includes interventions to increase functional mobility and participation in physical activity (PA). Thus, reliable and valid measures are needed to document PA in youth with CP. The purpose of this study was to evaluate the inter-instrument reliability and concurrent validity of 3 accelerometer-based motion sensors with indirect calorimetry as the criterion for measuring PA intensity in youth with CP. Fifty-seven youth with CP (mean age=12.5 years, SD=3.3; 51% female; 49.1% with spastic hemiplegia) participated. Inclusion criteria were: aged 6 to 20 years, ambulatory, Gross Motor Function Classification System (GMFCS) levels I through III, able to follow directions, and able to complete the full PA protocol. Protocol activities included standardized activity trials with increasing PA intensity (resting, writing, household chores, active video games, and walking at 3 self-selected speeds), as measured by weight-relative oxygen uptake (in mL/kg/min). During each trial, participants wore bilateral accelerometers on the upper arms, waist/hip, and ankle and a portable indirect calorimeter. Intraclass coefficient correlations (ICCs) were calculated to evaluate inter-instrument reliability (left-to-right accelerometer placement). Spearman correlations were used to examine concurrent validity between accelerometer output (activity and step counts) and indirect calorimetry. Friedman analyses of variance with post hoc pair-wise analyses were conducted to examine the validity of accelerometers to discriminate PA intensity across activity trials. All accelerometers exhibited excellent inter-instrument reliability (ICC=.94-.99) and good concurrent validity (rho=.70-.85). All accelerometers discriminated PA intensity across most activity trials. This PA protocol consisted of controlled activity trials. Accelerometers provide valid and reliable measures of PA intensity among youth with CP. © 2016 American Physical Therapy Association.
A Validation Study of Early Adolescents' Pubertal Self-Assessments
ERIC Educational Resources Information Center
Schmitz, Katharine E.; Hovell, Melbourne F.; Nichols, Jeanne F.; Irvin, Veronica L.; Keating, Kristen; Simon, Gayle M.; Gehrman, Christine; Jones, Kenneth Lee
2004-01-01
This study aimed to determine whether self-assessed puberty is sufficiently reliable and valid to substitute for physician examination when feasibility of physician examination is low (e.g., behavioral research). Adolescents (convenience sample N = 178 endocrinology patients and N = 125 from educational trial; mean age 12.7 and 11.3 years,…
McAuliff, Bradley D; Kovera, Margaret Bull; Nunez, Gabriel
2009-06-01
This study examined the ability of jury-eligible community members (N = 248) to detect internal validity threats in psychological science presented during a trial. Participants read a case summary in which an expert testified about a study that varied in internal validity (valid, missing control group, confound, and experimenter bias) and ecological validity (high, low). Ratings of expert evidence quality and expert credibility were higher for the valid versus missing control group versions only. Internal validity did not influence verdict or ratings of plaintiff credibility and no differences emerged as a function of ecological validity. Expert evidence quality, expert credibility, and plaintiff credibility were positively correlated with verdict. Implications for the scientific reasoning literature and for trials containing psychological science are discussed.
First-in-human Phase 1 CRISPR Gene Editing Cancer Trials: Are We Ready?
Baylis, Francoise; McLeod, Marcus
2017-01-01
A prospective first-in-human Phase 1 CRISPR gene editing trial in the United States for patients with melanoma, synovial sarcoma, and multiple myeloma offers hope that gene editing tools may usefully treat human disease. An overarching ethical challenge with first-in-human Phase 1 clinical trials, however, is knowing when it is ethically acceptable to initiate such trials on the basis of safety and efficacy data obtained from pre-clinical studies. If the pre-clinical studies that inform trial design are themselves poorly designed - as a result of which the quality of pre-clinical evidence is deficient - then the ethical requirement of scientific validity for clinical research may not be satisfied. In turn, this could mean that the Phase 1 clinical trial will be unsafe and that trial participants will be exposed to risk for no potential benefit. To assist sponsors, researchers, clinical investigators and reviewers in deciding when it is ethically acceptable to initiate first-in-human Phase 1 CRISPR gene editing clinical trials, structured processes have been developed to assess and minimize translational distance between pre-clinical and clinical research. These processes draw attention to various features of internal validity, construct validity, and external validity. As well, the credibility of supporting evidence is to be critically assessed with particular attention to optimism bias, financial conflicts of interest and publication bias. We critically examine the pre-clinical evidence used to justify the first-inhuman Phase 1 CRISPR gene editing cancer trial in the United States using these tools. We conclude that the proposed trial cannot satisfy the ethical requirement of scientific validity because the supporting pre-clinical evidence used to inform trial design is deficient. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Enhancing the Validity of a Quality of Life Measure for Autistic People
ERIC Educational Resources Information Center
McConachie, Helen; Mason, David; Parr, Jeremy R.; Garland, Deborah; Wilson, Colin; Rodgers, Jacqui
2018-01-01
Accurate measurement of quality of life (QoL) is important for evaluation of autism services and trials of interventions. We undertook psychometric validation of the World Health Organisation measure--WHOQoL-BREF, examined construct validity of the WHO Disabilities module and developed nine additional autism-specific items (ASQoL) from extensive…
Goldberg, Melissa C; Mostow, Allison J; Vecera, Shaun P; Larson, Jennifer C Gidley; Mostofsky, Stewart H; Mahone, E Mark; Denckla, Martha B
2008-09-01
We examined the ability to use static line drawings of eye gaze cues to orient visual-spatial attention in children with high functioning autism (HFA) compared to typically developing children (TD). The task was organized such that on valid trials, gaze cues were directed toward the same spatial location as the appearance of an upcoming target, while on invalid trials gaze cues were directed to an opposite location. Unlike TD children, children with HFA showed no advantage in reaction time (RT) on valid trials compared to invalid trials (i.e., no significant validity effect). The two stimulus onset asynchronies (200 ms, 700 ms) did not differentially affect these findings. The results suggest that children with HFA show impairments in utilizing static line drawings of gaze cues to orient visual-spatial attention.
Smith, Shannon M; Paillard, Florence; McKeown, Andrew; Burke, Laurie B; Edwards, Robert R; Katz, Nathaniel P; Papadopoulos, Elektra J; Rappaport, Bob A; Slagle, Ashley; Strain, Eric C; Wasan, Ajay D; Turk, Dennis C; Dworkin, Robert H
2015-05-01
Measurement of inappropriate medication use events (eg, abuse or misuse) in clinical trials is important in characterizing a medication's abuse potential. However, no gold standard assessment of inappropriate use events in clinical trials has been identified. In this systematic review, we examine the measurement properties (ie, content validity, cross-sectional reliability and construct validity, longitudinal construct validity, ability to detect change, and responder definitions) of instruments assessing inappropriate use of opioid and nonopioid prescription medications to identify any that meet U.S. and European regulatory agencies' rigorous standards for outcome measures in clinical trials. Sixteen published instruments were identified, most of which were not designed for the selected concept of interest and context of use. For this reason, many instruments were found to lack adequate content validity (or documentation of content validity) to evaluate current inappropriate medication use events; for example, evaluating inappropriate use across the life span rather than current use, including items that did not directly assess inappropriate use (eg, questions about anger), or failing to capture information pertinent to inappropriate use events (eg, intention and route of administration). In addition, the psychometric data across all instruments were generally limited in scope. A further limitation is the heterogeneous, nonstandardized use of inappropriate medication use terminology. These observations suggest that available instruments are not well suited for assessing current inappropriate medication use within the specific context of clinical trials. Further effort is needed to develop reliable and valid instruments to measure current inappropriate medication use events in clinical trials. This systematic review evaluates the measurement properties of inappropriate medication use (eg, abuse or misuse) instruments to determine whether any meet regulatory standards for clinical trial outcome measures to assess abuse potential. Copyright © 2015 American Pain Society. All rights reserved.
Smith, Shannon M.; Paillard, Florence; McKeown, Andrew; Burke, Laurie B.; Edwards, Robert R.; Katz, Nathaniel P.; Papadopoulos, Elektra J.; Rappaport, Bob A.; Slagle, Ashley; Strain, Eric C.; Wasan, Ajay D.; Turk, Dennis C.; Dworkin, Robert H.
2017-01-01
Measurement of inappropriate medication use events (e.g., abuse, misuse) in clinical trials is important in characterizing a medication’s abuse potential. However, no “gold standard” assessment of inappropriate use events in clinical trials has been identified. In this systematic review, we examine the measurement properties (i.e., content validity, cross-sectional reliability and construct validity, longitudinal construct validity, ability to detect change, and responder definitions) of instruments assessing inappropriate use of opioid and non-opioid prescription medications to identify any that meet U.S. and European regulatory agencies’ rigorous standards for outcome measures in clinical trials. Sixteen published instruments were identified, most of which were not designed for the selected concept of interest and context of use. For this reason, many instruments were found to lack adequate content validity (or documentation of content validity) to evaluate current inappropriate medication use events; for example, evaluating inappropriate use across the lifespan rather than current use, including items that did not directly assess inappropriate use (e.g., questions about anger), or failing to capture information pertinent to inappropriate use events (e.g., intention, route of administration). In addition, the psychometric data across all instruments were generally limited in scope. A further limitation is the heterogeneous, non-standardized use of inappropriate medication use terminology. These observations suggest that available instruments are not well suited for assessing current inappropriate medication use within the specific context of clinical trials. Further effort is needed to develop reliable and valid instruments to measure current inappropriate medication use events in clinical trials. PMID:25660826
ERIC Educational Resources Information Center
Wollersheim Shervey, Sarah; Sandilos, Lia E.; DiPerna, James C.; Lei, Pui-Wa
2017-01-01
The purpose of this study was to examine the social validity of the Social Skills Improvement System--Classwide Intervention Program (SSIS-CIP) for teachers in the primary grades. Participants included 45 first and second grade teachers who completed a 16-item social validity questionnaire during each year of the SSIS-CIP efficacy trial. Findings…
ERIC Educational Resources Information Center
Wollersheim Shervey, Sarah; Sandilos, Lia E.; DiPerna, James C.; Lei, Pui-Wa
2017-01-01
The purpose of this study was to examine the social validity of the Social Skills Improvement System-Classwide Intervention Program (SSIS-CIP) for teachers in the primary grades. Participants included 45 first and second grade teachers who completed a 16-item social validity questionnaire during each year of the SSIS-CIP efficacy trial. Findings…
ERIC Educational Resources Information Center
Gleason, Mary Margaret; Fox, Nathan A.; Drury, Stacy; Smyke, Anna; Egger, Helen L.; Nelson, Charles A., III; Gregas, Matthew C.; Zeanah, Charles H.
2011-01-01
Objective: This study examined the validity of criteria for indiscriminately social/disinhibited and emotionally withdrawn/inhibited reactive attachment disorder (RAD). Method: As part of a longitudinal intervention trial of previously institutionalized children, caregiver interviews and direct observational measurements provided continuous and…
Big Data in Designing Clinical Trials: Opportunities and Challenges
Mayo, Charles S.; Matuszak, Martha M.; Schipper, Matthew J.; Jolly, Shruti; Hayman, James A.; Ten Haken, Randall K.
2017-01-01
Emergence of big data analytics resource systems (BDARSs) as a part of routine practice in Radiation Oncology is on the horizon. Gradually, individual researchers, vendors, and professional societies are leading initiatives to create and demonstrate use of automated systems. What are the implications for design of clinical trials, as these systems emerge? Gold standard, randomized controlled trials (RCTs) have high internal validity for the patients and settings fitting constraints of the trial, but also have limitations including: reproducibility, generalizability to routine practice, infrequent external validation, selection bias, characterization of confounding factors, ethics, and use for rare events. BDARS present opportunities to augment and extend RCTs. Preliminary modeling using single- and muti-institutional BDARS may lead to better design and less cost. Standardizations in data elements, clinical processes, and nomenclatures used to decrease variability and increase veracity needed for automation and multi-institutional data pooling in BDARS also support ability to add clinical validation phases to clinical trial design and increase participation. However, volume and variety in BDARS present other technical, policy, and conceptual challenges including applicable statistical concepts, cloud-based technologies. In this summary, we will examine both the opportunities and the challenges for use of big data in design of clinical trials. PMID:28913177
Big Data in Designing Clinical Trials: Opportunities and Challenges.
Mayo, Charles S; Matuszak, Martha M; Schipper, Matthew J; Jolly, Shruti; Hayman, James A; Ten Haken, Randall K
2017-01-01
Emergence of big data analytics resource systems (BDARSs) as a part of routine practice in Radiation Oncology is on the horizon. Gradually, individual researchers, vendors, and professional societies are leading initiatives to create and demonstrate use of automated systems. What are the implications for design of clinical trials, as these systems emerge? Gold standard, randomized controlled trials (RCTs) have high internal validity for the patients and settings fitting constraints of the trial, but also have limitations including: reproducibility, generalizability to routine practice, infrequent external validation, selection bias, characterization of confounding factors, ethics, and use for rare events. BDARS present opportunities to augment and extend RCTs. Preliminary modeling using single- and muti-institutional BDARS may lead to better design and less cost. Standardizations in data elements, clinical processes, and nomenclatures used to decrease variability and increase veracity needed for automation and multi-institutional data pooling in BDARS also support ability to add clinical validation phases to clinical trial design and increase participation. However, volume and variety in BDARS present other technical, policy, and conceptual challenges including applicable statistical concepts, cloud-based technologies. In this summary, we will examine both the opportunities and the challenges for use of big data in design of clinical trials.
Tolley, Elizabeth E; Guthrie, Kate Morrow; Zissette, Seth; Fava, Joseph L; Gill, Katherine; Louw, Cheryl E; Kotze, Philip; Reddy, Krishnaveni; MacQueen, Kathleen
2018-01-01
Low adherence in recent HIV prevention clinical trials highlights the need to better understand, measure, and support product use within clinical trials. Conventional self-reported adherence instruments within HIV prevention trials, often relying on single-item questions, have proven ineffective. While objective adherence measures are desirable, none currently exist that apply to both active and placebo arms. Scales are composed of multiple items in the form of questions or statements that, when combined, measure a more complex construct that may not be directly observable. When psychometrically validated, such measures may better assess the multiple factors contributing to adherence/non-adherence. This study aimed to develop and psychometrically evaluate tools to screen and monitor trial participants' adherence to HIV prevention products within the context of clinical trial research. Based on an extensive literature review and conceptual framework, we identified and refined 86 items assessing potential predictors of adherence and 48 items assessing adherence experience. A structured survey, including adherence items and other variables, was administered to former ASPIRE and Ring Study participants and similar non-trial participants (n = 709). We conducted exploratory factor analyses (EFA) to identify a reduced set of constructs and items that could be used at screening to predict potential adherence, and at follow-up to monitor and intervene on adherence. We examined associations with other variables to assess content and construct validity. The EFA of screener items resulted in a 6-factor solution with acceptable to very good internal reliability (α: .62-.84). Similar to our conceptual framework, factors represent trial-related commitment (Distrust of Research and Commitment to Research); alignment with trial requirements (Visit Adherence and Trial Incompatibility); Belief in Trial Benefits and Partner Disclosure. The EFA on monitoring items resulted in 4 Product-specific factors that represent Vaginal Ring Doubts, Vaginal Ring Benefits, Ring Removal, and Side Effects with good to very good internal reliability (α = .71-.82). Evidence of content and construct validity was found; relationship to social desirability bias was examined. These scales are easy and inexpensive to administer, available in several languages, and are applicable regardless of randomization. Once validated prospectively, they could (1) screen for propensity to adhere, (2) target adherence support/counselling, and (3) complement biomarker measures in determining true efficacy of the experimental product.
Irsik, Vanessa C; Vanden Bosch der Nederlanden, Christina M; Snyder, Joel S
2016-11-01
Attention and other processing constraints limit the perception of objects in complex scenes, which has been studied extensively in the visual sense. We used a change deafness paradigm to examine how attention to particular objects helps and hurts the ability to notice changes within complex auditory scenes. In a counterbalanced design, we examined how cueing attention to particular objects affected performance in an auditory change-detection task through the use of valid or invalid cues and trials without cues (Experiment 1). We further examined how successful encoding predicted change-detection performance using an object-encoding task and we addressed whether performing the object-encoding task along with the change-detection task affected performance overall (Experiment 2). Participants had more error for invalid compared to valid and uncued trials, but this effect was reduced in Experiment 2 compared to Experiment 1. When the object-encoding task was present, listeners who completed the uncued condition first had less overall error than those who completed the cued condition first. All participants showed less change deafness when they successfully encoded change-relevant compared to irrelevant objects during valid and uncued trials. However, only participants who completed the uncued condition first also showed this effect during invalid cue trials, suggesting a broader scope of attention. These findings provide converging evidence that attention to change-relevant objects is crucial for successful detection of acoustic changes and that encouraging broad attention to multiple objects is the best way to reduce change deafness. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Panayidou, Klea; Gsteiger, Sandro; Egger, Matthias; Kilcher, Gablu; Carreras, Máximo; Efthimiou, Orestis; Debray, Thomas P A; Trelle, Sven; Hummel, Noemi
2016-09-01
The performance of a drug in a clinical trial setting often does not reflect its effect in daily clinical practice. In this third of three reviews, we examine the applications that have been used in the literature to predict real-world effectiveness from randomized controlled trial efficacy data. We searched MEDLINE, EMBASE from inception to March 2014, the Cochrane Methodology Register, and websites of key journals and organisations and reference lists. We extracted data on the type of model and predictions, data sources, validation and sensitivity analyses, disease area and software. We identified 12 articles in which four approaches were used: multi-state models, discrete event simulation models, physiology-based models and survival and generalized linear models. Studies predicted outcomes over longer time periods in different patient populations, including patients with lower levels of adherence or persistence to treatment or examined doses not tested in trials. Eight studies included individual patient data. Seven examined cardiovascular and metabolic diseases and three neurological conditions. Most studies included sensitivity analyses, but external validation was performed in only three studies. We conclude that mathematical modelling to predict real-world effectiveness of drug interventions is not widely used at present and not well validated. © 2016 The Authors Research Synthesis Methods Published by John Wiley & Sons Ltd. © 2016 The Authors Research Synthesis Methods Published by John Wiley & Sons Ltd.
Validation of Geriatric Depression Scale--5 Scores among Sedentary Older Adults
ERIC Educational Resources Information Center
Marquez, David X.; McAuley, Edward; Motl, Robert W.; Elavsky, Steriani; Konopack, James F.; Jerome, Gerald J.; Kramer, Arthur F.
2006-01-01
This study examined the validity of Geriatric Depression Scale--5 (GDS-5) scores among older sedentary adults based on its structural properties and relationship with external criteria. Participants from two samples (Ns = 185 and 93; M ages = 66 and 67 years) completed baseline assessments as part of randomized controlled exercise trials.…
ERIC Educational Resources Information Center
Rapee, Ronald M.; Abbott, Maree J.; Lyneham, Heidi J.
2006-01-01
The current trial examined the value of modifying empirically validated treatment for childhood anxiety for application via written materials for parents of anxious children. Two hundred sixty-seven clinically anxious children ages 6-12 years and their parents were randomly allocated to standard group treatment, wait list, or a bibliotherapy…
Therapeutic touch for anxiety disorders.
Robinson, J; Biley, F C; Dolk, H
2007-07-18
Anxiety disorders are a common occurrence in today's society. There is interest from the community in the use of complementary therapies for anxiety disorders. This review examined the currently available evidence supporting the use of therapeutic touch in treating anxiety disorders. To examine the efficacy and adverse effects of therapeutic touch for anxiety disorders. We searched the Cochrane Collaboration Depression, Anxiety and Neurosis Controlled Trials Registers (CCDANCTR-Studies and CCDANCTR-References) (search date 13/01/06), the Controlled Trials website and Dissertation Abstracts International. Searches of reference lists of retrieved papers were also carried out and experts in the field were contacted. Inclusion criteria included all published and unpublished randomised and quasi-randomised controlled trials comparing therapeutic touch with sham (mimic) TT, pharmacological therapy, psychological treatment, other treatment or no treatment /waiting list. The participants included adults with an anxiety disorder defined by the Diagnostic and Statistical Manual (DSM-IV),the International Classification of Diseases (ICD-10), validated diagnostic instruments, or other validated clinician or self-report instruments. Two review authors independently applied inclusion criteria. Further information was sought from trialists where papers contained insufficient information to make a decision about eligibility. No randomised or quasi-randomised controlled trials of therapeutic touch for anxiety disorders were identified. Given the high prevalence of anxiety disorders and the current paucity of evidence on therapeutic touch in this population, there is a need for well conducted randomised controlled trials to examine the effectiveness of therapeutic touch for anxiety disorders.
Saunders, Gabrielle H; Biswas, Kousick; Serpi, Tracey; McGovern, Stephanie; Groer, Shirley; Stock, Eileen M; Magruder, Kathryn M; Storzbach, Daniel; Skelton, Kelly; Abrams, Thad; McCranie, Mark; Richerson, Joan; Dorn, Patricia A; Huang, Grant D; Fallon, Michael T
2017-11-01
Posttraumatic stress disorder (PTSD) is a leading cause of impairments in quality of life and functioning among Veterans. Service dogs have been promoted as an effective adjunctive intervention for PTSD, however published research is limited and design and implementation flaws in published studies limit validated conclusions. This paper describes the rationale for the study design, a detailed methodological description, and implementation challenges of a multisite randomized clinical trial examining the impact of service dogs on the on the functioning and quality of life of Veterans with PTSD. Trial design considerations prioritized participant and intervention (dog) safety, selection of an intervention comparison group that would optimize enrollment in all treatment arms, pragmatic methods to ensure healthy well-trained dogs, and the selection of outcomes for achieving scientific and clinical validity in a Veteran PTSD population. Since there is no blueprint for conducting a randomized clinical trial examining the impact of dogs on PTSD of this size and scope, it is our primary intent that the successful completion of this trial will set a benchmark for future trial design and scientific rigor, as well as guiding researchers aiming to better understand the role that dogs can have in the management of Veterans experiencing mental health conditions such as PTSD. Published by Elsevier Inc.
Yamato, Tie Parma; Maher, Chris; Koes, Bart; Moseley, Anne
2017-06-01
The Physiotherapy Evidence Database (PEDro) scale has been widely used to investigate methodological quality in physiotherapy randomized controlled trials; however, its validity has not been tested for pharmaceutical trials. The aim of this study was to investigate the validity and interrater reliability of the PEDro scale for pharmaceutical trials. The reliability was also examined for the Cochrane Back and Neck (CBN) Group risk of bias tool. This is a secondary analysis of data from a previous study. We considered randomized placebo controlled trials evaluating any pain medication for chronic spinal pain or osteoarthritis. Convergent validity was evaluated by correlating the PEDro score with the summary score of the CBN risk of bias tool. The construct validity was tested using a linear regression analysis to determine the degree to which the total PEDro score is associated with treatment effect sizes, journal impact factor, and the summary score for the CBN risk of bias tool. The interrater reliability was estimated using the Prevalence and Bias Adjusted Kappa coefficient and 95% confidence interval (CI) for the PEDro scale and CBN risk of bias tool. Fifty-three trials were included, with 91 treatment effect sizes included in the analyses. The correlation between PEDro scale and CBN risk of bias tool was 0.83 (95% CI 0.76-0.88) after adjusting for reliability, indicating strong convergence. The PEDro score was inversely associated with effect sizes, significantly associated with the summary score for the CBN risk of bias tool, and not associated with the journal impact factor. The interrater reliability for each item of the PEDro scale and CBN risk of bias tool was at least substantial for most items (>0.60). The intraclass correlation coefficient for the PEDro score was 0.80 (95% CI 0.68-0.88), and for the CBN, risk of bias tool was 0.81 (95% CI 0.69-0.88). There was evidence for the convergent and construct validity for the PEDro scale when used to evaluate methodological quality of pharmacological trials. Both risk of bias tools have acceptably high interrater reliability. Copyright © 2017 Elsevier Inc. All rights reserved.
ERIC Educational Resources Information Center
Japuntich, Sandra J.; Piper, Megan E.; Schlam, Tanya R.; Bolt, Daniel M.; Baker, Timothy B.
2009-01-01
Few studies have examined whether nicotine dependence self-report questionnaires can predict specific behaviors and symptoms at specific points in time. The present study used data from a randomized clinical trial (N = 608; M. E. Piper et al., 2007) to assess the construct validity of scales and items from 3 nicotine dependence measures: the…
Kuklinski, Margaret R; Hawkins, J David; Plotnick, Robert D; Abbott, Robert D; Reid, Carolina K
2013-06-01
This study examined implications of the economic downturn that began in December 2007 for the Community Youth Development Study (CYDS), a longitudinal randomized controlled trial of the Communities That Care (CTC) prevention system. The downturn had the potential to affect the internal validity of the CYDS research design and implementation of science-based prevention in study communities. We used archival economic indicators and community key leader reports of economic conditions to assess the extent of the economic downturn in CYDS communities and potential internal validity threats. We also examined whether stronger economic downturn effects were associated with a decline in science-based prevention implementation. Economic indicators suggested the downturn affected CYDS communities to different degrees. We found no evidence of systematic differences in downturn effects in CTC compared to control communities that would threaten internal validity of the randomized trial. The Community Economic Problems scale was a reliable measure of community economic conditions, and it showed criterion validity in relation to several objective economic indicators. CTC coalitions continued to implement science-based prevention to a significantly greater degree than control coalitions 2 years after the downturn began. However, CTC implementation levels declined to some extent as unemployment, the percentage of students qualifying for free lunch, and community economic problems worsened. Control coalition implementation levels were not related to economic conditions before or after the downturn, but mean implementation levels of science-based prevention were also relatively low in both periods.
Kuklinski, Margaret R.; Hawkins, J. David; Plotnick, Robert D.; Abbott, Robert D.; Reid, Carolina K.
2013-01-01
This study examined implications of the economic downturn that began in December 2007 for the Community Youth Development Study (CYDS), a longitudinal randomized controlled trial of the Communities That Care (CTC) prevention system. The downturn had the potential to affect the internal validity of the CYDS research design and implementation of science-based prevention in study communities. We used archival economic indicators and community key leader reports of economic conditions to assess the extent of the economic downturn in CYDS communities and potential internal validity threats. We also examined whether stronger economic downturn effects were associated with a decline in science-based prevention implementation. Economic indicators suggested the downturn affected CYDS communities to different degrees. We found no evidence of systematic differences in downturn effects in CTC compared to control communities that would threaten internal validity of the randomized trial. The Community Economic Problems scale was a reliable measure of community economic conditions, and it showed criterion validity in relation to several objective economic indicators. CTC coalitions continued to implement science-based prevention to a significantly greater degree than control coalitions 2 years after the downturn began. However, CTC implementation levels declined to some extent as unemployment, the percentage of students qualifying for free lunch, and community economic problems worsened. Control coalition implementation levels were not related to economic conditions before or after the downturn, but mean implementation levels of science-based prevention were also relatively low in both periods. PMID:23054169
I Spy with My Little Eye: Jurors' Detection of Internal Validity Threats in Expert Evidence
McAuliff, Bradley D.; Duckworth, Tejah D.
2010-01-01
This experiment examined whether jury-eligible community members (N = 223) were able to detect internally invalid psychological science presented at trial. Participants read a simulated child sexual abuse case in which the defense expert described a study he had conducted on witness memory and suggestibility. We varied the study's internal validity (valid, missing control group, confound, and experimenter bias) and publication status (published, unpublished). Expert evidence quality ratings were higher for the valid versus missing control group version only. Publication increased ratings of defendant guilt when the study was missing a control group. Variations in internal validity did not influence perceptions of child victim credibility or police interview quality. Participants' limited detection of internal validity threats underscores the need to examine the effectiveness of traditional legal safeguards against junk science in court and improve the scientific reasoning ability of lay people and legal professionals. PMID:20162342
2012-01-01
Background A method for assessing the model validity of randomised controlled trials of homeopathy is needed. To date, only conventional standards for assessing intrinsic bias (internal validity) of trials have been invoked, with little recognition of the special characteristics of homeopathy. We aimed to identify relevant judgmental domains to use in assessing the model validity of homeopathic treatment (MVHT). We define MVHT as the extent to which a homeopathic intervention and the main measure of its outcome, as implemented in a randomised controlled trial (RCT), reflect 'state-of-the-art' homeopathic practice. Methods Using an iterative process, an international group of experts developed a set of six judgmental domains, with associated descriptive criteria. The domains address: (I) the rationale for the choice of the particular homeopathic intervention; (II) the homeopathic principles reflected in the intervention; (III) the extent of homeopathic practitioner input; (IV) the nature of the main outcome measure; (V) the capability of the main outcome measure to detect change; (VI) the length of follow-up to the endpoint of the study. Six papers reporting RCTs of homeopathy of varying design were randomly selected from the literature. A standard form was used to record each assessor's independent response per domain, using the optional verdicts 'Yes', 'Unclear', 'No'. Concordance among the eight verdicts per domain, across all six papers, was evaluated using the kappa (κ) statistic. Results The six judgmental domains enabled MVHT to be assessed with 'fair' to 'almost perfect' concordance in each case. For the six RCTs examined, the method allowed MVHT to be classified overall as 'acceptable' in three, 'unclear' in two, and 'inadequate' in one. Conclusion Future systematic reviews of RCTs in homeopathy should adopt the MVHT method as part of a complete appraisal of trial validity. PMID:22510227
Allmark, P; Mason, S
2006-08-01
To assess whether continuous consent, a process in which information is given to research participants at different stages in a trial, and clinician training in that process were effective when used by clinicians while gaining consent to the Total Body Hypothermia (TOBY) trial. The TOBY trial is a randomised controlled trial (RCT) investigating the use of whole-body cooling for neonates with evidence of perinatal asphyxia. Obtaining valid informed consent for the TOBY trial is difficult, but is a good test of the effectiveness of continuous consent. Semistructured interviews were conducted with 30 sets of parents who consented to the TOBY trial and with 10 clinicians who sought it by the continuous consent process. Analysis was focused on the validity of parental consent based on the consent components of competence, information, understanding and voluntariness. No marked problems with consent validity at the point of signature were observed in 19 of 27 (70%) couples. Problems were found mainly to lie with the competence and understanding of the parents: mothers, particularly, had problems with competence in the early stages of consent. Problems in understanding were primarily to do with side effects. Problems in both competence and understanding were observed to reduce markedly, particularly for mothers, in the post-signature phase, when further discussion took place. Randomisation was generally understood but unpopular. Information was not always given by clinicians in stages during the short period available before parents gave consent. Most clinicians, however, were able to give follow-up information. Consent validity was found to compare favourably with similar trials examined in the Euricon study. Adopting the elements of the continuous consent process and clinician training in RCTs should be considered by researchers, particularly when they have concerns about the quality of consent they are likely to obtain by using a conventional process.
Poor reporting of scientific leadership information in clinical trial registers.
Sekeres, Melanie; Gold, Jennifer L; Chan, An-Wen; Lexchin, Joel; Moher, David; Van Laethem, Marleen L P; Maskalyk, James; Ferris, Lorraine; Taback, Nathan; Rochon, Paula A
2008-02-20
In September 2004, the International Committee of Medical Journal Editors (ICMJE) issued a Statement requiring that all clinical trials be registered at inception in a public register in order to be considered for publication. The World Health Organization (WHO) and ICMJE have identified 20 items that should be provided before a trial is considered registered, including contact information. Identifying those scientifically responsible for trial conduct increases accountability. The objective is to examine the proportion of registered clinical trials providing valid scientific leadership information. We reviewed clinical trial entries listing Canadian investigators in the two largest international and public trial registers, the International Standard Randomized Controlled Trial Number (ISRCTN) register, and ClinicalTrials.gov. The main outcome measures were the proportion of clinical trials reporting valid contact information for the trials' Principal Investigator (PI)/Co-ordinating Investigator/Study Chair/Site PI, and trial e-mail contact address, stratified by funding source, recruiting status, and register. A total of 1388 entries (142 from ISRCTN and 1246 from ClinicalTrials.gov) comprised our sample. We found non-compliance with mandatory registration requirements regarding scientific leadership and trial contact information. Non-industry and partial industry funded trials were significantly more likely to identify the individual responsible for scientific leadership (OR = 259, 95% CI: 95-701) and to provide a contact e-mail address (OR = 9.6, 95% CI: 6.6-14) than were solely industry funded trials. Despite the requirements set by WHO and ICMJE, data on scientific leadership and contact e-mail addresses are frequently omitted from clinical trials registered in the two leading public clinical trial registers. To promote accountability and transparency in clinical trials research, public clinical trials registers should ensure adequate monitoring of trial registration to ensure completion of mandatory contact information fields identifying scientific leadership.
Hassett, Afton L; Li, Tracy; Buyske, Steven; Savage, Shantal V; Gignac, Monique A M
2008-05-01
To consider the feasibility of assessing multiple facets of independence in rheumatoid arthritis (RA) using a measure developed from existing items and examining its face validity, construct validity and responsiveness to change. The ATTAIN (Abatacept Trial in Treatment of Anti-tumor necrosis factor [TNF] Inadequate responders) database was used. Patients with RA were randomized 2:1, abatacept (n = 258) and placebo (n = 133). A multi-faceted scale to measure physical and psychosocial independence was constructed using items from the Health Assessment Questionnaire (HAQ) and Short Form 36 Health Survey (SF-36). Questions assessing activity limitations and need for outside caregiver help were also examined. Interviews with 20 RA patients assessed face validity. Item Response Theory analysis yielded two traits - 'Psychosocial Independence', derived from the number of days with activity limitations plus the Role Emotional, Social Functioning and Role Physical subscale items from the SF-36; and 'Physical Independence', derived from 15 HAQ items assessing need for help from another. The two traits showed no significant differential item functioning for age or gender and demonstrated good face validity. Changes over 169 days on Psychosocial Independence were greater (mean 0.46 units, 95% confidence interval [CI]: 0.17-0.75) for the abatacept group than for placebo (p = 0.002). Changes in Physical Independence were greater (mean 0.59 units, 95% CI: 0.35-0.82) for the abatacept group than for placebo (p < 0.001). The multi-faceted assessment of independence in RA based on items from commonly used instruments is feasible suggesting promise for evaluating independence in future clinical trials. This approach demonstrated good face and construct validity and responsiveness in RA patients who had previously failed anti-TNF therapy. However, we caution against an interpretation that these data suggest that abatacept improves independence because the component parts of this assessment came from instruments used in the ATTAIN trial where data had been previously analyzed.
Bauer, Lyndsey; O'Bryant, Sid E; Lynch, Julie K; McCaffrey, Robert J; Fisher, Jerid M
2007-09-01
Assessing effort level during neuropsychological evaluations is critical to support the accuracy of cognitive test scores. Many instruments are designed to measure effort, yet they are not routinely administered in neuropsychological assessments. The Test of Memory Malingering (TOMM) and the Word Memory Test (WMT) are commonly administered symptom validity tests with sound psychometric properties. This study examines the use of the TOMM Trial 1 and the WMT Immediate Recognition (IR) trial scores as brief screening tools for insufficient effort through an archival analysis of a combined sample of mild head-injury litigants ( N = 105) who were assessed in forensic private practices. Results show that both demonstrate impressive diagnostic accuracy and calculations of positive and negative predictive power are presented for a range of base rates. These results support the utility of Trial 1 of the TOMM and the WMT IR trial as screening methods for the assessment of insufficient effort in neuropsychological assessments.
Challenges in translating endpoints from trials to observational cohort studies in oncology
Ording, Anne Gulbech; Cronin-Fenton, Deirdre; Ehrenstein, Vera; Lash, Timothy L; Acquavella, John; Rørth, Mikael; Sørensen, Henrik Toft
2016-01-01
Clinical trials are considered the gold standard for examining drug efficacy and for approval of new drugs. Medical databases and population surveillance registries are valuable resources for post-approval observational research, which are increasingly used in studies of benefits and risk of new cancer drugs. Here, we address the challenges in translating endpoints from oncology trials to observational studies. Registry-based cohort studies can investigate real-world safety issues – including previously unrecognized concerns – by examining rare endpoints or multiple endpoints at once. In contrast to clinical trials, observational cohort studies typically do not exclude real-world patients from clinical practice, such as old and frail patients with comorbidity. The observational cohort study complements the clinical trial by examining the effectiveness of interventions applied in clinical practice and by providing evidence on long-term clinical outcomes, which are often not feasible to study in a clinical trial. Various endpoints can be included in clinical trials, such as hard endpoints, soft endpoints, surrogate endpoints, and patient-reported endpoints. Each endpoint has it strengths and limitations for use in research studies. Endpoints used in oncology trials are often not applicable in observational cohort studies which are limited by the setting of standard clinical practice and by non-standardized endpoint determination. Observational studies can be more helpful moving research forward if they restrict focus to appropriate and valid endpoints. PMID:27354827
Conflict processing in the anterior cingulate cortex constrains response priming.
Pastötter, Bernhard; Hanslmayr, Simon; Bäuml, Karl-Heinz T
2010-05-01
A prominent function of the anterior cingulate cortex (ACC) is to process conflict between competing response options. In this study, we investigated the role of conflict processing in a response-priming task in which manual responses were either validly or invalidly cued. Examining electrophysiological measurements of oscillatory brain activity on the source level, we found response priming to be related to a beta power decrease in the premotor cortex and conflict processing to be linked to a theta power increase in the ACC. In particular, correlation of oscillatory brain activities in the ACC and the premotor cortex showed that conflict processing reduces response priming by slowing response time in valid trials and lowering response errors in invalid trials. This relationship emerged on a between subjects level as well as within subjects, on a single trial level. These findings suggest that conflict processing in the ACC constrains the automatic priming process. 2010 Elsevier Inc. All rights reserved.
Study design and "evidence" in patient-oriented research.
Concato, John
2013-06-01
Individual studies in patient-oriented research, whether described as "comparative effectiveness" or using other terms, are based on underlying methodological designs. A simple taxonomy of study designs includes randomized controlled trials on the one hand, and observational studies (such as case series, cohort studies, and case-control studies) on the other. A rigid hierarchy of these design types is a fairly recent phenomenon, promoted as a tenet of "evidence-based medicine," with randomized controlled trials receiving gold-standard status in terms of producing valid results. Although randomized trials have many strengths, and contribute substantially to the evidence base in clinical care, making presumptions about the quality of a study based solely on category of research design is unscientific. Both the limitations of randomized trials as well as the strengths of observational studies tend to be overlooked when a priori assumptions are made. This essay presents an argument in support of a more balanced approach to evaluating evidence, and discusses representative examples from the general medical as well as pulmonary and critical care literature. The simultaneous consideration of validity (whether results are correct "internally") and generalizability (how well results apply to "external" populations) is warranted in assessing whether a study's results are accurate for patients likely to receive the intervention-examining the intersection of clinical and methodological issues in what can be called a medicine-based evidence approach. Examination of cause-effect associations in patient-oriented research should recognize both the strengths and limitations of randomized trials as well as observational studies.
Keil, Lukas G; Platts-Mills, Timothy F; Jones, Christopher W
2015-10-01
Publication bias compromises the validity of systematic reviews. This problem can be addressed in part through searching clinical trials registries to identify unpublished studies. This study aims to determine how often systematic reviews published in emergency medicine journals include clinical trials registry searches. We identified all systematic reviews published in the 6 highest-impact emergency medicine journals between January 1 and December 31, 2013. Systematic reviews that assessed the effects of an intervention were further examined to determine whether the authors described searching a clinical trials registry and whether this search identified relevant unpublished studies. Of 191 articles identified through PubMed search, 80 were confirmed to be systematic reviews. Our sample consisted of 41 systematic reviews that assessed a specific intervention. Eight of these 41 (20%) searched a clinical trials registry. For 4 of these 8 reviews, the registry search identified at least 1 relevant unpublished study. Systematic reviews published in emergency medicine journals do not routinely include searches of clinical trials registries. By helping authors identify unpublished trial data, the addition of registry searches may improve the validity of systematic reviews. Copyright © 2014 American College of Emergency Physicians. Published by Elsevier Inc. All rights reserved.
Validity, Reliability, and Inertia of Four Different Temperature Capsule Systems.
Bongers, Coen C W G; Daanen, Hein A M; Bogerd, Cornelis P; Hopman, Maria T E; Eijsvogels, Thijs M H
2018-01-01
Telemetric temperature capsule systems are wireless, relatively noninvasive, and easily applicable in field conditions and have therefore great advantages for monitoring core body temperature. However, the accuracy and responsiveness of available capsule systems have not been compared previously. Therefore, the aim of this study was to examine the validity, reliability, and inertia characteristics of four ingestible temperature capsule systems (i.e., CorTemp, e-Celsius, myTemp, and VitalSense). Ten temperature capsules were examined for each system in a temperature-controlled water bath during three trials. The water bath temperature gradually increased from 33°C to 44°C in trials 1 and 2 to assess the validity and reliability, and from 36°C to 42°C in trial 3 to assess the inertia characteristics of the temperature capsules. A systematic difference between capsule and water bath temperature was found for CorTemp (0.077°C ± 0.040°C), e-Celsius (-0.081°C ± 0.055°C), myTemp (-0.003°C ± 0.006°C), and VitalSense (-0.017°C ± 0.023°C; P < 0.010), with the lowest bias for the myTemp system (P < 0.001). A systematic difference was found between trial 1 and trial 2 for CorTemp (0.017°C ± 0.083°C; P = 0.030) and e-Celsius (-0.007°C ± 0.033°C; P = 0.019), whereas temperature values of myTemp (0.001°C ± 0.008°C) and VitalSense (0.002°C ± 0.014°C) did not differ (P > 0.05). Comparable inertia characteristics were found for CorTemp (25 ± 4 s), e-Celsius (21 ± 13 s), and myTemp (19 ± 2 s), whereas the VitalSense system responded more slowly (39 ± 6 s) to changes in water bath temperature (P < 0.001). Although differences in temperature and inertia were observed between capsule systems, an excellent validity, test-retest reliability, and inertia was found for each system between 36°C and 44°C after removal of outliers.
Validity and reliability of the Omron HJ-303 tri-axial accelerometer-based pedometer.
Steeves, Jeremy A; Tyo, Brian M; Connolly, Christopher P; Gregory, Douglas A; Stark, Nyle A; Bassett, David R
2011-09-01
This study compared the validity of a new Omron HJ-303 piezoelectric pedometer and 2 other pedometers (Sportline Traq and Yamax SW200). To examine the effect of speed, 60 subjects walked on a treadmill at 2, 3, and 4 mph. Twenty subjects also ran at 6, 7, and 8 mph. To test lifestyle activities, 60 subjects performed front-back-side-side stepping, elliptical machine and stair climbing/descending. Twenty others performed ballroom dancing. Sixty participants completed 5 100-step trials while wearing 5 different sets of the devices tested device reliability. Actual steps were determined using a hand tally counter. Significant differences existed among pedometers (P < .05). For walking, the Omron pedometers were the most valid. The Sportline overestimated and the Yamax underestimated steps (P < .05). Worn on the waist or in the backpack, the Omron device and Sportline were valid for running. The Omron was valid for 3 activities (elliptical machine, ascending and descending stairs). The Sportline overestimated all of these activities, and Yamax was only valid for descending stairs. The Omron andYamax were both valid and reliable in the 100-step trials. The Omron HJ-303, worn on the waist, appeared to be the most valid of the 3 pedometers.
An Initial Examination for Verifying Separation Algorithms by Simulation
NASA Technical Reports Server (NTRS)
White, Allan L.; Neogi, Natasha; Herencia-Zapana, Heber
2012-01-01
An open question in algorithms for aircraft is what can be validated by simulation where the simulation shows that the probability of undesirable events is below some given level at some confidence level. The problem is including enough realism to be convincing while retaining enough efficiency to run the large number of trials needed for high confidence. The paper first proposes a goal based on the number of flights per year in several regions. The paper examines the probabilistic interpretation of this goal and computes the number of trials needed to establish it at an equivalent confidence level. Since any simulation is likely to consider the algorithms for only one type of event and there are several types of events, the paper examines under what conditions this separate consideration is valid. This paper is an initial effort, and as such, it considers separation maneuvers, which are elementary but include numerous aspects of aircraft behavior. The scenario includes decisions under uncertainty since the position of each aircraft is only known to the other by broadcasting where GPS believes each aircraft to be (ADS-B). Each aircraft operates under feedback control with perturbations. It is shown that a scenario three or four orders of magnitude more complex is feasible. The question of what can be validated by simulation remains open, but there is reason to be optimistic.
Piazza, Alexander M; Binversie, Emily E; Baker, Lauren A; Nemke, Brett; Sample, Susannah J; Muir, Peter
2017-04-01
OBJECTIVE To determine whether walking at specific ranges of absolute and relative (V*) velocity would aid efficient capture of gait trial data with low ground reaction force (GRF) variance in a heterogeneous sample of dogs. ANIMALS 17 clinically normal dogs of various breeds, ages, and sexes. PROCEDURES Each dog was walked across a force platform at its preferred velocity, with controlled acceleration within 0.5 m/s 2 . Ranges in V* were created for height at the highest point of the shoulders (withers; WHV*). Variance effects from 8 walking absolute velocity ranges and associated WHV* ranges were examined by means of repeated-measures ANCOVA. RESULTS The individual dog effect provided the greatest contribution to variance. Narrow velocity ranges typically resulted in capture of a smaller percentage of valid trials and were not consistently associated with lower variance. The WHV* range of 0.33 to 0.46 allowed capture of valid trials efficiently, with no significant effects on peak vertical force and vertical impulse. CONCLUSIONS AND CLINICAL RELEVANCE Dogs with severe lameness may be unable to trot or may have a decline in mobility with gait trial repetition. Gait analysis involving evaluation of individual dogs at their preferred absolute velocity, such that dogs are evaluated at a similar V*, may facilitate efficient capture of valid trials without significant effects on GRF. Use of individual velocity ranges derived from a WHV* range of 0.33 to 0.46 can account for heterogeneity and appears suitable for use in clinical trials involving dogs at a walking gait.
Marable, Brian R; Maurissen, Jacques P J
2004-01-01
Neurotoxicity regulatory guidelines mandate that automated test systems be validated using chemicals. However, in some cases, chemicals may not necessarily be needed to prove test system validity. To examine this issue, two independent experiments were conducted to validate an automated auditory startle response (ASR) system. In Experiment 1, we used adult (PND 63) and weanling (PND 22) Sprague-Dawley rats (10/sex/dose) to determine the effect of either d-amphetamine (4.0 or 8.0 mg/kg) or clonidine (0.4 or 0.8 mg/kg) on the ASR peak amplitude (ASR PA). The startle response of each rat to a short burst of white noise (120 dB SPL) was recorded over 50 consecutive trials. The ASR PA was significantly decreased (by clonidine) and increased (by d-amphetamine) compared to controls in PND 63 rats. In PND 22 rats, the response to clonidine was similar to adults, but d-amphetamine effects were not significant. Neither drug affected the rate of the decrease in ASR PA over time (habituation). In Experiment 2, PND 31 Sprague-Dawley rats (8/sex) were presented with 150 trials consisting of either white noise bursts of variable intensity (70-120 dB SPL in 10 dB increments, presented in random order) or null (0 dB SPL) trials. Statistically significant sex- and intensity-dependent differences were detected in the ASR PA. These results suggest that in some cases, parametric modulation may be an alternative to using chemicals for test system validation.
Updating of Attentional and Premotor Allocation Resources as function of previous trial outcome
Arjona, Antonio; Escudero, Miguel; Gómez, Carlos M.
2014-01-01
The neural bases of the inter-trial validity/invalidity sequential effects in a visuo-auditory modified version of the Central Cue Posner's Paradigm (CCPP) are analyzed by means of Early Directing Attention Negativity (EDAN), Contingent Negative Variation (CNV) and Lateralized Readiness Potential (LRP). ERPs results indicated an increase in CNV and LRP in trials preceded by valid trials compared to trials preceded by invalid trials. The CNV and LRP pattern would be highly related to the behavioral pattern of lower RTs and higher number of anticipations in trials preceded by valid with respect to trials preceded by invalid trials. This effect was not preceded by a modulation of the EDAN as a result of the previous trial condition. The results suggest that there is a trial-by-trial dynamic modulation of the attentional system as a function of the validity assigned to the cue, in which conditional probabilities between cue and target are continuously updated. PMID:24681570
Wollersheim Shervey, Sarah; Sandilos, Lia E; DiPerna, James C; Lei, Pui-Wa
2017-09-01
The purpose of this study was to examine the social validity of the Social Skills Improvement System-Classwide Intervention Program (SSIS-CIP) for teachers in the primary grades. Participants included 45 first and second grade teachers who completed a 16-item social validity questionnaire during each year of the SSIS-CIP efficacy trial. Findings indicated that teachers generally perceived the SSIS-CIP as a socially valid and feasible intervention for primary grades; however, teachers' ratings regarding ease of implementation and relevance and sequence demonstrated differences across grade levels in the second year of implementation. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Kirouac, Megan; Stein, Elizabeth R; Pearson, Matthew R; Witkiewitz, Katie
2017-11-01
Quality of life is an outcome often examined in treatment research contexts such as biomedical trials, but has been studied less often in alcohol use disorder (AUD) treatment. The importance of considering QoL in substance use treatment research has recently been voiced, and measures of QoL have been administered in large AUD treatment trials. Yet, the viability of popular QoL measures has never been evaluated in AUD treatment samples. Accordingly, the present manuscript describes a psychometric examination of and prospective changes in the World Health Organization Quality of Life measure (WHOQOL-BREF) in a large sample (N = 1383) of patients with AUD recruited for the COMBINE Study. Specifically, we examined the construct validity (via confirmatory factor analyses), measurement invariance across time, internal consistency reliability, convergent validity, and effect sizes of post-treatment changes in the WHOQOL-BREF. Confirmatory factor analyses of the WHOQOL-BREF provided acceptable fit to the current data and this model was invariant across time. Internal consistency reliability was excellent (α > .9) for the full WHOQOL-BREF for each timepoint; the WHOQOL-BREF had good convergent validity, and medium effect size improvements were found in the full COMBINE sample across time. These findings suggest that the WHOQOL-BREF is an appropriate measure to use in samples with AUD, that the WHOQOL-BREF scores may be examined over time (e.g., from pre- to post-treatment), and the WHOQOL-BREF may be used to assess improvements in quality of life in AUD research.
Pauling, L; Herman, Z S
1989-01-01
With the assumption of the validity of the Hardin Jones principle that the death rate of members of a homogeneous cohort of cancer patients is constant, three criteria for the validity of clinical trials of cancer treatments are formulated. These criteria are satisfied by most published clinical trials, but one trial was found to violate all three, rendering the validity of its reported results uncertain. PMID:2780542
Krumholz, Samuel D; Egilman, David S; Ross, Joseph S
2011-06-27
Seeding trials, clinical studies conducted by pharmaceutical companies for marketing purposes, have rarely been described in detail. We examined all documents relating to the clinical trial Study of Neurontin: Titrate to Effect, Profile of Safety (STEPS) produced during the Neurontin marketing, sales practices, and product liability litigation, including company internal and external correspondence, reports, and presentations, as well as depositions elicited in legal proceedings of Harden Manufacturing vs Pfizer and Franklin vs Warner-Lambert, most which were created between 1990 and 2009. Using a systematic search strategy, we identified and reviewed all documents related to the STEPS trial in order to identify key themes related to the trial's conduct and determine the extent of marketing involvement in its planning and implementation. Documents demonstrated that STEPS was a seeding trial posing as a legitimate scientific study. Documents consistently described the trial itself, not trial results, to be a marketing tactic in the company's marketing plans. Documents demonstrated that at least 2 external sources questioned the validity of the study before execution, and that data quality during the study was often compromised. Furthermore, documents described company analyses examining the impact of participating as a STEPS investigator on rates and dosages of gabapentin prescribing, finding a positive association. None of these findings were reported in 2 published articles. The STEPS trial was a seeding trial, used to promote gabapentin and increase prescribing among investigators, and marketing was extensively involved in its planning and implementation.
Heald, Alison E; Fudman, Edward J; Anklesaria, Pervin; Mease, Philip J
2010-05-01
To assess the validity, responsiveness, and reliability of single-joint outcome measures for determining target joint (TJ) response in patients with inflammatory arthritis. Patient-reported outcomes (PRO), consisting of responses to single questions about TJ global status on a 100-mm visual analog scale (VAS; TJ global score), function on a 100-mm VAS (TJ function score), and pain on a 5-point Likert scale (TJ pain score) were piloted in 66 inflammatory arthritis subjects in a phase 1/2 clinical study of an intraarticular gene transfer agent and compared to physical examination measures (TJ swelling, TJ tenderness) and validated function questionnaires (Disabilities of the Arm, Shoulder and Hand scale, Rheumatoid Arthritis Outcome Score, and the Health Assessment Questionnaire). Construct validity was assessed by evaluating the correlation between the single-joint outcome measures and validated function questionnaires using Spearman's rank correlation. Responsiveness or sensitivity to change was assessed through calculating effect size and standardized response means (SRM). Reliability of physical examination measures was assessed by determining interobserver agreement. The single-joint PRO were highly correlated with each other and correlated well with validated functional measures. The TJ global score exhibited modest effect size and modest SRM that correlated well with the patient's assessment of response on a 100-mm VAS. Physical examination measures exhibited high interrater reliability, but correlated less well with validated functional measures and the patient's assessment of response. Single-joint PRO, particularly the TJ global score, are simple to administer and demonstrate construct validity and responsiveness in patients with inflammatory arthritis. (ClinicalTrials.gov identifier NCT00126724).
What tests should you use to assess small intestinal bacterial overgrowth in systemic sclerosis?
Braun-Moscovici, Yolanda; Braun, Marius; Khanna, Dinesh; Balbir-Gurman, Alexandra; Furst, Daniel E
2015-01-01
Small intestinal bacterial overgrowth (SIBO) plays a major role in the pathogenesis of malabsorption in SSc patients and is a source of great morbidity and even mortality, in those patients. This manuscript reviews which tests are valid and should be used in SSc when evaluating SIBO. We performed systematic literature searches in PubMed, Embase and the Cochrane library from 1966 up to November 2014 for English language, published articles examining bacterial overgrowth in SSc (e.g. malabsorption tests, breath tests, xylose test, etc). Articles obtained from these searches were reviewed for additional references. The validity of the tests was evaluated according to the OMERACT principles of truth, discrimination and feasibility. From a total of 65 titles, 22 articles were reviewed and 20 were ultimately extracted to examine the validity of tests for GI morphology, bacterial overgrowth and malabsorption in SSc. Only 1 test (hydrogen and methane breath tests) is fully validated. Four tests are partially validated, including jejunal cultures, xylose, lactulose tests, and 72 hours fecal fat test. Only 1 of a total of 5 GI tests of bacterial overgrowth (see above) is fully validated in SSc. For clinical trials, fully validated tests are preferred, although some investigators use partially validated tests (4 tests). Further validation of GI tests in SSc is needed.
ERIC Educational Resources Information Center
Asmus, Jennifer M.; Carter, Erik W.; Moss, Colleen K.; Biggs, Elizabeth E.; Bolt, Daniel M.; Born, Tiffany L.; Bottema-Beutel, Kristen; Brock, Matthew E.; Cattey, Gillian N.; Cooney, Molly; Fesperman, Ethan S.; Hochman, Julia M.; Huber, Heartley B.; Lequia, Jenna L.; Lyons, Gregory L.; Vincent, Lori B.; Weir, Katie
2017-01-01
This randomized controlled trial examined the efficacy of peer network interventions to improve the social connections of 47 high school students with severe disabilities. School staff invited, trained, and supported 192 peers without disabilities to participate in individualized social groups that met throughout one semester. Compared to…
ClinicalTrials.gov and Drugs@FDA: A comparison of results reporting for new drug approval trials
Schwartz, Lisa M.; Woloshin, Steven; Zheng, Eugene; Tse, Tony; Zarin, Deborah A.
2016-01-01
Background Pharmaceutical companies and other trial sponsors must submit certain trial results to ClinicalTrials.gov. The validity of these results is unclear. Purpose To validate results posted on ClinicalTrials.gov against publicly-available FDA reviews on Drugs@FDA. Data sources ClinicalTrials.gov (registry and results database) and Drugs@FDA (medical/statistical reviews). Study selection 100 parallel-group, randomized trials for new drug approvals (1/2013 – 7/2014) with results posted on ClinicalTrials.gov (3/15/2015). Data extraction Two assessors systematically extracted, and another verified, trial design, primary/secondary outcomes, adverse events, and deaths. Results The 100 trials were mostly phase 3 (90%) double-blind (92%), placebo-controlled (73%), representing 32 drugs from 24 companies. Of 137 primary outcomes from ClinicalTrials.gov, 134 (98%) had corresponding data in Drugs@FDA, 130 (95%) had concordant definitions, and 107 (78%) had concordant results; most differences were nominal (i.e. relative difference < 10%). Of 100 trials, primary outcome results in 14 could not be validated . Of 1,927 secondary outcomes from ClinicalTrials.gov, 1,061 (55%) definitions could be validated and 367 (19%) had results. Of 96 trials with ≥ 1 serious adverse event in either source, 14 could be compared and 7 were discordant. Of 62 trials with ≥ 1 death in either source, 25 could be compared and 17 were discordant. Limitations Unknown generalizability to uncontrolled or crossover trial results. Conclusion Primary outcome definitions and results were largely concordant between ClinicalTrials.gov and Drugs@FDA. Half of secondary outcomes could not be validated because Drugs@FDA only includes “key outcomes” for regulatory decision-making; nor could serious adverse events and deaths because Drugs@FDA frequently only includes results aggregated across multiple trials. PMID:27294570
Psychometric evaluation of ADAS-Cog and NTB for measuring drug response.
Karin, A; Hannesdottir, K; Jaeger, J; Annas, P; Segerdahl, M; Karlsson, P; Sjögren, N; von Rosen, T; Miller, F
2014-02-01
To conduct a psychometric analysis to determine the adequacy of instruments that measure cognition in Alzheimer's disease trials. Both the Alzheimer's Disease Assessment Scale - Cognition (ADAS-Cog) and the Neuropsychological Test Battery (NTB) are validated outcome measures for clinical trials in Alzheimer's disease and are approved also for regulatory purposes. However, it is not clear how comparable they are in measuring cognitive function. In fact, many recent trials in Alzheimer's disease patients have failed and it has been questioned if ADAS-Cog still is a sensitive measure. The present paper examines the psychometric properties of ADAS-Cog and NTB, based on a post hoc analysis of data from a clinical trial (NCT01024660), which was conducted by AstraZeneca, in mild-to-moderate Alzheimer's disease (AD) patients, with a Mini Mental State Examination (MMSE) Total score 16-24. Acceptability, reliability, different types of validity and ability to detect change were assessed using relevant statistical methods. Total scores of both tests, as well as separate domains of both tests, including the Wechsler Memory Scale (WMS), Rey Auditory Verbal Learning Test (RAVLT) and Delis-Kaplan Executive Function System (D-KEFS) Verbal Fluency Condition, were analyzed. Overall, NTB performed well, with acceptable reliability and ability to detect change, while ADAS-Cog had insufficient psychometric properties, including ceiling effects in 8 out of a total of 11 ADAS-Cog items in mild AD patients, as well as low test-retest reliability in some of the items. Based on a direct comparison on the same patient sample, we see advantages of the NTB compared with the ADAS-Cog for the evaluation of cognitive function in the population of mild-to-moderate AD patients. The results suggest that not all of ADAS-Cog items are relevant for both mild and moderate AD population. This validation study demonstrates satisfactory psychometric properties of the NTB, while ADAS-Cog was found to be psychometrically inadequate. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Sertdemir, Y; Burgut, R
2009-01-01
In recent years the use of surrogate end points (S) has become an interesting issue. In clinical trials, it is important to get treatment outcomes as early as possible. For this reason there is a need for surrogate endpoints (S) which are measured earlier than the true endpoint (T). However, before a surrogate endpoint can be used it must be validated. For a candidate surrogate endpoint, for example time to recurrence, the validation result may change dramatically between clinical trials. The aim of this study is to show how the validation criterion (R(2)(trial)) proposed by Buyse et al. are influenced by the magnitude of treatment effect with an application using real data. The criterion R(2)(trial) proposed by Buyse et al. (2000) is applied to the four data sets from colon cancer clinical trials (C-01, C-02, C-03 and C-04). Each clinical trial is analyzed separately for treatment effect on survival (true endpoint) and recurrence free survival (surrogate endpoint) and this analysis is done also for each center in each trial. Results are used for standard validation analysis. The centers were grouped by the Wald statistic in 3 equal groups. Validation criteria R(2)(trial) were 0.641 95% CI (0.432-0.782), 0.223 95% CI (0.008-0.503), 0.761 95% CI (0.550-0.872) and 0.560 95% CI (0.404-0.687) for C-01, C-02, C-03 and C-04 respectively. The R(2)(trial) criteria changed by the Wald statistics observed for the centers used in the validation process. Higher the Wald statistic groups are higher the R(2)(trial) values observed. The recurrence free survival is not a good surrogate for overall survival in clinical trials with non significant treatment effects and moderate for significant treatment effects. This shows that the level of significance of treatment effect should be taken into account in validation process of surrogate endpoints.
Clinical trial designs for testing biomarker-based personalized therapies
Lai, Tze Leung; Lavori, Philip W; Shih, Mei-Chiung I; Sikic, Branimir I
2014-01-01
Background Advances in molecular therapeutics in the past decade have opened up new possibilities for treating cancer patients with personalized therapies, using biomarkers to determine which treatments are most likely to benefit them, but there are difficulties and unresolved issues in the development and validation of biomarker-based personalized therapies. We develop a new clinical trial design to address some of these issues. The goal is to capture the strengths of the frequentist and Bayesian approaches to address this problem in the recent literature and to circumvent their limitations. Methods We use generalized likelihood ratio tests of the intersection null and enriched strategy null hypotheses to derive a novel clinical trial design for the problem of advancing promising biomarker-guided strategies toward eventual validation. We also investigate the usefulness of adaptive randomization (AR) and futility stopping proposed in the recent literature. Results Simulation studies demonstrate the advantages of testing both the narrowly focused enriched strategy null hypothesis related to validating a proposed strategy and the intersection null hypothesis that can accommodate to a potentially successful strategy. AR and early termination of ineffective treatments offer increased probability of receiving the preferred treatment and better response rates for patients in the trial, at the expense of more complicated inference under small-to-moderate total sample sizes and some reduction in power. Limitations The binary response used in the development phase may not be a reliable indicator of treatment benefit on long-term clinical outcomes. In the proposed design, the biomarker-guided strategy (BGS) is not compared to ‘standard of care’, such as physician’s choice that may be informed by patient characteristics. Therefore, a positive result does not imply superiority of the BGS to ‘standard of care’. The proposed design and tests are valid asymptotically. Simulations are used to examine small-to-moderate sample properties. Conclusion Innovative clinical trial designs are needed to address the difficulties and issues in the development and validation of biomarker-based personalized therapies. The article shows the advantages of using likelihood inference and interim analysis to meet the challenges in the sample size needed and in the constantly evolving biomarker landscape and genomic and proteomic technologies. PMID:22397801
A New MI-Based Visualization Aided Validation Index for Mining Big Longitudinal Web Trial Data
Zhang, Zhaoyang; Fang, Hua; Wang, Honggang
2016-01-01
Web-delivered clinical trials generate big complex data. To help untangle the heterogeneity of treatment effects, unsupervised learning methods have been widely applied. However, identifying valid patterns is a priority but challenging issue for these methods. This paper, built upon our previous research on multiple imputation (MI)-based fuzzy clustering and validation, proposes a new MI-based Visualization-aided validation index (MIVOOS) to determine the optimal number of clusters for big incomplete longitudinal Web-trial data with inflated zeros. Different from a recently developed fuzzy clustering validation index, MIVOOS uses a more suitable overlap and separation measures for Web-trial data but does not depend on the choice of fuzzifiers as the widely used Xie and Beni (XB) index. Through optimizing the view angles of 3-D projections using Sammon mapping, the optimal 2-D projection-guided MIVOOS is obtained to better visualize and verify the patterns in conjunction with trajectory patterns. Compared with XB and VOS, our newly proposed MIVOOS shows its robustness in validating big Web-trial data under different missing data mechanisms using real and simulated Web-trial data. PMID:27482473
STEPS: A NARRATIVE ACCOUNT OF A GABAPENTIN SEEDING TRIAL
Krumholz, Samuel D.; Egilman, David S.; Ross, Joseph S.
2012-01-01
Background Seeding trials, clinical studies conducted by pharmaceutical companies for marketing purposes, have rarely been described in detail. Methods We examined all documents relating to the clinical trial Study of Neurontin: Titrate to Effect, Profile of Safety (STEPS) produced during the Neurontin marketing, sales practices and product liability litigation, including company internal and external correspondence, reports, and presentations, as well as depositions elicited in legal proceedings of Harden Manufacturing v. Pfizer and Franklin v. Warner-Lambert, the majority of which were created between 1990 and 2009. Using a systematic search strategy, we identified and reviewed all documents related to the STEPS trial, in order to identify key themes related to the trial’s conduct and determine the extent of marketing involvement in its planning and implementation. Results Documents demonstrated that STEPS was a seeding trial posing as a legitimate scientific study. Documents consistently described the trial itself, not trial results, to be a marketing tactic in the company’s marketing plans. Documents demonstrated that several external sources questioned the validity of the study before execution, and that data quality during the study was often compromised. Furthermore, documents described company analyses examining the impact of participating as a STEPS investigator on rates and dosages of gabapentin prescribing, finding a positive association. None of these findings were reported in two published papers. Conclusions The STEPS trial was a seeding trial, used to promote gabapentin and increase prescribing among investigators, and marketing was extensively involved in its planning and implementation. PMID:21709111
Paterson, Charlotte; Karatzias, Thanos; Dickson, Adele; Harper, Sean; Dougall, Nadine; Hutton, Paul
2018-04-16
The effectiveness of psychological therapies for those receiving acute adult mental health inpatient care remains unclear, partly because of the difficulty in conducting randomized controlled trials (RCTs) in this setting. The aim of this meta-analysis was to synthesize evidence from all controlled trials of psychological therapy carried out with this group, to estimate its effects on a number of important outcomes and examine whether the presence of randomization and rater blinding moderated these estimates. A systematic review and meta-analysis of all controlled trials of psychological therapy delivered in acute inpatient settings was conducted, with a focus on psychotic symptoms, readmissions or emotional distress (anxiety and depression). Studies were identified through ASSIA, EMBASE, CINAHL, Cochrane, MEDLINE, and PsycINFO using a combination of the key terms 'inpatient', 'psychological therapy', and 'acute'. No restriction was placed on diagnosis. The moderating effect of the use of assessor-blind RCT methodology was examined via subgroup and sensitivity analyses. Overall, psychological therapy was associated with small-to-moderate improvements in psychotic symptoms at end of therapy but the effect was smaller and not significant at follow-up. Psychological therapy was also associated with reduced readmissions, depression, and anxiety. The use of single-blind randomized controlled trial methodology was associated with significantly reduced benefits on psychotic symptoms and was also associated with reduced benefits on readmission and depression; however, these reductions were not statistically significant. The provision of psychological therapy to acute psychiatric inpatients is associated with improvements; however, the use of single-blind RCT methodology was associated with reduced therapy-attributable improvements. Whether this is a consequence of increased internal validity or reduced external validity is unclear. Trials with both high internal and external validity are now required to establish what type, format, and intensity of brief psychological therapy is required to achieve sustained benefits. Clinical implications: This review provides the first meta-analytical synthesis of brief psychological therapy delivered in acute psychiatric inpatient settings. This review suggests that brief psychological therapy may be associated with reduced emotional distress and readmissions. The evidence in this review is of limited quality. The type, format, and intensity of brief psychological therapy required to achieve sustained benefits are yet to be established. © 2018 The British Psychological Society.
Loudon, Kirsty; Zwarenstein, Merrick; Sullivan, Frank; Donnan, Peter; Treweek, Shaun
2013-04-27
If you want to know which of two or more healthcare interventions is most effective, the randomised controlled trial is the design of choice. Randomisation, however, does not itself promote the applicability of the results to situations other than the one in which the trial was done. A tool published in 2009, PRECIS (PRagmatic Explanatory Continuum Indicator Summaries) aimed to help trialists design trials that produced results matched to the aim of the trial, be that supporting clinical decision-making, or increasing knowledge of how an intervention works. Though generally positive, groups evaluating the tool have also found weaknesses, mainly that its inter-rater reliability is not clear, that it needs a scoring system and that some new domains might be needed. The aim of the study is to: Produce an improved and validated version of the PRECIS tool. Use this tool to compare the internal validity of, and effect estimates from, a set of explanatory and pragmatic trials matched by intervention. The study has four phases. Phase 1 involves brainstorming and a two-round Delphi survey of authors who cited PRECIS. In Phase 2, the Delphi results will then be discussed and alternative versions of PRECIS-2 developed and user-tested by experienced trialists. Phase 3 will evaluate the validity and reliability of the most promising PRECIS-2 candidate using a sample of 15 to 20 trials rated by 15 international trialists. We will assess inter-rater reliability, and raters' subjective global ratings of pragmatism compared to PRECIS-2 to assess convergent and face validity. Phase 4, to determine if pragmatic trials sacrifice internal validity in order to achieve applicability, will compare the internal validity and effect estimates of matched explanatory and pragmatic trials of the same intervention, condition and participants. Effect sizes for the trials will then be compared in a meta-regression. The Cochrane Risk of Bias scores will be compared with the PRECIS-2 scores of pragmatism. We have concrete suggestions for improving PRECIS and a growing list of enthusiastic individuals interested in contributing to this work. By early 2014 we expect to have a validated PRECIS-2.
ERIC Educational Resources Information Center
Martini, Jay R.
2017-01-01
The purpose of this study was to conduct a psychometric evaluation the "Sound Beginning" phonological awareness progress monitoring tool. This assessment was used to track emergent literacy skills of preschoolers with autism spectrum disorder who were participating in a randomized trial studying early literacy interventions. Research…
Gitlin, Laura N.; Parisi, Jeanine; Huang, Jin; Winter, Laraine; Roth, David L.
2016-01-01
Purpose of study: Examine psychometric properties of Lawton’s Valuation of Life (VOL) scale, a measure of an older adults’ assessment of the perceived value of their lives; and whether ratings differ by race (White, Black/African American) and sex. Design and Methods: The 13-item VOL scale was administered at baseline in 2 separate randomized trials (Advancing Better Living for Elders, ABLE; Get Busy Get Better, GBGB) for a total of 527 older adults. Principal component analyses were applied to a subset of ABLE data (subsample 1) and confirmatory factor analyses were conducted on remaining data (subsample 2 and GBGB). Once the factor structure was identified and confirmed, 2 subscales were created, corresponding to optimism and engagement. Convergent validity of total and subscale scores were examined using measures of depressive symptoms, social support, control-oriented strategies, mastery, and behavioral activation. For discriminant validity, indices of health status, physical function, financial strain, cognitive status, and number of falls were examined. Results: Trial samples (ABLE vs. GBGB) differed by age, race, marital status, education, and employment. Principal component analysis on ABLE subsample 1 (n = 156) yielded two factors subsequently confirmed in confirmatory factor analyses on ABLE subsample 2 (n = 163) and GBGB sample (N = 208) separately. Adequate fit was found for the 2-factor model. Correlational analyses supported strong convergent and discriminant validity. Some statistically significant race and sex differences in subscale scores were found. Implications: VOL measures subjective appraisals of perceived value of life. Consisting of two interrelated subscales, it offers an efficient approach to ascertain personal attributions. PMID:26874189
Strauss, Gregory P; Keller, William R; Buchanan, Robert W; Gold, James M; Fischer, Bernard A; McMahon, Robert P; Catalano, Lauren T; Culbreth, Adam J; Carpenter, William T; Kirkpatrick, Brian
2012-12-01
The current study examined the psychometric properties of the Brief Negative Symptom Scale (BNSS), a next-generation rating instrument developed in response to the NIMH sponsored consensus development conference on negative symptoms. Participants included 100 individuals with a DSM-IV diagnosis of schizophrenia or schizoaffective disorder who completed a clinical interview designed to assess negative, positive, disorganized, and general psychiatric symptoms, as well as functional outcome. A battery of anhedonia questionnaires and neuropsychological tests were also administered. Results indicated that the BNSS has excellent internal consistency and temporal stability, as well as good convergent and discriminant validity in its relationships with other symptom rating scales, functional outcome, self-reported anhedonia, and neuropsychological test scores. Given its brevity (13-items, 15-minute interview) and good psychometric characteristics, the BNSS can be considered a promising new instrument for use in clinical trials. Copyright © 2012 Elsevier B.V. All rights reserved.
Bayesian Adaptive Trial Design for a Newly Validated Surrogate Endpoint
Renfro, Lindsay A.; Carlin, Bradley P.; Sargent, Daniel J.
2011-01-01
Summary The evaluation of surrogate endpoints for primary use in future clinical trials is an increasingly important research area, due to demands for more efficient trials coupled with recent regulatory acceptance of some surrogates as ‘valid.’ However, little consideration has been given to how a trial which utilizes a newly-validated surrogate endpoint as its primary endpoint might be appropriately designed. We propose a novel Bayesian adaptive trial design that allows the new surrogate endpoint to play a dominant role in assessing the effect of an intervention, while remaining realistically cautious about its use. By incorporating multi-trial historical information on the validated relationship between the surrogate and clinical endpoints, then subsequently evaluating accumulating data against this relationship as the new trial progresses, we adaptively guard against an erroneous assessment of treatment based upon a truly invalid surrogate. When the joint outcomes in the new trial seem plausible given similar historical trials, we proceed with the surrogate endpoint as the primary endpoint, and do so adaptively–perhaps stopping the trial for early success or inferiority of the experimental treatment, or for futility. Otherwise, we discard the surrogate and switch adaptive determinations to the original primary endpoint. We use simulation to test the operating characteristics of this new design compared to a standard O’Brien-Fleming approach, as well as the ability of our design to discriminate trustworthy from untrustworthy surrogates in hypothetical future trials. Furthermore, we investigate possible benefits using patient-level data from 18 adjuvant therapy trials in colon cancer, where disease-free survival is considered a newly-validated surrogate endpoint for overall survival. PMID:21838811
Meinck, Franziska; Boyes, Mark E; Cluver, Lucie; Ward, Catherine L; Schmidt, Peter; DeStone, Sachin; Dunne, Michael P
2018-05-31
Child abuse prevention research has been hampered by a lack of validated multi-dimensional non-proprietary instruments, sensitive enough to measure change in abuse victimization or behavior. This study aimed to adapt the ICAST child abuse self-report measure (parent and child) for use in intervention studies and to investigate the psychometric properties of this substantially modified tool in a South African sample. First, cross-cultural and sensitivity adaptation of the original ICAST tools resulted in two preliminary measures (ICAST-Trial adolescents: 27 items, ICAST-Trial caregivers: 19 items). Second, ICAST-Trial data from a cluster randomized trial of a parenting intervention for families with adolescents (N = 1104, 552 caregiver-adolescent dyads) was analyzed. Confirmatory factor analysis established the hypothesized 6-factor (adolescents) and 4-factor (caregivers) structure. Removal of two items for adolescents and five for caregivers resulted in adequate model fit. Concurrent criterion validity analysis confirmed hypothesized relationships between child abuse and adolescent and caregiver mental health, adolescent behavior, discipline techniques and caregiver childhood abuse history. The resulting ICAST-Trial measures have 25 (adolescent) and 14 (caregiver) items respectively and measure physical, emotional and contact sexual abuse, neglect (both versions), and witnessing intimate partner violence and sexual harassment (adolescent version). The study established that both tools are sensitive to measuring change over time in response to a parenting intervention. The ICAST-Trial should have utility for evaluating the effectiveness of child abuse prevention efforts in similar socioeconomic contexts. Further research is needed to replicate these findings and examine cultural appropriateness, barriers for disclosure, and willingness to engage in child abuse research. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.
Factor structure and validation of the Attentional Control Scale.
Judah, Matt R; Grant, DeMond M; Mills, Adam C; Lechner, William V
2014-04-01
The Attentional Control Scale (ACS; Derryberry & Reed, 2002) has been used to assess executive control over attention in numerous studies, but no published data have examined the factor structure of the English version. The current studies addressed this need and tested the predictive and convergent validity of the ACS subscales. In Study 1, exploratory factor analysis yielded a two-factor model with Focusing and Shifting subscales. In Study 2, confirmatory factor analysis supported this model and suggested superior fit compared to the factor structure of the Icelandic version (Ólafsson et al., 2011). Study 3 examined correlations between the ACS subscales and measures of working memory, anxiety, and cognitive control. Study 4 examined correlations between the subscales and reaction times on a mixed-antisaccade task, revealing positive correlations for antisaccade performance and prosaccade latency with Focusing scores and between switch trial performance and Shifting scores. Additionally, the findings partially supported unique relationships between Focusing and trait anxiety and between Shifting and depression that have been noted in recent research. Although the results generally support the validity of the ACS, additional research using performance-based tasks is needed.
Calabro, Miguel A; Welk, Gregory J; Carriquiry, Alicia L; Nusser, Sarah M; Beyler, Nicholas K; Mathews, Charles E
2009-03-01
The purpose of this study was to examine the validity of a computerized 24-hour physical activity recall instrument (24PAR). Participants (n=20) wore 2 pattern-recognition activity monitors (an IDEEA and a SenseWear Pro Armband) for a 24-hour period and then completed the 24PAR the following morning. Participants completed 2 trials, 1 while maintaining a prospective diary of their activities and 1 without a diary. The trials were counterbalanced and completed within a week from each other. Estimates of energy expenditure (EE) and minutes of moderate-to-vigorous physical activity (MVPA) were compared with the criterion measures using 3-way (method by gender by trial) mixed-model ANOVA analyses. For EE, pairwise correlations were high (r>.88), and there were no differences in estimates across methods. Estimates of MVPA were more variable, but correlations were still in the moderate to high range (r>.57). Average activity levels were significantly higher on the logging trial, but there was no significant difference in the accuracy of self-report on days with and without logging. The results of this study support the overall utility of the 24PAR for group-level estimates of daily EE and MVPA.
Measurement properties of the Haem-A-QoL in haemophilia clinical trials.
von Mackensen, S; Eldar-Lissai, A; Auguste, P; Krishnan, S; von Maltzahn, R; Yu, R; Wyrwich, K W
2017-05-01
Patients with haemophilia on long-acting prophylactic treatment may experience an improvement in health-related quality of life (HRQoL) through reductions in breakthrough bleeds and associated complications, including long-term joint damage, compared with episodic treatment. This analysis examined clinical trial data to understand the psychometric characteristics (reliability, validity and sensitivity to change over time) of the Haem-A-QoL Questionnaire in adult males with haemophilia. Two recent, multinational, Phase 3 clinical trials of new, long-acting factor concentrates (A-LONG: rFVIIIFc; B-LONG: rFIXFc) assessed HRQoL in adolescent and adult males with severe haemophilia A or B respectively. The adults' baseline assessments, via the 46-item Haem-A-QoL Questionnaire, and change over time at the 6-month assessment were used in the psychometric analyses. Internal consistency reliability was adequate (Cronbach's alpha > 0.70) for nine of the 10 Haem-A-QoL domains and for 'Total Score' in both trials at baseline (A-LONG, n = 133; B-LONG, n = 73). At baseline, several Haem-A-QoL domains and 'Total Score' demonstrated known-groups and convergent validity when compared with other trial measures, including the EQ-5D (items and total scores) and joint impairment. Change score correlations (baseline to 28 weeks) between the EQ-5D and the Haem-A-QoL 'Total Score', and 'Physical Health' and 'Feelings' domains were moderate in magnitude (│r│ ≥ 0.33; P < 0.03), demonstrating sensitivity to change for these outcome measures in A-LONG. These psychometric analyses provide evidence of the reliability, validity and ability to detect change of the Haem-A-QoL to assess the HRQoL of adult males with severe haemophilia A and B in longitudinal clinical trials. © 2016 The Authors. Haemophilia Published by John Wiley & Sons Ltd.
2013-01-01
Background If you want to know which of two or more healthcare interventions is most effective, the randomised controlled trial is the design of choice. Randomisation, however, does not itself promote the applicability of the results to situations other than the one in which the trial was done. A tool published in 2009, PRECIS (PRagmatic Explanatory Continuum Indicator Summaries) aimed to help trialists design trials that produced results matched to the aim of the trial, be that supporting clinical decision-making, or increasing knowledge of how an intervention works. Though generally positive, groups evaluating the tool have also found weaknesses, mainly that its inter-rater reliability is not clear, that it needs a scoring system and that some new domains might be needed. The aim of the study is to: Produce an improved and validated version of the PRECIS tool. Use this tool to compare the internal validity of, and effect estimates from, a set of explanatory and pragmatic trials matched by intervention. Methods The study has four phases. Phase 1 involves brainstorming and a two-round Delphi survey of authors who cited PRECIS. In Phase 2, the Delphi results will then be discussed and alternative versions of PRECIS-2 developed and user-tested by experienced trialists. Phase 3 will evaluate the validity and reliability of the most promising PRECIS-2 candidate using a sample of 15 to 20 trials rated by 15 international trialists. We will assess inter-rater reliability, and raters’ subjective global ratings of pragmatism compared to PRECIS-2 to assess convergent and face validity. Phase 4, to determine if pragmatic trials sacrifice internal validity in order to achieve applicability, will compare the internal validity and effect estimates of matched explanatory and pragmatic trials of the same intervention, condition and participants. Effect sizes for the trials will then be compared in a meta-regression. The Cochrane Risk of Bias scores will be compared with the PRECIS-2 scores of pragmatism. Discussion We have concrete suggestions for improving PRECIS and a growing list of enthusiastic individuals interested in contributing to this work. By early 2014 we expect to have a validated PRECIS-2. PMID:23782862
ERIC Educational Resources Information Center
Cohen, Dale; Tracy, Ryan; Cohen, Jon
2017-01-01
This study examined the effectiveness and influence on validity of a computer-based pop-up English glossary accommodation for English learners (ELs) in grades 3 and 7. In a randomized controlled trial, we administered pop-up English glossaries with audio to students taking a statewide accountability English language arts (ELA) and mathematics…
Williams, Mark R; McKeown, Andrew; Dexter, Franklin; Miner, James R; Sessler, Daniel I; Vargo, John; Turk, Dennis C; Dworkin, Robert H
2016-01-01
Successful procedural sedation represents a spectrum of patient- and clinician-related goals. The absence of a gold-standard measure of the efficacy of procedural sedation has led to a variety of outcomes being used in clinical trials, with the consequent lack of consistency among measures, making comparisons among trials and meta-analyses challenging. We evaluated which existing measures have undergone psychometric analysis in a procedural sedation setting and whether the validity of any of these measures support their use across the range of procedures for which sedation is indicated. Numerous measures were found to have been used in clinical research on procedural sedation across a wide range of procedures. However, reliability and validity have been evaluated for only a limited number of sedation scales, observer-rated pain/discomfort scales, and satisfaction measures in only a few categories of procedures. Typically, studies only examined 1 or 2 aspects of scale validity. The results are likely unique to the specific clinical settings they were tested in. Certain scales, for example, those requiring motor stimulation, are unsuitable to evaluate sedation for procedures where movement is prohibited (e.g., magnetic resonance imaging scans). Further work is required to evaluate existing measures for procedures for which they were not developed. Depending on the outcomes of these efforts, it might ultimately be necessary to consider measures of sedation efficacy to be procedure specific.
Uljevic, Ognjen; Spasic, Miodrag; Sekulic, Damir
2013-01-01
Sport-specific motor fitness tests are not often examined in water polo. In this study we examined the reliability, factorial and discriminative validity of 10 water-polo-specific motor-fitness tests, namely: three tests of in-water jumps (thrusts), two characteristic swimming sprints (10 and 20 metres from the water start), three ball-throws (shoots), one test of passing precision (accuracy), and a test of the dynamometric force produced while using the eggbeater kick. The sample of subjects consisted of 54 young male water polo players (15 to 17 years of age; 1.86 ± 0.07 m, and 83.1 ± 9.9 kg). All tests were applied over three testing trials. Reliability analyses included Cronbach Alpha coefficients (CA), inter-item- correlations (IIR) and coefficients of the variation (CV), while an analysis of variance was used to define any systematic bias between the testing trials. All tests except the test of accuracy (precision) were found to be reliable (CA ranged from 0.83 to 0.97; IIR from 0.62 to 0.91; CV from 2% to 21%); with small and irregular biases between the testing trials. Factor analysis revealed that jumping capacities as well as throwing and sprinting capacities should be observed as a relatively independent latent dimensions among young water polo players. Discriminative validity of the applied tests is partially proven since the playing positions significantly (p < 0.05) differed in some of the applied tests, with the points being superior in their fitness capacities in comparison to their teammates. This study included players from one of the world’s best junior National leagues, and reported values could be used as fitness standards for such an age. Further studies are needed to examine the applicability of the proposed test procedures to older subjects and females. Key Points Here presented and validated sport specific water polo motor fitness tests are found to be reliable in the sample of young male water polo players. Factor analysis revealed existence of three inde-pendent latent motor dimensions, namely, in-water jumping capacity, throwing ability, and sprint swimming capacity. Points are found to be most advanced in their fitness capacities which are mainly related to their game duties which allowed them to develop variety of fit-ness components. PMID:24421723
Mansour, J K; Beaudry, J L; Lindsay, R C L
2017-12-01
Eyewitness identification experiments typically involve a single trial: A participant views an event and subsequently makes a lineup decision. As compared to this single-trial paradigm, multiple-trial designs are more efficient, but significantly reduce ecological validity and may affect the strategies that participants use to make lineup decisions. We examined the effects of a number of forensically relevant variables (i.e., memory strength, type of disguise, degree of disguise, and lineup type) on eyewitness accuracy, choosing, and confidence across 12 target-present and 12 target-absent lineup trials (N = 349; 8,376 lineup decisions). The rates of correct rejections and choosing (across both target-present and target-absent lineups) did not vary across the 24 trials, as reflected by main effects or interactions with trial number. Trial number had a significant but trivial quadratic effect on correct identifications (OR = 0.99) and interacted significantly, but again trivially, with disguise type (OR = 1.00). Trial number did not significantly influence participants' confidence in correct identifications, confidence in correct rejections, or confidence in target-absent selections. Thus, multiple-trial designs appear to have minimal effects on eyewitness accuracy, choosing, and confidence. Researchers should thus consider using multiple-trial designs for conducting eyewitness identification experiments.
Carfoot, Sue; Williamson, Paula R; Dickson, Rumona
2003-06-01
to examine the effects of early skin-to-skin contact between mother and baby on the initiation and duration of breast feeding. electronic databases--the Cochrane Library, MEDLINE,CINAHL and EMBASE. References of studies were examined to identify additional trials and contact was made with researchers in the field. Study selection criteria: randomised or quasi-randomised controlled trials in any language in which skin-to-skin contact between mothers and their healthy full-term newborn babies was compared to routine contact. Primary outcomes were success of first breast feed and duration of breast feeding. Secondary outcomes included, baby temperature and behaviour. STUDY-QUALITY ASSESSMENT: validity of included studies was assessed using criteria defined by the Cochrane Collaboration. Application of inclusion criteria, validity assessment and data extraction were carried out independently by two reviewers with a third reviewer to resolve differences. seven randomised controlled trials were identified. Five studies assessed duration of breast feeding with mixed results. None of the studies assessed the success of the first breast-feeding experience. Study quality was variable with methods of randomisation and blinding of assessment unclear in four of the five studies providing relevant results. the findings of this systematic review fail to support the current initiatives to implement changes in clinical practice to include skin-to-skin contact. Methodological flaws within the included studies prohibit firm conclusions being reached with regard to the effect of skin-to-skin contact on the duration of breast feeding, timing of first breast feed or baby physiological factors. The review highlights the need for further primary research to assess the effect of skin-to-skin contact on the breast-feeding experience.
Koblitz, Amber R.; Persoskie, Alexander; Ferrer, Rebecca A.; Klein, William M. P.; Dwyer, Laura A.; Park, Elyse R.
2016-01-01
Introduction: Absolute and comparative risk perceptions, worry, perceived severity, perceived benefits, and self-efficacy are important theoretical determinants of tobacco use, but no measures have been validated to ensure the discriminant validity as well as test-retest reliability of these measures in the tobacco context. The purpose of the current study is to examine the reliability and factor structure of a measure assessing smoking-related health cognitions and emotions in a national sample of current and former heavy smokers in the National Lung Screening Trial. Methods: A sub-study of the National Lung Screening Trial assessed current and former smokers’ (age 55–74; N = 4379) self-reported health cognitions and emotions at trial enrollment and at 12-month follow-up. Items were derived from the Health Belief Model and Self-Regulation Model. Results: An exploratory factor analysis of baseline responses revealed a five-factor structure for former smokers (risk perceptions, worry, perceived severity, perceived benefits, and self-efficacy) and a six-factor structure for current smokers, such that absolute risk and comparative risk perceptions emerged as separate factors. A confirmatory factor analysis of 12-month follow-up responses revealed a good fit for the five latent constructs for former smokers and six latent constructs for current smokers. Longitudinal stability of these constructs was also demonstrated. Conclusions: This is the first study to examine tobacco-related health cognition and emotional constructs over time in current and former heavy smokers undergoing lung screening. This study found that the theoretical constructs were stable across time and that the factor structure differed based on smoking status (current vs. former). PMID:25964503
Adherence in single-parent households in a long-term asthma clinical trial.
Spicher, Mary; Bollers, Nancy; Chinn, Tamara; Hall, Anita; Plunkett, Anne; Rodgers, Denise; Sundström, D A; Wilson, Laura
2012-01-01
Adherence of participants in a long-term clinical trial is necessary to assure validity of findings. This article examines adherence differences between single-parent and two-parent families in the Childhood Asthma Management Program (CAMP). Adherence was defined as the percentage of completed daily diary cards and scheduled study visits during the course of the trial. Logistic regression and ordinal logistic regression analyses were used. Children from single-parent families had a lower percentage of completed diary cards (72% vs. 84%) than two-parent families. Single-parent families were also more likely to reschedule visits (62% vs. 45%) and miss more clinic visits (23% vs. 17%) than two-parent families. Suggestions are given for study coordinators to assist participants in completing a long-term clinical trial. Many suggestions may be adapted for nurses in inpatient or outpatient settings for assisting parents of patients with chronic diseases.
Polaron dynamics with a multitude of Davydov D{sub 2} trial states
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhou, Nengji; Department of Physics, Hangzhou Normal University, Hangzhou 310046; Huang, Zhongkai
2015-07-07
We propose an extension to the Davydov D{sub 2} Ansatz in the dynamics study of the Holstein molecular crystal model with diagonal and off-diagonal exciton-phonon coupling using the Dirac-Frenkel time-dependent variational principle. The new trial state by the name of the “multi-D{sub 2} Ansatz” is a linear combination of Davydov D{sub 2} trial states, and its validity is carefully examined by quantifying how faithfully it follows the Schrödinger equation. Considerable improvements in accuracy have been demonstrated in comparison with the usual Davydov trial states, i.e., the single D{sub 1} and D{sub 2} Ansätze. With an increase in the number ofmore » the Davydov D{sub 2} trial states in the multi-D{sub 2} Ansatz, deviation from the exact Schrödinger dynamics is gradually diminished, leading to a numerically exact solution to the Schrödinger equation.« less
ERIC Educational Resources Information Center
Jonsson, Ulf; Olsson, Nora Choque; Bölte, Sven
2016-01-01
Systematic reviews have traditionally focused on internal validity, while external validity often has been overlooked. In this study, we systematically reviewed determinants of external validity in the accumulated randomized controlled trials of social skills group interventions for children and adolescents with autism spectrum disorder. We…
Scott, Whitney; McCracken, Lance M
2015-06-01
The Patient Global Impression of Change (PGIC) measure has frequently been used as an indicator of meaningful change in treatments for chronic pain. However, limited research has examined the validity of PGIC items despite their wide adoption in clinical trials for pain. Additionally, research has not yet examined predictors of PGIC ratings following psychologically based treatment for pain. The purpose of the present study was to examine the validity, factor structure, and predictors of PGIC ratings following an interdisciplinary psychologically based treatment for chronic pain. Patients with chronic pain (N = 476) completed standard assessments of pain, daily functioning, and depression before and after a 4-week treatment program based on the principles of acceptance and commitment therapy. Following the program, patients rated 1 item assessing their impression of change overall and several items assessing their impression of more specific changes: physical and social functioning, work-related activities, mood, and pain. Results indicated that the global and specific impression of change items represent a single component. In the context of the acceptance and commitment therapy-based treatment studied here, overall PGIC ratings appeared to be influenced to a greater degree by patients' experienced improvements in physical activities and mood than by improvements in pain. The findings suggest that in addition to a single overall PGIC rating, domain-specific items may be relevant for some treatment trials. This article reports on the validity and predictors of patients' impression of change ratings following interdisciplinary psychologically based treatment for pain. In addition to a single overall PGIC rating, domain-specific items may be important for clinicians and researchers to consider depending on the focus of treatment. Copyright © 2015 American Pain Society. Published by Elsevier Inc. All rights reserved.
Ratanawongsa, Neda; Karter, Andrew J.; Quan, Judy; Parker, Melissa M.; Handley, Margaret; Sarkar, Urmimala; Schmittdiel, Julie A.; Schillinger, Dean
2015-01-01
Background With the expansion of Medicaid and low-cost health insurance plans among diverse patient populations, objective measures of medication adherence using pharmacy claims could advance clinical care and translational research for safety net care. However, safety net patients may experience fluctuating prescription drug coverage, affecting the performance of adherence measures. Objective To evaluate the performance of continuous medication gap (CMG) for diverse, low-income managed care members with diabetes. Methods We conducted this cross-sectional analysis using administrative and clinical data for 680 members eligible for a self-management support trial at a non-profit, government-sponsored managed care plan. We applied CMG methodology to cardiometabolic medication claims for English-, Cantonese-, or Spanish-speaking members with diabetes. We examined inclusiveness (the proportion with calculable CMG) and selectivity (sociodemographic and medical differences from members without CMG). To examine validity, we examined unadjusted associations of suboptimal adherence (CMG>20%) with suboptimal cardiometabolic control. Results 429 members (63%) had calculable CMG. Compared to members without CMG, members with CMG were younger; more likely employed; and had poorer glycemic control, but better blood pressure and lipid control. Suboptimal adherence occurred more frequently among members with poor cardiometabolic control than among members with optimal control (28% vs. 12%, p=0.02). Conclusions CMG demonstrated acceptable inclusiveness and validity in a diverse, low-income safety net population, comparable to its performance in studies among other insured populations. CMG may provide a useful tool to measure adherence among increasingly diverse Medicaid populations, complemented by other strategies to reach those not captured by CMG. Trial Registration NCT00683020 PMID:26233541
Quantitative methods in assessment of neurologic function.
Potvin, A R; Tourtellotte, W W; Syndulko, K; Potvin, J
1981-01-01
Traditionally, neurologists have emphasized qualitative techniques for assessing results of clinical trials. However, in recent years qualitative evaluations have been increasingly augmented by quantitative tests for measuring neurologic functions pertaining to mental state, strength, steadiness, reactions, speed, coordination, sensation, fatigue, gait, station, and simulated activities of daily living. Quantitative tests have long been used by psychologists for evaluating asymptomatic function, assessing human information processing, and predicting proficiency in skilled tasks; however, their methodology has never been directly assessed for validity in a clinical environment. In this report, relevant contributions from the literature on asymptomatic human performance and that on clinical quantitative neurologic function are reviewed and assessed. While emphasis is focused on tests appropriate for evaluating clinical neurologic trials, evaluations of tests for reproducibility, reliability, validity, and examiner training procedures, and for effects of motivation, learning, handedness, age, and sex are also reported and interpreted. Examples of statistical strategies for data analysis, scoring systems, data reduction methods, and data display concepts are presented. Although investigative work still remains to be done, it appears that carefully selected and evaluated tests of sensory and motor function should be an essential factor for evaluating clinical trials in an objective manner.
Assessing the generalizability of randomized trial results to target populations.
Stuart, Elizabeth A; Bradshaw, Catherine P; Leaf, Philip J
2015-04-01
Recent years have seen increasing interest in and attention to evidence-based practices, where the "evidence" generally comes from well-conducted randomized trials. However, while those trials yield accurate estimates of the effect of the intervention for the participants in the trial (known as "internal validity"), they do not always yield relevant information about the effects in a particular target population (known as "external validity"). This may be due to a lack of specification of a target population when designing the trial, difficulties recruiting a sample that is representative of a prespecified target population, or to interest in considering a target population somewhat different from the population directly targeted by the trial. This paper first provides an overview of existing design and analysis methods for assessing and enhancing the ability of a randomized trial to estimate treatment effects in a target population. It then provides a case study using one particular method, which weights the subjects in a randomized trial to match the population on a set of observed characteristics. The case study uses data from a randomized trial of school-wide positive behavioral interventions and supports (PBIS); our interest is in generalizing the results to the state of Maryland. In the case of PBIS, after weighting, estimated effects in the target population were similar to those observed in the randomized trial. The paper illustrates that statistical methods can be used to assess and enhance the external validity of randomized trials, making the results more applicable to policy and clinical questions. However, there are also many open research questions; future research should focus on questions of treatment effect heterogeneity and further developing these methods for enhancing external validity. Researchers should think carefully about the external validity of randomized trials and be cautious about extrapolating results to specific populations unless they are confident of the similarity between the trial sample and that target population.
Mente, Andrew; de Koning, Lawrence; Shannon, Harry S; Anand, Sonia S
2009-04-13
Although a wealth of literature links dietary factors and coronary heart disease (CHD), the strength of the evidence supporting valid associations has not been evaluated systematically in a single investigation. We conducted a systematic search of MEDLINE for prospective cohort studies or randomized trials investigating dietary exposures in relation to CHD. We used the Bradford Hill guidelines to derive a causation score based on 4 criteria (strength, consistency, temporality, and coherence) for each dietary exposure in cohort studies and examined for consistency with the findings of randomized trials. Strong evidence supports valid associations (4 criteria satisfied) of protective factors, including intake of vegetables, nuts, and "Mediterranean" and high-quality dietary patterns with CHD, and associations of harmful factors, including intake of trans-fatty acids and foods with a high glycemic index or load. Among studies of higher methodologic quality, there was also strong evidence for monounsaturated fatty acids and "prudent" and "western" dietary patterns. Moderate evidence (3 criteria) of associations exists for intake of fish, marine omega-3 fatty acids, folate, whole grains, dietary vitamins E and C, beta carotene, alcohol, fruit, and fiber. Insufficient evidence (< or =2 criteria) of association is present for intake of supplementary vitamin E and ascorbic acid (vitamin C); saturated and polyunsaturated fatty acids; total fat; alpha-linolenic acid; meat; eggs; and milk. Among the dietary exposures with strong evidence of causation from cohort studies, only a Mediterranean dietary pattern is related to CHD in randomized trials. The evidence supports a valid association of a limited number of dietary factors and dietary patterns with CHD. Future evaluation of dietary patterns, including their nutrient and food components, in cohort studies and randomized trials is recommended.
de Blieck, Elisabeth A.; Augustine, Erika F.; Marshall, Frederick J.; Adams, Heather; Cialone, Jennifer; Dure, Leon; Kwon, Jennifer M.; Newhouse, Nicole; Rose, Katherine; Rothberg, Paul G.; Vierhile, Amy; Mink, Jonathan W.
2013-01-01
Introduction Juvenile neuronal ceroid lipofuscinosis (JNCL; Batten disease) is a rare, inherited, fatal lysosomal storage childhood disorder. True for many rare diseases, there are no treatments that impact the course of JNCL. The University of Rochester Batten Center’s (URBC) mission is to find treatments to slow, halt, or prevent JNCL. Objectives Our initial objective was to develop clinical research infrastructure preparatory to clinical trials, establish a JNCL research cohort, construct a disease-specific clinical outcome measure, and validate a non-invasive diagnostic sampling method. The long-term objective is to design and implement JNCL clinical trials. Methods The Unified Batten Disease Rating Scale (UBDRS) was developed. The Batten Disease Support and Research Association (BDSRA) referred participants; annual BDSRA meetings provided a mobile research setting for registry enrollment and UBDRS piloting. Neuropsychological examinations were performed, enabling external validation of the UBDRS. Buccal epithelial cell collection for genotyping was introduced. Telemedicine for remote UBDRS assessment was piloted. Results The registry enrolled 198 families representing 237 children with NCL. The UBDRS was piloted, validated and has been used to collect natural history data from 120 subjects. Funding and regulatory approval were obtained for a recently launched phase II clinical trial. Several additional lines of inquiry were reported. Conclusion The registry and BDSRA collaboration have enabled development of a clinical rating scale, natural history and neuropsychological studies, and genetic studies for disease confirmation. This work highlights an approach for preparatory natural history research and infrastructure development needed to facilitate efficient implementation of clinical trials in rare diseases. PMID:23628560
Stein, R E; Bauman, L J; Ireys, H T
1991-08-01
Internal and external validity problems permeate all intervention studies but are accentuated in primary preventive intervention research, particularly when studies target or recruit individuals based on their risk for psychopathology. Since many people who are at risk do not yet experience distress, they may not perceive the need for intervention. Recruitment tactics based on explaining extent of risk are unlikely to be persuasive and may have negative consequences. If respondents are not motivated to participate, a small or biased subset of the target population will participate in the intervention. Bias is of special concern when those enrolled represent only part of the continuum of risk. Selective enrollment may compromise both internal validity (the interpretation of the research results) and external validity (the generalizability of the findings) of intervention trials in primary prevention. This article discusses the effects of partial enrollment and the resultant bias. It suggests several strategies for increasing the enrollment of the target population and examines some of their ethical ramifications. It also stresses the importance of collecting systematic data documenting how the participants in the intervention differ from the target group as a whole.
Zhang, Wei; Tang, Xiaoxiang; He, Xianyou; Lai, Shuxian
2018-01-01
Substantial evidence suggests that beauty is associated with the survival and reproduction of organisms. Landscape architecture is composed of a series of natural elements that have significant evolutionary implications. The present study used one pilot material ratings and three experiments to examine the mechanisms of aesthetic appraisals of landscape architecture. The results confirmed that landscape architecture elicited a sense of beauty and captured visual attention more easily than other types of architecture during explicit aesthetic rating task (Experiment 1) and implicit aesthetic perception task (dot-probe paradigm, Experiment 2). Furthermore, the spatial cueing paradigm revealed that response latencies were significantly faster for landscape architecture than non-landscape architecture on valid trials, but there was no significant difference in this contrast on invalid trials at 150-ms stimulus onset asynchrony (SOA, Experiment 3a). At 500-ms SOA (Experiment 3b), participants responded significantly faster for landscape architecture on valid trials, but reacted significantly slower for landscape architecture on invalid trials. The findings indicated that the beauty of landscape architecture can be perceived implicitly, and only faster orienting of attention, but not delayed disengagement of attention was generated at early stages of the processing of landscape architecture. However, the attentional bias at later stages of attentional processes may be resulted from both faster orienting of attention and delayed disengagement of attention from landscape architecture photographs. PMID:29467696
Zhang, Wei; Tang, Xiaoxiang; He, Xianyou; Lai, Shuxian
2018-01-01
Substantial evidence suggests that beauty is associated with the survival and reproduction of organisms. Landscape architecture is composed of a series of natural elements that have significant evolutionary implications. The present study used one pilot material ratings and three experiments to examine the mechanisms of aesthetic appraisals of landscape architecture. The results confirmed that landscape architecture elicited a sense of beauty and captured visual attention more easily than other types of architecture during explicit aesthetic rating task (Experiment 1) and implicit aesthetic perception task (dot-probe paradigm, Experiment 2). Furthermore, the spatial cueing paradigm revealed that response latencies were significantly faster for landscape architecture than non-landscape architecture on valid trials, but there was no significant difference in this contrast on invalid trials at 150-ms stimulus onset asynchrony (SOA, Experiment 3a). At 500-ms SOA (Experiment 3b), participants responded significantly faster for landscape architecture on valid trials, but reacted significantly slower for landscape architecture on invalid trials. The findings indicated that the beauty of landscape architecture can be perceived implicitly, and only faster orienting of attention, but not delayed disengagement of attention was generated at early stages of the processing of landscape architecture. However, the attentional bias at later stages of attentional processes may be resulted from both faster orienting of attention and delayed disengagement of attention from landscape architecture photographs.
ClinicalTrials.gov and Drugs@FDA: A Comparison of Results Reporting for New Drug Approval Trials.
Schwartz, Lisa M; Woloshin, Steven; Zheng, Eugene; Tse, Tony; Zarin, Deborah A
2016-09-20
Pharmaceutical companies and other trial sponsors must submit certain trial results to ClinicalTrials.gov. The validity of these results is unclear. To validate results posted on ClinicalTrials.gov against publicly available U.S. Food and Drug Administration (FDA) reviews on Drugs@FDA. ClinicalTrials.gov (registry and results database) and Drugs@FDA (medical and statistical reviews). 100 parallel-group, randomized trials for new drug approvals (January 2013 to July 2014) with results posted on ClinicalTrials.gov (15 March 2015). 2 assessors extracted, and another verified, the trial design, primary and secondary outcomes, adverse events, and deaths. Most trials were phase 3 (90%), double-blind (92%), and placebo-controlled (73%) and involved 32 drugs from 24 companies. Of 137 primary outcomes identified from ClinicalTrials.gov, 134 (98%) had corresponding data at Drugs@FDA, 130 (95%) had concordant definitions, and 107 (78%) had concordant results. Most differences were nominal (that is, relative difference <10%). Primary outcome results in 14 trials could not be validated. Of 1927 secondary outcomes from ClinicalTrials.gov, Drugs@FDA mentioned 1061 (55%) and included results data for 367 (19%). Of 96 trials with 1 or more serious adverse events in either source, 14 could be compared and 7 had discordant numbers of persons experiencing the adverse events. Of 62 trials with 1 or more deaths in either source, 25 could be compared and 17 were discordant. Unknown generalizability to uncontrolled or crossover trial results. Primary outcome definitions and results were largely concordant between ClinicalTrials.gov and Drugs@FDA. Half the secondary outcomes, as well as serious events and deaths, could not be validated because Drugs@FDA includes only "key outcomes" for regulatory decision making and frequently includes only adverse event results aggregated across multiple trials. National Library of Medicine.
Clark, Ross A; Paterson, Kade; Ritchie, Callan; Blundell, Simon; Bryant, Adam L
2011-03-01
Commercial timing light systems (CTLS) provide precise measurement of athletes running velocity, however they are often expensive and difficult to transport. In this study an inexpensive, wireless and portable timing light system was created using the infrared camera in Nintendo Wii hand controllers (NWHC). System creation with gold-standard validation. A Windows-based software program using NWHC to replicate a dual-beam timing gate was created. Firstly, data collected during 2m walking and running trials were validated against a 3D kinematic system. Secondly, data recorded during 5m running trials at various intensities from standing or flying starts were compared to a single beam CTLS and the independent and average scores of three handheld stopwatch (HS) operators. Intraclass correlation coefficient and Bland-Altman plots were used to assess validity. Absolute error quartiles and percentage of trials in absolute error threshold ranges were used to determine accuracy. The NWHC system was valid when compared against the 3D kinematic system (ICC=0.99, median absolute error (MAR)=2.95%). For the flying 5m trials the NWHC system possessed excellent validity and precision (ICC=0.97, MAR<3%) when compared with the CTLS. In contrast, the NWHC system and the HS values during standing start trials possessed only modest validity (ICC<0.75) and accuracy (MAR>8%). A NWHC timing light system is inexpensive, portable and valid for assessing running velocity. Errors in the 5m standing start trials may have been due to erroneous event detection by either the commercial or NWHC-based timing light systems. Copyright © 2010 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Update on simulation-based surgical training and assessment in ophthalmology: a systematic review.
Thomsen, Ann Sofia S; Subhi, Yousif; Kiilgaard, Jens Folke; la Cour, Morten; Konge, Lars
2015-06-01
This study reviews the evidence behind simulation-based surgical training of ophthalmologists to determine (1) the validity of the reported models and (2) the ability to transfer skills to the operating room. Simulation-based training is established widely within ophthalmology, although it often lacks a scientific basis for implementation. We conducted a systematic review of trials involving simulation-based training or assessment of ophthalmic surgical skills among health professionals. The search included 5 databases (PubMed, EMBASE, PsycINFO, Cochrane Library, and Web of Science) and was completed on March 1, 2014. Overall, the included trials were divided into animal, cadaver, inanimate, and virtual-reality models. Risk of bias was assessed using the Cochrane Collaboration's tool. Validity evidence was evaluated using a modern validity framework (Messick's). We screened 1368 reports for eligibility and included 118 trials. The most common surgery simulated was cataract surgery. Most validity trials investigated only 1 or 2 of 5 sources of validity (87%). Only 2 trials (48 participants) investigated transfer of skills to the operating room; 4 trials (65 participants) evaluated the effect of simulation-based training on patient-related outcomes. Because of heterogeneity of the studies, it was not possible to conduct a quantitative analysis. The methodologic rigor of trials investigating simulation-based surgical training in ophthalmology is inadequate. To ensure effective implementation of training models, evidence-based knowledge of validity and efficacy is needed. We provide a useful tool for implementation and evaluation of research in simulation-based training. Copyright © 2015 American Academy of Ophthalmology. Published by Elsevier Inc. All rights reserved.
The Reliability and Validity of the Computerized Double Inclinometer in Measuring Lumbar Mobility
MacDermid, Joy Christine; Arumugam, Vanitha; Vincent, Joshua Israel; Carroll, Krista L
2014-01-01
Study Design : Repeated measures reliability/validity study. Objectives : To determine the concurrent validity, test-retest, inter-rater and intra-rater reliability of lumbar flexion and extension measurements using the Tracker M.E. computerized dual inclinometer (CDI) in comparison to the modified-modified Schober (MMS) Summary of Background : Numerous studies have evaluated the reliability and validity of the various methods of measuring spinal motion, but the results are inconsistent. Differences in equipment and techniques make it difficult to correlate results. Methods : Twenty subjects with back pain and twenty without back pain were selected through convenience sampling. Two examiners measured sagittal plane lumbar range of motion for each subject. Two separate tests with the CDI and one test with the MMS were conducted. Each test consisted of three trials. Instrument and examiner order was randomly assigned. Intra-class correlations (ICCs 2, 2 and 2, 2) and Pearson correlation coefficients (r) were used to calculate reliability and concurrent validity respectively. Results : Intra-trial reliability was high to very high for both the CDI (ICCs 0.85 - 0.96) and MMS (ICCs 0.84 - 0.98). However, the reliability was poor to moderate, when the CDI unit had to be repositioned either by the same rate (ICCs 0.16 - 0.59) or a different rater (ICCs 0.45 - 0.52). Inter-rater reliability for the MMS was moderate to high (ICCs 0.75 - 0.82) which bettered the moderate correlation obtained for the CDI (ICCs 0.45 - 0.52). Correlations between the CDI and MMS were poor for flexion (0.32; p<0.05) and poor to moderate (-0.42 - -0.51; p<0.05) for extension measurements. Conclusion : When using the CDI, an average of subsequent tests is required to obtain moderate reliability. The MMS was highly reliable than the CDI. The MMS and the CDI measure lumbar movement on a different metric that are not highly related to each other. PMID:25352928
MERKEL, PETER A.; SILLIMAN, NANCY P.; DENTON, CHRISTOPHER P.; FURST, DANIEL E.; KHANNA, DINESH; EMERY, PAUL; HSU, VIVIEN M.; STREISAND, JAMES B.; POLISSON, RICHARD P.; ÅKESSON, ANITA; COPPOCK, JOHN; van den HOOGEN, FRANK; HERRICK, ARIANE; MAYES, MAUREEN D.; VEALE, DOUGLAS; SEIBOLD, JAMES R.; BLACK, CAROL M.; KORN, JOSEPH H.
2013-01-01
Objective To determine the validity, reliability, and feasibility of durometer measurements of skin hardness as an outcome measure in clinical trials of scleroderma. Methods Skin hardness was measured during a multicenter treatment trial for scleroderma using handheld digital durometers with a continuous scale. Skin thickness was measured by modified Rodnan skin score (MRSS). Other outcome data collected included the Scleroderma Health Assessment Questionnaire. In a reliability exercise in advance of the trial, 9 investigators examined the same 5 scleroderma patients by MRSS and durometry. Results Forty-three patients with early diffuse cutaneous systemic sclerosis were studied at 11 international centers (mean age 49 years [range 24–76], median disease duration 6.4 months [range 0.3–23], and median baseline MRSS 22 [range 11–38]). The reliability of durometer measurements was excellent, with high interobserver intraclass correlation coefficients (ICCs) (0.82–0.92), and each result was greater than the corresponding skin site ICCs for MRSS (0.54–0.85). Baseline durometer scores correlated well with MRSS (r = 0.69, P < 0.0001), patient self-assessments of skin disease (r = 0.69, P < 0.0001), and Health Assessment Questionnaire (HAQ) disability scores (r = 0.34, P = 0.03). Change in durometer scores correlated with change in MRSS (r = 0.70, P < 0.0001), change in patient self-assessments of skin disease (r = 0.52, P = 0.003), and change in HAQ disability scores (r = 0.42, P = 0.017). The effect size was greater for durometry than for MRSS or patient self-assessment. Conclusion Durometer measurements of skin hardness in patients with scleroderma are reliable, simple, accurate, demonstrate good sensitivity to change compared with traditional skin scoring, and reflect patients' self-assessments of their disease. Durometer measurements are valid, objective, and scalable, and should be considered for use as a complementary outcome measure to skin scoring in clinical trials of scleroderma. PMID:18438905
Quittner, Alexandra L; O'Donnell, Anne E; Salathe, Matthias A; Lewis, Sandra A; Li, Xiaoming; Montgomery, A Bruce; O'Riordan, Thomas G; Barker, Alan F
2015-01-01
The Quality of Life-Bronchiectasis (QOL-B), a self-administered, patient-reported outcome measure assessing symptoms, functioning and health-related quality of life for patients with non-cystic fibrosis (CF) bronchiectasis, contains 37 items on 8 scales (Respiratory Symptoms, Physical, Role, Emotional and Social Functioning, Vitality, Health Perceptions and Treatment Burden). Psychometric analyses of QOL-B V.3.0 used data from two double-blind, multicentre, randomised, placebo-controlled, phase III trials of aztreonam for inhalation solution (AZLI) in 542 patients with non-CF bronchiectasis and Gram-negative endobronchial infection. Excellent internal consistency (Cronbach's α ≥0.70) and 2-week test-retest reliability (intraclass correlation coefficients ≥0.72) were demonstrated for each scale. Convergent validity with 6 min walk test was observed for Physical and Role Functioning scores. No floor or ceiling effects (baseline scores of 0 or 100) were found for the Respiratory Symptoms scale (primary endpoint of trials). Baseline Respiratory Symptoms scores discriminated between patients based on baseline FEV₁% predicted in only one trial. The minimal important difference score for the Respiratory Symptoms scale was 8.0 points. AZLI did not show efficacy in the two phase III trials. QOL-B responsivity to treatment was assessed by examining changes from baseline QOL-B scores at study visits at which protocol-defined pulmonary exacerbations were reported. Mean Respiratory Symptoms scores decreased 14.0 and 14.2 points from baseline for placebo-treated and AZLI-treated patients with exacerbations, indicating that worsening respiratory symptoms were reflected in clinically meaningful changes in QOL-B scores. Previously established content validity, reliability and responsivity of the QOL-B are confirmed by this final validation study. The QOL-B is available for use in clinical trials and routine clinical practice. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
The PRECIS-2 tool has good interrater reliability and modest discriminant validity.
Loudon, Kirsty; Zwarenstein, Merrick; Sullivan, Frank M; Donnan, Peter T; Gágyor, Ildikó; Hobbelen, Hans J S M; Althabe, Fernando; Krishnan, Jerry A; Treweek, Shaun
2017-08-01
PRagmatic Explanatory Continuum Indicator Summary (PRECIS)-2 is a tool that could improve design insight for trialists. Our aim was to validate the PRECIS-2 tool, unlike its predecessor, testing the discriminant validity and interrater reliability. Over 80 international trialists, methodologists, clinicians, and policymakers created PRECIS-2 helping to ensure face validity and content validity. The interrater reliability of PRECIS-2 was measured using 19 experienced trialists who used PRECIS-2 to score a diverse sample of 15 randomized controlled trial protocols. Discriminant validity was tested with two raters to independently determine if the trial protocols were more pragmatic or more explanatory, with scores from the 19 raters for the 15 trials as predictors of pragmatism. Interrater reliability was generally good, with seven of nine domains having an intraclass correlation coefficient over 0.65. Flexibility (adherence) and recruitment had wide confidence intervals, but raters found these difficult to rate and wanted more information. Each of the nine PRECIS-2 domains could be used to differentiate between trials taking more pragmatic or more explanatory approaches with better than chance discrimination for all domains. We have assessed the validity and reliability of PRECIS-2. An elaboration study and web site provide guidance to help future users of the tool which is continuing to be tested by trial teams, systematic reviewers, and funders. Copyright © 2017 Elsevier Inc. All rights reserved.
Strategies for Dealing with Missing Accelerometer Data.
Stephens, Samantha; Beyene, Joseph; Tremblay, Mark S; Faulkner, Guy; Pullnayegum, Eleanor; Feldman, Brian M
2018-05-01
Missing data is a universal research problem that can affect studies examining the relationship between physical activity measured with accelerometers and health outcomes. Statistical techniques are available to deal with missing data; however, available techniques have not been synthesized. A scoping review was conducted to summarize the advantages and disadvantages of identified methods of dealing with missing data from accelerometers. Missing data poses a threat to the validity and interpretation of trials using physical activity data from accelerometry. Imputation using multiple imputation techniques is recommended to deal with missing data and improve the validity and interpretation of studies using accelerometry. Copyright © 2018 Elsevier Inc. All rights reserved.
Public titles of clinical trials should have ethics review.
Saenz, Carla; Reveiz, Ludovic; Tisdale, John F
2015-09-01
A key aspect to guarantee that research with human subjects is ethical is being overlooked. Ethics review committees invest great effort examining the informed consent documents of research protocols to ensure that potential participants can provide consent validly and are not deluded into thinking that the experimental intervention they may sign up for is already known to be therapeutic. However, these efforts to avoid what is called the "therapeutic misconception" might be in vain if the title with which the studies are being introduced to the potential participants escapes ethics review. Research participants might be deceived by clinical trials entitled "novel therapy" when the point of the trial is precisely to find out whether the intervention at stake is therapeutic or not. Providing potential research participants with such misleading information hampers their ability to make informed decisions. The well-established scrutiny that ethics review committees exercise with regard to consent forms is limited if the registration of clinical trials, for which a public title is chosen, constitutes a process that is independent from the ethics review. In this article, we examine this problem, assess recent measures to integrate clinical trial registration with ethics review processes, and provide specific recommendations to solve the problem and ultimately enhance the accountability, transparency, and ethics of research with human subjects. Copyright © 2015 Pan American Health Organization. Published by Elsevier Inc. All rights reserved.
Karvinen, Kristina H; Esposito, David; Raedeke, Thomas D; Vick, Joshua; Walker, Paul R
2014-01-01
Chemotherapy for lung cancer can have a detrimental effect on white blood cell (WBC) and red blood cell (RBC) counts. Physical exercise may have a role in improving WBCs and RBCs, although few studies have examined cancer patients receiving adjuvant therapies. The purpose of this pilot trial was to examine the effects of an exercise intervention utilizing resistance bands on WBCs and RBCs in lung cancer patients receiving curative intent chemotherapy. A sample of lung cancer patients scheduled for curative intent chemotherapy was randomly assigned to the exercise intervention (EX) condition or usual care (UC) condition. The EX condition participated in a three times weekly exercise program using resistance bands for the duration of chemotherapy. A total of 14 lung cancer patients completed the trial. EX condition participants completed 79% of planned exercise sessions. The EX condition was able to maintain WBCs over the course of the intervention compared to declines in the UC condition (p = .008; d = 1.68). There were no significant differences in change scores in RBCs. Exercise with resistance bands may help attenuate declines in WBCs in lung cancer patients receiving curative intent chemotherapy. Larger trials are warranted to validate these findings. Ultimately these findings could be informative for the development of supportive care strategies for lung cancer patients receiving chemotherapy. Clinical Trials Registration #: NCT01130714.
Validation of a Behavioral Approach for Measuring Saccades in Parkinson's Disease.
Turner, Travis H; Renfroe, Jenna B; Duppstadt-Delambo, Amy; Hinson, Vanessa K
2017-01-01
Speed and control of saccades are related to disease progression and cognitive functioning in Parkinson's disease (PD). Traditional eye-tracking complexities encumber application for individual evaluations and clinical trials. The authors examined psychometric properties of standalone tasks for reflexive prosaccade latency, volitional saccade initiation, and saccade inhibition (antisaccade) in a heterogeneous sample of 65 PD patients. Demographics had minimal impact on task performance. Thirty-day test-retest reliability estimates for behavioral tasks were acceptable and similar to traditional eye tracking. Behavioral tasks demonstrated concurrent validity with traditional eye-tracking measures; discriminant validity was less clear. Saccade initiation and inhibition discriminated PD patients with cognitive impairment. The present findings support further development and use of the behavioral tasks for assessing latency and control of saccades in PD.
Zhang, Zhaoyang; Fang, Hua; Wang, Honggang
2016-06-01
Web-delivered trials are an important component in eHealth services. These trials, mostly behavior-based, generate big heterogeneous data that are longitudinal, high dimensional with missing values. Unsupervised learning methods have been widely applied in this area, however, validating the optimal number of clusters has been challenging. Built upon our multiple imputation (MI) based fuzzy clustering, MIfuzzy, we proposed a new multiple imputation based validation (MIV) framework and corresponding MIV algorithms for clustering big longitudinal eHealth data with missing values, more generally for fuzzy-logic based clustering methods. Specifically, we detect the optimal number of clusters by auto-searching and -synthesizing a suite of MI-based validation methods and indices, including conventional (bootstrap or cross-validation based) and emerging (modularity-based) validation indices for general clustering methods as well as the specific one (Xie and Beni) for fuzzy clustering. The MIV performance was demonstrated on a big longitudinal dataset from a real web-delivered trial and using simulation. The results indicate MI-based Xie and Beni index for fuzzy-clustering are more appropriate for detecting the optimal number of clusters for such complex data. The MIV concept and algorithms could be easily adapted to different types of clustering that could process big incomplete longitudinal trial data in eHealth services.
Zhang, Zhaoyang; Wang, Honggang
2016-01-01
Web-delivered trials are an important component in eHealth services. These trials, mostly behavior-based, generate big heterogeneous data that are longitudinal, high dimensional with missing values. Unsupervised learning methods have been widely applied in this area, however, validating the optimal number of clusters has been challenging. Built upon our multiple imputation (MI) based fuzzy clustering, MIfuzzy, we proposed a new multiple imputation based validation (MIV) framework and corresponding MIV algorithms for clustering big longitudinal eHealth data with missing values, more generally for fuzzy-logic based clustering methods. Specifically, we detect the optimal number of clusters by auto-searching and -synthesizing a suite of MI-based validation methods and indices, including conventional (bootstrap or cross-validation based) and emerging (modularity-based) validation indices for general clustering methods as well as the specific one (Xie and Beni) for fuzzy clustering. The MIV performance was demonstrated on a big longitudinal dataset from a real web-delivered trial and using simulation. The results indicate MI-based Xie and Beni index for fuzzy-clustering is more appropriate for detecting the optimal number of clusters for such complex data. The MIV concept and algorithms could be easily adapted to different types of clustering that could process big incomplete longitudinal trial data in eHealth services. PMID:27126063
van der Heijden, G. J.; van der Windt, D. A.; de Winter, A. F.
1997-01-01
OBJECTIVE: To assess the effectiveness of physiotherapy for patients with soft tissue shoulder disorders. DESIGN: A systematic computerised literature search of Medline and Embase, supplemented with citation tracking, for relevant trials with random allocation published before 1996. SUBJECTS: Patients treated with physiotherapy for disorders of soft tissue of the shoulder. MAIN OUTCOME MEASURES: Success rates, mobility, pain, functional status. RESULTS: Six of the 20 assessed trials satisfied at least five of eight validity criteria. Assessment of methods was often hampered by insufficient information on various validity criteria, and trials were often flawed by lack of blinding, high proportions of withdrawals from treatment, and high proportions of missing values. Trial sizes were small: only six trials included intervention groups of more than 25 patients. Ultrasound therapy, evaluated in six trials, was not shown to be effective. Four other trials favoured physiotherapy (laser therapy or manipulation), but the validity of their methods was unsatisfactory. CONCLUSIONS: There is evidence that ultrasound therapy is ineffective in the treatment of soft tissue shoulder disorders. Due to small trial sizes and unsatisfactory methods, evidence for the effectiveness of other methods of physiotherapy is inconclusive. For all methods of treatment, trials were too heterogeneous with respect to included patients, index and reference treatments, and follow up to merit valid statistical pooling. Future studies should show whether physiotherapy is superior to treatment with drugs, steroid injections, or a wait and see policy. PMID:9233322
Validity of Various Methods for Determining Velocity, Force, and Power in the Back Squat.
Banyard, Harry G; Nosaka, Ken; Sato, Kimitake; Haff, G Gregory
2017-10-01
To examine the validity of 2 kinematic systems for assessing mean velocity (MV), peak velocity (PV), mean force (MF), peak force (PF), mean power (MP), and peak power (PP) during the full-depth free-weight back squat performed with maximal concentric effort. Ten strength-trained men (26.1 ± 3.0 y, 1.81 ± 0.07 m, 82.0 ± 10.6 kg) performed three 1-repetition-maximum (1RM) trials on 3 separate days, encompassing lifts performed at 6 relative intensities including 20%, 40%, 60%, 80%, 90%, and 100% of 1RM. Each repetition was simultaneously recorded by a PUSH band and commercial linear position transducer (LPT) (GymAware [GYM]) and compared with measurements collected by a laboratory-based testing device consisting of 4 LPTs and a force plate. Trials 2 and 3 were used for validity analyses. Combining all 120 repetitions indicated that the GYM was highly valid for assessing all criterion variables while the PUSH was only highly valid for estimations of PF (r = .94, CV = 5.4%, ES = 0.28, SEE = 135.5 N). At each relative intensity, the GYM was highly valid for assessing all criterion variables except for PP at 20% (ES = 0.81) and 40% (ES = 0.67) of 1RM. Moreover, the PUSH was only able to accurately estimate PF across all relative intensities (r = .92-.98, CV = 4.0-8.3%, ES = 0.04-0.26, SEE = 79.8-213.1 N). PUSH accuracy for determining MV, PV, MF, MP, and PP across all 6 relative intensities was questionable for the back squat, yet the GYM was highly valid at assessing all criterion variables, with some caution given to estimations of MP and PP performed at lighter loads.
Komro, Kelli A; Livingston, Melvin D; Kominsky, Terrence K; Livingston, Bethany J; Garrett, Brady A; Molina, Mildred Maldonado; Boyd, Misty L
2015-01-01
Objective: American Indians (AIs) suffer from significant alcohol-related health disparities, and increased risk begins early. This study examined the reliability and validity of measures to be used in a preventive intervention trial. Reliability and validity across racial/ethnic subgroups are crucial to evaluate intervention effectiveness and promote culturally appropriate evidence-based practice. Method: To assess reliability and validity, we used three baseline surveys of high school students participating in a preventive intervention trial within the jurisdictional service area of the Cherokee Nation in northeastern Oklahoma. The 15-minute alcohol risk survey included 16 multi-item scales and one composite score measuring key proximal, primary, and moderating variables. Forty-four percent of the students indicated that they were AI (of whom 82% were Cherokee), including 23% who reported being AI only (n = 435) and 18% both AI and White (n = 352). Forty-seven percent reported being White only (n = 901). Results: Scales were adequately reliable for the full sample and across race/ethnicity defined by AI, AI/White, and White subgroups. Among the full sample, all scales had acceptable internal consistency, with minor variation across race/ethnicity. All scales had extensive to exemplary test–retest reliability and showed minimal variation across race/ethnicity. The eight proximal and two primary outcome scales were each significantly associated with the frequency of alcohol use during the past month in both the cross-sectional and the longitudinal models, providing support for both criterion validity and predictive validity. For most scales, interpretation of the strength of association and statistical significance did not differ between the racial/ethnic subgroups. Conclusions: The results support the reliability and validity of scales of a brief questionnaire measuring risk and protective factors for alcohol use among AI adolescents, primarily members of the Cherokee Nation. PMID:25486402
Sport-specific endurance plank test for evaluation of global core muscle function.
Tong, Tom K; Wu, Shing; Nie, Jinlei
2014-02-01
To examine the validity and reliability of a sports-specific endurance plank test for the evaluation of global core muscle function. Repeated-measures study. Laboratory environment. Twenty-eight male and eight female young athletes. Surface electromyography (sEMG) of selected trunk flexors and extensors, and an intervention of pre-fatigue core workout were applied for test validation. Intraclass correlation coefficient (ICC), coefficient of variation (CV), and the measurement bias ratio */÷ ratio limits of agreement (LOA) were calculated to assess reliability and measurement error. Test validity was shown by the sEMG of selected core muscles, which indicated >50% increase in muscle activation during the test; and the definite discrimination of the ∼30% reduction in global core muscle endurance subsequent to a pre-fatigue core workout. For test-retest reliability, when the first attempt of three repeated trials was considered as familiarisation, the ICC was 0.99 (95% CI: 0.98-0.99), CV was 2.0 ± 1.56% and the measurement bias ratio */÷ ratio LOA was 0.99 */÷ 1.07. The findings suggest that the sport-specific endurance plank test is a valid, reliable and practical method for assessing global core muscle endurance in athletes given that at least one familiarisation trial takes place prior to measurement. Copyright © 2013 Elsevier Ltd. All rights reserved.
Chiaravalloti, Nancy D; Dobryakova, Ekaterina; Wylie, Glenn R; DeLuca, John
2015-01-01
New learning and memory deficits are common following traumatic brain injury (TBI). Yet few studies have examined the efficacy of memory retraining in TBI through the most methodologically vigorous randomized clinical trial. Our previous research has demonstrated that the modified Story Memory Technique (mSMT) significantly improves new learning and memory in multiple sclerosis. The present double-blind, placebo-controlled, randomized clinical trial examined changes in cerebral activation on functional magnetic resonance imaging following mSMT treatment in persons with TBI. Eighteen individuals with TBI were randomly assigned to treatment (n = 9) or placebo (n = 9) groups. Baseline and follow-up functional magnetic resonance imaging was collected during a list-learning task. Significant differences in cerebral activation from before to after treatment were noted in regions belonging to the default mode network and executive control network in the treatment group only. Results are interpreted in light of these networks. Activation differences between the groups likely reflect increased use of strategies taught during treatment. This study demonstrates a significant change in cerebral activation resulting from the mSMT in a TBI sample. Findings are consistent with previous work in multiple sclerosis. Behavioral interventions can show significant changes in the brain, validating clinical utility.
Nochioka, Kotaro; Biering-Sørensen, Tor; Hansen, Kim Wadt; Sørensen, Rikke; Pedersen, Sune; Jørgensen, Peter Godsk; Iversen, Allan; Shimokawa, Hiroaki; Jeger, Raban; Kaiser, Christoph; Pfisterer, Matthias; Galatius, Søren
2017-12-01
Rheumatologic disorders are characterised by inflammation and an increased risk of coronary artery disease (CAD). However, the association between rheumatologic disorders and long-term prognosis in CAD patients undergoing percutaneous coronary intervention (PCI) is unknown. Thus, we aimed to examine the association between rheumatologic disorders and long-term prognosis in CAD patients undergoing PCI. A post-hoc analysis was performed in 4605 patients (age: 63.3 ± 11.0 years; male: 76.6%) with ST-segment elevation myocardial infarction (STEMI; n = 1396), non-STEMI ( n = 1541), and stable CAD ( n = 1668) from the all-comer stent trials, the BAsel Stent Kosten-Effektivitäts Trial-PROspective Validation Examination (BASKET-PROVE) I and II trials. We evaluated the association between rheumatologic disorders and 2-year major adverse cardiac events (MACEs; cardiac death, nonfatal myocardial infarction (MI), and target vessel revascularisation (TVR)) by Cox regression analysis. Patients with rheumatologic disorders ( n = 197) were older, more often female, had a higher prevalence of renal disease, multi-vessel coronary disease, and bifurcation lesions, and had longer total stent lengths. During the 2-year follow-up, the MACE rate was 8.6% in the total cohort. After adjustment for potential confounders, rheumatologic disorders were associated with MACEs in the total cohort (adjusted hazard ratio: 1.55; 95% confidence interval (CI): 1.04-2.31) driven by the STEMI subgroup (adjusted hazard ratio: 2.38; 95% CI: 1.26-4.51). In all patients, rheumatologic disorders were associated with all-cause death (adjusted hazard ratio: 2.05; 95% CI: 1.14-3.70), cardiac death (adjusted hazard ratio: 2.63; 95% CI: 1.27-5.43), and non-fatal MI (adjusted hazard ratio: 2.64; 95% CI: 1.36-5.13), but not with TVR (adjusted hazard ratio: 0.81; 95% CI: 0.41-1.58). The presence of rheumatologic disorders appears to be independently associated with worse outcome in CAD patients undergoing PCI. This calls for further studies and focus on this high-risk group of patients following PCI.
Ellingson, Benjamin M; Abrey, Lauren E; Nelson, Sarah J; Kaufmann, Timothy J; Garcia, Josep; Chinot, Olivier; Saran, Frank; Nishikawa, Ryo; Henriksson, Roger; Mason, Warren P; Wick, Wolfgang; Butowski, Nicholas; Ligon, Keith L; Gerstner, Elizabeth R; Colman, Howard; de Groot, John; Chang, Susan; Mellinghoff, Ingo; Young, Robert J; Alexander, Brian M; Colen, Rivka; Taylor, Jennie W; Arrillaga-Romany, Isabel; Mehta, Arnav; Huang, Raymond Y; Pope, Whitney B; Reardon, David; Batchelor, Tracy; Prados, Michael; Galanis, Evanthia; Wen, Patrick Y; Cloughesy, Timothy F
2018-04-05
In the current study, we pooled imaging data in newly diagnosed GBM patients from international multicenter clinical trials, single institution databases, and multicenter clinical trial consortiums to identify the relationship between post-operative residual enhancing tumor volume and overall survival (OS). Data from 1,511 newly diagnosed GBM patients from 5 data sources were included in the current study: 1) a single institution database from UCLA (N=398; Discovery); 2) patients from the Ben and Cathy Ivy Foundation for Early Phase Clinical Trials Network Radiogenomics Database (N=262 from 8 centers; Confirmation); 3) the chemoradiation placebo arm from an international phase III trial (AVAglio; N=394 from 120 locations in 23 countries; Validation); 4) the experimental arm from AVAglio examining chemoradiation plus bevacizumab (N=404 from 120 locations in 23 countries; Exploratory Set 1); and 5) an Alliance (N0874) Phase I/II trial of vorinostat plus chemoradiation (N=53; Exploratory Set 2). Post-surgical, residual enhancing disease was quantified using T1 subtraction maps. Multivariate Cox regression models were used to determine influence of clinical variables, MGMT status, and residual tumor volume on OS. A log-linear relationship was observed between post-operative, residual enhancing tumor volume and OS in newly diagnosed GBM treated with standard chemoradiation. Post-operative tumor volume is a prognostic factor for OS (P<0.01), regardless of therapy, age, and MGMT promoter methylation status. Post-surgical, residual contrast-enhancing disease significantly negatively influences survival in patients with newly diagnosed glioblastoma treated with chemoradiation with or without concomitant experimental therapy.
Vandelanotte, Corneel; Maher, Carol A
2015-01-01
Despite their popularity and potential to promote health in large populations, the effectiveness of online social networks (e.g., Facebook) to improve health behaviors has been somewhat disappointing. Most of the research examining the effectiveness of such interventions has used randomized controlled trials (RCTs). It is asserted that the modest outcomes may be due to characteristics specific to both online social networks and RCTs. The highly controlled nature of RCTs stifles the dynamic nature of online social networks. Alternative and ecologically valid research designs that evaluate online social networks in real-life conditions are needed to advance the science in this area.
Pase, Matthew P; Kean, James; Sarris, Jerome; Neale, Chris; Scholey, Andrew B; Stough, Con
2012-07-01
Traditional knowledge suggests that Bacopa monnieri enhances cognitive performance. Such traditional beliefs have now been scientifically tested through a handful of randomized, controlled human clinical trials. The current systematic review aimed to examine the scientific evidence as to whether Bacopa can enhance cognitive performance in humans. A systematic review of randomized controlled trials is presented. Multiple databases were systematically searched by multiple authors. Relevant trials were objectively assessed for methodological quality. The subjects studied were adult humans without dementia or significant cognitive impairment. B. monnieri, including Bacopa extracts, were administered over long-term supplementation periods. Any validated cognitive test, whether a primary or secondary outcome. Six (6) studies met the final inclusion criteria and were included in review. Trials were all conducted over 12 weeks. Across trials, three different Bacopa extracts were used at dosages of 300-450 mg extract per day. All reviewed trials examined the effects of Bacopa on memory, while other cognitive domains were less well studied. There were no cognitive tests in the areas of auditory perceptual abilities or idea production and only a paucity of research in the domains of reasoning, number facility, and language behavior. Across studies, Bacopa improved performance on 9 of 17 tests in the domain of memory free recall. There was little evidence of enhancement in any other cognitive domains. There is some evidence to suggest that Bacopa improves memory free recall with evidence for enhancement in other cognitive abilities currently lacking perhaps due to inconsistent measures employed by studies across these cognitive domains. Research into the nootropic effects of Bacopa is in its infancy, with research still yet to investigate the effects of Bacopa across all human cognitive abilities. Similarly, future research should examine the nootropic effects of Bacopa at varied dosages and across different extracts.
Carter, Michael J; Smith, Victoria; Ste-Marie, Diane M
2016-02-01
Studies have consistently shown that prospective metacognitive judgments of learning are often inaccurate because humans mistakenly interpret current performance levels as valid indices of learning. These metacognitive discrepancies are strongly related to conditions of practice. Here, we examined how the type of feedback (after good versus poor trials) received during practice and awareness (aware versus unaware) of this manipulation affected judgments of learning and actual learning. After each six-trial block, participants received feedback on their three best trials or three worst trials and half of the participants were made explicitly aware of the type of feedback they received while the other half were unaware. Judgments of learning were made at the end of each six-trial block and before the 24-h retention test. Results indicated no motor performance differences between groups in practice or retention; however, receiving feedback on relatively good compared to relatively poor trials resulted in significantly higher judgments of learning in practice and retention, irrespective of awareness. These results suggest that KR on relatively good versus relatively poor trials can have dissociable effects on judgments of learning in the absence of actual learning differences, even when participants are made aware of their feedback manipulation. Copyright © 2015 Elsevier B.V. All rights reserved.
Drug designs fulfilling the requirements of clinical trials aiming at personalizing medicine
Mandrekar, Sumithra J.; Sargent, Daniel J.
2014-01-01
In the current era of stratified medicine and biomarker-driven therapies, the focus has shifted from predictions based on the traditional anatomic staging systems to guide the choice of treatment for an individual patient to an integrated approach using the genetic makeup of the tumor and the genotype of the patient. The clinical trial designs utilized in the developmental pathway for biomarkers and biomarker-directed therapies from discovery to clinical practice are rapidly evolving. While several issues need careful consideration, two critical issues that surround the validation of biomarkers are the choice of the clinical trial design (which is based on the strength of the preliminary evidence and marker prevalence), and biomarker assay related issues surrounding the marker assessment methods such as the reliability and reproducibility of the assay. In this review, we focus on trial designs aiming at personalized medicine in the context of early phase trials for initial marker validation, as well as in the context of larger definitive trials. Designs for biomarker validation are broadly classified as retrospective (i.e., using data from previously well-conducted randomized controlled trials (RCTs) versus prospective (enrichment, all-comers, hybrid or adaptive). We believe that the systematic evaluation and implementation of these design strategies are essential to accelerate the clinical validation of biomarker guided therapy. PMID:25414851
Clarke, Diana E.; Wilcox, Holly C.; Miller, Leslie; Cullen, Bernadette; Gerring, Joan; Greiner, Lisa H.; Newcomer, Alison; Mckitty, Mellisha V.; Regier, Darrel A.; Narrow, William E.
2014-01-01
The Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5) contains criteria for psychiatric diagnoses that reflect advances in the science and conceptualization of mental disorders and address the needs of clinicians. DSM-5 also recommends research on dimensional measures of cross-cutting symptoms and diagnostic severity, which are expected to better capture patients’ experiences with mental disorders. Prior to its May 2013 release, the American Psychiatric Association (APA) conducted field trials to examine the feasibility, clinical utility, reliability, and where possible, the validity of proposed DSM-5 diagnostic criteria and dimensional measures. The methods and measures proposed for the DSM-5 field trials were pilot tested in adult and child/adolescent clinical samples, with the goal to identify and correct design and procedural problems with the proposed methods before resources were expended for the larger DSM-5 Field Trials. Results allowed for the refinement of the protocols, procedures, and measures, which facilitated recruitment, implementation, and completion of the DSM-5 Field Trials. These results highlight the benefits of pilot studies in planning large multisite studies. PMID:24615761
Mahmood, Abda; Roberts, Ian; Shakur, Haleema
2017-07-17
Tranexamic acid prevents blood clots from breaking down and reduces bleeding. However, it is uncertain whether tranexamic acid is effective in traumatic brain injury. The CRASH-3 trial is a randomised controlled trial that will examine the effect of tranexamic acid (versus placebo) on death and disability in 13,000 patients with traumatic brain injury. The CRASH-3 trial hypothesizes that tranexamic acid will reduce intracranial haemorrhage, which will reduce the risk of death. Although it is possible that tranexamic acid will reduce intracranial bleeding, there is also a potential for harm. In particular, tranexamic acid may increase the risk of cerebral thrombosis and ischaemia. The protocol detailed here is for a mechanistic sub-study nested within the CRASH-3 trial. This mechanistic sub-study aims to examine the effect of tranexamic acid (versus placebo) on intracranial bleeding and cerebral ischaemia. The CRASH-3 Intracranial Bleeding Mechanistic Sub-Study (CRASH-3 IBMS) is nested within a prospective, double-blind, multi-centre, parallel-arm randomised trial called the CRASH-3 trial. The CRASH-3 IBMS will be conducted in a cohort of approximately 1000 isolated traumatic brain injury patients enrolled in the CRASH-3 trial. In the CRASH-3 IBMS, brain scans acquired before and after randomisation are examined, using validated methods, for evidence of intracranial bleeding and cerebral ischaemia. The primary outcome is the total volume of intracranial bleeding measured on computed tomography after randomisation, adjusting for baseline bleeding volume. Secondary outcomes include progression of intracranial haemorrhage (from pre- to post-randomisation scans), new intracranial haemorrhage (seen on post- but not pre-randomisation scans), intracranial haemorrhage following neurosurgery, and new focal ischaemic lesions (seen on post-but not pre-randomisation scans). A linear regression model will examine whether receipt of the trial treatment can predict haemorrhage volume. Bleeding volumes and new ischaemic lesions will be compared across treatment groups using relative risks and 95% confidence intervals. The CRASH-3 IBMS will provide an insight into the mechanism of action of tranexamic acid in traumatic brain injury, as well as information about the risks and benefits. Evidence from this trial could inform the management of patients with traumatic brain injury. The CRASH-3 trial was prospectively registered and the CRASH-3 IBMS is an addition to the original protocol registered at the International Standard Randomised Controlled Trials registry ( ISRCTN15088122 ) 19 July 2011, and ClinicalTrials.gov on 25 July 2011 (NCT01402882).
Macefield, Rhiannon C.; Blencowe, Natalie S.; Brookes, Sara T.; Blazeby, Jane M.
2016-01-01
Purpose The CONSORT extension for patient reported outcomes (PROs) aims to improve reporting, but guidance on the optimal integration with clinical data is lacking. This study examines in detail the reporting of PROs and clinical data from randomized controlled trials (RCTs) in gastro-intestinal cancer to inform design and reporting of combined PRO and clinical data from trials to improve the ‘take home’ message for clinicians to use in practice. Materials and Methods The case study was undertaken in gastro-intestinal cancer trials. Well-conducted RCTs reporting PROs with validated instruments were identified and categorized into those combining PRO and clinical data in a single paper, or those separating data into linked primary and supplemental papers. Qualitative methods were developed to examine reporting of the critical interpretation of the trial results (trial exegesis) in the papers in relation of the PRO and clinical outcomes and applied to each publication category. Results were used to inform recommendations for practice. Results From 1917 screened abstracts, 49 high quality RCTs were identified reported in 36 combined and 15 linked primary and supplemental papers. In-depth analysis of manuscript text identified three categories for understanding trial exegesis: where authors reported a “detailed”, “general”, or absent PRO rationale and integrated interpretation of clinical and PRO results. A total of 11 (30%) and 6 (16%) combined papers reported “detailed” PRO rationale and integrated interpretation of results although only 2 (14%) and 1 (7%) primary papers achieved the same standard respectively. Supplemental papers provide better information with 11 (73%) and 3 (20%) achieving “detailed” rationale and integrated interpretation of results. Supplemental papers, however, were published a median of 20 months after the primary RCT data in lower impact factor journals (median 16.8 versus 5.2). Conclusion It is recommended that single papers, with detailed PRO rationale and integrated PRO and clinical data are published to optimize trial exegesis. Further work to examine whether this improves the use of PRO data to inform practice is needed. PMID:27571514
Biomarkers and surrogate endpoints in glaucoma clinical trials
Medeiros, Felipe A
2015-01-01
Surrogate endpoints are often used as replacements for true clinically relevant endpoints in several areas of medicine, as they enable faster and less expensive clinical trials. However, without proper validation, the use of surrogates may lead to incorrect conclusions about the efficacy and safety of treatments. This article reviews the general requirements for validating surrogate endpoints and provides a critical assessment of the use of intraocular pressure (IOP), visual fields, and structural measurements of the optic nerve as surrogate endpoints in glaucoma clinical trials. A valid surrogate endpoint must be able to predict the clinically relevant endpoint and fully capture the effect of an intervention on that endpoint. Despite its widespread use in clinical trials, no proper validation of IOP as a surrogate endpoint has ever been conducted for any class of IOP-lowering treatments. Evidence has accumulated with regard to the role of imaging measurements of optic nerve damage as surrogate endpoints in glaucoma. These measurements are predictive of functional losses in the disease and may explain, at least in part, treatment effects on clinically relevant endpoints. The use of composite endpoints in glaucoma trials may overcome weaknesses of the use of structural or functional endpoints in isolation. Unless research is dedicated to fully develop and validate suitable endpoints that can be used in glaucoma clinical trials, we run the risk of inappropriate judgments about the value of new therapies. PMID:25034049
Zhang, Xin; Wu, Yuxia; Ren, Pengwei; Liu, Xueting; Kang, Deying
2015-10-30
To explore the relationship between the external validity and the internal validity of hypertension RCTs conducted in China. Comprehensive literature searches were performed in Medline, Embase, Cochrane Central Register of Controlled Trials (CCTR), CBMdisc (Chinese biomedical literature database), CNKI (China National Knowledge Infrastructure/China Academic Journals Full-text Database) and VIP (Chinese scientific journals database) as well as advanced search strategies were used to locate hypertension RCTs. The risk of bias in RCTs was assessed by a modified scale, Jadad scale respectively, and then studies with 3 or more grading scores were included for the purpose of evaluating of external validity. A data extract form including 4 domains and 25 items was used to explore relationship of the external validity and the internal validity. Statistic analyses were performed by using SPSS software, version 21.0 (SPSS, Chicago, IL). 226 hypertension RCTs were included for final analysis. RCTs conducted in university affiliated hospitals (P < 0.001) or secondary/tertiary hospitals (P < 0.001) were scored at higher internal validity. Multi-center studies (median = 4.0, IQR = 2.0) were scored higher internal validity score than single-center studies (median = 3.0, IQR = 1.0) (P < 0.001). Funding-supported trials had better methodological quality (P < 0.001). In addition, the reporting of inclusion criteria also leads to better internal validity (P = 0.004). Multivariate regression indicated sample size, industry-funding, quality of life (QOL) taken as measure and the university affiliated hospital as trial setting had statistical significance (P < 0.001, P < 0.001, P = 0.001, P = 0.006 respectively). Several components relate to the external validity of RCTs do associate with the internal validity, that do not stand in an easy relationship to each other. Regarding the poor reporting, other possible links between two variables need to trace in the future methodological researches.
Yamanaka, Ashley; Fialkowski, Marie Kainoa; Wilkens, Lynne; Li, Fenfang; Ettienne, Reynolette; Fleming, Travis; Power, Julianne; Deenik, Jonathan; Coleman, Patricia; Leon Guerrero, Rachael; Novotny, Rachel
2016-09-02
Quality assurance plays an important role in research by assuring data integrity, and thus, valid study results. We aim to describe and share the results of the quality assurance process used to guide the data collection process in a multi-site childhood obesity prevalence study and intervention trial across the US Affiliated Pacific Region. Quality assurance assessments following a standardized protocol were conducted by one assessor in every participating site. Results were summarized to examine and align the implementation of protocol procedures across diverse settings. Data collection protocols focused on food and physical activity were adhered to closely; however, protocols for handling completed forms and ensuring data security showed more variability. Quality assurance protocols are common in the clinical literature but are limited in multi-site community-based studies, especially in underserved populations. The reduction in the number of QA problems found in the second as compared to the first data collection periods for the intervention study attest to the value of this assessment. This paper can serve as a reference for similar studies wishing to implement quality assurance protocols of the data collection process to preserve data integrity and enhance the validity of study findings. NIH clinical trial #NCT01881373.
Treister, Roi; Trudeau, Jeremiah J; Van Inwegen, Richard; Jones, Judith K; Katz, Nathaniel P
2016-12-01
Inappropriate use of analgesic drugs has become increasingly pervasive over the past decade. Currently, drug abuse potential is primarily assessed post-marketing; no validated tools are available to assess this potential in phase II and III clinical trials. This paper describes the development and feasibility testing of a Misuse, Abuse, and Diversion Drug Event Reporting System (MADDERS), which aims to identify potentially abuse-related events and classify them according to a recently developed classification scheme, allowing the quantification of these events in clinical trials. The system was initially conceived and designed with input from experts and patients, followed by field-testing to assess its feasibility and content validity in both completed and ongoing clinical trials. The results suggest that MADDERS is a feasible system with initial validity. It showed higher rates of the triggering events in subjects taking medications with known abuse potential than in patients taking medications without abuse potential. Additionally, experts agreed on the classification of most abuse-related events in MADDERS. MADDERS is a new systematic approach to collect information on potentially abuse-related events in clinical trials and classify them. The system has demonstrated feasibility for implementation. Additional research is ongoing to further evaluate its validity. Currently, there are no validated tools to assess drug abuse potential during clinical trials. Because of its ease of implementation, its systematic approach, and its preliminary validation results, MADDERS could provide such a tool for clinical trials. (Am J Addict 2016;25:641-651). © 2016 American Academy of Addiction Psychiatry.
Knowledge and skills of cancer clinical trials nurses in Australia.
Scott, Kathleen; White, Kate; Johnson, Catherine; Roydhouse, Jessica K
2012-05-01
This paper is a report of the development and testing of a questionnaire measuring knowledge and skills of cancer clinical trials nurse in Australia. The role of cancer clinical trials nurse, widely acknowledged as an integral member of the clinical research team, has evolved in recent years. Elements of the clinical trials nurse role in cancer have previously been described. To evaluate specific cancer clinical trials nurse educational and training needs, the development of a valid and reliable tool is required. In 2009, a study was conducted in three stages. Stage I: questionnaire development and pilot testing; stage II: focus group; stage III: national survey. Internal consistency reliability testing and multi-trait analysis of item convergent/divergent validity were employed. Regression analysis was used to identify predictors of clinical trials nurse knowledge and skills. The national survey was a 48-item questionnaire, measuring six clinical trial knowledge and seven skills sub-scales. Of 61 respondents, 90% were women, with mean age 43 years, 19 years as a Registered Nurse and 5 years as a cancer clinical trials nurse. Self-reported knowledge and skills were satisfactory to good. Internal consistency reliability was high (Cronbach's alpha: knowledge = 0·98; skills = 0·90). Criteria for item convergent/divergent validity were met. Number of years as cancer clinical trials nurse was positively related to self-reported knowledge and skills. Preliminary data suggest that the national survey is reliable and valid. Data have contributed to better understanding the knowledge and skills of cancer clinical trials nurse in Australia and development of a postgraduate course in clinical trials. © 2011 Blackwell Publishing Ltd.
Atkinson, Thomas M; Mendoza, Tito R; Sit, Laura; Passik, Steven; Scher, Howard I; Cleeland, Charles; Basch, Ethan
2010-03-01
In 2006, the United States Food and Drug Administration (FDA) released a draft Guidance for Industry on the use of patient-reported outcomes (PRO) Measures in Medical Product Development to Support Labeling Claims. This draft guidance outlines psychometric aspects that should be considered when designing a PRO measure, including conceptual framework, content validity, construct validity, reliability, and the ability to detect clinically meaningful score changes. When finalized, it may provide a blueprint for evaluations of PRO measures that can be considered by sponsors and investigators involved in PRO research and drug registration trials. In this review we examine the short form of the Brief Pain Inventory (BPI) and particularly the "pain at its worst in the last 24 hours" item in the context of the FDA draft guidance, to assess its utility in clinical trials that include pain as a PRO endpoint. After a systematic evaluation of the psychometric aspects of the BPI, we conclude that the BPI and its "pain at its worst in the last 24 hours" item generically satisfy most key recommendations outlined in the draft guidance for assessing a pain-reduction treatment effect. Nonetheless, when the BPI is being considered for assessment of pain endpoints in a registration trial, sponsors and investigators should consult with the appropriate FDA division early during research design to discuss whether there is sufficient precedent to use the instrument in the population of interest or whether additional evaluations of measurement properties are advisable.
Validity and reliability of a new ankle dorsiflexion measurement device.
Gatt, Alfred; Chockalingam, Nachiappan
2013-08-01
The assessment of the maximum ankle dorsiflexion angle is an important clinical examination procedure. Evidence shows that the traditional goniometer is highly unreliable, and various designs of goniometers to measure the maximum ankle dorsiflexion angle rely on the application of a known force to obtain reliable results. Hence, an innovative ankle dorsiflexion measurement device was designed to make this measurement more reliable by holding the foot in a selected posture without the application of a known moment. To report on the comprehensive validity and reliability testing carried out on the new device. Following validity testing, four different trials to test reliability of the ankle dorsiflexion measurement device were performed. These trials included inter-rater and intra-rater testings with a controlled moment, intra-rater reliability testing with knees flexed and extended without a controlled moment, intra-rater testing with a patient population, and inter-rater reliability testing between four raters of varying experience without controlling moment. All raters were blinded. A series of trials to test intra-rater and inter-rater reliabilities. Intra-rater reliability intraclass correlation coefficient was 0.98 and inter-rater reliability intraclass correlation coefficient (2,1) was 0.953 with a controlled moment. With uncontrolled moment, very high reliability for intra-tester was also achieved (intraclass correlation coefficient = 0.94 with knees extended and intraclass correlation coefficient = 0.95 with knees flexed). For the trial investigating test-retest reliability with actual patients, intraclass correlation coefficient of 0.99 was obtained. In the trial investigating four different raters with uncontrolled moment, intraclass correlation coefficient of 0.91 was achieved. The new ankle dorsiflexion measurement device is a valid and reliable device for measuring ankle dorsiflexion in both healthy subjects and patients, with both controlled and uncontrolled moments, even by multiple raters of varying experience when the foot is dorsiflexed to its end of range of motion. An ankle dorsiflexion measuring device has been designed to increase the reliability of ankle dorsiflexion measurement and replace the traditional goniometer. While the majority of similar devices rely on application of a known moment to perform this measurement, it has been shown that this is not required with the new ankle dorsiflexion measurement device and, rather, foot posture should be taken into consideration as this affects the maximum ankle dorsiflexion angle.
Merghati-Khoei, E; Maasoumi, R; Rahdari, F; Bayat, A; Hajmirzaei, S; Lotfi, S; Hajiaghababaei, M; Emami-Razavi, SH; Korte, JE; Atoof, F
2016-01-01
Study design This is a cross-sectional study Objectives The objective of this study was to examine the psychometric properties of the Sexual Adjustment Questionnaire (SAQ) for Iranian people with spinal cord injury Setting This study was conducted in the brain and Spinal Cord Injury Research Center, Tehran University of Medical Sciences, Tehran, Iran Methods We assessed the psychometric properties of the SAQ, with 200 participants (men = 146, women = 54) completing the scale. An evaluation of its test–retest reliability was performed over a 2-weeks period, on a subsample of 30 patients recruited from the overall group. Cronbach’s α-coefficient was computed for assessment of internal consistency reliability. In addition, content and face validity were examined by an expert committee. Construct validity was assessed by examining convergent and discriminant validity. Finally, exploratory factor analysis was used to extract the factor structure of the questionnaire. Results The Cronbach’s α and intraclass correlation coefficient were 0.77 and 0.72 retrospectively. With regard to construct validity, there was a significant (P = 0.009) negative correlation (r = − 0.28) between the SAQ score and age. Those with lower levels of educations scored significantly lower on the SAQ (P = 0.04). The exploratory factor analysis indicated a four-factor structure for the questionnaire, accounting for 68.9% of the observed variance. The expert committee approved the face and content validity of the developed measure. Conclusion The SAQ is a valid measure for assessing sexual adjustment in people with spinal cord injury. The evaluation of sexual well-being may be useful in clinical trials and practical settings. PMID:25917953
Mentiplay, Benjamin F; Perraton, Luke G; Bower, Kelly J; Pua, Yong-Hao; McGaw, Rebekah; Heywood, Sophie; Clark, Ross A
2015-07-16
The revised Xbox One Kinect, also known as the Microsoft Kinect V2 for Windows, includes enhanced hardware which may improve its utility as a gait assessment tool. This study examined the concurrent validity and inter-day reliability of spatiotemporal and kinematic gait parameters estimated using the Kinect V2 automated body tracking system and a criterion reference three-dimensional motion analysis (3DMA) marker-based camera system. Thirty healthy adults performed two testing sessions consisting of comfortable and fast paced walking trials. Spatiotemporal outcome measures related to gait speed, speed variability, step length, width and time, foot swing velocity and medial-lateral and vertical pelvis displacement were examined. Kinematic outcome measures including ankle flexion, knee flexion and adduction and hip flexion were examined. To assess the agreement between Kinect and 3DMA systems, Bland-Altman plots, relative agreement (Pearson's correlation) and overall agreement (concordance correlation coefficients) were determined. Reliability was assessed using intraclass correlation coefficients, Cronbach's alpha and standard error of measurement. The spatiotemporal measurements had consistently excellent (r≥0.75) concurrent validity, with the exception of modest validity for medial-lateral pelvis sway (r=0.45-0.46) and fast paced gait speed variability (r=0.73). In contrast kinematic validity was consistently poor to modest, with all associations between the systems weak (r<0.50). In those measures with acceptable validity, the inter-day reliability was similar between systems. In conclusion, while the Kinect V2 body tracking may not accurately obtain lower body kinematic data, it shows great potential as a tool for measuring spatiotemporal aspects of gait. Copyright © 2015 Elsevier Ltd. All rights reserved.
Perlis, Roy H
2013-07-01
Early identification of depressed individuals at high risk for treatment resistance could be helpful in selecting optimal setting and intensity of care. At present, validated tools to facilitate this risk stratification are rarely used in psychiatric practice. Data were drawn from the first two treatment levels of a multicenter antidepressant effectiveness study in major depressive disorder, the STAR*D (Sequenced Treatment Alternatives to Relieve Depression) cohort. This cohort was divided into training, testing, and validation subsets. Only clinical or sociodemographic variables available by or readily amenable to self-report were considered. Multivariate models were developed to discriminate individuals reaching remission with a first or second pharmacological treatment trial from those not reaching remission despite two trials. A logistic regression model achieved an area under the receiver operating characteristic curve exceeding .71 in training, testing, and validation cohorts and maintained good calibration across cohorts. Performance of three alternative models with machine learning approaches--a naïve Bayes classifier and a support vector machine, and a random forest model--was less consistent. Similar performance was observed between more and less severe depression, men and women, and primary versus specialty care sites. A web-based calculator was developed that implements this tool and provides graphical estimates of risk. Risk for treatment resistance among outpatients with major depressive disorder can be estimated with a simple model incorporating baseline sociodemographic and clinical features. Future studies should examine the performance of this model in other clinical populations and its utility in treatment selection or clinical trial design. Copyright © 2013 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.
Bellack, Alan S.; Green, Michael F.; Cook, Judith A.; Fenton, Wayne; Harvey, Philip D.; Heaton, Robert K.; Laughren, Thomas; Leon, Andrew C.; Mayo, Donna J.; Patrick, Donald L.; Patterson, Thomas L.; Rose, Andrew; Stover, Ellen; Wykes, Til
2007-01-01
People with schizophrenia frequently have significant problems in community functioning. Progress in developing effective interventions to ameliorate these problems has been slowed by the absence of reliable and valid measures that are suitable for use in clinical trials. The National Institute of Mental Health convened a workgroup in September 2005 to examine this issue and make recommendations to the field that would foster research in this area. This article reports on issues raised at the meeting. Many instruments have been developed to assess community functioning, but overall insufficient attention has been paid to psychometric issues and many instruments are not suitable for use in clinical trials. Consumer self-report, informant report, ratings by clinicians and trained raters, and behavioral assessment all can provide useful and valid information in some circumstances and may be practical for use in clinical trials. However, insufficient attention has been paid to when and how different forms of assessment and sources of information are useful or how to understand inconsistencies. A major limiting factor in development of reliable and valid instruments is failure to develop a suitable model of functioning and its primary mediators and moderators. Several examples that can guide thinking are presented. Finally, the field is limited by the absence of an objective gold standard of community functioning. Hence, outcomes must be evaluated in part by “clinical significance.” This criterion is problematic because different observers and constituencies often have different opinions about what types of change are clinically important and how much change is significant. PMID:16931542
Murray, Nicholas P.; Hunfalvay, Melissa; Bolte, Takumi
2017-01-01
Purpose The purpose of this study was to determine the reliability of interpupillary distance (IPD) and pupil diameter (PD) measures using an infrared eye tracker and central point stimuli. Validity of the test compared to known clinical tools was determined, and normative data was established against which individuals can measure themselves. Methods Participants (416) across various demographics were examined for normative data. Of these, 50 were examined for reliability and validity. Validity for IPD measured the test (RightEye IPD/PD) against the PL850 Pupilometer and the Essilor Digital CRP. For PD, the test was measured against the Rosenbaum Pocket Vision Screener (RPVS). Reliability was analyzed with intraclass correlation coefficients (ICC) between trials with Cronbach's alpha (CA) and the standard error of measurement for each ICC. Convergent validity was investigated by calculating the bivariate correlation coefficient. Results Reliability results were strong (CA > 0.7) for all measures. High positive significant correlations were found between the RightEye IPD test and the PL850 Pupilometer (P < 0.001) and Essilor Digital CRP (P < 0.001) and for the RightEye PD test and the RPVS (P < 0.001). Conclusions Using infrared eye tracking and the RightEye IPD/PD test stimuli, reliable and accurate measures of IPD and PD were found. Results from normative data showed an adequate comparison for people with normal vision development. Translational Relevance Results revealed a central point of fixation may remove variability in examining PD reliably using infrared eye tracking when consistent environmental and experimental procedures are conducted. PMID:28685104
Smith, Chris; Edwards, Phil; Free, Caroline
2018-03-15
A variety of different approaches to measuring contraceptive use have been used or proposed, either to assess current use or adherence over time, using subjective or objective measures. This paper reports an overview of approaches to measuring adherence to the oral contraceptive, intra-uterine device, sub-dermal implant, and injectable and describes how we assessed contraception use in the MObile Technology for Improved Family Planning (MOTIF) trial in Cambodia. We summarise and discuss advantages and disadvantages of different subjective and objective approaches to measuring adherence to the oral contraceptive, intra-uterine device, sub-dermal implant, and injectable such as self-reports, clinic records, electronic monitoring devices, clinical examination and biomarkers. For the MOTIF trial, we did not consider it feasible to measure objective contraception use as many participants lived a long distance from the clinic and we were concerned whether it was appropriate to ask women to return to clinic for a physical examination simply to verify self-report information already provided. We aimed to assess the validity of the four-month data with 50 participants, calculating the sensitivity and specificity of self-reported data compared with objective measurement. For the 46 valid measurements obtained, the sensitivity and specificity was 100% for self-reported contraception use compared to objective measurement but this study had some limitations. To assess reliability of self-report data we compared calendar data collected on effective contraception use at months 1-4 post-abortion, collected separately at four and 12 months. Agreement ranged from 80 to 84% with a kappa statistic ranging from 0·59 to 0·67 indicating fair to good agreement. There is no perfect method of assessing contraception use and researchers designing future studies should give consideration of what to measure, for example current use or detailed patterns of use over time, and remain mindful of what will be feasible and acceptable to the study population. Although self-reported data on contraception use are considered less reliable, and prone to social desirability bias, it is often the standard approach for contraception research and provides data comparable to previous studies. ClinicalTrials.gov Identifier: NCT01823861 . Registered: March 30, 2013.
Faria, Ana Lúcia; Andrade, Andreia; Soares, Luísa; I Badia, Sergi Bermúdez
2016-11-02
Stroke is one of the most common causes of acquired disability, leaving numerous adults with cognitive and motor impairments, and affecting patients' capability to live independently. There is substancial evidence on post-stroke cognitive rehabilitation benefits, but its implementation is generally limited by the use of paper-and-pencil methods, insufficient personalization, and suboptimal intensity. Virtual reality tools have shown potential for improving cognitive rehabilitation by supporting carefully personalized, ecologically valid tasks through accessible technologies. Notwithstanding important progress in VR-based cognitive rehabilitation systems, specially with Activities of Daily Living (ADL's) simulations, there is still a need of more clinical trials for its validation. In this work we present a one-month randomized controlled trial with 18 stroke in and outpatients from two rehabilitation units: 9 performing a VR-based intervention and 9 performing conventional rehabilitation. The VR-based intervention involved a virtual simulation of a city - Reh@City - where memory, attention, visuo-spatial abilities and executive functions tasks are integrated in the performance of several daily routines. The intervention had levels of difficulty progression through a method of fading cues. There was a pre and post-intervention assessment in both groups with the Addenbrooke Cognitive Examination (primary outcome) and the Trail Making Test A and B, Picture Arrangement from WAIS III and Stroke Impact Scale 3.0 (secondary outcomes). A within groups analysis revealed significant improvements in global cognitive functioning, attention, memory, visuo-spatial abilities, executive functions, emotion and overall recovery in the VR group. The control group only improved in self-reported memory and social participation. A between groups analysis, showed significantly greater improvements in global cognitive functioning, attention and executive functions when comparing VR to conventional therapy. Our results suggest that cognitive rehabilitation through the Reh@City, an ecologically valid VR system for the training of ADL's, has more impact than conventional methods. This trial was not registered because it is a small sample study that evaluates the clinical validity of a prototype virtual reality system.
Wagman, Jennifer A.; Paul, Amy; Namatovu, Fredinah; Ssekubugu, Robert; Nalugoda, Fred
2016-01-01
Objective We identify complexities encountered, including unanticipated crossover between trial arms and inadequate ‘standard of care’ violence services, during a cluster randomized trial (CRT) of a community-level intimate partner violence (IPV) and HIV prevention intervention in Uganda. Methods Concepts in public health ethics - beneficence, social value of research, fairness, standard of care, and researcher responsibilities for post-trial benefits - are used to critically reflect on lessons learned and guide discussion on practical and ethical challenges of violence intervention CRTs. Results Existing ethical guidelines provide incomplete guidance for responding to unexpected crossover in CRTs providing IPV services. We struggled to balance duty of care with upholding trial integrity, and identifying and providing appropriate standard of care. While we ultimately offered short-term IPV services to controls, we faced additional challenges related to sustaining services beyond the ‘short-term’ and post-trial. Conclusion Studies evaluating community-level violence interventions, including those combined with HIV reduction strategies, are limited yet critical for developing evidence-based approaches for effectively preventing IPV. Although CRTs are a promising design, further guidance is needed to implement trials that avoid introducing tensions between validity of findings, researchers’ responsibilities to protect participants, and equitable distribution of CRT benefits. PMID:27453794
Mossman, Douglas; Wygant, Dustin B; Gervais, Roger O; Hart, Kathleen J
2018-01-01
This study examines the accuracy of the Test of Memory Malingering (TOMM), a frequently administered measure for evaluating effort during neurocognitive testing. In the last few years, several authors have suggested that the initial recognition trial of the TOMM (Trial 1) might be a more useful index for detecting feigned or exaggerated impairment than Trial 2, which is the source for inference recommended by the original instruction manual (Tombaugh, 1996). We used latent class modeling (LCM) implemented in a Bayesian framework to evaluate archival Trial 1 and Trial 2 data collected from 1,198 adults who had undergone outpatient forensic evaluations. All subjects were tested with 2 other performance validity tests (the Word Memory Test and the Computerized Assessment of Response Bias), and for 70% of the subjects, data from the California Verbal Learning Test-Second Edition Forced Choice trial were also available. Our results suggest that not even a perfect score on Trial 1 or Trial 2 justifies saying that an evaluee is definitely responding genuinely, although such scores imply a lower-than-base-rate probability of feigning. If one uses a Trial 2 cut-off higher than the manual's recommendation, Trial 2 does better than Trial 1 at identifying individuals who are almost certainly feigning while maintaining a negligible false positive rate. Using scores from both trials, one can identify a group of definitely feigning and very likely feigning subjects who comprise about 2 thirds of all feigners; only 1% of the members of this group would not be feigning. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Daftary-Kapur, Tarika; Penrod, Steven D; O'Connor, Maureen; Wallace, Brian
2014-10-01
The purpose of this study was to examine the influence of pretrial publicity (PTP) on mock juror decision making. Specifically, we examined the influence of quantity and slant of the PTP (proprosecution vs. prodefense), the persistence of PTP effects over time, and whether the PTP effects demonstrated in research laboratories would also occur in more naturalistic settings (generalizability). Using a shadow jury paradigm we examined these effects using a real trial as stimulus. Mock jurors included 115 jury-eligible community members who were naturally exposed to PTP in the venue in which the actual case occurred and 156 who were experimentally exposed. We found mock jurors were significantly influenced by both the slant and quantity of the PTP to which they were exposed, such that those exposed to proprosecution or prodefense PTP tended to render decision in support of the party favored in the PTP, and those exposed to greater quantities of PTP tended to be more biased. Additionally, PTP effects persisted throughout the course of the trial and continued to influence judgments in face of trial evidence and arguments. A finding of no significant difference in the effect of exposure slant between the naturally exposed and experimentally exposed samples provides support for the external validity of laboratory studies examining PTP effects. This research helps address some of the concerns raised by courts with regard to the durability of PTP effects and the application of laboratory findings to real world settings. PsycINFO Database Record (c) 2014 APA, all rights reserved.
Roberts, Paul
2016-01-01
Adopting the interpretative/hermeneutical method typical of much legal scholarship, this article considers two sets of issues pertaining to LTDNA profiles as evidence in criminal proceedings. The section titled Expert Evidence as Forensic Epistemic Warrant addresses some rather large questions about the epistemic status and probative value of expert testimony in general. It sketches a theoretical model of expert evidence, highlighting five essential criteria: (1) expert competence; (2) disciplinary domain; (3) methodological validity; (4) materiality; and (5) legal admissibility. This generic model of expert authority, highlighting law's fundamentally normative character, applies to all modern forms of criminal adjudication, across Europe and farther afield. The section titled LTDNA Evidence in UK Criminal Trials then examines English and Northern Irish courts' attempts to get to grips with LTDNA evidence in recent cases. Better appreciating the ways in which UK courts have addressed the challenges of LTDNA evidence may offer some insights into parallel developments in other legal systems. Appellate court rulings follow a predictable judicial logic, which might usefully be studied and reflected upon by any forensic scientist or statistician seeking to operate effectively in criminal proceedings. Whilst each legal jurisdiction has its own unique blend of jurisprudence, institutions, cultures and historical traditions, there is considerable scope for comparative analysis and cross-jurisdictional borrowing and instruction. In the spirit of promoting more nuanced and sophisticated international interdisciplinary dialogue, this article examines UK judicial approaches to LTDNA evidence and begins to elucidate their underlying institutional logic. Legal argument and broader policy debates are not confined to considerations of scientific validity, contamination risks and evidential integrity, or associated judgments of legal admissibility or exclusion. They also crucially concern the manner in which LTDNA profiling results are presented and explained to factfinders in criminal trials. PMID:27826316
Roberts, Paul
2016-01-01
Adopting the interpretative/hermeneutical method typical of much legal scholarship, this article considers two sets of issues pertaining to LTDNA profiles as evidence in criminal proceedings. The section titled Expert Evidence as Forensic Epistemic Warrant addresses some rather large questions about the epistemic status and probative value of expert testimony in general. It sketches a theoretical model of expert evidence, highlighting five essential criteria: (1) expert competence; (2) disciplinary domain; (3) methodological validity; (4) materiality; and (5) legal admissibility. This generic model of expert authority, highlighting law's fundamentally normative character, applies to all modern forms of criminal adjudication, across Europe and farther afield. The section titled LTDNA Evidence in UK Criminal Trials then examines English and Northern Irish courts' attempts to get to grips with LTDNA evidence in recent cases. Better appreciating the ways in which UK courts have addressed the challenges of LTDNA evidence may offer some insights into parallel developments in other legal systems. Appellate court rulings follow a predictable judicial logic, which might usefully be studied and reflected upon by any forensic scientist or statistician seeking to operate effectively in criminal proceedings. Whilst each legal jurisdiction has its own unique blend of jurisprudence, institutions, cultures and historical traditions, there is considerable scope for comparative analysis and cross-jurisdictional borrowing and instruction. In the spirit of promoting more nuanced and sophisticated international interdisciplinary dialogue, this article examines UK judicial approaches to LTDNA evidence and begins to elucidate their underlying institutional logic. Legal argument and broader policy debates are not confined to considerations of scientific validity, contamination risks and evidential integrity, or associated judgments of legal admissibility or exclusion. They also crucially concern the manner in which LTDNA profiling results are presented and explained to factfinders in criminal trials.
Lurie, Jon D.; Tosteson, Anna N.A.; Deyo, Richard A.; Tosteson, Tor; Weinstein, James; Mirza, Sohail K.
2014-01-01
Study Design Retrospective analysis of Medicare claims linked to a multi-center clinical trial. Objective The Spine Patient Outcomes Research Trial (SPORT) provided a unique opportunity to examine the validity of a claims-based algorithm for grouping patients by surgical indication. SPORT enrolled patients for lumbar disc herniation, spinal stenosis, and degenerative spondylolisthesis. We compared the surgical indication derived from Medicare claims to that provided by SPORT surgeons, the “gold standard”. Summary of Background Data Administrative data are frequently used to report procedure rates, surgical safety outcomes, and costs in the management of spinal surgery. However, the accuracy of using diagnosis codes to classify patients by surgical indication has not been examined. Methods Medicare claims were link to beneficiaries enrolled in SPORT. The sensitivity and specificity of three claims-based approaches to group patients based on surgical indications were examined: 1) using the first listed diagnosis; 2) using all diagnoses independently; and 3) using a diagnosis hierarchy based on the support for fusion surgery. Results Medicare claims were obtained from 376 SPORT participants, including 21 with disc herniation, 183 with spinal stenosis, and 172 with degenerative spondylolisthesis. The hierarchical coding algorithm was the most accurate approach for classifying patients by surgical indication, with sensitivities of 76.2%, 88.1%, and 84.3% for disc herniation, spinal stenosis, and degenerative spondylolisthesis cohorts, respectively. The specificity was 98.3% for disc herniation, 83.2% for spinal stenosis, and 90.7% for degenerative spondylolisthesis. Misclassifications were primarily due to codes attributing more complex pathology to the case. Conclusion Standardized approaches for using claims data to accurately group patients by surgical indications has widespread interest. We found that a hierarchical coding approach correctly classified over 90% of spine patients into their respective SPORT cohorts. Therefore, claims data appears to be a reasonably valid approach to classifying patients by surgical indication. PMID:24525995
Biomarkers and Surrogate Endpoints: Lessons Learned From Glaucoma
Medeiros, Felipe A.
2017-01-01
With the recent progress in imaging technologies for assessment of structural damage in glaucoma, a debate has emerged on whether these measurements can be used as valid surrogate endpoints in clinical trials evaluating new therapies for the disease. A discussion of surrogates should be grounded on knowledge acquired from their use in other areas of medicine as well as regulatory requirements. This article reviews the conditions for valid surrogacy in the context of glaucoma clinical trials and critically evaluates the role of biomarkers such as IOP and imaging measurements as potential surrogates for clinically relevant outcomes. Valid surrogate endpoints must be able to predict a clinically relevant endpoint, such as loss of vision or decrease in quality of life. In addition, the effect of a proposed treatment on the surrogate must capture the effect of the treatment on the clinically relevant endpoint. Despite its widespread use in clinical trials, no proper validation of IOP as a surrogate endpoint has yet been conducted for any class of IOP-lowering treatments. Although strong evidence has accumulated about imaging measurements as predictors of relevant functional outcomes in glaucoma, there is still insufficient evidence to support their use as valid surrogate endpoints. However, imaging biomarkers could potentially be used as part of composite endpoints in glaucoma trials, overcoming weaknesses of the use of structural or functional endpoints in isolation. Efforts should be taken to properly design and conduct studies that can provide proper validation of potential biomarkers in glaucoma clinical trials. PMID:28475699
Biomarkers and Surrogate Endpoints: Lessons Learned From Glaucoma.
Medeiros, Felipe A
2017-05-01
With the recent progress in imaging technologies for assessment of structural damage in glaucoma, a debate has emerged on whether these measurements can be used as valid surrogate endpoints in clinical trials evaluating new therapies for the disease. A discussion of surrogates should be grounded on knowledge acquired from their use in other areas of medicine as well as regulatory requirements. This article reviews the conditions for valid surrogacy in the context of glaucoma clinical trials and critically evaluates the role of biomarkers such as IOP and imaging measurements as potential surrogates for clinically relevant outcomes. Valid surrogate endpoints must be able to predict a clinically relevant endpoint, such as loss of vision or decrease in quality of life. In addition, the effect of a proposed treatment on the surrogate must capture the effect of the treatment on the clinically relevant endpoint. Despite its widespread use in clinical trials, no proper validation of IOP as a surrogate endpoint has yet been conducted for any class of IOP-lowering treatments. Although strong evidence has accumulated about imaging measurements as predictors of relevant functional outcomes in glaucoma, there is still insufficient evidence to support their use as valid surrogate endpoints. However, imaging biomarkers could potentially be used as part of composite endpoints in glaucoma trials, overcoming weaknesses of the use of structural or functional endpoints in isolation. Efforts should be taken to properly design and conduct studies that can provide proper validation of potential biomarkers in glaucoma clinical trials.
Jensen, Garrett; Tang, Chad; Hess, Kenneth R; Bishop, Andrew J; Pan, Hubert Y; Li, Jing; Yang, James N; Tannir, Nizar M; Amini, Behrang; Tatsui, Claudio; Rhines, Laurence; Brown, Paul D; Ghia, Amol J
2017-01-01
We sought to validate the Prognostic Index for Spinal Metastases (PRISM), a scoring system that stratifies patients into subgroups by overall survival.Methods and materials: The PRISM was previously created from multivariate Cox regression with patients enrolled in prospective single institution trials of stereotactic spine radiosurgery (SSRS) for spinal metastasis. We assess model calibration and discrimination within a validation cohort of patients treated off-trial with SSRS for metastatic disease at the same institution. The training and validation cohorts consisted of 205 and 249 patients respectively. Similar survival trends were shown in the 4 PRISM. Survival was significantly different between PRISM subgroups (P<0.0001). C-index for the validation cohort was 0.68 after stratification into subgroups. We internally validated the PRISM with patients treated off-protocol, demonstrating that it can distinguish subgroups by survival, which will be useful for individualizing treatment of spinal metastases and stratifying patients for clinical trials.
Development of the Therapist Empathy Scale.
Decker, Suzanne E; Nich, Charla; Carroll, Kathleen M; Martino, Steve
2014-05-01
Few measures exist to examine therapist empathy as it occurs in session. A 9-item observer rating scale, called the Therapist Empathy Scale (TES), was developed based on Watson's (1999) work to assess affective, cognitive, attitudinal, and attunement aspects of therapist empathy. The aim of this study was to evaluate the inter-rater reliability, internal consistency, and construct and criterion validity of the TES. Raters evaluated therapist empathy in 315 client sessions conducted by 91 therapists, using data from a multi-site therapist training trial (Martino et al., 2010) in Motivational Interviewing (MI). Inter-rater reliability (ICC = .87 to .91) and internal consistency (Cronbach's alpha = .94) were high. Confirmatory factor analyses indicated some support for single-factor fit. Convergent validity was supported by correlations between TES scores and MI fundamental adherence (r range .50 to .67) and competence scores (r range .56 to .69). Discriminant validity was indicated by negative or nonsignificant correlations between TES and MI-inconsistent behavior (r range .05 to -.33). The TES demonstrates excellent inter-rater reliability and internal consistency. RESULTS indicate some support for a single-factor solution and convergent and discriminant validity. Future studies should examine the use of the TES to evaluate therapist empathy in different psychotherapy approaches and to determine the impact of therapist empathy on client outcome.
Angeles, Arkhjamil; Hung, Wayne; Cheung, Winson Y
2018-06-23
The CORRECT trial demonstrated survival benefits with regorafenib monotherapy in patients with treatment-refractory, metastatic colorectal cancer (mCRC). However, the trial's stringent eligibility criteria for regorafenib may limit its external validity. We aimed to examine treatment attrition rates and eligibility for regorafenib in routine practice. We identified patients at the British Columbia Cancer Agency diagnosed with mCRC who demonstrated disease progression or intolerable toxicity on 2 or more lines of systemic therapy. During the study timeframe, panitumumab and cetuximab were only used in the chemo-refractory setting. Data on clinicopathologic variables and patient outcomes were ascertained and analyzed. Eligibility was determined using the CORRECT trial criteria. A total of 391 patients were identified, among whom only 39% were eligible for regorafenib: 35% in the panitumumab group and 51% in the cetuximab group. The main reasons for ineligibility in all patients were Eastern Cooperative Oncology Group Performance Status (ECOG PS) > 1 (69%), an elevated total bilirubin (21%), and thromboembolic events in the past 6 months (10%). No difference in eligibility for regorafenib was observed between patients previously receiving panitumumab or cetuximab (P = 0.914; 95% CI 0.550-1.951). Kaplan-Meier analyses showed that regorafenib-eligible compared to regorafenib-ineligible patients had an increased median overall survival of 5.3 versus 2.1 months, respectively (P < 0.001). However, Cox proportional hazard analyses showed that only ECOG PS rather than trial eligibility was correlated with outcomes. The strict eligibility criteria disqualify most patients with treatment-refractory mCRC for regorafenib therapy. Future trials should broaden the eligibility criteria to improve external validity.
Shao, Hui; Fonseca, Vivian; Stoecker, Charles; Liu, Shuqian; Shi, Lizheng
2018-05-03
There is an urgent need to update diabetes prediction, which has relied on the United Kingdom Prospective Diabetes Study (UKPDS) that dates back to 1970 s' European populations. The objective of this study was to develop a risk engine with multiple risk equations using a recent patient cohort with type 2 diabetes mellitus reflective of the US population. A total of 17 risk equations for predicting diabetes-related microvascular and macrovascular events, hypoglycemia, mortality, and progression of diabetes risk factors were estimated using the data from the Action to Control Cardiovascular Risk in Diabetes (ACCORD) trial (n = 10,251). Internal and external validation processes were used to assess performance of the Building, Relating, Assessing, and Validating Outcomes (BRAVO) risk engine. One-way sensitivity analysis was conducted to examine the impact of risk factors on mortality at the population level. The BRAVO risk engine added several risk factors including severe hypoglycemia and common US racial/ethnicity categories compared with the UKPDS risk engine. The BRAVO risk engine also modeled mortality escalation associated with intensive glycemic control (i.e., glycosylated hemoglobin < 6.5%). External validation showed a good prediction power on 28 endpoints observed from other clinical trials (slope = 1.071, R 2 = 0.86). The BRAVO risk engine for the US diabetes cohort provides an alternative to the UKPDS risk engine. It can be applied to assist clinical and policy decision making such as cost-effective resource allocation in USA.
Generalizability of Clinical Trial Results for Adolescent Major Depressive Disorder.
Blanco, Carlos; Hoertel, Nicolas; Franco, Silvia; Olfson, Mark; He, Jian-Ping; López, Saioa; González-Pinto, Ana; Limosin, Frédéric; Merikangas, Kathleen R
2017-12-01
Although there have been a number of clinical trials evaluating treatments for adolescents with major depressive disorder (MDD), the generalizability of those trials to samples of depressed adolescents who present for routine clinical care is unknown. Examining the generalizability of clinical trials of pharmacological and psychotherapy interventions for adolescent depression can help administrators and frontline practitioners determine the relevance of these studies for their patients and may also guide eligibility criteria for future clinical trials in this clinical population. Data on nationally representative adolescents were derived from the National Comorbidity Survey: Adolescent Supplement. To assess the generalizability of adolescent clinical trials for MDD, we applied a standard set of eligibility criteria representative of clinical trials to all adolescents in the National Comorbidity Survey: Adolescent Supplement with a Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition diagnosis of MDD ( N = 592). From the overall MDD sample, 61.9% would have been excluded from a typical pharmacological trial, whereas 42.2% would have been excluded from a psychotherapy trial. Among those who sought treatment ( n = 412), the corresponding exclusion rates were 72.7% for a pharmacological trial and 52.2% for a psychotherapy trial. The criterion leading to the largest number of exclusions was "significant risk of suicide" in both pharmacological and psychotherapy trials. Pharmacological and, to a lesser extent, psychotherapy clinical trials likely exclude most adolescents with MDD. Careful consideration should be given to balancing eligibility criteria and internal validity with applicability in routine clinical care while ensuring patient safety. Copyright © 2017 by the American Academy of Pediatrics.
Müller, Daniel J.; Ng, Chee H.; Byron, Keith; Berk, Michael; Singh, Ajeet B.
2017-01-01
Background Pharmacogenetic-based dosing support tools have been developed to personalize antidepressant-prescribing practice. However, the clinical validity of these tools has not been adequately tested, particularly for specific antidepressants. Objective To examine the concordance between the actual dose and a polygene pharmacogenetic predicted dose of desvenlafaxine needed to achieve symptom remission. Materials and methods A 10-week, open-label, prospective trial of desvenlafaxine among Caucasian adults with major depressive disorder (n=119) was conducted. Dose was clinically adjusted and at the completion of the trial, the clinical dose needed to achieve remission was compared with the predicted dose needed to achieve remission. Results Among remitters (n=95), there was a strong concordance (Kendall’s τ-b=0.84, P=0.0001; Cohen’s κ=0.82, P=0.0001) between the actual and the predicted dose need to achieve symptom remission, showing high sensitivity (≥85%), specificity (≥86%), and accuracy (≥89%) of the tool. Conclusion Findings provide initial evidence for the clinical validity of a polygene pharmacogenetic-based tool for desvenlafaxine dosing. PMID:27779571
Suitable RF spectrum in ISM band for 2-way advanced metering network in India
NASA Astrophysics Data System (ADS)
Mishra, A.; Khan, M. A.; Gaur, M. S.
2013-01-01
The ISM (Industrial Scientific and Medical) bands in the radio frequency space in India offer two alternative spectra to implement wireless network for advanced metering infrastructure (AMI). These bands lie in the range of 2.4GHz and sub-GHz frequencies 865 to 867 MHz This paper aims to examine the suitability of both options by designing and executing experiments in laboratory as well as carrying out field trials on electricity meters to validate the selected option. A parameter, communication effectiveness index (CEI2) is defined to measure the effectiveness of 2 way data communication (packet exchange) between two points under different scenarios of buildings and free space. Both 2.4 GHz and Sub-GHz designs were implemented to compare the results. The experiments were conducted across 3 floors of a building. Validation of the selected option was carried out by conducting a field trial by integrating the selected radio frequency (RF) modem into the single phase electricity meters and installing these meters across three floors of the building. The methodology, implementation details, observations and resulting analytical conclusion are described in the paper.
Exploration of Force Myography and surface Electromyography in hand gesture classification.
Jiang, Xianta; Merhi, Lukas-Karim; Xiao, Zhen Gang; Menon, Carlo
2017-03-01
Whereas pressure sensors increasingly have received attention as a non-invasive interface for hand gesture recognition, their performance has not been comprehensively evaluated. This work examined the performance of hand gesture classification using Force Myography (FMG) and surface Electromyography (sEMG) technologies by performing 3 sets of 48 hand gestures using a prototyped FMG band and an array of commercial sEMG sensors worn both on the wrist and forearm simultaneously. The results show that the FMG band achieved classification accuracies as good as the high quality, commercially available, sEMG system on both wrist and forearm positions; specifically, by only using 8 Force Sensitive Resisters (FSRs), the FMG band achieved accuracies of 91.2% and 83.5% in classifying the 48 hand gestures in cross-validation and cross-trial evaluations, which were higher than those of sEMG (84.6% and 79.1%). By using all 16 FSRs on the band, our device achieved high accuracies of 96.7% and 89.4% in cross-validation and cross-trial evaluations. Copyright © 2017 IPEM. Published by Elsevier Ltd. All rights reserved.
Bousman, Chad A; Müller, Daniel J; Ng, Chee H; Byron, Keith; Berk, Michael; Singh, Ajeet B
2017-01-01
Pharmacogenetic-based dosing support tools have been developed to personalize antidepressant-prescribing practice. However, the clinical validity of these tools has not been adequately tested, particularly for specific antidepressants. To examine the concordance between the actual dose and a polygene pharmacogenetic predicted dose of desvenlafaxine needed to achieve symptom remission. A 10-week, open-label, prospective trial of desvenlafaxine among Caucasian adults with major depressive disorder (n=119) was conducted. Dose was clinically adjusted and at the completion of the trial, the clinical dose needed to achieve remission was compared with the predicted dose needed to achieve remission. Among remitters (n=95), there was a strong concordance (Kendall's τ-b=0.84, P=0.0001; Cohen's κ=0.82, P=0.0001) between the actual and the predicted dose need to achieve symptom remission, showing high sensitivity (≥85%), specificity (≥86%), and accuracy (≥89%) of the tool. Findings provide initial evidence for the clinical validity of a polygene pharmacogenetic-based tool for desvenlafaxine dosing.
Barker, David H; Swenson, Rebecca R; Brown, Larry K; Stanton, Bonita F; Vanable, Peter A; Carey, Michael P; Valois, Robert F; Diclemente, Ralph J; Salazar, Laura F; Romer, Daniel
2012-04-01
HIV-related stigma has been shown to impede HIV-antibody testing and safer sexual practices in adults. Less is known about its effects on prevention programs among at-risk youth. This study examined the longitudinal relationships between HIV-stigma and HIV-knowledge following completion of a validated group-based intervention. Data were provided by 1,654 African-American adolescents who participated in a large multi-city prevention trial (Project iMPACCS). Participants were randomly assigned to an empirically-validated skill-based intervention or a general health promotion control group. Both stigma and knowledge were assessed at baseline and post-intervention. Results suggested that adolescents participating in the intervention showed improvements in knowledge and decreases in stigma when compared to controls. Improvements in stigma appeared to be partly driven by improvements in knowledge. Higher baseline stigma was shown to reduce gains in knowledge in both the treatment and control groups. Results suggest that HIV-stigma can interfere with how youth identify with and internalize messages from group-based prevention trials.
Fernandez-Hermida, Jose Ramon; Calafat, Amador; Becoña, Elisardo; Tsertsvadze, Alexander; Foxcroft, David R
2012-09-01
To assess external validity characteristics of studies from two Cochrane Systematic Reviews of the effectiveness of universal family-based prevention of alcohol misuse in young people. Two reviewers used an a priori developed external validity rating form and independently assessed three external validity dimensions of generalizability, applicability and predictability (GAP) in randomized controlled trials. The majority (69%) of the included 29 studies were rated 'unclear' on the reporting of sufficient information for judging generalizability from sample to study population. Ten studies (35%) were rated 'unclear' on the reporting of sufficient information for judging applicability to other populations and settings. No study provided an assessment of the validity of the trial end-point measures for subsequent mortality, morbidity, quality of life or other economic or social outcomes. Similarly, no study reported on the validity of surrogate measures using established criteria for assessing surrogate end-points. Studies evaluating the benefits of family-based prevention of alcohol misuse in young people are generally inadequate at reporting information relevant to generalizability of the findings or implications for health or social outcomes. Researchers, study authors, peer reviewers, journal editors and scientific societies should take steps to improve the reporting of information relevant to external validity in prevention trials. © 2012 The Authors. Addiction © 2012 Society for the Study of Addiction.
Towards automatic patient selection for chemotherapy in colorectal cancer trials
NASA Astrophysics Data System (ADS)
Wright, Alexander; Magee, Derek; Quirke, Philip; Treanor, Darren E.
2014-03-01
A key factor in the prognosis of colorectal cancer, and its response to chemoradiotherapy, is the ratio of cancer cells to surrounding tissue (the so called tumour:stroma ratio). Currently tumour:stroma ratio is calculated manually, by examining H&E stained slides and counting the proportion of area of each. Virtual slides facilitate this analysis by allowing pathologists to annotate areas of tumour on a given digital slide image, and in-house developed stereometry tools mark random, systematic points on the slide, known as spots. These spots are examined and classified by the pathologist. Typical analyses require a pathologist to score at least 300 spots per tumour. This is a time consuming (10- 60 minutes per case) and laborious task for the pathologist and automating this process is highly desirable. Using an existing dataset of expert-classified spots from one colorectal cancer clinical trial, an automated tumour:stroma detection algorithm has been trained and validated. Each spot is extracted as an image patch, and then processed for feature extraction, identifying colour, texture, stain intensity and object characteristics. These features are used as training data for a random forest classification algorithm, and validated against unseen image patches. This process was repeated for multiple patch sizes. Over 82,000 such patches have been used, and results show an accuracy of 79%, depending on image patch size. A second study examining contextual requirements for pathologist scoring was conducted and indicates that further analysis of structures within each image patch is required in order to improve algorithm accuracy.
Interactive spaced-education to teach the physical examination: a randomized controlled trial.
Kerfoot, B Price; Armstrong, Elizabeth G; O'Sullivan, Patricia N
2008-07-01
Several studies have documented that physical examination knowledge and skills are limited among medical trainees. The objective of the study is to investigate the efficacy and acceptability of a novel online educational methodology termed 'interactive spaced-education' (ISE) as a method to teach the physical examination. The design of the study is randomized controlled trial. All 170 second-year students in the physical examination course at Harvard Medical School were eligible to enroll. Spaced-education items (questions and explanations) were developed on core physical examination topics and were content-validated by two experts. Based on pilot-test data, 36 items were selected for inclusion. Students were randomized to start the 18-week program in November 2006 or 12 weeks later. Students were sent 6 spaced-education e-mails each week for 6 weeks (cycle 1) which were then repeated in two subsequent 6-week cycles (cycles 2 and 3). Students submitted answers to the questions online and received immediate feedback. An online end-of-program survey was administered. One-hundred twenty students enrolled in the trial. Cycles 1, 2, and 3 were completed by 88%, 76%, and 71% of students, respectively. Under an intent-to-treat analysis, cycle 3 scores for cohort A students [mean 74.0 (SD 13.5)] were significantly higher than cycle 1 scores for cohort B students [controls; mean 59.0 (SD 10.5); P < .001], corresponding to a Cohen's effect size of 1.43. Eighty-five percent of participants (102 of 120) recommended the ISE program for students the following year. ISE can generate significant improvements in knowledge of the physical examination and is very well-accepted by students.
Khoury, Joseph D; Wang, Wei-Lien; Prieto, Victor G; Medeiros, L Jeffrey; Kalhor, Neda; Hameed, Meera; Broaddus, Russell; Hamilton, Stanley R
2018-02-01
Biomarkers that guide therapy selection are gaining unprecedented importance as targeted therapy options increase in scope and complexity. In conjunction with high-throughput molecular techniques, therapy-guiding biomarker assays based upon immunohistochemistry (IHC) have a critical role in cancer care in that they inform about the expression status of a protein target. Here, we describe the validation procedures for four clinical IHC biomarker assays-PTEN, RB, MLH1, and MSH2-for use as integral biomarkers in the nationwide NCI-Molecular Analysis for Therapy Choice (NCI-MATCH) EAY131 clinical trial. Validation procedures were developed through an iterative process based on collective experience and adaptation of broad guidelines from the FDA. The steps included primary antibody selection; assay optimization; development of assay interpretation criteria incorporating biological considerations; and expected staining patterns, including indeterminate results, orthogonal validation, and tissue validation. Following assay lockdown, patient samples and cell lines were used for analytic and clinical validation. The assays were then approved as laboratory-developed tests and used for clinical trial decisions for treatment selection. Calculations of sensitivity and specificity were undertaken using various definitions of gold-standard references, and external validation was required for the PTEN IHC assay. In conclusion, validation of IHC biomarker assays critical for guiding therapy in clinical trials is feasible using comprehensive preanalytic, analytic, and postanalytic steps. Implementation of standardized guidelines provides a useful framework for validating IHC biomarker assays that allow for reproducibility across institutions for routine clinical use. Clin Cancer Res; 24(3); 521-31. ©2017 AACR . ©2017 American Association for Cancer Research.
Kerfoot, B Price; Baker, Harley
2012-03-01
While games are frequently used in resident education, there is little evidence supporting their efficacy. We investigated whether a spaced-education (SE) game can be both a reliable and valid method of assessing residents' knowledge and an effective means of teaching core content. The SE game consisted of 100 validated multiple-choice questions and explanations on core urology content. Residents were sent 2 questions each day via email. Adaptive game mechanics re-sent the questions in 2 or 6 weeks if answered incorrectly and correctly, respectively. Questions expired if not answered on time (appointment dynamic). Residents retired questions by answering each correctly twice in a row (progression dynamic). Competition was fostered by posting relative performance among residents. Main outcomes measures were baseline scores (percentage of questions answered correctly on initial presentation) and completion scores (percentage of questions retired). Nine hundred thirty-one US and Canadian residents enrolled in the 45-week trial. Cronbach alpha reliability for the SE baseline scores was 0.87. Baseline scores (median 62%, interquartile range [IQR] 17%) correlated with scores on the 2008 American Urological Association in-service examination (ISE08), 2009 American Board of Urology qualifying examination (QE09), and ISE09 (r = 0.76, 0.46, and 0.64, respectively; all p < 0.001). Baseline scores varied by sex, country, medical degree, and year of training (all p ≤ 0.001). Completion scores (median 100%, IQR 2%) correlated with ISE08 and ISE09 scores (r = 0.35, p < 0.001 for both). Seventy-two percent of enrollees (667 of 931) requested to participate in future SE games. An SE game is a reliable and valid means to assess residents' knowledge and is a well-accepted method by which residents can master core content. Published by Elsevier Inc.
The influence of FMRI lie detection evidence on juror decision-making.
McCabe, David P; Castel, Alan D; Rhodes, Matthew G
2011-01-01
In the current study, we report on an experiment examining whether functional magnetic resonance imaging (fMRI) lie detection evidence would influence potential jurors' assessment of guilt in a criminal trial. Potential jurors (N = 330) read a vignette summarizing a trial, with some versions of the vignette including lie detection evidence indicating that the defendant was lying about having committed the crime. Lie detector evidence was based on evidence from the polygraph, fMRI (functional brain imaging), or thermal facial imaging. Results showed that fMRI lie detection evidence led to more guilty verdicts than lie detection evidence based on polygraph evidence, thermal facial imaging, or a control condition that did not include lie detection evidence. However, when the validity of the fMRI lie detection evidence was called into question on cross-examination, guilty verdicts were reduced to the level of the control condition. These results provide important information about the influence of lie detection evidence in legal settings. Copyright © 2011 John Wiley & Sons, Ltd.
Purves, Murray; Parkes, David
2016-05-01
Three atmospheric dispersion models--DIFFAL, HPAC, and HotSpot--of differing complexities have been validated against the witness plate deposition dataset taken during the Full-Scale Radiological Dispersal Device (FSRDD) Field Trials. The small-scale nature of these trials in comparison to many other historical radiological dispersion trials provides a unique opportunity to evaluate the near-field performance of the models considered. This paper performs validation of these models using two graphical methods of comparison: deposition contour plots and hotline profile graphs. All of the models tested are assessed to perform well, especially considering that previous model developments and validations have been focused on larger-scale scenarios. Of the models, HPAC generally produced the most accurate results, especially at locations within ∼100 m of GZ. Features present within the observed data, such as hot spots, were not well modeled by any of the codes considered. Additionally, it was found that an increase in the complexity of the meteorological data input to the models did not necessarily lead to an improvement in model accuracy; this is potentially due to the small-scale nature of the trials.
Costa, Marlene Coelho da; Eckert, Gabriela Unchalo; Fortes, Barbara Gastal Borges; Fortes Filho, João Borges; Silveira, Rita C; Procianoy, Renato S
2013-01-01
Ophthalmologic examination for retinopathy of prematurity is a painful procedure. Pharmacological and non-pharmacological interventions have been proposed to reduce pain during eye examinations. This study aims to evaluate the analgesic effect of 25% glucose using a validated pain scale during the first eye examination for retinopathy of prematurity in preterm infants with birth weight <1,500 g and/or gestational age <32 weeks. A masked, randomized clinical trial for one dose of 1 ml of oral 25% glucose solution 2 minutes before the first ophthalmologic examination for retinopathy of prematurity was conducted between March 2008 and April 2010. The results were compared to those of a control group that did not receive oral glucose solution. Pain was evaluated using a Neonatal Infant Pain Scale immediately before and immediately after the ophthalmologic examination in both groups. Clinicaltrials.gov: NCT00648687 One hundred and twenty-four patients who were examined for the first time for retinopathy of prematurity were included. Seventy were included in the intervention group and 54 in the control group. The number of patients with pain immediately before the procedure was similar in both groups. The number of patients with pain after ophthalmologic examination was 15.7% in the intervention group and 68.5% in the control group (p<0.001). One ml of oral 25% glucose solution given 2 minutes before an ophthalmologic examination for retinopathy of prematurity was an effective measure for pain relief.
Developing scale for colleague solidarity among nurses in Turkey.
Uslusoy, Esin Cetinkaya; Alpar, Sule Ecevit
2013-02-01
There is a need for an appropriate instrument to measure colleague solidarity among nurses. This study was carried out to develop a Colleague Solidarity of Nurses' Scale (CSNS). This study was planned to be descriptive and methodological. The CSNS examined content validity, construct validity, test-retest reliability and internal consistency reliability. The trial form of the CSNS, which was composed of 44 items, was given to 200 nurses, followed by validity and reliability analyses. Following the analyses, 21 items were excluded from the scale, leaving an attitude scale made up of 23 items. Factor analysis of the data showed that the scale has a three sub-factor structure: emotional solidarity, academic solidarity and negative opinions about solidarity. The Cronbach's alpha reliability of the whole scale was 0.80. This study provides evidence that the CSNS possesses robust solidarity among nurses. © 2013 Wiley Publishing Asia Pty Ltd.
Mohammadifard, Noushin; Sajjadi, Firouzeh; Maghroun, Maryam; Alikhasi, Hassan; Nilforoushzadeh, Farzaneh; Sarrafzadegan, Nizal
2015-03-01
Dietary assessment is the first step of dietary modification in community-based interventional programs. This study was performed to validate a simple food frequency questionnaire (SFFQ) for assessment of selected food items in epidemiological studies with a large sample size as well as community trails. This validation study was carried out on 264 healthy adults aged ≥ 41 years old living in 3 district central of Iran, including Isfahan, Najafabad, and Arak. Selected food intakes were assessed using a 48-item food frequency questionnaire (FFQ). The FFQ was interviewer-administered, which was completed twice; at the beginning of the study and 2 weeks thereafter. The validity of this SFFQ was examined compared to estimated amount by single 24 h dietary recall and 2 days dietary record. Validation of the FFQ was determined using Spearman correlation coefficients between daily frequency consumption of food groups as assessed by the FFQ and the qualitative amount of daily food groups intake accessed by dietary reference method was applied to evaluate validity. Intraclass correlation coefficients (ICC) were used to determine the reproducibility. Spearman correlation coefficient between the estimated amount of food groups intake by examined and reference methods ranged from 0.105 (P = 0.378) in pickles to 0.48 (P < 0.001) in plant protein. ICC for reproducibility of FFQ were between 0.47-0.69 in different food groups (P < 0.001). The designed SFFQ has a good relative validity and reproducibility for assessment of selected food groups intake. Thus, it can serve as a valid tool in epidemiological studies and clinical trial with large participants.
Clarke, Diana E; Wilcox, Holly C; Miller, Leslie; Cullen, Bernadette; Gerring, Joan; Greiner, Lisa H; Newcomer, Alison; McKitty, Mellisha V; Regier, Darrel A; Narrow, William E
2014-06-01
The Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5) contains criteria for psychiatric diagnoses that reflect advances in the science and conceptualization of mental disorders and address the needs of clinicians. DSM-5 also recommends research on dimensional measures of cross-cutting symptoms and diagnostic severity, which are expected to better capture patients' experiences with mental disorders. Prior to its May 2013 release, the American Psychiatric Association (APA) conducted field trials to examine the feasibility, clinical utility, reliability, and where possible, the validity of proposed DSM-5 diagnostic criteria and dimensional measures. The methods and measures proposed for the DSM-5 field trials were pilot tested in adult and child/adolescent clinical samples, with the goal to identify and correct design and procedural problems with the proposed methods before resources were expended for the larger DSM-5 Field Trials. Results allowed for the refinement of the protocols, procedures, and measures, which facilitated recruitment, implementation, and completion of the DSM-5 Field Trials. These results highlight the benefits of pilot studies in planning large multisite studies. Copyright © 2013, American Psychiatric Association. All rights reserved.
Forbess, L J; Bresee, C; Wallace, D J; Weisman, M H
2017-08-01
Background Our primary goal was to create an outcome change score index similar to a standard rheumatoid arthritis (RA) model utilizing real-world data in systemic lupus erythematosus (SLE) patients that occurred during their phase 3 trials with a Food and Drug Administration-approved drug. Methods We utilized raw data from trials of belimumab for the treatment of SLE. Data were split 80/20 into training/validation sets. Index variables present in a majority of patients and with face validity were selected. Variables were scored for each patient as percentage improvement from baseline after one year. The percentage of placebo- and drug-treated patients considered improved after the application of various criteria was ascertained. Logistic regression was employed to determine the ability of the new index to predict treatment assignment. Results A total of 1693 subjects had data for analyses. Eight variables were chosen: arthritis, rash, physician global assessment, fatigue, anti-double stranded DNA antibodies, C3, C4 and C-reactive protein. In the training dataset, ≥20% improvement in ≥4 of eight variables produced the largest difference between placebo- and drug-treated patients (22.1%) with an acceptable rate of improved placebo-treated patients (25%). This resulted in an odds ratio for belimumab (10 mg/kg) vs placebo of 2.7 (95% CI: 2.0-3.6; p < 0.001). However, in the validate dataset the odds ratio was not significant at 1.3 (95% CI: 0.8-2.2; p = 0.863). Conclusions The index created from training data did not achieve statistical significance when tested in the validation set. We have speculated why this happened. Is the lack of success of therapeutics for SLE caused by ineffective medications, study design and outcome instruments that fail to inform us, or is the heterogeneity of the disease too daunting? The lessons learned here can help direct future endeavors intended to improve SLE outcome instruments.
Deputy, Nicholas P; Nguyen, Phuong H; Pham, Hoa; Nguyen, Son; Neufeld, Lynnette; Martorell, Reynaldo; Ramakrishnan, Usha
2017-01-11
Accurate estimation of gestational age is important for both clinical and public health purposes. Estimates of gestational age using fetal ultrasound measurements are considered most accurate but are frequently unavailable in low- and middle-income countries. The objective of this study was to assess the validity of last menstrual period and Farr neonatal examination estimates of gestational age, compared to ultrasound estimates, in a large cohort of women in Vietnam. Data for this analysis come from a randomized, placebo-controlled micronutrient supplementation trial in Vietnam. We analyzed 912 women with ultrasound and prospectively-collected last menstrual period estimates of gestational age and 685 women with ultrasound and Farr estimates of gestational age. We used the Wilcoxon signed rank sum test to assess differences in gestational age estimated by last menstrual period or Farr examination compared to ultrasound and computed the intraclass correlation coefficient (ICC) and concordance correlation coefficient (CCC) to quantify agreement between methods. We computed the Kappa coefficient (κ) to quantify agreement in preterm, term and post-term classification. The median gestational age estimated by ultrasound was 273.9 days. Gestational age was slightly overestimated by last menstrual period (median 276.0 days, P < 0.001) and more greatly overestimated by Farr examination (median 286.7 days, P < 0.001). Gestational age estimates by last menstrual period and ultrasound were moderately correlated (ICC = 0.78) and concordant (CCC = 0.63), whereas gestational age estimates by Farr examination and ultrasound were weakly correlated (ICC = 0.26) and concordant (CCC = 0.05). Last menstrual period and ultrasound estimates of gestational age were within ± 14 days for 88.4% of women; Farr and ultrasound estimates were within ± 14 days for 55.8% of women. Last menstrual period and ultrasound estimates of gestational age had higher agreement in term classification (κ = 0.41) than Farr and ultrasound (κ = 0.05). In this study of women in Vietnam, we found last menstrual period provided a more accurate estimate of gestational age than the Farr examination when compared to ultrasound. These findings provide useful information about the utility and accuracy of different methods to estimate gestational age and suggest last menstrual period may be preferred over Farr examination in settings where ultrasound is unavailable. The trial was registered at ClinicalTrials.Gov as NCT01665378 on August 13, 2012.
Biomarkers and surrogate endpoints in glaucoma clinical trials.
Medeiros, Felipe A
2015-05-01
Surrogate endpoints are often used as replacements for true clinically relevant endpoints in several areas of medicine, as they enable faster and less expensive clinical trials. However, without proper validation, the use of surrogates may lead to incorrect conclusions about the efficacy and safety of treatments. This article reviews the general requirements for validating surrogate endpoints and provides a critical assessment of the use of intraocular pressure (IOP), visual fields, and structural measurements of the optic nerve as surrogate endpoints in glaucoma clinical trials. A valid surrogate endpoint must be able to predict the clinically relevant endpoint and fully capture the effect of an intervention on that endpoint. Despite its widespread use in clinical trials, no proper validation of IOP as a surrogate endpoint has ever been conducted for any class of IOP-lowering treatments. Evidence has accumulated with regard to the role of imaging measurements of optic nerve damage as surrogate endpoints in glaucoma. These measurements are predictive of functional losses in the disease and may explain, at least in part, treatment effects on clinically relevant endpoints. The use of composite endpoints in glaucoma trials may overcome weaknesses of the use of structural or functional endpoints in isolation. Unless research is dedicated to fully develop and validate suitable endpoints that can be used in glaucoma clinical trials, we run the risk of inappropriate judgments about the value of new therapies. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Statistically Valid Planting Trials
C. B. Briscoe
1961-01-01
More than 100 million tree seedlings are planted each year in Latin America, and at least ten time'that many should be planted Rational control and development of a program of such magnitude require establishing and interpreting carefully planned trial plantings which will yield statistically valid answers to real and important questions. Unfortunately, many...
Ethical, legal, and social issues related to genomics and cancer research: the impending crisis.
Ellerin, Bruce E; Schneider, Robert J; Stern, Arnold; Toniolo, Paolo G; Formenti, Silvia C
2005-11-01
Cancer research is a multibillion-dollar enterprise validated by the clinical trial process and increasingly defined by genomics. The continued success of the endeavor depends on the smooth functioning of the clinical trial system, which in turn depends on human subject participation. Yet human subject participation can exist only in an atmosphere of trust between research participants and research sponsors, and the advent of genomics has raised a multitude of ethical, legal, and social issues that threaten this trust. The authors examine 6 of these issues: (1) informed consent; (2) privacy, confidentiality, and family disclosure dilemmas; (3) property rights in genomic discoveries; (4) individual and institutional conflicts of interest; (5) insurance and employment issues; and (6) litigation under the federal False Claims Act. The authors conclude that failure to resolve these issues may lead to a sufficient impairment of trust in genomics-based clinical trials on the part of potential research participants that the clinical trial system may implode for lack of willing participants, thus threatening the future of cancer research.
Long, Blaine C; Jutte, Lisa S; Knight, Kenneth L
2010-01-01
Thermocouples and electrothermometers are used in therapeutic modality research. Until recently, researchers assumed that these instruments were valid and reliable. To examine 3 different thermocouple types in 5 degrees C, 15 degrees C, 18.4 degrees C, 25 degrees C, and 35 degrees C water baths. Randomized controlled trial. Therapeutic modality laboratory. Eighteen thermocouple leads were inserted through the wall of a foamed polystyrene cooler. The cooler was filled with water. Six thermocouples (2 of each model) were plugged into the 6 channels of the Datalogger and 6 randomly selected channels in the 2 Iso-Thermexes. A mercury thermometer was immersed into the water and was read every 10 seconds for 4 minutes during each of 6 trials. The entire process was repeated for each of 5 water bath temperatures (5 degrees C, 15 degrees C, 18.4 degrees C, 25 degrees C, 35 degrees C). Temperature and absolute temperature differences among 3 thermocouple types (IT-21, IT-18, PT-6) and 3 electrothermometers (Datalogger, Iso-Thermex calibrated from -50 degrees C to 50 degrees C, Iso-Thermex calibrated from -20 degrees C to 80 degrees C). Validity and reliability were dependent on thermocouple type, electrothermometer, and water bath temperature (P < .001; modified Levene P < .05). Statistically, the IT-18 and PT-6 thermocouples were not reliable in each electrothermometer; however, these differences were not practically different from each other. The PT-6 thermocouples were more valid than the IT-18s, and both thermocouple types were more valid than the IT-21s, regardless of water bath temperature (P < .001). The validity and reliability of thermocouples interfaced to an electrothermometer under experimental conditions should be tested before data collection. We also recommend that investigators report the validity, the reliability, and the calculated uncertainty (validity + reliability) of their temperature measurements for therapeutic modalities research. With this information, investigators and clinicians will be better able to interpret and compare results and conclusions.
Ziconotide Monotherapy: A Systematic Review of Randomised Controlled Trials
Brookes, Morag E.; Eldabe, Sam; Batterham, Alan
2017-01-01
INTRODUCTION Chronic neuropathic pain is difficult to treat and is often refractory to most modalities of treatment. Ziconotide is a novel, potent, non-opioid, calcium channel blocking agent which has been shown in clinical trials to be effective in treating chronic neuropathic pain. METHODS EMBASE, MEDLINE, CINAHL Plus and Web of Science electronic databases were searched for English language studies. Reference sections of articles were examined for further papers and the manufacturer of ziconotide was contacted for further unpublished data. Three randomised controlled trials in ziconotide monotherapy were included and subjected to a random effects meta-analysis. RESULTS All three studies used the similar main outcome measure (visual analogue scale of pain intensity; VASPI) and were therefore comparable. A Jadad score was performed for each paper. Frequent serious adverse events (SAEs) were observed which resulted in two of the studies revising the protocol. The meta-analysis revealed a pooled odds ratio (responders on ziconotide vs. placebo) of 2.77 (95% CI, 1.37 to 5.59). DISCUSSION The results suggest that ziconotide is beneficial for pain reduction in chronic neuropathic pain. However, there remain some methodological issues that may call into question the validity of the results. It is evident that more work needs to be conducted to further validate the efficacy of ziconotide and to discover new areas of use. PMID:26861472
Stamenovic, Milorad; Dobraca, Amra; Smajlovic, Mersiha
2018-01-01
The aim of this paper is to present the marketing strategy and the application of management (marketing management) and advertising in order to increase the efficiency of innovative approach in clinical trials that include and involve the use of new technologies and transfer of technologies. This paper has a descriptive character and represents a narrative review of the literature and new model implementation. Marketing models are primarily used to improve the inclusion of a larger (and appropriate) number of patients, but they can be credited for the stay and monitoring of patients in the trial. Regulatory mechanisms play an important role in the application of various marketing strategies within clinical trials. The value for the patient as the most important stakeholder is defined in the field of clinical trials according to Kotler's value model for the consumer. In order to achieve the best results it is important to adequately examine all the elements of clinical trials and apply this knowledge in creation of a marketing plan that will be made in accordance with the legal regulations defined globally and locally. In this paper, two challenges have been highlighted for the adequate application of marketing tools in the field of clinical trials, namely: defining business elements in order to provide an adequate marketing approach for clinical trials and technology transfer and ensuring uniformity and regulatory affirmation of marketing attitudes in clinical trials in all regions in which they are carried out in accordance with ICH-GCP and valid regulations.
Kim, Youl-Ri; Tyrer, Peter; Lee, Hong-Seock; Kim, Sung-Gon; Hwang, Soon-Taek; Lee, Gi Young; Mulder, Roger
2015-11-01
This field trial examines the discriminant validity of five trait domains of the originally proposed research algorithm for diagnosing International Classification of Diseases (ICD)-11 personality disorders. This trial was carried out in South Korea where a total of 124 patients with personality disorder participated in the study. Participants were assessed using originally proposed monothetic trait domains of asocial-schizoid, antisocial-dissocial, anxious-dependent, emotionally unstable and anankastic-obsessional groups of the research algorithm in ICD-11. Their assessments were compared to those from the Personality Assessment Schedule interview, and the five-factor model (FFM). A total of 48.4% of patients were found to have pathology in two or more domains. In the discriminant analysis, 64.2% of the grouped cases of the originally proposed ICD-11 domains were correctly classified by the five domain categories using the Personality Assessment Schedule, with the highest accuracy in the anankastic-obsessional domain and the lowest accuracy in the emotionally unstable domain. In comparison, the asocial-schizoid, anxious-dependent and the emotionally unstable domains were moderately correlated with the FFM, whereas the anankastic-obsessional or antisocial-dissocial domains were not significantly correlated with the FFM. In this field trial, we demonstrated the limited discriminant and the convergent validities of the originally proposed trait domains of the research algorithm for diagnosing ICD-11 personality disorder. The results suggest that the anankastic, asocial and dissocial domains show good discrimination, whereas the anxious-dependent and emotionally unstable ones overlap too much and have been subsequently revised. Copyright © 2015 John Wiley & Sons, Ltd.
Francis, Jillian J; Wileman, Samantha M; Bekker, Hilary; Barton, Garry R; Ramsay, Craig R
2009-12-01
Within a trial of medical and surgical treatments for gastro-esophageal reflux disease (GORD), involving randomised arms and preference arms, we tested the applicability of the Beliefs about Medicines Questionnaire (BMQ) and developed and tested the validity of a new Beliefs about Surgery Questionnaire (BSQ). Patients with GORD (N = 43) were interviewed to elicit their beliefs about medical and surgical treatments. These contributed to the development of BSQ items. The BMQ and BSQ were completed by trial participants at baseline (randomised trial: N = 325; preference trial: N = 414). Factor analysis and discriminant function analysis were used to assess validity. Principal components analysis (PCA) largely replicated the four-factor BMQ structure. PCA of the combined BMQ/BSQ yielded six factors explaining 54.5% variance. BSQ items loaded onto distinct factors, demonstrating divergence from BMQ. As predicted, BMQ/BSQ scores enabled correct classification of 78.5% of participants to medication and surgery groups in the preference trial (chi(2)(6) = 205.9, p < 0.001) but only 54.5% (no better than chance) in the randomised trial (chi(2)(6) = 9.4, p = 0.154). The BSQ is a valid measure of perceptions about surgical treatments for GORD. With the BMQ, it provides information that may guide patients' choices about treatment. This measure may be applicable to other conditions.
[Defining an ethics for preventive trials].
Chippaux, J-P
2008-04-01
Preventive trials (to prevent from infection) or prophylaxis trials (to avoid consequences of the disease) differ from other clinical trials as they apply to healthy subjects or subjects considering themselves as such: the latter do not ask for intervention even less for trial. Moreover, it is generally an experiment which aims at validating a public health intervention, the individual character of which could appear as secondary regarding the collective interest. It concerns many tools or methods: preventive or prophylactic vaccines and drugs, condoms, impregnated bed nets, etc. The field of implementation of preventive trials is large and covers routine immunization (EPI), large-scale control or eradication of endemic diseases or epidemics, for which the concept of individual risk is generally better understood. Preventive trials imply ethical obligations (high individual or collective benefits and absence of risks as there is no immediate therapeutic compensation), methodological adaptations (because the number of subjects is considerably larger than for therapeutic trials) and a sensitive valorization towards a large population who is not asking for the recommended intervention. As regard the benefits, it is also necessary to consider the costs in comparison with the expected efficacy The methodological constraints are important because the demonstration of both safety and efficacy requires a very large number of subjects to validate the product. It is often necessary to use indirect or substitutive markers and indicators (title of protective antibodies rather than definite clinical protection) which need a preliminary validation. Before carrying out a preventive or prophylactic trial, it is advisable to specify the objectives in order to assess the real profits and absence of risks during the trial and after the implementation of the tested product. Preventive trials require a phase of technological transfer to guarantee the application of the validated tools for the benefit of the population at stake. In this respect, if trials for prevention are now well codified both on ethical and methodological aspects, trials for prophylaxis (filariasis with ivermectin, schistosomiasis with praziquantel, malaria with intermittent "preventive" treatment or HIV with antiretroviral treatment, for example) still remain a difficult issue at both ethical and methodological levels.
Statistical validity of using ratio variables in human kinetics research.
Liu, Yuanlong; Schutz, Robert W
2003-09-01
The purposes of this study were to investigate the validity of the simple ratio and three alternative deflation models and examine how the variation of the numerator and denominator variables affects the reliability of a ratio variable. A simple ratio and three alternative deflation models were fitted to four empirical data sets, and common criteria were applied to determine the best model for deflation. Intraclass correlation was used to examine the component effect on the reliability of a ratio variable. The results indicate that the validity, of a deflation model depends on the statistical characteristics of the particular component variables used, and an optimal deflation model for all ratio variables may not exist. Therefore, it is recommended that different models be fitted to each empirical data set to determine the best deflation model. It was found that the reliability of a simple ratio is affected by the coefficients of variation and the within- and between-trial correlations between the numerator and denominator variables. It was recommended that researchers should compute the reliability of the derived ratio scores and not assume that strong reliabilities in the numerator and denominator measures automatically lead to high reliability in the ratio measures.
Subtyping adolescents with bulimia nervosa.
Chen, Eunice Y; Le Grange, Daniel
2007-12-01
Cluster analyses of eating disorder patients have yielded a "dietary-depressive" subtype, typified by greater negative affect, and a "dietary" subtype, typified by dietary restraint. This study aimed to replicate these findings in an adolescent sample with bulimia nervosa (BN) from a randomized controlled trial and to examine the validity and reliability of this methodology. In the sample of BN adolescents (N=80), cluster analysis revealed a "dietary-depressive" subtype (37.5%) and a "dietary" subtype (62.5%) using the Beck Depression Inventory, Rosenberg Self-Esteem Scale and Eating Disorder Examination Restraint subscale. The "dietary-depressive" subtype compared to the "dietary" subtype was significantly more likely to: (1) report co-occurring disorders, (2) greater eating and weight concerns, and (3) less vomiting abstinence at post-treatment (all p's<.05). The cluster analysis based on "dietary" and "dietary-depressive" subtypes appeared to have concurrent validity, yielding more distinct groups than subtyping by vomiting frequency. In order to assess the reliability of the subtyping scheme, a larger sample of adolescents with mixed eating and weight disorders in an outpatient eating disorder clinic (N=149) was subtyped, yielding similar subtypes. These results support the validity and reliability of the subtyping strategy in two adolescent samples.
Further Validation of the Pathways Housing First Fidelity Scale.
Goering, Paula; Veldhuizen, Scott; Nelson, Geoffrey B; Stefancic, Ana; Tsemberis, Sam; Adair, Carol E; Distasio, Jino; Aubry, Tim; Stergiopoulos, Vicky; Streiner, David L
2016-01-01
This study examined whether Housing First fidelity ratings correspond to program operation descriptions from administrative data and predict client outcomes. A multisite, randomized controlled trial (At Home/Chez Soi) in five Canadian cities included two assessments of 12 programs over two years. Outcomes for 1,158 clients were measured every six months. Associations between fidelity ratings and administrative data (Spearman correlations) and participant outcomes (mixed-effects modeling) were examined. Fidelity ratings were generally good (mean ± SD=136.6 ± 10.3 out of a possible range of 38-152; 87% of maximum value). Fidelity was significantly associated with three of four measures of program operation, with correlations between .55 and .60. Greater program fidelity was associated with improvement in housing stability, community functioning, and quality of life. Variation in program fidelity was associated with operations and outcomes, supporting scale validity and intervention effectiveness. These findings reinforced the value of using fidelity monitoring to conduct quality assurance and technical assistance activities.
Model Validation Against The Modelers’ Data Archive
2014-08-01
completion of the planned Jack Rabbit 2 field trials. The relevant task for the effort addressed here is Task 4 of the current Interagency Agreement, as...readily simulates the Prairie Grass sulfur dioxide plumes. Also, Jack Rabbit II field trials are set to be completed during FY16. Once these data are...available, they will also be used to validate the combined models. This validation may prove to be more useful, as the Jack Rabbit II will release
The effects of anxiety upon attention allocation to affective stimuli.
Waters, Allison M; Nitz, Ashley B; Craske, Michelle G; Johnson, Chris
2007-04-01
Pictures of emotionally aversive, neutral, and pleasant scenes were presented for 500 ms, followed by a probe presented in the same location (valid trials) or an alternate location (invalid trials) as the picture. Response-times to the probes were recorded in low (N=20) and high (N=27) trait anxious participants. Results revealed an overall negative cue validity effect of shorter reaction times (RTs) on invalid than valid trials, suggestive of an inhibition of return effect. Moreover, high trait anxious females showed a reduced negative cue validity effect for aversive pictures in comparison with neutral and pleasant pictures, suggestive of selective interference by the unpleasant material. By contrast, low trait anxious females showed an enhanced negative cue validity effect for aversive pictures relative to neutral and pleasant pictures, suggestive of attentional avoidance of the aversive content. The emotional content of picture cues did not significantly affect RTs in males, regardless of anxiety status. The results suggest that biased attention processes for aversive stimuli may contribute to the greater female propensity for anxiety disorders.
Arjona, Antonio; Gómez, Carlos M.
2011-01-01
Background Preparatory activity based on a priori probabilities generated in previous trials and subjective expectancies would produce an attentional bias. However, preparation can be correct (valid) or incorrect (invalid) depending on the actual target stimulus. The alternation effect refers to the subjective expectancy that a target will not be repeated in the same position, causing RTs to increase if the target location is repeated. The present experiment, using the Posner's central cue paradigm, tries to demonstrate that not only the credibility of the cue, but also the expectancy about the next position of the target are changedin a trial by trial basis. Sequences of trials were analyzed. Results The results indicated an increase in RT benefits when sequences of two and three valid trials occurred. The analysis of errors indicated an increase in anticipatory behavior which grows as the number of valid trials is increased. On the other hand, there was also an RT benefit when a trial was preceded by trials in which the position of the target changed with respect to the current trial (alternation effect). Sequences of two alternations or two repetitions were faster than sequences of trials in which a pattern of repetition or alternation is broken. Conclusions Taken together, these results suggest that in Posner's central cue paradigm, and with regard to the anticipatory activity, the credibility of the external cue and of the endogenously anticipated patterns of target location are constantly updated. The results suggest that Bayesian rules are operating in the generation of anticipatory activity as a function of the previous trial's outcome, but also on biases or prior beliefs like the “gambler fallacy”. PMID:21698164
Perel, P; Prieto-Merino, D; Shakur, H; Roberts, I
2013-06-01
Severe bleeding accounts for about one-third of in-hospital trauma deaths. Patients with a high baseline risk of death have the most to gain from the use of life-saving treatments. An accurate and user-friendly prognostic model to predict mortality in bleeding trauma patients could assist doctors and paramedics in pre-hospital triage and could shorten the time to diagnostic and life-saving procedures such as surgery and tranexamic acid (TXA). The aim of the study was to develop and validate a prognostic model for early mortality in patients with traumatic bleeding and to examine whether or not the effect of TXA on the risk of death and thrombotic events in bleeding adult trauma patients varies according to baseline risk. Multivariable logistic regression and risk-stratified analysis of a large international cohort of trauma patients. Two hundred and seventy-four hospitals in 40 high-, medium- and low-income countries. We derived prognostic models in a large placebo-controlled trial of the effects of early administration of a short course of TXA [Clinical Randomisation of an Antifibrinolytic in Significant Haemorrhage (CRASH-2) trial]. The trial included 20,127 trauma patients with, or at risk of, significant bleeding, within 8 hours of injury. We externally validated the model on 14,220 selected trauma patients from the Trauma Audit and Research Network (TARN), which included mainly patients from the UK. We examined the effect of TXA on all-cause mortality, death due to bleeding and thrombotic events (fatal and non-fatal myocardial infarction, stroke, deep-vein thrombosis and pulmonary embolism) within risk strata in the CRASH-2 trial data set and we estimated the proportion of premature deaths averted by applying the odds ratio (OR) from the CRASH-2 trial to each of the risk strata in TARN. For the stratified analysis according baseline risk we considered the intervention TXA (1 g over 10 minutes followed by 1 g over 8 hours) or matching placebo. For the prognostic models we included predictors for death in hospital within 4 weeks of injury. For the stratified analysis we reported ORs for all causes of death, death due to bleeding, and fatal and non-fatal thrombotic events associated with the use of TXA according to baseline risk. A total of 3076 (15%) patients died in the CRASH-2 trial and 1705 (12%) in the TARN data set. Glasgow Coma Scale score, age and systolic blood pressure were the strongest predictors of mortality. Discrimination and calibration were satisfactory, with C-statistics > 0.80 in both CRASH-2 trial and TARN data sets. A simple chart was constructed to readily provide the probability of death at the point of care, while a web-based calculator is available for a more detailed risk assessment. TXA reduced all-cause mortality and death due to bleeding in each stratum of baseline risk. There was no evidence of heterogeneity in the effect of TXA on all-cause mortality (p-value for interaction = 0.96) or death due to bleeding (p= 0.98). There was a significant reduction in the odds of fatal and non-fatal thrombotic events with TXA (OR = 0.69, 95% confidence interval 0.53 to 0.89; p= 0.005). There was no evidence of heterogeneity in the effect of TXA on the risk of thrombotic events (p= 0.74). This prognostic model can be used to obtain valid predictions of mortality in patients with traumatic bleeding. TXA can be administered safely to a wide spectrum of bleeding trauma patients and should not be restricted to the most severely injured. Future research should evaluate whether or not the use of this prognostic model in clinical practice has an impact on the management and outcomes of trauma patients.
ERIC Educational Resources Information Center
Spillane, James P.; Pareja, Amber Stitziel; Dorner, Lisa; Barnes, Carol; May, Henry; Huff, Jason; Camburn, Eric
2010-01-01
In this paper we described how we mixed research approaches in a Randomized Control Trial (RCT) of a school principal professional development program. Using examples from our study we illustrate how combining qualitative and quantitative data can address some key challenges from validating instruments and measures of mediator variables to…
Ethnographic methods for process evaluations of complex health behaviour interventions.
Morgan-Trimmer, Sarah; Wood, Fiona
2016-05-04
This article outlines the contribution that ethnography could make to process evaluations for trials of complex health-behaviour interventions. Process evaluations are increasingly used to examine how health-behaviour interventions operate to produce outcomes and often employ qualitative methods to do this. Ethnography shares commonalities with the qualitative methods currently used in health-behaviour evaluations but has a distinctive approach over and above these methods. It is an overlooked methodology in trials of complex health-behaviour interventions that has much to contribute to the understanding of how interventions work. These benefits are discussed here with respect to three strengths of ethnographic methodology: (1) producing valid data, (2) understanding data within social contexts, and (3) building theory productively. The limitations of ethnography within the context of process evaluations are also discussed.
Are You "Tilting at Windmills" or Undertaking a Valid Clinical Trial?
Zariffa, Jose; Kramer, John L.K.
2011-01-01
In this review, several aspects surrounding the choice of a therapeutic intervention and the conduct of clinical trials are discussed. Some of the background for why human studies have evolved to their current state is also included. Specifically, the following questions have been addressed: 1) What criteria should be used to determine whether a scientific discovery or invention is worthy of translation to human application? 2) What recent scientific advance warrants a deeper understanding of clinical trials by everyone? 3) What are the different types and phases of a clinical trial? 4) What characteristics of a human disorder should be noted, tracked, or stratified for a clinical trial and what inclusion /exclusion criteria are important to enrolling appropriate trial subjects? 5) What are the different study designs that can be used in a clinical trial program? 6) What confounding factors can alter the accurate interpretation of clinical trial outcomes? 7) What are the success rates of clinical trials and what can we learn from previous clinical trials? 8) What are the essential principles for the conduct of valid clinical trials? PMID:21786433
A Monte Carlo analysis of breast screening randomized trials.
Zamora, Luis I; Forastero, Cristina; Guirado, Damián; Lallena, Antonio M
2016-12-01
To analyze breast screening randomized trials with a Monte Carlo simulation tool. A simulation tool previously developed to simulate breast screening programmes was adapted for that purpose. The history of women participating in the trials was simulated, including a model for survival after local treatment of invasive cancers. Distributions of time gained due to screening detection against symptomatic detection and the overall screening sensitivity were used as inputs. Several randomized controlled trials were simulated. Except for the age range of women involved, all simulations used the same population characteristics and this permitted to analyze their external validity. The relative risks obtained were compared to those quoted for the trials, whose internal validity was addressed by further investigating the reasons of the disagreements observed. The Monte Carlo simulations produce results that are in good agreement with most of the randomized trials analyzed, thus indicating their methodological quality and external validity. A reduction of the breast cancer mortality around 20% appears to be a reasonable value according to the results of the trials that are methodologically correct. Discrepancies observed with Canada I and II trials may be attributed to a low mammography quality and some methodological problems. Kopparberg trial appears to show a low methodological quality. Monte Carlo simulations are a powerful tool to investigate breast screening controlled randomized trials, helping to establish those whose results are reliable enough to be extrapolated to other populations and to design the trial strategies and, eventually, adapting them during their development. Copyright © 2016 Associazione Italiana di Fisica Medica. Published by Elsevier Ltd. All rights reserved.
Bessing, Barnabas; Bauld, Linda; Sinclair, Lesley; Mackay, Daniel F; Spence, William; Tappin, David M
2016-08-26
The limited representativeness of trial samples may restrict external validity. The aim of this study was to ascertain the representativeness of the population enrolled in the Cessation in Pregnancy Incentives Trial (CPIT), a therapeutic exploratory study to examine the effectiveness of financial incentives for smoking cessation during pregnancy. CPIT participants (n = 492) were compared with all self-reported smokers at maternity booking who did not participate in the trial (n = 1982). Both groups were drawn from the National Health Service (NHS) Greater Glasgow and Clyde area over a 1-year trial enrolment period. Variables used for comparison were age, area-based deprivation index, body mass index, gestation, and carbon monoxide (CO) breath test level. Chi-square and Mann-Whitney U tests were used to compare groups. From January to December 2012, 2474/13,945 (17.7 %) women, who booked for maternity care, self-reported as current smokers (at least one cigarette in the last week). Seven hundred and fifty-two were ineligible for trial participation because of a CO breath test level of less than 7 parts per million (ppm) used as a biochemical cut-off to corroborate self-report of current smoking. At telephone consent 301 could not be contacted, 11 had miscarried, 16 did not give consent and 3 opted out after randomisation, leaving 492 participants for analysis. There were no differences in demographic or clinical characteristics between trial participants, and self-reported smokers not enrolled in the trial in terms of CO breath test (as a measure of smoking level for those with a CO level of 7 ppm or higher), material deprivation (using an area-based measure), maternal age and maternal body mass index. Gestation at booking was statistically significantly lower for participants. To ensure that all trial participants were smokers, biochemical validation excluded self-reported smokers with a CO level of less than 7 ppm from taking part in the trial, which excluded 30 % of self-reported smokers who were 'lighter' smokers. The efficacy of financial incentives would not have been likely to decrease if 'lighter' smokers had been included in the trial population. Trial participants were slightly earlier in their pregnancy at maternity booking, but this difference would not clinically affect the provision of financial incentives if provided routinely. Overall, the trial population was representative of all self-reported smokers with regard to available routinely collected data. Appropriate comparison of trial and target populations, with detailed reporting of exclusion criteria would contribute to the understanding of the wider applicability of trial results. Current Controlled Trials ISRCTN87508788 . Registered/Assigned on 1 September 2011.
MacNeill, Virginia; Foley, Marian; Quirk, Alan; McCambridge, Jim
2016-01-29
The sequence of events in a behaviour change trial involves interactions between research participants and the trial process. Taking part in such a study has the potential to influence the behaviour of the participant, and if it does, this can engender bias in trial outcomes. Since participants' experience has received scant attention, the aim of this study is thus to generate hypotheses about which aspects of the conduct of behaviour change trials might matter most to participants, and thus have potential to alter subsequent behaviours and bias trial outcomes Twenty participants were opportunistically screened for a health compromising behaviour (unhealthy diet, lack of exercise, smoking or alcohol consumption) and recruited if eligible. Semi structured face to face interviews were conducted, after going through the usual processes involved in trial recruitment, baseline assessment and randomisation. Participants were given information on the contents of an intervention or control condition in a behaviour change trial, which was not actually implemented. Three months later they returned to reflect on these experiences and whether they had any effect on their behaviour during the intervening period. Data from the latter interview were analysed thematically using a modified grounded theory approach. The early processes of trial participation raised awareness of unhealthy behaviours, although most reported having had only fleeting intentions to change their behaviour as a result of taking part in this study, in the absence of interventions. However, careful examination of the accounts revealed evidence of subtle research participation effects, which varied according to the health behaviour, and its perceived social acceptability. Participants' relationships with the research study were viewed as somewhat important in stimulating thinking about whether and how to make lifestyle changes. These participants described no dramatic impacts attributable to taking part in this study. This study demonstrates the likely value of well conducted qualitative studies of subtle research participation effects, which may be particularly important to explore for alcohol. Separating unintended influences in trial participation from the effects of behaviour change interventions being evaluated therein is necessary for valid estimates of intervention effects.
Extending the validity of the Feeding Practices and Structure Questionnaire.
Jansen, Elena; Mallan, Kimberley M; Daniels, Lynne A
2015-06-30
Feeding practices are commonly examined as potentially modifiable determinants of children's eating behaviours and weight status. Although a variety of questionnaires exist to assess different feeding aspects, many lack thorough reliability and validity testing. The Feeding Practices and Structure Questionnaire (FPSQ) is a tool designed to measure early feeding practices related to non-responsive feeding and structure of the meal environment. Face validity, factorial validity, internal reliability and cross-sectional correlations with children's eating behaviours have been established in mothers with 2-year-old children. The aim of the present study was to further extend the validity of the FPSQ by examining factorial, construct and predictive validity, and stability. Participants were from the NOURISH randomised controlled trial which evaluated an intervention with first-time mothers designed to promote protective feeding practices. Maternal feeding practices (FP) and child eating behaviours were assessed when children were aged 2 years and 3.7 years (n = 388). Confirmatory Factor analysis, group differences, predictive relationships, and stability were tested. The original 9-factor structure was confirmed when children were aged 3.7 ± 0.3 years. Cronbach's alpha was above the recommended 0.70 cut-off for all factors except Structured Meal Timing, Over Restriction and Distrust in Appetite which were 0.58, 0.67 and 0.66 respectively. Allocated group differences reflected behaviour consistent with intervention content and all feeding practices were stable across both time points (range of r = 0.45-0.70). There was some evidence for the predictive validity of factors with 2 FP showing expected relationships, 2 FP showing expected and unexpected relationships and 5 FP showing no relationship. Reliability and validity was demonstrated for most subscales of the FPSQ. Future validation is warranted with culturally diverse samples and with fathers and other caregivers. The use of additional outcomes to further explore predictive validity is recommended as well as testing test-retest reliability of the questionnaire.
Lee, Eugene; Rosner, Richard; Harmon, Ronnie
2014-07-01
Fitness to Stand Trial is a critical concept in the adjudication of justice-involved persons. A retrospective study was conducted to examine criminal defendants' specific psychiatric symptoms and those symptoms' associations with expert opinions on Competence to Stand Trial. One hundred charts were reviewed: 50 Cases (opined as Not Fit) were compared against 50 Controls (opined as Fit) with respect to ratings on the Brief Psychiatric Rating Scale (BPRS). A significance level of 0.001 was selected a priori. Statistically significant differences were found in seven of the eighteen BPRS symptom constructs (with the highest differences in Conceptual Disorganization and Unusual Thought Content) and two of the four BPRS higher-order syndrome factors (Thinking Disorder and Hostile-Suspiciousness). Consistent with previous reports, psychotic symptoms are found in this study to be inversely associated with Fitness. Validity, reliability, and limitations of this study, as well as directions for future research, are discussed herein. © 2014 American Academy of Forensic Sciences.
Pearson, Matthew R.; Kirouac, Megan; Witkiewitz, Katie
2015-01-01
Background and Aims The terms “binge drinking” and “heavy drinking” are both typically operationalized as 4+/5+ standard drinks per occasion for women/men and are commonly used as a proxy for non-problematic (<4/<5) versus problematic (4+/5+) drinking in multiple research contexts. The Food and Drug Administration in the United States (US) recently proposed the 4+/5+ criterion as a primary efficacy endpoint in their guidance for trials examining new medications for alcohol use disorders (AUDs). Internationally, similar cut-offs have been proposed, with the European Medicines Agency having identified reductions in the number of heavy drinking days (defined as 40/60g pure alcohol in women/men) as a primary endpoint for efficacy trials with a harm reduction goal. Analysis and Evidence We question the validity of the 4+/5+ cutoff (and other similar cutoffs) on multiple accounts. The 4+/5+ cutoff has not been shown to have unique predictive validity or clinical utility. The cutoff has been created based on retrospective self-reports and its use demonstrates ecological bias. Given strong evidence that the relationship between alcohol consumption and problems related to drinking is at least monotonic, if not linear, there is little existing evidence to support the 4+/5+ cutoff as a valid marker of problematic alcohol use. Conclusions There is little empirical evidence for the 4+/5+ units per occasion threshold for “binge” or “heavy” drinking in indexing treatment efficacy. Further consideration of an appropriate threshold seems to be warranted. PMID:27605077
Kashikar-Zuck, Susmita; Flowers, Stacy R.; Claar, Robyn Lewis; Guite, Jessica W.; Logan, Deirdre E.; Lynch-Jordan, Anne M; Palermo, Tonya M.; Wilson, Anna C.
2011-01-01
The Functional Disability Inventory (FDI) is a well-established and commonly used measure of physical functioning and disability in youth with chronic pain. Further validation of the measure has been called for, in particular, examination of the clinical utility and factor structure of the measure. To address this need, we utilized a large multicenter dataset of pediatric patients with chronic pain who had completed the FDI and other measures assessing pain and emotional functioning. Clinical reference points to allow for interpretation of raw scores were developed to enhance clinical utility of the measure and exploratory factor analysis was performed to examine its factor structure. Participants included 1300 youth ages 8 to 18 years (M=14.2 years; 76% female) with chronic pain. Examination of the distribution of FDI scores and validation with measures of depressive symptoms and pain intensity yielded three distinct categories of disability: No/Minimal Disability, Moderate Disability and Severe Disability. Factor analysis of FDI scores revealed a two-factor solution representing vigorous Physical Activities and non-physically strenuous Daily Activities. The three-level classification system and factor structure were further explored via comparison across the four most commonly encountered pain conditions in clinical settings (head, back, abdominal and widespread pain). Our findings provide important new information regarding the clinical utility and validity of the FDI. This will greatly enhance the interpretability of scores for research and clinical use in a wide range of pediatric pain conditions. In particular these findings will facilitate use of the FDI as an outcome measure in future clinical trials. PMID:21458162
Systematic review and meta-analyses of randomized controlled trials examining tinnitus management
Hoare, Derek J; Kowalkowski, Victoria L; Kang, Sujin; Hall, Deborah A
2011-01-01
Objectives/Hypothesis To evaluate the existing level of evidence for tinnitus management strategies identified in the UK Department of Health's Good Practice Guideline. Study Design Systematic review of peer-reviewed literature and meta-analyses. Methods Searches were conducted in PubMed, Cambridge Scientific Abstracts, Web of Science, and EMBASE (earliest to August 2010), supplemented by hand searches in October 2010. Only randomized controlled trials that used validated questionnaire measures of symptoms (i.e., measures of tinnitus distress, anxiety, depression) were included. Results Twenty-eight randomized controlled trials met our inclusion criteria, most of which provide moderate levels of evidence for the effects they reported. Levels of evidence were generally limited by the lack of blinding, lack of power calculations, and incomplete data reporting in these studies. Only studies examining cognitive behavioral therapy were numerous and similar enough to perform meta-analysis, from which the efficacy of cognitive behavioral therapy (moderate effect size) appears to be reasonably established. Antidepressants were the only drug class to show any evidence of potential benefit. Conclusions The efficacy of most interventions for tinnitus benefit remains to be demonstrated conclusively. In particular, high-level assessment of the benefit derived from those interventions most commonly used in practice, namely hearing aids, maskers, and tinnitus retraining therapy needs to be performed. PMID:21671234
Cunningham, Jennifer; Wallston, Kenneth A; Wilkins, Consuelo H; Hull, Pamela C; Miller, Stephania T
2015-12-01
This study describes the development and psychometric evaluation of HPV Clinical Trial Survey for Parents with Children Aged 9 to 15 (CTSP-HPV) using traditional instrument development methods and community engagement principles. An expert panel and parental input informed survey content and parents recommended study design changes (e.g., flyer wording). A convenience sample of 256 parents completed the final survey measuring parental willingness to consent to HPV clinical trial (CT) participation and other factors hypothesized to influence willingness (e.g., HPV vaccine benefits). Cronbach's a, Spearman correlations, and multiple linear regression were used to estimate internal consistency, convergent and discriminant validity, and predictively validity, respectively. Internal reliability was confirmed for all scales (a ≥ 0.70.). Parental willingness was positively associated (p < 0.05) with trust in medical researchers, adolescent CT knowledge, HPV vaccine benefits, advantages of adolescent CTs (r range 0.33-0.42), supporting convergent validity. Moderate discriminant construct validity was also demonstrated. Regression results indicate reasonable predictive validity with the six scales accounting for 31% of the variance in parents' willingness. This instrument can inform interventions based on factors that influence parental willingness, which may lead to the eventual increase in trial participation. Further psychometric testing is warranted. © 2015 Wiley Periodicals, Inc.
Predicting Performance in Higher Education Using Proximal Predictors.
Niessen, A Susan M; Meijer, Rob R; Tendeiro, Jorge N
2016-01-01
We studied the validity of two methods for predicting academic performance and student-program fit that were proximal to important study criteria. Applicants to an undergraduate psychology program participated in a selection procedure containing a trial-studying test based on a work sample approach, and specific skills tests in English and math. Test scores were used to predict academic achievement and progress after the first year, achievement in specific course types, enrollment, and dropout after the first year. All tests showed positive significant correlations with the criteria. The trial-studying test was consistently the best predictor in the admission procedure. We found no significant differences between the predictive validity of the trial-studying test and prior educational performance, and substantial shared explained variance between the two predictors. Only applicants with lower trial-studying scores were significantly less likely to enroll in the program. In conclusion, the trial-studying test yielded predictive validities similar to that of prior educational performance and possibly enabled self-selection. In admissions aimed at student-program fit, or in admissions in which past educational performance is difficult to use, a trial-studying test is a good instrument to predict academic performance.
Practical Aspects of Designing and Conducting Validation Studies Involving Multi-study Trials.
Coecke, Sandra; Bernasconi, Camilla; Bowe, Gerard; Bostroem, Ann-Charlotte; Burton, Julien; Cole, Thomas; Fortaner, Salvador; Gouliarmou, Varvara; Gray, Andrew; Griesinger, Claudius; Louhimies, Susanna; Gyves, Emilio Mendoza-de; Joossens, Elisabeth; Prinz, Maurits-Jan; Milcamps, Anne; Parissis, Nicholaos; Wilk-Zasadna, Iwona; Barroso, João; Desprez, Bertrand; Langezaal, Ingrid; Liska, Roman; Morath, Siegfried; Reina, Vittorio; Zorzoli, Chiara; Zuang, Valérie
This chapter focuses on practical aspects of conducting prospective in vitro validation studies, and in particular, by laboratories that are members of the European Union Network of Laboratories for the Validation of Alternative Methods (EU-NETVAL) that is coordinated by the EU Reference Laboratory for Alternatives to Animal Testing (EURL ECVAM). Prospective validation studies involving EU-NETVAL, comprising a multi-study trial involving several laboratories or "test facilities", typically consist of two main steps: (1) the design of the validation study by EURL ECVAM and (2) the execution of the multi-study trial by a number of qualified laboratories within EU-NETVAL, coordinated and supported by EURL ECVAM. The approach adopted in the conduct of these validation studies adheres to the principles described in the OECD Guidance Document on the Validation and International Acceptance of new or updated test methods for Hazard Assessment No. 34 (OECD 2005). The context and scope of conducting prospective in vitro validation studies is dealt with in Chap. 4 . Here we focus mainly on the processes followed to carry out a prospective validation of in vitro methods involving different laboratories with the ultimate aim of generating a dataset that can support a decision in relation to the possible development of an international test guideline (e.g. by the OECD) or the establishment of performance standards.
Cronin-Fenton, Deirdre P; Mooney, Margaret M; Clegg, Limin X; Harlan, Linda C
2008-01-01
AIM: To examine the extent of use of specific therapies in clinical practice, and their relationship to therapies validated in clinical trials. METHODS: The US National Cancer Institutes’ Patterns of Care study was used to examine therapies and survival of patients diagnosed in 2001 with histologically-confirmed gastroesophageal adenocarcinoma (n = 1356). The study re-abstracted data and verified therapy with treating physicians for a population-based stratified random sample. RESULTS: Approximately 62% of patients had stomach adenocarcinoma (SAC), while 22% had gastric-cardia adenocarcinoma (GCA), and 16% lower esophageal adenocarcinoma (EAC). Stage IV/unstaged esophageal cancer patients were most likely and stage I-III stomach cancer patients least likely to receive chemotherapy as all or part of their therapy; gastric-cardia patients received chemotherapy at a rate between these two. In multivariable analysis by anatomic site, patients 70 years and older were significantly less likely than younger patients to receive chemotherapy alone or chemoradiation for all three anatomic sites. Among esophageal and stomach cancer patients, receipt of chemotherapy was associated with lower mortality; but no association was found among gastric-cardia patients. CONCLUSION: This study highlights the relatively low use of clinical trials-validated anti-cancer therapies in community practice. Use of chemotherapy-based treatment was associated with lower mortality, dependent on anatomic site. Findings suggest that physicians treat lower esophageal and SAC as two distinct entities, while gastric-cardia patients receive a mix of the treatment strategies employed for the two other sites. PMID:18506920
Dimeff, Linda A; Harned, Melanie S; Woodcock, Eric A; Skutch, Julie M; Koerner, Kelly; Linehan, Marsha M
2015-05-01
The present study examined the efficacy of online training (OLT), instructor-led training (ILT), and a treatment manual (TM) in training mental health clinicians in two core strategies of Dialectical Behavior Therapy (DBT): chain analysis and validation. A randomized controlled trial compared OLT, ILT, and TM among clinicians naïve to DBT (N=172) who were assessed at baseline, post-training, and 30, 60, and 90 days following training. Primary outcomes included satisfaction, self-efficacy, motivation, knowledge, clinical proficiency, and clinical use. Overall, ILT outperformed OLT and TM in satisfaction, self-efficacy, and motivation, whereas OLT was the most effective method for increasing knowledge. The conditions did not differ in observer-rated clinical proficiency or self-reported clinical use, which both increased to moderate levels after training. In addition, ILT was particularly effective at improving motivation to use chain analysis, whereas OLT was particularly effective at increasing knowledge of validation strategies. These findings suggest that these types of brief, didactic trainings may be effective methods of increasing knowledge of new treatment strategies, but may not be sufficient to enable clinicians to achieve a high level of clinical use or proficiency. Additional research examining the possible advantages of matching training methods to types of treatment strategies may help to determine a tailored, more effective approach to training clinicians in empirically supported treatments. Copyright © 2015. Published by Elsevier Ltd.
Reddy, L. Felice; Barch, Deanna M.; Buchanan, Robert W.; Dunayevich, Eduardo; Gold, James M.; Marder, Steven R.; Wynn, Jonathan K.; Young, Jared W.; Green, Michael F.
2015-01-01
Effort-based decision making has strong conceptual links to the motivational disturbances that define a key subdomain of negative symptoms. However, the extent to which effort-based decision-making performance relates to negative symptoms, and other clinical and functionally important variables has yet to be systematically investigated. In 94 clinically stable outpatients with schizophrenia, we examined the external validity of 5 effort-based paradigms, including the Effort Expenditure for Rewards, Balloon Effort, Grip Strength Effort, Deck Choice Effort, and Perceptual Effort tasks. These tasks covered 3 types of effort: physical, cognitive, and perceptual. Correlations between effort related performance and 6 classes of variables were examined, including: (1) negative symptoms, (2) clinically rated motivation and community role functioning, (3) self-reported motivational traits, (4) neurocognition, (5) other psychiatric symptoms and clinical/demographic characteristics, and (6) subjective valuation of monetary rewards. Effort paradigms showed small to medium relationships to clinical ratings of negative symptoms, motivation, and functioning, with the pattern more consistent for some measures than others. They also showed small to medium relations with neurocognitive functioning, but were generally unrelated to other psychiatric symptoms, self-reported traits, antipsychotic medications, side effects, and subjective valuation of money. There were relatively strong interrelationships among the effort measures. In conjunction with findings from a companion psychometric article, all the paradigms warrant further consideration and development, and 2 show the strongest potential for clinical trial use at this juncture. PMID:26209546
Explanatory Versus Pragmatic Trials: An Essential Concept in Study Design and Interpretation.
Merali, Zamir; Wilson, Jefferson R
2017-11-01
Randomized clinical trials often represent the highest level of clinical evidence available to evaluate the efficacy of an intervention in clinical medicine. Although the process of randomization serves to maximize internal validity, the external validity, or generalizability, of such studies depends on several factors determined at the design phase of the trial including eligibility criteria, study setting, and outcomes of interest. In general, explanatory trials are optimized to demonstrate the efficacy of an intervention in a highly selected patient group; however, findings from these studies may not be generalizable to the larger clinical problem. In contrast, pragmatic trials attempt to understand the real-world benefit of an intervention by incorporating design elements that allow for greater generalizability and clinical applicability of study results. In this article we describe the explanatory-pragmatic continuum for clinical trials in greater detail. Further, a well-accepted tool for grading trials on this continuum is described, and applied, to 2 recently published trials pertaining to the surgical management of lumbar degenerative spondylolisthesis.
Person mobility in the design and analysis of cluster-randomized cohort prevention trials.
Vuchinich, Sam; Flay, Brian R; Aber, Lawrence; Bickman, Leonard
2012-06-01
Person mobility is an inescapable fact of life for most cluster-randomized (e.g., schools, hospitals, clinic, cities, state) cohort prevention trials. Mobility rates are an important substantive consideration in estimating the effects of an intervention. In cluster-randomized trials, mobility rates are often correlated with ethnicity, poverty and other variables associated with disparity. This raises the possibility that estimated intervention effects may generalize to only the least mobile segments of a population and, thus, create a threat to external validity. Such mobility can also create threats to the internal validity of conclusions from randomized trials. Researchers must decide how to deal with persons who leave study clusters during a trial (dropouts), persons and clusters that do not comply with an assigned intervention, and persons who enter clusters during a trial (late entrants), in addition to the persons who remain for the duration of a trial (stayers). Statistical techniques alone cannot solve the key issues of internal and external validity raised by the phenomenon of person mobility. This commentary presents a systematic, Campbellian-type analysis of person mobility in cluster-randomized cohort prevention trials. It describes four approaches for dealing with dropouts, late entrants and stayers with respect to data collection, analysis and generalizability. The questions at issue are: 1) From whom should data be collected at each wave of data collection? 2) Which cases should be included in the analyses of an intervention effect? and 3) To what populations can trial results be generalized? The conclusions lead to recommendations for the design and analysis of future cluster-randomized cohort prevention trials.
Dodge, Hiroko H; Mattek, Nora; Gregor, Mattie; Bowman, Molly; Seelye, Adriana; Ybarra, Oscar; Asgari, Meysam; Kaye, Jeffrey A
2015-01-01
Detecting early signs of Alzheimer's disease (AD) and mild cognitive impairment (MCI) during the pre-symptomatic phase is becoming increasingly important for costeffective clinical trials and also for deriving maximum benefit from currently available treatment strategies. However, distinguishing early signs of MCI from normal cognitive aging is difficult. Biomarkers have been extensively examined as early indicators of the pathological process for AD, but assessing these biomarkers is expensive and challenging to apply widely among pre-symptomatic community dwelling older adults. Here we propose assessment of social markers, which could provide an alternative or complementary and ecologically valid strategy for identifying the pre-symptomatic phase leading to MCI and AD. The data came from a larger randomized controlled clinical trial (RCT), where we examined whether daily conversational interactions using remote video telecommunications software could improve cognitive functions of older adult participants. We assessed the proportion of words generated by participants out of total words produced by both participants and staff interviewers using transcribed conversations during the intervention trial as an indicator of how two people (participants and interviewers) interact with each other in one-on-one conversations. We examined whether the proportion differed between those with intact cognition and MCI, using first, generalized estimating equations with the proportion as outcome, and second, logistic regression models with cognitive status as outcome in order to estimate the area under ROC curve (ROC AUC). Compared to those with normal cognitive function, MCI participants generated a greater proportion of words out of the total number of words during the timed conversation sessions (p=0.01). This difference remained after controlling for participant age, gender, interviewer and time of assessment (p=0.03). The logistic regression models showed the ROC AUC of identifying MCI (vs. normals) was 0.71 (95% Confidence Interval: 0.54 - 0.89) when average proportion of word counts spoken by subjects was included univariately into the model. An ecologically valid social marker such as the proportion of spoken words produced during spontaneous conversations may be sensitive to transitions from normal cognition to MCI.
Coyne, James C; Thombs, Brett D; Hagedoorn, Mariet
2010-03-01
We examined four meta-analyses of behavioral interventions for adults (Dixon, Keefe, Scipio, Perri, & Abernethy, 2007; Hoffman, Papas, Chatkoff, & Kerns, 2007; Irwin, Cole, & Nicassio, 2006; and Jacobsen, Donovan, Vadaparampil, & Small, 2007) that have appeared in the Evidence Based Treatment Reviews section of Health Psychology. Narrative review. We applied the following criteria to each meta-analysis: (1) whether each meta-analysis was described accurately, adequately, and transparently in the article; (2) whether there was an adequate attempt to deal with methodological quality of the original trials; (3) the extent to which the meta-analysis depended on small, underpowered studies; and (4) the extent to which the meta-analysis provided valid and useful evidence-based recommendations. Across the four meta-analyses, we identified substantial problems with the transparency and completeness with which these meta-analyses were reported, as well as a dependence on small, underpowered trials of generally poor quality. Results of our exercise raise questions about the clinical validity and utility of the conclusions of these meta-analyses. Results should serve as a wake up call to prospective authors, reviewers, and end-users of meta-analyses now appearing in the literature. Copyright 2010 APA, all rights reserved.
Buckley, Nicholas A; Isbister, Geoffrey K; Stokes, Barrie; Juurlink, David N
2005-01-01
Poisoning with carbon monoxide (CO) is an important cause of unintentional and intentional injury worldwide. Hyperbaric oxygen (HBO) enhances CO elimination and has been postulated to reduce the incidence of neurological sequelae. These observations have led some clinicians to use HBO for selected patients with CO poisoning, although there is considerable variability in clinical practice. This article assesses the effectiveness of HBO compared with normobaric oxygen (NBO) for the prevention of neurological sequelae in patients with acute CO poisoning. The following databases were searched: MEDLINE (1966 to present), EMBASE (1980 to present), and the Controlled Trials Register of the Cochrane Collaboration, supplemented by a manual review of bibliographies of identified articles and discussion with recognised content experts. All randomised controlled trials involving people acutely poisoned with CO, regardless of severity, were examined. The primary analysis included all trials from which data could be extracted. Sensitivity analysis examined trials with better validity (defined using the validated instrument of Jadad) and those enrolling more severely poisoned patients. Two reviewers independently extracted from each trial, including information on the number of randomised patients, types of participants, the dose and duration of the intervention, and the prevalence of neurological sequelae at follow-up. A pooled odds ratio (OR) for the presence of neurological symptoms at 1-month follow-up was calculated using a random effects model. Bayesian models were also investigated to illustrate the degree of certainty about clinical effectiveness. Eight randomised controlled trials were identified. Two had no evaluable data and were excluded. The remaining trials were of varying quality and two have been published only as abstracts. The severity of CO poisoning varied among trials. At 1-month follow-up after treatment, sequelae possibly related to CO poisoning were present in 242 of 761 patients (36.1%) treated with NBO, compared with 259 of 718 patients (31.8%) treated with HBO. Restricting the analysis to the trials with the highest quality scores or those that enrolled all patients regardless of severity did not change the lack of statistical significance in the outcome of the pooled analysis. We found empiric evidence of multiple biases that operated to inflate the benefit of HBO in two positive trials. In contrast, the interpretation of negative trials was hampered by low rates of follow-up, unusual interventions for control patients and inclusion of less severely poisoned patients. Collectively, these limitations may have led negative trials to overlook a real and substantial benefit of HBO (type II error). There is conflicting evidence regarding the efficacy of HBO treatment for patients with CO poisoning. Methodological shortcomings are evident in all published trials, with empiric evidence of bias in some, particularly those that suggest a benefit of HBO. Bayesian analysis further illustrates the uncertainty about a meaningful clinical benefit. Consequently, firm guidelines regarding the use of HBO for patients with CO poisoning cannot be established. Further research is needed to better define the role of HBO, if any, in the treatment of CO poisoning. Such research should not exclude patients with severe poisoning, have a primary outcome that is clinically meaningful and have oversight from an independent data monitoring and ethics committee.
Veerus, Piret; Fischer, Krista; Hakama, Matti; Hemminki, Elina
2012-04-04
The Estonian Postmenopausal Hormone Therapy (EPHT) Trial assigned 4170 potential participants prior to recruitment to blind or non-blind hormone therapy (HT), with placebo or non-treatment the respective alternatives. Before having to decide on participation, women were told whether they had been randomised to the blind or non-blind trial. Eligible women who were still willing to join the trial were recruited. After recruitment participants in the non-blind trial (N = 1001) received open-label HT or no treatment, participants in the blind trial (N = 777) remained blinded until the end of the trial. The aim of this paper is to analyse the effect of blinding on internal and external validity of trial outcomes. Effect of blinding was calculated as the hazard ratio of selected chronic diseases, total mortality and all outcomes. For analysing the effect of blinding on external validity, the hazard ratios from women recruited to the placebo arm and to the non-treatment arm were compared with those not recruited; for analysing the effect of blinding on internal validity, the hazard ratios from the blind trial were compared with those from the non-blind trial. The women recruited to the placebo arm had less cerebrovascular disease events (HR 0.43; 95% CI: 0.26-0.71) and all outcomes combined (HR 0.76; 95% CI: 0.63-0.91) than those who were not recruited. Among women recruited or not recruited to the non-treatment arm, no differences were observed for any of the outcomes studied.Among women recruited to the trial, the risk for coronary heart disease events (HR 0.77; 95% CI: 0.64-0.93), cerebrovascular disease events (HR 0.66; 95%CI: 0.47-0.92), and all outcomes combined (HR 0.82; 95% CI: 0.72-0.94) was smaller among participants in the blind trial than in the non-blind trial. There was no difference between the blind and the non-blind trial for total cancer (HR 0.95; 95% CI: 0.64-1.42), bone fractures (0.93; 95% CI: 0.74-1.16), and total mortality (HR 1.03; 95% CI: 0.53-1.98). The results from blind and non-blind trials may differ, even if the target population is the same. Blinding may influence both internal and external validity. The effect of blinding may vary for different outcome events. [ISRCTN35338757].
Matlasz, Tatiana M; Brylski, Jamie L; Leidenfrost, Corey M; Scalco, Matt; Sinclair, Samuel J; Schoelerman, Ronald M; Tsang, Valerie; Antonius, Daniel
Cognitive impairment among seriously mentally ill offenders has implications for legal matters (e.g., competency to stand trial), as well as clinical treatment and care. Thus, being able to identify potential cognitive concerns early in the adjudication process can be important when deciding on further interventions. In this study, we examined the validity scales of the Personality Assessment Inventory (PAI), scores on the Wechsler Adult Intelligence Scale-IV (WAIS-IV), and competency findings in male inmates (n=61) diagnosed with a serious mental illness. Lower scores on the WAIS-IV significantly (p=0.001) predicted invalid, versus valid, PAI profiles, with working memory impairment being the most significant (p=0.004) predictor of an invalid profile. Ancillary analyses on a smaller sample (n=18) indicate that those with invalid PAI profiles were more likely to be deemed legally incompetent (p=0.03). These findings suggest that the PAI validity scales may be informative in detecting cognitive concerns and help clinicians make determinations about competency restoration and treatment. Copyright © 2016 Elsevier Ltd. All rights reserved.
Martí-Margarit, Anna; Manresa, Josep M; Herdman, Mike; Pujol, Ramon; Serra, Consol; Flyvholm, Mary-Ann; Giménez-Arnau, Ana M
2015-04-01
Hand eczema is an impacting cutaneous disease. Globally valid tools that help to diagnose hand and forearm eczema are required. To validate the questions to detect hand and/or forearm eczema included in the "Nordic Occupational Skin Questionnaire" (NOSQ-2002) in the Spanish language. A prospective pilot study was conducted with 80 employees of a cleaning company and a retrospective one involving 2,546 individuals. The responses were analysed for sensitivity, specificity and positive and negative predictive values. The final diagnosis according to the patients' hospital records, the specialty care records and the physical examination was taken as gold standard. The Dermatology Life Quality Index (DLQI) was also evaluated. Sensitivity and specificity, in a worst case scenario (WC) combining both questions, were 96.5% and 66.7%, respectively, and in a per protocol (PP) analysis, were 96.5% and 75.2%. The questions validated detected eczema effectively, making this tool suitable for use e.g. in multicentre epidemiological studies or clinical trials.
Stamenovic, Milorad; Dobraca, Amra; Smajlovic, Mersiha
2018-01-01
Introduction: The aim of this paper is to present the marketing strategy and the application of management (marketing management) and advertising in order to increase the efficiency of innovative approach in clinical trials that include and involve the use of new technologies and transfer of technologies. Material and Methods: This paper has a descriptive character and represents a narrative review of the literature and new model implementation. Results: Marketing models are primarily used to improve the inclusion of a larger (and appropriate) number of patients, but they can be credited for the stay and monitoring of patients in the trial. Regulatory mechanisms play an important role in the application of various marketing strategies within clinical trials. The value for the patient as the most important stakeholder is defined in the field of clinical trials according to Kotler’s value model for the consumer. Conclusion: In order to achieve the best results it is important to adequately examine all the elements of clinical trials and apply this knowledge in creation of a marketing plan that will be made in accordance with the legal regulations defined globally and locally. In this paper, two challenges have been highlighted for the adequate application of marketing tools in the field of clinical trials, namely: defining business elements in order to provide an adequate marketing approach for clinical trials and technology transfer and ensuring uniformity and regulatory affirmation of marketing attitudes in clinical trials in all regions in which they are carried out in accordance with ICH-GCP and valid regulations. PMID:29719318
Clinical efficacy of composite versus ceramic inlays and onlays: a systematic review.
Fron Chabouis, Hélène; Smail Faugeron, Violaine; Attal, Jean-Pierre
2013-12-01
Large tooth substance losses are frequent in posterior teeth because of primary caries or aging restorations. Inlays and onlays are often the minimal invasive solution in such cases, but the efficacy of the composite and ceramic materials used is unknown. We performed a systematic review of randomized controlled trials comparing the efficacy of composite and ceramic inlays or onlays. MEDLINE, Embase and the Cochrane Central Register of Controlled Trials were searched without any restriction on date or language, as were references of eligible studies and ClinicalTrials.gov. Eligible studies were randomized trials comparing the clinical efficacy of composite to ceramic inlays or onlays in adults with any clinical outcome for at least 6 months. From 172 records identified, we examined reports of 2 randomized controlled trials involving 138 inlays (no onlays evaluated) in 80 patients and exhibiting a high-risk of bias. Outcomes were clinical scores and major failures. The 3-year overall failure risk ratio was 2 [0.38-10.55] in favor of ceramic inlays although not statistically significant. The reported clinical scores (United States Public Health Services and Californian Dental Association) showed considerable heterogeneity between trials and could not be combined. We have very limited evidence that ceramics perform better than composite material for inlays in the short term. However, this result may not be valid in the long term, and other trials are needed. Trials should follow Fédération dentaire internationale recommendations and enhance their methodology. Trials comparing composite and ceramic onlays are needed. Copyright © 2013 Academy of Dental Materials. Published by Elsevier Ltd. All rights reserved.
Gaus, Wilhelm; Muche, Rainer
2013-05-01
Clinical studies provide formalised experience for evidence-based medicine (EBM). Many people consider a controlled randomised trial (CRT, identical to a randomised controlled trial RCT) to be the non-plus-ultra design. However, CRTs also have limitations. The problem is not randomisation itself but informed consent for randomisation and masking of therapies according to today's legal and ethical standards. We do not want to de-rate CRTs, but we would like to contribute to the discussion on clinical research methodology. Informed consent to a CRT and masking of therapies plainly select patients. The excellent internal validity of CRTs can be counterbalanced by poor external validity, because internal and external validity act as antagonists. In a CRT, patients may feel like guinea pigs, this can decrease compliance, cause protocol violations, reduce self-healing properties, suppress unspecific therapeutic effects and possibly even modify specific efficacy. A control group (comparative study) is most important for the degree of evidence achieved by a trial. Study control by detailed protocol and good clinical practice (controlled study) is second in importance and randomisation and masking is third (thus the sequence CRT instead of RCT). Controlled non-randomised trials are just as ambitious and detailed as CRTs. We recommend clinicians and biometricians to take high quality controlled non-randomised trials into consideration more often. They combine good internal and external validity, better suit daily medical practice, show better patient compliance and fewer protocol violations, deliver estimators unbiased by alienated patients, and perhaps provide a clearer explanation of the achieved success. Copyright © 2013 Elsevier Inc. All rights reserved.
Epidural analgesia in labour and risk of caesarean delivery.
Bannister-Tyrrell, Melanie; Ford, Jane B; Morris, Jonathan M; Roberts, Christine L
2014-09-01
A Cochrane Systematic Review of randomised controlled trials of epidural analgesia compared with other or no analgesia in labour reported no overall increased risk of caesarean delivery. However, many trials were affected by substantial non-compliance, and there are concerns about the external validity of some trials for contemporary maternity populations. We aimed to explore the association between epidural analgesia in labour and caesarean delivery in clinical practice and compare with findings from randomised controlled trials. Population-based cohort of pregnant women (n = 210 708) without major obstetrical complications who delivered a singleton live infant in hospitals in New South Wales, Australia, 2007-10. Data were obtained from linked, validated population-based data collections. Propensity score matching was used to examine the association between epidural analgesia in labour and caesarean delivery. Epidural analgesia in labour was used by a third (31.5%, n = 66 317) of the women, and 9.8% (n = 20 531) had a caesarean delivery. Epidural analgesia in labour was associated with increased risk of caesarean delivery {risk ratio [RR] 2.5, [95% confidence interval (CI) 2.5, 2.6]}. The association with epidural analgesia in labour was higher for caesarean delivery for failure to progress {RR 3.0, [95% CI 2.9, 3.0]} than for caesarean delivery for fetal distress {RR 1.9, [95% CI 1.8, 2.0]}. Epidural analgesia in labour is associated with caesarean delivery in a large maternity population. Population-based studies contribute important data about obstetrical care, when research settings and participants may not represent the clinical settings or broader population in which obstetrical interventions in labour are applied. © 2014 John Wiley & Sons Ltd.
Villar, Jesús; Pérez-Méndez, Lina; Basaldúa, Santiago; Blanco, Jesús; Aguilar, Gerardo; Toral, Darío; Zavala, Elizabeth; Romera, Miguel A; González-Díaz, Gumersindo; Nogal, Frutos Del; Santos-Bouza, Antonio; Ramos, Luís; Macías, Santiago; Kacmarek, Robert M
2011-04-01
Predicting mortality has become a necessary step for selecting patients for clinical trials and defining outcomes. We examined whether stratification by tertiles of respiratory and ventilatory variables at the onset of acute respiratory distress syndrome (ARDS) identifies patients with different risks of death in the intensive care unit. We performed a secondary analysis of data from 220 patients included in 2 multicenter prospective independent trials of ARDS patients mechanically ventilated with a lung-protective strategy. Using demographic, pulmonary, and ventilation data collected at ARDS onset, we derived and validated a simple prediction model based on a population-based stratification of variable values into low, middle, and high tertiles. The derivation cohort included 170 patients (all from one trial) and the validation cohort included 50 patients (all from a second trial). Tertile distribution for age, plateau airway pressure (P(plat)), and P(aO(2))/F(IO(2)) at ARDS onset identified subgroups with different mortalities, particularly for the highest-risk tertiles: age (> 62 years), P(plat) (> 29 cm H(2)O), and P(aO(2))/F(IO(2)) (< 112 mm Hg). Risk was defined by the number of coexisting high-risk tertiles: patients with no high-risk tertiles had a mortality of 12%, whereas patients with 3 high-risk tertiles had 90% mortality (P < .001). A prediction model based on tertiles of patient age, P(plat), and P(aO(2))/F(IO(2)) at the time the patient meets ARDS criteria identifies patients with the lowest and highest risk of intensive care unit death.
Xiao, Lan; Lv, Nan; Rosas, Lisa G; Au, David; Ma, Jun
2017-02-01
To validate clinic weights in electronic health records against researcher-measured weights for outcome assessment in weight loss trials. Clinic and researcher-measured weights from a published trial (BE WELL) were compared using Lin's concordance correlation coefficient, Bland and Altman's limits of agreement, and polynomial regression model. Changes in clinic and researcher-measured weights in BE WELL and another trial, E-LITE, were analyzed using growth curve modeling. Among BE WELL (n = 330) and E-LITE (n = 241) participants, 96% and 90% had clinic weights (mean [SD] of 5.8 [6.1] and 3.7 [3.9] records) over 12 and 15 months of follow-up, respectively. The concordance correlation coefficient was 0.99, and limits of agreement plots showed no pattern between or within treatment groups, suggesting overall good agreement between researcher-measured and nearest-in-time clinic weights up to 3 months. The 95% confidence intervals for predicted percent differences fell within ±3% for clinic weights within 3 months of the researcher-measured weights. Furthermore, the growth curve slopes for clinic and researcher-measured weights by treatment group did not differ significantly, suggesting similar inferences about treatment effects over time, in both trials. Compared with researcher-measured weights, close-in-time clinic weights showed high agreement and inference validity. Clinic weights could be a valid pragmatic outcome measure in weight loss studies. © 2017 The Obesity Society.
Evidence base for multimodal therapy in cachexia.
Solheim, Tora S; Laird, Barry J A
2012-12-01
The lack of success of unimodal treatment studies in cachexia and the growing awareness that multiple components are responsible for the development of cachexia have led to the view that cachexia intervention should include multimodal treatment. The aim of this article is to examine the evidence for multimodal treatment in the management of cancer cachexia. There are some studies involving multimodal treatment that indicate significant effects on cachexia outcomes. There are, however, no randomized controlled trials to date that incorporate fully a structured exercise program, nutrition, good symptom treatment as well as drug treatment, to counteract the effects of altered metabolism. The effectiveness of any drug intervention for cancer cachexia probably will only be maximized if incorporated into multimodal treatment. Further, cachexia treatment trials should also aim to include patients at an early phase in their cachexia trajectory and use validated outcome measures.
Koontz, Alicia M; Lin, Yen-Sheng; Kankipati, Padmaja; Boninger, Michael L; Cooper, Rory A
2011-01-01
This study describes a new custom measurement system designed to investigate the biomechanics of sitting-pivot wheelchair transfers and assesses the reliability of selected biomechanical variables. Variables assessed include horizontal and vertical reaction forces underneath both hands and three-dimensional trunk, shoulder, and elbow range of motion. We examined the reliability of these measures between 5 consecutive transfer trials for 5 subjects with spinal cord injury and 12 nondisabled subjects while they performed a self-selected sitting pivot transfer from a wheelchair to a level bench. A majority of the biomechanical variables demonstrated moderate to excellent reliability (r > 0.6). The transfer measurement system recorded reliable and valid biomechanical data for future studies of sitting-pivot wheelchair transfers.We recommend a minimum of five transfer trials to obtain a reliable measure of transfer technique for future studies.
Fermont, Jilles M; Blazeby, Jane M; Rogers, Chris A; Wordsworth, Sarah
2017-01-01
Bariatric surgery is considered an effective treatment for individuals with severe and complex obesity. Besides reducing weight and improving obesity related comorbidities such as diabetes, bariatric surgery could improve patients' health-related quality of life. However, the frequently used instrument to measure quality of life, the EQ-5D has not been validated for use in bariatric surgery, which is a major limitation to its use in this clinical context. Our study undertook a psychometric validation of the 5 level EQ-5D (EQ-5D-5L) using clinical trial data to measure health-related quality of life in patients with severe and complex obesity undergoing bariatric surgery. Health-related quality of life was assessed at baseline (before randomisation) and six months later in 189 patients in a randomised controlled trial of bariatric surgery. Patients completed two generic health-related quality of life instruments, the EQ-5D-5L and SF-12, which were used together for the validation using data from all patients in the trial as the trial is ongoing. Psychometric analyses included construct and criterion validity and responsiveness to change. Of the 189 validation patients, 141 (75%) were female, the median age was 49 years old (range 23-70 years) and body mass index ranged from 33-70 kg/m2. For construct validity, there were significant improvements in the distribution of responses in all EQ-5D dimensions between baseline and 6 months after randomisation. For criterion validity, the highest degree of correlation was between the EQ-5D pain/discomfort and SF-12 bodily pain domain. For responsiveness the EQ-5D and SF-12 showed statistically significant improvements in health-related quality of life between baseline and 6 months after randomisation. The EQ-5D-5L is a valid generic measure for measuring health-related quality of life in bariatric surgery patients.
Community engagement in diverse populations for Alzheimer disease prevention trials.
Romero, Heather R; Welsh-Bohmer, Kathleen A; Gwyther, Lisa P; Edmonds, Henry L; Plassman, Brenda L; Germain, Cassandra M; McCart, Michelle; Hayden, Kathleen M; Pieper, Carl; Roses, Allen D
2014-01-01
The recruitment of asymptomatic volunteers has been identified as a critical factor that is delaying the development and validation of preventive therapies for Alzheimer disease (AD). Typical recruitment strategies involve the use of convenience samples or soliciting participation of older adults with a family history of AD from clinics and outreach efforts. However, high-risk groups, such as ethnic/racial minorities, are traditionally less likely to be recruited for AD prevention studies, thus limiting the ability to generalize findings for a significant proportion of the aging population. A community-engagement approach was used to create a registry of 2311 research-ready, healthy adult volunteers who reflect the ethnically diverse local community. Furthermore, the registry's actual commitment to research was examined, through demonstrated participation rates in a clinical study. The approach had varying levels of success in establishing a large, diverse pool of individuals who are interested in participating in pharmacological prevention trials and meet the criteria for primary prevention research trials designed to delay the onset of AD. Our efforts suggest that entry criteria for the clinical trials need to be carefully considered to be inclusive of African Americans, and that sustained effort is needed to engage African Americans in pharmacological prevention approaches.
Wallston, Kenneth A.; Wilkins, Consuelo H.; Hull, Pamela C.; Miller, Stephania T.
2015-01-01
Abstract Objective This study describes the development and psychometric evaluation of HPV Clinical Trial Survey for Parents with Children Aged 9 to 15 (CTSP‐HPV) using traditional instrument development methods and community engagement principles. Methods An expert panel and parental input informed survey content and parents recommended study design changes (e.g., flyer wording). A convenience sample of 256 parents completed the final survey measuring parental willingness to consent to HPV clinical trial (CT) participation and other factors hypothesized to influence willingness (e.g., HPV vaccine benefits). Cronbach's a, Spearman correlations, and multiple linear regression were used to estimate internal consistency, convergent and discriminant validity, and predictively validity, respectively. Results Internal reliability was confirmed for all scales (a ≥ 0.70.). Parental willingness was positively associated (p < 0.05) with trust in medical researchers, adolescent CT knowledge, HPV vaccine benefits, advantages of adolescent CTs (r range 0.33–0.42), supporting convergent validity. Moderate discriminant construct validity was also demonstrated. Regression results indicate reasonable predictive validity with the six scales accounting for 31% of the variance in parents’ willingness. Conclusions This instrument can inform interventions based on factors that influence parental willingness, which may lead to the eventual increase in trial participation. Further psychometric testing is warranted. PMID:26530324
Topping, Alice; Kappel, Franz; Thijssen, Stephan; Kotanko, Peter
2018-01-01
In silico approaches have been proposed as a novel strategy to increase the repertoire of clinical trial designs. Realistic simulations of clinical trials can provide valuable information regarding safety and limitations of treatment protocols and have been shown to assist in the cost‐effective planning of clinical studies. In this report, we present a blueprint for the stepwise integration of internal, external, and ecological validity considerations in virtual clinical trials (VCTs). We exemplify this approach in the context of a model‐based in silico clinical trial aimed at anemia treatment in patients undergoing hemodialysis (HD). Hemoglobin levels and subsequent anemia treatment were simulated on a per patient level over the course of a year and compared to real‐life clinical data of 79,426 patients undergoing HD. The novel strategies presented here, aimed to improve external and ecological validity of a VCT, significantly increased the predictive power of the discussed in silico trial. PMID:29368434
Intelligence as the efficiency of cue-driven retrieval from secondary memory.
Liesefeld, Heinrich René; Hoffmann, Eugenia; Wentura, Dirk
2016-01-01
Complex-span (working-memory-capacity) tasks are among the most successful predictors of intelligence. One important contributor to this relationship is the ability to efficiently employ cues for the retrieval from secondary memory. Presumably, intelligent individuals can considerably restrict their memory search sets by using such cues and can thereby improve recall performance. We here test this assumption by experimentally manipulating the validity of retrieval cues. When memoranda are drawn from the same semantic category on two successive trials of a verbal complex-span task, the category is a very strong retrieval cue on its first occurrence (strong-cue trial) but loses some of its validity on its second occurrence (weak-cue trial). If intelligent individuals make better use of semantic categories as retrieval cues, their recall accuracy suffers more from this loss of cue validity. Accordingly, our results show that less variance in intelligence is explained by recall accuracy on weak-cue compared with strong-cue trials.
Wilman, E; Megone, C; Oliver, S; Duley, L; Gyte, G; Wright, J M
2015-11-04
Conducting clinical trials with pre-term or sick infants is important if care for this population is to be underpinned by sound evidence. Yet approaching parents at this difficult time raises challenges for the obtaining of valid informed consent to such research. This study asked: what light does the empirical literature cast on an ethically defensible approach to the obtaining of informed consent in perinatal clinical trials? A systematic search identified 49 studies. Analysis began by applying philosophical frameworks which were then refined in light of the concepts emerging from empirical studies to present a coherent picture of a broad literature. Between them, studies addressed the attitudes of both parents and clinicians concerning consent in neonatal trials; the validity of the consent process in the neonatal research context; and different possible methods of obtaining consent. Despite a variety of opinions among parents and clinicians there is a strongly and widely held view that it is important that parents do give or decline consent for neonatal participation in trials. However, none of the range of existing consent processes reviewed by the research is satisfactory. A significant gap is evaluation of the widespread practice of emergency 'assent', in which parents assent or refuse their baby's participation as best they can during the emergency and later give full consent to ongoing participation and follow-up. Emergency assent has not been evaluated for its acceptability, how such a process would deal with bad outcomes such as neonatal death between assent and consent, or the extent to which late parental refusal might bias results. This review of a large number of empirical papers, while not making fundamental changes, has refined and developed the conceptual framework from philosophy for examining informed consent in this context.
Ma, Ning; Yu, Angela J
2015-01-01
Response time (RT) is an oft-reported behavioral measure in psychological and neurocognitive experiments, but the high level of observed trial-to-trial variability in this measure has often limited its usefulness. Here, we combine computational modeling and psychophysics to examine the hypothesis that fluctuations in this noisy measure reflect dynamic computations in human statistical learning and corresponding cognitive adjustments. We present data from the stop-signal task (SST), in which subjects respond to a go stimulus on each trial, unless instructed not to by a subsequent, infrequently presented stop signal. We model across-trial learning of stop signal frequency, P(stop), and stop-signal onset time, SSD (stop-signal delay), with a Bayesian hidden Markov model, and within-trial decision-making with an optimal stochastic control model. The combined model predicts that RT should increase with both expected P(stop) and SSD. The human behavioral data (n = 20) bear out this prediction, showing P(stop) and SSD both to be significant, independent predictors of RT, with P(stop) being a more prominent predictor in 75% of the subjects, and SSD being more prominent in the remaining 25%. The results demonstrate that humans indeed readily internalize environmental statistics and adjust their cognitive/behavioral strategy accordingly, and that subtle patterns in RT variability can serve as a valuable tool for validating models of statistical learning and decision-making. More broadly, the modeling tools presented in this work can be generalized to a large body of behavioral paradigms, in order to extract insights about cognitive and neural processing from apparently quite noisy behavioral measures. We also discuss how this behaviorally validated model can then be used to conduct model-based analysis of neural data, in order to help identify specific brain areas for representing and encoding key computational quantities in learning and decision-making.
Assessing the Generalizability of Randomized Trial Results to Target Populations
Stuart, Elizabeth A.; Bradshaw, Catherine P.; Leaf, Philip J.
2014-01-01
Recent years have seen increasing interest in and attention to evidence-based practices, where the “evidence” generally comes from well-conducted randomized trials. However, while those trials yield accurate estimates of the effect of the intervention for the participants in the trial (known as “internal validity”), they do not always yield relevant information about the effects in a particular target population (known as “external validity”). This may be due to a lack of specification of a target population when designing the trial, difficulties recruiting a sample that is representative of a pre-specified target population, or to interest in considering a target population somewhat different from the population directly targeted by the trial. This paper first provides an overview of existing design and analysis methods for assessing and enhancing the ability of a randomized trial to estimate treatment effects in a target population. It then provides a case study using one particular method, which weights the subjects in a randomized trial to match the population on a set of observed characteristics. The case study uses data from a randomized trial of School-wide Positive Behavioral Interventions and Supports (PBIS); our interest is in generalizing the results to the state of Maryland. In the case of PBIS, after weighting, estimated effects in the target population were similar to those observed in the randomized trial. The paper illustrates that statistical methods can be used to assess and enhance the external validity of randomized trials, making the results more applicable to policy and clinical questions. However, there are also many open research questions; future research should focus on questions of treatment effect heterogeneity and further developing these methods for enhancing external validity. Researchers should think carefully about the external validity of randomized trials and be cautious about extrapolating results to specific populations unless they are confident of the similarity between the trial sample and that target population. PMID:25307417
Validity of smartphone pedometer applications.
Orr, Krystn; Howe, Holly S; Omran, Janine; Smith, Kristina A; Palmateer, Tess M; Ma, Alvin E; Faulkner, Guy
2015-11-30
Given the widespread use of smartphone pedometer applications and the relatively limited number of published validity tests, this study examined the validity of three popular commercial smartphone pedometer applications (i.e., Accupedo, Moves, and Runtastic Pedometer). Convenience samples of males and females were recruited for laboratory tests [n = 11; mean: aged 24.18 years (±3.06)] and a free-living test [n = 18; mean: aged 28.78 years (±9.52)]. Five conditions were assessed: (a) 20-step test, (b) 40-step stair climbing, (c) treadmill walking and running at different speeds, (d) driving, and (e) 3-day free-living. The Yamax SW-200 pedometer and observed step counts were used as criterion measures. Analyses identified an unacceptable error percentage in all of the applications compared to the pedometer. Given the inaccuracy of these applications, caution is required in their promotion to the public for self-monitoring physical activity and in their use as tools for assessing physical activity in research trials.
Exploring threats to generalisability in a large international rehabilitation trial (AVERT)
Bernhardt, Julie; Raffelt, Audrey; Churilov, Leonid; Lindley, Richard I; Speare, Sally; Ancliffe, Jacqueline; Katijjahbe, Md Ali; Hameed, Shahul; Lennon, Sheila; McRae, Anna; Tan, Dawn; Quiney, Jan; Williamson, Hannah C; Collier, Janice; Dewey, Helen M; Donnan, Geoffrey A; Langhorne, Peter; Thrift, Amanda G
2015-01-01
Objective The purpose of this paper is to examine potential threats to generalisability of the results of a multicentre randomised controlled trial using data from A Very Early Rehabilitation Trial (AVERT). Design AVERT is a prospective, parallel group, assessor-blinded randomised clinical trial. This paper presents data assessing the generalisability of AVERT. Setting Acute stroke units at 44 hospitals in 8 countries. Participants The first 20 000 patients screened for AVERT, of whom 1158 were recruited and randomised. Model We use the Proximal Similarity Model, which considers the person, place, and setting and practice, as a framework for considering generalisability. As well as comparing the recruited patients with the target population, we also performed an exploratory analysis of the demographic, clinical, site and process factors associated with recruitment. Results The demographics and stroke characteristics of the included patients in the trial were broadly similar to population-based norms, with the exception that AVERT had a greater proportion of men. The most common reason for non-recruitment was late arrival to hospital (ie, >24 h). Overall, being older and female reduced the odds of recruitment to the trial. More women than men were excluded for most of the reasons, including refusal. The odds of exclusion due to early deterioration were particularly high for those with severe stroke (OR=10.4, p<0.001, 95% CI 9.27 to 11.65). Conclusions A model which explores person, place, and setting and practice factors can provide important information about the external validity of a trial, and could be applied to other clinical trials. Trial registration number Australian New Zealand Clinical Trials Registry (ACTRN12606000185561) and Clinicaltrials.gov (NCT01846247). PMID:26283667
Jones, Matthew; Lewis, Sarah; Parrott, Steve; Wormall, Stephen; Coleman, Tim
2016-06-01
In pregnant smoking cessation trial participants, to estimate (1) among women abstinent at the end of pregnancy, the proportion who re-start smoking at time-points afterwards (primary analysis) and (2) among all trial participants, the proportion smoking at the end of pregnancy and at selected time-points during the postpartum period (secondary analysis). Trials identified from two Cochrane reviews plus searches of Medline and EMBASE. Twenty-seven trials were included. The included trials were randomized or quasi-randomized trials of within-pregnancy cessation interventions given to smokers who reported abstinence both at end of pregnancy and at one or more defined time-points after birth. Outcomes were validated biochemically and self-reported continuous abstinence from smoking and 7-day point prevalence abstinence. The primary random-effects meta-analysis used longitudinal data to estimate mean pooled proportions of re-starting smoking; a secondary analysis used cross-sectional data to estimate the mean proportions smoking at different postpartum time-points. Subgroup analyses were performed on biochemically validated abstinence. The pooled mean proportion re-starting at 6 months postpartum was 43% [95% confidence interval (CI) = 16-72%, I(2) = 96.7%] (11 trials, 571 abstinent women). The pooled mean proportion smoking at the end of pregnancy was 87% (95% CI = 84-90%, I(2) = 93.2%) and 94% (95% CI = 92-96%, I(2) = 88%) at 6 months postpartum (23 trials, 9262 trial participants). Findings were similar when using biochemically validated abstinence. In clinical trials of smoking cessation interventions during pregnancy only 13% are abstinent at term. Of these, 43% re-start by 6 months postpartum. © 2016 The Authors. Addiction published by John Wiley & Sons Ltd on behalf of Society for the Study of Addiction.
2014-01-01
Background Thresholds for statistical significance are insufficiently demonstrated by 95% confidence intervals or P-values when assessing results from randomised clinical trials. First, a P-value only shows the probability of getting a result assuming that the null hypothesis is true and does not reflect the probability of getting a result assuming an alternative hypothesis to the null hypothesis is true. Second, a confidence interval or a P-value showing significance may be caused by multiplicity. Third, statistical significance does not necessarily result in clinical significance. Therefore, assessment of intervention effects in randomised clinical trials deserves more rigour in order to become more valid. Methods Several methodologies for assessing the statistical and clinical significance of intervention effects in randomised clinical trials were considered. Balancing simplicity and comprehensiveness, a simple five-step procedure was developed. Results For a more valid assessment of results from a randomised clinical trial we propose the following five-steps: (1) report the confidence intervals and the exact P-values; (2) report Bayes factor for the primary outcome, being the ratio of the probability that a given trial result is compatible with a ‘null’ effect (corresponding to the P-value) divided by the probability that the trial result is compatible with the intervention effect hypothesised in the sample size calculation; (3) adjust the confidence intervals and the statistical significance threshold if the trial is stopped early or if interim analyses have been conducted; (4) adjust the confidence intervals and the P-values for multiplicity due to number of outcome comparisons; and (5) assess clinical significance of the trial results. Conclusions If the proposed five-step procedure is followed, this may increase the validity of assessments of intervention effects in randomised clinical trials. PMID:24588900
Jeger, Raban; Pfisterer, Matthias; Alber, Hannes; Eberli, Franz; Galatius, Søren; Naber, Christoph; Pedrazzini, Giovanni; Rickli, Hans; Jensen, Jan Skov; Vuilliomenet, André; Gilgen, Nicole; Kaiser, Christoph
2012-02-01
In the BAsel Stent Kosten Effektivitäts Trial PROspective Validation Examination (BASKET-PROVE), drug-eluting stents (DESs) had similar 2-year rates of death and myocardial infarction but lower rates of target vessel revascularization and major adverse cardiac events compared with bare-metal stents (BMSs). However, comparative clinical effects of newest-generation DES with biodegradable polymers vs second-generation DES or newest-generation BMS with biocompatible coatings, all combined with a prasugrel-based antiplatelet therapy, on 2-year outcomes are not known. In BASKET-PROVE II, 2,400 patients with de novo lesions in native vessels ≥3 mm in diameter are randomized 1:1:1 to receive a conventional DES, a DES with a biodegradable polymer, or a BMS with biocompatible coating. In addition to aspirin, stable patients with BMS will receive prasugrel for 1 month, whereas all others will receive prasugrel for 12 months. The primary end point will be combined cardiac death, nonfatal myocardial infarction, and target vessel revascularization up to 2 years. Secondary end points include stent thrombosis and major bleeding. The primary aim is to test (1) the noninferiority of a biodegradable-polymer DES to a conventional DES and (2) the superiority of both DESs to BMS. A secondary aim is to compare the outcomes with those of BASKET-PROVE regarding the effects of prasugrel-based vs clopidogrel-based antiplatelet therapy. By the end of 2010, 878 patients (37% of those planned) were enrolled. This study will test the comparative long-term safety and efficacy of newest-generation stents on the background of contemporary antiplatelet therapy in a large all-comer population undergoing large native coronary artery stenting. Copyright © 2012 Mosby, Inc. All rights reserved.
Ball, Susan G; Lipsius, Sarah; Escobar, Rodrigo
2015-09-01
The Generalized Anxiety Inventory (GAI) has been developed for use in the assessment of anxiety symptoms in older adults (≥ 65 years), but previous validation work has not examined the psychometric qualities of the instrument in relation to treatment. The objective of this study was to examine the performance of the GAI for its internal reliability, convergent and divergent validity, and its sensitivity to treatment. Elderly patients with generalized anxiety disorder (GAD) participated in a 10-week double-blind study of duloxetine treatment for patients with GAD. Anxiety symptoms were assessed with the Hamilton Anxiety Rating Scale (HAMA), the Hospital Anxiety and Depression Scale (HADS) anxiety and depression subscales, and the GAI. Internal reliability of the GAI was assessed with Cronbach's α. Correlations among the HAMA, HADS, and GAI scores were analyzed to determine convergent and divergent validity. Patients were also compared on remission status using recommended cut-off scores for the GAI. Patients with GAD, who were at least 65 years of age, were randomly assigned to double-blind treatment with either duloxetine (N = 151) or placebo (N = 140) for 10 weeks acute therapy. The mean change on the GAI was significantly greater with duloxetine compared with placebo treatment (-8.36 vs. -4.96, respectively, p ≤ 0.001). The GAI demonstrated good internal consistency, good convergent and divergent validity, but suggested cut-off values for caseness with the GAI did not correspond to remission status as measured by the HAMA. Within an elderly patient population with GAD, the GAI demonstrated sound psychometric qualities and sensitivity to change with treatment.
Woods, David L.; Wyma, John M.; Herron, Timothy J.; Yund, E. William
2017-01-01
Verbal learning tests (VLTs) are widely used to evaluate memory deficits in neuropsychiatric and developmental disorders. However, their validity has been called into question by studies showing significant differences in VLT scores obtained by different examiners. Here we describe the computerized Bay Area Verbal Learning Test (BAVLT), which minimizes inter-examiner differences by incorporating digital list presentation and automated scoring. In the 10-min BAVLT, a 12-word list is presented on three acquisition trials, followed by a distractor list, immediate recall of the first list, and, after a 30-min delay, delayed recall and recognition. In Experiment 1, we analyzed the performance of 195 participants ranging in age from 18 to 82 years. Acquisition trials showed strong primacy and recency effects, with scores improving over repetitions, particularly for mid-list words. Inter-word intervals (IWIs) increased with successive words recalled. Omnibus scores (summed over all trials except recognition) were influenced by age, education, and sex (women outperformed men). In Experiment 2, we examined BAVLT test-retest reliability in 29 participants tested with different word lists at weekly intervals. High intraclass correlation coefficients were seen for omnibus and acquisition scores, IWIs, and a categorization index reflecting semantic reorganization. Experiment 3 examined the performance of Experiment 2 participants when feigning symptoms of traumatic brain injury. Although 37% of simulated malingerers showed abnormal (p < 0.05) omnibus z-scores, z-score cutoffs were ineffective in discriminating abnormal malingerers from control participants with abnormal scores. In contrast, four malingering indices (recognition scores, primacy/recency effects, learning rate across acquisition trials, and IWIs) discriminated the two groups with 80% sensitivity and 80% specificity. Experiment 4 examined the performance of a small group of patients with mild or severe TBI. Overall, both patient groups performed within the normal range, although significant performance deficits were seen in some patients. The BAVLT improves the speed and replicability of verbal learning assessments while providing comprehensive measures of retrieval timing, semantic organization, and primacy/recency effects that clarify the nature of performance. PMID:28127280
Moncrieff, Joanna; Timimi, Sami
2013-01-01
The current paper presents an analysis of the NICE guidelines on depression and attention deficit hyperactivity disorder (ADHD) from the perspective of the philosophy of science, guided particularly by Foucault's notion of the symbiosis of knowledge and power. It examines how data that challenged the orthodox position on the validity and drug treatment of these conditions was managed in the process of guideline development. The depression guideline briefly considered the complexity and heterogeneity of depression, and numerous methodological problems with evaluating treatments, including antidepressants. However, the guideline recommendations made no reference to these issues and ignored evidence that questioned the analysis of antidepressant trials. The guideline on ADHD reviewed validity, but did not consider evidence from the critical literature, and overlooked inconsistencies in the data. The guideline identified that drug trials have shown no long-term benefit in ADHD, but still recommended treatment with stimulant drugs for children with severe symptoms and for all adults claiming consensus for this position. Both guidelines demonstrate how contradictory data are managed so as not to jeopardise the currently predominant view that ADHD and depression are valid and un-contentious medical conditions that should be treated with drugs. The subjective nature of guideline formation that is revealed illustrates Foucault's suggestion that the authority of medicine operates to promote a technological view of the nature of certain human problems, which in turn strengthens medical hegemony over these areas. PMID:23496174
Designing "Real-World" trials to meet the needs of health policy makers at marketing authorization.
Calvert, Melanie; Wood, John; Freemantle, Nick
2011-07-01
There is increasing interest in conducting "Real-World" trials that go beyond traditional assessment of efficacy and safety to examine market access and value for money questions before marketing authorization of a new pharmaceutical product or health technology. This commentary uses practical examples to demonstrate how high-quality evidence of the cost-effectiveness of an intervention may be gained earlier in the development process. Issues surrounding the design and analysis of "Real-World" trials to demonstrate relative cost-effectiveness early in the life of new technologies are discussed. The modification of traditional phase III trial designs, de novo trial designs, the combination of trial-based and epidemiological data, and the use of simulation model-based approaches to address reimbursement questions are described. Modest changes to a phase III trial protocol and case report form may be undertaken at the design stage to provide valid estimates of health care use and the benefits accrued; however, phase III designs often preclude "real-life" practice. Relatively small de novo trials may be used to address adherence to therapy or patient preference, although simply designed studies with active comparators enrolling large numbers of patients may provide evidence on long-term safety and rare adverse events. Practical examples demonstrate that it is possible to provide high-quality evidence of the cost-effectiveness of an intervention earlier in the development process. Payers and decision makers should preferentially adopt treatments with such evidence than treatments for which evidence is lacking or of lower quality. Copyright © 2011 Elsevier Inc. All rights reserved.
Narayanan, Aqilah Leela T; Hamid, Syed Rasul G Syed; Supriyanto, Eko
2016-01-01
Evidence regarding the effectiveness of incentive spirometry (ISy) on postoperative pulmonary outcomes after thoracic, cardiac and abdominal surgery remains inconclusive. This is attributed to various methodological issues inherent in ISy trials. Patient compliance has also been highlighted as a possible confounding factor; however, the status of evidence regarding patient compliance in these trials is unknown. To explore the status of evidence on patient compliance with ISy interventions in randomized controlled trials (RCTs) in the above contexts. A systematic search using MEDLINE, EMBASE and CINAHL databases was conducted to obtain relevant RCTs from 1972 to 2015 using the inclusion criteria. These were examined for specific ISy parameters, methods used for determining compliance and reporting on compliance. Main outcome measures were comparison of ISy parameters prescribed and assessed, and reporting on compliance. Thirty-six relevant RCTs were obtained. Six ISy parameters were identified in ISy prescriptions from these trials. Almost all (97.2%) of the trials had ISy prescriptions with specific parameters. Wilcoxon signed-rank test revealed that the ISy parameters assessed were significantly lower (Z=-5.433; P<0.001) than those prescribed; 66.7% of the trials indicated use of various methods to assess these parameters. Only six (16.7%) trials included reports on compliance; however, these were also incomprehensive. There is a scarcity and inconsistency of evidence regarding ISy compliance. Compliance data should be obtained using reliable and standardized methods to facilitate comparisons between and among trials. These should be reported comprehensively to facilitate valid inferences regarding ISy intervention effectiveness.
Hou, Fang; Huang, Chang-Bing; Lesmes, Luis; Feng, Li-Xia; Tao, Liming; Zhou, Yi-Feng; Lu, Zhong-Lin
2010-01-01
Purpose. The qCSF method is a novel procedure for rapid measurement of spatial contrast sensitivity functions (CSFs). It combines Bayesian adaptive inference with a trial-to-trial information gain strategy, to directly estimate four parameters defining the observer's CSF. In the present study, the suitability of the qCSF method for clinical application was examined. Methods. The qCSF method was applied to rapidly assess spatial CSFs in 10 normal and 8 amblyopic participants. The qCSF was evaluated for accuracy, precision, test–retest reliability, suitability of CSF model assumptions, and accuracy of amblyopia screening. Results. qCSF estimates obtained with as few as 50 trials matched those obtained with 300 Ψ trials. The precision of qCSF estimates obtained with 120 and 130 trials, in normal subjects and amblyopes, matched the precision of 300 Ψ trials. For both groups and both methods, test–retest sensitivity estimates were well matched (all R > 0.94). The qCSF model assumptions were valid for 8 of 10 normal participants and all amblyopic participants. Measures of the area under log CSF (AULCSF) and the cutoff spatial frequency (cutSF) were lower in the amblyopia group; these differences were captured within 50 qCSF trials. Amblyopia was detected at an approximately 80% correct rate in 50 trials, when a logistic regression model was used with AULCSF and cutSF as predictors. Conclusions. The qCSF method is sufficiently rapid, accurate, and precise in measuring CSFs in normal and amblyopic persons. It has great potential for clinical practice. PMID:20484592
Narayanan, Aqilah Leela T; Hamid, Syed Rasul G Syed; Supriyanto, Eko
2016-01-01
BACKGROUND: Evidence regarding the effectiveness of incentive spirometry (ISy) on postoperative pulmonary outcomes after thoracic, cardiac and abdominal surgery remains inconclusive. This is attributed to various methodological issues inherent in ISy trials. Patient compliance has also been highlighted as a possible confounding factor; however, the status of evidence regarding patient compliance in these trials is unknown. OBJECTIVE: To explore the status of evidence on patient compliance with ISy interventions in randomized controlled trials (RCTs) in the above contexts. METHOD: A systematic search using MEDLINE, EMBASE and CINAHL databases was conducted to obtain relevant RCTs from 1972 to 2015 using the inclusion criteria. These were examined for specific ISy parameters, methods used for determining compliance and reporting on compliance. Main outcome measures were comparison of ISy parameters prescribed and assessed, and reporting on compliance. RESULTS: Thirty-six relevant RCTs were obtained. Six ISy parameters were identified in ISy prescriptions from these trials. Almost all (97.2%) of the trials had ISy prescriptions with specific parameters. Wilcoxon signed-rank test revealed that the ISy parameters assessed were significantly lower (Z=−5.433; P<0.001) than those prescribed; 66.7% of the trials indicated use of various methods to assess these parameters. Only six (16.7%) trials included reports on compliance; however, these were also incomprehensive. CONCLUSIONS: There is a scarcity and inconsistency of evidence regarding ISy compliance. Compliance data should be obtained using reliable and standardized methods to facilitate comparisons between and among trials. These should be reported comprehensively to facilitate valid inferences regarding ISy intervention effectiveness. PMID:26909010
Hamm, Michele P; Scott, Shannon D; Klassen, Terry P; Moher, David; Hartling, Lisa
2012-10-18
Pediatric randomized controlled trials (RCTs) are susceptible to a high risk of bias. We examined the barriers and facilitators that pediatric trialists face in the design and conduct of unbiased trials. We used a mixed methods design, with semi-structured interviews building upon the results of a quantitative survey. We surveyed Canadian (n=253) and international (n=600) pediatric trialists regarding their knowledge and awareness of bias and their perceived barriers and facilitators in conducting clinical trials. We then interviewed 13 participants from different subspecialties and geographic locations to gain a more detailed description of how their experiences and attitudes towards research interacted with trial design and conduct. The survey response rate was 23.0% (186/807). 68.1% of respondents agreed that bias is a problem in pediatric RCTs and 72.0% felt that there is sufficient evidence to support changing some aspects of how trials are conducted. Knowledge related to bias was variable, with inconsistent awareness of study design features that may introduce bias into a study. Interview participants highlighted a lack of formal training in research methods, a negative research culture, and the pragmatics of trial conduct as barriers. Facilitators included contact with knowledgeable and supportive colleagues and infrastructure for research. A lack of awareness of bias and negative attitudes towards research present significant barriers in terms of conducting methodologically rigorous pediatric RCTs. Knowledge translation efforts must focus on these issues to ensure the relevance and validity of trial results.
Gonzalez, Araceli; Rozenman, Michelle; Langley, Audra K; Kendall, Philip C; Ginsburg, Golda S; Compton, Scott; Walkup, John T; Birmaher, Boris; Albano, Anne Marie; Piacentini, John
2017-06-01
Anxiety disorders are among the most common mental health problems in youth, and faulty interpretation bias has been positively linked to anxiety severity, even within anxiety-disordered youth. Quick, reliable assessment of interpretation bias may be useful in identifying youth with certain types of anxiety or assessing changes on cognitive bias during intervention. This study examined the factor structure, reliability, and validity of the Self-report of Ambiguous Social Situations for Youth (SASSY) scale, a self-report measure developed to assess interpretation bias in youth. Participants (N=488, age 7 to 17) met diagnostic criteria for Social Phobia, Generalized Anxiety Disorder, and/or Separation Anxiety Disorder. An exploratory factor analysis was performed on baseline data from youth participating in a large randomized clinical trial. Exploratory factor analysis yielded two factors (Accusation/Blame, Social Rejection). The SASSY full scale and Social Rejection factor demonstrated adequate internal consistency, convergent validity with social anxiety, and discriminant validity as evidenced by non-significant correlations with measures of non-social anxiety. Further, the SASSY Social Rejection factor accurately distinguished children and adolescents with Social Phobia from those with other anxiety disorders, supporting its criterion validity, and revealed sensitivity to changes with treatment. Given the relevance to youth with social phobia, pre- and post-intervention data were examined for youth social phobia to test sensitivity to treatment effects; results suggested that SASSY scores reduced for treatment responders. Findings suggest the potential utility of the SASSY Social Rejection factor as a quick, reliable, and efficient way of assessing interpretation bias in anxious youth, particularly as related to social concerns, in research and clinical settings.
Alayli-Goebbels, Adrienne F G; van Exel, Job; Ament, André J H A; de Vries, Nanne K; Bot, Sandra D M; Severens, Johan L
2015-12-01
To use contingent valuation (CV) to derive individual consumer values for both health and broader benefits of a public-health intervention directed at lifestyle behaviour change (LBC) and to examine the feasibility and validity of the method. Participants of a lifestyle intervention trial (n = 515) were invited to complete an online CV survey. Respondents (n = 312) expressed willingness to invest money and time for changes in life expectancy, health-related quality of life (HRQOL) and broader quality of life aspects. Internal validity was tested for by exploring associations between explanatory variables (i.e. income, paid work, experience and risk factors for cardiovascular diseases) and willingness to invest, and by examining ordering effects and respondents' sensitivity to the scope of the benefits. The majority of respondents (94.3%) attached value to benefits of LBC, and 87.4% were willing to invest both money and time. Respondents were willing to invest more for improvements in HRQOL (€42/month; 3 h/week) and broader quality of life aspects (€40/month; 2.6 h/week) than for improvements in life expectancy (€24/month; 2 h/week). Protest answers were limited (3%) and findings regarding internal validity were mixed. The importance of broader quality of life outcomes to consumers suggests that these outcomes are relevant to be considered in the decision making. Our research showed that CV is a feasible method to value both health and broader outcomes of LBC, but generalizability to other areas of public health still needs to be examined. Mixed evidence regarding internal validity pleads for caution to use CV as only the base for decision making. © 2014 John Wiley & Sons Ltd.
2011-01-01
Background Achieving adequate follow-up in clinical trials is essential to establish the validity of the findings. Achieving adequate response rates reduces bias and increases probability that the findings can be generalized to the population of interest. Therefore, the purpose of this study was to determine the influence of attention, demographic, psychological, and health status factors on web-based response rates in the ongoing Prevention of Low Back Pain in the Military (POLM) trial. Methods Twenty companies of Soldiers (n = 4,325) were cluster randomized to complete a traditional exercise program including sit-ups (TEP) with or without a psychosocial educational program (PSEP) or a core stabilization exercise program (CSEP) with or without PSEP. A subgroup of Soldiers (n = 371) was randomized to receive an additional physical and ultrasound imaging (USI) examination of key trunk musculature. As part of the surveillance program, all Soldiers were encouraged to complete monthly surveys via email during the first year. Descriptive statistics of the predictor variables were obtained and compared between responders and non-responders using two sample t-tests or chi-square test, as appropriate. Generalized linear mixed models were subsequently fitted for the dichotomous outcomes to estimate the effects of the predictor variables. The significance level was set at .05 a priori. Results The overall response rate was 18.9% (811 subjects) for the first year. Responders were more likely to be older, Caucasian, have higher levels of education and income, reservist military status, non smoker, lower BMI, and have received individualized attention via the physical/USI examination (p < .05). Age, race/ethnicity, education, military status, smoking history, BMI, and whether a Soldier received the physical/USI examination remained statistically significant (p < .05) when considered in a full multivariate model. Conclusion The overall web based response rate during the first year of the POLM trial was consistent with studies that used similar methodology, but lower when compared to rates expected for standard clinical trials. One year response rate was significantly associated with demographic characteristics, health status, and individualized attention via additional testing. These data may assist for planning of future trials that use web based response systems. Trial Registration This study has been registered at reports at http://clinicaltrials.gov (NCT00373009). PMID:21668961
Robson, Joanna C.; Milman, Nataliya; Tomasson, Gunnar; Dawson, Jill; Cronholm, Peter F.; Kellom, Katherine; Shea, Judy; Ashdown, Susan; Boers, Maarten; Boonen, Annelies; Casey, George C.; Farrar, John T.; Gebhart, Don; Krischer, Jeffrey; Lanier, Georgia; McAlear, Carol A.; Peck, Jacqueline; Sreih, Antoine G.; Tugwell, Peter; Luqmani, Raashid A.; Merkel, Peter A.
2016-01-01
Objective Antineutrophil cytoplasmic antibody (ANCA)-associated vasculitis (AAV) is a group of linked multisystem life- and organ-threatening diseases. The Outcome Measures in Rheumatology (OMERACT) vasculitis working group has been at the forefront of outcome development in the field and has achieved OMERACT endorsement of a core set of outcomes for AAV. Patients with AAV report as important some manifestations of disease not routinely collected through physician-completed outcome tools; and they rate common manifestations differently from investigators. The core set includes the domain of patient-reported outcomes (PRO). However, PRO currently used in clinical trials of AAV do not fully characterize patients’ perspectives on their burden of disease. The OMERACT vasculitis working group is addressing the unmet needs for PRO in AAV. Methods Current activities of the working group include (1) evaluating the feasibility and construct validity of instruments within the PROMIS (Patient-Reported Outcome Measurement Information System) to record components of the disease experience among patients with AAV; (2) creating a disease-specific PRO measure for AAV; and (3) applying The International Classification of Functioning, Disability and Health to examine the scope of outcome measures used in AAV. Results The working group has developed a comprehensive research strategy, organized an investigative team, included patient research partners, obtained peer-reviewed funding, and is using a considerable research infrastructure to complete these interrelated projects to develop evidence-based validated outcome instruments that meet the OMERACT filter of truth, discrimination, and feasibility. Conclusion The OMERACT vasculitis working group is on schedule to achieve its goals of developing validated PRO for use in clinical trials of AAV. (First Release September 1 2015; J Rheumatol 2015;42:2204–9; doi:10.3899/jrheum.141143) PMID:26329344
Validity and reliability of the Hawaii anaerobic run test.
Kimura, Iris F; Stickley, Christopher D; Lentz, Melissa A; Wages, Jennifer J; Yanagi, Kazuhiko; Hetzler, Ronald K
2014-05-01
This study examined the reliability and validity of the Hawaii anaerobic run test (HART) by comparing anaerobic capacity measures obtained to those during the Wingate Anaerobic Test (WAnT). Ninety-six healthy physically active volunteers (age, 22.0 ± 2.8 years; height, 163.9 ± 9.5 cm; body mass, 70.6 ± 14.7 kg; body fat %, 19.29 ± 5.39%) participated in this study. Each participant performed 2 anaerobic capacity tests: the WAnT and the HART by random assignment on separate days. The reliability of the HART was calculated from 2 separate trials of the test and then determined through intraclass correlation coefficients (ICCs). Blood samples were collected, and lactate was analyzed both pretest and posttest for each of the 2 exercise modes. Heart rate and rate of perceived exertion were also measured pre- and post-exercise. Hawaii anaerobic run test peak and mean momentum were calculated as body mass times highest or average split velocity, respectively. Intraclass correlation coefficients between trials of the HART for peak and mean momentum were 0.98 and 0.99, respectively (SEM = 18.8 and 25.7, respectively). Validity of the HART was established through comparison of momentum on the HART with power on the WAnT. High correlations were found between peak power and peak momentum (r = 0.88), as well as mean power and mean momentum (r = 0.94). The HART was considered to be a reliable test of anaerobic power. The HART was also determined to be a valid test of anaerobic power when compared with the WAnT. When testing healthy college-aged individuals, the HART offers an easy and inexpensive alternative maximal effort anaerobic power test to other established tests.
Wong, Carlos K H; Lang, Brian H H; Yu, Hill M S; Lam, Cindy L K
2017-08-01
The aim of this study was to examine the acceptability, validity, and reliability of the EuroQoL Five-Dimension Five-Level (EQ-5D-5L) and Short-Form Six-Dimension (SF-6D) health utility measures in patients with symptomatic benign thyroid nodules. Data from a randomized controlled trial (ClinicalTrials.gov identifier: NCT02398721) of 294 patients with symptomatic benign thyroid nodules were utilized for this psychometric evaluation of health-related quality of life (HR-QOL) measurement. Three HR-QOL questionnaires-the generic 12-item Short Form Health Survey (SF-12v2), EQ-5D-5L, and SF-6D-were interviewer-administered at baseline and 2 weeks afterwards. Responses to SF-6D were transformed to SF-6D utility scores using a Hong Kong population scoring algorithm derived by standard gamble, whereas responses to EQ-5D-5L were mapped onto EQ-5D-3L response via interim mapping algorithms and then converted to EQ-5D-5L utility scores using a Chinese-specific value set. Construct validity was determined by evaluating Spearman correlation between SF-12v2 scores and utility scores. Two-week test-retest reliability was assessed using intra-class correlation coefficient. No significant (>15%) floor and ceiling effects were observed for SF-6D utility scores. The SF-6D utility scores had a moderate Spearman rank correlation with the SF-12v2 domain score providing evidence for adequate construct validity. The SF-6D utility scores showed good test-retest reliability (0.794; range 0.696-0.860). Better reliability was observed in SF-6D utility scores than in EQ-5D-5L utility scores. While the EQ-5D-5L instrument was less reproducible, the SF-6D instrument appeared to be an applicable, valid, and reliable measure in assessing the HR-QOL of Chinese patients with symptomatic benign thyroid nodules. The impact of utility score selection on the effectiveness and cost effectiveness of clinical interventions targeted to these patients needs further exploration. NCT02398721, ClinicalTrials.gov.
Giaddui, Tawfik; Yu, Jialu; Manfredi, Denise; Linnemann, Nancy; Hunter, Joanne; O'Meara, Elizabeth; Galvin, James; Bialecki, Brian; Xiao, Ying
2016-01-01
Transmission of Imaging and Data (TRIAD) is a standard-based system built by the American College of Radiology to provide the seamless exchange of images and data for accreditation of clinical trials and registries. Scripts of structures' names validation profiles created in TRIAD are used in the automated submission process. It is essential for users to understand the logistics of these scripts for successful submission of radiation therapy cases with less iteration. Copyright © 2016 American Society for Radiation Oncology. Published by Elsevier Inc. All rights reserved.
Dougados, Maxime; Jousse-Joulin, Sandrine; Mistretta, Frederic; d'Agostino, Maria-Antonietta; Backhaus, Marina; Bentin, Jacques; Chalès, Gérard; Chary-Valckenaere, Isabelle; Conaghan, Philip; Etchepare, Fabien; Gaudin, Philippe; Grassi, Walter; van der Heijde, Désirée; Sellam, Jérémie; Naredo, Esperanza; Szkudlarek, Marcin; Wakefield, Richard; Saraux, Alain
2010-05-01
To evaluate different global ultrasonographic (US) synovitis scoring systems as potential outcome measures of rheumatoid arthritis (RA) according to the Outcome Measures in Rheumatoid Arthritis Clinical Trials (OMERACT) filter. To study selected global scoring systems, for the clinical, B mode and power Doppler techniques, the following joints were evaluated: 28 joints (28-joint Disease Activity Score (DAS28)), 20 joints (metacarpophalangeals (MCPs) + metatarsophalangeals (MTPs)) and 38 joints (28 joints + MTPs) using either a binary (yes/no) or a 0-3 grade. The study was a prospective, 4-month duration follow-up of 76 patients with RA requiring anti-tumour necrosis factor (TNF) therapy (complete follow-up data: 66 patients). Intraobserver reliability was evaluated using the intraclass correlation coefficient (ICC), construct validity was evaluated using the Cronbach alpha test and external validity was evaluated using level of correlation between scoring system and C reactive protein (CRP). Sensitivity to change was evaluated using the standardised response mean. Discriminating capacity was evaluated using the standardised mean differences in patients considered by the doctor as significantly improved or not at the end of the study. Different clinimetric properties of various US scoring systems were at least as good as the clinical scores with, for example, intraobserver reliability ranging from 0.61 to 0.97 versus from 0.53 to 0.82, construct validity ranging from 0.76 to 0.89 versus from 0.76 to 0.88, correlation with CRP ranging from 0.28 to 0.34 versus from 0.28 to 0.35 and sensitivity to change ranging from 0.60 to 1.21 versus from 0.96 to 1.36 for US versus clinical scoring systems, respectively. This study suggests that US evaluation of synovitis is an outcome measure at least as relevant as physical examination. Further studies are required in order to achieve optimal US scoring systems for monitoring patients with RA in clinical trials and in clinical practice.
Reliability and Validity of the TIMPSI for Infants With Spinal Muscular Atrophy Type I
Krosschell, Kristin J.; Maczulski, Jo Anne; Scott, Charles; King, Wendy; Hartman, Jill T.; Case, Laura E.; Viazzo-Trussell, Donata; Wood, Janine; Roman, Carolyn A.; Hecker, Eva; Meffert, Marianne; Léveillé, Maude; Kienitz, Krista; Swoboda, Kathryn J.
2014-01-01
Purpose This study examined the reliability and validity of the Test of Infant Motor Performance Screening Items (TIMPSI) in infants with type I spinal muscular atrophy (SMA). Methods After training, 12 evaluators scored 4 videos of infants with type I SMA to assess interrater reliability. Intrarater and test-retest reliability was further assessed for 9 evaluators during a SMA type I clinical trial, with 9 evaluators testing a total of 38 infants twice. Relatedness of the TIMPSI score to ability to reach and ventilatory support was also examined. Results Excellent interrater video score reliability was noted (intraclass correlation coefficient, 0.97–0.98). Intrarater reliability was excellent (intraclass correlation coefficient, 0.91–0.98) and test-retest reliability ranged from r = 0.82 to r = 0.95. The TIMPSI score was related to the ability to reach (P ≤ .05). Conclusion The TIMPSI can reliably be used to assess motor function in infants with type I SMA. In addition, the TIMPSI scores are related to the ability to reach, an important functional skill in children with type I SMA. PMID:23542189
Fokas, E; Fietkau, R; Hartmann, A; Hohenberger, W; Grützmann, R; Ghadimi, M; Liersch, T; Ströbel, P; Grabenbauer, G G; Graeven, U; Hofheinz, R-D; Köhne, C-H; Wittekind, C; Sauer, R; Kaufmann, M; Hothorn, T; Rödel, C
2018-04-27
Surrogate endpoints in rectal cancer after preoperative chemoradiation are lacking as their statistical validation poses major challenges, including confirmation based on large phase 3 trials. We examined the prognostic role and individual-level surrogacy of neoadjuvant rectal (NAR) score that incorporates weighted cT, ypT and ypN categories for disease-free survival (DFS) in 1191 patients with rectal carcinoma treated within the CAO/ARO/AIO-04 phase 3 trial. Cox regression models adjusted for treatment arm, resection status, and NAR score were used in multivariable analysis. The four Prentice criteria (PC1-4) were used to assess individual-level surrogacy of NAR for DFS. After a median follow-up of 50 months, the addition of oxaliplatin to fluorouracil-based chemoradiotherapy (CRT) significantly improved 3-year DFS (75.9% [95% CI 72.30-79.50] vs 71.3% [95% CI 67.60-74.90]; P = 0.034; PC 1) and resulted in a shift towards lower NAR groups (P = 0.034, PC 2) compared to fluorouracil-only CRT. The 3-year DFS was 91.7% (95% CI, 88.2 95.2), 81.8% (95% CI, 78.4-85.1) and 58.1 (95% CI 52.4-63.9) for low, intermediate and high NAR score, respectively (P < 0.001; PC 3). NAR score remained an independent prognostic factor for DFS (low vs high NAR: HR 4.670; 95% CI 3.106-7.020; P < 0.001; low vs intermediate NAR: HR 1.971; 95% CI 1.303-2.98; P = 0.001) in multivariable analysis. Notwithstanding the inherent methodological difficulty in interpretation of PC 4 to establish surrogacy, the treatment effect on DFS was captured by NAR, supporting satisfaction of individual-level PC4. Our study validates the prognostic role and individual-level surrogacy of NAR score for DFS within a large randomized phase 3 trial. NAR score could help oncologists to speed up response-adapted therapeutic decision, and further large phase 3 trial datasets should aim to confirm trial-level surrogacy.
Clayson, Peter E; Larson, Michael J
2012-05-01
Psychiatric and neurologic disorders are associated with deficits in the postconflict recruitment of cognitive control. The primary aim of this study was to validate the relationship between cognitive functioning and indices of conflict adaptation. Event-related potentials were obtained from 89 healthy individuals who completed an Eriksen flanker task. Neuropsychological domains tested included memory, verbal fluency, and attention/executive functioning. Behavioral measures and N2 amplitudes showed significant conflict adaptation (i.e., previous-trial congruencies influenced current-trial measures). Higher scores on the attention/executive functioning and verbal fluency domains were associated with larger incongruent-trial N2 conflict adaptation; measures of cognitive functioning were not related to behavioral indices. This study provides initial validation of N2 conflict adaptation effects as cognitive function-related aspects of cognitive control. Copyright © 2012 Society for Psychophysiological Research.
Getting inside acupuncture trials - Exploring intervention theory and rationale
2011-01-01
Background Acupuncture can be described as a complex intervention. In reports of clinical trials the mechanism of acupuncture (that is, the process by which change is effected) is often left unstated or not known. This is problematic in assisting understanding of how acupuncture might work and in drawing together evidence on the potential benefits of acupuncture. Our aim was to aid the identification of the assumed mechanisms underlying the acupuncture interventions in clinical trials by developing an analytical framework to differentiate two contrasting approaches to acupuncture (traditional acupuncture and Western medical acupuncture). Methods Based on the principles of realist review, an analytical framework to differentiate these two contrasting approaches was developed. In order to see how useful the framework was in uncovering the theoretical rationale, it was applied to a set of trials of acupuncture for fatigue and vasomotor symptoms, identified from a wider literature review of acupuncture and early stage breast cancer. Results When examined for the degree to which a study demonstrated adherence to a theoretical model, two of the fourteen selected studies could be considered TA, five MA, with the remaining seven not fitting into any recognisable model. When examined by symptom, five of the nine vasomotor studies, all from one group of researchers, are arguably in the MA category, and two a TA model; in contrast, none of the five fatigue studies could be classed as either MA or TA and all studies had a weak rationale for the chosen treatment for fatigue. Conclusion Our application of the framework to the selected studies suggests that it is a useful tool to help uncover the therapeutic rationale of acupuncture interventions in clinical trials, for distinguishing between TA and MA approaches and for exploring issues of model validity. English language acupuncture trials frequently fail to report enough detail relating to the intervention. We advocate using this framework to aid reporting, along with further testing and refinement of the framework. PMID:21414187
Dewhurst, Tahnee N; Found, Bryan; Ballantyne, Kaye N; Rogers, Doug
2014-03-01
Expertise studies in forensic handwriting examination involve comparisons of Forensic Handwriting Examiners' (FHEs) opinions with lay-persons on blind tests. All published studies of this type have reported real and demonstrable skill differences between the specialist and lay groups. However, critics have proposed that any difference shown may be indicative of a lack of motivation on the part of lay participants, rather than a real difference in skill. It has been suggested that qualified FHEs would be inherently more motivated to succeed in blinded validation trials, as their professional reputations could be at risk, should they perform poorly on the task provided. Furthermore, critics suggest that lay-persons would be unlikely to be highly motivated to succeed, as they would have no fear of negative consequences should they perform badly. In an effort to investigate this concern, a blind signature trial was designed and administered to forty lay-persons. Participants were required to compare known (exemplar) signatures of an individual to questioned signatures and asked to express an opinion regarding whether the writer of the known signatures wrote each of the questioned signatures. The questioned signatures comprised a mixture of genuine, disguised and simulated signatures. The forty participants were divided into two separate groupings. Group 'A' were requested to complete the trial as directed and were advised that for each correct answer they would be financially rewarded, for each incorrect answer they would be financially penalized, and for each inconclusive opinion they would receive neither penalty nor reward. Group 'B' was requested to complete the trial as directed, with no mention of financial recompense or penalty. The results of this study do not support the proposition that motivation rather than skill difference is the source of the statistical difference in opinions between individuals' results in blinded signature proficiency trials. Crown Copyright © 2014. Published by Elsevier Ireland Ltd. All rights reserved.
Horan, William P; Reddy, L Felice; Barch, Deanna M; Buchanan, Robert W; Dunayevich, Eduardo; Gold, James M; Marder, Steven R; Wynn, Jonathan K; Young, Jared W; Green, Michael F
2015-09-01
Effort-based decision making has strong conceptual links to the motivational disturbances that define a key subdomain of negative symptoms. However, the extent to which effort-based decision-making performance relates to negative symptoms, and other clinical and functionally important variables has yet to be systematically investigated. In 94 clinically stable outpatients with schizophrenia, we examined the external validity of 5 effort-based paradigms, including the Effort Expenditure for Rewards, Balloon Effort, Grip Strength Effort, Deck Choice Effort, and Perceptual Effort tasks. These tasks covered 3 types of effort: physical, cognitive, and perceptual. Correlations between effort related performance and 6 classes of variables were examined, including: (1) negative symptoms, (2) clinically rated motivation and community role functioning, (3) self-reported motivational traits, (4) neurocognition, (5) other psychiatric symptoms and clinical/demographic characteristics, and (6) subjective valuation of monetary rewards. Effort paradigms showed small to medium relationships to clinical ratings of negative symptoms, motivation, and functioning, with the pattern more consistent for some measures than others. They also showed small to medium relations with neurocognitive functioning, but were generally unrelated to other psychiatric symptoms, self-reported traits, antipsychotic medications, side effects, and subjective valuation of money. There were relatively strong interrelationships among the effort measures. In conjunction with findings from a companion psychometric article, all the paradigms warrant further consideration and development, and 2 show the strongest potential for clinical trial use at this juncture. © The Author 2015. Published by Oxford University Press on behalf of the Maryland Psychiatric Research Center. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Frost, Rachael; Levati, Sara; McClurg, Doreen; Brady, Marian; Williams, Brian
2017-06-01
To systematically review methods for measuring adherence used in home-based rehabilitation trials and to evaluate their validity, reliability, and acceptability. In phase 1 we searched the CENTRAL database, NHS Economic Evaluation Database, and Health Technology Assessment Database (January 2000 to April 2013) to identify adherence measures used in randomized controlled trials of allied health professional home-based rehabilitation interventions. In phase 2 we searched the databases of MEDLINE, Embase, CINAHL, Allied and Complementary Medicine Database, PsycINFO, CENTRAL, ProQuest Nursing and Allied Health, and Web of Science (inception to April 2015) for measurement property assessments for each measure. Studies assessing the validity, reliability, or acceptability of adherence measures. Two reviewers independently extracted data on participant and measure characteristics, measurement properties evaluated, evaluation methods, and outcome statistics and assessed study quality using the COnsensus-based Standards for the selection of health Measurement INstruments checklist. In phase 1 we included 8 adherence measures (56 trials). In phase 2, from the 222 measurement property assessments identified in 109 studies, 22 high-quality measurement property assessments were narratively synthesized. Low-quality studies were used as supporting data. StepWatch Activity Monitor validly and acceptably measured short-term step count adherence. The Problematic Experiences of Therapy Scale validly and reliably assessed adherence to vestibular rehabilitation exercises. Adherence diaries had moderately high validity and acceptability across limited populations. The Borg 6 to 20 scale, Bassett and Prapavessis scale, and Yamax CW series had insufficient validity. Low-quality evidence supported use of the Joint Protection Behaviour Assessment. Polar A1 series heart monitors were considered acceptable by 1 study. Current rehabilitation adherence measures are limited. Some possess promising validity and acceptability for certain parameters of adherence, situations, and populations and should be used in these situations. Rigorous evaluation of adherence measures in a broader range of populations is needed. Copyright © 2016 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Exploring threats to generalisability in a large international rehabilitation trial (AVERT).
Bernhardt, Julie; Raffelt, Audrey; Churilov, Leonid; Lindley, Richard I; Speare, Sally; Ancliffe, Jacqueline; Katijjahbe, Md Ali; Hameed, Shahul; Lennon, Sheila; McRae, Anna; Tan, Dawn; Quiney, Jan; Williamson, Hannah C; Collier, Janice; Dewey, Helen M; Donnan, Geoffrey A; Langhorne, Peter; Thrift, Amanda G
2015-08-17
The purpose of this paper is to examine potential threats to generalisability of the results of a multicentre randomised controlled trial using data from A Very Early Rehabilitation Trial (AVERT). AVERT is a prospective, parallel group, assessor-blinded randomised clinical trial. This paper presents data assessing the generalisability of AVERT. Acute stroke units at 44 hospitals in 8 countries. The first 20,000 patients screened for AVERT, of whom 1158 were recruited and randomised. We use the Proximal Similarity Model, which considers the person, place, and setting and practice, as a framework for considering generalisability. As well as comparing the recruited patients with the target population, we also performed an exploratory analysis of the demographic, clinical, site and process factors associated with recruitment. The demographics and stroke characteristics of the included patients in the trial were broadly similar to population-based norms, with the exception that AVERT had a greater proportion of men. The most common reason for non-recruitment was late arrival to hospital (ie, >24 h). Overall, being older and female reduced the odds of recruitment to the trial. More women than men were excluded for most of the reasons, including refusal. The odds of exclusion due to early deterioration were particularly high for those with severe stroke (OR=10.4, p<0.001, 95% CI 9.27 to 11.65). A model which explores person, place, and setting and practice factors can provide important information about the external validity of a trial, and could be applied to other clinical trials. Australian New Zealand Clinical Trials Registry (ACTRN12606000185561) and Clinicaltrials.gov (NCT01846247). Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Development and validation of a cancer-specific swallowing assessment tool: MASA-C.
Carnaby, Giselle D; Crary, Michael A
2014-03-01
We present data from a sample of patients receiving radiotherapy for head/neck cancer to define and measure the validity of a new clinical assessment measure for swallowing. Fifty-eight patients undergoing radiotherapy (±chemotherapy) for head/neck cancer (HNC) supported the development of a physiology-based assessment tool of swallowing (Mann Assessment of Swallowing Ability--Cancer: MASA-C) administered at two time points (baseline and following radiotherapy treatment). The new exam was evaluated for internal consistency of items using Cronbach's alpha. Reliability of measurement was evaluated with intraclass correlation (ICC) and the Kappa statistic between two independent raters. Concurrent validity was established through comparison with the original MASA examination and against the referent standard videofluoroscopic swallowing examination (VFE). Sensitivity, specificity, and likelihood ratios along with 95 % confidence intervals (CIs) were derived for comparison of the two evaluation forms (MASA vs. MASA-C). Accuracy of diagnostic precision was displayed using receiver operator characteristic curves. The new MASA-C tool demonstrated superior validity to the original MASA examination applied to a HNC population. In comparison to the VFE referent exam, the MASA-C revealed strong sensitivity and specificity (Se 83, Sp 96), predictive values (positive predictive value (PPV) 0.95, negative predictive value (NPV) 0.86), and likelihood ratios (21.6). In addition, it demonstrated good reliability (ICC = 0.96) between speech-language pathology raters. The MASA-C is a reliable and valid scale that is sensitive to differences in swallowing performance in HNC patients with and without dysphagia. Future longitudinal evaluation of this tool in larger samples is suggested. The development and refinement of this swallowing assessment tool for use in multidisciplinary HNC teams will facilitate earlier identification of patients with swallowing difficulties and enable more efficient allocation of resources to the management of dysphagia in this population. The MASA-C may also prove useful in future clinical HNC rehabilitation trials with this population.
Assessing Risk/Benefit for Trials Using Preclinical Evidence: A Proposal
Kimmelman, Jonathan; Henderson, Valerie C.
2015-01-01
Abstract Moral evaluation of risk/benefit in early phase studies requires assessing the clinical promise of a candidate intervention using preclinical evidence. Yet there is little to guide ethics committees, investigators, sponsors or other stakeholders morally charged with making these assessments (“evaluators”). In what follows, we draw on published guidelines for preclinical study design to develop a structured process for assessing the clinical promise of new interventions. In the first step, evaluators gather all relevant preclinical studies, assess the magnitude of treatment effects, and determine clinical promise in light of various threats to valid clinical inference. In the second step, evaluators adjust assessments of clinical promise from preclinical studies by examining how other agents in the same reference class-and supported by similar evidence- have fared in clinical development. Assessments of clinical promise can then be fed into moral evaluation of risk and benefit in early phase trials. Though our approach has limitations, it offers a systematic and transparent method for assessing risk/benefit in early phase trials of novel interventions. PMID:26463620
Measuring Anxiety as a Treatment Endpoint in Youth with Autism Spectrum Disorder
Lecavalier, Luc; Wood, Jeffrey J.; Halladay, Alycia K.; Jones, Nancy E.; Aman, Michael G.; Cook, Edwin H.; Handen, Benjamin L.; King, Bryan H.; Pearson, Deborah A.; Hallett, Victoria; Sullivan, Katherine Anne; Grondhuis, Sabrina; Bishop, Somer L.; Horrigan, Joseph P.; Dawson, Geraldine; Scahill, Lawrence
2013-01-01
Despite the high rate of anxiety in individuals with autism spectrum disorder (ASD), measuring anxiety in ASD is fraught with uncertainty. This is due, in part, to incomplete consensus on the manifestations of anxiety in this population. Autism Speaks assembled a panel of experts to conduct a systematic review of available measures for anxiety in youth with ASD. To complete the review, the panel held monthly conference calls and two face-to-face meetings over a fourteen-month period. Thirty eight published studies were reviewed and ten assessment measures were examined: four were deemed appropriate for use in clinical trials, although with conditions; three were judged to be potentially appropriate, while three were considered not useful for clinical trials assessing anxiety. Despite recent advances, additional relevant, reliable and valid outcome measures are needed to evaluate treatments for anxiety in ASD. PMID:24158679
Koontz, Alicia M.; Lin, Yen-Sheng; Kankipati, Padmaja; Boninger, Michael L.; Cooper, Rory A.
2017-01-01
This study describes a new custom measurement system designed to investigate the biomechanics of sitting-pivot wheelchair transfers and assesses the reliability of selected biomechanical variables. Variables assessed include horizontal and vertical reaction forces underneath both hands and three-dimensional trunk, shoulder, and elbow range of motion. We examined the reliability of these measures between 5 consecutive transfer trials for 5 subjects with spinal cord injury and 12 non-disabled subjects while they performed a self-selected sitting pivot transfer from a wheelchair to a level bench. A majority of the biomechanical variables demonstrated moderate to excellent reliability (r > 0.6). The transfer measurement system recorded reliable and valid biomechanical data for future studies of sitting-pivot wheelchair transfers. We recommend a minimum of five transfer trials to obtain a reliable measure of transfer technique for future studies. PMID:22068376
Fuertinger, Doris H; Topping, Alice; Kappel, Franz; Thijssen, Stephan; Kotanko, Peter
2018-04-01
In silico approaches have been proposed as a novel strategy to increase the repertoire of clinical trial designs. Realistic simulations of clinical trials can provide valuable information regarding safety and limitations of treatment protocols and have been shown to assist in the cost-effective planning of clinical studies. In this report, we present a blueprint for the stepwise integration of internal, external, and ecological validity considerations in virtual clinical trials (VCTs). We exemplify this approach in the context of a model-based in silico clinical trial aimed at anemia treatment in patients undergoing hemodialysis (HD). Hemoglobin levels and subsequent anemia treatment were simulated on a per patient level over the course of a year and compared to real-life clinical data of 79,426 patients undergoing HD. The novel strategies presented here, aimed to improve external and ecological validity of a VCT, significantly increased the predictive power of the discussed in silico trial. © 2018 The Authors CPT: Pharmacometrics & Systems Pharmacology published by Wiley Periodicals, Inc. on behalf of American Society for Clinical Pharmacology and Therapeutics.
Purmann, Sascha; Pollmann, Stefan
2015-01-01
To process information selectively and to continuously fine-tune selectivity of information processing are important abilities for successful goal-directed behavior. One phenomenon thought to represent this fine-tuning are conflict adaptation effects in interference tasks, i.e., reduction of interference after an incompatible trial and when incompatible trials are frequent. The neurocognitive mechanisms of these effects are currently only partly understood and results from brainimaging studies so far are mixed. In our study we validate and extend recent findings by examining adaption to recent conflict in the classical Stroop task using functional magnetic resonance imaging. Consistent with previous research we found increased activity in a fronto-parietal network comprising the medial prefrontal cortex, ventro-lateral prefrontal cortex, and posterior parietal cortex when contrasting incompatible with compatible trials. These areas have been associated with attentional processes and might reflect increased cognitive conflict and resolution thereof during incompatible trials. While carefully controlling for non-attentional sequential effects we found smaller Stroop interference after an incompatible trial (conflict adaptation effect). These behavioral conflict adaptation effects were accompanied by changes in activity in visual color-selective areas (V4, V4α), while there was no modulation by previous trial compatibility in a visual word-selective area (VWFA). Our results provide further evidence for the notion, that adaptation to recent conflict seems to be based mainly on enhancement of processing of the task-relevant information.
A systematic review of models to predict recruitment to multicentre clinical trials.
Barnard, Katharine D; Dent, Louise; Cook, Andrew
2010-07-06
Less than one third of publicly funded trials managed to recruit according to their original plan often resulting in request for additional funding and/or time extensions. The aim was to identify models which might be useful to a major public funder of randomised controlled trials when estimating likely time requirements for recruiting trial participants. The requirements of a useful model were identified as usability, based on experience, able to reflect time trends, accounting for centre recruitment and contribution to a commissioning decision. A systematic review of English language articles using MEDLINE and EMBASE. Search terms included: randomised controlled trial, patient, accrual, predict, enroll, models, statistical; Bayes Theorem; Decision Theory; Monte Carlo Method and Poisson. Only studies discussing prediction of recruitment to trials using a modelling approach were included. Information was extracted from articles by one author, and checked by a second, using a pre-defined form. Out of 326 identified abstracts, only 8 met all the inclusion criteria. Of these 8 studies examined, there are five major classes of model discussed: the unconditional model, the conditional model, the Poisson model, Bayesian models and Monte Carlo simulation of Markov models. None of these meet all the pre-identified needs of the funder. To meet the needs of a number of research programmes, a new model is required as a matter of importance. Any model chosen should be validated against both retrospective and prospective data, to ensure the predictions it gives are superior to those currently used.
2011-01-01
Background Studies suggests a possible link between vaginal discharge and common mental distress, as well as highlight the implications of the subjective burden of disease and its link with mental health. Methods/Design This is a community-based intervention trial that aims to evaluate the impact of a psycho-social intervention on medically unexplained vaginal discharge (MUVD) in a group of married, low-income Lebanese women, aged 18-49, and suffering from low to moderate levels of anxiety and/or depression. The intervention consisted of 12 sessions of structured social support, problem solving techniques, group discussions and trainer-supervised relaxation exercises (twice per week over six weeks). Women were recruited from Hey el Selloum, a southern disadvantaged suburb of Beirut, Lebanon, during an open recruitment campaign. The primary outcome was self-reported MUVD, upon ruling out reproductive tract infections (RTIs), through lab analysis. Anxiety and/or depression symptoms were the secondary outcomes for this trial. These were assessed using an Arabic validated version of the Hopkins Symptoms Checklist-25 (HSCL-25). Assessments were done at baseline and six months using face-to face interviews, pelvic examinations and laboratory tests. Women were randomized into either intervention or control group. Intent to treat analysis will be used. Discussion The results will indicate whether the proposed psychosocial intervention was effective in reducing MUVD (possibly mediated by common mental distress). Trial Registration The trial is registered at the Wellcome Trust Registry, ISRCTN assigned: ISRCTN: ISRCTN98441241 PMID:21864414
Campos, Nicole G.; Castle, Philip E.; Schiffman, Mark; Kim, Jane J.
2013-01-01
Background Although the randomized controlled trial (RCT) is widely considered the most reliable method for evaluation of health care interventions, challenges to both internal and external validity exist. Thus, the efficacy of an intervention in a trial setting does not necessarily represent the real-world performance that decision makers seek to inform comparative effectiveness studies and economic evaluations. Methods Using data from the ASCUS-LSIL Triage Study (ALTS), we performed a simplified economic evaluation of age-based management strategies to detect cervical intraepithelial neoplasia grade 3 (CIN3) among women who were referred to the study with low-grade squamous intraepithelial lesions (LSIL). We used data from the trial itself to adjust for 1) potential lead time bias and random error that led to variation in the observed prevalence of CIN3 by study arm, and 2) potential ascertainment bias among providers in the most aggressive management arm. Results We found that using unadjusted RCT data may result in counterintuitive cost-effectiveness results when random error and/or bias are present. Following adjustment, the rank order of management strategies changed for two of the three age groups we considered. Conclusion Decision analysts need to examine study design, available trial data and cost-effectiveness results closely in order to detect evidence of potential bias. Adjustment for random error and bias in RCTs may yield different policy conclusions relative to unadjusted trial data. PMID:22147881
Two Decades of Cardiovascular Trials With Primary Surrogate Endpoints: 1990-2011.
Bikdeli, Behnood; Punnanithinont, Natdanai; Akram, Yasir; Lee, Ike; Desai, Nihar R; Ross, Joseph S; Krumholz, Harlan M
2017-03-21
Surrogate endpoint trials test strategies more efficiently but are accompanied by uncertainty about the relationship between changes in surrogate markers and clinical outcomes. We identified cardiovascular trials with primary surrogate endpoints published in the New England Journal of Medicine , Lancet , and JAMA: Journal of the American Medical Association from 1990 to 2011 and determined the trends in publication of surrogate endpoint trials and the success of the trials in meeting their primary endpoints. We tracked for publication of clinical outcome trials on the interventions tested in surrogate trials. We screened 3016 articles and identified 220 surrogate endpoint trials. From the total of 220 surrogate trials, 157 (71.4%) were positive for their primary endpoint. Only 59 (26.8%) surrogate trials had a subsequent clinical outcomes trial. Among these 59 trials, 24 outcomes trial results validated the positive surrogates, whereas 20 subsequent outcome trials were negative following positive results on a surrogate. We identified only 3 examples in which the surrogate trial was negative but a subsequent outcomes trial was conducted and showed benefit. Findings were consistent in a sample cohort of 383 screened articles inclusive of 37 surrogate endpoint trials from 6 other high-impact journals. Although cardiovascular surrogate outcomes trials frequently show superiority of the tested intervention, they are infrequently followed by a prominent outcomes trial. When there was a high-profile clinical outcomes study, nearly half of the positive surrogate trials were not validated. Cardiovascular surrogate outcome trials may be more appropriate for excluding benefit from the patient perspective than for identifying it. © 2017 The Authors. Published on behalf of the American Heart Association, Inc., by Wiley Blackwell.
A systematic review of the quality of homeopathic clinical trials
Jonas, Wayne B; Anderson, Rachel L; Crawford, Cindy C; Lyons, John S
2001-01-01
Background While a number of reviews of homeopathic clinical trials have been done, all have used methods dependent on allopathic diagnostic classifications foreign to homeopathic practice. In addition, no review has used established and validated quality criteria allowing direct comparison of the allopathic and homeopathic literature. Methods In a systematic review, we compared the quality of clinical-trial research in homeopathy to a sample of research on conventional therapies using a validated and system-neutral approach. All clinical trials on homeopathic treatments with parallel treatment groups published between 1945–1995 in English were selected. All were evaluated with an established set of 33 validity criteria previously validated on a broad range of health interventions across differing medical systems. Criteria covered statistical conclusion, internal, construct and external validity. Reliability of criteria application is greater than 0.95. Results 59 studies met the inclusion criteria. Of these, 79% were from peer-reviewed journals, 29% used a placebo control, 51% used random assignment, and 86% failed to consider potentially confounding variables. The main validity problems were in measurement where 96% did not report the proportion of subjects screened, and 64% did not report attrition rate. 17% of subjects dropped out in studies where this was reported. There was practically no replication of or overlap in the conditions studied and most studies were relatively small and done at a single-site. Compared to research on conventional therapies the overall quality of studies in homeopathy was worse and only slightly improved in more recent years. Conclusions Clinical homeopathic research is clearly in its infancy with most studies using poor sampling and measurement techniques, few subjects, single sites and no replication. Many of these problems are correctable even within a "holistic" paradigm given sufficient research expertise, support and methods. PMID:11801202
Kawata, Ariane K; Wilson, Hilary; Ong, Siew Hwa; Kulich, Karoly; Coyne, Karin
2016-10-01
The aim of this study was to evaluate the factor structure and psychometric characteristics of the Hypoglycemia Perspectives Questionnaire (HPQ) assessing experience and perceptions of hypoglycemia in patients with type 2 diabetes mellitus (T2DM). HPQ was administered to adults with T2DM in a clinical sample from Cyprus (HYPO-Cyprus, n = 500) and a community sample in the United States (US, n = 1257) from the 2011 US National Health and Wellness Survey. Demographic and clinical data were collected. Analysis of HPQ data from two convenience samples examined item performance, factor structure, and HPQ measurement properties (reliability, convergent validity, known-groups validity). Analyses supported three HPQ domains: symptom concern (six items), compensatory behavior (five items), and worry (five items). Internal consistency was high for all three domains (all ≥0.75), supporting reliability. Convergent validity was supported by moderate Spearman correlations between HPQ domain scores and the Audit of Diabetes-Dependent Quality of Life (ADDQoL-19) total score. Patients with recent hypoglycemia events had significantly higher HPQ scores, supporting known-group validity. HPQ may be a valid and reliable measure capturing the experience and impact of hypoglycemia and useful in clinical trials and community-based settings.
Glenn, Beth A.; Bastani, Roshan; Maxwell, Annette E.
2013-01-01
Objective Threats to external validity including pretest sensitization and the interaction of selection and an intervention are frequently overlooked by researchers despite their potential to significantly influence study outcomes. The purpose of this investigation was to conduct secondary data analyses to assess the presence of external validity threats in the setting of a randomized trial designed to promote mammography use in a high risk sample of women. Design During the trial, recruitment and intervention implementation took place in three cohorts (with different ethnic composition), utilizing two different designs (pretest-posttest control group design; posttest only control group design). Results Results reveal that the intervention produced different outcomes across cohorts, dependent upon the research design used and the characteristics of the sample. Conclusion These results illustrate the importance of weighing the pros and cons of potential research designs before making a selection and attending more closely to issues of external validity. PMID:23289517
Glenn, Beth A; Bastani, Roshan; Maxwell, Annette E
2013-01-01
Threats to external validity, including pretest sensitisation and the interaction of selection and an intervention, are frequently overlooked by researchers despite their potential to significantly influence study outcomes. The purpose of this investigation was to conduct secondary data analyses to assess the presence of external validity threats in the setting of a randomised trial designed to promote mammography use in a high-risk sample of women. During the trial, recruitment and intervention, implementation took place in three cohorts (with different ethnic composition), utilising two different designs (pretest-posttest control group design and posttest only control group design). Results reveal that the intervention produced different outcomes across cohorts, dependent upon the research design used and the characteristics of the sample. These results illustrate the importance of weighing the pros and cons of potential research designs before making a selection and attending more closely to issues of external validity.
Simulation-Based Abdominal Ultrasound Training - A Systematic Review.
Østergaard, M L; Ewertsen, C; Konge, L; Albrecht-Beste, E; Bachmann Nielsen, M
2016-06-01
The aim is to provide a complete overview of the different simulation-based training options for abdominal ultrasound and to explore the evidence of their effect. This systematic review was performed according to the PRISMA guidelines and Medline, Embase, Web of Science, and the Cochrane Library was searched. Articles were divided into three categories based on study design (randomized controlled trials, before-and-after studies and descriptive studies) and assessed for level of evidence using the Oxford Centre for Evidence Based Medicine (OCEBM) system and for bias using the Cochrane Collaboration risk of bias assessment tool. Seventeen studies were included in the analysis: four randomized controlled trials, eight before-and-after studies with pre- and post-test evaluations, and five descriptive studies. No studies scored the highest level of evidence, and 14 had the lowest level. Bias was high for 11 studies, low for four, and unclear for two. No studies used a test with established evidence of validity or examined the correlation between obtained skills on the simulators and real-life clinical skills. Only one study used blinded assessors. The included studies were heterogeneous in the choice of simulator, study design, participants, and outcome measures, and the level of evidence for effect was inadequate. In all studies simulation training was equally or more beneficial than other instructions or no instructions. Study designs had significant built-in bias and confounding issues; therefore, further research should be based on randomized controlled trials using tests with validity evidence and blinded assessors. © Georg Thieme Verlag KG Stuttgart · New York.
Pflueger, Marlon O; Stieglitz, Rolf-Dieter; Lemoine, Patrick; Leyhe, Thomas
2018-06-07
Since the advent of imaging techniques, the role of the neuropsychological assessment has changed. Questions concerning everyday functionality became primarily important and, thus, ecologically valid neuropsychological assessments are mandatory. Virtual reality (VR) environments might provide a way of implementing immersive cognitive assessments with a higher degree of everyday-life-related cognitive demands. We report on a VR-based episodic memory examination in N = 30 young and N = 18 healthy older adults (HOA) using a kitchen scene. The test procedure was designed to be structurally comparable to clinically used California Verbal Learning Test (CVLT) in terms of repeated learning trials as well as short and long delayed recall measures. The results showed that age-related learning and performance decrements were mainly evident in the CVLT but not in the VR-memory examination. The ecologically valid VR-memory examination might provide a more accurate "age-fair" estimation of everyday-life-related memory demands in HOA than the frequently and clinically used CVLT. We concluded this from our finding of context-related automatic and effortless activations of deeply experience based encoding and retrieval strategies with regard to everyday-life-related objects in the HOA, which might not be paralleled by learning arbitrary word associations. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
2010-01-01
Background Patients with signs of nerve root irritation represent a sub-group of those with low back pain who are at increased risk of persistent symptoms and progression to costly and invasive management strategies including surgery. A period of non-surgical management is recommended for most patients, but there is little evidence to guide non-surgical decision-making. We conducted a preliminary study examining the effectiveness of a treatment protocol of mechanical traction with extension-oriented activities for patients with low back pain and signs of nerve root irritation. The results suggested this approach may be effective, particularly in a more specific sub-group of patients. The aim of this study will be to examine the effectiveness of treatment that includes traction for patients with low back pain and signs of nerve root irritation, and within the pre-defined sub-group. Methods/Design The study will recruit 120 patients with low back pain and signs of nerve root irritation. Patients will be randomized to receive an extension-oriented treatment approach, with or without the addition of mechanical traction. Randomization will be stratified based on the presence of the pre-defined sub-grouping criteria. All patients will receive 12 physical therapy treatment sessions over 6 weeks. Follow-up assessments will occur after 6 weeks, 6 months, and 1 year. The primary outcome will be disability measured with a modified Oswestry questionnaire. Secondary outcomes will include self-reports of low back and leg pain intensity, quality of life, global rating of improvement, additional healthcare utilization, and work absence. Statistical analysis will be based on intention to treat principles and will use linear mixed model analysis to compare treatment groups, and examine the interaction between treatment and sub-grouping status. Discussion This trial will provide a methodologically rigorous evaluation of the effectiveness of using traction for patients with low back pain and signs of nerve root irritation, and will examine the validity of a pre-defined sub-grouping hypothesis. The results will provide evidence to inform non-surgical decision-making for these patients. Trial Registration This trial has been registered with http://ClinicalTrials.gov: NCT00942227 PMID:20433733
Young, Sarah; Touyz, Stephen; Meyer, Caroline; Arcelus, Jon; Rhodes, Paul; Madden, Sloane; Pike, Kathleen; Attia, Evelyn; Crosby, Ross D; Wales, Jackie; Hay, Phillipa
2017-05-01
Compulsive exercise is a prominent feature for the majority of patients with Anorexia Nervosa (AN), but there is a dearth of research evaluating assessment instruments. This study assessed the concurrent validity of the exercise items of the Eating Disorder Examination (EDE) and Eating Disorder Examination-Questionnaire (EDE-Q), with the Compulsive Exercise Test (CET) and other self-report exercise measures in patients with AN. We also aimed to perform validation of the CET in an adult clinical sample. The sample consisted of 78 adults with AN, recruited for the randomized controlled trial "Taking a LEAP forward in the treatment of anorexia nervosa." At baseline, participants completed the EDE, EDE-Q, CET, Reasons for Exercise Inventory (REI), Commitment to Exercise Scale (CES) and Exercise Beliefs Questionnaire (EBQ). Correlational and regression analyses were performed. EDE exercise days and exercise time per day were positively correlated with each other and with all CET subscales (except Lack of exercise enjoyment), CES mean, EBQ total and REI total. Exercise time per day was associated with a higher EDE global score. The CET demonstrated good concurrent validity with the CES, the REI and the EBQ. Of the self-reports, the CET explained the greatest variance in eating disorder psychopathology and demonstrated good to excellent reliability in this sample. The EDE and EDE-Q demonstrated good concurrent validity with the CET. Further research is required to evaluate the CET's factor structure in a large clinical sample. However, the CET has demonstrated strong clinical utility in adult patients with AN. © 2016 Wiley Periodicals, Inc.(Int J Eat Disord 2017; 50:533-541). © 2016 Wiley Periodicals, Inc.
Nicholas, Angela; Bailey, Julia V; Stevenson, Fiona; Murray, Elizabeth
2013-12-12
Incidence of sexually transmitted infections (STIs) among young people in the United Kingdom is increasing. The Internet can be a suitable medium for delivery of sexual health information and sexual health promotion, given its high usage among young people, its potential for creating a sense of anonymity, and ease of access. Online randomized controlled trials (RCTs) are increasingly being used to evaluate online interventions, but while there are many advantages to online methodologies, they can be associated with a number of problems, including poor engagement with online interventions, poor trial retention, and concerns about the validity of data collected through self-report online. We conducted an online feasibility trial that tested the effects of the Sexunzipped website for sexual health compared to an information-only website. This study reports on a qualitative evaluation of the trial procedures, describing participants' experiences and views of the Sexunzipped online trial including methods of recruitment, incentives, methods of contact, and sexual health outcome measurement. Our goal was to determine participants' views of the acceptability and validity of the online trial methodology used in the pilot RCT of the Sexunzipped intervention. We used three qualitative data sources to assess the acceptability and validity of the online pilot RCT methodology: (1) individual interviews with 22 participants from the pilot RCT, (2) 133 emails received by the trial coordinator from trial participants, and (3) 217 free-text comments from the baseline and follow-up questionnaires. Interviews were audio-recorded and transcribed verbatim. An iterative, thematic analysis of all three data sources was conducted to identify common themes related to the acceptability and feasibility of the online trial methodology. Interview participants found the trial design, including online recruitment via Facebook, online registration, email communication with the researchers, and online completion of sexual health questionnaires to be highly acceptable and preferable to traditional methods. Incentives might assist in recruiting those who would not otherwise participate. Participants generally enjoyed taking part in sexual health research online and found the questionnaire itself thought-provoking. Completing the sexual health questionnaires online encouraged honesty in responding that might not be achieved with other methods. The majority of interview participants also thought that receiving and returning a urine sample for chlamydia testing via post was acceptable. These findings provide strong support for the use of online research methods for sexual health research, emphasizing the importance of careful planning and execution of all trial procedures including recruitment, respondent validation, trial related communication, and methods to maximize follow-up. Our findings suggest that sexual health outcome measurement might encourage reflection on current behavior, sometimes leading to behavior change. International Standard Randomized Controlled Trial Number (ISRCTN): 55651027; http://www.controlled-trials.com/isrctn/pf/55651027 (Archived by WebCite at http://www.webcitation.org/6LbkxdPKf).
Effectiveness of the Comalli Stroop Test as a measure of negative response bias.
Arentsen, Timothy J; Boone, Kyle Brauer; Lo, Tracy T Y; Goldberg, Hope E; Cottingham, Maria E; Victor, Tara L; Ziegler, Elizabeth; Zeller, Michelle A
2013-01-01
Practice guidelines recommend the use of multiple performance validity tests (PVTs) to detect noncredible performance during neuropsychological evaluations, and PVTs embedded in standard cognitive tests achieve this goal most efficiently. The present study examined the utility of the Comalli version of the Stroop Test as a measure of response bias in a large sample of "real world" noncredible patients (n = 129) as compared with credible neuropsychology clinic patients (n=233). The credible group performed significantly better than the noncredible group on all trials, but particularly on word-reading (Stroop A) and color-naming (Stroop B); cut-scores for Stroop A and Stroop B trials were associated with moderate sensitivity (49-53%) as compared to the low sensitivity found for the color interference trial (29%). Some types of diagnoses (including learning disability, severe traumatic brain injury, psychosis, and depression), very advanced age (⩾80), and lowered IQ were associated with increased rates of false positive identifications, suggesting the need for some adjustments to cut-offs in these subgroups. Despite some previous reports of an inverted Stroop effect (i.e., color-naming worse than color interference) in noncredible subjects, individual Stroop word reading and color naming trials were much more effective in identifying response bias.
Surrogate endpoints for overall survival in lung cancer trials: a review.
Fiteni, Frédéric; Westeel, Virginie; Bonnetain, Franck
2017-05-01
Intermediate endpoints are often used as primary endpoints instead of overall survival (OS) in lung cancer trials but they are not systematically validated as surrogate endpoints for OS. Areas covered: The aim of the study was to review the studies which assessed potential surrogate endpoints for OS in lung cancer trials. Expert commentary: Twenty studies were identified. In operable non-small cell lung cancer (NSCLC) (adjuvant trials) and locally advanced NSCLC (radiotherapy trials), one individual-patient data meta-analysis found a high correlation of disease-free survival (DFS) and progression-free survival (PFS) with OS at patient and trial level. In trials of adjuvant chemotherapy, correlation between disease-free survival DFS and OS were 0.83 at the individual level (95% CI 0.83-0.83) and 0.92 at trial level (95% CI 0.88-0.95). In locally advanced disease, correlation between PFS and OS was 0.77 to 0.85 at the individual level, and 0.89 to 0.97 at trial level. This study provides a 'proof' of the surrogacy of PFS and DFS on OS according to the IQWiG framework and the surrogacy of PFS and DFS on OS was classified level 2 according to Fleming hierarchy. In all the other setting, no endpoint was judged to be valid surrogate for OS.
Levy, Boaz
2006-10-01
Empirical studies have questioned the validity of the Faces subtest from the WMS-III for detecting impairment in visual memory, particularly among the elderly. A recent examination of the test norms revealed a significant age related floor effect already emerging on Faces I (immediate recall), implying excessive difficulty in the acquisition phase among unimpaired older adults. The current study compared the concurrent validity of the Faces subtest with an alternative measure between 16 Alzheimer's patients and 16 controls. The alternative measure was designed to facilitate acquisition by reducing the sequence of item presentation. Other changes aimed at increasing the retrieval challenge, decreasing error due to guessing and standardizing the administration. Analyses converged to indicate that the alternative measure provided a considerably greater differentiation than the Faces subtest between Alzheimer's patients and controls. Steps for revising the Faces subtest are discussed.
Herniman, Sarah E; Allott, Kelly A; Killackey, Eóin; Hester, Robert; Cotton, Sue M
2017-06-01
Depressive pathology is common in first-episode schizophrenia spectrum disorders (FES), and is frequently assessed using the Center for Epidemiological Studies - Depression Scale (CES-D), an instrument designed for use in community samples. Despite its widespread use, no prior study has examined the psychometric validity of the CES-D in assessing depressive pathology in FES. The aim of this study was to examine the psychometric validity of the CES-D in FES. This study involved secondary analysis of baseline data from a single blind, randomized controlled trial of vocational intervention for individuals with FES (N=91; age range: 15-25 years). Measures used were: CES-D, Brief Psychiatric Rating Scale (BPRS), Scale for the Assessment of Negative Symptoms (SANS), and Structured Clinical Interview for DSM-IV-TR (SCID-I/P). The CES-D strongly correlated with the depression subscale of the BPRS, and with the presence of full-threshold depressive disorder on the SCID-I/P. There was minimal overlap between the CES-D and SANS, with weak correlations emerging for avolition and anhedonia, and not for affective flattening, alogia, and attention. The CES-D cut-off of ≥23 produced high sensitivity and specificity values for determining full-threshold comorbid depressive disorder. Such findings indicate that the CES-D is effective for assessing and measuring depressive pathology in FES. Copyright © 2017 Elsevier Ireland Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lee, C.
2015-06-15
The US National Lung Screening Trial (NLST) was a multi-center randomized, controlled trial comparing a low-dose CT (LDCT) to posterior-anterior (PA) chest x-ray (CXR) in screening older, current and former heavy smokers for early detection of lung cancer. Recruitment was launched in September 2002 and ended in April 2004 when 53,454 participants had been randomized at 33 screening sites in equal proportions. Funded by the National Cancer Institute this trial demonstrated that LDCT screening reduced lung cancer mortality. The US Preventive Services Task Force (USPSTF) cited NLST findings and conclusions in its deliberations and analysis of lung cancer screening. Undermore » the 2010 Patient Protection and Affordable Care Act, the USPSTF favorable recommendation regarding lung cancer CT screening assisted in obtaining third-party payers coverage for screening. The objective of this session is to provide an introduction to the NLST and the trial findings, in addition to a comprehensive review of the dosimetry investigations and assessments completed using individual NLST participant CT and CXR examinations. Session presentations will review and discuss the findings of two independent assessments, a CXR assessment and the findings of a CT investigation calculating individual organ dosimetry values. The CXR assessment reviewed a total of 73,733 chest x-ray exams that were performed on 92 chest imaging systems of which 66,157 participant examinations were used. The CT organ dosimetry investigation collected scan parameters from 23,773 CT examinations; a subset of the 75,133 CT examinations performed using 97 multi-detector CT scanners. Organ dose conversion coefficients were calculated using a Monte Carlo code. An experimentally-validated CT scanner simulation was coupled with 193 adult hybrid computational phantoms representing the height and weight of the current U.S. population. The dose to selected organs was calculated using the organ dose library and the abstracted scan parameters. This session will review the results and summarize the individualized doses to major organs and the mean effective dose and CTDIvol estimate for 66,157 PA chest and 23,773 CT examinations respectively, using size-dependent computational phantoms coupled with Monte Carlo calculations. Learning Objectives: Review and summarize relevant NLST findings and conclusions. Understand the scope and scale of the NLST specific to participant dosimetry. Provide a comprehensive review of NLST participant dosimetry assessments. Summarize the results of an investigation providing individualized organ dose estimates for NLST participant cohorts.« less
Kersten, Paula; White, Peter J; Tennant, Alan
2014-01-01
Pain visual analogue scales (VAS) are commonly used in clinical trials and are often treated as an interval level scale without evidence that this is appropriate. This paper examines the internal construct validity and responsiveness of the pain VAS using Rasch analysis. Patients (n = 221, mean age 67, 58% female) with chronic stable joint pain (hip 40% or knee 60%) of mechanical origin waiting for joint replacement were included. Pain was scored on seven daily VASs. Rasch analysis was used to examine fit to the Rasch model. Responsiveness (Standardized Response Means, SRM) was examined on the raw ordinal data and the interval data generated from the Rasch analysis. Baseline pain VAS scores fitted the Rasch model, although 15 aberrant cases impacted on unidimensionality. There was some local dependency between items but this did not significantly affect the person estimates of pain. Daily pain (item difficulty) was stable, suggesting that single measures can be used. Overall, the SRMs derived from ordinal data overestimated the true responsiveness by 59%. Changes over time at the lower and higher end of the scale were represented by large jumps in interval equivalent data points; in the middle of the scale the reverse was seen. The pain VAS is a valid tool for measuring pain at one point in time. However, the pain VAS does not behave linearly and SRMs vary along the trait of pain. Consequently, Minimum Clinically Important Differences using raw data, or change scores in general, are invalid as these will either under- or overestimate true change; raw pain VAS data should not be used as a primary outcome measure or to inform parametric-based Randomised Controlled Trial power calculations in research studies; and Rasch analysis should be used to convert ordinal data to interval data prior to data interpretation.
Chen, Yu-Pei; Lv, Jia-Wei; Liu, Xu; Zhang, Yuan; Guo, Ying; Lin, Ai-Hua; Sun, Ying; Mao, Yan-Ping; Ma, Jun
2017-01-01
In the war on cancer marked by personalized medicine, positron emission tomography (PET)-based theranostic strategy is playing an increasingly important role. Well-designed clinical trials are of great significance for validating the PET applications and ensuring evidence-based cancer care. This study aimed to provide a comprehensive landscape of the characteristics of PET clinical trials using the substantial resource of ClinicalTrials.gov database. We identified 25,599 oncology trials registered with ClinicalTrials.gov in the last ten-year period (October 2005-September 2015). They were systematically reviewed to validate classification into 519 PET trials and 25,080 other oncology trials used for comparison. We found that PET trials were predominantly phase 1-2 studies (86.2%) and were more likely to be single-arm (78.9% vs. 57.9%, P <0.001) using non-randomized assignment (90.1% vs. 66.7%, P <0.001) than other oncology trials. Furthermore, PET trials were small in scale, generally enrolling fewer than 100 participants (20.3% vs. 25.7% for other oncology trials, P = 0.014), which might be too small to detect a significant theranostic effect. The funding support from industry or National Institutes of Health shrunk over time (both decreased by about 5%), and PET trials were more likely to be conducted in only one region lacking international collaboration (97.0% vs. 89.3% for other oncology trials, P <0.001). These findings raise concerns that clinical trials evaluating PET imaging in oncology are not receiving the attention or efforts necessary to generate high-quality evidence. Advancing the clinical application of PET imaging will require a concerted effort to improve the quality of trials.
Chen, Yu-Pei; Lv, Jia-Wei; Liu, Xu; Zhang, Yuan; Guo, Ying; Lin, Ai-Hua; Sun, Ying; Mao, Yan-Ping; Ma, Jun
2017-01-01
In the war on cancer marked by personalized medicine, positron emission tomography (PET)-based theranostic strategy is playing an increasingly important role. Well-designed clinical trials are of great significance for validating the PET applications and ensuring evidence-based cancer care. This study aimed to provide a comprehensive landscape of the characteristics of PET clinical trials using the substantial resource of ClinicalTrials.gov database. We identified 25,599 oncology trials registered with ClinicalTrials.gov in the last ten-year period (October 2005-September 2015). They were systematically reviewed to validate classification into 519 PET trials and 25,080 other oncology trials used for comparison. We found that PET trials were predominantly phase 1-2 studies (86.2%) and were more likely to be single-arm (78.9% vs. 57.9%, P <0.001) using non-randomized assignment (90.1% vs. 66.7%, P <0.001) than other oncology trials. Furthermore, PET trials were small in scale, generally enrolling fewer than 100 participants (20.3% vs. 25.7% for other oncology trials, P = 0.014), which might be too small to detect a significant theranostic effect. The funding support from industry or National Institutes of Health shrunk over time (both decreased by about 5%), and PET trials were more likely to be conducted in only one region lacking international collaboration (97.0% vs. 89.3% for other oncology trials, P <0.001). These findings raise concerns that clinical trials evaluating PET imaging in oncology are not receiving the attention or efforts necessary to generate high-quality evidence. Advancing the clinical application of PET imaging will require a concerted effort to improve the quality of trials. PMID:28042342
Vaquera, Alejandro; Villa, Jose G; Morante, Juan C; Thomas, Gavin; Renfree, Andrew J; Peters, Derek M
2016-02-01
The aims of this study were to (a) determine the relationship between performance on the court-based TIVRE-Basket test and peak aerobic power determined from a criterion laboratory-based incremental treadmill test and (b) to examine the test-retest reliability of the TIVRE-Basket test in elite male basketball players. To address aim 1, 36 elite male basketball players (age: 25.2 ± 4.7 years, weight: 94.1 ± 11.4 kg, height: 195.83 ± 9.6 cm) completed a graded treadmill exercise test and the TIVRE-Basket within 72 hours. The mean distance recorded during the TIVRE-Basket test was 4001.8 ± 176.4 m, and mean VO2 peak was 54.7 ± 2.8 ml · kg(-1) · min(-1), and the correlation between the 2 parameters was r = 0.824 (p ≤ 0.001). Linear regression analysis identified TIVRE-Basket distance (in meters) as the only unique predictor of VO2 peak in a single variable plus constant model: VO2 peak = 2.595 + (0.13 × TIVRE-Basket distance [in meters]). Performance on the TIVRE-Basket test accounted for 67.8% of the variance in VO2 peak (t = 8.466, p ≤ 0.001, 95% confidence interval: 0.01-0.016, SEE: 1.61). To address aim 2, 20 male basketball players (age: 26.7 ± 4.2 years, height: 1.94 ± 0.92 cm, weight: 94.0 ± 9.1 kg) performed the TIVRE-Basket test on 2 occasions. There was no significant difference in total distance covered between trial 1 (4138.8 ± 677.3 m) and trial 2 (4188.0 ± 648.8 m; t = 0.5798, p = 0.5688). Mean difference between trials was 49.2 ± 399.5 m, with an intraclass correlation coefficient of 0.85 suggesting a moderate level of reliability. Standardized typical error of measurement was 0.88%, representing a moderate degree of trial-to-trial error, and the Coefficient of Variation (CV) was 6.3%. The TIVRE-Basket test therefore represents a valid and moderately reliable court-based sport-specific test of aerobic power for use with individuals and teams of elite-level male basketball players. Future research is required to ascertain its validity and reliability in other basketball populations, for example, across age groups, at different levels of competition, in females and in different forms of the game, for example, wheelchair basketball.
Till, Cathee; Goodman, Phyllis J.; Chen, Xiaohong; Leach, Robin J.; Johnson-Pais, Teresa L.; Hsing, Ann W.; Hoque, Ashraful; Tangen, Catherine M.; Chu, Lisa; Parnes, Howard L.; Schenk, Jeannette M.; Reichardt, Juergen K. V.; Thompson, Ian M.; Figg, William D.
2015-01-01
Objective In the Prostate Cancer Prevention Trial (PCPT), finasteride reduced the risk of prostate cancer by 25%, even though high-grade prostate cancer was more common in the finasteride group. However, it remains to be determined whether finasteride concentrations may affect prostate cancer risk. In this study, we examined the association between serum finasteride concentrations and the risk of prostate cancer in the treatment arm of the PCPT and determined factors involved in modifying drug concentrations. Methods Data for this nested case-control study are from the PCPT. Cases were drawn from men with biopsy-proven prostate cancer and matched controls. Finasteride concentrations were measured using a liquid chromatography-mass spectrometry validated assay. The association of serum finasteride concentrations with prostate cancer risk was determined by logistic regression. We also examine whether polymorphisms in the enzyme target and metabolism genes of finasteride are related to drug concentrations using linear regression. Results and Conclusions Among men with detectable finasteride concentrations, there was no association between finasteride concentrations and prostate cancer risk, low-grade or high-grade, when finasteride concentration was analyzed as a continuous variable or categorized by cutoff points. Since there was no concentration-dependent effect on prostate cancer, any exposure to finasteride intake may reduce prostate cancer risk. Of the twenty-seven SNPs assessed in the enzyme target and metabolism pathway, five SNPs in two genes, CYP3A4 (rs2242480; rs4646437; rs4986910), and CYP3A5 (rs15524; rs776746) were significantly associated with modifying finasteride concentrations. These results suggest that finasteride exposure may reduce prostate cancer risk and finasteride concentrations are affected by genetic variations in genes responsible for altering its metabolism pathway. Trial Registration ClinicalTrials.gov NCT00288106 PMID:25955319
Yamanaka, Katsuo; Kawano, Yoshiyuki; Noguchi, Dai; Nakaaki, Shutaro; Watanabe, Norio; Amano, Takashi; Spector, Aimee
2013-01-01
Objectives Cognitive stimulation therapy (CST) has shown to have significant benefits in improving the cognitive function and quality of life (QOL) in people with mild-to-moderate dementia in a UK randomized controlled trial (RCT). We developed and examined the Japanese version of group CST (CST-J) in a single-blind, controlled clinical trial. Method CST-J consisting of 14 sessions was administered to a treatment group (n = 26) twice a week for 7 weeks. The treatment group was compared with a control group (n = 30). Based on single-blindness, cognition was evaluated by a researcher, and QOL and mood were rated by the participants themselves. Additionally, QOL and mood of participants were rated by care workers who were not blind but who observed them most directly in their daily life (important for social validity). Results A linear mixed model was used for analyses of cognition and QOL. There were significant improvements in cognition [COGNISTAT (Neurobehavioral Cognitive Status Examination) and MMSE (Mini-Mental State Examination)] for the treatment group compared with the control group (p < 0.01). Regarding QOL, the EQ-5D was significant (p = 0.019) and the QoL-AD (Quality of Life – Alzheimer's Disease) showed a positive trend (p = 0.06) when rated by care workers, although not when rated by the participants themselves. Using a nonparametrical analysis, there were significant improvements in the face scale for mood when rated by both the participants (p < 0.01) and the care workers (p = 0.017). Conclusion The CST-J shows promising improvements in cognition, mood, and aspects of QOL for people with dementia in Japanese care settings. A large RCT is now needed. PMID:23550665
Validating Trial-Based Functional Analyses in Mainstream Primary School Classrooms
ERIC Educational Resources Information Center
Austin, Jennifer L.; Groves, Emily A.; Reynish, Lisa C.; Francis, Laura L.
2015-01-01
There is growing evidence to support the use of trial-based functional analyses, particularly in classroom settings. However, there currently are no evaluations of this procedure with typically developing children. Furthermore, it is possible that refinements may be needed to adapt trial-based analyses to mainstream classrooms. This study was…
2015-03-20
performed a series of bid trials where they reported their willingness-to-pay for each of 30 snack food items; potential trial realization was...willingness-to-pay for each of 30 snack food items; potential trial realization was conducted at the end of the experiment via an auction procedure
Nicholson, Amanda; Mahon, James; Boland, Angela; Beale, Sophie; Dwan, Kerry; Fleeman, Nigel; Hockenhull, Juliet; Dundar, Yenal
2015-10-01
There is no single definitive test to identify prostate cancer in men. Biopsies are commonly used to obtain samples of prostate tissue for histopathological examination. However, this approach frequently misses cases of cancer, meaning that repeat biopsies may be necessary to obtain a diagnosis. The PROGENSA(®) prostate cancer antigen 3 (PCA3) assay (Hologic Gen-Probe, Marlborough, MA, USA) and the Prostate Health Index (phi; Beckman Coulter Inc., Brea, CA, USA) are two new tests (a urine test and a blood test, respectively) that are designed to be used to help clinicians decide whether or not to recommend a repeat biopsy. To evaluate the clinical effectiveness and cost-effectiveness of the PCA3 assay and the phi in the diagnosis of prostate cancer. Multiple publication databases and trial registers were searched in May 2014 (from 2000 to May 2014), including MEDLINE, EMBASE, The Cochrane Library, ISI Web of Science, Medion, Aggressive Research Intelligence Facility database, ClinicalTrials.gov, International Standard Randomised Controlled Trial Number Register and World Health Organization International Clinical Trials Registry Platform. The assessment of clinical effectiveness involved three separate systematic reviews, namely reviews of the analytical validity, the clinical validity of these tests and the clinical utility of these tests. The assessment of cost-effectiveness comprised a systematic review of full economic evaluations and the development of a de novo economic model. The perspective of the evaluation was the NHS in England and Wales. Men suspected of having prostate cancer for whom the results of an initial prostate biopsy were negative or equivocal. The use of the PCA3 score or phi in combination with existing tests (including histopathology results, prostate-specific antigen level and digital rectal examination), multiparametric magnetic resonance imaging and clinical judgement. In addition to documents published by the manufacturers, six studies were identified for inclusion in the analytical validity review. The review identified issues concerning the precision of the PCA3 assay measurements. It also highlighted issues relating to the storage requirements and stability of samples intended for analysis using the phi assay. Fifteen studies met the inclusion criteria for the clinical validity review. These studies reported results for 10 different clinical comparisons. There was insufficient evidence to enable the identification of appropriate test threshold values for use in a clinical setting. In addition, the implications of adding either the PCA3 assay or the phi to clinical assessment were not clear. Furthermore, the addition of the PCA3 assay or the phi to clinical assessment plus magnetic resonance imaging was not found to improve discrimination. No published papers met the inclusion criteria for either the clinical utility review or the cost-effectiveness review. The results from the cost-effectiveness analyses indicated that using either the PCA3 assay or the phi in the NHS was not cost-effective. The main limitations of the systematic review of clinical validity are that the review conclusions are over-reliant on findings from one study, the descriptions of clinical assessment vary widely within reviewed studies and many of the reported results for the clinical validity outcomes do not include either standard errors or confidence intervals. The clinical benefit of using the PCA3 assay or the phi in combination with existing tests, scans and clinical judgement has not yet been confirmed. The results from the cost-effectiveness analyses indicate that the use of these tests in the NHS would not be cost-effective. This study is registered as PROSPERO CRD42014009595. The National Institute for Health Research Health Technology Assessment programme.
Sakaguchi, Hitoshi; Ryan, Cindy; Ovigne, Jean-Marc; Schroeder, Klaus R; Ashikaga, Takao
2010-09-01
Regulatory policies in Europe prohibited the testing of cosmetic ingredients in animals for a number of toxicological endpoints. Currently no validated non-animal test methods exist for skin sensitization. Evaluation of changes in cell surface marker expression in dendritic cell (DC)-surrogate cell lines represents one non-animal approach. The human Cell Line Activation Test (h-CLAT) examines the level of CD86 and CD54 expression on the surface of THP-1 cells, a human monocytic leukemia cell line, following 24h of chemical exposure. To examine protocol transferability, between-lab reproducibility, and predictive capacity, the h-CLAT has been evaluated by five independent laboratories in several ring trials (RTs) coordinated by the European Cosmetics Association (COLIPA). The results of the first and second RTs demonstrated that the protocol was transferable and basically had good between-lab reproducibility and predictivity, but there were some false negative data. To improve performance, protocol and prediction model were modified. Using the modified prediction model in the first and second RT, accuracy was improved. However, about 15% of the outcomes were not correctly identified, which exposes some of the limitations of the assay. For the chemicals evaluated, the limitation may due to chemical being a weak allergen or having low solubility (ex. alpha-hexylcinnamaldehyde). The third RT evaluated the modified prediction model and satisfactory results were obtained. From the RT data, the feasibility of utilizing cell lines as surrogate DC in development of in vitro skin sensitization methods shows promise. The data also support initiating formal pre-validation of the h-CLAT in order to fully understand the capabilities and limitations of the assay. Copyright 2010 Elsevier Ltd. All rights reserved.
Metz, Torri D; Stoddard, Gregory J; Henry, Erick; Jackson, Marc; Holmgren, Calla; Esplin, Sean
2013-09-01
To create a simple tool for predicting the likelihood of successful trial of labor after cesarean delivery (TOLAC) during the pregnancy after a primary cesarean delivery using variables available at the time of admission. Data for all deliveries at 14 regional hospitals over an 8-year period were reviewed. Women with one cesarean delivery and one subsequent delivery were included. Variables associated with successful VBAC were identified using multivariable logistic regression. Points were assigned to these characteristics, with weighting based on the coefficients in the regression model to calculate an integer VBAC score. The VBAC score was correlated with TOLAC success rate and was externally validated in an independent cohort using a logistic regression model. A total of 5,445 women met inclusion criteria. Of those women, 1,170 (21.5%) underwent TOLAC. Of the women who underwent trial of labor, 938 (80%) had a successful VBAC. A VBAC score was generated based on the Bishop score (cervical examination) at the time of admission, with points added for history of vaginal birth, age younger than 35 years, absence of recurrent indication, and body mass index less than 30. Women with a VBAC score less than 10 had a likelihood of TOLAC success less than 50%. Women with a VBAC score more than 16 had a TOLAC success rate more than 85%. The model performed well in an independent cohort with an area under the curve of 0.80 (95% confidence interval 0.76-0.84). Prediction of TOLAC success at the time of admission is highly dependent on the initial cervical examination. This simple VBAC score can be utilized when counseling women considering TOLAC. II.
Topp, Libby; Breen, Courtney; Kaye, Sharlene; Darke, Shane
2004-02-07
Since 1996, the Illicit Drug Reporting System (IDRS), Australia's strategic early warning system for illicit drug trends, has monitored annual trends in the markets for the four main illicit drug classes, cannabis, methamphetamine, cocaine and heroin. In 2000, a 2-year trial was implemented to examine the feasibility of using similar methodology to monitor trends in the markets for 'party drugs'. A triangulation of three data sources was sought: (1) quantitative interviews with a 'sentinel' population of drug users; (2) qualitative interviews with key informants (KIs), or those who have contact with drug users through their work; (3) extant indicator data sources such as the purity of illicit drugs seized by law enforcement agencies. The results suggested that the feasibility of collecting detailed, reliable and valid data about party drug markets is a direct function of the size of those markets. The trial demonstrated that the system would allow the successful monitoring of markets for party drugs that are relatively widely used, such as ecstasy, but would be less sensitive in monitoring markets for party drugs that are used by small proportions of the total population, such as gamma-hydroxy-butyrate (GHB) and ketamine. Methodological issues encountered during the conduct of this trial are discussed, including defining the appropriate sentinel population of drug users, identifying relevant key informants, and the relative absence of extant indicator data sources that could inform our understanding of party drug markets.
Giaddui, Tawfik; Yu, Jialu; Manfredi, Denise; Linnemann, Nancy; Hunter, Joanne; O’Meara, Elizabeth; Galvin, James; Bialecki, Brian; Xiao, Ying
2016-01-01
Transmission of Imaging and Data (TRIAD) is a standard-based system built by the American College of Radiology (ACR) to provide seamless exchange of images and data for accreditation of clinical trials and registries. Scripts of structures’ names validation profiles created in TRIAD are used in the automated submission process. It is essential for users to understand the logistics of these scripts for successful submission of radiotherapy cases with less iteration. PMID:27053498
Fiechtner, Lauren; Puente, Gabriella C; Sharifi, Mona; Block, Jason P; Price, Sarah; Marshall, Richard; Blossom, Jeff; Gerber, Monica W; Taveras, Elsie M
2017-07-06
Novel approaches to health care delivery that leverage community resources could improve outcomes for children at high risk for obesity. We describe the process by which we created an online interactive community resources map for use in the Connect for Health randomized controlled trial. The trial was conducted in the 6 pediatric practices that cared for the highest percentage of children with overweight or obesity within a large multi-specialty group practice in eastern Massachusetts. By using semistructured interviews with parents and community partners and geographic information systems (GIS), we created and validated a community resource map for use in a randomized controlled trial for childhood obesity. We conducted semistructured interviews with 11 parents and received stakeholder feedback from 5 community partners, 2 pediatricians, and 3 obesity-built environment experts to identify community resources that could support behavior change. We used GIS databases to identify the location of resources. After the resources were validated, we created an online, interactive searchable map. We evaluated parent resource empowerment at baseline and follow-up, examined if the participant families went to new locations for physical activity and food shopping, and evaluated how satisfied the families were with the information they received. Parents, community partners, and experts identified several resources to be included in the map, including farmers markets, supermarkets, parks, and fitness centers. Parents expressed the need for affordable activities. Parent resource empowerment increased by 0.25 units (95% confidence interval, 0.21-0.30) over the 1-year intervention period; 76.2% of participants were physically active at new places, 57.1% of participant families shopped at new locations; and 71.8% reported they were very satisfied with the information they received. Parents and community partners identified several community resources that could help support behavior change. Parent resource empowerment and use of community resources increased over the intervention period, suggesting that community resource mapping should inform future interventions.
Peacock, Rachel E; Hosgood, Giselle; Swindells, Katrin L; Smart, Lisa
2015-01-01
To assess for any clinical benefit of intravenous lipid emulsion (ILE) for permethrin toxicosis in cats by comparing the progression of clinical signs of cats before and after treatment with ILE to cats treated with a saline control. To accomplish this objective, a clinical staging system for cats with permethrin toxicosis was developed and validated. Prospective, multicenter, randomized, controlled clinical trial. University veterinary teaching hospital and 12 private veterinary emergency hospitals. Thirty-four client-owned cats with permethrin toxicosis. A clinical staging system was designed based on abnormalities found on physical examination of cats with permethrin toxicosis. The clinical staging system had 6 stages, ranging from Stage A for cats with no abnormalities to Stage F for cats with grand mal seizures. The system was validated for intraviewer and interviewer variability. Cats in the clinical trial were randomized to receive 15 mL/kg of either intravenous 0.9% saline (control) or 20% ILE over 60 minutes. For each cat, a clinical stage was recorded at set time points before and after the randomized treatment was administered. The distribution of clinical stage stratified over time was compared across treatment groups. The clinical staging system showed excellent repeatability (P = 1.0) and reliability (P = 1.0). In the clinical trial, there was a significant difference in the distribution of clinical stages over time (P < 0.001) and from presentation stage to Stage B (P = 0.006), with ILE-treated cats (n = 20) having lower clinical stages earlier than control cats (n = 14). There was no significant difference in signalment, body weight, or supportive treatment between the groups. The clinical staging system was repeatable and reliable. Clinical stages of permethrin toxicosis in ILE-treated cats improved earlier compared to control cats, suggesting ILE may be a useful adjunctive therapy in the treatment of permethrin toxicosis in cats. © Veterinary Emergency and Critical Care Society 2015.
Review of the randomized clinical stroke rehabilitation trials in 2009
Rabadi, Meheroz H.
2011-01-01
Summary Background Recent review of the available evidence on interventions for motor recovery after stroke, showed that improvements in recovery of arm function were seen for constraint-induced movement therapy, electromyographic biofeedback, mental practice with motor imagery, and robotics. Similar improvement in transfer ability or balance were seen with repetitive task training, biofeedback, and training with a moving platform. Walking speed was improved by physical fitness training, high-intensity physiotherapy and repetitive task training. However, most of these trials were small and had design limitations. Material/Methods In this article, randomized control trials (RCT’s) published in 2009 of rehabilitation therapies for acute (≤2 weeks), sub-acute (2 to 12 weeks) and chronic (≥12 weeks) stroke was reviewed. A Medline search was performed to identify all RCT’s in stroke rehabilitation in the year 2009. The search strategy that was used for PubMed is presented in the Appendix 1. The objective was to examine the effectiveness of these treatment modalities in stroke rehabilitation. Results This generated 35 RCT’s under 5 categories which were found and analyzed. The methodological quality was assessed by using the PEDro scale for external and internal validity. Conclusions These trials were primarily efficacy studies. Most of these studies enrolled small numbers of patient which precluded their clinical applicability (limited external validity). However, the constraint induced movement therapy (CIT), regularly used in chronic stroke patients did not improve affected arm-hand function when used in acute stroke patients at ≤4 weeks. Intensive CIT did not lead to motor improvement in arm-hand function. Robotic arm treatment helped decrease motor impairment and improved function in chronic stroke patients only. Therapist provided exercise programs (when self-administered by patients during their off-therapy time in a rehabilitation setting) did improve arm-hand function. Tai Chi exercises helped improve balance and weight bearing. Exercise programs for community dwelling stroke patient helped maintain and even improve their functional state. PMID:21278702
Review of the randomized clinical stroke rehabilitation trials in 2009.
Rabadi, Meheroz H
2011-02-01
Recent review of the available evidence on interventions for motor recovery after stroke, showed that improvements in recovery of arm function were seen for constraint-induced movement therapy, electromyographic biofeedback, mental practice with motor imagery, and robotics. Similar improvement in transfer ability or balance were seen with repetitive task training, biofeedback, and training with a moving platform. Walking speed was improved by physical fitness training, high-intensity physiotherapy and repetitive task training. However, most of these trials were small and had design limitations. In this article, randomized control trials (RCT's) published in 2009 of rehabilitation therapies for acute (≤ 2 weeks), sub-acute (2 to 12 weeks) and chronic (≥ 12 weeks) stroke was reviewed. A Medline search was performed to identify all RCT's in stroke rehabilitation in the year 2009. The search strategy that was used for PubMed is presented in the Appendix 1. The objective was to examine the effectiveness of these treatment modalities in stroke rehabilitation. This generated 35 RCT's under 5 categories which were found and analyzed. The methodological quality was assessed by using the PEDro scale for external and internal validity. These trials were primarily efficacy studies. Most of these studies enrolled small numbers of patient which precluded their clinical applicability (limited external validity). However, the constraint induced movement therapy (CIT), regularly used in chronic stroke patients did not improve affected arm-hand function when used in acute stroke patients at ≤ 4 weeks. Intensive CIT did not lead to motor improvement in arm-hand function. Robotic arm treatment helped decrease motor impairment and improved function in chronic stroke patients only. Therapist provided exercise programs (when self-administered by patients during their off-therapy time in a rehabilitation setting) did improve arm-hand function. Tai Chi exercises helped improve balance and weight bearing. Exercise programs for community dwelling stroke patient helped maintain and even improve their functional state.
Validation of the PASSPORT V2 training environment for arthroscopic skills.
Stunt, J J; Kerkhoffs, G M M J; Horeman, T; van Dijk, C N; Tuijthof, G J M
2016-06-01
Virtual reality simulators used in the education of orthopaedic residents often lack realistic haptic feedback. To solve this, the (Practice Arthroscopic Surgical Skills for Perfect Operative Real-life Treatment) PASSPORT simulator was developed, which was subjected to fundamental changes: improved realism and user interface. The purpose was to demonstrate its face and construct validity. Thirty-one participants were divided into three groups having different levels of arthroscopic experience. Participants answered questions regarding general information and the outer appearance of the simulator for face validity. Construct validity was assessed with one standardized navigation task, which was timed. Face validity, educational value and user-friendliness were determined with two representative exercises and by asking participants to fill out the questionnaire. A value of 7 or greater was considered sufficient. Construct validity was demonstrated between experts and novices. Median task time for the fifth trial was 55 s (range 17-139 s) for the novices, 33 s (range 17-59 s) for the intermediates, and 26 s (range 14-52 s) for the experts. Median task times of three trials were not significantly different between the novices and intermediates, and none of the trials between intermediates and experts. Face validity, educational value and user-friendliness were perceived as sufficient (median >7). The presence of realistic tactile feedback was considered the biggest asset of the simulator. Proper preparation for arthroscopic operations will increase the quality of real-life surgery and patients' safety. The PASSPORT simulator can assist in achieving this, as it showed construct and face validity, and its physical nature offered adequate haptic feedback during training. This indicates that PASSPORT has potential to evolve as a valuable training modality.
Battisti, Nicolò Matteo Luca; Sehovic, Marina; Extermann, Martine
2017-09-01
Non-small-cell lung cancer (NSCLC) is a disease of the elderly, who are under-represented in clinical trials. This challenges the external validity of the evidence base for its management and of current guidelines, that we evaluated in a population of older patients. We retrieved randomized clinical trials (RCTs) supporting the guidelines and identified 18 relevant topics. We matched a cohort of NSCLC patients aged older than 80 years from the Moffitt Cancer Center database with the studies' eligibility criteria to check their qualification for at least 2 studies. Eligibility > 60% was rated full validity, 30% to 60% partial validity, and < 30% limited validity. We obtained data from 760 elderly patients in stage-adjusted groups and collected 244 RCTs from the National Comprehensive Cancer Network (NCCN) and 148 from the European Society for Medical Oncology (ESMO) guidelines. External validity was deemed insufficient for neoadjuvant chemotherapy in stage III disease (27.37% and 25.26% of patients eligible for NCCN and ESMO guidelines, respectively) and use of bevacizumab (13.86% and 16.27% of patients eligible). For ESMO guidelines, it was inadequate regarding double-agent chemotherapy (25.90% of patients eligible), its duration (24.10%) and therapy for Eastern Cooperative Oncology Group performance status 2 patients (17.74%). For NCCN guidelines external validity was lacking for neoadjuvant chemoradiotherapy in stage IIIA disease (25.86% of patients eligible). Our analysis highlighted the effect of RCT eligibility criteria on guidelines' external validity in elderly patients. Eligibility criteria should be carefully considered in trial design and more studies that do not exclude elderly patients should be included in guidelines. Copyright © 2017 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kruger, R.
The US National Lung Screening Trial (NLST) was a multi-center randomized, controlled trial comparing a low-dose CT (LDCT) to posterior-anterior (PA) chest x-ray (CXR) in screening older, current and former heavy smokers for early detection of lung cancer. Recruitment was launched in September 2002 and ended in April 2004 when 53,454 participants had been randomized at 33 screening sites in equal proportions. Funded by the National Cancer Institute this trial demonstrated that LDCT screening reduced lung cancer mortality. The US Preventive Services Task Force (USPSTF) cited NLST findings and conclusions in its deliberations and analysis of lung cancer screening. Undermore » the 2010 Patient Protection and Affordable Care Act, the USPSTF favorable recommendation regarding lung cancer CT screening assisted in obtaining third-party payers coverage for screening. The objective of this session is to provide an introduction to the NLST and the trial findings, in addition to a comprehensive review of the dosimetry investigations and assessments completed using individual NLST participant CT and CXR examinations. Session presentations will review and discuss the findings of two independent assessments, a CXR assessment and the findings of a CT investigation calculating individual organ dosimetry values. The CXR assessment reviewed a total of 73,733 chest x-ray exams that were performed on 92 chest imaging systems of which 66,157 participant examinations were used. The CT organ dosimetry investigation collected scan parameters from 23,773 CT examinations; a subset of the 75,133 CT examinations performed using 97 multi-detector CT scanners. Organ dose conversion coefficients were calculated using a Monte Carlo code. An experimentally-validated CT scanner simulation was coupled with 193 adult hybrid computational phantoms representing the height and weight of the current U.S. population. The dose to selected organs was calculated using the organ dose library and the abstracted scan parameters. This session will review the results and summarize the individualized doses to major organs and the mean effective dose and CTDIvol estimate for 66,157 PA chest and 23,773 CT examinations respectively, using size-dependent computational phantoms coupled with Monte Carlo calculations. Learning Objectives: Review and summarize relevant NLST findings and conclusions. Understand the scope and scale of the NLST specific to participant dosimetry. Provide a comprehensive review of NLST participant dosimetry assessments. Summarize the results of an investigation providing individualized organ dose estimates for NLST participant cohorts.« less
Ensuring Quality in AFRINEST and SATT
2013-01-01
Background: Three randomized open-label clinical trials [Simplified Antibiotic Therapy Trial (SATT) Bangladesh, SATT Pakistan and African Neonatal Sepsis Trial (AFRINEST)] were developed to test the equivalence of simplified antibiotic regimens compared with the standard regimen of 7 days of parenteral antibiotics. These trials were originally conceived and designed separately; subsequently, significant efforts were made to develop and implement a common protocol and approach. Previous articles in this supplement briefly describe the specific quality control methods used in the individual trials; this article presents additional information about the systematic approaches used to minimize threats to validity and ensure quality across the trials. Methods: A critical component of quality control for AFRINEST and SATT was striving to eliminate variation in clinical assessments and decisions regarding eligibility, enrollment and treatment outcomes. Ensuring appropriate and consistent clinical judgment was accomplished through standardized approaches applied across the trials, including training, assessment of clinical skills and refresher training. Standardized monitoring procedures were also applied across the trials, including routine (day-to-day) internal monitoring of performance and adherence to protocols, systematic external monitoring by funding agencies and external monitoring by experienced, independent trial monitors. A group of independent experts (Technical Steering Committee/Technical Advisory Group) provided regular monitoring and technical oversight for the trials. Conclusions: Harmonization of AFRINEST and SATT have helped to ensure consistency and quality of implementation, both internally and across the trials as a whole, thereby minimizing potential threats to the validity of the trials’ results. PMID:23945575
Chevat, Catherine; Viala-Danten, Muriel; Dias-Barbosa, Carla; Nguyen, Van Hung
2009-01-01
Background Influenza is among the most common infectious diseases. The main protection against influenza is vaccination. A self-administered questionnaire was developed and validated for use in clinical trials to assess subjects' perception and acceptance of influenza vaccination and its subsequent injection site reactions (ISR). Methods The VAPI questionnaire was developed based on interviews with vaccinees. The initial version was administered to subjects in international clinical trials comparing intradermal with intramuscular influenza vaccination. Item reduction and scale construction were carried out using principal component and multitrait analyses (n = 549). Psychometric validation of the final version was conducted per country (n = 5,543) and included construct and clinical validity and internal consistency reliability. All subjects gave their written informed consent before being interviewed or included in the clinical studies. Results The final questionnaire comprised 4 dimensions ("bother from ISR"; "arm movement"; "sleep"; "acceptability") grouping 16 items, and 5 individual items (anxiety before vaccination; bother from pain during vaccination; satisfaction with injection system; willingness to be vaccinated next year; anxiety about vaccination next year). Construct validity was confirmed for all scales in most of the countries. Internal consistency reliability was good for all versions (Cronbach's alpha ranging from 0.68 to 0.94), as was clinical validity: scores were positively correlated with the severity of ISR and pain. Conclusion The VAPI questionnaire is a valid and reliable tool, assessing the acceptance of vaccine injection and reactions following vaccination. Trial registration NCT00258934, NCT00383526, NCT00383539. PMID:19261173
Initial Development and Validation of the Global Citizenship Scale
ERIC Educational Resources Information Center
Morais, Duarte B.; Ogden, Anthony C.
2011-01-01
The purpose of this article is to report on the initial development of a theoretically grounded and empirically validated scale to measure global citizenship. The methodology employed is multi-faceted, including two expert face validity trials, extensive exploratory and confirmatory factor analyses with multiple datasets, and a series of three…
When is informed consent required in cluster randomized trials in health research?
2011-01-01
This article is part of a series of papers examining ethical issues in cluster randomized trials (CRTs) in health research. In the introductory paper in this series, we set out six areas of inquiry that must be addressed if the cluster trial is to be set on a firm ethical foundation. This paper addresses the second of the questions posed, namely, from whom, when, and how must informed consent be obtained in CRTs in health research? The ethical principle of respect for persons implies that researchers are generally obligated to obtain the informed consent of research subjects. Aspects of CRT design, including cluster randomization, cluster level interventions, and cluster size, present challenges to obtaining informed consent. Here we address five questions related to consent and CRTs: How can a study proceed if informed consent is not possible? Is consent to randomization always required? What information must be disclosed to potential subjects if their cluster has already been randomized? Is passive consent a valid substitute for informed consent? Do health professionals have a moral obligation to participate as subjects in CRTs designed to improve professional practice? We set out a framework based on the moral foundations of informed consent and international regulatory provisions to address each of these questions. First, when informed consent is not possible, a study may proceed if a research ethics committee is satisfied that conditions for a waiver of consent are satisfied. Second, informed consent to randomization may not be required if it is not possible to approach subjects at the time of randomization. Third, when potential subjects are approached after cluster randomization, they must be provided with a detailed description of the interventions in the trial arm to which their cluster has been randomized; detailed information on interventions in other trial arms need not be provided. Fourth, while passive consent may serve a variety of practical ends, it is not a substitute for valid informed consent. Fifth, while health professionals may have a moral obligation to participate as subjects in research, this does not diminish the necessity of informed consent to study participation. PMID:21906277
Implementation and validation of a conceptual benchmarking framework for patient blood management.
Kastner, Peter; Breznik, Nada; Gombotz, Hans; Hofmann, Axel; Schreier, Günter
2015-01-01
Public health authorities and healthcare professionals are obliged to ensure high quality health service. Because of the high variability of the utilisation of blood and blood components, benchmarking is indicated in transfusion medicine. Implementation and validation of a benchmarking framework for Patient Blood Management (PBM) based on the report from the second Austrian Benchmark trial. Core modules for automatic report generation have been implemented with KNIME (Konstanz Information Miner) and validated by comparing the output with the results of the second Austrian benchmark trial. Delta analysis shows a deviation <0.1% for 95% (max. 1.4%). The framework provides a reliable tool for PBM benchmarking. The next step is technical integration with hospital information systems.
Pantalon, Michael V; Dziura, James; Li, Fang-Yong; Owens, Patricia H; O'Connor, Patrick G; D'Onofrio, Gail
2017-01-01
No psychometrically validated instrument for evaluating the extent to which interventionists correctly implement brief interventions designed to motivate treatment engagement for opioid use disorders has been reported in the literature. The objective of this study was to develop and examine the psychometric properties of the Brief Negotiation Interview (BNI) Adherence Scale for Opioid Use Disorders (BAS-O). In the context of a randomized controlled trial evaluating the efficacy of 3 models of emergency department care for opioid use disorders, the authors developed and subsequently examined the psychometric properties of the BAS-O, a 38-item scale that required raters to answer whether or not ("Yes" or "No") each of the critical actions of the BNI was correctly implemented by the research interventionist. BAS-O items pertained to the BNI's 4 steps: (1) Raise the Subject, (2) Provide Feedback, (3) Enhance Motivation, and (4) Negotiate and Advise. A total of 215 audio-recorded BNI and 88 control encounters were rated by 3 trained raters who were independent of the study team and blind to study hypotheses, treatment, and assignment. The results indicated the BAS-O has fair to excellent psychometric properties, in terms of good internal consistency, excellent interrater reliability, discriminant validity, and construct validity, and fair predictive validity. A 13-item, 2-factor solution accounted for nearly 80% of the variance, where factor 1 addressed "Autonomy and Planning" (7 items) and factor 2 addressed "Motivation and Problems" (6 items). However, predictive validity was found for only one of the BAS-O factor items (i.e., Telling patients that treatment will address a range of issues related to their opioid use disorder). This study suggests that the BAS-O is a psychometrically valid measure of adherence to the specialized BNI for motivating treatment engagement in patients with opioid use disorders, thus providing a brief (13-item), objective method of evaluating BNI skill performance.
Downer, Mary Kathryn; Gea, Alfredo; Stampfer, Meir; Sánchez-Tainta, Ana; Corella, Dolores; Salas-Salvadó, Jordi; Ros, Emilio; Estruch, Ramón; Fitó, Montserrat; Gómez-Gracia, Enrique; Arós, Fernando; Fiol, Miquel; De-la-Corte, Francisco Jose Garcia; Serra-Majem, Lluís; Pinto, Xavier; Basora, Josep; Sorlí, José V; Vinyoles, Ernest; Zazpe, Itziar; Martínez-González, Miguel-Ángel
2016-06-14
Dietary intervention success requires strong participant adherence, but very few studies have examined factors related to both short-term and long-term adherence. A better understanding of predictors of adherence is necessary to improve the design and execution of dietary intervention trials. This study was designed to identify participant characteristics at baseline and study features that predict short-term and long-term adherence with interventions promoting the Mediterranean-type diet (MedDiet) in the PREvención con DIeta MEDiterránea (PREDIMED) randomized trial. Analyses included men and women living in Spain aged 55-80 at high risk for cardiovascular disease. Participants were randomized to the MedDiet supplemented with either complementary extra-virgin olive oil (EVOO) or tree nuts. The control group and participants with insufficient information on adherence were excluded. PREDIMED began in 2003 and ended in 2010. Investigators assessed covariates at baseline and dietary information was updated yearly throughout follow-up. Adherence was measured with a validated 14-point Mediterranean-type diet adherence score. Logistic regression was used to examine associations between baseline characteristics and adherence at one and four years of follow-up. Participants were randomized to the MedDiet supplemented with EVOO (n = 2,543; 1,962 after exclusions) or tree nuts (n = 2,454; 2,236 after exclusions). A higher number of cardiovascular risk factors, larger waist circumference, lower physical activity levels, lower total energy intake, poorer baseline adherence to the 14-point adherence score, and allocation to MedDiet + EVOO each independently predicted poorer adherence. Participants from PREDIMED recruiting centers with a higher total workload (measured as total number of persons-years of follow-up) achieved better adherence. No adverse events or side effects were reported. To maximize dietary adherence in dietary interventions, additional efforts to promote adherence should be used for participants with lower baseline adherence to the intended diet and poorer health status. The design of multicenter nutrition trials should prioritize few large centers with more participants in each, rather than many small centers. This study was registered at controlled-trials.com (http://www.controlled-trials. com/ISRCTN35739639). International Standard Randomized Controlled Trial Number (ISRCTN): 35739639. Registration date: 5 October 2005. parallel randomized trial.
A systematic review of models to predict recruitment to multicentre clinical trials
2010-01-01
Background Less than one third of publicly funded trials managed to recruit according to their original plan often resulting in request for additional funding and/or time extensions. The aim was to identify models which might be useful to a major public funder of randomised controlled trials when estimating likely time requirements for recruiting trial participants. The requirements of a useful model were identified as usability, based on experience, able to reflect time trends, accounting for centre recruitment and contribution to a commissioning decision. Methods A systematic review of English language articles using MEDLINE and EMBASE. Search terms included: randomised controlled trial, patient, accrual, predict, enrol, models, statistical; Bayes Theorem; Decision Theory; Monte Carlo Method and Poisson. Only studies discussing prediction of recruitment to trials using a modelling approach were included. Information was extracted from articles by one author, and checked by a second, using a pre-defined form. Results Out of 326 identified abstracts, only 8 met all the inclusion criteria. Of these 8 studies examined, there are five major classes of model discussed: the unconditional model, the conditional model, the Poisson model, Bayesian models and Monte Carlo simulation of Markov models. None of these meet all the pre-identified needs of the funder. Conclusions To meet the needs of a number of research programmes, a new model is required as a matter of importance. Any model chosen should be validated against both retrospective and prospective data, to ensure the predictions it gives are superior to those currently used. PMID:20604946
MacPherson, Laura; Calvin, Nicholas T.; Richards, Jessica M.; Guller, Leila; Mayes, Linda C.; Crowley, Michael J.; Daughters, Stacey B.; Lejuez, C.W.
2011-01-01
Background A long line of theoretical and empirical evidence implicates negative reinforcement as a process underlying the etiology and maintenance of risky alcohol use behaviors from adolescence through emerging adulthood. However, the bulk of this literature has relied on self-report measures and there is a notable absence of behavioral modes of assessments of negative reinforcement-based alcohol-related risk-taking. To address this clear gap in the literature, the current study presents the first published data on the reliability and validity of the Maryland Resource for the Behavioral Utilization of the Reinforcement of Negative Stimuli (MRBURNS), which is a modified version of the positive reinforcement-based Balloon Analogue Risk Task (BART). Methods Participants included a convenience sample of 116 college freshmen ever regular drinkers (aged 18–19) who completed both behavioral tasks; self-report measures of negative reinforcement/avoidance constructs and of positive reinforcement/appetitive constructs to examine convergent validity and discriminant validity, respectively; and self-report measures of alcohol use, problems, and motives to examine criterion validity. Results The MRBURNS evidenced sound experimental properties and reliability across task trials. In support of convergent validity, risk taking on the MRBURNS correlated significantly with negative urgency, difficulties in emotion regulation and depressive and anxiety-related symptoms. In support of discriminant validity, performance on the MRBURNS was unrelated to risk taking on the BART, sensation seeking, and trait impulsivity. Finally, pertaining to criterion validity, risk taking on the MRBURNS was related to alcohol-related problems but not heavy episodic alcohol use. Notably, risk taking on the MRBURNS was associated with negative reinforcement-based but not with positive reinforcement-based drinking motives. Conclusions Data from this initial investigation suggest the utility of the MRBURNS as a behavioral measure of negative-reinforcement based risk-taking that can provide a useful compliment to existing self-report measures to improve our understanding of the relationship between avoidant reinforcement processes and risky alcohol use. PMID:22309846
2011-07-01
cancer clinical trials, attitudes and knowledge about such trials, and barriers to and facilitators of participation, b) Conduct a self-administered...facilitate or hinder participation in prostate cancer trials by examining patients’ attitudes , physicians’ perceived barriers, characteristics of...cancer trials by examining patients’ attitudes , physicians’ perceived barriers, characteristics of prostate trials and sites, and broader community
Tanaka, Shiro; Matsuyama, Yutaka; Ohashi, Yasuo
2017-08-30
Increasing attention has been focused on the use and validation of surrogate endpoints in cancer clinical trials. Previous literature on validation of surrogate endpoints are classified into four approaches: the proportion explained approach; the indirect effects approach; the meta-analytic approach; and the principal stratification approach. The mainstream in cancer research has seen the application of a meta-analytic approach. However, VanderWeele (2013) showed that all four of these approaches potentially suffer from the surrogate paradox. It was also shown that, if a principal surrogate satisfies additional criteria called one-sided average causal sufficiency, the surrogate cannot exhibit a surrogate paradox. Here, we propose a method for estimating principal effects under a monotonicity assumption. Specifically, we consider cancer clinical trials which compare a binary surrogate endpoint and a time-to-event clinical endpoint under two naturally ordered treatments (e.g. combined therapy vs. monotherapy). Estimation based on a mean score estimating equation will be implemented by the expectation-maximization algorithm. We will also apply the proposed method as well as other surrogacy criteria to evaluate the surrogacy of prostate-specific antigen using data from a phase III advanced prostate cancer trial, clarifying the complementary roles of both the principal stratification and meta-analytic approaches in the evaluation of surrogate endpoints in cancer. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Baumann, Brian C.; He, Jiwei; Hwang, Wei-Ting
Purpose: To inform prospective trials of adjuvant radiation therapy (adj-RT) for bladder cancer after radical cystectomy, a locoregional failure (LF) risk stratification was proposed. This stratification was developed and validated using surgical databases that may not reflect the outcomes expected in prospective trials. Our purpose was to assess sources of bias that may affect the stratification model's validity or alter the LF risk estimates for each subgroup: time bias due to evolving surgical techniques; trial accrual bias due to inclusion of patients who would be ineligible for adj-RT trials because of early disease progression, death, or loss to follow-up shortlymore » after cystectomy; bias due to different statistical methods to estimate LF; and subgrouping bias due to different definitions of the LF subgroups. Methods and Materials: The LF risk stratification was developed using a single-institution cohort (n=442, 1990-2008) and the multi-institutional SWOG 8710 cohort (n=264, 1987-1998) treated with radical cystectomy with or without chemotherapy. We evaluated the sensitivity of the stratification to sources of bias using Fine-Gray regression and Kaplan-Meier analyses. Results: Year of radical cystectomy was not associated with LF risk on univariate or multivariate analysis after controlling for risk group. By use of more stringent inclusion criteria, 26 SWOG patients (10%) and 60 patients from the single-institution cohort (14%) were excluded. Analysis of the remaining patients confirmed 3 subgroups with significantly different LF risks with 3-year rates of 7%, 17%, and 36%, respectively (P<.01), nearly identical to the rates without correcting for trial accrual bias. Kaplan-Meier techniques estimated higher subgroup LF rates than competing risk analysis. The subgroup definitions used in the NRG-GU001 adj-RT trial were validated. Conclusions: These sources of bias did not invalidate the LF risk stratification or substantially change the model's LF estimates.« less
Feedback-Driven Trial-by-Trial Learning in Autism Spectrum Disorders
Solomon, Marjorie; Frank, Michael J.; Ragland, J. Daniel; Smith, Anne C.; Niendam, Tara A.; Lesh, Tyler A.; Grayson, David S.; Beck, Jonathan S.; Matter, John C.; Carter, Cameron S.
2017-01-01
Objective Impairments in learning are central to autism spectrum disorders. The authors investigated the cognitive and neural basis of these deficits in young adults with autism spectrum disorders using a well-characterized probabilistic reinforcement learning paradigm. Method The probabilistic selection task was implemented among matched participants with autism spectrum disorders (N=22) and with typical development (N=25), aged 18–40 years, using rapid event-related functional MRI. Participants were trained to choose the correct stimulus in high-probability (AB), medium-probability (CD), and low-probability (EF) pairs, presented with valid feedback 80%, 70%, and 60% of the time, respectively. Whole-brain voxel-wise and parametric modulator analyses examined early and late learning during the stimulus and feedback epochs of the task. Results The groups exhibited comparable performance on medium- and low-probability pairs. Typically developing persons showed higher accuracy on the high-probability pair, better win-stay performance (selection of the previously rewarded stimulus on the next trial of that type), and more robust recruitment of the anterior and medial prefrontal cortex during the stimulus epoch, suggesting development of an intact reward-based working memory for recent stimulus values. Throughout the feedback epoch, individuals with autism spectrum disorders exhibited greater recruitment of the anterior cingulate and orbito-frontal cortices compared with individuals with typical development, indicating continuing trial-by-trial activity related to feedback processing. Conclusions Individuals with autism spectrum disorders exhibit learning deficits reflecting impaired ability to develop an effective reward-based working memory to guide stimulus selection. Instead, they continue to rely on trial-by-trial feedback processing to support learning dependent upon engagement of the anterior cingulate and orbito-frontal cortices. PMID:25158242
BRISCO, MEREDITH A.; ZILE, MICHAEL R.; HANBERG, JENNIFER S.; WILSON, F. PERRY; PARIKH, CHIRAG R.; COCA, STEVEN G.; TANG, W.H. WILSON; TESTANI, JEFFREY M.
2017-01-01
Background Worsening renal function (WRF) is a common endpoint in decompensated heart failure clinical trials because of associations between WRF and adverse outcomes. However, WRF has not universally been identified as a poor prognostic sign, challenging the validity of WRF as a surrogate endpoint. Our aim was to describe the associations between changes in creatinine and adverse outcomes in a clinical trial of decongestive therapies. Methods and Results We investigated the association between changes in creatinine and the composite endpoint of death, rehospitalization or emergency room visit within 60 days in 301 patients in the Diuretic Optimization Strategies Evaluation (DOSE) trial. WRF was defined as an increase in creatinine >0.3 mg/dL and improvement in renal function (IRF) as a decrease >0.3 mg/dL. When examining linear changes in creatinine from baseline to 72 hours (the coprimary endpoint of DOSE), increasing creatinine was associated with lower risk for the composite outcome (HR = 0.81 per 0.3 mg/dL increase, 95% CI 0.67–0.98, P = .026). Compared with patients with stable renal function (n = 219), WRF (n = 54) was not associated with the composite endpoint (HR = 1.17, 95% CI = 0.77–1.78, P = .47). However, compared with stable renal function, there was a strong relationship between IRF (n = 28) and the composite endpoint (HR = 2.52, 95% CI = 1.57–4.03, P <.001). Conclusion The coprimary endpoint of the DOSE trial, a linear increase in creatinine, was paradoxically associated with improved outcomes. This was driven by absence of risk attributable to WRF and a strong risk associated with IRF. These results argue against using changes in serum creatinine as a surrogate endpoint in trials of decongestive strategies. PMID:27374839
Ponizovsky, Alexander M; Rosca, Paola; Aronovich, Edward; Weizman, Abraham; Grinshpoon, Alexander
2015-05-01
Limited clinical trials and case-reports yielded conflicting results regarding the efficacy of baclofen (a GABAB agonist) in the treatment of alcohol dependence. The aim of this study was to test the efficacy and tolerability of baclofen in alcohol dependent patients in Israel. The study was a double-blind, placebo-controlled, randomized trial comparing 50mg/day of baclofen to placebo over 12 weeks, in addition to a standard psychosocial intervention program, with 26-week and 52-week follow-up observations. The percentages of heavy drinking days and abstinent days were the primary outcome measures, and craving, distress and depression levels; self-efficacy; social support from different sources; and health-related quality of life (HRQL) were secondary outcomes. Tolerability was also examined. Sixty-four patients were randomized; 62% completed the 12-week trial and 37% completed the 52-week follow-up. No between group differences were found in the percentages of heavy drinking and abstinent days. A significant reduction in levels of distress, depression and craving and improved HRQL occurred for both arms, whereas self-efficacy and social support remained unchanged in both groups. No adverse events were observed. Unlike previous positive trials in Italy, and similarly to a negative trial in the USA, we found no evidence of superiority of baclofen over placebo in the treatment of alcohol dependence. However, the high placebo response undermines the validity of this conclusion. Therefore, more placebo-controlled trials are needed to either verify or discard a possible clinical efficacy of baclofen for alcohol dependence. Copyright © 2015 Elsevier Inc. All rights reserved.
Leucht, Stefan; Winter-van Rossum, Inge; Heres, Stephan; Arango, Celso; Fleischhacker, W Wolfgang; Glenthøj, Birte; Leboyer, Marion; Leweke, F Markus; Lewis, Shôn; McGuire, Phillip; Meyer-Lindenberg, Andreas; Rujescu, Dan; Kapur, Shitij; Kahn, René S; Sommer, Iris E
2015-05-01
Most of the 13 542 trials contained in the Cochrane Schizophrenia Group's register just tested the general efficacy of pharmacological or psychosocial interventions. Studies on the subsequent treatment steps, which are essential to guide clinicians, are largely missing. This knowledge gap leaves important questions unanswered. For example, when a first antipsychotic failed, is switching to another drug effective? And when should we use clozapine? The aim of this article is to review the efficacy of switching antipsychotics in case of nonresponse. We also present the European Commission sponsored "Optimization of Treatment and Management of Schizophrenia in Europe" (OPTiMiSE) trial which aims to provide a treatment algorithm for patients with a first episode of schizophrenia. We searched Pubmed (October 29, 2014) for randomized controlled trials (RCTs) that examined switching the drug in nonresponders to another antipsychotic. We described important methodological choices of the OPTiMiSE trial. We found 10 RCTs on switching antipsychotic drugs. No trial was conclusive and none was concerned with first-episode schizophrenia. In OPTiMiSE, 500 first episode patients are treated with amisulpride for 4 weeks, followed by a 6-week double-blind RCT comparing continuation of amisulpride with switching to olanzapine and ultimately a 12-week clozapine treatment in nonremitters. A subsequent 1-year RCT validates psychosocial interventions to enhance adherence. Current literature fails to provide basic guidance for the pharmacological treatment of schizophrenia. The OPTiMiSE trial is expected to provide a basis for clinical guidelines to treat patients with a first episode of schizophrenia. © The Author 2015. Published by Oxford University Press on behalf of the Maryland Psychiatric Research Center. All rights reserved. For permissions, please email: journals.permissions@oup.com.
[Current knowledge on perioperative treatments of non-small cell lung carcinomas].
Brosseau, S; Naltet, C; Nguenang, M; Gounant, V; Mordant, P; Milleron, B; Castier, Y; Zalcman, G
2017-06-01
Surgery is still the main treatment in early-stage of non-small cell lung cancer with 5-year survival of stage IA patients exceeding 80%, but 5-year survival of stage II patients rapidly decreasing with tumor size, N status, and visceral pleura invasion. The major metastatic risk in such patients has supported clinical research assessing systemic or loco-regional perioperative treatments. Modern phase 3 trials clearly validated adjuvant or neo-adjuvant platinum-based chemotherapy in resected stage I-III patients as a standard treatment of which value has been reassessed several independent meta-analyses, showing a 5% benefit in 5y-survival, and a decrease of the relative risk for death around from 12 to 25%. Conversely perioperative treatments were not validated for stage IA and IB patients. In more advanced stage patients, neo-adjuvant radio-chemotherapy has not been validated either. Adjuvant radiotherapy for N2 patients is currently tested in the large international phase 3 trial Lung-ART/IFCT-0503. The development of video-assisted thoracic surgery (VATS) has helped adjuvant chemotherapies for elderly patients. Perioperative targeted treatments in NSCLC with EGFR or ALK molecular alterations is currently assessed in the U.S. ALCHEMIST prospective trial. Finally, the role of immune check-points inhibitors is currently evaluated in a large international phase 3 trial testing adjuvant anti-PD-L1 monoclonal antibody, the BR31/IFCT-1401 trial, while a proof-of principle neo-adjuvant trial IONESCO/IFCT-1601, has just begun by the end of the 2016 year, with survival results of both trials expected in 5 to 7 years. Copyright © 2017 SPLF. Published by Elsevier Masson SAS. All rights reserved.
NASA Astrophysics Data System (ADS)
Rusyati, Lilit; Firman, Harry
2017-05-01
This research was motivated by the importance of multiple-choice questions that indicate the elements and sub-elements of critical thinking and implementation of computer-based test. The method used in this research was descriptive research for profiling the validation of science virtual test to measure students' critical thinking in junior high school. The participant is junior high school students of 8th grade (14 years old) while science teacher and expert as the validators. The instrument that used as a tool to capture the necessary data are sheet of an expert judgment, sheet of legibility test, and science virtual test package in multiple choice form with four possible answers. There are four steps to validate science virtual test to measure students' critical thinking on the theme of "Living Things and Environmental Sustainability" in 7th grade Junior High School. These steps are analysis of core competence and basic competence based on curriculum 2013, expert judgment, legibility test and trial test (limited and large trial test). The test item criterion based on trial test are accepted, accepted but need revision, and rejected. The reliability of the test is α = 0.747 that categorized as `high'. It means the test instruments used is reliable and high consistency. The validity of Rxy = 0.63 means that the validity of the instrument was categorized as `high' according to interpretation value of Rxy (correlation).
Discovery and validation of cell cycle arrest biomarkers in human acute kidney injury
2013-01-01
Introduction Acute kidney injury (AKI) can evolve quickly and clinical measures of function often fail to detect AKI at a time when interventions are likely to provide benefit. Identifying early markers of kidney damage has been difficult due to the complex nature of human AKI, in which multiple etiologies exist. The objective of this study was to identify and validate novel biomarkers of AKI. Methods We performed two multicenter observational studies in critically ill patients at risk for AKI - discovery and validation. The top two markers from discovery were validated in a second study (Sapphire) and compared to a number of previously described biomarkers. In the discovery phase, we enrolled 522 adults in three distinct cohorts including patients with sepsis, shock, major surgery, and trauma and examined over 300 markers. In the Sapphire validation study, we enrolled 744 adult subjects with critical illness and without evidence of AKI at enrollment; the final analysis cohort was a heterogeneous sample of 728 critically ill patients. The primary endpoint was moderate to severe AKI (KDIGO stage 2 to 3) within 12 hours of sample collection. Results Moderate to severe AKI occurred in 14% of Sapphire subjects. The two top biomarkers from discovery were validated. Urine insulin-like growth factor-binding protein 7 (IGFBP7) and tissue inhibitor of metalloproteinases-2 (TIMP-2), both inducers of G1 cell cycle arrest, a key mechanism implicated in AKI, together demonstrated an AUC of 0.80 (0.76 and 0.79 alone). Urine [TIMP-2]·[IGFBP7] was significantly superior to all previously described markers of AKI (P <0.002), none of which achieved an AUC >0.72. Furthermore, [TIMP-2]·[IGFBP7] significantly improved risk stratification when added to a nine-variable clinical model when analyzed using Cox proportional hazards model, generalized estimating equation, integrated discrimination improvement or net reclassification improvement. Finally, in sensitivity analyses [TIMP-2]·[IGFBP7] remained significant and superior to all other markers regardless of changes in reference creatinine method. Conclusions Two novel markers for AKI have been identified and validated in independent multicenter cohorts. Both markers are superior to existing markers, provide additional information over clinical variables and add mechanistic insight into AKI. Trial registration ClinicalTrials.gov number NCT01209169. PMID:23388612
Measuring the validity and reliability of the Apple Watch as a physical activity monitor.
Zhang, Peng; Godin, Steven D; Owens, Matthew V
2018-04-04
This study aimed to investigate the validity and reliability of the energy expenditure (EE) estimation of Apple Watch among college students. Thirty college students completed two sets of three 10-minute treadmill walking and running trials while wearing three Apple Watches and being connected to indirect calorimetry. The walking trials were at speeds of 54, 80, and 107 m•min-1 while the running trials were at 134, 161, 188m•min-1. Energy expenditure comparisons were made using Two-way ANOVA with repeatedmeasures. Reliability was analyzed by Intraclass Correlation. There was no significant device x speed interactions (F (15, 696) = 1.113, p = 0.341) between the indirect calorimetry (criterion) and Apple Watch. The lowest Inter-Class Correlation (ICC) scores were 0.49 (95%CI) at 54 while the highest were 0.72 (95%CI) at 107 and 134 m•min-1. Apple Watch demonstrated a low to moderate validity and reliability on measuring EE.
Communication skills in the training of psychiatrists: A systematic review of current approaches.
Ditton-Phare, Philippa; Loughland, Carmel; Duvivier, Robbert; Kelly, Brian
2017-07-01
A range of communication skills training programmes have been developed targeting trainees in various medical specialties, predominantly in oncology but to a lesser extent in psychiatry. Effective communication is fundamental to the assessment and treatment of psychiatric conditions, but there has been less attention to this in clinical practice for psychiatrists in training. This review examines the outcomes of communication skills training interventions in psychiatric specialty training. The published English-language literature was examined using multiple online databases, grey literature and hand searches. The review was conducted and reported using Preferred Reporting Items for Systematic Reviews and Meta-analyses guidelines. Studies examining the efficacy of communication skills training were included. Randomised controlled trials, pseudo-randomised studies and quasi-experimental studies, as well as observational analytical studies and qualitative studies that met criteria, were selected and critically appraised. No limits were applied for date of publication up until 16 July 2016. Total search results yielded 2574 records. Of these, 12 studies were identified and reviewed. Two were randomised controlled trials and the remaining 10 were one-group pretest/posttest designs or posttest-only designs, including self-report evaluations of communication skills training and objective evaluations of trainee skills. There were no studies with outcomes related to behaviour change or patient outcomes. Two randomised controlled trials reported an improvement in clinician empathy and psychotherapeutic interviewing skills due to specific training protocols focused on those areas. Non-randomised studies showed varying levels of skills gains and self-reported trainee satisfaction ratings with programmes, with the intervention being some form of communication skills training. The heterogeneity of communication skills training is a barrier to evaluating the efficacy of different communication skills training programmes. Further validation studies examining specific models and frameworks would support a stronger evidence base for communication skills training in psychiatry. It remains a challenge to develop research to investigate behaviour change over time in clinical practice or to measure patient outcomes due to the effects of communication skills training.
Measurement of discrete vertical in-shoe stress with piezoelectric transducers.
Gross, T S; Bunch, R P
1988-05-01
The purpose of this investigation was to design and validate a system suitable for non-invasive measurement of discrete in-shoe vertical plantar stress during dynamic activities. Eight transducers were constructed, with small piezoelectric ceramic squares (4.83 x 4.83 x 1.3 mm) used to generate a charge output proportional to vertical plantar stress. The mechanical properties of the transducers included 2.3% linearity and 3.7% hysteresis for stresses up to 2000 kPa and loading times up to 200 ms. System design efficacy was analysed by means of a multiple day, multiple trial data collection. With the transducers placed beneath plantar landmarks, the footstrike of one subject was recorded ten times on each of five days while running at 3.58 m/s on a treadmill. Within-day and between-day proportional error (PE) was used to estimate the error contained in the mean peak stress during foot contact. Within-day PE focused on trial to trial variability associated with the subject and equipment, and averaged 3.1% (range 2.5-4.0%) across transducer location. Between-day PE provided a cumulative estimate of subject, transducer placement, and random equipment variability, but excluded trial to trial variability. It ranged from 4.9 to 15.8%, with a mean of 9.9%. Peak stress, impulse, and sequence of loading data were examined to identify discrete foot function patterns and highlight the value of discrete stress analysis.
The effect of offset cues on saccade programming and covert attention.
Smith, Daniel T; Casteau, Soazig
2018-02-01
Salient peripheral events trigger fast, "exogenous" covert orienting. The influential premotor theory of attention argues that covert orienting of attention depends upon planned but unexecuted eye-movements. One problem with this theory is that salient peripheral events, such as offsets, appear to summon attention when used to measure covert attention (e.g., the Posner cueing task) but appear not to elicit oculomotor preparation in tasks that require overt orienting (e.g., the remote distractor paradigm). Here, we examined the effects of peripheral offsets on covert attention and saccade preparation. Experiment 1 suggested that transient offsets summoned attention in a manual detection task without triggering motor preparation planning in a saccadic localisation task, although there were a high proportion of saccadic capture errors on "no-target" trials, where a cue was presented but no target appeared. In Experiment 2, "no-target" trials were removed. Here, transient offsets produced both attentional facilitation and faster saccadic responses on valid cue trials. A third experiment showed that the permanent disappearance of an object also elicited attentional facilitation and faster saccadic reaction times. These experiments demonstrate that offsets trigger both saccade programming and covert attentional orienting, consistent with the idea that exogenous, covert orienting is tightly coupled with oculomotor activation. The finding that no-go trials attenuates oculomotor priming effects offers a way to reconcile the current findings with previous claims of a dissociation between covert attention and oculomotor control in paradigms that utilise a high proportion of catch trials.
Kim, Gyuri; Bae, Ji Cheol; Yi, Byoung Kee; Hur, Kyu Yeon; Chang, Dong Kyung; Lee, Moon-Kyu; Kim, Jae Hyeon; Jin, Sang-Man
2017-07-18
A Personal Health Record (PHR) is an online application that allows patients to access, manage, and share their health data. PHRs not only enhance shared decision making with healthcare providers, but also enable remote monitoring and at-home-collection of detailed data. The benefits of PHRs can be maximized in insulin dose adjustment for patients starting or intensifying insulin regimens, as frequent self-monitoring of glucose, self-adjustment of insulin dose, and precise at-home data collection during the visit-to-visit period are important for glycemic control. The aim of this study is to examine the efficacy and safety of insulin dose adjustment based on a smartphone PHR application in patients with diabetes mellitus (DM) and to confirm the validity and stability of an information and communication technology (ICT)-based centralized clinical trial monitoring system. This is a 24-week, open-label, randomized, multi-center trial. There are three follow-up measures: baseline, post-intervention at week 12, and at week 24. Subjects diagnosed with type 1 DM, type 2 DM, and/or post-transplant DM who initiate basal insulin or intensify their insulin regimen to a basal-bolus regimen are included. After education on insulin dose titration and prevention for hypoglycemia and a 1-week acclimation period, subjects are randomized in a 1:1 ratio to either an ICT-based intervention group or a conventional intervention group. Subjects in the conventional intervention group will save and send their health information to the server via a PHR application, whereas those in ICT-based intervention group will receive additional algorithm-based feedback messages. The health information includes level of blood glucose, insulin dose, details on hypoglycemia, food diary, and step count. The primary outcome will be the proportion of patients who reach an optimal insulin dose within 12 weeks of study enrollment, without severe hypoglycemia or unscheduled clinic visits. This clinical trial will reveal whether insulin dose adjustment based on a smartphone PHR application can facilitate the optimization of insulin doses in patients with DM. In addition, the process evaluation will provide information about the validity and stability of the ICT-based centralized clinical trial monitoring system in this research field. Clinicaltrials.gov NCT 03112343 . Registered on 12 April 2017.
Chen, A-Di; Wang, Chun-Ling; Qin, Yang; Tian, Liang; Chen, Li-Bin; Yuan, Xiao-Ming; Ma, Lin-Xiu; Wang, Yu-Feng; Sun, Ji-Rong; Wang, Hao-Sen; Dai, Neng
2017-12-20
Lipoprotein-associated phospholipase A 2 (Lp-PLA 2 ), a biomarker of oxidation and inflammation, has been associated with increased coronary artery disease risk. To date, very few studies have examined the Chinese herbal drug Danshen or its extract on Lp-PLA 2 in patients with stable angina pectoris. In this study, we aim to investigate the effect of Danshen extract on Lp-PLA 2 level in patients with stable angina. This is a randomized, single-blind, placebo-controlled, adaptive clinical trial. A total of 156 patients meeting the eligibility criteria will be randomly assigned to either the Danshen extract (DanshenDuofensuanyan injection and Danshen drop spill) group or the placebo group in a 1:1 ratio. Participants will then undergo treatment with DanshenDuofensuanyan injection or placebo (glucose) during hospitalization, followed by open-label Danshen drop spill (30 pills/day) in Danshen extract group for 60 days after discharge. Because this is an adaptive trial, two interim analyses are prospectively planned. These will be performed after one-third and two-thirds of the patients, respectively, have completed the trial. On the basis of the results of these interim analyses, a data monitoring committee will determine how to modify aspects of the study without undermining the validity and integrity of the trial. The primary outcome measure is the serum level of Lp-PLA 2 in the Danshen extract group and the placebo group. The secondary outcomes include the proportion of patients who show a clinically significant change, which is defined as at least a 20-point improvement in angina frequency score on the Seattle Angina Questionnaire and the carotid intima-media thickness, which will be measured using ultrasound. Other secondary efficacy and safety outcomes will also be assessed. This study will provide evidence that Danshen extract is beneficial for stable angina and may establish a possible mechanism of Danshen treatment effects on cardiovascular disease. This study may also validate an objective blood test (LP-PLA 2 level) for assessing the effectiveness of Danshen therapy in patients with stable angina pectoris. ClinicalTrials.gov, NCT02870764 . Registered on 13 August 2016.
Myo-inositol reduces β-catenin activation in colitis
Bradford, Emily M; Thompson, Corey A; Goretsky, Tatiana; Yang, Guang-Yu; Rodriguez, Luz M; Li, Linheng; Barrett, Terrence A
2017-01-01
AIM To assess dietary myo-inositol in reducing stem cell activation in colitis, and validate pβ-cateninS552 as a biomarker of recurrent dysplasia. METHODS We examined the effects of dietary myo-inositol treatment on inflammation, pβ-cateninS552 and pAkt levels by histology and western blot in IL-10-/- and dextran sodium sulfate-treated colitic mice. Additionally, we assessed nuclear pβ-cateninS552 in patients treated with myo-inositol in a clinical trial, and in patients with and without a history of colitis-induced dysplasia. RESULTS In mice, pβ-cateninS552 staining faithfully reported the effects of myo-inositol in reducing inflammation and intestinal stem cell activation. In a pilot clinical trial of myo-inositol administration in patients with a history of low grade dysplasia (LGD), two patients had reduced numbers of intestinal stem cell activation compared to the placebo control patient. In humans, pβ-cateninS552 staining discriminated ulcerative colitis patients with a history of LGD from those with benign disease. CONCLUSION Enumerating crypts with increased numbers of pβ-cateninS552 - positive cells can be utilized as a biomarker in colitis-associated cancer chemoprevention trials. PMID:28811707
Myo-inositol reduces β-catenin activation in colitis.
Bradford, Emily M; Thompson, Corey A; Goretsky, Tatiana; Yang, Guang-Yu; Rodriguez, Luz M; Li, Linheng; Barrett, Terrence A
2017-07-28
To assess dietary myo-inositol in reducing stem cell activation in colitis, and validate pβ-catenin S552 as a biomarker of recurrent dysplasia. We examined the effects of dietary myo-inositol treatment on inflammation, pβ-catenin S552 and pAkt levels by histology and western blot in IL-10 -/- and dextran sodium sulfate-treated colitic mice. Additionally, we assessed nuclear pβ-catenin S552 in patients treated with myo-inositol in a clinical trial, and in patients with and without a history of colitis-induced dysplasia. In mice, pβ-catenin S552 staining faithfully reported the effects of myo-inositol in reducing inflammation and intestinal stem cell activation. In a pilot clinical trial of myo-inositol administration in patients with a history of low grade dysplasia (LGD), two patients had reduced numbers of intestinal stem cell activation compared to the placebo control patient. In humans, pβ-catenin S552 staining discriminated ulcerative colitis patients with a history of LGD from those with benign disease. Enumerating crypts with increased numbers of pβ-catenin S552 - positive cells can be utilized as a biomarker in colitis-associated cancer chemoprevention trials.
Bentsen, H; Landrø, N I
2017-10-16
There is need for more efficient treatment of neurocognitive deficits in schizophrenia. In this 16 weeks randomised, placebo-controlled trial, we examined neurocognitive effects of adding ethyl-eicosapentaenoate 2g/day and/or vitamins E 364mg/day + C 1000mg/day to antipsychotics in 53 patients aged 18-39 years with acute schizophrenia. For the sake of validating neurocognitive tests, healthy subjects, not taking trial drugs, were also included in the study. Ethyl-EPA given alone to patients with low baseline RBC polyunsaturated fatty acids (PUFA), and Vitamins E+C given alone to high PUFA patients, impaired sustained attention (Continuous Performance Test, CPT-IP d prime score), standardised effect sizes d = 0.78 and d = 0.69, respectively. These adverse effects were paralleled by excessive increases in long-chain PUFA and serum alpha-tocopherol, respectively. They were counteracted by combining ethyl-EPA and vitamins, d = 0.80 and d = 0.74 in low and high PUFA patients, respectively. No other neurocognitive tests yielded significant results. Plausible mechanisms of harmful effects are oxidative stress and lipid raft disruption. Copyright © 2017. Published by Elsevier Ltd.
Kozloski, G V; Stefanello, C M; Oliveira, L; Filho, H M N Ribeiro; Klopfenstein, T J
2017-02-01
A data set of individual observations was compiled from digestibility trials to examine the relationship between the duodenal purine bases (PB) flow and urinary purine derivatives (PD) excretion and the validity of different equations for estimating rumen microbial N (Nm) supply based on urinary PD in comparison with estimates based on duodenal PB. Trials (8 trials, = 185) were conducted with male sheep fitted with a duodenal T-type cannula, housed in metabolic cages, and fed forage alone or with supplements. The amount of PD excreted in urine was linearly related to the amount of PB flowing to the duodenum ( < 0.05). The intercept of the linear regression was 0.180 mmol/(d·kg), representing the endogenous excretion of PD, and the slope was lower than 1 ( < 0.05), indicating that only 0.43% of the PB in the duodenum was excreted as PD in urine. The Nm supply estimated by either approach was linearly related ( < 0.05) to the digestible OM intake. However, the Nm supply estimated through either of 3 published PD-based equations probably underestimated the Nm supply in sheep.
A new music therapy engagement scale for persons with dementia.
Tan, Jane; Wee, Shiou-Liang; Yeo, Pei Shi; Choo, Juliet; Ritholz, Michele; Yap, Philip
2018-05-25
ABSTRACTObjectives:To develop and validate a new scale to assess music therapy engagement in persons with dementia (PWDs). A draft scale was derived from literature review and >2 years of qualitative recording of PWDs during music therapy. Content validity was attained through iterative consultations, trial sessions, and revisions. The final five-item Music Therapy Engagement scale for Dementia (MTED) assessed music and non-music related elements. Internal consistency and inter-rater reliability were assessed over 120 music therapy sessions. MTED was validated with the Greater Cincinnati Chapter Well-being Observation Tool, Holden Communication Scale, and Participant Engagement Observation Checklist - Music Sessions. A total of 62 PWDs (83.2 ± 7.7 years, modified version of the mini-mental state examination = 13.2/30 ± 4.1) in an acute hospital dementia unit were involved. The mean MTED score was 13.02/30 ± 4.27; internal consistency (Cronbach's α = 0.87) and inter-rater reliability (intra-class correlation = 0.96) were good. Principal component analysis revealed a one-factor structure with Eigen value > 1 (3.27), which explained 65.4% of the variance. MTED demonstrated good construct validity. The MTED total score correlated strongly with the combined items comprising Pleasure, Interest, Sadness, and Sustained attention of the Greater Cincinnati Chapter Well-being Observation Tool (rs = 0.88, p < 0.001). MTED is a clinically appropriate and psychometrically valid scale to evaluate music therapy engagement in PWDs.
Quality control and assurance for validation of DOS/I measurements
NASA Astrophysics Data System (ADS)
Cerussi, Albert; Durkin, Amanda; Kwong, Richard; Quang, Timothy; Hill, Brian; Tromberg, Bruce J.; MacKinnon, Nick; Mantulin, William W.
2010-02-01
Ongoing multi-center clinical trials are crucial for Biophotonics to gain acceptance in medical imaging. In these trials, quality control (QC) and assurance (QA) are key to success and provide "data insurance". Quality control and assurance deal with standardization, validation, and compliance of procedures, materials and instrumentation. Specifically, QC/QA involves systematic assessment of testing materials, instrumentation performance, standard operating procedures, data logging, analysis, and reporting. QC and QA are important for FDA accreditation and acceptance by the clinical community. Our Biophotonics research in the Network for Translational Research in Optical Imaging (NTROI) program for breast cancer characterization focuses on QA/QC issues primarily related to the broadband Diffuse Optical Spectroscopy and Imaging (DOS/I) instrumentation, because this is an emerging technology with limited standardized QC/QA in place. In the multi-center trial environment, we implement QA/QC procedures: 1. Standardize and validate calibration standards and procedures. (DOS/I technology requires both frequency domain and spectral calibration procedures using tissue simulating phantoms and reflectance standards, respectively.) 2. Standardize and validate data acquisition, processing and visualization (optimize instrument software-EZDOS; centralize data processing) 3. Monitor, catalog and maintain instrument performance (document performance; modularize maintenance; integrate new technology) 4. Standardize and coordinate trial data entry (from individual sites) into centralized database 5. Monitor, audit and communicate all research procedures (database, teleconferences, training sessions) between participants ensuring "calibration". This manuscript describes our ongoing efforts, successes and challenges implementing these strategies.
An experimental validation of genomic selection in octoploid strawberry
Gezan, Salvador A; Osorio, Luis F; Verma, Sujeet; Whitaker, Vance M
2017-01-01
The primary goal of genomic selection is to increase genetic gains for complex traits by predicting performance of individuals for which phenotypic data are not available. The objective of this study was to experimentally evaluate the potential of genomic selection in strawberry breeding and to define a strategy for its implementation. Four clonally replicated field trials, two in each of 2 years comprised of a total of 1628 individuals, were established in 2013–2014 and 2014–2015. Five complex yield and fruit quality traits with moderate to low heritability were assessed in each trial. High-density genotyping was performed with the Affymetrix Axiom IStraw90 single-nucleotide polymorphism array, and 17 479 polymorphic markers were chosen for analysis. Several methods were compared, including Genomic BLUP, Bayes B, Bayes C, Bayesian LASSO Regression, Bayesian Ridge Regression and Reproducing Kernel Hilbert Spaces. Cross-validation within training populations resulted in higher values than for true validations across trials. For true validations, Bayes B gave the highest predictive abilities on average and also the highest selection efficiencies, particularly for yield traits that were the lowest heritability traits. Selection efficiencies using Bayes B for parent selection ranged from 74% for average fruit weight to 34% for early marketable yield. A breeding strategy is proposed in which advanced selection trials are utilized as training populations and in which genomic selection can reduce the breeding cycle from 3 to 2 years for a subset of untested parents based on their predicted genomic breeding values. PMID:28090334
Ploquin, A; Olmos, D; Lacombe, D; A'Hern, R; Duhamel, A; Twelves, C; Marsoni, S; Morales-Barrera, R; Soria, J-C; Verweij, J; Voest, E E; Schöffski, P; Schellens, J H; Kramar, A; Kristeleit, R S; Arkenau, H-T; Kaye, S B; Penel, N
2012-09-25
Selecting patients with 'sufficient life expectancy' for Phase I oncology trials remains challenging. The Royal Marsden Hospital Score (RMS) previously identified high-risk patients as those with ≥ 2 of the following: albumin <35 g l(-1); LDH > upper limit of normal; >2 metastatic sites. This study developed an alternative prognostic model, and compared its performance with that of the RMS. The primary end point was the 90-day mortality rate. The new model was developed from the same database as RMS, but it used Chi-squared Automatic Interaction Detection (CHAID). The ROC characteristics of both methods were then validated in an independent database of 324 patients enrolled in European Organization on Research and Treatment of Cancer Phase I trials of cytotoxic agents between 2000 and 2009. The CHAID method identified high-risk patients as those with albumin <33 g l(-1) or ≥ 33 g l(-1), but platelet counts ≥ 400.000 mm(-3). In the validation data set, the rates of correctly classified patients were 0.79 vs 0.67 for the CHAID model and RMS, respectively. The negative predictive values (NPV) were similar for the CHAID model and RMS. The CHAID model and RMS provided a similarly high level of NPV, but the CHAID model gave a better accuracy in the validation set. Both CHAID model and RMS may improve the screening process in phase I trials.
Quantifying external focus of attention in sailing by means of action sport cameras.
Pluijms, Joost P; Cañal-Bruland, Rouwen; Hoozemans, Marco J M; Van Beek, Morris W; Böcker, Kaj; Savelsbergh, Geert J P
2016-08-01
The aim of the current study was twofold: (1) to validate the use of action sport cameras for quantifying focus of visual attention in sailing and (2) to apply this method to examine whether an external focus of attention is associated with better performance in upwind sailing. To test the validity of this novel quantification method, we first calculated the agreement between gaze location measures and head orientation measures in 13 sailors sailing upwind during training regattas using a head mounted eye tracker. The results confirmed that for measuring visual focus of attention in upwind sailing, the agreement for the two measures was high (intraclass correlation coefficient (ICC) = 0.97) and the 95% limits of agreement were acceptable (between -8.0% and 14.6%). In a next step, we quantified the focus of visual attention in sailing upwind as fast as possible by means of an action sport camera. We captured sailing performance, operationalised as boat speed in the direction of the wind, and environmental conditions using a GPS, compass and wind meter. Four trials, each lasting 1 min, were analysed for 15 sailors each, resulting in a total of 30 upwind speed trials on port tack and 30 upwind speed trials on starboard tack. The results revealed that in sailing - within constantly changing environments - the focus of attention is not a significant predictor for better upwind sailing performances. This implicates that neither external nor internal foci of attention was per se correlated with better performances. Rather, relatively large interindividual differences seem to indicate that different visual attention strategies can lead to similar performance outcomes.
Messinis, Lambros; Nasios, Grigorios; Mougias, Antonios; Politis, Antonis; Zampakis, Petros; Tsiamaki, Eirini; Malefaki, Sonia; Gourzis, Phillipos; Papathanasopoulos, Panagiotis
2016-01-01
Rey's Auditory Verbal Learning Test (RAVLT) is a widely used neuropsychological test to assess episodic memory. In the present study we sought to establish normative and discriminative validity data for the RAVLT in the elderly population using previously adapted learning lists for the Greek adult population. We administered the test to 258 cognitively healthy elderly participants, aged 60-89 years, and two patient groups (192 with amnestic mild cognitive impairment, aMCI, and 65 with Alzheimer's disease, AD). From the statistical analyses, we found that age and education contributed significantly to most trials of the RAVLT, whereas the influence of gender was not significant. Younger elderly participants with higher education outperformed the older elderly with lower education levels. Moreover, both clinical groups performed significantly worse on most RAVLT trials and composite measures than matched cognitively healthy controls. Furthermore, the AD group performed more poorly than the aMCI group on most RAVLT variables. Receiver operating characteristic (ROC) analysis was used to examine the utility of the RAVLT trials to discriminate cognitively healthy controls from aMCI and AD patients. Area under the curve (AUC), an index of effect size, showed that most of the RAVLT measures (individual and composite) included in this study adequately differentiated between the performance of healthy elders and aMCI/AD patients. We also provide cutoff scores in discriminating cognitively healthy controls from aMCI and AD patients, based on the sensitivity and specificity of the prescribed scores. Moreover, we present age- and education-specific normative data for individual and composite scores for the Greek adapted RAVLT in elderly subjects aged between 60 and 89 years for use in clinical and research settings.
2011-01-01
Background Randomised controlled trials (RCTs) are the gold standard of evidence-based practice in medicine but they have had limited influence in the field of intellectual disabilities. Previous literature suggests that participants and professionals have limited tolerance for this type of research methodology. However, it is not known how well service users, carers and other health professionals understand and accept the need for RCTs, and why it is important for individuals with intellectual disabilities to be included in this kind of research. Methods We examined individual perceptions of RCTs in 51 participants (18 carers, 6 service users and 27 professionals) using semi-structured interviews. A framework approach was adopted in the analysis of data. Results We found that participants had concerns about capacity and resource allocation but held positive views towards this type of research methodology. Understanding of the principles behind RCTs was poor amongst service users and a minority of carers, but mediated by previous exposure to research for professionals. Conclusions The social validity of RCTs in intellectual disabilities may be compromised by lack of understanding of the design and the on-going concerns about obtaining informed consent especially in incapacitated adults. However, the overall finding that the need for this form of research was seen in a positive light suggests that there is a turning point in the perceptions of stakeholders working in intellectual disabilities services. We recommend that researchers include on-going education on RCT design during trials, tailoring it to all stakeholders with emphasis on strong service user and care involvement. This could be a pivotal element in improving acceptability of, and recruitment to RCTs. PMID:21658215
Leveraging the power of pooled data for cancer outcomes research.
Hugh-Yeun, Kiara; Cheung, Winson Y
2016-08-02
Clinical trials continue to be the gold standard for determining the efficacy of novel cancer treatments, but they may also expose participants to the potential risks of unpredictable or severe toxicities. The development of validated tools that better inform patients of the benefits and risks associated with clinical trial participation can facilitate the informed consent process. The design and validation of such instruments are strengthened when we leverage the power of pooled data analysis for cancer outcomes research. In a recent study published in the Journal of Clinical Oncology entitled "Determinants of early mortality among 37,568 patients with colon cancer who participated in 25 clinical trials from the adjuvant colon cancer endpoints database," using a large pooled analysis of over 30,000 study participants who were enrolled in clinical trials of adjuvant therapy for early-stage colon cancer, we developed and validated a nomogram depicting the predictors of early cancer mortality. This database of pooled individual-level data allowed for a comprehensive analysis of poor prognostic factors associated with early death; furthermore, it enabled the creation of a nomogram that was able to reliably capture and quantify the benefit-to-risk profile for patients who are considering clinical trial participation. This tool can facilitate treatment decision-making discussions. As China and other Asian countries continue to conduct oncology clinical trials, efforts to collate patient-level information from these studies into a large data repository should be strongly considered since pooled data can increase future capacity for cancer outcomes research, which, in turn, can enhance patient-physician discussions and optimize clinical care.
NIH EXAMINER: conceptualization and development of an executive function battery.
Kramer, Joel H; Mungas, Dan; Possin, Katherine L; Rankin, Katherine P; Boxer, Adam L; Rosen, Howard J; Bostrom, Alan; Sinha, Lena; Berhel, Ashley; Widmeyer, Mary
2014-01-01
Executive functioning is widely targeted when human cognition is assessed, but there is little consensus on how it should be operationalized and measured. Recognizing the difficulties associated with establishing standard operational definitions of executive functioning, the National Institute of Neurological Disorders and Stroke entered into a contract with the University of California-San Francisco to develop psychometrically robust executive measurement tools that would be accepted by the neurology clinical trials and clinical research communities. This effort, entitled Executive Abilities: Measures and Instruments for Neurobehavioral Evaluation and Research (EXAMINER), resulted in a series of tasks targeting working memory, inhibition, set shifting, fluency, insight, planning, social cognition and behavior. We describe battery conceptualization and development, data collection, scale construction based on item response theory, and lay the foundation for studying the battery's utility and validity for specific assessment and research goals.
Tzoulaki, Ioanna; Zgaga, Lina; Ioannidis, John P A
2014-01-01
Objective To evaluate the breadth, validity, and presence of biases of the associations of vitamin D with diverse outcomes. Design Umbrella review of the evidence across systematic reviews and meta-analyses of observational studies of plasma 25-hydroxyvitamin D or 1,25-dihydroxyvitamin D concentrations and randomised controlled trials of vitamin D supplementation. Data sources Medline, Embase, and screening of citations and references. Eligibility criteria Three types of studies were eligible for the umbrella review: systematic reviews and meta-analyses that examined observational associations between circulating vitamin D concentrations and any clinical outcome; and meta-analyses of randomised controlled trials assessing supplementation with vitamin D or active compounds (both established and newer compounds of vitamin D). Results 107 systematic literature reviews and 74 meta-analyses of observational studies of plasma vitamin D concentrations and 87 meta-analyses of randomised controlled trials of vitamin D supplementation were identified. The relation between vitamin D and 137 outcomes has been explored, covering a wide range of skeletal, malignant, cardiovascular, autoimmune, infectious, metabolic, and other diseases. Ten outcomes were examined by both meta-analyses of observational studies and meta-analyses of randomised controlled trials, but the direction of the effect and level of statistical significance was concordant only for birth weight (maternal vitamin D status or supplementation). On the basis of the available evidence, an association between vitamin D concentrations and birth weight, dental caries in children, maternal vitamin D concentrations at term, and parathyroid hormone concentrations in patients with chronic kidney disease requiring dialysis is probable, but further studies and better designed trials are needed to draw firmer conclusions. In contrast to previous reports, evidence does not support the argument that vitamin D only supplementation increases bone mineral density or reduces the risk of fractures or falls in older people. Conclusions Despite a few hundred systematic reviews and meta-analyses, highly convincing evidence of a clear role of vitamin D does not exist for any outcome, but associations with a selection of outcomes are probable. PMID:24690624
OWL model of clinical trial eligibility criteria compatible with partially-known information
2013-01-01
Background Clinical trials are important for patients, for researchers and for companies. One of the major bottlenecks is patient recruitment. This task requires the matching of a large volume of information about the patient with numerous eligibility criteria, in a logically-complex combination. Moreover, some of the patient’s information necessary to determine the status of the eligibility criteria may not be available at the time of pre-screening. Results We showed that the classic approach based on negation as failure over-estimates rejection when confronted with partially-known information about the eligibility criteria because it ignores the distinction between a trial for which patient eligibility should be rejected and trials for which patient eligibility cannot be asserted. We have also shown that 58.64% of the values were unknown in the 286 prostate cancer cases examined during the weekly urology multidisciplinary meetings at Rennes’ university hospital between October 2008 and March 2009. We propose an OWL design pattern for modeling eligibility criteria based on the open world assumption to address the missing information problem. We validate our model on a fictitious clinical trial and evaluate it on two real clinical trials. Our approach successfully distinguished clinical trials for which the patient is eligible, clinical trials for which we know that the patient is not eligible and clinical trials for which the patient may be eligible provided that further pieces of information (which we can identify) can be obtained. Conclusions OWL-based reasoning based on the open world assumption provides an adequate framework for distinguishing those patients who can confidently be rejected from those whose status cannot be determined. The expected benefits are a reduction of the workload of the physicians and a higher efficiency by allowing them to focus on the patients whose eligibility actually require expertise. PMID:24034867
Look--but also listen! ReQuest: A new dimension-oriented GERD symptom scale.
Bardhan, Karna Dev
2004-03-01
The symptom spectrum of gastroesophageal reflux disease (GERD) is much wider than is commonly believed, and in about half the patients endoscopic examination is negative. The role of endoscopy to assess response to treatment is therefore much reduced in GERD. Assessment of symptoms is becoming increasingly important, so different outcome measures are required. The Reflux Questionnaire ReQuest was thus designed as a brief, effective and robust method of tracking and quantifying GERD symptoms during treatment. It comprises seven dimensions: general well-being, acid complaints, upper abdominal/stomach complaints, lower abdominal/digestive complaints, nausea, sleep disturbances and other complaints. In the short version of ReQuest the symptom burden of each dimension is measured by its frequency and intensity (except general well-being, for which only the intensity was determined). The long version also includes 67 symptom descriptions that constitute the dimensions (except general well-being). The rigorous validation process included clinical trial evaluation and statistical assessment of the findings. Important measures of the instrument, such as internal consistency, test/ retest reliability, construct validity and the responsiveness to changes during treatment, among others, all fulfilled or exceeded requirements, thereby demonstrating the accuracy of the instrument. ReQuest meets the criteria set by regulatory authorities and serves as the primary outcome measure for symptom assessment in future clinical trials of current and new treatments. (c) 2004 Prous Science
Mungovan, Sean F; Peralta, Paula J; Gass, Gregory C; Scanlan, Aaron T
2018-04-12
To examine the test-retest reliability and criterion validity of a high-intensity, netball-specific fitness test. Repeated measures, within-subject design. Eighteen female netball players competing in an international competition completed a trial of the Net-Test, which consists of 14 timed netball-specific movements. Players also completed a series of netball-relevant criterion fitness tests. Ten players completed an additional Net-Test trial one week later to assess test-retest reliability using intraclass correlation coefficient (ICC), typical error of measurement (TEM), and coefficient of variation (CV). The typical error of estimate expressed as CV and Pearson correlations were calculated between each criterion test and Net-Test performance to assess criterion validity. Five movements during the Net-Test displayed moderate ICC (0.84-0.90) and two movements displayed high ICC (0.91-0.93). Seven movements and heart rate taken during the Net-Test held low CV (<5%) with values ranging from 1.7 to 9.5% across measures. Total time (41.63±2.05s) during the Net-Test possessed low CV and significant (p<0.05) correlations with 10m sprint time (1.98±0.12s; CV=4.4%, r=0.72), 20m sprint time (3.38±0.19s; CV=3.9%, r=0.79), 505 Change-of-Direction time (2.47±0.08s; CV=2.0%, r=0.80); and maximum oxygen uptake (46.59±2.58 mLkg -1 min -1 ; CV=4.5%, r=-0.66). The Net-Test possesses acceptable reliability for the assessment of netball fitness. Further, the high criterion validity for the Net-Test suggests a range of important netball-specific fitness elements are assessed in combination. Copyright © 2018 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Hammoudeh, Samer; Ghuloum, Suhaila; Mahfoud, Ziyad; Opler, Mark; Khan, Anzalee; Yehya, Arij; Abdulhakam, Abdulmoneim; Al-Mujalli, Azza; Hani, Yahya; Elsherbiny, Reem; Al-Amin, Hassen
2016-12-07
Patients with schizophrenia are known to have higher rates of mortality and morbidity when compared to the general population. Suicidality is a major contributor to increased mortality. The International Suicide Prevention Trial (InterSePT) Scale for Suicidal Thinking (ISST) is a validated tool to assess current suicidal ideation in patients with schizophrenia. The aims of the study were to culturally adapt the Arabic translation of ISST and to examine the psychometric characteristics of the Arabic version of the ISST among patients with schizophrenia in Qatar. ISST was translated and adapted into formal Arabic using the back translation method. Patients diagnosed with schizophrenia were randomly recruited from the department of Psychiatry, Rumailah Hospital, Doha, Qatar. Healthy controls were randomly recruited from two primary health care centers in Doha, Qatar. The Arabic version of Module B for suicidality in Mini International Neuropsychiatric Interview was used as the gold standard to which the Arabic ISST was compared. The study sample (n = 199) was composed of 100 patients diagnosed with schizophrenia (age 35.30 ± 10.04 years; M/F is 2/1) and 99 controls (age 33.98 ± 8.33 years; M/F is 2/3). The mean score on the ISST was 3.03 ± 4.75 vs. 0.47 ± 1.44 for the schizophrenia and control groups, respectively. Inter-rater reliability coefficient was 0.95, p > 0.001. The overall Cronbach's alpha was 0.92. Principal Component Analysis produced 3 factors explaining a total of 73.8% of variance. This is the first study in the Arab countries to validate the Arabic version of the ISST. The psychometric properties indicate that the Arabic ISST is a valid tool to assess the severity of suicidal ideation in Arabic patients with schizophrenia.
Karim, Quarraisha Abdool; Kharsany, Ayesha B M; Naidoo, Kasavan; Yende, Nonhlanhla; Gengiah, Tanuja; Omar, Zaheen; Arulappan, Natasha; Mlisana, Koleka P; Luthuli, Londiwe R; Karim, Salim S Abdool
2011-05-01
In settings where multiple HIV prevention trials are conducted in close proximity, trial participants may attempt to enroll in more than one trial simultaneously. Co-enrollment impacts on participant's safety and validity of trial results. We describe our experience, remedial action taken, inter-organizational collaboration and lessons learnt following the identification of co-enrolled participants. Between February and April 2008, we identified 185 of the 398 enrolled participants as ineligible. In violation of the study protocol exclusion criteria, there was simultaneous enrollment in another HIV prevention trial (ineligible co-enrolled, n=135), and enrollment of women who had participated in a microbicide trial within the past 12 months (ineligible not co-enrolled, n=50). Following a complete audit of all enrolled participants, ineligible participants were discontinued via study exit visits from trial follow-up. Custom-designed education program on co-enrollment impacting on participants' safety and validity of the trial results was implemented. Shared electronic database between research units was established to enable verification of each volunteer's trial participation and to prevent future co-enrollments. Interviews with ineligible enrolled women revealed that high-quality care, financial incentives, altruistic motives, preference for sex with gel, wanting to increase their likelihood of receiving active gel, perceived low risk of discovery and peer pressure are the reasons for their enrollment in the CAPRISA 004 trial. Instituting education programs based on the reasons reported by women for seeking enrollment in more than one trial and using a shared central database system to identify co-enrollments have effectively prevented further co-enrollments. Copyright © 2011 Elsevier Inc. All rights reserved.
Sainz de Baranda, Pilar; Rodríguez-Iniesta, María; Ayala, Francisco; Santonja, Fernando; Cejudo, Antonio
2014-07-01
To examine the criterion-related validity of the horizontal hip joint angle (H-HJA) test and vertical hip joint angle (V-HJA) test for estimating hamstring flexibility measured through the passive straight-leg raise (PSLR) test using contemporary statistical measures. Validity study. Controlled laboratory environment. One hundred thirty-eight professional trampoline gymnasts (61 women and 77 men). Hamstring flexibility. Each participant performed 2 trials of H-HJA, V-HJA, and PSLR tests in a randomized order. The criterion-related validity of H-HJA and V-HJA tests was measured through the estimation equation, typical error of the estimate (TEEST), validity correlation (β), and their respective confidence limits. The findings from this study suggest that although H-HJA and V-HJA tests showed moderate to high validity scores for estimating hamstring flexibility (standardized TEEST = 0.63; β = 0.80), the TEEST statistic reported for both tests was not narrow enough for clinical purposes (H-HJA = 10.3 degrees; V-HJA = 9.5 degrees). Subsequently, the predicted likely thresholds for the true values that were generated were too wide (H-HJA = predicted value ± 13.2 degrees; V-HJA = predicted value ± 12.2 degrees). The results suggest that although the HJA test showed moderate to high validity scores for estimating hamstring flexibility, the prediction intervals between the HJA and PSLR tests are not strong enough to suggest that clinicians and sport medicine practitioners should use the HJA and PSLR tests interchangeably as gold standard measurement tools to evaluate and detect short hamstring muscle flexibility.
Leveraging molecular datasets for biomarker-based clinical trial design in glioblastoma.
Tanguturi, Shyam K; Trippa, Lorenzo; Ramkissoon, Shakti H; Pelton, Kristine; Knoff, David; Sandak, David; Lindeman, Neal I; Ligon, Azra H; Beroukhim, Rameen; Parmigiani, Giovanni; Wen, Patrick Y; Ligon, Keith L; Alexander, Brian M
2017-07-01
Biomarkers can improve clinical trial efficiency, but designing and interpreting biomarker-driven trials require knowledge of relationships among biomarkers, clinical covariates, and endpoints. We investigated these relationships across genomic subgroups of glioblastoma (GBM) within our institution (DF/BWCC), validated results in The Cancer Genome Atlas (TCGA), and demonstrated potential impacts on clinical trial design and interpretation. We identified genotyped patients at DF/BWCC, and clinical associations across 4 common GBM genomic biomarker groups were compared along with overall survival (OS), progression-free survival (PFS), and survival post-progression (SPP). Significant associations were validated in TCGA. Biomarker-based clinical trials were simulated using various assumptions. Epidermal growth factor receptor (EGFR)(+) and p53(-) subgroups were more likely isocitrate dehydrogenase (IDH) wild-type. Phosphatidylinositol-3 kinase (PI3K)(+) patients were older, and patients with O6-DNA methylguanine-methyltransferase (MGMT)-promoter methylation were more often female. OS, PFS, and SPP were all longer for IDH mutant and MGMT methylated patients, but there was no independent prognostic value for other genomic subgroups. PI3K(+) patients had shorter PFS among IDH wild-type tumors, however, and no DF/BWCC long-term survivors were either EGFR(+) (0% vs 7%, P = .014) or p53(-) (0% vs 10%, P = .005). The degree of biomarker overlap impacted the efficiency of Bayesian-adaptive clinical trials, while PFS and OS distribution variation had less impact. Biomarker frequency was proportionally associated with sample size in all designs. We identified several associations between GBM genomic subgroups and clinical or molecular prognostic covariates and validated known prognostic factors in all survival periods. These results are important for biomarker-based trial design and interpretation of biomarker-only and nonrandomized trials. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Neuro-Oncology. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com
Villafañe, Jorge H; Gobbo, Massimiliano; Peranzoni, Matteo; Naik, Ganesh; Imperio, Grace; Cleland, Joshua A; Negrini, Stefano
2016-09-01
This systematic literature review aimed at examining the validity and applicability in everyday clinical rehabilitation practise of methods for the assessment of back muscle fatiguability in patients with chronic non-specific low back pain (CNSLBP). Extensive research was performed in MEDLINE, Cumulative Index of Nursing and Allied Health Literature (CINAHL), Embase, Physiotherapy Evidence Database (PEDro) and Cochrane Central Register of Controlled Trials (CENTRAL) databases from their inception to September 2014. Potentially relevant articles were also manually looked for in the reference lists of the identified publications. Studies examining lumbar muscle fatigue in people with CNSLBP were selected. Two reviewers independently selected the articles, carried out the study quality assessment and extracted the results. A modified Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) scale was used to evaluate the scientific rigour of the selected works. Twenty-four studies fulfilled the selection criteria and were included in the systematic review. We found conflicting data regarding the validity of methods used to examine back muscle fatigue. The Biering-Sorensen test, performed in conjunction with surface electromyography spectral analysis, turned out to be the most widely used and comparatively, the most optimal modality currently available to assess objective back muscle fatigue in daily clinical practise, even though critical limitations are discussed. Future research should address the identification of an advanced method for lower back fatigue assessment in patients with CNSLBP which, eventually, might provide physical therapists with an objective and reliable test usable in everyday clinical practise. Implications for Rehabilitation Despite its limitations, the Biering-Sorensen test is currently the most used, convenient and easily available fatiguing test for lumbar muscles. To increase validity and reliability of the Biering-Sorensen test, concomitant activation of synergistic muscles should be taken into account. Pooled mean frequency and half-width of the spectrum are currently the most valid electromyographic parameters to assess fatigue in chronic non-specific low back pain. Body mass index, grading of pain and level of disability of the study population should be reported to enhance research quality.
Validity and test–retest reliability of a novel simple back extensor muscle strength test
Harding, Amy T; Weeks, Benjamin Kurt; Horan, Sean A; Little, Andrew; Watson, Steven L; Beck, Belinda Ruth
2017-01-01
Objectives: To develop and determine convergent validity and reliability of a simple and inexpensive clinical test to quantify back extensor muscle strength. Methods: Two testing sessions were conducted, 7 days apart. Each session involved three trials of standing maximal isometric back extensor muscle strength using both the novel test and isokinetic dynamometry. Lumbar spine bone mineral density was examined by dual-energy X-ray absorptiometry. Validation was examined with Pearson correlations (r). Test–retest reliability was examined with intraclass correlation coefficients and limits of agreement. Pearson correlations and intraclass correlation coefficients are presented with corresponding 95% confidence intervals. Linear regression was used to examine the ability of peak back extensor muscle strength to predict indices of lumbar spine bone mineral density and strength. Results: A total of 52 healthy adults (26 men, 26 women) aged 46.4 ± 20.4 years were recruited from the community. A strong positive relationship was observed between peak back extensor strength from hand-held and isokinetic dynamometry (r = 0.824, p < 0.001). For the novel back extensor strength test, short- and long-term reliability was excellent (intraclass correlation coefficient = 0.983 (95% confidence interval, 0.971–0.990), p < 0.001 and intraclass correlation coefficient = 0.901 (95% confidence interval, 0.833–0.943), p < 0.001, respectively). Limits of agreement for short-term repeated back extensor strength measures with the novel back extensor strength protocol were −6.63 to 7.70 kg, with a mean bias of +0.71 kg. Back extensor strength predicted 11% of variance in lumbar spine bone mineral density (p < 0.05) and 9% of lumbar spine index of bone structural strength (p < 0.05). Conclusion: Our novel hand-held dynamometer method to determine back extensor muscle strength is quick, relatively inexpensive, and reliable; demonstrates initial convergent validity in a healthy population; and is associated with bone mass at a clinically important site. PMID:28255442
Zbrozek, Arthur; Hebert, Joy; Gogates, Gregory; Thorell, Rod; Dell, Christopher; Molsen, Elizabeth; Craig, Gretchen; Grice, Kenneth; Kern, Scottie; Hines, Sheldon
2013-06-01
Outcomes research literature has many examples of high-quality, reliable patient-reported outcome (PRO) data entered directly by electronic means, ePRO, compared to data entered from original results on paper. Clinical trial managers are increasingly using ePRO data collection for PRO-based end points. Regulatory review dictates the rules to follow with ePRO data collection for medical label claims. A critical component for regulatory compliance is evidence of the validation of these electronic data collection systems. Validation of electronic systems is a process versus a focused activity that finishes at a single point in time. Eight steps need to be described and undertaken to qualify the validation of the data collection software in its target environment: requirements definition, design, coding, testing, tracing, user acceptance testing, installation and configuration, and decommissioning. These elements are consistent with recent regulatory guidance for systems validation. This report was written to explain how the validation process works for sponsors, trial teams, and other users of electronic data collection devices responsible for verifying the quality of the data entered into relational databases from such devices. It is a guide on the requirements and documentation needed from a data collection systems provider to demonstrate systems validation. It is a practical source of information for study teams to ensure that ePRO providers are using system validation and implementation processes that will ensure the systems and services: operate reliably when in practical use; produce accurate and complete data and data files; support management control and comply with any existing regulations. Furthermore, this short report will increase user understanding of the requirements for a technology review leading to more informed and balanced recommendations or decisions on electronic data collection methods. Copyright © 2013 International Society for Pharmacoeconomics and Outcomes Research (ISPOR). Published by Elsevier Inc. All rights reserved.
Chalmers, Joanne; Deckert, Stefanie; Schmitt, Jochen
2015-06-01
This article describes the core outcome set (COS) for atopic eczema trials. COS describe a minimum set of outcomes to be assessed in a defined situation. COS are required to overcome the current situation of different trials using different endpoints with unclear/insufficient measurement properties resulting in incomparable trials. The global multi-stakeholder Harmonising Outcomes Measures for Eczema initiative developed the Harmonising Outcomes Measures for Eczema roadmap as a generic framework for COS development. Following the establishment of a panel representing all stakeholders, a core set of outcome domains need to be selected based on systematic reviews and consensus methods. Outcome measurement instruments to assess these core domains need to be valid, reliable, and feasible. There is broad global consensus that clinical signs, quality of life, symptoms, and long-term control of flares form the COS for atopic eczema trials. The Eczema Area and Severity Index is recommended to assess clinical signs in atopic eczema trials. Systematic reviews to identify adequate outcome measurement instruments for the other core outcome domains are underway. Clinical signs should be assessed in all atopic eczema trials by at least the Eczema Area and Severity Index. Quality of life, symptoms, and flares should also be assessed in all atopic eczema trials by a valid, reliable, and feasible instrument.
A generic minimization random allocation and blinding system on web.
Cai, Hongwei; Xia, Jielai; Xu, Dezhong; Gao, Donghuai; Yan, Yongping
2006-12-01
Minimization is a dynamic randomization method for clinical trials. Although recommended by many researchers, the utilization of minimization has been seldom reported in randomized trials mainly because of the controversy surrounding the validity of conventional analyses and its complexity in implementation. However, both the statistical and clinical validity of minimization were demonstrated in recent studies. Minimization random allocation system integrated with blinding function that could facilitate the implementation of this method in general clinical trials has not been reported. SYSTEM OVERVIEW: The system is a web-based random allocation system using Pocock and Simon minimization method. It also supports multiple treatment arms within a trial, multiple simultaneous trials, and blinding without further programming. This system was constructed with generic database schema design method, Pocock and Simon minimization method and blinding method. It was coded with Microsoft Visual Basic and Active Server Pages (ASP) programming languages. And all dataset were managed with a Microsoft SQL Server database. Some critical programming codes were also provided. SIMULATIONS AND RESULTS: Two clinical trials were simulated simultaneously to test the system's applicability. Not only balanced groups but also blinded allocation results were achieved in both trials. Practical considerations for minimization method, the benefits, general applicability and drawbacks of the technique implemented in this system are discussed. Promising features of the proposed system are also summarized.
Durão, Solange; Kredo, Tamara; Volmink, Jimmy
2015-06-01
To develop, assess, and maximize the sensitivity of a search strategy to identify diet and nutrition trials in PubMed using relative recall. We developed a search strategy to identify diet and nutrition trials in PubMed. We then constructed a gold standard reference set to validate the identified trials using the relative recall method. Relative recall was calculated by dividing the number of references from the gold standard our search strategy identified by the total number of references in the gold standard. Our gold standard comprised 298 trials, derived from 16 included systematic reviews. The initial search strategy identified 242 of 298 references, with a relative recall of 81.2% [95% confidence interval (CI): 76.3%, 85.5%]. We analyzed titles and abstracts of the 56 missed references for possible additional terms. We then modified the search strategy accordingly. The relative recall of the final search strategy was 88.6% (95% CI: 84.4%, 91.9%). We developed a search strategy to identify diet and nutrition trials in PubMed with a high relative recall (sensitivity). This could be useful for establishing a nutrition trials register to support the conduct of future research, including systematic reviews. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Evaluation and comparison of predictive individual-level general surrogates.
Gabriel, Erin E; Sachs, Michael C; Halloran, M Elizabeth
2018-07-01
An intermediate response measure that accurately predicts efficacy in a new setting at the individual level could be used both for prediction and personalized medical decisions. In this article, we define a predictive individual-level general surrogate (PIGS), which is an individual-level intermediate response that can be used to accurately predict individual efficacy in a new setting. While methods for evaluating trial-level general surrogates, which are predictors of trial-level efficacy, have been developed previously, few, if any, methods have been developed to evaluate individual-level general surrogates, and no methods have formalized the use of cross-validation to quantify the expected prediction error. Our proposed method uses existing methods of individual-level surrogate evaluation within a given clinical trial setting in combination with cross-validation over a set of clinical trials to evaluate surrogate quality and to estimate the absolute prediction error that is expected in a new trial setting when using a PIGS. Simulations show that our method performs well across a variety of scenarios. We use our method to evaluate and to compare candidate individual-level general surrogates over a set of multi-national trials of a pentavalent rotavirus vaccine.
Sarri, Grammati; Bhor, Menaka; Abogunrin, Seye; Farmer, Caroline; Nandal, Savita; Halloway, Rashid; Revicki, Dennis A
2018-05-21
Sickle cell disease (SCD) is a chronic condition associated with high mortality and morbidity. It is characterized by acute clinical symptoms such as painful vaso-occlusive crises, which can impair health-related quality of life (HRQL). This study was conducted to identify validated patient-reported outcome (PRO) instruments for use in future trials of potential treatments for SCD. A systematic literature review (SLR) was performed using MEDLINE and EMBASE to identify United States (US)-based studies published in English between 1997 and 2017 that reported on validated PRO instruments used in randomized controlled trials and real-world settings. The COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist was used to assess the quality of PRO instruments. The SLR included 21 studies assessing the psychometric properties of 24 PRO instruments. Fifteen of those instruments were developed and validated for adults and 10 for children (one instrument was used in both children and young adults aged up to 21 years). Only five of the 15 adult instruments and three of the 10 pediatric instruments were developed specifically for SCD. For most instruments, there were few or no data on validation conducted in SCD development cohorts. Of the 24 PRO instruments identified, 16 had strong internal reliability (Cronbach's α ≥0.80). There was often insufficient information to assess the content validity, construct validity, responsiveness, or test-retest reliability of the instruments identified for both child and adult populations. No validated PRO instruments measuring caregiver burden in SCD were identified. The evidence on the psychometric properties of PRO instruments was limited. However, the results of this SLR provide key information on such tools to help inform the design of future clinical trials for patients with SCD in the US.
Hang, Junjie; Wu, Lixia; Zhu, Lina; Sun, Zhiqiang; Wang, Ge; Pan, Jingjing; Zheng, Suhua; Xu, Kequn; Du, Jiadi; Jiang, Hua
2018-06-01
It is necessary to develop prognostic tools of metastatic pancreatic cancer (MPC) for optimizing therapeutic strategies. Thus, we tried to develop and validate a prognostic nomogram of MPC. Data from 3 clinical trials (NCT00844649, NCT01124786, and NCT00574275) and 133 Chinese MPC patients were used for analysis. The former 2 trials were taken as the training cohort while NCT00574275 was used as the validation cohort. In addition, 133 MPC patients treated in China were taken as the testing cohort. Cox regression model was used to investigate prognostic factors in the training cohort. With these factors, we established a nomogram and verified it by Harrell's concordance index (C-index) and calibration plots. Furthermore, the nomogram was externally validated in the validation cohort and testing cohort. In the training cohort (n = 445), performance status, liver metastasis, Carbohydrate antigen 19-9 (CA19-9) log-value, absolute neutrophil count (ANC), and albumin were independent prognostic factors for overall survival (OS). A nomogram was established with these factors to predict OS and survival probabilities. The nomogram showed an acceptable discrimination ability (C-index: .683) and good calibration, and was further externally validated in the validation cohort (n = 273, C-index: .699) and testing cohort (n = 133, C-index: .653).The nomogram total points (NTP) had the potential to stratify patients into 3-risk groups with median OS of 11.7, 7.0 and 3.7 months (P < .001), respectively. In conclusion, the prognostic nomogram with NTP can predict OS for patients with MPC with considerable accuracy. © 2018 The Authors. Cancer Medicine published by John Wiley & Sons Ltd.
Modeling the clinical and economic implications of obesity using microsimulation.
Su, W; Huang, J; Chen, F; Iacobucci, W; Mocarski, M; Dall, T M; Perreault, L
2015-01-01
The obesity epidemic has raised considerable public health concerns, but there are few validated longitudinal simulation models examining the human and economic cost of obesity. This paper describes a microsimulation model as a comprehensive tool to understand the relationship between body weight, health, and economic outcomes. Patient health and economic outcomes were simulated annually over 10 years using a Markov-based microsimulation model. The obese population examined is nationally representative of obese adults in the US from the 2005-2012 National Health and Nutrition Examination Surveys, while a matched normal weight population was constructed to have similar demographics as the obese population during the same period. Prediction equations for onset of obesity-related comorbidities, medical expenditures, economic outcomes, mortality, and quality-of-life came from published trials and studies supplemented with original research. Model validation followed International Society for Pharmacoeconomics and Outcomes Research practice guidelines. Among surviving adults, relative to a matched normal weight population, obese adults averaged $3900 higher medical expenditures in the initial year, growing to $4600 higher expenditures in year 10. Obese adults had higher initial prevalence and higher simulated onset of comorbidities as they aged. Over 10 years, excess medical expenditures attributed to obesity averaged $4280 annually-ranging from $2820 for obese category I to $5100 for obese category II, and $8710 for obese category III. Each excess kilogram of weight contributed to $140 higher annual costs, on average, ranging from $136 (obese I) to $152 (obese III). Poor health associated with obesity increased work absenteeism and mortality, and lowered employment probability, personal income, and quality-of-life. This validated model helps illustrate why obese adults have higher medical and indirect costs relative to normal weight adults, and shows that medical costs for obese adults rise more rapidly with aging relative to normal weight adults.
Effects of stop-signal probability in the stop-signal paradigm: the N2/P3 complex further validated.
Ramautar, J R; Kok, A; Ridderinkhof, K R
2004-11-01
The aim of this study was to examine the effects of frequency of occurrence of stop signals in the stop-signal paradigm. Presenting stop signals less frequently resulted in faster reaction times to the go stimulus and a lower probability of inhibition. Also, go stimuli elicited larger and somewhat earlier P3 responses when stop signals occurred less frequently. Since the amplitude effect was more pronounced on trials when go signals were followed by fast than slow reactions, it probably reflected a stronger set to produce fast responses. N2 and P3 components to stop signals were observed to be larger and of longer latency when stop signals occurred less frequently. The amplitude enhancement of these N2 and P3 components were more pronounced for unsuccessful than for successful stop-signal trials. Moreover, the successfully inhibited stop trials elicited a frontocentral P3 whereas unsuccessfully inhibited stop trials elicited a more posterior P3 that resembled the classical P3b. P3 amplitude in the unsuccessfully inhibited condition also differed between waveforms synchronized with the stop signal and waveforms synchronized with response onset whereas N2 amplitude did not. Taken together these findings suggest that N2 reflected a greater significance of failed inhibitions after low probability stop signals while P3 reflected continued processing of the erroneous response after response execution.
Liang, Yuanyuan; Ketchum, Norma S; Louden, Christopher; Jimenez-Rios, Miguel A; Thompson, Ian M; Camarena-Reynoso, Hector R
2012-01-01
To perform the first validation study of the finasteride-adjusted Prostate Cancer Prevention Trial Prostate Cancer Risk Calculator (finPCPTRC) in a contemporary referral population in Mexico. 837 patients referred to the Instituto Nacional de Cancerología, Mexico City, Mexico, between 2005 and 2009 were used to validate the finPCPTRC by examining various measures of discrimination and calibration. Net benefit curve analysis was used to gain insight into the use of the finPCPTRC for clinical decisions. Prostate cancer (PCa) incidence (72.8%) was high in this Mexican referral cohort and 45.7% of men who were diagnosed with PCa had high-grade lesions (HGPCa, Gleason score >6). 1.3% of the patients were taking finasteride. The finPCPTRC was a superior diagnostic tool compared to prostate-specific antigen alone when discriminating patients with PCa from those without PCa (AUC = 0.784 vs. AUC = 0.687, p < 0.001) and when discriminating patients with HGPCa from those without HGPCa (AUC = 0.768 vs. AUC = 0.739, p < 0.001). The finPCPTRC underestimated the risk of PCa but overestimated the risk of HGPCa (both p < 0.001). Compared with other strategies to opt for biopsy, the net benefit would be larger with utilization of the finPCPTRC for patients accepting higher risks of HGPCa. Rates of biopsy-detectable PCa and HGPCa were high and 1.3% of this referral cohort in Mexico was taking finasteride. The risks of PCa or HGPCa calculated by the finPCPTRC were not well calibrated for this referral Mexican population and new clinical diagnostic tools are needed. Copyright © 2012 S. Karger AG, Basel.
Cano, Stefan J; Warner, Thomas T; Thompson, Alan J; Bhatia, Kailash P; Fitzpatrick, Ray; Hobart, Jeremy C
2008-01-01
Background The United States Food and Drug Administration (FDA) are currently producing guidelines for the scientific adequacy of patient reported outcome measures (PROMs) in clinical trials, which will have implications for the selection of scales used in future clinical trials. In this study, we examine how the Cervical Dystonia Impact Profile (CDIP-58), a rigorous Rasch measurement developed neurologic PROM, stands up to traditional psychometric criteria for three reasons: 1) provide traditional psychometric evidence for the CDIP-58 in line with proposed FDA guidelines; 2) enable researchers and clinicians to compare it with existing dystonia PROMs; and 3) help researchers and clinicians bridge the knowledge gap between old and new methods of reliability and validity testing. Methods We evaluated traditional psychometric properties of data quality, scaling assumptions, targeting, reliability and validity in a group of 391 people with CD. The main outcome measures used were the CDIP-58, Medical Outcome Study Short Form-36, the 28-item General Health Questionnaire, and Hospital and Anxiety and Depression Scale. Results A total of 391 people returned completed questionnaires (corrected response rate 87%). Analyses showed: 1) data quality was high (low missing data ≤ 4%, subscale scores could be computed for > 96% of the sample); 2) item groupings passed tests for scaling assumptions; 3) good targeting (except for the Sleep subscale, ceiling effect = 27%); 4) good reliability (Cronbach's alpha ≥ 0.92, test-retest intraclass correlations ≥ 0.83); and 5) validity was supported. Conclusion This study has shown that new psychometric methods can produce a PROM that stands up to traditional criteria and supports the clinical advantages of Rasch analysis. PMID:18684327
Cano, Stefan J; Warner, Thomas T; Thompson, Alan J; Bhatia, Kailash P; Fitzpatrick, Ray; Hobart, Jeremy C
2008-08-06
The United States Food and Drug Administration (FDA) are currently producing guidelines for the scientific adequacy of patient reported outcome measures (PROMs) in clinical trials, which will have implications for the selection of scales used in future clinical trials. In this study, we examine how the Cervical Dystonia Impact Profile (CDIP-58), a rigorous Rasch measurement developed neurologic PROM, stands up to traditional psychometric criteria for three reasons: 1) provide traditional psychometric evidence for the CDIP-58 in line with proposed FDA guidelines; 2) enable researchers and clinicians to compare it with existing dystonia PROMs; and 3) help researchers and clinicians bridge the knowledge gap between old and new methods of reliability and validity testing. We evaluated traditional psychometric properties of data quality, scaling assumptions, targeting, reliability and validity in a group of 391 people with CD. The main outcome measures used were the CDIP-58, Medical Outcome Study Short Form-36, the 28-item General Health Questionnaire, and Hospital and Anxiety and Depression Scale. A total of 391 people returned completed questionnaires (corrected response rate 87%). Analyses showed: 1) data quality was high (low missing data < or = 4%, subscale scores could be computed for > 96% of the sample); 2) item groupings passed tests for scaling assumptions; 3) good targeting (except for the Sleep subscale, ceiling effect = 27%); 4) good reliability (Cronbach's alpha > or = 0.92, test-retest intraclass correlations > or = 0.83); and 5) validity was supported. This study has shown that new psychometric methods can produce a PROM that stands up to traditional criteria and supports the clinical advantages of Rasch analysis.
Rutter, Carolyn M; Knudsen, Amy B; Marsh, Tracey L; Doria-Rose, V Paul; Johnson, Eric; Pabiniak, Chester; Kuntz, Karen M; van Ballegooijen, Marjolein; Zauber, Ann G; Lansdorp-Vogelaar, Iris
2016-07-01
Microsimulation models synthesize evidence about disease processes and interventions, providing a method for predicting long-term benefits and harms of prevention, screening, and treatment strategies. Because models often require assumptions about unobservable processes, assessing a model's predictive accuracy is important. We validated 3 colorectal cancer (CRC) microsimulation models against outcomes from the United Kingdom Flexible Sigmoidoscopy Screening (UKFSS) Trial, a randomized controlled trial that examined the effectiveness of one-time flexible sigmoidoscopy screening to reduce CRC mortality. The models incorporate different assumptions about the time from adenoma initiation to development of preclinical and symptomatic CRC. Analyses compare model predictions to study estimates across a range of outcomes to provide insight into the accuracy of model assumptions. All 3 models accurately predicted the relative reduction in CRC mortality 10 years after screening (predicted hazard ratios, with 95% percentile intervals: 0.56 [0.44, 0.71], 0.63 [0.51, 0.75], 0.68 [0.53, 0.83]; estimated with 95% confidence interval: 0.56 [0.45, 0.69]). Two models with longer average preclinical duration accurately predicted the relative reduction in 10-year CRC incidence. Two models with longer mean sojourn time accurately predicted the number of screen-detected cancers. All 3 models predicted too many proximal adenomas among patients referred to colonoscopy. Model accuracy can only be established through external validation. Analyses such as these are therefore essential for any decision model. Results supported the assumptions that the average time from adenoma initiation to development of preclinical cancer is long (up to 25 years), and mean sojourn time is close to 4 years, suggesting the window for early detection and intervention by screening is relatively long. Variation in dwell time remains uncertain and could have important clinical and policy implications. © The Author(s) 2016.
Pienaar, Andries W; Barnard, Justhinus G
2017-04-01
This study describes the development of a new portable muscle testing device, using air pressure as a biofeedback and strength testing tool. For this purpose, a pressure air biofeedback device (PAB ® ) was developed to measure and record the isometric extension strength of the lumbar multifidus muscle in asymptomatic and low back pain (LBP) persons. A total of 42 subjects (age 47.58 years, ±18.58) participated in this study. The validity of PAB ® was assessed by comparing a selected measure, air pressure force in millibar (mb), to a standard criterion; calibrated weights in kilograms (kg) during day-to-day tests. Furthermore, clinical trial-to-trial and day-to-day tests of maximum voluntary isometric contraction (MVIC) of L5 lumbar multifidus were done to compare air pressure force (mb) to electromyography (EMG) in microvolt (μV) and to measure the reliability of PAB ® . A highly significant relationship were found between air pressure output (mb) and calibrated weights (kg). In addition, Pearson correlation calculations showed a significant relationship between PAB ® force (mb) and EMG activity (μV) for all subjects (n = 42) examined, as well as for the asymptomatic group (n = 24). No relationship was detected for the LBP group (n = 18). In terms of lumbar extension strength, we found that asymptomatic subjects were significantly stronger than LBP subjects. The results of the PAB ® test differentiated between LBP and asymptomatic subject's lumbar isometric extension strength without any risk to the subjects and also indicate that the lumbar isometric extension test with the new PAB ® device is reliable and valid.
2012-01-01
Purpose To examine the psychometric properties of the Injection Pen Assessment Questionnaire (IPAQ) including the following: 1) item and scale characteristics (e.g., frequencies, item distributions, and factor structure), 2) reliability, and 3) validity. Methods Focus groups and one-on-one dyad interviews guided the development of the IPAQ. The IPAQ was subsequently tested in 136 parent–child dyads in a Phase 3, 2-month, open-label, multicenter trial for a new Genotropin® disposable pen. Factor analysis was performed to inform the development of a scoring algorithm, and reliability and validity of the IPAQ were evaluated using the data from this two months study. Psychometric analyses were conducted separately for each injection pen. Results Confirmatory factor analysis provides evidence supporting a second order factor solution for four subscales and a total IPAQ score. These factor analysis results support the conceptual framework developed from previous qualitative research in patient dyads using the reusable pen. However, the IPAQ subscales did not consistently meet acceptable internal consistency reliability for some group level comparisons. Cronbach’s alphas for the total IPAQ score for both pens were 0.85, exceeding acceptable levels of reliability for group comparisons. Conclusions The total IPAQ score is a useful measure for evaluating ease of use and preference for injection pens in clinical trials among patient dyads receiving hGH. The psychometric properties of the individual subscales, mainly the lower internal consistency reliability of some of the subscales and the predictive validity findings, do not support the use of subscale scores alone as a primary endpoint. PMID:23046797
Ortelli, Paola; Maestri, Roberto; Zarucchi, Marianna; Cian, Veronica; Urso, Elisa; Giacomello, Francesca; Ferrazzoli, Davide; Frazzitta, Giuseppe
2017-01-01
Quality of life (QoL) is the sense of well-being perceived by people. The improvement of parkinsonian patient's QoL is a crucial goal for clinicians involved in rehabilitative care. In order to provide an appropriate endpoint for the assessment of the effectiveness of rehabilitation treatments on QoL of patients with Parkinson's Disease (PD), in this study we have first translated and then validated the Belastungsfragebogen Parkinson kurzversion (BELA-P-k). This tool allows evaluating separately two crucial aspects: i) the loss of personal autonomy in activities of daily life and ii) the psychological and psychosocial impact of the disease. The BELA-P-k was translated from Dutch into Italian. Subsequently 202 PD patients filled out the questionnaire. Patients were also evaluated by using the Parkinson Disease Questionnaire -39 (PDQ39), the Unified Parkinson's Disease Rating Scale (UPDRS), the Mini Mental State Examination (MMSE) and the Frontal Assessment Battery (FAB). The internal consistency for total of two different scores Bothered by (Bb) and Need for Help (NfH) was excellent ( p = 0.91) for both categories. The correlation between Bb and NfH categories was significant and strong, very-strong, ranging from 0.78 to 0.88 (all p < 0.0001). Finally, the value of Spearman r for the relationship between Bb and NfH items and PDQ 39 values were significant ( p ≤ 0.003). In conclusion, we validated the BELA-P-k and demonstrated that it is an appropriate and potentially useful tool for assessing QoL in the management of PD. This trial was retrospectively registered with ClinicalTrials.gov, NCT03073044.
Kleinman, L; Leidy, N K; Crawley, J; Bonomi, A; Schoenfeld, P
2001-02-01
Although most health-related quality of life questionnaires are self-administered by means of paper and pencil, new technologies for automated computer administration are becoming more readily available. Novel methods of instrument administration must be assessed for score equivalence in addition to consistency in reliability and validity. The present study compared the psychometric characteristics (score equivalence and structure, internal consistency, and reproducibility reliability and construct validity) of the Quality of Life in Reflux And Dyspepsia (QOLRAD) questionnaire when self-administered by means of paper and pencil versus touch-screen computer. The influence of age, education, and prior experience with computers on score equivalence was also examined. This crossover trial randomized 134 patients with gastroesophageal reflux disease to 1 of 2 groups: paper-and-pencil questionnaire administration followed by computer administration or computer administration followed by use of paper and pencil. To minimize learning effects and respondent fatigue, administrations were scheduled 3 days apart. A random sample of 32 patients participated in a 1-week reproducibility evaluation of the computer-administered QOLRAD. QOLRAD scores were equivalent across the 2 methods of administration regardless of subject age, education, and prior computer use. Internal consistency levels were very high (alpha = 0.93-0.99). Interscale correlations were strong and generally consistent across methods (r = 0.7-0.87). Correlations between the QOLRAD and Short Form 36 (SF-36) were high, with no significant differences by method. Test-retest reliability of the computer-administered QOLRAD was also very high (ICC = 0.93-0.96). Results of the present study suggest that the QOLRAD is reliable and valid when self-administered by means of computer touch-screen or paper and pencil.
Lynn, Scott K.; Watkins, Casey M.; Wong, Megan A.; Balfany, Katherine; Feeney, Daniel F.
2018-01-01
The Athos ® wearable system integrates surface electromyography (sEMG ) electrodes into the construction of compression athletic apparel. The Athos system reduces the complexity and increases the portability of collecting EMG data and provides processed data to the end user. The objective of the study was to determine the reliability and validity of Athos as compared with a research grade sEMG system. Twelve healthy subjects performed 7 trials on separate days (1 baseline trial and 6 repeated trials). In each trial subjects wore the wearable sEMG system and had a research grade sEMG system’s electrodes placed just distal on the same muscle, as close as possible to the wearable system’s electrodes. The muscles tested were the vastus lateralis (VL), vastus medialis (VM), and biceps femoris (BF). All testing was done on an isokinetic dynamometer. Baseline testing involved performing isometric 1 repetition maximum tests for the knee extensors and flexors and three repetitions of concentric-concentric knee flexion and extension at MVC for each testing speed: 60, 180, and 300 deg/sec. Repeated trials 2-7 each comprised 9 sets where each set included three repetitions of concentric-concentric knee flexion-extension. Each repeated trial (2-7) comprised one set at each speed and percent MVC (50%, 75%, 100%) combination. The wearable system and research grade sEMG data were processed using the same methods and aligned in time. The amplitude metrics calculated from the sEMG for each repetition were the peak amplitude, sum of the linear envelope, and 95th percentile. Validity results comprise two main findings. First, there is not a significant effect of system (Athos or research grade system) on the repetition amplitude metrics (95%, peak, or sum). Second, the relationship between torque and sEMG is not significantly different between Athos and the research grade system. For reliability testing, the variation across trials and averaged across speeds was 0.8%, 7.3%, and 0.2% higher for Athos from BF, VL and VM, respectively. Also, using the standard deviation of the MVC normalized repetition amplitude, the research grade system showed 10.7% variability while Athos showed 12%. The wearable technology (Athos) provides sEMG measures that are consistent with controlled, research grade technologies and data collection procedures. Key points Surface EMG embedded into athletic garments (Athos) had similar validity and reliability when compared with a research grade system There was no difference in the torque-EMG relationship between the two systems No statistically significant difference in reliability across 6 trials between the two systems The validity and reliability of Athos demonstrates the potential for sEMG to be applied in dynamic rehabilitation and sports settings PMID:29769821
We propose a Phase 2 (large cross-sectional) PRoBE-compliant validation trial of stool-based and serum-based tests for the detection of colorectal neoplasia (1). The trial is powered to detect early stage colorectal adenocarcinoma or high grade dysplasia. This is the most stringent, conservative approach to the early diagnosis of colonic neoplasia and addresses the most important endpoint of identifying individuals with curable, early stage cancer and those with very high risk non-invasive neoplasia (high grade dysplasia).
Lustgarten, M.S.; Fielding, R.A.
2012-01-01
It is estimated that in the next 20 years, the amount of people greater than 65 years of age will rise from 40 to 70 million, and will account for 19% of the total population. Age-related decreases in muscle mass and function, known as sarcopenia, have been shown to be related to functional limitation, frailty and an increased risk of morbidity and mortality. Therefore, with an increasing elderly population, interventions that can improve muscle mass content and/or function are essential. However, analytical techniques used for measurement of muscle mass in young subjects may not be valid for use in the elderly. Therefore, the purpose of this review is to examine the applied specificity and accuracy of methods that are commonly used for measurement of muscle mass in aged subjects, and, to propose specific recommendations for the use of body composition measures in phase II clinical trials of function-promoting anabolic therapies. PMID:21528163
Morgan-Lopez, Antonio A.; Fals-Stewart, William
2015-01-01
Historically, difficulties in analyzing treatment outcome data from open enrollment groups have led to their avoidance in use in federally-funded treatment trials, despite the fact that 79% of treatment programs use open enrollment groups. Recently, latent class pattern mixture models (LCPMM) have shown promise as a defensible approach for making overall (and attendance class-specific) inferences from open enrollment groups with membership turnover. We present a statistical simulation study comparing LCPMMs to longitudinal growth models (LGM) to understand when both frameworks are likely to produce conflicting inferences concerning overall treatment efficacy. LCPMMs performed well under all conditions examined; meanwhile LGMs produced problematic levels of bias and Type I errors under two joint conditions: moderate-to-high dropout (30–50%) and treatment by attendance class interactions exceeding Cohen's d ≈.2. This study highlights key concerns about using LGM for open enrollment data: treatment effect overestimation and advocacy for treatments that may be ineffective in reality. PMID:18513917
Treatment manuals: use in the treatment of bulimia nervosa.
Wallace, Laurel M; von Ranson, Kristin M
2011-11-01
As psychology has moved toward emphasizing evidence-based practice, use of treatment manuals has extended from research trials into clinical practice. Minimal research has directly evaluated use of manuals in clinical practice. This survey of international eating disorder professionals examined use of manuals with 259 clinicians' most recent client with bulimia nervosa. Although evidence-based manuals for bulimia nervosa exist, only 35.9% of clinicians reported using a manual. Clinicians were more likely to use a manual if they were younger; were treating an adult client; were clinical psychologists; were involved in research related to eating disorders; and endorsed a cognitive-behavioral orientation. Clinicians were less likely to use a manual if they provided eclectic psychotherapy that incorporated multiple psychotherapeutic approaches. We conclude that psychotherapy provided in clinical practice often does not align with the specific form validated in research trials, and "eclecticism" is at odds with efforts to disseminate manuals into clinical practice. Copyright © 2011 Elsevier Ltd. All rights reserved.
Chen, Rui; Xie, Liping; Xue, Wei; Ye, Zhangqun; Ma, Lulin; Gao, Xu; Ren, Shancheng; Wang, Fubo; Zhao, Lin; Xu, Chuanliang; Sun, Yinghao
2016-09-01
Substantial differences exist in the relationship of prostate cancer (PCa) detection rate and prostate-specific antigen (PSA) level between Western and Asian populations. Classic Western risk calculators, European Randomized Study for Screening of Prostate Cancer Risk Calculator, and Prostate Cancer Prevention Trial Risk Calculator, were shown to be not applicable in Asian populations. We aimed to develop and validate a risk calculator for predicting the probability of PCa and high-grade PCa (defined as Gleason Score sum 7 or higher) at initial prostate biopsy in Chinese men. Urology outpatients who underwent initial prostate biopsy according to the inclusion criteria were included. The multivariate logistic regression-based Chinese Prostate Cancer Consortium Risk Calculator (CPCC-RC) was constructed with cases from 2 hospitals in Shanghai. Discriminative ability, calibration and decision curve analysis were externally validated in 3 CPCC member hospitals. Of the 1,835 patients involved, PCa was identified in 338/924 (36.6%) and 294/911 (32.3%) men in the development and validation cohort, respectively. Multivariate logistic regression analyses showed that 5 predictors (age, logPSA, logPV, free PSA ratio, and digital rectal examination) were associated with PCa (Model 1) or high-grade PCa (Model 2), respectively. The area under the curve of Model 1 and Model 2 was 0.801 (95% CI: 0.771-0.831) and 0.826 (95% CI: 0.796-0.857), respectively. Both models illustrated good calibration and substantial improvement in decision curve analyses than any single predictors at all threshold probabilities. Higher predicting accuracy, better calibration, and greater clinical benefit were achieved by CPCC-RC, compared with European Randomized Study for Screening of Prostate Cancer Risk Calculator and Prostate Cancer Prevention Trial Risk Calculator in predicting PCa. CPCC-RC performed well in discrimination and calibration and decision curve analysis in external validation compared with Western risk calculators. CPCC-RC may aid in decision-making of prostate biopsy in Chinese or in other Asian populations with similar genetic and environmental backgrounds. Copyright © 2016 Elsevier Inc. All rights reserved.
Ebbeling, Cara B; Wadden, Thomas A; Ludwig, David S
2011-01-01
Background: The circumstances under which the glycemic index (GI) and glycemic load (GL) are derived do not reflect real-world eating behavior. Thus, the ecologic validity of these constructs is incompletely known. Objective: This study examined the relation of dietary intake to glycemic response when foods are consumed under free-living conditions. Design: Participants were 26 overweight or obese adults with type 2 diabetes who participated in a randomized trial of lifestyle modification. The current study includes baseline data, before initiation of the intervention. Participants wore a continuous glucose monitor and simultaneously kept a food diary for 3 d. The dietary variables included GI, GL, and intakes of energy, fat, protein, carbohydrate, sugars, and fiber. The glycemic response variables included AUC, mean and SD of continuous glucose monitoring (CGM) values, percentage of CGM values in euglycemic and hyperglycemic ranges, and mean amplitude of glycemic excursions. Relations between daily dietary intake and glycemic outcomes were examined. Results: Data were available from 41 d of monitoring. Partial correlations, controlled for energy intake, indicated that GI or GL was significantly associated with each glycemic response outcome. In multivariate analyses, dietary GI accounted for 10% to 18% of the variance in each glycemic variable, independent of energy and carbohydrate intakes (P < 0.01). Conclusions: The data support the ecologic validity of the GI and GL constructs in free-living obese adults with type 2 diabetes. GI was the strongest and most consistent independent predictor of glycemic stability and variability. PMID:22071699
Change in knee contact force with simulated change in body weight.
Knarr, Brian A; Higginson, Jill S; Zeni, Joseph A
2016-02-01
The relationship between obesity, weight gain and progression of knee osteoarthritis is well supported, suggesting that excessive joint loading may be a mechanism responsible for cartilage deterioration. Examining the influence of weight gain on joint compressive forces is difficult, as both muscles and ground reaction forces can have a significant impact on the forces experienced during gait. While previous studies have examined the relationship between body weight and knee forces, these studies have used models that were not validated using experimental data. Therefore, the objective of this study was to evaluate the relationship between changes in body weight and changes in knee joint contact forces for an individual's gait pattern using musculoskeletal modeling that is validated against known internal compressive forces. Optimal weighting constants were determined for three subjects to generate valid predictions of knee contact forces (KCFs) using in vivo data collection with instrumented total knee arthroplasty. A total of five simulations per walking trial were generated for each subject, from 80% to 120% body weight in 10% increments, resulting in 50 total simulations. The change in peak KCF with respect to body weight was found to be constant and subject-specific, predominantly determined by the peak force during the baseline condition at 100% body weight. This relationship may be further altered by any change in kinematics or body mass distribution that may occur as a result of a change in body weight or exercise program.
Developing a Cognition Endpoint for Traumatic Brain Injury Clinical Trials
Crane, Paul K.; Dams-O'Connor, Kristen; Holdnack, James; Ivins, Brian J.; Lange, Rael T.; Manley, Geoffrey T.; McCrea, Michael; Iverson, Grant L.
2017-01-01
Abstract Cognitive impairment is a core clinical feature of traumatic brain injury (TBI). After TBI, cognition is a key determinant of post-injury productivity, outcome, and quality of life. As a final common pathway of diverse molecular and microstructural TBI mechanisms, cognition is an ideal endpoint in clinical trials involving many candidate drugs and nonpharmacological interventions. Cognition can be reliably measured with performance-based neuropsychological tests that have greater granularity than crude rating scales, such as the Glasgow Outcome Scale-Extended, which remain the standard for clinical trials. Remarkably, however, there is no well-defined, widely accepted, and validated cognition endpoint for TBI clinical trials. A single cognition endpoint that has excellent measurement precision across a wide functional range and is sensitive to the detection of small improvements (and declines) in cognitive functioning would enhance the power and precision of TBI clinical trials and accelerate drug development research. We outline methodologies for deriving a cognition composite score and a research program for validation. Finally, we discuss regulatory issues and the limitations of a cognition endpoint. PMID:27188248
Dellson, P; Nilbert, M; Bendahl, P-O; Malmström, P; Carlsson, C
2011-07-01
Clinical trials are crucial to improve cancer treatment but recruitment is difficult. Optimised patient information has been recognised as a key issue. In line with the increasing focus on patients' perspectives in health care, we aimed to study patients' opinions about the written information used in three clinical trials for breast cancer. Primary data collection was done in focus group interviews with breast cancer patient advocates. Content analysis identified three major themes: comprehensibility, emotions and associations, and decision making. Based on the advocates' suggestions for improvements, 21 key issues were defined and validated through a questionnaire in an independent group of breast cancer patient advocates. Clear messages, emotionally neutral expressions, careful descriptions of side effects, clear comparisons between different treatment alternatives and information about the possibility to discontinue treatment were perceived as the most important issues. Patients' views of the information in clinical trials provide new insights and identify key issues to consider in optimising future written information and may improve recruitment to clinical cancer trials. © 2010 Blackwell Publishing Ltd.
Berk, Michael; Dodd, Seetal; Dean, Olivia M; Kohlmann, Kristy; Berk, Lesley; Malhi, Gin S
2010-10-01
Berk M, Dodd S, Dean OM, Kohlmann K, Berk L, Malhi GS. The validity and internal structure of the Bipolar Depression Rating Scale: data from a clinical trial of N-acetylcysteine as adjunctive therapy in bipolar disorder. The phenomenology of unipolar and bipolar disorders differ in a number of ways, such as the presence of mixed states and atypical features. Conventional depression rating instruments are designed to capture the characteristics of unipolar depression and have limitations in capturing the breadth of bipolar disorder. The Bipolar Depression Rating Scale (BDRS) was administered together with the Montgomery Asberg Rating Scale (MADRS) and Young Mania Rating Scale (YMRS) in a double-blind randomised placebo-controlled clinical trial of N-acetyl cysteine for bipolar disorder (N = 75). A factor analysis showed a two-factor solution: depression and mixed symptom clusters. The BDRS has strong internal consistency (Cronbach's alpha = 0.917), the depression cluster showed robust correlation with the MADRS (r = 0.865) and the mixed subscale correlated with the YMRS (r = 0.750). The BDRS has good internal validity and inter-rater reliability and is sensitive to change in the context of a clinical trial.
Zhang, Chuanwu; Garrard, Lili; Keighley, John; Carlson, Susan; Gajewski, Byron
2017-01-10
Despite the widely recognized association between the severity of early preterm birth (ePTB) and its related severe diseases, little is known about the potential risk factors of ePTB and the sub-population with high risk of ePTB. Moreover, motivated by a future confirmatory clinical trial to identify whether supplementing pregnant women with docosahexaenoic acid (DHA) has a different effect on the risk subgroup population or not in terms of ePTB prevalence, this study aims to identify potential risk subgroups and risk factors for ePTB, defined as babies born less than 34 weeks of gestation. The analysis data (N = 3,994,872) were obtained from CDC and NCHS' 2014 Natality public data file. The sample was split into independent training and validation cohorts for model generation and model assessment, respectively. Logistic regression and CART models were used to examine potential ePTB risk predictors and their interactions, including mothers' age, nativity, race, Hispanic origin, marital status, education, pre-pregnancy smoking status, pre-pregnancy BMI, pre-pregnancy diabetes status, pre-pregnancy hypertension status, previous preterm birth status, infertility treatment usage status, fertility enhancing drug usage status, and delivery payment source. Both logistic regression models with either 14 or 10 ePTB risk factors produced the same C-index (0.646) based on the training cohort. The C-index of the logistic regression model based on 10 predictors was 0.645 for the validation cohort. Both C-indexes indicated a good discrimination and acceptable model fit. The CART model identified preterm birth history and race as the most important risk factors, and revealed that the subgroup with a preterm birth history and a race designation as Black had the highest risk for ePTB. The c-index and misclassification rate were 0.579 and 0.034 for the training cohort, and 0.578 and 0.034 for the validation cohort, respectively. This study revealed 14 maternal characteristic variables that reliably identified risk for ePTB through either logistic regression model and/or a CART model. Moreover, both models efficiently identify risk subgroups for further enrichment clinical trial design.
Kong, C L; Lee, C C; Ip, Y C; Chow, L P; Leung, C H; Lam, Y C
2016-03-01
The World Health Organization Five Well-Being Index (WHO-5) has been developed to measure psychological wellbeing. Translation and linguistic validation of the WHO-5 into a Cantonese version has been accomplished for local use but it is not yet validated in people with severe mental illness in Hong Kong. This study aimed to examine the applicability of WHO-5 in measuring the psychological wellbeing dimension of people with severe mental illness. A brief and easily administrated tool to measure psychological wellbeing of people with severe mental illness can be used to provide an outcome measure in research studies and clinical trials. Subjects were randomly recruited from the Extended-Care Patient Intensive Treatment, Early Diversion and Rehabilitation Stepping-Stone Project (EXITERS) and the Rehabilitation Activity Centre (RAC) of Kwai Chung Hospital in Hong Kong. They were invited to complete the abbreviated version of Hong Kong Chinese World Health Organization Quality of Life (WHOQOL-BREF [HK]) and WHO-5 (Cantonese version) separately and concurrent validity was examined. A total of 84 subjects were recruited, 42 each from EXITERS and RAC. In all, 49 (58%) were male and 35 (42%) were female. The mean ± standard deviation age was 43.2 ± 9.7 years. Their mean duration of mental illness was 16.4 ± 10.5 years and the mean years of education was 10.17 ± 2.5 years, i.e. about junior secondary school level in Hong Kong. The internal consistency of the WHO-5 was satisfactory (0.86) and was comparable with previous reports. Regarding validity, 1-factor structure with an eigenvalue of 3.24 explained 64.8% of total variance of WHO-5 for people with severe mental illness. Concurrent validity was established with moderate correlation (0.41-0.51) between WHO-5 and 4 domains of the WHOQOL-BREF (HK). The WHO-5 (Cantonese version) is a reliable and valid tool to assess the psychological wellbeing of people with severe mental illness in Hong Kong. It can be used to monitor the effectiveness of psychological intervention aimed at improving the wellbeing of such patients.
Alternative methods to evaluate trial level surrogacy.
Abrahantes, Josè Cortiñas; Shkedy, Ziv; Molenberghs, Geert
2008-01-01
The evaluation and validation of surrogate endpoints have been extensively studied in the last decade. Prentice [1] and Freedman, Graubard and Schatzkin [2] laid the foundations for the evaluation of surrogate endpoints in randomized clinical trials. Later, Buyse et al. [5] proposed a meta-analytic methodology, producing different methods for different settings, which was further studied by Alonso and Molenberghs [9], in their unifying approach based on information theory. In this article, we focus our attention on the trial-level surrogacy and propose alternative procedures to evaluate such surrogacy measure, which do not pre-specify the type of association. A promising correction based on cross-validation is investigated. As well as the construction of confidence intervals for this measure. In order to avoid making assumption about the type of relationship between the treatment effects and its distribution, a collection of alternative methods, based on regression trees, bagging, random forests, and support vector machines, combined with bootstrap-based confidence interval and, should one wish, in conjunction with a cross-validation based correction, will be proposed and applied. We apply the various strategies to data from three clinical studies: in opthalmology, in advanced colorectal cancer, and in schizophrenia. The results obtained for the three case studies are compared; they indicate that using random forest or bagging models produces larger estimated values for the surrogacy measure, which are in general stabler and the confidence interval narrower than linear regression and support vector regression. For the advanced colorectal cancer studies, we even found the trial-level surrogacy is considerably different from what has been reported. In general the alternative methods are more computationally demanding, and specially the calculation of the confidence intervals, require more computational time that the delta-method counterpart. First, more flexible modeling techniques can be used, allowing for other type of association. Second, when no cross-validation-based correction is applied, overly optimistic trial-level surrogacy estimates will be found, thus cross-validation is highly recommendable. Third, the use of the delta method to calculate confidence intervals is not recommendable since it makes assumptions valid only in very large samples. It may also produce range-violating limits. We therefore recommend alternatives: bootstrap methods in general. Also, the information-theoretic approach produces comparable results with the bagging and random forest approaches, when cross-validation correction is applied. It is also important to observe that, even for the case in which the linear model might be a good option too, bagging methods perform well too, and their confidence intervals were more narrow.
2009-01-01
Background Decisions about interim analysis and early stopping of clinical trials, as based on recommendations of Data Monitoring Committees (DMCs), have far reaching consequences for the scientific validity and clinical impact of a trial. Our aim was to evaluate the frequency and quality of the reporting on DMC composition and roles, interim analysis and early termination in pediatric trials. Methods We conducted a systematic review of randomized controlled clinical trials published from 2005 to 2007 in a sample of four general and four pediatric journals. We used full-text databases to identify trials which reported on DMCs, interim analysis or early termination, and included children or adolescents. Information was extracted on general trial characteristics, risk of bias, and a set of parameters regarding DMC composition and roles, interim analysis and early termination. Results 110 of the 648 pediatric trials in this sample (17%) reported on DMC or interim analysis or early stopping, and were included; 68 from general and 42 from pediatric journals. The presence of DMCs was reported in 89 of the 110 included trials (81%); 62 papers, including 46 of the 89 that reported on DMCs (52%), also presented information about interim analysis. No paper adequately reported all DMC parameters, and nine (15%) reported all interim analysis details. Of 32 trials which terminated early, 22 (69%) did not report predefined stopping guidelines and 15 (47%) did not provide information on statistical monitoring methods. Conclusions Reporting on DMC composition and roles, on interim analysis results and on early termination of pediatric trials is incomplete and heterogeneous. We propose a minimal set of reporting parameters that will allow the reader to assess the validity of trial results. PMID:20003383
ERIC Educational Resources Information Center
Maerten-Rivera, Jaime Lynn; Huggins-Manley, Anne Corinne; Adamson, Karen; Lee, Okhee; Llosa, Lorena
2015-01-01
Using data collected from two multiyear teacher professional development projects employing randomized control trials, this study describes the development and validation of a paper-based test of elementary teachers' science content knowledge (SCK). Evidence of construct validity is presented, including evidence on internal structural…
Markers of systemic inflammation predict survival in patients with advanced renal cell cancer.
Fox, P; Hudson, M; Brown, C; Lord, S; Gebski, V; De Souza, P; Lee, C K
2013-07-09
The host inflammatory response has a vital role in carcinogenesis and tumour progression. We examined the prognostic value of inflammatory markers (albumin, white-cell count and its components, and platelets) in pre-treated patients with advanced renal cell carcinoma (RCC). Using data from a randomised trial, multivariable proportional hazards models were generated to examine the impact of inflammatory markers and established prognostic factors (performance status, calcium, and haemoglobin) on overall survival (OS). We evaluated a new prognostic classification incorporating additional information from inflammatory markers. Of the 416 patients, 362 were included in the analysis. Elevated neutrophil counts, elevated platelet counts, and a high neutrophil-lymphocyte ratio were significant independent predictors for shorter OS in a model with established prognostic factors. The addition of inflammatory markers improves the discriminatory value of the prognostic classification as compared with established factors alone (C-statistic 0.673 vs 0.654, P=0.002 for the difference), with 25.8% (P=0.004) of patients more appropriately classified using the new classification. Markers of systemic inflammation contribute significantly to prognostic classification in addition to established factors for pre-treated patients with advanced RCC. Upon validation of these data in independent studies, stratification of patients using these markers in future clinical trials is recommended.
Evaluation Using Sequential Trials Methods.
ERIC Educational Resources Information Center
Cohen, Mark E.; Ralls, Stephen A.
1986-01-01
Although dental school faculty as well as practitioners are interested in evaluating products and procedures used in clinical practice, research design and statistical analysis can sometimes pose problems. Sequential trials methods provide an analytical structure that is both easy to use and statistically valid. (Author/MLW)
Surrogate endpoints in randomized cardiovascular clinical trials.
Domanski, Michael; Pocock, Stuart; Bernaud, Corine; Borer, Jeffrey; Geller, Nancy; Revkin, James; Zannad, Faiez
2011-08-01
Surrogate endpoints predict the occurrence and timing of a clinical endpoint of interest (CEI). Substitution of a surrogate endpoint for a CEI can dramatically reduce the time and cost necessary to complete a Phase III clinical trial. However, assurance that use of a surrogate endpoint will result in a correct conclusion regarding treatment effect on a CEI requires prior rigorous validation of the surrogate. Surrogate endpoints can also be of substantial use in Phase I and II studies to assess whether the intended therapeutic pathway is operative, thus providing assurance regarding the reasonableness of proceeding to a Phase III trial. This paper discusses the uses and validation of surrogate endpoints. © 2010 The Authors Fundamental and Clinical Pharmacology © 2010 Société Française de Pharmacologie et de Thérapeutique.
Lee, Cindy; Vather, Ryash; O'Callaghan, Anne; Robinson, Jackie; McLeod, Briar; Findlay, Michael; Bissett, Ian
2013-12-01
Malignant bowel obstruction (MBO) is common in patients with advanced cancer. To perform a phase II study to assess the feasibility of conducting a phase III trial investigating the therapeutic value of gastrografin in MBO. Randomized double-blinded placebo-controlled feasibility study. Participants received 100 mL of either gastrografin or placebo. Over 8 months, 57 patients were screened and 9 enrolled (15.8% recruitment rate). Of the 9 enrolled, 4 received gastrografin (with 2 completing assessment) and 5 received placebo (with 4 completing assessment). It is not feasible to conduct a phase III trial using the same study protocol. This study validates the use of the phase II feasibility study to assess protocol viability in a palliative population prior to embarking on a larger trial.
Test of memory malingering (TOMM) trial 1 as a screening measure for insufficient effort.
O'Bryant, Sid E; Engel, Lisa R; Kleiner, Jennifer S; Vasterling, Jennifer J; Black, F William
2007-05-01
The identification of insufficient effort is critical to neuropsychological evaluation, and several existing instruments assess effort on neuropsychological tasks. Yet instruments designed to detect insufficient effort are underutilized in standard neuropsychological assessments, perhaps in part because they typically require significant administration time and are, therefore, not ideally suited to screening contexts. The Test of Memory Malingering (TOMM) is a commonly administered, well-validated symptom validity test. This study evaluates the utility of TOMM Trial 1 as a relatively brief screening measure of insufficient effort. Results suggest that TOMM Trial 1 demonstrates high diagnostic accuracy and is a viable option for screening insufficient effort. Diagnostic accuracy estimates are presented for a range of base rates. The need for more comprehensive SVT assessment in most clinical and forensic situation is discussed.
Large Cancer Drug Trial Helps Move Precision Medicine Toward the Mainstream | Poster
A landmark cancer drug trial is helping set the stage for moving precision medicine into the mainstream of clinical practice, according to a new study. The study, reported in the Journal of Molecular Diagnostics, validates a procedure used in the drug trial that identifies the unique genetic mutations in a patient’s tumor, which is then used as the basis for selecting targeted
Timmermans, Ivy; Meine, Matias; Zitron, Edgar; Widdershoven, Jos; Kimman, Geert; Prevot, Sébastien; Rauwolf, Thomas; Anselme, Frédéric; Szendey, Istvan; Romero Roldán, Javier; Mabo, Philippe; Schaer, Beat; Denollet, Johan; Versteeg, Henneke
2017-07-01
Studies have shown that remote patient monitoring (RPM) of implantable cardioverter defibrillators (ICDs) is at least comparable to in-clinic follow-up with regard to clinical outcomes and might be cost-effective, yet RPM is not standard clinical practice within Europe. Better insight into the patient perspective on RPM may aid in its acceptance, implementation, and reimbursement. This narrative review (1) summarizes existing evidence on the impact of RPM on patient-reported outcomes and (2) discusses future directions in examining the patient perspective. Literature review indicated that only five randomized trials on RPM in ICD patients included patient-reported outcomes, with inconclusive results. Observational studies show a trend toward good patient satisfaction and acceptation of RPM. Yet, results should be interpreted with caution due to a number of limitations including a potential selection bias, use of generic/nonvalidated questionnaires, relatively short follow-up durations, and a lack of subgroup identification. Although RPM seems to be safe, effective, timely, and efficient, the patient perspective has received little attention so far. The scarce evidence on patient-reported outcomes in RPM studies seems to be positive, but future trials with a follow-up of ≥12 months and validated patient-reported outcome measures are needed. The REMOTE-CIED study from our group is the first prospective randomized controlled trial primarily designed to examine the patient perspective on RPM, and is powered to identify characteristics associated with RPM satisfaction and benefit. Results are expected in 2018 and will add valuable information to the current evidence. © 2017 Wiley Periodicals, Inc.
Evans, M Blair; McGuckin, Matthew; Gainforth, Heather L; Bruner, Mark W; Côté, Jean
2015-07-01
Although evidence supports the effectiveness of interpersonal Coach Development Programmes (CDPs), which are designed to foster coach-athlete relationships, an intervention's impact is shaped by numerous factors over and above effectiveness. The purpose of this systematic review was to examine the extent that published articles describing interpersonal CDP trials reported on indicators of internal and external validity, as conceptualised in the RE-AIM framework (ie, Reach, Effectiveness, Adoption, Implementation and Maintenance). The search strategy was conducted according to the Preferred Reporting Items for Systematic Reviews and Meta-analyses guidelines, involving a database search and supplemental manual search of key articles and journals. After initial screening, the full-text search strategy involved identifying articles describing CDP trials and then selecting a specific subgroup of articles involving interpersonal CDP trials and excluding ineligible articles. Resulting trials were coded using a 47-item sport coaching adaptation of the RE-AIM coding sheet. 17 published articles met eligibility criteria, representing 10 distinct CDP trials. After attaining coder agreement, global ratings of RE-AIM indicators within interpersonal CDP trials ranged from the low to moderate quality. Whereas indicators of effectiveness and implementation were reported to some extent across all studies, maintenance within sport organisations and a number of specific indicators from across dimensions were rarely reported. These findings inform the future design and evaluation of CDPs that have the potential to be adopted in numerous settings and reach athletes and coaches who can most benefit. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
ERIC Educational Resources Information Center
National Academy of Education, Stanford, CA.
This report evaluates the conduct, validity, and uses of the National Assessment of Educational Progress (NAEP) Trial State Assessment (TSA). The report addresses such pressing problems as how participation in NAEP can be maintained and appropriate samples can be achieved; how errors can be minimized in the complex process of scaling and analyzing…
Macklin, Eric A; Blacker, Deborah; Hyman, Bradley T; Betensky, Rebecca A
2013-01-01
Alzheimer's disease (AD) trials initiated during or before the prodrome are costly and lengthy because patients are enrolled long before clinical symptoms are apparent, when disease progression is slow. We hypothesized that design of such trials could be improved by: 1) selecting individuals at moderate near-term risk of progression to AD dementia (the current clinical standard) and 2) by using short-term surrogate endpoints that predict progression to AD dementia. We used a longitudinal cohort of older, initially non-demented, community-dwelling participants (n = 358) to derive selection criteria and surrogate endpoints and tested them in an independent national data set (n = 6,243). To identify a "mid-risk" subgroup, we applied conditional tree-based survival models to Clinical Dementia Rating (CDR) scale scores and common neuropsychological tests. In the validation cohort, a time-to-AD dementia trial applying these mid-risk selection criteria to a pool of all non-demented individuals could achieve equivalent power with 47% fewer participants than enrolling at random from that pool. We evaluated surrogate endpoints measureable over two years of follow-up based on cross-validated concordance between predictions from Cox models and observed time to AD dementia. The best performing surrogate, rate of change in CDR sum-of-boxes, did not reduce the trial duration required for equivalent power using estimates from the validation cohort, but alternative surrogates with better ability to predict time to AD dementia should be able to do so. The approach tested here might improve efficiency of prodromal AD trials using other potential measures and could be generalized to other diseases with long prodromal phases.
Macklin, Eric A.; Blacker, Deborah; Hyman, Bradley T.; Betensky, Rebecca A.
2013-01-01
Summary Alzheimer’s disease (AD) trials initiated during or before the prodrome are costly and lengthy because patients are enrolled long before clinical symptoms are apparent, when disease progression is slow. We hypothesized that design of such trials could be improved by: (1) selecting individuals at moderate near-term risk of progression to AD dementia (the current clinical standard) and (2) by using short-term surrogate endpoints that predict progression to AD dementia. We used a longitudinal cohort of older, initially non-demented, community-dwelling participants (n=358) to derive selection criteria and surrogate endpoints and tested them in an independent national data set (n=6,243). To identify a “mid-risk” subgroup, we applied conditional tree-based survival models to Clinical Dementia Rating (CDR) scale scores and common neuropsychological tests. In the validation cohort, a time-to-AD dementia trial applying these mid-risk selection criteria to a pool of all non-demented individuals could achieve equivalent power with 47% fewer participants than enrolling at random from that pool. We evaluated surrogate endpoints measureable over two years of follow-up based on cross-validated concordance between predictions from Cox models and observed time to AD dementia. The best performing surrogate, rate of change in CDR sum-of-boxes, did not reduce the trial duration required for equivalent power using estimates from the validation cohort, but alternative surrogates with better ability to predict time to AD dementia should be able to do so. The approach tested here might improve efficiency of prodromal AD trials using other potential measures and could be generalized to other diseases with long prodromal phases. PMID:23629586
In Silico Evaluation of Pharmacokinetic Optimization for Antimitogram-Based Clinical Trials.
Haviari, Skerdi; You, Benoît; Tod, Michel
2018-04-01
Antimitograms are prototype in vitro tests for evaluating chemotherapeutic efficacy using patient-derived primary cancer cells. These tests might help optimize treatment from a pharmacodynamic standpoint by guiding treatment selection. However, they are technically challenging and require refinements and trials to demonstrate benefit to be widely used. In this study, we performed simulations aimed at exploring how to validate antimitograms and how to complement them by pharmacokinetic optimization. A generic model of advanced cancer, including pharmacokinetic-pharmacodynamic monitoring, was used to link dosing schedules with progression-free survival (PFS), as built from previously validated modules. This model was used to explore different possible situations in terms of pharmacokinetic variability, pharmacodynamic variability, and antimitogram performance. The model recapitulated tumor dynamics and standalone therapeutic drug monitoring efficacy consistent with published clinical results. Simulations showed that combining pharmacokinetic and pharmacodynamic optimization should increase PFS in a synergistic fashion. Simulated data were then used to compute required clinical trial sizes, which were 30% to 90% smaller when pharmacokinetic optimization was added to pharmacodynamic optimization. This improvement was observed even when pharmacokinetic optimization alone exhibited only modest benefit. Overall, our work illustrates the synergy derived from combining antimitograms with therapeutic drug monitoring, permitting a disproportionate reduction of the trial size required to prove a benefit on PFS. Accordingly, we suggest that strategies with benefits too small for standalone clinical trials could be validated in combination in a similar manner. Significance: This work offers a method to reduce the number of patients needed for a clinical trial to prove the hypothesized benefit of a drug to progression-free survival, possibly easing opportunities to evaluate combinations. Cancer Res; 78(7); 1873-82. ©2018 AACR . ©2018 American Association for Cancer Research.
Rider, Lisa G.; Aggarwal, Rohit; Pistorio, Angela; Bayat, Nastaran; Erman, Brian; Feldman, Brian M.; Huber, Adam M.; Cimaz, Rolando; Cuttica, Rubén J.; de Oliveira, Sheila Knupp; Lindsley, Carol B.; Pilkington, Clarissa A.; Punaro, Marilyn; Ravelli, Angelo; Reed, Ann M.; Rouster-Stevens, Kelly; van Royen, Annet; Dressler, Frank; Magalhaes, Claudia Saad; Constantin, Tamás; Davidson, Joyce E.; Magnusson, Bo; Russo, Ricardo; Villa, Luca; Rinaldi, Mariangela; Rockette, Howard; Lachenbruch, Peter A.; Miller, Frederick W.; Vencovsky, Jiri; Ruperto, Nicolino
2017-01-01
Objective Develop response criteria for juvenile dermatomyositis (JDM). Methods We analyzed the performance of 312 definitions that used core set measures (CSM) from either the International Myositis Assessment and Clinical Studies Group (IMACS) or the Pediatric Rheumatology International Trials Organization (PRINTO) and were derived from natural history data and a conjoint-analysis survey. They were further validated in the PRINTO trial of prednisone alone compared to prednisone with methotrexate or cyclosporine and the Rituximab in Myositis trial. Experts considered 14 top-performing candidate criteria based on their performance characteristics and clinical face validity using nominal group technique at a consensus conference. Results Consensus was reached for a conjoint analysis–based continuous model with a Total Improvement Score of 0-100, using absolute percent change in CSM with thresholds for minimal (≥30 points), moderate (≥45), and major improvement (≥70). The same criteria were chosen for adult dermatomyositis/polymyositis with differing thresholds for improvement. The sensitivity and specificity were 89% and 91-98% for minimal, 92-94% and 94-99% for moderate, and 91-98% and 85-85% for major improvement, respectively, in JDM patient cohorts using the IMACS and PRINTO CSM. These criteria were validated in the PRINTO trial for differentiating between treatment arms for minimal and moderate improvement (P=0.009–0.057) and in the Rituximab trial for significantly differentiating the physician rating of improvement (P<0.006). Conclusion The response criteria for JDM was a conjoint analysis–based model using a continuous improvement score based on absolute percent change in CSM, with thresholds for minimal, moderate, and major improvement. PMID:28382787
Rider, Lisa G.; Aggarwal, Rohit; Pistorio, Angela; Bayat, Nastaran; Erman, Brian; Feldman, Brian M.; Huber, Adam M.; Cimaz, Rolando; Cuttica, Rubén J.; de Oliveira, Sheila Knupp; Lindsley, Carol B.; Pilkington, Clarissa A.; Punaro, Marilyn; Ravelli, Angelo; Reed, Ann M.; Rouster-Stevens, Kelly; van Royen, Annet; Dressler, Frank; Magalhaes, Claudia Saad; Constantin, Tamás; Davidson, Joyce E.; Magnusson, Bo; Russo, Ricardo; Villa, Luca; Rinaldi, Mariangela; Rockette, Howard; Lachenbruch, Peter A.; Miller, Frederick W.; Vencovsky, Jiri; Ruperto, Nicolino
2017-01-01
Objective Develop response criteria for juvenile dermatomyositis (JDM). Methods We analyzed the performance of 312 definitions that used core set measures (CSM) from either the International Myositis Assessment and Clinical Studies Group (IMACS) or the Pediatric Rheumatology International Trials Organization (PRINTO) and were derived from natural history data and a conjoint-analysis survey. They were further validated in the PRINTO trial of prednisone alone compared to prednisone with methotrexate or cyclosporine and the Rituximab in Myositis trial. Experts considered 14 top-performing candidate criteria based on their performance characteristics and clinical face validity using nominal group technique at a consensus conference. Results Consensus was reached for a conjoint analysis–based continuous model with a Total Improvement Score of 0-100, using absolute percent change in CSM with thresholds for minimal (≥30 points), moderate (≥45), and major improvement (≥70). The same criteria were chosen for adult dermatomyositis/polymyositis with differing thresholds for improvement. The sensitivity and specificity were 89% and 91-98% for minimal, 92-94% and 94-99% for moderate, and 91-98% and 85-85% for major improvement, respectively, in JDM patient cohorts using the IMACS and PRINTO CSM. These criteria were validated in the PRINTO trial for differentiating between treatment arms for minimal and moderate improvement (P=0.009–0.057) and in the Rituximab trial for significantly differentiating the physician rating of improvement (P<0.006). Conclusion The response criteria for JDM was a conjoint analysis–based model using a continuous improvement score based on absolute percent change in CSM, with thresholds for minimal, moderate, and major improvement. PMID:28382778
Schwartz, Lawrence H; Seymour, Lesley; Litière, Saskia; Ford, Robert; Gwyther, Stephen; Mandrekar, Sumithra; Shankar, Lalitha; Bogaerts, Jan; Chen, Alice; Dancey, Janet; Hayes, Wendy; Hodi, F Stephen; Hoekstra, Otto S; Huang, Erich P; Lin, Nancy; Liu, Yan; Therasse, Patrick; Wolchok, Jedd D; de Vries, Elisabeth
2016-07-01
Radiologic imaging of disease sites plays a pivotal role in the management of patients with cancer. Response Evaluation Criteria in Solid Tumours (RECIST), introduced in 2000, and modified in 2009, has become the de facto standard for assessment of response in solid tumours in patients on clinical trials. The RECIST Working Group considers the ability of the global oncology community to implement and adopt updates to RECIST in a timely manner to be critical. Updates to RECIST must be tested, validated and implemented in a standardised, methodical manner in response to therapeutic and imaging technology advances as well as experience gained by users. This was the case with the development of RECIST 1.1, where an expanded data warehouse was developed to test and validate modifications. Similar initiatives are ongoing, testing RECIST in the evaluation of response to non-cytotoxic agents, immunotherapies, as well as in specific diseases. The RECIST Working Group has previously outlined the level of evidence considered necessary to formally and fully validate new imaging markers as an appropriate end-point for clinical trials. Achieving the optimal level of evidence desired is a difficult feat for phase III trials; this involves a meta-analysis of multiple prospective, randomised multicentre clinical trials. The rationale for modifications should also be considered; the modifications may be proposed to improve surrogacy, to provide a more mechanistic imaging technique, or be designed to improve reproducibility of the imaging biomarker. Here, we present the commonly described modifications of RECIST, each of which is associated with different levels of evidence and validation. Copyright © 2016. Published by Elsevier Ltd.
2012-01-01
Background Many women try to stop smoking in pregnancy but fail. One difficulty is that there is insufficient evidence that medications for smoking cessation are effective and safe in pregnancy and thus many women prefer to avoid these. Physical activity (PA) interventions may assist cessation; however, trials examining these interventions have been too small to detect or exclude plausible beneficial effects. The London Exercise And Pregnant smokers (LEAP) trial is investigating whether a PA intervention is effective and cost-effective when used for smoking cessation by pregnant women, and will be the largest study of its kind to date. Methods/design The LEAP study is a pragmatic, multi-center, two-arm, randomized, controlled trial that will target pregnant women who smoke at least one cigarette a day (and at least five cigarettes a day before pregnancy), and are between 10 and 24 weeks pregnant. Eligible patients are individually randomized to either usual care (that is, behavioral support for smoking cessation) or usual care plus a intervention (entailing supervised exercise on a treadmill plus PA consultations). The primary outcome of the trial is self-reported and biochemically validated continuous abstinence from smoking between a specified quit date and the end of pregnancy. The secondary outcomes, measured at 1 and 4 weeks after the quit date, and at the end of pregnancy and 6 months after childbirth, are PA levels, depression, self-confidence, and cigarette withdrawal symptoms. Smoking status will also be self-reported at 6 months after childbirth. In addition, perinatal measures will be collected, including antenatal complications, duration of labor, mode of delivery, and birth and placental weight. Outcomes will be analyzed on an intention-to-treat basis, and logistic regression models used to compare treatment effects on the primary outcome. Discussion This trial will assess whether a PA intervention is effective when used for smoking cessation during pregnancy. Trial registration ISRCTN48600346 PMID:23035669
Aparicio-Ting, Fabiola E; Farris, Megan; Courneya, Kerry S; Schiller, Ashley; Friedenreich, Christine M
2015-05-05
Few studies have examined recreational physical activity (RPA) after participating in a structured exercise intervention. More specifically, little is known about the long-term effects of exercise interventions in post-menopausal women. This study had two objectives: 1) To compare RPA in postmenopausal women in the exercise group and the control group 12 months after the end of the Alberta Physical Activity and Breast Cancer Prevention (ALPHA) Trial; and 2) To apply the Theory of Planned Behaviour (TPB) to identify predictors of RPA 12 months post-intervention among women in the exercise group. Self-reported RPA 12-months post-intervention from a validated questionnaire was used to estimate RPA levels for control group (118/160, 74% response) and exercise group participants (126/160, 79% response). Bivariate analysis was used to compare RPA between exercise and control group participants and to identify TPB variables for multivariate analysis. Logistic regression was applied to TPB data collected from self- administered questionnaires at end of trial by exercise group participants (126/160, 79% response) to identify predictors of long-term RPA. At 12 months post-intervention, 62% of women in the exercise group were active compared to 58% of controls (p = 0.52). Of the TPB constructs examined, self-efficacy (OR =2.98 (1.08-8.20)) and behavioural beliefs (OR = 1.46 (1.03-2.06)) were identified as predictors of RPA for exercise group participants. Levels of RPA in the exercise and control groups were comparable 12 months post intervention, indicating that participation in the ALPHA trial was associated with increased physical activity in previously inactive women, regardless of randomization into either the exercise group or in the control group. Exercise interventions that promote self-efficacy and positive behavioural beliefs have the potential to have long-term impacts on physical activity behaviour, although further research is needed to examine additional psychological, social and environmental predictors of long-term RPA in post-menopausal women. ClinicalTrials.gov NCT00522262.
Phase II cancer clinical trials for biomarker-guided treatments.
Jung, Sin-Ho
2018-01-01
The design and analysis of cancer clinical trials with biomarker depend on various factors, such as the phase of trials, the type of biomarker, whether the used biomarker is validated or not, and the study objectives. In this article, we demonstrate the design and analysis of two Phase II cancer clinical trials, one with a predictive biomarker and the other with an imaging prognostic biomarker. Statistical testing methods and their sample size calculation methods are presented for each trial. We assume that the primary endpoint of these trials is a time to event variable, but this concept can be used for any type of endpoint.
Hamilton, Roy H; Stark, Marianna; Coslett, H Branch
2010-01-01
Debate continues regarding the mechanisms underlying covert shifts of visual attention. We examined the relationship between target eccentricity and the speed of covert shifts of attention in normal subjects and patients with brain lesions using a cued-response task in which cues and targets were presented at 2 degrees or 8 degrees lateral to the fixation point. Normal subjects were slower on invalid trials in the 8 degrees as compared to 2 degrees condition. Patients with right-hemisphere stroke with neglect were slower in their responses to left-sided invalid targets compared to valid targets, and demonstrated a significant increase in the effect of target validity as a function of target eccentricity. Additional data from one neglect patient (JM) demonstrated an exaggerated validity x eccentricity x side interaction for contralesional targets on a cued reaction time task with a central (arrow) cue. We frame these results in the context of a continuous 'moving spotlight' model of attention, and also consider the potential role of spatial saliency maps. By either account, we argue that neglect is characterized by an eccentricity-dependent deficit in the allocation of attention.
The validity of ultrasound estimation of muscle volumes.
Infantolino, Benjamin W; Gales, Daniel J; Winter, Samantha L; Challis, John H
2007-08-01
The purpose of this study was to validate ultrasound muscle volume estimation in vivo. To examine validity, vastus lateralis ultrasound images were collected from cadavers before muscle dissection; after dissection, the volumes were determined by hydrostatic weighing. Seven thighs from cadaver specimens were scanned using a 7.5-MHz ultrasound probe (SSD-1000, Aloka, Japan). The perimeter of the vastus lateralis was identified in the ultrasound images and manually digitized. Volumes were then estimated using the Cavalieri principle, by measuring the image areas of sets of parallel two-dimensional slices through the muscles. The muscles were then dissected from the cadavers, and muscle volume was determined via hydrostatic weighing. There was no statistically significant difference between the ultrasound estimation of muscle volume and that estimated using hydrostatic weighing (p > 0.05). The mean percentage error between the two volume estimates was 0.4% +/- 6.9. Three operators all performed four digitizations of all images from one randomly selected muscle; there was no statistical difference between operators or trials and the intraclass correlation was high (>0.8). The results of this study indicate that ultrasound is an accurate method for estimating muscle volumes in vivo.
Measuring episodic memory across the lifespan: NIH Toolbox Picture Sequence Memory Test.
Dikmen, Sureyya S; Bauer, Patricia J; Weintraub, Sandra; Mungas, Dan; Slotkin, Jerry; Beaumont, Jennifer L; Gershon, Richard; Temkin, Nancy R; Heaton, Robert K
2014-07-01
Episodic memory is one of the most important cognitive domains that involves acquiring, storing and recalling new information. In this article, we describe a new measure developed for the NIH Toolbox, called the Picture Sequence Memory Test (PSMT) that is the first to examine episodic memory across the age range from 3 to 85. We describe the development of the measure and present validation data for ages 20 to 85. The PSMT involves presentation of sequences of pictured objects and activities in a fixed order on a computer screen and simultaneously verbally described, that the participant must remember and then reproduce over three learning trials. The results indicate good test-retest reliability and construct validity. Performance is strongly related to well-established "gold standard" measures of episodic memory and, as expected, much less well correlated with those of a measure of vocabulary. It shows clear decline with aging in parallel with a gold standard summary measure and relates to several other demographic factors and to self-reported general health status. The PSMT appears to be a reliable and valid test of episodic memory for adults, a finding similar to those found for the same measure with children.
Generalized worry disorder: a review of DSM-IV generalized anxiety disorder and options for DSM-V.
Andrews, Gavin; Hobbs, Megan J; Borkovec, Thomas D; Beesdo, Katja; Craske, Michelle G; Heimberg, Richard G; Rapee, Ronald M; Ruscio, Ayelet Meron; Stanley, Melinda A
2010-02-01
Generalized anxiety disorder (GAD) has undergone a series of substantial classificatory changes since its first inclusion in DSM-III. The majority of these revisions have been in response to its poor inter-rater reliability and concerns that it may lack diagnostic validity. This article provides options for the revision of the DSM-IV GAD criteria for DSM-V. First, searches were conducted to identify the evidence that previous DSM Work Groups relied upon when revising the DSM-III-R GAD and the overanxious disorder classifications. Second, the literature pertaining to the DSM-IV criteria for GAD was examined. The review presents a number of options to be considered for DSM-V. One option is for GAD to be re-labeled in DSM-V as generalized worry disorder. This would reflect its hallmark feature. Proposed revisions would result in a disorder that is characterized by excessive anxiety and worry generalized to a number of events or activities for 3 months or more. Worry acts as a cognitive coping strategy that manifests in avoidant behaviors. The reliability and validity of the proposed changes could be investigated in DSM-V validity tests and field trials.
Altered orientation of spatial attention in depersonalization disorder.
Adler, Julia; Beutel, Manfred E; Knebel, Achim; Berti, Stefan; Unterrainer, Josef; Michal, Matthias
2014-05-15
Difficulties with concentration are frequent complaints of patients with depersonalization disorder (DPD). Standard neuropsychological tests suggested alterations of the attentional and perceptual systems. To investigate this, the well-validated Spatial Cueing paradigm was used with two different tasks, consisting either in the detection or in the discrimination of visual stimuli. At the start of each trial a cue indicated either the correct (valid) or the incorrect (invalid) position of the upcoming stimulus or was uninformative (neutral). Only under the condition of increased task difficulty (discrimination task) differences between DPD patients and controls were observed. DPD patients showed a smaller total attention directing effect (RT in valid vs. invalid trials) compared to healthy controls only in the discrimination condition. RT costs (i.e., prolonged RT in neutral vs. invalid trials) mainly accounted for this difference. These results indicate that DPD is associated with altered attentional mechanisms, especially with a stronger responsiveness to unexpected events. From an evolutionary perspective this may be advantageous in a dangerous environment, in daily life it may be experienced as high distractibility. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Sawata, Hiroshi; Ueshima, Kenji; Tsutani, Kiichiro
2011-04-14
Clinical evidence is important for improving the treatment of patients by health care providers. In the study of cardiovascular diseases, large-scale clinical trials involving thousands of participants are required to evaluate the risks of cardiac events and/or death. The problems encountered in conducting the Japanese Acute Myocardial Infarction Prospective (JAMP) study highlighted the difficulties involved in obtaining the financial and infrastructural resources necessary for conducting large-scale clinical trials. The objectives of the current study were: 1) to clarify the current funding and infrastructural environment surrounding large-scale clinical trials in cardiovascular and metabolic diseases in Japan, and 2) to find ways to improve the environment surrounding clinical trials in Japan more generally. We examined clinical trials examining cardiovascular diseases that evaluated true endpoints and involved 300 or more participants using Pub-Med, Ichushi (by the Japan Medical Abstracts Society, a non-profit organization), websites of related medical societies, the University Hospital Medical Information Network (UMIN) Clinical Trials Registry, and clinicaltrials.gov at three points in time: 30 November, 2004, 25 February, 2007 and 25 July, 2009. We found a total of 152 trials that met our criteria for 'large-scale clinical trials' examining cardiovascular diseases in Japan. Of these, 72.4% were randomized controlled trials (RCTs). Of 152 trials, 9.2% of the trials examined more than 10,000 participants, and 42.8% examined between 1,000 and 10,000 participants. The number of large-scale clinical trials markedly increased from 2001 to 2004, but suddenly decreased in 2007, then began to increase again. Ischemic heart disease (39.5%) was the most common target disease. Most of the larger-scale trials were funded by private organizations such as pharmaceutical companies. The designs and results of 13 trials were not disclosed. To improve the quality of clinical trials, all sponsors should register trials and disclose the funding sources before the enrolment of participants, and publish their results after the completion of each study.
Development and validation of a high-fidelity phonomicrosurgical trainer.
Klein, Adam M; Gross, Jennifer
2017-04-01
To validate the use of a high-fidelity phonomicrosurgical trainer. A high-fidelity phonomicrosurgical trainer, based on a previously validated model by Contag et al., 1 was designed with multilayered vocal folds that more closely mimic the consistency of true vocal folds, containing intracordal lesions to practice phonomicrosurgical removal. A training module was developed to simulate the true phonomicrosurgical experience. A validation study with novice and expert surgeons was conducted. Novices and experts were instructed to remove the lesion from the synthetic vocal folds, and novices were given four training trials. Performances were measured by the amount of time spent and tissue injury (microflap, superficial, deep) to the vocal fold. An independent Student t test and Fisher exact tests were used to compare subjects. A matched-paired t test and Wilcoxon signed rank tests were used to compare novice performance on the first and fourth trials and assess for improvement. Experts completed the excision with less total errors than novices (P = .004) and made less injury to the microflap (P = .05) and superficial tissue (P = .003). Novices improved their performance with training, making less total errors (P = .002) and superficial tissue injuries (P = .02) and spending less time for removal (P = .002) after several practice trials. This high-fidelity phonomicrosurgical trainer has been validated for novice surgeons. It can distinguish between experts and novices; and after training, it helped to improve novice performance. N/A. Laryngoscope, 127:888-893, 2017. © 2016 The American Laryngological, Rhinological and Otological Society, Inc.
Mark-Christensen, Troels; Troelsen, Anders; Kallemose, Thomas; Barfod, Kristoffer Weisskirchner
2016-06-01
The optimal treatment for acute Achilles tendon rupture (ATR) is continuously debated. Recent studies have proposed that the choice of either operative or non-operative treatment may not be as important as rehabilitation, suggesting that functional rehabilitation should be preferred over traditional immobilization. The purpose of this meta-analysis of randomized controlled trials (RCTs) was to compare functional rehabilitation to immobilization in the treatment of ATR. This meta-analysis was conducted using the databases: PubMed, EMBASE, Rehabilitation and Sports Medicine Source, AMED, CINAHL, Cochrane Library and PEDro using the search terms: "Achilles tendon," "rupture," "mobilization" and "immobilization". Seven RCTs involving 427 participants were eligible for inclusion, with a total of 211 participants treated with functional rehabilitation and 216 treated with immobilization. Re-rupture rate, other complications, strength, range of motion, duration of sick leave, return to sport and patient satisfaction were examined. There were no statistically significant differences between groups. A trend favoring functional rehabilitation was seen regarding the examined outcomes. Functional rehabilitation after acute Achilles tendon rupture does not increase the rate of re-rupture or other complications. A trend toward earlier return to work and sport, and increased patient satisfaction was found when functional rehabilitation was used. The present literature is of low-to-average quality, and the basic constructs of the examined treatment and study protocols vary considerably. Larger, randomized controlled trials using validated outcome measures are needed to confirm the findings. II.
Methodological Issues in Research on Web-Based Behavioral Interventions
Danaher, Brian G; Seeley, John R
2013-01-01
Background Web-based behavioral intervention research is rapidly growing. Purpose We review methodological issues shared across Web-based intervention research to help inform future research in this area. Methods We examine measures and their interpretation using exemplar studies and our research. Results We report on research designs used to evaluate Web-based interventions and recommend newer, blended designs. We review and critique methodological issues associated with recruitment, engagement, and social validity. Conclusions We suggest that there is value to viewing this burgeoning realm of research from the broader context of behavior change research. We conclude that many studies use blended research designs, that innovative mantling designs such as the Multiphase Optimization Strategy and Sequential Multiple Assignment Randomized Trial methods hold considerable promise and should be used more widely, and that Web-based controls should be used instead of usual care or no-treatment controls in public health research. We recommend topics for future research that address participant recruitment, engagement, and social validity. PMID:19806416
Evidence-Based School Behavior Assessment of Externalizing Behavior in Young Children.
Bagner, Daniel M; Boggs, Stephen R; Eyberg, Sheila M
2010-02-01
This study examined the psychometric properties of the Revised Edition of the School Observation Coding System (REDSOCS). Participants were 68 children ages 3 to 6 who completed parent-child interaction therapy for Oppositional Defiant Disorder as part of a larger efficacy trial. Interobserver reliability on REDSOCS categories was moderate to high, with percent agreement ranging from 47% to 90% (M = 67%) and Cohen's kappa coefficients ranging from .69 to .95 (M = .82). Convergent validity of the REDSOCS categories was supported by significant correlations with the Intensity Scale of the Sutter-Eyberg Student Behavior Inventory-Revised and related subscales of the Conners' Teacher Rating Scale-Revised: Long Version (CTRS-R: L). Divergent validity was indicated by nonsignificant correlations between REDSOCS categories and scales on the CTRS-R: L expected not to relate to disruptive classroom behavior. Treatment sensitivity was demonstrated for two of the three primary REDSOCS categories by significant pre to posttreatment changes. This study provides psychometric support for the designation of REDSOCS as an evidence-based assessment procedure for young children.
Outcome Measures in Spinal Cord Injury
Alexander, Marcalee S.; Anderson, Kim; Biering-Sorensen, Fin; Blight, Andrew R.; Brannon, Ruth; Bryce, Thomas; Creasey, Graham; Catz, Amiram; Curt, Armin; Donovan, William; Ditunno, John; Ellaway, Peter; Finnerup, Nanna B.; Graves, Daniel E.; Haynes, Beth Ann; Heinemann, Allen W.; Jackson, Amie B.; Johnston, Mark; Kalpakjian, Claire Z.; Kleitman, Naomi; Krassioukov, Andrei; Krogh, Klaus; Lammertse, Daniel; Magasi, Susan; Mulcahey, MJ; Schurch, Brigitte; Sherwood, Arthur; Steeves, John D.; Stiens, Steven; Tulsky, David S.; van Hedel, Hubertus J.A.; Whiteneck, Gale
2009-01-01
Study Design review by the Spinal Cord Outcomes Partnership Endeavor (SCOPE), which is a broad-based international consortium of scientists and clinical researchers representing academic institutions, industry, government agencies, not-for-profit organizations and foundations. Objectives assessment of current and evolving tools for evaluating human spinal cord injury (SCI) outcomes for both clinical diagnosis and clinical research studies. Methods a framework for the appraisal of evidence of metric properties was used to examine outcome tools or tests for accuracy, sensitivity, reliability and validity for human SCI. Results imaging, neurological, functional, autonomic, sexual health, bladder/bowel, pain, and psycho-social tools were evaluated. Several specific tools for human SCI studies have or are being developed to allow the more accurate determination for a clinically meaningful benefit (improvement in functional outcome or quality of life) being achieved as a result of a therapeutic intervention. Conclusion significant progress has been made, but further validation studies are required to identify the most appropriate tools for specific targets in a human SCI study or clinical trial. PMID:19381157
Cerebrospinal Fluid Biomarkers for Huntington's Disease.
Byrne, Lauren M; Wild, Edward J
2016-01-01
Cerebrospinal fluid (CSF) is enriched in brain-derived components and represents an accessible and appealing means of interrogating the CNS milieu to study neurodegenerative diseases and identify biomarkers to facilitate the development of novel therapeutics. Many such CSF biomarkers have been proposed for Huntington's disease (HD) but none has been validated for clinical trial use. Across many studies proposing dozens of biomarker candidates, there is a notable lack of statistical power, consistency, rigor and validation. Here we review proposed CSF biomarkers including neurotransmitters, transglutaminase activity, kynurenine pathway metabolites, oxidative stress markers, inflammatory markers, neuroendocrine markers, protein markers of neuronal death, proteomic approaches and mutant huntingtin protein itself. We reflect on the need for large-scale, standardized CSF collections with detailed phenotypic data to validate and qualify much-needed CSF biomarkers for clinical trial use in HD.
Bardhan, Karna D; Berghöfer, Peter
2007-01-01
The ReQuest (Reflux Questionnaire) is a new instrument: a diary developed specifically to allow patients to measure the totality of their gastroesophageal reflux disease (GERD) symptoms and to track changes each day during treatment, an increasingly important need in clinical trials. This paper reviews the background, development, testing, and validation of the instrument and shows how its flexibility allows it to be used in new ways to assess different aspects of GERD. There were four key steps. (1) A full symptom spectrum was gathered from experts, literature and, crucially, from GERD patients, and comprised 67 symptom descriptions. (2) By expert consensus, these symptom descriptions were empirically condensed into six easier-to-handle 'dimensions' (acid complaints, upper abdominal/stomach complaints, lower abdominal/digestive complaints, nausea, sleep disturbances, other complaints), to which the seventh, general well-being, was added, a key feature of the instrument. The symptom burden of each dimension is measured as frequency x intensity (general well-being: intensity only). (3) This prototype instrument was tested in PPI clinical trials involving patients with erosive and non-erosive GERD, while the data generated were used to validate the prototype. (4) Finally, the scale was refined by factor analysis, a statistical process. Detailed statistical testing validating the scale and factor analysis confirmed that empirically condensing the symptom descriptions into dimensions did not affect the measurement properties of the instrument. FURTHER APPLICATION: The structure of ReQuest and its product, which is numerical, makes the instrument highly flexible and allows for its use in other GERD areas. (1) Its subscales, ReQuest-GI and ReQuest-WSO, measure symptoms traditionally associated with reflux and with general well-being, respectively, and permit these to be quantified and tracked independently. (2) Minor degrees of reflux symptoms are common amongst the healthy. The level was determined in a large population without evidence of disease, and a 'GERD symptom threshold' calculated. Reduction below this threshold to 'background levels' is a more realistic end point in clinical trials than the rarely achieved 'complete absence'. (3) ReQuest-GI was re-scaled and integrated with the modified Los Angeles scale used to grade esophageal appearances at endoscopy. The new instrument is a matrix, the ReQuest/LA-classification, which allows both symptoms and endoscopy appearances to be expressed by a single set of numbers in individual patients and in populations. (4) This in turn allows identification of patients who on treatment achieve 'complete remission', i.e. healing and symptom relief. PPI clinical trials confirm that it takes a longer period of treatment to reach this endpoint than healing or symptom relief alone. Thus the conventional 4-8 weeks of PPI therapy may unwittingly result in under-treatment. (5) Studies are in progress to determine the 'minimum clinically important difference' (MCID). This is the minimum change measured on treatment, which is judged as being of clinical benefit, an increasingly important subject. (6) Finally, the ReQuest used in clinical trials has been simplified for application in day-to-day clinical practice. The results obtained with 'ReQuest in Practice', the simplified version, are similar to those observed with the full version, as confirmed in a large-scale study in general practice. The clinical practice version has also been validated. (1) The characteristics of ReQuest make it suitable for use as the primary endpoint in clinical trials assessing the outcome of GERD. (2) The subscales allows for closer examination on the nature of GERD and response to treatment. (3) A simplified version has proven suitable for use in daily practice. Copyright 2007 S. Karger AG, Basel.
Urquhart, Donna M; Wluka, Anita E; Sim, Malcolm R; van Tulder, Maurits; Forbes, Andrew; Gibson, Stephen J; Arnold, Carolyn; Fong, Chris; Anthony, Shane N; Cicuttini, Flavia M
2016-10-22
Low back pain is a major clinical and public health problem, with limited evidence-based treatments. Low-dose antidepressants are commonly used to treat pain in chronic low back pain. However, their efficacy is unproven. The aim of this pragmatic, double-blind, randomised, placebo-controlled trial is to determine whether low-dose amitriptyline (an antidepressant) is more effective than placebo in reducing pain in individuals with chronic low back pain. One hundred and fifty individuals with chronic low back pain will be recruited through hospital and private medical and allied health clinics, advertising in local media and posting of flyers in community locations. They will be randomly allocated to receive either low-dose amitriptyline (25 mg) or an active placebo (benztropine mesylate, 1 mg) for 6 months. The primary outcome measure of pain intensity will be assessed at baseline, 3 and 6 months using validated questionnaires. Secondary measures of self-reported low back disability, work absence and hindrance in the performance of paid/unpaid work will also be examined. Intention-to-treat analyses will be performed. This pragmatic, double-blind, randomised, placebo-controlled trial will provide evidence regarding the effectiveness of low-dose antidepressants compared with placebo in reducing pain, disability, work absenteeism and hindrance in work performance in individuals with chronic low back pain. This trial has major public health and clinical importance as it has the potential to provide an effective approach to the management of chronic low back pain. Australian New Zealand Clinical Trials Registry: ACTRN12612000131853 ; registered on 30 January 2012.
Tomlin, Zelda; deSalis, Isabel; Toerien, Merran; Donovan, Jenny L
2014-10-01
With the routinization of evidence-based medicine and of the randomized-controlled trial (RCT), more patients are becoming 'sites of evidence production' yet, little is known about how they are recruited as participants; there is some evidence that 'substantively valid consent' is difficult to achieve. To explore the views and experiences of nurses recruiting patients to randomized-controlled trials and to examine the extent to which their recruitment practices were patient-centred and patient empowering. Semi-structured in-depth interviews; audio recording of recruitment appointments; thematic interactional analysis (drawing on discourse and conversation analysis). Nurses recruiting patients to five publicly funded RCTs and patients consenting to the recording of their recruitment sessions. The views of recruiting nurses about their recruitment role; the extent to which nurse-patient interactions were patient-centred; the nature of the nurses' interactional strategies and the nature and extent of patient participation in the discussion. The nurses had a keen sense of themselves as clinicians and patient advocates and their perceptions of the trial and its interventions were inextricably linked to those of the patients. However, many of their recruitment practices made it difficult for patients to play an active and informed part in the discussion about trial participation, raising questions over the quality of consent decisions. Nurses working in patient recruitment to RCTs need to reconcile two different worlds with different demands and ethics. Evidence production, a central task in evidence-based medicine, poses a challenge to patient-centred practice and more research and relevant training are needed. © 2012 John Wiley & Sons Ltd.
Rosenblum, Uri; Melzer, Itshak
2017-01-01
About 90% of people with multiple sclerosis (PwMS) have gait instability and 50% fall. Reliable and clinically feasible methods of gait instability assessment are needed. The study investigated the reliability and validity of the Narrow Path Walking Test (NPWT) under single-task (ST) and dual-task (DT) conditions for PwMS. Thirty PwMS performed the NPWT on 2 different occasions, a week apart. Number of Steps, Trial Time, Trial Velocity, Step Length, Number of Step Errors, Number of Cognitive Task Errors, and Number of Balance Losses were measured. Intraclass correlation coefficients (ICC2,1) were calculated from the average values of NPWT parameters. Absolute reliability was quantified from standard error of measurement (SEM) and smallest real difference (SRD). Concurrent validity of NPWT with Functional Reach Test, Four Square Step Test (FSST), 12-item Multiple Sclerosis Walking Scale (MSWS-12), and 2 Minute Walking Test (2MWT) was determined using partial correlations. Intraclass correlation coefficients (ICCs) for most NPWT parameters during ST and DT ranged from 0.46-0.94 and 0.55-0.95, respectively. The highest relative reliability was found for Number of Step Errors (ICC = 0.94 and 0.93, for ST and DT, respectively) and Trial Velocity (ICC = 0.83 and 0.86, for ST and DT, respectively). Absolute reliability was high for Number of Step Errors in ST (SEM % = 19.53%) and DT (SEM % = 18.14%) and low for Trial Velocity in ST (SEM % = 6.88%) and DT (SEM % = 7.29%). Significant correlations for Number of Step Errors and Trial Velocity were found with FSST, MSWS-12, and 2MWT. In persons with PwMS performing the NPWT, Number of Step Errors and Trial Velocity were highly reliable parameters. Based on correlations with other measures of gait instability, Number of Step Errors was the most valid parameter of dynamic balance under the conditions of our test.Video Abstract available for more insights from the authors (see Supplemental Digital Content 1, available at: http://links.lww.com/JNPT/A159).
An Ontology-based Architecture for Integration of Clinical Trials Management Applications
Shankar, Ravi D.; Martins, Susana B.; O’Connor, Martin; Parrish, David B.; Das, Amar K.
2007-01-01
Management of complex clinical trials involves coordinated-use of a myriad of software applications by trial personnel. The applications typically use distinct knowledge representations and generate enormous amount of information during the course of a trial. It becomes vital that the applications exchange trial semantics in order for efficient management of the trials and subsequent analysis of clinical trial data. Existing model-based frameworks do not address the requirements of semantic integration of heterogeneous applications. We have built an ontology-based architecture to support interoperation of clinical trial software applications. Central to our approach is a suite of clinical trial ontologies, which we call Epoch, that define the vocabulary and semantics necessary to represent information on clinical trials. We are continuing to demonstrate and validate our approach with different clinical trials management applications and with growing number of clinical trials. PMID:18693919
NIH EXAMINER: Conceptualization and Development of an Executive Function Battery
Kramer, Joel H.; Mungas, Dan; Possin, Katherine L.; Rankin, Katherine P.; Boxer, Adam L.; Rosen, Howard J.; Bostrom, Alan; Sinha, Lena; Berhel, Ashley; Widmeyer, Mary
2015-01-01
Executive functioning is widely targeted when human cognition is assessed, but there is little consensus on how it should be operationalized and measured. Recognizing the difficulties associated with establishing standard operational definitions of executive functioning, the National Institute of Neurological Disorders and Stroke entered into a contract with the University of California-San Francisco to develop psychometrically robust executive measurement tools that would be accepted by the neurology clinical trials and clinical research communities. This effort, entitled Executive Abilities: Measures and Instruments for Neurobehavioral Evaluation and Research (EXAMINER), resulted in a series of tasks targeting working memory, inhibition, set shifting, fluency, insight, planning, social cognition and behavior. We describe battery conceptualization and development, data collection, scale construction based on item response theory, and lay the foundation for studying the battery’s utility and validity for specific assessment and research goals. PMID:24103232
Differential Scavenging Among Pig, Rabbit, and Human Subjects.
Steadman, Dawnie Wolfe; Dautartas, Angela; Kenyhercz, Michael W; Jantz, Lee M; Mundorff, Amy; Vidoli, Giovanna M
2018-04-12
Different animal species have been used as proxies for human remains in decomposition studies for decades, although few studies have sought to validate their use in research aimed at estimating the postmortem interval. This study examines 45 pig, rabbit, and human subjects placed in three seasonal trials at the Anthropology Research Facility. In an earlier paper, we found that overall decomposition trends did vary between species that could be due to differential insect and scavenger behavior. This study specifically examines if scavenger behavior differs by carrion species. Daily photographs, game camera photographs, written observations, and Total Body Score (TBS) documented scavenging and decomposition changes. Results show that raccoons were the most commonly observed vertebrate scavenger, that scavenging was most extensive in winter, and that certain human subjects were preferred over other humans and all non-human subjects. Finally, scavenging activity greatly reduces the accuracy of postmortem interval estimates based on TBS. © 2018 American Academy of Forensic Sciences.
Review of patient-reported outcome measures in chronic hepatitis C
2012-01-01
Background Chronic hepatitis C (CHC) and its treatment are associated with a variety of patient-reported symptoms and impacts. Some CHC symptoms and impacts may be difficult to evaluate through objective clinical testing, and more easily measured through patient self-report. This literature review identified concepts raised by CHC patients related to symptoms, impacts, and treatment effects, and evaluated integration of these concepts within patient-reported outcome (PRO) measures. The goal of this work was to provide recommendations for incorporation of PRO measurement of concepts that are relevant to the CHC experience into CHC clinical trial design. Methods A three-tiered literature search was conducted. This included searches on concepts of importance, PRO measures used in clinical trials, and existing PRO measures. The PRO Concept Search focused on reviewing issues raised by CHC patients about CHC symptoms, disease impact, and treatment effects. The CHC Trials with PRO Endpoints Search reviewed clinical trials with PRO endpoints to assess differences between treatments over time. The PRO Measure Search reviewed existing PRO measures associated with the concepts of interest. Results This multi-tiered approach identified five key concepts of interest: depression/anxiety, fatigue, flu-like symptoms, cognitive function, insomnia. Comparing these five concepts of interest to the PRO measures in published CHC clinical trials showed that, while treatment of CHC may decrease health-related quality of life in a number of mental and physical domains, the PRO measures that were utilized in published clinical trials inadequately covered the concepts of interest. Further review of 18 existing PRO measures of the concepts of interest showed only four of the 18 were validated in CHC populations. Conclusions This review identified several gaps in the literature regarding assessment of symptoms and outcomes reported as important by CHC patients. Further research is needed to ensure that CHC clinical trials evaluate concepts that are important to patients and include measures that have evidence supporting content validity, reliability, construct validity, and responsiveness. PMID:22871087
Pre-trial inter-laboratory analytical validation of the FOCUS4 personalised therapy trial.
Richman, Susan D; Adams, Richard; Quirke, Phil; Butler, Rachel; Hemmings, Gemma; Chambers, Phil; Roberts, Helen; James, Michelle D; Wozniak, Sue; Bathia, Riya; Pugh, Cheryl; Maughan, Timothy; Jasani, Bharat
2016-01-01
Molecular characterisation of tumours is increasing personalisation of cancer therapy, tailored to an individual and their cancer. FOCUS4 is a molecularly stratified clinical trial for patients with advanced colorectal cancer. During an initial 16-week period of standard first-line chemotherapy, tumour tissue will undergo several molecular assays, with the results used for cohort allocation, then randomisation. Laboratories in Leeds and Cardiff will perform the molecular testing. The results of a rigorous pre-trial inter-laboratory analytical validation are presented and discussed. Wales Cancer Bank supplied FFPE tumour blocks from 97 mCRC patients with consent for use in further research. Both laboratories processed each sample according to an agreed definitive FOCUS4 laboratory protocol, reporting results directly to the MRC Trial Management Group for independent cross-referencing. Pyrosequencing analysis of mutation status at KRAS codons12/13/61/146, NRAS codons12/13/61, BRAF codon600 and PIK3CA codons542/545/546/1047, generated highly concordant results. Two samples gave discrepant results; in one a PIK3CA mutation was detected only in Leeds, and in the other, a PIK3CA mutation was only detected in Cardiff. pTEN and mismatch repair (MMR) protein expression was assessed by immunohistochemistry (IHC) resulting in 6/97 discordant results for pTEN and 5/388 for MMR, resolved upon joint review. Tumour heterogeneity was likely responsible for pyrosequencing discrepancies. The presence of signet-ring cells, necrosis, mucin, edge-effects and over-counterstaining influenced IHC discrepancies. Pre-trial assay analytical validation is essential to ensure appropriate selection of patients for targeted therapies. This is feasible for both mutation testing and immunohistochemical assays and must be built into the workup of such trials. ISRCTN90061564. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/
Smith, Shannon M; Dworkin, Robert H; Turk, Dennis C; Baron, Ralf; Polydefkis, Michael; Tracey, Irene; Borsook, David; Edwards, Robert R; Harris, Richard E; Wager, Tor D; Arendt-Nielsen, Lars; Burke, Laurie B; Carr, Daniel B; Chappell, Amy; Farrar, John T; Freeman, Roy; Gilron, Ian; Goli, Veeraindar; Haeussler, Juergen; Jensen, Troels; Katz, Nathaniel P; Kent, Jeffrey; Kopecky, Ernest A; Lee, David A; Maixner, William; Markman, John D; McArthur, Justin C; McDermott, Michael P; Parvathenani, Lav; Raja, Srinivasa N; Rappaport, Bob A; Rice, Andrew S C; Rowbotham, Michael C; Tobias, Jeffrey K; Wasan, Ajay D; Witter, James
2017-07-01
Valid and reliable biomarkers can play an important role in clinical trials as indicators of biological or pathogenic processes or as a signal of treatment response. Currently, there are no biomarkers for pain qualified by the U.S. Food and Drug Administration or the European Medicines Agency for use in clinical trials. This article summarizes an Initiative on Methods, Measurement, and Pain Assessment in Clinical Trials meeting in which 3 potential biomarkers were discussed for use in the development of analgesic treatments: 1) sensory testing, 2) skin punch biopsy, and 3) brain imaging. The empirical evidence supporting the use of these tests is described within the context of the 4 categories of biomarkers: 1) diagnostic, 2) prognostic, 3) predictive, and 4) pharmacodynamic. Although sensory testing, skin punch biopsy, and brain imaging are promising tools for pain in clinical trials, additional evidence is needed to further support and standardize these tests for use as biomarkers in pain clinical trials. The applicability of sensory testing, skin biopsy, and brain imaging as diagnostic, prognostic, predictive, and pharmacodynamic biomarkers for use in analgesic treatment trials is considered. Evidence in support of their use and outlining problems is presented, as well as a call for further standardization and demonstrations of validity and reliability. Copyright © 2017 American Pain Society. All rights reserved.
Breast Cancer Screening by Physical Examination: Randomized Trial in the Phillipines
2005-10-01
J -4327 TITLE: Breast Cancer Screening by Physical Examination: Randomized Trial in the Phillipines...Examination: Randomized Trial in the 5a. CONTRACT NUMBER Phillipines 5b. GRANT NUMBER DAMD17-94- J -4327 5c. PROGRAM ELEMENT NUMBER 6. AUTHOR(S...Grant DAMD17-94- J -4327 3 Table of
NASA Astrophysics Data System (ADS)
Sya’bandari, Y.; Firman, H.; Rusyati, L.
2018-05-01
The method used in this research was descriptive research for profiling the validation of SVT-MH to measure students’ critical thinking on matter and heat topic in junior high school. The subject is junior high school students of 7th grade (13 years old) while science teacher and expert as the validators. The instruments that used as a tool to obtain the data are rubric expert judgment (content, media, education) and rubric of readability test. There are four steps to validate SVT-MH in 7th grade Junior High School. These steps are analysis of core competence and basic competence based on Curriculum 2013, expert judgment (content, media, education), readability test and trial test (limited and larger trial test). The instrument validation resulted 30 items that represent 8 elements and 21 sub-elements to measure students’ critical thinking based on Inch in matter and heat topic. The alpha Cronbach (α) is 0.642 which means that the instrument is sufficient to measure students’ critical thinking matter and heat topic.
Cross-trial prediction of treatment outcome in depression: a machine learning approach.
Chekroud, Adam Mourad; Zotti, Ryan Joseph; Shehzad, Zarrar; Gueorguieva, Ralitza; Johnson, Marcia K; Trivedi, Madhukar H; Cannon, Tyrone D; Krystal, John Harrison; Corlett, Philip Robert
2016-03-01
Antidepressant treatment efficacy is low, but might be improved by matching patients to interventions. At present, clinicians have no empirically validated mechanisms to assess whether a patient with depression will respond to a specific antidepressant. We aimed to develop an algorithm to assess whether patients will achieve symptomatic remission from a 12-week course of citalopram. We used patient-reported data from patients with depression (n=4041, with 1949 completers) from level 1 of the Sequenced Treatment Alternatives to Relieve Depression (STAR*D; ClinicalTrials.gov, number NCT00021528) to identify variables that were most predictive of treatment outcome, and used these variables to train a machine-learning model to predict clinical remission. We externally validated the model in the escitalopram treatment group (n=151) of an independent clinical trial (Combining Medications to Enhance Depression Outcomes [COMED]; ClinicalTrials.gov, number NCT00590863). We identified 25 variables that were most predictive of treatment outcome from 164 patient-reportable variables, and used these to train the model. The model was internally cross-validated, and predicted outcomes in the STAR*D cohort with accuracy significantly above chance (64·6% [SD 3·2]; p<0·0001). The model was externally validated in the escitalopram treatment group (N=151) of COMED (accuracy 59·6%, p=0.043). The model also performed significantly above chance in a combined escitalopram-buproprion treatment group in COMED (n=134; accuracy 59·7%, p=0·023), but not in a combined venlafaxine-mirtazapine group (n=140; accuracy 51·4%, p=0·53), suggesting specificity of the model to underlying mechanisms. Building statistical models by mining existing clinical trial data can enable prospective identification of patients who are likely to respond to a specific antidepressant. Yale University. Copyright © 2016 Elsevier Ltd. All rights reserved.
Dacombe, Peter Jonathan; Amirfeyz, Rouin; Davis, Tim
2016-03-01
Patient-reported outcome measures (PROMs) are important tools for assessing outcomes following injuries to the hand and wrist. Many commonly used PROMs have no evidence of reliability, validity, and responsiveness in a hand and wrist trauma population. This systematic review examines the PROMs used in the assessment of hand and wrist trauma patients, and the evidence for reliability, validity, and responsiveness of each measure in this population. A systematic review of Pubmed, Medline, and CINAHL searching for randomized controlled trials of patients with traumatic injuries to the hand and wrist was carried out to identify the PROMs. For each identified PROM, evidence of reliability, validity, and responsiveness was identified using a further systematic review of the Pubmed, Medline, CINAHL, and reverse citation trail audit procedure. The PROM used most often was the Disabilities of the Arm, Shoulder and Hand (DASH) questionnaire; the Patient-Rated Wrist Evaluation (PRWE), Gartland and Werley score, Michigan Hand Outcomes score, Mayo Wrist Score, and Short Form 36 were also commonly used. Only the DASH and PRWE have evidence of reliability, validity, and responsiveness in patients with traumatic injuries to the hand and wrist; other measures either have incomplete evidence or evidence gathered in a nontraumatic population. The DASH and PRWE both have evidence of reliability, validity, and responsiveness in a hand and wrist trauma population. Other PROMs used to assess hand and wrist trauma patients do not. This should be considered when selecting a PROM for patients with traumatic hand and wrist pathology.
Hales, M; Biros, E; Reznik, J E
2015-01-01
Since 1982, the International Standards for Neurological Classification of Spinal Cord Injury (ISNCSCI) has been used to classify sensation of spinal cord injury (SCI) through pinprick and light touch scores. The absence of proprioception, pain, and temperature within this scale creates questions about its validity and accuracy. To assess whether the sensory component of the ISNCSCI represents a reliable and valid measure of classification of SCI. A systematic review of studies examining the reliability and validity of the sensory component of the ISNCSCI published between 1982 and February 2013 was conducted. The electronic databases MEDLINE via Ovid, CINAHL, PEDro, and Scopus were searched for relevant articles. A secondary search of reference lists was also completed. Chosen articles were assessed according to the Oxford Centre for Evidence-Based Medicine hierarchy of evidence and critically appraised using the McMasters Critical Review Form. A statistical analysis was conducted to investigate the variability of the results given by reliability studies. Twelve studies were identified: 9 reviewed reliability and 3 reviewed validity. All studies demonstrated low levels of evidence and moderate critical appraisal scores. The majority of the articles (~67%; 6/9) assessing the reliability suggested that training was positively associated with better posttest results. The results of the 3 studies that assessed the validity of the ISNCSCI scale were confounding. Due to the low to moderate quality of the current literature, the sensory component of the ISNCSCI requires further revision and investigation if it is to be a useful tool in clinical trials.
Hales, M.; Biros, E.
2015-01-01
Background: Since 1982, the International Standards for Neurological Classification of Spinal Cord Injury (ISNCSCI) has been used to classify sensation of spinal cord injury (SCI) through pinprick and light touch scores. The absence of proprioception, pain, and temperature within this scale creates questions about its validity and accuracy. Objectives: To assess whether the sensory component of the ISNCSCI represents a reliable and valid measure of classification of SCI. Methods: A systematic review of studies examining the reliability and validity of the sensory component of the ISNCSCI published between 1982 and February 2013 was conducted. The electronic databases MEDLINE via Ovid, CINAHL, PEDro, and Scopus were searched for relevant articles. A secondary search of reference lists was also completed. Chosen articles were assessed according to the Oxford Centre for Evidence-Based Medicine hierarchy of evidence and critically appraised using the McMasters Critical Review Form. A statistical analysis was conducted to investigate the variability of the results given by reliability studies. Results: Twelve studies were identified: 9 reviewed reliability and 3 reviewed validity. All studies demonstrated low levels of evidence and moderate critical appraisal scores. The majority of the articles (~67%; 6/9) assessing the reliability suggested that training was positively associated with better posttest results. The results of the 3 studies that assessed the validity of the ISNCSCI scale were confounding. Conclusions: Due to the low to moderate quality of the current literature, the sensory component of the ISNCSCI requires further revision and investigation if it is to be a useful tool in clinical trials. PMID:26363591
Ausserhofer, Dietmar; Anderson, Ruth A; Colón-Emeric, Cathleen; Schwendimann, René
2013-08-01
The Safety Organizing Scale is a valid and reliable measure on safety behaviors and practices in hospitals. This study aimed to explore the psychometric properties of the Safety Organizing Scale-Nursing Home version (SOS-NH). In a cross-sectional analysis of staff survey data, we examined validity and reliability of the 9-item Safety SOS-NH using American Educational Research Association guidelines. This substudy of a larger trial used baseline survey data collected from staff members (n = 627) in a variety of work roles in 13 nursing homes (NHs) in North Carolina and Virginia. Psychometric evaluation of the SOS-NH revealed good response patterns with low average of missing values across all items (3.05%). Analyses of the SOS-NH's internal structure (eg, comparative fit indices = 0.929, standardized root mean square error of approximation = 0.045) and consistency (composite reliability = 0.94) suggested its 1-dimensionality. Significant between-facility variability, intraclass correlations, within-group agreement, and design effect confirmed appropriateness of the SOS-NH for measurement at the NH level, justifying data aggregation. The SOS-NH showed discriminate validity from one related concept: communication openness. Initial evidence regarding validity and reliability of the SOS-NH supports its utility in measuring safety behaviors and practices among a wide range of NH staff members, including those with low literacy. Further psychometric evaluation should focus on testing concurrent and criterion validity, using resident outcome measures (eg, patient fall rates). Copyright © 2013 American Medical Directors Association, Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rainina, Evguenia I.; McCune, D. E.; Luna, Maria L.
2012-05-31
The goal of this study was to validate the previously observed high biological kill performance of PAEROSOL, a semi-dry, micro-aerosol decontamination technology, against common HAI in a non-human subject trial within a hospital setting of Madigan Army Medical Center (MAMC) on Joint Base Lewis-McChord in Tacoma, Washington. In addition to validating the disinfecting efficacy of PAEROSOL, the objectives of the trial included a demonstration of PAEROSOL environmental safety, (i.e., impact to hospital interior materials and electronic equipment exposed during testing) and PAEROSOL parameters optimization for future deployment.
Biomarker-Guided Adaptive Trial Designs in Phase II and Phase III: A Methodological Review
Antoniou, Miranta; Jorgensen, Andrea L; Kolamunnage-Dona, Ruwanthi
2016-01-01
Background Personalized medicine is a growing area of research which aims to tailor the treatment given to a patient according to one or more personal characteristics. These characteristics can be demographic such as age or gender, or biological such as a genetic or other biomarker. Prior to utilizing a patient’s biomarker information in clinical practice, robust testing in terms of analytical validity, clinical validity and clinical utility is necessary. A number of clinical trial designs have been proposed for testing a biomarker’s clinical utility, including Phase II and Phase III clinical trials which aim to test the effectiveness of a biomarker-guided approach to treatment; these designs can be broadly classified into adaptive and non-adaptive. While adaptive designs allow planned modifications based on accumulating information during a trial, non-adaptive designs are typically simpler but less flexible. Methods and Findings We have undertaken a comprehensive review of biomarker-guided adaptive trial designs proposed in the past decade. We have identified eight distinct biomarker-guided adaptive designs and nine variations from 107 studies. Substantial variability has been observed in terms of how trial designs are described and particularly in the terminology used by different authors. We have graphically displayed the current biomarker-guided adaptive trial designs and summarised the characteristics of each design. Conclusions Our in-depth overview provides future researchers with clarity in definition, methodology and terminology for biomarker-guided adaptive trial designs. PMID:26910238
McLean, Christine; Rohan, Maheswaran; Sisk, Rose; Dobbs, Terry; Nada-Raja, Shyamala; Wilson, Denise; Vandal, Alain C
2016-01-01
Background Automated eHealth Web-based research trials offer people an accessible, confidential opportunity to engage in research that matters to them. eHealth trials may be particularly useful for sensitive issues when seeking health care may be accompanied by shame and mistrust. Yet little is known about people’s early engagement with eHealth trials, from recruitment to preintervention autoregistration processes. A recent randomized controlled trial that tested the effectiveness of an eHealth safety decision aid for New Zealand women in the general population who experienced intimate partner violence (isafe) provided the opportunity to examine recruitment and preintervention participant engagement with a fully automated Web-based registration process. The trial aimed to recruit 340 women within 24 months. Objective The objective of our study was to examine participant preintervention engagement and recruitment efficiency for the isafe trial, and to analyze dropout through the registration pathway, from recruitment to eligibility screening and consent, to completion of baseline measures. Methods In this case study, data collection sources included the trial recruitment log, Google Analytics reports, registration and program metadata, and costs. Analysis included a qualitative narrative of the recruitment experience and descriptive statistics of preintervention participant engagement and dropout rates. A Koyck model investigated the relationship between Web-based online marketing website advertisements (ads) and participant accrual. Results The isafe trial was launched on September 17, 2012. Placement of ads in an online classified advertising platform increased the average number of recruited participants per month from 2 to 25. Over the 23-month recruitment period, the registration website recorded 4176 unique visitors. Among 1003 women meeting eligibility criteria, 51.55% (517) consented to participate; among the 501 women who enrolled (consented, validated, and randomized), 412 (82.2%) were accrued (completed baseline assessments). The majority (n=52, 58%) of the 89 women who dropped out between enrollment and accrual never logged in to the allocated isafe website. Of every 4 accrued women, 3 (314/412, 76.2%) identified the classified ad as their referral source, followed by friends and family (52/412, 12.6%). Women recruited through a friend or relative were more likely to self-identify as indigenous Māori and live in the highest-deprivation areas. Ads increased the accrual rate by a factor of 74 (95% CI 49–112). Conclusions Print advertisements, website links, and networking were costly and inefficient methods for recruiting participants to a Web-based eHealth trial. Researchers are advised to limit their recruitment efforts to Web-based online marketplace and classified advertising platforms, as in the isafe case, or to social media. Online classified advertising in “Jobs–Other–volunteers” successfully recruited a diverse sample of women experiencing intimate partner violence. Preintervention recruitment data provide critical information to inform future research and critical analysis of Web-based eHealth trials. ClinicalTrial Australian New Zealand Clinical Trials Registry (ANZCTR): ACTRN12612000708853; https://www.anzctr.org.au/Trial/Registration/TrialReview.aspx?ACTRN=12612000708853 (Archived by WebCite at http://www.webcitation/6lMGuVXdK) PMID:27780796
Psychometric evaluation of the HIV symptom distress scale
Marc, Linda G.; Wang, Ming-Mei; Testa, Marcia A.
2012-01-01
The objective of this paper is to psychometrically validate the HIV Symptom Distress Scale (SDS), an instrument that can be used to measure overall HIV symptom distress or clinically relevant groups of HIV symptoms. A secondary data analysis was conducted using the Collaborations in HIV Outcomes Research U.S. Cohort (CHORUS). Inclusion criteria required study participants (N=5,521) to have a valid baseline measure of the AIDS Clinical Trial Group Symptom Distress Module, with an SF-12 or SF-36 completed on the same day. Psychometric testing assessed unidimensionality, internal consistency and factor structure using exploratory and confirmatory factor analysis, and structural equation modeling (SEM). Construct validity examined whether the new measure discriminates across clinical significance (CD4 and HIV viral load). Findings show that the SDS has high reliability (α=0.92), and SEM supports a correlated second-order factor model (physical and mental distress) with acceptable fit (GFI=0.88, AGFI=0.85, NFI=0.99, NNFI=0.99; RMSEA=0.06, [90% CI 0.06 – 0.06]; Satorra Bentler Scaled, C2 =3274.20; p=0.0). Construct validity shows significant differences across categories for HIV-1 viral load (p< 0.001) and CD4 (p< 0.001). Differences in mean SDS scores exist across gender (p< 0.001), race/ethnicity (p< 0.05) and educational attainment (p < 0.001). Hence, the HIV Symptom Distress Scale is a reliable and valid instrument, which measures overall HIV symptoms or clinically relevant groups of symptoms. PMID:22409246
Bothelius, Kristoffer; Jernelöv, Susanna; Fredrikson, Mats; McCracken, Lance M; Kaldo, Viktor
2015-11-01
Acceptance may be an important therapeutic process in sleep medicine, but valid psychometric instruments measuring acceptance related to sleep difficulties are lacking. The purpose of this study was to develop a measure of acceptance in insomnia, and to examine its factor structure as well as construct validity. In a cross-sectional design, a principal component analysis for item reduction was conducted on a first sample (A) and a confirmatory factor analysis on a second sample (B). Construct validity was tested on a combined sample (C). Questionnaire items were derived from a measure of acceptance in chronic pain, and data were gathered through screening or available from pretreatment assessments in four insomnia treatment trials, administered online, via bibliotherapy and in primary care. Adults with insomnia: 372 in sample A and 215 in sample B. Sample C (n = 820) included sample A and B with another 233 participants added. Construct validity was assessed through relations with established acceptance and sleep scales. The principal component analysis presented a two-factor solution with eight items, explaining 65.9% of the total variance. The confirmatory factor analysis supported the solution. Acceptance of sleep problems was more closely related to subjective symptoms and consequences of insomnia than to diary description of sleep, or to acceptance of general private events. The Sleep Problem Acceptance Questionnaire (SPAQ), containing the subscales "Activity Engagement" and "Willingness", is a valid tool to assess acceptance of insomnia. © 2015 Associated Professional Sleep Societies, LLC.
Patient-Reported Outcome Measures for Hand and Wrist Trauma
Dacombe, Peter Jonathan; Amirfeyz, Rouin; Davis, Tim
2016-01-01
Background: Patient-reported outcome measures (PROMs) are important tools for assessing outcomes following injuries to the hand and wrist. Many commonly used PROMs have no evidence of reliability, validity, and responsiveness in a hand and wrist trauma population. This systematic review examines the PROMs used in the assessment of hand and wrist trauma patients, and the evidence for reliability, validity, and responsiveness of each measure in this population. Methods: A systematic review of Pubmed, Medline, and CINAHL searching for randomized controlled trials of patients with traumatic injuries to the hand and wrist was carried out to identify the PROMs. For each identified PROM, evidence of reliability, validity, and responsiveness was identified using a further systematic review of the Pubmed, Medline, CINAHL, and reverse citation trail audit procedure. Results: The PROM used most often was the Disabilities of the Arm, Shoulder and Hand (DASH) questionnaire; the Patient-Rated Wrist Evaluation (PRWE), Gartland and Werley score, Michigan Hand Outcomes score, Mayo Wrist Score, and Short Form 36 were also commonly used. Only the DASH and PRWE have evidence of reliability, validity, and responsiveness in patients with traumatic injuries to the hand and wrist; other measures either have incomplete evidence or evidence gathered in a nontraumatic population. Conclusions: The DASH and PRWE both have evidence of reliability, validity, and responsiveness in a hand and wrist trauma population. Other PROMs used to assess hand and wrist trauma patients do not. This should be considered when selecting a PROM for patients with traumatic hand and wrist pathology. PMID:27418884
Randomized Trial of Asprin as Adjuvant Therapy for Node-Positive Breast Cancer
2016-10-01
subject to any penalty for failing to comply with a collection of information if it does not display a currently valid OMB control number. PLEASE DO...within the proposed time frame. 15. SUBJECT TERMS Breast cancer, adjuvant treatment, aspirin, randomized controlled trial 16. SECURITY...propose a randomized controlled trial to test that promise. There is compelling epidemiologic, in-vitro, and in-vivo, evidence of aspirin’s potential
Large Cancer Drug Trial Helps Move Precision Medicine Toward the Mainstream | FNLCR Staging
A landmark cancer drug trial is helping set the stage for moving precision medicine into the mainstream of clinical practice, according to a new study. The study, reported in the Journal of Molecular Diagnostics, validates a procedure used in the dru
WE-B-207-00: CT Lung Cancer Screening Part 1
DOE Office of Scientific and Technical Information (OSTI.GOV)
NONE
2015-06-15
The US National Lung Screening Trial (NLST) was a multi-center randomized, controlled trial comparing a low-dose CT (LDCT) to posterior-anterior (PA) chest x-ray (CXR) in screening older, current and former heavy smokers for early detection of lung cancer. Recruitment was launched in September 2002 and ended in April 2004 when 53,454 participants had been randomized at 33 screening sites in equal proportions. Funded by the National Cancer Institute this trial demonstrated that LDCT screening reduced lung cancer mortality. The US Preventive Services Task Force (USPSTF) cited NLST findings and conclusions in its deliberations and analysis of lung cancer screening. Undermore » the 2010 Patient Protection and Affordable Care Act, the USPSTF favorable recommendation regarding lung cancer CT screening assisted in obtaining third-party payers coverage for screening. The objective of this session is to provide an introduction to the NLST and the trial findings, in addition to a comprehensive review of the dosimetry investigations and assessments completed using individual NLST participant CT and CXR examinations. Session presentations will review and discuss the findings of two independent assessments, a CXR assessment and the findings of a CT investigation calculating individual organ dosimetry values. The CXR assessment reviewed a total of 73,733 chest x-ray exams that were performed on 92 chest imaging systems of which 66,157 participant examinations were used. The CT organ dosimetry investigation collected scan parameters from 23,773 CT examinations; a subset of the 75,133 CT examinations performed using 97 multi-detector CT scanners. Organ dose conversion coefficients were calculated using a Monte Carlo code. An experimentally-validated CT scanner simulation was coupled with 193 adult hybrid computational phantoms representing the height and weight of the current U.S. population. The dose to selected organs was calculated using the organ dose library and the abstracted scan parameters. This session will review the results and summarize the individualized doses to major organs and the mean effective dose and CTDIvol estimate for 66,157 PA chest and 23,773 CT examinations respectively, using size-dependent computational phantoms coupled with Monte Carlo calculations. Learning Objectives: Review and summarize relevant NLST findings and conclusions. Understand the scope and scale of the NLST specific to participant dosimetry. Provide a comprehensive review of NLST participant dosimetry assessments. Summarize the results of an investigation providing individualized organ dose estimates for NLST participant cohorts.« less
Jull, J; Whitehead, M; Petticrew, M; Kristjansson, E; Gough, D; Petkovic, J; Volmink, J; Weijer, C; Taljaard, M; Edwards, S; Mbuagbaw, L; Cookson, R; McGowan, J; Lyddiatt, A; Boyer, Y; Cuervo, L G; Armstrong, R; White, H; Yoganathan, M; Pantoja, T; Shea, B; Pottie, K; Norheim, O; Baird, S; Robberstad, B; Sommerfelt, H; Asada, Y; Wells, G; Tugwell, P; Welch, V
2017-01-01
Background Randomised controlled trials can provide evidence relevant to assessing the equity impact of an intervention, but such information is often poorly reported. We describe a conceptual framework to identify health equity-relevant randomised trials with the aim of improving the design and reporting of such trials. Methods An interdisciplinary and international research team engaged in an iterative consensus building process to develop and refine the conceptual framework via face-to-face meetings, teleconferences and email correspondence, including findings from a validation exercise whereby two independent reviewers used the emerging framework to classify a sample of randomised trials. Results A randomised trial can usefully be classified as ‘health equity relevant’ if it assesses the effects of an intervention on the health or its determinants of either individuals or a population who experience ill health due to disadvantage defined across one or more social determinants of health. Health equity-relevant randomised trials can either exclusively focus on a single population or collect data potentially useful for assessing differential effects of the intervention across multiple populations experiencing different levels or types of social disadvantage. Trials that are not classified as ‘health equity relevant’ may nevertheless provide information that is indirectly relevant to assessing equity impact, including information about individual level variation unrelated to social disadvantage and potentially useful in secondary modelling studies. Conclusion The conceptual framework may be used to design and report randomised trials. The framework could also be used for other study designs to contribute to the evidence base for improved health equity. PMID:28951402
Factors influencing participation of psychiatry inpatients in clinical trials.
Mopuru, Nandeeshwar Reddy; Jose, Sam Padamadan; Viswanath, Biju; Kumar, C Naveen; Math, Suresh Bada; Thirthalli, Jagadisha
2018-02-01
Serious concerns have arisen in recent years regarding the unethical and illegal practices resorted to during clinical trials. Clinical trials in psychiatry are further complicated by issues such as 'validity of consent' and 'decision making capacity' of patients. This study was planned to explore the factors determining patient participation in clinical trials. A random sample of 123 consenting psychiatry inpatients were provided the information and consent-form of a hypothetical clinical drug trial. They were interviewed regarding their decision, the decision maker and factors that led to the decision. Family members tended to be the decision makers when patients were females, had low-income, were from rural background or had severe illnesses. Anticipated side effects and not wanting to interfere with existing treatment were the common reasons for refusal to participate while hope of betterment of the patient and benefit to humanity were cited for consent. The educated, urban, affluent class had more awareness regarding unethical trials and tended to be mistrustful of the medical community leading to higher rates of non-participation. Those who were adherent with ongoing treatment were also unwilling to participate. The lesser educated, low-income patients and rural domicile patients on the other hand had lesser awareness regarding clinical trials, trusted doctors and were more likely to participate. A good doctor-patient relationship, detailed explanations and clarification regarding the study and its conduct, and building awareness regarding clinical trials among vulnerable groups is necessary to ensure a valid consent involving no coercion, removal of prejudices, and ethical conduct of trials. Copyright © 2017 Elsevier B.V. All rights reserved.
Roozenbeek, Bob; Lingsma, Hester F.; Lecky, Fiona E.; Lu, Juan; Weir, James; Butcher, Isabella; McHugh, Gillian S.; Murray, Gordon D.; Perel, Pablo; Maas, Andrew I.R.; Steyerberg, Ewout W.
2012-01-01
Objective The International Mission on Prognosis and Analysis of Clinical Trials (IMPACT) and Corticoid Randomisation After Significant Head injury (CRASH) prognostic models predict outcome after traumatic brain injury (TBI) but have not been compared in large datasets. The objective of this is study is to validate externally and compare the IMPACT and CRASH prognostic models for prediction of outcome after moderate or severe TBI. Design External validation study. Patients We considered 5 new datasets with a total of 9036 patients, comprising three randomized trials and two observational series, containing prospectively collected individual TBI patient data. Measurements Outcomes were mortality and unfavourable outcome, based on the Glasgow Outcome Score (GOS) at six months after injury. To assess performance, we studied the discrimination of the models (by AUCs), and calibration (by comparison of the mean observed to predicted outcomes and calibration slopes). Main Results The highest discrimination was found in the TARN trauma registry (AUCs between 0.83 and 0.87), and the lowest discrimination in the Pharmos trial (AUCs between 0.65 and 0.71). Although differences in predictor effects between development and validation populations were found (calibration slopes varying between 0.58 and 1.53), the differences in discrimination were largely explained by differences in case-mix in the validation studies. Calibration was good, the fraction of observed outcomes generally agreed well with the mean predicted outcome. No meaningful differences were noted in performance between the IMPACT and CRASH models. More complex models discriminated slightly better than simpler variants. Conclusions Since both the IMPACT and the CRASH prognostic models show good generalizability to more recent data, they are valid instruments to quantify prognosis in TBI. PMID:22511138
Imamura, Masaaki; Usui, Tomoko; Johnin, Kazuyoshi; Yoshimura, Koji; Farhat, Walid; Kanematsu, Akihiro; Ogawa, Osamu
2014-07-01
Validated questionnaire for evaluation of pediatric lower urinary tract symptoms (LUTS) is of a great need. We performed cross-cultural validated adaptation of Dysfunctional Voiding Symptom Score (DVSS) to Japanese language, and assessed whether children understand and respond to questionnaire correctly, using cognitive linguistic approach. We translated DVSS into two Japanese versions according to a standard validation methodology: translation, synthesis, back-translation, expert review, and pre-testing. One version was written in adult language for parents, and the other was written in child language for children. Pre-testing was done with 5 to 15-year-old patients visiting us, having normal intelligence. A specialist in cognitive linguistics observed the response by children and parents to DVSS as an interviewer. When a child could not understand a question without adding or paraphrasing the question by the parents, it was defined as 'misidentification'. We performed pretesting with 2 trial versions of DVSS before having the final version. The pre-testing for the first trial version was done for 32 patients (male to female ratio was 19 : 13). The pre-testing for the second trial version was done for 11 patients (male to female ratio was 8 : 3). In DVSS in child language, misidentification was consistently observed for representation of time or frequency. We completed the formal validated translation by amending the problems raised in the pre-testing. The cross-cultural validated adaptation of DVSS to child and adult Japanese was completed. Since temporal perception is not fully developed in children, caution should be taken for using the terms related with time or frequency in the questionnaires for children.
Mertz, Marcel; Sofaer, Neema; Strech, Daniel
2014-09-27
The systematic review of reasons is a new way to obtain comprehensive information about specific ethical topics. One such review was carried out for the question of why post-trial access to trial drugs should or need not be provided. The objective of this study was to empirically validate this review using an author check method. The article also reports on methodological challenges faced by our study. We emailed a questionnaire to the 64 corresponding authors of those papers that were assessed in the review of reasons on post-trial access. The questionnaire consisted of all quotations ("reason mentions") that were identified by the review to represent a reason in a given author's publication, together with a set of codings for the quotations. The authors were asked to rate the correctness of the codings. We received 19 responses, from which only 13 were completed questionnaires. In total, 98 quotations and their related codes in the 13 questionnaires were checked by the addressees. For 77 quotations (79%), all codings were deemed correct, for 21 quotations (21%), some codings were deemed to need correction. Most corrections were minor and did not imply a complete misunderstanding of the citation. This first attempt to validate a review of reasons leads to four crucial methodological questions relevant to the future conduct of such validation studies: 1) How can a description of a reason be deemed incorrect? 2) Do the limited findings of this author check study enable us to determine whether the core results of the analysed SRR are valid? 3) Why did the majority of surveyed authors refrain from commenting on our understanding of their reasoning? 4) How can the method for validating reviews of reasons be improved?
Updating the OMERACT Filter: Implications for imaging and soluble biomarkers
D’Agostino, Maria-Antonietta; Boers, Maarten; Kirwan, John; van der Heijde, Desirée; Østergaard, Mikkel; Schett, Georg; Landewé, Robert B.M.; Maksymowych, Walter P.; Naredo, Esperanza; Dougados, Maxime; Iagnocco, Annamaria; Bingham, Clifton O.; Brooks, Peter; Beaton, Dorcas; Gandjbakhch, Frederique; Gossec, Laure; Guillemin, Francis; Hewlett, Sarah; Kloppenburg, Margreet; March, Lyn; Mease, Philip J; Moller, Ingrid; Simon, Lee S; Singh, Jasvinder A; Strand, Vibeke; Wakefield, Richard J; Wells, George; Tugwell, Peter; Conaghan, Philip G
2014-01-01
Objective The OMERACT Filter provides a framework for the validation of outcome measures for use in rheumatology clinical research. However, imaging and biochemical measures may face additional validation challenges due to their technical nature. The Imaging and Soluble Biomarker Session at OMERACT 11 aimed to provide a guide for the iterative development of an imaging or biochemical measurement instrument so it can be used in therapeutic assessment. Methods A hierarchical structure was proposed, reflecting 3 dimensions needed for validating an imaging or biochemical measurement instrument: outcome domain(s), study setting and performance of the instrument. Movement along the axes in any dimension reflects increasing validation. For a given test instrument, the 3-axis structure assesses the extent to which the instrument is a validated measure for the chosen domain, whether it assesses a patient or disease centred-variable, and whether its technical performance is adequate in the context of its application. Some currently used imaging and soluble biomarkers for rheumatoid arthritis, spondyloarthritis and knee osteoarthritis were then evaluated using the original OMERACT filter and the newly proposed structure. Break-out groups critically reviewed the extent to which the candidate biomarkers complied with the proposed step-wise approach, as a way of examining the utility of the proposed 3 dimensional structure. Results Although there was a broad acceptance of the value of the proposed structure in general, some areas for improvement were suggested including clarification of criteria for achieving a certain level of validation and how to deal with extension of the structure to areas beyond clinical trials. Conclusion General support was obtained for a proposed tri-axis structure to assess validation of imaging and soluble biomarkers; nevertheless, additional work is required to better evaluate its place within the OMERACT Filter 2.0. PMID:24584916
Updating the OMERACT filter: implications for imaging and soluble biomarkers.
D'Agostino, Maria-Antonietta; Boers, Maarten; Kirwan, John; van der Heijde, Désirée; Østergaard, Mikkel; Schett, Georg; Landewé, Robert B; Maksymowych, Walter P; Naredo, Esperanza; Dougados, Maxime; Iagnocco, Annamaria; Bingham, Clifton O; Brooks, Peter M; Beaton, Dorcas E; Gandjbakhch, Frederique; Gossec, Laure; Guillemin, Francis; Hewlett, Sarah E; Kloppenburg, Margreet; March, Lyn; Mease, Philip J; Moller, Ingrid; Simon, Lee S; Singh, Jasvinder A; Strand, Vibeke; Wakefield, Richard J; Wells, George A; Tugwell, Peter; Conaghan, Philip G
2014-05-01
The Outcome Measures in Rheumatology (OMERACT) Filter provides a framework for the validation of outcome measures for use in rheumatology clinical research. However, imaging and biochemical measures may face additional validation challenges because of their technical nature. The Imaging and Soluble Biomarker Session at OMERACT 11 aimed to provide a guide for the iterative development of an imaging or biochemical measurement instrument so it can be used in therapeutic assessment. A hierarchical structure was proposed, reflecting 3 dimensions needed for validating an imaging or biochemical measurement instrument: outcome domain(s), study setting, and performance of the instrument. Movement along the axes in any dimension reflects increasing validation. For a given test instrument, the 3-axis structure assesses the extent to which the instrument is a validated measure for the chosen domain, whether it assesses a patient-centered or disease-centered variable, and whether its technical performance is adequate in the context of its application. Some currently used imaging and soluble biomarkers for rheumatoid arthritis, spondyloarthritis, and knee osteoarthritis were then evaluated using the original OMERACT Filter and the newly proposed structure. Breakout groups critically reviewed the extent to which the candidate biomarkers complied with the proposed stepwise approach, as a way of examining the utility of the proposed 3-dimensional structure. Although there was a broad acceptance of the value of the proposed structure in general, some areas for improvement were suggested including clarification of criteria for achieving a certain level of validation and how to deal with extension of the structure to areas beyond clinical trials. General support was obtained for a proposed tri-axis structure to assess validation of imaging and soluble biomarkers; nevertheless, additional work is required to better evaluate its place within the OMERACT Filter 2.0.
Maxwell, Annette E; Stewart, Susan L; Glenn, Beth A; Wong, Weng Kee; Yasui, Yutaka; Chang, L Cindy; Taylor, Victoria M; Nguyen, Tung T; Chen, Moon S; Bastani, Roshan
2012-01-01
Few studies have examined theoretically informed constructs related to hepatitis B (HBV) testing, and comparisons across studies are challenging due to lack of uniformity in constructs assessed. The present analysis examined relationships among Health Behavior Framework factors across four Asian American groups to advance the development of theory-based interventions for HBV testing in at-risk populations. Data were collected from 2007-2010 as part of baseline surveys during four intervention trials promoting HBV testing among Vietnamese-, Hmong-, Korean- and Cambodian-Americans (n = 1,735). Health Behavior Framework constructs assessed included: awareness of HBV, knowledge of transmission routes, perceived susceptibility, perceived severity, doctor recommendation, stigma of HBV infection, and perceived efficacy of testing. Within each group we assessed associations between our intermediate outcome of knowledge of HBV transmission and other constructs, to assess the concurrent validity of our model and instruments. While the absolute levels for Health Behavior Framework factors varied across groups, relationships between knowledge and other factors were generally consistent. This suggests similarities rather than differences with respect to posited drivers of HBV-related behavior. Our findings indicate that Health Behavior Framework constructs are applicable to diverse ethnic groups and provide preliminary evidence for the construct validity of the Health Behavior Framework.
A new method to identify the location of the kick point during the golf swing.
Joyce, Christopher; Burnett, Angus; Matthews, Miccal
2013-12-01
No method currently exists to determine the location of the kick point during the golf swing. This study consisted of two phases. In the first phase, the static kick point of 10 drivers (having identical grip and head but fitted with shafts of differing mass and stiffness) was determined by two methods: (1) a visual method used by professional club fitters and (2) an algorithm using 3D locations of markers positioned on the golf club. Using level of agreement statistics, we showed the latter technique was a valid method to determine the location of the static kick point. In phase two, the validated method was used to determine the dynamic kick point during the golf swing. Twelve elite male golfers had three shots analyzed for two drivers fitted with stiff shafts of differing mass (56 g and 78 g). Excellent between-trial reliability was found for dynamic kick point location. Differences were found for dynamic kick point location when compared with static kick point location, as well as between-shaft and within-shaft. These findings have implications for future investigations examining the bending behavior of golf clubs, as well as being useful to examine relationships between properties of the shaft and launch parameters.
Maxwell, AE; Stewart, SL; Glenn, BA; Wong, WK; Yasui, Y; Chang, LC; Taylor, VM; Nguyen, TT; Chen, MS; Bastani, R
2012-01-01
Background Few studies have examined theoretically informed constructs related to hepatitis B (HBV) testing, and comparisons across studies is challenging due to lack of uniformity in constructs assessed. This analysis examines relationships among Health Behavior Framework factors across four Asian American groups to advance the development of theory-based interventions for HBV testing in at-risk populations. Methods Data were collected from 2007–2010 as part of baseline surveys during four intervention trials promoting HBV testing among Vietnamese-, Hmong-, Korean- and Cambodian-Americans (n = 1,735). Health Behavior Framework constructs assessed included: awareness of HBV, knowledge of transmission routes, perceived susceptibility, perceived severity, doctor recommendation, stigma of HBV infection, and perceived efficacy of testing. Within each group we assessed associations between our intermediate outcome of knowledge of HBV transmission and other constructs, to assess the concurrent validity of our model and instruments. Results While the absolute levels for Health Behavior Framework factors varied across groups, relationships between knowledge and other factors were generally consistent. This suggests similarities rather than differences with respect to posited drivers of HBV-related behavior. Discussion Our findings indicate that Health Behavior Framework constructs are applicable to diverse ethnic groups and provide preliminary evidence for the construct validity of the Health Behavior Framework. PMID:22799389
Center-Within-Trial Versus Trial-Level Evaluation of Surrogate Endpoints.
Renfro, Lindsay A; Shi, Qian; Xue, Yuan; Li, Junlong; Shang, Hongwei; Sargent, Daniel J
2014-10-01
Evaluation of candidate surrogate endpoints using individual patient data from multiple clinical trials is considered the gold standard approach to validate surrogates at both patient and trial levels. However, this approach assumes the availability of patient-level data from a relatively large collection of similar trials, which may not be possible to achieve for a given disease application. One common solution to the problem of too few similar trials involves performing trial-level surrogacy analyses on trial sub-units (e.g., centers within trials), thereby artificially increasing the trial-level sample size for feasibility of the multi-trial analysis. To date, the practical impact of treating trial sub-units (centers) identically to trials in multi-trial surrogacy analyses remains unexplored, and conditions under which this ad hoc solution may in fact be reasonable have not been identified. We perform a simulation study to identify such conditions, and demonstrate practical implications using a multi-trial dataset of patients with early stage colon cancer.
Center-Within-Trial Versus Trial-Level Evaluation of Surrogate Endpoints
Renfro, Lindsay A.; Shi, Qian; Xue, Yuan; Li, Junlong; Shang, Hongwei; Sargent, Daniel J.
2014-01-01
Evaluation of candidate surrogate endpoints using individual patient data from multiple clinical trials is considered the gold standard approach to validate surrogates at both patient and trial levels. However, this approach assumes the availability of patient-level data from a relatively large collection of similar trials, which may not be possible to achieve for a given disease application. One common solution to the problem of too few similar trials involves performing trial-level surrogacy analyses on trial sub-units (e.g., centers within trials), thereby artificially increasing the trial-level sample size for feasibility of the multi-trial analysis. To date, the practical impact of treating trial sub-units (centers) identically to trials in multi-trial surrogacy analyses remains unexplored, and conditions under which this ad hoc solution may in fact be reasonable have not been identified. We perform a simulation study to identify such conditions, and demonstrate practical implications using a multi-trial dataset of patients with early stage colon cancer. PMID:25061255
Banks, Jamie L; Marotta, Charles A
2007-03-01
The modified Rankin scale (mRS), a clinician-reported measure of global disability, is widely applied for evaluating stroke patient outcomes and as an end point in randomized clinical trials. Extensive evidence on the validity of the mRS exists across a large but fragmented literature. As new treatments for acute ischemic stroke are submitted for agency approval, an appreciation of the mRS's attributes, specifically its relationship to other stroke evaluation scales, would be valuable for decision-makers to properly assess the impact of a new drug on treatment paradigms. The purpose of this report is to assemble and systematically assess the properties of the mRS to provide decision-makers with pertinent evaluative information. A Medline search was conducted to identify reports in the peer-reviewed medical literature (1957-2006) that provide information on the structure, validation, scoring, and psychometric properties of the mRS and its use in clinical trials. The selection of articles was based on defined criteria that included relevance, study design and use of appropriate statistical methods. Of 224 articles identified by the literature search, 50 were selected for detailed assessment. Inter-rater reliability with the mRS is moderate and improves with structured interviews (kappa 0.56 versus 0.78); strong test-re-test reliability (kappa=0.81 to 0.95) has been reported. Numerous studies demonstrate the construct validity of the mRS by its relationships to physiological indicators such as stroke type, lesion size, perfusion and neurological impairment. Convergent validity between the mRS and other disability scales is well documented. Patient comorbidities and socioeconomic factors should be considered in properly applying and interpreting the mRS. Recent analyses suggest that randomized clinical trials of acute stroke treatments may require a smaller sample size if the mRS is used as a primary end point rather than the Barthel Index. Multiple types of evidence attest to the validity and reliability of the mRS. The reported data support the view that the mRS is a valuable instrument for assessing the impact of new stroke treatments.
Abbas, Ismail; Rovira, Joan; Casanovas, Josep
2006-12-01
To develop and validate a model of a clinical trial that evaluates the changes in cholesterol level as a surrogate marker for lipodystrophy in HIV subjects under alternative antiretroviral regimes, i.e., treatment with Protease Inhibitors vs. a combination of nevirapine and other antiretroviral drugs. Five simulation models were developed based on different assumptions, on treatment variability and pattern of cholesterol reduction over time. The last recorded cholesterol level, the difference from the baseline, the average difference from the baseline and level evolution, are the considered endpoints. Specific validation criteria based on a 10% minus or plus standardized distance in means and variances were used to compare the real and the simulated data. The validity criterion was met by all models for considered endpoints. However, only two models met the validity criterion when all endpoints were considered. The model based on the assumption that within-subjects variability of cholesterol levels changes over time is the one that minimizes the validity criterion, standardized distance equal to or less than 1% minus or plus. Simulation is a useful technique for calibration, estimation, and evaluation of models, which allows us to relax the often overly restrictive assumptions regarding parameters required by analytical approaches. The validity criterion can also be used to select the preferred model for design optimization, until additional data are obtained allowing an external validation of the model.
The internal validity of arthroscopic simulators and their effectiveness in arthroscopic education.
Slade Shantz, Jesse Alan; Leiter, Jeff R S; Gottschalk, Tania; MacDonald, Peter Benjamin
2014-01-01
The purpose of this systematic review was to identify standard procedures for the validation of arthroscopic simulators and determine whether simulators improve the surgical skills of users. Arthroscopic simulator validation studies and randomized trials assessing the effectiveness of arthroscopic simulators in education were identified from online databases, as well as, grey literature and reference lists. Only validation studies and randomized trials were included for review. Study heterogeneity was calculated and where appropriate, study results were combined employing a random effects model. Four hundred and thirteen studies were reviewed. Thirteen studies met the inclusion criteria assessing the construct validity of simulators. A pooled analysis of internal validation studies determined that simulators could discriminate between novice and experts, but not between novice and intermediate trainees on time of completion of a simulated task. Only one study assessed the utility of a knee simulator in training arthroscopic skills directly and demonstrated that the skill level of simulator-trained residents was greater than non-simulator-trained residents. Excessive heterogeneity exists in the literature to determine the internal and transfer validity of arthroscopic simulators currently available. Evidence suggests that simulators can discriminate between novice and expert users, but discrimination between novice and intermediate trainees in surgical education should be paramount. International standards for the assessment of arthroscopic simulator validity should be developed to increase the use and effectiveness of simulators in orthopedic surgery.
Schmitt, Jochen; Spuls, Phyllis I; Thomas, Kim S; Simpson, Eric; Furue, Masutaka; Deckert, Stefanie; Dohil, Magdalene; Apfelbacher, Christian; Singh, Jasvinder A; Chalmers, Joanne; Williams, Hywel C
2014-10-01
The lack of core outcome sets for atopic eczema (AE) is a major obstacle for advancing evidence-based treatment. The global Harmonising Outcome Measures for Eczema (HOME) initiative has already defined clinical signs, symptoms, quality of life, and long-term control of flares as core outcome domains for AE trials. This article deals with the standardization of measurement instruments to assess clinical signs of AE. To resolve the current lack of standardization of the assessment of clinical signs of AE, we followed a structured process of systematic reviews and international consensus sessions to identify 1 core outcome measurement instrument for assessment of clinical signs in all future AE trials. Systematic reviews indicated that from 16 different instruments identified to assess clinical signs of AE, only the Eczema Area and Severity Index (EASI) and the objective Scoring Atopic Dermatitis (SCORAD) index were identified as extensively validated. The EASI has adequate validity, responsiveness, internal consistency, and intraobserver reliability. The objective SCORAD index has adequate validity, responsiveness, and interobserver reliability but unclear intraobserver reliability to measure clinical signs of AE. In an international consensus study, patients, physicians, nurses, methodologists, and pharmaceutical industry representatives agreed that the EASI is the preferred core instrument to measure clinical signs in all future AE trials. All stakeholders involved in designing, reporting, and using clinical trials on AE are asked to comply with this consensus to enable better evidence-based decision making, clearer scientific communication, and improved patient care. Copyright © 2014 American Academy of Allergy, Asthma & Immunology. Published by Elsevier Inc. All rights reserved.
van der Aalst, Carlijn M; de Koning, Harry J
2016-04-01
Smoking is the main cause of lung cancer, so data linked to smoking behaviour are important in lung cancer screening trials. However, self-reporting data concerning smoking behaviour are mainly used. The aim of this study was to biochemically determine the validity and reliability of self-reported smoking status among smokers at high risk for developing lung cancer participating in the Dutch-Belgian lung cancer screening (NELSON) trial. For this sub study, a random sample of 475 men was selected who were scheduled for the fourth screening round in the NELSON trial. They were asked to complete a short questionnaire to verify the smoking behaviour for the previous seven days and a blood sample was collected to measure the cotinine level. The validity (sensitivity (Se), specificity (Sp), positive predictive value (PPV), and negative predictive value (NPV)) and reliability (Kappa) of the self-reported smoking status compared to the cotinine level (as golden standard) were determined. Both a completed questionnaire as well as a cotinine level were available for 199 (41.9%) participants. Based on these data, Se and Sp were respectively 98% (95%-Confidence Interval (CI): 91-99) and 98% (95%-CI: 93-100). PPV and NPV were 98% and 96% and Kappa was 0.96. In conclusion, the validity of the self-reported smoking status turned out to be reliable amongst men at high risk for developing lung cancer who participate in the NELSON lung cancer screening trial. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Swigris, Jeffrey J.; Han, Meilan; Vij, Rekha; Noth, Imre; Eisenstein, Eric L.; Anstrom, Kevin J.; Brown, Kevin K.; Fairclough, Diane
2012-01-01
Background Idiopathic pulmonary fibrosis (IPF) is a progressive interstitial lung disease that often causes disabling dyspnea. In IPF and other lung diseases, patient-reported outcomes (PROs)—questionnaires designed to gather information from the patient's perspective—can determine whether therapies affect dyspnea or other outcomes meaningful to patients. Before a PRO can be used confidently as an outcome measure in a longitudinal trial, studies must demonstrate the PRO's ability to capture change over time in the target population. Our goal in this study was to examine whether the UCSD Shortness of Breath Questionnaire does so in patients with IPF. Methods We used data from the Sildenafil Trial of Exercise Performance in Idiopathic Pulmonary Fibrosis (STEP-IPF) to perform analyses that examined associations between UCSD scores and five external measures (anchors) at baseline and over time. Anchors included the Activity domain from St. George's Respiratory Questionnaire (SGRQ-A), the Physical Functioning domain from the SF-36 (SF36-PF), forced vital capacity (FVC), diffusing capacity of the lung for carbon monoxide (DLCO), and distance walked during a timed walk test (6MWD). Linear regression models were used to examine relationships between UCSD scores and anchors over time. Results At baseline, UCSD scores were weakly correlated with percent predicted FVC (−0.21, p=0.005) and percent predicted DLCO (−0.20, p=0.008), moderately correlated with 6MWD (−0.39, p<0.0001) and strongly correlated with SGRQ-A (0.79, p<0.0001) and SF36-PF (−0.72, p<0.0001). Change over time in UCSD scores was associated with change in FVC (estimate=2.54, standard error [SE]=1.23, p=0.04), SGRQ-A (estimate=7.94, SE=1.11, p<0.0001), SF36-PF (estimate=6.00, SE=1.13, p<0.0001), and 6MWD (estimate=4.23, SE=1.18, p=0.0004) but not DLCO (estimate=0.33, SE=1.33, p=0.80). Conclusions These results support the validity of the UCSD to assess change in dyspnea over time in patients with IPF. PMID:22801586
Chase, J Geoffrey; Preiser, Jean-Charles; Dickson, Jennifer L; Pironet, Antoine; Chiew, Yeong Shiong; Pretty, Christopher G; Shaw, Geoffrey M; Benyo, Balazs; Moeller, Knut; Safaei, Soroush; Tawhai, Merryn; Hunter, Peter; Desaive, Thomas
2018-02-20
Critical care, like many healthcare areas, is under a dual assault from significantly increasing demographic and economic pressures. Intensive care unit (ICU) patients are highly variable in response to treatment, and increasingly aging populations mean ICUs are under increasing demand and their cohorts are increasingly ill. Equally, patient expectations are growing, while the economic ability to deliver care to all is declining. Better, more productive care is thus the big challenge. One means to that end is personalised care designed to manage the significant inter- and intra-patient variability that makes the ICU patient difficult. Thus, moving from current "one size fits all" protocolised care to adaptive, model-based "one method fits all" personalised care could deliver the required step change in the quality, and simultaneously the productivity and cost, of care. Computer models of human physiology are a unique tool to personalise care, as they can couple clinical data with mathematical methods to create subject-specific models and virtual patients to design new, personalised and more optimal protocols, as well as to guide care in real-time. They rely on identifying time varying patient-specific parameters in the model that capture inter- and intra-patient variability, the difference between patients and the evolution of patient condition. Properly validated, virtual patients represent the real patients, and can be used in silico to test different protocols or interventions, or in real-time to guide care. Hence, the underlying models and methods create the foundation for next generation care, as well as a tool for safely and rapidly developing personalised treatment protocols over large virtual cohorts using virtual trials. This review examines the models and methods used to create virtual patients. Specifically, it presents the models types and structures used and the data required. It then covers how to validate the resulting virtual patients and trials, and how these virtual trials can help design and optimise clinical trial. Links between these models and higher order, more complex physiome models are also discussed. In each section, it explores the progress reported up to date, especially on core ICU therapies in glycemic, circulatory and mechanical ventilation management, where high cost and frequency of occurrence provide a significant opportunity for model-based methods to have measurable clinical and economic impact. The outcomes are readily generalised to other areas of medical care.
Outcome Measures Used in Clinical Trials for Behçet Syndrome: A Systematic Review
Hatemi, Gulen; Merkel, Peter A.; Hamuryudan, Vedat; Boers, Maarten; Direskeneli, Haner; Aydin, Sibel Z.; Yazici, Hasan
2015-01-01
Behçet syndrome (BS) is a multisystem vasculitis that is most active during young adulthood, causing serious disability and significant impairment in quality of life. Differences in the disease course, severity, and organ involvement between patients, depending on the age at presentation and sex, makes it impossible to determine a single management strategy. The diversity and variability in the outcome measures used in clinical trials in BS makes it difficult to compare the results or inform physicians about the best management strategy for individual patients. There is a large unmet need to determine or develop validated outcome measures for use in clinical trials in BS that are acceptable to researchers and regulatory agencies. We conducted a systematic review to describe the outcomes and outcome measures that have been used in clinical trials in BS. This review revealed the diversity and variability in the outcomes and outcome measures and the lack of standard definitions for most outcomes and rarity of validated outcome tools for disease assessment in BS. This systematic literature review will identify domains and candidate instruments for use in a Delphi exercise, the next step in the development of a core set of outcome measures that are properly validated and widely accepted by the collaboration of researchers from many different regions of the world and from different specialties, including rheumatology, ophthalmology, dermatology, gastroenterology, and neurology. PMID:24488418
Outcome measures used in clinical trials for Behçet syndrome: a systematic review.
Hatemi, Gulen; Merkel, Peter A; Hamuryudan, Vedat; Boers, Maarten; Direskeneli, Haner; Aydin, Sibel Z; Yazici, Hasan
2014-03-01
Behçet syndrome (BS) is a multisystem vasculitis that is most active during young adulthood, causing serious disability and significant impairment in quality of life. Differences in the disease course, severity, and organ involvement between patients, depending on the age at presentation and sex, makes it impossible to determine a single management strategy. The diversity and variability in the outcome measures used in clinical trials in BS makes it difficult to compare the results or inform physicians about the best management strategy for individual patients. There is a large unmet need to determine or develop validated outcome measures for use in clinical trials in BS that are acceptable to researchers and regulatory agencies. We conducted a systematic review to describe the outcomes and outcome measures that have been used in clinical trials in BS. This review revealed the diversity and variability in the outcomes and outcome measures and the lack of standard definitions for most outcomes and rarity of validated outcome tools for disease assessment in BS. This systematic literature review will identify domains and candidate instruments for use in a Delphi exercise, the next step in the development of a core set of outcome measures that are properly validated and widely accepted by the collaboration of researchers from many different regions of the world and from different specialties, including rheumatology, ophthalmology, dermatology, gastroenterology, and neurology.
The Development, Validation, and Utility of the Diabetes Prevention Trial-Type 1 Risk Score (DPTRS)
Sosenko, Jay M.; Skyler, Jay S.; Palmer, Jerry P.
2016-01-01
Summary This report details the development, validation, and utility of the Diabetes Prevention Trial-Type 1 (DPT-1) Risk Score (DPTRS) for type 1 diabetes (T1D). Proportional hazards regression was used to develop the DPTRS model which includes the glucose and C-peptide sums from oral glucose tolerance tests at 30, 60, 90, and 120 minutes, the log fasting C-peptide, age, and the log BMI. The DPTRS was externally validated in the TrialNet Natural History Study cohort (TNNHS). In a study of the application of the DPTRS, the findings showed that it could be used to identify normoglycemic individuals who were at a similar risk for T1D as those with dysglycemia. The DPTRS could also be used to identify lower risk dysglycemic individuals. Risk estimates of individuals deemed to be at higher risk according to DPTRS values did not differ significantly between the DPT-1 and the TNNHS, whereas the risk estimates for those with dysglycemia were significantly higher in DPT-1. Individuals with very high DPTRS values were found to be at such marked risk for T1D that they could reasonably be considered to be in a pre-diabetic state. The findings indicate that the DPTRS has utility in T1D prevention trials and for identifying pre-diabetic individuals. PMID:26077017
The development, validation, and utility of the Diabetes Prevention Trial-Type 1 Risk Score (DPTRS).
Sosenko, Jay M; Skyler, Jay S; Palmer, Jerry P
2015-08-01
This report details the development, validation, and utility of the Diabetes Prevention Trial-Type 1 (DPT-1) Risk Score (DPTRS) for type 1 diabetes (T1D). Proportional hazards regression was used to develop the DPTRS model which includes the glucose and C-peptide sums from oral glucose tolerance tests at 30, 60, 90, and 120 min, the log fasting C-peptide, age, and the log BMI. The DPTRS was externally validated in the TrialNet Natural History Study cohort (TNNHS). In a study of the application of the DPTRS, the findings showed that it could be used to identify normoglycemic individuals who were at a similar risk for T1D as those with dysglycemia. The DPTRS could also be used to identify lower risk dysglycemic individuals. Risk estimates of individuals deemed to be at higher risk according to DPTRS values did not differ significantly between the DPT-1 and the TNNHS; whereas, the risk estimates for those with dysglycemia were significantly higher in DPT-1. Individuals with very high DPTRS values were found to be at such marked risk for T1D that they could reasonably be considered to be in a pre-diabetic state. The findings indicate that the DPTRS has utility in T1D prevention trials and for identifying pre-diabetic individuals.
Ma, Alison; Parkin, Lianne
2015-09-04
To determine whether pharmaceutical advertisement claims targeting health professionals were supported by the randomised controlled trials (RCTs) cited in the advertisements, and to assess the risk of bias in those trials. Pharmaceutical advertisements were obtained from New Zealand Doctor and Pharmacy Today for the period July 2013 to June 2014. All claims made regarding efficacy, safety, and indications were identified and RCTs cited to substantiate these claims were examined. A claim was defined as supported by an RCT if the conclusions drawn in the paper were consistent with the claim. The quality of the RCT was assessed separately, using the Cochrane Risk of Bias Assessment Tool. In 25 (19%) of the 133 instances in which an RCT was cited, the published paper did not support the promotional claim. Moreover, there were only 10 (8%) instances in which the claim was supported by an RCT with a low risk of bias. Of the 78 cited RCTs, only 14% had a low risk of bias, while 49% had an unclear risk and 37% had a high risk. A high proportion of advertisements failed to meet New Zealand regulatory requirements that claims "are valid and have been substantiated".
Telestroke 10 years later--'telestroke 2.0'.
Switzer, Jeffrey A; Levine, Steven R; Hess, David C
2009-01-01
The lack of physicians with specialty stroke training represents a significant challenge to the future of stroke. This deficit limits both quality stroke care and clinical research initiatives. The use of telemedicine for stroke ('telestroke') has been an attempt to overcome this shortage and extend stroke expertise to locations which lack coverage. However, the initial telestroke systems required a point-to-point connection for transmission and only provided videoconferencing which limited their generalizability and usefulness. 'Telestroke 2.0' is the authors' vision of an integrative web-based telestroke system combining high-quality audiovideo transmission, the ability of consults and teleradiology to be carried out from any desktop or laptop computer with web-access, decision and technical support, creation of billable physician documentation and electronic medical record connectivity. These features will facilitate the development of statewide and regional telestroke call networks with an opportunity for physician supply companies to fill in coverage gaps. In addition, telestroke 2.0 may improve acute stroke research by increasing trial efficiency via the addition of non-academic recruitment sites, enhancing trial validity by centralizing neurologic examinations via recorded encounters, and generalizing clinical trial results to community hospital settings. Greater diffusion and long-term sustainability of telestroke systems will be dependent upon improvements in patient and hospital reimbursement for acute stroke and telestroke care. Copyright 2009 S. Karger AG, Basel.
Revisiting sample size: are big trials the answer?
Lurati Buse, Giovanna A L; Botto, Fernando; Devereaux, P J
2012-07-18
The superiority of the evidence generated in randomized controlled trials over observational data is not only conditional to randomization. Randomized controlled trials require proper design and implementation to provide a reliable effect estimate. Adequate random sequence generation, allocation implementation, analyses based on the intention-to-treat principle, and sufficient power are crucial to the quality of a randomized controlled trial. Power, or the probability of the trial to detect a difference when a real difference between treatments exists, strongly depends on sample size. The quality of orthopaedic randomized controlled trials is frequently threatened by a limited sample size. This paper reviews basic concepts and pitfalls in sample-size estimation and focuses on the importance of large trials in the generation of valid evidence.
Research gaps identified during systematic reviews of clinical trials: glass-ionomer cements.
Mickenautsch, Steffen
2012-06-29
To report the results of an audit concerning research gaps in clinical trials that were accepted for appraisal in authored and published systematic reviews regarding the application of glass-ionomer cements (GIC) in dental practice Information concerning research gaps in trial precision was extracted, following a framework that included classification of the research gap reasons: 'imprecision of information (results)', 'biased information', 'inconsistency or unknown consistency' and 'not the right information', as well as research gap characterization using PICOS elements: population (P), intervention (I), comparison (C), outcomes (O) and setting (S). Internal trial validity assessment was based on the understanding that successful control for systematic error cannot be assured on the basis of inclusion of adequate methods alone, but also requires empirical evidence about whether such attempt was successful. A comprehensive and interconnected coverage of GIC-related clinical topics was established. The most common reasons found for gaps in trial precision were lack of sufficient trials and lack of sufficient large sample size. Only a few research gaps were ascribed to 'Lack of information' caused by focus on mainly surrogate trial outcomes. According to the chosen assessment criteria, a lack of adequate randomisation, allocation concealment and blinding/masking in trials covering all reviewed GIC topics was noted (selection- and detection/performance bias risk). Trial results appear to be less affected by loss-to-follow-up (attrition bias risk). This audit represents an adjunct of the systematic review articles it has covered. Its results do not change the systematic review's conclusions but highlight existing research gaps concerning the precision and internal validity of reviewed trials in detail. These gaps should be addressed in future GIC-related clinical research.
Streamlining cardiovascular clinical trials to improve efficiency and generalisability.
Zannad, Faiez; Pfeffer, Marc A; Bhatt, Deepak L; Bonds, Denise E; Borer, Jeffrey S; Calvo-Rojas, Gonzalo; Fiore, Louis; Lund, Lars H; Madigan, David; Maggioni, Aldo Pietro; Meyers, Catherine M; Rosenberg, Yves; Simon, Tabassome; Stough, Wendy Gattis; Zalewski, Andrew; Zariffa, Nevine; Temple, Robert
2017-08-01
Controlled trials provide the most valid determination of the efficacy and safety of an intervention, but large cardiovascular clinical trials have become extremely costly and complex, making it difficult to study many important clinical questions. A critical question, and the main objective of this review, is how trials might be simplified while maintaining randomisation to preserve scientific integrity and unbiased efficacy assessments. Experience with alternative approaches is accumulating, specifically with registry-based randomised controlled trials that make use of data already collected. This approach addresses bias concerns while still capitalising on the benefits and efficiencies of a registry. Several completed or ongoing trials illustrate the feasibility of using registry-based controlled trials to answer important questions relevant to daily clinical practice. Randomised trials within healthcare organisation databases may also represent streamlined solutions for some types of investigations, although data quality (endpoint assessment) is likely to be a greater concern in those settings. These approaches are not without challenges, and issues pertaining to informed consent, blinding, data quality and regulatory standards remain to be fully explored. Collaboration among stakeholders is necessary to achieve standards for data management and analysis, to validate large data sources for use in randomised trials, and to re-evaluate ethical standards to encourage research while also ensuring that patients are protected. The rapidly evolving efforts to streamline cardiovascular clinical trials have the potential to lead to major advances in promoting better care and outcomes for patients with cardiovascular disease. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Dunn, Graham; Emsley, Richard; Liu, Hanhua; Landau, Sabine; Green, Jonathan; White, Ian; Pickles, Andrew
2015-11-01
The development of the capability and capacity to evaluate the outcomes of trials of complex interventions is a key priority of the National Institute for Health Research (NIHR) and the Medical Research Council (MRC). The evaluation of complex treatment programmes for mental illness (e.g. cognitive-behavioural therapy for depression or psychosis) not only is a vital component of this research in its own right but also provides a well-established model for the evaluation of complex interventions in other clinical areas. In the context of efficacy and mechanism evaluation (EME) there is a particular need for robust methods for making valid causal inference in explanatory analyses of the mechanisms of treatment-induced change in clinical outcomes in randomised clinical trials. The key objective was to produce statistical methods to enable trial investigators to make valid causal inferences about the mechanisms of treatment-induced change in these clinical outcomes. The primary objective of this report is to disseminate this methodology, aiming specifically at trial practitioners. The three components of the research were (1) the extension of instrumental variable (IV) methods to latent growth curve models and growth mixture models for repeated-measures data; (2) the development of designs and regression methods for parallel trials; and (3) the evaluation of the sensitivity/robustness of findings to the assumptions necessary for model identifiability. We illustrate our methods with applications from psychological and psychosocial intervention trials, keeping the technical details to a minimum, leaving the reporting of the more theoretical and mathematically demanding results for publication in appropriate specialist journals. We show how to estimate treatment effects and introduce methods for EME. We explain the use of IV methods and principal stratification to evaluate the role of putative treatment effect mediators and therapeutic process measures. These results are extended to the analysis of longitudinal data structures. We consider the design of EME trials. We focus on designs to create convincing IVs, bearing in mind assumptions needed to attain model identifiability. A key area of application that has become apparent during this work is the potential role of treatment moderators (predictive markers) in the evaluation of treatment effect mechanisms for personalised therapies (stratified medicine). We consider the role of targeted therapies and multiarm trials and the use of parallel trials to help elucidate the evaluation of mediators working in parallel. In order to demonstrate both efficacy and mechanism, it is necessary to (1) demonstrate a treatment effect on the primary (clinical) outcome, (2) demonstrate a treatment effect on the putative mediator (mechanism) and (3) demonstrate a causal effect from the mediator to the outcome. Appropriate regression models should be applied for (3) or alternative IV procedures, which account for unmeasured confounding, provided that a valid instrument can be identified. Stratified medicine may provide a setting where such instruments can be designed into the trial. This work could be extended by considering improved trial designs, sample size considerations and measurement properties. The project presents independent research funded under the MRC-NIHR Methodology Research Programme (grant reference G0900678).
Idiopathic Pulmonary Fibrosis: Clinically Meaningful Primary Endpoints in Phase 3 Clinical Trials
Collard, Harold R.; Anstrom, Kevin J.; Flaherty, Kevin R.; Fleming, Thomas R.; King, Talmadge E.; Martinez, Fernando J.; Brown, Kevin K.
2012-01-01
Definitive evidence of clinical efficacy in a Phase 3 trial is best shown by a beneficial impact on a clinically meaningful endpoint—that is, an endpoint that directly measures how a patient feels (symptoms), functions (the ability to perform activities in daily life), or survives. In idiopathic pulmonary fibrosis (IPF), we believe the endpoints that best meet these criteria are all-cause mortality and all-cause nonelective hospitalization. There are no validated measures of symptoms or broader constructs such as health status or funtional status in IPF. A surrogate endpoint is defined as an indirect measure that is intended to substitute for a clinically meaningful endpoint. Surrogate endpoints can be appropriate outcome measures if validated. However, validation requires substantial evidence that the effect of an intervention on a clinically meaningful endpoint is reliably predicted by the effect of an intervention on the surrogate endpoint. For patients with IPF, there are currently no validated surrogate endpoints. PMID:22505745
Minimal clinically important difference of the Modified Fatigue Impact Scale in Parkinson's disease.
Kluger, Benzi M; Garimella, Sanjana; Garvan, Cynthia
2017-10-01
Fatigue is a common and debilitating symptom of Parkinson's disease (PD) with no evidence-based treatments. While several fatigue scales are partially validated in PD the minimal clinically important difference (MCID) is unknown for any scale but is an important psychometric value to design and interpret therapeutic trials. We thus sought to determine the MCID for the Modified Fatigue Impact Scale (MFIS). This is a secondary data analysis from 94 PD participants in an acupuncture trial for PD fatigue. Standard psychometric approaches were used to establish validity and an anchor-based approach was used to determine the MCID. The MFIS demonstrated good concurrent validity with other outcome measures and high internal consistency. MCIDs values were found to be 13.8, 6.8 and 6.2 for the MFIS total, MFIS cognitive, and MFIS physical subscores respectively. The MFIS is a valid multidimensional measure of fatigue in PD with demonstrable MCID. Copyright © 2017 Elsevier Ltd. All rights reserved.
Li, Bin; Shin, Hyunjin; Gulbekyan, Georgy; Pustovalova, Olga; Nikolsky, Yuri; Hope, Andrew; Bessarabova, Marina; Schu, Matthew; Kolpakova-Hart, Elona; Merberg, David; Dorner, Andrew; Trepicchio, William L.
2015-01-01
Development of drug responsive biomarkers from pre-clinical data is a critical step in drug discovery, as it enables patient stratification in clinical trial design. Such translational biomarkers can be validated in early clinical trial phases and utilized as a patient inclusion parameter in later stage trials. Here we present a study on building accurate and selective drug sensitivity models for Erlotinib or Sorafenib from pre-clinical in vitro data, followed by validation of individual models on corresponding treatment arms from patient data generated in the BATTLE clinical trial. A Partial Least Squares Regression (PLSR) based modeling framework was designed and implemented, using a special splitting strategy and canonical pathways to capture robust information for model building. Erlotinib and Sorafenib predictive models could be used to identify a sub-group of patients that respond better to the corresponding treatment, and these models are specific to the corresponding drugs. The model derived signature genes reflect each drug’s known mechanism of action. Also, the models predict each drug’s potential cancer indications consistent with clinical trial results from a selection of globally normalized GEO expression datasets. PMID:26107615
Li, Bin; Shin, Hyunjin; Gulbekyan, Georgy; Pustovalova, Olga; Nikolsky, Yuri; Hope, Andrew; Bessarabova, Marina; Schu, Matthew; Kolpakova-Hart, Elona; Merberg, David; Dorner, Andrew; Trepicchio, William L
2015-01-01
Development of drug responsive biomarkers from pre-clinical data is a critical step in drug discovery, as it enables patient stratification in clinical trial design. Such translational biomarkers can be validated in early clinical trial phases and utilized as a patient inclusion parameter in later stage trials. Here we present a study on building accurate and selective drug sensitivity models for Erlotinib or Sorafenib from pre-clinical in vitro data, followed by validation of individual models on corresponding treatment arms from patient data generated in the BATTLE clinical trial. A Partial Least Squares Regression (PLSR) based modeling framework was designed and implemented, using a special splitting strategy and canonical pathways to capture robust information for model building. Erlotinib and Sorafenib predictive models could be used to identify a sub-group of patients that respond better to the corresponding treatment, and these models are specific to the corresponding drugs. The model derived signature genes reflect each drug's known mechanism of action. Also, the models predict each drug's potential cancer indications consistent with clinical trial results from a selection of globally normalized GEO expression datasets.
Kryscio, R J; Abner, E L; Schmitt, F A; Goodman, P J; Mendiondo, M; Caban-Holt, A; Dennis, B C; Mathews, M; Klein, E A; Crowley, J J
2013-01-01
To summarize the ongoing prevention of Alzheimer's disease (AD) by vitamin E and selenium (PREADViSE) trial as an ancillary study to SELECT (a large prostate cancer prevention trial) and to present the blinded results of the first year as an exposure study. PREADViSE was designed as a double blind randomized controlled trial (RCT). SELECT terminated after median of 5.5 years of exposure to supplements due to a futility analysis. Both trials then converted into an exposure study. In the randomized component PREADViSE enrolled 7,547 men age 62 or older (60 if African American). Once the trial terminated 4,246 of these men volunteered for the exposure study. Demographics were similar for both groups with exposure volunteers having baseline mean age 67.3 ± 5.2 years, 15.3 ± 2.4 years of education, 9.8% African Americans, and 22.0% reporting a family history of dementia. In the RCT men were randomly assigned to either daily doses of 400 IU of vitamin E or placebo and 200 µg of selenium or placebo using a 2x2 factorial structure. In the RCT, participants completed the memory impairment screen (MIS), and if they failed, underwent a longer screening (based on an expanded Consortium to Establish a Registry in AD [CERAD] battery). CERAD failure resulted in visits to their clinician for medical examination with records of these examinations forwarded to the PREADViSE center for further review. In the exposure study, men are contacted by telephone and complete the telephone version of the memory impairment screen (MIS-T) screen. If they fail the MIS-T, a modified telephone interview of cognitive status (TICS-M) exam is given. A failed TICS-M exam also leads to a visit to their clinician for an in-depth examination and forwarding of records for a centralized consensus diagnosis by expert clinicians. A subgroup of the men who pass the MIS-T also take the TICS-M exam for validation purposes. While this ancillary trial was open to all 427 SELECT clinical sites, only 130 (30.0%) of the sites chose to participate in PREADViSE. Staff turnover at the sites presented challenges when training persons unfamiliar with cognitive testing procedures to conduct the memory screens. In the RCT few participants (1.6%) failed the MIS screen and among those who passed this screen a significant practice effect was encountered. In the exposure study 3,581 men were reached by phone in year 1, 15.7% could not be reached after 5 calls, and of those contacted 6.0% refused the screen even after consenting to the procedures at their clinical site. Most notable is that the failure rate for the MIS-T increased fourfold to 7.2%. Of the 257 men who took the TICS-M, 84.0% failed and were asked to contact their physicians for a more detailed memory assessment, and approximately half of these had some form of dementia or cognitive impairment. Several of these dementia cases are not AD. Partnering with SELECT led to an AD prevention trial conducted at a very reasonable cost by taking advantage of the experience and efficient clinical trial management found in a cancer cooperative group (Southwest Oncology Group or SWOG). Once unblinded, the RCT and exposure study data have the potential to yield new information on long term exposure to antioxidant supplements under controlled conditions.
Challenges and Innovations in a Community-Based Participatory Randomized Controlled Trial
ERIC Educational Resources Information Center
Goodkind, Jessica R.; Amer, Suha; Christian, Charlisa; Hess, Julia Meredith; Bybee, Deborah; Isakson, Brian L.; Baca, Brandon; Ndayisenga, Martin; Greene, R. Neil; Shantzek, Cece
2017-01-01
Randomized controlled trials (RCTs) are a long-standing and important design for conducting rigorous tests of the effectiveness of health interventions. However, many questions have been raised about the external validity of RCTs, their utility in explicating mechanisms of intervention and participants' intervention experiences, and their…
A landmark cancer drug trial is helping set the stage for moving precision medicine into the mainstream of clinical practice, according to a new study. The study, reported in the Journal of Molecular Diagnostics, validates a procedure used in the dru
Dolatabadi, Elham; Taati, Babak; Mihailidis, Alex
2016-09-01
This paper presents a study to evaluate the concurrent validity of the Microsoft Kinect for Windows v2 for measuring the spatiotemporal parameters of gait. Twenty healthy adults performed several sequences of walks across a GAITRite mat under three different conditions: usual pace, fast pace, and dual task. Each walking sequence was simultaneously captured with two Kinect for Windows v2 and the GAITRite system. An automated algorithm was employed to extract various spatiotemporal features including stance time, step length, step time and gait velocity from the recorded Kinect v2 sequences. Accuracy in terms of reliability, concurrent validity and limits of agreement was examined for each gait feature under different walking conditions. The 95% Bland-Altman limits of agreement were narrow enough for the Kinect v2 to be a valid tool for measuring all reported spatiotemporal parameters of gait in all three conditions. An excellent intraclass correlation coefficient (ICC2, 1) ranging from 0.9 to 0.98 was observed for all gait measures across different walking conditions. The inter trial reliability of all gait parameters were shown to be strong for all walking types (ICC3, 1 > 0.73). The results of this study suggest that the Kinect for Windows v2 has the capacity to measure selected spatiotemporal gait parameters for healthy adults. Copyright © 2016 IPEM. Published by Elsevier Ltd. All rights reserved.
Language-based Measures of Mindfulness: Initial Validity and Clinical Utility
Collins, Susan E.; Chawla, Neharika; Hsu, Sharon H.; Grow, Joel; Otto, Jacqueline M.; Marlatt, G. Alan
2009-01-01
This study examined relationships among language use, mindfulness, and substance-use treatment outcomes in the context of an efficacy trial of mindfulness-based relapse prevention (MBRP) for adults with alcohol and other drug use (AOD) disorders (see Bowen, Chawla, Collins et al., in press). An expert panel generated two categories of mindfulness language (ML) describing the mindfulness state and the more encompassing “mindfulness journey,” which included words describing challenges of developing a mindfulness practice. MBRP participants (n=48) completed baseline sociodemographic and AOD measures, and participated in the 8-week MBRP program. AOD data were collected during the 4-month follow-up. A word count program assessed the frequency of ML and other linguistic markers in participants’ responses to open-ended questions about their postintervention impressions of mindfulness practice and MBRP. Findings supported concurrent validity of ML categories: ML words appeared more frequently in the MBRP manual compared to the 12-step Big Book. Further, ML categories correlated with other linguistic variables related to the mindfulness construct. Finally, predictive validity was supported: greater use of ML predicted fewer AOD use days during the 4-month follow-up. This study provided initial support for ML as a valid, clinically useful mindfulness measure. If future studies replicate these findings, ML could be used in conjunction with self-report to provide a more complete picture of the mindfulness experience. PMID:20025383
Role of learning potential in cognitive remediation: Construct and predictive validity.
Davidson, Charlie A; Johannesen, Jason K; Fiszdon, Joanna M
2016-03-01
The construct, convergent, discriminant, and predictive validity of Learning Potential (LP) was evaluated in a trial of cognitive remediation for adults with schizophrenia-spectrum disorders. LP utilizes a dynamic assessment approach to prospectively estimate an individual's learning capacity if provided the opportunity for specific related learning. LP was assessed in 75 participants at study entry, of whom 41 completed an eight-week cognitive remediation (CR) intervention, and 22 received treatment-as-usual (TAU). LP was assessed in a "test-train-test" verbal learning paradigm. Incremental predictive validity was assessed as the degree to which LP predicted memory skill acquisition above and beyond prediction by static verbal learning ability. Examination of construct validity confirmed that LP scores reflected use of trained semantic clustering strategy. LP scores correlated with executive functioning and education history, but not other demographics or symptom severity. Following the eight-week active phase, TAU evidenced little substantial change in skill acquisition outcomes, which related to static baseline verbal learning ability but not LP. For the CR group, LP significantly predicted skill acquisition in domains of verbal and visuospatial memory, but not auditory working memory. Furthermore, LP predicted skill acquisition incrementally beyond relevant background characteristics, symptoms, and neurocognitive abilities. Results suggest that LP assessment can significantly improve prediction of specific skill acquisition with cognitive training, particularly for the domain assessed, and thereby may prove useful in individualization of treatment. Published by Elsevier B.V.
Afshar, Majid; Press, Valerie G; Robison, Rachel G; Kho, Abel N; Bandi, Sindhura; Biswas, Ashvini; Avila, Pedro C; Kumar, Harsha Vardhan Madan; Yu, Byung; Naureckas, Edward T; Nyenhuis, Sharmilee M; Codispoti, Christopher D
2017-10-13
Comprehensive, rapid, and accurate identification of patients with asthma for clinical care and engagement in research efforts is needed. The original development and validation of a computable phenotype for asthma case identification occurred at a single institution in Chicago and demonstrated excellent test characteristics. However, its application in a diverse payer mix, across different health systems and multiple electronic health record vendors, and in both children and adults was not examined. The objective of this study is to externally validate the computable phenotype across diverse Chicago institutions to accurately identify pediatric and adult patients with asthma. A cohort of 900 asthma and control patients was identified from the electronic health record between January 1, 2012 and November 30, 2014. Two physicians at each site independently reviewed the patient chart to annotate cases. The inter-observer reliability between the physician reviewers had a κ-coefficient of 0.95 (95% CI 0.93-0.97). The accuracy, sensitivity, specificity, negative predictive value, and positive predictive value of the computable phenotype were all above 94% in the full cohort. The excellent positive and negative predictive values in this multi-center external validation study establish a useful tool to identify asthma cases in in the electronic health record for research and care. This computable phenotype could be used in large-scale comparative-effectiveness trials.
Takyar, Varun; Nath, Anand; Beri, Andrea; Gharib, Ahmed M; Rotman, Yaron
2017-09-01
Healthy volunteers are crucial for biomedical research. Inadvertent inclusion of subjects with nonalcoholic fatty liver disease (NAFLD) as controls can compromise study validity and subject safety. Given the rising prevalence of NAFLD in the general population, we sought to identify its prevalence and potential impact in volunteers for clinical trials. We conducted a cross-sectional study of subjects who were classified as healthy volunteers between 2011 and 2015 and had no known liver disease. Subjects were classified as presumed NAFLD (pNF; alanine aminotransferase [ALT] level ≥ 20 for women or ≥ 31 for men and body mass index [BMI] > 25 kg/m 2 ), healthy non-NAFLD controls (normal ALT and BMI), or indeterminate. A total of 3160 subjects participated as healthy volunteers in 149 clinical trials (1-29 trials per subject); 1732 of these subjects (55%) had a BMI > 25 kg/m 2 and 1382 (44%) had abnormal ALT. pNF was present in 881 subjects (27.9%), and these subjects were older than healthy control subjects and had higher triglycerides, low-density lipoprotein cholesterol, and HbA1c and lower high-density lipoprotein cholesterol (P < 0.001 for all). The 149 trials included 101 non-interventional, 33 interventional, and 15 vaccine trials. The impact on study validity of recruiting NAFLD subjects as controls was estimated as likely, probable, and unlikely in 10, 41, and 98 trials, respectively. The proportion of pNF subjects (28%-29%) did not differ by impact. Only 14% of trials used both BMI and ALT for screening. ALT cutoffs for screening were based on local reference values. Grade 3-4 ALT elevations during the study period were rare but more common in pNF subjects than in healthy control subjects (4 versus 1). NAFLD is common and often overlooked in volunteers for clinical trials, despite its potential impact on subject safety and validity of study findings. Increased awareness of NAFLD prevalence and stricter ALT cutoffs may ameliorate this problem. (Hepatology 2017;66:825-833). Published 2017. This article is a U.S. Government work and is in the public domain in the USA.
Litt, Mark D.; Kadden, Ronald M; Tennen, Howard
2012-01-01
The Coping Strategies Scale (CSS) was designed to assess adaptive changes in substance-use specific coping that result from treatment. The present study sought to examine the latent structure of the CSS in the hope that it might shed light on the coping processes of drug users, and guide the development of a brief version of the CSS. Respondents on the CSS were 751 men and women treated in three clinical trials for marijuana dependence. Posttreatment CSS data were analyzed to determine the nature of coping responses in patients who have been trained to use specific strategies to deal with substance use disorders. Exploratory factor analysis yielded two factors, categorized as problem-focused and emotion-focused coping, but confirmatory factor analysis did not support this structure. When infrequently endorsed items were removed, however, confirmatory factor analysis revealed a good fit to the data. Contrary to expectations, practical strategies that often form the basis for coping skills training, such as avoiding those who smoke, were not frequently endorsed. Problem focused items reflected cognitive commitments to change. Emotion-focused items included cognitive reinterpretations of emotions, to help manage emotional reactions. Brief versions of the CSS based on these factors showed good convergent and discriminant validity. The CSS, and the brief versions of the CSS, may prove useful in future treatment trials to evaluate effects of treatment on coping skills acquisition and utilization in substance dependent individuals. PMID:22082345
Heikinheimo, Oskari; Bitzer, Johannes; García Rodríguez, Luis
2017-08-01
In the context of women's health, we examine (1) the role that observational ('real-world') studies have in overcoming limitations of randomised clinical trials, (2) the relative advantages and disadvantages of different study designs, (3) the importance of outcome data from observational studies when making health-economic or clinical decisions, and (4) provide insights into changing perceptions of observational clinical data. PubMed and internet searches were used to identify (i) guidance and expert commentary on designing, conducting, analysing, and reporting clinical trials or observational studies, (ii) supporting evidence of the rapid growth of observational ('real world') studies and publications since the turn of millennium in the fields of contraception, reproductive health, obstetrics or gynaecology. The rapidly growing use and validation of large, computerised medical records and related databases (e.g., health insurance or national registries) have played a major part in changing perceptions of observational data among researchers and clinicians. In the past 10 years, a distinct increase in the number of observational studies published tends to confirm their growing acceptance, appreciation and use. Observational studies can provide information that is impossible or infeasible to obtain otherwise (e.g., impractical, very expensive, or ethically unacceptable). Greater understanding, dissemination, uptake and use of observational data might be expected to drive ongoing evolution of research, data collection, analysis, and validation, in turn improving quality and therefore credibility, utility, and further application by clinicians.
Kardos, Peter; Worsley, Sally; Singh, Dave; Román-Rodríguez, Miguel; Newby, David E; Müllerová, Hana
2016-01-01
Long-acting muscarinic antagonist (LAMA) or long-acting β2-agonist (LABA) bronchodilators and their combination are recommended for the maintenance treatment of chronic obstructive pulmonary disease (COPD). Although the efficacy of LAMAs and LABAs has been well established through randomized controlled trials (RCTs), questions remain regarding their cardiovascular (CV) safety. Furthermore, while the safety of LAMA and LABA monotherapy has been extensively studied, data are lacking for LAMA/LABA combination therapy, and the majority of the studies that have reported on the CV safety of LAMA/LABA combination therapy were not specifically designed to assess this. Evaluation of CV safety for COPD treatments is important because many patients with COPD have underlying CV comorbidities. However, severe CV and other comorbidities are often exclusion criteria for RCTs, contributing to a lack in external validity and generalizability. Real-world observational studies are another important tool to evaluate the effectiveness and safety of COPD therapies in a broader population of patients and can improve upon the external validity limitations of RCTs. We examine what is already known regarding the CV and cerebrovascular safety of LAMA/LABA combination therapy from RCTs and real-world observational studies, and explore the advantages and limitations of data derived from each study type. We also describe an ongoing prospective, observational, comparative post-authorization safety study of a LAMA/LABA combination therapy (umeclidinium/vilanterol) and LAMA monotherapy (umeclidinium) versus tiotropium, with a focus on the relative merits of the study design.
Validation of Shear Wave Elastography in Skeletal Muscle
Eby, Sarah F.; Song, Pengfei; Chen, Shigao; Chen, Qingshan; Greenleaf, James F.; An, Kai-Nan
2013-01-01
Skeletal muscle is a very dynamic tissue, thus accurate quantification of skeletal muscle stiffness throughout its functional range is crucial to improve the physical functioning and independence following pathology. Shear wave elastography (SWE) is an ultrasound-based technique that characterizes tissue mechanical properties based on the propagation of remotely induced shear waves. The objective of this study is to validate SWE throughout the functional range of motion of skeletal muscle for three ultrasound transducer orientations. We hypothesized that combining traditional materials testing (MTS) techniques with SWE measurements will show increased stiffness measures with increasing tensile load, and will correlate well with each other for trials in which the transducer is parallel to underlying muscle fibers. To evaluate this hypothesis, we monitored the deformation throughout tensile loading of four porcine brachialis whole-muscle tissue specimens, while simultaneously making SWE measurements of the same specimen. We used regression to examine the correlation between Young's modulus from MTS and shear modulus from SWE for each of the transducer orientations. We applied a generalized linear model to account for repeated testing. Model parameters were estimated via generalized estimating equations. The regression coefficient was 0.1944, with a 95% confidence interval of (0.1463 – 0.2425) for parallel transducer trials. Shear waves did not propagate well for both the 45° and perpendicular transducer orientations. Both parallel SWE and MTS showed increased stiffness with increasing tensile load. This study provides the necessary first step for additional studies that can evaluate the distribution of stiffness throughout muscle. PMID:23953670
Gernand, Alison D; Paul, Rina Rani; Ullah, Barkat; Taher, Muhammad A; Witter, Frank R; Wu, Lee; Labrique, Alain B; West, Keith P; Christian, Parul
2016-10-21
The best method of gestational age assessment is by ultrasound in the first trimester; however, this method is impractical in large field trials in rural areas. Our objective was to assess the validity of gestational age estimated from prospectively collected date of last menstrual period (LMP) using crown-rump length (CRL) measured in early pregnancy by ultrasound. As part of a large, cluster-randomized, controlled trial in rural Bangladesh, we collected dates of LMP by recall and as marked on a calendar every 5 weeks in women likely to become pregnant. Among those with a urine-test confirmed pregnancy, a subset with gestational age of <15 weeks (n = 353) were enrolled for ultrasound follow-up to measure CRL. We compared interview-assessed LMP with CRL gestational age estimates and classification of preterm, term, and post-term births. LMP-based gestational age was higher than CRL by a mean (SD) of 2.8 (10.7) days; differences varied by maternal education and preterm birth (P < 0.05). Lin's concordance correlation coefficient was good at ultrasound [0.63 (95 % CI 0.56, 0.69)] and at birth [0.77 (95 % CI 0.73, 0.81)]. Validity of classifying preterm birth was high but post-term was lower, with specificity of 96 and 89 % and sensitivity of 86 and 67 %, respectively. Results were similar by parity. Prospectively collected LMP provided a valid estimate of gestational age and preterm birth in a rural, low-income setting and may be a suitable alternative to ultrasound in programmatic settings and large field trials. ClinicalTrials.gov NCT00860470.
Koziol-McLain, Jane; McLean, Christine; Rohan, Maheswaran; Sisk, Rose; Dobbs, Terry; Nada-Raja, Shyamala; Wilson, Denise; Vandal, Alain C
2016-10-25
Automated eHealth Web-based research trials offer people an accessible, confidential opportunity to engage in research that matters to them. eHealth trials may be particularly useful for sensitive issues when seeking health care may be accompanied by shame and mistrust. Yet little is known about people's early engagement with eHealth trials, from recruitment to preintervention autoregistration processes. A recent randomized controlled trial that tested the effectiveness of an eHealth safety decision aid for New Zealand women in the general population who experienced intimate partner violence (isafe) provided the opportunity to examine recruitment and preintervention participant engagement with a fully automated Web-based registration process. The trial aimed to recruit 340 women within 24 months. The objective of our study was to examine participant preintervention engagement and recruitment efficiency for the isafe trial, and to analyze dropout through the registration pathway, from recruitment to eligibility screening and consent, to completion of baseline measures. In this case study, data collection sources included the trial recruitment log, Google Analytics reports, registration and program metadata, and costs. Analysis included a qualitative narrative of the recruitment experience and descriptive statistics of preintervention participant engagement and dropout rates. A Koyck model investigated the relationship between Web-based online marketing website advertisements (ads) and participant accrual. The isafe trial was launched on September 17, 2012. Placement of ads in an online classified advertising platform increased the average number of recruited participants per month from 2 to 25. Over the 23-month recruitment period, the registration website recorded 4176 unique visitors. Among 1003 women meeting eligibility criteria, 51.55% (517) consented to participate; among the 501 women who enrolled (consented, validated, and randomized), 412 (82.2%) were accrued (completed baseline assessments). The majority (n=52, 58%) of the 89 women who dropped out between enrollment and accrual never logged in to the allocated isafe website. Of every 4 accrued women, 3 (314/412, 76.2%) identified the classified ad as their referral source, followed by friends and family (52/412, 12.6%). Women recruited through a friend or relative were more likely to self-identify as indigenous Māori and live in the highest-deprivation areas. Ads increased the accrual rate by a factor of 74 (95% CI 49-112). Print advertisements, website links, and networking were costly and inefficient methods for recruiting participants to a Web-based eHealth trial. Researchers are advised to limit their recruitment efforts to Web-based online marketplace and classified advertising platforms, as in the isafe case, or to social media. Online classified advertising in "Jobs-Other-volunteers" successfully recruited a diverse sample of women experiencing intimate partner violence. Preintervention recruitment data provide critical information to inform future research and critical analysis of Web-based eHealth trials. Australian New Zealand Clinical Trials Registry (ANZCTR): ACTRN12612000708853; https://www.anzctr.org.au/Trial/Registration/TrialReview.aspx?ACTRN=12612000708853 (Archived by WebCite at http://www.webcitation/6lMGuVXdK).
Esbensen, A J; Hoffman, E K; Stansberry, E; Shaffer, R
2018-04-01
There is a need for rigorous measures of sleep in children with Down syndrome as sleep is a substantial problem in this population and there are barriers to obtaining the gold standard polysomnography (PSG). PSG is cost-prohibitive when measuring treatment effects in some clinical trials, and children with Down syndrome may not cooperate with undergoing a PSG. Minimal information is available on the validity of alternative methods of assessing sleep in children with Down syndrome, such as actigraphy and parent ratings. Our study examined the concurrent and convergent validity of different measures of sleep, including PSG, actigraphy and parent reports of sleep among children with Down syndrome. A clinic (n = 27) and a community (n = 47) sample of children with Down syndrome were examined. In clinic, children with Down syndrome wore an actigraph watch during a routine PSG. In the community, children with Down syndrome wore an actigraph watch for a week at home at night as part of a larger study on sleep and behaviour. Their parent completed ratings of the child's sleep during that same week. Actigraph watches demonstrated convergent validity with PSG when measuring a child with Down syndrome's total amount of sleep time, total wake time after sleep onset and sleep period efficiency. In contrast, actigraph watches demonstrated poor correlations with parent reports of sleep, and with PSG when measuring the total time in bed and total wake episodes. Actigraphy, PSG and parent ratings of sleep demonstrated poor concurrent validity with clinical diagnosis of obstructive sleep apnoea. Our current data suggest that actigraph watches demonstrate convergent validity and are sensitive to measuring certain sleep constructs (duration, efficiency) in children with Down syndrome. However, parent reports, such as the Children's Sleep Habits Questionnaire, may be measuring other sleep constructs. These findings highlight the importance of selecting measures of sleep related to target concerns. © 2018 MENCAP and International Association of the Scientific Study of Intellectual and Developmental Disabilities and John Wiley & Sons Ltd.
Jago, Russell; Edwards, Mark J; Sebire, Simon J; Tomkinson, Keeley; Bird, Emma L; Banfield, Kathryn; May, Thomas; Kesten, Joanna M; Cooper, Ashley R; Powell, Jane E; Blair, Peter S
2015-10-06
The aim of this study was to examine the effectiveness and cost of an after-school dance intervention at increasing the physical activity levels of Year 7 girls (age 11-12). A cluster randomised controlled trial was conducted in 18 secondary schools. Participants were Year 7 girls attending a study school. The Bristol Girls Dance Project (BGDP) intervention consisted of up to forty, 75-minute dance sessions delivered in the period immediately after school by experienced dance instructors over 20-weeks. The pre-specified primary outcome was accelerometer assessed mean minutes of weekday moderate to vigorous physical activity (MVPA) at time 2 (52 weeks are T0 baseline assessments). Secondary outcomes included accelerometer assessed mean minutes of weekday MVPA at time 1 (while the intervention was still running) and psychosocial outcomes. Intervention costs were assessed. 571 girls participated. Valid accelerometer data were collected from 549 girls at baseline with 508 girls providing valid accelerometer data at baseline and time 2. There were no differences between the intervention and control group for accelerometer assessed physical activity at either time 1 or time 2. Only one third of the girls in the intervention arm met the pre-set adherence criteria of attending two thirds of the dance sessions that were available to them. Instrumental variable regression analyses using complier average causal effects provided no evidence of a difference between girls who attended the sessions and the control group. The average cost of the intervention was £73 per girl, which was reduced to £63 when dance instructor travel expenses were excluded. This trial showed no evidence that an after-school dance programme can increase the physical activity of Year 7 girls. The trial highlighted the difficulty encountered in maintaining attendance in physical activity programmes delivered in secondary schools. There is a need to find new ways to help adolescent girls to be physically active via identifying ways to support and encourage sustained engagement in physical activity over the life course. ISRCTN52882523.
Tanner, Amy Vivien; Nielsen, Birthe Vejby; Allgrove, Judith
2014-01-01
The aim of this study was to examine the acute response to plasma and salivary cortisol and testosterone to three training protocols. Ten trained endurance athletes participated in three experimental trials, such as interval training (INT), tempo run (TEMP) and bodyweight-only circuit training (CIR), on separate days. Blood and saliva samples were collected pre- and 0, 15, 30 and 60 min post-exercise. Peak post-exercise salivary cortisol was higher than pre-exercise in all trials (P < 0.01). After INT, salivary cortisol remained elevated above pre-exercise than 60 min post-exercise. Salivary testosterone also increased post-exercise in all trials (P < 0.05). Plasma and salivary cortisol were correlated between individuals (r = 0.81, 0.73-0.88) and within individuals (r = 0.81, 0.73-0.87) (P < 0.01). Plasma and salivary testosterone was also correlated between (r = 0.57, 0.43-0.69) and within individuals (r = 0.60, 0.45-0.72), (P < 0.01). Peak cortisol and testosterone levels occurred simultaneously in plasma and saliva, but timing of post-exercise hormone peaks differed between trials and individuals. Further investigation is required to identify the mechanisms eliciting an increase in hormones in response to CIR. Furthermore, saliva is a valid alternative sampling technique for measurement of cortisol, although the complex, individual and situation dependent nature of the hormone response to acute exercise should be considered.
Nighbor, Tyler; Kohn, Carolynn; Normand, Matthew; Schlinger, Henry
2017-01-01
Some research suggests infants display a tendency to judge others' prosocial behavior, and in particular, that infants show a strong preference for prosocial others. For example, data from one frequently cited and well-publicized study showed that, after watching a puppet show with three puppets, 74% of infants chose the puppet that "helped" rather than the puppet that "hindered" a third puppet from attaining its goal. The purpose of the current investigation was to replicate these methods and extend them by including a within-subject measure of infant puppet choice across repeated trials to assess the stability of infants' choice. In the current study, 20 infants viewed a puppet show and chose between two puppets (i.e., helper or hinderer) immediately following the puppet show. Although results were similar to previously published work on the first-choice trial (65% of infants chose the helper puppet on the first trial), infants did not consistently choose the helper across trials; several infants demonstrated a side preference, with 9 infants almost exclusively choosing puppets presented on the right or left side. The current investigation addressed limitations of previous research by including a between-subjects (replication) as well as a within-subjects (extension) repeated measure of choice that allowed for the examination of the stability of the choice measure. Our results, particularly in light of other failed replications, raise questions regarding the robustness of infants' preference for prosocial others and the reliability and validity of the single-choice paradigm.
Murphy, Caitlin C; Vernon, Sally W; Haddock, Nicole M; Anderson, Melissa L; Chubak, Jessica; Green, Beverly B
2014-09-01
Few studies use longitudinal data to identify predictors of colorectal cancer screening (CRCS). We examined predictors of (1) initial CRCS during the first year of a randomized trial, and (2) repeat CRCS during the second year of the trial among those that completed FOBT in Year 1. The sample comprised 1247 participants of the Systems of Support to Increase Colorectal Cancer Screening (SOS) Trial (Group Health Cooperative, August 2008 to November 2011). Potential predictors of CRCS were identified with logistic regression and included sociodemographics, health history, and validated scales of psychosocial constructs. Prior CRCS (OR 2.64, 95% CI 1.99-3.52) and intervention group (Automated: OR 2.06 95% CI 1.43-2.95; Assisted: OR 4.03, 95% CI 2.69-6.03; Navigated: OR 5.64, 95% CI 3.74-8.49) were predictors of CRCS completion at Year 1. For repeat CRCS at Year 2, prior CRCS at baseline (OR 1.97, 95% CI 1.25-3.11), intervention group (Automated: OR 9.27, 95% CI 4.56-18.82; Assisted: OR 11.17, 95% CI 5.44-22.94; Navigated: OR 13.10, 95% CI 6.33-27.08), and self-efficacy (OR 1.32, 95% CI 1.00-1.73) were significant predictors. Self-efficacy and prior CRCS are important predictors of future screening behavior. CRCS completion increased when access barriers were removed through interventions. Copyright © 2014 Elsevier Inc. All rights reserved.
Rapee, Ronald M; Lyneham, Heidi J; Wuthrich, Viviana; Chatterton, Mary Lou; Hudson, Jennifer L; Kangas, Maria; Mihalopoulos, Cathrine
2017-10-01
Stepped care is embraced as an ideal model of service delivery but is minimally evaluated. The aim of this study was to evaluate the efficacy of cognitive-behavioral therapy (CBT) for child anxiety delivered via a stepped-care framework compared against a single, empirically validated program. A total of 281 youth with anxiety disorders (6-17 years of age) were randomly allocated to receive either empirically validated treatment or stepped care involving the following: (1) low intensity; (2) standard CBT; and (3) individually tailored treatment. Therapist qualifications increased at each step. Interventions did not differ significantly on any outcome measures. Total therapist time per child was significantly shorter to deliver stepped care (774 minutes) compared with best practice (897 minutes). Within stepped care, the first 2 steps returned the strongest treatment gains. Stepped care and a single empirically validated program for youth with anxiety produced similar efficacy, but stepped care required slightly less therapist time. Restricting stepped care to only steps 1 and 2 would have led to considerable time saving with modest loss in efficacy. Clinical trial registration information-A Randomised Controlled Trial of Standard Care Versus Stepped Care for Children and Adolescents With Anxiety Disorders; http://anzctr.org.au/; ACTRN12612000351819. Copyright © 2017 American Academy of Child and Adolescent Psychiatry. Published by Elsevier Inc. All rights reserved.
Fun and games in reviewing neonatal emergency care.
Gordon, D W; Brown, H N
1995-04-01
To develop a game-based review instrument for use by newborn caregivers in preparing for emergency situations. One hundred and one test questions covering pathophysiology, resuscitation, and medications were developed. The questions then underwent expert and peer review, psychometric testing for content validity and test-retest reliability, and a game trial. The needs of adult learners are different from those of other learners. The gaming format uses knowledge gained through experience and provides an avenue for validating knowledge and sharing experiences. This format has been found effective for review and reinforcement of facts. Twelve nurses participated in a trial game and completed a written evaluation using a Likert scale. The Neonatal Emergency Trivia Game is an effective tool for reviewing material related to neonatal emergency care decisions. Additional testing with a larger group would strengthen validity and reliability data.
Ussher, Michael; Aveyard, Paul; Manyonda, Isaac; Lewis, Sarah; West, Robert; Lewis, Beth; Marcus, Bess; Taylor, Adrian H; Barton, Pelham; Coleman, Tim
2012-10-04
Many women try to stop smoking in pregnancy but fail. One difficulty is that there is insufficient evidence that medications for smoking cessation are effective and safe in pregnancy and thus many women prefer to avoid these. Physical activity (PA) interventions may assist cessation; however, trials examining these interventions have been too small to detect or exclude plausible beneficial effects. The London Exercise And Pregnant smokers (LEAP) trial is investigating whether a PA intervention is effective and cost-effective when used for smoking cessation by pregnant women, and will be the largest study of its kind to date. The LEAP study is a pragmatic, multi-center, two-arm, randomized, controlled trial that will target pregnant women who smoke at least one cigarette a day (and at least five cigarettes a day before pregnancy), and are between 10 and 24 weeks pregnant. Eligible patients are individually randomized to either usual care (that is, behavioral support for smoking cessation) or usual care plus a intervention (entailing supervised exercise on a treadmill plus PA consultations). The primary outcome of the trial is self-reported and biochemically validated continuous abstinence from smoking between a specified quit date and the end of pregnancy. The secondary outcomes, measured at 1 and 4 weeks after the quit date, and at the end of pregnancy and 6 months after childbirth, are PA levels, depression, self-confidence, and cigarette withdrawal symptoms. Smoking status will also be self-reported at 6 months after childbirth. In addition, perinatal measures will be collected, including antenatal complications, duration of labor, mode of delivery, and birth and placental weight. Outcomes will be analyzed on an intention-to-treat basis, and logistic regression models used to compare treatment effects on the primary outcome. This trial will assess whether a PA intervention is effective when used for smoking cessation during pregnancy. ISRCTN48600346.
Brisco, Meredith A; Zile, Michael R; Hanberg, Jennifer S; Wilson, F Perry; Parikh, Chirag R; Coca, Steven G; Tang, W H Wilson; Testani, Jeffrey M
2016-10-01
Worsening renal function (WRF) is a common endpoint in decompensated heart failure clinical trials because of associations between WRF and adverse outcomes. However, WRF has not universally been identified as a poor prognostic sign, challenging the validity of WRF as a surrogate endpoint. Our aim was to describe the associations between changes in creatinine and adverse outcomes in a clinical trial of decongestive therapies. We investigated the association between changes in creatinine and the composite endpoint of death, rehospitalization or emergency room visit within 60 days in 301 patients in the Diuretic Optimization Strategies Evaluation (DOSE) trial. WRF was defined as an increase in creatinine >0.3 mg/dL and improvement in renal function (IRF) as a decrease >0.3 mg/dL. When examining linear changes in creatinine from baseline to 72 hours (the coprimary endpoint of DOSE), increasing creatinine was associated with lower risk for the composite outcome (HR = 0.81 per 0.3 mg/dL increase, 95% CI 0.67-0.98, P = .026). Compared with patients with stable renal function (n = 219), WRF (n = 54) was not associated with the composite endpoint (HR = 1.17, 95% CI = 0.77-1.78, P = .47). However, compared with stable renal function, there was a strong relationship between IRF (n = 28) and the composite endpoint (HR = 2.52, 95% CI = 1.57-4.03, P < .001). The coprimary endpoint of the DOSE trial, a linear increase in creatinine, was paradoxically associated with improved outcomes. This was driven by absence of risk attributable to WRF and a strong risk associated with IRF. These results argue against using changes in serum creatinine as a surrogate endpoint in trials of decongestive strategies. Copyright © 2016 Elsevier Inc. All rights reserved.
Cuesta-Gragera, Ana; Navarro-Fontestad, Carmen; Mangas-Sanjuan, Victor; González-Álvarez, Isabel; García-Arieta, Alfredo; Trocóniz, Iñaki F; Casabó, Vicente G; Bermejo, Marival
2015-07-10
The objective of this paper is to apply a previously developed semi-physiologic pharmacokinetic model implemented in NONMEM to simulate bioequivalence trials (BE) of acetyl salicylic acid (ASA) in order to validate the model performance against ASA human experimental data. ASA is a drug with first-pass hepatic and intestinal metabolism following Michaelis-Menten kinetics that leads to the formation of two main metabolites in two generations (first and second generation metabolites). The first aim was to adapt the semi-physiological model for ASA in NOMMEN using ASA pharmacokinetic parameters from literature, showing its sequential metabolism. The second aim was to validate this model by comparing the results obtained in NONMEM simulations with published experimental data at a dose of 1000 mg. The validated model was used to simulate bioequivalence trials at 3 dose schemes (100, 1000 and 3000 mg) and with 6 test formulations with decreasing in vivo dissolution rate constants versus the reference formulation (kD 8-0.25 h (-1)). Finally, the third aim was to determine which analyte (parent drug, first generation or second generation metabolite) was more sensitive to changes in formulation performance. The validation results showed that the concentration-time curves obtained with the simulations reproduced closely the published experimental data, confirming model performance. The parent drug (ASA) was the analyte that showed to be more sensitive to the decrease in pharmaceutical quality, with the highest decrease in Cmax and AUC ratio between test and reference formulations. Copyright © 2015 Elsevier B.V. All rights reserved.
Zabelina, Darya; Saporta, Arielle; Beeman, Mark
2016-04-01
Creativity has been putatively linked to distinct forms of attention, but which aspects of creativity and which components of attention remains unclear. Two experiments examined how divergent thinking and creative achievement relate to visual attention. In both experiments, participants identified target letters (S or H) within hierarchical stimuli (global letters made of local letters), after being cued to either the local or global level. In Experiment 1, participants identified the targets more quickly following valid cues (80% of trials) than following invalid cues. However, this smaller validity effect was associated with higher divergent thinking, suggesting that divergent thinking was related to quicker overcoming of invalid cues, and thus to flexible attention. Creative achievement was unrelated to the validity effect. Experiment 2 examined whether divergent thinking (or creative achievement) is related to "leaky attention," so that when cued to one level of a stimulus, some information is still processed, or leaks in, from the non-cued level. In this case, the cued stimulus level always contained a target, and the non-cued level was congruent, neutral, or incongruent with the target. Divergent thinking did not relate to stimulus congruency. In contrast, high creative achievement was related to quicker responses to the congruent than to the incongruent stimuli, suggesting that real-world creative achievement is indeed associated with leaky attention, whereas standard laboratory tests of divergent thinking are not. Together, these results elucidate distinct patterns of attention for different measures of creativity. Specifically, creative achievers may have leaky attention, as suggested by previous literature, whereas divergent thinkers have selective yet flexible attention.
Ratanawongsa, Neda; Karter, Andrew J; Quan, Judy; Parker, Melissa M; Handley, Margaret; Sarkar, Urmimala; Schmittdiel, Julie A; Schillinger, Dean
2015-08-01
With the expansion of Medicaid and low-cost health insurance plans among diverse patient populations, objective measures of medication adherence using pharmacy claims could advance clinical care and translational research for safety net care. However, safety net patients may experience fluctuating prescription drug coverage, affecting the performance of adherence measures. To evaluate the performance of continuous medication gap (CMG) for diverse, low-income managed care members with diabetes. We conducted this cross-sectional analysis using administrative and clinical data for 680 members eligible for a self-management support trial at a nonprofit, government-sponsored managed care plan. We applied CMG methodology to cardiometabolic medication claims for English- , Cantonese- , or Spanish-speaking members with diabetes. We examined inclusiveness (the proportion with calculable CMG) and selectivity (sociodemographic and medical differences from members without CMG). For validity, we examined unadjusted associations of suboptimal adherence (CMG > 20%) with suboptimal cardiometabolic control. 429 members (63%) had calculable CMG. Compared with members without CMG, members with CMG were younger, more likely employed, and had poorer glycemic control but had better blood pressure and lipid control. Suboptimal adherence occurred more frequently among members with poor cardiometabolic control than among members with optimal control (28% vs. 12%, P = 0.02). CMG demonstrated acceptable inclusiveness and validity in a diverse, low-income safety net population, comparable with its performance in studies among other insured populations. CMG may provide a useful tool to measure adherence among increasingly diverse Medicaid populations, complemented by other strategies to reach those not captured by CMG.
PLCO Ovarian Phase III Validation Study — EDRN Public Portal
Our preliminary data indicate that the performance of CA 125 as a screening test for ovarian cancer can be improved upon by additional biomarkers. With completion of one additional validation step, we will be ready to test the performance of a consensus marker panel in a phase III validation study. Given the original aims of the PLCO trial, we believe that the PLCO represents an ideal longitudinal cohort offering specimens for phase III validation of ovarian cancer biomarkers.
Choosing a Control Group in Effectiveness Trials of Behavioral Drug Abuse Treatments
Brigham, Gregory S.; Feaster, Daniel J.; Wakim, Paul G.; Dempsey, Catherine L.
2009-01-01
Effectiveness trials are an important step in the scientific process of developing and evaluating behavioral treatments. The focus on effectiveness research presents a different set of requirements on the research design when compared with efficacy studies. The choice of a control condition has many implications for a clinical trial's internal and external validity. The purpose of this manuscript is to provide a discussion of the issues involved in choosing a control group for effectiveness trials of behavioral interventions in substance abuse treatment. The authors provide a description of four trial designs and a discussion of the advantages and disadvantages of each. PMID:19553062
Ferrante di Ruffano, Lavinia; Dinnes, Jacqueline; Sitch, Alice J; Hyde, Chris; Deeks, Jonathan J
2017-02-24
There is a growing recognition for the need to expand our evidence base for the clinical effectiveness of diagnostic tests. Many international bodies are calling for diagnostic randomized controlled trials to provide the most rigorous evidence of impact to patient health. Although these so-called test-treatment RCTs are very challenging to undertake due to their methodological complexity, they have not been subjected to a systematic appraisal of their methodological quality. The extent to which these trials may be producing biased results therefore remains unknown. We set out to address this issue by conducting a methodological review of published test-treatment trials to determine how often they implement adequate methods to limit bias and safeguard the validity of results. We ascertained all test-treatment RCTs published 2004-2007, indexed in CENTRAL, including RCTs which randomized patients to diagnostic tests and measured patient outcomes after treatment. Tests used for screening, monitoring or prognosis were excluded. We assessed adequacy of sequence generation, allocation concealment and intention-to-treat, appropriateness of primary analyses, blinding and reporting of power calculations, and extracted study characteristics including the primary outcome. One hundred three trials compared 105 control with 119 experimental interventions, and reported 150 primary outcomes. Randomization and allocation concealment were adequate in 57 and 37% of trials. Blinding was uncommon (patients 5%, clinicians 4%, outcome assessors 21%), as was an adequate intention-to-treat analysis (29%). Overall 101 of 103 trials (98%) were at risk of bias, as judged using standard Cochrane criteria. Test-treatment trials are particularly susceptible to attrition and inadequate primary analyses, lack of blinding and under-powering. These weaknesses pose much greater methodological and practical challenges to conducting reliable RCT evaluations of test-treatment strategies than standard treatment interventions. We suggest a cautious approach that first examines whether a test-treatment intervention can accommodate the methodological safeguards necessary to minimize bias, and highlight that test-treatment RCTs require different methods to ensure reliability than standard treatment trials. Please see the companion paper to this article: http://bmcmedresmethodol.biomedcentral.com/articles/10.1186/s12874-016-0286-0 .
Hoogendoorn, Martine; Feenstra, Talitha L; Asukai, Yumi; Briggs, Andrew H; Hansen, Ryan N; Leidl, Reiner; Risebrough, Nancy; Samyshkin, Yevgeniy; Wacker, Margarethe; Rutten-van Mölken, Maureen P M H
2017-03-01
To validate outcomes of presently available chronic obstructive pulmonary disease (COPD) cost-effectiveness models against results of two large COPD trials-the 3-year TOwards a Revolution in COPD Health (TORCH) trial and the 4-year Understanding Potential Long-term Impacts on Function with Tiotropium (UPLIFT) trial. Participating COPD modeling groups simulated the outcomes for the placebo-treated groups of the TORCH and UPLIFT trials using baseline characteristics of the trial populations as input. Groups then simulated treatment effectiveness by using relative reductions in annual decline in lung function and exacerbation frequency observed in the most intensively treated group compared with placebo as input for the models. Main outcomes were (change in) total/severe exacerbations and mortality. Furthermore, the absolute differences in total exacerbations and quality-adjusted life-years (QALYs) were used to approximate the cost per exacerbation avoided and the cost per QALY gained. Of the six participating models, three models reported higher total exacerbation rates than observed in the TORCH trial (1.13/patient-year) (models: 1.22-1.48). Four models reported higher rates than observed in the UPLIFT trial (0.85/patient-year) (models: 1.13-1.52). Two models reported higher mortality rates than in the TORCH trial (15.2%) (models: 20.0% and 30.6%) and the UPLIFT trial (16.3%) (models: 24.8% and 36.0%), whereas one model reported lower rates (9.8% and 12.1%, respectively). Simulation of treatment effectiveness showed that the absolute reduction in total exacerbations, the gain in QALYs, and the cost-effectiveness ratios did not differ from the trials, except for one model. Although most of the participating COPD cost-effectiveness models reported higher total exacerbation rates than observed in the trials, estimates of the absolute treatment effect and cost-effectiveness ratios do not seem different from the trials in most models. Copyright © 2017 International Society for Pharmacoeconomics and Outcomes Research (ISPOR). Published by Elsevier Inc. All rights reserved.
Elsner, Birgit; Pauen, Sabina; Jeschonek, Susanna
2006-11-01
This report investigates the relations between duration of examining and heart rate (HR) across several trials of an object-examination task. A total of N= 20 11-month-olds were familiarized with a sequence of 10 different exemplars from the same global category (animals or furniture) before they received an exemplar from the contrasting category at test. Consistent with previous findings, HR was lower during states of focused attention (i.e. examining) than during states of casual attention (i.e. looking) or non-looking. Over the familiarization trials, examining stayed about the same, while mean HR increased. At test, examining increased and mean HR decreased, indicating that infants focused their attention on the out-of-category object. Psychophysiological and behavioral measures of attention were systematically related over trials, suggesting that heart rate provides a suitable objective measure to study infants' categorization performance in object-examination tasks.
Arora, Samantha; Cheung, Angela C; Tarique, Usman; Agarwal, Arnav; Firdouse, Mohammed; Ailon, Jonathan
2017-09-01
To compare point-of-care ultrasound and physical examination (PEx), each performed by first-year medical students after brief teaching, for assessing ascites and hepatomegaly. Ultrasound and PEx were compared on: (1) reliability, validity and performance, (2) diagnostic confidence, ease of use, utility, and applicability. A single-center, randomized controlled trial was performed at a tertiary centre. First-year medical students were randomized to use ultrasound or PEx to assess for ascites and hepatomegaly. Cohen's kappa and interclass coefficient (ICC) were used to measure interrater reliability between trainee assessments and the reference standard (a same day ultrasound by a radiologist). Sensitivity, specificity, accuracy, positive predictive value (PPV), and negative predictive value (NPV) were compared. A ten-point Likert scale was used to assess trainee diagnostic confidence and perceptions of utility. There were no significant differences in interobserver reliability, sensitivity, specificity, accuracy, PPV, or NPV between the ultrasound and PEx groups. However, students in the ultrasound group provided higher scores for perceived utility (ascites 8.38 ± 1.35 vs 7.08 ± 1.86, p = 0.008; hepatomegaly 7.68 ± 1.52 vs 5.36 ± 2.48, p < 0.001) and likelihood of adoption (ascites 8.67 ± 1.61 vs 7.46 ± 1.79, p = 0.02; hepatomegaly 8.12 ± 1.90 vs 5.92 ± 2.32, p = 0.001). When performed by first-year medical students, the validity and reliability of ultrasound is comparable to PEx, but with greater perceived utility and likelihood of adoption. With similarly brief instruction, point-of-care ultrasonography can be as effectively learned and performed as PEx, with a high degree of interest from trainees.
McManus, Lisa; Fisher, Jill A
2018-04-25
Phase I trials test the safety and tolerability of investigational drugs and often use healthy volunteers as research participants. Adverse events (AEs) are collected in part through participants' self-reports of any symptoms they experience during the trial. In some cases, experiencing AEs can result in trial participation being terminated. Because of the economic incentives underlying their motivation to participate, there is concern that healthy volunteers routinely fail to report AEs and, thereby, jeopardize the validity of the trial results. We interviewed 131 U.S. healthy volunteers about their experiences with AEs, including their rationales for reporting or failing to report symptoms. We found that participants have three primary rationales for their AE reporting behavior: economic, health-oriented, and data integrity. Participants often make decisions about whether to report AEs on a case-by-case basis evaluating what effects reporting or not reporting might have on the compensation they receive from the trial, the risk to their health, and the results of the particular clinical trial. Participants' interpretations of clinic policies, staff behaviors, and personal or vicarious experiences with reporting AEs also shape reporting decisions. Our findings demonstrate that participants' reporting behavior is more complex than previous portraits of healthy volunteers have suggested. Rather than finding participants who were so focused on the financial compensation that they were willing to subvert trial results, our study indicates that participants are willing in most cases to forgo their full compensation if they believe not reporting their symptoms jeopardizes their own safety or the validity of the research.
Measures of outcome for stimulant trials: ACTTION recommendations and research agenda.
Kiluk, Brian D; Carroll, Kathleen M; Duhig, Amy; Falk, Daniel E; Kampman, Kyle; Lai, Shengan; Litten, Raye Z; McCann, David J; Montoya, Ivan D; Preston, Kenzie L; Skolnick, Phil; Weisner, Constance; Woody, George; Chandler, Redonna; Detke, Michael J; Dunn, Kelly; Dworkin, Robert H; Fertig, Joanne; Gewandter, Jennifer; Moeller, F Gerard; Ramey, Tatiana; Ryan, Megan; Silverman, Kenneth; Strain, Eric C
2016-01-01
The development and approval of an efficacious pharmacotherapy for stimulant use disorders has been limited by the lack of a meaningful indicator of treatment success, other than sustained abstinence. In March, 2015, a meeting sponsored by Analgesic, Anesthetic, and Addiction Clinical Trial Translations, Innovations, Opportunities, and Networks (ACTTION) was convened to discuss the current state of the evidence regarding meaningful outcome measures in clinical trials for stimulant use disorders. Attendees included members of academia, funding and regulatory agencies, pharmaceutical companies, and healthcare organizations. The goal was to establish a research agenda for the development of a meaningful outcome measure that may be used as an endpoint in clinical trials for stimulant use disorders. Based on guidelines for the selection of clinical trial endpoints, the lessons learned from prior addiction clinical trials, and the process that led to identification of a meaningful indicator of treatment success for alcohol use disorders, several recommendations for future research were generated. These include a focus on the validation of patient reported outcome measures of functioning, the exploration of patterns of stimulant abstinence that may be associated with physical and/or psychosocial benefits, the role of urine testing for validating self-reported measures of stimulant abstinence, and the operational definitions for reduction-based measures in terms of frequency rather than quantity of stimulant use. These recommendations may be useful for secondary analyses of clinical trial data, and in the design of future clinical trials that may help establish a meaningful indicator of treatment success. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Wing, Kevin; Williamson, Elizabeth; Carpenter, James R; Wise, Lesley; Schneeweiss, Sebastian; Smeeth, Liam; Quint, Jennifer K; Douglas, Ian
2018-03-25
Chronic obstructive pulmonary disease (COPD) is a progressive disease affecting 3 million people in the UK, in which patients exhibit airflow obstruction that is not fully reversible. COPD treatment guidelines are largely informed by randomised controlled trial results, but it is unclear if these findings apply to large patient populations not studied in trials. Non-interventional studies could be used to study patient groups excluded from trials, but the use of these studies to estimate treatment effectiveness is in its infancy. In this study, we will use individual trial data to validate non-interventional methods for assessing COPD treatment effectiveness, before applying these methods to the analysis of treatment effectiveness within people excluded from, or under-represented in COPD trials. Using individual patient data from the landmark COPD Towards a Revolution in COPD Health (TORCH) trial and validated methods for detecting COPD and exacerbations in routinely collected primary care data, we will assemble a cohort in the UK Clinical Practice Research Datalink (selecting people between 1 January 2004 and 1 January 2017) with similar characteristics to TORCH participants and test whether non-interventional data can generate comparable results to trials, using cohort methodology with propensity score techniques to adjust for potential confounding. We will then use the methodological template we have developed to determine risks and benefits of COPD treatments in people excluded from TORCH. Outcomes are pneumonia, COPD exacerbation, mortality and time to treatment change. Groups to be studied include the elderly (>80 years), people with substantial comorbidity, people with and without underlying cardiovascular disease and people with mild COPD. Ethical approval has been granted by the London School of Hygiene & Tropical Medicine Ethics Committee (Ref: 11997). The study has been approved by the Independent Scientific Advisory Committee of the UK Medicines and Healthcare Products Regulatory Agency (protocol no. 17_114R). An application to use the TORCH trial data made to clinicalstudydatarequest.com has been approved. In addition to scientific publications, dissemination methods will be developed based on discussions with patient groups with COPD. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Khanna, Reena; Bouguen, Guillaume; Feagan, Brian G; DʼHaens, Geert; Sandborn, William J; Dubcenco, Elena; Baker, K Adam; Levesque, Barrett G
2014-10-01
Crohn's disease (CD) is a chronic idiopathic inflammatory disorder of the gastrointestinal tract. Recently, mucosal healing has been proposed as a goal of therapy because clinical symptoms are subjective. Evaluative indices that measure endoscopic disease activity are required to define mucosal healing for clinical trials. The primary objective of this systematic review was to assess the existing evaluative indices that measure disease activity in CD and evaluate their role as outcome measures in clinical trials. A systematic literature review was performed using MEDLINE (Ovid), EMBASE (Ovid), PubMed, the Cochrane Library (CENTRAL), and DDW abstracts to identify randomized controlled trials and controlled clinical trials that used a relevant evaluative index from inception to February 2013. The data obtained from these trials were reviewed and summarized. The initial literature searches identified 2300 citations. After duplicates were removed, 1454 studies remained. After application of the apriori inclusion and exclusion criteria, 109 articles were included and 3 were identified with handsearches. In total, 9 evaluative indices for CD were identified and reviewed. The Crohn's Disease Endoscopic Index of Severity (CDEIS) and the Simple Endoscopic Score in Crohn's Disease (SES-CD) are indices with the most extensively described operating properties. Both the endoscopic evaluative instrument selected and the definition chosen for mucosal healing affect the validity of assessing endoscopic disease activity during a clinical trial for CD. Currently, the CDEIS and SES-CD have the most data regarding operating properties; however, further validation is required.
Patient-reported outcomes as end points in clinical trials in rheumatoid arthritis
Gossec, Laure; Dougados, Maxime; Dixon, William
2015-01-01
There is a growing interest in patient-reported outcomes (PROs) in rheumatology, which goes with a global trend for more ‘patient-centred care’. This review considers the use of PROs in trials, including their strengths and limitations. In rheumatoid arthritis (RA) trials, the most frequently used PROs to assess treatments include pain, patient global assessment, assessment of functional status, but also health-related quality of life and less commonly fatigue. Other aspects of importance for patients, such as sleep, psychological well-being or ability to cope, are rarely assessed. PROs as outcome measures in RA trials have strengths as well as limitations. PROs have face validity, they are reproducible and sensitive to change and they bring additional information beyond joint counts or acute phase reactants. However, their predictive validity for later outcomes has been little explored, some PROs show redundancy (they bring similar information) and, due to the apparently moderate link between some PROs such as fatigue and the disease process, the use of some PROs to inform treatment choices has been questioned. We suggest the choice of PROs for trials depends on the study objective and on the viewpoint of the stakeholder. There needs to be agreed prioritisation across all stakeholders about what is most important to collect in a trial, which is why a prioritisation and selection process is necessary. Trials in RA will continue to include PROs and their interpretation will become easier as our knowledge progresses. PMID:26509052
Development, validity, and reliability of a ballet-specific aerobic fitness test.
Twitchett, Emily; Nevill, Alan; Angioi, Manuela; Koutedakis, Yiannis; Wyon, Matthew
2011-09-01
The aim of this study was to develop and assess the reliability and validity of a multi-stage, ballet-specific aerobic fitness test to be used in a dance studio setting. The test consists of five stages, each four minutes long, that increase in intensity. It uses classical ballet movement of an intermediate-level of difficulty, thus emphasizing physiological demand rather than skill. The demand of each stage was determined by calculating the mean oxygen uptake during its final minute using a portable gas analyser. After an initial familiarization period, eight female subjects performed the test twice within seven days. The results showed significant differences in oxygen consumption between stages (p < 0.001), but not between trials. Pearson correlation co-efficients produced a very good linear relationship between trials (r = 0.998, p < 0.001). Bland-Altman reliability analysis revealed the 95% limits of agreement to be ± 6.2 ml·kg(-1)·min(-1), showing good agreement between trials. The oxygen uptake in our subjects equated positively to previous estimates for class and performance, confirming validity. It was concluded that the test is suitable for use among classical ballet dancers, with many possible applications.
Green, Michael F.
2013-01-01
Social cognitive impairment is prominent in schizophrenia, and it is closely related to functional outcome. Partly for these reasons, it has rapidly become a target for both training and psychopharmacological interventions. However, there is a paucity of reliable and valid social cognitive endpoints that can be used to evaluate treatment response in clinical trials. Also, clinical studies in schizophrenia have benefited rather little from the surge of activity and knowledge in nonclinical social neuroscience. The National Institute of Mental Health-sponsored study, “Social Cognition and Functioning in Schizophrenia” (SCAF), attempted to address this translational challenge by selecting paradigms from social neuroscience that could be adapted for use in schizophrenia. The project also evaluated the psychometric properties and external validity of the tasks to determine their suitability for multisite clinical trials. This first article in the theme section presents the goals, conceptual background, and rationale for the SCAF project. PMID:24072811
Moura, Lidia Mvr; Westover, M Brandon; Kwasnik, David; Cole, Andrew J; Hsu, John
2017-01-01
The elderly population faces an increasing number of cases of chronic neurological conditions, such as epilepsy and Alzheimer's disease. Because the elderly with epilepsy are commonly excluded from randomized controlled clinical trials, there are few rigorous studies to guide clinical practice. When the elderly are eligible for trials, they either rarely participate or frequently have poor adherence to therapy, thus limiting both generalizability and validity. In contrast, large observational data sets are increasingly available, but are susceptible to bias when using common analytic approaches. Recent developments in causal inference-analytic approaches also introduce the possibility of emulating randomized controlled trials to yield valid estimates. We provide a practical example of the application of the principles of causal inference to a large observational data set of patients with epilepsy. This review also provides a framework for comparative-effectiveness research in chronic neurological conditions.
Meltzer, Eli O; Hadley, James; Blaiss, Michael; Benninger, Michael; Kimel, Miriam; Kleinman, Leah; Dupclay, Leon; Garcia, Jorge; Leahy, Michael; Georges, George
2005-02-01
To develop a questionnaire to evaluate preferences for attributes of intranasal corticosteroids (INSs) in clinical trials with allergic rhinitis (AR) patients. Established questionnaire development practices were used, including performance of a literature review and use of patient and physician focus groups, cognitive debriefing interviews, and pilot testing before validation. Findings from patient and physician focus groups suggest that sensory attributes are relevant to AR patients when choosing INSs. Physician focus groups identified the need for 2 distinct preference instruments, a clinical trial patient preference questionnaire (CTPPQ) and a clinical practice preference questionnaire (CPPPQ). A pilot study suggests that the CTPPQ is capable of discriminating between 2 INSs in the clinical trial setting. Initial findings suggest that items in the CTPPQ and CPPPQ are easy to understand and relevant to patients. Further validation studies with larger sample sizes are needed to assess the psychometric properties of both questionnaires. B-20.
Jull, J; Whitehead, M; Petticrew, M; Kristjansson, E; Gough, D; Petkovic, J; Volmink, J; Weijer, C; Taljaard, M; Edwards, S; Mbuagbaw, L; Cookson, R; McGowan, J; Lyddiatt, A; Boyer, Y; Cuervo, L G; Armstrong, R; White, H; Yoganathan, M; Pantoja, T; Shea, B; Pottie, K; Norheim, O; Baird, S; Robberstad, B; Sommerfelt, H; Asada, Y; Wells, G; Tugwell, P; Welch, V
2017-09-25
Randomised controlled trials can provide evidence relevant to assessing the equity impact of an intervention, but such information is often poorly reported. We describe a conceptual framework to identify health equity-relevant randomised trials with the aim of improving the design and reporting of such trials. An interdisciplinary and international research team engaged in an iterative consensus building process to develop and refine the conceptual framework via face-to-face meetings, teleconferences and email correspondence, including findings from a validation exercise whereby two independent reviewers used the emerging framework to classify a sample of randomised trials. A randomised trial can usefully be classified as 'health equity relevant' if it assesses the effects of an intervention on the health or its determinants of either individuals or a population who experience ill health due to disadvantage defined across one or more social determinants of health. Health equity-relevant randomised trials can either exclusively focus on a single population or collect data potentially useful for assessing differential effects of the intervention across multiple populations experiencing different levels or types of social disadvantage. Trials that are not classified as 'health equity relevant' may nevertheless provide information that is indirectly relevant to assessing equity impact, including information about individual level variation unrelated to social disadvantage and potentially useful in secondary modelling studies. The conceptual framework may be used to design and report randomised trials. The framework could also be used for other study designs to contribute to the evidence base for improved health equity. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Randomised placebo-controlled trials of surgery: ethical analysis and guidelines.
Savulescu, Julian; Wartolowska, Karolina; Carr, Andy
2016-12-01
Use of a placebo control in surgical trials is a divisive issue. We argue that, in principle, placebo controls for surgery are necessary in the same way as for medicine. However, there are important differences between these types of trial, which both increase justification and limit application of surgical studies. We propose that surgical randomised placebo-controlled trials are ethical if certain conditions are fulfilled: (1) the presence of equipoise, defined as a lack of unbiased evidence for efficacy of an intervention; (2) clinically important research question; (3) the risk to patients is minimised and reasonable; (4) there is uncertainty about treatment allocation rather than deception; (5) there is preliminary evidence for efficacy, which justifies a placebo-controlled design; and (6) ideally, the placebo procedure should have some direct benefit to the patient, for example, as a diagnostic tool. Placebo-controlled trials in surgery will most often be justified when surgery is performed to improve function or relieve symptoms and when objective outcomes are not available, while the risk of mortality or significant morbidity is low. In line with medical placebo-controlled trials, the surgical trial (1) should be sufficiently powered and (2) standardised so that its results are valid, (3) consent should be valid, (4) the standard treatment or rescue medication should be provided if possible, and (5) after the trial, the patients should be told which treatment they received and there should be provision for post-trial care if the study may result in long-term negative effects. We comment and contrast our guidelines with those of the American Medical Association. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.
Sun, H; Stockbridge, N; Ariagno, R L; Murphy, D; Nelson, R M; Rodriguez, W
2016-12-01
To identify suitable end points and surrogates for pediatric pulmonary arterial hypertension (PAH) as the lack of developmentally appropriate end point and clinical trials contribute to the unmet medical need. Reviewed the efficacy end points and surrogates for all trials (1995 to 2013) that were submitted to the Food and Drug Administration (FDA) to support the approval of PAH therapy and conducted literature search. An increase in the 6 min walking distance (6MWD) was used as a primary end point in 8/9 adult PAH trials. This end point is not suitable for infants and young children because of performance limitations and lack of control data. One adult PAH trial used time to the first morbidity or mortality event as a primary end point, which could potentially be used in pediatric PAH trials. In the sildenafil pediatric PAH trial, the change in pulmonary vascular resistance index or mean pulmonary artery pressure was used as a surrogate for the 6MWD to assess exercise capacity. However, two deaths and three severe adverse events during the catheterizations made this an unacceptably high-risk surrogate. The INOmax persistent pulmonary hypertension of the newborn trial used a reduction in initiation of extracorporeal membrane oxygenation treatment as a primary end point, which is not feasible for other pediatric PAH trials. A Literature review revealed none of the existing noninvasive markers are fully validated as surrogates to assess PAH efficacy and long-term safety. For pediatric PAH trials, clinical end points are acceptable, and novel validated surrogates would be helpful. FDA seeks collaboration with academia, industry and parents to develop other suitable and possibly more efficient efficacy end points to facilitate pediatric PAH drug development.
External validity of randomized controlled trials in older adults, a systematic review.
van Deudekom, Floor J; Postmus, Iris; van der Ham, Danielle J; Pothof, Alexander B; Broekhuizen, Karen; Blauw, Gerard J; Mooijaart, Simon P
2017-01-01
To critically assess the external validity of randomized controlled trials (RCTs) it is important to know what older adults have been enrolled in the trials. The aim of this systematic review is to study what proportion of trials specifically designed for older patients report on somatic status, physical and mental functioning, social environment and frailty in the patient characteristics. PubMed was searched for articles published in 2012 and only RCTs were included. Articles were further excluded if not conducted with humans or only secondary analyses were reported. A random sample of 10% was drawn. The current review analyzed this random sample and further selected trials when the reported mean age was ≥ 60 years. We extracted geriatric assessments from the population descriptives or the in- and exclusion criteria. In total 1396 trials were analyzed and 300 trials included. The median of the reported mean age was 66 (IQR 63-70) and the median percentage of men in the trials was 60 (IQR 45-72). In 34% of the RCTs specifically designed for older patients somatic status, physical and mental functioning, social environment or frailty were reported in the population descriptives or the in- and exclusion criteria. Physical and mental functioning was reported most frequently (22% and 14%). When selecting RCTs on a mean age of 70 or 80 years the report of geriatric assessments in the patient characteristics was 46% and 85% respectively but represent only 5% and 1% of the trials. Somatic status, physical and mental functioning, social environment and frailty are underreported even in RCTs specifically designed for older patients published in 2012. Therefore, it is unclear for clinicians to which older patients the results can be applied. We recommend systematic to transparently report these relevant characteristics of older participants included in RCTs.
The Robinson Protocol: a treadmill anaerobic performance test.
Robinson, Ellyn M; Graham, Louise B; Headley, Samuel A
2004-08-01
The current investigation was designed to further examine the reliability of the Robinson protocol, which is a run-to-exhaustion treadmill test. Robinson (10) originally examined this protocol with 5 subjects. The significance of the initial exploratory study was the impetus for expanding the study to examine the reliability of the protocol with a larger sample. Fifteen male subjects participated in 3 trial runs on the treadmill. The first trial was a modified McConnell (7) test to determine the aerobic capacity of each subject. The second and third trials were identical Robinson protocols (10). The first trial run mean, in seconds (262.04 +/- 74.50), was not significantly different from the second trial run mean (257.30 +/- 72.65), p = 0.526 (2 tailed). As expected, trial 1 and trial 2 were highly correlated (intraclass) (r = 0.927, p < 0.001). These results provide additional support for the hypothesis that the Robinson protocol with a greater subject pool is a reliable protocol that can be used in research studies interested in examining various physiological interventions or anaerobic training.