Cai, Gaigai; Chen, Xuefeng; Li, Bing; Chen, Baojia; He, Zhengjia
2012-01-01
The reliability of cutting tools is critical to machining precision and production efficiency. The conventional statistic-based reliability assessment method aims at providing a general and overall estimation of reliability for a large population of identical units under given and fixed conditions. However, it has limited effectiveness in depicting the operational characteristics of a cutting tool. To overcome this limitation, this paper proposes an approach to assess the operation reliability of cutting tools. A proportional covariate model is introduced to construct the relationship between operation reliability and condition monitoring information. The wavelet packet transform and an improved distance evaluation technique are used to extract sensitive features from vibration signals, and a covariate function is constructed based on the proportional covariate model. Ultimately, the failure rate function of the cutting tool being assessed is calculated using the baseline covariate function obtained from a small sample of historical data. Experimental results and a comparative study show that the proposed method is effective for assessing the operation reliability of cutting tools. PMID:23201980
Interrater Reliability of the Power Mobility Road Test in the Virtual Reality-Based Simulator-2.
Kamaraj, Deepan C; Dicianno, Brad E; Mahajan, Harshal P; Buhari, Alhaji M; Cooper, Rory A
2016-07-01
To assess interrater reliability of the Power Mobility Road Test (PMRT) when administered through the Virtual Reality-based SIMulator-version 2 (VRSIM-2). Within-subjects repeated-measures design. Participants interacted with VRSIM-2 through 2 display options (desktop monitor vs immersive virtual reality screens) using 2 control interfaces (roller system vs conventional movement-sensing joystick), providing 4 different driving scenarios (driving conditions 1-4). Participants performed 3 virtual driving sessions for each of the 2 display screens and 1 session through a real-world driving course (driving condition 5). The virtual PMRT was conducted in a simulated indoor office space, and an equivalent course was charted in an open space for the real-world assessment. After every change in driving condition, participants completed a self-reported workload assessment questionnaire, the Task Load Index, developed by the National Aeronautics and Space Administration. A convenience sample of electric-powered wheelchair (EPW) athletes (N=21) recruited at the 31st National Veterans Wheelchair Games. Not applicable. Total composite PMRT score. The PMRT had high interrater reliability (intraclass correlation coefficient [ICC]>.75) between the 2 raters in all 5 driving conditions. Post hoc analyses revealed that the reliability analyses had >80% power to detect high ICCs in driving conditions 1 and 4. The PMRT has high interrater reliability in conditions 1 and 4 and could be used to assess EPW driving performance virtually in VRSIM-2. However, further psychometric assessment is necessary to assess the feasibility of administering the PMRT using the different interfaces of VRSIM-2. Copyright © 2016 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Mani, Suresh; Sharma, Shobha; Omar, Baharudin; Paungmali, Aatit; Joseph, Leonard
2017-04-01
Purpose The purpose of this review is to systematically explore and summarise the validity and reliability of telerehabilitation (TR)-based physiotherapy assessment for musculoskeletal disorders. Method A comprehensive systematic literature review was conducted using a number of electronic databases: PubMed, EMBASE, PsycINFO, Cochrane Library and CINAHL, published between January 2000 and May 2015. The studies examined the validity, inter- and intra-rater reliabilities of TR-based physiotherapy assessment for musculoskeletal conditions were included. Two independent reviewers used the Quality Appraisal Tool for studies of diagnostic Reliability (QAREL) and the Quality Assessment of Diagnostic Accuracy Studies (QUADAS) tool to assess the methodological quality of reliability and validity studies respectively. Results A total of 898 hits were achieved, of which 11 articles based on inclusion criteria were reviewed. Nine studies explored the concurrent validity, inter- and intra-rater reliabilities, while two studies examined only the concurrent validity. Reviewed studies were moderate to good in methodological quality. The physiotherapy assessments such as pain, swelling, range of motion, muscle strength, balance, gait and functional assessment demonstrated good concurrent validity. However, the reported concurrent validity of lumbar spine posture, special orthopaedic tests, neurodynamic tests and scar assessments ranged from low to moderate. Conclusion TR-based physiotherapy assessment was technically feasible with overall good concurrent validity and excellent reliability, except for lumbar spine posture, orthopaedic special tests, neurodynamic testa and scar assessment.
Gasq, David; Labrunée, Marc; Amarantini, David; Dupui, Philippe; Montoya, Richard; Marque, Philippe
2014-03-21
Stroke patients have impaired postural balance that increases the risk of falls and impairs their mobility. Assessment of postural balance is commonly carried out by recording centre of pressure (CoP) displacements, but the lack of data concerning reliability of these measures compromises their interpretation. The purpose of this study was to investigate the between-day reliability of six CoP-based variables, in order to provide i) reliability data for monitoring postural sway and weight-bearing asymmetry of stroke patients in clinical practice and ii) consistent assessment method of measurement error for applications in physical medicine and rehabilitation. Postural balance of 20 stroke patients was assessed in quiet standing on a force platform, in two sessions, 7 days apart. Six CoP-based variables were collected in eyes open and eyes closed conditions: postural sway was assessed with mean and standart deviation of CoP-velocity, CoP-velocity along the mediolateral and anteroposterior axes, and confidence ellipse area (CE(AREA)); weight-bearing asymmetry was assessed with mean CoP position along the mediolateral axis (CoP(ML)). The intraclass correlation coefficient (ICC) was used to determine the level of agreement between test-retest. Small real difference (SRD), corresponding to the smallest change that indicates a real improvement for a single individual, was used to determine the extent of measurement error. ICCs were satisfactory (>0.9) for all CoP-based variables, except for CE(AREA) in eyes open condition and CoP(ML) (<0.8). The SRDs (eyes open/closed conditions) were: 6.1/9.5 mm.s(-1) for mean velocity; 12.3/12.2 mm.s(-1) for standard deviation of CoP-velocity; 3.6/5.5 mm.s(-1) and 4.9/7.3 mm.s(-1) for CoP-velocity in mediolateral and anteroposterior axes, respectively; 17.4/21.4 mm for CoP(ML). Because CE(AREA) showed heteroscedasticity of measurement error distribution, SRD (eyes open/closed conditions) was expressed as a percentage (121/75%) and a ratio (3.68/2.16) obtained after log-antilog procedure. In clinical practice, the CoP-based velocity variables should be prefer to CE(AREA) to assess and monitor postural sway over time in hemiplegic stroke patients. The poor reliability of CoP(ML) compromises its use to assess weight-bearing asymmetry. The procedure we used could be applied in reliability studies concerning other CoP-based variables or other biological variables in the field of physical medicine and rehabilitation.
Animal-Based Measures to Assess the Welfare of Extensively Managed Ewes
Hemsworth, Paul; Doyle, Rebecca
2017-01-01
Simple Summary The aim of this study was to assess the reliability and practicality of 10 animal-based welfare measures for extensively managed ewes, which were derived from the scientific literature, previous welfare protocols and through consultation with veterinarians and animal welfare scientists. Measures were examined on 100 Merino ewes, which were individually identified and repeatedly examined at mid-pregnancy, mid-lactation and weaning. Body condition score, fleece condition, skin lesions, tail length, dag score and lameness are proposed for on-farm use in welfare assessments of extensive sheep production systems. These six welfare measures, which address the main welfare concerns for extensively managed ewes, can be reliably and feasibly measured in the field. Abstract The reliability and feasibility of 10 animal-based measures of ewe welfare were examined for use in extensive sheep production systems. Measures were: Body condition score (BCS), rumen fill, fleece cleanliness, fleece condition, skin lesions, tail length, dag score, foot-wall integrity, hoof overgrowth and lameness, and all were examined on 100 Merino ewes (aged 2–4 years) during mid-pregnancy, mid-lactation and weaning by a pool of nine trained observers. The measures of BCS, fleece condition, skin lesions, tail length, dag score and lameness were deemed to be reliable and feasible. All had good observer agreement, as determined by the percentage of agreement, Kendall’s coefficient of concordance (W) and Kappa (k) values. When combined, these nutritional and health measures provide a snapshot of the current welfare status of ewes, as well as evidencing previous or potential welfare issues. PMID:29295551
NASA Astrophysics Data System (ADS)
Flanigan, Katherine A.; Johnson, Nephi R.; Hou, Rui; Ettouney, Mohammed; Lynch, Jerome P.
2017-04-01
The ability to quantitatively assess the condition of railroad bridges facilitates objective evaluation of their robustness in the face of hazard events. Of particular importance is the need to assess the condition of railroad bridges in networks that are exposed to multiple hazards. Data collected from structural health monitoring (SHM) can be used to better maintain a structure by prompting preventative (rather than reactive) maintenance strategies and supplying quantitative information to aid in recovery. To that end, a wireless monitoring system is validated and installed on the Harahan Bridge which is a hundred-year-old long-span railroad truss bridge that crosses the Mississippi River near Memphis, TN. This bridge is exposed to multiple hazards including scour, vehicle/barge impact, seismic activity, and aging. The instrumented sensing system targets non-redundant structural components and areas of the truss and floor system that bridge managers are most concerned about based on previous inspections and structural analysis. This paper details the monitoring system and the analytical method for the assessment of bridge condition based on automated data-driven analyses. Two primary objectives of monitoring the system performance are discussed: 1) monitoring fatigue accumulation in critical tensile truss elements; and 2) monitoring the reliability index values associated with sub-system limit states of these members. Moreover, since the reliability index is a scalar indicator of the safety of components, quantifiable condition assessment can be used as an objective metric so that bridge owners can make informed damage mitigation strategies and optimize resource management on single bridge or network levels.
Wolf, Timothy J; Dahl, Abigail; Auen, Colleen; Doherty, Meghan
2017-07-01
The objective of this study was to evaluate the inter-rater reliability, test-retest reliability, concurrent validity, and discriminant validity of the Complex Task Performance Assessment (CTPA): an ecologically valid performance-based assessment of executive function. Community control participants (n = 20) and individuals with mild stroke (n = 14) participated in this study. All participants completed the CTPA and a battery of cognitive assessments at initial testing. The control participants completed the CTPA at two different times one week apart. The intra-class correlation coefficient (ICC) for inter-rater reliability for the total score on the CTPA was .991. The ICCs for all of the sub-scores of the CTPA were also high (.889-.977). The CTPA total score was significantly correlated to Condition 4 of the DKEFS Color-Word Interference Test (p = -.425), and the Wechsler Test of Adult Reading (p = -.493). Finally, there were significant differences between control subjects and individuals with mild stroke on the total score of the CTPA (p = .007) and all sub-scores except interpretation failures and total items incorrect. These results are also consistent with other current executive function performance-based assessments and indicate that the CTPA is a reliable and valid performance-based measure of executive function.
Smith, Justin D; Dishion, Thomas J; Brown, Kimbree; Ramos, Karina; Knoble, Naomi B; Shaw, Daniel S; Wilson, Melvin N
2016-01-01
The valid and reliable assessment of fidelity is critical at all stages of intervention research and is particularly germane to interpreting the results of efficacy and implementation trials. Ratings of protocol adherence typically are reliable, but ratings of therapist competence are plagued by low reliability. Because family context and case conceptualization guide the therapist's delivery of interventions, the reliability of fidelity ratings might be improved if the coder is privy to client context in the form of an ecological assessment. We conducted a randomized experiment to test this hypothesis. A subsample of 46 families with 5-year-old children from a multisite randomized trial who participated in the feedback session of the Family Check-Up (FCU) intervention were selected. We randomly assigned FCU feedback sessions to be rated for fidelity to the protocol using the COACH rating system either after the coder reviewed the results of a recent ecological assessment or had not. Inter-rater reliability estimates of fidelity ratings were meaningfully higher for the assessment information condition compared to the no-information condition. Importantly, the reliability of the COACH mean score was found to be statistically significantly higher in the information condition. These findings suggest that the reliability of observational ratings of fidelity, particularly when the competence or quality of delivery is considered, could be improved by providing assessment data to the coders. Our findings might be most applicable to assessment-driven interventions, where assessment data explicitly guides therapist's selection of intervention strategies tailored to the family's context and needs, but they could also apply to other intervention programs and observational coding of context-dependent therapy processes, such as the working alliance.
Smith, Justin D.; Dishion, Thomas J.; Brown, Kimbree; Ramos, Karina; Knoble, Naomi B.; Shaw, Daniel S.; Wilson, Melvin N.
2015-01-01
The valid and reliable assessment of fidelity is critical at all stages of intervention research and is particularly germane to interpreting the results of efficacy and implementation trials. Ratings of protocol adherence typically are reliable, but ratings of therapist competence are plagued by low reliability. Because family context and case conceptualization guide the therapist's delivery of interventions, the reliability of fidelity ratings might be improved if the coder is privy to client context in the form of an ecological assessment. We conducted a randomized experiment to test this hypothesis. A subsample of 46 families with 5-year-old children from a multisite randomized trial who participated in the feedback session of the Family Check-Up (FCU) intervention were selected. We randomly assigned FCU feedback sessions to be rated for fidelity to the protocol using the COACH rating system either after the coder reviewed the results of a recent ecological assessment or had not. Inter-rater reliability estimates of fidelity ratings were meaningfully higher for the assessment information condition compared to the no-information condition. Importantly, the reliability of the COACH mean score was found to be statistically significantly higher in the information condition. These findings suggest that the reliability of observational ratings of fidelity, particularly when the competence or quality of delivery is considered, could be improved by providing assessment data to the coders. Our findings might be most applicable to assessment-driven interventions, where assessment data explicitly guides therapist's selection of intervention strategies tailored to the family's context and needs, but they could also apply to other intervention programs and observational coding of context-dependent therapy processes, such as the working alliance. PMID:26271300
DOT National Transportation Integrated Search
2016-09-01
Travel time and travel-time reliability have been used as performance : measures to evaluate traffic system conditions and develop advanced : traveler information and traffic management systems. The objectives of this research were to: : - Quantify s...
Using generalizability theory to develop clinical assessment protocols.
Preuss, Richard A
2013-04-01
Clinical assessment protocols must produce data that are reliable, with a clinically attainable minimal detectable change (MDC). In a reliability study, generalizability theory has 2 advantages over classical test theory. These advantages provide information that allows assessment protocols to be adjusted to match individual patient profiles. First, generalizability theory allows the user to simultaneously consider multiple sources of measurement error variance (facets). Second, it allows the user to generalize the findings of the main study across the different study facets and to recalculate the reliability and MDC based on different combinations of facet conditions. In doing so, clinical assessment protocols can be chosen based on minimizing the number of measures that must be taken to achieve a realistic MDC, using repeated measures to minimize the MDC, or simply based on the combination that best allows the clinician to monitor an individual patient's progress over a specified period of time.
NASA Technical Reports Server (NTRS)
Bean, E. E.; Bloomquist, C. E.
1972-01-01
A summary of the KSC program for investigating the reliability aspects of the ground support activities is presented. An analysis of unsatisfactory condition reports (RC), and the generation of reliability assessment of components based on the URC are discussed along with the design considerations for attaining reliable real time hardware/software configurations.
Advanced Reactor Passive System Reliability Demonstration Analysis for an External Event
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bucknor, Matthew D.; Grabaskas, David; Brunett, Acacia J.
2016-01-01
Many advanced reactor designs rely on passive systems to fulfill safety functions during accident sequences. These systems depend heavily on boundary conditions to induce a motive force, meaning the system can fail to operate as intended due to deviations in boundary conditions, rather than as the result of physical failures. Furthermore, passive systems may operate in intermediate or degraded modes. These factors make passive system operation difficult to characterize within a traditional probabilistic framework that only recognizes discrete operating modes and does not allow for the explicit consideration of time-dependent boundary conditions. Argonne National Laboratory has been examining various methodologiesmore » for assessing passive system reliability within a probabilistic risk assessment for a station blackout event at an advanced small modular reactor. This paper provides an overview of a passive system reliability demonstration analysis for an external event. Centering on an earthquake with the possibility of site flooding, the analysis focuses on the behavior of the passive reactor cavity cooling system following potential physical damage and system flooding. The assessment approach seeks to combine mechanistic and simulation-based methods to leverage the benefits of the simulation-based approach without the need to substantially deviate from conventional probabilistic risk assessment techniques. While this study is presented as only an example analysis, the results appear to demonstrate a high level of reliability for the reactor cavity cooling system (and the reactor system in general) to the postulated transient event.« less
Advanced Reactor Passive System Reliability Demonstration Analysis for an External Event
Bucknor, Matthew; Grabaskas, David; Brunett, Acacia J.; ...
2017-01-24
We report that many advanced reactor designs rely on passive systems to fulfill safety functions during accident sequences. These systems depend heavily on boundary conditions to induce a motive force, meaning the system can fail to operate as intended because of deviations in boundary conditions, rather than as the result of physical failures. Furthermore, passive systems may operate in intermediate or degraded modes. These factors make passive system operation difficult to characterize within a traditional probabilistic framework that only recognizes discrete operating modes and does not allow for the explicit consideration of time-dependent boundary conditions. Argonne National Laboratory has beenmore » examining various methodologies for assessing passive system reliability within a probabilistic risk assessment for a station blackout event at an advanced small modular reactor. This paper provides an overview of a passive system reliability demonstration analysis for an external event. Considering an earthquake with the possibility of site flooding, the analysis focuses on the behavior of the passive Reactor Cavity Cooling System following potential physical damage and system flooding. The assessment approach seeks to combine mechanistic and simulation-based methods to leverage the benefits of the simulation-based approach without the need to substantially deviate from conventional probabilistic risk assessment techniques. Lastly, although this study is presented as only an example analysis, the results appear to demonstrate a high level of reliability of the Reactor Cavity Cooling System (and the reactor system in general) for the postulated transient event.« less
Guo, Yiting Emily; Togher, Leanne; Power, Emma; Hutomo, Edwin; Yang, Yi-Fei; Tay, Arthur; Yen, Shih-Cheng; Koh, Gerald Choon-Huat
2017-04-01
Access2Aphasia™ is an iPad™-based aphasia assessment application that enables real-time audiovisual communication between people with aphasia (PWA) and speech-language pathologists (SLPs), and the use of supported conversation techniques. This study aimed to establish the reliability of aphasia assessment across the International Classification of Functioning, Disability and Health (ICF) using Access2Aphasia, and compare it with face-to-face (FTF) assessment. Consumer perspectives of Access2Aphasia were also examined. Thirty PWA were randomized into two conditions: online-led and FTF assessment. Participants in the online-led group were assessed remotely using Access2Aphasia™ in their own homes, while an FTF SLP scored silently simultaneously. Participants in the FTF group were assessed FTF using standard administration materials. Assessment included two subtests of the Psycholinguistic Assessment of Language Processing Activities (PALPA) and the Assessment of Living with Aphasia (ALA) to allow for outcomes to be captured across the ICF domains. Consumer perspectives on Access2Aphasia were obtained from both PWA and research SLPs in the online-led group. Kappa statistics indicated moderate to almost perfect agreement between online and FTF SLPs (k = 0.71-1.00). Intrarater and interrater reliability was excellent (ICC = 0.99-1.00) and equivalent for the online-led and FTF conditions. Both PWA and research SLPs in the online-led group reported being satisfied with the experience overall, with suggestions provided by research SLPs to improve Access2Aphasia. This study supports the provision of iPad-based aphasia assessments across the ICF in the online environment, with comparable reliability to FTF assessments. Future research is warranted to support the development of iPad-based aphasia assessment and treatment as an alternative mode of service delivery to PWA.
Achieving Reliable Communication in Dynamic Emergency Responses
Chipara, Octav; Plymoth, Anders N.; Liu, Fang; Huang, Ricky; Evans, Brian; Johansson, Per; Rao, Ramesh; Griswold, William G.
2011-01-01
Emergency responses require the coordination of first responders to assess the condition of victims, stabilize their condition, and transport them to hospitals based on the severity of their injuries. WIISARD is a system designed to facilitate the collection of medical information and its reliable dissemination during emergency responses. A key challenge in WIISARD is to deliver data with high reliability as first responders move and operate in a dynamic radio environment fraught with frequent network disconnections. The initial WIISARD system employed a client-server architecture and an ad-hoc routing protocol was used to exchange data. The system had low reliability when deployed during emergency drills. In this paper, we identify the underlying causes of unreliability and propose a novel peer-to-peer architecture that in combination with a gossip-based communication protocol achieves high reliability. Empirical studies show that compared to the initial WIISARD system, the redesigned system improves reliability by as much as 37% while reducing the number of transmitted packets by 23%. PMID:22195075
TVA-Based Assessment of Visual Attention Using Line-Drawings of Fruits and Vegetables
Wang, Tianlu; Gillebert, Celine R.
2018-01-01
Visuospatial attention and short-term memory allow us to prioritize, select, and briefly maintain part of the visual information that reaches our senses. These cognitive abilities are quantitatively accounted for by Bundesen’s theory of visual attention (TVA; Bundesen, 1990). Previous studies have suggested that TVA-based assessments are sensitive to inter-individual differences in spatial bias, visual short-term memory capacity, top-down control, and processing speed in healthy volunteers as well as in patients with various neurological and psychiatric conditions. However, most neuropsychological assessments of attention and executive functions, including TVA-based assessment, make use of alphanumeric stimuli and/or are performed verbally, which can pose difficulties for individuals who have troubles processing letters or numbers. Here we examined the reliability of TVA-based assessments when stimuli are used that are not alphanumeric, but instead based on line-drawings of fruits and vegetables. We compared five TVA parameters quantifying the aforementioned cognitive abilities, obtained by modeling accuracy data on a whole/partial report paradigm using conventional alphabet stimuli versus the food stimuli. Significant correlations were found for all TVA parameters, indicating a high parallel-form reliability. Split-half correlations assessing internal reliability, and correlations between predicted and observed data assessing goodness-of-fit were both significant. Our results provide an indication that line-drawings of fruits and vegetables can be used for a reliable assessment of attention and short-term memory. PMID:29535660
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bucknor, Matthew; Grabaskas, David; Brunett, Acacia J.
We report that many advanced reactor designs rely on passive systems to fulfill safety functions during accident sequences. These systems depend heavily on boundary conditions to induce a motive force, meaning the system can fail to operate as intended because of deviations in boundary conditions, rather than as the result of physical failures. Furthermore, passive systems may operate in intermediate or degraded modes. These factors make passive system operation difficult to characterize within a traditional probabilistic framework that only recognizes discrete operating modes and does not allow for the explicit consideration of time-dependent boundary conditions. Argonne National Laboratory has beenmore » examining various methodologies for assessing passive system reliability within a probabilistic risk assessment for a station blackout event at an advanced small modular reactor. This paper provides an overview of a passive system reliability demonstration analysis for an external event. Considering an earthquake with the possibility of site flooding, the analysis focuses on the behavior of the passive Reactor Cavity Cooling System following potential physical damage and system flooding. The assessment approach seeks to combine mechanistic and simulation-based methods to leverage the benefits of the simulation-based approach without the need to substantially deviate from conventional probabilistic risk assessment techniques. Lastly, although this study is presented as only an example analysis, the results appear to demonstrate a high level of reliability of the Reactor Cavity Cooling System (and the reactor system in general) for the postulated transient event.« less
The Effect of Raters and Rating Conditions on the Reliability of the Missionary Teaching Assessment
ERIC Educational Resources Information Center
Ure, Abigail C.
2011-01-01
This study investigated how 2 different rating conditions, the controlled rating condition (CRC) and the uncontrolled rating condition (URC), effected rater behavior and the reliability of a performance assessment (PA) known as the Missionary Teaching Assessment (MTA). The CRC gives raters the capability to manipulate (pause, rewind, fast-forward)…
NASA Astrophysics Data System (ADS)
Yu, Bo; Ning, Chao-lie; Li, Bing
2017-03-01
A probabilistic framework for durability assessment of concrete structures in marine environments was proposed in terms of reliability and sensitivity analysis, which takes into account the uncertainties under the environmental, material, structural and executional conditions. A time-dependent probabilistic model of chloride ingress was established first to consider the variations in various governing parameters, such as the chloride concentration, chloride diffusion coefficient, and age factor. Then the Nataf transformation was adopted to transform the non-normal random variables from the original physical space into the independent standard Normal space. After that the durability limit state function and its gradient vector with respect to the original physical parameters were derived analytically, based on which the first-order reliability method was adopted to analyze the time-dependent reliability and parametric sensitivity of concrete structures in marine environments. The accuracy of the proposed method was verified by comparing with the second-order reliability method and the Monte Carlo simulation. Finally, the influences of environmental conditions, material properties, structural parameters and execution conditions on the time-dependent reliability of concrete structures in marine environments were also investigated. The proposed probabilistic framework can be implemented in the decision-making algorithm for the maintenance and repair of deteriorating concrete structures in marine environments.
ERIC Educational Resources Information Center
Byars, Alvin Gregg
The objectives of this investigation are to develop, describe, assess, and demonstrate procedures for constructing mastery tests to minimize errors of classification and to maximize decision reliability. The guidelines are based on conditions where item exchangeability is a reasonable assumption and the test constructor can control the number of…
Smith, Erin; Walsh, Lorcan; Doyle, Julie; Greene, Barry; Blake, Catherine
2016-01-01
The timed up and go (TUG) test is a commonly used assessment in older people with variations including the addition of a motor or cognitive dual-task, however in high functioning older adults it is more difficult to assess change. The quantified TUG (QTUG) uses inertial sensors to detect test and gait parameters during the test. If it is to be used in the longitudinal assessment of older adults, it is important that we know which parameters are reliable and under which conditions. This study aims to examine the relative reliability of the QTUG over five consecutive days under single, motor and cognitive dual-task conditions. Twelve community dwelling older adults (10 females, mean age 74.17 (3.88)) performed the QTUG under three conditions for five consecutive days. The relative reliability of each of the gait parameters was assessed using intra-class correlation coefficient (ICC 3,1) and standard error of measurement (SEM). Five of the measures demonstrated excellent reliability (ICC>0.70) under all three conditions (time to complete test, walk time, number of gait cycles, number of steps and return from turn time). Measures of variability and turn derived parameters demonstrated weak reliability under all three conditions (ICC=0.05-0.49). For the most reliable parameters under single-task conditions, the addition of a cognitive task resulted in a reduction in reliability suggesting caution when interpreting results under these conditions. Certain sensor derived parameters during the QTUG test may provide an additional resource in the longitudinal assessment of older people and earlier identification of falls risk. Copyright © 2015 Elsevier B.V. All rights reserved.
Invited review: Animal-based indicators for on-farm welfare assessment for dairy goats.
Battini, M; Vieira, A; Barbieri, S; Ajuda, I; Stilwell, G; Mattiello, S
2014-11-01
This paper reviews animal-based welfare indicators to develop a valid, reliable, and feasible on-farm welfare assessment protocol for dairy goats. The indicators were considered in the light of the 4 accepted principles (good feeding, good housing, good health, appropriate behavior) subdivided into 12 criteria developed by the European Welfare Quality program. We will only examine the practical indicators to be used on-farm, excluding those requiring the use of specific instruments or laboratory analysis and those that are recorded at the slaughterhouse. Body condition score, hair coat condition, and queuing at the feed barrier or at the drinker seem the most promising indicators for the assessment of the "good feeding" principle. As to "good housing," some indicators were considered promising for assessing "comfort around resting" (e.g., resting in contact with a wall) or "thermal comfort" (e.g., panting score for the detection of heat stress and shivering score for the detection of cold stress). Several indicators related to "good health," such as lameness, claw overgrowth, presence of external abscesses, and hair coat condition, were identified. As to the "appropriate behavior" principle, different criteria have been identified: agonistic behavior is largely used as the "expression of social behavior" criterion, but it is often not feasible for on-farm assessment. Latency to first contact and the avoidance distance test can be used as criteria for assessing the quality of the human-animal relationship. Qualitative behavior assessment seems to be a promising indicator for addressing the "positive emotional state" criterion. Promising indicators were identified for most of the considered criteria; however, no valid indicator has been identified for "expression of other behaviors." Interobserver reliability has rarely been assessed and warrants further attention; in contrast, short-term intraobserver reliability is frequently assessed and some studies consider mid- and long-term reliability. The feasibility of most of the reviewed indicators in commercial farms still needs to be carefully evaluated, as several studies were performed under experimental conditions. Our review highlights some aspects of goat welfare that have been widely studied, but some indicators need to be investigated further and drafted before being included in a valid, reliable, and feasible welfare assessment protocol. The indicators selected and examined may be an invaluable starting point for the development of an on-farm welfare assessment protocol for dairy goats. Copyright © 2014 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Reliability modelling and analysis of thermal MEMS
NASA Astrophysics Data System (ADS)
Muratet, Sylvaine; Lavu, Srikanth; Fourniols, Jean-Yves; Bell, George; Desmulliez, Marc P. Y.
2006-04-01
This paper presents a MEMS reliability study methodology based on the novel concept of 'virtual prototyping'. This methodology can be used for the development of reliable sensors or actuators and also to characterize their behaviour in specific use conditions and applications. The methodology is demonstrated on the U-shaped micro electro thermal actuator used as test vehicle. To demonstrate this approach, a 'virtual prototype' has been developed with the modeling tools MatLab and VHDL-AMS. A best practice FMEA (Failure Mode and Effect Analysis) is applied on the thermal MEMS to investigate and assess the failure mechanisms. Reliability study is performed by injecting the identified defaults into the 'virtual prototype'. The reliability characterization methodology predicts the evolution of the behavior of these MEMS as a function of the number of cycles of operation and specific operational conditions.
The Infant Motor Profile: A Standardized and Qualitative Method to Assess Motor Behaviour in Infancy
ERIC Educational Resources Information Center
Heineman, Kirsten R.; Bos, Arend F.; Hadders-Algra, Mijna
2008-01-01
A reliable and valid instrument to assess neuromotor condition in infancy is a prerequisite for early detection of developmental motor disorders. We developed a video-based assessment of motor behaviour, the Infant Motor Profile (IMP), to evaluate motor abilities, movement variability, ability to select motor strategies, movement symmetry, and…
Silsupadol, Patima; Teja, Kunlanan; Lugade, Vipul
2017-10-01
The assessment of spatiotemporal gait parameters is a useful clinical indicator of health status. Unfortunately, most assessment tools require controlled laboratory environments which can be expensive and time consuming. As smartphones with embedded sensors are becoming ubiquitous, this technology can provide a cost-effective, easily deployable method for assessing gait. Therefore, the purpose of this study was to assess the reliability and validity of a smartphone-based accelerometer in quantifying spatiotemporal gait parameters when attached to the body or in a bag, belt, hand, and pocket. Thirty-four healthy adults were asked to walk at self-selected comfortable, slow, and fast speeds over a 10-m walkway while carrying a smartphone. Step length, step time, gait velocity, and cadence were computed from smartphone-based accelerometers and validated with GAITRite. Across all walking speeds, smartphone data had excellent reliability (ICC 2,1 ≥0.90) for the body and belt locations, with bag, hand, and pocket locations having good to excellent reliability (ICC 2,1 ≥0.69). Correlations between the smartphone-based and GAITRite-based systems were very high for the body (r=0.89, 0.98, 0.96, and 0.87 for step length, step time, gait velocity, and cadence, respectively). Similarly, Bland-Altman analysis demonstrated that the bias approached zero, particularly in the body, bag, and belt conditions under comfortable and fast speeds. Thus, smartphone-based assessments of gait are most valid when placed on the body, in a bag, or on a belt. The use of a smartphone to assess gait can provide relevant data to clinicians without encumbering the user and allow for data collection in the free-living environment. Copyright © 2017 Elsevier B.V. All rights reserved.
Phythian, C J; Toft, N; Cripps, P J; Michalopoulou, E; Winter, A C; Jones, P H; Grove-White, D; Duncan, J S
2013-07-01
A scientific literature review and consensus of expert opinion used the welfare definitions provided by the Farm Animal Welfare Council (FAWC) Five Freedoms as the framework for selecting a set of animal-based indicators that were sensitive to the current on-farm welfare issues of young lambs (aged ≤ 6 weeks). Ten animal-based indicators assessed by observation - demeanour, response to stimulation, shivering, standing ability, posture, abdominal fill, body condition, lameness, eye condition and salivation were tested as part of the objective of developing valid, reliable and feasible animal-based measures of lamb welfare The indicators were independently tested on 966 young lambs from 17 sheep flocks across Northwest England and Wales during December 2008 to April 2009 by four trained observers. Inter-observer reliability was assessed using Fleiss's kappa (κ), and the pair-wise agreement with an experienced, observer designated as the 'test standard observer' (TSO) was examined using Cohen's κ. Latent class analysis (LCA) estimated the sensitivity (Se) and specificity (Sp) of each observer without assuming a gold standard and predicted the Se and Sp of randomly selected observers who may apply the indicators in the future. Overall, good levels of inter-observer reliability, and high levels of Sp were identified for demeanour (κ = 0.54, Se ≥ 0.70, Sp ≥ 0.98), stimulation (κ = 0.57, Se = 0.30 to 0.77, Sp ≥ 0.98), shivering (κ = 0.55, Se = 0.37 to 0.85, Sp ≥ 0.99), standing ability (0.54, Se ≥ 0.80, Sp ≥ 0.99), posture (κ = 0.45, Se ≥ 0.56, Sp = 0.99), abdominal fill (κ = 0.44, Se = 0.39 to 0.98, Sp = 0.99), body condition (κ = 0.72, Se ⩾ 0.38 to 0.90, Sp = 0.99), lameness (κ = 0.68, Se > 0.73, Sp = 1.00), and eye condition (κ = 0.72, Se ≥ 0.86, Sp = 0.99). LCA predicted that randomly selected observers had Se > 0.77 (acceptable), and Sp ≥ 0.98 (high) for assessments of demeanour, lameness, abdominal fill posture, body condition and eye condition. The diagnostic performance of some indicators was influenced by the composition of the study population, and it would be useful to test the indicators on lambs with a greater level of outcomes associated with poor welfare. The findings presented in this paper could be applied in the selection of valid, reliable and feasible indicators used for the purposes of on-farm assessments of lamb welfare.
Pruitt, Sandi L; Jeffe, Donna B; Yan, Yan; Schootman, Mario
2012-04-01
Limited psychometric research has examined the reliability of self-reported measures of neighbourhood conditions, the effect of measurement error on associations between neighbourhood conditions and health, and potential differences in the reliabilities between neighbourhood strata (urban vs rural and low vs high poverty). We assessed overall and stratified reliability of self-reported perceived neighbourhood conditions using five scales (social and physical disorder, social control, social cohesion, fear) and four single items (multidimensional neighbouring). We also assessed measurement error-corrected associations of these conditions with self-rated health. Using random-digit dialling, 367 women without breast cancer (matched controls from a larger study) were interviewed twice, 2-3 weeks apart. Test-retest (intraclass correlation coefficients (ICC)/weighted κ) and internal consistency reliability (Cronbach's α) were assessed. Differences in reliability across neighbourhood strata were tested using bootstrap methods. Regression calibration corrected estimates for measurement error. All measures demonstrated satisfactory internal consistency (α ≥ 0.70) and either moderate (ICC/κ=0.41-0.60) or substantial (ICC/κ=0.61-0.80) test-retest reliability in the full sample. Internal consistency did not differ by neighbourhood strata. Test-retest reliability was significantly lower among rural (vs urban) residents for two scales (social control, physical disorder) and two multidimensional neighbouring items; test-retest reliability was higher for physical disorder and lower for one multidimensional neighbouring item among the high (vs low) poverty strata. After measurement error correction, the magnitude of associations between neighbourhood conditions and self-rated health were larger, particularly in the rural population. Research is needed to develop and test reliable measures of perceived neighbourhood conditions relevant to the health of rural populations.
Scharmanski, Sara; Renner, Ilona
2016-12-01
Health professionals in early childhood intervention and prevention make an important contribution by helping burdened families with young children cope with everyday life and child raising issues. A prerequisite for success is the health professionals' ability to tailor their services to the specific needs of families. The "Systematic Exploration and Process Inventory for health professionals in early childhood intervention services (SEVG)" can be used to identify each family's individual resources and needs, enabling a valid, reliable and objective assessment of the conditions and the process of counseling service. The present paper presents the statistical analyses that were used to confirm the reliability of the inventory. Based on the results of the reliability analysis and principal component analysis (PCA), the SEVG seems to be a reliable and objective inventory for assessing families' need for support. It also allows for calculation of average values of each scale. The development of valid and reliable assessments is essential to quality assurance and the professionalization of interventions in early childhood service. Copyright © 2016. Published by Elsevier GmbH.
Bader, Michael D. M.; Mooney, Stephen J.; Lee, Yeon Jin; Sheehan, Daniel; Neckerman, Kathryn M.; Rundle, Andrew G.; Teitler, Julien O.
2014-01-01
Public health research has shown that neighborhood conditions are associated with health behaviors and outcomes. Systematic neighborhood audits have helped researchers measure neighborhood conditions that they deem theoretically relevant but not available in existing administrative data. Systematic audits, however, are expensive to conduct and rarely comparable across geographic regions. We describe the development of an online application, the Computer Assisted Neighborhood Visual Assessment System (CANVAS), that uses Google Street View to conduct virtual audits of neighborhood environments. We use this system to assess the inter-rater reliability of 187 items related to walkability and physical disorder on a national sample of 150 street segments in the United States. We find that many items are reliably measured across auditors using CANVAS and that agreement between auditors appears to be uncorrelated with neighborhood demographic characteristics. Based on our results we conclude that Google Street View and CANVAS offer opportunities to develop greater comparability across neighborhood audit studies. PMID:25545769
Lai, J.-S.; Nowinski, C.J.; Victorson, D.; Peterman, A.; Miller, D.; Bethoux, F.; Heinemann, A.; Rubin, S.; Cavazos, J.E.; Reder, A.T.; Sufit, R.; Simuni, T.; Holmes, G.L.; Siderowf, A.; Wojna, V.; Bode, R.; McKinney, N.; Podrabsky, T.; Wortman, K.; Choi, S.; Gershon, R.; Rothrock, N.; Moy, C.
2012-01-01
Objective: To address the need for brief, reliable, valid, and standardized quality of life (QOL) assessment applicable across neurologic conditions. Methods: Drawing from larger calibrated item banks, we developed short measures (8–9 items each) of 13 different QOL domains across physical, mental, and social health and evaluated their validity and reliability. Three samples were utilized during short form development: general population (Internet-based, n = 2,113); clinical panel (Internet-based, n = 553); and clinical outpatient (clinic-based, n = 581). All short forms are expressed as T scores with a mean of 50 and SD of 10. Results: Internal consistency (Cronbach α) of the 13 short forms ranged from 0.85 to 0.97. Correlations between short form and full-length item bank scores ranged from 0.88 to 0.99 (0.82–0.96 after removing common items from banks). Online respondents were asked whether they had any of 19 different chronic health conditions, and whether or not those reported conditions interfered with ability to function normally. All short forms, across physical, mental, and social health, were able to separate people who reported no health condition from those who reported 1–2 or 3 or more. In addition, scores on all 13 domains were worse for people who acknowledged being limited by the health conditions they reported, compared to those who reported conditions but were not limited by them. Conclusion: These 13 brief measures of self-reported QOL are reliable and show preliminary evidence of concurrent validity inasmuch as they differentiate people based upon number of reported health conditions and whether those reported conditions impede normal function. PMID:22573626
Temporary threshold shift after impulse-noise during video game play: laboratory data.
Spankovich, C; Griffiths, S K; Lobariñas, E; Morgenstein, K E; de la Calle, S; Ledon, V; Guercio, D; Le Prell, C G
2014-03-01
Prevention of temporary threshold shift (TTS) after laboratory-based exposure to pure-tones, broadband noise, and narrowband noise signals has been achieved, but prevention of TTS under these experimental conditions may not accurately reflect protection against hearing loss following impulse noise. This study used a controlled laboratory-based TTS paradigm that incorporated impulsive stimuli into the exposure protocol; development of this model could provide a novel platform for assessing proposed therapeutics. Participants played a video game that delivered gunfire-like sound through headphones as part of a target practice game. Effects were measured using audiometric threshold evaluations and distortion product otoacoustic emissions (DPOAEs). The sound level and number of impulses presented were sequentially increased throughout the study. Participants were normal-hearing students at the University of Florida who provided written informed consent prior to participation. TTS was not reliably induced by any of the exposure conditions assessed here. However, there was significant individual variability, and a subset of subjects showed TTS under some exposure conditions. A subset of participants demonstrated reliable threshold shifts under some conditions. Additional experiments are needed to better understand and optimize stimulus parameters that influence TTS after simulated impulse noise.
Temporary threshold shift after impulse-noise during video game play: Laboratory data
Spankovich, C.; Griffiths, S. K.; Lobariñas, E.; Morgenstein, K.E.; de la Calle, S.; Ledon, V.; Guercio, D.; Le Prell, C.G.
2015-01-01
Objective Prevention of temporary threshold shift (TTS) after laboratory-based exposure to pure-tones, broadband noise, and narrow band noise signals has been achieved, but prevention of TTS under these experimental conditions may not accurately reflect protection against hearing loss following impulse noise. This study used a controlled laboratory-based TTS paradigm that incorporated impulsive stimuli into the exposure protocol; development of this model could provide a novel platform for assessing proposed therapeutics. Design Participants played a video game that delivered gunfire-like sound through headphones as part of a target practice game. Effects were measured using audiometric threshold evaluations and distortion product otoacoustic emissions (DPOAEs). The sound level and number of impulses presented were sequentially increased throughout the study. Study sample Participants were normal-hearing students at the University of Florida who provided written informed consent prior to participation. Results TTS was not reliably induced by any of the exposure conditions assessed here. However, there was significant individual variability, and a subset of subjects showed TTS under some exposure conditions. Conclusions A subset of participants demonstrated reliable threshold shifts under some conditions. Additional experiments are needed to better understand and optimize stimulus parameters that influence TTS after simulated impulse noise. PMID:24564694
The comprehensive care project: measuring physician performance in ambulatory practice.
Holmboe, Eric S; Weng, Weifeng; Arnold, Gerald K; Kaplan, Sherrie H; Normand, Sharon-Lise; Greenfield, Sheldon; Hood, Sarah; Lipner, Rebecca S
2010-12-01
To investigate the feasibility, reliability, and validity of comprehensively assessing physician-level performance in ambulatory practice. Ambulatory-based general internists in 13 states participated in the assessment. We assessed physician-level performance, adjusted for patient factors, on 46 individual measures, an overall composite measure, and composite measures for chronic, acute, and preventive care. Between- versus within-physician variation was quantified by intraclass correlation coefficients (ICC). External validity was assessed by correlating performance on a certification exam. Medical records for 236 physicians were audited for seven chronic and four acute care conditions, and six age- and gender-appropriate preventive services. Performance on the individual and composite measures varied substantially within (range 5-86 percent compliance on 46 measures) and between physicians (ICC range 0.12-0.88). Reliabilities for the composite measures were robust: 0.88 for chronic care and 0.87 for preventive services. Higher certification exam scores were associated with better performance on the overall (r = 0.19; p<.01), chronic care (r = 0.14, p = .04), and preventive services composites (r = 0.17, p = .01). Our results suggest that reliable and valid comprehensive assessment of the quality of chronic and preventive care can be achieved by creating composite measures and by sampling feasible numbers of patients for each condition. © Health Research and Educational Trust.
Reliability of a smartphone-based goniometer for knee joint goniometry.
Ferriero, Giorgio; Vercelli, Stefano; Sartorio, Francesco; Muñoz Lasa, Susana; Ilieva, Elena; Brigatti, Elisa; Ruella, Carolina; Foti, Calogero
2013-06-01
The aim of this study was to assess the reliability of a smartphone-based application developed for photographic-based goniometry, DrGoniometer (DrG), by comparing its measurement of the knee joint angle with that made by a universal goniometer (UG). Joint goniometry is a common mode of clinical assessment used in many disciplines, in particular in rehabilitation. One validated method is photographic-based goniometry, but the procedure is usually complex: the image has to be downloaded from the camera to a computer and then edited using dedicated software. This disadvantage may be overcome by the new generation of mobile phones (smartphones) that have computer-like functionality and an integrated digital camera. This validation study was carried out under two different controlled conditions: (i) with the participant to measure in a fixed position and (ii) with a battery of pictures to assess. In the first part, four raters performed repeated measurements with DrG and UG at different knee joint angles. Then, 10 other raters measured the knee at different flexion angles ranging 20-145° on a battery of 35 pictures taken in a clinical setting. The results showed that inter-rater and intra-rater correlations were always more than 0.958. Agreement with the UG showed a width of 18.2° [95% limits of agreement (LoA)=-7.5/+10.7°] and 14.1° (LoA=-6.6/+7.5°). In conclusion, DrG seems to be a reliable method for measuring knee joint angle. This mHealth application can be an alternative/additional method of goniometry, easier to use than other photographic-based goniometric assessments. Further studies are required to assess its reliability for the measurement of other joints.
Zijlstra, Agnes; Zijlstra, Wiebren
2013-09-01
Inverted pendulum (IP) models of human walking allow for wearable motion-sensor based estimations of spatio-temporal gait parameters during unconstrained walking in daily-life conditions. At present it is unclear to what extent different IP based estimations yield different results, and reliability and validity have not been investigated in older persons without a specific medical condition. The aim of this study was to compare reliability and validity of four different IP based estimations of mean step length in independent-living older persons. Participants were assessed twice and walked at different speeds while wearing a tri-axial accelerometer at the lower back. For all step-length estimators, test-retest intra-class correlations approached or were above 0.90. Intra-class correlations with reference step length were above 0.92 with a mean error of 0.0 cm when (1) multiplying the estimated center-of-mass displacement during a step by an individual correction factor in a simple IP model, or (2) adding an individual constant for bipedal stance displacement to the estimated displacement during single stance in a 2-phase IP model. When applying generic corrections or constants in all subjects (i.e. multiplication by 1.25, or adding 75% of foot length), correlations were above 0.75 with a mean error of respectively 2.0 and 1.2 cm. Although the results indicate that an individual adjustment of the IP models provides better estimations of mean step length, the ease of a generic adjustment can be favored when merely evaluating intra-individual differences. Further studies should determine the validity of these IP based estimations for assessing gait in daily life. Copyright © 2013 Elsevier B.V. All rights reserved.
Assessing the Conditional Reliability of State Assessments
ERIC Educational Resources Information Center
May, Henry; Cole, Russell; Haimson, Josh; Perez-Johnson, Irma
2010-01-01
The purpose of this study is to provide empirical benchmarks of the conditional reliabilities of state tests for samples of the student population defined by ability level. Given that many educational interventions are targeted for samples of low performing students, schools, or districts, the primary goal of this research is to determine how…
Development of a reliable method to assess footwear comfort during running.
Mündermann, Anne; Nigg, Benno M; Stefanyshyn, Darren J; Humble, R Neil
2002-08-01
The purposes of this study were: (a) to determine whether subjects are able to distinguish between differences in footwear with respect to footwear comfort; and (b) to determine how reliably footwear comfort can be assessed using a visual analogue scale (VAS) and a protocol including a control condition during running. Intraclass correlation coefficients (ICCs) between comfort ratings for repeated conditions were high (ICC = 0.799). Differences in comfort ratings between the insert conditions were significant. A paired t-test revealed a significant difference in overall comfort ratings for the control insert when tested after the soft insert compared to when tested after the hard insert (P = 0.008). The results of this study showed that VASs provide a reliable measure to assess footwear comfort during running under the conditions that: (a) a control condition is included; and (b) the average comfort rating of sessions 4-6 is used. Copyright 2002 Elsevier Science B.V.
Sensor Selection and Optimization for Health Assessment of Aerospace Systems
NASA Technical Reports Server (NTRS)
Maul, William A.; Kopasakis, George; Santi, Louis M.; Sowers, Thomas S.; Chicatelli, Amy
2007-01-01
Aerospace systems are developed similarly to other large-scale systems through a series of reviews, where designs are modified as system requirements are refined. For space-based systems few are built and placed into service. These research vehicles have limited historical experience to draw from and formidable reliability and safety requirements, due to the remote and severe environment of space. Aeronautical systems have similar reliability and safety requirements, and while these systems may have historical information to access, commercial and military systems require longevity under a range of operational conditions and applied loads. Historically, the design of aerospace systems, particularly the selection of sensors, is based on the requirements for control and performance rather than on health assessment needs. Furthermore, the safety and reliability requirements are met through sensor suite augmentation in an ad hoc, heuristic manner, rather than any systematic approach. A review of the current sensor selection practice within and outside of the aerospace community was conducted and a sensor selection architecture is proposed that will provide a justifiable, dependable sensor suite to address system health assessment requirements.
Sensor Selection and Optimization for Health Assessment of Aerospace Systems
NASA Technical Reports Server (NTRS)
Maul, William A.; Kopasakis, George; Santi, Louis M.; Sowers, Thomas S.; Chicatelli, Amy
2008-01-01
Aerospace systems are developed similarly to other large-scale systems through a series of reviews, where designs are modified as system requirements are refined. For space-based systems few are built and placed into service these research vehicles have limited historical experience to draw from and formidable reliability and safety requirements, due to the remote and severe environment of space. Aeronautical systems have similar reliability and safety requirements, and while these systems may have historical information to access, commercial and military systems require longevity under a range of operational conditions and applied loads. Historically, the design of aerospace systems, particularly the selection of sensors, is based on the requirements for control and performance rather than on health assessment needs. Furthermore, the safety and reliability requirements are met through sensor suite augmentation in an ad hoc, heuristic manner, rather than any systematic approach. A review of the current sensor selection practice within and outside of the aerospace community was conducted and a sensor selection architecture is proposed that will provide a justifiable, defendable sensor suite to address system health assessment requirements.
ERIC Educational Resources Information Center
Harlen, Wynne
2005-01-01
This paper summarizes the findings of a systematic review of research on the reliability and validity of teachers' assessment used for summative purposes. In addition to the main question, the review also addressed the question "What conditions affect the reliability and validity of teachers' summative assessment?" The initial search for studies…
Kosse, Nienke M; Caljouw, Simone; Vervoort, Danique; Vuillerme, Nicolas; Lamoth, Claudine J C
2015-08-01
Accelerometer-based assessments can identify elderly with an increased fall risk and monitor interventions. Smart devices, like the iPod Touch, with built-in accelerometers are promising for clinical gait and posture assessments due to easy use and cost-effectiveness. The aim of the present study was to establish the validity and reliability of the iPod Touch for gait and posture assessment. Sixty healthy participants (aged 18-75 years) were measured with an iPod Touch and stand-alone accelerometer while they walked under single- and dual-task conditions, and while standing in parallel and semi-tandem stance with eyes open, eyes closed and when performing a dual task. Cross-correlation values (CCV) showed high correspondence of anterior-posterior and medio-lateral signal patterns (CCV's ≥ 0.88). Validity of gait parameters (foot contacts, index of harmonicity, and amplitude variability) and standing posture parameters [root mean square of accelerations, median power frequency (MPF) and sway area] as indicated by intra-class correlation (ICC) was high (ICC = 0.85-0.99) and test-retest reliability was good (ICC = 0.81-0.97), except for MPF (ICC = 0.59-0.87). Overall, the iPod Touch obtained valid and reliable measures of gait and postural control in healthy adults of all ages under different conditions. Additionally, smart devices have the potential to be used for clinical gait and posture assessments.
Web-Based Assessment of Mental Well-Being in Early Adolescence: A Reliability Study.
Hamann, Christoph; Schultze-Lutter, Frauke; Tarokh, Leila
2016-06-15
The ever-increasing use of the Internet among adolescents represents an emerging opportunity for researchers to gain access to larger samples, which can be queried over several years longitudinally. Among adolescents, young adolescents (ages 11 to 13 years) are of particular interest to clinicians as this is a transitional stage, during which depressive and anxiety symptoms often emerge. However, it remains unclear whether these youngest adolescents can accurately answer questions about their mental well-being using a Web-based platform. The aim of the study was to examine the accuracy of responses obtained from Web-based questionnaires by comparing Web-based with paper-and-pencil versions of depression and anxiety questionnaires. The primary outcome was the score on the depression and anxiety questionnaires under two conditions: (1) paper-and-pencil and (2) Web-based versions. Twenty-eight adolescents (aged 11-13 years, mean age 12.78 years and SD 0.78; 18 females, 64%) were randomly assigned to complete either the paper-and-pencil or the Web-based questionnaire first. Intraclass correlation coefficients (ICCs) were calculated to measure intrarater reliability. Intraclass correlation coefficients were calculated separately for depression (Children's Depression Inventory, CDI) and anxiety (Spence Children's Anxiety Scale, SCAS) questionnaires. On average, it took participants 17 minutes (SD 6) to answer 116 questions online. Intraclass correlation coefficient analysis revealed high intrarater reliability when comparing Web-based with paper-and-pencil responses for both CDI (ICC=.88; P<.001) and the SCAS (ICC=.95; P<.001). According to published criteria, both of these values are in the "almost perfect" category indicating the highest degree of reliability. The results of the study show an excellent reliability of Web-based assessment in 11- to 13-year-old children as compared with the standard paper-pencil assessment. Furthermore, we found that Web-based assessments with young adolescents are highly feasible, with all enrolled participants completing the Web-based form. As early adolescence is a time of remarkable social and behavioral changes, these findings open up new avenues for researchers from diverse fields who are interested in studying large samples of young adolescents over time.
Quantified Risk Ranking Model for Condition-Based Risk and Reliability Centered Maintenance
NASA Astrophysics Data System (ADS)
Chattopadhyaya, Pradip Kumar; Basu, Sushil Kumar; Majumdar, Manik Chandra
2017-06-01
In the recent past, risk and reliability centered maintenance (RRCM) framework is introduced with a shift in the methodological focus from reliability and probabilities (expected values) to reliability, uncertainty and risk. In this paper authors explain a novel methodology for risk quantification and ranking the critical items for prioritizing the maintenance actions on the basis of condition-based risk and reliability centered maintenance (CBRRCM). The critical items are identified through criticality analysis of RPN values of items of a system and the maintenance significant precipitating factors (MSPF) of items are evaluated. The criticality of risk is assessed using three risk coefficients. The likelihood risk coefficient treats the probability as a fuzzy number. The abstract risk coefficient deduces risk influenced by uncertainty, sensitivity besides other factors. The third risk coefficient is called hazardous risk coefficient, which is due to anticipated hazards which may occur in the future and the risk is deduced from criteria of consequences on safety, environment, maintenance and economic risks with corresponding cost for consequences. The characteristic values of all the three risk coefficients are obtained with a particular test. With few more tests on the system, the values may change significantly within controlling range of each coefficient, hence `random number simulation' is resorted to obtain one distinctive value for each coefficient. The risk coefficients are statistically added to obtain final risk coefficient of each critical item and then the final rankings of critical items are estimated. The prioritization in ranking of critical items using the developed mathematical model for risk assessment shall be useful in optimization of financial losses and timing of maintenance actions.
Radiation-Tolerance Assessment of a Redundant Wireless Device
NASA Astrophysics Data System (ADS)
Huang, Q.; Jiang, J.
2018-01-01
This paper presents a method to evaluate radiation-tolerance without physical tests for a commercial off-the-shelf (COTS)-based monitoring device for high level radiation fields, such as those found in post-accident conditions in a nuclear power plant (NPP). This paper specifically describes the analysis of radiation environment in a severe accident, radiation damages in electronics, and the redundant solution used to prolong the life of the system, as well as the evaluation method for radiation protection and the analysis method of system reliability. As a case study, a wireless monitoring device with redundant and diversified channels is evaluated by using the developed method. The study results and system assessment data show that, under the given radiation condition, performance of the redundant device is more reliable and more robust than those non-redundant devices. The developed redundant wireless monitoring device is therefore able to apply in those conditions (up to 10 M Rad (Si)) during a severe accident in a NPP.
Magnan, Morris A; Maklebust, Joann
2008-01-01
To evaluate the effect of Web-based Braden Scale training on the reliability and precision of pressure ulcer risk assessments made by registered nurses (RN) working in acute care settings. Pretest-posttest, 2-group, quasi-experimental design. Five hundred Braden Scale risk assessments were made on 102 acute care patients deemed to be at various levels of risk for pressure ulceration. Assessments were made by RNs working in acute care hospitals at 3 different medical centers where the Braden Scale was in regular daily use (2 medical centers) or new to the setting (1 medical center). The Braden Scale for Predicting Pressure Sore Risk was used to guide pressure ulcer risk assessments. A Web-based version of the Detroit Medical Center Braden Scale Computerized Training Module was used to teach nurses correct use of the Braden Scale and selection of risk-based pressure ulcer prevention interventions. In the aggregate, RN generated reliable Braden Scale pressure ulcer risk assessments 65% of the time after training. The effect of Web-based Braden Scale training on reliability and precision of assessments varied according to familiarity with the scale. With training, new users of the scale made reliable assessments 84% of the time and significantly improved precision of their assessments. The reliability and precision of Braden Scale risk assessments made by its regular users was unaffected by training. Technology-assisted Braden Scale training improved both reliability and precision of risk assessments made by new users of the scale, but had virtually no effect on the reliability or precision of risk assessments made by regular users of the instrument. Further research is needed to determine best approaches for improving reliability and precision of Braden Scale assessments made by its regular users.
The Comprehensive Care Project: Measuring Physician Performance in Ambulatory Practice
Holmboe, Eric S; Weng, Weifeng; Arnold, Gerald K; Kaplan, Sherrie H; Normand, Sharon-Lise; Greenfield, Sheldon; Hood, Sarah; Lipner, Rebecca S
2010-01-01
Objective To investigate the feasibility, reliability, and validity of comprehensively assessing physician-level performance in ambulatory practice. Data Sources/Study Setting Ambulatory-based general internists in 13 states participated in the assessment. Study Design We assessed physician-level performance, adjusted for patient factors, on 46 individual measures, an overall composite measure, and composite measures for chronic, acute, and preventive care. Between- versus within-physician variation was quantified by intraclass correlation coefficients (ICC). External validity was assessed by correlating performance on a certification exam. Data Collection/Extraction Methods Medical records for 236 physicians were audited for seven chronic and four acute care conditions, and six age- and gender-appropriate preventive services. Principal Findings Performance on the individual and composite measures varied substantially within (range 5–86 percent compliance on 46 measures) and between physicians (ICC range 0.12–0.88). Reliabilities for the composite measures were robust: 0.88 for chronic care and 0.87 for preventive services. Higher certification exam scores were associated with better performance on the overall (r = 0.19; p <.01), chronic care (r = 0.14, p = .04), and preventive services composites (r = 0.17, p = .01). Conclusions Our results suggest that reliable and valid comprehensive assessment of the quality of chronic and preventive care can be achieved by creating composite measures and by sampling feasible numbers of patients for each condition. PMID:20819110
Rong, Hao; Tian, Jin; Zhao, Tingdi
2016-01-01
In traditional approaches of human reliability assessment (HRA), the definition of the error producing conditions (EPCs) and the supporting guidance are such that some of the conditions (especially organizational or managerial conditions) can hardly be included, and thus the analysis is burdened with incomprehensiveness without reflecting the temporal trend of human reliability. A method based on system dynamics (SD), which highlights interrelationships among technical and organizational aspects that may contribute to human errors, is presented to facilitate quantitatively estimating the human error probability (HEP) and its related variables changing over time in a long period. Taking the Minuteman III missile accident in 2008 as a case, the proposed HRA method is applied to assess HEP during missile operations over 50 years by analyzing the interactions among the variables involved in human-related risks; also the critical factors are determined in terms of impact that the variables have on risks in different time periods. It is indicated that both technical and organizational aspects should be focused on to minimize human errors in a long run. Copyright © 2015 Elsevier Ltd and The Ergonomics Society. All rights reserved.
Assessment of Technogenic Accident Risk of Industrial Building Structures
NASA Astrophysics Data System (ADS)
Baiburin, D. A.; Baiburin, A. Kh
2017-11-01
A methodology for assessing the risk of an industrial building accident was developed taking into account the damage caused by various localization of collapse. Before the beginning of the survey of a facility technical condition, groups including the same type of building structures are selected. Further, assessment is made for the reduction in their load-carrying capacity from the strength and stability conditions taking into account defects. The characteristics of the influence of defects and structural damage on a building safety is the degree of compliance with the standards expressed by the reliability level. Reliability levels assignment is carried out on the basis of calculations, operating experience and inspection of a particular type of structure according to the formalized rules. The risk of collapse according to a separate scenario is calculated for structures that are capable and incapable of causing a progressive ossification. The results of the technique application are based on the analysis of the accident risk at the welding shop “Vysota (Height) 239” of the Chelyabinsk Pipe Rolling Plant.
Validity and Reliability of Baseline Testing in a Standardized Environment.
Higgins, Kathryn L; Caze, Todd; Maerlender, Arthur
2017-08-11
The Immediate Postconcussion Assessment and Cognitive Testing (ImPACT) is a computerized neuropsychological test battery commonly used to determine cognitive recovery from concussion based on comparing post-injury scores to baseline scores. This model is based on the premise that ImPACT baseline test scores are a valid and reliable measure of optimal cognitive function at baseline. Growing evidence suggests that this premise may not be accurate and a large contributor to invalid and unreliable baseline test scores may be the protocol and environment in which baseline tests are administered. This study examined the effects of a standardized environment and administration protocol on the reliability and performance validity of athletes' baseline test scores on ImPACT by comparing scores obtained in two different group-testing settings. Three hundred-sixty one Division 1 cohort-matched collegiate athletes' baseline data were assessed using a variety of indicators of potential performance invalidity; internal reliability was also examined. Thirty-one to thirty-nine percent of the baseline cases had at least one indicator of low performance validity, but there were no significant differences in validity indicators based on environment in which the testing was conducted. Internal consistency reliability scores were in the acceptable to good range, with no significant differences between administration conditions. These results suggest that athletes may be reliably performing at levels lower than their best effort would produce. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Comprehensive clinical assessment in community setting: applicability of the MDS-HC.
Morris, J N; Fries, B E; Steel, K; Ikegami, N; Bernabei, R; Carpenter, G I; Gilgen, R; Hirdes, J P; Topinková, E
1997-08-01
To describe the results of an international trial of the home care version of the MDS assessment and problem identification system (the MDS-HC), including reliability estimates, a comparison of MDS-HC reliabilities with reliabilities of the same items in the MDS 2.0 nursing home assessment instrument, and an examination of the types of problems found in home care clients using the MDS-HC. Independent, dual assessment of clients of home-care agencies by trained clinicians using a draft of the MDS-HC, with additional descriptive data regarding problem profiles for home care clients. Reliability data from dual assessments of 241 randomly selected clients of home care agencies in five countries, all of whom volunteered to test the MDS-HC. Also included are an expanded sample of 780 home care assessments from these countries and 187 dually assessed residents from 21 nursing homes in the United States. The array of MDS-HC assessment items included measures in the following areas: personal items, cognitive patterns, communication/hearing, vision, mood and behavior, social functioning, informal support services, physical functioning, continence, disease diagnoses health conditions and preventive health measures, nutrition/hydration, dental status, skin condition, environmental assessment, service utilization, and medications. Forty-seven percent of the functional, health status, social environment, and service items in the MDS-HC were taken from the MDS 2.0 for nursing homes. For this item set, it is estimated that the average weighted Kappa is .74 for the MDS-HC and .75 for the MDS 2.0. Similarly, high reliability values were found for items newly introduced in the MDS-HC (weighted Kappa = .70). Descriptive findings also characterize the problems of home care clients, with subanalyses within cognitive performance levels. Findings indicate that the core set of items in the MDS 2.0 work equally well in community and nursing home settings. New items are highly reliable. In tandem, these instruments can be used within the international community, assisting and planning care for older adults within a broad spectrum of service settings, including nursing homes and home care programs. With this community-based, second-generation problem and care plan-driven assessment instrument, disability assessment can be performed consistently across the world.
Machine Learning Assessments of Soil Drying
NASA Astrophysics Data System (ADS)
Coopersmith, E. J.; Minsker, B. S.; Wenzel, C.; Gilmore, B. J.
2011-12-01
Agricultural activities require the use of heavy equipment and vehicles on unpaved farmlands. When soil conditions are wet, equipment can cause substantial damage, leaving deep ruts. In extreme cases, implements can sink and become mired, causing considerable delays and expense to extricate the equipment. Farm managers, who are often located remotely, cannot assess sites before allocating equipment, causing considerable difficulty in reliably assessing conditions of countless sites with any reliability and frequency. For example, farmers often trace serpentine paths of over one hundred miles each day to assess the overall status of various tracts of land spanning thirty, forty, or fifty miles in each direction. One means of assessing the moisture content of a field lies in the strategic positioning of remotely-monitored in situ sensors. Unfortunately, land owners are often reluctant to place sensors across their properties due to the significant monetary cost and complexity. This work aspires to overcome these limitations by modeling the process of wetting and drying statistically - remotely assessing field readiness using only information that is publically accessible. Such data includes Nexrad radar and state climate network sensors, as well as Twitter-based reports of field conditions for validation. Three algorithms, classification trees, k-nearest-neighbors, and boosted perceptrons are deployed to deliver statistical field readiness assessments of an agricultural site located in Urbana, IL. Two of the three algorithms performed with 92-94% accuracy, with the majority of misclassifications falling within the calculated margins of error. This demonstrates the feasibility of using a machine learning framework with only public data, knowledge of system memory from previous conditions, and statistical tools to assess "readiness" without the need for real-time, on-site physical observation. Future efforts will produce a workflow assimilating Nexrad, climate network, and Twitter data to generate a real-time web-map of estimated readiness conditions.
Using Facility Condition Assessments to Identify Actions Related to Infrastructure
NASA Technical Reports Server (NTRS)
Rubert, Kennedy F.
2010-01-01
To support cost effective, quality research it is essential that laboratory and testing facilities are maintained in a continuous and reliable state of availability at all times. NASA Langley Research Center (LaRC) and its maintenance contractor, Jacobs Technology, Inc. Research Operations, Maintenance, and Engineering (ROME) group, are in the process of implementing a combined Facility Condition Assessment (FCA) and Reliability Centered Maintenance (RCM) program to improve asset management and overall reliability of testing equipment in facilities such as wind tunnels. Specific areas are being identified for improvement, the deferred maintenance cost is being estimated, and priority is being assigned against facilities where conditions have been allowed to deteriorate. This assessment serves to assist in determining where to commit available funds on the Center. RCM methodologies are being reviewed and enhanced to assure that appropriate preventive, predictive, and facilities/equipment acceptance techniques are incorporated to prolong lifecycle availability and assure reliability at minimum cost. The results from the program have been favorable, better enabling LaRC to manage assets prudently.
Manor, Brad; Yu, Wanting; Zhu, Hao; Harrison, Rachel; Lo, On-Yee; Lipsitz, Lewis; Travison, Thomas; Pascual-Leone, Alvaro; Zhou, Junhong
2018-01-30
Walking is a complex cognitive motor task that is commonly completed while performing another task such as talking or making decisions. Gait assessments performed under normal and "dual-task" walking conditions thus provide important insights into health. Such assessments, however, are limited primarily to laboratory-based settings. The objective of our study was to create and test a smartphone-based assessment of normal and dual-task walking for use in nonlaboratory settings. We created an iPhone app that used the phone's motion sensors to record movements during walking under normal conditions and while performing a serial-subtraction dual task, with the phone placed in the user's pants pocket. The app provided the user with multimedia instructions before and during the assessment. Acquired data were automatically uploaded to a cloud-based server for offline analyses. A total of 14 healthy adults completed 2 laboratory visits separated by 1 week. On each visit, they used the app to complete three 45-second trials each of normal and dual-task walking. Kinematic data were collected with the app and a gold-standard-instrumented GAITRite mat. Participants also used the app to complete normal and dual-task walking trials within their homes on 3 separate days. Within laboratory-based trials, GAITRite-derived heel strikes and toe-offs of the phone-side leg aligned with smartphone acceleration extrema, following filtering and rotation to the earth coordinate system. We derived stride times-a clinically meaningful metric of locomotor control-from GAITRite and app data, for all strides occurring over the GAITRite mat. We calculated stride times and the dual-task cost to the average stride time (ie, percentage change from normal to dual-task conditions) from both measurement devices. We calculated similar metrics from home-based app data. For these trials, periods of potential turning were identified via custom-developed algorithms and omitted from stride-time analyses. Across all detected strides in the laboratory, stride times derived from the app and GAITRite mat were highly correlated (P<.001, r 2 =.98). These correlations were independent of walking condition and pocket tightness. App- and GAITRite-derived stride-time dual-task costs were also highly correlated (P<.001, r 2 =.95). The error of app-derived stride times (mean 16.9, SD 9.0 ms) was unaffected by the magnitude of stride time, walking condition, or pocket tightness. For both normal and dual-task trials, average stride times derived from app walking trials demonstrated excellent test-retest reliability within and between both laboratory and home-based assessments (intraclass correlation coefficient range .82-.94). The iPhone app we created enabled valid and reliable assessment of stride timing-with the smartphone in the pocket-during both normal and dual-task walking and within both laboratory and nonlaboratory environments. Additional work is warranted to expand the functionality of this tool to older adults and other patient populations. ©Brad Manor, Wanting Yu, Hao Zhu, Rachel Harrison, On-Yee Lo, Lewis Lipsitz, Thomas Travison, Alvaro Pascual-Leone, Junhong Zhou. Originally published in JMIR Mhealth and Uhealth (http://mhealth.jmir.org), 30.01.2018.
Yu, Wanting; Zhu, Hao; Harrison, Rachel; Lo, On-Yee; Lipsitz, Lewis; Travison, Thomas; Pascual-Leone, Alvaro; Zhou, Junhong
2018-01-01
Background Walking is a complex cognitive motor task that is commonly completed while performing another task such as talking or making decisions. Gait assessments performed under normal and “dual-task” walking conditions thus provide important insights into health. Such assessments, however, are limited primarily to laboratory-based settings. Objective The objective of our study was to create and test a smartphone-based assessment of normal and dual-task walking for use in nonlaboratory settings. Methods We created an iPhone app that used the phone’s motion sensors to record movements during walking under normal conditions and while performing a serial-subtraction dual task, with the phone placed in the user’s pants pocket. The app provided the user with multimedia instructions before and during the assessment. Acquired data were automatically uploaded to a cloud-based server for offline analyses. A total of 14 healthy adults completed 2 laboratory visits separated by 1 week. On each visit, they used the app to complete three 45-second trials each of normal and dual-task walking. Kinematic data were collected with the app and a gold-standard–instrumented GAITRite mat. Participants also used the app to complete normal and dual-task walking trials within their homes on 3 separate days. Within laboratory-based trials, GAITRite-derived heel strikes and toe-offs of the phone-side leg aligned with smartphone acceleration extrema, following filtering and rotation to the earth coordinate system. We derived stride times—a clinically meaningful metric of locomotor control—from GAITRite and app data, for all strides occurring over the GAITRite mat. We calculated stride times and the dual-task cost to the average stride time (ie, percentage change from normal to dual-task conditions) from both measurement devices. We calculated similar metrics from home-based app data. For these trials, periods of potential turning were identified via custom-developed algorithms and omitted from stride-time analyses. Results Across all detected strides in the laboratory, stride times derived from the app and GAITRite mat were highly correlated (P<.001, r2=.98). These correlations were independent of walking condition and pocket tightness. App- and GAITRite-derived stride-time dual-task costs were also highly correlated (P<.001, r2=.95). The error of app-derived stride times (mean 16.9, SD 9.0 ms) was unaffected by the magnitude of stride time, walking condition, or pocket tightness. For both normal and dual-task trials, average stride times derived from app walking trials demonstrated excellent test-retest reliability within and between both laboratory and home-based assessments (intraclass correlation coefficient range .82-.94). Conclusions The iPhone app we created enabled valid and reliable assessment of stride timing—with the smartphone in the pocket—during both normal and dual-task walking and within both laboratory and nonlaboratory environments. Additional work is warranted to expand the functionality of this tool to older adults and other patient populations. PMID:29382625
Heineman, Kirsten R; Bos, Arend F; Hadders-Algra, Mijna
2008-04-01
A reliable and valid instrument to assess neuromotor condition in infancy is a prerequisite for early detection of developmental motor disorders. We developed a video-based assessment of motor behaviour, the Infant Motor Profile (IMP), to evaluate motor abilities, movement variability, ability to select motor strategies, movement symmetry, and fluency. The IMP consists of 80 items and is applicable in children from 3 to 18 months. The present study aimed to test intra- and interobserver reliability and concurrent validity of the IMP with the Alberta Infant Motor Scale (AIMS) and Touwen neurological examination. The study group consisted of 40 low-risk term (median gestational age [GA] 40 wks, range 38-42 wks) and 40 high-risk preterm infants (median GA 29.6 wks, range 26-33 wks) with corrected ages 4 to 18 months (31 females, 49 males). Intra- and interobserver agreement of the IMP were satisfactory (Spearman's rho=0.9). Concurrent validity of IMP and AIMS was good (Spearman's rho=0.8, p<0.005). The IMP was able to differentiate between infants with normal neurological condition, simple minor neurological dysfunction (MND), complex MND, and abnormal neurological condition (p<0.005). This means that the IMP may be a promising tool to evaluate neurological integrity during infancy, a suggestion that needs confirmation by means of assessment of larger groups of infants with heterogeneous neurological conditions.
Beehler, Sarah; Ahern, Jennifer; Balmer, Brandi; Kuhlman, Jennifer
2017-01-01
This pilot study evaluated the validity and reliability of an Experience of Neighborhood (EON) measure developed to assess neighborhood characteristics that shape reintegration opportunities for returning service members and their families. A total of 91 post-9/11 veterans and spouses completed a survey administered at the Minnesota State Fair. Participants self-reported on their reintegration status (veterans), social functioning (spouses), social support, and mental health. EON factor structure, internal consistency reliability, and validity (discriminant, content, criterion) were analyzed. The EON measure showed adequate reliability, discriminant validity, and content validity. More work is needed to assess criterion validity because EON scores were not correlated with scores on a Census-based index used to measure quality of military neighborhoods. The EON may be useful in assessing broad local factors influencing health among returning veterans and spouses. More research is needed to understand geographic variation in neighborhood conditions and how those affect reintegration and mental health for military families.
Beehler, Sarah; Ahern, Jennifer; Balmer, Brandi; Kuhlman, Jennifer
2017-01-01
This pilot study evaluated the validity and reliability of an Experience of Neighborhood (EON) measure developed to assess neighborhood characteristics that shape reintegration opportunities for returning service members and their families. A total of 91 post-9/11 veterans and spouses completed a survey administered at the Minnesota State Fair. Participants self-reported on their reintegration status (veterans), social functioning (spouses), social support, and mental health. EON factor structure, internal consistency reliability, and validity (discriminant, content, criterion) were analyzed. The EON measure showed adequate reliability, discriminant validity, and content validity. More work is needed to assess criterion validity because EON scores were not correlated with scores on a Census-based index used to measure quality of military neighborhoods. The EON may be useful in assessing broad local factors influencing health among returning veterans and spouses. More research is needed to understand geographic variation in neighborhood conditions and how those affect reintegration and mental health for military families. PMID:28936370
Monticone, Marco; Ambrosini, Emilia; Fiorentini, Roberta; Rocca, Barbara; Liquori, Valentina; Pedrocchi, Alessandra; Ferrante, Simona
2014-09-01
To evaluate the reliability and minimum detectable change (MDC) of spatial-temporal gait parameters in subjects with multiple sclerosis (MS) during dual tasking. This cross-sectional study involved 25 healthy subjects (mean age 49.9 ± 15.8 years) and 25 people with MS (mean age 49.2 ± 11.5 years). Gait under motor-cognitive and motor-motor dual tasking conditions was evaluated in two sessions separated by a one-day interval using the GAITRite Walkway System. Test-retest reliability was assessed using intraclass correlation coefficients (ICCs), standard errors of measurement (SEM), and coefficients of variation (CV). MDC scores were computed for the velocity, cadence, step and stride length, step and stride time, double support time, the % of gait cycle for single support and stance phase, and base of support. All of the gait parameters reported good to excellent ICCs under both conditions, with healthy subject values of >0.69 and MS subject values of >0.84. SEM values were always below 18% for both groups of subjects. The gait patterns of the people with MS were slightly more variable than those of the normal controls (CVs: 5.88-41.53% vs 2.84-30.48%). The assessment of quantitative gait parameters in healthy subjects and people with MS is highly reliable under both of the investigated dual tasking conditions. Copyright © 2014 Elsevier B.V. All rights reserved.
ERIC Educational Resources Information Center
Scanlan, Aaron T.; Richter-Stretton, Gina L.; Madueno, Maria C.; Borges, Nattai R.; Fenning, Andrew S.
2017-01-01
Measurement of plasma osmolality (P[subscript osm]) remains popular for assessing hydration status in exercise science. However, a controlled reliability assessment of micro-osmometry using small sample volumes to measure Posm remains to be performed. This study aimed to examine the reliability of a cryoscopic micro-osmometer requiring 15-µL…
The development of a screening tool to evaluate gross motor function in HIV-infected infants.
Hilburn, Nicole; Potterton, Joanne; Stewart, Aimee; Becker, Piet
2011-12-01
Neurodevelopmental delay or HIV encephalopathy is a stage four disease indicator for paediatric HIV/AIDS according to the World Health Organisation (WHO), and may be used as a criterion for initiation of highly active antiretroviral therapy (HAART). To date, the only means of prevention of this condition is early initiation of HAART. Studies which have been carried out in South African clinics have revealed the high prevalence of this condition. In developing countries, commencement of HAART is based on declining virologic and immunologic status, as standardised neurodevelopmental assessment tools are not widely available. A standardised developmental screening tool which is suitable for use in a developing country is therefore necessary in order to screen for neurodevelopmental delay to allow for further assessment and referral to rehabilitation services, as well as providing an additional assessment criterion for initiation of HAART. The infant gross motor screening test (IGMST) was developed for this purpose. The standardisation sample of the IGMST consisted of 112 HIV-infected infants between six and 18 months of age. Item selection for the IGMST was based on the Gross Motor scale of the Bayley Scales of Infant Development (BSID)-III. Content validity was assessed by a panel of experts using a nominal group technique (NGT; agreement >80%). Concurrent validity (n=60) of the IGMST was carried out against the BSID-III, and agreement was excellent (K=0.85). The diagnostic properties of the IGMST were evaluated and revealed: sensitivity 97.4%, specificity 85.7%, positive predictive value (PPV) 92.7%, and negative predictive value (NPV) 94.7%. Reliability testing (n=30) revealed inter-rater reliability as: r=1, test-retest reliability: r=0.98 and intra-rater reliability: r=0.98. The results indicate that the statistical properties of the IGMST are excellent, and the tool is suitable for use within the paediatric HIV setting.
Validation of new psychosocial factors questionnaires: a Colombian national study.
Villalobos, Gloria H; Vargas, Angélica M; Rondón, Martin A; Felknor, Sarah A
2013-01-01
The study of workers' health problems possibly associated with stressful conditions requires valid and reliable tools for monitoring risk factors. The present study validates two questionnaires to assess psychosocial risk factors for stress-related illnesses within a sample of Colombian workers. The validation process was based on a representative sample survey of 2,360 Colombian employees, aged 18-70 years. Worker response rate was 90%; 46% of the responders were women. Internal consistency was calculated, construct validity was tested with factor analysis and concurrent validity was tested with Spearman correlations. The questionnaires demonstrated adequate reliability (0.88-0.95). Factor analysis confirmed the dimensions proposed in the measurement model. Concurrent validity resulted in significant correlations with stress and health symptoms. "Work and Non-work Psychosocial Factors Questionnaires" were found to be valid and reliable for the assessment of workers' psychosocial factors, and they provide information for research and intervention. Copyright © 2012 Wiley Periodicals, Inc.
ERIC Educational Resources Information Center
Ghazali, Nor Hasnida Md
2016-01-01
A valid, reliable and practical instrument is needed to evaluate the implementation of the school-based assessment (SBA) system. The aim of this study is to develop and assess the validity and reliability of an instrument to measure the perception of teachers towards the SBA implementation in schools. The instrument is developed based on a…
Field Demonstration of Condition Assessment Technologies for Wastewater Collection Systems
Reliable information on pipe condition is needed to accurately estimate the remaining service life of wastewater collection system assets. Although inspections with conventional closed-circuit television (CCTV) have been the mainstay of pipeline condition assessment for decades,...
ERIC Educational Resources Information Center
Chang, Chi-Cheng; Wu, Bing-Hong
2012-01-01
This study explored the reliability and validity of teacher assessment under a Web-based portfolio assessment environment (or Web-based teacher portfolio assessment). Participants were 72 eleventh graders taking the "Computer Application" course. The students perform portfolio creation, inspection, self- and peer-assessment using the Web-based…
Reliability analysis of a sensitive and independent stabilometry parameter set
Nagymáté, Gergely; Orlovits, Zsanett
2018-01-01
Recent studies have suggested reduced independent and sensitive parameter sets for stabilometry measurements based on correlation and variance analyses. However, the reliability of these recommended parameter sets has not been studied in the literature or not in every stance type used in stabilometry assessments, for example, single leg stances. The goal of this study is to evaluate the test-retest reliability of different time-based and frequency-based parameters that are calculated from the center of pressure (CoP) during bipedal and single leg stance for 30- and 60-second measurement intervals. Thirty healthy subjects performed repeated standing trials in a bipedal stance with eyes open and eyes closed conditions and in a single leg stance with eyes open for 60 seconds. A force distribution measuring plate was used to record the CoP. The reliability of the CoP parameters was characterized by using the intraclass correlation coefficient (ICC), standard error of measurement (SEM), minimal detectable change (MDC), coefficient of variation (CV) and CV compliance rate (CVCR). Based on the ICC, SEM and MDC results, many parameters yielded fair to good reliability values, while the CoP path length yielded the highest reliability (smallest ICC > 0.67 (0.54–0.79), largest SEM% = 19.2%). Usually, frequency type parameters and extreme value parameters yielded poor reliability values. There were differences in the reliability of the maximum CoP velocity (better with 30 seconds) and mean power frequency (better with 60 seconds) parameters between the different sampling intervals. PMID:29664938
Reliability analysis of a sensitive and independent stabilometry parameter set.
Nagymáté, Gergely; Orlovits, Zsanett; Kiss, Rita M
2018-01-01
Recent studies have suggested reduced independent and sensitive parameter sets for stabilometry measurements based on correlation and variance analyses. However, the reliability of these recommended parameter sets has not been studied in the literature or not in every stance type used in stabilometry assessments, for example, single leg stances. The goal of this study is to evaluate the test-retest reliability of different time-based and frequency-based parameters that are calculated from the center of pressure (CoP) during bipedal and single leg stance for 30- and 60-second measurement intervals. Thirty healthy subjects performed repeated standing trials in a bipedal stance with eyes open and eyes closed conditions and in a single leg stance with eyes open for 60 seconds. A force distribution measuring plate was used to record the CoP. The reliability of the CoP parameters was characterized by using the intraclass correlation coefficient (ICC), standard error of measurement (SEM), minimal detectable change (MDC), coefficient of variation (CV) and CV compliance rate (CVCR). Based on the ICC, SEM and MDC results, many parameters yielded fair to good reliability values, while the CoP path length yielded the highest reliability (smallest ICC > 0.67 (0.54-0.79), largest SEM% = 19.2%). Usually, frequency type parameters and extreme value parameters yielded poor reliability values. There were differences in the reliability of the maximum CoP velocity (better with 30 seconds) and mean power frequency (better with 60 seconds) parameters between the different sampling intervals.
Mehta, Saurabh P; George, Hannah R; Goering, Christian A; Shafer, Danielle R; Koester, Alan; Novotny, Steven
2017-11-01
Clinical measurement study. The push-off test (POT) was recently conceived and found to be reliable and valid for assessing weight bearing through injured wrist or elbow. However, further research with larger sample can lend credence to the preliminary findings supporting the use of the POT. This study examined the interrater reliability, construct validity, and measurement error for the POT in patients with wrist conditions. Participants with musculoskeletal (MSK) wrist conditions were recruited. The performance on the POT, grip isometric strength of wrist extensors was assessed. The shortened version of the Disabilities of the Arm, Shoulder and Hand and numeric pain rating scale were completed. The intraclass correlation coefficient assessed interrater reliability of the POT. Pearson correlation coefficients (r) examined the concurrent relationships between the POT and other measures. The standard error of measurement and the minimal detectable change at 90% confidence interval were assessed as measurement error and index of true change for the POT. A total of 50 participants with different elbow or wrist conditions (age: 48.1 ± 16.6 years) were included in this study. The results of this study strongly supported the interrater reliability (intraclass correlation coefficient: 0.96 and 0.93 for the affected and unaffected sides, respectively) of the POT in patients with wrist MSK conditions. The POT showed convergent relationships with the grip strength on the injured side (r = 0.89) and the wrist extensor strength (r = 0.7). The POT showed smaller standard error of measurement (1.9 kg). The minimal detectable change at 90% confidence interval for the POT was 4.4 kg for the sample. This study provides additional evidence to support the reliability and validity of the POT. This is the first study that provides the values for the measurement error and true change on the POT scores in patients with wrist MSK conditions. Further research should examine the responsiveness and discriminant validity of the POT in patients with wrist conditions. Copyright © 2017 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Parts and Components Reliability Assessment: A Cost Effective Approach
NASA Technical Reports Server (NTRS)
Lee, Lydia
2009-01-01
System reliability assessment is a methodology which incorporates reliability analyses performed at parts and components level such as Reliability Prediction, Failure Modes and Effects Analysis (FMEA) and Fault Tree Analysis (FTA) to assess risks, perform design tradeoffs, and therefore, to ensure effective productivity and/or mission success. The system reliability is used to optimize the product design to accommodate today?s mandated budget, manpower, and schedule constraints. Stand ard based reliability assessment is an effective approach consisting of reliability predictions together with other reliability analyses for electronic, electrical, and electro-mechanical (EEE) complex parts and components of large systems based on failure rate estimates published by the United States (U.S.) military or commercial standards and handbooks. Many of these standards are globally accepted and recognized. The reliability assessment is especially useful during the initial stages when the system design is still in the development and hard failure data is not yet available or manufacturers are not contractually obliged by their customers to publish the reliability estimates/predictions for their parts and components. This paper presents a methodology to assess system reliability using parts and components reliability estimates to ensure effective productivity and/or mission success in an efficient manner, low cost, and tight schedule.
Larson, Tomas; Kerekes, Nóra; Selinus, Eva Norén; Lichtenstein, Paul; Gumpert, Clara Hellner; Anckarsäter, Henrik; Nilsson, Thomas; Lundström, Sebastian
2014-02-01
The Autism-Tics, AD/HD, and other Comorbidities (A-TAC) inventory is used in epidemiological research to assess neurodevelopmental problems and coexisting conditions. Although the A-TAC has been applied in various populations, data on retest reliability are limited. The objective of the present study was to present additional reliability data. The A-TAC was administered by lay assessors and was completed on two occasions by parents of 400 individual twins, with an average interval of 70 days between test sessions. Intra- and inter-rater reliability were analysed with intraclass correlations and Cohen's kappa. A-TAC showed excellent test-retest intraclass correlations for both autism spectrum disorder and attention deficit hyperactivity disorder (each at .84). Most modules in the A-TAC had intra- and inter-rater reliability intraclass correlation coefficients of > or = .60. Cohen's kappa indi- cated acceptable reliability. The current study provides statistical evidence that the A-TAC yields good test-retest reliability in a population-based cohort of children.
Review of Reliability-Based Design Optimization Approach and Its Integration with Bayesian Method
NASA Astrophysics Data System (ADS)
Zhang, Xiangnan
2018-03-01
A lot of uncertain factors lie in practical engineering, such as external load environment, material property, geometrical shape, initial condition, boundary condition, etc. Reliability method measures the structural safety condition and determine the optimal design parameter combination based on the probabilistic theory. Reliability-based design optimization (RBDO) is the most commonly used approach to minimize the structural cost or other performance under uncertainty variables which combines the reliability theory and optimization. However, it cannot handle the various incomplete information. The Bayesian approach is utilized to incorporate this kind of incomplete information in its uncertainty quantification. In this paper, the RBDO approach and its integration with Bayesian method are introduced.
Reliability-Based Life Assessment of Stirling Convertor Heater Head
NASA Technical Reports Server (NTRS)
Shah, Ashwin R.; Halford, Gary R.; Korovaichuk, Igor
2004-01-01
Onboard radioisotope power systems being developed and planned for NASA's deep-space missions require reliable design lifetimes of up to 14 yr. The structurally critical heater head of the high-efficiency Stirling power convertor has undergone extensive computational analysis of operating temperatures, stresses, and creep resistance of the thin-walled Inconel 718 bill of material. A preliminary assessment of the effect of uncertainties in the material behavior was also performed. Creep failure resistance of the thin-walled heater head could show variation due to small deviations in the manufactured thickness and in uncertainties in operating temperature and pressure. Durability prediction and reliability of the heater head are affected by these deviations from nominal design conditions. Therefore, it is important to include the effects of these uncertainties in predicting the probability of survival of the heater head under mission loads. Furthermore, it may be possible for the heater head to experience rare incidences of small temperature excursions of short duration. These rare incidences would affect the creep strain rate and, therefore, the life. This paper addresses the effects of such rare incidences on the reliability. In addition, the sensitivities of variables affecting the reliability are quantified, and guidelines developed to improve the reliability are outlined. Heater head reliability is being quantified with data from NASA Glenn Research Center's accelerated benchmark testing program.
A Passive System Reliability Analysis for a Station Blackout
DOE Office of Scientific and Technical Information (OSTI.GOV)
Brunett, Acacia; Bucknor, Matthew; Grabaskas, David
2015-05-03
The latest iterations of advanced reactor designs have included increased reliance on passive safety systems to maintain plant integrity during unplanned sequences. While these systems are advantageous in reducing the reliance on human intervention and availability of power, the phenomenological foundations on which these systems are built require a novel approach to a reliability assessment. Passive systems possess the unique ability to fail functionally without failing physically, a result of their explicit dependency on existing boundary conditions that drive their operating mode and capacity. Argonne National Laboratory is performing ongoing analyses that demonstrate various methodologies for the characterization of passivemore » system reliability within a probabilistic framework. Two reliability analysis techniques are utilized in this work. The first approach, the Reliability Method for Passive Systems, provides a mechanistic technique employing deterministic models and conventional static event trees. The second approach, a simulation-based technique, utilizes discrete dynamic event trees to treat time- dependent phenomena during scenario evolution. For this demonstration analysis, both reliability assessment techniques are used to analyze an extended station blackout in a pool-type sodium fast reactor (SFR) coupled with a reactor cavity cooling system (RCCS). This work demonstrates the entire process of a passive system reliability analysis, including identification of important parameters and failure metrics, treatment of uncertainties and analysis of results.« less
Towards early software reliability prediction for computer forensic tools (case study).
Abu Talib, Manar
2016-01-01
Versatility, flexibility and robustness are essential requirements for software forensic tools. Researchers and practitioners need to put more effort into assessing this type of tool. A Markov model is a robust means for analyzing and anticipating the functioning of an advanced component based system. It is used, for instance, to analyze the reliability of the state machines of real time reactive systems. This research extends the architecture-based software reliability prediction model for computer forensic tools, which is based on Markov chains and COSMIC-FFP. Basically, every part of the computer forensic tool is linked to a discrete time Markov chain. If this can be done, then a probabilistic analysis by Markov chains can be performed to analyze the reliability of the components and of the whole tool. The purposes of the proposed reliability assessment method are to evaluate the tool's reliability in the early phases of its development, to improve the reliability assessment process for large computer forensic tools over time, and to compare alternative tool designs. The reliability analysis can assist designers in choosing the most reliable topology for the components, which can maximize the reliability of the tool and meet the expected reliability level specified by the end-user. The approach of assessing component-based tool reliability in the COSMIC-FFP context is illustrated with the Forensic Toolkit Imager case study.
Golden angle based scanning for robust corneal topography with OCT
Wagner, Joerg; Goldblum, David; Cattin, Philippe C.
2017-01-01
Corneal topography allows the assessment of the cornea’s refractive power which is crucial for diagnostics and surgical planning. The use of optical coherence tomography (OCT) for corneal topography is still limited. One limitation is the susceptibility to disturbances like blinking of the eye. This can result in partially corrupted scans that cannot be evaluated using common methods. We present a new scanning method for reliable corneal topography from partial scans. Based on the golden angle, the method features a balanced scan point distribution which refines over measurement time and remains balanced when part of the scan is removed. The performance of the method is assessed numerically and by measurements of test surfaces. The results confirm that the method enables numerically well-conditioned and reliable corneal topography from partially corrupted scans and reduces the need for repeated measurements in case of abrupt disturbances. PMID:28270961
Validation of the Oxford Participation and Activities Questionnaire.
Morley, David; Dummett, Sarah; Kelly, Laura; Dawson, Jill; Fitzpatrick, Ray; Jenkinson, Crispin
2016-01-01
There is growing interest in the management of long-term conditions and in keeping people active and participating in the community. Testing the effectiveness of interventions that aim to affect activities and participation can be challenging without a well-developed, valid, and reliable instrument. This study therefore aims to develop a patient-reported outcome measure, the Oxford Participation and Activities Questionnaire (Ox-PAQ), which is theoretically grounded in the World Health Organization's International Classification of Functioning, Disability, and Health (ICF) and fully compliant with current best practice guidelines. Questionnaire items generated from patient interviews and based on the nine chapters of the ICF were administered by postal survey to 386 people with three neurological conditions: motor neuron disease, multiple sclerosis, and Parkinson's disease. Participants also completed the Medical Outcomes Study (MOS) 36-Item Short Form Health Survey (SF-36) and EQ-5D-5L. Thus, 334 participants completed the survey, a response rate of 86.5%. Factor analysis techniques identified three Ox-PAQ domains, consisting of 23 items, accounting for 72.8% of variance. Internal reliability for the three domains was high (Cronbach's α: 0.81-0.96), as was test-retest reliability (intraclass correlation: 0.83-0.92). Concurrent validity was demonstrated through highly significant relationships with relevant domains of the MOS SF-36 and the EQ- 5D-5L. Assessment of known-groups validity identified significant differences in Ox-PAQ scores among the three conditions included in the survey. Results suggest that the Ox-PAQ is a valid and reliable measure of participation and activity. The measure will now be validated in a range of further conditions, and additional properties, such as responsiveness, will also be assessed in the next phase of the instrument's development.
Validation of the Oxford Participation and Activities Questionnaire
Morley, David; Dummett, Sarah; Kelly, Laura; Dawson, Jill; Fitzpatrick, Ray; Jenkinson, Crispin
2016-01-01
Purpose There is growing interest in the management of long-term conditions and in keeping people active and participating in the community. Testing the effectiveness of interventions that aim to affect activities and participation can be challenging without a well-developed, valid, and reliable instrument. This study therefore aims to develop a patient-reported outcome measure, the Oxford Participation and Activities Questionnaire (Ox-PAQ), which is theoretically grounded in the World Health Organization’s International Classification of Functioning, Disability, and Health (ICF) and fully compliant with current best practice guidelines. Methods Questionnaire items generated from patient interviews and based on the nine chapters of the ICF were administered by postal survey to 386 people with three neurological conditions: motor neuron disease, multiple sclerosis, and Parkinson’s disease. Participants also completed the Medical Outcomes Study (MOS) 36-Item Short Form Health Survey (SF-36) and EQ-5D-5L. Results Thus, 334 participants completed the survey, a response rate of 86.5%. Factor analysis techniques identified three Ox-PAQ domains, consisting of 23 items, accounting for 72.8% of variance. Internal reliability for the three domains was high (Cronbach’s α: 0.81–0.96), as was test–retest reliability (intraclass correlation: 0.83–0.92). Concurrent validity was demonstrated through highly significant relationships with relevant domains of the MOS SF-36 and the EQ- 5D-5L. Assessment of known-groups validity identified significant differences in Ox-PAQ scores among the three conditions included in the survey. Conclusion Results suggest that the Ox-PAQ is a valid and reliable measure of participation and activity. The measure will now be validated in a range of further conditions, and additional properties, such as responsiveness, will also be assessed in the next phase of the instrument’s development. PMID:27366108
O'Grady, Michael G; Dusing, Stacey C
2015-01-01
Play is vital for development. Infants and children learn through play. Traditional standardized developmental tests measure whether a child performs individual skills within controlled environments. Play-based assessments can measure skill performance during natural, child-driven play. The purpose of this study was to systematically review reliability, validity, and responsiveness of all play-based assessments that quantify motor and cognitive skills in children from birth to 36 months of age. Studies were identified from a literature search using PubMed, ERIC, CINAHL, and PsycINFO databases and the reference lists of included papers. Included studies investigated reliability, validity, or responsiveness of play-based assessments that measured motor and cognitive skills for children to 36 months of age. Two reviewers independently screened 40 studies for eligibility and inclusion. The reviewers independently extracted reliability, validity, and responsiveness data. They examined measurement properties and methodological quality of the included studies. Four current play-based assessment tools were identified in 8 included studies. Each play-based assessment tool measured motor and cognitive skills in a different way during play. Interrater reliability correlations ranged from .86 to .98 for motor development and from .23 to .90 for cognitive development. Test-retest reliability correlations ranged from .88 to .95 for motor development and from .45 to .91 for cognitive development. Structural validity correlations ranged from .62 to .90 for motor development and from .42 to .93 for cognitive development. One study assessed responsiveness to change in motor development. Most studies had small and poorly described samples. Lack of transparency in data management and statistical analysis was common. Play-based assessments have potential to be reliable and valid tools to assess cognitive and motor skills, but higher-quality research is needed. Psychometric properties should be considered for each play-based assessment before it is used in clinical and research practice. © 2015 American Physical Therapy Association.
Wireless sensing system for bridge condition assessment and health monitoring
NASA Astrophysics Data System (ADS)
Gangone, Michael V.; Whelan, Matthew J.; Janoyan, Kerop D.
2009-03-01
Discussed in this paper is the deployment of a universal and low-cost dense wireless sensor system for structural monitoring, load rating and condition assessment of bridges. The wireless sensor system developed is designed specifically for diagnostic bridge monitoring, providing independent conditioning for both accelerometers and strain transducers in addition to high-rate wireless data transmission. The system was field deployed on a three span simply supported bridge superstructure, where strain and acceleration measurements were obtained simultaneously and in realtime at critical locations under several loading conditions, providing reliable quantitative information as to the actual performance level of the bridge. Monitoring was also conducted as the bridge was subjected to various controlled damage scenarios on the final day of testing. Select cases of detected damage using strain and modal based analysis are presented.
The purpose of the field demonstration program is to gather technically reliable cost and performance information on selected condition assessment technologies under defined field conditions. The selected technologies include zoom camera, focused electrode leak location (FELL), ...
Reliable information on pipe condition is needed to accurately estimate the remaining service life of wastewater collection system assets. Although inspections with conventional closed-circuit television (CCTV) have been the mainstay of pipeline condition assessment for decades,...
Evaluation of Animal-Based Indicators to Be Used in a Welfare Assessment Protocol for Sheep.
Richmond, Susan E; Wemelsfelder, Francoise; de Heredia, Ina Beltran; Ruiz, Roberto; Canali, Elisabetta; Dwyer, Cathy M
2017-01-01
Sheep are managed under a variety of different environments (continually outdoors, partially outdoors with seasonal or diurnal variation, continuously indoors) and for different purposes, which makes assessing welfare challenging. This diversity means that resource-based indicators are not particularly useful and, thus, a welfare assessment scheme for sheep, focusing on animal-based indicators, was developed. We focus specifically on ewes, as the most numerous group of sheep present on farm, although many of the indicators may also have relevance to adult male sheep. Using the Welfare Quality ® framework of four Principles and 12 Criteria, we considered the validity, reliability, and feasibility of 46 putative animal-based indicators derived from the literature for these criteria. Where animal-based indicators were potentially unreliably or were not considered feasible, we also considered the resource-based indicators of access to water, stocking density, and floor slipperiness. With the exception of the criteria "Absence of prolonged thirst," we suggest at least one animal-based indicator for each welfare criterion. As a minimum, face validity was available for all indicators; however, for many, we found evidence of convergent validity and discriminant validity (e.g., lameness as measured by gait score, body condition score). The reliability of most of the physical and health measures has been tested in the field and found to be appropriate for use in welfare assessment. However, for the majority of the proposed behavioral indicators (lying synchrony, social withdrawal, postures associated with pain, vocalizations, stereotypy, vigilance, response to surprise, and human approach test), this still needs to be tested. In conclusion, the comprehensive assessment of sheep welfare through largely animal-based measures is supported by the literature through the use of indicators focusing on specific aspects of sheep biology. Further work is required for some indicators to ensure that measures are reliable when used in commercial settings.
Evaluation of Animal-Based Indicators to Be Used in a Welfare Assessment Protocol for Sheep
Richmond, Susan E.; Wemelsfelder, Francoise; de Heredia, Ina Beltran; Ruiz, Roberto; Canali, Elisabetta; Dwyer, Cathy M.
2017-01-01
Sheep are managed under a variety of different environments (continually outdoors, partially outdoors with seasonal or diurnal variation, continuously indoors) and for different purposes, which makes assessing welfare challenging. This diversity means that resource-based indicators are not particularly useful and, thus, a welfare assessment scheme for sheep, focusing on animal-based indicators, was developed. We focus specifically on ewes, as the most numerous group of sheep present on farm, although many of the indicators may also have relevance to adult male sheep. Using the Welfare Quality® framework of four Principles and 12 Criteria, we considered the validity, reliability, and feasibility of 46 putative animal-based indicators derived from the literature for these criteria. Where animal-based indicators were potentially unreliably or were not considered feasible, we also considered the resource-based indicators of access to water, stocking density, and floor slipperiness. With the exception of the criteria “Absence of prolonged thirst,” we suggest at least one animal-based indicator for each welfare criterion. As a minimum, face validity was available for all indicators; however, for many, we found evidence of convergent validity and discriminant validity (e.g., lameness as measured by gait score, body condition score). The reliability of most of the physical and health measures has been tested in the field and found to be appropriate for use in welfare assessment. However, for the majority of the proposed behavioral indicators (lying synchrony, social withdrawal, postures associated with pain, vocalizations, stereotypy, vigilance, response to surprise, and human approach test), this still needs to be tested. In conclusion, the comprehensive assessment of sheep welfare through largely animal-based measures is supported by the literature through the use of indicators focusing on specific aspects of sheep biology. Further work is required for some indicators to ensure that measures are reliable when used in commercial settings. PMID:29322048
Validity and reliability of self-reported arthritis: Georgia senior centers, 2000-2001.
Bombard, Jennifer M; Powell, Kenneth E; Martin, Linda M; Helmick, Charles G; Wilson, William H
2005-04-01
Arthritis and other rheumatic conditions are a common cause of disability among adults in the United States. Telephone interviews of the state-based Behavioral Risk Factor Surveillance System (BRFSS) are used by states and territories to estimate the prevalence of arthritis. The purpose of this study was to assess the validity and reliability of the 1996-2001 BRFSS arthritis case definition in a senior center population. A total of 487 respondents at selected senior centers in Georgia were surveyed by telephone, evaluated 3 to 4 weeks later by board-certified rheumatologists, and completed a written survey in 2000 to 2001. Using the rheumatologists' summary assessment "Does this person have arthritis or a related condition" as the standard, the sensitivity and specificity of the BRFSS arthritis case definitions were calculated. Reliability for the BRFSS arthritis case definition was also calculated by comparing responses to the telephone survey with responses to a written survey. Sensitivity was 70.8% (95% confidence interval [CI]=65.9-75.6), and the specificity was 70.3% (95% CI=62.9-77.8). The agreement between the telephone and written responses indicating self-reported arthritis was high (kappa=0.68). Analyses were conducted in 2002 to 2004. Self-reported arthritis in the 1996-2001 BRFSS was highly reliable, and moderately sensitive and specific among these senior center participants.
Model Development for Risk Assessment of Driving on Freeway under Rainy Weather Conditions
Cai, Xiaonan; Wang, Chen; Chen, Shengdi; Lu, Jian
2016-01-01
Rainy weather conditions could result in significantly negative impacts on driving on freeways. However, due to lack of enough historical data and monitoring facilities, many regions are not able to establish reliable risk assessment models to identify such impacts. Given the situation, this paper provides an alternative solution where the procedure of risk assessment is developed based on drivers’ subjective questionnaire and its performance is validated by using actual crash data. First, an ordered logit model was developed, based on questionnaire data collected from Freeway G15 in China, to estimate the relationship between drivers’ perceived risk and factors, including vehicle type, rain intensity, traffic volume, and location. Then, weighted driving risk for different conditions was obtained by the model, and further divided into four levels of early warning (specified by colors) using a rank order cluster analysis. After that, a risk matrix was established to determine which warning color should be disseminated to drivers, given a specific condition. Finally, to validate the proposed procedure, actual crash data from Freeway G15 were compared with the safety prediction based on the risk matrix. The results show that the risk matrix obtained in the study is able to predict driving risk consistent with actual safety implications, under rainy weather conditions. PMID:26894434
Pati, Sanghamitra; Hussain, Mohammad Akhtar; Swain, Subhashisa; Salisbury, Chris; Metsemakers, Job F M; Knottnerus, J André; van den Akker, Marjan
2016-01-01
Multimorbidity remains an underexplored domain in Indian primary care. We undertook a study to assess the prevalence, correlates, and outcomes of multimorbidity in primary care settings in India. This paper describes the process of development and validation of our data collection tool "Multimorbidity Assessment Questionnaire for Primary Care (MAQ-PC)." An iterative process comprising desk review, chart review, and expert consultations was undertaken to generate the questionnaire. The MAQ-PC contained items on chronic conditions, health care utilization, health related quality of life, disease severity, and sociodemographics. It was first tested with twelve adults for comprehensibility followed by test-retest reliability with 103 patients from four primary care practices. For interrater reliability, two interviewers separately administered the questionnaire to sixteen patients. MAQ-PC displayed strong internal consistency (Cronbach's alpha: 0.69), interrater reliability (Cohen's Kappa: 0.78-1), and test-retest reliability (ICC: 0.970-0.741). Substantial concordance between self-report and physician diagnosis (Scott Kappa: 0.59-1.0) was observed for listed chronic conditions indicating strong concurrent validity. Nearly 54% had one chronic condition and 23.3% had multimorbidity. Our findings demonstrate MAQ-PC to be a valid and reliable measure of multimorbidity in primary care practice and suggest its potential utility in multimorbidity research in India.
ERIC Educational Resources Information Center
Chang, Chi-Cheng; Liang, Chaoyun; Chen, Yi-Hui
2013-01-01
This study explored the reliability and validity of Web-based portfolio self-assessment. Participants were 72 senior high school students enrolled in a computer application course. The students created learning portfolios, viewed peers' work, and performed self-assessment on the Web-based portfolio assessment system. The results indicated: 1)…
Krzepota, Justyna; Sadowska, Dorota; Sempolska, Katarzyna; Pelczar, Małgorzata
2017-12-23
The assessment of physical activity during pregnancy is crucial in perinatal care and it is an important research topic. Unfortunately, in Poland there is a lack of one commonly accepted questionnaire of physical activity during pregnancy. The aim of this study was to adapt the Pregnancy Physical Activity Questionnaire (PPAQ) to Polish conditions and assess the reliability of its Polish version (PPAQ-PL). The PPAQ was translated from English into Polish and its reliability tested. 64 correctly completed (twice, one week apart) questionnaires were qualified for analysis. Test-retest reliability was assessed using Intraclass Correlation Coefficient (ICC). As a result of the adaptation and psychometric assessment, in the Polish version of the questionnaire the number of questions was reduced from 36 to 35 by removing the question concerning 'mowing lawn while on a riding mower'. The ICC value for total activity was 0.75, which confirms a substantial level of reliability. The ICC values for subscales of intensity ranged from 0.53 (light) - 0.86 (vigorous). For subscales of type, ICC values ranged from 0.59 (transportation) - 0.89 (household/caregiving). The PPAQ-PL can be accepted as a reliable tool for the assessing physical activity of pregnant women in Poland. Information obtained using the questionnaire might be helpful in monitoring health behaviours, preventing obesity, as well as designing and promoting physical activity programmes for pregnant women.
Assessment of wheelchair driving performance in a virtual reality-based simulator
Mahajan, Harshal P.; Dicianno, Brad E.; Cooper, Rory A.; Ding, Dan
2013-01-01
Objective To develop a virtual reality (VR)-based simulator that can assist clinicians in performing standardized wheelchair driving assessments. Design A completely within-subjects repeated measures design. Methods Participants drove their wheelchairs along a virtual driving circuit modeled after the Power Mobility Road Test (PMRT) and in a hallway with decreasing width. The virtual simulator was displayed on computer screen and VR screens and participants interacted with it using a set of instrumented rollers and a wheelchair joystick. Driving performances of participants were estimated and compared using quantitative metrics from the simulator. Qualitative ratings from two experienced clinicians were used to estimate intra- and inter-rater reliability. Results Ten regular wheelchair users (seven men, three women; mean age ± SD, 39.5 ± 15.39 years) participated. The virtual PMRT scores from the two clinicians show high inter-rater reliability (78–90%) and high intra-rater reliability (71–90%) for all test conditions. More research is required to explore user preferences and effectiveness of the two control methods (rollers and mathematical model) and the display screens. Conclusions The virtual driving simulator seems to be a promising tool for wheelchair driving assessment that clinicians can use to supplement their real-world evaluations. PMID:23820148
Weir, David R.; Wallace, Robert B.; Langa, Kenneth M.; Plassman, Brenda L.; Wilson, Robert S.; Bennett, David A.; Duara, Ranjan; Loewenstein, David; Ganguli, Mary; Sano, Mary
2011-01-01
Establishing methods for ascertainment of dementia and cognitive impairment that are accurate and also cost effective is a challenging enterprise. Large population-based studies often using administrative data sets offer relatively inexpensive but reliable estimates of severe conditions including moderate to advanced dementia that are useful for public health planning, but they can miss less severe cognitive impairment which may be the most effective point for intervention. Clinical and epidemiological cohorts, intensively assessed, provide more sensitive detection of less severe cognitive impairment but are often costly. Here, several approaches to ascertainment are evaluated for validity, reliability, and cost. In particular, the methods of ascertainment from the Health and Retirement Study (HRS) are described briefly, along with those of the Aging, Demographics, and Memory Study (ADAMS). ADAMS, a resource-intense sub-study of the HRS, was designed to provide diagnostic accuracy among persons with more advanced dementia. A proposal to streamline future ADAMS assessments is offered. Also considered are decision tree, algorithmic, and web-based approaches to diagnosis that can reduce the expense of clinical expertise and, in some contexts, can reduce the extent of data collection. These approaches are intended for intensively assessed epidemiological cohorts. The goal is valid and reliable detection with efficient and cost-effective tools. PMID:21255747
Hoenig, Helen M; Amis, Kristopher; Edmonds, Carol; Morgan, Michelle S; Landerman, Lawrence; Caves, Kevin
2017-01-01
Background There is limited research about the effects of video quality on the accuracy of assessments of physical function. Methods A repeated measures study design was used to assess reliability and validity of the finger-nose test (FNT) and the finger-tapping test (FTT) carried out with 50 veterans who had impairment in gross and/or fine motor coordination. Videos were scored by expert raters under eight differing conditions, including in-person, high definition video with slow motion review and standard speed videos with varying bit rates and frame rates. Results FTT inter-rater reliability was excellent with slow motion video (ICC 0.98-0.99) and good (ICC 0.59) under the normal speed conditions. Inter-rater reliability for FNT 'attempts' was excellent (ICC 0.97-0.99) for all viewing conditions; for FNT 'misses' it was good to excellent (ICC 0.89) with slow motion review but substantially worse (ICC 0.44) on the normal speed videos. FTT criterion validity (i.e. compared to slow motion review) was excellent (β = 0.94) for the in-person rater and good ( β = 0.77) on normal speed videos. Criterion validity for FNT 'attempts' was excellent under all conditions ( r ≥ 0.97) and for FNT 'misses' it was good to excellent under all conditions ( β = 0.61-0.81). Conclusions In general, the inter-rater reliability and validity of the FNT and FTT assessed via video technology is similar to standard clinical practices, but is enhanced with slow motion review and/or higher bit rate.
The purpose of the field demonstration program is to gather technically reliable cost and performance information on selected condition assessment technologies under defined field conditions. The selected technologies include zoom camera, electro-scan (FELL-41), and a multi-sens...
NASA Astrophysics Data System (ADS)
Martowicz, Adam; Uhl, Tadeusz
2012-10-01
The paper discusses the applicability of a reliability- and performance-based multi-criteria robust design optimization technique for micro-electromechanical systems, considering their technological uncertainties. Nowadays, micro-devices are commonly applied systems, especially in the automotive industry, taking advantage of utilizing both the mechanical structure and electronic control circuit on one board. Their frequent use motivates the elaboration of virtual prototyping tools that can be applied in design optimization with the introduction of technological uncertainties and reliability. The authors present a procedure for the optimization of micro-devices, which is based on the theory of reliability-based robust design optimization. This takes into consideration the performance of a micro-device and its reliability assessed by means of uncertainty analysis. The procedure assumes that, for each checked design configuration, the assessment of uncertainty propagation is performed with the meta-modeling technique. The described procedure is illustrated with an example of the optimization carried out for a finite element model of a micro-mirror. The multi-physics approach allowed the introduction of several physical phenomena to correctly model the electrostatic actuation and the squeezing effect present between electrodes. The optimization was preceded by sensitivity analysis to establish the design and uncertain domains. The genetic algorithms fulfilled the defined optimization task effectively. The best discovered individuals are characterized by a minimized value of the multi-criteria objective function, simultaneously satisfying the constraint on material strength. The restriction of the maximum equivalent stresses was introduced with the conditionally formulated objective function with a penalty component. The yielded results were successfully verified with a global uniform search through the input design domain.
Leifker, Feea R.; Patterson, Thomas L.; Bowie, Christopher R.; Mausbach, Brent T.; Harvey, Philip D.
2010-01-01
Performance-based measures of the ability to perform social and everyday living skills are being more widely used to assess functional capacity in people with serious mental illnesses such as schizophrenia and bipolar disorder. Since they are also being used as outcome measures in pharmacological and cognitive remediation studies aimed at cognitive impairments in schizophrenia, understanding their measurement properties and potential sensitivity to change is important. In this study, the test-retest reliability, practice effects, and reliable change indices of two different performance-based functional capacity measures, the UCSD Performance-based skills assessment (UPSA) and Social skills performance assessment (SSPA) were examined over several different retest intervals in two different samples of people with schizophrenia (n’s=238 and 116) and a healthy comparison sample (n=109). These psychometric properties were compared to those of a neuropsychological assessment battery. Test-retest reliabilities of the long form of the UPSA ranged from r=.63 to r=.80 over follow-up periods up to 36 months in people with schizophrenia, while brief UPSA reliabilities ranged from r=.66 to r=.81. Test-retest reliability of the NP performance scores ranged from r=.77 to r=.79. Test-retest reliabilities of the UPSA were lower in healthy controls, while NP performance was slightly more reliable. SSPA test-retest reliability was lower. Practice effect sizes ranged from .05 to .16 for the UPSA and .07 to .19 for the NP assessment in patients, with HC having more practice effects. Reliable change intervals were consistent across NP and both FC measures, indicating equal potential for detection of change. These performance-based measures of functional capacity appear to have similar potential to be sensitive to change compared to NP performance in people with schizophrenia. PMID:20399613
Do aggressive signals evolve towards higher reliability or lower costs of assessment?
Ręk, P
2014-12-01
It has been suggested that the evolution of signals must be a wasteful process for the signaller, aimed at the maximization of signal honesty. However, the reliability of communication depends not only on the costs paid by signallers but also on the costs paid by receivers during assessment, and less attention has been given to the interaction between these two types of costs during the evolution of signalling systems. A signaller and receiver may accept some level of signal dishonesty by choosing signals that are cheaper in terms of assessment but that are stabilized with less reliable mechanisms. I studied the potential trade-off between signal reliability and the costs of signal assessment in the corncrake (Crex crex). I found that the birds prefer signals that are less costly regarding assessment rather than more reliable. Despite the fact that the fundamental frequency of calls was a strong predictor of male size, it was ignored by receivers unless they could directly compare signal variants. My data revealed a response advantage of costly signals when comparison between calls differing with fundamental frequencies is fast and straightforward, whereas cheap signalling is preferred in natural conditions. These data might improve our understanding of the influence of receivers on signal design because they support the hypothesis that fully honest signalling systems may be prone to dishonesty based on the effects of receiver costs and be replaced by signals that are cheaper in production and reception but more susceptible to cheating. © 2014 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2014 European Society For Evolutionary Biology.
Assessment of the psychometric properties of the Family Management Measure.
Knafl, Kathleen; Deatrick, Janet A; Gallo, Agatha; Dixon, Jane; Grey, Margaret; Knafl, George; O'Malley, Jean
2011-06-01
This paper reports development of the Family Management Measure (FaMM) of parental perceptions of family management of chronic conditions. By telephone interview, 579 parents of children age 3 to 19 with a chronic condition (349 partnered mothers, 165 partners, 65 single mothers) completed the FaMM and measures of child functional status and behavioral problems and family functioning. Analyses addressed reliability, factor structure, and construct validity. Exploratory factor analysis yielded six scales: Child's Daily Life, Condition Management Ability, Condition Management Effort, Family Life Difficulty, Parental Mutuality, and View of Condition Impact. Internal consistency reliability ranged from .72 to .91, and test-retest reliability from .71 to .94. Construct validity was supported by significant correlations in hypothesized directions between FaMM scales and established measures. Results support FaMM's; reliability and validity, indicating it performs in a theoretically meaningful way and taps distinct aspects of family response to childhood chronic conditions.
Koh, Yi Ling Eileen; Lua, Yi Hui Adela; Hong, Liyue; Bong, Huey Shin Shirley; Yeo, Ling Sui Jocelyn; Tsang, Li Ping Marianne; Ong, Kai Zhi; Wong, Sook Wai Samantha; Tan, Ngiap Chuan
2016-03-01
Essential hypertension often requires affected patients to self-manage their condition most of the time. Besides seeking regular medical review of their life-long condition to detect vascular complications, patients have to maintain healthy lifestyles in between physician consultations via diet and physical activity, and to take their medications according to their prescriptions. Their self-management ability is influenced by their self-efficacy capacity, which can be assessed using questionnaire-based tools. The "Hypertension Self-Care Profile" (HTN-SCP) is 1 such questionnaire assessing self-efficacy in the domains of "behavior," "motivation," and "self-efficacy." This study aims to determine the test-retest reliability of HTN-SCP in an English-literate Asian population using a web-based approach. Multiethnic Asian patients, aged 40 years and older, with essential hypertension were recruited from a typical public primary care clinic in Singapore. The investigators guided the patients to fill up the web-based 60-item HTN-SCP in English using a tablet or smartphone on the first visit and refilled the instrument 2 weeks later in the retest. Internal consistency and test-retest reliability were evaluated using Cronbach's Alpha and intraclass correlation coefficients (ICC), respectively. The t test was used to determine the relationship between the overall HTN-SCP scores of the patients and their self-reported self-management activities. A total of 160 patients completed the HTN-SCP during the initial test, from which 71 test-retest responses were completed. No floor or ceiling effect was found for the scores for the 3 subscales. Cronbach's Alpha coefficients were 0.857, 0.948, and 0.931 for "behavior," "motivation," and "self-efficacy" domains respectively, indicating high internal consistency. The item-total correlation ranges for the 3 scales were from 0.105 to 0.656 for Behavior, 0.401 to 0.808 for Motivation, 0.349 to 0.789 for Self-efficacy. The corresponding ICC scores of 0.671, 0.762, and 0.720 for these respective domains showed good test-retest reliability. The correlation of the HTN-SCP scores and patients' reported self-management measures were significant, except for keeping their food diary. HTN-SCP showed satisfactory internal consistency and test-retest reliability in an English literate Asian population. A web-based approach is feasible if similar studies are needed to validate its translated versions of the tool for wider application in the local multilingual population.
Sharif Nia, Hamid; Pahlevan Sharif, Saeed; Lehto, Rebecca H; Allen, Kelly A; Goudarzian, Amir Hossein; Yaghoobzadeh, Ameneh; Soleimani, Mohammad Ali
2017-07-01
Objective: Limited research has examined the psychometric properties of death depression scales in Persian populations with cardiac disease despite the need for valid assessment tools for evaluating depressive symptoms in patients with life-limiting chronic conditions. The present study aimed at evaluating the reliability and validity of the Persian Version of Death Depression Scale - Revised (DDS-R) in Iranian patients who had recent acute myocardial infarction (AMI). Method: This psychometric study was conducted with a convenience sample of 407 patients with AMI diagnosis who completed the Persian version of the DDS-R. The face, content, and construct validity of the scale were ascertained. Internal consistency, test-retest, and construct reliability (CR) were used to assess reliability of the Persian Version of DDS-R. Results: Based on maximum likelihood exploratory factor analysis and consideration of conceptual meaning, a 4-factor solution was identified, explaining 75.89% of the total variance. Goodness-of-fit indices (GFI), Comparative Fit Index (CFI), Normed Fit Index (NFI), Incremental Fit Index (IFI), and Root Mean Square Error of Approximation (RMSEA) in the final DDS-R structure demonstrated the adequacy of the 4-domain structure. The internal consistency, construct reliability, and Intra-class Correlation Coefficients (ICC) were greater than .70. Conclusion: The DDS-R was found to be a valid and reliable assessment tool for evaluating death depression symptoms in Iranian patients with AMI.
NASA Astrophysics Data System (ADS)
Murrad, Muhamad; Leong, M. Salman
Based on the experiences of the Malaysian Armed Forces (MAF), failure of the main rotor gearbox (MRGB) was one of the major contributing factors to helicopter breakdowns. Even though vibration and oil analysis are the effective techniques for monitoring the health of helicopter components, these two techniques were rarely combined to form an effective assessment tool in MAF. Results of the oil analysis were often used only for oil changing schedule while assessments of MRGB condition were mainly based on overall vibration readings. A study group was formed and given a mandate to improve the maintenance strategy of S61-A4 helicopter fleet in the MAF. The improvement consisted of a structured approach to the reassessment/redefinition suitable maintenance actions that should be taken for the MRGB. Basic and enhanced tools for condition monitoring (CM) are investigated to address the predominant failures of the MRGB. Quantitative accelerated life testing (QALT) was considered in this work with an intent to obtain the required reliability information in a shorter time with tests under normal stress conditions. These tests when performed correctly can provide valuable information about MRGB performance under normal operating conditions which enable maintenance personnel to make decision more quickly, accurately and economically. The time-to-failure and probability of failure information of the MRGB were generated by applying QALT analysis principles. This study is anticipated to make a dramatic change in its approach to CM, bringing significant savings and various benefits to MAF.
Chen, Chia Lin; Lo, Chu Ling; Huang, Kai Chu; Huang, Chen Fu
2017-10-01
[Purpose] The aim of this study was to determine the intrarater reliability of using ultrasonography as a measurement tool to assess the patella position in a weight-bearing condition. [Subjects and Methods] Ten healthy adults participated in this study. Ultrasonography was used to assess the patella position during step down with the loading knee in flexion (0° and 20°). The distance between the patella and lateral condyle was measured to represent the patella position on the condylar groove. Two measurements were obtained on the first day and the day after 1 week by the same investigator. [Results] Excellent intrarater reliability, ranging from 0.83 to 0.93, was shown in both conditions. Standard errors of the measurements were 0.5 mm in the straight knee and 0.7 mm in the knee flexion at 20°. Minimal differences in knee flexion at 0° and knee flexion at 20° were 1.5 mm and 1.9 mm, respectively. [Conclusion] Ultrasonography is a reliable assessment tool for evaluating the positional changes of the patella in weight-bearing activities, and it can be easily used by practitioners in the clinical setting.
49 CFR Appendix E to Part 238 - General Principles of Reliability-Based Maintenance Programs
Code of Federal Regulations, 2010 CFR
2010-10-01
... 49 Transportation 4 2010-10-01 2010-10-01 false General Principles of Reliability-Based... STANDARDS Pt. 238, App. E Appendix E to Part 238—General Principles of Reliability-Based Maintenance... maintenance programs are based on the following general principles. A failure is an unsatisfactory condition...
Inter-rater reliability of an observation-based ergonomics assessment checklist for office workers.
Pereira, Michelle Jessica; Straker, Leon Melville; Comans, Tracy Anne; Johnston, Venerina
2016-12-01
To establish the inter-rater reliability of an observation-based ergonomics assessment checklist for computer workers. A 37-item (38-item if a laptop was part of the workstation) comprehensive observational ergonomics assessment checklist comparable to government guidelines and up to date with empirical evidence was developed. Two trained practitioners assessed full-time office workers performing their usual computer-based work and evaluated the suitability of workstations used. Practitioners assessed each participant consecutively. The order of assessors was randomised, and the second assessor was blinded to the findings of the first. Unadjusted kappa coefficients between the raters were obtained for the overall checklist and subsections that were formed from question-items relevant to specific workstation equipment. Twenty-seven office workers were recruited. The inter-rater reliability between two trained practitioners achieved moderate to good reliability for all except one checklist component. This checklist has mostly moderate to good reliability between two trained practitioners. Practitioner Summary: This reliable ergonomics assessment checklist for computer workers was designed using accessible government guidelines and supplemented with up-to-date evidence. Employers in Queensland (Australia) can fulfil legislative requirements by using this reliable checklist to identify and subsequently address potential risk factors for work-related injury to provide a safe working environment.
Reliability of conditioned pain modulation: a systematic review
Kennedy, Donna L.; Kemp, Harriet I.; Ridout, Deborah; Yarnitsky, David; Rice, Andrew S.C.
2016-01-01
Abstract A systematic literature review was undertaken to determine if conditioned pain modulation (CPM) is reliable. Longitudinal, English language observational studies of the repeatability of a CPM test paradigm in adult humans were included. Two independent reviewers assessed the risk of bias in 6 domains; study participation; study attrition; prognostic factor measurement; outcome measurement; confounding and analysis using the Quality in Prognosis Studies (QUIPS) critical assessment tool. Intraclass correlation coefficients (ICCs) less than 0.4 were considered to be poor; 0.4 and 0.59 to be fair; 0.6 and 0.75 good and greater than 0.75 excellent. Ten studies were included in the final review. Meta-analysis was not appropriate because of differences between studies. The intersession reliability of the CPM effect was investigated in 8 studies and reported as good (ICC = 0.6-0.75) in 3 studies and excellent (ICC > 0.75) in subgroups in 2 of those 3. The assessment of risk of bias demonstrated that reporting is not comprehensive for the description of sample demographics, recruitment strategy, and study attrition. The absence of blinding, a lack of control for confounding factors, and lack of standardisation in statistical analysis are common. Conditioned pain modulation is a reliable measure; however, the degree of reliability is heavily dependent on stimulation parameters and study methodology and this warrants consideration for investigators. The validation of CPM as a robust prognostic factor in experimental and clinical pain studies may be facilitated by improvements in the reporting of CPM reliability studies. PMID:27559835
NASA Astrophysics Data System (ADS)
Biryuk, V. V.; Tsapkova, A. B.; Larin, E. A.; Livshiz, M. Y.; Sheludko, L. P.
2018-01-01
A set of mathematical models for calculating the reliability indexes of structurally complex multifunctional combined installations in heat and power supply systems was developed. Reliability of energy supply is considered as required condition for the creation and operation of heat and power supply systems. The optimal value of the power supply system coefficient F is based on an economic assessment of the consumers’ loss caused by the under-supply of electric power and additional system expences for the creation and operation of an emergency capacity reserve. Rationing of RI of the industrial heat supply is based on the use of concept of technological margin of safety of technological processes. The definition of rationed RI values of heat supply of communal consumers is based on the air temperature level iside the heated premises. The complex allows solving a number of practical tasks for providing reliability of heat supply for consumers. A probabilistic model is developed for calculating the reliability indexes of combined multipurpose heat and power plants in heat-and-power supply systems. The complex of models and calculation programs can be used to solve a wide range of specific tasks of optimization of schemes and parameters of combined heat and power plants and systems, as well as determining the efficiency of various redundance methods to ensure specified reliability of power supply.
Krejsa, Martin; Janas, Petr; Yilmaz, Işık; Marschalko, Marian; Bouchal, Tomas
2013-01-01
The load-carrying system of each construction should fulfill several conditions which represent reliable criteria in the assessment procedure. It is the theory of structural reliability which determines probability of keeping required properties of constructions. Using this theory, it is possible to apply probabilistic computations based on the probability theory and mathematic statistics. Development of those methods has become more and more popular; it is used, in particular, in designs of load-carrying structures with the required level or reliability when at least some input variables in the design are random. The objective of this paper is to indicate the current scope which might be covered by the new method—Direct Optimized Probabilistic Calculation (DOProC) in assessments of reliability of load-carrying structures. DOProC uses a purely numerical approach without any simulation techniques. This provides more accurate solutions to probabilistic tasks, and, in some cases, such approach results in considerably faster completion of computations. DOProC can be used to solve efficiently a number of probabilistic computations. A very good sphere of application for DOProC is the assessment of the bolt reinforcement in the underground and mining workings. For the purposes above, a special software application—“Anchor”—has been developed. PMID:23935412
Stirling Convertor Fasteners Reliability Quantification
NASA Technical Reports Server (NTRS)
Shah, Ashwin R.; Korovaichuk, Igor; Kovacevich, Tiodor; Schreiber, Jeffrey G.
2006-01-01
Onboard Radioisotope Power Systems (RPS) being developed for NASA s deep-space science and exploration missions require reliable operation for up to 14 years and beyond. Stirling power conversion is a candidate for use in an RPS because it offers a multifold increase in the conversion efficiency of heat to electric power and reduced inventory of radioactive material. Structural fasteners are responsible to maintain structural integrity of the Stirling power convertor, which is critical to ensure reliable performance during the entire mission. Design of fasteners involve variables related to the fabrication, manufacturing, behavior of fasteners and joining parts material, structural geometry of the joining components, size and spacing of fasteners, mission loads, boundary conditions, etc. These variables have inherent uncertainties, which need to be accounted for in the reliability assessment. This paper describes these uncertainties along with a methodology to quantify the reliability, and provides results of the analysis in terms of quantified reliability and sensitivity of Stirling power conversion reliability to the design variables. Quantification of the reliability includes both structural and functional aspects of the joining components. Based on the results, the paper also describes guidelines to improve the reliability and verification testing.
Zaccaro, Heather N; Carbone, Emily C; Dsouza, Nishita; Xu, Michelle R; Byrne, Mary C; Kraemer, John D
2015-12-01
There is a need to develop motorcycle helmet surveillance approaches that are less labour intensive than direct observation (DO), which is the commonly recommended but never formally validated approach, particularly in developing settings. This study sought to assess public traffic camera feeds as an alternative to DO, in addition to the reliability of DO under field conditions. DO had high inter-rater reliability, κ=0.88 and 0.84, respectively, for cycle type and helmet type, which reinforces its use as a gold standard. However, traffic camera-based data collection was found to be unreliable, with κ=0.46 and 0.53 for cycle type and helmet type. When bicycles, motorcycles and scooters were classified based on traffic camera streams, only 68.4% of classifications concurred with those made via DO. Given the current technology, helmet surveillance via traffic camera streams is infeasible, and there remains a need for innovative traffic safety surveillance approaches in low-income urban settings. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/
Monthly streamflow forecasting based on hidden Markov model and Gaussian Mixture Regression
NASA Astrophysics Data System (ADS)
Liu, Yongqi; Ye, Lei; Qin, Hui; Hong, Xiaofeng; Ye, Jiajun; Yin, Xingli
2018-06-01
Reliable streamflow forecasts can be highly valuable for water resources planning and management. In this study, we combined a hidden Markov model (HMM) and Gaussian Mixture Regression (GMR) for probabilistic monthly streamflow forecasting. The HMM is initialized using a kernelized K-medoids clustering method, and the Baum-Welch algorithm is then executed to learn the model parameters. GMR derives a conditional probability distribution for the predictand given covariate information, including the antecedent flow at a local station and two surrounding stations. The performance of HMM-GMR was verified based on the mean square error and continuous ranked probability score skill scores. The reliability of the forecasts was assessed by examining the uniformity of the probability integral transform values. The results show that HMM-GMR obtained reasonably high skill scores and the uncertainty spread was appropriate. Different HMM states were assumed to be different climate conditions, which would lead to different types of observed values. We demonstrated that the HMM-GMR approach can handle multimodal and heteroscedastic data.
Structural reliability assessment of the Oman India Pipeline
DOE Office of Scientific and Technical Information (OSTI.GOV)
Al-Sharif, A.M.; Preston, R.
1996-12-31
Reliability techniques are increasingly finding application in design. The special design conditions for the deep water sections of the Oman India Pipeline dictate their use since the experience basis for application of standard deterministic techniques is inadequate. The paper discusses the reliability analysis as applied to the Oman India Pipeline, including selection of a collapse model, characterization of the variability in the parameters that affect pipe resistance to collapse, and implementation of first and second order reliability analyses to assess the probability of pipe failure. The reliability analysis results are used as the basis for establishing the pipe wall thicknessmore » requirements for the pipeline.« less
Advanced reliability modeling of fault-tolerant computer-based systems
NASA Technical Reports Server (NTRS)
Bavuso, S. J.
1982-01-01
Two methodologies for the reliability assessment of fault tolerant digital computer based systems are discussed. The computer-aided reliability estimation 3 (CARE 3) and gate logic software simulation (GLOSS) are assessment technologies that were developed to mitigate a serious weakness in the design and evaluation process of ultrareliable digital systems. The weak link is based on the unavailability of a sufficiently powerful modeling technique for comparing the stochastic attributes of one system against others. Some of the more interesting attributes are reliability, system survival, safety, and mission success.
Implementing a Reliability Centered Maintenance Program at NASA's Kennedy Space Center
NASA Technical Reports Server (NTRS)
Tuttle, Raymond E.; Pete, Robert R.
1998-01-01
Maintenance practices have long focused on time based "preventive maintenance" techniques. Components were changed out and parts replaced based on how long they had been in place instead of what condition they were in. A reliability centered maintenance (RCM) program seeks to offer equal or greater reliability at decreased cost by insuring only applicable, effective maintenance is performed and by in large part replacing time based maintenance with condition based maintenance. A significant portion of this program involved introducing non-intrusive technologies, such as vibration analysis, oil analysis and I/R cameras, to an existing labor force and management team.
NASA Astrophysics Data System (ADS)
Althuwaynee, Omar F.; Pradhan, Biswajeet; Ahmad, Noordin
2014-06-01
This article uses methodology based on chi-squared automatic interaction detection (CHAID), as a multivariate method that has an automatic classification capacity to analyse large numbers of landslide conditioning factors. This new algorithm was developed to overcome the subjectivity of the manual categorization of scale data of landslide conditioning factors, and to predict rainfall-induced susceptibility map in Kuala Lumpur city and surrounding areas using geographic information system (GIS). The main objective of this article is to use CHi-squared automatic interaction detection (CHAID) method to perform the best classification fit for each conditioning factor, then, combining it with logistic regression (LR). LR model was used to find the corresponding coefficients of best fitting function that assess the optimal terminal nodes. A cluster pattern of landslide locations was extracted in previous study using nearest neighbor index (NNI), which were then used to identify the clustered landslide locations range. Clustered locations were used as model training data with 14 landslide conditioning factors such as; topographic derived parameters, lithology, NDVI, land use and land cover maps. Pearson chi-squared value was used to find the best classification fit between the dependent variable and conditioning factors. Finally the relationship between conditioning factors were assessed and the landslide susceptibility map (LSM) was produced. An area under the curve (AUC) was used to test the model reliability and prediction capability with the training and validation landslide locations respectively. This study proved the efficiency and reliability of decision tree (DT) model in landslide susceptibility mapping. Also it provided a valuable scientific basis for spatial decision making in planning and urban management studies.
Lord, Sarah Peregrine; Can, Doğan; Yi, Michael; Marin, Rebeca; Dunn, Christopher W.; Imel, Zac E.; Georgiou, Panayiotis; Narayanan, Shrikanth; Steyvers, Mark; Atkins, David C.
2014-01-01
The current paper presents novel methods for collecting MISC data and accurately assessing reliability of behavior codes at the level of the utterance. The MISC 2.1 was used to rate MI interviews from five randomized trials targeting alcohol and drug use. Sessions were coded at the utterance-level. Utterance-based coding reliability was estimated using three methods and compared to traditional reliability estimates of session tallies. Session-level reliability was generally higher compared to reliability using utterance-based codes, suggesting that typical methods for MISC reliability may be biased. These novel methods in MI fidelity data collection and reliability assessment provided rich data for therapist feedback and further analyses. Beyond implications for fidelity coding, utterance-level coding schemes may elucidate important elements in the counselor-client interaction that could inform theories of change and the practice of MI. PMID:25242192
Lord, Sarah Peregrine; Can, Doğan; Yi, Michael; Marin, Rebeca; Dunn, Christopher W; Imel, Zac E; Georgiou, Panayiotis; Narayanan, Shrikanth; Steyvers, Mark; Atkins, David C
2015-02-01
The current paper presents novel methods for collecting MISC data and accurately assessing reliability of behavior codes at the level of the utterance. The MISC 2.1 was used to rate MI interviews from five randomized trials targeting alcohol and drug use. Sessions were coded at the utterance-level. Utterance-based coding reliability was estimated using three methods and compared to traditional reliability estimates of session tallies. Session-level reliability was generally higher compared to reliability using utterance-based codes, suggesting that typical methods for MISC reliability may be biased. These novel methods in MI fidelity data collection and reliability assessment provided rich data for therapist feedback and further analyses. Beyond implications for fidelity coding, utterance-level coding schemes may elucidate important elements in the counselor-client interaction that could inform theories of change and the practice of MI. Copyright © 2015 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Abdenov, A. Zh; Trushin, V. A.; Abdenova, G. A.
2018-01-01
The paper considers the questions of filling the relevant SIEM nodes based on calculations of objective assessments in order to improve the reliability of subjective expert assessments. The proposed methodology is necessary for the most accurate security risk assessment of information systems. This technique is also intended for the purpose of establishing real-time operational information protection in the enterprise information systems. Risk calculations are based on objective estimates of the adverse events implementation probabilities, predictions of the damage magnitude from information security violations. Calculations of objective assessments are necessary to increase the reliability of the proposed expert assessments.
Development of Internet-Based Tasks for the Executive Function Performance Test.
Rand, Debbie; Lee Ben-Haim, Keren; Malka, Rachel; Portnoy, Sigal
The Executive Function Performance Test (EFPT) is a reliable and valid performance-based tool to assess executive functions (EFs). This study's objective was to develop and verify two Internet-based tasks for the EFPT. A cross-sectional study assessed the alternate-form reliability of the Internet-based bill-paying and telephone-use tasks in healthy adults and people with subacute stroke (Study 1). It also sought to establish the tasks' criterion reliability for assessing EF deficits by correlating performance with that on the Trail Making Test in five groups: healthy young adults, healthy older adults, people with subacute stroke, people with chronic stroke, and young adults with attention deficit hyperactivity disorder (Study 2). The alternative-form reliability and initial construct validity for the Internet-based bill-paying task were verified. Criterion validity was established for both tasks. The Internet-based tasks are comparable to the original EFPT tasks and can be used for assessment of EF deficits. Copyright © 2018 by the American Occupational Therapy Association, Inc.
NASA Astrophysics Data System (ADS)
Hardikar, Kedar Y.; Liu, Bill J. J.; Bheemreddy, Venkata
2016-09-01
Gaining an understanding of degradation mechanisms and their characterization are critical in developing relevant accelerated tests to ensure PV module performance warranty over a typical lifetime of 25 years. As newer technologies are adapted for PV, including new PV cell technologies, new packaging materials, and newer product designs, the availability of field data over extended periods of time for product performance assessment cannot be expected within the typical timeframe for business decisions. In this work, to enable product design decisions and product performance assessment for PV modules utilizing newer technologies, Simulation and Mechanism based Accelerated Reliability Testing (SMART) methodology and empirical approaches to predict field performance from accelerated test results are presented. The method is demonstrated for field life assessment of flexible PV modules based on degradation mechanisms observed in two accelerated tests, namely, Damp Heat and Thermal Cycling. The method is based on design of accelerated testing scheme with the intent to develop relevant acceleration factor models. The acceleration factor model is validated by extensive reliability testing under different conditions going beyond the established certification standards. Once the acceleration factor model is validated for the test matrix a modeling scheme is developed to predict field performance from results of accelerated testing for particular failure modes of interest. Further refinement of the model can continue as more field data becomes available. While the demonstration of the method in this work is for thin film flexible PV modules, the framework and methodology can be adapted to other PV products.
Infant polysomnography: reliability and validity of infant arousal assessment.
Crowell, David H; Kulp, Thomas D; Kapuniai, Linda E; Hunt, Carl E; Brooks, Lee J; Weese-Mayer, Debra E; Silvestri, Jean; Ward, Sally Davidson; Corwin, Michael; Tinsley, Larry; Peucker, Mark
2002-10-01
Infant arousal scoring based on the Atlas Task Force definition of transient EEG arousal was evaluated to determine (1). whether transient arousals can be identified and assessed reliably in infants and (2). whether arousal and no-arousal epochs scored previously by trained raters can be validated reliably by independent sleep experts. Phase I for inter- and intrarater reliability scoring was based on two datasets of sleep epochs selected randomly from nocturnal polysomnograms of healthy full-term, preterm, idiopathic apparent life-threatening event cases, and siblings of Sudden Infant Death Syndrome infants of 35 to 64 weeks postconceptional age. After training, test set 1 reliability was assessed and discrepancies identified. After retraining, test set 2 was scored by the same raters to determine interrater reliability. Later, three raters from the trained group rescored test set 2 to assess inter- and intrarater reliabilities. Interrater and intrarater reliability kappa's, with 95% confidence intervals, ranged from substantial to almost perfect levels of agreement. Interrater reliabilities for spontaneous arousals were initially moderate and then substantial. During the validation phase, 315 previously scored epochs were presented to four sleep experts to rate as containing arousal or no-arousal events. Interrater expert agreements were diverse and considered as noninterpretable. Concordance in sleep experts' agreements, based on identification of the previously sampled arousal and no-arousal epochs, was used as a secondary evaluative technique. Results showed agreement by two or more experts on 86% of the Collaborative Home Infant Monitoring Evaluation Study arousal scored events. Conversely, only 1% of the Collaborative Home Infant Monitoring Evaluation Study-scored no-arousal epochs were rated as an arousal. In summary, this study presents an empirically tested model with procedures and criteria for attaining improved reliability in transient EEG arousal assessments in infants using the modified Atlas Task Force standards. With training based on specific criteria, substantial inter- and intrarater agreement in identifying infant arousals was demonstrated. Corroborative validation results were too disparate for meaningful interpretation. Alternate evaluation based on concordance agreements supports reliance on infant EEG criteria for assessment. Results mandate additional confirmatory validation studies with specific training on infant EEG arousal assessment criteria.
Time-Tagged Risk/Reliability Assessment Program for Development and Operation of Space System
NASA Astrophysics Data System (ADS)
Kubota, Yuki; Takegahara, Haruki; Aoyagi, Junichiro
We have investigated a new method of risk/reliability assessment for development and operation of space system. It is difficult to evaluate risk of spacecraft, because of long time operation, maintenance free and difficulty of test under the ground condition. Conventional methods are FMECA, FTA, ETA and miscellaneous. These are not enough to assess chronological anomaly and there is a problem to share information during R&D. A new method of risk and reliability assessment, T-TRAP (Time-tagged Risk/Reliability Assessment Program) is proposed as a management tool for the development and operation of space system. T-TRAP consisting of time-resolved Fault Tree and Criticality Analyses, upon occurrence of anomaly in the system, facilitates the responsible personnel to quickly identify the failure cause and decide corrective actions. This paper describes T-TRAP method and its availability.
Reliability Issues and Solutions in Flexible Electronics Under Mechanical Fatigue
NASA Astrophysics Data System (ADS)
Yi, Seol-Min; Choi, In-Suk; Kim, Byoung-Joon; Joo, Young-Chang
2018-07-01
Flexible devices are of significant interest due to their potential expansion of the application of smart devices into various fields, such as energy harvesting, biological applications and consumer electronics. Due to the mechanically dynamic operations of flexible electronics, their mechanical reliability must be thoroughly investigated to understand their failure mechanisms and lifetimes. Reliability issue caused by bending fatigue, one of the typical operational limitations of flexible electronics, has been studied using various test methodologies; however, electromechanical evaluations which are essential to assess the reliability of electronic devices for flexible applications had not been investigated because the testing method was not established. By employing the in situ bending fatigue test, we has studied the failure mechanism for various conditions and parameters, such as bending strain, fatigue area, film thickness, and lateral dimensions. Moreover, various methods for improving the bending reliability have been developed based on the failure mechanism. Nanostructures such as holes, pores, wires and composites of nanoparticles and nanotubes have been suggested for better reliability. Flexible devices were also investigated to find the potential failures initiated by complex structures under bending fatigue strain. In this review, the recent advances in test methodology, mechanism studies, and practical applications are introduced. Additionally, perspectives including the future advance to stretchable electronics are discussed based on the current achievements in research.
Reliability Issues and Solutions in Flexible Electronics Under Mechanical Fatigue
NASA Astrophysics Data System (ADS)
Yi, Seol-Min; Choi, In-Suk; Kim, Byoung-Joon; Joo, Young-Chang
2018-03-01
Flexible devices are of significant interest due to their potential expansion of the application of smart devices into various fields, such as energy harvesting, biological applications and consumer electronics. Due to the mechanically dynamic operations of flexible electronics, their mechanical reliability must be thoroughly investigated to understand their failure mechanisms and lifetimes. Reliability issue caused by bending fatigue, one of the typical operational limitations of flexible electronics, has been studied using various test methodologies; however, electromechanical evaluations which are essential to assess the reliability of electronic devices for flexible applications had not been investigated because the testing method was not established. By employing the in situ bending fatigue test, we has studied the failure mechanism for various conditions and parameters, such as bending strain, fatigue area, film thickness, and lateral dimensions. Moreover, various methods for improving the bending reliability have been developed based on the failure mechanism. Nanostructures such as holes, pores, wires and composites of nanoparticles and nanotubes have been suggested for better reliability. Flexible devices were also investigated to find the potential failures initiated by complex structures under bending fatigue strain. In this review, the recent advances in test methodology, mechanism studies, and practical applications are introduced. Additionally, perspectives including the future advance to stretchable electronics are discussed based on the current achievements in research.
Prognostics-based qualification of high-power white LEDs using Lévy process approach
NASA Astrophysics Data System (ADS)
Yung, Kam-Chuen; Sun, Bo; Jiang, Xiaopeng
2017-01-01
Due to their versatility in a variety of applications and the growing market demand, high-power white light-emitting diodes (LEDs) have attracted considerable attention. Reliability qualification testing is an essential part of the product development process to ensure the reliability of a new LED product before its release. However, the widely used IES-TM-21 method does not provide comprehensive reliability information. For more accurate and effective qualification, this paper presents a novel method based on prognostics techniques. Prognostics is an engineering technology predicting the future reliability or determining the remaining useful lifetime (RUL) of a product by assessing the extent of deviation or degradation from its expected normal operating conditions. A Lévy subordinator of a mixed Gamma and compound Poisson process is used to describe the actual degradation process of LEDs characterized by random sporadic small jumps of degradation degree, and the reliability function is derived for qualification with different distribution forms of jump sizes. The IES LM-80 test results reported by different LED vendors are used to develop and validate the qualification methodology. This study will be helpful for LED manufacturers to reduce the total test time and cost required to qualify the reliability of an LED product.
Nicholson, Patricia; Griffin, Patrick; Gillis, Shelley; Wu, Margaret; Dunning, Trisha
2013-09-01
Concern about the process of identifying underlying competencies that contribute to effective nursing performance has been debated with a lack of consensus surrounding an approved measurement instrument for assessing clinical performance. Although a number of methodologies are noted in the development of competency-based assessment measures, these studies are not without criticism. The primary aim of the study was to develop and validate a Performance Based Scoring Rubric, which included both analytical and holistic scales. The aim included examining the validity and reliability of the rubric, which was designed to measure clinical competencies in the operating theatre. The fieldwork observations of 32 nurse educators and preceptors assessing the performance of 95 instrument nurses in the operating theatre were used in the calibration of the rubric. The Rasch model, a particular model among Item Response Models, was used in the calibration of each item in the rubric in an attempt at improving the measurement properties of the scale. This is done by establishing the 'fit' of the data to the conditions demanded by the Rasch model. Acceptable reliability estimates, specifically a high Cronbach's alpha reliability coefficient (0.940), as well as empirical support for construct and criterion validity for the rubric were achieved. Calibration of the Performance Based Scoring Rubric using Rasch model revealed that the fit statistics for most items were acceptable. The use of the Rasch model offers a number of features in developing and refining healthcare competency-based assessments, improving confidence in measuring clinical performance. The Rasch model was shown to be useful in developing and validating a competency-based assessment for measuring the competence of the instrument nurse in the operating theatre with implications for use in other areas of nursing practice. Crown Copyright © 2012. Published by Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Buczyński, P.
2018-05-01
This article presents a new approach to reliability assessment of the road structure in which the base layer will be constructed in the process of cold deep recycling with foamed bitumen. In order to properly assess the reliability of the structure with the recycled base, it is necessary to determine the distribution of stress and strain in typical pavement layer systems. The true stress and strain values were established for particular structural layers using the complex modulus (E*) determined based on the master curves. The complex modulus was determined by the direct tension-compression test on cylindrical specimens (DTC-CY) at five temperatures (-7°C, 5°C, 13°C, 25°C, 40°C) and six loading times (0.1 Hz, 0.3 Hz, 1 Hz, 3 Hz, 10 Hz, 20 Hz) in accordance with EN 12697-26 in the linear viscoelasticity (LVE) range for small strains ranging from 25 to 50 με. The master curves of the complex modulus were constructed using the Richards model for the mixtures typically incorporated in structural layers, i.e., SMA11, AC16W, AC22P and MCAS. The values of the modulus characterizing particular layers were determined with temperature distribution in the structure taken into account, when the surface temperature was 40°C. The stress distribution was established for those calculation models. The stress values were used to evaluate the fatigue life under controlled stress conditions (IT-FT). This evaluation, with the controlled stress corresponding to that in the structure, facilitated the quality assessment of the rehabilitated recycled base course. Results showed that the recycled base mixtures having the indirect tensile strength (ITSDRY) similar to the stress in the structure under analysis needed an additional fatigue life evaluation in the indirect tensile test ITT. This approach to the recycled base quality assessment will allow eliminating the damage induced by overloading.
Absolute and Relative Reliability of Percentage of Syllables Stuttered and Severity Rating Scales
ERIC Educational Resources Information Center
Karimi, Hamid; O'Brian, Sue; Onslow, Mark; Jones, Mark
2014-01-01
Purpose: Percentage of syllables stuttered (%SS) and severity rating (SR) scales are measures in common use to quantify stuttering severity and its changes during basic and clinical research conditions. However, their reliability has not been assessed with indices measuring both relative and absolute reliability. This study was designed to provide…
Michaelsen, Stella M; Rocha, André S; Knabben, Rodrigo J; Rodrigues, Luciano P; Fernandes, Claudia G C
2011-01-01
Recently, the reliability of the Brazilian version of the Fugl-Meyer Assessment (FMA) was assessed through the scoring given according to observations made by a single evaluator who applied the test. When different raters apply the scale, the reliability may depend on the interpretation given to the assessment sheet. In such cases, a clear administration manual is essential for ensuring homogeneity of application. To translate and adapt the French Canadian version of the FMA administration manual into Brazilian Portuguese and to evaluate the inter-rater reliability when different evaluators apply the FMA on the basis of the information contained in the manual. Eighteen adults (59±10 years) with chronic hemiparesis (38±35 months after a stroke) took part in this study. Eight patients participated in the first part of the study and 10 in the second part. Based on analyzing the results from part 1, an adapted version was developed, in which information and photos were added to illustrate the positions of the patient and evaluator. The inter-rater reliability was assessed using the intraclass correlation coefficient (ICC). The reliability of the FMA based on the adapted version of the manual was excellent for the total motor scores for the upper limbs (ICC=0.98) and lower limbs (ICC=0.90), as well as for movement sense (ICC=0.98) and upper and lower-limb passive range of motion (ICC=0.84 and 0.90, respectively). The reliability was moderate for tactile sensitivity (0.75). The joint pain assessment presented low reliability. The results showed that, except for pain assessment, application of the FMA based on the adapted version of the application manual for Brazilian Portuguese presented adequate inter-rater reliability.
CPM Test-Retest Reliability: "Standard" vs "Single Test-Stimulus" Protocols.
Granovsky, Yelena; Miller-Barmak, Adi; Goldstein, Oren; Sprecher, Elliot; Yarnitsky, David
2016-03-01
Assessment of pain inhibitory mechanisms using conditioned pain modulation (CPM) is relevant clinically in prediction of pain and analgesic efficacy. Our objective is to provide necessary estimates of intersession CPM reliability, to enable transformation of the CPM paradigm into a clinical tool. Two cohorts of young healthy subjects (N = 65) participated in two dual-session studies. In Study I, a Bath-Thermode CPM protocol was used, with hot water immersion and contact heat as conditioning- and test-stimuli, respectively, in a classical parallel CPM design introducing test-stimulus first, and then the conditioning- and repeated test-stimuli in parallel. Study II consisted of two CPM protocols: 1) Two-Thermodes, one for each of the stimuli, in the same parallel design as above, and 2) single test-stimulus (STS) protocol with a single administration of a contact heat test-stimulus, partially overlapped in time by a remote shorter contact heat as conditioning stimulus. Test-retest reliability was assessed within 3-7 days. The STS-CPM had superior reliability intraclass correlation (ICC 2 ,: 1 = 0.59) over Bath-Thermode (ICC 2 ,: 1 = 0.34) or Two-Thermodes (ICC 2 ,: 1 = 0.21) protocols. The hand immersion conditioning pain had higher reliability than thermode pain (ICC 2 ,: 1 = 0.76 vs ICC 2 ,: 1 = 0.16). Conditioned test-stimulus pain scores were of good (ICC 2 ,: 1 = 0.62) or fair (ICC 2 ,: 1 = 0.43) reliability for the Bath-Thermode and the STS, respectively, but not for the Two-Thermodes protocol (ICC 2 ,: 1 = 0.20). The newly developed STS-CPM paradigm was more reliable than other CPM protocols tested here, and should be further investigated for its clinical relevance. It appears that large contact size of the conditioning-stimulus and use of single rather than dual test-stimulus pain contribute to augmentation of CPM reliability. © 2015 American Academy of Pain Medicine. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Reliability of Radioisotope Stirling Convertor Linear Alternator
NASA Technical Reports Server (NTRS)
Shah, Ashwin; Korovaichuk, Igor; Geng, Steven M.; Schreiber, Jeffrey G.
2006-01-01
Onboard radioisotope power systems being developed and planned for NASA s deep-space missions would require reliable design lifetimes of up to 14 years. Critical components and materials of Stirling convertors have been undergoing extensive testing and evaluation in support of a reliable performance for the specified life span. Of significant importance to the successful development of the Stirling convertor is the design of a lightweight and highly efficient linear alternator. Alternator performance could vary due to small deviations in the permanent magnet properties, operating temperature, and component geometries. Durability prediction and reliability of the alternator may be affected by these deviations from nominal design conditions. Therefore, it is important to evaluate the effect of these uncertainties in predicting the reliability of the linear alternator performance. This paper presents a study in which a reliability-based methodology is used to assess alternator performance. The response surface characterizing the induced open-circuit voltage performance is constructed using 3-D finite element magnetic analysis. Fast probability integration method is used to determine the probability of the desired performance and its sensitivity to the alternator design parameters.
Reliability and discriminatory power of methods for dental plaque quantification
RAGGIO, Daniela Prócida; BRAGA, Mariana Minatel; RODRIGUES, Jonas Almeida; FREITAS, Patrícia Moreira; IMPARATO, José Carlos Pettorossi; MENDES, Fausto Medeiros
2010-01-01
Objective This in situ study evaluated the discriminatory power and reliability of methods of dental plaque quantification and the relationship between visual indices (VI) and fluorescence camera (FC) to detect plaque. Material and Methods Six volunteers used palatal appliances with six bovine enamel blocks presenting different stages of plaque accumulation. The presence of plaque with and without disclosing was assessed using VI. Images were obtained with FC and digital camera in both conditions. The area covered by plaque was assessed. Examinations were done by two independent examiners. Data were analyzed by Kruskal-Wallis and Kappa tests to compare different conditions of samples and to assess the inter-examiner reproducibility. Results Some methods presented adequate reproducibility. The Turesky index and the assessment of area covered by disclosed plaque in the FC images presented the highest discriminatory powers. Conclusions The Turesky index and images with FC with disclosing present good reliability and discriminatory power in quantifying dental plaque. PMID:20485931
Composite Reliability of a Workplace-Based Assessment Toolbox for Postgraduate Medical Education
ERIC Educational Resources Information Center
Moonen-van Loon, J. M. W.; Overeem, K.; Donkers, H. H. L. M.; van der Vleuten, C. P. M.; Driessen, E. W.
2013-01-01
In recent years, postgraduate assessment programmes around the world have embraced workplace-based assessment (WBA) and its related tools. Despite their widespread use, results of studies on the validity and reliability of these tools have been variable. Although in many countries decisions about residents' continuation of training and…
Assessing segment- and corridor-based travel-time reliability on urban freeways : final report.
DOT National Transportation Integrated Search
2016-09-01
Travel time and its reliability are intuitive performance measures for freeway traffic operations. The objective of this project was to quantify segment-based and corridor-based travel time reliability measures on urban freeways. To achieve this obje...
Painter, J; Trevithick, L; Hastings, R P; Ingham, B; Roy, A
2016-12-01
In meeting the needs of individuals with intellectual disabilities (ID) who access health services, a brief, holistic assessment of need is useful. This study outlines the development and testing of the Learning Disabilities Needs Assessment Tool (LDNAT), a tool intended for this purpose. An existing mental health (MH) tool was extended by a multidisciplinary group of ID practitioners. Additional scales were drafted to capture needs across six ID treatment domains that the group identified. LDNAT ratings were analysed for the following: item redundancy, relevance, construct validity and internal consistency (n = 1692); test-retest reliability (n = 27); and concurrent validity (n = 160). All LDNAT scales were deemed clinically relevant with little redundancy apparent. Principal component analysis indicated three components (developmental needs, challenging behaviour, MH and well-being). Internal consistency was good (Cronbach alpha 0.80). Individual item test-retest reliability was substantial-near perfect for 20 scales and slight-fair for three scales. Overall reliability was near perfect (intra-class correlation = 0.91). There were significant associations with five of six condition-specific measures, i.e. the Waisman Activities of Daily Living Scale (general ability/disability), Threshold Assessment Grid (risk), Behaviour Problems Inventory for Individuals with Intellectual Disabilities-Short Form (challenging behaviour) Social Communication Questionnaire (autism) and a bespoke physical health questionnaire. Additionally, the statistically significant correlations between these tools and the LDNAT components made sense clinically. There were no statistically significant correlations with the Psychiatric Assessment Schedules for Adults with Developmental Disabilities (a measure of MH symptoms in people with ID). The LDNAT had clinically utility when rating the needs of people with ID prior to condition-specific assessment(s). Analyses of internal and external validity were promising. Further evaluation of its sensitivity to changes in needs is now required. © 2016 MENCAP and International Association of the Scientific Study of Intellectual and Developmental Disabilities and John Wiley & Sons Ltd.
Arifin, Nooranida; Abu Osman, Noor Azuan; Wan Abas, Wan Abu Bakar
2014-04-01
The measurements of postural balance often involve measurement error, which affects the analysis and interpretation of the outcomes. In most of the existing clinical rehabilitation research, the ability to produce reliable measures is a prerequisite for an accurate assessment of an intervention after a period of time. Although clinical balance assessment has been performed in previous study, none has determined the intrarater test-retest reliability of static and dynamic stability indexes during dominant single stance. In this study, one rater examined 20 healthy university students (female=12, male=8) in two sessions separated by 7 day intervals. Three stability indexes--the overall stability index (OSI), anterior/posterior stability index (APSI), and medial/ lateral stability index (MLSI) in static and dynamic conditions--were measured during single dominant stance. Intraclass correlation coefficient (ICC), standard error measurement (SEM) and 95% confidence interval (95% CI) were calculated. Test-retest ICCs for OSI, APSI, and MLSI were 0.85, 0.78, and 0.84 during static condition and were 0.77, 0.77, and 0.65 during dynamic condition, respectively. We concluded that the postural stability assessment using Biodex stability system demonstrates good-to-excellent test-retest reliability over a 1 week time interval.
Management of the aging of critical safety-related concrete structures in light-water reactor plants
DOE Office of Scientific and Technical Information (OSTI.GOV)
Naus, D.J.; Oland, C.B.; Arndt, E.G.
1990-01-01
The Structural Aging Program has the overall objective of providing the USNRC with an improved basis for evaluating nuclear power plant safety-related structures for continued service. The program consists of a management task and three technical tasks: materials property data base, structural component assessment/repair technology, and quantitative methodology for continued-service determinations. Objectives, accomplishments, and planned activities under each of these tasks are presented. Major program accomplishments include development of a materials property data base for structural materials as well as an aging assessment methodology for concrete structures in nuclear power plants. Furthermore, a review and assessment of inservice inspection techniquesmore » for concrete materials and structures has been complete, and work on development of a methodology which can be used for performing current as well as reliability-based future condition assessment of concrete structures is well under way. 43 refs., 3 tabs.« less
Murphy, Douglas J; Bruce, David A; Mercer, Stewart W; Eva, Kevin W
2009-05-01
To investigate the reliability and feasibility of six potential workplace-based assessment methods in general practice training: criterion audit, multi-source feedback from clinical and non-clinical colleagues, patient feedback (the CARE Measure), referral letters, significant event analysis, and video analysis of consultations. Performance of GP registrars (trainees) was evaluated with each tool to assess the reliabilities of the tools and feasibility, given raters and number of assessments needed. Participant experience of process determined by questionnaire. 171 GP registrars and their trainers, drawn from nine deaneries (representing all four countries in the UK), participated. The ability of each tool to differentiate between doctors (reliability) was assessed using generalisability theory. Decision studies were then conducted to determine the number of observations required to achieve an acceptably high reliability for "high-stakes assessment" using each instrument. Finally, descriptive statistics were used to summarise participants' ratings of their experience using these tools. Multi-source feedback from colleagues and patient feedback on consultations emerged as the two methods most likely to offer a reliable and feasible opinion of workplace performance. Reliability co-efficients of 0.8 were attainable with 41 CARE Measure patient questionnaires and six clinical and/or five non-clinical colleagues per doctor when assessed on two occasions. For the other four methods tested, 10 or more assessors were required per doctor in order to achieve a reliable assessment, making the feasibility of their use in high-stakes assessment extremely low. Participant feedback did not raise any major concerns regarding the acceptability, feasibility, or educational impact of the tools. The combination of patient and colleague views of doctors' performance, coupled with reliable competence measures, may offer a suitable evidence-base on which to monitor progress and completion of doctors' training in general practice.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bucknor, Matthew; Grabaskas, David; Brunett, Acacia
2015-04-26
Advanced small modular reactor designs include many advantageous design features such as passively driven safety systems that are arguably more reliable and cost effective relative to conventional active systems. Despite their attractiveness, a reliability assessment of passive systems can be difficult using conventional reliability methods due to the nature of passive systems. Simple deviations in boundary conditions can induce functional failures in a passive system, and intermediate or unexpected operating modes can also occur. As part of an ongoing project, Argonne National Laboratory is investigating various methodologies to address passive system reliability. The Reliability Method for Passive Systems (RMPS), amore » systematic approach for examining reliability, is one technique chosen for this analysis. This methodology is combined with the Risk-Informed Safety Margin Characterization (RISMC) approach to assess the reliability of a passive system and the impact of its associated uncertainties. For this demonstration problem, an integrated plant model of an advanced small modular pool-type sodium fast reactor with a passive reactor cavity cooling system is subjected to a station blackout using RELAP5-3D. This paper discusses important aspects of the reliability assessment, including deployment of the methodology, the uncertainty identification and quantification process, and identification of key risk metrics.« less
Mash, Bob; Derese, Anselme
2013-01-01
Abstract Background Competency-based education and the validity and reliability of workplace-based assessment of postgraduate trainees have received increasing attention worldwide. Family medicine was recognised as a speciality in South Africa six years ago and a satisfactory portfolio of learning is a prerequisite to sit the national exit exam. A massive scaling up of the number of family physicians is needed in order to meet the health needs of the country. Aim The aim of this study was to develop a reliable, robust and feasible portfolio assessment tool (PAT) for South Africa. Methods Six raters each rated nine portfolios from the Stellenbosch University programme, using the PAT, to test for inter-rater reliability. This rating was repeated three months later to determine test–retest reliability. Following initial analysis and feedback the PAT was modified and the inter-rater reliability again assessed on nine new portfolios. An acceptable intra-class correlation was considered to be > 0.80. Results The total score was found to be reliable, with a coefficient of 0.92. For test–retest reliability, the difference in mean total score was 1.7%, which was not statistically significant. Amongst the subsections, only assessment of the educational meetings and the logbook showed reliability coefficients > 0.80. Conclusion This was the first attempt to develop a reliable, robust and feasible national portfolio assessment tool to assess postgraduate family medicine training in the South African context. The tool was reliable for the total score, but the low reliability of several sections in the PAT helped us to develop 12 recommendations regarding the use of the portfolio, the design of the PAT and the training of raters.
Reliability Assessment of a Robust Design Under Uncertainty for a 3-D Flexible Wing
NASA Technical Reports Server (NTRS)
Gumbert, Clyde R.; Hou, Gene J. -W.; Newman, Perry A.
2003-01-01
The paper presents reliability assessment results for the robust designs under uncertainty of a 3-D flexible wing previously reported by the authors. Reliability assessments (additional optimization problems) of the active constraints at the various probabilistic robust design points are obtained and compared with the constraint values or target constraint probabilities specified in the robust design. In addition, reliability-based sensitivity derivatives with respect to design variable mean values are also obtained and shown to agree with finite difference values. These derivatives allow one to perform reliability based design without having to obtain second-order sensitivity derivatives. However, an inner-loop optimization problem must be solved for each active constraint to find the most probable point on that constraint failure surface.
Vanwolleghem, Griet; Van Dyck, Delfien; Ducheyne, Fabian; De Bourdeaudhuij, Ilse; Cardon, Greet
2014-06-10
Google Street View provides a valuable and efficient alternative to observe the physical environment compared to on-site fieldwork. However, studies on the use, reliability and validity of Google Street View in a cycling-to-school context are lacking. We aimed to study the intra-, inter-rater reliability and criterion validity of EGA-Cycling (Environmental Google Street View Based Audit - Cycling to school), a newly developed audit using Google Street View to assess the physical environment along cycling routes to school. Parents (n = 52) of 11-to-12-year old Flemish children, who mostly cycled to school, completed a questionnaire and identified their child's cycling route to school on a street map. Fifty cycling routes of 11-to-12-year olds were identified and physical environmental characteristics along the identified routes were rated with EGA-Cycling (5 subscales; 37 items), based on Google Street View. To assess reliability, two researchers performed the audit. Criterion validity of the audit was examined by comparing the ratings based on Google Street View with ratings through on-site assessments. Intra-rater reliability was high (kappa range 0.47-1.00). Large variations in the inter-rater reliability (kappa range -0.03-1.00) and criterion validity scores (kappa range -0.06-1.00) were reported, with acceptable inter-rater reliability values for 43% of all items and acceptable criterion validity for 54% of all items. EGA-Cycling can be used to assess physical environmental characteristics along cycling routes to school. However, to assess the micro-environment specifically related to cycling, on-site assessments have to be added.
The Validation of a Case-Based, Cumulative Assessment and Progressions Examination
Coker, Adeola O.; Copeland, Jeffrey T.; Gottlieb, Helmut B.; Horlen, Cheryl; Smith, Helen E.; Urteaga, Elizabeth M.; Ramsinghani, Sushma; Zertuche, Alejandra; Maize, David
2016-01-01
Objective. To assess content and criterion validity, as well as reliability of an internally developed, case-based, cumulative, high-stakes third-year Annual Student Assessment and Progression Examination (P3 ASAP Exam). Methods. Content validity was assessed through the writing-reviewing process. Criterion validity was assessed by comparing student scores on the P3 ASAP Exam with the nationally validated Pharmacy Curriculum Outcomes Assessment (PCOA). Reliability was assessed with psychometric analysis comparing student performance over four years. Results. The P3 ASAP Exam showed content validity through representation of didactic courses and professional outcomes. Similar scores on the P3 ASAP Exam and PCOA with Pearson correlation coefficient established criterion validity. Consistent student performance using Kuder-Richardson coefficient (KR-20) since 2012 reflected reliability of the examination. Conclusion. Pharmacy schools can implement internally developed, high-stakes, cumulative progression examinations that are valid and reliable using a robust writing-reviewing process and psychometric analyses. PMID:26941435
Fernandez-Mayoralas, Gloria; Giraldez-Garcia, Carolina; Forjaz, Maria João; Rojo-Perez, Fermina; Martinez-Martin, Pablo; Prieto-Flores, Maria-Eugenia
2012-03-01
The survey "Quality of life in older adults-Spain" (CadeViMa-Spain) was designed to obtain information about objective and subjective determinants of Quality of Life (QoL) in old age, from a multidimensional perspective. This paper presents the overall description, methodology, sample characteristics and reliability of the measures used. A cross-sectional survey was carried out in a representative sample of 1106 community-dwelling adults aged 60 years and over in Spain. The sample was obtained by a geodemographically-based proportional multistage stratified sampling. A home-based questionnaire included validated scales and questions about sociodemographic characteristics, global QoL, health, family and social networks, financial means and retirement, leisure and social participation, residential environment, and satisfaction with those issues. Face-to-face semi-structured interviews were conducted. Cronbach's α coefficients were used to assess internal consistency of the scales. This nationally representative survey furnishes information about global QoL, health-related QoL, resources availability, living conditions, and satisfaction with the assessed aspects, including life domains most valued by this group. In general, community-dwelling older adults reported positive assessments of health, living conditions, and high levels of satisfaction with the different aspects of QoL. The reliability of the measures in this population was good. This survey provides comprehensive and useful information, based on the view of older people themselves, with potential to contribute to health and social policies towards promoting active aging. The database is available for in-depth comparisons.
Software reliability through fault-avoidance and fault-tolerance
NASA Technical Reports Server (NTRS)
Vouk, Mladen A.; Mcallister, David F.
1993-01-01
Strategies and tools for the testing, risk assessment and risk control of dependable software-based systems were developed. Part of this project consists of studies to enable the transfer of technology to industry, for example the risk management techniques for safety-concious systems. Theoretical investigations of Boolean and Relational Operator (BRO) testing strategy were conducted for condition-based testing. The Basic Graph Generation and Analysis tool (BGG) was extended to fully incorporate several variants of the BRO metric. Single- and multi-phase risk, coverage and time-based models are being developed to provide additional theoretical and empirical basis for estimation of the reliability and availability of large, highly dependable software. A model for software process and risk management was developed. The use of cause-effect graphing for software specification and validation was investigated. Lastly, advanced software fault-tolerance models were studied to provide alternatives and improvements in situations where simple software fault-tolerance strategies break down.
Evidence-based assessment in pediatric psychology: family measures.
Alderfer, Melissa A; Fiese, Barbara H; Gold, Jeffrey I; Cutuli, J J; Holmbeck, Grayson N; Goldbeck, Lutz; Chambers, Christine T; Abad, Mona; Spetter, Dante; Patterson, Joän
2008-10-01
To provide a review of the evidence base of family measures relevant to pediatric psychology. Twenty-nine family measures were selected based upon endorsement by Division 54 listserv members, expert judgment, and literature review. Spanning observational and self-report methods, the measures fell into three broad assessment categories: Family functioning, Dyadic family relationships, and Family functioning in the context of childhood chronic health conditions. Measures were categorized as: "Well-established", "Approaching well-established", or "Promising." Nineteen measures met "well-established" criteria and the remaining ten were "approaching well-established." "Well-established" measures were documented for each of the broad assessment categories named above. Many measures deemed "well-established" in the general population are proving to be reliable and useful in pediatric samples. More evidence of the validity of family measures is needed in this context. This review should prove helpful to clinicians and researchers as they strive to make evidence-based decisions regarding family measures.
Harris, Joshua D; Erickson, Brandon J; Cvetanovich, Gregory L; Abrams, Geoffrey D; McCormick, Frank M; Gupta, Anil K; Verma, Nikhil N; Bach, Bernard R; Cole, Brian J
2014-02-01
Condition-specific questionnaires are important components in evaluation of outcomes of surgical interventions. No condition-specific study methodological quality questionnaire exists for evaluation of outcomes of articular cartilage surgery in the knee. To develop a reliable and valid knee articular cartilage-specific study methodological quality questionnaire. Cross-sectional study. A stepwise, a priori-designed framework was created for development of a novel questionnaire. Relevant items to the topic were identified and extracted from a recent systematic review of 194 investigations of knee articular cartilage surgery. In addition, relevant items from existing generic study methodological quality questionnaires were identified. Items for a preliminary questionnaire were generated. Redundant and irrelevant items were eliminated, and acceptable items modified. The instrument was pretested and items weighed. The instrument, the MARK score (Methodological quality of ARticular cartilage studies of the Knee), was tested for validity (criterion validity) and reliability (inter- and intraobserver). A 19-item, 3-domain MARK score was developed. The 100-point scale score demonstrated face validity (focus group of 8 orthopaedic surgeons) and criterion validity (strong correlation to Cochrane Quality Assessment score and Modified Coleman Methodology Score). Interobserver reliability for the overall score was good (intraclass correlation coefficient [ICC], 0.842), and for all individual items of the MARK score, acceptable to perfect (ICC, 0.70-1.000). Intraobserver reliability ICC assessed over a 3-week interval was strong for 2 reviewers (≥0.90). The MARK score is a valid and reliable knee articular cartilage condition-specific study methodological quality instrument. This condition-specific questionnaire may be used to evaluate the quality of studies reporting outcomes of articular cartilage surgery in the knee.
Schiffman, Eric L.; Truelove, Edmond L.; Ohrbach, Richard; Anderson, Gary C.; John, Mike T.; List, Thomas; Look, John O.
2011-01-01
AIMS The purpose of the Research Diagnostic Criteria for Temporomandibular Disorders (RDC/TMD) Validation Project was to assess the diagnostic validity of this examination protocol. An overview is presented, including Axis I and II methodology and descriptive statistics for the study participant sample. This paper details the development of reliable methods to establish the reference standards for assessing criterion validity of the Axis I RDC/TMD diagnoses. Validity testing for the Axis II biobehavioral instruments was based on previously validated reference standards. METHODS The Axis I reference standards were based on the consensus of 2 criterion examiners independently performing a comprehensive history, clinical examination, and evaluation of imaging. Intersite reliability was assessed annually for criterion examiners and radiologists. Criterion exam reliability was also assessed within study sites. RESULTS Study participant demographics were comparable to those of participants in previous studies using the RDC/TMD. Diagnostic agreement of the criterion examiners with each other and with the consensus-based reference standards was excellent with all kappas ≥ 0.81, except for osteoarthrosis (moderate agreement, k = 0.53). Intrasite criterion exam agreement with reference standards was excellent (k ≥ 0.95). Intersite reliability of the radiologists for detecting computed tomography-disclosed osteoarthrosis and magnetic resonance imaging-disclosed disc displacement was good to excellent (k = 0.71 and 0.84, respectively). CONCLUSION The Validation Project study population was appropriate for assessing the reliability and validity of the RDC/TMD Axis I and II. The reference standards used to assess the validity of Axis I TMD were based on reliable and clinically credible methods. PMID:20213028
A USEPA-sponsored field demonstration program was conducted to gather technically reliable cost and performance information on the electro-scan (FELL -41) pipeline condition assessment technology. Electro-scan technology can be used to estimate the magnitude and location of pote...
Assessment of a specifically developed bullet casing gun for the stunning of water buffaloes.
Meichtry, Carmen; Glauser, Urs; Glardon, Matthieu; Ross, Steffen G; Lechner, Isabel; Kneubuehl, Beat P; Gascho, Dominic; Spadavecchia, Claudia; von Rotz, Alois; Stojiljkovic, Ana; Stoffel, Michael H
2018-01-01
Water buffaloes and cattle differ considerably with respect to the anatomy of the head. As a result, captive bolt stunners often fail to reliably produce adequate loss of consciousness in water buffaloes and, thus, do not fulfill animal welfare requirements. The goal of the present study was to assess and validate a new stunning device for water buffaloes meeting animal welfare and occupational safety requirements. The newly designed bullet casing gun uses .357Mag/10.2g hollow point bullets and has additional safety features. Its effectiveness and usability were assessed under practical conditions in an abattoir as based on widely accepted criteria. Stunning resulted in deep unconsciousness in 19 out of 20 water buffaloes. One 9-year old male did not immediately collapse. Except for very old bulls, the device presented herewith provides a means to stun water buffaloes of both sexes effectively and reliably while keeping occupational hazards to a minimum. Copyright © 2017 Elsevier Ltd. All rights reserved.
Hiller, W; Goebel, G
1999-01-01
The development and course of chronic tinnitus are determined by both biological and psychological factors. To combine these different sources of data, we developed a standardized interview to assess tinnitus history, summarize audiological findings, screen for etiological conditions, and explore tinnitus-related psychological complaints (Structured Tinnitus Interview). The results of a test-retest study with 65 tinnitus inpatients show that most of these components can be assessed with acceptable or high reliability. Further data based on 166 patients demonstrate that tinnitus annoyance was to some extent different from patterns of general psychological complaints, although there were medium intercorrelations with depression. Significant predictors of tinnitus annoyance were (a) continuous tinnitus without intervals, (b) hearing loss, (c) increasing tinnitus loudness over time, (d) poor maskability, (e) history of sudden hearing loss, and (f) associated craniomandibular disorder. Psychological distress was not significantly increased in patients whose tinnitus was associated to vascular disorder, cervical spine dysfunction, acoustic trauma, Menihre's disease, or neurological disorder.
Anguera, J A; Brandes-Aitken, A N; Rolle, C E; Skinner, S N; Desai, S S; Bower, J D; Martucci, W E; Chung, W K; Sherr, E H; Marco, E J
2016-09-20
Assessing cognitive abilities in children is challenging for two primary reasons: lack of testing engagement can lead to low testing sensitivity and inherent performance variability. Here we sought to explore whether an engaging, adaptive digital cognitive platform built to look and feel like a video game would reliably measure attention-based abilities in children with and without neurodevelopmental disabilities related to a known genetic condition, 16p11.2 deletion. We assessed 20 children with 16p11.2 deletion, a genetic variation implicated in attention deficit/hyperactivity disorder and autism, as well as 16 siblings without the deletion and 75 neurotypical age-matched children. Deletion carriers showed significantly slower response times and greater response variability when compared with all non-carriers; by comparison, traditional non-adaptive selective attention assessments were unable to discriminate group differences. This phenotypic characterization highlights the potential power of administering tools that integrate adaptive psychophysical mechanics into video-game-style mechanics to achieve robust, reliable measurements.
Reliability of pulse waveform separation analysis: effects of posture and fasting.
Stoner, Lee; Credeur, Daniel; Fryer, Simon; Faulkner, James; Lambrick, Danielle; Gibbs, Bethany Barone
2017-03-01
Oscillometric pulse wave analysis devices enable, with relative simplicity and objectivity, the measurement of central hemodynamic parameters. The important parameters are central blood pressures and indices of arterial wave reflection, including wave separation analysis (backward pressure component Pb and reflection magnitude). This study sought to determine whether the measurement precision (between-day reliability) of Pb and reflection magnitude: exceeds the criterion for acceptable reliability; and is affected by posture (supine, seated) and fasting state. Twenty healthy adults (50% female, 27.9 years, 24.2 kg/m) were tested on six different mornings: 3 days fasted, 3 days nonfasted condition. On each occasion, participants were tested in supine and seated postures. Oscillometric pressure waveforms were recorded on the left upper arm. The criterion intra-class correlation coefficient value of 0.75 was exceeded for Pb (0.76) and reflection magnitude (0.77) when participants were assessed under the combined supine-fasted condition. The intra-class correlation coefficient was lowest for Pb in seated-nonfasted condition (0.57), and lowest for reflection magnitude in the seated-fasted condition (0.56). For Pb, the smallest detectible change that must be exceeded in order for a significant change to occur in an individual was 2.5 mmHg, and for reflection magnitude, the smallest detectable change was 8.5%. Assessments of Pb and reflection magnitude are as follows: exceed the criterion for acceptable reliability; and are most reliable when participants are fasted in a supine position. The demonstrated reliability suggests sufficient precision to detect clinically meaningful changes in reflection magnitude and Pb.
Do downscaled general circulation models reliably simulate historical climatic conditions?
Bock, Andrew R.; Hay, Lauren E.; McCabe, Gregory J.; Markstrom, Steven L.; Atkinson, R. Dwight
2018-01-01
The accuracy of statistically downscaled (SD) general circulation model (GCM) simulations of monthly surface climate for historical conditions (1950–2005) was assessed for the conterminous United States (CONUS). The SD monthly precipitation (PPT) and temperature (TAVE) from 95 GCMs from phases 3 and 5 of the Coupled Model Intercomparison Project (CMIP3 and CMIP5) were used as inputs to a monthly water balance model (MWBM). Distributions of MWBM input (PPT and TAVE) and output [runoff (RUN)] variables derived from gridded station data (GSD) and historical SD climate were compared using the Kolmogorov–Smirnov (KS) test For all three variables considered, the KS test results showed that variables simulated using CMIP5 generally are more reliable than those derived from CMIP3, likely due to improvements in PPT simulations. At most locations across the CONUS, the largest differences between GSD and SD PPT and RUN occurred in the lowest part of the distributions (i.e., low-flow RUN and low-magnitude PPT). Results indicate that for the majority of the CONUS, there are downscaled GCMs that can reliably simulate historical climatic conditions. But, in some geographic locations, none of the SD GCMs replicated historical conditions for two of the three variables (PPT and RUN) based on the KS test, with a significance level of 0.05. In these locations, improved GCM simulations of PPT are needed to more reliably estimate components of the hydrologic cycle. Simple metrics and statistical tests, such as those described here, can provide an initial set of criteria to help simplify GCM selection.
The long-term reliability of static and dynamic quantitative sensory testing in healthy individuals.
Marcuzzi, Anna; Wrigley, Paul J; Dean, Catherine M; Adams, Roger; Hush, Julia M
2017-07-01
Quantitative sensory tests (QSTs) have been increasingly used to investigate alterations in somatosensory function in a wide range of painful conditions. The interpretation of these findings is based on the assumption that the measures are stable and reproducible. To date, reliability of QST has been investigated for short test-retest intervals. The aim of this study was to investigate the long-term reliability of a multimodal QST assessment in healthy people, with testing conducted on 3 occasions over 4 months. Forty-two healthy people were enrolled in the study. Static and dynamic tests were performed, including cold and heat pain threshold (CPT, HPT), mechanical wind-up [wind-up ratio (WUR)], pressure pain threshold (PPT), 2-point discrimination (TPD), and conditioned pain modulation (CPM). Systematic bias, relative reliability and agreement were analysed using repeated measure analysis of variance, intraclass correlation coefficients (ICCs3,1) and SE of the measurement (SEM), respectively. Static QST (CPT, HPT, PPT, and TPD) showed good-to-excellent reliability (ICCs: 0.68-0.90). Dynamic QST (WUR and CPM) showed poor-to-good reliability (ICCs: 0.35-0.61). A significant linear decrease over time was observed for mechanical QST at the back (PPT and TPD) and for CPM (P < 0.01). Static QST were stable over a period of 4 months; however, a small systematic decrease over time has been observed for mechanical QST. Dynamic QST showed considerable variability over time; in particular, CPM using PPT as the test stimulus did not show adequate reliability, suggesting that this test paradigm may be less useful for monitoring individuals over time.
Body postures and patterns as amplifiers of physical condition.
Taylor, P W; Hasson, O; Clark, D L
2000-01-01
The question of why receivers accept a selfish signaller's message as reliable or 'honest' has fuelled ample controversy in discussions of communication. The handicap mechanism is now widely accepted as a potent constraint on cheating. Handicap signals are deemed reliable by their costs: signallers must choose between investing in the signal or in other aspects of fitness. Accordingly, resources allocated to the signal come to reflect the signaller's fitness budget and, on average, cheating is uneconomic. However, that signals may also be deemed reliable by their design, regardless of costs, is not widely appreciated. Here we briefly describe indices and amplifiers, reliable signals that may be essentially cost free. Indices are reliable because they bear a direct association with the signalled quality rather than costs. Amplifiers do not directly provide information about signaller quality, but they facilitate assessment by increasing the apparency of pre-existing cues and signals that are associated with quality. We present results of experiments involving a jumping spider (Plexippus paykulli) to illustrate how amplifiers can facilitate assessment of cues associated with physical condition without invoking the costs required for handicap signalling. PMID:10853735
NASA Astrophysics Data System (ADS)
Li, Lin; Zeng, Li; Lin, Zi-Jing; Cazzell, Mary; Liu, Hanli
2015-05-01
Test-retest reliability of neuroimaging measurements is an important concern in the investigation of cognitive functions in the human brain. To date, intraclass correlation coefficients (ICCs), originally used in inter-rater reliability studies in behavioral sciences, have become commonly used metrics in reliability studies on neuroimaging and functional near-infrared spectroscopy (fNIRS). However, as there are six popular forms of ICC, the adequateness of the comprehensive understanding of ICCs will affect how one may appropriately select, use, and interpret ICCs toward a reliability study. We first offer a brief review and tutorial on the statistical rationale of ICCs, including their underlying analysis of variance models and technical definitions, in the context of assessment on intertest reliability. Second, we provide general guidelines on the selection and interpretation of ICCs. Third, we illustrate the proposed approach by using an actual research study to assess intertest reliability of fNIRS-based, volumetric diffuse optical tomography of brain activities stimulated by a risk decision-making protocol. Last, special issues that may arise in reliability assessment using ICCs are discussed and solutions are suggested.
Kergoat, Marie-Jeanne; Leclerc, Bernard-Simon; Leduc, Nicole; Latour, Judith; Berg, Katherine; Bolduc, Aline
2009-07-29
The number of elderly people requiring hospital care is growing, so, quality and assessment of care for elders are emerging and complex areas of research. Very few validated and reliable instruments exist for the assessment of quality of acute care in this field. This study's objective was to create such a tool for Geriatric Evaluation and Management Units (GEMUs). The methodology involved a reliability and feasibility study of a retrospective chart review on 934 older inpatients admitted in 49 GEMUs during the year 2002-2003 for fall-related trauma as a tracer condition. Pertinent indicators for a chart abstraction tool, the Geriatric Care Tool (GCT), were developed and validated according to five dimensions: access to care, comprehensiveness, continuity of care, patient-centred care and appropriateness. Consensus methods were used to develop the content. Participants were experts representing eight main health care professions involved in GEMUs from 19 different sites. Items associated with high quality of care at each step of the multidisciplinary management of patients admitted due to falls were identified. The GCT was tested for intra- and inter-rater reliability using 30 medical charts reviewed by each of three independent and blinded trained nurses. Kappa and agreement measures between pairs of chart reviewers were computed on an item-by-item basis. Three quarters of 169 items identifying the process of care, from the case history to discharge planning, demonstrated good agreement (kappa greater than 0.40 and agreement over 70%). Indicators for the appropriateness of care showed less reliability. Content validity and reliability results, as well as the feasibility of the process, suggest that the chart abstraction tool can gather standardized and pertinent clinical information for further evaluating quality of care in GEMU using admission due to falls as a tracer condition. However, the GCT should be evaluated in other models of acute geriatric units and new strategies should be developed to improve reliability of peer assessments in characterizing the quality of care for elderly patients with complex conditions.
Peterson, Eleanor B; Calhoun, Aaron W; Rider, Elizabeth A
2014-09-01
With increased recognition of the importance of sound communication skills and communication skills education, reliable assessment tools are essential. This study reports on the psychometric properties of an assessment tool based on the Kalamazoo Consensus Statement Essential Elements Communication Checklist. The Gap-Kalamazoo Communication Skills Assessment Form (GKCSAF), a modified version of an existing communication skills assessment tool, the Kalamazoo Essential Elements Communication Checklist-Adapted, was used to assess learners in a multidisciplinary, simulation-based communication skills educational program using multiple raters. 118 simulated conversations were available for analysis. Internal consistency and inter-rater reliability were determined by calculating a Cronbach's alpha score and intra-class correlation coefficients (ICC), respectively. The GKCSAF demonstrated high internal consistency with a Cronbach's alpha score of 0.844 (faculty raters) and 0.880 (peer observer raters), and high inter-rater reliability with an ICC of 0.830 (faculty raters) and 0.89 (peer observer raters). The Gap-Kalamazoo Communication Skills Assessment Form is a reliable method of assessing the communication skills of multidisciplinary learners using multi-rater methods within the learning environment. The Gap-Kalamazoo Communication Skills Assessment Form can be used by educational programs that wish to implement a reliable assessment and feedback system for a variety of learners. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Clark, Ross A; Pua, Yong-Hao; Oliveira, Cristino C; Bower, Kelly J; Thilarajah, Shamala; McGaw, Rebekah; Hasanki, Ksaniel; Mentiplay, Benjamin F
2015-07-01
The Microsoft Kinect V2 for Windows, also known as the Xbox One Kinect, includes new and potentially far improved depth and image sensors which may increase its accuracy for assessing postural control and balance. The aim of this study was to assess the concurrent validity and reliability of kinematic data recorded using a marker-based three dimensional motion analysis (3DMA) system and the Kinect V2 during a variety of static and dynamic balance assessments. Thirty healthy adults performed two sessions, separated by one week, consisting of static standing balance tests under different visual (eyes open vs. closed) and supportive (single limb vs. double limb) conditions, and dynamic balance tests consisting of forward and lateral reach and an assessment of limits of stability. Marker coordinate and joint angle data were concurrently recorded using the Kinect V2 skeletal tracking algorithm and the 3DMA system. Task-specific outcome measures from each system on Day 1 and 2 were compared. Concurrent validity of trunk angle data during the dynamic tasks and anterior-posterior range and path length in the static balance tasks was excellent (Pearson's r>0.75). In contrast, concurrent validity for medial-lateral range and path length was poor to modest for all trials except single leg eyes closed balance. Within device test-retest reliability was variable; however, the results were generally comparable between devices. In conclusion, the Kinect V2 has the potential to be used as a reliable and valid tool for the assessment of some aspects of balance performance. Copyright © 2015 Elsevier B.V. All rights reserved.
Sutherland, Rebecca; Trembath, David; Hodge, Antoinette; Drevensek, Suzi; Lee, Sabrena; Silove, Natalie; Roberts, Jacqueline
2017-01-01
Introduction Telehealth can be an effective way to provide speech pathology intervention to children with speech and language impairments. However, the provision of reliable and feasible standardised language assessments via telehealth to establish children's needs for intervention and to monitor progress has not yet been well established. Further, there is limited information about children's reactions to telehealth. This study aimed to examine the reliability and feasibility of conducting standardised language assessment with school-aged children with known or suspected language impairment via a telehealth application using consumer grade computer equipment within a public school setting. Method Twenty-three children (aged 8-12 years) participated. Each child was assessed using a standardised language assessment comprising six subtests. Two subtests were administered by a speech pathologist face-to-face (local clinician) and four subtests were administered via telehealth. All subtests were completed within a single visit to the clinic service, with a break between the face to face and telehealth sessions. The face-to-face clinician completed behaviour observation checklists in the telehealth and face to face conditions and provided feedback on the audio and video quality of the application from the child's point of view. Parent feedback about their child's experience was elicited via survey. Results There was strong inter-rater reliability in the telehealth and face-to-face conditions (correlation coefficients ranged from r = 0.96-1.0 across the subtests) and good agreement on all measures. Similar levels of attention, distractibility and anxiety were observed in the two conditions. Clinicians rated only one session of 23 as having poor audio quality and no sessions were rated as having poor visual quality. Parent and child reactions to the use of telehealth were largely positive and supportive of using telehealth to assess rural children. Discussion The findings support the use of telehealth in the language assessment of school-aged children using a web application and commercially available computer equipment. This reliable and innovative service delivery model has the potential to be used by speech pathologists to provide assessments to children in remote communities.
Probabilistic sizing of laminates with uncertainties
NASA Technical Reports Server (NTRS)
Shah, A. R.; Liaw, D. G.; Chamis, C. C.
1993-01-01
A reliability based design methodology for laminate sizing and configuration for a special case of composite structures is described. The methodology combines probabilistic composite mechanics with probabilistic structural analysis. The uncertainties of constituent materials (fiber and matrix) to predict macroscopic behavior are simulated using probabilistic theory. Uncertainties in the degradation of composite material properties are included in this design methodology. A multi-factor interaction equation is used to evaluate load and environment dependent degradation of the composite material properties at the micromechanics level. The methodology is integrated into a computer code IPACS (Integrated Probabilistic Assessment of Composite Structures). Versatility of this design approach is demonstrated by performing a multi-level probabilistic analysis to size the laminates for design structural reliability of random type structures. The results show that laminate configurations can be selected to improve the structural reliability from three failures in 1000, to no failures in one million. Results also show that the laminates with the highest reliability are the least sensitive to the loading conditions.
Probabilistic Assessment of National Wind Tunnel
NASA Technical Reports Server (NTRS)
Shah, A. R.; Shiao, M.; Chamis, C. C.
1996-01-01
A preliminary probabilistic structural assessment of the critical section of National Wind Tunnel (NWT) is performed using NESSUS (Numerical Evaluation of Stochastic Structures Under Stress) computer code. Thereby, the capabilities of NESSUS code have been demonstrated to address reliability issues of the NWT. Uncertainties in the geometry, material properties, loads and stiffener location on the NWT are considered to perform the reliability assessment. Probabilistic stress, frequency, buckling, fatigue and proof load analyses are performed. These analyses cover the major global and some local design requirements. Based on the assumed uncertainties, the results reveal the assurance of minimum 0.999 reliability for the NWT. Preliminary life prediction analysis results show that the life of the NWT is governed by the fatigue of welds. Also, reliability based proof test assessment is performed.
Inter-rater agreement in evaluation of disability: systematic review of reproducibility studies
Barth, Jürgen; de Boer, Wout E L; Busse, Jason W; Hoving, Jan L; Kedzia, Sarah; Couban, Rachel; Fischer, Katrin; von Allmen, David Y; Spanjer, Jerry
2017-01-01
Objectives To explore agreement among healthcare professionals assessing eligibility for work disability benefits. Design Systematic review and narrative synthesis of reproducibility studies. Data sources Medline, Embase, and PsycINFO searched up to 16 March 2016, without language restrictions, and review of bibliographies of included studies. Eligibility criteria Observational studies investigating reproducibility among healthcare professionals performing disability evaluations using a global rating of working capacity and reporting inter-rater reliability by a statistical measure or descriptively. Studies could be conducted in insurance settings, where decisions on ability to work include normative judgments based on legal considerations, or in research settings, where decisions on ability to work disregard normative considerations.Teams of paired reviewers identified eligible studies, appraised their methodological quality and generalisability, and abstracted results with pretested forms. As heterogeneity of research designs and findings impeded a quantitative analysis, a descriptive synthesis stratified by setting (insurance or research) was performed. Results From 4562 references, 101 full text articles were reviewed. Of these, 16 studies conducted in an insurance setting and seven in a research setting, performed in 12 countries, met the inclusion criteria. Studies in the insurance setting were conducted with medical experts assessing claimants who were actual disability claimants or played by actors, hypothetical cases, or short written scenarios. Conditions were mental (n=6, 38%), musculoskeletal (n=4, 25%), or mixed (n=6, 38%). Applicability of findings from studies conducted in an insurance setting to real life evaluations ranged from generalisable (n=7, 44%) and probably generalisable (n=3, 19%) to probably not generalisable (n=6, 37%). Median inter-rater reliability among experts was 0.45 (range intraclass correlation coefficient 0.86 to κ−0.10). Inter-rater reliability was poor in six studies (37%) and excellent in only two (13%). This contrasts with studies conducted in the research setting, where the median inter-rater reliability was 0.76 (range 0.91-0.53), and 71% (5/7) studies achieved excellent inter-rater reliability. Reliability between assessing professionals was higher when the evaluation was guided by a standardised instrument (23 studies, P=0.006). No such association was detected for subjective or chronic health conditions or the studies’ generalisability to real world evaluation of disability (P=0.46, 0.45, and 0.65, respectively). Conclusions Despite their common use and far reaching consequences for workers claiming disabling injury or illness, research on the reliability of medical evaluations of disability for work is limited and indicates high variation in judgments among assessing professionals. Standardising the evaluation process could improve reliability. Development and testing of instruments and structured approaches to improve reliability in evaluation of disability are urgently needed. PMID:28122727
Huo, Tianyao; Guo, Yi; Shenkman, Elizabeth; Muller, Keith
2018-02-13
Although Short Form (SF)-12 × 2® has been extensively studied and used as a valid measure of health-related quality of life in a variety of population groups, no systematic studies have described the reliability of the measure in patients with behavioral conditions or serious mental illness (SMI). We assessed the internal consistency, split-half reliability and annual test-retest correlations in a sample of 1587 participants with either a combination of physical and behavioral conditions or SMI. The Mosier's alpha was 0.70 for the Physical Composite Scale (PCS) and 0.69 for the Mental Health Composite Scale (MCS), indicating good internal consistency. We observed strong correlations between physical functioning, physical role and body pain scales (r = 0.55-0.56), and between social functioning, emotional role, and mental health (r = 0.53-0.58). We calculated split-half reliabilities to be 0.74 for physical functioning, 0.75 for physical role, 0.73 for emotional role and 0.65 for mental health respectively. We assessed the annual test-retest correlation using intraclass correlation (ICC) and found an ICC of 0.61 for PCS and 0.57 for MCS composite scores, adjusting for age, sex, race/ethnicity, and CRG. We found no decline in the correlations between baseline and the following study years until year 3. Our results encourage using SF-12v2® to assess health-related quality of life in the Medicaid population with combined physical and behavioral conditions or similar cohorts. The WIN study was registered with clinicaltrials.gov on April 22, 2015. NCT02440906 . Retrospectively registered.
Schiffman, Eric L; Truelove, Edmond L; Ohrbach, Richard; Anderson, Gary C; John, Mike T; List, Thomas; Look, John O
2010-01-01
The purpose of the Research Diagnostic Criteria for Temporomandibular Disorders (RDC/TMD) Validation Project was to assess the diagnostic validity of this examination protocol. The aim of this article is to provide an overview of the project's methodology, descriptive statistics, and data for the study participant sample. This article also details the development of reliable methods to establish the reference standards for assessing criterion validity of the Axis I RDC/TMD diagnoses. The Axis I reference standards were based on the consensus of two criterion examiners independently performing a comprehensive history, clinical examination, and evaluation of imaging. Intersite reliability was assessed annually for criterion examiners and radiologists. Criterion examination reliability was also assessed within study sites. Study participant demographics were comparable to those of participants in previous studies using the RDC/TMD. Diagnostic agreement of the criterion examiners with each other and with the consensus-based reference standards was excellent with all kappas > or = 0.81, except for osteoarthrosis (moderate agreement, k = 0.53). Intrasite criterion examiner agreement with reference standards was excellent (k > or = 0.95). Intersite reliability of the radiologists for detecting computed tomography-disclosed osteoarthrosis and magnetic resonance imaging-disclosed disc displacement was good to excellent (k = 0.71 and 0.84, respectively). The Validation Project study population was appropriate for assessing the reliability and validity of the RDC/TMD Axis I and II. The reference standards used to assess the validity of Axis I TMD were based on reliable and clinically credible methods.
RELIABILITY AND VALIDITY OF SUBJECTIVE ASSESSMENT OF LUMBAR LORDOSIS IN CONVENTIONAL RADIOGRAPHY.
Ruhinda, E; Byanyima, R K; Mugerwa, H
2014-10-01
Reliability and validity studies of different lumbar curvature analysis and measurement techniques have been documented however there is limited literature on the reliability and validity of subjective visual analysis. Radiological assessment of lumbar lordotic curve aids in early diagnosis of conditions even before neurologic changes set in. To ascertain the level of reliability and validity of subjective assessment of lumbar lordosis in conventional radiography. A blinded, repeated-measures diagnostic test was carried out on lumbar spine x-ray radiographs. Radiology Department at Joint Clinical Research Centre (JCRC), Mengo-Kampala-Uganda. Seventy (70) lateral lumbar x-ray films were used for this study and were obtained from the archive of JCRC radiology department at Butikiro house, Mengo-Kampala. Poor observer agreement, both inter- and intra-observer, with kappa values of 0.16 was found. Inter-observer agreement was poorer than intra-observer agreement. Kappa values significantly rose when the lumbar lordosis was clustered into four categories without grading each abnormality. The results confirm that subjective assessment of lumbar lordosis has low reliability and validity. Film quality has limited influence on the observer reliability. This study further shows that fewer scale categories of lordosis abnormalities produce better observer reliability.
McPherson, Amy C; Gofine, Miriam L; Stinson, Jennifer
2014-01-01
The numbers of children and young people with chronic conditions are increasing. While their information needs may vary, providing health care information can have considerable benefits, including better emotional health, less distress during treatments, and greater satisfaction with medical care. The Internet is increasingly being used to communicate health-related information to children about a range of chronic conditions. However, the quality of such websites is underexplored. Thus, the objectives of this study were to evaluate the reliability and quality of online information for children about chronic conditions using a standardized evaluation tool, and to explore children's and young people's perceptions of quality and trustworthiness regarding online health information. The study consisted of two phases. In Phase 1, websites about common pediatric chronic conditions aimed at children and containing treatment or management options were identified and the quality assessment tool DISCERN was completed. Test-retest and interrater reliability were calculated. In Phase 2, two focus groups with laptop computers were conducted with children and young people with a chronic condition to explore their perceptions of trustworthiness of online health information. In Phase 1, 165 websites were identified and 100 met the criteria and were assessed. The mean DISCERN score of all sites was 48.16 out of 75 (SD = 7.97, range 28-71, min 15 to max 75). Quality scores varied widely across the sample. The internal consistency and interrater reliability scores were both lower than previously reported in studies using the DISCERN to assess information for adults. In Phase 2, two focus groups with a total of six participants aged 11-23 years revealed a relative lack of concern about the quality and trustworthiness of online health information. Older participants reported judging the source and authorship of websites, but other participants did not question the source of the information they found online. Although personal websites were perceived to be less reliable than those from well-known medical institutions, they were still valued by many of the participants. Overall, there were relatively few websites about pediatric chronic conditions aimed at children, with variable quality and reliability. However, DISCERN's use with pediatric websites across a broad spectrum of developmental stages is limited. Children and young people demonstrated a need for more guidance around assessing trustworthiness of online information. A more appropriate quality assessment tool is warranted, which could usefully be employed by health care professionals, children, and parents.
User-independent assessment of conditioning pain modulation by cuff pressure algometry.
Graven-Nielsen, T; Izumi, M; Petersen, K K; Arendt-Nielsen, L
2017-03-01
The use of conditioning pain modulation (CPM) is hampered by poor reproducibility and lack of user-independent paradigms. This study refined the CPM paradigm by applying user-independent cuff algometry. In 20 subjects, the CPM effect of conditioning with cuff stimulation on the arm was investigated by pain test stimuli on the contralateral leg before and in parallel with different cuff conditionings (10, 30, 60 kPa/60 s; 30, 60 kPa/10 s). As test stimulus, another cuff was inflated (1 kPa/s) until the subjects detected the pain tolerance threshold (PTT) during which the pain detection threshold (PDT) and the pressure at a pain intensity of 6 cm on a 10-cm visual analogue scale (PVAS6) were extracted. For comparison, pressure pain thresholds (PPTs) as test stimuli were recorded by the user-dependent handheld pressure algometry. Combinations of cuff locations for conditioning (pain intensity standardized) and contralateral test stimuli were additionally evaluated (leg-arm, leg-leg, arm-thigh). The test-retest reliability in two sessions 1 month apart was assessed in five CPM protocols. In all protocols, the PDT, PVAS6 and PTT increased during conditioning compared with baseline (p < 0.05). The CPM effect (i.e. conditioning minus baseline) for PVAS6, PTT and PPT increased for increasing conditioning intensities (p < 0.05). The CPM effects were not significantly different for changes in conditioning durations or conditioning/test stimulus locations. In two sessions, the CPM effects for PVAS6 and PTT assessed after 60 s of conditioning on the leg/thigh showed the highest intra-class correlations (0.47-0.73), where they were 0.04-0.6 for PPTs. The user-independent cuff algometry is reliable for CPM assessment and for supra-pain threshold test stimuli better than the user-dependent technology. A user-independent CPM technique where the conditioning is controlled by one cuff stimulation, and the test-stimulus is provided by another cuff stimulation. This study shows that cuff algometry is reliable for CPM assessment. © 2016 European Pain Federation - EFIC®.
Reliability of engineering methods of assessment the critical buckling load of steel beams
NASA Astrophysics Data System (ADS)
Rzeszut, Katarzyna; Folta, Wiktor; Garstecki, Andrzej
2018-01-01
In this paper the reliability assessment of buckling resistance of steel beam is presented. A number of parameters such as: the boundary conditions, the section height to width ratio, the thickness and the span are considered. The examples are solved using FEM procedures and formulas proposed in the literature and standards. In the case of the numerical models the following parameters are investigated: support conditions, mesh size, load conditions, steel grade. The numerical results are compared with approximate solutions calculated according to the standard formulas. It was observed that for high slenderness section the deformation of the cross-section had to be described by the following modes: longitudinal and transverse displacement, warping, rotation and distortion of the cross section shape. In this case we face interactive buckling problem. Unfortunately, neither the EN Standards nor the subject literature give close-form formulas to solve these problems. For this reason the reliability of the critical bending moment calculations is discussed.
NASA Astrophysics Data System (ADS)
Chen, Fan; Huang, Shaoxiong; Ding, Jinjin; Ding, Jinjin; Gao, Bo; Xie, Yuguang; Wang, Xiaoming
2018-01-01
This paper proposes a fast reliability assessing method for distribution grid with distributed renewable energy generation. First, the Weibull distribution and the Beta distribution are used to describe the probability distribution characteristics of wind speed and solar irradiance respectively, and the models of wind farm, solar park and local load are built for reliability assessment. Then based on power system production cost simulation probability discretization and linearization power flow, a optimal power flow objected with minimum cost of conventional power generation is to be resolved. Thus a reliability assessment for distribution grid is implemented fast and accurately. The Loss Of Load Probability (LOLP) and Expected Energy Not Supplied (EENS) are selected as the reliability index, a simulation for IEEE RBTS BUS6 system in MATLAB indicates that the fast reliability assessing method calculates the reliability index much faster with the accuracy ensured when compared with Monte Carlo method.
Bayard, Sophie; Lebrun, Cindy; Maudarbocus, Khaalid Hassan; Schellaert, Vanessa; Joffre, Alicia; Ferrante, Esther; Le Louedec, Marie; Cournoulat, Alice; Gely-Nargeot, Marie-Christine; Luik, Annemarie I
2017-12-01
Insomnia disorder is frequent in the population, yet there is no French screening instrument available that is based on the updated DSM-5 criteria. We evaluated the validity and reliability of the French version of an insomnia screening instrument based on DSM-5 criteria, the Sleep Condition Indicator, in a population-based sample of adults. A total of 366 community-dwelling participants completed a face-to-face clinical interview to determine insomnia disorder against DSM-5 criteria and several questionnaires including the French Sleep Condition Indicator version. Three-hundred and twenty-nine participants completed the Sleep Condition Indicator again after 1 month. Statistical analyses were performed to determine the reliability, construct validity, divergent validity and temporal stability of the French translation of the Sleep Condition Indicator. In addition, an explanatory factor analysis was performed to assess the underlying structure. The internal consistency (α = 0.87) and temporal stability (r = 0.86, P < 0.001) of the French Sleep Condition Indicator were high. When using the previously defined cut-off value of ≤ 16, the area under the receiver operating characteristic curve was 0.93 with a sensitivity of 95% and a specificity of 75%. Additionally, good construct and divergent validity were demonstrated. The factor analyses showed a two-factor structure with a focus on sleep and daytime effects. The French version of the Sleep Condition Indicator demonstrates satisfactory psychometric properties while being a useful instrument in detecting cases of insomnia disorder, consistent with features of DSM-5, in the general population. © 2017 European Sleep Research Society.
Applications of computerized adaptive testing (CAT) to the assessment of headache impact.
Ware, John E; Kosinski, Mark; Bjorner, Jakob B; Bayliss, Martha S; Batenhorst, Alice; Dahlöf, Carl G H; Tepper, Stewart; Dowson, Andrew
2003-12-01
To evaluate the feasibility of computerized adaptive testing (CAT) and the reliability and validity of CAT-based estimates of headache impact scores in comparison with 'static' surveys. Responses to the 54-item Headache Impact Test (HIT) were re-analyzed for recent headache sufferers (n = 1016) who completed telephone interviews during the National Survey of Headache Impact (NSHI). Item response theory (IRT) calibrations and the computerized dynamic health assessment (DYNHA) software were used to simulate CAT assessments by selecting the most informative items for each person and estimating impact scores according to pre-set precision standards (CAT-HIT). Results were compared with IRT estimates based on all items (total-HIT), computerized 6-item dynamic estimates (CAT-HIT-6), and a developmental version of a 'static' 6-item form (HIT-6-D). Analyses focused on: respondent burden (survey length and administration time), score distributions ('ceiling' and 'floor' effects), reliability and standard errors, and clinical validity (diagnosis, level of severity). A random sample (n = 245) was re-assessed to test responsiveness. A second study (n = 1103) compared actual CAT surveys and an improved 'static' HIT-6 among current headache sufferers sampled on the Internet. Respondents completed measures from the first study and the generic SF-8 Health Survey; some (n = 540) were re-tested on the Internet after 2 weeks. In the first study, simulated CAT-HIT and total-HIT scores were highly correlated (r = 0.92) without 'ceiling' or 'floor' effects and with a substantial reduction (90.8%) in respondent burden. Six of the 54 items accounted for the great majority of item administrations (3603/5028, 77.6%). CAT-HIT reliability estimates were very high (0.975-0.992) in the range where 95% of respondents scored, and relative validity (RV) coefficients were high for diagnosis (RV = 0.87) and severity (RV = 0.89); patient-level classifications were accurate 91.3% for a diagnosis of migraine. For all three criteria of change, CAT-HIT scores were more responsive than all other measures. In the second study, estimates of respondent burden, item usage, reliability and clinical validity were replicated. The test-retest reliability of CAT-HIT was 0.79 and alternate forms coefficients ranged from 0.85 to 0.91. All correlations with the generic SF-8 were negative. CAT-based administrations of headache impact items achieved very large reductions in respondent burden without compromising validity for purposes of patient screening or monitoring changes in headache impact over time. IRT models and CAT-based dynamic health assessments warrant testing among patients with other conditions.
Spaceflight tracking and data network operational reliability assessment for Skylab
NASA Technical Reports Server (NTRS)
Seneca, V. I.; Mlynarczyk, R. H.
1974-01-01
Data on the spaceflight communications equipment status during the Skylab mission were subjected to an operational reliability assessment. Reliability models were revised to reflect pertinent equipment changes accomplished prior to the beginning of the Skylab missions. Appropriate adjustments were made to fit the data to the models. The availabilities are based on the failure events resulting in the stations inability to support a function of functions and the MTBF's are based on all events including 'can support' and 'cannot support'. Data were received from eleven land-based stations and one ship.
Becker, Anne E.; Roberts, Andrea L.; Perloe, Alexandra; Bainivualiku, Asenaca; Richards, Lauren K.; Gilman, Stephen E.; Striegel-Moore, Ruth H.
2010-01-01
Objective The Global School-based Student Health Survey (GSHS) is an assessment for adolescent health risk behaviors and exposures, supported by the World Health Organization. Although already widely implemented—and intended for youth assessment across diverse ethnic and national contexts—no reliability data have yet been reported for GSHS-based assessment in any ethnicity or country-specific population. This study reports test-retest reliability for GSHS content adapted for a female adolescent ethnic Fijian study sample in Fiji. Design We adapted and translated GSHS content to assess health risk behaviors as part of a larger study investigating the impact of social transition on ethnic Fijian secondary schoolgirls in Fiji. In order to evaluate the performance of this measure for our ethnic Fijian study sample (n=523), we examined its test-retest reliability with kappa coefficients, % agreement, and prevalence estimates in a sub-sample (n=81). Reliability among strata defined by topic, age, and language was also examined. Results Average agreement between test and retest was 77%, and average Cohen's kappa was 0.47. Mean kappas for questions from core modules about alcohol use, tobacco use, and sexual behavior were substantial, and higher than those for modules relating to other risk behaviors. Conclusions Although test-retest reliability of responses within this country-specific version of GSHS content was substantial in several topical domains for this ethnic Fijian sample, only fair reliability for the module assessing dietary behaviors and other individual items suggests that population-specific psychometric evaluation is essential to interpreting language and country-specific GSHS data. PMID:20234961
Thermal fatigue life evaluation of SnAgCu solder joints in a multi-chip power module
NASA Astrophysics Data System (ADS)
Barbagallo, C.; Malgioglio, G. L.; Petrone, G.; Cammarata, G.
2017-05-01
For power devices, the reliability of thermal fatigue induced by thermal cycling has been prioritized as an important concern. The main target of this work is to apply a numerical procedure to assess the fatigue life for lead-free solder joints, that represent, in general, the weakest part of the electronic modules. Starting from a real multi-chip power module, FE-based models were built-up by considering different conditions in model implementation in order to simulate, from one hand, the worst working condition for the module and, from another one, the module standing into a climatic test room performing thermal cycles. Simulations were carried-out both in steady and transient conditions in order to estimate the module thermal maps, the stress-strain distributions, the effective plastic strain distributions and finally to assess the number of cycles to failure of the constitutive solder layers.
Comparing Interrater reliability between eye examination and eye self-examination 1
de Lima, Maria Alzete; Pagliuca, Lorita Marlena Freitag; do Nascimento, Jennara Cândido; Caetano, Joselany Áfio
2017-01-01
Resume Objective: to compare Interrater reliability concerning two eye assessment methods. Method: quasi-experimental study conducted with 324 college students including eye self-examination and eye assessment performed by the researchers in a public university. Kappa coefficient was used to verify agreement. Results: reliability coefficients between Interraters ranged from 0.85 to 0.95, with statistical significance at 0.05. The exams to check for near acuity and peripheral vision presented a reasonable kappa >0.2. The remaining coefficients were higher, ranging from very to totally reliable. Conclusion: comparatively, the results of both methods were similar. The virtual manual on eye self-examination can be used to screen for eye conditions. PMID:29069269
A 2-D process-based model for suspended sediment dynamics: A first step towards ecological modeling
Achete, F. M.; van der Wegen, M.; Roelvink, D.; Jaffe, B.
2015-01-01
In estuaries suspended sediment concentration (SSC) is one of the most important contributors to turbidity, which influences habitat conditions and ecological functions of the system. Sediment dynamics differs depending on sediment supply and hydrodynamic forcing conditions that vary over space and over time. A robust sediment transport model is a first step in developing a chain of models enabling simulations of contaminants, phytoplankton and habitat conditions. This works aims to determine turbidity levels in the complex-geometry delta of the San Francisco estuary using a process-based approach (Delft3D Flexible Mesh software). Our approach includes a detailed calibration against measured SSC levels, a sensitivity analysis on model parameters and the determination of a yearly sediment budget as well as an assessment of model results in terms of turbidity levels for a single year, water year (WY) 2011. Model results show that our process-based approach is a valuable tool in assessing sediment dynamics and their related ecological parameters over a range of spatial and temporal scales. The model may act as the base model for a chain of ecological models assessing the impact of climate change and management scenarios. Here we present a modeling approach that, with limited data, produces reliable predictions and can be useful for estuaries without a large amount of processes data.
A 2-D process-based model for suspended sediment dynamics: a first step towards ecological modeling
NASA Astrophysics Data System (ADS)
Achete, F. M.; van der Wegen, M.; Roelvink, D.; Jaffe, B.
2015-06-01
In estuaries suspended sediment concentration (SSC) is one of the most important contributors to turbidity, which influences habitat conditions and ecological functions of the system. Sediment dynamics differs depending on sediment supply and hydrodynamic forcing conditions that vary over space and over time. A robust sediment transport model is a first step in developing a chain of models enabling simulations of contaminants, phytoplankton and habitat conditions. This works aims to determine turbidity levels in the complex-geometry delta of the San Francisco estuary using a process-based approach (Delft3D Flexible Mesh software). Our approach includes a detailed calibration against measured SSC levels, a sensitivity analysis on model parameters and the determination of a yearly sediment budget as well as an assessment of model results in terms of turbidity levels for a single year, water year (WY) 2011. Model results show that our process-based approach is a valuable tool in assessing sediment dynamics and their related ecological parameters over a range of spatial and temporal scales. The model may act as the base model for a chain of ecological models assessing the impact of climate change and management scenarios. Here we present a modeling approach that, with limited data, produces reliable predictions and can be useful for estuaries without a large amount of processes data.
Managing unexpected events in the manufacturing of biologic medicines.
Grampp, Gustavo; Ramanan, Sundar
2013-08-01
The manufacturing of biologic medicines (biologics) requires robust process and facility design, rigorous regulatory compliance, and a well-trained workforce. Because of the complex attributes of biologics and their sensitivity to production and handling conditions, manufacturing of these medicines also requires a high-reliability manufacturing organization. As required by regulators, such an organization must monitor the state-of-control for the manufacturing process. A high-reliability organization also invests in an experienced and fully engaged technical support staff and fosters a management culture that rewards in-depth analysis of unexpected results, robust risk assessments, and timely and effective implementation of mitigation measures. Such a combination of infrastructure, technology, human capital, management, and a science-based operations culture does not occur without a strong organizational and financial commitment. These attributes of a high-reliability biologics manufacturer are difficult to achieve and may be differentiating factors as the supply of biologics diversifies in future years.
Gonzalez, Javier T; Veasey, Rachel C; Rumbold, Penny L S; Stevenson, Emma J
2012-10-01
The present study aimed to investigate the reliability of metabolic and subjective appetite responses under fasted conditions and following consumption of a cereal-based breakfast. Twelve healthy, physically active males completed two postabsorption (PA) and two postprandial (PP) trials in a randomised order. In PP trials a cereal based breakfast providing 1859 kJ of energy was consumed. Expired gas samples were used to estimate energy expenditure and fat oxidation and 100mm visual analogue scales were used to determine appetite sensations at baseline and every 30 min for 120 min. Reliability was assessed using limits of agreement, coefficient of variation (CV), intraclass coefficient of correlation and 95% confidence limits of typical error. The limits of agreement and typical error were 292.0 and 105.5 kJ for total energy expenditure, 9.3 and 3.4 g for total fat oxidation and 22.9 and 8.3mm for time-averaged AUC for hunger sensations, respectively over the 120 min period in the PP trial. The reliability of energy expenditure and appetite in the 2h response to a cereal-based breakfast would suggest that an intervention requires a 211 kJ and 16.6mm difference in total postprandial energy expenditure and time-averaged hunger AUC to be meaningful, fat oxidation would require a 6.7 g difference which may not be sensitive to most meal manipulations. Copyright © 2012 Elsevier Ltd. All rights reserved.
Antecedent wetness conditions based on ERS scatterometer data
NASA Astrophysics Data System (ADS)
Brocca, L.; Melone, F.; Moramarco, T.; Morbidelli, R.
2009-01-01
SummarySoil moisture is widely recognized as a key parameter in environmental processes mainly for the role of rainfall partitioning into runoff and infiltration. Therefore, for storm rainfall-runoff modeling the estimation of the antecedent wetness conditions ( AWC) is one of the most important aspect. In this context, this study investigates the potential of scatterometer on board of the ERS satellites for the assessment of wetness conditions in three Tiber sub-catchments (Central Italy), of which one includes an experimental area for soil moisture monitoring. The satellite soil moisture data are taken from the ERS/METOP soil moisture archive. First, the scatterometer-derived soil wetness index ( SWI) data are compared with two on-site soil moisture data sets acquired by different methodologies on areas of different extension ranging from 0.01 km 2 to ˜60 km 2. Moreover, the reliability of SWI to estimate the AWC at a catchment scale is investigated considering the relationship between SWI and the soil potential maximum retention parameter, S, of the Soil Conservation Service-Curve Number (SCS-CN) method for abstraction. Several flood events occurred from 1992 to 2005 are selected for this purpose. Specifically, the performance of the SWI for S estimation is compared with two antecedent precipitation indices ( API) and one base flow index ( BFI). The S values obtained through the observed direct runoff volume and rainfall depth are used as benchmark. Results show the great reliability of the SWI for the estimation of wetness conditions both at the plot and catchment scale despite the complex orography of the investigated areas. As far as the comparison with on site soil moisture data set is concerned, the SWI is found quite reliable in representing the soil moisture at layer depth of 15 cm, with a mean correlation coefficient equal to 0.81. The characteristic time length parameter variations, as expected, is depended on soil type, with values in accordance with previous studies. In terms of AWC assessment at catchment scale, based on selected flood events, the SWI is found highly correlated with the observed maximum potential retention of the SCS-CN method with a correlation coefficient R equal to -0.90. Besides, SWI in representing the AWC of the three investigated catchments, outperformed both API indices, poorly representative of AWC, and BFI. Finally, the classical SCS-CN method applied for direct runoff depth estimation, where S is assessed by SWI, provided good performance with a percentage error not exceeding ˜25% for 80% of investigated rainfall-runoff events.
Validity and reliability of the Ergomopro powermeter.
Kirkland, A; Coleman, D; Wiles, J D; Hopker, J
2008-11-01
The aim of this investigation was to assess the validity and reliability of the Ergomopro powermeter. Nine participants completed trials on a Monark ergometer fitted with Ergomopro and SRM powermeters simultaneously recording power output. Each participant completed multiple trials at power outputs ranging from 50 to 450 W. The work stages recorded were 60 s in duration and were repeated three times. Participants also completed a single trial on a cycle ergometer designed to assess bilateral contributions to work output (Lode Excaliber Sport PFM). The power output during the trials was significantly different between all three systems, (p < 0.01) 231.2 +/- 114.2 W, 233.0 +/- 112.4 W, 227.8 +/- 108.8 W for the Monark, SRM and Ergomopro system, respectively. When the bilateral contributions were factored into the analysis, there were no significant differences between the powermeters (p = 0.58). The reliability of the Ergomopro system (CV%) was 2.31 % (95 % CI 2.13 - 2.52 %) compared to 1.59 % (95 % CI 1.47 to 1.74 %) for the Monark, and 1.37 % (95 % CI 1.26 - 1.50 %) for the SRM powermeter. These results indicate that the Ergomopro system has acceptable accuracy under these conditions. However, based on the reliability data, the increased variability of the Ergomopro system and bilateral balance issues have to be considered when using this device.
Interrater Reliability of the Supports Intensity Scale (SIS)
ERIC Educational Resources Information Center
Thompson, James R.; Tasse, Marc J.; McLaughlin, Colleen A.
2008-01-01
The interrater reliability of the Supports Intensity Scale (SIS) was investigated under the condition that interviewers had to have been trained and/or experienced in its administration and scoring. Both corrected and noncorrected Pearson's product-moment coefficients were generated to assess interinterviewer, interrespondent, and mixed interrater…
Costa, Y M; Morita-Neto, O; de Araújo-Júnior, E N S; Sampaio, F A; Conti, P C R; Bonjardim, L R
2017-03-01
Assessing the reliability of medical measurements is a crucial step towards the elaboration of an applicable clinical instrument. There are few studies that evaluate the reliability of somatosensory assessment and pain modulation of masticatory structures. This study estimated the test-retest reliability, that is over time, of the mechanical somatosensory assessment of anterior temporalis, masseter and temporomandibular joint (TMJ) and the conditioned pain modulation (CPM) using the anterior temporalis as the test site. Twenty healthy women were evaluated in two sessions (1 week apart) by the same examiner. Mechanical detection threshold (MDT), mechanical pain threshold (MPT), wind-up ratio (WUR) and pressure pain threshold (PPT) were assessed on the skin overlying the anterior temporalis, masseter and TMJ of the dominant side. CPM was tested by comparing PPT before and during the hand immersion in a hot water bath. anova and intra-class correlation coefficients (ICCs) were applied to the data (α = 5%). The overall ICCs showed acceptable values for the test-retest reliability of mechanical somatosensory assessment of masticatory structures. The ICC values of 75% of all quantitative sensory measurements were considered fair to excellent (fair = 8·4%, good = 33·3% and excellent = 33·3%). However, the CPM paradigm presented poor reliability (ICC = 0·25). The mechanical somatosensory assessment of the masticatory structures, but not the proposed CPM protocol, can be considered sufficiently reliable over time to evaluate the trigeminal sensory function. © 2016 John Wiley & Sons Ltd.
Schneebeli, Alessandro; Del Grande, Filippo; Vincenzo, Gabriele; Cescon, Corrado; Clijsen, Ron; Biordi, Fulvio; Barbero, Marco
2016-08-01
To establish the test-retest reliability of sonoelastography (SE) on healthy Achilles tendons in contracted and relaxed states using an external reference system. Forty-eight Achilles tendons from 24 healthy volunteers were assessed using ultrasound and real-time SE with an external reference material. Tendons were analyzed under relaxed and contracted conditions. Strain ratios between the tendons and the reference material were calculated. The intraclass correlation coefficient (ICC2.k) and Bland-Altman plot were used to assess test-retest reliability. The reliability of SE measurements under relaxed conditions ranged from high to very high, with an ICC2.k of 0.84 (95 % CI: 0.64-0.92) for reference material, 0.91 (95 % CI: 0.83-0.95) for Achilles tendons and 0.95 (95 % CI: 0.91-0.97) for Kager fat pads (KFP). The ICC2.k value for skin was 0.30 (95 % CI: -0.26 to 0.61). Reliability for measurements in the contracted state ranged from high to very high, with an ICC2.k of 0.93 (95 % CI: 0.87-0.96) for reference material, 0.72 (95 % CI: 0.50-0.84) for skin, 0.93 (95 % CI: 0.87-0.96) for Achilles tendons, and 0.81 (95 % CI: 0.66-0.89) for KFP. Reliability of the strain ratio (tendon/reference) under relaxed conditions was high with an ICC2.k of 0.87 (95 % CI: 0.75-0.93), and in the contracted state, it was very high with an ICC2.k of 0.94 (95 % CI: 0.90-0.97). Sonoelastography using an external reference material is a reliable and simple technique for the assessment of the elasticity of healthy Achilles tendons. The use of an external material as a reference, along with strain ratios, could provide a quantitative measure of elasticity.
DOT National Transportation Integrated Search
2016-06-01
Load and Resistance Factor Rating (LRFR) is a reliability-based rating procedure complementary to Load and Resistance Factor Design (LRFD). The intent of LRFR is to provide consistent reliability for all bridges regardless of in-situ condition. The p...
Integrated performance and reliability specification for digital avionics systems
NASA Technical Reports Server (NTRS)
Brehm, Eric W.; Goettge, Robert T.
1995-01-01
This paper describes an automated tool for performance and reliability assessment of digital avionics systems, called the Automated Design Tool Set (ADTS). ADTS is based on an integrated approach to design assessment that unifies traditional performance and reliability views of system designs, and that addresses interdependencies between performance and reliability behavior via exchange of parameters and result between mathematical models of each type. A multi-layer tool set architecture has been developed for ADTS that separates the concerns of system specification, model generation, and model solution. Performance and reliability models are generated automatically as a function of candidate system designs, and model results are expressed within the system specification. The layered approach helps deal with the inherent complexity of the design assessment process, and preserves long-term flexibility to accommodate a wide range of models and solution techniques within the tool set structure. ADTS research and development to date has focused on development of a language for specification of system designs as a basis for performance and reliability evaluation. A model generation and solution framework has also been developed for ADTS, that will ultimately encompass an integrated set of analytic and simulated based techniques for performance, reliability, and combined design assessment.
Reliability, Compliance, and Security in Web-Based Course Assessments
ERIC Educational Resources Information Center
Bonham, Scott
2008-01-01
Pre- and postcourse assessment has become a very important tool for education research in physics and other areas. The web offers an attractive alternative to in-class paper administration, but concerns about web-based administration include reliability due to changes in medium, student compliance rates, and test security, both question leakage…
Kumar, A; Bridgham, R; Potts, M; Gushurst, C; Hamp, M; Passal, D
2001-01-01
To determine consistency of assessment in a new paper case-based structured oral examination in a multi-community pediatrics clerkship, and to identify correctable problems in the administration of examination and assessment process. Nine paper case-based oral examinations were audio-taped. From audio-tapes five community coordinators scored examiner behaviors and graded student performance. Correlations among examiner behaviors scores were examined. Graphs identified grading patterns of evaluators. The effect of exam-giving on evaluators was assessed by t-test. Reliability of grades was calculated and the effect of reducing assessment problems was modeled. Exam-givers differed most in their "teaching-guiding" behavior, and this negatively correlated with student grades. Exam reliability was lowered mainly by evaluator differences in leniency and grading pattern; less important was absence of standardization in cases. While grade reliability was low in early use of the paper case-based oral examination, modeling of plausible effects of training and monitoring for greater uniformity in administration of the examination and assigning scores suggests that more adequate reliabilities can be attained.
Kvistgaard Olsen, Jack; Fener, Dilay Kesgin; Waehrens, Eva Elisabet; Wulf Christensen, Anton; Jespersen, Anders; Danneskiold-Samsøe, Bente; Bartels, Else Marie
2017-07-01
Computerized pneumatic cuff pressure algometry (CPA) using the DoloCuff is a new method for pain assessment. Intra- and inter-rater reliabilities have not yet been established. Our aim was to examine the inter- and intrarater reliabilities of DoloCuff measures in healthy subjects. Twenty healthy subjects (ages 20 to 29 years) were assessed three times at 24-hour intervals by two trained raters. Inter-rater reliability was established based on the first and second assessments, whereas intrarater reliability was based on the second and third assessments. Subjects were randomized 1:1 to first assessment at either rater 1 or rater 2. The variables of interest were pressure pain threshold (PT), pressure pain tolerance (PTol), and temporal summation index (TSI). Reliability was estimated by a two-way mixed intraclass correlation coefficient (ICC) absolute agreement analysis. Reliability was considered excellent if ICC > 0.75, fair to good if 0.4 < ICC < 0.75, and poor if ICC < 0.4. Bias and random errors between raters and assessments were evaluated using 95% confidence interval (CI) and Bland-Altman plots. Inter-rater reliability for PT, PTol, and TSI was 0.88 (95% CI: 0.69 to 0.95), 0.86 (95% CI: 0.65 to 0.95), and 0.81 (95% CI: 0.42 to 0.94), respectively. The intrarater reliability for PT, PTol, and TSI was 0.81 (95% CI: 0.53 to 0.92), 0.89 (95% CI: 0.74 to 0.96), and 0.75 (95% CI: 0.28 to 0.91), respectively. Inter-rater reliability was excellent for PT, PTol, and TSI. Similarly, the intrarater reliability for PT and PTol was excellent, while borderline excellent/good for TSI. Therefore, the DoloCuff can be used to obtain reliable measures of pressure pain parameters in healthy subjects. © 2016 World Institute of Pain.
Use of Internal Consistency Coefficients for Estimating Reliability of Experimental Tasks Scores
Green, Samuel B.; Yang, Yanyun; Alt, Mary; Brinkley, Shara; Gray, Shelley; Hogan, Tiffany; Cowan, Nelson
2017-01-01
Reliabilities of scores for experimental tasks are likely to differ from one study to another to the extent that the task stimuli change, the number of trials varies, the type of individuals taking the task changes, the administration conditions are altered, or the focal task variable differs. Given reliabilities vary as a function of the design of these tasks and the characteristics of the individuals taking them, making inferences about the reliability of scores in an ongoing study based on reliability estimates from prior studies is precarious. Thus, it would be advantageous to estimate reliability based on data from the ongoing study. We argue that internal consistency estimates of reliability are underutilized for experimental task data and in many applications could provide this information using a single administration of a task. We discuss different methods for computing internal consistency estimates with a generalized coefficient alpha and the conditions under which these estimates are accurate. We illustrate use of these coefficients using data for three different tasks. PMID:26546100
Salamone, Francesco; Belussi, Lorenzo; Currò, Cristian; Danza, Ludovico; Ghellere, Matteo; Guazzi, Giulia; Lenzi, Bruno; Megale, Valentino; Meroni, Italo
2018-05-17
Thermal comfort has become a topic issue in building performance assessment as well as energy efficiency. Three methods are mainly recognized for its assessment. Two of them based on standardized methodologies, face the problem by considering the indoor environment in steady-state conditions (PMV and PPD) and users as active subjects whose thermal perception is influenced by outdoor climatic conditions (adaptive approach). The latter method is the starting point to investigate thermal comfort from an overall perspective by considering endogenous variables besides the traditional physical and environmental ones. Following this perspective, the paper describes the results of an in-field investigation of thermal conditions through the use of nearable and wearable solutions, parametric models and machine learning techniques. The aim of the research is the exploration of the reliability of IoT-based solutions combined with advanced algorithms, in order to create a replicable framework for the assessment and improvement of user thermal satisfaction. For this purpose, an experimental test in real offices was carried out involving eight workers. Parametric models are applied for the assessment of thermal comfort; IoT solutions are used to monitor the environmental variables and the users' parameters; the machine learning CART method allows to predict the users' profile and the thermal comfort perception respect to the indoor environment.
Currò, Cristian; Danza, Ludovico; Ghellere, Matteo; Guazzi, Giulia; Lenzi, Bruno; Megale, Valentino; Meroni, Italo
2018-01-01
Thermal comfort has become a topic issue in building performance assessment as well as energy efficiency. Three methods are mainly recognized for its assessment. Two of them based on standardized methodologies, face the problem by considering the indoor environment in steady-state conditions (PMV and PPD) and users as active subjects whose thermal perception is influenced by outdoor climatic conditions (adaptive approach). The latter method is the starting point to investigate thermal comfort from an overall perspective by considering endogenous variables besides the traditional physical and environmental ones. Following this perspective, the paper describes the results of an in-field investigation of thermal conditions through the use of nearable and wearable solutions, parametric models and machine learning techniques. The aim of the research is the exploration of the reliability of IoT-based solutions combined with advanced algorithms, in order to create a replicable framework for the assessment and improvement of user thermal satisfaction. For this purpose, an experimental test in real offices was carried out involving eight workers. Parametric models are applied for the assessment of thermal comfort; IoT solutions are used to monitor the environmental variables and the users’ parameters; the machine learning CART method allows to predict the users’ profile and the thermal comfort perception respect to the indoor environment. PMID:29772818
Reliability-based trajectory optimization using nonintrusive polynomial chaos for Mars entry mission
NASA Astrophysics Data System (ADS)
Huang, Yuechen; Li, Haiyang
2018-06-01
This paper presents the reliability-based sequential optimization (RBSO) method to settle the trajectory optimization problem with parametric uncertainties in entry dynamics for Mars entry mission. First, the deterministic entry trajectory optimization model is reviewed, and then the reliability-based optimization model is formulated. In addition, the modified sequential optimization method, in which the nonintrusive polynomial chaos expansion (PCE) method and the most probable point (MPP) searching method are employed, is proposed to solve the reliability-based optimization problem efficiently. The nonintrusive PCE method contributes to the transformation between the stochastic optimization (SO) and the deterministic optimization (DO) and to the approximation of trajectory solution efficiently. The MPP method, which is used for assessing the reliability of constraints satisfaction only up to the necessary level, is employed to further improve the computational efficiency. The cycle including SO, reliability assessment and constraints update is repeated in the RBSO until the reliability requirements of constraints satisfaction are satisfied. Finally, the RBSO is compared with the traditional DO and the traditional sequential optimization based on Monte Carlo (MC) simulation in a specific Mars entry mission to demonstrate the effectiveness and the efficiency of the proposed method.
Centralized vs decentralized lunar power system study
NASA Astrophysics Data System (ADS)
Metcalf, Kenneth; Harty, Richard B.; Perronne, Gerald E.
1991-09-01
Three power-system options are considered with respect to utilization on a lunar base: the fully centralized option, the fully decentralized option, and a hybrid comprising features of the first two options. Power source, power conditioning, and power transmission are considered separately, and each architecture option is examined with ac and dc distribution, high and low voltage transmission, and buried and suspended cables. Assessments are made on the basis of mass, technological complexity, cost, reliability, and installation complexity, however, a preferred power-system architecture is not proposed. Preferred options include transmission based on ac, transmission voltages of 2000-7000 V with buried high-voltage lines and suspended low-voltage lines. Assessments of the total cost associated with the installations are required to determine the most suitable power system.
Reliable Assessment with CyberTutor, a Web-Based Homework Tutor.
ERIC Educational Resources Information Center
Pritchard, David E.; Morote, Elsa-Sofia
This paper demonstrates that an electronic tutoring program can collect data that enables a far more reliable assessment of students' skills than a standard examination. Socratic electronic homework tutor, CyberTutor can integrate effectively instruction and assessment. CyberTutor assessment has about 62 times less variance due to random test…
Going DEEP: guidelines for building simulation-based team assessments.
Grand, James A; Pearce, Marina; Rench, Tara A; Chao, Georgia T; Fernandez, Rosemarie; Kozlowski, Steve W J
2013-05-01
Whether for team training, research or evaluation, making effective use of simulation-based technologies requires robust, reliable and accurate assessment tools. Extant literature on simulation-based assessment practices has primarily focused on scenario and instructional design; however, relatively little direct guidance has been provided regarding the challenging decisions and fundamental principles related to assessment development and implementation. The objective of this manuscript is to introduce a generalisable assessment framework supplemented by specific guidance on how to construct and ensure valid and reliable simulation-based team assessment tools. The recommendations reflect best practices in assessment and are designed to empower healthcare educators, professionals and researchers with the knowledge to design and employ valid and reliable simulation-based team assessments. Information and actionable recommendations associated with creating assessments of team processes (non-technical 'teamwork' activities) and performance (demonstration of technical proficiency) are presented which provide direct guidance on how to Distinguish the underlying competencies one aims to assess, Elaborate the measures used to capture team member behaviours during simulation activities, Establish the content validity of these measures and Proceduralise the measurement tools in a way that is systematically aligned with the goals of the simulation activity while maintaining methodological rigour (DEEP). The DEEP framework targets fundamental principles and critical activities that are important for effective assessment, and should benefit healthcare educators, professionals and researchers seeking to design or enhance any simulation-based assessment effort.
NASA Astrophysics Data System (ADS)
Serevina, V.; Muliyati, D.
2018-05-01
This research aims to develop students’ performance assessment instrument based on scientific approach is valid and reliable in assessing the performance of students on basic physics lab of Simple Harmonic Motion (SHM). This study uses the ADDIE consisting of stages: Analyze, Design, Development, Implementation, and Evaluation. The student performance assessment developed can be used to measure students’ skills in observing, asking, conducting experiments, associating and communicate experimental results that are the ‘5M’ stages in a scientific approach. Each grain of assessment in the instrument is validated by the instrument expert and the evaluation with the result of all points of assessment shall be eligible to be used with a 100% eligibility percentage. The instrument is then tested for the quality of construction, material, and language by panel (lecturer) with the result: 85% or very good instrument construction aspect, material aspect 87.5% or very good, and language aspect 83% or very good. For small group trial obtained instrument reliability level of 0.878 or is in the high category, where r-table is 0.707. For large group trial obtained instrument reliability level of 0.889 or is in the high category, where r-table is 0.320. Instruments declared valid and reliable for 5% significance level. Based on the result of this research, it can be concluded that the student performance appraisal instrument based on the developed scientific approach is declared valid and reliable to be used in assessing student skill in SHM experimental activity.
Everett, Tobias C; Ng, Elaine; Power, Daniel; Marsh, Christopher; Tolchard, Stephen; Shadrina, Anna; Bould, Matthew D
2013-12-01
The use of simulation-based assessments for high-stakes physician examinations remains controversial. The Managing Emergencies in Paediatric Anaesthesia course uses simulation to teach evidence-based management of anesthesia crises to trainee anesthetists in the United Kingdom (UK) and Canada. In this study, we investigated the feasibility and reliability of custom-designed scenario-specific performance checklists and a global rating scale (GRS) assessing readiness for independent practice. After research ethics board approval, subjects were videoed managing simulated pediatric anesthesia crises in a single Canadian teaching hospital. Each subject was randomized to two of six different scenarios. All 60 scenarios were subsequently rated by four blinded raters (two in the UK, two in Canada) using the checklists and GRS. The actual and predicted reliability of the tools was calculated for different numbers of raters using the intraclass correlation coefficient (ICC) and the Spearman-Brown prophecy formula. Average measures ICCs ranged from 'substantial' to 'near perfect' (P ≤ 0.001). The reliability of the checklists and the GRS was similar. Single measures ICCs showed more variability than average measures ICC. At least two raters would be required to achieve acceptable reliability. We have established the reliability of a GRS to assess the management of simulated crisis scenarios in pediatric anesthesia, and this tool is feasible within the setting of a research study. The global rating scale allows raters to make a judgement regarding a participant's readiness for independent practice. These tools may be used in the future research examining simulation-based assessment. © 2013 John Wiley & Sons Ltd.
Test-retest reliability of sensor-based sit-to-stand measures in young and older adults.
Regterschot, G Ruben H; Zhang, Wei; Baldus, Heribert; Stevens, Martin; Zijlstra, Wiebren
2014-01-01
This study investigated test-retest reliability of sensor-based sit-to-stand (STS) peak power and other STS measures in young and older adults. In addition, test-retest reliability of the sensor method was compared to test-retest reliability of the Timed Up and Go Test (TUGT) and Five-Times-Sit-to-Stand Test (FTSST) in older adults. Ten healthy young female adults (20-23 years) and 31 older adults (21 females; 73-94 years) participated in two assessment sessions separated by 3-8 days. Vertical peak power was assessed during three (young adults) and five (older adults) normal and fast STS trials with a hybrid motion sensor worn on the hip. Older adults also performed the FTSST and TUGT. The average sensor-based STS peak power of the normal STS trials and the average sensor-based STS peak power of the fast STS trials showed excellent test-retest reliability in young adults (intra-class correlation (ICC)≥0.90; zero in 95% confidence interval of mean difference between test and retest (95%CI of D); standard error of measurement (SEM)≤6.7% of mean peak power) and older adults (ICC≥0.91; zero in 95%CI of D; SEM≤9.9%). Test-retest reliability of sensor-based STS peak power and TUGT (ICC=0.98; zero in 95%CI of D; SEM=8.5%) was comparable in older adults, test-retest reliability of the FTSST was lower (ICC=0.73; zero outside 95%CI of D; SEM=14.4%). Sensor-based STS peak power demonstrated excellent test-retest reliability and may therefore be useful for clinical assessment of functional status and fall risk. Copyright © 2014 Elsevier B.V. All rights reserved.
Iversen, Jonas Vestergård; Bartels, Else Marie; Langberg, Henning
2012-02-01
Achilles tendinopathy (AT) is a common pathology and the aetiology is unknown. For valid and reliable assessment The Victorian Institute of Sports Assessment has designed a self-administered Achilles questionnaire, the VISA-A. The aim of the present study was to evaluate VISA-A as an outcome measure in patients with AT. A systematic search of the literature was conducted using MEDLINE, EMBASE, CINAHL, PEDro, Web of Science, and Cochrane Controlled trials to identify trials using VISA-A for patients with AT. This was followed by data mining and analysis of the obtained data. Twenty-six clinical trials containing 1336 individuals were included. Overall mean VISA-A scores ranged from 24 (severe AT) to 100 (healthy). Mean VISA-A scores in patients with AT ranged from 24 to 96.6. Healthy subjects scored a minimum of 96. Only two groups of participants from two different studies had a post-VISA-A score as high as healthy individuals, indicating full recovery of the AT. A VISA-A score lower than 24 is rarely attained in AT. Only few patients with AT reach an equivalent VISA-A score compared to uninjured healthy subjects following treatment. The VISA-A is a reliable tool when assessing AT patients, providing a good assessment of the actual condition from very poor, (score around 24) to excellent (a score of 90), which based on this systematic review and previous studies could be considered full recovery from AT.
ERIC Educational Resources Information Center
Barbu, Otilia C.; Levine-Donnerstein, Deborah; Marx, Ronald W.; Yaden, David B., Jr.
2013-01-01
This study examined reliability and validity of the Devereux Early Childhood Assessment (DECA), based on samples of parents and teachers' ratings of 1,145 entering kindergartners in the Southwest. Confirmatory factor analysis showed that DECA presented good reliability and validity for manifest variables, corroborating previous findings. Three…
Moewis, P; Boeth, H; Heller, M O; Yntema, C; Jung, T; Doyscher, R; Ehrig, R M; Zhong, Y; Taylor, W R
2014-07-01
The in vivo quantification of rotational laxity of the knee joint is of importance for monitoring changes in joint stability or the outcome of therapies. While invasive assessments have been used to study rotational laxity, non-invasive methods are attractive particularly for assessing young cohorts. This study aimed to determine the conditions under which tibio-femoral rotational laxity can be assessed reliably and accurately in a non-invasive manner. The reliability and error of non-invasive examinations of rotational joint laxity were determined by comparing the artefact associated with surface mounted markers against simultaneous measurements using fluoroscopy in five knees including healthy and ACL deficient joints. The knees were examined at 0°, 30°, 60° and 90° flexion using a device that allows manual axial rotation of the joint. With a mean RMS error of 9.6°, the largest inaccuracy using non-invasive assessment was present at 0° knee flexion, whereas at 90° knee flexion, a smaller RMS error of 5.7° was found. A Bland and Altman assessment indicated that a proportional bias exists between the non-invasive and fluoroscopic approaches, with limits of agreement that exceeded 20°. Correction using average linear regression functions resulted in a reduction of the RMS error to below 1° and limits of agreement to less than ±1° across all knees and flexion angles. Given the excellent reliability and the fact that a correction of the surface mounted marker based rotation values can be achieved, non-invasive evaluation of tibio-femoral rotation could offer opportunities for simplified devices for use in clinical settings in cases where invasive assessments are not justified. Although surface mounted marker based measurements tend to overestimate joint rotation, and therefore joint laxity, our results indicate that it is possible to correct for this error. Copyright © 2014 IPEM. Published by Elsevier Ltd. All rights reserved.
Borotikar, Bhushan; Lempereur, Mathieu; Lelievre, Mathieu; Burdin, Valérie; Ben Salem, Douraied; Brochard, Sylvain
2017-01-01
To report evidence for the concurrent validity and reliability of dynamic MRI techniques to evaluate in vivo joint and muscle mechanics, and to propose recommendations for their use in the assessment of normal and impaired musculoskeletal function. The search was conducted on articles published in Web of science, PubMed, Scopus, Academic search Premier, and Cochrane Library between 1990 and August 2017. Studies that reported the concurrent validity and/or reliability of dynamic MRI techniques for in vivo evaluation of joint or muscle mechanics were included after assessment by two independent reviewers. Selected articles were assessed using an adapted quality assessment tool and a data extraction process. Results for concurrent validity and reliability were categorized as poor, moderate, or excellent. Twenty articles fulfilled the inclusion criteria with a mean quality assessment score of 66% (±10.4%). Concurrent validity and/or reliability of eight dynamic MRI techniques were reported, with the knee being the most evaluated joint (seven studies). Moderate to excellent concurrent validity and reliability were reported for seven out of eight dynamic MRI techniques. Cine phase contrast and real-time MRI appeared to be the most valid and reliable techniques to evaluate joint motion, and spin tag for muscle motion. Dynamic MRI techniques are promising for the in vivo evaluation of musculoskeletal mechanics; however results should be evaluated with caution since validity and reliability have not been determined for all joints and muscles, nor for many pathological conditions.
Nagle, Elizabeth F; Sanders, Mary E; Gibbs, Bethany B; Franklin, Barry A; Nagle, Jacquelyn A; Prins, Philip J; Johnson, Caleb D; Robertson, Robert J
2017-06-01
A standardized fitness assessment is critical for the development of an individualized exercise prescription. Although the benefits of aquatic exercise have been well established, there remains the need for a standardized nonswimming protocol to accurately assess cardiorespiratory fitness (CRF) in shallow water. The present investigation was designed to assess (a) the reliability of a standardized shallow water run (SWR) test of CRF and (b) the accuracy of a standardized SWR compared with a land-based treadmill (LTM) test. Twenty-three healthy women (20 ± 3 years), with body mass index (23.5 ± 3 kg·m), performed 2 shallow water peak oxygen consumption (V[Combining Dot Above]O2peak) running tests (SWRa and SWRb), and 1 V[Combining Dot Above]O2max LTM. Intraclass correlation coefficients indicated moderately strong reliability for V[Combining Dot Above]O2peak (ml·kg·min) (r = 0.73, p < 0.01), HRpeak (b·min) (r = 0.82; p < 0.01), and O2pulse (V[Combining Dot Above]O2 [ml·kg·min]·HR [b·min]) (r = 0.77, p < 0.01). Using paired t-tests and Pearson's correlations, SWR V[Combining Dot Above]O2peak and HRpeak were significantly lower than during LTM (p ≤ 0.05) and showed moderate correlations of 0.60 and 0.58 (p < 0.001) to LTM. O2pulse was similar (p > 0.05) for the SWR and LTM tests with a moderate correlation of 0.63. A standardized SWR test as a measure of CRF is a reliable, and to some degree, valid alternative to conventional protocols and may be used by strength and conditioning professionals to measure program outcomes and monitor training progress. Furthermore, this protocol provides a water-based option for CRF assessment among healthy women and offers insight toward the development of an effective protocol that can accommodate individuals with limited mobility, or those seeking less musculoskeletal impact from traditional land-based types of training.
An Evaluation of Test Speededness in an Assessment for Third-Grade Gifted Students
ERIC Educational Resources Information Center
Hailey, Emily; Callahan, Carolyn M.; Azano, Amy; Moon, Tonya R.
2012-01-01
Reliability and validity are integral concepts in assessment design. Test speededness, the influence of time constraints on test taker performance, is often an overlooked threat to reliability and validity, especially in classroom-based testing. The purpose of this study is to evaluate the degree of test speededness of classroom-based assessments…
Foerster, Rebecca M.; Poth, Christian H.; Behler, Christian; Botsch, Mario; Schneider, Werner X.
2016-01-01
Neuropsychological assessment of human visual processing capabilities strongly depends on visual testing conditions including room lighting, stimuli, and viewing-distance. This limits standardization, threatens reliability, and prevents the assessment of core visual functions such as visual processing speed. Increasingly available virtual reality devices allow to address these problems. One such device is the portable, light-weight, and easy-to-use Oculus Rift. It is head-mounted and covers the entire visual field, thereby shielding and standardizing the visual stimulation. A fundamental prerequisite to use Oculus Rift for neuropsychological assessment is sufficient test-retest reliability. Here, we compare the test-retest reliabilities of Bundesen’s visual processing components (visual processing speed, threshold of conscious perception, capacity of visual working memory) as measured with Oculus Rift and a standard CRT computer screen. Our results show that Oculus Rift allows to measure the processing components as reliably as the standard CRT. This means that Oculus Rift is applicable for standardized and reliable assessment and diagnosis of elementary cognitive functions in laboratory and clinical settings. Oculus Rift thus provides the opportunity to compare visual processing components between individuals and institutions and to establish statistical norm distributions. PMID:27869220
Foerster, Rebecca M; Poth, Christian H; Behler, Christian; Botsch, Mario; Schneider, Werner X
2016-11-21
Neuropsychological assessment of human visual processing capabilities strongly depends on visual testing conditions including room lighting, stimuli, and viewing-distance. This limits standardization, threatens reliability, and prevents the assessment of core visual functions such as visual processing speed. Increasingly available virtual reality devices allow to address these problems. One such device is the portable, light-weight, and easy-to-use Oculus Rift. It is head-mounted and covers the entire visual field, thereby shielding and standardizing the visual stimulation. A fundamental prerequisite to use Oculus Rift for neuropsychological assessment is sufficient test-retest reliability. Here, we compare the test-retest reliabilities of Bundesen's visual processing components (visual processing speed, threshold of conscious perception, capacity of visual working memory) as measured with Oculus Rift and a standard CRT computer screen. Our results show that Oculus Rift allows to measure the processing components as reliably as the standard CRT. This means that Oculus Rift is applicable for standardized and reliable assessment and diagnosis of elementary cognitive functions in laboratory and clinical settings. Oculus Rift thus provides the opportunity to compare visual processing components between individuals and institutions and to establish statistical norm distributions.
Hulteen, Ryan M; Lander, Natalie J; Morgan, Philip J; Barnett, Lisa M; Robertson, Samuel J; Lubans, David R
2015-10-01
It has been suggested that young people should develop competence in a variety of 'lifelong physical activities' to ensure that they can be active across the lifespan. The primary aim of this systematic review is to report the methodological properties, validity, reliability, and test duration of field-based measures that assess movement skill competency in lifelong physical activities. A secondary aim was to clearly define those characteristics unique to lifelong physical activities. A search of four electronic databases (Scopus, SPORTDiscus, ProQuest, and PubMed) was conducted between June 2014 and April 2015 with no date restrictions. Studies addressing the validity and/or reliability of lifelong physical activity tests were reviewed. Included articles were required to assess lifelong physical activities using process-oriented measures, as well as report either one type of validity or reliability. Assessment criteria for methodological quality were adapted from a checklist used in a previous review of sport skill outcome assessments. Movement skill assessments for eight different lifelong physical activities (badminton, cycling, dance, golf, racquetball, resistance training, swimming, and tennis) in 17 studies were identified for inclusion. Methodological quality, validity, reliability, and test duration (time to assess a single participant), for each article were assessed. Moderate to excellent reliability results were found in 16 of 17 studies, with 71% reporting inter-rater reliability and 41% reporting intra-rater reliability. Only four studies in this review reported test-retest reliability. Ten studies reported validity results; content validity was cited in 41% of these studies. Construct validity was reported in 24% of studies, while criterion validity was only reported in 12% of studies. Numerous assessments for lifelong physical activities may exist, yet only assessments for eight lifelong physical activities were included in this review. Generalizability of results may be more applicable if more heterogeneous samples are used in future research. Moderate to excellent levels of inter- and intra-rater reliability were reported in the majority of studies. However, future work should look to establish test-retest reliability. Validity was less commonly reported than reliability, and further types of validity other than content validity need to be established in future research. Specifically, predictive validity of 'lifelong physical activity' movement skill competency is needed to support the assertion that such activities provide the foundation for a lifetime of activity.
ERIC Educational Resources Information Center
Lee, Guemin; Park, In-Yong
2012-01-01
Previous assessments of the reliability of test scores for testlet-composed tests have indicated that item-based estimation methods overestimate reliability. This study was designed to address issues related to the extent to which item-based estimation methods overestimate the reliability of test scores composed of testlets and to compare several…
Wang, Min; Wang, Qinglian; Zhang, Baohong
2013-11-01
Reference genes are critical for normalization of the gene expression level of target genes. The widely used housekeeping genes may change their expression levels at different tissue under different treatment or stress conditions. Therefore, systematical evaluation on the housekeeping genes is required for gene expression analysis. Up to date, no work was performed to evaluate the housekeeping genes in cotton under stress treatment. In this study, we chose 10 housekeeping genes to systematically assess their expression levels at two different tissues (leaves and roots) under two different abiotic stresses (salt and drought) with three different concentrations. Our results show that there is no best reference gene for all tissues at all stress conditions. The reliable reference gene should be selected based on a specific condition. For example, under salt stress, UBQ7, GAPDH and EF1A8 are better reference genes in leaves; TUA10, UBQ7, CYP1, GAPDH and EF1A8 were better in roots. Under drought stress, UBQ7, EF1A8, TUA10, and GAPDH showed less variety of expression level in leaves and roots. Thus, it is better to identify reliable reference genes first before performing any gene expression analysis. However, using a combination of housekeeping genes as reference gene may provide a new strategy for normalization of gene expression. In this study, we found that combination of four housekeeping genes worked well as reference genes under all the stress conditions. © 2013.
Evidence-based Assessment in Pediatric Psychology: Family Measures
Fiese, Barbara H.; Gold, Jeffrey I.; Cutuli, J. J.; Holmbeck, Grayson N.; Goldbeck, Lutz; Chambers, Christine T.; Abad, Mona; Spetter, Dante; Patterson, Joän
2008-01-01
Objective To provide a review of the evidence base of family measures relevant to pediatric psychology. Method Twenty-nine family measures were selected based upon endorsement by Division 54 listserv members, expert judgment, and literature review. Spanning observational and self-report methods, the measures fell into three broad assessment categories: Family functioning, Dyadic family relationships, and Family functioning in the context of childhood chronic health conditions. Measures were categorized as: “Well-established”, “Approaching well-established”, or “Promising.” Results Nineteen measures met “well-established” criteria and the remaining ten were “approaching well-established.” “Well-established” measures were documented for each of the broad assessment categories named above. Conclusions Many measures deemed “well-established” in the general population are proving to be reliable and useful in pediatric samples. More evidence of the validity of family measures is needed in this context. This review should prove helpful to clinicians and researchers as they strive to make evidence-based decisions regarding family measures. PMID:17905801
Konge, L; Vilmann, P; Clementsen, P; Annema, J T; Ringsted, C
2012-10-01
Fine-needle aspiration (FNA) guided by endoscopic ultrasonography (EUS) is important in mediastinal staging of non-small cell lung cancer (NSCLC). Training standards and implementation strategies of this technique are currently under discussion. The aim of this study was to explore the reliability and validity of a newly developed EUS Assessment Tool (EUSAT) designed to measure competence in EUS - FNA for mediastinal staging of NSCLC. A total of 30 patients with proven or suspected NSCLC underwent EUS - FNA for mediastinal staging by three trainees and three experienced physicians. Their performances were assessed prospectively by three experts in EUS under direct observation and again 2 months later in a blinded fashion using digital video-recordings. Based on the assessments, intra-rater reliability, inter-rater reliability, and construct validity were explored. The intra-rater reliability was good (Cronbach's α = 0.80), but comparison of results based on direct observations and blinded video-recordings indicated a significant bias favoring consultants (P = 0.022). Inter-rater reliability was very good (Cronbach's α = 0.93). However, one rater assessing five procedures or two raters each assessing four procedures were necessary to secure a generalizability coefficient of 0.80. The assessment tool demonstrated construct validity by discriminating between trainees and experienced physicians (P = 0.034). Competency in mediastinal staging of NSCLC using EUS and EUS - FNA can be assessed in a reliable and valid way using the EUSAT assessment tool. Measuring and defining competency and training requirements could improve EUS quality and benefit patient care. © Georg Thieme Verlag KG Stuttgart · New York.
Birgfeld, Craig B; Heike, Carrie L; Saltzman, Babette S; Leroux, Brian G; Evans, Kelly N; Luquetti, Daniela V
2016-03-31
Craniofacial microsomia is a common congenital condition for which children receive longitudinal, multidisciplinary team care. However, little is known about the etiology of craniofacial microsomia and few outcome studies have been published. In order to facilitate large, multicenter studies in craniofacial microsomia, we assessed the reliability of phenotypic classification based on photographs by comparison with direct physical examination. Thirty-nine children with craniofacial microsomia underwent a physical examination and photographs according to a standardized protocol. Three clinicians completed ratings during the physical examination and, at least a month later, using respective photographs for each participant. We used descriptive statistics for participant characteristics and intraclass correlation coefficients (ICCs) to assess reliability. The agreement between ratings on photographs and physical exam was greater than 80 % for all 15 categories included in the analysis. The ICC estimates were higher than 0.6 for most features. Features with the highest ICC included: presence of epibulbar dermoids, ear abnormalities, and colobomas (ICC 0.85, 0.81, and 0.80, respectively). Orbital size, presence of pits, tongue abnormalities, and strabismus had the lowest ICC, values (0.17 or less). There was not a strong tendency for either type of rating, physical exam or photograph, to be more likely to designate a feature as abnormal. The agreement between photographs and physical exam regarding the presence of a prior surgery was greater than 90 % for most features. Our results suggest that categorization of facial phenotype in children with CFM based on photographs is reliable relative to physical examination for most facial features.
Reliability Generalization of the Alcohol Use Disorder Identification Test.
ERIC Educational Resources Information Center
Shields, Alan L.; Caruso, John C.
2002-01-01
Evaluated the reliability of scores from the Alcohol Use Disorders Identification Test (AUDIT; J. Sounders and others, 1993) in a reliability generalization study based on 17 empirical journal articles. Results show AUDIT scores to be generally reliable for basic assessment. (SLD)
Fagbeja, Mofoluso A; Hill, Jennifer L; Chatterton, Tim J; Longhurst, James W S
2015-02-01
An assessment of the reliability of the Scanning Imaging Absorption Spectrometer for Atmospheric Cartography (SCIAMACHY) satellite sensor measurements to interpolate tropospheric concentrations of carbon monoxide considering the low-latitude climate of the Niger Delta region in Nigeria was conducted. Monthly SCIAMACHY carbon monoxide (CO) column measurements from January 2,003 to December 2005 were interpolated using ordinary kriging technique. The spatio-temporal variations observed in the reliability were based on proximity to the Atlantic Ocean, seasonal variations in the intensities of rainfall and relative humidity, the presence of dust particles from the Sahara desert, industrialization in Southwest Nigeria and biomass burning during the dry season in Northern Nigeria. Spatial reliabilities of 74 and 42 % are observed for the inland and coastal areas, respectively. Temporally, average reliability of 61 and 55 % occur during the dry and wet seasons, respectively. Reliability in the inland and coastal areas was 72 and 38 % during the wet season, and 75 and 46 % during the dry season, respectively. Based on the results, the WFM-DOAS SCIAMACHY CO data product used for this study is therefore relevant in the assessment of CO concentrations in developing countries within the low latitudes that could not afford monitoring infrastructure due to the required high costs. Although the SCIAMACHY sensor is no longer available, it provided cost-effective, reliable and accessible data that could support air quality assessment in developing countries.
Palmer, Kara K.
2017-01-01
Assessing children’s perceptions of their movement abilities (i.e., perceived competence) is traditionally done using picture scales—Pictorial Scale of Perceived Competence and Acceptance for Young Children or Pictorial Scale of Perceived Movement Skill Competence. Pictures fail to capture the temporal components of movement. To address this limitation, we created a digital-based instrument to assess perceived motor competence: the Digital Scale of Perceived Motor Competence. The purpose of this study was to determine the validity, reliability, and internal consistency of the Digital-based Scale of Perceived Motor Skill Competence. The Digital-based Scale of Perceived Motor Skill Competence is based on the twelve fundamental motor skills from the Test of Gross Motor Development-2nd Edition with a similar layout and item structure as the Pictorial Scale of Perceived Movement Skill Competence. Face Validity of the instrument was examined in Phase I (n = 56; Mage = 8.6 ± 0.7 years, 26 girls). Test-retest reliability and internal consistency were assessed in Phase II (n = 54, Mage = 8.7 years ± 0.5 years, 26 girls). Intra-class correlations (ICC) and Cronbach’s alpha were conducted to determine test-retest reliability and internal consistency for all twelve skills along with locomotor and object control subscales. The Digital Scale of Perceived Motor Competence demonstrates excellent test-retest reliability (ICC = 0.83, total; ICC = 0.77, locomotor; ICC = 0.79, object control) and acceptable/good internal consistency (α = 0.62, total; α = 0.57, locomotor; α = 0.49, object control). Findings provide evidence of the reliability of the three level digital-based instrument of perceived motor competence for older children. PMID:29910408
Anguera, J A; Brandes-Aitken, A N; Rolle, C E; Skinner, S N; Desai, S S; Bower, J D; Martucci, W E; Chung, W K; Sherr, E H; Marco, E J
2016-01-01
Assessing cognitive abilities in children is challenging for two primary reasons: lack of testing engagement can lead to low testing sensitivity and inherent performance variability. Here we sought to explore whether an engaging, adaptive digital cognitive platform built to look and feel like a video game would reliably measure attention-based abilities in children with and without neurodevelopmental disabilities related to a known genetic condition, 16p11.2 deletion. We assessed 20 children with 16p11.2 deletion, a genetic variation implicated in attention deficit/hyperactivity disorder and autism, as well as 16 siblings without the deletion and 75 neurotypical age-matched children. Deletion carriers showed significantly slower response times and greater response variability when compared with all non-carriers; by comparison, traditional non-adaptive selective attention assessments were unable to discriminate group differences. This phenotypic characterization highlights the potential power of administering tools that integrate adaptive psychophysical mechanics into video-game-style mechanics to achieve robust, reliable measurements. PMID:27648915
Inter-rater agreement in evaluation of disability: systematic review of reproducibility studies.
Barth, Jürgen; de Boer, Wout E L; Busse, Jason W; Hoving, Jan L; Kedzia, Sarah; Couban, Rachel; Fischer, Katrin; von Allmen, David Y; Spanjer, Jerry; Kunz, Regina
2017-01-25
To explore agreement among healthcare professionals assessing eligibility for work disability benefits. Systematic review and narrative synthesis of reproducibility studies. Medline, Embase, and PsycINFO searched up to 16 March 2016, without language restrictions, and review of bibliographies of included studies. Observational studies investigating reproducibility among healthcare professionals performing disability evaluations using a global rating of working capacity and reporting inter-rater reliability by a statistical measure or descriptively. Studies could be conducted in insurance settings, where decisions on ability to work include normative judgments based on legal considerations, or in research settings, where decisions on ability to work disregard normative considerations. : Teams of paired reviewers identified eligible studies, appraised their methodological quality and generalisability, and abstracted results with pretested forms. As heterogeneity of research designs and findings impeded a quantitative analysis, a descriptive synthesis stratified by setting (insurance or research) was performed. From 4562 references, 101 full text articles were reviewed. Of these, 16 studies conducted in an insurance setting and seven in a research setting, performed in 12 countries, met the inclusion criteria. Studies in the insurance setting were conducted with medical experts assessing claimants who were actual disability claimants or played by actors, hypothetical cases, or short written scenarios. Conditions were mental (n=6, 38%), musculoskeletal (n=4, 25%), or mixed (n=6, 38%). Applicability of findings from studies conducted in an insurance setting to real life evaluations ranged from generalisable (n=7, 44%) and probably generalisable (n=3, 19%) to probably not generalisable (n=6, 37%). Median inter-rater reliability among experts was 0.45 (range intraclass correlation coefficient 0.86 to κ-0.10). Inter-rater reliability was poor in six studies (37%) and excellent in only two (13%). This contrasts with studies conducted in the research setting, where the median inter-rater reliability was 0.76 (range 0.91-0.53), and 71% (5/7) studies achieved excellent inter-rater reliability. Reliability between assessing professionals was higher when the evaluation was guided by a standardised instrument (23 studies, P=0.006). No such association was detected for subjective or chronic health conditions or the studies' generalisability to real world evaluation of disability (P=0.46, 0.45, and 0.65, respectively). Despite their common use and far reaching consequences for workers claiming disabling injury or illness, research on the reliability of medical evaluations of disability for work is limited and indicates high variation in judgments among assessing professionals. Standardising the evaluation process could improve reliability. Development and testing of instruments and structured approaches to improve reliability in evaluation of disability are urgently needed. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
NASA Astrophysics Data System (ADS)
Lanni, Cristiano; Mazzorana, Bruno; Volcan, Claudio; Bertagnolli, Rudi
2015-04-01
Flood hazard is generally assessed by assuming the return period of the rainfall as a proxy for the return period of the discharge and the related hydrograph. Frequently this deterministic view is extended also to the straightforward application of hydrodynamic models. However, the climate (i.e. precipitation), the catchment (i.e. geology, soil and antecedent soil-moisture condition) and the anthropogenic (i.e. drainage system and its regulation) systems interact in a complex way, and the occurrence probability of a flood inundation event can significantly differ from the occurrence probability of the triggering event (i.e. rainfall). In order to reliably determine the spatial patterns of flood intensities and probabilities, the rigorous determination of flood event scenarios is beneficial because it provides a clear, rationale method to recognize and unveil the inherent stochastic behavior of natural processes. Therefore, a multi-scenario approach for hazard assessment should be applied and should consider the possible events taking place in the area potentially subject to flooding (i.e. floodplains). Here, we apply a multi-scenario approach for the assessment of the flood hazard around the Idro lake (Italy). We consider and estimate the probability of occurrence of several scenarios related to the initial (i.e. initial water level in the lake) and boundary (i.e. shape of the hydrograph, downslope drainage, spillway opening operations) conditions characterizing the lake. Finally, we discuss the advantages and issues of the presented methodological procedure compared to traditional (and essentially deterministic) approaches.
Validation of a method for assessing resident physicians' quality improvement proposals.
Leenstra, James L; Beckman, Thomas J; Reed, Darcy A; Mundell, William C; Thomas, Kris G; Krajicek, Bryan J; Cha, Stephen S; Kolars, Joseph C; McDonald, Furman S
2007-09-01
Residency programs involve trainees in quality improvement (QI) projects to evaluate competency in systems-based practice and practice-based learning and improvement. Valid approaches to assess QI proposals are lacking. We developed an instrument for assessing resident QI proposals--the Quality Improvement Proposal Assessment Tool (QIPAT-7)-and determined its validity and reliability. QIPAT-7 content was initially obtained from a national panel of QI experts. Through an iterative process, the instrument was refined, pilot-tested, and revised. Seven raters used the instrument to assess 45 resident QI proposals. Principal factor analysis was used to explore the dimensionality of instrument scores. Cronbach's alpha and intraclass correlations were calculated to determine internal consistency and interrater reliability, respectively. QIPAT-7 items comprised a single factor (eigenvalue = 3.4) suggesting a single assessment dimension. Interrater reliability for each item (range 0.79 to 0.93) and internal consistency reliability among the items (Cronbach's alpha = 0.87) were high. This method for assessing resident physician QI proposals is supported by content and internal structure validity evidence. QIPAT-7 is a useful tool for assessing resident QI proposals. Future research should determine the reliability of QIPAT-7 scores in other residency and fellowship training programs. Correlations should also be made between assessment scores and criteria for QI proposal success such as implementation of QI proposals, resident scholarly productivity, and improved patient outcomes.
What is the best method for assessing lower limb force-velocity relationship?
Giroux, C; Rabita, G; Chollet, D; Guilhem, G
2015-02-01
This study determined the concurrent validity and reliability of force, velocity and power measurements provided by accelerometry, linear position transducer and Samozino's methods, during loaded squat jumps. 17 subjects performed squat jumps on 2 separate occasions in 7 loading conditions (0-60% of the maximal concentric load). Force, velocity and power patterns were averaged over the push-off phase using accelerometry, linear position transducer and a method based on key positions measurements during squat jump, and compared to force plate measurements. Concurrent validity analyses indicated very good agreement with the reference method (CV=6.4-14.5%). Force, velocity and power patterns comparison confirmed the agreement with slight differences for high-velocity movements. The validity of measurements was equivalent for all tested methods (r=0.87-0.98). Bland-Altman plots showed a lower agreement for velocity and power compared to force. Mean force, velocity and power were reliable for all methods (ICC=0.84-0.99), especially for Samozino's method (CV=2.7-8.6%). Our findings showed that present methods are valid and reliable in different loading conditions and permit between-session comparisons and characterization of training-induced effects. While linear position transducer and accelerometer allow for examining the whole time-course of kinetic patterns, Samozino's method benefits from a better reliability and ease of processing. © Georg Thieme Verlag KG Stuttgart · New York.
A prospective study assessing agreement and reliability of a geriatric evaluation.
Locatelli, Isabella; Monod, Stéfanie; Cornuz, Jacques; Büla, Christophe J; Senn, Nicolas
2017-07-19
The present study takes place within a geriatric program, aiming at improving the diagnosis and management of geriatric syndromes in primary care. Within this program it was of prime importance to be able to rely on a robust and reproducible geriatric consultation to use as a gold standard for evaluating a primary care brief assessment tool. The specific objective of the present study was thus assessing the agreement and reliability of a comprehensive geriatric consultation. The study was conducted at the outpatient clinic of the Service of Geriatric Medicine, University of Lausanne, Switzerland. All community-dwelling older persons aged 70 years and above were eligible. Patients were excluded if they hadn't a primary care physician, they were unable to speak French, or they were already assessed by a geriatrician within the last 12 months. A set of 9 geriatricians evaluated 20 patients. Each patient was assessed twice within a 2-month delay. Geriatric consultations were based on a structured evaluation process, leading to rating the following geriatric conditions: functional, cognitive, visual, and hearing impairment, mood disorders, risk of fall, osteoporosis, malnutrition, and urinary incontinence. Reliability and agreement estimates on each of these items were obtained using a three-way Intraclass Correlation and a three-way Observed Disagreement index. The latter allowed a decomposition of overall disagreement into disagreements due to each source of error variability (visit, rater and random). Agreement ranged between 0.62 and 0.85. For most domains, geriatrician-related error variability explained an important proportion of disagreement. Reliability ranged between 0 and 0.8. It was poor/moderate for visual impairment, malnutrition and risk of fall, and good/excellent for functional/cognitive/hearing impairment, osteoporosis, incontinence and mood disorders. Six out of nine items of the geriatric consultation described in this study (functional/cognitive/hearing impairment, osteoporosis, incontinence and mood disorders) present a good to excellent reliability and can safely be used as a reference (gold standard) to evaluate the diagnostic performance of a primary care brief assessment tool. More objective/significant measures are needed to improve reliability of malnutrition, visual impairment, and risk of fall assessment before they can serve as a safe gold standard of a primary care tool.
Brouillette, Robert M; Foil, Heather; Fontenot, Stephanie; Correro, Anthony; Allen, Ray; Martin, Corby K; Bruce-Keller, Annadora J; Keller, Jeffrey N
2013-01-01
While considerable knowledge has been gained through the use of established cognitive and motor assessment tools, there is a considerable interest and need for the development of a battery of reliable and validated assessment tools that provide real-time and remote analysis of cognitive and motor function in the elderly. Smartphones appear to be an obvious choice for the development of these "next-generation" assessment tools for geriatric research, although to date no studies have reported on the use of smartphone-based applications for the study of cognition in the elderly. The primary focus of the current study was to assess the feasibility, reliability, and validity of a smartphone-based application for the assessment of cognitive function in the elderly. A total of 57 non-demented elderly individuals were administered a newly developed smartphone application-based Color-Shape Test (CST) in order to determine its utility in measuring cognitive processing speed in the elderly. Validity of this novel cognitive task was assessed by correlating performance on the CST with scores on widely accepted assessments of cognitive function. Scores on the CST were significantly correlated with global cognition (Mini-Mental State Exam: r = 0.515, p<0.0001) and multiple measures of processing speed and attention (Digit Span: r = 0.427, p<0.0001; Trail Making Test: r = -0.651, p<0.00001; Digit Symbol Test: r = 0.508, p<0.0001). The CST was not correlated with naming and verbal fluency tasks (Boston Naming Test, Vegetable/Animal Naming) or memory tasks (Logical Memory Test). Test re-test reliability was observed to be significant (r = 0.726; p = 0.02). Together, these data are the first to demonstrate the feasibility, reliability, and validity of using a smartphone-based application for the purpose of assessing cognitive function in the elderly. The importance of these findings for the establishment of smartphone-based assessment batteries of cognitive and motor function in the elderly is discussed.
Update on MTTF figures for linear and rotary coolers of Thales Cryogenics
NASA Astrophysics Data System (ADS)
van de Groep, W.; van der Weijden, H.; van Leeuwen, R.; Benschop, T.; Cauquil, J. M.; Griot, R.
2012-06-01
Thales Cryogenics has an extensive background in delivering linear and rotary coolers for military, civil and space programs. During the last years several technical improvements have increased the lifetime of all Thales coolers resulting in significantly higher Mean Time To Failure (MTTF) figures. In this paper not only updated MTTF values for most of the products in our portfolio will be presented but also the methodology used to come to these reliability figures will be explained. The differences between rotary and linear coolers will be highlighted including the different failure modes influencing the lifetime under operational conditions. These updated reliability figures are based on extensive test results for both rotary and linear coolers as well as Weibull analysis, failure mode identifications, various types of lifetime testing and field results of operational coolers. The impact of the cooler selection for typical applications will be outlined. This updated reliability approach will enable an improved tradeoff for cooler selection in applications where MTTF and a correct reliability assessment is key. Improbing on cooler selection and an increased insight in cooler reliability will result in a higher uptime and operability of equipment, less risk on unexpected failures and lower costs of ownership.
ERIC Educational Resources Information Center
McGill, D. A.; van der Vleuten, C. P. M.; Clarke, M. J.
2011-01-01
Even though rater-based judgements of clinical competence are widely used, they are context sensitive and vary between individuals and institutions. To deal adequately with rater-judgement unreliability, evaluating the reliability of workplace rater-based assessments in the local context is essential. Using such an approach, the primary intention…
The Reliability and Sources of Error of Using Rubrics-Based Assessment for Student Projects
ERIC Educational Resources Information Center
Menéndez-Varela, José-Luis; Gregori-Giralt, Eva
2018-01-01
Rubrics are widely used in higher education to assess performance in project-based learning environments. To date, the sources of error that may affect their reliability have not been studied in depth. Using generalisability theory as its starting-point, this article analyses the influence of the assessors and the criteria of the rubrics on the…
Sasaki, Hatoko; Kakee, Naoko; Morisaki, Naho; Mori, Rintaro; Ravens-Sieberer, Ulrike; Bullinger, Monika
2018-05-02
This study examined the reliability and validity of the Japanese versions of the DISABKIDS-37 generic modules, a tool for assessing the health-related quality of life (HRQOL) of children and adolescents with a chronic condition. The study was conducted using a sample of 123 children/adolescents with a chronic medical condition, aged 8-18 years, and their parents. Focus interviews were performed to ensure content validity after translation. The classical psychometric tests were used to assess reliability and scale intercorrelations. The factor structure was examined with confirmatory factor analysis (CFA). Convergent validity was assessed by the correlation between the total score and the sub-scales of DISABKIDS-37 as well as the total score of KIDSCREEN-10. Both the children/adolescent and parent versions of the score showed good to high internal consistency, and the test-retest reliability correlations were r = 0.91 or above. The CFA revealed that the modified models for all domains were better fit than the original 37 item scale model for both self-report and proxy-report. Moderate to high positive correlations were found for the associations within DISABKIDS-37 sub-scales and between the subscales and total score, except for the treatment sub-scale, which correlated weakly with the remaining sub-scales. The total score of the child-reported version of KIDSCREEN-10 correlated significantly and positively with the total score and all the sub-scales of the child-reported version of DISABKIDS-37 except the Treatment sub-scale in adolescents. The modified models of Japanese version of DISABKIDS generic module were psychometrically robust enough to assess the HRQOL of children with a chronic condition.
NASA Astrophysics Data System (ADS)
Sun, Kai; Wu, Hongfei; Cai, Yan; Xing, Yan
2014-06-01
A thermoelectric generator (TEG) is a very important kind of power supply for spacecraft, especially for deep-space missions, due to its long lifetime and high reliability. To develop a practical TEG power supply for spacecraft, a power conditioning stage is indispensable, being employed to convert the varying output voltage of the TEG modules to a definite voltage for feeding batteries or loads. To enhance the system reliability, a power conditioning stage based on analog-circuit maximum-power-point tracking (MPPT) control and a superbuck converter is proposed in this paper. The input of this power conditioning stage is connected to the output of the TEG modules, and the output of this stage is connected to the battery and loads. The superbuck converter is employed as the main circuit, featuring low input current ripples and high conversion efficiency. Since for spacecraft power systems reliable operation is the key target for control circuits, a reset-set flip-flop-based analog circuit is used as the basic control circuit to implement MPPT, being much simpler than digital control circuits and offering higher reliability. Experiments have verified the feasibility and effectiveness of the proposed power conditioning stage. The results show the advantages of the proposed stage, such as maximum utilization of TEG power, small input ripples, and good stability.
Akram, A J; Ireland, A J; Postlethwaite, K C; Sandy, J R; Jerreat, A S
2013-11-01
This article describes the process of validity and reliability testing of a condition-specific quality-of-life measure for patients with hypodontia presenting for orthodontic treatment. The development of the instrument is described in a previous article. Royal Devon and Exeter NHS Foundation Trust & Musgrove Park Hospital, Taunton. The child perception questionnaire was used as a standard against which to test criterion validity. The Bland and Altman method was used to check agreement between the two questionnaires. Construct validity was tested using principal component analysis on the four sections of the questionnaire. Test-retest reliability was tested using intraclass correlation coefficient and Bland and Altman method. Cronbach's alpha was used to test internal consistency reliability. Overall the questionnaire showed good reliability, criterion and construct validity. This together with previous evidence of good face and content validity suggests that the instrument may prove useful in clinical practice and further research. This study has demonstrated that the newly developed condition-specific quality-of-life questionnaire is both valid and reliable for use in young patients with hypodontia. © 2013 John Wiley & Sons A/S. Published by Blackwell Publishing Ltd.
Evans, Travis C; Britton, Jennifer C
2018-09-01
Abnormal threat-related attention in anxiety disorders is most commonly assessed and modified using the dot-probe paradigm; however, poor psychometric properties of reaction-time measures may contribute to inconsistencies across studies. Typically, standard attention measures are derived using average reaction-times obtained in experimentally-defined conditions. However, current approaches based on experimentally-defined conditions are limited. In this study, the psychometric properties of a novel response-based computation approach to analyze dot-probe data are compared to standard measures of attention. 148 adults (19.19 ± 1.42 years, 84 women) completed a standardized dot-probe task including threatening and neutral faces. We generated both standard and response-based measures of attention bias, attentional orientation, and attentional disengagement. We compared overall internal consistency, number of trials necessary to reach internal consistency, test-retest reliability (n = 72), and criterion validity obtained using each approach. Compared to standard attention measures, response-based measures demonstrated uniformly high levels of internal consistency with relatively few trials and varying improvements in test-retest reliability. Additionally, response-based measures demonstrated specific evidence of anxiety-related associations above and beyond both standard attention measures and other confounds. Future studies are necessary to validate this approach in clinical samples. Response-based attention measures demonstrate superior psychometric properties compared to standard attention measures, which may improve the detection of anxiety-related associations and treatment-related changes in clinical samples. Copyright © 2018 Elsevier Ltd. All rights reserved.
Chen, J D; Sun, H L
1999-04-01
Objective. To assess and predict reliability of an equipment dynamically by making full use of various test informations in the development of products. Method. A new reliability growth assessment method based on army material system analysis activity (AMSAA) model was developed. The method is composed of the AMSAA model and test data conversion technology. Result. The assessment and prediction results of a space-borne equipment conform to its expectations. Conclusion. It is suggested that this method should be further researched and popularized.
Reliability and risk assessment of structures
NASA Technical Reports Server (NTRS)
Chamis, C. C.
1991-01-01
Development of reliability and risk assessment of structural components and structures is a major activity at Lewis Research Center. It consists of five program elements: (1) probabilistic loads; (2) probabilistic finite element analysis; (3) probabilistic material behavior; (4) assessment of reliability and risk; and (5) probabilistic structural performance evaluation. Recent progress includes: (1) the evaluation of the various uncertainties in terms of cumulative distribution functions for various structural response variables based on known or assumed uncertainties in primitive structural variables; (2) evaluation of the failure probability; (3) reliability and risk-cost assessment; and (4) an outline of an emerging approach for eventual certification of man-rated structures by computational methods. Collectively, the results demonstrate that the structural durability/reliability of man-rated structural components and structures can be effectively evaluated by using formal probabilistic methods.
Perceptual attraction in tool use: evidence for a reliability-based weighting mechanism.
Debats, Nienke B; Ernst, Marc O; Heuer, Herbert
2017-04-01
Humans are well able to operate tools whereby their hand movement is linked, via a kinematic transformation, to a spatially distant object moving in a separate plane of motion. An everyday example is controlling a cursor on a computer monitor. Despite these separate reference frames, the perceived positions of the hand and the object were found to be biased toward each other. We propose that this perceptual attraction is based on the principles by which the brain integrates redundant sensory information of single objects or events, known as optimal multisensory integration. That is, 1 ) sensory information about the hand and the tool are weighted according to their relative reliability (i.e., inverse variances), and 2 ) the unisensory reliabilities sum up in the integrated estimate. We assessed whether perceptual attraction is consistent with optimal multisensory integration model predictions. We used a cursor-control tool-use task in which we manipulated the relative reliability of the unisensory hand and cursor position estimates. The perceptual biases shifted according to these relative reliabilities, with an additional bias due to contextual factors that were present in experiment 1 but not in experiment 2 The biased position judgments' variances were, however, systematically larger than the predicted optimal variances. Our findings suggest that the perceptual attraction in tool use results from a reliability-based weighting mechanism similar to optimal multisensory integration, but that certain boundary conditions for optimality might not be satisfied. NEW & NOTEWORTHY Kinematic tool use is associated with a perceptual attraction between the spatially separated hand and the effective part of the tool. We provide a formal account for this phenomenon, thereby showing that the process behind it is similar to optimal integration of sensory information relating to single objects. Copyright © 2017 the American Physiological Society.
Petscher, Yaacov; Mitchell, Alison M; Foorman, Barbara R
2015-01-01
A growing body of literature suggests that response latency, the amount of time it takes an individual to respond to an item, may be an important factor to consider when using assessment data to estimate the ability of an individual. Considering that tests of passage and list fluency are being adapted to a computer administration format, it is possible that accounting for individual differences in response times may be an increasingly feasible option to strengthen the precision of individual scores. The present research evaluated the differential reliability of scores when using classical test theory and item response theory as compared to a conditional item response model which includes response time as an item parameter. Results indicated that the precision of student ability scores increased by an average of 5 % when using the conditional item response model, with greater improvements for those who were average or high ability. Implications for measurement models of speeded assessments are discussed.
Petscher, Yaacov; Mitchell, Alison M.; Foorman, Barbara R.
2016-01-01
A growing body of literature suggests that response latency, the amount of time it takes an individual to respond to an item, may be an important factor to consider when using assessment data to estimate the ability of an individual. Considering that tests of passage and list fluency are being adapted to a computer administration format, it is possible that accounting for individual differences in response times may be an increasingly feasible option to strengthen the precision of individual scores. The present research evaluated the differential reliability of scores when using classical test theory and item response theory as compared to a conditional item response model which includes response time as an item parameter. Results indicated that the precision of student ability scores increased by an average of 5 % when using the conditional item response model, with greater improvements for those who were average or high ability. Implications for measurement models of speeded assessments are discussed. PMID:27721568
ERIC Educational Resources Information Center
Williams, Harriet G.; Pfeiffer, Karin A.; Dowda, Marsha; Jeter, Chevy; Jones, Shaverra; Pate, Russell R.
2009-01-01
The purpose of this study was to develop a valid and reliable tool for use in assessing motor skills in preschool children in field-based settings. The development of the Children's Activity and Movement in Preschool Study Motor Skills Protocol included evidence of its reliability and validity for use in field-based environments as part of large…
NASA Astrophysics Data System (ADS)
Rahman, Abdul Ghaffar Abdul; Noroozi, Siamak; Dupac, Mihai; Mahathir Syed Mohd Al-Attas, Syed; Vinney, John E.
2013-03-01
Complex rotating machinery requires regular condition monitoring inspections to assess their running conditions and their structural integrity to prevent catastrophic failures. Machine failures can be divided into two categories. First is the wear and tear during operation, they range from bearing defects, gear damage, misalignment, imbalance or mechanical looseness, for which simple condition-based maintenance techniques can easily detect the root cause and trigger remedial action process. The second factor in machine failure is caused by the inherent design faults that usually happened due to many reasons such as improper installation, poor servicing, bad workmanship and structural dynamics design deficiency. In fact, individual machines components are generally dynamically well designed and rigorously tested. However, when these machines are assembled on sight and linked together, their dynamic characteristics will change causing unexpected behaviour of the system. Since nondestructive evaluation provides an excellent alternative to the classical monitoring and proved attractive due to the possibility of performing reliable assessments of all types of machinery, the novel dynamic design verification procedure - based on the combination of in-service operation deflection shape measurement, experimental modal analysis and iterative inverse finite element analysis - proposed here allows quick identification of structural weakness, and helps to provide and verify the solutions.
NASA Astrophysics Data System (ADS)
Wallace, Jon Michael
2003-10-01
Reliability prediction of components operating in complex systems has historically been conducted in a statistically isolated manner. Current physics-based, i.e. mechanistic, component reliability approaches focus more on component-specific attributes and mathematical algorithms and not enough on the influence of the system. The result is that significant error can be introduced into the component reliability assessment process. The objective of this study is the development of a framework that infuses the needs and influence of the system into the process of conducting mechanistic-based component reliability assessments. The formulated framework consists of six primary steps. The first three steps, identification, decomposition, and synthesis, are primarily qualitative in nature and employ system reliability and safety engineering principles to construct an appropriate starting point for the component reliability assessment. The following two steps are the most unique. They involve a step to efficiently characterize and quantify the system-driven local parameter space and a subsequent step using this information to guide the reduction of the component parameter space. The local statistical space quantification step is accomplished using two proposed multivariate probability models: Multi-Response First Order Second Moment and Taylor-Based Inverse Transformation. Where existing joint probability models require preliminary distribution and correlation information of the responses, these models combine statistical information of the input parameters with an efficient sampling of the response analyses to produce the multi-response joint probability distribution. Parameter space reduction is accomplished using Approximate Canonical Correlation Analysis (ACCA) employed as a multi-response screening technique. The novelty of this approach is that each individual local parameter and even subsets of parameters representing entire contributing analyses can now be rank ordered with respect to their contribution to not just one response, but the entire vector of component responses simultaneously. The final step of the framework is the actual probabilistic assessment of the component. Although the same multivariate probability tools employed in the characterization step can be used for the component probability assessment, variations of this final step are given to allow for the utilization of existing probabilistic methods such as response surface Monte Carlo and Fast Probability Integration. The overall framework developed in this study is implemented to assess the finite-element based reliability prediction of a gas turbine airfoil involving several failure responses. Results of this implementation are compared to results generated using the conventional 'isolated' approach as well as a validation approach conducted through large sample Monte Carlo simulations. The framework resulted in a considerable improvement to the accuracy of the part reliability assessment and an improved understanding of the component failure behavior. Considerable statistical complexity in the form of joint non-normal behavior was found and accounted for using the framework. Future applications of the framework elements are discussed.
Tang, D Y Y; Liu, A C Y; Leung, M H T; Siu, B W M
2013-06-01
OBJECTIVE. Antisocial personality disorder (ASPD) is a risk factor for violence and is associated with poor treatment response when it is a co-morbid condition with substance abuse. It is an under-recognised clinical entity in the local Hong Kong setting, for which there are only a few available Chinese-language diagnostic instruments. None has been tested for its psychometric properties in the Cantonese-speaking population in Hong Kong. This study therefore aimed to assess the reliability and validity of the Chinese version of the ASPD subscale of the Structured Clinical Interview for the DSM-IV Axis II Disorders (SCID-II) in Hong Kong Chinese. METHODS. This assessment tool was modified according to dialectal differences between Mainland China and Hong Kong. Inpatients in Castle Peak Hospital, Hong Kong, who were designated for priority follow-up based on their assessed propensity for violence and who fulfilled the inclusion criteria for the study, were recruited. To assess the level of agreement, best-estimate diagnosis made by a multidisciplinary team was compared with diagnostic status determined by the SCID-II ASPD subscale. The internal consistency, sensitivity, and specificity of the subscale were also calculated. RESULTS. The internal consistency of the subscale was acceptable at 0.79, whereas the test-retest reliability and inter-rater reliability showed an excellent and good agreement of 0.90 and 0.86, respectively. Best-estimate clinical diagnosis-SCID diagnosis agreement was acceptable at 0.76. The sensitivity, specificity, positive and negative predictive values were 0.91, 0.86, 0.83, and 0.93, respectively. CONCLUSION. The Chinese version of the SCID-II ASPD subscale is reliable and valid for diagnosing ASPD in a Cantonese-speaking clinical population.
Abanto, Jenny; Albites, Ursula; Bönecker, Marcelo; Paiva, Saul M; Castillo, Jorge L; Aguilar-Gálvez, Denisse
2015-12-01
The lack of a Family Impact Scale (FIS) in Spanish language limits its use as an indicator in Spanish-speaking countries and precludes comparisons with data from other cultural and ethnic groups. The purpose of this study was therefore to adapt the FIS cross-culturally to the Peruvian Spanish language and assess its reliability and validity. In order to translate and adapt the FIS cross-culturally, it was answered by 60 parents in two pilot tests, after which it was tested on 200 parents of children aged 11 to 14 years who were clinically examined for dental caries experience and malocclusions. Internal consistency was assessed by Cronbach's alpha coefficient while repeat administration of the FIS on the same 200 parents enabled the test-retest reliability to be assessed via intraclass correlation coefficient (ICC). Construct and discriminant validity were based on associations of the FIS with global ratings of oral health and clinical groups, respectively. Mean (standard deviation) FIS total score was 5.20 (5.86). Internal consistency was confirmed by Cronbach's alpha 0.84. Test-retest reliability revealed excellent reproducibility (ICC = 0.96). Construct validity was good, demonstrating statistically significant associations between total FIS score and global ratings of oral health (p=0.007) and overall wellbeing (p=0.002), as well as for the subscale scores (p<0.05) with exception of the financial burden subscale. The FIS was also able to discriminate between children with and without dental caries experience and malocclusions (p<0.05). Satisfactory psychometric results for the Peruvian Spanish FIS confirm it as a reliable, valid instrument for assessing the impact on the family caused by children's oral conditions. Sociedad Argentina de Investigación Odontológica.
Classification in childhood disability: focusing on function in the 21st century.
Rosenbaum, Peter; Eliasson, Ann-Christin; Hidecker, Mary Jo Cooley; Palisano, Robert J
2014-08-01
Classification systems in health care are usually based on current understanding of the condition. They are often derived empirically and adopted applying sound principles of measurement science to assess whether they are reliable (consistent) and valid (true) for the purposes to which they are applied. In the past 15 years, the authors have developed and validated classification systems for specific aspects of everyday function in people with cerebral palsy--gross motor function, manual abilities, and communicative function. This article describes the approaches used to conceptualize each aspect of function, develop the tools, and assess their reliability and validity. We report on the utility of each system with respect to clinical applicability, use of these tools for research, and the uptake and impact that they have had around the world. We hope that readers will find these accounts interesting, relevant, and applicable to their daily work with children and youth with disabilities. © The Author(s) 2014.
Reliability Evaluation of Machine Center Components Based on Cascading Failure Analysis
NASA Astrophysics Data System (ADS)
Zhang, Ying-Zhi; Liu, Jin-Tong; Shen, Gui-Xiang; Long, Zhe; Sun, Shu-Guang
2017-07-01
In order to rectify the problems that the component reliability model exhibits deviation, and the evaluation result is low due to the overlook of failure propagation in traditional reliability evaluation of machine center components, a new reliability evaluation method based on cascading failure analysis and the failure influenced degree assessment is proposed. A direct graph model of cascading failure among components is established according to cascading failure mechanism analysis and graph theory. The failure influenced degrees of the system components are assessed by the adjacency matrix and its transposition, combined with the Pagerank algorithm. Based on the comprehensive failure probability function and total probability formula, the inherent failure probability function is determined to realize the reliability evaluation of the system components. Finally, the method is applied to a machine center, it shows the following: 1) The reliability evaluation values of the proposed method are at least 2.5% higher than those of the traditional method; 2) The difference between the comprehensive and inherent reliability of the system component presents a positive correlation with the failure influenced degree of the system component, which provides a theoretical basis for reliability allocation of machine center system.
Travaglini, Davide; Fattorini, Lorenzo; Barbati, Anna; Bottalico, Francesca; Corona, Piermaria; Ferretti, Marco; Chirici, Gherardo
2013-04-01
A correct characterization of the status and trend of forest condition is essential to support reporting processes at national and international level. An international forest condition monitoring has been implemented in Europe since 1987 under the auspices of the International Co-operative Programme on Assessment and Monitoring of Air Pollution Effects on Forests (ICP Forests). The monitoring is based on harmonized methodologies, with individual countries being responsible for its implementation. Due to inconsistencies and problems in sampling design, however, the ICP Forests network is not able to produce reliable quantitative estimates of forest condition at European and sometimes at country level. This paper proposes (1) a set of requirements for status and change assessment and (2) a harmonized sampling strategy able to provide unbiased and consistent estimators of forest condition parameters and of their changes at both country and European level. Under the assumption that a common definition of forest holds among European countries, monitoring objectives, parameters of concern and accuracy indexes are stated. On the basis of fixed-area plot sampling performed independently in each country, an unbiased and consistent estimator of forest defoliation indexes is obtained at both country and European level, together with conservative estimators of their sampling variance and power in the detection of changes. The strategy adopts a probabilistic sampling scheme based on fixed-area plots selected by means of systematic or stratified schemes. Operative guidelines for its application are provided.
Lempereur, Mathieu; Lelievre, Mathieu; Burdin, Valérie; Ben Salem, Douraied; Brochard, Sylvain
2017-01-01
Purpose To report evidence for the concurrent validity and reliability of dynamic MRI techniques to evaluate in vivo joint and muscle mechanics, and to propose recommendations for their use in the assessment of normal and impaired musculoskeletal function. Materials and methods The search was conducted on articles published in Web of science, PubMed, Scopus, Academic search Premier, and Cochrane Library between 1990 and August 2017. Studies that reported the concurrent validity and/or reliability of dynamic MRI techniques for in vivo evaluation of joint or muscle mechanics were included after assessment by two independent reviewers. Selected articles were assessed using an adapted quality assessment tool and a data extraction process. Results for concurrent validity and reliability were categorized as poor, moderate, or excellent. Results Twenty articles fulfilled the inclusion criteria with a mean quality assessment score of 66% (±10.4%). Concurrent validity and/or reliability of eight dynamic MRI techniques were reported, with the knee being the most evaluated joint (seven studies). Moderate to excellent concurrent validity and reliability were reported for seven out of eight dynamic MRI techniques. Cine phase contrast and real-time MRI appeared to be the most valid and reliable techniques to evaluate joint motion, and spin tag for muscle motion. Conclusion Dynamic MRI techniques are promising for the in vivo evaluation of musculoskeletal mechanics; however results should be evaluated with caution since validity and reliability have not been determined for all joints and muscles, nor for many pathological conditions. PMID:29232401
Retest reliability of force-time variables of neck muscles under isometric conditions.
Almosnino, Sivan; Pelland, Lucie; Stevenson, Joan M
2010-01-01
Proper conditioning of the neck muscles may play a role in reducing the risk of neck injury and, possibly, concussions in contact sports. However, the ability to reliably measure the force-time-based variables that might be relevant for this purpose has not been addressed. To assess the between-days reliability of discrete force-time-based variables of neck muscles during maximal voluntary isometric contractions in 5 directions. Cohort study. University research center. Twenty-six highly physically active men (age = 21.6 ± 2.1 years, height = 1.85 ± 0.09 m, mass = 81.6 ± 9.9 kg, head circumference = 0.58 ± 0.01 m, neck circumference = 0.39 ± 0.02 m). We used a custom-built testing apparatus to measure maximal voluntary isometric contractions of the neck muscles in 5 directions (extension, flexion, protraction, left lateral bending, and right lateral bending) on 2 separate occasions separated by 7 to 8 days. Variables measured were peak force (PF), rate of force development (RFD), and time to 50% of PF (T(50)PF). Reliability indices calculated for each variable comprised the difference in scores between the testing sessions, with corresponding 95% confidence intervals, the coefficient of variation of the typical error of measurement (CV(TE)), and intraclass correlation coefficients (ICC [3,3]). No evidence of systematic bias was detected for the dependent measures across any movement direction; retest differences in measurements were between 1.8% and 2.7%, with corresponding 95% confidence interval ranges of less than 10% and overlapping zero. The CV(TE) was lowest for PF (range, 2.4%-6.3%) across all testing directions, followed by RFD (range, 4.8%-9.0%) and T(50)PF (range, 7.1%-9.3%). The ICC score range for all dependent measures was 0.90 to 0.99. Discrete variables representative of the force-generating capacity of neck muscles under isometric conditions can be measured with an acceptable degree of reliability. This finding has possible applications for investigating the role of neck muscle strength-training programs in reducing the risk of injuries in sport settings.
Leddy, Abigail L; Crowner, Beth E; Earhart, Gammon M
2011-01-01
Gait impairments, balance impairments, and falls are prevalent in individuals with Parkinson disease (PD). Although the Berg Balance Scale (BBS) can be considered the reference standard for the determination of fall risk, it has a noted ceiling effect. Development of ceiling-free measures that can assess balance and are good at discriminating "fallers" from "nonfallers" is needed. The purpose of this study was to compare the Functional Gait Assessment (FGA) and the Balance Evaluation Systems Test (BESTest) with the BBS among individuals with PD and evaluate the tests' reliability, validity, and discriminatory sensitivity and specificity for fallers versus nonfallers. This was an observational study of community-dwelling individuals with idiopathic PD. The BBS, FGA, and BESTest were administered to 80 individuals with PD. Interrater reliability (n=15) was assessed by 3 raters. Test-retest reliability was based on 2 tests of participants (n=24), 2 weeks apart. Intraclass correlation coefficients (2,1) were used to calculate reliability, and Spearman correlation coefficients were used to assess validity. Cutoff points, sensitivity, and specificity were based on receiver operating characteristic plots. Test-retest reliability was .80 for the BBS, .91 for the FGA, and .88 for the BESTest. Interrater reliability was greater than .93 for all 3 tests. The FGA and BESTest were correlated with the BBS (r=.78 and r=.87, respectively). Cutoff scores to identify fallers were 47/56 for the BBS, 15/30 for the FGA, and 69% for the BESTest. The overall accuracy (area under the curve) for the BBS, FGA, and BESTest was .79, .80, and .85, respectively. Fall reports were retrospective. Both the FGA and the BESTest have reliability and validity for assessing balance in individuals with PD. The BESTest is most sensitive for identifying fallers.
Reliable and valid assessment of Lichtenstein hernia repair skills.
Carlsen, C G; Lindorff-Larsen, K; Funch-Jensen, P; Lund, L; Charles, P; Konge, L
2014-08-01
Lichtenstein hernia repair is a common surgical procedure and one of the first procedures performed by a surgical trainee. However, formal assessment tools developed for this procedure are few and sparsely validated. The aim of this study was to determine the reliability and validity of an assessment tool designed to measure surgical skills in Lichtenstein hernia repair. Key issues were identified through a focus group interview. On this basis, an assessment tool with eight items was designed. Ten surgeons and surgical trainees were video recorded while performing Lichtenstein hernia repair, (four experts, three intermediates, and three novices). The videos were blindly and individually assessed by three raters (surgical consultants) using the assessment tool. Based on these assessments, validity and reliability were explored. The internal consistency of the items was high (Cronbach's alpha = 0.97). The inter-rater reliability was very good with an intra-class correlation coefficient (ICC) = 0.93. Generalizability analysis showed a coefficient above 0.8 even with one rater. The coefficient improved to 0.92 if three raters were used. One-way analysis of variance found a significant difference between the three groups which indicates construct validity, p < 0.001. Lichtenstein hernia repair skills can be assessed blindly by a single rater in a reliable and valid fashion with the new procedure-specific assessment tool. We recommend this tool for future assessment of trainees performing Lichtenstein hernia repair to ensure that the objectives of competency-based surgical training are met.
Kuehnapfel, Andreas; Ahnert, Peter; Loeffler, Markus; Scholz, Markus
2017-02-01
Body surface area is a physiological quantity relevant for many medical applications. In clinical practice, it is determined by empirical formulae. 3D laser-based anthropometry provides an easy and effective way to measure body surface area but is not ubiquitously available. We used data from laser-based anthropometry from a population-based study to assess validity of published and commonly used empirical formulae. We performed a large population-based study on adults collecting classical anthropometric measurements and 3D body surface assessments (N = 1435). We determined reliability of the 3D body surface assessment and validity of 18 different empirical formulae proposed in the literature. The performance of these formulae is studied in subsets of sex and BMI. Finally, improvements of parameter settings of formulae and adjustments for sex and BMI were considered. 3D body surface measurements show excellent intra- and inter-rater reliability of 0.998 (overall concordance correlation coefficient, OCCC was used as measure of agreement). Empirical formulae of Fujimoto and Watanabe, Shuter and Aslani and Sendroy and Cecchini performed best with excellent concordance with OCCC > 0.949 even in subgroups of sex and BMI. Re-parametrization of formulae and adjustment for sex and BMI slightly improved results. In adults, 3D laser-based body surface assessment is a reliable alternative to estimation by empirical formulae. However, there are empirical formulae showing excellent results even in subgroups of sex and BMI with only little room for improvement.
Cramer, Emily
2016-01-01
Abstract Hospital performance reports often include rankings of unit pressure ulcer rates. Differentiating among units on the basis of quality requires reliable measurement. Our objectives were to describe and apply methods for assessing reliability of hospital‐acquired pressure ulcer rates and evaluate a standard signal‐noise reliability measure as an indicator of precision of differentiation among units. Quarterly pressure ulcer data from 8,199 critical care, step‐down, medical, surgical, and medical‐surgical nursing units from 1,299 US hospitals were analyzed. Using beta‐binomial models, we estimated between‐unit variability (signal) and within‐unit variability (noise) in annual unit pressure ulcer rates. Signal‐noise reliability was computed as the ratio of between‐unit variability to the total of between‐ and within‐unit variability. To assess precision of differentiation among units based on ranked pressure ulcer rates, we simulated data to estimate the probabilities of a unit's observed pressure ulcer rate rank in a given sample falling within five and ten percentiles of its true rank, and the probabilities of units with ulcer rates in the highest quartile and highest decile being identified as such. We assessed the signal‐noise measure as an indicator of differentiation precision by computing its correlations with these probabilities. Pressure ulcer rates based on a single year of quarterly or weekly prevalence surveys were too susceptible to noise to allow for precise differentiation among units, and signal‐noise reliability was a poor indicator of precision of differentiation. To ensure precise differentiation on the basis of true differences, alternative methods of assessing reliability should be applied to measures purported to differentiate among providers or units based on quality. © 2016 The Authors. Research in Nursing & Health published by Wiley Periodicals, Inc. PMID:27223598
Fast determination of royal jelly freshness by a chromogenic reaction.
Zheng, Huo-Qing; Wei, Wen-Ting; Wu, Li-Ming; Hu, Fu-Liang; Dietemann, Vincent
2012-06-01
Royal jelly is one of the most important products of honeybees. Given its role in development of bee brood into fertile individuals of the royal caste it is also used in health products for human consumption. Royal jelly spoils and loses its health-promoting properties depending on storage duration and conditions. To ensure product quality before selling, it is therefore necessary to assess royal jelly freshness. Many indexes of freshness have been suggested, but they all lack reliability or require complex and time-consuming analyses. Here we describe a method to detect royal jelly freshness based on a chromogenic reaction between royal jelly and HCl. We demonstrate that analyses based on color parameters allow for the discrimination of royal jelly samples based on the duration of their storage. Color parameters of royal jelly stored at -18 and 4 °C for 28 d remained comparable to that of fresh samples, which supports the reliability of the method. The method of freshness determination described is practical, cheap, and fast and can thus be used in real-time when trading royal jelly. The method developed can be used to assess royal jelly freshness. It is practical, cheap, and fast and can thus be used in real-time when trading royal jelly. © 2012 Institute of Food Technologists®
Palm, Peter; Josephson, Malin; Mathiassen, Svend Erik; Kjellberg, Katarina
2016-06-01
We evaluated the intra- and inter-observer reliability and criterion validity of an observation protocol, developed in an iterative process involving practicing ergonomists, for assessment of working technique during cash register work for the purpose of preventing upper extremity symptoms. Two ergonomists independently assessed 17 15-min videos of cash register work on two occasions each, as a basis for examining reliability. Criterion validity was assessed by comparing these assessments with meticulous video-based analyses by researchers. Intra-observer reliability was acceptable (i.e. proportional agreement >0.7 and kappa >0.4) for 10/10 questions. Inter-observer reliability was acceptable for only 3/10 questions. An acceptable inter-observer reliability combined with an acceptable criterion validity was obtained only for one working technique aspect, 'Quality of movements'. Thus, major elements of the cashiers' working technique could not be assessed with an acceptable accuracy from short periods of observations by one observer, such as often desired by practitioners. Practitioner Summary: We examined an observation protocol for assessing working technique in cash register work. It was feasible in use, but inter-observer reliability and criterion validity were generally not acceptable when working technique aspects were assessed from short periods of work. We recommend the protocol to be used for educational purposes only.
Reliability considerations of a fuel cell backup power system for telecom applications
NASA Astrophysics Data System (ADS)
Serincan, Mustafa Fazil
2016-03-01
A commercial fuel cell backup power unit is tested in real life operating conditions at a base station of a Turkish telecom operator. The fuel cell system responds to 256 of 260 electric power outages successfully, providing the required power to the base station. Reliability of the fuel cell backup power unit is found to be 98.5% at the system level. On the other hand, a qualitative reliability analysis at the component level is carried out. Implications of the power management algorithm on reliability is discussed. Moreover, integration of the backup power unit to the base station ecosystem is reviewed in the context of reliability. Impact of inverter design on the stability of the output power is outlined. Significant current harmonics are encountered when a generic inverter is used. However, ripples are attenuated significantly when a custom design inverter is used. Further, fault conditions are considered for real world case studies such as running out of hydrogen, a malfunction in the system, or an unprecedented operating scheme. Some design guidelines are suggested for hybridization of the backup power unit for an uninterrupted operation.
Reliability, Validity and Usefulness of 30–15 Intermittent Fitness Test in Female Soccer Players
Čović, Nedim; Jelešković, Eldin; Alić, Haris; Rađo, Izet; Kafedžić, Erduan; Sporiš, Goran; McMaster, Daniel T.; Milanović, Zoran
2016-01-01
PURPOSE: The aim of this study was to examine the reliability, validity and usefulness of the 30–15IFT in competitive female soccer players. METHODS: Seventeen elite female soccer players participated in the study. A within subject test-retest study design was utilized to assess the reliability of the 30–15 intermittent fitness test (IFT). Seven days prior to 30–15IFT, subjects performed a continuous aerobic running test (CT) under laboratory conditions to assess the criterion validity of the 30–15IFT. End running velocity (VCT and VIFT), peak heart rate (HRpeak) and maximal oxygen consumption (VO2max) were collected and/or estimated for both tests. RESULTS: VIFT (ICC = 0.91; CV = 1.8%), HRpeak (ICC = 0.94; CV = 1.2%), and VO2max (ICC = 0.94; CV = 1.6%) obtained from the 30–15IFT were all deemed highly reliable (p > 0.05). Pearson product moment correlations between the CT and 30–15IFT for VO2max, HRpeak and end running velocity were large (r = 0.67, p = 0.013), very large (r = 0.77, p = 0.02) and large (r = 0.57, p = 0.042), respectively. CONCLUSION: Current findings suggest that the 30–15IFT is a valid and reliable intermittent aerobic fitness test of elite female soccer players. The findings have also provided practitioners with evidence to support the accurate detection of meaningful individual changes in VIFT of 0.5 km/h (1 stage) and HRpeak of 2 bpm. This information may assist coaches in monitoring “real” aerobic fitness changes to better inform training of female intermittent team sport athletes. Lastly, coaches could use the 30–15IFT as a practical alternative to laboratory based assessments to assess and monitor intermittent aerobic fitness changes in their athletes. PMID:27909408
Camera-based measurement of respiratory rates is reliable.
Becker, Christoph; Achermann, Stefan; Rocque, Mukul; Kirenko, Ihor; Schlack, Andreas; Dreher-Hummel, Thomas; Zumbrunn, Thomas; Bingisser, Roland; Nickel, Christian H
2017-06-01
Respiratory rate (RR) is one of the most important vital signs used to detect whether a patient is in critical condition. It is part of many risk scores and its measurement is essential for triage of patients in emergency departments. It is often not recorded as measurement is cumbersome and time-consuming. We intended to evaluate the accuracy of camera-based measurements as an alternative measurement to the current practice of manual counting. We monitored the RR of healthy male volunteers with a camera-based prototype application and simultaneously by manual counting and by capnography, which was considered the gold standard. The four assessors were mutually blinded. We simulated normoventilation, hypoventilation and hyperventilation as well as deep, normal and superficial breathing depths to assess potential clinical settings. The volunteers were assessed while being undressed, wearing a T-shirt or a winter coat. In total, 20 volunteers were included. The results of camera-based measurements of RRs and capnography were in close agreement throughout all clothing styles and respiratory patterns (Pearson's correlation coefficient, r=0.90-1.00, except for one scenario, in which the volunteer breathed slowly dressed in a winter coat r=0.84). In the winter-coat scenarios, the camera-based prototype application was superior to human counters. In our pilot study, we found that camera-based measurements delivered accurate and reliable results. Future studies need to show that camera-based measurements are a secure alternative for measuring RRs in clinical settings as well.
Validity and reliability assessment of a peer evaluation method in team-based learning classes.
Yoon, Hyun Bae; Park, Wan Beom; Myung, Sun-Jung; Moon, Sang Hui; Park, Jun-Bean
2018-03-01
Team-based learning (TBL) is increasingly employed in medical education because of its potential to promote active group learning. In TBL, learners are usually asked to assess the contributions of peers within their group to ensure accountability. The purpose of this study is to assess the validity and reliability of a peer evaluation instrument that was used in TBL classes in a single medical school. A total of 141 students were divided into 18 groups in 11 TBL classes. The students were asked to evaluate their peers in the group based on evaluation criteria that were provided to them. We analyzed the comments that were written for the highest and lowest achievers to assess the validity of the peer evaluation instrument. The reliability of the instrument was assessed by examining the agreement among peer ratings within each group of students via intraclass correlation coefficient (ICC) analysis. Most of the students provided reasonable and understandable comments for the high and low achievers within their group, and most of those comments were compatible with the evaluation criteria. The average ICC of each group ranged from 0.390 to 0.863, and the overall average was 0.659. There was no significant difference in inter-rater reliability according to the number of members in the group or the timing of the evaluation within the course. The peer evaluation instrument that was used in the TBL classes was valid and reliable. Providing evaluation criteria and rules seemed to improve the validity and reliability of the instrument.
Qin, D L; Jin, X N; Wang, S J; Wang, J J; Mamat, N; Wang, F J; Wang, Y; Shen, Z A; Sheng, L G; Forsman, M; Yang, L Y; Wang, S; Zhang, Z B; He, L H
2018-06-18
To form a new assessment method to evaluate postural workload comprehensively analyzing the dynamic and static postural workload for workers during their work process to analyze the reliability and validity, and to study the relation between workers' postural workload and work-related musculoskeletal disorders (WMSDs). In the study, 844 workers from electronic and railway vehicle manufacturing factories were selected as subjects investigated by using the China Musculoskeletal Questionnaire (CMQ) to form the postural workload comprehensive assessment method. The Cronbach's α, cluster analysis and factor analysis were used to assess the reliability and validity of the new assessment method. Non-conditional Logistic regression was used to analyze the relation between workers' postural workload and WMSDs. Reliability of the assessment method for postural workload: internal consistency analysis results showed that Cronbach's α was 0.934 and the results of split-half reliability indicated that Spearman-Brown coefficient was 0.881 and the correlation coefficient between the first part and the second was 0.787. Validity of the assessment method for postural workload: the results of cluster analysis indicated that square Euclidean distance between dynamic and static postural workload assessment in the same part or work posture was the shortest. The results of factor analysis showed that 2 components were extracted and the cumulative percentage of variance achieved 65.604%. The postural workload score of the different occupational workers showed significant difference (P<0.05) by covariance analysis. The results of nonconditional Logistic regression indicated that alcohol intake (OR=2.141, 95%CI 1.337-3.428) and obesity (OR=3.408, 95%CI 1.629-7.130) were risk factors for WMSDs. The risk for WMSDs would rise as workers' postural workload rose (OR=1.035, 95%CI 1.022-1.048). There was significant different risk for WMSDs in the different groups of workers distinguished by work type, gender and age. Female workers exhibited a higher prevalence for WMSDs (OR=2.626, 95%CI 1.414-4.879) and workers between 30-40 years of age (OR=1.909, 95%CI 1.237-2.946) as compared with those under 30. This method for comprehensively assessing postural workload is reliable and effective when used in assembling workers, and there is certain relation between the postural workload and WMSDs.
APPLICATION OF TRAVEL TIME RELIABILITY FOR PERFORMANCE ORIENTED OPERATIONAL PLANNING OF EXPRESSWAYS
NASA Astrophysics Data System (ADS)
Mehran, Babak; Nakamura, Hideki
Evaluation of impacts of congestion improvement scheme s on travel time reliability is very significant for road authorities since travel time reliability repr esents operational performance of expressway segments. In this paper, a methodology is presented to estimate travel tim e reliability prior to implementation of congestion relief schemes based on travel time variation modeling as a function of demand, capacity, weather conditions and road accident s. For subject expressway segmen ts, traffic conditions are modeled over a whole year considering demand and capacity as random variables. Patterns of demand and capacity are generated for each five minute interval by appl ying Monte-Carlo simulation technique, and accidents are randomly generated based on a model that links acci dent rate to traffic conditions. A whole year analysis is performed by comparing de mand and available capacity for each scenario and queue length is estimated through shockwave analysis for each time in terval. Travel times are estimated from refined speed-flow relationships developed for intercity expressways and buffer time index is estimated consequently as a measure of travel time reliability. For validation, estimated reliability indices are compared with measured values from empirical data, and it is shown that the proposed method is suitable for operational evaluation and planning purposes.
Uncertainty and Intelligence in Computational Stochastic Mechanics
NASA Technical Reports Server (NTRS)
Ayyub, Bilal M.
1996-01-01
Classical structural reliability assessment techniques are based on precise and crisp (sharp) definitions of failure and non-failure (survival) of a structure in meeting a set of strength, function and serviceability criteria. These definitions are provided in the form of performance functions and limit state equations. Thus, the criteria provide a dichotomous definition of what real physical situations represent, in the form of abrupt change from structural survival to failure. However, based on observing the failure and survival of real structures according to the serviceability and strength criteria, the transition from a survival state to a failure state and from serviceability criteria to strength criteria are continuous and gradual rather than crisp and abrupt. That is, an entire spectrum of damage or failure levels (grades) is observed during the transition to total collapse. In the process, serviceability criteria are gradually violated with monotonically increasing level of violation, and progressively lead into the strength criteria violation. Classical structural reliability methods correctly and adequately include the ambiguity sources of uncertainty (physical randomness, statistical and modeling uncertainty) by varying amounts. However, they are unable to adequately incorporate the presence of a damage spectrum, and do not consider in their mathematical framework any sources of uncertainty of the vagueness type. Vagueness can be attributed to sources of fuzziness, unclearness, indistinctiveness, sharplessness and grayness; whereas ambiguity can be attributed to nonspecificity, one-to-many relations, variety, generality, diversity and divergence. Using the nomenclature of structural reliability, vagueness and ambiguity can be accounted for in the form of realistic delineation of structural damage based on subjective judgment of engineers. For situations that require decisions under uncertainty with cost/benefit objectives, the risk of failure should depend on the underlying level of damage and the uncertainties associated with its definition. A mathematical model for structural reliability assessment that includes both ambiguity and vagueness types of uncertainty was suggested to result in the likelihood of failure over a damage spectrum. The resulting structural reliability estimates properly represent the continuous transition from serviceability to strength limit states over the ultimate time exposure of the structure. In this section, a structural reliability assessment method based on a fuzzy definition of failure is suggested to meet these practical needs. A failure definition can be developed to indicate the relationship between failure level and structural response. In this fuzzy model, a subjective index is introduced to represent all levels of damage (or failure). This index can be interpreted as either a measure of failure level or a measure of a degree of belief in the occurrence of some performance condition (e.g., failure). The index allows expressing the transition state between complete survival and complete failure for some structural response based on subjective evaluation and judgment.
Reliability of a single objective measure in assessing sleepiness.
Sunwoo, Bernie Y; Jackson, Nicholas; Maislin, Greg; Gurubhagavatula, Indira; George, Charles F; Pack, Allan I
2012-01-01
To evaluate reliability of single objective tests in assessing sleepiness. Subjects who completed polysomnography underwent a 4-nap multiple sleep latency test (MSLT) the following day. Prior to each nap opportunity on MSLT, subjects performed the psychomotor vigilance test (PVT) and divided attention driving task (DADT). Results of single versus multiple test administrations were compared using the intraclass correlation coefficient (ICC) and adjusted for test administration order effects to explore time of day effects. Measures were explored as continuous and binary (i.e., impaired or not impaired). Community-based sample evaluated at a tertiary, university-based sleep center. 372 adult commercial vehicle operators oversampled for increased obstructive sleep apnea risk. N/A. AS CONTINUOUS MEASURES, ICC WERE AS FOLLOWS: MSLT 0.45, PVT median response time 0.69, PVT number of lapses 0.51, 10-min DADT tracking error 0.87, 20-min DADT tracking error 0.90. Based on binary outcomes, ICC were: MSLT 0.63, PVT number of lapses 0.85, 10-min DADT 0.95, 20-min DADT 0.96. Statistically significant time of day effects were seen in both the MSLT and PVT but not the DADT. Correlation between ESS and different objective tests was strongest for MSLT, range [-0.270 to -0.195] and persisted across all time points. Single DADT and PVT administrations are reliable measures of sleepiness. A single MSLT administration can reasonably discriminate individuals with MSL < 8 minutes. These results support the use of a single administration of some objective tests of sleepiness when performed under controlled conditions in routine clinical care.
The reliability of the Extra Load Index as a measure of relative load carriage economy.
Hudson, Sean; Cooke, Carlton; Lloyd, Ray
2017-09-01
The aim of this study was to measure the reliability of the extra load index (ELI) as a method for assessing relative load carriage economy. Seventeen volunteers (12 males, 5 females) performed walking trials at 3 km·h -1 , 6 km·h -1 and a self-selected speed. Trial conditions were repeated 7 days later to assess test-retest reliability. Trials involved four 4-minute periods of walking, each separated by 5 min of rest. The initial stage was performed unloaded followed in a randomised order by a second unloaded period and walking with backpacks of 7 and 20 kg. Results show ELI values did not differ significantly between trials for any of the speeds (p = 0.46) with either of the additional loads (p = 0.297). The systematic bias, limits of agreement and coefficients of variation were small in all trial conditions. We conclude the ELI appears to be a reliable measure of relative load carriage economy. Practitioner Summary: This paper demonstrates that the ELI is a reliable measure of load carriage economy at a range of walking speeds with both a light and heavy load. The ELI, therefore, represents a useful tool for comparing the relative economy associated with different load carriage systems.
The Reliability of Encounter Cards to Assess the CanMEDs Roles
ERIC Educational Resources Information Center
Sherbino, Jonathan; Kulasegaram, Kulamakan; Worster, Andrew; Norman, Geoffrey R.
2013-01-01
The purpose of this study was to determine the reliability of a computer-based encounter card (EC) to assess medical students during an emergency medicine rotation. From April 2011 to March 2012, multiple physicians assessed an entire medical school class during their emergency medicine rotation using the CanMEDS framework. At the end of an…
Investigating the Validity and Reliability of the Vanderbilt Assessment of Leadership in Education
ERIC Educational Resources Information Center
Porter, Andrew C.; Polikoff, Morgan S.; Goldring, Ellen B.; Murphy, Joseph; Elliott, Stephen N.; May, Henry
2010-01-01
The Vanderbilt Assessment of Leadership in Education (VAL-ED) is a multirater assessment of principals' learning-centered leadership. The instrument was developed based on the Standards for Educational and Psychological Testing. In this article, we report on the validity and reliability evidence for the VAL-ED accumulated in a national field…
Can a Two-Question Test Be Reliable and Valid for Predicting Academic Outcomes?
ERIC Educational Resources Information Center
Bridgeman, Brent
2016-01-01
Scores on essay-based assessments that are part of standardized admissions tests are typically given relatively little weight in admissions decisions compared to the weight given to scores from multiple-choice assessments. Evidence is presented to suggest that more weight should be given to these assessments. The reliability of the writing scores…
Brown, Laura J E; Adlam, Tim; Hwang, Faustina; Khadra, Hassan; Maclean, Linda M; Rudd, Bridey; Smith, Tom; Timon, Claire; Williams, Elizabeth A; Astell, Arlene J
2016-08-01
Patterns of cognitive change over micro-longitudinal timescales (i.e., ranging from hours to days) are associated with a wide range of age-related health and functional outcomes. However, practical issues of conducting high-frequency assessments make investigations of micro-longitudinal cognition costly and burdensome to run. One way of addressing this is to develop cognitive assessments that can be performed by older adults, in their own homes, without a researcher being present. Here, we address the question of whether reliable and valid cognitive data can be collected over micro-longitudinal timescales using unsupervised cognitive tests.In study 1, 48 older adults completed two touchscreen cognitive tests, on three occasions, in controlled conditions, alongside a battery of standard tests of cognitive functions. In study 2, 40 older adults completed the same two computerized tasks on multiple occasions, over three separate week-long periods, in their own homes, without a researcher present. Here, the tasks were incorporated into a wider touchscreen system (Novel Assessment of Nutrition and Ageing (NANA)) developed to assess multiple domains of health and behavior. Standard tests of cognitive function were also administered prior to participants using the NANA system.Performance on the two "NANA" cognitive tasks showed convergent validity with, and similar levels of reliability to, the standard cognitive battery in both studies. Completion and accuracy rates were also very high. These results show that reliable and valid cognitive data can be collected from older adults using unsupervised computerized tests, thus affording new opportunities for the investigation of cognitive.
Bartels, Else Marie; Langberg, Henning
2012-01-01
Background: Achilles tendinopathy (AT) is a common pathology and the aetiology is unknown. For valid and reliable assessment The Victorian Institute of Sports Assessment has designed a self-administered Achilles questionnaire, the VISA-A. The aim of the present study was to evaluate VISA-A as an outcome measure in patients with AT. Methods: A systematic search of the literature was conducted using MEDLINE, EMBASE, CINAHL, PEDro, Web of Science, and Cochrane Controlled trials to identify trials using VISA-A for patients with AT. This was followed by data mining and analysis of the obtained data. Results: Twenty-six clinical trials containing 1336 individuals were included. Overall mean VISA-A scores ranged from 24 (severe AT) to 100 (healthy). Mean VISA-A scores in patients with AT ranged from 24 to 96.6. Healthy subjects scored a minimum of 96. Only two groups of participants from two different studies had a post-VISA-A score as high as healthy individuals, indicating full recovery of the AT. Conclusions: A VISA-A score lower than 24 is rarely attained in AT. Only few patients with AT reach an equivalent VISA-A score compared to uninjured healthy subjects following treatment. The VISA-A is a reliable tool when assessing AT patients, providing a good assessment of the actual condition from very poor, (score around 24) to excellent (a score of 90), which based on this systematic review and previous studies could be considered full recovery from AT. PMID:22319681
Study of complete interconnect reliability for a GaAs MMIC power amplifier
NASA Astrophysics Data System (ADS)
Lin, Qian; Wu, Haifeng; Chen, Shan-ji; Jia, Guoqing; Jiang, Wei; Chen, Chao
2018-05-01
By combining the finite element analysis (FEA) and artificial neural network (ANN) technique, the complete prediction of interconnect reliability for a monolithic microwave integrated circuit (MMIC) power amplifier (PA) at the both of direct current (DC) and alternating current (AC) operation conditions is achieved effectively in this article. As a example, a MMIC PA is modelled to study the electromigration failure of interconnect. This is the first time to study the interconnect reliability for an MMIC PA at the conditions of DC and AC operation simultaneously. By training the data from FEA, a high accuracy ANN model for PA reliability is constructed. Then, basing on the reliability database which is obtained from the ANN model, it can give important guidance for improving the reliability design for IC.
NASA Astrophysics Data System (ADS)
Reinert, K. A.
The use of linear decision rules (LDR) and chance constrained programming (CCP) to optimize the performance of wind energy conversion clusters coupled to storage systems is described. Storage is modelled by LDR and output by CCP. The linear allocation rule and linear release rule prescribe the size and optimize a storage facility with a bypass. Chance constraints are introduced to explicitly treat reliability in terms of an appropriate value from an inverse cumulative distribution function. Details of deterministic programming structure and a sample problem involving a 500 kW and a 1.5 MW WECS are provided, considering an installed cost of $1/kW. Four demand patterns and three levels of reliability are analyzed for optimizing the generator choice and the storage configuration for base load and peak operating conditions. Deficiencies in ability to predict reliability and to account for serial correlations are noted in the model, which is concluded useful for narrowing WECS design options.
Reliability of human-supervised formant-trajectory measurement for forensic voice comparison.
Zhang, Cuiling; Morrison, Geoffrey Stewart; Ochoa, Felipe; Enzinger, Ewald
2013-01-01
Acoustic-phonetic approaches to forensic voice comparison often include human-supervised measurement of vowel formants, but the reliability of such measurements is a matter of concern. This study assesses the within- and between-supervisor variability of three sets of formant-trajectory measurements made by each of four human supervisors. It also assesses the validity and reliability of forensic-voice-comparison systems based on these measurements. Each supervisor's formant-trajectory system was fused with a baseline mel-frequency cepstral-coefficient system, and performance was assessed relative to the baseline system. Substantial improvements in validity were found for all supervisors' systems, but some supervisors' systems were more reliable than others.
Textile Organic Electrochemical Transistors as a Platform for Wearable Biosensors
NASA Astrophysics Data System (ADS)
Gualandi, I.; Marzocchi, M.; Achilli, A.; Cavedale, D.; Bonfiglio, A.; Fraboni, B.
2016-09-01
The development of wearable chemical sensors is receiving a great deal of attention in view of non-invasive and continuous monitoring of physiological parameters in healthcare applications. This paper describes the development of a fully textile, wearable chemical sensor based on an organic electrochemical transistor (OECT) entirely made of conductive polymer (PEDOT:PSS). The active polymer patterns are deposited into the fabric by screen printing processes, thus allowing the device to actually “disappear” into it. We demonstrate the reliability of the proposed textile OECTs as a platform for developing chemical sensors capable to detect in real-time various redox active molecules (adrenaline, dopamine and ascorbic acid), by assessing their performance in two different experimental contexts: i) ideal operation conditions (i.e. totally dipped in an electrolyte solution); ii) real-life operation conditions (i.e. by sequentially adding few drops of electrolyte solution onto only one side of the textile sensor). The OECTs response has also been measured in artificial sweat, assessing how these sensors can be reliably used for the detection of biomarkers in body fluids. Finally, the very low operating potentials (<1 V) and absorbed power (~10-4 W) make the here described textile OECTs very appealing for portable and wearable applications.
Textile Organic Electrochemical Transistors as a Platform for Wearable Biosensors
Gualandi, I.; Marzocchi, M.; Achilli, A.; Cavedale, D.; Bonfiglio, A.; Fraboni, B.
2016-01-01
The development of wearable chemical sensors is receiving a great deal of attention in view of non-invasive and continuous monitoring of physiological parameters in healthcare applications. This paper describes the development of a fully textile, wearable chemical sensor based on an organic electrochemical transistor (OECT) entirely made of conductive polymer (PEDOT:PSS). The active polymer patterns are deposited into the fabric by screen printing processes, thus allowing the device to actually “disappear” into it. We demonstrate the reliability of the proposed textile OECTs as a platform for developing chemical sensors capable to detect in real-time various redox active molecules (adrenaline, dopamine and ascorbic acid), by assessing their performance in two different experimental contexts: i) ideal operation conditions (i.e. totally dipped in an electrolyte solution); ii) real-life operation conditions (i.e. by sequentially adding few drops of electrolyte solution onto only one side of the textile sensor). The OECTs response has also been measured in artificial sweat, assessing how these sensors can be reliably used for the detection of biomarkers in body fluids. Finally, the very low operating potentials (<1 V) and absorbed power (~10−4 W) make the here described textile OECTs very appealing for portable and wearable applications. PMID:27667396
Web-Based Assessment of Visual and Visuospatial Symptoms in Parkinson's Disease
Amick, Melissa M.; Miller, Ivy N.; Neargarder, Sandy; Cronin-Golomb, Alice
2012-01-01
Visual and visuospatial dysfunction is prevalent in Parkinson's disease (PD). To promote assessment of these often overlooked symptoms, we adapted the PD Vision Questionnaire for Internet administration. The questionnaire evaluates visual and visuospatial symptoms, impairments in activities of daily living (ADLs), and motor symptoms. PD participants of mild to moderate motor severity (n = 24) and healthy control participants (HC, n = 23) completed the questionnaire in paper and web-based formats. Reliability was assessed by comparing responses across formats. Construct validity was evaluated by reference to performance on measures of vision, visuospatial cognition, ADLs, and motor symptoms. The web-based format showed excellent reliability with respect to the paper format for both groups (all P′s < 0.001; HC completing the visual and visuospatial section only). Demonstrating the construct validity of the web-based questionnaire, self-rated ADL and visual and visuospatial functioning were significantly associated with performance on objective measures of these abilities (all P′s < 0.01). The findings indicate that web-based administration may be a reliable and valid method of assessing visual and visuospatial and ADL functioning in PD. PMID:22530162
Reliability modelling and analysis of a multi-state element based on a dynamic Bayesian network
NASA Astrophysics Data System (ADS)
Li, Zhiqiang; Xu, Tingxue; Gu, Junyuan; Dong, Qi; Fu, Linyu
2018-04-01
This paper presents a quantitative reliability modelling and analysis method for multi-state elements based on a combination of the Markov process and a dynamic Bayesian network (DBN), taking perfect repair, imperfect repair and condition-based maintenance (CBM) into consideration. The Markov models of elements without repair and under CBM are established, and an absorbing set is introduced to determine the reliability of the repairable element. According to the state-transition relations between the states determined by the Markov process, a DBN model is built. In addition, its parameters for series and parallel systems, namely, conditional probability tables, can be calculated by referring to the conditional degradation probabilities. Finally, the power of a control unit in a failure model is used as an example. A dynamic fault tree (DFT) is translated into a Bayesian network model, and subsequently extended to a DBN. The results show the state probabilities of an element and the system without repair, with perfect and imperfect repair, and under CBM, with an absorbing set plotted by differential equations and verified. Through referring forward, the reliability value of the control unit is determined in different kinds of modes. Finally, weak nodes are noted in the control unit.
NASA Astrophysics Data System (ADS)
Shi, Yu-Fang; Ma, Yi-Yi; Song, Ping-Ping
2018-03-01
System Reliability Theory is a research hotspot of management science and system engineering in recent years, and construction reliability is useful for quantitative evaluation of project management level. According to reliability theory and target system of engineering project management, the defination of construction reliability appears. Based on fuzzy mathematics theory and language operator, value space of construction reliability is divided into seven fuzzy subsets and correspondingly, seven membership function and fuzzy evaluation intervals are got with the operation of language operator, which provides the basis of corresponding method and parameter for the evaluation of construction reliability. This method is proved to be scientific and reasonable for construction condition and an useful attempt for theory and method research of engineering project system reliability.
Eliasson, Kristina; Palm, Peter; Nyman, Teresia; Forsman, Mikael
2017-07-01
A common way to conduct practical risk assessments is to observe a job and report the observed long term risks for musculoskeletal disorders. The aim of this study was to evaluate the inter- and intra-observer reliability of ergonomists' risk assessments without the support of an explicit risk assessment method. Twenty-one experienced ergonomists assessed the risk level (low, moderate, high risk) of eight upper body regions, as well as the global risk of 10 video recorded work tasks. Intra-observer reliability was assessed by having nine of the ergonomists repeat the procedure at least three weeks after the first assessment. The ergonomists made their risk assessment based on his/her experience and knowledge. The statistical parameters of reliability included agreement in %, kappa, linearly weighted kappa, intraclass correlation and Kendall's coefficient of concordance. The average inter-observer agreement of the global risk was 53% and the corresponding weighted kappa (K w ) was 0.32, indicating fair reliability. The intra-observer agreement was 61% and 0.41 (K w ). This study indicates that risk assessments of the upper body, without the use of an explicit observational method, have non-acceptable reliability. It is therefore recommended to use systematic risk assessment methods to a higher degree. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.
NASA Astrophysics Data System (ADS)
Xia, Quan; Wang, Zili; Ren, Yi; Sun, Bo; Yang, Dezhen; Feng, Qiang
2018-05-01
With the rapid development of lithium-ion battery technology in the electric vehicle (EV) industry, the lifetime of the battery cell increases substantially; however, the reliability of the battery pack is still inadequate. Because of the complexity of the battery pack, a reliability design method for a lithium-ion battery pack considering the thermal disequilibrium is proposed in this paper based on cell redundancy. Based on this method, a three-dimensional electric-thermal-flow-coupled model, a stochastic degradation model of cells under field dynamic conditions and a multi-state system reliability model of a battery pack are established. The relationships between the multi-physics coupling model, the degradation model and the system reliability model are first constructed to analyze the reliability of the battery pack and followed by analysis examples with different redundancy strategies. By comparing the reliability of battery packs of different redundant cell numbers and configurations, several conclusions for the redundancy strategy are obtained. More notably, the reliability does not monotonically increase with the number of redundant cells for the thermal disequilibrium effects. In this work, the reliability of a 6 × 5 parallel-series configuration is the optimal system structure. In addition, the effect of the cell arrangement and cooling conditions are investigated.
The 20 GHz solid state transmitter design, impatt diode development and reliability assessment
NASA Technical Reports Server (NTRS)
Picone, S.; Cho, Y.; Asmus, J. R.
1984-01-01
A single drift gallium arsenide (GaAs) Schottky barrier IMPATT diode and related components were developed. The IMPATT diode reliability was assessed. A proof of concept solid state transmitter design and a technology assessment study were performed. The transmitter design utilizes technology which, upon implementation, will demonstrate readiness for development of a POC model within the 1982 time frame and will provide an information base for flight hardware capable of deployment in a 1985 to 1990 demonstrational 30/20 GHz satellite communication system. Life test data for Schottky barrier GaAs diodes and grown junction GaAs diodes are described. The results demonstrate the viability of GaAs IMPATTs as high performance, reliable RF power sources which, based on the recommendation made herein, will surpass device reliability requirements consistent with a ten year spaceborne solid state power amplifier mission.
Evaluation of high fidelity patient simulator in assessment of performance of anaesthetists.
Weller, J M; Bloch, M; Young, S; Maze, M; Oyesola, S; Wyner, J; Dob, D; Haire, K; Durbridge, J; Walker, T; Newble, D
2003-01-01
There is increasing emphasis on performance-based assessment of clinical competence. The High Fidelity Patient Simulator (HPS) may be useful for assessment of clinical practice in anaesthesia, but needs formal evaluation of validity, reliability, feasibility and effect on learning. We set out to assess the reliability of a global rating scale for scoring simulator performance in crisis management. Using a global rating scale, three judges independently rated videotapes of anaesthetists in simulated crises in the operating theatre. Five anaesthetists then independently rated subsets of these videotapes. There was good agreement between raters for medical management, behavioural attributes and overall performance. Agreement was high for both the initial judges and the five additional raters. Using a global scale to assess simulator performance, we found good inter-rater reliability for scoring performance in a crisis. We estimate that two judges should provide a reliable assessment. High fidelity simulation should be studied further for assessing clinical performance.
NDE detectability of fatigue type cracks in high strength alloys
NASA Technical Reports Server (NTRS)
Christner, B. K.; Rummel, W. D.
1983-01-01
Specimens suitable for investigating the reliability of production nondestructive evaluation (NDE) to detect tightly closed fatigue cracks in high strength alloys representative of those materials used in spacecraft engine/booster construction were produced. Inconel 718 was selected as representative of nickel base alloys and Haynes 188 was selected as representative of cobalt base alloys used in this application. Cleaning procedures were developed to insure the reusability of the test specimens and a flaw detection reliability assessment of the fluorescent penetrant inspection method was performed using the test specimens produced to characterize their use for future reliability assessments and to provide additional NDE flaw detection reliability data for high strength alloys. The statistical analysis of the fluorescent penetrant inspection data was performed to determine the detection reliabilities for each inspection at a 90% probability/95% confidence level.
Okundamiya, Michael S; Emagbetere, Joy O; Ogujor, Emmanuel A
2014-01-01
The rapid growth of the mobile telecommunication sectors of many emerging countries creates a number of problems such as network congestion and poor service delivery for network operators. This results primarily from the lack of a reliable and cost-effective power solution within such regions. This study presents a comprehensive review of the underlying principles of the renewable energy technology (RET) with the objective of ensuring a reliable and cost-effective energy solution for a sustainable development in the emerging world. The grid-connected hybrid renewable energy system incorporating a power conversion and battery storage unit has been proposed based on the availability, dynamism, and technoeconomic viability of energy resources within the region. The proposed system's performance validation applied a simulation model developed in MATLAB, using a practical load data for different locations with varying climatic conditions in Nigeria. Results indicate that, apart from being environmentally friendly, the increase in the overall energy throughput of about 4 kWh/$ of the proposed system would not only improve the quality of mobile services, by making the operations of GSM base stations more reliable and cost effective, but also better the living standards of the host communities.
Okundamiya, Michael S.; Emagbetere, Joy O.; Ogujor, Emmanuel A.
2014-01-01
The rapid growth of the mobile telecommunication sectors of many emerging countries creates a number of problems such as network congestion and poor service delivery for network operators. This results primarily from the lack of a reliable and cost-effective power solution within such regions. This study presents a comprehensive review of the underlying principles of the renewable energy technology (RET) with the objective of ensuring a reliable and cost-effective energy solution for a sustainable development in the emerging world. The grid-connected hybrid renewable energy system incorporating a power conversion and battery storage unit has been proposed based on the availability, dynamism, and technoeconomic viability of energy resources within the region. The proposed system's performance validation applied a simulation model developed in MATLAB, using a practical load data for different locations with varying climatic conditions in Nigeria. Results indicate that, apart from being environmentally friendly, the increase in the overall energy throughput of about 4 kWh/$ of the proposed system would not only improve the quality of mobile services, by making the operations of GSM base stations more reliable and cost effective, but also better the living standards of the host communities. PMID:24578673
DOE Office of Scientific and Technical Information (OSTI.GOV)
Boring, Ronald; Mandelli, Diego; Rasmussen, Martin
2016-06-01
This report presents an application of a computation-based human reliability analysis (HRA) framework called the Human Unimodel for Nuclear Technology to Enhance Reliability (HUNTER). HUNTER has been developed not as a standalone HRA method but rather as framework that ties together different HRA methods to model dynamic risk of human activities as part of an overall probabilistic risk assessment (PRA). While we have adopted particular methods to build an initial model, the HUNTER framework is meant to be intrinsically flexible to new pieces that achieve particular modeling goals. In the present report, the HUNTER implementation has the following goals: •more » Integration with a high fidelity thermal-hydraulic model capable of modeling nuclear power plant behaviors and transients • Consideration of a PRA context • Incorporation of a solid psychological basis for operator performance • Demonstration of a functional dynamic model of a plant upset condition and appropriate operator response This report outlines these efforts and presents the case study of a station blackout scenario to demonstrate the various modules developed to date under the HUNTER research umbrella.« less
Fault Diagnosis Strategies for SOFC-Based Power Generation Plants
Costamagna, Paola; De Giorgi, Andrea; Gotelli, Alberto; Magistri, Loredana; Moser, Gabriele; Sciaccaluga, Emanuele; Trucco, Andrea
2016-01-01
The success of distributed power generation by plants based on solid oxide fuel cells (SOFCs) is hindered by reliability problems that can be mitigated through an effective fault detection and isolation (FDI) system. However, the numerous operating conditions under which such plants can operate and the random size of the possible faults make identifying damaged plant components starting from the physical variables measured in the plant very difficult. In this context, we assess two classical FDI strategies (model-based with fault signature matrix and data-driven with statistical classification) and the combination of them. For this assessment, a quantitative model of the SOFC-based plant, which is able to simulate regular and faulty conditions, is used. Moreover, a hybrid approach based on the random forest (RF) classification method is introduced to address the discrimination of regular and faulty situations due to its practical advantages. Working with a common dataset, the FDI performances obtained using the aforementioned strategies, with different sets of monitored variables, are observed and compared. We conclude that the hybrid FDI strategy, realized by combining a model-based scheme with a statistical classifier, outperforms the other strategies. In addition, the inclusion of two physical variables that should be measured inside the SOFCs can significantly improve the FDI performance, despite the actual difficulty in performing such measurements. PMID:27556472
Baker, Elizabeth A; Ledford, Cynthia H; Fogg, Louis; Way, David P; Park, Yoon Soo
2015-01-01
Construct: Clinical skills are used in the care of patients, including reporting, diagnostic reasoning, and decision-making skills. Written comprehensive new patient admission notes (H&Ps) are a ubiquitous part of student education but are underutilized in the assessment of clinical skills. The interpretive summary, differential diagnosis, explanation of reasoning, and alternatives (IDEA) assessment tool was developed to assess students' clinical skills using written comprehensive new patient admission notes. The validity evidence for assessment of clinical skills using clinical documentation following authentic patient encounters has not been well documented. Diagnostic justification tools and postencounter notes are described in the literature (1,2) but are based on standardized patient encounters. To our knowledge, the IDEA assessment tool is the first published tool that uses medical students' H&Ps to rate students' clinical skills. The IDEA assessment tool is a 15-item instrument that asks evaluators to rate students' reporting, diagnostic reasoning, and decision-making skills based on medical students' new patient admission notes. This study presents validity evidence in support of the IDEA assessment tool using Messick's unified framework, including content (theoretical framework), response process (interrater reliability), internal structure (factor analysis and internal-consistency reliability), and relationship to other variables. Validity evidence is based on results from four studies conducted between 2010 and 2013. First, the factor analysis (2010, n = 216) yielded a three-factor solution, measuring patient story, IDEA, and completeness, with reliabilities of .79, .88, and .79, respectively. Second, an initial interrater reliability study (2010) involving two raters demonstrated fair to moderate consensus (κ = .21-.56, ρ =.42-.79). Third, a second interrater reliability study (2011) with 22 trained raters also demonstrated fair to moderate agreement (intraclass correlations [ICCs] = .29-.67). There was moderate reliability for all three skill domains, including reporting skills (ICC = .53), diagnostic reasoning skills (ICC = .64), and decision-making skills (ICC = .63). Fourth, there was a significant correlation between IDEA rating scores (2010-2013) and final Internal Medicine clerkship grades (r = .24), 95% confidence interval (CI) [.15, .33]. The IDEA assessment tool is a novel tool with validity evidence to support its use in the assessment of students' reporting, diagnostic reasoning, and decision-making skills. The moderate reliability achieved supports formative or lower stakes summative uses rather than high-stakes summative judgments.
FY11 Facility Assessment Study for Aeronautics Test Program
NASA Technical Reports Server (NTRS)
Loboda, John A.; Sydnor, George H.
2013-01-01
This paper presents the approach and results for the Aeronautics Test Program (ATP) FY11 Facility Assessment Project. ATP commissioned assessments in FY07 and FY11 to aid in the understanding of the current condition and reliability of its facilities and their ability to meet current and future (five year horizon) test requirements. The principle output of the assessment was a database of facility unique, prioritized investments projects with budgetary cost estimates. This database was also used to identify trends for the condition of facility systems.
Ferris, M; Cohen, S; Haberman, C; Javalkar, K; Massengill, S; Mahan, J D; Kim, S; Bickford, K; Cantu, G; Medeiros, M; Phillips, A; Ferris, M T; Hooper, S R
2015-01-01
The Self-Management and Transition to Adulthood with Rx=Treatment (STARx) Questionnaire was developed to collect information on self-management and health care transition (HCT) skills, via self-report, in a broad population of adolescents and young adults (AYAs) with chronic conditions. Over several iterations, the STARx questionnaire was created with AYA, family, and health provider input. The development and pilot testing of the STARx Questionnaire took place with the assistance of 1219 AYAs with different chronic health conditions, in multiple institutions and settings over three phases: item development, pilot testing, reliability and factor structuring. The three development phases resulted in a final version of the STARx Questionnaire. The exploratory factor analysis of the third version of the 18-item STARx identified six factors that accounted for about 65% of the variance: Medication management, Provider communication, Engagement during appointments, Disease knowledge, Adult health responsibilities, and Resource utilization. Reliability estimates revealed good internal consistency and temporal stability, with the alpha coefficient for the overall scale being .80. The STARx was developmentally sensitive, with older patients scoring significantly higher on nearly every factor than younger patients. The STARx Questionnaire is a reliable, self-report tool with adequate internal consistency, temporal stability, and a strong, multidimensional factor structure. It provides another assessment strategy to measure self-management and transition skills in AYAs with chronic conditions. Copyright © 2015 Elsevier Inc. All rights reserved.
Simulation-based Assessment to Reliably Identify Key Resident Performance Attributes.
Blum, Richard H; Muret-Wagstaff, Sharon L; Boulet, John R; Cooper, Jeffrey B; Petrusa, Emil R; Baker, Keith H; Davidyuk, Galina; Dearden, Jennifer L; Feinstein, David M; Jones, Stephanie B; Kimball, William R; Mitchell, John D; Nadelberg, Robert L; Wiser, Sarah H; Albrecht, Meredith A; Anastasi, Amanda K; Bose, Ruma R; Chang, Laura Y; Culley, Deborah J; Fisher, Lauren J; Grover, Meera; Klainer, Suzanne B; Kveraga, Rikante; Martel, Jeffrey P; McKenna, Shannon S; Minehart, Rebecca D; Mitchell, John D; Mountjoy, Jeremi R; Pawlowski, John B; Pilon, Robert N; Shook, Douglas C; Silver, David A; Warfield, Carol A; Zaleski, Katherine L
2018-04-01
Obtaining reliable and valid information on resident performance is critical to patient safety and training program improvement. The goals were to characterize important anesthesia resident performance gaps that are not typically evaluated, and to further validate scores from a multiscenario simulation-based assessment. Seven high-fidelity scenarios reflecting core anesthesiology skills were administered to 51 first-year residents (CA-1s) and 16 third-year residents (CA-3s) from three residency programs. Twenty trained attending anesthesiologists rated resident performances using a seven-point behaviorally anchored rating scale for five domains: (1) formulate a clear plan, (2) modify the plan under changing conditions, (3) communicate effectively, (4) identify performance improvement opportunities, and (5) recognize limits. A second rater assessed 10% of encounters. Scores and variances for each domain, each scenario, and the total were compared. Low domain ratings (1, 2) were examined in detail. Interrater agreement was 0.76; reliability of the seven-scenario assessment was r = 0.70. CA-3s had a significantly higher average total score (4.9 ± 1.1 vs. 4.6 ± 1.1, P = 0.01, effect size = 0.33). CA-3s significantly outscored CA-1s for five of seven scenarios and domains 1, 2, and 3. CA-1s had a significantly higher proportion of worrisome ratings than CA-3s (chi-square = 24.1, P < 0.01, effect size = 1.50). Ninety-eight percent of residents rated the simulations more educational than an average day in the operating room. Sensitivity of the assessment to CA-1 versus CA-3 performance differences for most scenarios and domains supports validity. No differences, by experience level, were detected for two domains associated with reflective practice. Smaller score variances for CA-3s likely reflect a training effect; however, worrisome performance scores for both CA-1s and CA-3s suggest room for improvement.
Validity and Reliability of the 8-Item Work Limitations Questionnaire.
Walker, Timothy J; Tullar, Jessica M; Diamond, Pamela M; Kohl, Harold W; Amick, Benjamin C
2017-12-01
Purpose To evaluate factorial validity, scale reliability, test-retest reliability, convergent validity, and discriminant validity of the 8-item Work Limitations Questionnaire (WLQ) among employees from a public university system. Methods A secondary analysis using de-identified data from employees who completed an annual Health Assessment between the years 2009-2015 tested research aims. Confirmatory factor analysis (CFA) (n = 10,165) tested the latent structure of the 8-item WLQ. Scale reliability was determined using a CFA-based approach while test-retest reliability was determined using the intraclass correlation coefficient. Convergent/discriminant validity was tested by evaluating relations between the 8-item WLQ with health/performance variables for convergent validity (health-related work performance, number of chronic conditions, and general health) and demographic variables for discriminant validity (gender and institution type). Results A 1-factor model with three correlated residuals demonstrated excellent model fit (CFI = 0.99, TLI = 0.99, RMSEA = 0.03, and SRMR = 0.01). The scale reliability was acceptable (0.69, 95% CI 0.68-0.70) and the test-retest reliability was very good (ICC = 0.78). Low-to-moderate associations were observed between the 8-item WLQ and the health/performance variables while weak associations were observed between the demographic variables. Conclusions The 8-item WLQ demonstrated sufficient reliability and validity among employees from a public university system. Results suggest the 8-item WLQ is a usable alternative for studies when the more comprehensive 25-item WLQ is not available.
Roth, Ralf; Donath, Lars; Kurz, Eduard; Zahner, Lukas; Faude, Oliver
2017-03-01
The present study aimed to assess the between day reliability of isokinetic and isometric peak torque (PT) during trunk measurement on an isokinetic device (IsoMed 2000). Test-retest-protocol on five separate days. Fifteen healthy sport students (8 female and 7 male) aged 21 to 26. PT was assessed in isometric back extension and flexion as well as right and left rotation. Isokinetic strength was captured at a speed of 60°/s and 150°/s for all tasks. For none of the assessed parameters a meaningful variation in PT during test days was observed. Relative reliability (ICC = 0.85-0.96) was excellent for all tasks. Estimates of absolute reliability as Coefficient of Variation (CoV) and Standard Error of Measurement (SEM in Nm/kg lean body mass) remained stable for isometric (6.9% < CoV < 9.4%; 0.15 < SEM < 0.23) and isokinetic mode (60°/s: 3.7% < CoV < 8.6%; 0.08 < SEM < 0.24; 150°/s: 6.9% < CoV < 12.4%; 0.10 < SEM < 0.31). In contrast, reliability between familiarization day and day 1 was lower (6.6% < CoV < 26.2%; 0.10 < SEM < 0.65). Trunk strength measurement in flexion and extension or trunk rotation in either isometric or isokinetic condition is highly reliable. Therefore, it seems possible to elucidate changes which are smaller than 10% due to intervention programs when a preceding familiarization condition was applied. Copyright © 2016 Elsevier Ltd. All rights reserved.
Reliability based design optimization: Formulations and methodologies
NASA Astrophysics Data System (ADS)
Agarwal, Harish
Modern products ranging from simple components to complex systems should be designed to be optimal and reliable. The challenge of modern engineering is to ensure that manufacturing costs are reduced and design cycle times are minimized while achieving requirements for performance and reliability. If the market for the product is competitive, improved quality and reliability can generate very strong competitive advantages. Simulation based design plays an important role in designing almost any kind of automotive, aerospace, and consumer products under these competitive conditions. Single discipline simulations used for analysis are being coupled together to create complex coupled simulation tools. This investigation focuses on the development of efficient and robust methodologies for reliability based design optimization in a simulation based design environment. Original contributions of this research are the development of a novel efficient and robust unilevel methodology for reliability based design optimization, the development of an innovative decoupled reliability based design optimization methodology, the application of homotopy techniques in unilevel reliability based design optimization methodology, and the development of a new framework for reliability based design optimization under epistemic uncertainty. The unilevel methodology for reliability based design optimization is shown to be mathematically equivalent to the traditional nested formulation. Numerical test problems show that the unilevel methodology can reduce computational cost by at least 50% as compared to the nested approach. The decoupled reliability based design optimization methodology is an approximate technique to obtain consistent reliable designs at lesser computational expense. Test problems show that the methodology is computationally efficient compared to the nested approach. A framework for performing reliability based design optimization under epistemic uncertainty is also developed. A trust region managed sequential approximate optimization methodology is employed for this purpose. Results from numerical test studies indicate that the methodology can be used for performing design optimization under severe uncertainty.
Issues in developing valid assessments of speech pathology students' performance in the workplace.
McAllister, Sue; Lincoln, Michelle; Ferguson, Alison; McAllister, Lindy
2010-01-01
Workplace-based learning is a critical component of professional preparation in speech pathology. A validated assessment of this learning is seen to be 'the gold standard', but it is difficult to develop because of design and validation issues. These issues include the role and nature of judgement in assessment, challenges in measuring quality, and the relationship between assessment and learning. Valid assessment of workplace-based performance needs to capture the development of competence over time and account for both occupation specific and generic competencies. This paper reviews important conceptual issues in the design of valid and reliable workplace-based assessments of competence including assessment content, process, impact on learning, measurement issues, and validation strategies. It then goes on to share what has been learned about quality assessment and validation of a workplace-based performance assessment using competency-based ratings. The outcomes of a four-year national development and validation of an assessment tool are described. A literature review of issues in conceptualizing, designing, and validating workplace-based assessments was conducted. Key factors to consider in the design of a new tool were identified and built into the cycle of design, trialling, and data analysis in the validation stages of the development process. This paper provides an accessible overview of factors to consider in the design and validation of workplace-based assessment tools. It presents strategies used in the development and national validation of a tool COMPASS, used in an every speech pathology programme in Australia, New Zealand, and Singapore. The paper also describes Rasch analysis, a model-based statistical approach which is useful for establishing validity and reliability of assessment tools. Through careful attention to conceptual and design issues in the development and trialling of workplace-based assessments, it has been possible to develop the world's first valid and reliable national assessment tool for the assessment of performance in speech pathology.
Yamada, Minoru; Aoyama, Tomoki; Nakamura, Masatoshi; Tanaka, Buichi; Nagai, Koutatsu; Tatematsu, Noriatsu; Uemura, Kazuki; Nakamura, Takashi; Tsuboyama, Tadao; Ichihashi, Noriaki
2011-01-01
The purpose of this study was to examine whether the Nintendo Wii Fit program could be used for fall risk assessment in healthy, community-dwelling older adults. Forty-five community-dwelling older women participated in this study. The "Basic Step" and "Ski Slalom" modules were selected from the Wii Fit game program. The following 5 physical performance tests were performed: the 10-m walk test under single- and dual-task conditions, the Timed Up and Go test under single- and dual-task conditions, and the Functional Reach test. Compared with the faller group, the nonfaller group showed a significant difference in the Basic Step (P < .001) and a nonsignificant difference in the Ski Slalom (P = .453). The discriminating criterion between the 2 groups was a score of 111 points on the Basic Step (P < .001). The Basic Step showed statistically significant, moderate correlations between the dual-task lag of walking (r = -.547) and the dual-task lag of the Timed Up and Go test (r = -.688). These results suggest that game-based fall risk assessment using the Basic Step has a high generality and is useful in community-dwelling older adults. Copyright © 2011 Mosby, Inc. All rights reserved.
A Comprehensive Critique and Review of Published Measures of Acne Severity
Furber, Gareth; Leach, Matthew; Segal, Leonie
2016-01-01
Objective: Acne vulgaris is a dynamic, complex condition that is notoriously difficult to evaluate. The authors set out to critically evaluate currently available measures of acne severity, particularly in terms of suitability for use in clinical trials. Design: A systematic review was conducted to identify methods used to measure acne severity, using MEDLINE, CINAHL, Scopus, and Wiley Online. Each method was critically reviewed and given a score out of 13 based on eight quality criteria under two broad groupings of psychometric testing and suitability for research and evaluation. Results: Twenty-four methods for assessing acne severity were identified. Four scales received a quality score of zero, and 11 scored ≤3. The highest rated scales achieved a total score of 6. Six scales reported strong inter-rater reliability (ICC>0.75), and four reported strong intra-rater reliability (ICC>0.75). The poor overall performance of most scales, largely characterized by the absence of reliability testing or evidence for independent assessment and validation indicates that generally, their application in clinical trials is not supported. Conclusion: This review and appraisal of instruments for measuring acne severity supports previously identified concerns regarding the quality of published measures. It highlights the need for a valid and reliable acne severity scale, especially for use in research and evaluation. The ideal scale would demonstrate adequate validation and reliability and be easily implemented for third-party analysis. The development of such a scale is critical to interpreting results of trials and facilitating the pooling of results for systematic reviews and meta-analyses. PMID:27672410
Assessing Reliability of Cold Spray Sputter Targets in Photovoltaic Manufacturing
NASA Astrophysics Data System (ADS)
Hardikar, Kedar; Vlcek, Johannes; Bheemreddy, Venkata; Juliano, Daniel
2017-10-01
Cold spray has been used to manufacture more than 800 Cu-In-Ga (CIG) sputter targets for deposition of high-efficiency photovoltaic thin films. It is a preferred technique since it enables high deposit purity and transfer of non-equilibrium alloy states to the target material. In this work, an integrated approach to reliability assessment of such targets with deposit weight in excess of 50 lb. is undertaken, involving thermal-mechanical characterization of the material in as-deposited condition, characterization of the interface adhesion on cylindrical substrate in as-deposited condition, and developing means to assess target integrity under thermal-mechanical loads during the physical vapor deposition (PVD) sputtering process. Mechanical characterization of cold spray deposited CIG alloy is accomplished through the use of indentation testing and adaptation of Brazilian disk test. A custom lever test was developed to characterize adhesion along the cylindrical interface between the CIG deposit and cylindrical substrate, overcoming limitations of current standards. A cohesive zone model for crack initiation and propagation at the deposit interface is developed and validated using the lever test and later used to simulate the potential catastrophic target failure in the PVD process. It is shown that this approach enables reliability assessment of sputter targets and improves robustness.
A rainwater harvesting system reliability model based on nonparametric stochastic rainfall generator
NASA Astrophysics Data System (ADS)
Basinger, Matt; Montalto, Franco; Lall, Upmanu
2010-10-01
SummaryThe reliability with which harvested rainwater can be used as a means of flushing toilets, irrigating gardens, and topping off air-conditioner serving multifamily residential buildings in New York City is assessed using a new rainwater harvesting (RWH) system reliability model. Although demonstrated with a specific case study, the model is portable because it is based on a nonparametric rainfall generation procedure utilizing a bootstrapped markov chain. Precipitation occurrence is simulated using transition probabilities derived for each day of the year based on the historical probability of wet and dry day state changes. Precipitation amounts are selected from a matrix of historical values within a moving 15 day window that is centered on the target day. RWH system reliability is determined for user-specified catchment area and tank volume ranges using precipitation ensembles generated using the described stochastic procedure. The reliability with which NYC backyard gardens can be irrigated and air conditioning units supplied with water harvested from local roofs exceeds 80% and 90%, respectively, for the entire range of catchment areas and tank volumes considered in the analysis. For RWH systems installed on the most commonly occurring rooftop catchment areas found in NYC (51-75 m 2), toilet flushing demand can be met with 7-40% reliability, with lower end of the range representing buildings with high flow toilets and no storage elements, and the upper end representing buildings that feature low flow fixtures and storage tanks of up to 5 m 3. When the reliability curves developed are used to size RWH systems to flush the low flow toilets of all multifamily buildings found a typical residential neighborhood in the Bronx, rooftop runoff inputs to the sewer system are reduced by approximately 28% over an average rainfall year, and potable water demand is reduced by approximately 53%.
Intersession reliability of fMRI activation for heat pain and motor tasks
Quiton, Raimi L.; Keaser, Michael L.; Zhuo, Jiachen; Gullapalli, Rao P.; Greenspan, Joel D.
2014-01-01
As the practice of conducting longitudinal fMRI studies to assess mechanisms of pain-reducing interventions becomes more common, there is a great need to assess the test–retest reliability of the pain-related BOLD fMRI signal across repeated sessions. This study quantitatively evaluated the reliability of heat pain-related BOLD fMRI brain responses in healthy volunteers across 3 sessions conducted on separate days using two measures: (1) intraclass correlation coefficients (ICC) calculated based on signal amplitude and (2) spatial overlap. The ICC analysis of pain-related BOLD fMRI responses showed fair-to-moderate intersession reliability in brain areas regarded as part of the cortical pain network. Areas with the highest intersession reliability based on the ICC analysis included the anterior midcingulate cortex, anterior insula, and second somatosensory cortex. Areas with the lowest intersession reliability based on the ICC analysis also showed low spatial reliability; these regions included pregenual anterior cingulate cortex, primary somatosensory cortex, and posterior insula. Thus, this study found regional differences in pain-related BOLD fMRI response reliability, which may provide useful information to guide longitudinal pain studies. A simple motor task (finger-thumb opposition) was performed by the same subjects in the same sessions as the painful heat stimuli were delivered. Intersession reliability of fMRI activation in cortical motor areas was comparable to previously published findings for both spatial overlap and ICC measures, providing support for the validity of the analytical approach used to assess intersession reliability of pain-related fMRI activation. A secondary finding of this study is that the use of standard ICC alone as a measure of reliability may not be sufficient, as the underlying variance structure of an fMRI dataset can result in inappropriately high ICC values; a method to eliminate these false positive results was used in this study and is recommended for future studies of test–retest reliability. PMID:25161897
Reliability Analysis of the Adult Mentoring Assessment for Extension Professionals
ERIC Educational Resources Information Center
Denny, Marina D'Abreau
2017-01-01
The Adult Mentoring Assessment for Extension Professionals will help mentors develop an accurate profile of their mentoring style with adult learners and identify areas of proficiency and deficiency based on six constructs--relationship, information, facilitation, confrontation, modeling, and vision. This article reports on the reliability of this…
Inter- and intra-rater reliability of nasal auscultation in daycare children.
Santos, Rita; Silva Alexandrino, Ana; Tomé, David; Melo, Cristina; Mesquita Montes, António; Costa, Daniel; Pinto Ferreira, João
2018-02-01
The aim of this study was to assess nasal auscultation's intra- and inter-rater reliability and to analyze ear and respiratory clinical condition according to nasal auscultation. Cross-sectional study performed in 125 children aged up to 3 years old attending daycare centers. Nasal auscultation, tympanometry and Paediatric Respiratory Severity Score (PRSS) were applied to all children. Nasal sounds were classified by an expert panel in order to determine nasal auscultation's intra and inter- rater reliability. The classification of nasal sounds was assessed against tympanometric and PRSS values. Nasal auscultation revealed substantial inter-rater (K=0.75) and intra-rater (K=0.69; K=0.61 and K=0.72) reliability. Children with a "non-obstructed" classification revealed a lower peak pressure (t=-3.599, P<0.001 in left ear; t=-2.258, P=0.026 in right ear) and a higher compliance (t=-2,728, P=0.007 in left ear; t=-3.830. P<0.001 in right ear) in both ears. There was an association between the classification of sounds and tympanogram types in both ears (X=11.437, P=0.003 in left ear; X=13.535, P=0.001 in right ear). Children with a "non-obstructed" classification had a healthier respiratory condition. Nasal auscultation revealed substantial intra- and inter-rater reliability. Nasal auscultation exhibited important differences according to ear and respiratory clinical conditions. Nasal auscultation in pediatrics seems to be an original topic as well as a simple method that can be used to identify early signs of nasopharyngeal obstruction.
Validation of the oesophageal hypervigilance and anxiety scale for chronic oesophageal disease.
Taft, T H; Triggs, J R; Carlson, D A; Guadagnoli, L; Tomasino, K N; Keefer, L; Pandolfino, J E
2018-05-01
Oesophageal hypervigilance and anxiety can drive symptom experience in chronic oesophageal conditions, including gastro-oesophageal reflux disease, achalasia and functional oesophageal disorders. To date, no validated self-report measure exists to evaluate oesophageal hypervigilance and anxiety. This study aims to develop a brief and reliable questionnaire assessing these constructs, the oesophageal hypervigilance and anxiety scale (EHAS). Questions for the EHAS were drawn from 4 existing validated measures that assessed hypervigilance and anxiety adapted for the oesophagus. Patients who previously underwent high-resolution manometry testing at a university-based oesophageal motility clinic were retrospectively identified. Patients were included in the analysis if they completed the EHAS as well as questionnaires assessing symptom severity and health-related quality of life at the time of the high-resolution manometry. Nine hundred and eighty-two patients aged 18-85 completed the study. The EHAS demonstrates excellent internal consistency (α = 0.93) and split-half reliability (Guttman = 0.87). Inter-item correlations indicated multicollinearity was not achieved; thus, no items were removed from the original 15-item scale. Principal components factor analysis revealed two subscales measuring symptom-specific anxiety and symptom-specific hypervigilance. Construct validity for total and subscale scores was supported by positive correlations with symptom severity and negative correlations with health-related quality of life. The EHAS is a 15-item scale assessing oesophageal hypervigilance and symptom-specfic anxiety. The EHAS could be useful in evaluating the role of these constructs in several oesophageal conditions in which hypersensitivity, hypervigilance and anxiety may contribute to symptoms and impact treatment outcomes. © 2018 John Wiley & Sons Ltd.
Economos, Christina D; Sacheck, Jennifer M; Kwan Ho Chui, Kenneth; Irizarry, Laura; Irizzary, Laura; Guillemont, Juliette; Collins, Jessica J; Hyatt, Raymond R
2008-04-01
Interventions aiming to modify the dietary and physical activity behaviors of young children require precise and accurate measurement tools. As part of a larger community-based project, three school-based questionnaires were developed to assess (a) fruit and vegetable intake, (b) physical activity and television (TV) viewing, and (c) perceived parental support for diet and physical activity. Test-retest reliability was performed on all questionnaires and validity was measured for fruit and vegetable intake, physical activity, and TV viewing. Eighty-four school children (8.3+/-1.1 years) were studied. Test-retest reliability was performed by administering questionnaires twice, 1 to 2 hours apart. Validity of the fruit and vegetable questionnaire was measured by direct observation, while the physical activity and TV questionnaire was validated by a parent phone interview. All three questionnaires yielded excellent test-retest reliability (P<0.001). The majority of fruit and vegetable questions and the questions regarding specific physical activities and TV viewing were valid. Low validity scores were found for questions on watching TV during breakfast or dinner. These questionnaires are reliable and valid tools to assess fruit and vegetable intake, physical activity, and TV viewing behaviors in early elementary school-aged children. Methods for assessment of children's TV viewing during meals should be further investigated because of parent-child discrepancies.
Methodology Series Module 9: Designing Questionnaires and Clinical Record Forms - Part II.
Setia, Maninder Singh
2017-01-01
This article is a continuation of the previous module on designing questionnaires and clinical record form in which we have discussed some basic points about designing the questionnaire and clinical record forms. In this section, we will discuss the reliability and validity of questionnaires. The different types of validity are face validity, content validity, criterion validity, and construct validity. The different types of reliability are test-retest reliability, inter-rater reliability, and intra-rater reliability. Some of these parameters are assessed by subject area experts. However, statistical tests should be used for evaluation of other parameters. Once the questionnaire has been designed, the researcher should pilot test the questionnaire. The items in the questionnaire should be changed based on the feedback from the pilot study participants and the researcher's experience. After the basic structure of the questionnaire has been finalized, the researcher should assess the validity and reliability of the questionnaire or the scale. If an existing standard questionnaire is translated in the local language, the researcher should assess the reliability and validity of the translated questionnaire, and these values should be presented in the manuscript. The decision to use a self- or interviewer-administered, paper- or computer-based questionnaire depends on the nature of the questions, literacy levels of the target population, and resources.
Methodology Series Module 9: Designing Questionnaires and Clinical Record Forms – Part II
Setia, Maninder Singh
2017-01-01
This article is a continuation of the previous module on designing questionnaires and clinical record form in which we have discussed some basic points about designing the questionnaire and clinical record forms. In this section, we will discuss the reliability and validity of questionnaires. The different types of validity are face validity, content validity, criterion validity, and construct validity. The different types of reliability are test-retest reliability, inter-rater reliability, and intra-rater reliability. Some of these parameters are assessed by subject area experts. However, statistical tests should be used for evaluation of other parameters. Once the questionnaire has been designed, the researcher should pilot test the questionnaire. The items in the questionnaire should be changed based on the feedback from the pilot study participants and the researcher's experience. After the basic structure of the questionnaire has been finalized, the researcher should assess the validity and reliability of the questionnaire or the scale. If an existing standard questionnaire is translated in the local language, the researcher should assess the reliability and validity of the translated questionnaire, and these values should be presented in the manuscript. The decision to use a self- or interviewer-administered, paper- or computer-based questionnaire depends on the nature of the questions, literacy levels of the target population, and resources. PMID:28584367
ERIC Educational Resources Information Center
Colbert, Peta; Wyatt-Smith, Claire; Klenowski, Val
2012-01-01
This article considers the conditions that are necessary at system and local levels for teacher assessment to be valid, reliable and rigorous. With sustainable assessment cultures as a goal, the article examines how education systems can support local-level efforts for quality learning and dependable teacher assessment. This is achieved through…
NASA Technical Reports Server (NTRS)
Chatzimavroudis, George P.; Spirka, Thomas A.; Setser, Randolph M.; Myers, Jerry G.
2004-01-01
One of NASA's objectives is to be able to perform a complete, pre-flight, evaluation of cardiovascular changes in astronauts scheduled for prolonged space missions. Computational fluid dynamics (CFD) has shown promise as a method for estimating cardiovascular function during reduced gravity conditions. For this purpose, MRI can provide geometrical information, to reconstruct vessel geometries, and measure all spatial velocity components, providing location specific boundary conditions. The objective of this study was to investigate the reliability of MRI-based model reconstruction and measured boundary conditions for CFD simulations. An aortic arch model and a carotid bifurcation model were scanned in a 1.5T Siemens MRI scanner. Axial MRI acquisitions provided images for geometry reconstruction (slice thickness 3 and 5 mm; pixel size 1x1 and 0.5x0.5 square millimeters). Velocity acquisitions provided measured inlet boundary conditions and localized three-directional steady-flow velocity data (0.7-3.0 L/min). The vessel walls were isolated using NIH provided software (ImageJ) and lofted to form the geometric surface. Constructed and idealized geometries were imported into a commercial CFD code for meshing and simulation. Contour and vector plots of the velocity showed identical features between the MRI velocity data, the MRI-based CFD data, and the idealized-geometry CFD data, with less than 10% differences in the local velocity values. CFD results on models reconstructed from different MRI resolution settings showed insignificant differences (less than 5%). This study illustrated, quantitatively, that reliable CFD simulations can be performed with MRI reconstructed models and gives evidence that a future, subject-specific, computational evaluation of the cardiovascular system alteration during space travel is feasible.
Klinck, Mary P.; Rialland, Pascale; Guillot, Martin; Moreau, Maxim; Frank, Diane; Troncy, Eric
2015-01-01
Simple Summary Feline osteoarthritis (OA) is challenging to diagnose. A pain scale was developed for use by veterinarians, in association with their physical examination, and tested for reliability and validity. The scale items were: Interaction with the examiner, Exploration of the room, Body Posture, Gait, Body Condition, condition of Coat and Claws, and abnormal Findings or Cat Reaction upon joint Palpation. Expert review supported the scale content. Two studies using laboratory-housed cats found the most promising results for Gait and Body Posture, in terms of distinguishing between OA and non-OA cats, repeatability of results, and correlations with objectively measured kinetics (weight-bearing). Abstract Subtle signs and conflicting physical and radiographic findings make feline osteoarthritis (OA) challenging to diagnose. A physical examination-based assessment was developed, consisting of eight items: Interaction, Exploration, Posture, Gait, Body Condition, Coat and Claws, (joint) Palpation–Findings, and Palpation–Cat Reaction. Content (experts) and face (veterinary students) validity were excellent. Construct validity, internal consistency, and intra- and inter-rater reliability were assessed via a pilot and main study, using laboratory-housed cats with and without OA. Gait distinguished OA status in the pilot (p = 0.05) study. In the main study, no scale item achieved statistically significant OA detection. Forelimb peak vertical ground reaction force (PVF) correlated inversely with Gait (Rhos = −0.38 (p = 0.03) to −0.41 (p = 0.02)). Body Posture correlated with Gait, and inversely with forelimb PVF at two of three time points (Rhos = −0.38 (p = 0.03) to −0.43 (p = 0.01)). Palpation (Findings, Cat Reaction) did not distinguish OA from non-OA cats. Palpation—Cat Reaction (Forelimbs) correlated inversely with forelimb PVF at two time points (Rhos = −0.41 (p = 0.02) to −0.41 (p = 0.01)), but scores were highly variable, and poorly reliable. Gait and Posture require improved sensitivity, and Palpation should be interpreted cautiously, in diagnosing feline OA. PMID:26633524
Janssen, Ellen M; Marshall, Deborah A; Hauber, A Brett; Bridges, John F P
2017-12-01
The recent endorsement of discrete-choice experiments (DCEs) and other stated-preference methods by regulatory and health technology assessment (HTA) agencies has placed a greater focus on demonstrating the validity and reliability of preference results. Areas covered: We present a practical overview of tests of validity and reliability that have been applied in the health DCE literature and explore other study qualities of DCEs. From the published literature, we identify a variety of methods to assess the validity and reliability of DCEs. We conceptualize these methods to create a conceptual model with four domains: measurement validity, measurement reliability, choice validity, and choice reliability. Each domain consists of three categories that can be assessed using one to four procedures (for a total of 24 tests). We present how these tests have been applied in the literature and direct readers to applications of these tests in the health DCE literature. Based on a stakeholder engagement exercise, we consider the importance of study characteristics beyond traditional concepts of validity and reliability. Expert commentary: We discuss study design considerations to assess the validity and reliability of a DCE, consider limitations to the current application of tests, and discuss future work to consider the quality of DCEs in healthcare.
Development and validation of the Survey of Organizational Research Climate (SORC).
Martinson, Brian C; Thrush, Carol R; Lauren Crain, A
2013-09-01
Development and targeting efforts by academic organizations to effectively promote research integrity can be enhanced if they are able to collect reliable data to benchmark baseline conditions, to assess areas needing improvement, and to subsequently assess the impact of specific initiatives. To date, no standardized and validated tool has existed to serve this need. A web- and mail-based survey was administered in the second half of 2009 to 2,837 randomly selected biomedical and social science faculty and postdoctoral fellows at 40 academic health centers in top-tier research universities in the United States. Measures included the Survey of Organizational Research Climate (SORC) as well as measures of perceptions of organizational justice. Exploratory and confirmatory factor analyses yielded seven subscales of organizational research climate, all of which demonstrated acceptable internal consistency (Cronbach's α ranging from 0.81 to 0.87) and adequate test-retest reliability (Pearson r ranging from 0.72 to 0.83). A broad range of correlations between the seven subscales and five measures of organizational justice (unadjusted regression coefficients ranging from 0.13 to 0.95) document both construct and discriminant validity of the instrument. The SORC demonstrates good internal (alpha) and external reliability (test-retest) as well as both construct and discriminant validity.
Development and Validation of the Survey of Organizational Research Climate (SORC)
Martinson, Brian C.; Thrush, Carol R.; Crain, A. Lauren
2012-01-01
Background Development and targeting efforts by academic organizations to effectively promote research integrity can be enhanced if they are able to collect reliable data to benchmark baseline conditions, to assess areas needing improvement, and to subsequently assess the impact of specific initiatives. To date, no standardized and validated tool has existed to serve this need. Methods A web- and mail-based survey was administered in the second half of 2009 to 2,837 randomly selected biomedical and social science faculty and postdoctoral fellows at 40 academic health centers in top-tier research universities in the United States. Measures included the Survey of Organizational Research Climate (SORC) as well as measures of perceptions of organizational justice. Results Exploratory and confirmatory factor analyses yielded seven subscales of organizational research climate, all of which demonstrated acceptable internal consistency (Cronbach’s α ranging from 0.81 to 0.87) and adequate test-retest reliability (Pearson r ranging from 0.72 to 0.83). A broad range of correlations between the seven subscales and five measures of organizational justice (unadjusted regression coefficients ranging from .13 to .95) document both construct and discriminant validity of the instrument. Conclusions The SORC demonstrates good internal (alpha) and external reliability (test-retest) as well as both construct and discriminant validity. PMID:23096775
Reliable and valid assessment of point-of-care ultrasonography.
Todsen, Tobias; Tolsgaard, Martin Grønnebæk; Olsen, Beth Härstedt; Henriksen, Birthe Merete; Hillingsø, Jens Georg; Konge, Lars; Jensen, Morten Lind; Ringsted, Charlotte
2015-02-01
To explore the reliability and validity of the Objective Structured Assessment of Ultrasound Skills (OSAUS) scale for point-of-care ultrasonography (POC US) performance. POC US is increasingly used by clinicians and is an essential part of the management of acute surgical conditions. However, the quality of performance is highly operator-dependent. Therefore, reliable and valid assessment of trainees' ultrasonography competence is needed to ensure patient safety. Twenty-four physicians, representing novices, intermediates, and experts in POC US, scanned 4 different surgical patient cases in a controlled set-up. All ultrasound examinations were video-recorded and assessed by 2 blinded radiologists using OSAUS. Reliability was examined using generalizability theory. Construct validity was examined by comparing performance scores between the groups and by correlating physicians' OSAUS scores with diagnostic accuracy. The generalizability coefficient was high (0.81) and a D-study demonstrated that 1 assessor and 5 cases would result in similar reliability. The construct validity of the OSAUS scale was supported by a significant difference in the mean scores between the novice group (17.0; SD 8.4) and the intermediate group (30.0; SD 10.1), P = 0.007, as well as between the intermediate group and the expert group (72.9; SD 4.4), P = 0.04, and by a high correlation between OSAUS scores and diagnostic accuracy (Spearman ρ correlation coefficient = 0.76; P < 0.001). This study demonstrates high reliability as well as evidence of construct validity of the OSAUS scale for assessment of POC US competence. Hence, the OSAUS scale may be suitable for both in-training as well as end-of-training assessment.
NASA Astrophysics Data System (ADS)
Streuber, Gregg Mitchell
Environmental and economic factors motivate the pursuit of more fuel-efficient aircraft designs. Aerodynamic shape optimization is a powerful tool in this effort, but is hampered by the presence of multimodality in many design spaces. Gradient-based multistart optimization uses a sampling algorithm and multiple parallel optimizations to reliably apply fast gradient-based optimization to moderately multimodal problems. Ensuring that the sampled geometries remain physically realizable requires manually developing specialized linear constraints for each class of problem. Utilizing free-form deformation geometry control allows these linear constraints to be written in a geometry-independent fashion, greatly easing the process of applying the algorithm to new problems. This algorithm was used to assess the presence of multimodality when optimizing a wing in subsonic and transonic flows, under inviscid and viscous conditions, and a blended wing-body under transonic, viscous conditions. Multimodality was present in every wing case, while the blended wing-body was found to be generally unimodal.
A Laboratory Study on the Reliability Estimations of the Mini-CEX
ERIC Educational Resources Information Center
de Lima, Alberto Alves; Conde, Diego; Costabel, Juan; Corso, Juan; Van der Vleuten, Cees
2013-01-01
Reliability estimations of workplace-based assessments with the mini-CEX are typically based on real-life data. Estimations are based on the assumption of local independence: the object of the measurement should not be influenced by the measurement itself and samples should be completely independent. This is difficult to achieve. Furthermore, the…
Reliable aerial thermography for energy conservation
NASA Technical Reports Server (NTRS)
Jack, J. R.; Bowman, R. L.
1981-01-01
A method for energy conservation, the aerial thermography survey, is discussed. It locates sources of energy losses and wasteful energy management practices. An operational map is presented for clear sky conditions. The map outlines the key environmental conditions conductive to obtaining reliable aerial thermography. The map is developed from defined visual and heat loss discrimination criteria which are quantized based on flat roof heat transfer calculations.
Lau, Nathan; Jamieson, Greg A; Skraaning, Gyrd
2016-03-01
The Process Overview Measure is a query-based measure developed to assess operator situation awareness (SA) from monitoring process plants. A companion paper describes how the measure has been developed according to process plant properties and operator cognitive work. The Process Overview Measure demonstrated practicality, sensitivity, validity and reliability in two full-scope simulator experiments investigating dramatically different operational concepts. Practicality was assessed based on qualitative feedback of participants and researchers. The Process Overview Measure demonstrated sensitivity and validity by revealing significant effects of experimental manipulations that corroborated with other empirical results. The measure also demonstrated adequate inter-rater reliability and practicality for measuring SA in full-scope simulator settings based on data collected on process experts. Thus, full-scope simulator studies can employ the Process Overview Measure to reveal the impact of new control room technology and operational concepts on monitoring process plants. Practitioner Summary: The Process Overview Measure is a query-based measure that demonstrated practicality, sensitivity, validity and reliability for assessing operator situation awareness (SA) from monitoring process plants in representative settings.
NASA Astrophysics Data System (ADS)
Akhmetova, I. G.; Chichirova, N. D.
2016-12-01
Heat supply is the most energy-consuming sector of the economy. Approximately 30% of all used primary fuel-and-energy resources is spent on municipal heat-supply needs. One of the key indicators of activity of heat-supply organizations is the reliability of an energy facility. The reliability index of a heat supply organization is of interest to potential investors for assessing risks when investing in projects. The reliability indices established by the federal legislation are actually reduced to a single numerical factor, which depends on the number of heat-supply outages in connection with disturbances in operation of heat networks and the volume of their resource recovery in the calculation year. This factor is rather subjective and may change in a wide range during several years. A technique is proposed for evaluating the reliability of heat-supply organizations with the use of the simple additive weighting (SAW) method. The technique for integrated-index determination satisfies the following conditions: the reliability level of the evaluated heat-supply system is represented maximum fully and objectively; the information used for the reliability-index evaluation is easily available (is located on the Internet in accordance with demands of data-disclosure standards). For reliability estimation of heat-supply organizations, the following indicators were selected: the wear of equipment of thermal energy sources, the wear of heat networks, the number of outages of supply of thermal energy (heat carrier due to technological disturbances on heat networks per 1 km of heat networks), the number of outages of supply of thermal energy (heat carrier due to technologic disturbances on thermal energy sources per 1 Gcal/h of installed power), the share of expenditures in the cost of thermal energy aimed at recovery of the resource (renewal of fixed assets), coefficient of renewal of fixed assets, and a coefficient of fixed asset retirement. A versatile program is developed and the analysis of heat-supply organizations is performed by the example of the Republic of Tatarstan. The assessment system is based on construction of comparative ratings of heat-supply organizations. A rating is the assessment of reliability of the organization, is characterized by a numerical value, and makes it possible to compare organizations engaged in the same kind of activity between each other.
Rosen, Jules; Mulsant, Benoit H; Marino, Patricia; Groening, Christopher; Young, Robert C; Fox, Debra
2008-10-30
Despite the importance of establishing shared scoring conventions and assessing interrater reliability in clinical trials in psychiatry, these elements are often overlooked. Obstacles to rater training and reliability testing include logistic difficulties in providing live training sessions, or mailing videotapes of patients to multiple sites and collecting the data for analysis. To address some of these obstacles, a web-based interactive video system was developed. It uses actors of diverse ages, gender and race to train raters how to score the Hamilton Depression Rating Scale and to assess interrater reliability. This system was tested with a group of experienced and novice raters within a single site. It was subsequently used to train raters of a federally funded multi-center clinical trial on scoring conventions and to test their interrater reliability. The advantages and limitations of using interactive video technology to improve the quality of clinical trials are discussed.
Bhattacharyya, Agnihotri; Mukherjee, Shuvankar; Chatterjee, Chitra; Dasgupta, Samir
2013-04-01
Integrated management of childhood illness (IMNCI) is already operational in many states of India, but there are only limited studies in Indian scenario comparing its validity and reliability with the decisions of pediatricians. Aims and. To assess the validity and reliability of the IMNCI algorithm with provisional diagnosis of senior pediatricians for each IMNCI classifications. The present study is done with all the young infants between 0-2 months presented during the study period with a fresh episode of illness to test the validity and reliability of the algorithm in comparison to provisional diagnoses of senior pediatricians. The study was done in a tertiary care hospital. Validity characteristics such as sensitivity, specificity, positive predictive value, negative predictive value, and reliability characteristics such as percent agreement and Kappa were assessed for individual IMNCI classifications. The sensitivity of possible serious bacterial infection, local bacterial infection, jaundice, no dehydration and possible serious bacterial infection, not able to feed were 88.89, 14.29, 66.67, 25 and 44.44% respectively. The specificities for the same conditions were 71.72, 99.09, 99.07, 94.50 and 86.87%. Percent agreements for similar conditions were 74, 94, 97, 90 and 80% respectively and the Kappa ratios were 0.38, 0.20, 0.73, 0.19 and 0.29 respectively. It could be concluded that IMNCI is quite a sensitive strategy and could identify severe illnesses of young infants requiring referral to higher facility. Further studies, particularly in primary health care setting, are required.
González-Ortiz, Ailema Janeth; Arce-Santander, Celene Viridiana; Vega-Vega, Olynka; Correa-Rotter, Ricardo; Espinosa-Cuevas, María de Los Angeles
2014-10-04
The protein-energy wasting syndrome (PEW) is a condition of malnutrition, inflammation, anorexia and wasting of body reserves resulting from inflammatory and non-inflammatory conditions in patients with chronic kidney disease (CKD).One way of assessing PEW, extensively described in the literature, is using the Malnutrition Inflammation Score (MIS). To assess the reliability and consistency of MIS for diagnosis of PEW in Mexican adults with CKD on hemodialysis (HD). Study of diagnostic tests. A sample of 45 adults with CKD on HD were analyzed during the period June-July 2014.The instrument was applied on 2 occasions; the test-retest reliability was calculated using the Intraclass Correlation Coefficient (ICC); the internal consistency of the questionnaire was analyzed using Cronbach's αcoefficient. A weighted Kappa test was used to estimate the validity of the instrument; the result was subsequently compared with the Bilbrey nutritional index (BNI). The reliability of the questionnaires, evaluated in the patient sample, was ICC=0.829.The agreement between MIS observations was considered adequate, k= 0.585 (p <0.001); when comparing it with BNI, a value of k = 0.114 was obtained (p <0.001).In order to estimate the tendency, a correlation test was performed. The r² correlation coefficient was 0.488 (P <0.001). MIS has adequate reliability and validity for diagnosing PEW in the population with chronic kidney disease on HD. Copyright AULA MEDICA EDICIONES 2014. Published by AULA MEDICA. All rights reserved.
Validity and Reliability of Accelerometers in Patients With COPD: A SYSTEMATIC REVIEW.
Gore, Shweta; Blackwood, Jennifer; Guyette, Mary; Alsalaheen, Bara
2018-05-01
Reduced physical activity is associated with poor prognosis in chronic obstructive pulmonary disease (COPD). Accelerometers have greatly improved quantification of physical activity by providing information on step counts, body positions, energy expenditure, and magnitude of force. The purpose of this systematic review was to compare the validity and reliability of accelerometers used in patients with COPD. An electronic database search of MEDLINE and CINAHL was performed. Study quality was assessed with the Strengthening the Reporting of Observational Studies in Epidemiology checklist while methodological quality was assessed using the modified Quality Appraisal Tool for Reliability Studies. The search yielded 5392 studies; 25 met inclusion criteria. The SenseWear Pro armband reported high criterion validity under controlled conditions (r = 0.75-0.93) and high reliability (ICC = 0.84-0.86) for step counts. The DynaPort MiniMod demonstrated highest concurrent validity for step count using both video and manual methods. Validity of the SenseWear Pro armband varied between studies especially in free-living conditions, slower walking speeds, and with addition of weights during gait. A high degree of variability was found in the outcomes used and statistical analyses performed between studies, indicating a need for further studies to measure reliability and validity of accelerometers in COPD. The SenseWear Pro armband is the most commonly used accelerometer in COPD, but measurement properties are limited by gait speed variability and assistive device use. DynaPort MiniMod and Stepwatch accelerometers demonstrated high validity in patients with COPD but lack reliability data.
Rollover risk prediction of heavy vehicles by reliability index and empirical modelling
NASA Astrophysics Data System (ADS)
Sellami, Yamine; Imine, Hocine; Boubezoul, Abderrahmane; Cadiou, Jean-Charles
2018-03-01
This paper focuses on a combination of a reliability-based approach and an empirical modelling approach for rollover risk assessment of heavy vehicles. A reliability-based warning system is developed to alert the driver to a potential rollover before entering into a bend. The idea behind the proposed methodology is to estimate the rollover risk by the probability that the vehicle load transfer ratio (LTR) exceeds a critical threshold. Accordingly, a so-called reliability index may be used as a measure to assess the vehicle safe functioning. In the reliability method, computing the maximum of LTR requires to predict the vehicle dynamics over the bend which can be in some cases an intractable problem or time-consuming. With the aim of improving the reliability computation time, an empirical model is developed to substitute the vehicle dynamics and rollover models. This is done by using the SVM (Support Vector Machines) algorithm. The preliminary obtained results demonstrate the effectiveness of the proposed approach.
Graph-based inductive reasoning.
Boumans, Marcel
2016-10-01
This article discusses methods of inductive inferences that are methods of visualizations designed in such a way that the "eye" can be employed as a reliable tool for judgment. The term "eye" is used as a stand-in for visual cognition and perceptual processing. In this paper "meaningfulness" has a particular meaning, namely accuracy, which is closeness to truth. Accuracy consists of precision and unbiasedness. Precision is dealt with by statistical methods, but for unbiasedness one needs expert judgment. The common view at the beginning of the twentieth century was to make the most efficient use of this kind of judgment by representing the data in shapes and forms in such a way that the "eye" can function as a reliable judge to reduce bias. The need for judgment of the "eye" is even more necessary when the background conditions of the observations are heterogeneous. Statistical procedures require a certain minimal level of homogeneity, but the "eye" does not. The "eye" is an adequate tool for assessing topological similarities when, due to heterogeneity of the data, metric assessment is not possible. In fact, graphical assessments precedes measurement, or to put it more forcefully, the graphic method is a necessary prerequisite for measurement. Copyright © 2016 Elsevier Ltd. All rights reserved.
A low-cost, tablet-based option for prehospital neurologic assessment: The iTREAT Study.
Chapman Smith, Sherita N; Govindarajan, Prasanthi; Padrick, Matthew M; Lippman, Jason M; McMurry, Timothy L; Resler, Brian L; Keenan, Kevin; Gunnell, Brian S; Mehndiratta, Prachi; Chee, Christina Y; Cahill, Elizabeth A; Dietiker, Cameron; Cattell-Gordon, David C; Smith, Wade S; Perina, Debra G; Solenski, Nina J; Worrall, Bradford B; Southerland, Andrew M
2016-07-05
In this 2-center study, we assessed the technical feasibility and reliability of a low cost, tablet-based mobile telestroke option for ambulance transport and hypothesized that the NIH Stroke Scale (NIHSS) could be performed with similar reliability between remote and bedside examinations. We piloted our mobile telemedicine system in 2 geographic regions, central Virginia and the San Francisco Bay Area, utilizing commercial cellular networks for videoconferencing transmission. Standardized patients portrayed scripted stroke scenarios during ambulance transport and were evaluated by independent raters comparing bedside to remote mobile telestroke assessments. We used a mixed-effects regression model to determine intraclass correlation of the NIHSS between bedside and remote examinations (95% confidence interval). We conducted 27 ambulance runs at both sites and successfully completed the NIHSS for all prehospital assessments without prohibitive technical interruption. The mean difference between bedside (face-to-face) and remote (video) NIHSS scores was 0.25 (1.00 to -0.50). Overall, correlation of the NIHSS between bedside and mobile telestroke assessments was 0.96 (0.92-0.98). In the mixed-effects regression model, there were no statistically significant differences accounting for method of evaluation or differences between sites. Utilizing a low-cost, tablet-based platform and commercial cellular networks, we can reliably perform prehospital neurologic assessments in both rural and urban settings. Further research is needed to establish the reliability and validity of prehospital mobile telestroke assessment in live patients presenting with acute neurologic symptoms. © 2016 American Academy of Neurology.
A low-cost, tablet-based option for prehospital neurologic assessment
Chapman Smith, Sherita N.; Govindarajan, Prasanthi; Padrick, Matthew M.; Lippman, Jason M.; McMurry, Timothy L.; Resler, Brian L.; Keenan, Kevin; Gunnell, Brian S.; Mehndiratta, Prachi; Chee, Christina Y.; Cahill, Elizabeth A.; Dietiker, Cameron; Cattell-Gordon, David C.; Smith, Wade S.; Perina, Debra G.; Solenski, Nina J.; Worrall, Bradford B.
2016-01-01
Objectives: In this 2-center study, we assessed the technical feasibility and reliability of a low cost, tablet-based mobile telestroke option for ambulance transport and hypothesized that the NIH Stroke Scale (NIHSS) could be performed with similar reliability between remote and bedside examinations. Methods: We piloted our mobile telemedicine system in 2 geographic regions, central Virginia and the San Francisco Bay Area, utilizing commercial cellular networks for videoconferencing transmission. Standardized patients portrayed scripted stroke scenarios during ambulance transport and were evaluated by independent raters comparing bedside to remote mobile telestroke assessments. We used a mixed-effects regression model to determine intraclass correlation of the NIHSS between bedside and remote examinations (95% confidence interval). Results: We conducted 27 ambulance runs at both sites and successfully completed the NIHSS for all prehospital assessments without prohibitive technical interruption. The mean difference between bedside (face-to-face) and remote (video) NIHSS scores was 0.25 (1.00 to −0.50). Overall, correlation of the NIHSS between bedside and mobile telestroke assessments was 0.96 (0.92–0.98). In the mixed-effects regression model, there were no statistically significant differences accounting for method of evaluation or differences between sites. Conclusions: Utilizing a low-cost, tablet-based platform and commercial cellular networks, we can reliably perform prehospital neurologic assessments in both rural and urban settings. Further research is needed to establish the reliability and validity of prehospital mobile telestroke assessment in live patients presenting with acute neurologic symptoms. PMID:27281534
Electronic Quality of Life Assessment Using Computer-Adaptive Testing
2016-01-01
Background Quality of life (QoL) questionnaires are desirable for clinical practice but can be time-consuming to administer and interpret, making their widespread adoption difficult. Objective Our aim was to assess the performance of the World Health Organization Quality of Life (WHOQOL)-100 questionnaire as four item banks to facilitate adaptive testing using simulated computer adaptive tests (CATs) for physical, psychological, social, and environmental QoL. Methods We used data from the UK WHOQOL-100 questionnaire (N=320) to calibrate item banks using item response theory, which included psychometric assessments of differential item functioning, local dependency, unidimensionality, and reliability. We simulated CATs to assess the number of items administered before prespecified levels of reliability was met. Results The item banks (40 items) all displayed good model fit (P>.01) and were unidimensional (fewer than 5% of t tests significant), reliable (Person Separation Index>.70), and free from differential item functioning (no significant analysis of variance interaction) or local dependency (residual correlations < +.20). When matched for reliability, the item banks were between 45% and 75% shorter than paper-based WHOQOL measures. Across the four domains, a high standard of reliability (alpha>.90) could be gained with a median of 9 items. Conclusions Using CAT, simulated assessments were as reliable as paper-based forms of the WHOQOL with a fraction of the number of items. These properties suggest that these item banks are suitable for computerized adaptive assessment. These item banks have the potential for international development using existing alternative language versions of the WHOQOL items. PMID:27694100
Hu, Kun; Peng, C K; Czosnyka, Marek; Zhao, Peng; Novak, Vera
2008-03-01
Cerebral autoregulation (CA) is an most important mechanism responsible for the relatively constant blood flow supply to brain when cerebral perfusion pressure varies. Its assessment in nonacute cases has been relied on the quantification of the relationship between noninvasive beat-to-beat blood pressure (BP) and blood flow velocity (BFV). To overcome the nonstationary nature of physiological signals such as BP and BFV, a computational method called multimodal pressure-flow (MMPF) analysis was recently developed to study the nonlinear BP-BFV relationship during the Valsalva maneuver (VM). The present study aimed to determine (i) whether this method can estimate autoregulation from spontaneous BP and BFV fluctuations during baseline rest conditions; (ii) whether there is any difference between the MMPF measures of autoregulation based on intra-arterial BP (ABP) and based on cerebral perfusion pressure (CPP); and (iii) whether the MMPF method provides reproducible and reliable measure for noninvasive assessment of autoregulation. To achieve these aims, we analyzed data from existing databases including: (i) ABP and BFV of 12 healthy control, 10 hypertensive, and 10 stroke subjects during baseline resting conditions and during the Valsalva maneuver, and (ii) ABP, CPP, and BFV of 30 patients with traumatic brain injury (TBI) who were being paralyzed, sedated, and ventilated. We showed that autoregulation in healthy control subjects can be characterized by specific phase shifts between BP and BFV oscillations during the Valsalva maneuver, and the BP-BFV phase shifts were reduced in hypertensive and stroke subjects (P < 0.01), indicating impaired autoregulation. Similar results were found during baseline condition from spontaneous BP and BFV oscillations. The BP-BFV phase shifts obtained during baseline and during VM were highly correlated (R > 0.8, P < 0.0001), showing no statistical difference (paired-t test P > 0.47). In TBI patients there were strong correlations between phases of ABP and CPP oscillations (R = 0.99, P < 0.0001) and, thus, between ABP-BFV and CPP-BFV phase shifts (P < 0.0001, R = 0.76). By repeating the MMPF 4 times on data of TBI subjects, each time on a selected cycle of spontaneous BP and BFV oscillations, we showed that MMPF had better reproducibility than traditional autoregulation index. These results indicate that the MMPF method, based on instantaneous phase relationships between cerebral blood flow velocity and peripheral blood pressure, has better performance than the traditional standard method, and can reliably assess cerebral autoregulation dynamics from ambulatory blood pressure and cerebral blood flow during supine rest conditions.
Peng, C. K.; Czosnyka, Marek; Zhao, Peng
2009-01-01
Cerebral autoregulation (CA) is an most important mechanism responsible for the relatively constant blood flow supply to brain when cerebral perfusion pressure varies. Its assessment in nonacute cases has been relied on the quantification of the relationship between noninvasive beat-to-beat blood pressure (BP) and blood flow velocity (BFV). To overcome the nonstationary nature of physiological signals such as BP and BFV, a computational method called multimodal pressure-flow (MMPF) analysis was recently developed to study the nonlinear BP–BFV relationship during the Valsalva maneuver (VM). The present study aimed to determine (i) whether this method can estimate autoregulation from spontaneous BP and BFV fluctuations during baseline rest conditions; (ii) whether there is any difference between the MMPF measures of autoregulation based on intra-arterial BP (ABP) and based on cerebral perfusion pressure (CPP); and (iii) whether the MMPF method provides reproducible and reliable measure for noninvasive assessment of autoregulation. To achieve these aims, we analyzed data from existing databases including: (i) ABP and BFV of 12 healthy control, 10 hypertensive, and 10 stroke subjects during baseline resting conditions and during the Valsalva maneuver, and (ii) ABP, CPP, and BFV of 30 patients with traumatic brain injury (TBI) who were being paralyzed, sedated, and ventilated. We showed that autoregulation in healthy control subjects can be characterized by specific phase shifts between BP and BFV oscillations during the Valsalva maneuver, and the BP–BFV phase shifts were reduced in hypertensive and stroke subjects (P < 0.01), indicating impaired autoregulation. Similar results were found during baseline condition from spontaneous BP and BFV oscillations. The BP–BFV phase shifts obtained during baseline and during VM were highly correlated (R > 0.8, P < 0.0001), showing no statistical difference (paired-t test P > 0.47). In TBI patients there were strong correlations between phases of ABP and CPP oscillations (R = 0.99, P < 0.0001) and, thus, between ABP–BFV and CPP–BFV phase shifts (P < 0.0001, R = 0.76). By repeating the MMPF 4 times on data of TBI subjects, each time on a selected cycle of spontaneous BP and BFV oscillations, we showed that MMPF had better reproducibility than traditional autoregulation index. These results indicate that the MMPF method, based on instantaneous phase relationships between cerebral blood flow velocity and peripheral blood pressure, has better performance than the traditional standard method, and can reliably assess cerebral autoregulation dynamics from ambulatory blood pressure and cerebral blood flow during supine rest conditions. PMID:18080758
Pac, A; Oruba, Z; Olszewska-Czyż, I; Chomyszyn-Gajewska, M
2014-03-01
The individual evaluation of patients' motivation should be introduced to the protocol of periodontal treatment, as it could impact positively on effective treatment planning and treatment outcomes. However, a standardised tool measuring the extent of periodontal patients' motivation has not yet been proposed in the literature. Thus, the objective of the present study was to determine the validity and reliability of the Zychlińscy motivation scale adjusted to the needs of periodontology. Cross sectional study. Department of Periodontology and Oral Medicine, Dental University Clinic, Jagiellonian University, Krakow, Poland. 199 adult periodontal patients, aged 20-78. 14-item questionnaire. The items were adopted from the original Zychlińscy motivation assessment scale. Validity and reliability of the proposed motivation assessment instrument. The assessed Cronbach's alpha of 0.79 indicates the scale is a reliable tool. Principal component analysis revealed a model with three factors, which explained half of the total variance. Those factors represented: the patient's attitude towards treatment and oral hygiene practice; previous experiences during treatment; and the influence of external conditions on the patient's attitude towards treatment. The proposed scale proved to be a reliable and accurate tool for the evaluation of periodontal patients' motivation.
Alkon, Abbey; Rose, Roberta; Wolff, Mimi; Kotch, Jonathan B; Aronson, Susan S
2016-01-01
The project aims were to (1) develop an observational Health and Safety Checklist to assess health and safety practices and conditions in early care and education (ECE) programs using Stepping Stones To Caring For Our Children, 3rd Edition national standards, (2) pilot test the Checklist, completed by nurse child care health consultants, to assess feasibility, ease of completion, objectivity, validity, and reliability, and (3) revise the Checklist based on the qualitative and quantitative results of the pilot study. The observable national health and safety standards were identified and then rated by health, safety, and child care experts using a Delphi technique to validate the standards as essential to prevent harm and promote health. Then, child care health consultants recruited ECE centers and pilot tested the 124-item Checklist. The pilot study was conducted in Arizona, California and North Carolina. The psychometric properties of the Checklist were assessed. The 37 participating ECE centers had 2627 children from ethnically-diverse backgrounds and primarily low-income families. The child care health consultants found the Checklist easy to complete, objective, and useful for planning health and safety interventions. The Checklist had content and face validity, inter-rater reliability, internal consistency, and concurrent validity. Based on the child care health consultant feedback and psychometric properties of the Checklist, the Checklist was revised and re-written at an 8th grade literacy level. The Health and Safety Checklist provides a standardized instrument of observable, selected national standards to assess the quality of health and safety in ECE centers.
Comparison of Knee and Ankle Dynamometry between NASA's X1 Exoskeleton and Biodex System 4
NASA Technical Reports Server (NTRS)
English, K. L.; Newby, N. J.; Hackney, K. J.; DeWitt, J. K.; Beck, C. E.; Rovekamp, R. N.; Rea, R. L.; Ploutz-Snyder, L. L.
2014-01-01
Pre- and post-flight dynamometry is performed on International Space Station crewmembers to characterize microgravity-induced strength changes. Strength is not assessed in flight due to hardware limitations and there is poor understanding of the time course of in-flight changes. PURPOSE: To assess the reliability of a prototype dynamometer, the X1 Exoskeleton (EXO) and its agreement with a Biodex System 4 (BIO). METHODS: Eight subjects (4 M/4 F) completed 2 counterbalanced testing sessions of knee extension/flexion (KE/KF), 1 with BIO and 1 with EXO, with repeated measures within each session in normal gravity. Test-retest reliability (test 1 and 2) and device agreement (BIO vs. EXO) were evaluated. Later, to assess device agreement for ankle plantarflexion (PF), 10 subjects (4 M/6 F) completed 3 test conditions (BIO, EXO, and BIOEXO); BIOEXO was a hybrid condition comprised of the Biodex dynamometer motor and the X1 footplate and ankle frame. Ankle comparisons were: BIO vs. BIOEXO (footplate differences), BIOEXO vs. EXO (motor differences), and BIO vs. EXO (all differences). Reliability for KE/KF was determined by intraclass correlation (ICC). Device agreement was assessed with: 1) repeated measures ANOVA, 2) a measure of concordance (rho), and 3) average difference. RESULTS: ICCs for KE/KF were 0.99 for BIO and 0.96 to 0.99 for EXO. Agreement was high for KE (concordance: 0.86 to 0.95; average differences: -7 to +9 Nm) and low to moderate for KF (concordance: 0.64 to 0.78; average differences: -4 to -29 Nm, P<0.05). BIO vs. BIOEXO PF concordance ranged from 0.89 to 0.92 and mean differences ranged from -9 to +3 Nm (BIO < BIOEXO). BIOEXO vs. EXO PF concordance ranged from 0.73 to 0.80 while mean differences were -18 to -36 Nm (BIOEXO < EXO, P<0.05). PF concordance for BIO vs. EXO was slightly lower (0.61 to 0.84) and mean differences were greater (-27 to -33 Nm; BIO < EXO, P<0.05). CONCLUSION: BIO and EXO were similarly reliable for KE and KF. KE measures produced high agreement between devices; KF did not. For ankle PF, torque differences due to the two footplates were small. However, the X1 motor reports greater torques than the Biodex motor during PF. This first prototype provides proof of concept for a reliable, robotic-based exoskeleton to perform portable dynamometry for large muscle groups of the lower body.
How accelerated biological aging can affect solar reflective polymeric based building materials
NASA Astrophysics Data System (ADS)
Ferrari, C.; Santunione, G.; Libbra, A.; Muscio, A.; Sgarbi, E.
2017-11-01
Among the main issues concerning building materials, in particular outdoor ones, one can identify the colonization by microorganisms referred to as biological aggression. This can affect not only the aesthetical aspect but also the thermal performance of solar reflective materials. In order to improve the reliability of tests aimed to assess the resistance to biological aggression and contextually reduce the test duration, an accelerated test method has been developed. It is based on a lab reproducible setup where specific and controlled environmental and boundary conditions are imposed to accelerate as much as possible biological growth on building materials. Due to their widespread use, polymeric materials have been selected for the present analysis, in the aim of reaching an advanced bio-aged level in a relatively short time (8 weeks or less) and at the same time comparatively evaluate different materials under a given set of ageing conditions. Surface properties before, during and after ageing have been investigated by surface, microstructural and chemical analyses, as well as by examination of time progressive images to assess bacterial and algal growth rate.
What should students learn about complementary and alternative medicine?
Gaster, Barak; Unterborn, John N; Scott, Richard B; Schneeweiss, Ronald
2007-10-01
With thousands of complementary and alternative medicine (CAM) treatments currently being used in the United States today, it is challenging to design a concise body of CAM content which will fit into already overly full curricula for health care students. The purpose of this article is to outline key principles which 15 National Center for Complementary and Alternative Medicine-funded education programs found useful when developing CAM course-work and selecting CAM content. Three key guiding principles are discussed: teach foundational CAM competencies to give students a framework for learning about CAM; choose specific content on the basis of evidence, demographics and condition (what conditions are most appropriate for CAM therapies?); and finally, provide students with skills for future learning, including where to find reliable information about CAM and how to search the scientific literature and assess the results of CAM research. Most of the programs developed evidence-based guides to help students find reliable CAM resources. The cumulative experiences of the 15 programs have been compiled, and an annotated table outlining the most highly recommended resources about CAM is presented.
Gimmon, Yoav; Jacob, Grinshpon; Lenoble-Hoskovec, Constanze; Büla, Christophe; Melzer, Itshak
2013-01-01
Decline in gait stability has been associated with increased fall risk in older adults. Reliable and clinically feasible methods of gait instability assessment are needed. This study evaluated the relative and absolute reliability and concurrent validity of the testing procedure of the clinical version of the Narrow Path Walking Test (NPWT) under single task (ST) and dual task (DT) conditions. Thirty independent community-dwelling older adults (65-87 years) were tested twice. Participants were instructed to walk within the 6-m narrow path without stepping out. Trial time, number of steps, trial velocity, number of step errors, and number of cognitive task errors were determined. Intraclass correlation coefficients (ICCs) were calculated as indices of agreement, and a graphic approach called "mountain plot" was applied to help interpret the direction and magnitude of disagreements between testing procedures. Smallest detectable change and smallest real difference (SRD) were computed to determine clinically relevant improvement at group and individual levels, respectively. Concurrent validity was assessed using Performance Oriented Mobility Assessment Tool (POMA) and the Short Physical Performance Battery (SPPB). Test-retest agreement (ICC1,2) varied from 0.77 to 0.92 in ST and from 0.78 to 0.92 in DT conditions, with no apparent systematic differences between testing procedures demonstrated by the mountain plot graphs. Smallest detectable change and smallest real change were small for motor task performance and larger for cognitive errors. Significant correlations were observed for trial velocity and trial time with POMA and SPPB. The present results indicate that the NPWT testing procedure is highly reliable and reproducible. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Hunter, Susan W; Frengopoulos, Courtney; Holmes, Jeff; Viana, Ricardo; Payne, Michael W
2018-04-01
To determine the relative and absolute reliability of a dual-task functional mobility assessment. Cross-sectional study. Academic rehabilitation hospital. Individuals (N=60) with lower extremity amputation attending an outpatient amputee clinic (mean age, 58.21±12.59y; 18, 80% male) who were stratified into 3 groups: (1) transtibial amputation of vascular etiology (n=20); (2) transtibial amputation of nonvascular etiology (n=20); and (3) transfemoral or bilateral amputation of any etiology (n=20). Not applicable. Time to complete the L Test measured functional mobility under single- and dual-task conditions. The addition of a cognitive task (serial subtractions by 3's) created dual-task conditions. Single-task performance on the cognitive task was also reported. Intraclass correlation coefficients (ICCs) measured relative reliability; SEM and minimal detectable change with a 95% confidence interval (MDC 95 ) measured absolute reliability. Bland-Altman plots measured agreement between assessments. Relative reliability results were excellent for all 3 groups. Values for the dual-task L Test for those with transtibial amputation of vascular etiology (n=20; mean age, 60.36±7.84y; 19, 90% men) were ICC=.98 (95% confidence interval [CI], .94-.99), SEM=1.36 seconds, and MDC 95 =3.76 seconds; for those with transtibial amputation of nonvascular etiology (n=20; mean age, 55.85±14.08y; 17, 85% men), values were ICC=.93 (95% CI, .80-.98), SEM=1.34 seconds, and MDC 95 =3.71 seconds; and for those with transfemoral or bilateral amputation (n=20; mean age, 58.21±14.88y; 13, 65% men), values were ICC=.998 (95% CI, .996-.999), SEM=1.03 seconds, and MDC 95 =2.85 seconds. Bland-Altman plots indicated that assessments did not vary systematically for each group. This dual-task assessment protocol achieved approved levels of relative reliability values for the 3 groups tested. This protocol may be used clinically or in research settings to assess the interaction between cognition and functional mobility in the population with lower extremity amputation. Copyright © 2018 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Lo, Men-Tzung; Hu, Kun; Liu, Yanhui; Peng, C.-K.; Novak, Vera
2008-12-01
Quantification of nonlinear interactions between two nonstationary signals presents a computational challenge in different research fields, especially for assessments of physiological systems. Traditional approaches that are based on theories of stationary signals cannot resolve nonstationarity-related issues and, thus, cannot reliably assess nonlinear interactions in physiological systems. In this review we discuss a new technique called multimodal pressure flow (MMPF) method that utilizes Hilbert-Huang transformation to quantify interaction between nonstationary cerebral blood flow velocity (BFV) and blood pressure (BP) for the assessment of dynamic cerebral autoregulation (CA). CA is an important mechanism responsible for controlling cerebral blood flow in responses to fluctuations in systemic BP within a few heart-beats. The MMPF analysis decomposes BP and BFV signals into multiple empirical modes adaptively so that the fluctuations caused by a specific physiologic process can be represented in a corresponding empirical mode. Using this technique, we showed that dynamic CA can be characterized by specific phase delays between the decomposed BP and BFV oscillations, and that the phase shifts are significantly reduced in hypertensive, diabetics and stroke subjects with impaired CA. Additionally, the new technique can reliably assess CA using both induced BP/BFV oscillations during clinical tests and spontaneous BP/BFV fluctuations during resting conditions.
Reliability Quantification of Advanced Stirling Convertor (ASC) Components
NASA Technical Reports Server (NTRS)
Shah, Ashwin R.; Korovaichuk, Igor; Zampino, Edward
2010-01-01
The Advanced Stirling Convertor, is intended to provide power for an unmanned planetary spacecraft and has an operational life requirement of 17 years. Over this 17 year mission, the ASC must provide power with desired performance and efficiency and require no corrective maintenance. Reliability demonstration testing for the ASC was found to be very limited due to schedule and resource constraints. Reliability demonstration must involve the application of analysis, system and component level testing, and simulation models, taken collectively. Therefore, computer simulation with limited test data verification is a viable approach to assess the reliability of ASC components. This approach is based on physics-of-failure mechanisms and involves the relationship among the design variables based on physics, mechanics, material behavior models, interaction of different components and their respective disciplines such as structures, materials, fluid, thermal, mechanical, electrical, etc. In addition, these models are based on the available test data, which can be updated, and analysis refined as more data and information becomes available. The failure mechanisms and causes of failure are included in the analysis, especially in light of the new information, in order to develop guidelines to improve design reliability and better operating controls to reduce the probability of failure. Quantified reliability assessment based on fundamental physical behavior of components and their relationship with other components has demonstrated itself to be a superior technique to conventional reliability approaches based on utilizing failure rates derived from similar equipment or simply expert judgment.
ERIC Educational Resources Information Center
Wang, Tianyou; And Others
M. J. Kolen, B. A. Hanson, and R. L. Brennan (1992) presented a procedure for assessing the conditional standard error of measurement (CSEM) of scale scores using a strong true-score model. They also investigated the ways of using nonlinear transformation from number-correct raw score to scale score to equalize the conditional standard error along…
Reliability, Validity and Utility of a Multiple Intelligences Assessment for Career Planning.
ERIC Educational Resources Information Center
Shearer, C. Branton
"The Multiple Intelligences Developmental Assessment Scales" (MIDAS) is a self- (or other-) completed instrument which is based upon the theory of multiple intelligences. The validity, reliability, and utility data regarding the MIDAS are reported here. The measure consists of 7 main scales and 24 subscales which summarize a person's intellectual…
NASA Applications and Lessons Learned in Reliability Engineering
NASA Technical Reports Server (NTRS)
Safie, Fayssal M.; Fuller, Raymond P.
2011-01-01
Since the Shuttle Challenger accident in 1986, communities across NASA have been developing and extensively using quantitative reliability and risk assessment methods in their decision making process. This paper discusses several reliability engineering applications that NASA has used over the year to support the design, development, and operation of critical space flight hardware. Specifically, the paper discusses several reliability engineering applications used by NASA in areas such as risk management, inspection policies, components upgrades, reliability growth, integrated failure analysis, and physics based probabilistic engineering analysis. In each of these areas, the paper provides a brief discussion of a case study to demonstrate the value added and the criticality of reliability engineering in supporting NASA project and program decisions to fly safely. Examples of these case studies discussed are reliability based life limit extension of Shuttle Space Main Engine (SSME) hardware, Reliability based inspection policies for Auxiliary Power Unit (APU) turbine disc, probabilistic structural engineering analysis for reliability prediction of the SSME alternate turbo-pump development, impact of ET foam reliability on the Space Shuttle System risk, and reliability based Space Shuttle upgrade for safety. Special attention is given in this paper to the physics based probabilistic engineering analysis applications and their critical role in evaluating the reliability of NASA development hardware including their potential use in a research and technology development environment.
Intersession reliability of self-selected and narrow stance balance testing in older adults.
Riemann, Bryan L; Piersol, Kelsey
2017-10-01
Despite the common practice of using force platforms to assess balance of older adults, few investigations have examined the reliability of postural screening tests in this population. We sought to determine the test-retest reliability of self-selected and narrow stance balance testing with eyes open and eyes closed in healthy older adults. Thirty older adults (>65 years) completed 45 s trials of eyes open and eyes closed stability tests using self-selected and narrow stances on two separate days (1.9 ± .7 days). Average medial-lateral center of pressure velocity was computed. The ICC results ranged from .74 to .86, and no significant systematic changes (P < .05) occurred between the testing sessions for any of the tests. The standard error of measurement ranged from 15.9 to 23.6%. Reliability estimates were similar between the two stances and visual conditions assessed. Slightly higher coefficients were identified for the self-selected stances compared to the narrow stances under both visual conditions; however, there were negligible differences between the sessions. The within subject session-to-session variability provides a basis for further research to consider differences between fallers and non-fallers. Reliability for eyes open and closed balance testing using self-selected and narrow stances in older adults was established which should provide a foundation for the development of fall risk screening tests.
NASA Astrophysics Data System (ADS)
Aghakouchak, Amir; Tourian, Mohammad J.
2015-04-01
Development of reliable drought monitoring, prediction and recovery assessment tools are fundamental to water resources management. This presentation focuses on how gravimetry information can improve drought assessment. First, we provide an overview of the Global Integrated Drought Monitoring and Prediction System (GIDMaPS) which offers near real-time drought information using remote sensing observations and model simulations. Then, we present a framework for integration of satellite gravimetry information for improving drought prediction and recovery assessment. The input data include satellite-based and model-based precipitation, soil moisture estimates and equivalent water height. Previous studies show that drought assessment based on one single indicator may not be sufficient. For this reason, GIDMaPS provides drought information based on multiple drought indicators including Standardized Precipitation Index (SPI), Standardized Soil Moisture Index (SSI) and the Multivariate Standardized Drought Index (MSDI) which combines SPI and SSI probabilistically. MSDI incorporates the meteorological and agricultural drought conditions and provides composite multi-index drought information for overall characterization of droughts. GIDMaPS includes a seasonal prediction component based on a statistical persistence-based approach. The prediction component of GIDMaPS provides the empirical probability of drought for different severity levels. In this presentation we present a new component in which the drought prediction information based on SPI, SSI and MSDI are conditioned on equivalent water height obtained from the Gravity Recovery and Climate Experiment (GRACE). Using a Bayesian approach, GRACE information is used to evaluate persistence of drought. Finally, the deficit equivalent water height based on GRACE is used for assessing drought recovery. In this presentation, both monitoring and prediction components of GIDMaPS will be discussed, and the results from 2014 California Drought will be presented. Further Reading: Hao Z., AghaKouchak A., Nakhjiri N., Farahmand A., 2014, Global Integrated Drought Monitoring and Prediction System, Scientific Data, 1:140001, 1-10, doi: 10.1038/sdata.2014.1.
Reliability modelling and analysis of a multi-state element based on a dynamic Bayesian network
Xu, Tingxue; Gu, Junyuan; Dong, Qi; Fu, Linyu
2018-01-01
This paper presents a quantitative reliability modelling and analysis method for multi-state elements based on a combination of the Markov process and a dynamic Bayesian network (DBN), taking perfect repair, imperfect repair and condition-based maintenance (CBM) into consideration. The Markov models of elements without repair and under CBM are established, and an absorbing set is introduced to determine the reliability of the repairable element. According to the state-transition relations between the states determined by the Markov process, a DBN model is built. In addition, its parameters for series and parallel systems, namely, conditional probability tables, can be calculated by referring to the conditional degradation probabilities. Finally, the power of a control unit in a failure model is used as an example. A dynamic fault tree (DFT) is translated into a Bayesian network model, and subsequently extended to a DBN. The results show the state probabilities of an element and the system without repair, with perfect and imperfect repair, and under CBM, with an absorbing set plotted by differential equations and verified. Through referring forward, the reliability value of the control unit is determined in different kinds of modes. Finally, weak nodes are noted in the control unit. PMID:29765629
NASA Astrophysics Data System (ADS)
Stamenkovic, Dragan D.; Popovic, Vladimir M.
2015-02-01
Warranty is a powerful marketing tool, but it always involves additional costs to the manufacturer. In order to reduce these costs and make use of warranty's marketing potential, the manufacturer needs to master the techniques for warranty cost prediction according to the reliability characteristics of the product. In this paper a combination free replacement and pro rata warranty policy is analysed as warranty model for one type of light bulbs. Since operating conditions have a great impact on product reliability, they need to be considered in such analysis. A neural network model is used to predict light bulb reliability characteristics based on the data from the tests of light bulbs in various operating conditions. Compared with a linear regression model used in the literature for similar tasks, the neural network model proved to be a more accurate method for such prediction. Reliability parameters obtained in this way are later used in Monte Carlo simulation for the prediction of times to failure needed for warranty cost calculation. The results of the analysis make possible for the manufacturer to choose the optimal warranty policy based on expected product operating conditions. In such a way, the manufacturer can lower the costs and increase the profit.
Adapting forest health assessments to changing perspectives on threats--a case example from Sweden.
Wulff, Sören; Lindelöw, Åke; Lundin, Lars; Hansson, Per; Axelsson, Anna-Lena; Barklund, Pia; Wijk, Sture; Ståhl, Göran
2012-04-01
A revised Swedish forest health assessment system is presented. The assessment system is composed of several interacting components which target information needs for strategic and operational decision making and accommodate a continuously expanding knowledge base. The main motivation for separating information for strategic and operational decision making is that major damage outbreaks are often scattered throughout the landscape. Generally, large-scale inventories (such as national forest inventories) cannot provide adequate information for mitigation measures. In addition to broad monitoring programs that provide time-series information on known damaging agents and their effects, there is also a need for local and regional inventories adapted to specific damage events. While information for decision making is the major focus of the health assessment system, the system also contributes to expanding the knowledge base of forest conditions. For example, the integrated monitoring programs provide a better understanding of ecological processes linked to forest health. The new health assessment system should be able to respond to the need for quick and reliable information and thus will be an important part of the future monitoring of Swedish forests.
Automated Portable Test System (APTS) - A performance envelope assessment tool
NASA Technical Reports Server (NTRS)
Kennedy, R. S.; Dunlap, W. P.; Jones, M. B.; Wilkes, R. L.; Bittner, A. C., Jr.
1985-01-01
The reliability and stability of microcomputer-based psychological tests are evaluated. The hardware, test programs, and system control of the Automated Portable Test System, which assesses human performance and subjective status, are described. Subjects were administered 11 pen-and-pencil and microcomputer-based tests for 10 sessions. The data reveal that nine of the 10 tests stabilized by the third administration; inertial correlations were high and consistent. It is noted that the microcomputer-based tests display good psychometric properties in terms of differential stability and reliability.
Staggs, Vincent S; Cramer, Emily
2016-08-01
Hospital performance reports often include rankings of unit pressure ulcer rates. Differentiating among units on the basis of quality requires reliable measurement. Our objectives were to describe and apply methods for assessing reliability of hospital-acquired pressure ulcer rates and evaluate a standard signal-noise reliability measure as an indicator of precision of differentiation among units. Quarterly pressure ulcer data from 8,199 critical care, step-down, medical, surgical, and medical-surgical nursing units from 1,299 US hospitals were analyzed. Using beta-binomial models, we estimated between-unit variability (signal) and within-unit variability (noise) in annual unit pressure ulcer rates. Signal-noise reliability was computed as the ratio of between-unit variability to the total of between- and within-unit variability. To assess precision of differentiation among units based on ranked pressure ulcer rates, we simulated data to estimate the probabilities of a unit's observed pressure ulcer rate rank in a given sample falling within five and ten percentiles of its true rank, and the probabilities of units with ulcer rates in the highest quartile and highest decile being identified as such. We assessed the signal-noise measure as an indicator of differentiation precision by computing its correlations with these probabilities. Pressure ulcer rates based on a single year of quarterly or weekly prevalence surveys were too susceptible to noise to allow for precise differentiation among units, and signal-noise reliability was a poor indicator of precision of differentiation. To ensure precise differentiation on the basis of true differences, alternative methods of assessing reliability should be applied to measures purported to differentiate among providers or units based on quality. © 2016 The Authors. Research in Nursing & Health published by Wiley Periodicals, Inc. © 2016 The Authors. Research in Nursing & Health published by Wiley Periodicals, Inc.
Berger, Aaron J; Momeni, Arash; Ladd, Amy L
2014-04-01
Trapeziometacarpal, or thumb carpometacarpal (CMC), arthritis is a common problem with a variety of treatment options. Although widely used, the Eaton radiographic staging system for CMC arthritis is of questionable clinical utility, as disease severity does not predictably correlate with symptoms or treatment recommendations. A possible reason for this is that the classification itself may not be reliable, but the literature on this has not, to our knowledge, been systematically reviewed. We therefore performed a systematic review to determine the intra- and interobserver reliability of the Eaton staging system. We systematically reviewed English-language studies published between 1973 and 2013 to assess the degree of intra- and interobserver reliability of the Eaton classification for determining the stage of trapeziometacarpal joint arthritis and pantrapezial arthritis based on plain radiographic imaging. Search engines included: PubMed, Scopus(®), and CINAHL. Four studies, which included a total of 163 patients, met our inclusion criteria and were evaluated. The level of evidence of the studies included in this analysis was determined using the Oxford Centre for Evidence Based Medicine Levels of Evidence Classification by two independent observers. A limited number of studies have been performed to assess intra- and interobserver reliability of the Eaton classification system. The four studies included were determined to be Level 3b. These studies collectively indicate that the Eaton classification demonstrates poor to fair interobserver reliability (kappa values: 0.11-0.56) and fair to moderate intraobserver reliability (kappa values: 0.54-0.657). Review of the literature demonstrates that radiographs assist in the assessment of CMC joint disease, but there is not a reliable system for classification of disease severity. Currently, diagnosis and treatment of thumb CMC arthritis are based on the surgeon's qualitative assessment combining history, physical examination, and radiographic evaluation. Inconsistent agreement using the current common radiographic classification system suggests a need for better radiographic tools to quantify disease severity.
Measures of Self-Care Independence for Children with Osteochondrodysplasia: A Clinimetric Review
ERIC Educational Resources Information Center
Ireland, Penelope; Johnston, Leanne M.
2012-01-01
This systematic review evaluates the validity, reliability, and clinical utility of outcome measures used to assess self-care skills among children with congenital musculoskeletal conditions and assesses the applicability of these measures for children with osteochondrodysplasia aged 0-12 years. Electronic databases were searched to identify…
Grimm, Annegret; Gruber, Bernd; Henle, Klaus
2014-01-01
Reliable estimates of population size are fundamental in many ecological studies and biodiversity conservation. Selecting appropriate methods to estimate abundance is often very difficult, especially if data are scarce. Most studies concerning the reliability of different estimators used simulation data based on assumptions about capture variability that do not necessarily reflect conditions in natural populations. Here, we used data from an intensively studied closed population of the arboreal gecko Gehyra variegata to construct reference population sizes for assessing twelve different population size estimators in terms of bias, precision, accuracy, and their 95%-confidence intervals. Two of the reference populations reflect natural biological entities, whereas the other reference populations reflect artificial subsets of the population. Since individual heterogeneity was assumed, we tested modifications of the Lincoln-Petersen estimator, a set of models in programs MARK and CARE-2, and a truncated geometric distribution. Ranking of methods was similar across criteria. Models accounting for individual heterogeneity performed best in all assessment criteria. For populations from heterogeneous habitats without obvious covariates explaining individual heterogeneity, we recommend using the moment estimator or the interpolated jackknife estimator (both implemented in CAPTURE/MARK). If data for capture frequencies are substantial, we recommend the sample coverage or the estimating equation (both models implemented in CARE-2). Depending on the distribution of catchabilities, our proposed multiple Lincoln-Petersen and a truncated geometric distribution obtained comparably good results. The former usually resulted in a minimum population size and the latter can be recommended when there is a long tail of low capture probabilities. Models with covariates and mixture models performed poorly. Our approach identified suitable methods and extended options to evaluate the performance of mark-recapture population size estimators under field conditions, which is essential for selecting an appropriate method and obtaining reliable results in ecology and conservation biology, and thus for sound management. PMID:24896260
Weyers, Simone; Jemi, Iman; Karger, André; Raski, Bianca; Rotthoff, Thomas; Pentzek, Michael; Mortsiefer, Achim
2016-01-01
Background: Imparting communication skills has been given great importance in medical curricula. In addition to standardized assessments, students should communicate with real patients in actual clinical situations during workplace-based assessments and receive structured feedback on their performance. The aim of this project was to pilot a formative testing method for workplace-based assessment. Our investigation centered in particular on whether or not physicians view the method as feasible and how high acceptance is among students. In addition, we assessed the reliability of the method. Method: As part of the project, 16 students held two consultations each with chronically ill patients at the medical practice where they were completing GP training. These consultations were video-recorded. The trained mentoring physician rated the student’s performance and provided feedback immediately following the consultations using the Berlin Global Rating scale (BGR). Two impartial, trained raters also evaluated the videos using BGR. For qualitative and quantitative analysis, information on how physicians and students viewed feasibility and their levels of acceptance was collected in written form in a partially standardized manner. To test for reliability, the test-retest reliability was calculated for both of the overall evaluations given by each rater. The inter-rater reliability was determined for the three evaluations of each individual consultation. Results: The formative assessment method was rated positively by both physicians and students. It is relatively easy to integrate into daily routines. Its significant value lies in the personal, structured and recurring feedback. The two overall scores for each patient consultation given by the two impartial raters correlate moderately. The degree of uniformity among the three raters in respect to the individual consultations is low. Discussion: Within the scope of this pilot project, only a small sample of physicians and students could be surveyed to a limited extent. There are indications that the assessment can be improved by integrating more information on medical context and student self-assessments. Despite the current limitations regarding test criteria, it is clear that workplace-based assessment of communication skills in the clinical setting is a valuable addition to the communication curricula of medical schools. PMID:27990466
Weyers, Simone; Jemi, Iman; Karger, André; Raski, Bianca; Rotthoff, Thomas; Pentzek, Michael; Mortsiefer, Achim
2016-01-01
Background: Imparting communication skills has been given great importance in medical curricula. In addition to standardized assessments, students should communicate with real patients in actual clinical situations during workplace-based assessments and receive structured feedback on their performance. The aim of this project was to pilot a formative testing method for workplace-based assessment. Our investigation centered in particular on whether or not physicians view the method as feasible and how high acceptance is among students. In addition, we assessed the reliability of the method. Method: As part of the project, 16 students held two consultations each with chronically ill patients at the medical practice where they were completing GP training. These consultations were video-recorded. The trained mentoring physician rated the student's performance and provided feedback immediately following the consultations using the Berlin Global Rating scale (BGR). Two impartial, trained raters also evaluated the videos using BGR. For qualitative and quantitative analysis, information on how physicians and students viewed feasibility and their levels of acceptance was collected in written form in a partially standardized manner. To test for reliability, the test-retest reliability was calculated for both of the overall evaluations given by each rater. The inter-rater reliability was determined for the three evaluations of each individual consultation. Results: The formative assessment method was rated positively by both physicians and students. It is relatively easy to integrate into daily routines. Its significant value lies in the personal, structured and recurring feedback. The two overall scores for each patient consultation given by the two impartial raters correlate moderately. The degree of uniformity among the three raters in respect to the individual consultations is low. Discussion: Within the scope of this pilot project, only a small sample of physicians and students could be surveyed to a limited extent. There are indications that the assessment can be improved by integrating more information on medical context and student self-assessments. Despite the current limitations regarding test criteria, it is clear that workplace-based assessment of communication skills in the clinical setting is a valuable addition to the communication curricula of medical schools.
Full-Scale Crash Tests and Analyses of Three High-Wing Single
NASA Technical Reports Server (NTRS)
Annett, Martin S.; Littell, Justin D.; Stimson, Chad M.; Jackson, Karen E.; Mason, Brian H.
2015-01-01
The NASA Emergency Locator Transmitter Survivability and Reliability (ELTSAR) project was initiated in 2014 to assess the crash performance standards for the next generation of ELT systems. Three Cessna 172 aircraft have been acquired to conduct crash testing at NASA Langley Research Center's Landing and Impact Research Facility. Testing is scheduled for the summer of 2015 and will simulate three crash conditions; a flare to stall while emergency landing, and two controlled flight into terrain scenarios. Instrumentation and video coverage, both onboard and external, will also provide valuable data of airframe response. Full-scale finite element analyses will be performed using two separate commercial explicit solvers. Calibration and validation of the models will be based on the airframe response under these varying crash conditions.
Paleophysiology: From Fossils to the Future.
Vermeij, Geerat J
2015-10-01
Future environments may resemble conditions that have not existed for millions of years. To assess the adaptive options available to organisms evolving under such circumstances, it is instructive to probe paleophysiology, the ways in which ancient life coped with its physical and chemical surroundings. To do this, we need reliable proxies that are based on fundamental principles, quantitatively verified in living species, and observable in fossil remains. Insights have already come from vertebrates and plants, and others will likely emerge for marine animals if promising proxies are validated. Many questions remain about the circumstances for the evolution of environmental tolerances, metabolic rates, biomineralization, and physiological responses to interacting species, and about how living organisms will perform under exceptional conditions. Copyright © 2015 Elsevier Ltd. All rights reserved.
2013-10-21
depend on the quality of allocating resources. This work uses a reliability model of system and environmental covariates incorporating information at...state space. Further, the use of condition variables allows for the direct modeling of maintenance impact with the assumption that a nominal value ... value ), the model in the application of aviation maintenance can provide a useful estimation of reliability at multiple levels. Adjusted survival
NASA Technical Reports Server (NTRS)
Seneca, V. I.; Mlynarczyk, R. H.
1974-01-01
Tables of data are provided to show the availability of Skylab data to selected ground stations during the phases of Skylab preflight, Skylab unmanned condition, and Skylab manned condition. The mean time between failure (MTBF) of the same Skylab functions is tabulated for the selected ground stations. All reliability data are based on a 90 percent confidence interval.
Structural reliability analysis under evidence theory using the active learning kriging model
NASA Astrophysics Data System (ADS)
Yang, Xufeng; Liu, Yongshou; Ma, Panke
2017-11-01
Structural reliability analysis under evidence theory is investigated. It is rigorously proved that a surrogate model providing only correct sign prediction of the performance function can meet the accuracy requirement of evidence-theory-based reliability analysis. Accordingly, a method based on the active learning kriging model which only correctly predicts the sign of the performance function is proposed. Interval Monte Carlo simulation and a modified optimization method based on Karush-Kuhn-Tucker conditions are introduced to make the method more efficient in estimating the bounds of failure probability based on the kriging model. Four examples are investigated to demonstrate the efficiency and accuracy of the proposed method.
Scaled CMOS Technology Reliability Users Guide
NASA Technical Reports Server (NTRS)
White, Mark
2010-01-01
The desire to assess the reliability of emerging scaled microelectronics technologies through faster reliability trials and more accurate acceleration models is the precursor for further research and experimentation in this relevant field. The effect of semiconductor scaling on microelectronics product reliability is an important aspect to the high reliability application user. From the perspective of a customer or user, who in many cases must deal with very limited, if any, manufacturer's reliability data to assess the product for a highly-reliable application, product-level testing is critical in the characterization and reliability assessment of advanced nanometer semiconductor scaling effects on microelectronics reliability. A methodology on how to accomplish this and techniques for deriving the expected product-level reliability on commercial memory products are provided.Competing mechanism theory and the multiple failure mechanism model are applied to the experimental results of scaled SDRAM products. Accelerated stress testing at multiple conditions is applied at the product level of several scaled memory products to assess the performance degradation and product reliability. Acceleration models are derived for each case. For several scaled SDRAM products, retention time degradation is studied and two distinct soft error populations are observed with each technology generation: early breakdown, characterized by randomly distributed weak bits with Weibull slope (beta)=1, and a main population breakdown with an increasing failure rate. Retention time soft error rates are calculated and a multiple failure mechanism acceleration model with parameters is derived for each technology. Defect densities are calculated and reflect a decreasing trend in the percentage of random defective bits for each successive product generation. A normalized soft error failure rate of the memory data retention time in FIT/Gb and FIT/cm2 for several scaled SDRAM generations is presented revealing a power relationship. General models describing the soft error rates across scaled product generations are presented. The analysis methodology may be applied to other scaled microelectronic products and their key parameters.
Lyon, Aaron R; Pullmann, Michael D; Dorsey, Shannon; Martin, Prerna; Grigore, Alexandra A; Becker, Emily M; Jensen-Doss, Amanda
2018-05-11
Measurement-based care (MBC) is an increasingly popular, evidence-based practice, but there are no tools with established psychometrics to evaluate clinician use of MBC practices in mental health service delivery. The current study evaluated the reliability, validity, and factor structure of scores generated from a brief, standardized tool to measure MBC practices, the Current Assessment Practice Evaluation-Revised (CAPER). Survey data from a national sample of 479 mental health clinicians were used to conduct exploratory and confirmatory factor analyses, as well as reliability and validity analyses (e.g., relationships between CAPER subscales and clinician MBC attitudes). Analyses revealed competing two- and three-factor models. Regardless of the model used, scores from CAPER subscales demonstrated good reliability and convergent and divergent validity with MBC attitudes in the expected directions. The CAPER appears to be a psychometrically sound tool for assessing clinician MBC practices. Future directions for development and application of the tool are discussed.
[The willingness to pay for new drugs is based on ethical principles].
Liliemark, Jan; Lööf, Lars; Befrits, Gustaf; Back, Stefan; Sandman, Lars
2016-10-18
The County Council's board for new therapies (the NT Council) provides recommendations on the use of new drugs based on the ethical platform of priorities, founded by the Swedish parliament. The Council has formulated a policy that interprets the parliamentary ethical platform and operationalize its need and solidarity principle and cost effectiveness principle in four dimensions. The NT Council weighs the health economic evaluation of the drug and the four dimensions: the severity of the condition, the rarity of the condition, the effect size and the data reliability to determine the willingness to pay level and whether the platform allows a recommendation for using of the drug. The severity of the condition has a greater impact than the other dimensions. In the assessment of severity there is also a trade-off between prevention and treatment of manifest diseases and in prevention, the size of the risk of falling ill is of importance. A slightly higher willingness to pay level is reasonable for treatment of very rare conditions, but it is important that identified patients are not given priority over anonymous patient groups with equally strong needs.
Development and evaluation of consensus-based sediment quality guidelines for freshwater ecosystems
MacDonald, D.D.; Ingersoll, C.G.; Berger, T.A.
2000-01-01
Numerical sediment quality guidelines (SQGs) for freshwater ecosystems have previously been developed using a variety of approaches. Each approach has certain advantages and limitations which influence their application in the sediment quality assessment process. In an effort to focus on the agreement among these various published SQGs, consensus-based SQGs were developed for 28 chemicals of concern in freshwater sediments (i.e., metals, polycyclic aromatic hydrocarbons, polychlorinated biphenyls, and pesticides). For each contaminant of concern, two SQGs were developed from the published SQGs, including a threshold effect concentration (TEC) and a probable effect concentration (PEC). The resultant SQGs for each chemical were evaluated for reliability using matching sediment chemistry and toxicity data from field studies conducted throughout the United States. The results of this evaluation indicated that most of the TECs (i.e., 21 of 28) provide an accurate basis for predicting the absence of sediment toxicity. Similarly, most of the PECs (i.e., 16 of 28) provide an accurate basis for predicting sediment toxicity. Mean PEC quotients were calculated to evaluate the combined effects of multiple contaminants in sediment. Results of the evaluation indicate that the incidence of toxicity is highly correlated to the mean PEC quotient (R2= 0.98 for 347 samples). It was concluded that the consensus-based SQGs provide a reliable basis for assessing sediment quality conditions in freshwater ecosystems.
Rahman, Mohd Nasrull Abdol; Mohamad, Siti Shafika
2017-01-01
Computer works are associated with Musculoskeletal Disorders (MSDs). There are several methods have been developed to assess computer work risk factor related to MSDs. This review aims to give an overview of current techniques available for pen-and-paper-based observational methods in assessing ergonomic risk factors of computer work. We searched an electronic database for materials from 1992 until 2015. The selected methods were focused on computer work, pen-and-paper observational methods, office risk factors and musculoskeletal disorders. This review was developed to assess the risk factors, reliability and validity of pen-and-paper observational method associated with computer work. Two evaluators independently carried out this review. Seven observational methods used to assess exposure to office risk factor for work-related musculoskeletal disorders were identified. The risk factors involved in current techniques of pen and paper based observational tools were postures, office components, force and repetition. From the seven methods, only five methods had been tested for reliability. They were proven to be reliable and were rated as moderate to good. For the validity testing, from seven methods only four methods were tested and the results are moderate. Many observational tools already exist, but no single tool appears to cover all of the risk factors including working posture, office component, force, repetition and office environment at office workstations and computer work. Although the most important factor in developing tool is proper validation of exposure assessment techniques, the existing observational method did not test reliability and validity. Futhermore, this review could provide the researchers with ways on how to improve the pen-and-paper-based observational method for assessing ergonomic risk factors of computer work.
Intraday and Interday Reliability of Ultra-Short-Term Heart Rate Variability in Rugby Union Players.
Nakamura, Fábio Y; Pereira, Lucas A; Esco, Michael R; Flatt, Andrew A; Moraes, José E; Cal Abad, Cesar C; Loturco, Irineu
2017-02-01
Nakamura, FY, Pereira, LA, Esco, MR, Flatt, AA, Moraes, JE, Cal Abad, CC, and Loturco, I. Intraday and interday reliability of ultra-short-term heart rate variability in rugby union players. J Strength Cond Res 31(2): 548-551, 2017-The aim of this study was to examine the intraday and interday reliability of ultra-short-term vagal-related heart rate variability (HRV) in elite rugby union players. Forty players from the Brazilian National Rugby Team volunteered to participate in this study. The natural log of the root mean square of successive RR interval differences (lnRMSSD) assessments were performed on 4 different days. The HRV was assessed twice (intraday reliability) on the first day and once per day on the following 3 days (interday reliability). The RR interval recordings were obtained from 2-minute recordings using a portable heart rate monitor. The relative reliability of intraday and interday lnRMSSD measures was analyzed using the intraclass correlation coefficient (ICC). The typical error of measurement (absolute reliability) of intraday and interday lnRMSSD assessments was analyzed using the coefficient of variation (CV). Both intraday (ICC = 0.96; CV = 3.99%) and interday (ICC = 0.90; CV = 7.65%) measures were highly reliable. The ultra-short-term lnRMSSD is a consistent measure for evaluating elite rugby union players, in both intraday and interday settings. This study provides further validity to using this shortened method in practical field conditions with highly trained team sports athletes.
NASA Astrophysics Data System (ADS)
Kostyukov, V. N.; Naumenko, A. P.
2017-08-01
The paper dwells upon urgent issues of evaluating impact of actions conducted by complex technological systems operators on their safe operation considering application of condition monitoring systems for elements and sub-systems of petrochemical production facilities. The main task for the research is to distinguish factors and criteria of monitoring system properties description, which would allow to evaluate impact of errors made by personnel on operation of real-time condition monitoring and diagnostic systems for machinery of petrochemical facilities, and find and objective criteria for monitoring system class, considering a human factor. On the basis of real-time condition monitoring concepts of sudden failure skipping risk, static and dynamic error, monitoring systems, one may solve a task of evaluation of impact that personnel's qualification has on monitoring system operation in terms of error in personnel or operators' actions while receiving information from monitoring systems and operating a technological system. Operator is considered as a part of the technological system. Although, personnel's behavior is usually a combination of the following parameters: input signal - information perceiving, reaction - decision making, response - decision implementing. Based on several researches on behavior of nuclear powers station operators in USA, Italy and other countries, as well as on researches conducted by Russian scientists, required data on operator's reliability were selected for analysis of operator's behavior at technological facilities diagnostics and monitoring systems. The calculations revealed that for the monitoring system selected as an example, the failure skipping risk for the set values of static (less than 0.01) and dynamic (less than 0.001) errors considering all related factors of data on reliability of information perception, decision-making, and reaction fulfilled is 0.037, in case when all the facilities and error probability are under control - not more than 0.027. In case when only pump and compressor units are under control, the failure skipping risk is not more than 0.022, when the probability of error in operator's action is not more than 0.011. The work output shows that on the basis of the researches results an assessment of operators' reliability can be made in terms of almost any kind of production, but considering only technological capabilities, since operators' psychological and general training considerable vary in different production industries. Using latest technologies of engineering psychology and design of data support systems, situation assessment systems, decision-making and responding system, as well as achievement in condition monitoring in various production industries one can evaluate hazardous condition skipping risk probability considering static, dynamic errors and human factor.
A Protocol for Advanced Psychometric Assessment of Surveys
Squires, Janet E.; Hayduk, Leslie; Hutchinson, Alison M.; Cranley, Lisa A.; Gierl, Mark; Cummings, Greta G.; Norton, Peter G.; Estabrooks, Carole A.
2013-01-01
Background and Purpose. In this paper, we present a protocol for advanced psychometric assessments of surveys based on the Standards for Educational and Psychological Testing. We use the Alberta Context Tool (ACT) as an exemplar survey to which this protocol can be applied. Methods. Data mapping, acceptability, reliability, and validity are addressed. Acceptability is assessed with missing data frequencies and the time required to complete the survey. Reliability is assessed with internal consistency coefficients and information functions. A unitary approach to validity consisting of accumulating evidence based on instrument content, response processes, internal structure, and relations to other variables is taken. We also address assessing performance of survey data when aggregated to higher levels (e.g., nursing unit). Discussion. In this paper we present a protocol for advanced psychometric assessment of survey data using the Alberta Context Tool (ACT) as an exemplar survey; application of the protocol to the ACT survey is underway. Psychometric assessment of any survey is essential to obtaining reliable and valid research findings. This protocol can be adapted for use with any nursing survey. PMID:23401759
Nimon, Kim; Zientek, Linda Reichwein; Henson, Robin K.
2012-01-01
The purpose of this article is to help researchers avoid common pitfalls associated with reliability including incorrectly assuming that (a) measurement error always attenuates observed score correlations, (b) different sources of measurement error originate from the same source, and (c) reliability is a function of instrumentation. To accomplish our purpose, we first describe what reliability is and why researchers should care about it with focus on its impact on effect sizes. Second, we review how reliability is assessed with comment on the consequences of cumulative measurement error. Third, we consider how researchers can use reliability generalization as a prescriptive method when designing their research studies to form hypotheses about whether or not reliability estimates will be acceptable given their sample and testing conditions. Finally, we discuss options that researchers may consider when faced with analyzing unreliable data. PMID:22518107
NASA Technical Reports Server (NTRS)
Kramer, Arthur F.; Sirevaag, Erik J.; Braune, Rolf
1986-01-01
This study explores the relationship between the P300 component of the event-related brain potential (ERP) and the processing demands of a complex real-world task. Seven male volunteers enrolled in an Instrument Flight Rule (IFR) aviation course flew a series of missions in a single engine fixed-based simulator. In dual task conditions subjects were also required to discriminate between two tones differing in frequency. ERPs time-locked to the tones, subjective effort ratings and overt performance measures were collected during two 45 min flights differing in difficulty (manipulated by varying both atmospheric conditions and instrument reliability). The more difficult flight was associated with poorer performance, increased subjective effort ratings, and smaller secondary task P300s. Within each flight, P300 amplitude was negatively correlated with deviations from command headings indicating that P300 amplitude was a sensitive workload metric both between and within the flight missions.
Development of ecological indicator guilds for land management
Krzysik, A.J.; Balbach, H.E.; Duda, J.J.; Emlen, J.M.; Freeman, D.C.; Graham, J.H.; Kovacic, D.A.; Smith, L.M.; Zak, J.C.
2005-01-01
Agency land-use must be efficiently and cost-effectively monitored to assess conditions and trends in ecosystem processes and natural resources relevant to mission requirements and legal mandates. Ecological Indicators represent important land management tools for tracking ecological changes and preventing irreversible environmental damage in disturbed landscapes. The overall objective of the research was to develop both individual and integrated sets (i.e., statistically derived guilds) of Ecological Indicators to: quantify habitat conditions and trends, track and monitor ecological changes, provide early warning or threshold detection, and provide guidance for land managers. The derivation of Ecological Indicators was based on statistical criteria, ecosystem relevance, reliability and robustness, economy and ease of use for land managers, multi-scale performance, and stress response criteria. The basis for the development of statistically based Ecological Indicators was the identification of ecosystem metrics that analytically tracked a landscape disturbance gradient.
Vibration Sensor Monitoring of Nickel-Titanium Alloy Turning for Machinability Evaluation.
Segreto, Tiziana; Caggiano, Alessandra; Karam, Sara; Teti, Roberto
2017-12-12
Nickel-Titanium (Ni-Ti) alloys are very difficult-to-machine materials causing notable manufacturing problems due to their unique mechanical properties, including superelasticity, high ductility, and severe strain-hardening. In this framework, the aim of this paper is to assess the machinability of Ni-Ti alloys with reference to turning processes in order to realize a reliable and robust in-process identification of machinability conditions. An on-line sensor monitoring procedure based on the acquisition of vibration signals was implemented during the experimental turning tests. The detected vibration sensorial data were processed through an advanced signal processing method in time-frequency domain based on wavelet packet transform (WPT). The extracted sensorial features were used to construct WPT pattern feature vectors to send as input to suitably configured neural networks (NNs) for cognitive pattern recognition in order to evaluate the correlation between input sensorial information and output machinability conditions.
Vibration Sensor Monitoring of Nickel-Titanium Alloy Turning for Machinability Evaluation
Segreto, Tiziana; Karam, Sara; Teti, Roberto
2017-01-01
Nickel-Titanium (Ni-Ti) alloys are very difficult-to-machine materials causing notable manufacturing problems due to their unique mechanical properties, including superelasticity, high ductility, and severe strain-hardening. In this framework, the aim of this paper is to assess the machinability of Ni-Ti alloys with reference to turning processes in order to realize a reliable and robust in-process identification of machinability conditions. An on-line sensor monitoring procedure based on the acquisition of vibration signals was implemented during the experimental turning tests. The detected vibration sensorial data were processed through an advanced signal processing method in time-frequency domain based on wavelet packet transform (WPT). The extracted sensorial features were used to construct WPT pattern feature vectors to send as input to suitably configured neural networks (NNs) for cognitive pattern recognition in order to evaluate the correlation between input sensorial information and output machinability conditions. PMID:29231864
La Padula, Simone; Hersant, Barbara; SidAhmed, Mounia; Niddam, Jeremy; Meningaud, Jean Paul
2016-07-01
Most patients requesting aesthetic rejuvenation treatment expect to look healthier and younger. Some scales for ageing assessment have been proposed, but none is focused on patient age prediction. The aim of this study was to develop and validate a new facial rating scale assessing facial ageing sign severity. One thousand Caucasian patients were included and assessed. The Rasch model was used as part of the validation process. A score was attributed to each patient, based on the scales we developed. The correlation between the real age and scores obtained, the inter-rater reliability and test-retest reliability were analysed. The objective was to develop a tool enabling the assigning of a patient to a specific age range based on the calculated score. All scales exceeded criteria for acceptability, reliability and validity. The real age strongly correlated with the total facial score in both sex groups. The test-retest reliability confirmed this strong correlation. We developed a facial ageing scale which could be a useful tool to assess patients before and after rejuvenation treatment and an important new metrics to be used in facial rejuvenation and regenerative clinical research. Copyright © 2016 European Association for Cranio-Maxillo-Facial Surgery. Published by Elsevier Ltd. All rights reserved.
Morilla-Herrera, J C; Morales-Asencio, J M; Fernández-Gallego, M C; Cobos, E Berrobianco; Romero, A Delgado
2011-01-01
Self-care and management of therapeutic regime (drugs adherence, preventive behaviours and development of healthy life-styles) are key components for managing chronic diseases. Nursing has standardized languages which describe many of these situations, such as the diagnosis "Ineffective Self Health Management" (ISHM) or many of the Nursing Outcomes Classification (NOC) indicators. The aims of this study were to determine the interobserver reliability of a NOC-based instrument for assessment and aid in diagnosis of the ISHM in patients with chronic conditions in Primary Health Care, to determine its diagnostic validity and to describe the prevalence of patients with this problem. Cross-sectional validation study developed in the provinces of Málaga, Cádiz and Almería from 2006 to 2009. Each patient was assessed by 3 independent observers: the first two observers evaluated scoring of the NOC indicators and the third one acted as the "gold-standard". Two hundred and twenty-eight patients were included, 37.7% of them with more than one chronic condition. NOC indicators showed a high interobserver reliability (ICC>0,70) and a consistency (Cronbach's alpha: 0.81). With a cut-point of 10.5, sensitivity was 61% and specificity 85%, and the area under the curve was 0.81 (CI95%: 0.77 to 0.85). The prevalence of patients with ISHM was 36% (CI 95%: 34 to 40). The use of NOC indicators allows evaluation of management of the therapeutic regime in people with chronic conditions with a satisfactory validity and it provides new approaches for dealing with this problem.
Ramadan, Ahmed; Cholewicki, Jacek; Radcliffe, Clark J; Popovich, John M; Reeves, N Peter; Choi, Jongeun
2017-11-07
This study evaluated the within- and between-visit reliability of a seated balance test for quantifying trunk motor control using input-output data. Thirty healthy subjects performed a seated balance test under three conditions: eyes open (EO), eyes closed (EC), and eyes closed with vibration to the lumbar muscles (VIB). Each subject performed three trials of each condition on three different visits. The seated balance test utilized a torque-controlled robotic seat, which together with a sitting subject resulted in a physical human-robot interaction (pHRI) (two degrees-of-freedom with upper and lower body rotations). Subjects balanced the pHRI by controlling trunk rotation in response to pseudorandom torque perturbations applied to the seat in the coronal plane. Performance error was expressed as the root mean square (RMSE) of deviations from the upright position in the time domain and as the mean bandpass signal energy (E mb ) in the frequency domain. Intra-class correlation coefficients (ICC) quantified the between-visit reliability of both RMSE and E mb . The empirical transfer function estimates (ETFE) from the perturbation input to each of the two rotational outputs were calculated. Coefficients of multiple correlation (CMC) quantified the within- and between-visit reliability of the averaged ETFE. ICCs of RMSE and E mb for all conditions were ≥0.84. The mean within- and between-visit CMCs were all ≥0.96 for the lower body rotation and ≥0.89 for the upper body rotation. Therefore, our seated balance test consisting of pHRI to assess coronal plane trunk motor control is reliable. Copyright © 2017 Elsevier Ltd. All rights reserved.
Reliability evaluation of microgrid considering incentive-based demand response
NASA Astrophysics Data System (ADS)
Huang, Ting-Cheng; Zhang, Yong-Jun
2017-07-01
Incentive-based demand response (IBDR) can guide customers to adjust their behaviour of electricity and curtail load actively. Meanwhile, distributed generation (DG) and energy storage system (ESS) can provide time for the implementation of IBDR. The paper focus on the reliability evaluation of microgrid considering IBDR. Firstly, the mechanism of IBDR and its impact on power supply reliability are analysed. Secondly, the IBDR dispatch model considering customer’s comprehensive assessment and the customer response model are developed. Thirdly, the reliability evaluation method considering IBDR based on Monte Carlo simulation is proposed. Finally, the validity of the above models and method is studied through numerical tests on modified RBTS Bus6 test system. Simulation results demonstrated that IBDR can improve the reliability of microgrid.
Training and Maintaining System-Wide Reliability in Outcome Management.
Barwick, Melanie A; Urajnik, Diana J; Moore, Julia E
2014-01-01
The Child and Adolescent Functional Assessment Scale (CAFAS) is widely used for outcome management, for providing real time client and program level data, and the monitoring of evidence-based practices. Methods of reliability training and the assessment of rater drift are critical for service decision-making within organizations and systems of care. We assessed two approaches for CAFAS training: external technical assistance and internal technical assistance. To this end, we sampled 315 practitioners trained by external technical assistance approach from 2,344 Ontario practitioners who had achieved reliability on the CAFAS. To assess the internal technical assistance approach as a reliable alternative training method, 140 practitioners trained internally were selected from the same pool of certified raters. Reliabilities were high for both practitioners trained by external technical assistance and internal technical assistance approaches (.909-.995, .915-.997, respectively). 1 and 3-year estimates showed some drift on several scales. High and consistent reliabilities over time and training method has implications for CAFAS training of behavioral health care practitioners, and the maintenance of CAFAS as a global outcome management tool in systems of care.
Snow, Nicholas J; Peters, Sue; Borich, Michael R; Shirzad, Navid; Auriat, Angela M; Hayward, Kathryn S; Boyd, Lara A
2016-01-15
Diffusion-weighted magnetic resonance imaging (DW-MRI) is commonly used to assess white matter properties after stroke. Novel work is utilizing constrained spherical deconvolution (CSD) to estimate complex intra-voxel fiber architecture unaccounted for with tensor-based fiber tractography. However, the reliability of CSD-based tractography has not been established in people with chronic stroke. Establishing the reliability of CSD-based DW-MRI in chronic stroke. High-resolution DW-MRI was performed in ten adults with chronic stroke during two separate sessions. Deterministic region of interest-based fiber tractography using CSD was performed by two raters. Mean fractional anisotropy (FA), apparent diffusion coefficient (ADC), tract number, and tract volume were extracted from reconstructed fiber pathways in the corticospinal tract (CST) and superior longitudinal fasciculus (SLF). Callosal fiber pathways connecting the primary motor cortices were also evaluated. Inter-rater and test-retest reliability were determined by intra-class correlation coefficients (ICCs). ICCs revealed excellent reliability for FA and ADC in ipsilesional (0.86-1.00; p<0.05) and contralesional hemispheres (0.94-1.00; p<0.0001), for CST and SLF fibers; and excellent reliability for all metrics in callosal fibers (0.85-1.00; p<0.05). ICC ranged from poor to excellent for tract number and tract volume in ipsilesional (-0.11 to 0.92; p≤0.57) and contralesional hemispheres (-0.27 to 0.93; p≤0.64), for CST and SLF fibers. Like other select DW-MRI approaches, CSD-based tractography is a reliable approach to evaluate FA and ADC in major white matter pathways, in chronic stroke. Future work should address the reproducibility and utility of CSD-based metrics of tract number and tract volume. Copyright © 2015 Elsevier B.V. All rights reserved.
ERIC Educational Resources Information Center
Raymond, Mark R.; Clauser, Brian E.; Furman, Gail E.
2010-01-01
The use of standardized patients to assess communication skills is now an essential part of assessing a physician's readiness for practice. To improve the reliability of communication scores, it has become increasingly common in recent years to use statistical models to adjust ratings provided by standardized patients. This study employed ordinary…
Potthoff, Annegret; Oelschlägel, Kathrin; Schmitt-Jansen, Mechthild; Rummel, Christoph Daniel; Kühnel, Dana
2017-05-01
The presence of microplastic (MP) in the aquatic environment is recognized as a global-scale pollution issue. Secondary MP particles result from an ongoing fragmentation process governed by various biotic and abiotic factors. For a reliable risk assessment of these MP particles, knowledge about interactions with biota is needed. However, extensive testing with standard organisms under reproducible laboratory conditions with well-characterized MP suspensions is not available yet. As MP in the environment represents a mixture of particles differing in properties (e.g., size, color, polymer type, surface characteristics), it is likely that only specific particle fractions pose a threat towards organisms. In order to assign hazardous effects to specific particle properties, these characteristics need to be analyzed. As shown by the testing of particles (e.g. nanoparticles), characteristics other than chemical properties are important for the emergence of toxicity in organisms, and parameters such as surface area or size distribution need consideration. Therefore, the use of "well-defined" particles for ecotoxicological testing (i.e., standard particles) facilitates the establishment of causal links between physical-chemical properties of MP particles and toxic effects in organisms. However, the benefits of well-defined particles under laboratory conditions are offset by the disadvantage of the unknown comparability with MP in the environment. Therefore, weathering effects caused by biological, chemical, physical or mechanical processes have to be considered. To date, the characterization of the progression of MP weathering based on powder and suspension characterization methods is in its infancy. The aim of this commentary is to illustrate the prerequisites for testing MP in the laboratory from 3 perspectives: (i) knowledge of particle properties; (ii) behavior of MP in test setups involving ecotoxicological test organisms; and (iii) accordingly, test conditions that may need adjustment. Only under those prerequisites will reliable hazard assessment of MP be feasible. Integr Environ Assess Manag 2017;13:500-504. © 2017 SETAC. © 2017 SETAC.
A Comparison of a Brain-Based Adaptive System and a Manual Adaptable System for Invoking Automation
NASA Technical Reports Server (NTRS)
Bailey, Nathan R.; Scerbo, Mark W.; Freeman, Frederick G.; Mikulka, Peter J.; Scott, Lorissa A.
2004-01-01
Two experiments are presented that examine alternative methods for invoking automation. In each experiment, participants were asked to perform simultaneously a monitoring task and a resource management task as well as a tracking task that changed between automatic and manual modes. The monitoring task required participants to detect failures of an automated system to correct aberrant conditions under either high or low system reliability. Performance on each task was assessed as well as situation awareness and subjective workload. In the first experiment, half of the participants worked with a brain-based system that used their EEG signals to switch the tracking task between automatic and manual modes. The remaining participants were yoked to participants from the adaptive condition and received the same schedule of mode switches, but their EEG had no effect on the automation. Within each group, half of the participants were assigned to either the low or high reliability monitoring task. In addition, within each combination of automation invocation and system reliability, participants were separated into high and low complacency potential groups. The results revealed no significant effects of automation invocation on the performance measures; however, the high complacency individuals demonstrated better situation awareness when working with the adaptive automation system. The second experiment was the same as the first with one important exception. Automation was invoked manually. Thus, half of the participants pressed a button to invoke automation for 10 s. The remaining participants were yoked to participants from the adaptable condition and received the same schedule of mode switches, but they had no control over the automation. The results showed that participants who could invoke automation performed more poorly on the resource management task and reported higher levels of subjective workload. Further, those who invoked automation more frequently performed more poorly on the tracking task and reported higher levels of subjective workload. and the adaptable condition in the second experiment revealed only one significant difference: the subjective workload was higher in the adaptable condition. Overall, the results show that a brain-based, adaptive automation system may facilitate situation awareness for those individuals who are more complacent toward automation. By contrast, requiring operators to invoke automation manually may have some detrimental impact on performance but does appear to increases subjective workload relative to an adaptive system.
Measurement issues in the sonographic assessment of tennis elbow.
Poltawski, Leon; Jayaram, Vijay; Watson, Tim
2010-05-01
Sonography is increasingly being used for assessment in tennis elbow research and clinical practice, but there are a lack of data regarding its validity, reliability, and responsiveness to change for this application. Studies using the modality were reviewed to establish current levels of evidence for these measurement properties. There is reasonable evidence regarding its validity for identifying tennis elbow tendinopathy, but a lack of data addressing its reliability and responsiveness. Practical issues affecting image quality are discussed, and recommendations for further investigation are suggested, to enhance the credible use of sonography with this debilitating condition.
Human Reliability and the Cost of Doing Business
NASA Technical Reports Server (NTRS)
DeMott, Diana
2014-01-01
Most businesses recognize that people will make mistakes and assume errors are just part of the cost of doing business, but does it need to be? Companies with high risk, or major consequences, should consider the effect of human error. In a variety of industries, Human Errors have caused costly failures and workplace injuries. These have included: airline mishaps, medical malpractice, administration of medication and major oil spills have all been blamed on human error. A technique to mitigate or even eliminate some of these costly human errors is the use of Human Reliability Analysis (HRA). Various methodologies are available to perform Human Reliability Assessments that range from identifying the most likely areas for concern to detailed assessments with human error failure probabilities calculated. Which methodology to use would be based on a variety of factors that would include: 1) how people react and act in different industries, and differing expectations based on industries standards, 2) factors that influence how the human errors could occur such as tasks, tools, environment, workplace, support, training and procedure, 3) type and availability of data and 4) how the industry views risk & reliability influences ( types of emergencies, contingencies and routine tasks versus cost based concerns). The Human Reliability Assessments should be the first step to reduce, mitigate or eliminate the costly mistakes or catastrophic failures. Using Human Reliability techniques to identify and classify human error risks allows a company more opportunities to mitigate or eliminate these risks and prevent costly failures.
Evidence-based Assessment of Adherence to Medical Treatments in Pediatric Psychology
Modi, Avani C.; Lemanek, Kathleen L.; Ievers-Landis, Carolyn E.; Rapoff, Michael A.
2008-01-01
Objectives Adherence to medical regimens for children and adolescents with chronic conditions is generally below 50% and is considered the single, greatest cause of treatment failure. As the prevalence of chronic illnesses in pediatric populations increases and awareness of the negative consequences of poor adherence become clearer, the need for reliable and valid measures of adherence has grown. Methods This review evaluated empirical evidence for 18 measures utilizing three assessment methods: (a) self-report or structured interviews, (b) daily diary methods, and (c) electronic monitors. Results Ten measures met the “well-established” evidence-based (EBA) criteria. Conclusions Several recommendations for improving adherence assessment were made. In particular, consideration should be given to the use of innovative technologies that provide a window into the “real time” behaviors of patients and families. Providing written treatment plans, identifying barriers to good adherence, and examining racial and ethnic differences in attitudes, beliefs and behaviors affecting adherence were strongly recommended. PMID:17846042
Socioeconomic Drought in a Changing Climate: Modeling and Management
NASA Astrophysics Data System (ADS)
AghaKouchak, Amir; Mehran, Ali; Mazdiyasni, Omid
2016-04-01
Drought is typically defined based on meteorological, hydrological and land surface conditions. However, in many parts of the world, anthropogenic changes and water management practices have significantly altered local water availability. Socioeconomic drought refers to conditions whereby the available water supply cannot satisfy the human and environmental water needs. Surface water reservoirs provide resilience against local climate variability (e.g., droughts), and play a major role in regional water management. This presentation focuses on a framework for describing socioeconomic drought based on both water supply and demand information. We present a multivariate approach as a measure of socioeconomic drought, termed Multivariate Standardized Reliability and Resilience Index (MSRRI; Mehran et al., 2015). This model links the information on inflow and surface reservoir storage to water demand. MSRRI integrates a "top-down" and a "bottom-up" approach for describing socioeconomic drought. The "top-down" component describes processes that cannot be simply controlled or altered by local decision-makers and managers (e.g., precipitation, climate variability, climate change), whereas the "bottom-up" component focuses on the local resilience, and societal capacity to respond to droughts. The two components (termed, Inflow-Demand Reliability (IDR) indicator and Water Storage Resilience (WSR) indicator) are integrated using a nonparametric multivariate approach. We use this framework to assess the socioeconomic drought during the Australian Millennium Drought (1998-2010) and the 2011-2014 California Droughts. MSRRI provides additional information on socioeconomic drought onset, development and termination based on local resilience and human demand that cannot be obtained from the commonly used drought indicators. We show that MSRRI can be used for water management scenario analysis (e.g., local water availability based on different human water demands scenarios). Finally, we provide examples of using the proposed modeling framework for analyzing water availability in a changing climate considering local conditions. Reference: Mehran A., Mazdiyasni O., AghaKouchak A., 2015, A Hybrid Framework for Assessing Socioeconomic Drought: Linking Climate Variability, Local Resilience, and Demand, Journal of Geophysical Research, 120 (15), 7520-7533, doi: 10.1002/2015JD023147
Test-Retest Reliability of the Short-Form Survivor Unmet Needs Survey.
Taylor, Karen; Bulsara, Max; Monterosso, Leanne
2018-01-01
Reliable and valid needs assessment measures are important assessment tools in cancer survivorship care. A new 30-item short-form version of the Survivor Unmet Needs Survey (SF-SUNS) was developed and validated with cancer survivors, including hematology cancer survivors; however, test-retest reliability has not been established. The objective of this study was to assess the test-retest reliability of the SF-SUNS with a cohort of lymphoma survivors ( n = 40). Test-retest reliability of the SF-SUNS was conducted at two time points: baseline (time 1) and 5 days later (time 2). Test-retest data were collected from lymphoma cancer survivors ( n = 40) in a large tertiary cancer center in Western Australia. Intraclass correlation analyses compared data at time 1 (baseline) and time 2 (5 days later). Cronbach's alpha analyses were performed to assess the internal consistency at both time points. The majority (23/30, 77%) of items achieved test-retest reliability scores 0.45-0.74 (fair to good). A high degree of overall internal consistency was demonstrated (time 1 = 0.92, time 2 = 0.95), with scores 0.65-0.94 across subscales for both time points. Mixed test-retest reliability of the SF-SUNS was established. Our results indicate the SF-SUNS is responsive to the changing needs of lymphoma cancer survivors. Routine use of cancer survivorship specific needs-based assessments is required in oncology care today. Nurses are well placed to administer these assessments and provide tailored information and resources. Further assessment of test-retest reliability in hematology and other cancer cohorts is warranted.
Larsen, Camilla Marie; Juul-Kristensen, Birgit; Lund, Hans; Søgaard, Karen
2014-10-01
The aims were to compile a schematic overview of clinical scapular assessment methods and critically appraise the methodological quality of the involved studies. A systematic, computer-assisted literature search using Medline, CINAHL, SportDiscus and EMBASE was performed from inception to October 2013. Reference lists in articles were also screened for publications. From 50 articles, 54 method names were identified and categorized into three groups: (1) Static positioning assessment (n = 19); (2) Semi-dynamic (n = 13); and (3) Dynamic functional assessment (n = 22). Fifteen studies were excluded for evaluation due to no/few clinimetric results, leaving 35 studies for evaluation. Graded according to the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN checklist), the methodological quality in the reliability and validity domains was "fair" (57%) to "poor" (43%), with only one study rated as "good". The reliability domain was most often investigated. Few of the assessment methods in the included studies that had "fair" or "good" measurement property ratings demonstrated acceptable results for both reliability and validity. We found a substantially larger number of clinical scapular assessment methods than previously reported. Using the COSMIN checklist the methodological quality of the included measurement properties in the reliability and validity domains were in general "fair" to "poor". None were examined for all three domains: (1) reliability; (2) validity; and (3) responsiveness. Observational evaluation systems and assessment of scapular upward rotation seem suitably evidence-based for clinical use. Future studies should test and improve the clinimetric properties, and especially diagnostic accuracy and responsiveness, to increase utility for clinical practice.
Duff, Kevin
2012-01-01
Repeated assessments are a relatively common occurrence in clinical neuropsychology. The current paper will review some of the relevant concepts (e.g., reliability, practice effects, alternate forms) and methods (e.g., reliable change index, standardized based regression) that are used in repeated neuropsychological evaluations. The focus will be on the understanding and application of these concepts and methods in the evaluation of the individual patient through examples. Finally, some future directions for assessing change will be described. PMID:22382384
Ultra-short heart rate variability recording reliability: The effect of controlled paced breathing.
Melo, Hiago M; Martins, Thiago C; Nascimento, Lucas M; Hoeller, Alexandre A; Walz, Roger; Takase, Emílio
2018-06-04
Recent studies have reported that Heart Rate Variability (HRV) indices remain reliable even during recordings shorter than 5 min, suggesting the ultra-short recording method as a valuable tool for autonomic assessment. However, the minimum time-epoch to obtain a reliable record for all HRV domains (time, frequency, and Poincare geometric measures), as well as the effect of respiratory rate on the reliability of these indices remains unknown. Twenty volunteers had their HRV recorded in a seated position during spontaneous and controlled respiratory rhythms. HRV intervals with 1, 2, and 3 min were correlated with the gold standard period (6-min duration) and the mean values of all indices were compared in the two respiratory rhythm conditions. rMSSD and SD1 were more reliable for recordings with ultra-short duration at all time intervals (r values from 0.764 to 0.950, p < 0.05) for spontaneous breathing condition, whereas the other indices require longer recording time to obtain reliable values. The controlled breathing rhythm evokes stronger r values for time domain indices (r values from 0.83 to 0.99, p < 0.05 for rMSSD), but impairs the mean values replicability of domains across most time intervals. Although the use of standardized breathing increases the correlations coefficients, all HRV indices showed an increase in mean values (t values from 3.79 to 14.94, p < 0.001) except the RR and HF that presented a decrease (t = 4.14 and 5.96, p < 0.0001). Our results indicate that proper ultra-short-term recording method can provide a quick and reliable source of cardiac autonomic nervous system assessment. © 2018 Wiley Periodicals, Inc.
Sun, Yi; Arning, Martin; Bochmann, Frank; Börger, Jutta; Heitmann, Thomas
2018-06-01
The Occupational Safety and Health Monitoring and Assessment Tool (OSH-MAT) is a practical instrument that is currently used in the German woodworking and metalworking industries to monitor safety conditions at workplaces. The 12-item scoring system has three subscales rating technical, organizational, and personnel-related conditions in a company. Each item has a rating value ranging from 1 to 9, with higher values indicating higher standard of safety conditions. The reliability of this instrument was evaluated in a cross-sectional survey among 128 companies and its validity among 30,514 companies. The inter-rater reliability of the instrument was examined independently and simultaneously by two well-trained safety engineers. Agreement between the double ratings was quantified by the intraclass correlation coefficient and absolute agreement of the rating values. The content validity of the OSH-MAT was evaluated by quantifying the association between OSH-MAT values and 5-year average injury rates by Poisson regression analysis adjusted for the size of the companies and industrial sectors. The construct validity of OSH-MAT was examined by principle component factor analysis. Our analysis indicated good to very good inter-rater reliability (intraclass correlation coefficient = 0.64-0.74) of OSH-MAT values with an absolute agreement of between 72% and 81%. Factor analysis identified three component subscales that met exactly the structure theory of this instrument. The Poisson regression analysis demonstrated a statistically significant exposure-response relationship between OSH-MAT values and the 5-year average injury rates. These analyses indicate that OSH-MAT is a valid and reliable instrument that can be used effectively to monitor safety conditions at workplaces.
Optimizing the early phase development of new analgesics by human pain biomarkers.
Arendt-Nielsen, Lars; Hoeck, Hans Christian
2011-11-01
Human pain biomarkers are based on standardized acute activation of pain pathways/mechanisms and quantitative assessment of the evoked responses. This approach can be applied to healthy volunteers, to pain patients, and before and after pharmacological interventions to help understanding and profile the mode of action (proof-of-concept) of new and existing analgesic compounds. Standardized stimuli of different modalities can be applied to different tissues (multimodal and multi-tissue) for profiling analgesic compounds with respect to modulation of pain transduction, transmission, specific mechanisms and processing. This approach substantiates which specific compounds may work in particular clinical pain conditions. Human pain biomarkers can be translational and may bridge animal findings in clinical pain conditions, which in turn can provide new possibilities for designing more successful clinical trials. Biomarker based proof-of-concept drug studies in either volunteers or selected patient populations provide inexpensive, fast and reliable mechanism-based information about dose-efficacy relationships. This is important information in the early drug development phase and for designing large expensive clinical trials.
Prognostics Applied to Electric Propulsion UAV
NASA Technical Reports Server (NTRS)
Goebel, Kai; Saha, Bhaskar
2013-01-01
Health management plays an important role in operations of UAV. If there is equipment malfunction on critical components, safe operation of the UAV might possibly be compromised. A technology with particular promise in this arena is equipment prognostics. This technology provides a state assessment of the health of components of interest and, if a degraded state has been found, it estimates how long it will take before the equipment will reach a failure threshold, conditional on assumptions about future operating conditions and future environmental conditions. This chapter explores the technical underpinnings of how to perform prognostics and shows an implementation on the propulsion of an electric UAV. A particle filter is shown as the method of choice in performing state assessment and predicting future degradation. The method is then applied to the batteries that provide power to the propeller motors. An accurate run-time battery life prediction algorithm is of critical importance to ensure the safe operation of the vehicle if one wants to maximize in-air time. Current reliability based techniques turn out to be insufficient to manage the use of such batteries where loads vary frequently in uncertain environments.
NASA Astrophysics Data System (ADS)
Bordoni, M.; Bittelli, M.; Valentino, R.; Chersich, S.; Meisina, C.
2017-09-01
In this work, Soil Water Characteristic Curves (SWCCs) were reconstructed through simultaneous field measurements of soil pore water pressure and water content. The objective was to evaluate whether field-based monitoring can allow for the improvement of the accuracy in SWCCs estimation with respect to the use of laboratory techniques. Moreover, field assessment of SWCCs allowed to: a) quantify the hydrological hysteresis affecting SWCCs through field data; b) analyze the effect of different temporal resolution of field measures; c) highlight the differences in SWCCs reconstructed for a particular soil during different hydrological years; d) evaluate the reliability of field reconstructed SWCCs, by the comparison between assessed and measured trends of a component of the soil water balance. These aspects were fundamental for assessing the reliability of the field reconstructed SWCCs. Field data at two Italian test-sites were measured. These test-sites were used to evaluate the goodness of field reconstructed SWCCs for soils characterized by different geomorphological, geological, physical and pedological features. Field measured or laboratory measured SWCCs data of 5 soil horizons (3 in a predominantly silty soil, 2 in a predominantly clayey one) were fitted by Van Genuchten model. Different field drying and wetting periods were identified, based on monthly meteorological conditions, in terms of rainfall and evapotranspiration amounts, of different cycles. This method allowed for a correct discrimination of the main drying and the main wetting paths from field data related and for a more reliable quantification of soil hydrological properties with respect to laboratory methodologies. Particular patterns of changes in SWCCs forms along depth could be also identified. Field SWCCs estimation is not affected by the temporal resolution of the acquisition (hours or days), as testified by similar values of Van Genuchten equation fitting parameters. Instead, hourly data may offer a clearer vision of the drying and wetting paths, due to the highest number of experimental data points. Moreover, in temperate climate situations as those of the test-sites, main drying curves and main wetting curves of a particular soil were substantially similar also for different hydrological cycles with peculiar meteorological conditions. SWCCs parameters were implemented in a numerical code (HYDRUS-1D) to simulate soil water storage for different soil horizons. Field reconstructed SWCCs allowed for simulating with a higher precision these trends, confirming the reliability of the reconstructed field curves by a quantitative point of view. Moreover, best results were obtained considering hysteresis in the modeling.
Øhre, Beate; Saltnes, Hege; von Tetzchner, Stephen; Falkum, Erik
2014-05-22
There is a need for psychiatric assessment instruments that enable reliable diagnoses in persons with hearing loss who have sign language as their primary language. The objective of this study was to assess the validity of the Norwegian Sign Language (NSL) version of the Mini International Neuropsychiatric Interview (MINI). The MINI was translated into NSL. Forty-one signing patients consecutively referred to two specialised psychiatric units were assessed with a diagnostic interview by clinical experts and with the MINI. Inter-rater reliability was assessed with Cohen's kappa and "observed agreement". There was 65% agreement between MINI diagnoses and clinical expert diagnoses. Kappa values indicated fair to moderate agreement, and observed agreement was above 76% for all diagnoses. The MINI diagnosed more co-morbid conditions than did the clinical expert interview (mean diagnoses: 1.9 versus 1.2). Kappa values indicated moderate to substantial agreement, and "observed agreement" was above 88%. The NSL version performs similarly to other MINI versions and demonstrates adequate reliability and validity as a diagnostic instrument for assessing mental disorders in persons who have sign language as their primary and preferred language.
The use of remote sensing and GIS data in the NW Forest Plan aquatic assessment
NASA Astrophysics Data System (ADS)
Hirsch, C.; Gordon, S. N.; Eldred, P.
2017-12-01
The NW Forest Plan is an integrated set of standards and guidelines covering 24 m ac of federal lands on the west side of Washington, Oregon, and northern California. The associated the Aquatic and Riparian Effectiveness Monitoring Program (AREMP) assesses watershed condition and trend on these lands at two levels: upslope/riparian based on GIS and remote sensing data, and inchannel, based on field data collection. AREMP uses a multicriteria evaluation approach similar to WCF, but upslope attributes are restricted to roads and vegetation because these are the only ones which can be reliably backcasted to estimate trend since the start of the Plan in 1994. AREMP has developed a context-sensitive road evaluation system in response to studies showing that a large percentage of road sediment delivery to streams comes from a small percentage of the overall road network, and we are currently testing integration with the GRAIP-lite road sediment tool. AREMP's evaluation of vegetation data, based on Landsat imagery, also uses context derived from the distributions of least-human-disturbed conditions by 16 major forest types. A major question remains in how to evaluate vegetation given the importance of natural disturbances in renewing aquatic habitat. A common challenge across all of our data sources is how to assess uncertainty in data values and locational accuracy.
Clark, Cynthia M; Sattler, Victoria P; Barbosa-Leiker, Celestina
2016-10-01
Fostering healthy work environments that enhance job satisfaction and reflect high levels of employee engagement and productivity is imperative for all organizations. This is especially true for health care organizations where unhealthy work conditions can lead to poor patient outcomes. A convenience sample of 520 nursing faculty and practice-based nurses in the United States participated in a study to test the psychometric properties of the Healthy Work Environment Inventory (HWEI). A factor analysis and other reliability analyses support the use of the HWEI as a valid and reliable instrument to measure perceptions of work environment health. The HWEI is a 20-item psychometrically sound instrument to measure perceptions of the health of the work environment. It may be completed either as an individual exercise or by all members of a team to compare perceptions of work environment health, to determine areas of strength and improvement, and to form the basis for interviewing. [J Nurs Educ. 2016;55(10):555-562.]. Copyright 2016, SLACK Incorporated.
The effect of Web-based Braden Scale training on the reliability of Braden subscale ratings.
Magnan, Morris A; Maklebust, JoAnn
2009-01-01
The primary purpose of this study was to evaluate the effect of Web-based Braden Scale training on the reliability of Braden Scale subscale ratings made by nurses working in acute care hospitals. A secondary purpose was to describe the distribution of reliable Braden subscale ratings before and after Web-based Braden Scale training. Secondary analysis of data from a recently completed quasi-experimental, pretest-posttest, interrater reliability study. A convenience sample of RNs working at 3 Michigan medical centers voluntarily participated in the study. RN participants included nurses who used the Braden Scale regularly at their place of employment ("regular users") as well as nurses who did not use the Braden Scale at their place of employment ("new users"). Using a pretest-posttest, quasi-experimental design, pretest interrater reliability data were collected to identify the percentage of nurses making reliable Braden subscale assessments. Nurses then completed a Web-based Braden Scale training module after which posttest interrater reliability data were collected. The reliability of nurses' Braden subscale ratings was determined by examining the level of agreement/disagreement between ratings made by an RN and an "expert" rating the same patient. In total, 381 RN-to-expert dyads were available for analysis. During both the pretest and posttest periods, the percentage of reliable subscale ratings was highest for the activity subscale, lowest for the moisture subscale, and second lowest for the nutrition subscale. With Web-based Braden Scale training, the percentage of reliable Braden subscale ratings made by new users increased for all 6 subscales with statistically significant improvements in the percentage of reliable assessments made on 3 subscales: sensory-perception, moisture, and mobility. Training had virtually no effect on the percentage of reliable subscale ratings made by regular users of the Braden Scale. With Web-based Braden Scale training the percentage of nurses making reliable ratings increased for all 6 subscales, but this was true for new users only. Additional research is needed to identify educational approaches that effectively improve and sustain the reliability of subscale ratings among regular users of the Braden Scale. Moreover, special attention needs to be given to ensuring that all nurses working with the Braden Scale have a clear understanding of the intended meanings and correct approaches to rating moisture and nutrition subscales.
Valle, Susanne Collier; Støen, Ragnhild; Sæther, Rannei; Jensenius, Alexander Refsum; Adde, Lars
2015-10-01
A computer-based video analysis has recently been presented for quantitative assessment of general movements (GMs). This method's test-retest reliability, however, has not yet been evaluated. The aim of the current study was to evaluate the test-retest reliability of computer-based video analysis of GMs, and to explore the association between computer-based video analysis and the temporal organization of fidgety movements (FMs). Test-retest reliability study. 75 healthy, term-born infants were recorded twice the same day during the FMs period using a standardized video set-up. The computer-based movement variables "quantity of motion mean" (Qmean), "quantity of motion standard deviation" (QSD) and "centroid of motion standard deviation" (CSD) were analyzed, reflecting the amount of motion and the variability of the spatial center of motion of the infant, respectively. In addition, the association between the variable CSD and the temporal organization of FMs was explored. Intraclass correlation coefficients (ICC 1.1 and ICC 3.1) were calculated to assess test-retest reliability. The ICC values for the variables CSD, Qmean and QSD were 0.80, 0.80 and 0.86 for ICC (1.1), respectively; and 0.80, 0.86 and 0.90 for ICC (3.1), respectively. There were significantly lower CSD values in the recordings with continual FMs compared to the recordings with intermittent FMs (p<0.05). This study showed high test-retest reliability of computer-based video analysis of GMs, and a significant association between our computer-based video analysis and the temporal organization of FMs. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Bernhardt, Jay M; Stellefson, Michael; Weiler, Robert M; Anderson-Lewis, Charkarra; Miller, M David; MacInnes, Jann
2015-01-01
Background Social media can promote healthy behaviors by facilitating engagement and collaboration among health professionals and the public. Thus, social media is quickly becoming a vital tool for health promotion. While guidelines and trainings exist for public health professionals, there are currently no standardized measures to assess individual social media competency among Certified Health Education Specialists (CHES) and Master Certified Health Education Specialists (MCHES). Objective The aim of this study was to design, develop, and test the Social Media Competency Inventory (SMCI) for CHES and MCHES. Methods The SMCI was designed in three sequential phases: (1) Conceptualization and Domain Specifications, (2) Item Development, and (3) Inventory Testing and Finalization. Phase 1 consisted of a literature review, concept operationalization, and expert reviews. Phase 2 involved an expert panel (n=4) review, think-aloud sessions with a small representative sample of CHES/MCHES (n=10), a pilot test (n=36), and classical test theory analyses to develop the initial version of the SMCI. Phase 3 included a field test of the SMCI with a random sample of CHES and MCHES (n=353), factor and Rasch analyses, and development of SMCI administration and interpretation guidelines. Results Six constructs adapted from the unified theory of acceptance and use of technology and the integrated behavioral model were identified for assessing social media competency: (1) Social Media Self-Efficacy, (2) Social Media Experience, (3) Effort Expectancy, (4) Performance Expectancy, (5) Facilitating Conditions, and (6) Social Influence. The initial item pool included 148 items. After the pilot test, 16 items were removed or revised because of low item discrimination (r<.30), high interitem correlations (Ρ>.90), or based on feedback received from pilot participants. During the psychometric analysis of the field test data, 52 items were removed due to low discrimination, evidence of content redundancy, low R-squared value, or poor item infit or outfit. Psychometric analyses of the data revealed acceptable reliability evidence for the following scales: Social Media Self-Efficacy (alpha=.98, item reliability=.98, item separation=6.76), Social Media Experience (alpha=.98, item reliability=.98, item separation=6.24), Effort Expectancy(alpha =.74, item reliability=.95, item separation=4.15), Performance Expectancy (alpha =.81, item reliability=.99, item separation=10.09), Facilitating Conditions (alpha =.66, item reliability=.99, item separation=16.04), and Social Influence (alpha =.66, item reliability=.93, item separation=3.77). There was some evidence of local dependence among the scales, with several observed residual correlations above |.20|. Conclusions Through the multistage instrument-development process, sufficient reliability and validity evidence was collected in support of the purpose and intended use of the SMCI. The SMCI can be used to assess the readiness of health education specialists to effectively use social media for health promotion research and practice. Future research should explore associations across constructs within the SMCI and evaluate the ability of SMCI scores to predict social media use and performance among CHES and MCHES. PMID:26399428
Alber, Julia M; Bernhardt, Jay M; Stellefson, Michael; Weiler, Robert M; Anderson-Lewis, Charkarra; Miller, M David; MacInnes, Jann
2015-09-23
Social media can promote healthy behaviors by facilitating engagement and collaboration among health professionals and the public. Thus, social media is quickly becoming a vital tool for health promotion. While guidelines and trainings exist for public health professionals, there are currently no standardized measures to assess individual social media competency among Certified Health Education Specialists (CHES) and Master Certified Health Education Specialists (MCHES). The aim of this study was to design, develop, and test the Social Media Competency Inventory (SMCI) for CHES and MCHES. The SMCI was designed in three sequential phases: (1) Conceptualization and Domain Specifications, (2) Item Development, and (3) Inventory Testing and Finalization. Phase 1 consisted of a literature review, concept operationalization, and expert reviews. Phase 2 involved an expert panel (n=4) review, think-aloud sessions with a small representative sample of CHES/MCHES (n=10), a pilot test (n=36), and classical test theory analyses to develop the initial version of the SMCI. Phase 3 included a field test of the SMCI with a random sample of CHES and MCHES (n=353), factor and Rasch analyses, and development of SMCI administration and interpretation guidelines. Six constructs adapted from the unified theory of acceptance and use of technology and the integrated behavioral model were identified for assessing social media competency: (1) Social Media Self-Efficacy, (2) Social Media Experience, (3) Effort Expectancy, (4) Performance Expectancy, (5) Facilitating Conditions, and (6) Social Influence. The initial item pool included 148 items. After the pilot test, 16 items were removed or revised because of low item discrimination (r<.30), high interitem correlations (Ρ>.90), or based on feedback received from pilot participants. During the psychometric analysis of the field test data, 52 items were removed due to low discrimination, evidence of content redundancy, low R-squared value, or poor item infit or outfit. Psychometric analyses of the data revealed acceptable reliability evidence for the following scales: Social Media Self-Efficacy (alpha=.98, item reliability=.98, item separation=6.76), Social Media Experience (alpha=.98, item reliability=.98, item separation=6.24), Effort Expectancy(alpha =.74, item reliability=.95, item separation=4.15), Performance Expectancy (alpha =.81, item reliability=.99, item separation=10.09), Facilitating Conditions (alpha =.66, item reliability=.99, item separation=16.04), and Social Influence (alpha =.66, item reliability=.93, item separation=3.77). There was some evidence of local dependence among the scales, with several observed residual correlations above |.20|. Through the multistage instrument-development process, sufficient reliability and validity evidence was collected in support of the purpose and intended use of the SMCI. The SMCI can be used to assess the readiness of health education specialists to effectively use social media for health promotion research and practice. Future research should explore associations across constructs within the SMCI and evaluate the ability of SMCI scores to predict social media use and performance among CHES and MCHES.
TENI: A comprehensive battery for cognitive assessment based on games and technology.
Delgado, Marcela Tenorio; Uribe, Paulina Arango; Alonso, Andrés Aparicio; Díaz, Ricardo Rosas
2016-01-01
TENI (Test de Evaluación Neuropsicológica Infantil) is an instrument developed to assess cognitive abilities in children between 3 and 9 years of age. It is based on a model that incorporates games and technology as tools to improve the assessment of children's capacities. The test was standardized with two Chilean samples of 524 and 82 children living in urban zones. Evidence of reliability and validity based on current standards is presented. Data show good levels of reliability for all subtests. Some evidence of validity in terms of content, test structure, and association with other variables is presented. This instrument represents a novel approach and a new frontier in cognitive assessment. Further studies with clinical, rural, and cross-cultural populations are required.
DOT National Transportation Integrated Search
2012-09-01
Capacity, demand, and vehicle based emissions reduction strategies are compared for several pollutants employing aggregate US : congestion and vehicle fleet condition data. We find that congestion mitigation does not inevitably lead to reduced emissi...
The Reliability, Impact, and Cost-Effectiveness of Value-Added Teacher Assessment Methods
ERIC Educational Resources Information Center
Yeh, Stuart S.
2012-01-01
This article reviews evidence regarding the intertemporal reliability of teacher rankings based on value-added methods. Value-added methods exhibit low reliability, yet are broadly supported by prominent educational researchers and are increasingly being used to evaluate and fire teachers. The article then presents a cost-effectiveness analysis…
A study on the real-time reliability of on-board equipment of train control system
NASA Astrophysics Data System (ADS)
Zhang, Yong; Li, Shiwei
2018-05-01
Real-time reliability evaluation is conducive to establishing a condition based maintenance system for the purpose of guaranteeing continuous train operation. According to the inherent characteristics of the on-board equipment, the connotation of reliability evaluation of on-board equipment is defined and the evaluation index of real-time reliability is provided in this paper. From the perspective of methodology and practical application, the real-time reliability of the on-board equipment is discussed in detail, and the method of evaluating the realtime reliability of on-board equipment at component level based on Hidden Markov Model (HMM) is proposed. In this method the performance degradation data is used directly to realize the accurate perception of the hidden state transition process of on-board equipment, which can achieve a better description of the real-time reliability of the equipment.
Franke, Konstantin H; Krumkamp, Ralf; Mohammed, Aliyu; Sarpong, Nimako; Owusu-Dabo, Ellis; Brinkel, Johanna; Fobil, Julius N; Marinovic, Axel Bonacic; Asihene, Philip; Boots, Mark; May, Jürgen; Kreuels, Benno
2018-03-27
The aim of this study was the development and evaluation of an algorithm-based diagnosis-tool, applicable on mobile phones, to support guardians in providing appropriate care to sick children. The algorithm was developed on the basis of the Integrated Management of Childhood Illness (IMCI) guidelines and evaluated at a hospital in Ghana. Two hundred and thirty-seven guardians applied the tool to assess their child's symptoms. Data recorded by the tool and health records completed by a physician were compared in terms of symptom detection, disease assessment and treatment recommendation. To compare both assessments, Kappa statistics and predictive values were calculated. The tool detected the symptoms of cough, fever, diarrhoea and vomiting with good agreement to the physicians' findings (kappa = 0.64; 0.59; 0.57 and 0.42 respectively). The disease assessment barely coincided with the physicians' findings. The tool's treatment recommendation correlated with the physicians' assessments in 93 out of 237 cases (39.2% agreement, kappa = 0.11), but underestimated a child's condition in only seven cases (3.0%). The algorithm-based tool achieved reliable symptom detection and treatment recommendations were administered conformably to the physicians' assessment. Testing in domestic environment is envisaged.
Optimising mobility outcome measures in Huntington's disease.
Busse, Monica; Quinn, Lori; Khalil, Hanan; McEwan, Kirsten
2014-01-01
Many of the performance-based mobility measures that are currently used in Huntington's disease (HD) were developed for assessment in other neurological conditions such as stroke. We aimed to assess the individual item-response of commonly used performance-based mobility measures, with a view to optimizing the scales for specific application in Huntington's Disease (HD). Data from a larger multicentre, observational study were used. Seventy-five people with HD (11 pre-manifest & 64 manifest) were assessed on the Six-Minute Walk Test, 10-Meter Walk Test, Timed "Up & Go" Test (TUG), Berg Balance Scale (BBS), Physical Performance Test (PPT), Four Square Step Test, and Tinetti Mobility Test (TMT). The Unified Huntington's Disease Rating Scale (UHDRS) Total Motor Score, Functional Assessment Scale and Total Functional Capacity scores were recorded, alongside cognitive measures. Standard regression analysis was used to assess predictive validity. Individual item responses were investigated using a sequence of approaches to allow for gradual removal of items and the subsequent creation of shortened versions. Psychometric properties (reliability and discriminant ability) of the shortened scales were assessed. TUG (β 0.46, CI 0.20-3.47), BBS (β -0.35, CI -2.10-0.14), and TMT (β -0.45, CI -3.14-0.64) were good disease-specific mobility measures. PPT was the best measure of functional performance (β 0.42, CI 0.00-0.43 for TFC & β 0.57 CI 0.15-0.81 for FAS). Shortened versions of BBS and TMT were developed based on item analysis. The resultant BBS and TMT shortened scales were reliable for use in manifest HD. ROC analysis showed that shortened scales were able to discriminate between manifest and pre-manifest disease states. Our data suggests that the PPT is appropriate as a general measure of function in individuals with HD, and we have identified shortened versions of the BBS and TMT that measure the unique gait and balance impairments in HD. These scales, alongside the TUG, may therefore be important measures to consider in future clinical trials.
Assessment and preliminary design of an energy buffer for regenerative braking in electric vehicles
NASA Technical Reports Server (NTRS)
Buchholz, R.; Mathur, A. K.
1979-01-01
Energy buffer systems, capable of storing the vehicle energy during braking and reusing this stored energy during acceleration, were examined. Some of these buffer systems when incorporated in an electric vehicle would result in an improvement in the performance and range under stop and go driving conditions. Buffer systems considered included flywheels, hydropneumatic, pneumatic, spring, and regenerative braking. Buffer ranking and rating criteria were established. Buffer systems were rated based on predicted range improvements, consumer acceptance, driveability, safety, reliability and durability, and initial and life cycle costs. A hydropneumatic buffer system was selected.
2013-01-01
Background The Parent-Infant Relationship Global Assessment Scale (PIR-GAS) signifies a conceptually relevant development in the multi-axial, developmentally sensitive classification system DC:0-3R for preschool children. However, information about the reliability and validity of the PIR-GAS is rare. A review of the available empirical studies suggests that in research, PIR-GAS ratings can be based on a ten-minute videotaped interaction sequence. The qualification of raters may be very heterogeneous across studies. Methods To test whether the use of the PIR-GAS still allows for a reliable assessment of the parent-infant relationship, our study compared a PIR-GAS ratings based on a full-information procedure across multiple settings with ratings based on a ten-minute video by two doctoral candidates of medicine. For each mother-child dyad at a family day hospital (N = 48), we obtained two video ratings and one full-information rating at admission to therapy and at discharge. This pre-post design allowed for a replication of our findings across the two measurement points. We focused on the inter-rater reliability between the video coders, as well as between the video and full-information procedure, including mean differences and correlations between the raters. Additionally, we examined aspects of the validity of video and full-information ratings based on their correlation with measures of child and maternal psychopathology. Results Our results showed that a ten-minute video and full-information PIR-GAS ratings were not interchangeable. Most results at admission could be replicated by the data obtained at discharge. We concluded that a higher degree of standardization of the assessment procedure should increase the reliability of the PIR-GAS, and a more thorough theoretical foundation of the manual should increase its validity. PMID:23705962
NASA Astrophysics Data System (ADS)
Varlataya, S. K.; Evdokimov, V. E.; Urzov, A. Y.
2017-11-01
This article describes a process of calculating a certain complex information security system (CISS) reliability using the example of the technospheric security management model as well as ability to determine the frequency of its maintenance using the system reliability parameter which allows one to assess man-made risks and to forecast natural and man-made emergencies. The relevance of this article is explained by the fact the CISS reliability is closely related to information security (IS) risks. Since reliability (or resiliency) is a probabilistic characteristic of the system showing the possibility of its failure (and as a consequence - threats to the protected information assets emergence), it is seen as a component of the overall IS risk in the system. As it is known, there is a certain acceptable level of IS risk assigned by experts for a particular information system; in case of reliability being a risk-forming factor maintaining an acceptable risk level should be carried out by the routine analysis of the condition of CISS and its elements and their timely service. The article presents a reliability parameter calculation for the CISS with a mixed type of element connection, a formula of the dynamics of such system reliability is written. The chart of CISS reliability change is a S-shaped curve which can be divided into 3 periods: almost invariable high level of reliability, uniform reliability reduction, almost invariable low level of reliability. Setting the minimum acceptable level of reliability, the graph (or formula) can be used to determine the period of time during which the system would meet requirements. Ideally, this period should not be longer than the first period of the graph. Thus, the proposed method of calculating the CISS maintenance frequency helps to solve a voluminous and critical task of the information assets risk management.
Moore, Amy Lawson; Miller, Terissa M
2018-01-01
The purpose of the current study is to evaluate the validity and reliability of the revised Gibson Test of Cognitive Skills, a computer-based battery of tests measuring short-term memory, long-term memory, processing speed, logic and reasoning, visual processing, as well as auditory processing and word attack skills. This study included 2,737 participants aged 5-85 years. A series of studies was conducted to examine the validity and reliability using the test performance of the entire norming group and several subgroups. The evaluation of the technical properties of the test battery included content validation by subject matter experts, item analysis and coefficient alpha, test-retest reliability, split-half reliability, and analysis of concurrent validity with the Woodcock Johnson III Tests of Cognitive Abilities and Tests of Achievement. Results indicated strong sources of evidence of validity and reliability for the test, including internal consistency reliability coefficients ranging from 0.87 to 0.98, test-retest reliability coefficients ranging from 0.69 to 0.91, split-half reliability coefficients ranging from 0.87 to 0.91, and concurrent validity coefficients ranging from 0.53 to 0.93. The Gibson Test of Cognitive Skills-2 is a reliable and valid tool for assessing cognition in the general population across the lifespan.
Smelling is Telling: Human Olfactory Cues Influence Social Judgments in Semi-Realistic Interactions.
Gaby, Jessica M; Zayas, Vivian
2017-06-01
How does a person's smell affect others' impressions of them? Most body odor research asks perceivers to make social judgments based on armpit sweat without perfume or deodorant, presented on t-shirts. Yet, in real life, perceivers encounter fragranced body odor, on whole bodies. Our "raters" wore blindfolds and earplugs and repeatedly smelled same-sex "donors" in live interactions. In one condition, donors wore their normal deodorant and perfume ("diplomatic" odor) while in the other condition, donors were asked to avoid all outside fragrance influences ("natural" odor). We assessed the reliability of social judgments based on such live interactions, and the relationships between live judgments and traditional t-shirt based judgments, and between natural- and diplomatic odor-based judgments. Raters' repeated live social judgments (e.g., friendliness, likeability) were highly consistent for both diplomatic and natural odor, and converged with judgments based on t-shirts. However, social judgments based on natural odor did not consistently predict social judgments based on diplomatic odor, suggesting that natural and diplomatic body odor may convey different types of social information. Our results provide evidence that individuals can perceive reliable, meaningful social olfactory signals from whole bodies, at social distances, regardless of the presence or absence of perfume. Importantly, however, the social value of these signals is modified by the addition of exogenous fragrances. Further, our focus on judgments in same-sex dyads suggests that these olfactory cues hold social value in non-mating contexts. We suggest that future research employ more ecologically relevant methods. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Validity and inter-observer reliability of subjective hand-arm vibration assessments.
Coenen, Pieter; Formanoy, Margriet; Douwes, Marjolein; Bosch, Tim; de Kraker, Heleen
2014-07-01
Exposure to mechanical vibrations at work (e.g., due to handling powered tools) is a potential occupational risk as it may cause upper extremity complaints. However, reliable and valid assessment methods for vibration exposure at work are lacking. Measuring hand-arm vibration objectively is often difficult and expensive, while often used information provided by manufacturers lacks detail. Therefore, a subjective hand-arm vibration assessment method was tested on validity and inter-observer reliability. In an experimental protocol, sixteen tasks handling powered tools were executed by two workers. Hand-arm vibration was assessed subjectively by 16 observers according to the proposed subjective assessment method. As a gold standard reference, hand-arm vibration was measured objectively using a vibration measurement device. Weighted κ's were calculated to assess validity, intra-class-correlation coefficients (ICCs) were calculated to assess inter-observer reliability. Inter-observer reliability of the subjective assessments depicting the agreement among observers can be expressed by an ICC of 0.708 (0.511-0.873). The validity of the subjective assessments as compared to the gold-standard reference can be expressed by a weighted κ of 0.535 (0.285-0.785). Besides, the percentage of exact agreement of the subjective assessment compared to the objective measurement was relatively low (i.e., 52% of all tasks). This study shows that subjectively assessed hand-arm vibrations are fairly reliable among observers and moderately valid. This assessment method is a first attempt to use subjective risk assessments of hand-arm vibration. Although, this assessment method can benefit from some future improvement, it can be of use in future studies and in field-based ergonomic assessments. Copyright © 2014 Elsevier Ltd and The Ergonomics Society. All rights reserved.
Fell, Matthew; Meirte, Jill; Anthonissen, Mieke; Maertens, Koen; Pleat, Jonathon; Moortgat, Peter
2016-03-01
Objective scar assessment tools were designed to help identify problematic scars and direct clinical management. Their use has been restricted by their measurement of a single scar property and the bulky size of equipment. The Scarbase Duo(®) was designed to assess both trans-epidermal water loss (TEWL) and colour of a burn scar whilst being compact and easy to use. Twenty patients with a burn scar were recruited and measurements taken using the Scarbase Duo(®) by two observers. The Scarbase Duo(®) measures TEWL via an open-chamber system and undertakes colorimetry via narrow-band spectrophotometry, producing values for relative erythema and melanin pigmentation. Validity was assessed by comparing the Scarbase Duo(®) against the Dermalab(®) and the Minolta Chromameter(®) respectively for TEWL and colorimetry measurements. The intra-class correlation coefficient (ICC) was used to assess reliability with standard error of measurement (SEM) used to assess reproducibility of measurements. The Pearson correlation coefficient (r) was used to assess the convergent validity. The Scarbase Duo(®) TEWL mode had excellent reliability when used on scars for both intra- (ICC=0.95) and inter-rater (ICC=0.96) measurements with moderate SEM values. The erythema component of the colorimetry mode showed good reliability for use on scars for both intra-(ICC=0.81) and inter-rater (ICC=0.83) measurements with low SEM values. Pigmentation values showed excellent reliability on scar tissue for both intra- (ICC=0.97) and inter-rater (ICC=0.97) with moderate SEM values. The Scarbase Duo(®) TEWL function had excellent correlation with the Dermalab(®) (r=0.93) whilst the colorimetry erythema value had moderate correlation with the Minolta Chromameter (r=0.72). The Scarbase Duo(®) is a reliable and objective scar assessment tool, which is specifically designed for burn scars. However, for clinical use, standardised measurement conditions are recommended. Copyright © 2015 Elsevier Ltd and ISBI. All rights reserved.
Bridge reliability assessment based on the PDF of long-term monitored extreme strains
NASA Astrophysics Data System (ADS)
Jiao, Meiju; Sun, Limin
2011-04-01
Structural health monitoring (SHM) systems can provide valuable information for the evaluation of bridge performance. As the development and implementation of SHM technology in recent years, the data mining and use has received increasingly attention and interests in civil engineering. Based on the principle of probabilistic and statistics, a reliability approach provides a rational basis for analysis of the randomness in loads and their effects on structures. A novel approach combined SHM systems with reliability method to evaluate the reliability of a cable-stayed bridge instrumented with SHM systems was presented in this paper. In this study, the reliability of the steel girder of the cable-stayed bridge was denoted by failure probability directly instead of reliability index as commonly used. Under the assumption that the probability distributions of the resistance are independent to the responses of structures, a formulation of failure probability was deduced. Then, as a main factor in the formulation, the probability density function (PDF) of the strain at sensor locations based on the monitoring data was evaluated and verified. That Donghai Bridge was taken as an example for the application of the proposed approach followed. In the case study, 4 years' monitoring data since the operation of the SHM systems was processed, and the reliability assessment results were discussed. Finally, the sensitivity and accuracy of the novel approach compared with FORM was discussed.
Test-Retest Reliability of the Preschool Age Psychiatric Assessment (PAPA)
ERIC Educational Resources Information Center
Egger, Helen Link; Erkanli, Alaattin; Keeler, Gordon; Potts, Edward; Walter, Barbara Keith; Angold, Adrian
2006-01-01
Objective: To examine the test-retest reliability of a new interviewer-based psychiatric diagnostic measure (the Preschool Age Psychiatric Assessment) for use with parents of preschoolers 2 to 5 years old. Method: A total of 1,073 parents of children attending a large pediatric clinic completed the Child Behavior Checklist 1 1/2-5. For 18 months,…
NASA Astrophysics Data System (ADS)
Fagbeja, M. A.; Hill, J. L.; Chatterton, T. J.; Longhurst, J. W.; Akinyede, J. O.
2011-12-01
Space-based satellite sensor technology may provide important tools in the study and assessment of national, regional and local air pollution. However, the application of optical satellite sensor observation of atmospheric trace gases, including those considered to be 'air pollutants', within the lower latitudes is limited due to prevailing climatic conditions. The lack of appropriate air pollution ground monitoring stations within the tropical belt reduces the ability to verify and calibrate space-based measurements. This paper considers the suitability of satellite remotely sensed data in estimating concentrations of atmospheric trace gases in view of the prevailing climate over the Niger Delta region. The methodological approach involved identifying suitable satellite data products and using the ArcGIS Geostatistical Analyst kriging interpolation technique to generate surface concentrations from satellite column measurements. The observed results are considered in the context of the climate of the study area. Using data from January 2001 to December 2005, an assessment of the suitability of satellite sensor data to interpolate column concentrations of trace gases over the Niger Delta has been undertaken and indicates varying degrees of reliability. The level of reliability of the interpolated surfaces is predicated on the number and spatial distributions of column measurements. Accounting for the two climatic seasons in the region, the interpolation of total column concentrations of CO and CO2 from SCIAMACHY produced both reliable and unreliable results over inland parts of the region during the dry season, while mainly unreliable results are observed over the coastal parts especially during the rainy season due to inadequate column measurements. The interpolation of tropospheric measurements of NO2 and O3 from GOME and OMI respectively produced reliable results all year. This is thought to be due to the spatial distribution of available column measurements, which were more regularly distributed over the region than the total column measurements of CO and CO2. Observations also indicated higher concentrations during the dry season than the wet seasons. The observed trend in the concentration of tropospheric O3 was as expected, considering the observed concentrations of precursor gases of CO and NO2. Whilst satellites currently play a significant role in the assessment of global air pollution and the long-range transport of air pollutants, the technology is faced with limitations in assessing ground level concentrations of pollutants. These limitations restrict the extent to which both pollution emissions and impacts of receptors can be accurately assessed. Further research is required to improve the capability of satellite sensors to observe atmospheric pollutants within the lower troposphere, where pollution has the most direct impacts on humans and ecosystems.
Bartels, Meike; Cath, Danielle C.; Boomsma, Dorret I.
2008-01-01
The factor structure of the Dutch translation of the Autism-Spectrum Quotient (AQ; a continuous, quantitative measure of autistic traits) was evaluated with confirmatory factor analyses in a large general population and student sample. The criterion validity of the AQ was examined in three matched patient groups (autism spectrum conditions (ASC), social anxiety disorder, and obsessive–compulsive disorder). A two factor model, consisting of a “Social interaction” factor and “Attention to detail” factor could be identified. The internal consistency and test–retest reliability of the AQ were satisfactory. High total AQ and factor scores were specific to ASC patients. Men scored higher than women and science students higher than non-science students. The Dutch translation of the AQ is a reliable instrument to assess autism spectrum conditions. PMID:18302013
Loi, Samantha M; Wanasinghage, Sangeeth; Goh, Anita; Lautenschlager, Nicola T; Darby, David G; Velakoulis, Dennis
2018-04-01
Improving and minimizing challenging behaviors seen in psychiatric conditions, including behavioral and psychological symptoms of dementia are important in the care of people with these conditions. Yet there is a lack of systematic evaluation of these as a part of routine clinical care. The Neuropsychiatric Inventory is a validated and reliable tool for rating the severity and disruptiveness of challenging behaviors. We report on the evaluation of a Web-based symptom assessment manager (SAM), designed to address the limitation of previous tools using some of the Neuropsychiatric Inventory functions, to monitor behaviors by staff caring for people with dementia and other psychiatric conditions in inpatient and residential care settings. The SAM was piloted in an 8-bed inpatient neuropsychiatry unit over 5 months. Eleven nurses and 4 clinicians were trained in usage of SAM. Primary outcomes were usage of SAM and perceived usability, utility, and acceptance of SAM. Secondary outcomes were the frequencies of documented behavior. Usage data were analyzed using chi-square and logistic regression analyses. The SAM was used for all admitted patients regardless of diagnosis, with a usage rate of 64% for nurses regularly employed in the unit. Staff provided positive feedback regarding the utility of SAM. The SAM appeared to offer individualized behavior assessment by providing a quick, structured, and standardized platform for assessing behavior in a real-world setting. Further research would involve trialing SAM with more staff in alternative settings such as in home or residential care settings. Copyright © 2017 John Wiley & Sons, Ltd.
Condition assessment of timber bridges. 2, Evaluation of several stress-wave tools
Brian K. Brashaw; Robert J. Vatalaro; James P. Wacker; Robert J. Ross
2005-01-01
This study was conducted to evaluate the accuracy and reliability of several stress-wave devices widely used for locating deteriorated areas in timber bridge members. Bridge components containing different levels of natural decay were tested using various devices. The specimens were then sawn (along their length) into slabs to expose their interior condition. The...
Development and evaluation of an instrument for assessing brief behavioral change interventions.
Strayer, Scott M; Martindale, James R; Pelletier, Sandra L; Rais, Salehin; Powell, Jon; Schorling, John B
2011-04-01
To develop an observational coding instrument for evaluating the fidelity and quality of brief behavioral change interventions based on the behavioral theories of the 5 A's, Stages of Change and Motivational Interviewing. Content and face validity were assessed prior to an intervention where psychometric properties were evaluated with a prospective cohort of 116 medical students. Properties assessed included the inter-rater reliability of the instrument, internal consistency of the full scale and sub-scales and descriptive statistics of the instrument. Construct validity was assessed based on student's scores. Inter-rater reliability for the instrument was 0.82 (intraclass correlation). Internal consistency for the full scale was 0.70 (KR20). Internal consistencies for the sub-scales were as follows: MI intervention component (KR20=.7); stage-appropriate MI-based intervention (KR20=.55); MI spirit (KR20=.5); appropriate assessment (KR20=.45) and appropriate assisting (KR20=.56). The instrument demonstrated good inter-rater reliability and moderate overall internal consistency when used to assess performing brief behavioral change interventions by medical students. This practical instrument can be used with minimal training and demonstrates promising psychometric properties when evaluated with medical students counseling standardized patients. Further testing is required to evaluate its usefulness in clinical settings. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.
Klinck, Mary P; Rialland, Pascale; Guillot, Martin; Moreau, Maxim; Frank, Diane; Troncy, Eric
2015-12-02
Subtle signs and conflicting physical and radiographic findings make feline osteoarthritis (OA) challenging to diagnose. A physical examination-based assessment was developed, consisting of eight items: Interaction, Exploration, Posture, Gait, Body Condition, Coat and Claws, (joint) Palpation-Findings, and Palpation-Cat Reaction. Content (experts) and face (veterinary students) validity were excellent. Construct validity, internal consistency, and intra- and inter-rater reliability were assessed via a pilot and main study, using laboratory-housed cats with and without OA. Gait distinguished OA status in the pilot ( p = 0.05) study. In the main study, no scale item achieved statistically significant OA detection. Forelimb peak vertical ground reaction force (PVF) correlated inversely with Gait (Rho s = -0.38 ( p = 0.03) to -0.41 ( p = 0.02)). Body Posture correlated with Gait, and inversely with forelimb PVF at two of three time points (Rho s = -0.38 ( p = 0.03) to -0.43 ( p = 0.01)). Palpation (Findings, Cat Reaction) did not distinguish OA from non-OA cats. Palpation-Cat Reaction (Forelimbs) correlated inversely with forelimb PVF at two time points (Rho s = -0.41 ( p = 0.02) to -0.41 ( p = 0.01)), but scores were highly variable, and poorly reliable. Gait and Posture require improved sensitivity, and Palpation should be interpreted cautiously, in diagnosing feline OA.
NASA Astrophysics Data System (ADS)
Ndiritu, John; Ilemobade, Adesola; Kagoda, Paulo
2018-06-01
As water demand increases rainwater harvesting (RWH) systems are increasingly being installed for water supply but comprehensive hydrologic design guidelines for RWH do not exist in many parts of the world. The objective of this study was to develop guidelines for the hydrologic design and assessment of rainwater harvesting (RWH) systems in the City of Johannesburg, South Africa. The data for developing the guidelines were mainly obtained from multiple daily simulations of potential RWH systems in the city. The simulations used daily rainfall from 8 stations and demands based on the probable non-potable uses of RWH systems - toilet flushing, air conditioning and irrigation. The guidelines were confined to systems that would typically fill up in the wet season and empty towards the end of the dry season of the same year. Therefore, supply-to-demand ratios ranging from 0.1 to 0.9 were applied. Two generalized design charts of dimensionless relationships were developed. One relates the yield ratio with supply-to-demand ratio and reliability while the other relates the yield ratio with the storage-to-demand ratio and reliability. Reliability was defined as the probability of exceedance of annual yield in order to incorporate the large inter-annual variability of rainfall experienced in the region. The analyses and design of an example RWH system is used to illustrate the application of the design charts.
Automated MRI Cerebellar Size Measurements Using Active Appearance Modeling
Price, Mathew; Cardenas, Valerie A.; Fein, George
2014-01-01
Although the human cerebellum has been increasingly identified as an important hub that shows potential for helping in the diagnosis of a large spectrum of disorders, such as alcoholism, autism, and fetal alcohol spectrum disorder, the high costs associated with manual segmentation, and low availability of reliable automated cerebellar segmentation tools, has resulted in a limited focus on cerebellar measurement in human neuroimaging studies. We present here the CATK (Cerebellar Analysis Toolkit), which is based on the Bayesian framework implemented in FMRIB’s FIRST. This approach involves training Active Appearance Models (AAM) using hand-delineated examples. CATK can currently delineate the cerebellar hemispheres and three vermal groups (lobules I–V, VI–VII, and VIII–X). Linear registration with the low-resolution MNI152 template is used to provide initial alignment, and Point Distribution Models (PDM) are parameterized using stellar sampling. The Bayesian approach models the relationship between shape and texture through computation of conditionals in the training set. Our method varies from the FIRST framework in that initial fitting is driven by 1D intensity profile matching, and the conditional likelihood function is subsequently used to refine fitting. The method was developed using T1-weighted images from 63 subjects that were imaged and manually labeled: 43 subjects were scanned once and were used for training models, and 20 subjects were imaged twice (with manual labeling applied to both runs) and used to assess reliability and validity. Intraclass correlation analysis shows that CATK is highly reliable (average test-retest ICCs of 0.96), and offers excellent agreement with the gold standard (average validity ICC of 0.87 against manual labels). Comparisons against an alternative atlas-based approach, SUIT (Spatially Unbiased Infratentorial Template), that registers images with a high-resolution template of the cerebellum, show that our AAM approach offers superior reliability and validity. Extensions of CATK to cerebellar hemisphere parcels is envisioned. PMID:25192657
Reliability Analysis/Assessment of Advanced Technologies
1990-05-01
34, Reliability Physics 1980 , IEEE, p. 165. 25. RADC-TR-83-244. 26. Towner, Janet M., et. al., "Aluminum Electromigration Under Pulsed D.C. Conditions...Duvvury, Redwine, Kitagawa, Haas, Chuang, Beydler, Hyslop , "Impact of Hot Carriers On DRAM circuits", 1987 IEEE/IRPS. 58. Cahoon, Thornewell, Tsai...et. a]., "Substrate for Large Silicon Chip and Full Wafer Packaging", Semiconductor International, pp. 149-156, April 1980 . 5. T.E. Lewis and D.L
Karstad, Kristina; Rugulies, Reiner; Skotte, Jørgen; Munch, Pernille Kold; Greiner, Birgit A; Burdorf, Alex; Søgaard, Karen; Holtermann, Andreas
2018-05-01
The aim of the study was to develop and evaluate the reliability of the "Danish observational study of eldercare work and musculoskeletal disorders" (DOSES) observation instrument to assess physical and psychosocial risk factors for musculoskeletal disorders (MSD) in eldercare work. During 1.5 years, sixteen raters conducted 117 inter-rater observations from 11 nursing homes. Reliability was evaluated using percent agreement and Gwet's AC1 coefficient. Of the 18 examined items, inter-rater reliability was excellent for 7 items (AC1>0.75) fair to good for 7 items (AC1 0.40-0.75) and poor for 2 items (AC1 0-0.40). For 2 items there was no agreement between the raters (AC1 <0). The reliability did not differ between the first and second half of the data collection period and the inter-rater observations were representative regarding occurrence of events in eldercare work. The instrument is appropriate for assessing physical and psychosocial risk factors for MSD among eldercare workers. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.
Aminoff, Bechor Z; Purits, Elena; Noy, Shlomo; Adunsky, Abraham
2004-01-01
Assessment of suffering is extremely important in dying end-stage dementia patients (ESDP). We have developed and examined the reliability and validity of the Mini-Suffering State Examination (MSSE), in 103 consecutive bedridden ESDP. Main outcome measures included inter-observer reliability and concurrent validity. Reliability of the MSSE questionnaire was satisfactory, with Cronbach alpha values of 0.735 and 0.718 for the two physicians (Ph-1, Ph-2), respectively. The kappa agreement coefficient was 0.791. There was a high agreement for seven items (kappa 0.882-0.972) and a substantial agreement for the other three items (kappa 0.621-0.682) of the MSSE. MSSE was validated versus the comfort assessment in dying with dementia (CAD-EOLD) scale and resulted in a significant Pearson correlation (r=-0.796, P<0.001). We conclude that the MSSE scale is a reliable and valid clinical tool, recommended for evaluating the severity of the patient's condition and the level of suffering of ESDP. Use of MSSE may improve medical management and facilitate communication between patients and caregivers.
Vinco, L J; Giacomelli, S; Campana, L; Chiari, M; Vitale, N; Lombardi, G; Veldkamp, T; Hocking, P M
2018-02-01
1. An experiment was conducted to compare 5 different methods for the evaluation of litter moisture. 2. For litter collection and assessment, 55 farms were selected, one shed from each farm was inspected and 9 points were identified within each shed. 3. For each device, used for the evaluation of litter moisture, mean and standard deviation of wetness measures per collection point were assessed. 4. The reliability and overall consistency between the 5 instruments used to measure wetness were high (α = 0.72). 5. Measurement of three out of the 9 collection points were sufficient to provide a reliable assessment of litter moisture throughout the shed. 6. Based on the direct correlation between litter moisture and footpad lesions, litter moisture measurement can be used as a resource based on-farm animal welfare indicator. 7. Among the 5 methods analysed, visual scoring is the most simple and practical, and therefore the best candidate to be used on-farm for animal welfare assessment.
A reliability analysis of the revised competitiveness index.
Harris, Paul B; Houston, John M
2010-06-01
This study examined the reliability of the Revised Competitiveness Index by investigating the test-retest reliability, interitem reliability, and factor structure of the measure based on a sample of 280 undergraduates (200 women, 80 men) ranging in age from 18 to 28 years (M = 20.1, SD = 2.1). The findings indicate that the Revised Competitiveness Index has high test-retest reliability, high inter-item reliability, and a stable factor structure. The results support the assertion that the Revised Competitiveness Index assesses competitiveness as a stable trait rather than a dynamic state.
The risk of bias in systematic reviews tool showed fair reliability and good construct validity.
Bühn, Stefanie; Mathes, Tim; Prengel, Peggy; Wegewitz, Uta; Ostermann, Thomas; Robens, Sibylle; Pieper, Dawid
2017-11-01
There is a movement from generic quality checklists toward a more domain-based approach in critical appraisal tools. This study aimed to report on a first experience with the newly developed risk of bias in systematic reviews (ROBIS) tool and compare it with A Measurement Tool to Assess Systematic Reviews (AMSTAR), that is, the most common used tool to assess methodological quality of systematic reviews while assessing validity, reliability, and applicability. Validation study with four reviewers based on 16 systematic reviews in the field of occupational health. Interrater reliability (IRR) of all four raters was highest for domain 2 (Fleiss' kappa κ = 0.56) and lowest for domain 4 (κ = 0.04). For ROBIS, median IRR was κ = 0.52 (range 0.13-0.88) for the experienced pair of raters compared to κ = 0.32 (range 0.12-0.76) for the less experienced pair of raters. The percentage of "yes" scores of each review of ROBIS ratings was strongly correlated with the AMSTAR ratings (r s = 0.76; P = 0.01). ROBIS has fair reliability and good construct validity to assess the risk of bias in systematic reviews. More validation studies are needed to investigate reliability and applicability, in particular. Copyright © 2017 Elsevier Inc. All rights reserved.
Frost, Rachael; Levati, Sara; McClurg, Doreen; Brady, Marian; Williams, Brian
2017-06-01
To systematically review methods for measuring adherence used in home-based rehabilitation trials and to evaluate their validity, reliability, and acceptability. In phase 1 we searched the CENTRAL database, NHS Economic Evaluation Database, and Health Technology Assessment Database (January 2000 to April 2013) to identify adherence measures used in randomized controlled trials of allied health professional home-based rehabilitation interventions. In phase 2 we searched the databases of MEDLINE, Embase, CINAHL, Allied and Complementary Medicine Database, PsycINFO, CENTRAL, ProQuest Nursing and Allied Health, and Web of Science (inception to April 2015) for measurement property assessments for each measure. Studies assessing the validity, reliability, or acceptability of adherence measures. Two reviewers independently extracted data on participant and measure characteristics, measurement properties evaluated, evaluation methods, and outcome statistics and assessed study quality using the COnsensus-based Standards for the selection of health Measurement INstruments checklist. In phase 1 we included 8 adherence measures (56 trials). In phase 2, from the 222 measurement property assessments identified in 109 studies, 22 high-quality measurement property assessments were narratively synthesized. Low-quality studies were used as supporting data. StepWatch Activity Monitor validly and acceptably measured short-term step count adherence. The Problematic Experiences of Therapy Scale validly and reliably assessed adherence to vestibular rehabilitation exercises. Adherence diaries had moderately high validity and acceptability across limited populations. The Borg 6 to 20 scale, Bassett and Prapavessis scale, and Yamax CW series had insufficient validity. Low-quality evidence supported use of the Joint Protection Behaviour Assessment. Polar A1 series heart monitors were considered acceptable by 1 study. Current rehabilitation adherence measures are limited. Some possess promising validity and acceptability for certain parameters of adherence, situations, and populations and should be used in these situations. Rigorous evaluation of adherence measures in a broader range of populations is needed. Copyright © 2016 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Robot-Assisted Arm Assessments in Spinal Cord Injured Patients: A Consideration of Concept Study
Albisser, Urs; Rudhe, Claudia; Curt, Armin; Riener, Robert; Klamroth-Marganska, Verena
2015-01-01
Robotic assistance is increasingly used in neurological rehabilitation for enhanced training. Furthermore, therapy robots have the potential for accurate assessment of motor function in order to diagnose the patient status, to measure therapy progress or to feedback the movement performance to the patient and therapist in real time. We investigated whether a set of robot-based assessments that encompasses kinematic, kinetic and timing metrics is applicable, safe, reliable and comparable to clinical metrics for measurement of arm motor function. Twenty-four healthy subjects and five patients after spinal cord injury underwent robot-based assessments using the exoskeleton robot ARMin. Five different tasks were performed with aid of a visual display. Ten kinematic, kinetic and timing assessment parameters were extracted on joint- and end-effector level (active and passive range of motion, cubic reaching volume, movement time, distance-path ratio, precision, smoothness, reaction time, joint torques and joint stiffness). For cubic volume, joint torques and the range of motion for most joints, good inter- and intra-rater reliability were found whereas precision, movement time, distance-path ratio and smoothness showed weak to moderate reliability. A comparison with clinical scores revealed good correlations between robot-based joint torques and the Manual Muscle Test. Reaction time and distance-path ratio showed good correlation with the “Graded and Redefined Assessment of Strength, Sensibility and Prehension” (GRASSP) and the Van Lieshout Test (VLT) for movements towards a predefined position in the center of the frontal plane. In conclusion, the therapy robot ARMin provides a comprehensive set of assessments that are applicable and safe. The first results with spinal cord injured patients and healthy subjects suggest that the measurements are widely reliable and comparable to clinical scales for arm motor function. The methods applied and results can serve as a basis for the future development of end-effector and exoskeleton-based robotic assessments. PMID:25996374
Human Reliability Assessments: Using the Past (Shuttle) to Predict the Future (Orion)
NASA Technical Reports Server (NTRS)
DeMott, Diana L.; Bigler, Mark A.
2017-01-01
NASA (National Aeronautics and Space Administration) Johnson Space Center (JSC) Safety and Mission Assurance (S&MA) uses two human reliability analysis (HRA) methodologies. The first is a simplified method which is based on how much time is available to complete the action, with consideration included for environmental and personal factors that could influence the human's reliability. This method is expected to provide a conservative value or placeholder as a preliminary estimate. This preliminary estimate or screening value is used to determine which placeholder needs a more detailed assessment. The second methodology is used to develop a more detailed human reliability assessment on the performance of critical human actions. This assessment needs to consider more than the time available, this would include factors such as: the importance of the action, the context, environmental factors, potential human stresses, previous experience, training, physical design interfaces, available procedures/checklists and internal human stresses. The more detailed assessment is expected to be more realistic than that based primarily on time available. When performing an HRA on a system or process that has an operational history, we have information specific to the task based on this history and experience. In the case of a Probabilistic Risk Assessment (PRA) that is based on a new design and has no operational history, providing a "reasonable" assessment of potential crew actions becomes more challenging. To determine what is expected of future operational parameters, the experience from individuals who had relevant experience and were familiar with the system and process previously implemented by NASA was used to provide the "best" available data. Personnel from Flight Operations, Flight Directors, Launch Test Directors, Control Room Console Operators, and Astronauts were all interviewed to provide a comprehensive picture of previous NASA operations. Verification of the assumptions and expectations expressed in the assessments will be needed when the procedures, flight rules, and operational requirements are developed and then finalized.
Human Reliability Assessments: Using the Past (Shuttle) to Predict the Future (Orion)
NASA Technical Reports Server (NTRS)
DeMott, Diana; Bigler, Mark
2016-01-01
NASA (National Aeronautics and Space Administration) Johnson Space Center (JSC) Safety and Mission Assurance (S&MA) uses two human reliability analysis (HRA) methodologies. The first is a simplified method which is based on how much time is available to complete the action, with consideration included for environmental and personal factors that could influence the human's reliability. This method is expected to provide a conservative value or placeholder as a preliminary estimate. This preliminary estimate or screening value is used to determine which placeholder needs a more detailed assessment. The second methodology is used to develop a more detailed human reliability assessment on the performance of critical human actions. This assessment needs to consider more than the time available, this would include factors such as: the importance of the action, the context, environmental factors, potential human stresses, previous experience, training, physical design interfaces, available procedures/checklists and internal human stresses. The more detailed assessment is expected to be more realistic than that based primarily on time available. When performing an HRA on a system or process that has an operational history, we have information specific to the task based on this history and experience. In the case of a Probabilistic Risk Assessment (PRA) that is based on a new design and has no operational history, providing a "reasonable" assessment of potential crew actions becomes more challenging. In order to determine what is expected of future operational parameters, the experience from individuals who had relevant experience and were familiar with the system and process previously implemented by NASA was used to provide the "best" available data. Personnel from Flight Operations, Flight Directors, Launch Test Directors, Control Room Console Operators and Astronauts were all interviewed to provide a comprehensive picture of previous NASA operations. Verification of the assumptions and expectations expressed in the assessments will be needed when the procedures, flight rules and operational requirements are developed and then finalized.
Reliability and Probabilistic Risk Assessment - How They Play Together
NASA Technical Reports Server (NTRS)
Safie, Fayssal; Stutts, Richard; Huang, Zhaofeng
2015-01-01
Since the Space Shuttle Challenger accident in 1986, NASA has extensively used probabilistic analysis methods to assess, understand, and communicate the risk of space launch vehicles. Probabilistic Risk Assessment (PRA), used in the nuclear industry, is one of the probabilistic analysis methods NASA utilizes to assess Loss of Mission (LOM) and Loss of Crew (LOC) risk for launch vehicles. PRA is a system scenario based risk assessment that uses a combination of fault trees, event trees, event sequence diagrams, and probability distributions to analyze the risk of a system, a process, or an activity. It is a process designed to answer three basic questions: 1) what can go wrong that would lead to loss or degraded performance (i.e., scenarios involving undesired consequences of interest), 2) how likely is it (probabilities), and 3) what is the severity of the degradation (consequences). Since the Challenger accident, PRA has been used in supporting decisions regarding safety upgrades for launch vehicles. Another area that was given a lot of emphasis at NASA after the Challenger accident is reliability engineering. Reliability engineering has been a critical design function at NASA since the early Apollo days. However, after the Challenger accident, quantitative reliability analysis and reliability predictions were given more scrutiny because of their importance in understanding failure mechanism and quantifying the probability of failure, which are key elements in resolving technical issues, performing design trades, and implementing design improvements. Although PRA and reliability are both probabilistic in nature and, in some cases, use the same tools, they are two different activities. Specifically, reliability engineering is a broad design discipline that deals with loss of function and helps understand failure mechanism and improve component and system design. PRA is a system scenario based risk assessment process intended to assess the risk scenarios that could lead to a major/top undesirable system event, and to identify those scenarios that are high-risk drivers. PRA output is critical to support risk informed decisions concerning system design. This paper describes the PRA process and the reliability engineering discipline in detail. It discusses their differences and similarities and how they work together as complementary analyses to support the design and risk assessment processes. Lessons learned, applications, and case studies in both areas are also discussed in the paper to demonstrate and explain these differences and similarities.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Glass, Samuel W.; Fifield, Leonard S.; Jones, Anthony M.
Cable insulation polymers are among the more susceptible materials to age-related degradation within a nuclear power plant. This is recognized by both regulators and utilities, so all plants have developed cable aging management programs to detect damage before critical component failure in compliance with regulatory guidelines. Although a wide range of tools are available to evaluate cables and cable systems, cable aging management programs vary in how condition monitoring and NDE is conducted as utilities search for the most reliable and cost-effective ways to assess cable system condition. Frequency domain reflectometry (FDR) is emerging as one valuable tool to locatemore » and assess damaged portions of a cable system with minimal cost and only requires access in most cases to one of the cable terminal ends. This work examines a physics-based model of a cable system and relates it to FDR measurements for a better understanding of specific damage influences on defect detectability.« less
NASA Technical Reports Server (NTRS)
Kleinhammer, Roger K.; Graber, Robert R.; DeMott, D. L.
2016-01-01
Reliability practitioners advocate getting reliability involved early in a product development process. However, when assigned to estimate or assess the (potential) reliability of a product or system early in the design and development phase, they are faced with lack of reasonable models or methods for useful reliability estimation. Developing specific data is costly and time consuming. Instead, analysts rely on available data to assess reliability. Finding data relevant to the specific use and environment for any project is difficult, if not impossible. Instead, analysts attempt to develop the "best" or composite analog data to support the assessments. Industries, consortia and vendors across many areas have spent decades collecting, analyzing and tabulating fielded item and component reliability performance in terms of observed failures and operational use. This data resource provides a huge compendium of information for potential use, but can also be compartmented by industry, difficult to find out about, access, or manipulate. One method used incorporates processes for reviewing these existing data sources and identifying the available information based on similar equipment, then using that generic data to derive an analog composite. Dissimilarities in equipment descriptions, environment of intended use, quality and even failure modes impact the "best" data incorporated in an analog composite. Once developed, this composite analog data provides a "better" representation of the reliability of the equipment or component. It can be used to support early risk or reliability trade studies, or analytical models to establish the predicted reliability data points. It also establishes a baseline prior that may updated based on test data or observed operational constraints and failures, i.e., using Bayesian techniques. This tutorial presents a descriptive compilation of historical data sources across numerous industries and disciplines, along with examples of contents and data characteristics. It then presents methods for combining failure information from different sources and mathematical use of this data in early reliability estimation and analyses.
NASA Astrophysics Data System (ADS)
McPhee, J.; William, Y. W.
2005-12-01
This work presents a methodology for pumping test design based on the reliability requirements of a groundwater model. Reliability requirements take into consideration the application of the model results in groundwater management, expressed in this case as a multiobjective management model. The pumping test design is formulated as a mixed-integer nonlinear programming (MINLP) problem and solved using a combination of genetic algorithm (GA) and gradient-based optimization. Bayesian decision theory provides a formal framework for assessing the influence of parameter uncertainty over the reliability of the proposed pumping test. The proposed methodology is useful for selecting a robust design that will outperform all other candidate designs under most potential 'true' states of the system
A Comparison of Laser and Video Techniques for Determining Displacement and Velocity during Running
ERIC Educational Resources Information Center
Harrison, Andrew J.; Jensen, Randall L.; Donoghue, Orna
2005-01-01
The reliability of a laser system was compared with the reliability of a video-based kinematic analysis in measuring displacement and velocity during running. Validity and reliability of the laser on static measures was also assessed at distances between 10 m and 70 m by evaluating the coefficient of variation and intraclass correlation…
Kolodziejczyk, Julia K; Norman, Gregory J; Rock, Cheryl L; Arredondo, Elva M; Roesch, Scott C; Madanat, Hala; Patrick, Kevin
2016-01-01
This study evaluates the reliability and validity of the strategies for weight management (SWM) measure, a questionnaire that assesses weight management strategies for adults. The SWM includes 20 items that are categorized within the following subscales: (1) energy intake, (2) energy expenditure, (3) self-monitoring, and (4) self-regulation. Baseline and 6-month data were collected from 404 overweight/obese adults (mean age=22±3.8 years, 68% ethnic minority) enrolled in a randomized controlled trial aiming to reduce weight by improving diet and physical activity behaviours. Reliability and validity were assessed for each subscale separately. Cronbach alpha was conducted to assess reliability. Concurrent, construct I (sensitivity to the study treatment condition), and construct II (relationship to the outcomes) validity were assessed using linear regressions with the following outcome measures: weight, self-reported diet, and weekly energy expenditure. All subscales showed strong internal consistency. The strength of the validity evidence depended on subscale and validity type. The strongest validity evidence was concurrent validity of the energy intake and energy expenditure subscales; construct I validity of the energy intake and self-monitoring subscales; and construct II validity of the energy intake, energy expenditure, and self-regulation subscales. Results indicate that the SWM can be used to assess weight management strategies among an ethnically diverse sample of adults as each subscale showed evidence of reliability and select types of validity. As validity is an accumulation of evidence over multiple studies, this study provides initial reliability and validity evidence in one population segment. Copyright © 2015 Asia Oceania Association for the Study of Obesity. Published by Elsevier Ltd. All rights reserved.
Stoner, Lee; Geoffron, Morgane; Cornwall, Jon; Chinn, Victoria; Gram, Martin; Credeur, Daniel; Fryer, Simon
2016-12-01
Recently, it was reported that intra-abdominal thickness (IAT) assessments using ultrasound are most reliable if measured from the linea alba to the anterior vertebral column. These 2 anatomical sites can be simultaneously visualized using a linear array transducer. Linear array transducers have different operational characteristics when compared with conventional curved array transducers and are more reliable for some ultrasound-derived measures such as abdominal subcutaneous fat thickness. However, it is unknown whether linear array transducers facilitate more reliable IAT measurements than curved array transducers. The purpose of the current study was to (1) compare the reliability of linear and curved array transducer assessments of IAT and maximal abdominal ratio (MAR) and (2) use the findings to update central adiposity measurement guidelines. Fifteen healthy adults (mean [SD], 27 [10] years; 60% female) with a range of somatotypes (body mass index: mean [SD], 24 [4]; range, 19-33 kg/m; waist circumference: mean [SD], 75 [11]; range, 61-96 cm) were tested on 3 mornings under standardized conditions. Intra-abdominal thickness was assessed 2 cm above the umbilicus (transverse plane), measuring from linea alba to the anterior vertebral column. Maximal abdominal ratio was defined as the ratio of IAT to abdominal subcutaneous fat thickness. The IAT range was 25 to 87 mm, and the MAR range was 0.15 to 0.77. Between-day intraclass correlation coefficient values for IAT measurements made were comparable (0.96-0.97) for both transducers, as were MAR values (0.95). In conclusion, while both transducers provided equally reliable measurement of IAT, the use of a single linear array transducer simplifies the assessment of central adiposity.
Reliability Prediction Approaches For Domestic Intelligent Electric Energy Meter Based on IEC62380
NASA Astrophysics Data System (ADS)
Li, Ning; Tong, Guanghua; Yang, Jincheng; Sun, Guodong; Han, Dongjun; Wang, Guixian
2018-01-01
The reliability of intelligent electric energy meter is a crucial issue considering its large calve application and safety of national intelligent grid. This paper developed a procedure of reliability prediction for domestic intelligent electric energy meter according to IEC62380, especially to identify the determination of model parameters combining domestic working conditions. A case study was provided to show the effectiveness and validation.
Enhanced ultrasonic inspection of steel bridge pin components.
DOT National Transportation Integrated Search
1998-01-01
This report describes the development of a technique for obtaining a reliable assessment of the condition of steel bridge pins already determined by ultrasound to contain imperfections. The details of a technique for performing high-definition ultras...
Virtual prototyping of drop test using explicit analysis
NASA Astrophysics Data System (ADS)
Todorov, Georgi; Kamberov, Konstantin
2017-12-01
Increased requirements for reliability and safety, included in contemporary standards and norms, has high impact over new product development. New numerical techniques based on virtual prototyping technology, facilitates imrpoving product development cycle, resutling in reduced time/money spent for this stage as well as increased knowledge about certain failure mechanism. So called "drop test" became nearly a "must" step in development of any human operated product. This study aims to demonstrate dynamic behaviour assessment of a structure under impact loads, based on virtual prototyping using a typical nonlinear analysis - explicit dynamics. An example is presneted, based on a plastic container that is used as cartridge for a dispenser machine exposed to various work conditions. Different drop orientations were analyzed and critical load cases and design weaknesses have been found. Several design modifications have been proposed, based on detailed analyses results review.
Tsze, Daniel S; von Baeyer, Carl L; Pahalyants, Vartan; Dayan, Peter S
2018-06-01
The Verbal Numerical Rating Scale is the most commonly used self-report measure of pain intensity. It is unclear how the validity and reliability of the scale scores vary across children's ages. We aimed to determine the validity and reliability of the scale for children presenting to the emergency department across a comprehensive spectrum of age. This was a cross-sectional study of children aged 4 to 17 years. Children self-reported their pain intensity, using the Verbal Numerical Rating Scale and Faces Pain Scale-Revised at 2 serial assessments. We evaluated convergent validity (strong validity defined as correlation coefficient ≥0.60), agreement (difference between concurrent Verbal Numerical Rating Scale and Faces Pain Scale-Revised scores), known-groups validity (difference in score between children with painful versus nonpainful conditions), responsivity (decrease in score after analgesic administration), and reliability (test-retest at 2 serial assessments) in the total sample and subgroups based on age. We enrolled 760 children; 27 did not understand the Verbal Numerical Rating Scale and were removed. Of the remainder, Pearson correlations were strong to very strong (0.62 to 0.96) in all years of age except 4 and 5 years, and agreement was strong for children aged 8 and older. Known-groups validity and responsivity were strong in all years of age. Reliability was strong in all age subgroups, including each year of age from 4 to 7 years. Convergent validity, known-groups validity, responsivity, and reliability of the Verbal Numerical Rating Scale were strong for children aged 6 to 17 years. Convergent validity was not strong for children aged 4 and 5 years. Our findings support the use of the Verbal Numerical Rating Scale for most children aged 6 years and older, but not for those aged 4 and 5 years. Copyright © 2017 American College of Emergency Physicians. Published by Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Tamura, Yoshinobu; Yamada, Shigeru
OSS (open source software) systems which serve as key components of critical infrastructures in our social life are still ever-expanding now. Especially, embedded OSS systems have been gaining a lot of attention in the embedded system area, i.e., Android, BusyBox, TRON, etc. However, the poor handling of quality problem and customer support prohibit the progress of embedded OSS. Also, it is difficult for developers to assess the reliability and portability of embedded OSS on a single-board computer. In this paper, we propose a method of software reliability assessment based on flexible hazard rates for the embedded OSS. Also, we analyze actual data of software failure-occurrence time-intervals to show numerical examples of software reliability assessment for the embedded OSS. Moreover, we compare the proposed hazard rate model for the embedded OSS with the typical conventional hazard rate models by using the comparison criteria of goodness-of-fit. Furthermore, we discuss the optimal software release problem for the porting-phase based on the total expected software maintenance cost.
Pennathur, Arunkumar; Magham, Rohini; Contreras, Luis Rene; Dowling, Winifred
2004-01-01
The objective of the work reported in this paper is to assess test-retest reliability of Yale Physical Activity Survey Total Time, Estimated Energy Expenditure, Activity Dimension Indices, and Activities Check-list in older Mexican American men and women. A convenience-based healthy sample of 49 (42 women and 7 men) older Mexican American adults recruited from senior recreation centers aged 68 to 80 years volunteered to participate in this pilot study. Forty-nine older Mexican American adults filled out the Yale Physical Activity Survey for this study. Fifteen (12 women and 3 men) of the 49 volunteers responded twice to the Yale Physical Activity Survey after a 2-week period, and helped assess the test-retest reliability of the Yale Physical Activity Survey. Results indicate that based on a 2-week test-retest administration, the Yale Physical Activity Survey was found to have moderate (rhoI= .424, p < .05) to good reliability (rs = .789, p < .01) for physical activity assessment in older Mexican American adults who responded.
Molander, Linda; Hanberg, Annika; Rudén, Christina; Ågerstrand, Marlene; Beronius, Anna
2017-03-01
Different tools have been developed that facilitate systematic and transparent evaluation and handling of toxicity data in the risk assessment process. The present paper sets out to explore the combined use of two web-based tools for study evaluation and identification of reliable data relevant to health risk assessment. For this purpose, a case study was performed using in vivo toxicity studies investigating low-dose effects of bisphenol A on mammary gland development. The reliability of the mammary gland studies was evaluated using the Science in Risk Assessment and Policy (SciRAP) criteria for toxicity studies. The Health Assessment Workspace Collaborative (HAWC) was used for characterizing and visualizing the mammary gland data in terms of type of effects investigated and reported, and the distribution of these effects within the dose interval. It was then investigated whether there was any relationship between study reliability and the type of effects reported and/or their distribution in the dose interval. The combination of the SciRAP and HAWC tools allowed for transparent evaluation and visualization of the studies investigating developmental effects of BPA on the mammary gland. The use of these tools showed that there were no apparent differences in the type of effects and their distribution in the dose interval between the five studies assessed as most reliable and the whole data set. Combining the SciRAP and HAWC tools was found to be a useful approach for evaluating in vivo toxicity studies and identifying reliable and sensitive information relevant to regulatory risk assessment of chemicals. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Clements, Julie; Sanchez, Jessica N
2015-11-01
This research aims to validate a novel, visual body scoring system created for the Magellanic penguin (Spheniscus magellanicus) suitable for the zoo practitioner. Magellanics go through marked seasonal fluctuations in body mass gains and losses. A standardized multi-variable visual body condition guide may provide a more sensitive and objective assessment tool compared to the previously used single variable method. Accurate body condition scores paired with seasonal weight variation measurements give veterinary and keeper staff a clearer understanding of an individual's nutritional status. San Francisco Zoo staff previously used a nine-point body condition scale based on the classic bird standard of a single point of keel palpation with the bird restrained in hand, with no standard measure of reference assigned to each scoring category. We created a novel, visual body condition scoring system that does not require restraint to assesses subcutaneous fat and muscle at seven body landmarks using illustrations and descriptive terms. The scores range from one, the least robust or under-conditioned, to five, the most robust, or over-conditioned. The ratio of body weight to wing length was used as a "gold standard" index of body condition and compared to both the novel multi-variable and previously used single-variable body condition scores. The novel multi-variable scale showed improved agreement with weight:wing ratio compared to the single-variable scale, demonstrating greater accuracy, and reliability when a trained assessor uses the multi-variable body condition scoring system. Zoo staff may use this tool to manage both the colony and the individual to assist in seasonally appropriate Magellanic penguin nutrition assessment. © 2015 Wiley Periodicals, Inc.
Methods for assessing the preventability of adverse drug events: a systematic review.
Hakkarainen, Katja Marja; Andersson Sundell, Karolina; Petzold, Max; Hägg, Staffan
2012-02-01
Preventable adverse drug events (ADEs) are common in both outpatient and inpatient settings. However, the proportion of preventable ADEs varies considerably in different studies, even when conducted in the same setting, and methods for assessing the preventability of ADEs are diverse. The aim of this article is to identify and systematically evaluate methods for assessing the preventability of ADEs. Seven databases (Cochrane, CINAHL, EMBASE, IPA, MEDLINE, PsycINFO and Web of Science) were searched in September 2010 utilizing the databases' index terms and other common terminology on preventable ADEs. No limits for the years of publication were set. Reference lists of included original articles and relevant review articles were also screened. After applying predetermined inclusion and exclusion criteria on 4161 unique citations, 142 (3.4%) original research articles were included in the review. One additional article was included from reference lists. Outcome measures of included studies had to include the frequency of ADEs and the assessment of their preventability. Studies were excluded if they focused on individuals with one specific type of treatment, medical condition, medical procedure or ADE. Measurement instruments for determining the preventability of ADEs in each article were extracted and unique instruments were compared. The process of assessing the preventability of ADEs was described based on reported actions taken to standardize and conduct the assessment, and on information about the reliability and validity of the assessment. Eighteen unique instruments for determining the preventability of ADEs were identified. They fell under the following four groups: (i) instruments using a definition of preventability only (n = 3); (ii) instruments with a definition of preventability and an assessment scale for determining preventability (n = 5); (iii) instruments with specific criteria for each preventability category (n = 3); and (iv) instruments with an algorithm for determining preventability (n = 7). Of actions to standardize the assessment process, performing a pilot study was reported in 21 (15%), and use of a standardized protocol was reported in 18 (13%), of the included 143 articles. Preventability was assessed by physicians in 86 (60%) articles and by pharmacists in 41 (29%) articles. In 29 (20%) articles, persons conducting the assessment were described as trained for or experienced in preventability assessment. In 94 (66%) articles, more than one person assessed the preventability of each case. Among these 94 articles, assessment was done independently in 73 (51%) articles. Procedures for managing conflicting assessments were diverse. The reliability of the preventability assessment was tested in 39 (27%) articles, and 16 (11%) articles referred to a previous reliability assessment. Reliability ranged from poor to excellent (kappa 0.19-0.98; overall agreement 26-97%). Four (3%) articles mentioned assessing validity, but no sensitivity or specificity analyses or negative or positive predictive values were presented. Instruments for assessing the preventability of ADEs vary from implicit instruments to explicit algorithms. There is limited evidence for the validity of the identified instruments, and instrument reliability varied significantly. The process of assessing the preventability of ADEs is also commonly imprecisely described, which hinders the interpretation and comparison of studies. For measuring the preventability of ADEs more accurately and precisely in future, we believe that existing instruments should be further studied and developed, or that one or more new instruments should be developed, and the validity and reliability of the existing and new instruments be established.
Interrater reliability of the mind map assessment rubric in a cohort of medical students.
D'Antoni, Anthony V; Zipp, Genevieve Pinto; Olson, Valerie G
2009-04-28
Learning strategies are thinking tools that students can use to actively acquire information. Examples of learning strategies include mnemonics, charts, and maps. One strategy that may help students master the tsunami of information presented in medical school is the mind map learning strategy. Currently, there is no valid and reliable rubric to grade mind maps and this may contribute to their underutilization in medicine. Because concept maps and mind maps engage learners similarly at a metacognitive level, a valid and reliable concept map assessment scoring system was adapted to form the mind map assessment rubric (MMAR). The MMAR can assess mind map depth based upon concept-links, cross-links, hierarchies, examples, pictures, and colors. The purpose of this study was to examine interrater reliability of the MMAR. This exploratory study was conducted at a US medical school as part of a larger investigation on learning strategies. Sixty-six (N = 66) first-year medical students were given a 394-word text passage followed by a 30-minute presentation on mind mapping. After the presentation, subjects were again given the text passage and instructed to create mind maps based upon the passage. The mind maps were collected and independently scored using the MMAR by 3 examiners. Interrater reliability was measured using the intraclass correlation coefficient (ICC) statistic. Statistics were calculated using SPSS version 12.0 (Chicago, IL). Analysis of the mind maps revealed the following: concept-links ICC = .05 (95% CI, -.42 to .38), cross-links ICC = .58 (95% CI, .37 to .73), hierarchies ICC = .23 (95% CI, -.15 to .50), examples ICC = .53 (95% CI, .29 to .69), pictures ICC = .86 (95% CI, .79 to .91), colors ICC = .73 (95% CI, .59 to .82), and total score ICC = .86 (95% CI, .79 to .91). The high ICC value for total mind map score indicates strong MMAR interrater reliability. Pictures and colors demonstrated moderate to strong interrater reliability. We conclude that the MMAR may be a valid and reliable tool to assess mind maps in medicine. However, further research on the validity and reliability of the MMAR is necessary.
Interrater reliability of the mind map assessment rubric in a cohort of medical students
D'Antoni, Anthony V; Zipp, Genevieve Pinto; Olson, Valerie G
2009-01-01
Background Learning strategies are thinking tools that students can use to actively acquire information. Examples of learning strategies include mnemonics, charts, and maps. One strategy that may help students master the tsunami of information presented in medical school is the mind map learning strategy. Currently, there is no valid and reliable rubric to grade mind maps and this may contribute to their underutilization in medicine. Because concept maps and mind maps engage learners similarly at a metacognitive level, a valid and reliable concept map assessment scoring system was adapted to form the mind map assessment rubric (MMAR). The MMAR can assess mind map depth based upon concept-links, cross-links, hierarchies, examples, pictures, and colors. The purpose of this study was to examine interrater reliability of the MMAR. Methods This exploratory study was conducted at a US medical school as part of a larger investigation on learning strategies. Sixty-six (N = 66) first-year medical students were given a 394-word text passage followed by a 30-minute presentation on mind mapping. After the presentation, subjects were again given the text passage and instructed to create mind maps based upon the passage. The mind maps were collected and independently scored using the MMAR by 3 examiners. Interrater reliability was measured using the intraclass correlation coefficient (ICC) statistic. Statistics were calculated using SPSS version 12.0 (Chicago, IL). Results Analysis of the mind maps revealed the following: concept-links ICC = .05 (95% CI, -.42 to .38), cross-links ICC = .58 (95% CI, .37 to .73), hierarchies ICC = .23 (95% CI, -.15 to .50), examples ICC = .53 (95% CI, .29 to .69), pictures ICC = .86 (95% CI, .79 to .91), colors ICC = .73 (95% CI, .59 to .82), and total score ICC = .86 (95% CI, .79 to .91). Conclusion The high ICC value for total mind map score indicates strong MMAR interrater reliability. Pictures and colors demonstrated moderate to strong interrater reliability. We conclude that the MMAR may be a valid and reliable tool to assess mind maps in medicine. However, further research on the validity and reliability of the MMAR is necessary. PMID:19400964
Assessing the Reliability of Curriculum-Based Measurement: An Application of Latent Growth Modeling
ERIC Educational Resources Information Center
Yeo, Seungsoo; Kim, Dong-Il; Branum-Martin, Lee; Wayman, Miya Miura; Espin, Christine A.
2012-01-01
The purpose of this study was to demonstrate the use of Latent Growth Modeling (LGM) as a method for estimating reliability of Curriculum-Based Measurement (CBM) progress-monitoring data. The LGM approach permits the error associated with each measure to differ at each time point, thus providing an alternative method for examining of the…
Two Prophecy Formulas for Assessing the Reliability of Item Response Theory-Based Ability Estimates
ERIC Educational Resources Information Center
Raju, Nambury S.; Oshima, T.C.
2005-01-01
Two new prophecy formulas for estimating item response theory (IRT)-based reliability of a shortened or lengthened test are proposed. Some of the relationships between the two formulas, one of which is identical to the well-known Spearman-Brown prophecy formula, are examined and illustrated. The major assumptions underlying these formulas are…
Testing the reliability of the Fall Risk Screening Tool in an elderly ambulatory population.
Fielding, Susan J; McKay, Michael; Hyrkas, Kristiina
2013-11-01
To identify and test the reliability of a fall risk screening tool in an ambulatory outpatient clinic. The Fall Risk Screening Tool (Albert Lea Medical Center, MN, USA) was scripted for an interview format. Two interviewers separately screened a convenience sample of 111 patients (age ≥ 65 years) in an ambulatory outpatient clinic in a northeastern US city. The interviewers' scoring of fall risk categories was similar. There was good internal consistency (Cronbach's α = 0.834-0.889) and inter-rater reliability [intra-class correlation coefficients (ICC) = 0.824-0.881] for total, Risk Factor and Client's Health Status subscales. The Physical Environment scores indicated acceptable internal consistency (Cronbach's α = 0.742) and adequate reliability (ICC = 0.688). Two Physical Environment items (furniture and medical equipment condition) had low reliabilities [Kappa (K) = 0.323, P = 0.08; K = -0.078, P = 0.648), respectively. The scripted Fall Risk Screening Tool demonstrated good reliability in this sample. Rewording two Physical Environment items will be considered. A reliable instrument such as the scripted Fall Risk Screening Tool provides a standardised assessment for identifying high fall risk patients. This tool is especially useful because it assesses personal, behavioural and environmental factors specific to community-dwelling patients; the interview format also facilitates patient-provider interaction. © 2013 John Wiley & Sons Ltd.
Test-retest reliability of cognitive EEG
NASA Technical Reports Server (NTRS)
McEvoy, L. K.; Smith, M. E.; Gevins, A.
2000-01-01
OBJECTIVE: Task-related EEG is sensitive to changes in cognitive state produced by increased task difficulty and by transient impairment. If task-related EEG has high test-retest reliability, it could be used as part of a clinical test to assess changes in cognitive function. The aim of this study was to determine the reliability of the EEG recorded during the performance of a working memory (WM) task and a psychomotor vigilance task (PVT). METHODS: EEG was recorded while subjects rested quietly and while they performed the tasks. Within session (test-retest interval of approximately 1 h) and between session (test-retest interval of approximately 7 days) reliability was calculated for four EEG components: frontal midline theta at Fz, posterior theta at Pz, and slow and fast alpha at Pz. RESULTS: Task-related EEG was highly reliable within and between sessions (r0.9 for all components in WM task, and r0.8 for all components in the PVT). Resting EEG also showed high reliability, although the magnitude of the correlation was somewhat smaller than that of the task-related EEG (r0.7 for all 4 components). CONCLUSIONS: These results suggest that under appropriate conditions, task-related EEG has sufficient retest reliability for use in assessing clinical changes in cognitive status.
de Witte, Annemarie M H; Hoozemans, Marco J M; Berger, Monique A M; van der Slikke, Rienk M A; van der Woude, Lucas H V; Veeger, Dirkjan H E J
2018-01-01
The aim of this study was to develop and describe a wheelchair mobility performance test in wheelchair basketball and to assess its construct validity and reliability. To mimic mobility performance of wheelchair basketball matches in a standardised manner, a test was designed based on observation of wheelchair basketball matches and expert judgement. Forty-six players performed the test to determine its validity and 23 players performed the test twice for reliability. Independent-samples t-tests were used to assess whether the times needed to complete the test were different for classifications, playing standards and sex. Intraclass correlation coefficients (ICC) were calculated to quantify reliability of performance times. Males performed better than females (P < 0.001, effect size [ES] = -1.26) and international men performed better than national men (P < 0.001, ES = -1.62). Performance time of low (≤2.5) and high (≥3.0) classification players was borderline not significant with a moderate ES (P = 0.06, ES = 0.58). The reliability was excellent for overall performance time (ICC = 0.95). These results show that the test can be used as a standardised mobility performance test to validly and reliably assess the capacity in mobility performance of elite wheelchair basketball athletes. Furthermore, the described methodology of development is recommended for use in other sports to develop sport-specific tests.
ASSESSING AND COMBINING RELIABILITY OF PROTEIN INTERACTION SOURCES
LEACH, SONIA; GABOW, AARON; HUNTER, LAWRENCE; GOLDBERG, DEBRA S.
2008-01-01
Integrating diverse sources of interaction information to create protein networks requires strategies sensitive to differences in accuracy and coverage of each source. Previous integration approaches calculate reliabilities of protein interaction information sources based on congruity to a designated ‘gold standard.’ In this paper, we provide a comparison of the two most popular existing approaches and propose a novel alternative for assessing reliabilities which does not require a gold standard. We identify a new method for combining the resultant reliabilities and compare it against an existing method. Further, we propose an extrinsic approach to evaluation of reliability estimates, considering their influence on the downstream tasks of inferring protein function and learning regulatory networks from expression data. Results using this evaluation method show 1) our method for reliability estimation is an attractive alternative to those requiring a gold standard and 2) the new method for combining reliabilities is less sensitive to noise in reliability assignments than the similar existing technique. PMID:17990508
A probability-based approach for assessment of roadway safety hardware.
DOT National Transportation Integrated Search
2017-03-14
This report presents a general probability-based approach for assessment of roadway safety hardware (RSH). It was achieved using a reliability : analysis method and computational techniques. With the development of high-fidelity finite element (FE) m...
Asgari, Fatemeh; Haghdoost, Faraidoon; Masjedi, Samaneh Sadat; Manouchehri, Navid; Banihashemi, Mahboobeh; Ghorbani, Abbas; Najafi, Mohammad Reza; Saadatnia, Mohammad; Lipton, Richard B.
2014-01-01
Introduction. MIDAS is a valid and reliable short questionnaire for assessment of headache related disability. Linguistic validation of Persian MIDAS and assessment of psychometric properties between tension type headache (TTH) and migraine were the aims of this study. Methods. Patients with migraine or TTH were included. At the first visit, we administered a headache symptom questionnaire, MIDAS, and SF-36. Patients filled out MIDAS in second and third visit within three and eight weeks after base line visit. Internal consistency (Cronbach α) and test-retest reproducibility (Spearman correlation coefficient) were used to assess reliability. Convergent validity and MIDAS capability to differentiate between chronic and episodic headaches (migraine and TTH) were also assessed. Results. The 267 participants had episodic migraine (EM-64%), chronic migraine (CM-13.5%), episodic TTH (ETTH-13.5%), and chronic TTH (CTTH-9). Internal consistency reliability was 0.8 for the entire sample, 0.72 for TTH, and 0.82 for migraine. Test-retest reliability for all questions between visit 1 and visit 2 varied from 0.54 to 0.71. Convergent validity was assessed using SF-36 as an external referent. Patients with episodic headaches (EM and ETTH) had significantly lower MIDAS scores than chronic headaches (CM and CTTH). Conclusion. Persian MIDAS is a valid and reliable questionnaire for migraine and TTH that can differentiate between episodic headache and chronic headache. PMID:24527462
Malisova, Olga; Bountziouka, Vassiliki; Panagiotakos, Demosthenes B; Zampelas, Antonis; Kapsokefalou, Maria
2012-03-01
There is a need to develop a questionnaire as a research tool for the evaluation of water balance in the general population. The water balance questionnaire (WBQ) was designed to evaluate water intake from fluid and solid foods and drinking water, and water loss from urine, faeces and sweat at sedentary conditions and physical activity. For validation purposes, the WBQ was administrated in 40 apparently healthy participants aged 22-57 years (37.5% males). Hydration indices in urine (24 h volume, osmolality, specific gravity, pH, colour) were measured through established procedures. Furthermore, the questionnaire was administered twice to 175 subjects to evaluate its reliability. Kendall's τ-b and the Bland and Altman method were used to assess the questionnaire's validity and reliability. The proposed WBQ to assess water balance in healthy individuals was found to be valid and reliable, and it could thus be a useful tool in future projects that aim to evaluate water balance.
Reliability and validity of the Wolfram Unified Rating Scale (WURS)
2012-01-01
Background Wolfram syndrome (WFS) is a rare, neurodegenerative disease that typically presents with childhood onset insulin dependent diabetes mellitus, followed by optic atrophy, diabetes insipidus, deafness, and neurological and psychiatric dysfunction. There is no cure for the disease, but recent advances in research have improved understanding of the disease course. Measuring disease severity and progression with reliable and validated tools is a prerequisite for clinical trials of any new intervention for neurodegenerative conditions. To this end, we developed the Wolfram Unified Rating Scale (WURS) to measure the severity and individual variability of WFS symptoms. The aim of this study is to develop and test the reliability and validity of the Wolfram Unified Rating Scale (WURS). Methods A rating scale of disease severity in WFS was developed by modifying a standardized assessment for another neurodegenerative condition (Batten disease). WFS experts scored the representativeness of WURS items for the disease. The WURS was administered to 13 individuals with WFS (6-25 years of age). Motor, balance, mood and quality of life were also evaluated with standard instruments. Inter-rater reliability, internal consistency reliability, concurrent, predictive and content validity of the WURS were calculated. Results The WURS had high inter-rater reliability (ICCs>.93), moderate to high internal consistency reliability (Cronbach’s α = 0.78-0.91) and demonstrated good concurrent and predictive validity. There were significant correlations between the WURS Physical Assessment and motor and balance tests (rs>.67, p<.03), between the WURS Behavioral Scale and reports of mood and behavior (rs>.76, p<.04) and between WURS Total scores and quality of life (rs=-.86, p=.001). The WURS demonstrated acceptable content validity (Scale-Content Validity Index=0.83). Conclusions These preliminary findings demonstrate that the WURS has acceptable reliability and validity and captures individual differences in disease severity in children and young adults with WFS. PMID:23148655
Bryant, Elizabeth; Murtagh, Shemane; Finucane, Laura; McCrum, Carol; Mercer, Christopher; Smith, Toby; Canby, Guy; Rowe, David A; Moore, Ann P
2018-05-11
In response for the need of a freely available, stand-alone, validated outcome measure for use within musculoskeletal (MSK) physiotherapy practice, sensitive enough to measure clinical effectiveness, we developed an MSK patient reported outcome measure. This study examined the validity and reliability of the newly developed Brighton musculoskeletal Patient-Reported Outcome Measure (BmPROM) within physiotherapy outpatient settings. Two hundred twenty-four patients attending physiotherapy outpatient departments in South East England with an MSK condition participated in this study. The BmPROM was assessed for user friendliness (rated feedback, N = 224), reliability (internal consistency and test-retest reliability, n = 42), validity (internal and external construct validity, N = 224), and responsiveness (internal, n = 25). Exploratory factor analysis indicated that a two-factor model provides a good fit to the data. Factors were representative of "Functionality" and "Wellbeing". Correlations observed between the BmPROM and SF-36 domains provided evidence of convergent validity. Reliability results indicated that both subscales were internally consistent with alphas above the acceptable limits for both "Functionality" (α = .85, 95% CI [.81, .88]) and 'Wellbeing' (α = .80, 95% CI [.75, .84]). Test-retest analyses (n = 42) demonstrated a high degree of reliability between "Functionality" (ICC = .84; 95% CI [.72, .91]) and "Wellbeing" scores (ICC = .84; 95% CI [.72, .91]). Further examination of test-retest reliability through the Bland-Altman analysis demonstrated that the difference between "Functionality" and "Wellbeing" test scores did not vary as a function of absolute test score. Large treatment effect sizes were found for both subscales (Functionality d = 1.10; Wellbeing 1.03). The BmPROM is a reliable and valid outcome measure for use in evaluating physiotherapy treatment of MSK conditions. Copyright © 2018 John Wiley & Sons, Ltd.
Service life assessment of timber highway bridges in USA climate zones
James P. Wacker; Brian K. Brashaw; Thomas G. Williamson; P. David Jones; Matthew S. Smith; Travis K. Hosteng; David L. Strahl; Lola E. Coombe; V.J. Gopu
2014-01-01
As engineers begin to estimate life-cycle costs and sustainable design approaches for timber bridges, there is a need for more reliable data about their durability and expected service life. This paper summarizes a comprehensive effort to assess the current condition of more than one hundred timber highway bridge superstructures throughout the United States. This...
Discharge reliability in ablative pulsed plasma thrusters
NASA Astrophysics Data System (ADS)
Wu, Zhiwen; Sun, Guorui; Yuan, Shiyue; Huang, Tiankun; Liu, Xiangyang; Xie, Kan; Wang, Ningfei
2017-08-01
Discharge reliability is typically neglected in low-ignition-cycle ablative pulsed plasma thrusters (APPTs). In this study, the discharge reliability of an APPT is assessed analytically and experimentally. The goals of this study are to better understand the ignition characteristics and to assess the accuracy of the analytical method. For each of six sets of operating conditions, 500 tests of a parallel-plate APPT with a coaxial semiconductor spark plug are conducted. The discharge voltage and current are measured with a high-voltage probe and a Rogowski coil, respectively, to determine whether the discharge is successful. Generally, the discharge success rate increases as the discharge voltage increases, and it decreases as the electrode gap and the number of ignitions increases. The theoretical analysis and the experimental results are reasonably consistent. This approach provides a reference for designing APPTs and improving their stability.
Reliable change on the Boston naming test.
Sachs, Bonnie C; Lucas, John A; Smith, Glenn E; Ivnik, Robert J; Petersen, Ronald C; Graff-Radford, Neill R; Pedraza, Otto
2012-03-01
Serial assessments are commonplace in neuropsychological practice and used to document cognitive trajectory for many clinical conditions. However, true change scores may be distorted by measurement error, repeated exposure to the assessment instrument, or person variables. The present study provides reliable change indices (RCI) for the Boston Naming Test, derived from a sample of 844 cognitively normal adults aged 56 years and older. All participants were retested between 9 and 24 months after their baseline exam. Results showed that a 4-point decline during a 9-15 month retest period or a 6-point decline during a 16-24 month retest period represents reliable change. These cutoff values were further characterized as a function of a person's age and family history of dementia. These findings may help clinicians and researchers to characterize with greater precision the temporal changes in confrontation naming ability.
Chew, Taariq; Ho, Kerrie-Anne; Loo, Colleen K
2015-01-01
Translation of transcranial direct current stimulation (tDCS) from research to clinical practice is hindered by a lack of consensus on optimal stimulation parameters, significant inter-individual variability in response, and in sufficient intra-individual reliability data. Inter-individual differences in response to anodal tDCS at a range of current intensities were explored. Intra-individual reliability in response to anodal tDCS across two identical sessions was also investigated. Twenty-nine subjects participated in a crossover study. Anodal-tDCS using four different current intensities (0.2, 0.5, 1 and 2 mA), with an anode size of 16 cm2, was tested. The 0.5 mA condition was repeated to assess intra-individual variability. TMS was used to elicit 40 motor-evoked potentials (MEPs) before 10 min of tDCS, and 20 MEPs at four time-points over 30 min following tDCS. ANOVA revealed no main effect of TIME for all conditions except the first 0.5 mA condition, and no differences in response between the four current intensities. Cluster analysis identified two clusters for the 0.2 and 2 mA conditions only. Frequency distributions based on individual subject responses (excitatory, inhibitory or no response) to each condition indicate possible differential responses between individuals to different current intensities. Test-retest reliability was negligible (ICC(2,1) = -0.50). Significant inter-individual variability in response to tDCS across a range of current intensities was found. 2 mA and 0.2 mA tDCS were most effective at inducing a distinct response. Significant intra-individual variability in response to tDCS was also found. This has implications for interpreting results of single-session tDCS experiments. Crown Copyright © 2015. Published by Elsevier Inc. All rights reserved.
Thomas, G V; Jolley, R P
1998-05-01
Although consideration of children's art work (usually drawings) in clinical investigations of children referred to psychologists is fairly common, there is little evidence for the reliability and validity of such assessments. We consider a variety of possible mechanisms which could operate to influence the characteristics of children's drawings, and review the evidence that such mechanisms operate to allow meaningful psychological evaluations of children from their drawings. The problem for making a reliable interpretation of the significance of a drawing is that a given feature could plausibly support several very different interpretations, depending which of many possible processes was active or dominant in the production of the drawing. Evidence from studies of clinical populations and experimental studies with non-selected samples are reviewed in the light of these possibilities. The review indicates that drawings are inaccurate and unreliable as personality or state assessments but can be influenced by children's emotional attitudes towards the topics depicted. The form of that expression, however, may be personal and idiosyncratic. Analogue studies of these effects undertaken with non-clinical samples under controlled conditions have produced mixed results. At best the reported effects are small. Children's drawings on their own are too complexly determined and inherently ambiguous to be reliable sole indicators of the emotional experiences of the children who drew them. Further research is needed to establish the extent to which such drawings can usefully facilitate assessment of children by other means or provide useful support as one of several converging lines of evidence.
Reliability and Responsiveness of NutriQoL® Questionnaire.
Cuerda, Maria Cristina; Apezetxea, Antonio; Carrillo, Lourdes; Casanueva, Felipe; Cuesta, Federico; Irles, Jose Antonio; Virgili, Maria Nuria; Layola, Miquel; Lizán, Luis
2016-10-01
NutriQoL ® (Nestlé Health Science, Vevay, Switzerland) is a questionnaire developed to assess the health-related quality-of-life (HRQoL) of patients with home enteral nutrition (HEN) irrespective of their underlying condition and route of administration. The aim of this work is assessing the questionnaire's reliability and responsiveness to change. Two cohorts of patients with HEN and their primary caregivers were enrolled to assess reliability and responsiveness, respectively. All participants had to be 18 years of age or older, without mental deterioration (≤3 or 4 errors in the Pfeiffer's test) and with sufficient functional status (>40 points on Karnovsky's performance status scale). When the patients' ability to respond to the questionnaire was impaired due to underlying disease, their caregivers answered on their behalf. NutriQoL was administered in two and three visits to reliability and responsiveness cohorts, respectively. Test-retest reliability and internal consistency were assessed by the intra-class correlation coefficient (ICC) and the Cronbach's α, respectively. Responsiveness was evaluated by standardized effect size and standardized response mean between basal visit and third visit. Finally, the minimal clinically important difference (MCID) was estimated. A total of 54 and 86 participants were recruited to the reliability and responsiveness cohort, respectively. Thirty-five caregivers were selected to assess the inter-observer reliability. ICC values confirmed the good reproducibility level (ICC >0.75) of the questionnaire in both "physical functioning and activities of daily living" and "social life" domains and total score. The assessment of internal consistency in both domains of the questionnaire showed good internal consistency in visit 2. ICC showed the excellent agreement level between caregiver and patient in the global NutriQoL score. Finally, patients classified as having a minimal change in their health reported a mean (standard deviation) MCID in NutriQoL score of 0.63 (11.51). NutriQoL is a reliable and unique instrument to measure the HRQoL in HEN patients. NutriQoL detects changes in the health status of the patient. Nevertheless, further research is needed to determine the full extent of the questionnaire responsiveness.
Psychometric properties of the WHODASII in rehabilitation patients.
Pösl, Miriam; Cieza, Alarcos; Stucki, Gerold
2007-11-01
To evaluate function and disability, the WHO has developed the WHO Disability Assessment Schedule II (WHODASII), an instrument arising from the same conceptual basis as the International Classification of Functioning, Disability, and Health (ICF). The general objective of this study was to investigate whether the WHODASII--German version-is a valid instrument to measure functioning and disability across various conditions. Specific aims were (1) to assess its psychometric properties (reliability, validity, and sensitivity to change) based on the traditional test theory and (2) to compare its sensitivity to change after a rehabilitative intervention to the Short Form 36 (SF-36). This was a multi-center study with convenience samples of patients with different chronic conditions undergoing rehabilitation. Patients completed the WHODASII and the SF-36 before and after a rehabilitation treatment. Health professionals rated in cooperation with the patients the pain of the patients based on the ICF category "sensation of pain." 904 patients were included in the study. The Cronbach's range from 0.70 to 0.97 for the different subscales of WHODASII. With exception of the subscale Activities, the exploratory-factor structure of the WHODASII corresponds highly with the original structure. The effect size (ES) of the WHODASII total score ranged from 0.16 to 0.69 depending on the subgroup. The ES of the SF-36 summary scores ranged from 0.03 to 1.40. The WHODAS II (German version) is a useful instrument for measuring functioning and disability in patients with musculoskeletal diseases, internal diseases, stroke, breast cancer, and depressive disorder. The results of this study support the reliability, validity, dimensionality, and responsiveness of the German version of the WHODASII. However, the reproducibility in test-retest samples of stable patients, as well as the question to what extent a summary score can be constructed, requires further investigation.
Martin, William; Palazzo, Clémence; Poiraudeau, Serge
2017-11-01
To develop and validate a self-reporting questionnaire assessing the burden of exercise therapy for patients with chronic conditions. Measurement properties of an instrument. Outpatient clinics and tertiary care hospital. Patients (N=201) with at least 1 chronic condition and performing exercise therapy. Not applicable. The dimensional structure of the questionnaire was assessed by principal component analysis. Construct validity of the instrument was assessed by exploring convergent validity with the Treatment Burden Questionnaire (TBQ) and divergent validity with pain, self-efficacy, treatment satisfaction, and health state. Reliability was assessed with the Cronbach α coefficient, a test-retest method using the intraclass correlation coefficient (ICC), and Bland-Altman plotting. A preliminary list of items was developed from semistructured interviews with 28 patients and reviewed by 2 expert physicians. Items obtained were reduced. Then a sample of 163 patients was used to measure the psychometrics of the Exercise Therapy Burden Questionnaire (ETBQ), consisting of 10 items. Principal component analysis extracted 1 dimension. The Cronbach α was .86 (.82-.89). Test-retest reliability (n=24 patients) was good with an ICC of .93 (.85-.97), and Bland-Altman analysis did not reveal a systematic trend. The ETBQ showed expected convergent validity with the TBQ (ρ=.52) and expected divergent validity with pain (ρ=.37), self-efficacy (ρ=-.34), treatment satisfaction (ρ=-.49), and perceived health state (ρ=-.28). The ETBQ is the first questionnaire assessing exercise therapy burden in patients with chronic conditions. Its psychometric properties are promising. Copyright © 2017 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Reliability-based structural optimization: A proposed analytical-experimental study
NASA Technical Reports Server (NTRS)
Stroud, W. Jefferson; Nikolaidis, Efstratios
1993-01-01
An analytical and experimental study for assessing the potential of reliability-based structural optimization is proposed and described. In the study, competing designs obtained by deterministic and reliability-based optimization are compared. The experimental portion of the study is practical because the structure selected is a modular, actively and passively controlled truss that consists of many identical members, and because the competing designs are compared in terms of their dynamic performance and are not destroyed if failure occurs. The analytical portion of this study is illustrated on a 10-bar truss example. In the illustrative example, it is shown that reliability-based optimization can yield a design that is superior to an alternative design obtained by deterministic optimization. These analytical results provide motivation for the proposed study, which is underway.
Evaluation of reliability and validity of three dental color-matching devices.
Tsiliagkou, Aikaterini; Diamantopoulou, Sofia; Papazoglou, Efstratios; Kakaboura, Afrodite
2016-01-01
To assess the repeatability and accuracy of three dental color-matching devices under standardized and freehand measurement conditions. Two shade guides (Vita Classical A1-D4, Vita; and Vita Toothguide 3D-Master, Vita), and three color-matching devices (Easyshade, Vita; SpectroShade, MHT Optic Research; and ShadeVision, X-Rite) were used. Five shade tabs were selected from the Vita Classical A1-D4 (A2, A3.5, B1, C4, D3), and five from the Vita Toothguide 3D-Master (1M1, 2R1.5, 3M2, 4L2.5, 5M3) shade guides. Each shade tab was recorded 15 continuous, repeated times with each device under two different measurement conditions (standardized, and freehand). Both qualitative (color shade) and quantitative (L, a, and b) color characteristics were recorded. The color difference (ΔE) of each recorded value with the known values of the shade tab was calculated. The repeatability of each device was evaluated by the coefficient of variance. The accuracy of each device was determined by comparing the recorded values with the known values of the reference shade tab (one sample t test; α = 0.05). The agreement between the recorded shade and the reference shade tab was calculated. The influence of the parameters (devices and conditions) on the parameter ΔE was investigated (two-way ANOVA). Comparison of the devices was performed with Bonferroni pairwise post-hoc analysis. Under standardized conditions, repeatability of all three devices was very good, except for ShadeVision with Vita Classical A1-D4. Accuracy ranged from good to fair, depending on the device and the shade guide. Under freehand conditions, repeatability and accuracy for Easyshade and ShadeVision were negatively influenced, but not for SpectroShade, regardless of the shade guide. Based on the total of the color parameters assessed per device, SpectroShade was the most reliable of the three color-matching devices studied.
Kirchner, M K; Schulze Westerath, H; Knierim, U; Tessitore, E; Cozzi, G; Winckler, C
2014-03-01
Consistency over time of (on-farm) animal welfare assessment systems forms part of reliability, meaning that results of the assessment should be representative of the longer-term welfare state of the farm as long as the housing and management conditions have not changed considerably. This is especially important if assessments are to be used for certification purposes. It was the aim of the present study to investigate consistency over time of the Welfare Quality(®) (WQ(®)) assessment system for fattening cattle at single measure level, aggregated criterion and principle scores, and overall classification across short-term (1 month) and longer-term periods (6 months). We hypothesized that consistency over time of aggregated criterion and principle scores is higher than that of single measures. Consistency was also expected to be lower with longer intervals between assessments. Data were obtained using the WQ(®) protocol for fattening cattle during three visits (months 0, 1 and 7) on 63 beef farms in Austria, Germany and Italy. Only data from farms where no major changes in housing and management had taken place were considered for analysis. At the single measure level, Spearman rank correlations between visits were >0.7 and variance was lower within farms than between farms for six and two of 19 measures after 1 month and 6 months, respectively. After aggregation of single measures into criterion and principle scores, five and two of 10 criteria and three and one of four principles were found reliable after 1 and 6 months, respectively. At the WQ(®) principle level, this was the case for three and one of four principles. Seventy-nine per cent and 75% of the farms were allocated to the same overall welfare category after 1 month and 6 months. Possible reasons for a lack of consistency are seasonal effects or short-term fluctuations that occur under normal farm conditions, low prevalence of clinical measures and probably insufficient sample size, whereas poor inter-observer agreement leading to inflation of correlation can be ruled out. At the criterion and principle level, aggregation of information into scores appears to partly smoothen undirected variation at the single measure level without losing sensitivity in terms of welfare evaluation. Reliable on-farm animal welfare assessments should therefore be based on repeated assessments. Further long-term studies are recommended to better understand the factors influencing consistency over time.
Choosing a reliability inspection plan for interval censored data
Lu, Lu; Anderson-Cook, Christine Michaela
2017-04-19
Reliability test plans are important for producing precise and accurate assessment of reliability characteristics. This paper explores different strategies for choosing between possible inspection plans for interval censored data given a fixed testing timeframe and budget. A new general cost structure is proposed for guiding precise quantification of total cost in inspection test plan. Multiple summaries of reliability are considered and compared as the criteria for choosing the best plans using an easily adapted method. Different cost structures and representative true underlying reliability curves demonstrate how to assess different strategies given the logistical constraints and nature of the problem. Resultsmore » show several general patterns exist across a wide variety of scenarios. Given the fixed total cost, plans that inspect more units with less frequency based on equally spaced time points are favored due to the ease of implementation and consistent good performance across a large number of case study scenarios. Plans with inspection times chosen based on equally spaced probabilities offer improved reliability estimates for the shape of the distribution, mean lifetime, and failure time for a small fraction of population only for applications with high infant mortality rates. The paper uses a Monte Carlo simulation based approach in addition to the common evaluation based on the asymptotic variance and offers comparison and recommendation for different applications with different objectives. Additionally, the paper outlines a variety of different reliability metrics to use as criteria for optimization, presents a general method for evaluating different alternatives, as well as provides case study results for different common scenarios.« less
Choosing a reliability inspection plan for interval censored data
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lu, Lu; Anderson-Cook, Christine Michaela
Reliability test plans are important for producing precise and accurate assessment of reliability characteristics. This paper explores different strategies for choosing between possible inspection plans for interval censored data given a fixed testing timeframe and budget. A new general cost structure is proposed for guiding precise quantification of total cost in inspection test plan. Multiple summaries of reliability are considered and compared as the criteria for choosing the best plans using an easily adapted method. Different cost structures and representative true underlying reliability curves demonstrate how to assess different strategies given the logistical constraints and nature of the problem. Resultsmore » show several general patterns exist across a wide variety of scenarios. Given the fixed total cost, plans that inspect more units with less frequency based on equally spaced time points are favored due to the ease of implementation and consistent good performance across a large number of case study scenarios. Plans with inspection times chosen based on equally spaced probabilities offer improved reliability estimates for the shape of the distribution, mean lifetime, and failure time for a small fraction of population only for applications with high infant mortality rates. The paper uses a Monte Carlo simulation based approach in addition to the common evaluation based on the asymptotic variance and offers comparison and recommendation for different applications with different objectives. Additionally, the paper outlines a variety of different reliability metrics to use as criteria for optimization, presents a general method for evaluating different alternatives, as well as provides case study results for different common scenarios.« less
Kim, Kwang-Yon; Shin, Seong Eun; No, Kyoung Tai
2015-01-01
Objectives For successful adoption of legislation controlling registration and assessment of chemical substances, it is important to obtain sufficient toxicological experimental evidence and other related information. It is also essential to obtain a sufficient number of predicted risk and toxicity results. Particularly, methods used in predicting toxicities of chemical substances during acquisition of required data, ultimately become an economic method for future dealings with new substances. Although the need for such methods is gradually increasing, the-required information about reliability and applicability range has not been systematically provided. Methods There are various representative environmental and human toxicity models based on quantitative structure-activity relationships (QSAR). Here, we secured the 10 representative QSAR-based prediction models and its information that can make predictions about substances that are expected to be regulated. We used models that predict and confirm usability of the information expected to be collected and submitted according to the legislation. After collecting and evaluating each predictive model and relevant data, we prepared methods quantifying the scientific validity and reliability, which are essential conditions for using predictive models. Results We calculated predicted values for the models. Furthermore, we deduced and compared adequacies of the models using the Alternative non-testing method assessed for Registration, Evaluation, Authorization, and Restriction of Chemicals Substances scoring system, and deduced the applicability domains for each model. Additionally, we calculated and compared inclusion rates of substances expected to be regulated, to confirm the applicability. Conclusions We evaluated and compared the data, adequacy, and applicability of our selected QSAR-based toxicity prediction models, and included them in a database. Based on this data, we aimed to construct a system that can be used with predicted toxicity results. Furthermore, by presenting the suitability of individual predicted results, we aimed to provide a foundation that could be used in actual assessments and regulations. PMID:26206368
An Investigation of the Impact of Guessing on Coefficient α and Reliability
2014-01-01
Guessing is known to influence the test reliability of multiple-choice tests. Although there are many studies that have examined the impact of guessing, they used rather restrictive assumptions (e.g., parallel test assumptions, homogeneous inter-item correlations, homogeneous item difficulty, and homogeneous guessing levels across items) to evaluate the relation between guessing and test reliability. Based on the item response theory (IRT) framework, this study investigated the extent of the impact of guessing on reliability under more realistic conditions where item difficulty, item discrimination, and guessing levels actually vary across items with three different test lengths (TL). By accommodating multiple item characteristics simultaneously, this study also focused on examining interaction effects between guessing and other variables entered in the simulation to be more realistic. The simulation of the more realistic conditions and calculations of reliability and classical test theory (CTT) item statistics were facilitated by expressing CTT item statistics, coefficient α, and reliability in terms of IRT model parameters. In addition to the general negative impact of guessing on reliability, results showed interaction effects between TL and guessing and between guessing and test difficulty.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hamachi La Commare, Kristina
Metrics for reliability, such as the frequency and duration of power interruptions, have been reported by electric utilities for many years. This study examines current utility practices for collecting and reporting electricity reliability information and discusses challenges that arise in assessing reliability because of differences among these practices. The study is based on reliability information for year 2006 reported by 123 utilities in 37 states representing over 60percent of total U.S. electricity sales. We quantify the effects that inconsistencies among current utility reporting practices have on comparisons of System Average Interruption Duration Index (SAIDI) and System Average Interruption Frequency Indexmore » (SAIFI) reported by utilities. We recommend immediate adoption of IEEE Std. 1366-2003 as a consistent method for measuring and reporting reliability statistics.« less
The Future Value of Serious Games for Assessment: Where Do We Go Now?
ERIC Educational Resources Information Center
de Klerk, Sebastiaan; Kato, Pamela M.
2017-01-01
Game-based assessments will most likely be an increasing part of testing programs in future generations because they provide promising possibilities for more valid and reliable measurement of students' skills as compared to the traditional methods of assessment like paper-and-pencil tests or performance-based assessments. The current status of…
Rozental, Alexander; Forsell, Erik; Svensson, Andreas; Forsström, David; Andersson, Gerhard; Carlbring, Per
2014-01-01
Procrastination is a prevalent self-regulatory failure associated with stress and anxiety, decreased well-being, and poorer performance in school as well as work. One-fifth of the adult population and half of the student population describe themselves as chronic and severe procrastinators. However, despite the fact that it can become a debilitating condition, valid and reliable self-report measures for assessing the occurrence and severity of procrastination are lacking, particularly for use in a clinical context. The current study explored the usefulness of the Swedish version of three Internet-administered self-report measures for evaluating procrastination; the Pure Procrastination Scale, the Irrational Procrastination Scale, and the Susceptibility to Temptation Scale, all having good psychometric properties in English. In total, 710 participants were recruited for a clinical trial of Internet-based cognitive behavior therapy for procrastination. All of the participants completed the scales as well as self-report measures of depression, anxiety, and quality of life. Principal Component Analysis was performed to assess the factor validity of the scales, and internal consistency and correlations between the scales were also determined. Intraclass Correlation Coefficient, Minimal Detectable Change, and Standard Error of Measurement were calculated for the Irrational Procrastination Scale. The Swedish version of the scales have a similar factor structure as the English version, generated good internal consistencies, with Cronbach's α ranging between .76 to .87, and were moderately to highly intercorrelated. The Irrational Procrastination Scale had an Intraclass Correlation Coefficient of .83, indicating excellent reliability. Furthermore, Standard Error of Measurement was 1.61, and Minimal Detectable Change was 4.47, suggesting that a change of almost five points on the scale is necessary to determine a reliable change in self-reported procrastination severity. The current study revealed that the Pure Procrastination Scale, the Irrational Procrastination Scale, and the Susceptibility to Temptation Scale are both valid and reliable from a psychometric perspective, and that they might be used for assessing the occurrence and severity of procrastination via the Internet. The current study is part of a clinical trial assessing the efficacy of Internet-based cognitive behavior therapy for procrastination, and was registered 04/22/2013 on ClinicalTrials.gov (NCT01842945).
Paesani, Daniel A; Guarda-Nardini, Luca; Gelos, Carlota; Salmaso, Luigi; Manfredini, Daniele
2014-03-01
The aim was to answer the clinical research question: is incisal/occlusal tooth wear assessment on dental casts performed by five professionals with expertise in different fields of dentistry reliable? Five examiners with different fields of expertise in the dental profession assessed tooth wear on dental casts of 45 subjects, based on a six-degree rating of incisal/occlusal wear. After a calibration meeting, the examiners evaluated the casts individually and various issues concerning interexaminer agreement and reliability were assessed. A total of 872 teeth were evaluated. The five examiners agreed only for the rating of 6.6% of the teeth. The teeth with the highest percentage of agreement were the premolars. Pairwise comparison of the assessments of the examiners #1 (bruxism expert), #2 (orthodontist), #3 (temporomandibular disorders [TMD] and occlusion expert), #4 (dental nurse) showed fair to moderate agreement, with κ-values ranging from 0.306 to 0.577, whilst the examiner #5 (lab technician) achieved low interexaminer reliability values with all the other four examiners. The interexaminer reliability of tooth wear assessment on dental casts performed by five professionals with expertise in different fields of dentistry is highly variable. General practitioners should keep in mind that consensus decisions by the examiners and assessment by raters belonging to the same dental discipline are recommended strategies to increase the reliability of tooth wear evaluation in the clinical setting. This investigation adds to the literature suggesting that, in a clinical setting, a single examiner's assessment of tooth wear on dental casts does not have optimal reliability and that it may be source of internal validity problems in the research setting.
Vital physical signals measurements using a webcam
NASA Astrophysics Data System (ADS)
Ouyang, Jianfei; Yan, Yonggang; Yao, Lifeng
2013-10-01
Non-contact and remote measurements of vital physical signals are important for reliable and comfortable physiological self-assessment. In this paper, we provide a new video-based methodology for remote and fast measurements of vital physical signals such as cardiac pulse and breathing rate. A webcam is used to track color video of a human face or wrist, and a Photoplethysmography (PPG) technique is applied to perform the measurements of the vital signals. A novel sequential blind signal extraction methodology is applied to the color video under normal lighting conditions, based on correlation analysis between the green trace and the source signals. The approach is successfully applied in the measurement of vital signals under the condition of different illuminating in which the target signal can also be found out accurately. To assess the advantages, the measuring time of a large number of cases is recorded correctly. The experimental results show that it only takes less than 30 seconds to measure the vital physical signals using presented technique. The study indicates the proposed approach is feasible for PPG technique, which provides a way to study the relationship of the signal for different ROI in future research.
Angst, Ueli M.; Boschmann, Carolina; Wagner, Matthias; Elsener, Bernhard
2017-01-01
The aging of reinforced concrete infrastructure in developed countries imposes an urgent need for methods to reliably assess the condition of these structures. Corrosion of the embedded reinforcing steel is the most frequent cause for degradation. While it is well known that the ability of a structure to withstand corrosion depends strongly on factors such as the materials used or the age, it is common practice to rely on threshold values stipulated in standards or textbooks. These threshold values for corrosion initiation (Ccrit) are independent of the actual properties of a certain structure, which clearly limits the accuracy of condition assessments and service life predictions. The practice of using tabulated values can be traced to the lack of reliable methods to determine Ccrit on-site and in the laboratory. Here, an experimental protocol to determine Ccrit for individual engineering structures or structural members is presented. A number of reinforced concrete samples are taken from structures and laboratory corrosion testing is performed. The main advantage of this method is that it ensures real conditions concerning parameters that are well known to greatly influence Ccrit, such as the steel-concrete interface, which cannot be representatively mimicked in laboratory-produced samples. At the same time, the accelerated corrosion test in the laboratory permits the reliable determination of Ccrit prior to corrosion initiation on the tested structure; this is a major advantage over all common condition assessment methods that only permit estimating the conditions for corrosion after initiation, i.e., when the structure is already damaged. The protocol yields the statistical distribution of Ccrit for the tested structure. This serves as a basis for probabilistic prediction models for the remaining time to corrosion, which is needed for maintenance planning. This method can potentially be used in material testing of civil infrastructures, similar to established methods used for mechanical testing. PMID:28892023
Angst, Ueli M; Boschmann, Carolina; Wagner, Matthias; Elsener, Bernhard
2017-08-31
The aging of reinforced concrete infrastructure in developed countries imposes an urgent need for methods to reliably assess the condition of these structures. Corrosion of the embedded reinforcing steel is the most frequent cause for degradation. While it is well known that the ability of a structure to withstand corrosion depends strongly on factors such as the materials used or the age, it is common practice to rely on threshold values stipulated in standards or textbooks. These threshold values for corrosion initiation (Ccrit) are independent of the actual properties of a certain structure, which clearly limits the accuracy of condition assessments and service life predictions. The practice of using tabulated values can be traced to the lack of reliable methods to determine Ccrit on-site and in the laboratory. Here, an experimental protocol to determine Ccrit for individual engineering structures or structural members is presented. A number of reinforced concrete samples are taken from structures and laboratory corrosion testing is performed. The main advantage of this method is that it ensures real conditions concerning parameters that are well known to greatly influence Ccrit, such as the steel-concrete interface, which cannot be representatively mimicked in laboratory-produced samples. At the same time, the accelerated corrosion test in the laboratory permits the reliable determination of Ccrit prior to corrosion initiation on the tested structure; this is a major advantage over all common condition assessment methods that only permit estimating the conditions for corrosion after initiation, i.e., when the structure is already damaged. The protocol yields the statistical distribution of Ccrit for the tested structure. This serves as a basis for probabilistic prediction models for the remaining time to corrosion, which is needed for maintenance planning. This method can potentially be used in material testing of civil infrastructures, similar to established methods used for mechanical testing.
Cost of enlarged operating zone for an existing Francis runner
NASA Astrophysics Data System (ADS)
Monette, Christine; Marmont, Hugues; Chamberland-Lauzon, Joël; Skagerstrand, Anders; Coutu, André; Carlevi, Jens
2016-11-01
Traditionally, hydro power plants have been operated close to best efficiency point, the more stable operating condition for which they have been designed. However, because of changes in the electricity market, many hydro power plants operators wish to operate their machines differently to fulfil those new market needs. New operating conditions can include whole range operation, many start/stops, extensive low load operation, synchronous condenser mode and power/frequency regulation. Many of these new operating conditions may impose more severe fatigue damage than the traditional base load operation close to best efficiency point. Under these conditions, the fatigue life of the runner may be significantly reduced and reparation or replacement cost might occur sooner than expected. In order to design reliable Francis runners for those new challenging operating scenarios, Andritz Hydro has developed various proprietary tools and design rules. These are used within Andritz Hydro to design mechanically robust Francis runners for the operating scenarios fulfilling customer's specifications. To estimate residual life under different operating scenarios of an existing runner designed years ago for best efficiency base load operation, Andritz Hydro's design rules and tools would necessarily lead to conservative results. While the geometry of a new runner can be modified to fulfil all conservative mechanical design rules, the predicted fatigue life of an existing runner under off-design operating conditions may appear rather short because of the conservative safety factor included in the calculations. The most precise and reliable way to calculate residual life of an existing runner under different operating scenarios is to perform a strain gauge measurement campaign on the runner. This paper presents the runner strain gage measurement campaign of a mid-head Francis turbine over all the operating conditions available during the test, the analysis of the measurement signals and the runner residual life assessment under different operating scenarios. With these results, the maintenance cost of the change in operating mode can then be calculated and foreseen by the power plant owner.
Dedy, Nicolas J; Szasz, Peter; Louridas, Marisa; Bonrath, Esther M; Husslein, Heinrich; Grantcharov, Teodor P
2015-06-01
Nontechnical skills are critical for patient safety in the operating room (OR). As a result, regulatory bodies for accreditation and certification have mandated the integration of these competencies into postgraduate education. A generally accepted approach to the in-training assessment of nontechnical skills, however, is lacking. The goal of the present study was to develop an evidence-based and reliable tool for the in-training assessment of residents' nontechnical performance in the OR. The Objective Structured Assessment of Nontechnical Skills tool was designed as a 5-point global rating scale with descriptive anchors for each item, based on existing evidence-based frameworks of nontechnical skills, as well as resident training requirements. The tool was piloted on scripted videos and refined in an iterative process. The final version was used to rate residents' performance in recorded OR crisis simulations and during live observations in the OR. A total of 37 simulations and 10 live procedures were rated. Interrater agreement was good for total mean scores, both in simulation and in the real OR, with intraclass correlation coefficients >0.90 in all settings for average and single measures. Internal consistency of the scale was high (Cronbach's alpha = 0.80). The Objective Structured Assessment of Nontechnical Skills global rating scale was developed as an evidence-based tool for the in-training assessment of residents' nontechnical performance in the OR. Unique descriptive anchors allow for a criterion-referenced assessment of performance. Good reliability was demonstrated in different settings, supporting applications in research and education. Copyright © 2015 Elsevier Inc. All rights reserved.
Reliability analysis and utilization of PEMs in space application
NASA Astrophysics Data System (ADS)
Jiang, Xiujie; Wang, Zhihua; Sun, Huixian; Chen, Xiaomin; Zhao, Tianlin; Yu, Guanghua; Zhou, Changyi
2009-11-01
More and more plastic encapsulated microcircuits (PEMs) are used in space missions to achieve high performance. Since PEMs are designed for use in terrestrial operating conditions, the successful usage of PEMs in space harsh environment is closely related to reliability issues, which should be considered firstly. However, there is no ready-made methodology for PEMs in space applications. This paper discusses the reliability for the usage of PEMs in space. This reliability analysis can be divided into five categories: radiation test, radiation hardness, screening test, reliability calculation and reliability assessment. One case study is also presented to illuminate the details of the process, in which a PEM part is used in a joint space program Double-Star Project between the European Space Agency (ESA) and China. The influence of environmental constrains including radiation, humidity, temperature and mechanics on the PEM part has been considered. Both Double-Star Project satellites are still running well in space now.
VanGeest, Jonathan B; Wynia, Matthew K; Cummins, Deborah S; Wilson, Ira B
2002-06-01
This study examined the test-retest reliability of physicians' self-reported manipulation of reimbursement rules for patients. The test-retest reliability of self-report of three specific tactics were examined: (1) exaggerating the severity of patients' conditions, (2) changing a patient's official (billing) diagnosis, and (3) reporting signs or symptoms that patients did not have. The reliability of a scaled summary measure of physicians' manipulation of reimbursement rules was also assessed. Overall, the authors found high levels of test-retest agreement across all three items and the summary measure. These findings suggest that self-report can be used to produce reliable data on this controversial issue. Specifically, the three items reported here can be used to produce a reliable summary measure of physicians' manipulation of reimbursement rules to help patients obtain care that physicians perceive as necessary.
Reliability modeling of fault-tolerant computer based systems
NASA Technical Reports Server (NTRS)
Bavuso, Salvatore J.
1987-01-01
Digital fault-tolerant computer-based systems have become commonplace in military and commercial avionics. These systems hold the promise of increased availability, reliability, and maintainability over conventional analog-based systems through the application of replicated digital computers arranged in fault-tolerant configurations. Three tightly coupled factors of paramount importance, ultimately determining the viability of these systems, are reliability, safety, and profitability. Reliability, the major driver affects virtually every aspect of design, packaging, and field operations, and eventually produces profit for commercial applications or increased national security. However, the utilization of digital computer systems makes the task of producing credible reliability assessment a formidable one for the reliability engineer. The root of the problem lies in the digital computer's unique adaptability to changing requirements, computational power, and ability to test itself efficiently. Addressed here are the nuances of modeling the reliability of systems with large state sizes, in the Markov sense, which result from systems based on replicated redundant hardware and to discuss the modeling of factors which can reduce reliability without concomitant depletion of hardware. Advanced fault-handling models are described and methods of acquiring and measuring parameters for these models are delineated.
Multi-frequency bioimpedance in equine muscle assessment.
Harrison, Adrian Paul; Elbrønd, Vibeke Sødring; Riis-Olesen, Kiwa; Bartels, Else Marie
2015-03-01
Multi-frequency BIA (mfBIA) equipment has been shown to be a non-invasive and reliable method to assess a muscle as a whole or at fibre level. In the equine world this may be the future method of assessment of training condition or of muscle injury. The aim of this study was to test if mfBIA reliably can be used to assess the condition of a horse's muscles in connection with health assessment, injury and both training and re-training. mfBIA measurements was carried out on 10 'hobby' horses and 5 selected cases with known anamnesis. Impedance, resistance, reactance, phase angle, centre frequency, membrane capacitance and both extracellular and intracellular resistance were measured. Platinum electrodes in connection with a conductance paste were used to accommodate the typical BIA frequencies and to facilitate accurate measurements. Use of mfBIA data to look into the effects of myofascial release treatment was also demonstrated. Our findings indicate that mfBIA provides a non-invasive, easily measurable and very precise assessment of the state of muscles in horses. This study also shows the potential of mfBIA as a diagnostic tool as well as a tool to monitor effects of treatment e.g. myofascial release therapy and metabolic diseases, respectively.
Bond, Mary Lou; Cason, Carolyn L
2014-01-01
To assess the content validity and internal consistency reliability of the Healthcare Professions Education Program Self-Assessment (PSA) and the Institutional Self-Assessment for Factors Supporting Hispanic Student Retention (ISA). Health disparities among vulnerable populations are among the top priorities demanding attention in the United States. Efforts to recruit and retain Hispanic nursing students are essential. Based on a sample of provosts, deans/directors, and an author of the Model of Institutional Support, participants commented on the perceived validity and usefulness of each item of the PSA and ISA. Internal consistency reliability was calculated by Cronbach's alpha using responses from nursing schools in states with large Hispanic populations. The ISA and PSA were found to be reliable and valid tools for assessing institutional friendliness. The instruments highlight strengths and identify potential areas of improvement at institutional and program levels.
PROBABILISTIC AQUATIC EXPOSURE ASSESSMENT FOR PESTICIDES 1: FOUNDATIONS
Models that capture underlying mechanisms and processes are necessary for reliable extrapolation of laboratory chemical data to field conditions. For validation, these models require a major revision of the conventional model testing paradigm to better recognize the conflict betw...
Cordier, Reinie; Munro, Natalie; Wilkes-Gillan, Sarah; Speyer, Renée; Pearce, Wendy M
2014-07-01
There is a need for a reliable and valid assessment of childhood pragmatic language skills during peer-peer interactions. This study aimed to evaluate the psychometric properties of a newly developed pragmatic assessment, the Pragmatic Observational Measure (POM). The psychometric properties of the POM were investigated from observational data of two studies - study 1 involved 342 children aged 5-11 years (108 children with ADHD; 108 typically developing playmates; 126 children in the control group), and study 2 involved 9 children with ADHD who attended a 7-week play-based intervention. The psychometric properties of the POM were determined based on the COnsensus-based Standards for the selection of health status Measurement INstruments (COSMIN) taxonomy of psychometric properties and definitions for health-related outcomes; the Pragmatic Protocol was used as the reference tool against which the POM was evaluated. The POM demonstrated sound psychometric properties in all the reliability, validity and interpretability criteria against which it was assessed. The findings showed that the POM is a reliable and valid measure of pragmatic language skills of children with ADHD between the age of 5 and 11 years and has clinical utility in identifying children with pragmatic language difficulty. Copyright © 2014 The Authors. Published by Elsevier Ltd.. All rights reserved.
Rosenblum, Uri; Melzer, Itshak
2017-01-01
About 90% of people with multiple sclerosis (PwMS) have gait instability and 50% fall. Reliable and clinically feasible methods of gait instability assessment are needed. The study investigated the reliability and validity of the Narrow Path Walking Test (NPWT) under single-task (ST) and dual-task (DT) conditions for PwMS. Thirty PwMS performed the NPWT on 2 different occasions, a week apart. Number of Steps, Trial Time, Trial Velocity, Step Length, Number of Step Errors, Number of Cognitive Task Errors, and Number of Balance Losses were measured. Intraclass correlation coefficients (ICC2,1) were calculated from the average values of NPWT parameters. Absolute reliability was quantified from standard error of measurement (SEM) and smallest real difference (SRD). Concurrent validity of NPWT with Functional Reach Test, Four Square Step Test (FSST), 12-item Multiple Sclerosis Walking Scale (MSWS-12), and 2 Minute Walking Test (2MWT) was determined using partial correlations. Intraclass correlation coefficients (ICCs) for most NPWT parameters during ST and DT ranged from 0.46-0.94 and 0.55-0.95, respectively. The highest relative reliability was found for Number of Step Errors (ICC = 0.94 and 0.93, for ST and DT, respectively) and Trial Velocity (ICC = 0.83 and 0.86, for ST and DT, respectively). Absolute reliability was high for Number of Step Errors in ST (SEM % = 19.53%) and DT (SEM % = 18.14%) and low for Trial Velocity in ST (SEM % = 6.88%) and DT (SEM % = 7.29%). Significant correlations for Number of Step Errors and Trial Velocity were found with FSST, MSWS-12, and 2MWT. In persons with PwMS performing the NPWT, Number of Step Errors and Trial Velocity were highly reliable parameters. Based on correlations with other measures of gait instability, Number of Step Errors was the most valid parameter of dynamic balance under the conditions of our test.Video Abstract available for more insights from the authors (see Supplemental Digital Content 1, available at: http://links.lww.com/JNPT/A159).
Visconti, Luca; Martin, Conchita
2013-01-01
The aim of this study was to evaluate both intra- and interoperator reliability of a radiological three-dimensional classification system (KPG index) for the assessment of degree of difficulty for orthodontic treatment of maxillary canine impactions. Cone beam computed tomography (CBCT) scans of fifty impacted canines, obtained using three different scanners (NewTom, Kodak, and Planmeca), were classified using the KPG index by three independent orthodontists. Measurements were repeated one month later. Based on these two sessions, several recommendations on KPG Index scoring were elaborated. After a joint calibration session, these recommendations were explained to nine orthodontists and the two measurement sessions were repeated. There was a moderate intrarater agreement in the precalibration measurement sessions. After the calibration session, both intra- and interrater agreement were almost perfect. Indexes assessed with Kodak Dental Imaging 3D module software showed a better reliability in z-axis values, whereas indexes assessed with Planmeca Romexis software showed a better reliability in x- and y-axis values. No differences were found between the CBCT scanners used. Taken together, these findings indicate that the application of the instructions elaborated during this study improved KPG index reliability, which was nevertheless variously influenced by the use of different software for images evaluation. PMID:24235889
Lovett, Rosemary; Summerfield, Quentin; Vickers, Deborah
2013-06-01
The Toy Discrimination Test measures children's ability to discriminate spoken words. Previous assessments of reliability tested children with normal hearing or mild hearing impairment, and most studies used a version of the test without a masking sound. We assessed test-retest reliability for children with hearing impairment using maskers of broadband noise and two-talker babble. Stimuli were presented from a loudspeaker. The signal-to-noise ratio (SNR) was varied adaptively to estimate the speech-reception threshold (SRT) corresponding to 70.7% correct performance. Participants completed each masked condition twice. Fifty-five children with permanent hearing impairment participated, aged 3.0 to 6.3 years. Thirty-four children used acoustic hearing aids; 21 children used cochlear implants. For the noise masker, the within-subject standard deviation of SRTs was 2.4 dB, and the correlation between first and second SRT was + 0.73. For the babble masker, corresponding values were 2.7 dB and + 0.60. Reliability was similar for children with hearing aids and children with cochlear implants. The results can inform the interpretation of scores from individual children. If a child completes a condition twice in different listening situations (e.g. aided and unaided), a difference between scores ≥ 7.5 dB would be statistically significant (p <.05).
Life-assessment technique for nuclear power plant cables
NASA Astrophysics Data System (ADS)
Bartoníček, B.; Hnát, V.; Plaček, V.
1998-06-01
The condition of polymer-based cable material can be best characterized by measuring elongation at break of its insulating materials. However, it is not often possible to take sufficiently large samples for measurement with the tensile testing machine. The problem has been conveniently solved by utilizing differential scanning calorimetry technique. From the tested cable, several microsamples are taken and the oxidation induction time (OIT) is determined. For each cable which is subject to the assessment of the lifetime, the correlation of OIT with elongation at break and the correlation of elongation at break with the cable service time has to be performed. A reliable assessment of the cable lifetime depends on accuracy of these correlations. Consequently, synergistic effects well known at this time - dose rate effects and effects resulting from the different sequence of applying radiation and elevated temperature must be taken into account.
System reliability analysis of granular filter for protection against piping in dams
NASA Astrophysics Data System (ADS)
Srivastava, A.; Sivakumar Babu, G. L.
2015-09-01
Granular filters are provided for the safety of water retaining structure for protection against piping failure. The phenomenon of piping triggers when the base soil to be protected starts migrating in the direction of seepage flow under the influence of seepage force. To protect base soil from migration, the voids in the filter media should be small enough but it should not also be too small to block smooth passage of seeping water. Fulfilling these two contradictory design requirements at the same time is a major concern for the successful performance of granular filter media. Since Terzaghi era, conventionally, particle size distribution (PSD) of granular filters is designed based on particle size distribution characteristics of the base soil to be protected. The design approach provides a range of D15f value in which the PSD of granular filter media should fall and there exist infinite possibilities. Further, safety against the two critical design requirements cannot be ensured. Although used successfully for many decades, the existing filter design guidelines are purely empirical in nature accompanied with experience and good engineering judgment. In the present study, analytical solutions for obtaining the factor of safety with respect to base soil particle migration and soil permeability consideration as proposed by the authors are first discussed. The solution takes into consideration the basic geotechnical properties of base soil and filter media as well as existing hydraulic conditions and provides a comprehensive solution to the granular filter design with ability to assess the stability in terms of factor of safety. Considering the fact that geotechnical properties are variable in nature, probabilistic analysis is further suggested to evaluate the system reliability of the filter media that may help in risk assessment and risk management for decision making.
Ferrario, M M; Cesana, G
2009-01-01
Due to a new legislation, the assessment of work stress has become compulsory in Italy for all the enterprise. Work stress is become a leading health problem in work settings all over Europe. The two major approaches, the expert-based direct observations and the measurements of the perceived job strain, are briefly introduced emphasizing on strengthens and weaknesses. Among the methods to assess perceived job stress, the Karasek's Job Content Questionnaire has been extensively used in Italy, and the available results support its use because reliable and able to pick up major constrictiveness at work. In addition, because it is now possible to have reference levels, comparisons are possible for either public or private enterprises. Acknowledging the complexity of carrying out reliable assessment of work stress, a multiphase approach is emphasised: first an analysis or current data can be used to estimate the levels of turnover, down-sizing, outsourcing, extra hours, shift work, sickness absenteeism, changes of job titles, work accidents and work-related diseases. At a second step, on groups of workers selected because recognised at risk at the first phase and on control groups, the JCQ can be used to assess workers stress perception. Finally, when constrain conditions emerge,further investigations are required, including: intervention of experts in work organisation analysis, clinical psychological examinations of selected workers, to separate between work-related and personal psychological problems and health consequences.
Deskovitz, Mark A; Weed, Nathan C; McLaughlan, Joseph K; Williams, John E
2016-04-01
The reliability of six Minnesota Multiphasic Personality Inventory-Second edition (MMPI-2) computer-based test interpretation (CBTI) programs was evaluated across a set of 20 commonly appearing MMPI-2 profile codetypes in clinical settings. Evaluation of CBTI reliability comprised examination of (a) interrater reliability, the degree to which raters arrive at similar inferences based on the same CBTI profile and (b) interprogram reliability, the level of agreement across different CBTI systems. Profile inferences drawn by four raters were operationalized using q-sort methodology. Results revealed no significant differences overall with regard to interrater and interprogram reliability. Some specific CBTI/profile combinations (e.g., the CBTI by Automated Assessment Associates on a within normal limits profile) and specific profiles (e.g., the 4/9 profile displayed greater interprogram reliability than the 2/4 profile) were interpreted with variable consensus (α range = .21-.95). In practice, users should consider that certain MMPI-2 profiles are interpreted more or less consensually and that some CBTIs show variable reliability depending on the profile. © The Author(s) 2015.
Risk assessment for construction projects of transport infrastructure objects
NASA Astrophysics Data System (ADS)
Titarenko, Boris
2017-10-01
The paper analyzes and compares different methods of risk assessment for construction projects of transport objects. The management of such type of projects demands application of special probabilistic methods due to large level of uncertainty of their implementation. Risk management in the projects requires the use of probabilistic and statistical methods. The aim of the work is to develop a methodology for using traditional methods in combination with robust methods that allow obtaining reliable risk assessments in projects. The robust approach is based on the principle of maximum likelihood and in assessing the risk allows the researcher to obtain reliable results in situations of great uncertainty. The application of robust procedures allows to carry out a quantitative assessment of the main risk indicators of projects when solving the tasks of managing innovation-investment projects. Calculation of damage from the onset of a risky event is possible by any competent specialist. And an assessment of the probability of occurrence of a risky event requires the involvement of special probabilistic methods based on the proposed robust approaches. Practice shows the effectiveness and reliability of results. The methodology developed in the article can be used to create information technologies and their application in automated control systems for complex projects.
Støre-Valen, Jakob; Ryum, Truls; Pedersen, Geir A F; Pripp, Are H; Jose, Paul E; Karterud, Sigmund
2015-09-01
The Global Assessment of Functioning (GAF) Scale is used in routine clinical practice and research to estimate symptom and functional severity and longitudinal change. Concerns about poor interrater reliability have been raised, and the present study evaluated the effect of a Web-based GAF training program designed to improve interrater reliability in routine clinical practice. Clinicians rated up to 20 vignettes online, and received deviation scores as immediate feedback (i.e., own scores compared with expert raters) after each rating. Growth curves of absolute SD scores across the vignettes were modeled. A linear mixed effects model, using the clinician's deviation scores from expert raters as the dependent variable, indicated an improvement in reliability during training. Moderation by content of scale (symptoms; functioning), scale range (average; extreme), previous experience with GAF rating, profession, and postgraduate training were assessed. Training reduced deviation scores for inexperienced GAF raters, for individuals in clinical professions other than nursing and medicine, and for individuals with no postgraduate specialization. In addition, training was most beneficial for cases with average severity of symptoms compared with cases with extreme severity. The results support the use of Web-based training with feedback routines as a means to improve the reliability of GAF ratings performed by clinicians in mental health practice. These results especially pertain to clinicians in mental health practice who do not have a masters or doctoral degree. (c) 2015 APA, all rights reserved.
Developing a Danish version of the "Impact on Participation and Autonomy Questionnaire".
Ghaziani, Emma; Krogh, Anne Grethe; Lund, Hans
2013-05-01
To translate the "Impact on Participation and Autonomy Questionnaire" into Danish (IPAQ-DK), and estimate its internal consistency and test-retest reliability in order to promote participation-based interventions and research. Translation and two successive reliability assessments through test-retest. 137 adults with varying degrees of impairment; of these, 67 participated in the final reliability assessment. The translation followed guidelines set forth by the "European Group for Quality of Life Assessment and Health Measurement". Internal consistency for subscales was estimated by Chronbach's alpha. Weighted kappa coefficients and intraclass correlation coefficients were calculated to assess the test-retest reliability at item and subscale level, respectively. A preliminary reliability assessment revealed residual issues regarding the translation and cultural adaptation of the instrument. The revised version (IPAQ-DK) was subsequently subjected to a similar assessment demonstrating Chronbach's alpha values from 0.698 to 0.817. Weighted kappa ranged from 0.370 to 0.880; 78% of these values were higher than 0.600. The intraclass correlation coefficient covered values from 0.701 to 0.818. IPAQ-DK is a useful instrument for identifying person-perceived participation restrictions and satisfaction with participation. Further studies of IPAQ-DK's floor/ceiling effects and responsiveness to change are recommended, and whether there is a need for further linguistic improvement of certain items.
An Evidential Reasoning-Based CREAM to Human Reliability Analysis in Maritime Accident Process.
Wu, Bing; Yan, Xinping; Wang, Yang; Soares, C Guedes
2017-10-01
This article proposes a modified cognitive reliability and error analysis method (CREAM) for estimating the human error probability in the maritime accident process on the basis of an evidential reasoning approach. This modified CREAM is developed to precisely quantify the linguistic variables of the common performance conditions and to overcome the problem of ignoring the uncertainty caused by incomplete information in the existing CREAM models. Moreover, this article views maritime accident development from the sequential perspective, where a scenario- and barrier-based framework is proposed to describe the maritime accident process. This evidential reasoning-based CREAM approach together with the proposed accident development framework are applied to human reliability analysis of a ship capsizing accident. It will facilitate subjective human reliability analysis in different engineering systems where uncertainty exists in practice. © 2017 Society for Risk Analysis.
Reliability Assessment Approach for Stirling Convertors and Generators
NASA Technical Reports Server (NTRS)
Shah, Ashwin R.; Schreiber, Jeffrey G.; Zampino, Edward; Best, Timothy
2004-01-01
Stirling power conversion is being considered for use in a Radioisotope Power System for deep-space science missions because it offers a multifold increase in the conversion efficiency of heat to electric power. Quantifying the reliability of a Radioisotope Power System that utilizes Stirling power conversion technology is important in developing and demonstrating the capability for long-term success. A description of the Stirling power convertor is provided, along with a discussion about some of the key components. Ongoing efforts to understand component life, design variables at the component and system levels, related sources, and the nature of uncertainties is discussed. The requirement for reliability also is discussed, and some of the critical areas of concern are identified. A section on the objectives of the performance model development and a computation of reliability is included to highlight the goals of this effort. Also, a viable physics-based reliability plan to model the design-level variable uncertainties at the component and system levels is outlined, and potential benefits are elucidated. The plan involves the interaction of different disciplines, maintaining the physical and probabilistic correlations at all the levels, and a verification process based on rational short-term tests. In addition, both top-down and bottom-up coherency were maintained to follow the physics-based design process and mission requirements. The outlined reliability assessment approach provides guidelines to improve the design and identifies governing variables to achieve high reliability in the Stirling Radioisotope Generator design.
Sevdalis, Nick; Undre, Shabnam; Henry, Janet; Sydney, Elaine; Koutantji, Mary; Darzi, Ara; Vincent, Charles A
2009-09-01
The recent emergence of the Systems Approach to the safety and quality of surgical care has triggered individual and team skills training modules for surgeons and anaesthetists and relevant observational assessment tools have been developed. To develop an observational tool that captures operating room (OR) nurses' technical skill and can be used for assessment and training. The Imperial College Assessment of Technical Skills for Nurses (ICATS-N) assesses (i) gowning and gloving, (ii) setting up instrumentation, (iii) draping, and (iv) maintaining sterility. Three to five observable behaviours have been identified for each skill and are rated on 1-6 scales. Feasibility and aspects of reliability and validity were assessed in 20 simulation-based crisis management training modules for trainee nurses and doctors, carried out in a Simulated Operating Room. The tool was feasible to use in the context of simulation-based training. Satisfactory reliability (Cronbach alpha) was obtained across trainers' and trainees' scores (analysed jointly and separately). Moreover, trainer nurse's ratings of the four skills correlated positively, thus indicating adequate content validity. Trainer's and trainees' ratings did not correlate. Assessment of OR nurses' technical skill is becoming a training priority. The present evidence suggests that the ICATS-N could be considered for use as an assessment/training tool for junior OR nurses.
ERIC Educational Resources Information Center
Srsen, Katja Groleger; Vidmar, Gaj; Pikl, Masa; Vrecar, Irena; Burja, Cirila; Krusec, Klavdija
2012-01-01
The Halliwick concept is widely used in different settings to promote joyful movement in water and swimming. To assess the swimming skills and progression of an individual swimmer, a valid and reliable measure should be used. The Halliwick-concept-based Swimming with Independent Measure (SWIM) was introduced for this purpose. We aimed to determine…