reliability demonstration testing: Topics by Science.gov

Sample records for reliability demonstration testing

Overview of RICOR's reliability theoretical analysis, accelerated life demonstration test results and verification by field data

NASA Astrophysics Data System (ADS)

Vainshtein, Igor; Baruch, Shlomi; Regev, Itai; Segal, Victor; Filis, Avishai; Riabzev, Sergey

2018-05-01

The growing demand for EO applications that work around the clock 24hr/7days a week, such as in border surveillance systems, emphasizes the need for a highly reliable cryocooler having increased operational availability and optimized system's Integrated Logistic Support (ILS). In order to meet this need, RICOR developed linear and rotary cryocoolers which achieved successfully this goal. Cryocoolers MTTF was analyzed by theoretical reliability evaluation methods, demonstrated by normal and accelerated life tests at Cryocooler level and finally verified by field data analysis derived from Cryocoolers operating at system level. The following paper reviews theoretical reliability analysis methods together with analyzing reliability test results derived from standard and accelerated life demonstration tests performed at Ricor's advanced reliability laboratory. As a summary for the work process, reliability verification data will be presented as a feedback from fielded systems.
Reliability of Single-Leg Balance and Landing Tests in Rugby Union; Prospect of Using Postural Control to Monitor Fatigue

PubMed Central

Troester, Jordan C.; Jasmin, Jason G.; Duffield, Rob

2018-01-01

The present study examined the inter-trial (within test) and inter-test (between test) reliability of single-leg balance and single-leg landing measures performed on a force plate in professional rugby union players using commercially available software (SpartaMARS, Menlo Park, USA). Twenty-four players undertook test – re-test measures on two occasions (7 days apart) on the first training day of two respective pre-season weeks following 48h rest and similar weekly training loads. Two 20s single-leg balance trials were performed on a force plate with eyes closed. Three single-leg landing trials were performed by jumping off two feet and landing on one foot in the middle of a force plate 1m from the starting position. Single-leg balance results demonstrated acceptable inter-trial reliability (ICC = 0.60-0.81, CV = 11-13%) for sway velocity, anterior-posterior sway velocity, and mediolateral sway velocity variables. Acceptable inter-test reliability (ICC = 0.61-0.89, CV = 7-13%) was evident for all variables except mediolateral sway velocity on the dominant leg (ICC = 0.41, CV = 15%). Single-leg landing results only demonstrated acceptable inter-trial reliability for force based measures of relative peak landing force and impulse (ICC = 0.54-0.72, CV = 9-15%). Inter-test results indicate improved reliability through the averaging of three trials with force based measures again demonstrating acceptable reliability (ICC = 0.58-0.71, CV = 7-14%). Of the variables investigated here, total sway velocity and relative landing impulse are the most reliable measures of single-leg balance and landing performance, respectively. These measures should be considered for monitoring potential changes in postural control in professional rugby union. Key points Single-leg balance demonstrated acceptable inter-trial and inter-test reliability. Single-leg landing demonstrated good inter-trial and inter-test reliability for measures of relative peak landing force and relative impulse, but not time to stabilization. Of the variables investigated, sway velocity and relative landing impulse are the most reliable measures of single-leg balance and landing respectively, and should considered for monitoring changes in postural control. PMID:29769817
Multiple objective optimization in reliability demonstration test

DOE PAGES

Lu, Lu; Anderson-Cook, Christine Michaela; Li, Mingyang

2016-10-01

Reliability demonstration tests are usually performed in product design or validation processes to demonstrate whether a product meets specified requirements on reliability. For binomial demonstration tests, the zero-failure test has been most commonly used due to its simplicity and use of minimum sample size to achieve an acceptable consumer’s risk level. However, this test can often result in unacceptably high risk for producers as well as a low probability of passing the test even when the product has good reliability. This paper explicitly explores the interrelationship between multiple objectives that are commonly of interest when planning a demonstration test andmore » proposes structured decision-making procedures using a Pareto front approach for selecting an optimal test plan based on simultaneously balancing multiple criteria. Different strategies are suggested for scenarios with different user priorities and graphical tools are developed to help quantify the trade-offs between choices and to facilitate informed decision making. As a result, potential impacts of some subjective user inputs on the final decision are studied to offer insights and useful guidance for general applications.« less
Vestibular Assessments in Children With Global Developmental Delay: An Exploratory Study.

PubMed

Dannenbaum, Elizabeth; Horne, Victoria; Malik, Farwa; Villeneuve, Myriam; Salvo, Lora; Chilingaryan, Gevorg; Lamontagne, Anouk

2016-01-01

To compare results of 3 clinical vestibular tests between children with global developmental delay (GDD) and children with typical development (TD) and investigate the test-retest reliability. Twenty children with GDD (aged 4.1-12.1 years) and 11 age-matched controls with TD participated. Participants with GDD underwent 2 sessions of testing. Each session consisted of the Clinical Test of Sensory Interaction and Balance (CTSIB), Dynamic Visual Acuity (DVA) test, and the modified Emory Clinical Vestibular Chair Test (m-ECVCT). Up to 33% of the children with GDD had abnormal DVA scores. m-ECVCT results of children with GDD demonstrated larger variance than children with TD. The CTSIB score was significantly reduced in the group with GDD. The test-retest reliability varied, with good reliability for the m-ECVCT and CTSIB, and fair reliability for the DVA. Findings suggest vestibular involvement in children in GDD. The clinical tests demonstrated moderate test-retest reliability.
Reliability demonstration test for load-sharing systems with exponential and Weibull components

PubMed Central

Hu, Qingpei; Yu, Dan; Xie, Min

2017-01-01

Conducting a Reliability Demonstration Test (RDT) is a crucial step in production. Products are tested under certain schemes to demonstrate whether their reliability indices reach pre-specified thresholds. Test schemes for RDT have been studied in different situations, e.g., lifetime testing, degradation testing and accelerated testing. Systems designed with several structures are also investigated in many RDT plans. Despite the availability of a range of test plans for different systems, RDT planning for load-sharing systems hasn’t yet received the attention it deserves. In this paper, we propose a demonstration method for two specific types of load-sharing systems with components subject to two distributions: exponential and Weibull. Based on the assumptions and interpretations made in several previous works on such load-sharing systems, we set the mean time to failure (MTTF) of the total system as the demonstration target. We represent the MTTF as a summation of mean time between successive component failures. Next, we introduce generalized test statistics for both the underlying distributions. Finally, RDT plans for the two types of systems are established on the basis of these test statistics. PMID:29284030
Reliability demonstration test for load-sharing systems with exponential and Weibull components.

PubMed

Xu, Jianyu; Hu, Qingpei; Yu, Dan; Xie, Min

2017-01-01

Conducting a Reliability Demonstration Test (RDT) is a crucial step in production. Products are tested under certain schemes to demonstrate whether their reliability indices reach pre-specified thresholds. Test schemes for RDT have been studied in different situations, e.g., lifetime testing, degradation testing and accelerated testing. Systems designed with several structures are also investigated in many RDT plans. Despite the availability of a range of test plans for different systems, RDT planning for load-sharing systems hasn't yet received the attention it deserves. In this paper, we propose a demonstration method for two specific types of load-sharing systems with components subject to two distributions: exponential and Weibull. Based on the assumptions and interpretations made in several previous works on such load-sharing systems, we set the mean time to failure (MTTF) of the total system as the demonstration target. We represent the MTTF as a summation of mean time between successive component failures. Next, we introduce generalized test statistics for both the underlying distributions. Finally, RDT plans for the two types of systems are established on the basis of these test statistics.
A reliability as an independent variable (RAIV) methodology for optimizing test planning for liquid rocket engines

NASA Astrophysics Data System (ADS)

Strunz, Richard; Herrmann, Jeffrey W.

2011-12-01

The hot fire test strategy for liquid rocket engines has always been a concern of space industry and agency alike because no recognized standard exists. Previous hot fire test plans focused on the verification of performance requirements but did not explicitly include reliability as a dimensioning variable. The stakeholders are, however, concerned about a hot fire test strategy that balances reliability, schedule, and affordability. A multiple criteria test planning model is presented that provides a framework to optimize the hot fire test strategy with respect to stakeholder concerns. The Staged Combustion Rocket Engine Demonstrator, a program of the European Space Agency, is used as example to provide the quantitative answer to the claim that a reduced thrust scale demonstrator is cost beneficial for a subsequent flight engine development. Scalability aspects of major subsystems are considered in the prior information definition inside the Bayesian framework. The model is also applied to assess the impact of an increase of the demonstrated reliability level on schedule and affordability.
Reliability and validity of the test of incremental respiratory endurance measures of inspiratory muscle performance in COPD.

PubMed

Formiga, Magno F; Roach, Kathryn E; Vital, Isabel; Urdaneta, Gisel; Balestrini, Kira; Calderon-Candelario, Rafael A; Campos, Michael A; Cahalin, Lawrence P

2018-01-01

The Test of Incremental Respiratory Endurance (TIRE) provides a comprehensive assessment of inspiratory muscle performance by measuring maximal inspiratory pressure (MIP) over time. The integration of MIP over inspiratory duration (ID) provides the sustained maximal inspiratory pressure (SMIP). Evidence on the reliability and validity of these measurements in COPD is not currently available. Therefore, we assessed the reliability, responsiveness and construct validity of the TIRE measures of inspiratory muscle performance in subjects with COPD. Test-retest reliability, known-groups and convergent validity assessments were implemented simultaneously in 81 male subjects with mild to very severe COPD. TIRE measures were obtained using the portable PrO2 device, following standard guidelines. All TIRE measures were found to be highly reliable, with SMIP demonstrating the strongest test-retest reliability with a nearly perfect intraclass correlation coefficient (ICC) of 0.99, while MIP and ID clustered closely together behind SMIP with ICC values of about 0.97. Our findings also demonstrated known-groups validity of all TIRE measures, with SMIP and ID yielding larger effect sizes when compared to MIP in distinguishing between subjects of different COPD status. Finally, our analyses confirmed convergent validity for both SMIP and ID, but not MIP. The TIRE measures of MIP, SMIP and ID have excellent test-retest reliability and demonstrated known-groups validity in subjects with COPD. SMIP and ID also demonstrated evidence of moderate convergent validity and appear to be more stable measures in this patient population than the traditional MIP.
Demonstrating the Safety and Reliability of a New System or Spacecraft: Incorporating Analyses and Reviews of the Design and Processing in Determining the Number of Tests to be Conducted

NASA Technical Reports Server (NTRS)

Vesely, William E.; Colon, Alfredo E.

2010-01-01

Design Safety/Reliability is associated with the probability of no failure-causing faults existing in a design. Confidence in the non-existence of failure-causing faults is increased by performing tests with no failure. Reliability-Growth testing requirements are based on initial assurance and fault detection probability. Using binomial tables generally gives too many required tests compared to reliability-growth requirements. Reliability-Growth testing requirements are based on reliability principles and factors and should be used.
Reliability of Single-Leg Balance and Landing Tests in Rugby Union; Prospect of Using Postural Control to Monitor Fatigue.

PubMed

Troester, Jordan C; Jasmin, Jason G; Duffield, Rob

2018-06-01

The present study examined the inter-trial (within test) and inter-test (between test) reliability of single-leg balance and single-leg landing measures performed on a force plate in professional rugby union players using commercially available software (SpartaMARS, Menlo Park, USA). Twenty-four players undertook test - re-test measures on two occasions (7 days apart) on the first training day of two respective pre-season weeks following 48h rest and similar weekly training loads. Two 20s single-leg balance trials were performed on a force plate with eyes closed. Three single-leg landing trials were performed by jumping off two feet and landing on one foot in the middle of a force plate 1m from the starting position. Single-leg balance results demonstrated acceptable inter-trial reliability (ICC = 0.60-0.81, CV = 11-13%) for sway velocity, anterior-posterior sway velocity, and mediolateral sway velocity variables. Acceptable inter-test reliability (ICC = 0.61-0.89, CV = 7-13%) was evident for all variables except mediolateral sway velocity on the dominant leg (ICC = 0.41, CV = 15%). Single-leg landing results only demonstrated acceptable inter-trial reliability for force based measures of relative peak landing force and impulse (ICC = 0.54-0.72, CV = 9-15%). Inter-test results indicate improved reliability through the averaging of three trials with force based measures again demonstrating acceptable reliability (ICC = 0.58-0.71, CV = 7-14%). Of the variables investigated here, total sway velocity and relative landing impulse are the most reliable measures of single-leg balance and landing performance, respectively. These measures should be considered for monitoring potential changes in postural control in professional rugby union.
Test-Retest Reliability of a Survey to Measure Transport-Related Physical Activity in Adults

ERIC Educational Resources Information Center

Badland, Hannah; Schofield, Grant

2006-01-01

The present research details test-retest reliability of a newly developed, telephone-administered TPA survey for adults. This instrument examines barriers, perceptions, and current travel behaviors to place of work/study and local convenience shops. Demonstrated test-retest reliability of the Active Friendly Environments-Transport-Related Physical…
Validity and reliability of the NAB Naming Test.

PubMed

Sachs, Bonnie C; Rush, Beth K; Pedraza, Otto

2016-05-01

Confrontation naming is commonly assessed in neuropsychological practice, but few standardized measures of naming exist and those that do are susceptible to the effects of education and culture. The Neuropsychological Assessment Battery (NAB) Naming Test is a 31-item measure used to assess confrontation naming. Despite adequate psychometric information provided by the test publisher, there has been limited independent validation of the test. In this study, we investigated the convergent and discriminant validity, internal consistency, and alternate forms reliability of the NAB Naming Test in a sample of adults (Form 1: n = 247, Form 2: n = 151) clinically referred for neuropsychological evaluation. Results indicate adequate-to-good internal consistency and alternate forms reliability. We also found strong convergent validity as demonstrated by relationships with other neurocognitive measures. We found preliminary evidence that the NAB Naming Test demonstrates a more pronounced ceiling effect than other commonly used measures of naming. To our knowledge, this represents the largest published independent validation study of the NAB Naming Test in a clinical sample. Our findings suggest that the NAB Naming Test demonstrates adequate validity and reliability and merits consideration in the test arsenal of clinical neuropsychologists.
Demonstration of the test-retest reliability and sensitivity of the Lower Limb Functional Index-10 as a measure of functional recovery post burn injury: a cross-sectional repeated measures study design.

PubMed

Ryland, Margaret E; Grisbrook, Tiffany L; Wood, Fiona M; Phillips, Michael; Edgar, Dale W

2016-01-01

Lower limb burns can significantly delay recovery of function. Measuring lower limb functional outcomes is challenging in the unique burn patient population and necessitates the use of reliable and valid tools. The aims of this study were to examine the test-retest reliability, sensitivity, and internal consistency of Sections 1 and 3 of the Lower Limb Functional Index-10 (LLFI-10) questionnaire for measuring functional ability in patients with lower limb burns over time. Twenty-nine adult patients who had sustained a lower limb burn injury in the previous 12 months completed the test-retest procedure of the study. In addition, the minimal detectable change (MDC) was calculated for Section 1 and 3 of the LLFI-10. Section 1 is focused on the activity limitations experienced by patients with a lower limb disorder whereas Section 3 involves patients indicating their current percentage of pre-injury duties. Section 1 of the LLFI-10 demonstrated excellent test-retest reliability (intra-class correlation coefficient (ICC) 0.98, 95 % CI 0.96-0.99) whilst Section 3 demonstrated high test-retest reliability (ICC 0.88, 95 % CI 0.79-0.94). MDC scores for Sections 1 and 3 were 1.27 points and 30.22 %, respectively. Internal consistency was demonstrated with a significant negative association (r s = -0.83) between Sections 1 and 3 of the LLFI-10 (p < 0.001). This study demonstrates that Section 1 and 3 of the LLFI-10 are reliable for measuring functional ability in patients who have sustained lower limb burns in the previous 12 months, and furthermore, Section 1 is sensitive to changes in patient function over time.
Inter-Rater Reliability and Validity of the Australian Football League’s Kicking and Handball Tests

PubMed Central

Cripps, Ashley J.; Hopper, Luke S.; Joyce, Christopher

2015-01-01

Talent identification tests used at the Australian Football League’s National Draft Combine assess the capacities of athletes to compete at a professional level. Tests created for the National Draft Combine are also commonly used for talent identification and athlete development in development pathways. The skills tests created by the Australian Football League required players to either handball (striking the ball with the hand) or kick to a series of 6 randomly generated targets. Assessors subjectively rate each skill execution giving a 0-5 score for each disposal. This study aimed to investigate the inter-rater reliability and validity of the skills tests at an adolescent sub-elite level. Male Australian footballers were recruited from sub-elite adolescent teams (n = 121, age = 15.7 ± 0.3 years, height = 1.77 ± 0.07 m, mass = 69.17 ± 8.08 kg). The coaches (n = 7) of each team were also recruited. Inter-rater reliability was assessed using Inter-class correlations (ICC) and Limits of Agreement statistics. Both the kicking (ICC = 0.96, p < .01) and handball tests (ICC = 0.89, p < .01) demonstrated strong reliability and acceptable levels of absolute agreement. Content validity was determined by examining the test scores sensitivity to laterality and distance. Concurrent validity was assessed by comparing coaches’ perceptions of skill to actual test outcomes. Multivariate analysis of variance (MANOVA) examined the main effect of laterality, with scores on the dominant hand (p = .04) and foot (p < .01) significantly higher compared to the non-dominant side. Follow-up univariate analysis reported significant differences at every distance in the kicking test. A poor correlation was found between coaches’ perceptions of skill and testing outcomes. The results of this study demonstrate both skill tests demonstrate acceptable inter-rater reliable. Partial content validity was confirmed for the kicking test, however further research is required to confirm validity of the handball test. Key points The skill tests created by the AFL demonstrated acceptable levels of relative and absolute inter-rater reliability. Both the AFL’s skills tests are able to differentiate between athletes dominant and non-dominant limbs. However, only the kicking test could consistently differentiated between score outcomes over a range of Australian Football specific disposal distances. Both tests demonstrated poor concurrent validity, with no correlation found between coaches’ perceptions of technical skills and actual skill outcomes measured. PMID:26336356
Reliability Estimation When a Test Is Split into Two Parts of Unknown Effective Length.

ERIC Educational Resources Information Center

Feldt, Leonard S.

2002-01-01

Considers the situation in which content or administrative considerations limit the way in which a test can be partitioned to estimate the internal consistency reliability of the total test score. Demonstrates that a single-valued estimate of the total score reliability is possible only if an assumption is made about the comparative size of the…
Processes and Procedures for Estimating Score Reliability and Precision

ERIC Educational Resources Information Center

Bardhoshi, Gerta; Erford, Bradley T.

2017-01-01

Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…
Interhemispheric Inhibition Measurement Reliability in Stroke: A Pilot Study

PubMed Central

Cassidy, Jessica M.; Chu, Haitao; Chen, Mo; Kimberley, Teresa J.; Carey, James R.

2016-01-01

Objective Reliable transcranial magnetic stimulation (TMS) measures for probing corticomotor excitability are important when assessing the physiological effects of non-invasive brain stimulation. The primary objective of this study was to examine test-retest reliability of an interhemispheric inhibition (IHI) index measurement in stroke. Materials and Methods Ten subjects with chronic stroke (≥ 6 months) completed two IHI testing sessions per week for three weeks (six testing sessions total). A single investigator measured IHI in the contra- to-ipsilesional primary motor cortex direction and in the opposite direction using bilateral paired-pulse TMS. Weekly sessions were separated by 24 hours with a 1-week washout period separating testing weeks. To determine if motor-evoked potential (MEP) quantification method affected measurement reliability, IHI indices computed from both MEP amplitude and area responses were found. Reliability was assessed with two-way, mixed intraclass correlation coefficients (ICC(3,k)). Standard error of measurement and minimal detectable difference statistics were also determined. Results With the exception of the initial testing week, IHI indices measured in the contra-to-ipsilesional hemisphere direction demonstrated moderate to excellent reliability (ICC = 0.725 – 0.913). Ipsi-to-contralesional IHI indices depicted poor or invalid reliability estimates throughout the three-week testing duration (ICC= −1.153 – 0.105). The overlap of ICC 95% confidence intervals suggested that IHI indices using MEP amplitude vs. area measures did not differ with respect to reliability. Conclusions IHI indices demonstrated varying magnitudes of reliability irrespective of MEP quantification method. Several strategies for improving IHI index measurement reliability are discussed. PMID:27333364
Reliability of fitness tests using methods and time periods common in sport and occupational management.

PubMed

Burnstein, Bryan D; Steele, Russell J; Shrier, Ian

2011-01-01

Fitness testing is used frequently in many areas of physical activity, but the reliability of these measurements under real-world, practical conditions is unknown. To evaluate the reliability of specific fitness tests using the methods and time periods used in the context of real-world sport and occupational management. Cohort study. Eighteen different Cirque du Soleil shows. Cirque du Soleil physical performers who completed 4 consecutive tests (6-month intervals) and were free of injury or illness at each session (n = 238 of 701 physical performers). Performers completed 6 fitness tests on each assessment date: dynamic balance, Harvard step test, handgrip, vertical jump, pull-ups, and 60-second jump test. We calculated the intraclass coefficient (ICC) and limits of agreement between baseline and each time point and the ICC over all 4 time points combined. Reliability was acceptable (ICC > 0.6) over an 18-month time period for all pairwise comparisons and all time points together for the handgrip, vertical jump, and pull-up assessments. The Harvard step test and 60-second jump test had poor reliability (ICC < 0.6) between baseline and other time points. When we excluded the baseline data and calculated the ICC for 6-month, 12-month, and 18-month time points, both the Harvard step test and 60-second jump test demonstrated acceptable reliability. Dynamic balance was unreliable in all contexts. Limit-of-agreement analysis demonstrated considerable intraindividual variability for some tests and a learning effect by administrators on others. Five of the 6 tests in this battery had acceptable reliability over an 18-month time frame, but the values for certain individuals may vary considerably from time to time for some tests. Specific tests may require a learning period for administrators.
Evaluation of Environmental Profiles for Reliability Demonstration

DTIC Science & Technology

1975-09-01

the increase in the ram air flow rate. As a result, one cannot generalize in advance about the effect of velocity increase on air-conditioner turbine ...152 6.2.6.3 Forced Cooling Air Temperature/ Flow Schedule. 152 Sample Test Provile ....... .............. 154 6.2.8 Profiles for Multi...Profiles for Reliability Demonstration Study Flow ....... . ....... 7 2 Typical MIL-STD-781 Profile ................ 23 3 Test Cycle A - Ambient Cooled
Reliability Quantification of Advanced Stirling Convertor (ASC) Components

NASA Technical Reports Server (NTRS)

Shah, Ashwin R.; Korovaichuk, Igor; Zampino, Edward

2010-01-01

The Advanced Stirling Convertor, is intended to provide power for an unmanned planetary spacecraft and has an operational life requirement of 17 years. Over this 17 year mission, the ASC must provide power with desired performance and efficiency and require no corrective maintenance. Reliability demonstration testing for the ASC was found to be very limited due to schedule and resource constraints. Reliability demonstration must involve the application of analysis, system and component level testing, and simulation models, taken collectively. Therefore, computer simulation with limited test data verification is a viable approach to assess the reliability of ASC components. This approach is based on physics-of-failure mechanisms and involves the relationship among the design variables based on physics, mechanics, material behavior models, interaction of different components and their respective disciplines such as structures, materials, fluid, thermal, mechanical, electrical, etc. In addition, these models are based on the available test data, which can be updated, and analysis refined as more data and information becomes available. The failure mechanisms and causes of failure are included in the analysis, especially in light of the new information, in order to develop guidelines to improve design reliability and better operating controls to reduce the probability of failure. Quantified reliability assessment based on fundamental physical behavior of components and their relationship with other components has demonstrated itself to be a superior technique to conventional reliability approaches based on utilizing failure rates derived from similar equipment or simply expert judgment.

Reliability of Measurement of Glenohumeral Internal Rotation, External Rotation, and Total Arc of Motion in 3 Test Positions

PubMed Central

Kevern, Mark A.; Beecher, Michael; Rao, Smita

2014-01-01

Context: Athletes who participate in throwing and racket sports consistently demonstrate adaptive changes in glenohumeral-joint internal and external rotation in the dominant arm. Measurements of these motions have demonstrated excellent intrarater and poor interrater reliability. Objective: To determine intrarater reliability, interrater reliability, and standard error of measurement for shoulder internal rotation, external rotation, and total arc of motion using an inclinometer in 3 testing procedures in National Collegiate Athletic Association Division I baseball and softball athletes. Design: Cross-sectional study. Setting: Athletic department. Patients or Other Participants Thirty-eight players participated in the study. Shoulder internal rotation, external rotation, and total arc of motion were measured by 2 investigators in 3 test positions. The standard supine position was compared with a side-lying test position, as well as a supine test position without examiner overpressure. Results: Excellent intrarater reliability was noted for all 3 test positions and ranges of motion, with intraclass correlation coefficient values ranging from 0.93 to 0.99. Results for interrater reliability were less favorable. Reliability for internal rotation was highest in the side-lying position (0.68) and reliability for external rotation and total arc was highest in the supine-without-overpressure position (0.774 and 0.713, respectively). The supine-with-overpressure position yielded the lowest interrater reliability results in all positions. The side-lying position had the most consistent results, with very little variation among intraclass correlation coefficient values for the various test positions. Conclusions: The results of our study clearly indicate that the side-lying test procedure is of equal or greater value than the traditional supine-with-overpressure method. PMID:25188316
Validity and Reliability of the 8-Item Work Limitations Questionnaire.

PubMed

Walker, Timothy J; Tullar, Jessica M; Diamond, Pamela M; Kohl, Harold W; Amick, Benjamin C

2017-12-01

Purpose To evaluate factorial validity, scale reliability, test-retest reliability, convergent validity, and discriminant validity of the 8-item Work Limitations Questionnaire (WLQ) among employees from a public university system. Methods A secondary analysis using de-identified data from employees who completed an annual Health Assessment between the years 2009-2015 tested research aims. Confirmatory factor analysis (CFA) (n = 10,165) tested the latent structure of the 8-item WLQ. Scale reliability was determined using a CFA-based approach while test-retest reliability was determined using the intraclass correlation coefficient. Convergent/discriminant validity was tested by evaluating relations between the 8-item WLQ with health/performance variables for convergent validity (health-related work performance, number of chronic conditions, and general health) and demographic variables for discriminant validity (gender and institution type). Results A 1-factor model with three correlated residuals demonstrated excellent model fit (CFI = 0.99, TLI = 0.99, RMSEA = 0.03, and SRMR = 0.01). The scale reliability was acceptable (0.69, 95% CI 0.68-0.70) and the test-retest reliability was very good (ICC = 0.78). Low-to-moderate associations were observed between the 8-item WLQ and the health/performance variables while weak associations were observed between the demographic variables. Conclusions The 8-item WLQ demonstrated sufficient reliability and validity among employees from a public university system. Results suggest the 8-item WLQ is a usable alternative for studies when the more comprehensive 25-item WLQ is not available.
Developing an oropharyngeal cancer (OPC) knowledge and behaviors survey.

PubMed

Dodd, Virginia J; Riley Iii, Joseph L; Logan, Henrietta L

2012-09-01

To use the community participation research model to (1) develop a survey assessing knowledge about mouth and throat cancer and (2) field test and establish test-retest reliability with newly developed instrument. Cognitive interviews with primarily rural African American adults to assess their perception and interpretation of survey items. Test-retest reliability was established with a racially diverse rural population. Test-retest reliabilities ranged from .79 to .40 for screening awareness and .74 to .19 for knowledge. Coefficients increased for composite scores. Community participation methodology provided a culturally appropriate survey instrument that demonstrated acceptable levels of reliability.
Manual unloading of the lumbar spine: can it identify immediate responders to mechanical traction in a low back pain population? A study of reliability and criterion referenced predictive validity

PubMed Central

Swanson, Brian T.; Riley, Sean P.; Cote, Mark P.; Leger, Robin R.; Moss, Isaac L.; Carlos,, John

2016-01-01

Background To date, no research has examined the reliability or predictive validity of manual unloading tests of the lumbar spine to identify potential responders to lumbar mechanical traction. Purpose To determine: (1) the intra and inter-rater reliability of a manual unloading test of the lumbar spine and (2) the criterion referenced predictive validity for the manual unloading test. Methods Ten volunteers with low back pain (LBP) underwent a manual unloading test to establish reliability. In a separate procedure, 30 consecutive patients with LBP (age 50·86±11·51) were assessed for pain in their most provocative standing position (visual analog scale (VAS) 49·53±25·52 mm). Patients were assessed with a manual unloading test in their most provocative position followed by a single application of intermittent mechanical traction. Post traction, pain in the provocative position was reassessed and utilized as the outcome criterion. Results The test of unloading demonstrated substantial intra and inter-rater reliability K = 1·00, P = 0·002, K = 0·737, P = 0·001, respectively. There were statistically significant within group differences for pain response following traction for patients with a positive manual unloading test (P<0·001), while patients with a negative manual unloading test did not demonstrate a statistically significant change (P>0·05). There were significant between group differences for proportion of responders to traction based on manual unloading response (P = 0·031), and manual unloading response demonstrated a moderate to strong relationship with traction response Phi = 0·443, P = 0·015. Discussion and conclusion The manual unloading test appears to be a reliable test and has a moderate to strong correlation with pain relief that exceeds minimal clinically important difference (MCID) following traction supporting the validity of this test. PMID:27559274
Comparison of three instruments for measuring patient anxiety in a coronary care unit.

PubMed

Elliott, D

1993-09-01

This paper compares the State-Trait Anxiety Inventory (STAI), Hospital Anxiety and Depression Scale (HAD Scale) and a Linear Analogue Anxiety Scale (LAAS) for evaluating anxiety in patients with acute ischaemic heart disease. The instruments were examined for correlation, reliability and internal consistency. Strong associations were demonstrated at pre-test between the STAI and the other scales. Moderate coefficients between HAD-A and HAD-D/LAAS were also apparent. Lower correlations were found at post-test than at pre-test. At post-test, strong inter-correlations occurred for STAI/LAAS. The HAD Scale demonstrated high test-retest reliability, while the STAI and LAAS were moderate in their reliability in this sample. The adequate correlation between the instruments suggest that each is a valid and appropriate measure of anxiety in this clinical sample.
Technology Demonstration Summary Technology Evaluation Report, Site Demonstration Test, Hazcon Solidification, Douglassville, Pennsylvania

EPA Science Inventory

The major objective of the HAZCON Solidification SITE Program Demonstration Test was to develop reliable performance and cost information. The demonstration occurred at a 50-acre site of a former oil reprocessing plant at Douglassville, PA containing a wide range of organic...
Expert Reliability for the World Health Organization Standardized Ultrasound Classification of Cystic Echinococcosis

PubMed Central

Solomon, Nadia; Fields, Paul J.; Tamarozzi, Francesca; Brunetti, Enrico; Macpherson, Calum N. L.

2017-01-01

Cystic echinococcosis (CE), a parasitic zoonosis, results in cyst formation in the viscera. Cyst morphology depends on developmental stage. In 2003, the World Health Organization (WHO) published a standardized ultrasound (US) classification for CE, for use among experts as a standard of comparison. This study examined the reliability of this classification. Eleven international CE and US experts completed an assessment of eight WHO classification images and 88 test images representing cyst stages. Inter- and intraobserver reliability and observer performance were assessed using Fleiss' and Cohen's kappa. Interobserver reliability was moderate for WHO images (κ = 0.600, P < 0.0001) and substantial for test images (κ = 0.644, P < 0.0001), with substantial to almost perfect interobserver reliability for stages with pathognomonic signs (CE1, CE2, and CE3) for WHO (0.618 < κ < 0.904) and test images (0.642 < κ < 0.768). Comparisons of expert performances against the majority classification for each image were significant for WHO (0.413 < κ < 1.000, P < 0.005) and test images (0.718 < κ < 0.905, P < 0.0001); and intraobserver reliability was significant for WHO (0.520 < κ < 1.000, P < 0.005) and test images (0.690 < κ < 0.896, P < 0.0001). Findings demonstrate moderate to substantial interobserver and substantial to almost perfect intraobserver reliability for the WHO classification, with substantial to almost perfect interobserver reliability for pathognomonic stages. This confirms experts' abilities to reliably identify WHO-defined pathognomonic signs of CE, demonstrating that the WHO classification provides a reproducible way of staging CE. PMID:28070008
Development and positioning reliability of a TMS coil holder for headache research.

PubMed

Chronicle, Edward P; Pearson, A Jane; Matthews, Cheryl

2005-01-01

Accurate and reproducible coil positioning is important for headache research using transcranial magnetic stimulation protocols. We aimed to design a transcranial magnetic stimulation coil holder and demonstrate reliability of test-retest coil positioning. A coil holder was developed and manufactured according to three principles of stability, durability, and three-dimensional positional accuracy. Reliability of coil positioning was assessed by stimulating over the motor cortex of four neurologically normal subjects and recording finger muscle responses, both at a test phase and a retest phase several hours later. In all four subjects, repositioning of the transcranial magnetic stimulation coil solely on the basis of coil holder coordinates was accurate to within 2 mm. The coil holder demonstrated good test-retest reliability of coil positioning, and is thus a promising tool for transcranial magnetic stimulation-based headache research, particularly studies of prophylactic drug effect where several laboratory visits with identical coil positioning are necessary.
Reliability of Fitness Tests Using Methods and Time Periods Common in Sport and Occupational Management

PubMed Central

Burnstein, Bryan D.; Steele, Russell J.; Shrier, Ian

2011-01-01

Context: Fitness testing is used frequently in many areas of physical activity, but the reliability of these measurements under real-world, practical conditions is unknown. Objective: To evaluate the reliability of specific fitness tests using the methods and time periods used in the context of real-world sport and occupational management. Design: Cohort study. Setting: Eighteen different Cirque du Soleil shows. Patients or Other Participants: Cirque du Soleil physical performers who completed 4 consecutive tests (6-month intervals) and were free of injury or illness at each session (n = 238 of 701 physical performers). Intervention(s): Performers completed 6 fitness tests on each assessment date: dynamic balance, Harvard step test, handgrip, vertical jump, pull-ups, and 60-second jump test. Main Outcome Measure(s): We calculated the intraclass coefficient (ICC) and limits of agreement between baseline and each time point and the ICC over all 4 time points combined. Results: Reliability was acceptable (ICC > 0.6) over an 18-month time period for all pairwise comparisons and all time points together for the handgrip, vertical jump, and pull-up assessments. The Harvard step test and 60-second jump test had poor reliability (ICC < 0.6) between baseline and other time points. When we excluded the baseline data and calculated the ICC for 6-month, 12-month, and 18-month time points, both the Harvard step test and 60-second jump test demonstrated acceptable reliability. Dynamic balance was unreliable in all contexts. Limit-of-agreement analysis demonstrated considerable intraindividual variability for some tests and a learning effect by administrators on others. Conclusions: Five of the 6 tests in this battery had acceptable reliability over an 18-month time frame, but the values for certain individuals may vary considerably from time to time for some tests. Specific tests may require a learning period for administrators. PMID:22488138
Development and psychometric evaluation of an information literacy self-efficacy survey and an information literacy knowledge test.

PubMed

Tepe, Rodger; Tepe, Chabha

2015-03-01

To develop and psychometrically evaluate an information literacy (IL) self-efficacy survey and an IL knowledge test. In this test-retest reliability study, a 25-item IL self-efficacy survey and a 50-item IL knowledge test were developed and administered to a convenience sample of 53 chiropractic students. Item analyses were performed on all questions. The IL self-efficacy survey demonstrated good reliability (test-retest correlation = 0.81) and good/very good internal consistency (mean κ = .56 and Cronbach's α = .92). A total of 25 questions with the best item analysis characteristics were chosen from the 50-item IL knowledge test, resulting in a 25-item IL knowledge test that demonstrated good reliability (test-retest correlation = 0.87), very good internal consistency (mean κ = .69, KR20 = 0.85), and good item discrimination (mean point-biserial = 0.48). This study resulted in the development of three instruments: a 25-item IL self-efficacy survey, a 50-item IL knowledge test, and a 25-item IL knowledge test. The information literacy self-efficacy survey and the 25-item version of the information literacy knowledge test have shown preliminary evidence of adequate reliability and validity to justify continuing study with these instruments.
Reliability and Validity of Dual-Task Mobility Assessments in People with Chronic Stroke

PubMed Central

Yang, Lei; He, Chengqi; Pang, Marco Yiu Chung

2016-01-01

Background The ability to perform a cognitive task while walking simultaneously (dual-tasking) is important in real life. However, the psychometric properties of dual-task walking tests have not been well established in stroke. Objective To assess the test-retest reliability, concurrent and known-groups validity of various dual-task walking tests in people with chronic stroke. Design Observational measurement study with a test-retest design. Methods Eighty-eight individuals with chronic stroke participated. The testing protocol involved four walking tasks (walking forward at self-selected and maximal speed, walking backward at self-selected speed, and crossing over obstacles) performed simultaneously with each of the three attention-demanding tasks (verbal fluency, serial 3 subtractions or carrying a cup of water). For each dual-task condition, the time taken to complete the walking task, the correct response rate (CRR) of the cognitive task, and the dual-task effect (DTE) for the walking time and CRR were calculated. Forty-six of the participants were tested twice within 3–4 days to establish test-retest reliability. Results The walking time in various dual-task assessments demonstrated good to excellent reliability [Intraclass correlation coefficient (ICC2,1) = 0.70–0.93; relative minimal detectable change at 95% confidence level (MDC95%) = 29%-45%]. The reliability of the CRR (ICC2,1 = 0.58–0.81) and the DTE in walking time (ICC2,1 = 0.11–0.80) was more varied. The reliability of the DTE in CRR (ICC2,1 = -0.31–0.40) was poor to fair. The walking time and CRR obtained in various dual-task walking tests were moderately to strongly correlated with those of the dual-task Timed-up-and-Go test, thus demonstrating good concurrent validity. None of the tests could discriminate fallers (those who had sustained at least one fall in the past year) from non-fallers. Limitation The results are generalizable to community-dwelling individuals with chronic stroke only. Conclusions The walking time derived from the various dual-task assessments generally demonstrated good to excellent reliability, making them potentially useful in clinical practice and future research endeavors. However, the usefulness of these measurements in predicting falls needs to be further explored. Relatively low reliability was shown in the cognitive outcomes and DTE, which may not be preferred measurements for assessing dual-task performance. PMID:26808662
Practical Procedures for Constructing Mastery Tests to Minimize Errors of Classification and to Maximize or Optimize Decision Reliability.

ERIC Educational Resources Information Center

Byars, Alvin Gregg

The objectives of this investigation are to develop, describe, assess, and demonstrate procedures for constructing mastery tests to minimize errors of classification and to maximize decision reliability. The guidelines are based on conditions where item exchangeability is a reasonable assumption and the test constructor can control the number of…
Accuracy and Feasibility of Video Analysis for Assessing Hamstring Flexibility and Validity of the Sit-and-Reach Test

ERIC Educational Resources Information Center

Mier, Constance M.

2011-01-01

The accuracy of video analysis of the passive straight-leg raise test (PSLR) and the validity of the sit-and-reach test (SR) were tested in 60 men and women. Computer software measured static hip-joint flexion accurately. High within-session reliability of the PSLR was demonstrated (R greater than 0.97). Test-retest (separate days) reliability for…
Reliability of the Client-Centeredness of Goal Setting (C-COGS) Scale in Acquired Brain Injury Rehabilitation.

PubMed

Doig, Emmah; Prescott, Sarah; Fleming, Jennifer; Cornwell, Petrea; Kuipers, Pim

2016-01-01

To examine the internal reliability and test-retest reliability of the Client-Centeredness of Goal Setting (C-COGS) scale. The C-COGS scale was administered to 42 participants with acquired brain injury after completion of multidisciplinary goal planning. Internal reliability of scale items was examined using item-partial total correlations and Cronbach's α coefficient. The scale was readministered within a 1-mo period to a subsample of 12 participants to examine test-retest reliability by calculating exact and close percentage agreement for each item. After examination of item-partial total correlations, test items were revised. The revised items demonstrated stronger internal consistency than the original items. Preliminary evaluation of test-retest reliability was fair, with an average exact percent agreement across all test items of 67%. Findings support the preliminary reliability of the C-COGS scale as a tool to evaluate and promote client-centered goal planning in brain injury rehabilitation. Copyright © 2016 by the American Occupational Therapy Association, Inc.
The reliability of three devices used for measuring vertical jump height.

PubMed

Nuzzo, James L; Anning, Jonathan H; Scharfenberg, Jessica M

2011-09-01

The purpose of this investigation was to assess the intrasession and intersession reliability of the Vertec, Just Jump System, and Myotest for measuring countermovement vertical jump (CMJ) height. Forty male and 39 female university students completed 3 maximal-effort CMJs during 2 testing sessions, which were separated by 24-48 hours. The height of the CMJ was measured from all 3 devices simultaneously. Systematic error, relative reliability, absolute reliability, and heteroscedasticity were assessed for each device. Systematic error across the 3 CMJ trials was observed within both sessions for males and females, and this was most frequently observed when the CMJ height was measured by the Vertec. No systematic error was discovered across the 2 testing sessions when the maximum CMJ heights from the 2 sessions were compared. In males, the Myotest demonstrated the best intrasession reliability (intraclass correlation coefficient [ICC] = 0.95; SEM = 1.5 cm; coefficient of variation [CV] = 3.3%) and intersession reliability (ICC = 0.88; SEM = 2.4 cm; CV = 5.3%; limits of agreement = -0.08 ± 4.06 cm). Similarly, in females, the Myotest demonstrated the best intrasession reliability (ICC = 0.91; SEM = 1.4 cm; CV = 4.5%) and intersession reliability (ICC = 0.92; SEM = 1.3 cm; CV = 4.1%; limits of agreement = 0.33 ± 3.53 cm). Additional analysis revealed that heteroscedasticity was present in the CMJ when measured from all 3 devices, indicating that better jumpers demonstrate greater fluctuations in CMJ scores across testing sessions. To attain reliable CMJ height measurements, practitioners are encouraged to familiarize athletes with the CMJ technique and then allow the athletes to complete numerous repetitions until performance plateaus, particularly if the Vertec is being used.
Functional performance testing of the hip in athletes: a systematic review for reliability and validity.

PubMed

Kivlan, Benjamin R; Martin, Robroy L

2012-08-01

The purpose of this study was to systematically review the literature for functional performance tests with evidence of reliability and validity that could be used for a young, athletic population with hip dysfunction. A search of PubMed and SPORTDiscus databases were performed to identify movement, balance, hop/jump, or agility functional performance tests from the current peer-reviewed literature used to assess function of the hip in young, athletic subjects. The single-leg stance, deep squat, single-leg squat, and star excursion balance tests (SEBT) demonstrated evidence of validity and normative data for score interpretation. The single-leg stance test and SEBT have evidence of validity with association to hip abductor function. The deep squat test demonstrated evidence as a functional performance test for evaluating femoroacetabular impingement. Hop/Jump tests and agility tests have no reported evidence of reliability or validity in a population of subjects with hip pathology. Use of functional performance tests in the assessment of hip dysfunction has not been well established in the current literature. Diminished squat depth and provocation of pain during the single-leg balance test have been associated with patients diagnosed with FAI and gluteal tendinopathy, respectively. The SEBT and single-leg squat tests provided evidence of convergent validity through an analysis of kinematics and muscle function in normal subjects. Reliability of functional performance tests have not been established on patients with hip dysfunction. Further study is needed to establish reliability and validity of functional performance tests that can be used in a young, athletic population with hip dysfunction. 2b (Systematic Review of Literature).
Identifying and classifying hyperostosis frontalis interna via computerized tomography.

PubMed

May, Hila; Peled, Nathan; Dar, Gali; Hay, Ori; Abbas, Janan; Masharawi, Youssef; Hershkovitz, Israel

2010-12-01

The aim of this study was to recognize the radiological characteristics of hyperostosis frontalis interna (HFI) and to establish a valid and reliable method for its identification and classification. A reliability test was carried out on 27 individuals who had undergone a head computerized tomography (CT) scan. Intra-observer reliability was obtained by examining the images three times, by the same researcher, with a 2-week interval between each sample ranking. The inter-observer test was performed by three independent researchers. A validity test was carried out using two methods for identifying and classifying HFI: 46 cadaver skullcaps were ranked twice via computerized tomography scans and then by direct observation. Reliability and validity were calculated using Kappa test (SPSS 15.0). Reliability tests of ranking HFI via CT scans demonstrated good results (K > 0.7). As for validity, a very good consensus was obtained between the CT and direct observation, when moderate and advanced types of HFI were present (K = 0.82). The suggested classification method for HFI, using CT, demonstrated a sensitivity of 84%, specificity of 90.5%, and positive predictive value of 91.3%. In conclusion, volume rendering is a reliable and valid tool for identifying HFI. The suggested three-scale classification is most suitable for radiological diagnosis of the phenomena. Considering the increasing awareness of HFI as an early indicator of a developing malady, this study may assist radiologists in identifying and classifying the phenomena.
The reliability of the quantitative timed up and go test (QTUG) measured over five consecutive days under single and dual-task conditions in community dwelling older adults.

PubMed

Smith, Erin; Walsh, Lorcan; Doyle, Julie; Greene, Barry; Blake, Catherine

2016-01-01

The timed up and go (TUG) test is a commonly used assessment in older people with variations including the addition of a motor or cognitive dual-task, however in high functioning older adults it is more difficult to assess change. The quantified TUG (QTUG) uses inertial sensors to detect test and gait parameters during the test. If it is to be used in the longitudinal assessment of older adults, it is important that we know which parameters are reliable and under which conditions. This study aims to examine the relative reliability of the QTUG over five consecutive days under single, motor and cognitive dual-task conditions. Twelve community dwelling older adults (10 females, mean age 74.17 (3.88)) performed the QTUG under three conditions for five consecutive days. The relative reliability of each of the gait parameters was assessed using intra-class correlation coefficient (ICC 3,1) and standard error of measurement (SEM). Five of the measures demonstrated excellent reliability (ICC>0.70) under all three conditions (time to complete test, walk time, number of gait cycles, number of steps and return from turn time). Measures of variability and turn derived parameters demonstrated weak reliability under all three conditions (ICC=0.05-0.49). For the most reliable parameters under single-task conditions, the addition of a cognitive task resulted in a reduction in reliability suggesting caution when interpreting results under these conditions. Certain sensor derived parameters during the QTUG test may provide an additional resource in the longitudinal assessment of older people and earlier identification of falls risk. Copyright © 2015 Elsevier B.V. All rights reserved.
Reliability Correction for Functional Connectivity: Theory and Implementation

PubMed Central

Mueller, Sophia; Wang, Danhong; Fox, Michael D.; Pan, Ruiqi; Lu, Jie; Li, Kuncheng; Sun, Wei; Buckner, Randy L.; Liu, Hesheng

2016-01-01

Network properties can be estimated using functional connectivity MRI (fcMRI). However, regional variation of the fMRI signal causes systematic biases in network estimates including correlation attenuation in regions of low measurement reliability. Here we computed the spatial distribution of fcMRI reliability using longitudinal fcMRI datasets and demonstrated how pre-estimated reliability maps can correct for correlation attenuation. As a test case of reliability-based attenuation correction we estimated properties of the default network, where reliability was significantly lower than average in the medial temporal lobe and higher in the posterior medial cortex, heterogeneity that impacts estimation of the network. Accounting for this bias using attenuation correction revealed that the medial temporal lobe’s contribution to the default network is typically underestimated. To render this approach useful to a greater number of datasets, we demonstrate that test-retest reliability maps derived from repeated runs within a single scanning session can be used as a surrogate for multi-session reliability mapping. Using data segments with different scan lengths between 1 and 30 min, we found that test-retest reliability of connectivity estimates increases with scan length while the spatial distribution of reliability is relatively stable even at short scan lengths. Finally, analyses of tertiary data revealed that reliability distribution is influenced by age, neuropsychiatric status and scanner type, suggesting that reliability correction may be especially important when studying between-group differences. Collectively, these results illustrate that reliability-based attenuation correction is an easily implemented strategy that mitigates certain features of fMRI signal nonuniformity. PMID:26493163
Test-retest reliability of jump execution variables using mechanography: a comparison of jump protocols.

PubMed

Fitzgerald, John S; Johnson, LuAnn; Tomkinson, Grant; Stein, Jesse; Roemmich, James N

2018-05-01

Mechanography during the vertical jump may enhance screening and determining mechanistic causes underlying physical performance changes. Utility of jump mechanography for evaluation is limited by scant test-retest reliability data on force-time variables. This study examined the test-retest reliability of eight jump execution variables assessed from mechanography. Thirty-two women (mean±SD: age 20.8 ± 1.3 yr) and 16 men (age 22.1 ± 1.9 yr) attended a familiarization session and two testing sessions, all one week apart. Participants performed two variations of the squat jump with squat depth self-selected and controlled using a goniometer to 80º knee flexion. Test-retest reliability was quantified as the systematic error (using effect size between jumps), random error (using coefficients of variation), and test-retest correlations (using intra-class correlation coefficients). Overall, jump execution variables demonstrated acceptable reliability, evidenced by small systematic errors (mean±95%CI: 0.2 ± 0.07), moderate random errors (mean±95%CI: 17.8 ± 3.7%), and very strong test-retest correlations (range: 0.73-0.97). Differences in random errors between controlled and self-selected protocols were negligible (mean±95%CI: 1.3 ± 2.3%). Jump execution variables demonstrated acceptable reliability, with no meaningful differences between the controlled and self-selected jump protocols. To simplify testing, a self-selected jump protocol can be used to assess force-time variables with negligible impact on measurement error.

Reliability of heart rate measures during walking before and after running maximal efforts.

PubMed

Boullosa, D A; Barros, E S; del Rosso, S; Nakamura, F Y; Leicht, A S

2014-11-01

Previous studies on HR recovery (HRR) measures have utilized the supine and the seated postures. However, the most common recovery mode in sport and clinical settings after running exercise is active walking. The aim of the current study was to examine the reliability of HR measures during walking (4 km · h(-1)) before and following a maximal test. Twelve endurance athletes performed an incremental running test on 2 days separated by 48 h. Absolute (coefficient of variation, CV, %) and relative [Intraclass correlation coefficient, (ICC)] reliability of time domain and non-linear measures of HR variability (HRV) from 3 min recordings, and HRR parameters over 5 min were assessed. Moderate to very high reliability was identified for most HRV indices with short-term components of time domain and non-linear HRV measures demonstrating the greatest reliability before (CV: 12-22%; ICC: 0.73-0.92) and after exercise (CV: 14-32%; ICC: 0.78-0.91). Most HRR indices and parameters of HRR kinetics demonstrated high to very high reliability with HR values at a given point and the asymptotic value of HR being the most reliable (CV: 2.5-10.6%; ICC: 0.81-0.97). These findings demonstrate these measures as reliable tools for the assessment of autonomic control of HR during walking before and after maximal efforts. © Georg Thieme Verlag KG Stuttgart · New York.
Test-retest reliability of the Capute scales for neurodevelopmental screening of a high risk sample: Impact of test-retest interval and degree of neonatal risk.

PubMed

McCurdy, M; Bellows, A; Deng, D; Leppert, M; Mahone, E; Pritchard, A

2015-01-01

Reliable and valid screening and assessment tools are necessary to identify children at risk for neurodevelopmental disabilities who may require additional services. This study evaluated the test-retest reliability of the Capute Scales in a high-risk sample, hypothesizing adequate reliability across 6- and 12-month intervals. Capute Scales scores (N = 66) were collected via retrospective chart review from a NICU follow-up clinic within a large urban medical center spanning three age-ranges: 12-18, 19-24, and 25-36 months. On average, participants were classified as very low birth weight and premature. Reliability of the Capute Scales was evaluated with intraclass correlation coefficients across length of test-retest interval, age at testing, and degree of neonatal complications. The Capute Scales demonstrated high reliability, regardless of length of test-retest interval (ranging from 6 to 14 months) or age of participant, for all index scores, including overall Developmental Quotient (DQ), language-based skill index (CLAMS) and nonverbal reasoning index (CAT). Linear regressions revealed that greater neonatal risk was related to poorer test-retest reliability; however, reliability coefficients remained strong. The Capute Scales afford clinicians a reliable and valid means of screening and assessing for neurodevelopmental delay within high-risk infant populations.
Development and psychometric evaluation of an information literacy self-efficacy survey and an information literacy knowledge test*

PubMed Central

Tepe, Rodger; Tepe, Chabha

2015-01-01

Objective To develop and psychometrically evaluate an information literacy (IL) self-efficacy survey and an IL knowledge test. Methods In this test–retest reliability study, a 25-item IL self-efficacy survey and a 50-item IL knowledge test were developed and administered to a convenience sample of 53 chiropractic students. Item analyses were performed on all questions. Results The IL self-efficacy survey demonstrated good reliability (test–retest correlation = 0.81) and good/very good internal consistency (mean κ = .56 and Cronbach's α = .92). A total of 25 questions with the best item analysis characteristics were chosen from the 50-item IL knowledge test, resulting in a 25-item IL knowledge test that demonstrated good reliability (test–retest correlation = 0.87), very good internal consistency (mean κ = .69, KR20 = 0.85), and good item discrimination (mean point-biserial = 0.48). Conclusions This study resulted in the development of three instruments: a 25-item IL self-efficacy survey, a 50-item IL knowledge test, and a 25-item IL knowledge test. The information literacy self-efficacy survey and the 25-item version of the information literacy knowledge test have shown preliminary evidence of adequate reliability and validity to justify continuing study with these instruments. PMID:25517736
Environmental education curriculum evaluation questionnaire: A reliability and validity study

NASA Astrophysics Data System (ADS)

Minner, Daphne Diane

The intention of this research project was to bridge the gap between social science research and application to the environmental domain through the development of a theoretically derived instrument designed to give educators a template by which to evaluate environmental education curricula. The theoretical base for instrument development was provided by several developmental theories such as Piaget's theory of cognitive development, Developmental Systems Theory, Life-span Perspective, as well as curriculum research within the area of environmental education. This theoretical base fueled the generation of a list of components which were then translated into a questionnaire with specific questions relevant to the environmental education domain. The specific research question for this project is: Can a valid assessment instrument based largely on human development and education theory be developed that reliably discriminates high, moderate, and low quality in environmental education curricula? The types of analyses conducted to answer this question were interrater reliability (percent agreement, Cohen's Kappa coefficient, Pearson's Product-Moment correlation coefficient), test-retest reliability (percent agreement, correlation), and criterion-related validity (correlation). Face validity and content validity were also assessed through thorough reviews. Overall results indicate that 29% of the questions on the questionnaire demonstrated a high level of interrater reliability and 43% of the questions demonstrated a moderate level of interrater reliability. Seventy-one percent of the questions demonstrated a high test-retest reliability and 5% a moderate level. Fifty-five percent of the questions on the questionnaire were reliable (high or moderate) both across time and raters. Only eight questions (8%) did not show either interrater or test-retest reliability. The global overall rating of high, medium, or low quality was reliable across both coders and time, indicating that the questionnaire can discriminate differences in quality of environmental education curricula. Of the 35 curricula evaluated, 6 were high quality, 14 were medium quality and 15 were low quality. The criterion-related validity of the instrument is at current time unable to be established due to the lack of comparable measures or a concretely usable set of multidisciplinary standards. Face and content validity were sufficiently demonstrated.
Interexaminer reliability in physical examination of patients with low back pain.

PubMed

Strender, L E; Sjöblom, A; Sundell, K; Ludwig, R; Taube, A

1997-04-01

Seventy-one patients with low back pain were examined by two physiotherapists (50 patients) and two physicians (21 patients). The two physiotherapists had worked together for many years, but the two physicians had not. The interexaminer reliability of the clinical tests included in the physical examination was evaluated. To evaluate the interexaminer reliability of clinical tests used in the physical examination of patients with low back pain under ideal circumstances, which was the case for the physiotherapists. Numerous clinical tests are used in the evaluation of patients with low back pain. To reach the correct diagnosis, only tests with an acceptable validity and reliability should be used. Previous studies have mainly shown low reliability. It is important that clinical tests not be rejected because of low reliability caused by differences between examiners in performance of the examination and in their definition of normal results. Two examiners, either two physiotherapists or two physicians, independently examined patients with low back pain. In approximately half of the clinical tests studied, an acceptable reliability was demonstrated. On the basis of the physiotherapists series, the reliability was acceptable for a number of clinical tests that are used in the evaluation of patients with low back pain. The results suggest that clinical tests should be standardized to a much higher degree than they are today.
The development and reliability of a simple field based screening tool to assess core stability in athletes.

PubMed

O'Connor, S; McCaffrey, N; Whyte, E; Moran, K

2016-07-01

To adapt the trunk stability test to facilitate further sub-classification of higher levels of core stability in athletes for use as a screening tool. To establish the inter-tester and intra-tester reliability of this adapted core stability test. Reliability study. Collegiate athletic therapy facilities. Fifteen physically active male subjects (19.46 ± 0.63) free from any orthopaedic or neurological disorders were recruited from a convenience sample of collegiate students. The intraclass correlation coefficients (ICC) and 95% Confidence Intervals (CI) were computed to establish inter-tester and intra-tester reliability. Excellent ICC values were observed in the adapted core stability test for inter-tester reliability (0.97) and good to excellent intra-tester reliability (0.73-0.90). While the 95% CI were narrow for inter-tester reliability, Tester A and C 95% CI's were widely distributed compared to Tester B. The adapted core stability test developed in this study is a quick and simple field based test to administer that can further subdivide athletes with high levels of core stability. The test demonstrated high inter-tester and intra-tester reliability. Copyright © 2015 Elsevier Ltd. All rights reserved.
A Structured Clinical Interview for Kleptomania (SCI-K): preliminary validity and reliability testing.

PubMed

Grant, Jon E; Kim, Suck Won; McCabe, James S

2006-06-01

Kleptomania presents difficulties in diagnosis for clinicians. This study aimed to develop and test a DSM-IV-based diagnostic instrument for kleptomania. To assess for current kleptomania the Structured Clinical Interview for Kleptomania (SCI-K) was administered to 112 consecutive subjects requesting psychiatric outpatient treatment for a variety of disorders. Reliability and validity were determined. Classification accuracy was examined using the longitudinal course of illness. The SCI-K demonstrated excellent test-retest (Phi coefficient = 0.956 (95% CI = 0.937, 0.970)) and inter-rater reliability (phi coefficient = 0.718 (95% CI = 0.506, 0.848)) in the diagnosis of kleptomania. Concurrent validity was observed with a self-report measure using DSM-IV kleptomania criteria (phi coefficient = 0.769 (95% CI = 0.653, 0.850)). Discriminant validity was observed with a measure of depression (point biserial coefficient = -0.020 (95% CI = -0.205, 0.166)). The SCI-K demonstrated both high sensitivity and specificity based on longitudinal assessment. The SCI-K demonstrated excellent reliability and validity in diagnosing kleptomania in subjects presenting with various psychiatric problems. These findings require replication in larger groups, including non-psychiatric populations, to examine their generalizability. Copyright (c) 2006 John Wiley & Sons, Ltd.
Test-retest reliability and smallest detectable change of the Bristol Impact of Hypermobility (BIoH) questionnaire.

PubMed

Palmer, S; Manns, S; Cramp, F; Lewis, R; Clark, E M

2017-12-01

The Bristol Impact of Hypermobility (BIoH) questionnaire is a patient-reported outcome measure developed in conjunction with adults with Joint Hypermobility Syndrome (JHS). It has demonstrated strong concurrent validity with the Short Form-36 (SF-36) physical component score but other psychometric properties have yet to be established. This study aimed to determine its test-retest reliability and smallest detectable change (SDC). A test-retest reliability study. Participants were recruited from the Hypermobility Syndromes Association, a patient organisation in the United Kingdom. Recruitment packs were sent to 1080 adults who had given permission to be contacted about research. BIoH and SF-36 questionnaires were administered at baseline and repeated two weeks later. An 11-point global rating of change scale (-5 to +5) was also administered at two weeks. Test-retest analysis and calculation of the SDC was conducted on 'stable' patients (defined as global rating of change -1 to +1). 462 responses were received. 233 patients reported a 'stable' condition and were included in analysis (95% women; mean (SD) age 44.5 (13.9) years; BIoH score 223.6 (54.0)). The BIoH questionnaire demonstrated excellent test-retest reliability (ICC 0.923, 95% CI 0.900-0.940). The SDC was 42 points (equivalent to 19% of the mean baseline score). The SF-36 physical and mental component scores demonstrated poorer test-retest reliability and larger SDCs (as a proportion of the mean baseline scores). The results provide further evidence of the potential of the BIoH questionnaire to underpin research and clinical practice for people with JHS. Copyright © 2017 Elsevier Ltd. All rights reserved.
FUNCTIONAL PERFORMANCE TESTING OF THE HIP IN ATHLETES: A SYSTEMATIC REVIEW FOR RELIABILITY AND VALIDITY

PubMed Central

Martin, RobRoy L.

2012-01-01

Purpose/Background: The purpose of this study was to systematically review the literature for functional performance tests with evidence of reliability and validity that could be used for a young, athletic population with hip dysfunction. Methods: A search of PubMed and SPORTDiscus databases were performed to identify movement, balance, hop/jump, or agility functional performance tests from the current peer-reviewed literature used to assess function of the hip in young, athletic subjects. Results: The single-leg stance, deep squat, single-leg squat, and star excursion balance tests (SEBT) demonstrated evidence of validity and normative data for score interpretation. The single-leg stance test and SEBT have evidence of validity with association to hip abductor function. The deep squat test demonstrated evidence as a functional performance test for evaluating femoroacetabular impingement. Hop/Jump tests and agility tests have no reported evidence of reliability or validity in a population of subjects with hip pathology. Conclusions: Use of functional performance tests in the assessment of hip dysfunction has not been well established in the current literature. Diminished squat depth and provocation of pain during the single-leg balance test have been associated with patients diagnosed with FAI and gluteal tendinopathy, respectively. The SEBT and single-leg squat tests provided evidence of convergent validity through an analysis of kinematics and muscle function in normal subjects. Reliability of functional performance tests have not been established on patients with hip dysfunction. Further study is needed to establish reliability and validity of functional performance tests that can be used in a young, athletic population with hip dysfunction. Level of Evidence: 2b (Systematic Review of Literature) PMID:22893860
The validity and reliability of a dynamic neuromuscular stabilization-heel sliding test for core stability.

PubMed

Cha, Young Joo; Lee, Jae Jin; Kim, Do Hyun; You, Joshua Sung H

2017-10-23

Core stabilization plays an important role in the regulation of postural stability. To overcome shortcomings associated with pain and severe core instability during conventional core stabilization tests, we recently developed the dynamic neuromuscular stabilization-based heel sliding (DNS-HS) test. The purpose of this study was to establish the criterion validity and test-retest reliability of the novel DNS-HS test. Twenty young adults with core instability completed both the bilateral straight leg lowering test (BSLLT) and DNS-HS test for the criterion validity study and repeated the DNS-HS test for the test-retest reliability study. Criterion validity was determined by comparing hip joint angle data that were obtained from BSLLT and DNS-HS measures. The test-retest reliability was determined by comparing hip joint angle data. Criterion validity was (ICC2,3) = 0.700 (p< 0.05), suggesting a good relationship between the two core stability measures. Test-retest reliability was (ICC3,3) = 0.953 (p< 0.05), indicating excellent consistency between the repeated DNS-HS measurements. Criterion validity data demonstrated a good relationship between the gold standard BSLLT and DNS-HS core stability measures. Test-retest reliability data suggests that DNS-HS core stability was a reliable test for core stability. Clinically, the DNS-HS test is useful to objectively quantify core instability and allow early detection and evaluation.
Validation of the Simple Shoulder Test in a Portuguese-Brazilian population. Is the latent variable structure and validation of the Simple Shoulder Test Stable across cultures?

PubMed

Neto, Jose Osni Bruggemann; Gesser, Rafael Lehmkuhl; Steglich, Valdir; Bonilauri Ferreira, Ana Paula; Gandhi, Mihir; Vissoci, João Ricardo Nickenig; Pietrobon, Ricardo

2013-01-01

The validation of widely used scales facilitates the comparison across international patient samples. The objective of this study was to translate, culturally adapt and validate the Simple Shoulder Test into Brazilian Portuguese. Also we test the stability of factor analysis across different cultures. The objective of this study was to translate, culturally adapt and validate the Simple Shoulder Test into Brazilian Portuguese. Also we test the stability of factor analysis across different cultures. The Simple Shoulder Test was translated from English into Brazilian Portuguese, translated back into English, and evaluated for accuracy by an expert committee. It was then administered to 100 patients with shoulder conditions. Psychometric properties were analyzed including factor analysis, internal reliability, test-retest reliability at seven days, and construct validity in relation to the Short Form 36 health survey (SF-36). Factor analysis demonstrated a three factor solution. Cronbach's alpha was 0.82. Test-retest reliability index as measured by intra-class correlation coefficient (ICC) was 0.84. Associations were observed in the hypothesized direction with all subscales of SF-36 questionnaire. The Simple Shoulder Test translation and cultural adaptation to Brazilian-Portuguese demonstrated adequate factor structure, internal reliability, and validity, ultimately allowing for its use in the comparison with international patient samples.
Validation of the Simple Shoulder Test in a Portuguese-Brazilian Population. Is the Latent Variable Structure and Validation of the Simple Shoulder Test Stable across Cultures?

PubMed Central

Neto, Jose Osni Bruggemann; Gesser, Rafael Lehmkuhl; Steglich, Valdir; Bonilauri Ferreira, Ana Paula; Gandhi, Mihir; Vissoci, João Ricardo Nickenig; Pietrobon, Ricardo

2013-01-01

Background The validation of widely used scales facilitates the comparison across international patient samples. The objective of this study was to translate, culturally adapt and validate the Simple Shoulder Test into Brazilian Portuguese. Also we test the stability of factor analysis across different cultures. Objective The objective of this study was to translate, culturally adapt and validate the Simple Shoulder Test into Brazilian Portuguese. Also we test the stability of factor analysis across different cultures. Methods The Simple Shoulder Test was translated from English into Brazilian Portuguese, translated back into English, and evaluated for accuracy by an expert committee. It was then administered to 100 patients with shoulder conditions. Psychometric properties were analyzed including factor analysis, internal reliability, test-retest reliability at seven days, and construct validity in relation to the Short Form 36 health survey (SF-36). Results Factor analysis demonstrated a three factor solution. Cronbach’s alpha was 0.82. Test-retest reliability index as measured by intra-class correlation coefficient (ICC) was 0.84. Associations were observed in the hypothesized direction with all subscales of SF-36 questionnaire. Conclusion The Simple Shoulder Test translation and cultural adaptation to Brazilian-Portuguese demonstrated adequate factor structure, internal reliability, and validity, ultimately allowing for its use in the comparison with international patient samples. PMID:23675436
Post-Test Analysis of a 10-Year Sodium Heat Pipe Life Test

NASA Technical Reports Server (NTRS)

Rosenfeld, John H.; Locci, Ivan E.; Sanzi, James L.; Hull, David R.; Geng, Steven M.

2011-01-01

High-temperature heat pipes are being evaluated for use in energy conversion applications such as fuel cells, gas turbine re-combustors, Stirling cycle heat sources; and with the resurgence of space nuclear power both as reactor heat removal elements and as radiator elements. Long operating life and reliable performance are critical requirements for these applications. Accordingly, long-term materials compatibility is being evaluated through the use of high-temperature life test heat pipes. Thermacore, Inc., has carried out a sodium heat pipe 10-year life test to establish long-term operating reliability. Sodium heat pipes have demonstrated favorable materials compatibility and heat transport characteristics at high operating temperatures in air over long time periods. A representative one-tenth segment Stirling Space Power Converter heat pipe with an Inconel 718 envelope and a stainless steel screen wick has operated for over 87,000 hr (10 years) at nearly 700 C. These life test results have demonstrated the potential for high-temperature heat pipes to serve as reliable energy conversion system components for power applications that require long operating lifetime with high reliability. Detailed design specifications, operating history, and post-test analysis of the heat pipe and sodium working fluid are described. Lessons learned and future life test plans are also discussed.
Comprehensive proficiency-based inanimate training for robotic surgery: reliability, feasibility, and educational benefit.

PubMed

Arain, Nabeel A; Dulan, Genevieve; Hogg, Deborah C; Rege, Robert V; Powers, Cathryn E; Tesfay, Seifu T; Hynan, Linda S; Scott, Daniel J

2012-10-01

We previously developed a comprehensive proficiency-based robotic training curriculum demonstrating construct, content, and face validity. This study aimed to assess reliability, feasibility, and educational benefit associated with curricular implementation. Over an 11-month period, 55 residents, fellows, and faculty (robotic novices) from general surgery, urology, and gynecology were enrolled in a 2-month curriculum: online didactics, half-day hands-on tutorial, and self-practice using nine inanimate exercises. Each trainee completed a questionnaire and performed a single proctored repetition of each task before (pretest) and after (post-test) training. Tasks were scored for time and errors using modified FLS metrics. For inter-rater reliability (IRR), three trainees were scored by two raters and analyzed using intraclass correlation coefficients (ICC). Data from eight experts were analyzed using ICC and Cronbach's α to determine test-retest reliability and internal consistency, respectively. Educational benefit was assessed by comparing baseline (pretest) and final (post-test) trainee performance; comparisons used Wilcoxon signed-rank test. Of the 55 trainees that pretested, 53 (96 %) completed all curricular components in 9-17 h and reached proficiency after completing an average of 72 ± 28 repetitions over 5 ± 1 h. Trainees indicated minimal prior robotic experience and "poor comfort" with robotic skills at baseline (1.8 ± 0.9) compared to final testing (3.1 ± 0.8, p < 0.001). IRR data for the composite score revealed an ICC of 0.96 (p < 0.001). Test-retest reliability was 0.91 (p < 0.001) and internal consistency was 0.81. Performance improved significantly after training for all nine tasks and according to composite scores (548 ± 176 vs. 914 ± 81, p < 0.001), demonstrating educational benefit. This curriculum is associated with high reliability measures, demonstrated feasibility for a large cohort of trainees, and yielded significant educational benefit. Further studies and adoption of this curriculum are encouraged.
Reliability of externally fixed dynamometry hamstring strength testing in elite youth football players.

PubMed

Wollin, Martin; Purdam, Craig; Drew, Michael K

2016-01-01

To investigate inter and intra-tester reliability of an externally fixed dynamometry unilateral hamstring strength test, in the elite sports setting. Reliability study. Sixteen, injury-free, elite male youth football players (age=16.81±0.54 years, height=180.22±5.29cm, weight 73.88±6.54kg, BMI=22.57±1.42) gave written informed consent. Unilateral maximum isometric peak hamstring force was evaluated by externally fixed dynamometry for inter-tester, intra-day and intra-tester, inter-week reliability. The test position was standardised to correlate with the terminal swing phase of the gait running cycle. Inter and intra-tester values demonstrated good to high levels of reliability. The intra-class coefficient (ICC) for inter-tester, intra-day reliability was 0.87 (95% CI=0.75-0.93) with standard error of measure percentage (SEM%) 4.7 and minimal detectable change percentage (MDC%) 12.9. Intra-tester, inter-week reliability results were ICC 0.86 (95% CI, 0.74-0.93), SEM% 5.0 and MDC% 14.0. This study demonstrates good to high inter and intra-tester reliability of isometric externally fixed dynamometry unilateral hamstring strength testing in the regular elite sport setting involving elite male youth football players. The intra-class coefficient in association with the low standard error of measure and minimal detectable change percentages suggest that this procedure is appropriate for clinical and academic use as well as monitoring hamstring strength in the elite sport setting. Crown Copyright © 2015. Published by Elsevier Ltd. All rights reserved.
An alternative to the balance error scoring system: using a low-cost balance board to improve the validity/reliability of sports-related concussion balance testing.

PubMed

Chang, Jasper O; Levy, Susan S; Seay, Seth W; Goble, Daniel J

2014-05-01

Recent guidelines advocate sports medicine professionals to use balance tests to assess sensorimotor status in the management of concussions. The present study sought to determine whether a low-cost balance board could provide a valid, reliable, and objective means of performing this balance testing. Criterion validity testing relative to a gold standard and 7 day test-retest reliability. University biomechanics laboratory. Thirty healthy young adults. Balance ability was assessed on 2 days separated by 1 week using (1) a gold standard measure (ie, scientific grade force plate), (2) a low-cost Nintendo Wii Balance Board (WBB), and (3) the Balance Error Scoring System (BESS). Validity of the WBB center of pressure path length and BESS scores were determined relative to the force plate data. Test-retest reliability was established based on intraclass correlation coefficients. Composite scores for the WBB had excellent validity (r = 0.99) and test-retest reliability (R = 0.88). Both the validity (r = 0.10-0.52) and test-retest reliability (r = 0.61-0.78) were lower for the BESS. These findings demonstrate that a low-cost balance board can provide improved balance testing accuracy/reliability compared with the BESS. This approach provides a potentially more valid/reliable, yet affordable, means of assessing sports-related concussion compared with current methods.
Evaluating information skills training in health libraries: a systematic review.

PubMed

Brettle, Alison

2007-12-01

Systematic reviews have shown that there is limited evidence to demonstrate that the information literacy training health librarians provide is effective in improving clinicians' information skills or has an impact on patient care. Studies lack measures which demonstrate validity and reliability in evaluating the impact of training. To determine what measures have been used; the extent to which they are valid and reliable; to provide guidance for health librarians who wish to evaluate the impact of their information skills training. Systematic review methodology involved searching seven databases, and personal files. Studies were included if they were about information skills training, used an objective measure to assess outcomes, and occurred in a health setting. Fifty-four studies were included in the review. Most outcome measures used in the studies were not tested for the key criteria of validity and reliability. Three tested for validity and reliability are described in more detail. Selecting an appropriate measure to evaluate the impact of training is a key factor in carrying out any evaluation. This systematic review provides guidance to health librarians by highlighting measures used in various circumstances, and those that demonstrate validity and reliability.
Reliability and validity of the test of incremental respiratory endurance measures of inspiratory muscle performance in COPD

PubMed Central

Formiga, Magno F; Roach, Kathryn E; Vital, Isabel; Urdaneta, Gisel; Balestrini, Kira; Calderon-Candelario, Rafael A

2018-01-01

Purpose The Test of Incremental Respiratory Endurance (TIRE) provides a comprehensive assessment of inspiratory muscle performance by measuring maximal inspiratory pressure (MIP) over time. The integration of MIP over inspiratory duration (ID) provides the sustained maximal inspiratory pressure (SMIP). Evidence on the reliability and validity of these measurements in COPD is not currently available. Therefore, we assessed the reliability, responsiveness and construct validity of the TIRE measures of inspiratory muscle performance in subjects with COPD. Patients and methods Test–retest reliability, known-groups and convergent validity assessments were implemented simultaneously in 81 male subjects with mild to very severe COPD. TIRE measures were obtained using the portable PrO2 device, following standard guidelines. Results All TIRE measures were found to be highly reliable, with SMIP demonstrating the strongest test–retest reliability with a nearly perfect intraclass correlation coefficient (ICC) of 0.99, while MIP and ID clustered closely together behind SMIP with ICC values of about 0.97. Our findings also demonstrated known-groups validity of all TIRE measures, with SMIP and ID yielding larger effect sizes when compared to MIP in distinguishing between subjects of different COPD status. Finally, our analyses confirmed convergent validity for both SMIP and ID, but not MIP. Conclusion The TIRE measures of MIP, SMIP and ID have excellent test–retest reliability and demonstrated known-groups validity in subjects with COPD. SMIP and ID also demonstrated evidence of moderate convergent validity and appear to be more stable measures in this patient population than the traditional MIP. PMID:29805255
Reliability and Validity of the Behavioral Addiction Measure for Video Gaming.

PubMed

Sanders, James L; Williams, Robert J

2016-01-01

Most tests of video game addiction have weak construct validity and limited ability to correctly identify people in denial. The purpose of the present research was to investigate the reliability and validity of a new test of video game addiction (Behavioral Addiction Measure-Video Gaming [BAM-VG]) that was developed in part to address these deficiencies. Regular adult video gamers (n = 506) were recruited from a Canadian online panel and completed a survey containing three measures of excessive video gaming (BAM-VG; DSM-5 criteria for Internet Gaming Disorder [IGD]; and the IGD-20), as well as questions concerning extensiveness of video game involvement and self-report of problems associated with video gaming. One month later, they were reassessed for the purposes of establishing test-retest reliability. The BAM-VG demonstrated good internal consistency as well as 1 month test-retest reliability. Criterion-related validity was demonstrated by significant correlations with the following: time spent playing, self-identification of video game problems, and scores on other instruments designed to assess video game addiction (DSM-5 IGD, IGD-20). Consistent with the theory, principal component analysis identified two components underlying the BAM-VG that roughly correspond with impaired control and significant negative consequences deriving from this impaired control. Together with its excellent construct validity and other technical features, the BAM-VG represents a reliable and valid test of video game addiction.
Ten Year Operating Test Results and Post-Test Analysis of a 1/10 Segment Stirling Sodium Heat Pipe, Phase III

NASA Technical Reports Server (NTRS)

Rosenfeld, John, H; Minnerly, Kenneth, G; Dyson, Christopher, M.

2012-01-01

High-temperature heat pipes are being evaluated for use in energy conversion applications such as fuel cells, gas turbine re-combustors, Stirling cycle heat sources; and with the resurgence of space nuclear power both as reactor heat removal elements and as radiator elements. Long operating life and reliable performance are critical requirements for these applications. Accordingly, long-term materials compatibility is being evaluated through the use of high-temperature life test heat pipes. Thermacore, Inc., has carried out a sodium heat pipe 10-year life test to establish long-term operating reliability. Sodium heat pipes have demonstrated favorable materials compatibility and heat transport characteristics at high operating temperatures in air over long time periods. A representative one-tenth segment Stirling Space Power Converter heat pipe with an Inconel 718 envelope and a stainless steel screen wick has operated for over 87,000 hr (10 yr) at nearly 700 C. These life test results have demonstrated the potential for high-temperature heat pipes to serve as reliable energy conversion system components for power applications that require long operating lifetime with high reliability. Detailed design specifications, operating history, and post-test analysis of the heat pipe and sodium working fluid are described.

Establishing Inter- and Intrarater Reliability for High-Stakes Testing Using Simulation.

PubMed

Kardong-Edgren, Suzan; Oermann, Marilyn H; Rizzolo, Mary Anne; Odom-Maryon, Tamara

This article reports one method to develop a standardized training method to establish the inter- and intrarater reliability of a group of raters for high-stakes testing. Simulation is used increasingly for high-stakes testing, but without research into the development of inter- and intrarater reliability for raters. Eleven raters were trained using a standardized methodology. Raters scored 28 student videos over a six-week period. Raters then rescored all videos over a two-day period to establish both intra- and interrater reliability. One rater demonstrated poor intrarater reliability; a second rater failed all students. Kappa statistics improved from the moderate to substantial agreement range with the exclusion of the two outlier raters' scores. There may be faculty who, for different reasons, should not be included in high-stakes testing evaluations. All faculty are content experts, but not all are expert evaluators.
Estimating Between-Person and Within-Person Subscore Reliability with Profile Analysis.

PubMed

Bulut, Okan; Davison, Mark L; Rodriguez, Michael C

2017-01-01

Subscores are of increasing interest in educational and psychological testing due to their diagnostic function for evaluating examinees' strengths and weaknesses within particular domains of knowledge. Previous studies about the utility of subscores have mostly focused on the overall reliability of individual subscores and ignored the fact that subscores should be distinct and have added value over the total score. This study introduces a profile reliability approach that partitions the overall subscore reliability into within-person and between-person subscore reliability. The estimation of between-person reliability and within-person reliability coefficients is demonstrated using subscores from number-correct scoring, unidimensional and multidimensional item response theory scoring, and augmented scoring approaches via a simulation study and a real data study. The effects of various testing conditions, such as subtest length, correlations among subscores, and the number of subtests, are examined. Results indicate that there is a substantial trade-off between within-person and between-person reliability of subscores. Profile reliability coefficients can be useful in determining the extent to which subscores provide distinct and reliable information under various testing conditions.
Integrating Formal Methods and Testing 2002

NASA Technical Reports Server (NTRS)

Cukic, Bojan

2002-01-01

Traditionally, qualitative program verification methodologies and program testing are studied in separate research communities. None of them alone is powerful and practical enough to provide sufficient confidence in ultra-high reliability assessment when used exclusively. Significant advances can be made by accounting not only tho formal verification and program testing. but also the impact of many other standard V&V techniques, in a unified software reliability assessment framework. The first year of this research resulted in the statistical framework that, given the assumptions on the success of the qualitative V&V and QA procedures, significantly reduces the amount of testing needed to confidently assess reliability at so-called high and ultra-high levels (10-4 or higher). The coming years shall address the methodologies to realistically estimate the impacts of various V&V techniques to system reliability and include the impact of operational risk to reliability assessment. Combine formal correctness verification, process and product metrics, and other standard qualitative software assurance methods with statistical testing with the aim of gaining higher confidence in software reliability assessment for high-assurance applications. B) Quantify the impact of these methods on software reliability. C) Demonstrate that accounting for the effectiveness of these methods reduces the number of tests needed to attain certain confidence level. D) Quantify and justify the reliability estimate for systems developed using various methods.
Measuring social alienation in adolescence: translation and validation of the Jessor and Jessor Social Alienation Scale.

PubMed

Safipour, Jalal; Tessma, Mesfin Kassaye; Higginbottom, Gina; Emami, Azita

2010-12-01

The objective of the study is to translate and examine the reliability and validity of the Jessor and Jessor Social Alienation Scale for use in a Swedish context. The study involved four phases of testing: (1) Translation and back-translation; (2) a pilot test to evaluate the translation; (3) reliability testing; and (4) a validity test. Main participants of this study were 446 students (Age = 15-19, SD = 1.01, Mean = 17). Results from the reliability test showed high internal consistency and stability. Face, content and construct validity were demonstrated using experts and confirmatory factor analysis. The results of testing the Swedish version of the alienation scale revealed an acceptable level of reliability and validity, and is appropriate for use in the Swedish context. © 2010 The Authors. Scandinavian Journal of Psychology © 2010 The Scandinavian Psychological Associations.
Reliability of instruments in a cooperative, multisite study: employment intervention demonstration program.

PubMed

Salyers, M P; McHugo, G J; Cook, J A; Razzano, L A; Drake, R E; Mueser, K T

2001-09-01

Reliability of well-known instruments was examined in 202 people with severe mental illness participating in a multisite vocational study. We examined interrater reliability of the Positive and Negative Syndrome Scale (PANSS) and the internal consistency and test-retest reliability of the PANSS, the Rosenberg Self-Esteem Scale, the Medical Outcomes Study Short Form-36 (SF-36), and the Quality of Life Interview. Most scales had good levels of reliability, with intraclass correlation coefficients (ICCs) and coefficient alphas above .70. However, the SF-36 scales were generally less stable over time, particularly Social Functioning (ICC = .55). Test-retest reliability was lower among less educated respondents and among ethnic minorities. We recommend close monitoring of psychometric issues in future multisite studies.
Reliability and validity of a questionnaire for self-assessment of complete dentures.

PubMed

Komagamine, Yuriko; Kanazawa, Manabu; Kaiba, Yoshinori; Sato, Yusuke; Minakuchi, Shunsuke

2014-05-02

Demand for complete denture treatment is expected to rise over several decades. However, to date, no questionnaire on complete dentures, as evaluated by edentulous patients, has been shown to be reliable and valid. This study sought to assess the reliability and validity of Patient's Denture Assessment (PDA), which provides a multidimensional evaluation of dentures among edentulous patients. Patients, who had new complete dentures fabricated at the University Hospital of Dentistry, Tokyo Medical and Dental University through 2009 to 2010, were enrolled. The reliability of the PDA was determined by examining internal consistency and test-retest reliability. Internal consistency for all of the question items and the six subscales was measured using Cronbach's α and average inter-item correlation coefficients among 93 participants. For 33 of these participants, test-retest reliability was determined at a 2 month-interval using the interclass correlation coefficients (ICCs) and 95% confidence interval for the summary scores and the six subscale scores. The PDA was validated in 93 participants by examining the difference in the summary score and the six subscale scores of the PDA before and after replacement with new dentures by the paired t-test. Ability to detect change was also tested in 93 patients using effect size. The Cronbach's α for the PDA ranged from 0.56 to 0.93. The average inter-item correlation coefficients ranged from 0.28 to 0.83. ICCs for the PDA ranged from 0.37 to 0.83. The paired t-test showed a significant difference between the summary score and the six subscale scores before and after replacement with new dentures (p < 0.05) and the effect size was 0.97. The PDA demonstrated good reliability by assessing internal consistency and test-retest reliability. In addition, the PDA demonstrated good validity by assessing discriminant validity. Thus, the PDA could help dentists obtain a detailed understanding of the patients' perceptions in using their dentures.
Motivational Interviewing Skills in Health Care Encounters (MISHCE): Development and psychometric testing of an assessment tool.

PubMed

Petrova, Tatjana; Kavookjian, Jan; Madson, Michael B; Dagley, John; Shannon, David; McDonough, Sharon K

2015-01-01

Motivational interviewing (MI) has demonstrated a significant impact as an intervention strategy for addiction management, change in lifestyle behaviors, and adherence to prescribed medication and other treatments. Key elements to studying MI include training in MI of professionals who will use it, assessment of skills acquisition in trainees, and the use of a validated skills assessment tool. The purpose of this research project was to develop a psychometrically valid and reliable tool that has been designed to assess MI skills competence in health care provider trainees. The goal was to develop an assessment tool that would evaluate the acquisition and use of specific MI skills and principles, as well as the quality of the patient-provider therapeutic alliance in brief health care encounters. To address this purpose, specific steps were followed, beginning with a literature review. This review contributed to the development of relevant conceptual and operational definitions, selecting a scaling technique and response format, and methods for analyzing validity and reliability. Internal consistency reliability was established on 88 video recorded interactions. The inter-rater and test-retest reliability were established using randomly selected 18 from the 88 interactions. The assessment tool Motivational Interviewing Skills for Health Care Encounters (MISHCE) and a manual for use of the tool were developed. Validity and reliability of MISHCE were examined. Face and content validity were supported with well-defined conceptual and operational definitions and feedback from an expert panel. Reliability was established through internal consistency, inter-rater reliability, and test-retest reliability. The overall internal consistency reliability (Cronbach's alpha) for all fifteen items was 0.75. MISHCE demonstrated good inter-rater reliability and good to excellent test-retest reliability. MISHCE assesses the health provider's level of knowledge and skills in brief disease management encounters. MISHCE also evaluates quality of the patient-provider therapeutic alliance, i.e., the "flow" of the interaction. Copyright © 2015 Elsevier Inc. All rights reserved.
Eddy current crack detection capability assessment approach using crack specimens with differing electrical conductivity

NASA Astrophysics Data System (ADS)

Koshti, Ajay M.

2018-03-01

Like other NDE methods, eddy current surface crack detectability is determined using probability of detection (POD) demonstration. The POD demonstration involves eddy current testing of surface crack specimens with known crack sizes. Reliably detectable flaw size, denoted by, a90/95 is determined by statistical analysis of POD test data. The surface crack specimens shall be made from a similar material with electrical conductivity close to the part conductivity. A calibration standard with electro-discharged machined (EDM) notches is typically used in eddy current testing for surface crack detection. The calibration standard conductivity shall be within +/- 15% of the part conductivity. This condition is also applicable to the POD demonstration crack set. Here, a case is considered, where conductivity of the crack specimens available for POD testing differs by more than 15% from that of the part to be inspected. Therefore, a direct POD demonstration of reliably detectable flaw size is not applicable. Additional testing is necessary to use the demonstrated POD test data. An approach to estimate the reliably detectable flaw size in eddy current testing for part made from material A using POD crack specimens made from material B with different conductivity is provided. The approach uses additional test data obtained on EDM notch specimens made from materials A and B. EDM notch test data from the two materials is used to create a transfer function between the demonstrated a90/95 size on crack specimens made of material B and the estimated a90/95 size for part made of material A. Two methods are given. For method A, a90/95 crack size for material B is given and POD data is available. Objective of method A is to determine a90/95 crack size for material A using the same relative decision threshold that was used for material B. For method B, target crack size a90/95 for material A is known. Objective is to determine decision threshold for inspecting material A.
Assessment of a condition-specific quality-of-life measure for patients with developmentally absent teeth: validity and reliability testing.

PubMed

Akram, A J; Ireland, A J; Postlethwaite, K C; Sandy, J R; Jerreat, A S

2013-11-01

This article describes the process of validity and reliability testing of a condition-specific quality-of-life measure for patients with hypodontia presenting for orthodontic treatment. The development of the instrument is described in a previous article. Royal Devon and Exeter NHS Foundation Trust & Musgrove Park Hospital, Taunton. The child perception questionnaire was used as a standard against which to test criterion validity. The Bland and Altman method was used to check agreement between the two questionnaires. Construct validity was tested using principal component analysis on the four sections of the questionnaire. Test-retest reliability was tested using intraclass correlation coefficient and Bland and Altman method. Cronbach's alpha was used to test internal consistency reliability. Overall the questionnaire showed good reliability, criterion and construct validity. This together with previous evidence of good face and content validity suggests that the instrument may prove useful in clinical practice and further research. This study has demonstrated that the newly developed condition-specific quality-of-life questionnaire is both valid and reliable for use in young patients with hypodontia. © 2013 John Wiley & Sons A/S. Published by Blackwell Publishing Ltd.
Demonstration of Essential Reliability Services by a 300-MW Solar Photovoltaic Power Plant

DOE Office of Scientific and Technical Information (OSTI.GOV)

Loutan, Clyde; Klauer, Peter; Chowdhury, Sirajul

The California Independent System Operator (CAISO), First Solar, and the National Renewable Energy Laboratory (NREL) conducted a demonstration project on a large utility-scale photovoltaic (PV) power plant in California to test its ability to provide essential ancillary services to the electric grid. With increasing shares of solar- and wind-generated energy on the electric grid, traditional generation resources equipped with automatic governor control (AGC) and automatic voltage regulation controls -- specifically, fossil thermal -- are being displaced. The deployment of utility-scale, grid-friendly PV power plants that incorporate advanced capabilities to support grid stability and reliability is essential for the large-scale integrationmore » of PV generation into the electric power grid, among other technical requirements. A typical PV power plant consists of multiple power electronic inverters and can contribute to grid stability and reliability through sophisticated 'grid-friendly' controls. In this way, PV power plants can be used to mitigate the impact of variability on the grid, a role typically reserved for conventional generators. In August 2016, testing was completed on First Solar's 300-MW PV power plant, and a large amount of test data was produced and analyzed that demonstrates the ability of PV power plants to use grid-friendly controls to provide essential reliability services. These data showed how the development of advanced power controls can enable PV to become a provider of a wide range of grid services, including spinning reserves, load following, voltage support, ramping, frequency response, variability smoothing, and frequency regulation to power quality. Specifically, the tests conducted included various forms of active power control such as AGC and frequency regulation; droop response; and reactive power, voltage, and power factor controls. This project demonstrated that advanced power electronics and solar generation can be controlled to contribute to system-wide reliability. It was shown that the First Solar plant can provide essential reliability services related to different forms of active and reactive power controls, including plant participation in AGC, primary frequency control, ramp rate control, and voltage regulation. For AGC participation in particular, by comparing the PV plant testing results to the typical performance of individual conventional technologies, we showed that regulation accuracy by the PV plant is 24-30 points better than fast gas turbine technologies. The plant's ability to provide volt-ampere reactive control during periods of extremely low power generation was demonstrated as well. The project team developed a pioneering demonstration concept and test plan to show how various types of active and reactive power controls can leverage PV generation's value from being a simple variable energy resource to a resource that provides a wide range of ancillary services. With this project's approach to a holistic demonstration on an actual, large, utility-scale, operational PV power plant and dissemination of the obtained results, the team sought to close some gaps in perspectives that exist among various stakeholders in California and nationwide by providing real test data.« less
Getting the story straight: evaluating the test-retest reliability of a university health history questionnaire.

PubMed

Gilkison, C R; Fenton, M V; Lester, J W

1992-05-01

This study was designed to establish the reliability of a health history questionnaire used as a screening tool for incoming university students. The authors used a test-retest design, with a test interval of 6 months, on a sample of medical and nursing students. The analysis focused on overall reliability of the questionnaire and reproducibility of specific items, based on question format. Questionnaire items of specific interest were those with dichotomous yes/no response options versus open-ended format questions, those using the words frequently or recently, or those that asked multiple questions. Demographic characteristics of the subjects were considered in the evaluation of reliability. Overall reliability of the questionnaire (93.6%) was above the anticipated level of 90%, and subject sex or program of study did not show any significant differences in reproducibility of responses. Although wording of questions did not affect item reliability, dichotomous format questions demonstrated a higher degree of reliability (96.4%) than the overall reliability of the questionnaire. Recommendations for enhancing the reliability of the questionnaire are based on item analysis and information gathered from interviews with subjects.
Development, test-retest reliability, and construct validity of the resistance training skills battery.

PubMed

Lubans, David R; Smith, Jordan J; Harries, Simon K; Barnett, Lisa M; Faigenbaum, Avery D

2014-05-01

The aim of this study was to describe the development and assess test-retest reliability and construct validity of the Resistance Training Skills Battery (RTSB) for adolescents. The RTSB provides an assessment of resistance training skill competency and includes 6 exercises (i.e., body weight squat, push-up, lunge, suspended row, standing overhead press, and front support with chest touches). Scoring for each skill is based on the number of performance criteria successfully demonstrated. An overall resistance training skill quotient (RTSQ) is created by adding participants' scores for the 6 skills. Participants (44 boys and 19 girls, mean age = 14.5 ± 1.2 years) completed the RTSB on 2 occasions separated by 7 days. Participants also completed the following fitness tests, which were used to create a muscular fitness score (MFS): handgrip strength, timed push-up, and standing long jump tests. Intraclass correlation (ICC), paired samples t-tests, and typical error were used to assess test-retest reliability. To assess construct validity, gender and RTSQ were entered into a regression model predicting MFS. The rank order repeatability of the RTSQ was high (ICC = 0.88). The model explained 39% of the variance in MFS (p ≤ 0.001) and RTSQ (r = 0.40, p ≤ 0.001) was a significant predictor. This study has demonstrated the construct validity and test-retest reliability of the RTSB in a sample of adolescents. The RTSB can reliably rank participants in regards to their resistance training competency and has the necessary sensitivity to detect small changes in resistance training skill proficiency.
Reliability and validity of urinary nerve growth factor measurement in women with lower urinary tract symptoms.

PubMed

Vijaya, Gopalan; Cartwright, Rufus; Bhide, Alka; Derpapas, Alexandros; Fernando, Ruwan; Khullar, Vik

2016-11-01

The validity and reliability of measurement of urinary NGF as a diagnostic biomarker in women with lower urinary tract dysfunction (LUTD) is uncertain. We aimed to evaluate both the diagnostic and discriminant validity, and the test-retest reliability of urinary NGF measurement in women with LUTD. Urinary NGF was measured in women with LUTD (n = 205) and asymptomatic subjects (n = 31). Urinary NGF was assayed using an ELISA method and normalized against urinary creatinine. NGF/creatinine ratios were compared between symptom subgroups using Mann-Whitney U test, and between different urodynamic diagnoses using the Kruskal-Wallis test. Receiver Operator Characteristic (ROC) analysis was employed to evaluate the diagnostic performance of urinary NGF. Test-retest reliability of NGF measurement was assessed using intra-class correlation (ICC). Urinary NGF was significantly but non-specifically increased in symptomatic patients when compared to controls (13.33 vs. 2.05 ng NGF/g Cr, P < 0.001). On multivariate logistic regression NGF was a good predictor of patients having OAB or not, however, the adjusted odds ratio only 1.006. ROC analysis demonstrated poor discriminant ability between different symptomatic groups and urodynamic groups. Using a cut off of 13.0 ng NGF/g creatinine the test provides a sensitivity of 81%, but a specificity of only 39% for overactive bladder. The assays demonstrated good test-retest reliability with ICC of 0.889. Although urinary NGF can be reliably assayed, and is increased in various LUTDs, it discriminates poorly between these disorders therefore has very limited potential as a biomarker. Neurourol. Urodynam. 35:944-948, 2016. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.
Measuring the Process and Quality of Informed Consent for Clinical Research: Development and Testing

PubMed Central

Cohn, Elizabeth Gross; Jia, Haomiao; Smith, Winifred Chapman; Erwin, Katherine; Larson, Elaine L.

2013-01-01

Purpose/Objectives To develop and assess the reliability and validity of an observational instrument, the Process and Quality of Informed Consent (P-QIC). Design A pilot study of the psychometrics of a tool designed to measure the quality and process of the informed consent encounter in clinical research. The study used professionally filmed, simulated consent encounters designed to vary in process and quality. Setting A major urban teaching hospital in the northeastern region of the United States. Sample 63 students enrolled in health-related programs participated in psychometric testing, 16 students participated in test-retest reliability, and 5 investigator-participant dyads were observed for the actual consent encounters. Methods For reliability and validity testing, students watched and rated videotaped simulations of four consent encounters intentionally varied in process and content and rated them with the proposed instrument. Test-retest reliability was established by raters watching the videotaped simulations twice. Inter-rater reliability was demonstrated by two simultaneous but independent raters observing an actual consent encounter. Main Research Variables The essential elements of information and communication for informed consent. Findings The initial testing of the P-QIC demonstrated reliable and valid psychometric properties in both the simulated standardized consent encounters and actual consent encounters in the hospital setting. Conclusions The P-QIC is an easy-to-use observational tool that provides a quick assessment of the areas of strength and areas that need improvement in a consent encounter. It can be used in the initial trainings of new investigators or consent administrators and in ongoing programs of improvement for informed consent. Implications for Nursing The development of a validated observational instrument will allow investigators to assess the consent process more accurately and evaluate strategies designed to improve it. PMID:21708532
Advanced Grid-Friendly Controls Demonstration Project for Utility-Scale PV Power Plants

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gevorgian, Vahan; O'Neill, Barbara

A typical photovoltaic (PV) power plant consists of multiple power electronic inverters and can contribute to grid stability and reliability through sophisticated 'grid-friendly' controls. The availability and dissemination of actual test data showing the viability of advanced utility-scale PV controls among all industry stakeholders can leverage PV's value from being simply an energy resource to providing additional ancillary services that range from variability smoothing and frequency regulation to power quality. Strategically partnering with a selected utility and/or PV power plant operator is a key condition for a successful demonstration project. The U.S. Department of Energy's (DOE's) Solar Energy Technologies Officemore » selected the National Renewable Energy Laboratory (NREL) to be a principal investigator in a two-year project with goals to (1) identify a potential partner(s), (2) develop a detailed scope of work and test plan for a field project to demonstrate the gird-friendly capabilities of utility-scale PV power plants, (3) facilitate conducting actual demonstration tests, and (4) disseminate test results among industry stakeholders via a joint NREL/DOE publication and participation in relevant technical conferences. The project implementation took place in FY 2014 and FY 2015. In FY14, NREL established collaborations with AES and First Solar Electric, LLC, to conduct demonstration testing on their utility-scale PV power plants in Puerto Rico and Texas, respectively, and developed test plans for each partner. Both Puerto Rico Electric Power Authority and the Electric Reliability Council of Texas expressed interest in this project because of the importance of such advanced controls for the reliable operation of their power systems under high penetration levels of variable renewable generation. During FY15, testing was completed on both plants, and a large amount of test data was produced and analyzed that demonstrates the ability of PV power plants to provide various types of new grid-friendly controls.« less
Reliability of the Fox-walk test in patients with rheumatoid arthritis.

PubMed

Verberkt, Cornelia Antonia; Fridén, Cecilia; Grooten, Wilhelmus Johannes Andreas; Opava, Christina H

2012-01-01

The Fox-walk test is a new method used to estimate aerobic capacity outside a clinical environment, which may be useful in the implementation of daily health-enhancing physical activity. The aim of our study was to investigate the reliability of the test in people with rheumatoid arthritis (RA). Fifteen participants performed the Fox-walk test three times with weekly intervals. The intraclass correlation coefficient (ICC), the standard error of measurement (SEM) and the smallest detectable change (SDC) were used to estimate the reliability. General health perception, lower limb pain and fatigue were measured to determine their potential influence on the reliability. There were no systematic differences between the three test occasions (p = 0.190) and the reliability was almost perfect (ICC = 0.982). None of the covariates influenced the reliability. The SEM was 0.999 ml/kg/min or 3.4% and the SDC was 2.769 ml/kg/min or 9.4%. These findings demonstrate that the Fox-walk test is reliable in people with RA and enables differentiation between people with RA and monitoring progress. The validity of the test among people with RA is still to be determined. • The Fox-walk test is a new method to estimate aerobic capacity and could be performed walking or running. • The test is self administered without expensive equipment and is available in 150 public places in Sweden and several other European countries. • The Fox-walk test is a reliable test for use among people with rheumatoid arthritis monitoring the progress of their physical activity.
The reliability of eyetracking to assess attentional bias to threatening words in healthy individuals.

PubMed

Skinner, Ian W; Hübscher, Markus; Moseley, G Lorimer; Lee, Hopin; Wand, Benedict M; Traeger, Adrian C; Gustin, Sylvia M; McAuley, James H

2017-08-15

Eyetracking is commonly used to investigate attentional bias. Although some studies have investigated the internal consistency of eyetracking, data are scarce on the test-retest reliability and agreement of eyetracking to investigate attentional bias. This study reports the test-retest reliability, measurement error, and internal consistency of 12 commonly used outcome measures thought to reflect the different components of attentional bias: overall attention, early attention, and late attention. Healthy participants completed a preferential-looking eyetracking task that involved the presentation of threatening (sensory words, general threat words, and affective words) and nonthreatening words. We used intraclass correlation coefficients (ICCs) to measure test-retest reliability (ICC > .70 indicates adequate reliability). The ICCs(2, 1) ranged from -.31 to .71. Reliability varied according to the outcome measure and threat word category. Sensory words had a lower mean ICC (.08) than either affective words (.32) or general threat words (.29). A longer exposure time was associated with higher test-retest reliability. All of the outcome measures, except second-run dwell time, demonstrated low measurement error (<6%). Most of the outcome measures reported high internal consistency (α > .93). Recommendations are discussed for improving the reliability of eyetracking tasks in future research.
Improving the quality of discrete-choice experiments in health: how can we assess validity and reliability?

PubMed

Janssen, Ellen M; Marshall, Deborah A; Hauber, A Brett; Bridges, John F P

2017-12-01

The recent endorsement of discrete-choice experiments (DCEs) and other stated-preference methods by regulatory and health technology assessment (HTA) agencies has placed a greater focus on demonstrating the validity and reliability of preference results. Areas covered: We present a practical overview of tests of validity and reliability that have been applied in the health DCE literature and explore other study qualities of DCEs. From the published literature, we identify a variety of methods to assess the validity and reliability of DCEs. We conceptualize these methods to create a conceptual model with four domains: measurement validity, measurement reliability, choice validity, and choice reliability. Each domain consists of three categories that can be assessed using one to four procedures (for a total of 24 tests). We present how these tests have been applied in the literature and direct readers to applications of these tests in the health DCE literature. Based on a stakeholder engagement exercise, we consider the importance of study characteristics beyond traditional concepts of validity and reliability. Expert commentary: We discuss study design considerations to assess the validity and reliability of a DCE, consider limitations to the current application of tests, and discuss future work to consider the quality of DCEs in healthcare.
Characterizing reliability in a product/process design-assurance program

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kerscher, W.J. III; Booker, J.M.; Bement, T.R.

1997-10-01

Over the years many advancing techniques in the area of reliability engineering have surfaced in the military sphere of influence, and one of these techniques is Reliability Growth Testing (RGT). Private industry has reviewed RGT as part of the solution to their reliability concerns, but many practical considerations have slowed its implementation. It`s objective is to demonstrate the reliability requirement of a new product with a specified confidence. This paper speaks directly to that objective but discusses a somewhat different approach to achieving it. Rather than conducting testing as a continuum and developing statistical confidence bands around the results, thismore » Bayesian updating approach starts with a reliability estimate characterized by large uncertainty and then proceeds to reduce the uncertainty by folding in fresh information in a Bayesian framework.« less
Reliability, validity and responsiveness of the German Manchester-Oxford Foot Questionnaire (MOXFQ) in patients with foot or ankle surgery.

PubMed

Arbab, Dariusch; Kuhlmann, Katharina; Ringendahl, Hubert; Bouillon, Bertil; Eysel, Peer; König, Dietmar

2017-06-13

Patient-reported outcome measures are a critical tool in evaluating the efficacy of orthopaedic procedures. The intention of this study was to develop and culturally adapt a German version of the Manchester-Oxford Foot Questionnaire (MOXFQ) and to evaluate reliability, validity and responsiveness. According to guidelines forward and backward translation has been performed. The German MOXFQ was investigated in 177 consecutive patients before and 6 months after foot or ankle surgery. All patients completed MOXFQ, Foot and Ankle Outcome Score (FAOS), Short form 36 and numeric scales for pain and disability (NRS). Test-Retest reliability, internal consistency, floor and ceiling effects, construct validity and minimal important change were analyzed. The German MOXFQ demonstrated excellent test-retest reliability with ICC values >0.9 Cronbach's alpha (α) values demonstrated strong internal consistency. No floor or ceiling effects were observed. As hypothesized MOXFQ subscales correlated strongly with corresponding FAOS and SF-36 domains. All subscales showed excellent (ES/SRM >0.8) responsiveness between preoperative assessment and postoperative follow-up. The German version of the MOXFQ demonstrated good psychometric properties. It proofed to be a valid and reliable instrument for use in foot and ankle patients. Copyright © 2017 European Foot and Ankle Society. Published by Elsevier Ltd. All rights reserved.

Optimal periodic proof test based on cost-effective and reliability criteria

NASA Technical Reports Server (NTRS)

Yang, J.-N.

1976-01-01

An exploratory study for the optimization of periodic proof tests for fatigue-critical structures is presented. The optimal proof load level and the optimal number of periodic proof tests are determined by minimizing the total expected (statistical average) cost, while the constraint on the allowable level of structural reliability is satisfied. The total expected cost consists of the expected cost of proof tests, the expected cost of structures destroyed by proof tests, and the expected cost of structural failure in service. It is demonstrated by numerical examples that significant cost saving and reliability improvement for fatigue-critical structures can be achieved by the application of the optimal periodic proof test. The present study is relevant to the establishment of optimal maintenance procedures for fatigue-critical structures.
The Brighton musculoskeletal Patient-Reported Outcome Measure (BmPROM): An assessment of validity, reliability, and responsiveness.

PubMed

Bryant, Elizabeth; Murtagh, Shemane; Finucane, Laura; McCrum, Carol; Mercer, Christopher; Smith, Toby; Canby, Guy; Rowe, David A; Moore, Ann P

2018-05-11

In response for the need of a freely available, stand-alone, validated outcome measure for use within musculoskeletal (MSK) physiotherapy practice, sensitive enough to measure clinical effectiveness, we developed an MSK patient reported outcome measure. This study examined the validity and reliability of the newly developed Brighton musculoskeletal Patient-Reported Outcome Measure (BmPROM) within physiotherapy outpatient settings. Two hundred twenty-four patients attending physiotherapy outpatient departments in South East England with an MSK condition participated in this study. The BmPROM was assessed for user friendliness (rated feedback, N = 224), reliability (internal consistency and test-retest reliability, n = 42), validity (internal and external construct validity, N = 224), and responsiveness (internal, n = 25). Exploratory factor analysis indicated that a two-factor model provides a good fit to the data. Factors were representative of "Functionality" and "Wellbeing". Correlations observed between the BmPROM and SF-36 domains provided evidence of convergent validity. Reliability results indicated that both subscales were internally consistent with alphas above the acceptable limits for both "Functionality" (α = .85, 95% CI [.81, .88]) and 'Wellbeing' (α = .80, 95% CI [.75, .84]). Test-retest analyses (n = 42) demonstrated a high degree of reliability between "Functionality" (ICC = .84; 95% CI [.72, .91]) and "Wellbeing" scores (ICC = .84; 95% CI [.72, .91]). Further examination of test-retest reliability through the Bland-Altman analysis demonstrated that the difference between "Functionality" and "Wellbeing" test scores did not vary as a function of absolute test score. Large treatment effect sizes were found for both subscales (Functionality d = 1.10; Wellbeing 1.03). The BmPROM is a reliable and valid outcome measure for use in evaluating physiotherapy treatment of MSK conditions. Copyright © 2018 John Wiley & Sons, Ltd.
The 5K70SK automatically tuned, high power, S-band klystron

NASA Technical Reports Server (NTRS)

Goldfinger, A.

1977-01-01

Primary objectives include delivery of 44 5K70SK klystron amplifier tubes and 26 remote tuner assemblies with spare parts kits. Results of a reliability demonstration on a klystron test cavity are discussed, along with reliability tests performed on a remote tuning unit. Production problems and one design modification are reported and discussed. Results of PAT and DVT are included.
Plastic Encapsulated Microcircuits (PEMs) Reliability Guide

NASA Technical Reports Server (NTRS)

Sandor, M.

2000-01-01

It is reported by some users and has been demonstrated by others via testing and qualification that the quality and reliability of plastic-encapsulated microcircuits (PEMs) manufactured today are excellent in commercial applications and closely equivalent, and in some cases superior to their hemetic counterparts.
Test-retest reliability of the Military Pre-training Questionnaire.

PubMed

Robinson, M; Stokes, K; Bilzon, J; Standage, M; Brown, P; Thompson, D

2010-09-01

Musculoskeletal injuries are a significant cause of morbidity during military training. A brief, inexpensive and user-friendly tool that demonstrates reliability and validity is warranted to effectively monitor the relationship between multiple predictor variables and injury incidence in military populations. To examine the test-retest reliability of the Military Pre-training Questionnaire (MPQ), designed specifically to assess risk factors for injury among military trainees across five domains (physical activity, injury history, diet, alcohol and smoking). Analyses were based on a convenience sample of 58 male British Army trainees. Kappa (kappa), weighted kappa (kappa(w)) and intraclass correlation coefficients (ICC) were used to evaluate the 2-week test-retest reliability of the MPQ. For index measures constituting the assessment of a given construct, internal consistency was assessed by Cronbach's alpha (alpha) coefficients. Reliability of individual items ranged from poor to almost perfect (kappa range = 0.45-0.86; kappa(w) range = 0.11-0.91; ICC range = 0.34-0.86) with most items demonstrating moderate reliability. Overall scores related to physical activity, diet, alcohol and smoking constructs were reliable between both administrations (ICC = 0.63-0.85). Support for the internal consistency of the incorporated alcohol (alpha = 0.78) and cigarette (alpha = 0.75) scales was also provided. The MPQ is a reliable self-report instrument for assessing multiple injury-related risk factors during initial military training. Further assessment of the psychometric properties of the MPQ (e.g. different types of validity) with military populations/samples will support its interpretation and use in future surveillance and epidemiological studies.
Reliability evaluation of microgrid considering incentive-based demand response

NASA Astrophysics Data System (ADS)

Huang, Ting-Cheng; Zhang, Yong-Jun

2017-07-01

Incentive-based demand response (IBDR) can guide customers to adjust their behaviour of electricity and curtail load actively. Meanwhile, distributed generation (DG) and energy storage system (ESS) can provide time for the implementation of IBDR. The paper focus on the reliability evaluation of microgrid considering IBDR. Firstly, the mechanism of IBDR and its impact on power supply reliability are analysed. Secondly, the IBDR dispatch model considering customer’s comprehensive assessment and the customer response model are developed. Thirdly, the reliability evaluation method considering IBDR based on Monte Carlo simulation is proposed. Finally, the validity of the above models and method is studied through numerical tests on modified RBTS Bus6 test system. Simulation results demonstrated that IBDR can improve the reliability of microgrid.
Test-retest reliability of fMRI during nonverbal semantic decisions in moderate-severe nonfluent aphasia patients

PubMed Central

Kurland, Jacquie; Naeser, Margaret A.; Baker, Errol H.; Doron, Karl; Martin, Paula I.; Seekins, Heidi E.; Bogdan, Andrew; Renshaw, Perry; Yurgelun-Todd, Deborah

2005-01-01

Cortical reorganization in poststroke aphasia is not well understood. Few studies have investigated neural mechanisms underlying language recovery in severe aphasia patients, who are typically viewed as having a poor prognosis for language recovery. Although test-retest reliability is routinely demonstrated during collection of language data in single-subject aphasia research, this is rarely examined in fMRI studies investigating the underlying neural mechanisms in aphasia recovery. The purpose of this study was to acquire fMRI test-retest data examining semantic decisions both within and between two aphasia patients. Functional MRI was utilized to image individuals with chronic, moderate-severe nonfluent aphasia during nonverbal, yes/no button-box semantic judgments of iconic sentences presented in the Computer-assisted Visual Communication (C-ViC) program. We investigated the critical issue of intra-subject reliability by exploring similarities and differences in regions of activation during participants’ performance of identical tasks twice on the same day. Each participant demonstrated high intra-subject reliability, with response decrements typical of task familiarity. Differences between participants included greater left hemisphere perilesional activation in the individual with better response to C-ViC training. This study provides fMRI reliability in chronic nonfluent aphasia, and adds to evidence supporting differences in individual cortical reorganization in aphasia recovery. PMID:15706052
Technology Demonstration Summary Site Program Demonstration Test Soliditech Inc Solidification-stabilization Process

EPA Science Inventory

The major objective of the Soliditech, Inc., SITE demonstration was to develop reliable performance and cost information about the Soliditech solidification, stabilization technology. The Soliditech process mixes hazardous waste materials with Portland cement or pozzolanic m...
Adaptation of the ToxRTool to Assess the Reliability of Toxicology Studies Conducted with Genetically Modified Crops and Implications for Future Safety Testing.

PubMed

Koch, Michael S; DeSesso, John M; Williams, Amy Lavin; Michalek, Suzanne; Hammond, Bruce

2016-01-01

To determine the reliability of food safety studies carried out in rodents with genetically modified (GM) crops, a Food Safety Study Reliability Tool (FSSRTool) was adapted from the European Centre for the Validation of Alternative Methods' (ECVAM) ToxRTool. Reliability was defined as the inherent quality of the study with regard to use of standardized testing methodology, full documentation of experimental procedures and results, and the plausibility of the findings. Codex guidelines for GM crop safety evaluations indicate toxicology studies are not needed when comparability of the GM crop to its conventional counterpart has been demonstrated. This guidance notwithstanding, animal feeding studies have routinely been conducted with GM crops, but their conclusions on safety are not always consistent. To accurately evaluate potential risks from GM crops, risk assessors need clearly interpretable results from reliable studies. The development of the FSSRTool, which provides the user with a means of assessing the reliability of a toxicology study to inform risk assessment, is discussed. Its application to the body of literature on GM crop food safety studies demonstrates that reliable studies report no toxicologically relevant differences between rodents fed GM crops or their non-GM comparators.
Assessment of the severity of dementia: validity and reliability of the Chinese (Cantonese) version of the Hierarchic Dementia Scale (CV-HDS).

PubMed

Poon, Vickie Wan-kei; Lam, Linda Chiu-wa; Wong, Samuel Yeung-shan

2008-09-01

With the rapid growth of the older population, early detection of cognitive deficits is crucial in slowing down functional deterioration of the elderly persons. To examine the validity and reliability of the Chinese (Cantonese) version of the Hierarchic Dementia Scale (CV-HDS) for Chinese older persons in Hong Kong. The HDS was translated into Cantonese Chinese. The content and cultural validity were evaluated by six expert panel members. Sixty-two participants with diagnosis of dementia were recruited for evaluation. Inter-rater reliability, test-retest reliability, internal consistency and concurrent validity were examined. The CV-HDS demonstrated satisfactory psychometric properties. inter-rater reliability and test-retest reliability were high (alpha=0.89 and alpha=0.94 respectively). High value of Cronbach's alpha (alpha=0.94) demonstrated good internal consistency. The concurrent validity of CV-HDS, through correlation with its scores with that of the Chinese version of Mini Mental Status Examination, was established (ranged from r=0.58 to r=0.78, p<0.01). The CV-HDS is a reliable and valid instrument for assessing severity of cognitive impairment in Cantonese speaking Chinese people with dementia. It facilitates treatment planning to optimize the effects of functional training and rehabilitation.
Reliability of ultrasound thickness measurement of the abdominal muscles during clinical isometric endurance tests.

PubMed

ShahAli, Shabnam; Arab, Amir Massoud; Talebian, Saeed; Ebrahimi, Esmaeil; Bahmani, Andia; Karimi, Noureddin; Nabavi, Hoda

2015-07-01

The study was designed to evaluate the intra-examiner reliability of ultrasound (US) thickness measurement of abdominal muscles activity when supine lying and during two isometric endurance tests in subjects with and without Low back pain (LBP). A total of 19 women (9 with LBP, 10 without LBP) participated in the study. Within-day reliability of the US thickness measurements at supine lying and the two isometric endurance tests were assessed in all subjects. The intra-class correlation coefficient (ICC) was used to assess the relative reliability of thickness measurement. The standard error of measurement (SEM), minimal detectable change (MDC) and the coefficient of variation (CV) were used to evaluate the absolute reliability. Results indicated high ICC scores (0.73-0.99) and also small SEM and MDC scores for within-day reliability assessment. The Bland-Altman plots of agreement in US measurement of the abdominal muscles during the two isometric endurance tests demonstrated that 95% of the observations fall between the limits of agreement for test and retest measurements. Together the results indicate high intra-tester reliability for the US measurement of the thickness of abdominal muscles in all the positions tested. According to the study's findings, US imaging can be used as a reliable method for assessment of abdominal muscles activity in supine lying and the two isometric endurance tests employed, in participants with and without LBP. Copyright © 2014 Elsevier Ltd. All rights reserved.
Reliability of tensiomyography and myotonometry in detecting mechanical and contractile characteristics of the lumbar erector spinae in healthy volunteers.

PubMed

Lohr, Christine; Braumann, Klaus-Michael; Reer, Ruediger; Schroeder, Jan; Schmidt, Tobias

2018-04-20

Tensiomyography™ (TMG) and MyotonPRO ® (MMT) are two non-invasive devices for monitoring muscle contractile and mechanical characteristics. This study aimed to evaluate the test-retest reliability of TMG and MMT parameters for measuring (TMG:) muscle displacement (D m ), contraction time (T c ), and velocity (V c ) and (MMT:) frequency (F), stiffness (S), and decrement (D) of the erector spinae muscles (ES) in healthy adults. A particular focus was set on the establishment of reliability measures for the previously barely evaluated secondary TMG parameter V c . Twenty-four subjects (13 female and 11 male, mean ± SD, 38.0 ± 12.0 years) were measured using TMG and MMT over 2 consecutive days. Absolute and relative reliability was calculated by standard error of measurement (SEM, SEM%), Minimum detectable change (MDC, MDC%), coefficient of variation (CV%) and intraclass correlation coefficient (ICC, 3.1) with a 95% confidence interval (CI). The ICCs for all variables and test-retest intervals ranged from 0.75 to 0.99 indicating a good to excellent relative reliability for both TMG and MMT, demonstrating the lowest values for TMG T c and between-day MMT D (ICC < 0.90). Absolute reliability was suitable for all parameters (CV 2-8%) except for D m (10-12%). V c demonstrated to be the most reliable and repeatable TMG parameter (ICC > 0.95, CV < 8%). The reliability for TMG V c could be established successfully. Its further applicability needs to be confirmed in future studies. MMT was found to be more reliable on repeated testing than the two other TMG parameters D m and T c .
Development of a clinical static and dynamic standing balance measurement tool appropriate for use in adolescents.

PubMed

Emery, Carolyn A; Cassidy, J David; Klassen, Terry P; Rosychuk, Rhonda J; Rowe, Brian B

2005-06-01

There is a need in sports medicine for a static and dynamic standing balance measure to quantify balance ability in adolescents. The purposes of this study were to determine the test-retest reliability of timed static (eyes open) and dynamic (eyes open and eyes closed) unipedal balance measurements and to examine factors associated with balance. Adolescents (n=123) were randomly selected from 10 Calgary high schools. This study used a repeated-measures design. One rater measured unipedal standing balance, including timed eyes-closed static (ECS), eyes-open dynamic (EOD), and eyes-closed dynamic (ECD) balance at baseline and 1 week later. Dynamic balance was measured on a foam surface. Reliability was examined using both intraclass correlation coefficients (ICCs) and Bland and Altman statistical techniques. Multiple linear regressions were used to examine other potentially influencing factors. Based on ICCs, test-retest reliability was adequate for ECS, EOD, and ECD balance (ICC=.69, .59, and .46, respectively). The results of Bland and Altman methods, however, suggest that caution is required in interpreting reliability based on ICCs alone. Although both ECS balance and ECD balance appear to demonstrate adequate test-retest reliability by ICC, Bland and Altman methods of agreement demonstrate sufficient reliability for ECD balance only. Thirty percent of the subjects reached the 180-second maximum on EOD balance, suggesting that this test is not appropriate for use in this population. Balance ability (ECS and ECD) was better in adolescents with no past history of lower-extremity injury. Timed ECD balance is an appropriate and reliable clinical measurement for use in adolescents and is influenced by previous injury.
Data mining-based coefficient of influence factors optimization of test paper reliability

NASA Astrophysics Data System (ADS)

Xu, Peiyao; Jiang, Huiping; Wei, Jieyao

2018-05-01

Test is a significant part of the teaching process. It demonstrates the final outcome of school teaching through teachers' teaching level and students' scores. The analysis of test paper is a complex operation that has the characteristics of non-linear relation in the length of the paper, time duration and the degree of difficulty. It is therefore difficult to optimize the coefficient of influence factors under different conditions in order to get text papers with clearly higher reliability with general methods [1]. With data mining techniques like Support Vector Regression (SVR) and Genetic Algorithm (GA), we can model the test paper analysis and optimize the coefficient of impact factors for higher reliability. It's easy to find that the combination of SVR and GA can get an effective advance in reliability from the test results. The optimal coefficient of influence factors optimization has a practicability in actual application, and the whole optimizing operation can offer model basis for test paper analysis.
Reference values for the muscle power sprint test in 6- to 12-year-old children.

PubMed

Douma-van Riet, Danielle; Verschuren, Olaf; Jelsma, Dorothee; Kruitwagen, Cas; Smits-Engelsman, Bouwien; Takken, Tim

2012-01-01

The aims of this study were (1) to develop centile reference values for anaerobic performance of Dutch children tested using the Muscle Power Sprint Test (MPST) and (2) to examine the test-retest reliability of the MPST. Children who were developing typically (178 boys and 201 girls) and aged 6 to 12 years (mean = 8.9 years) were recruited. The MPST was administered to 379 children, and test-retest reliability was examined in 47 children. MPST scores were transformed into centile curves, which were created using generalized additive models for location, scale, and shape. Height-related reference curves were created for both genders. Excellent (intraclass correlation coefficient = 0.98) test-retest reliability was demonstrated. The reference values for the MPST of children who are developing typically and aged 6 to 12 years can serve as a clinical standard in pediatric physical therapy practice. The MPST is a reliable and practical method for determining anaerobic performance in children.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Agalgaonkar, Yashodhan P.; Hammerstrom, Donald J.

The Pacific Northwest Smart Grid Demonstration (PNWSGD) was a smart grid technology performance evaluation project that included multiple U.S. states and cooperation from multiple electric utilities in the northwest region. One of the local objectives for the project was to achieve improved distribution system reliability. Toward this end, some PNWSGD utilities automated their distribution systems, including the application of fault detection, isolation, and restoration and advanced metering infrastructure. In light of this investment, a major challenge was to establish a correlation between implementation of these smart grid technologies and actual improvements of distribution system reliability. This paper proposes using Welch’smore » t-test to objectively determine and quantify whether distribution system reliability is improving over time. The proposed methodology is generic, and it can be implemented by any utility after calculation of the standard reliability indices. The effectiveness of the proposed hypothesis testing approach is demonstrated through comprehensive practical results. It is believed that wider adoption of the proposed approach can help utilities to evaluate a realistic long-term performance of smart grid technologies.« less
Development and testing of mobile technology for community park improvements: validity and reliability of the eCPAT application with youth.

PubMed

Besenyi, Gina M; Diehl, Paul; Schooley, Benjamin; Turner-McGrievy, Brie M; Wilcox, Sara; Stanis, Sonja A Wilhelm; Kaczynski, Andrew T

2016-12-01

Creation of mobile technology environmental audit tools can provide a more interactive way for youth to engage with communities and facilitate participation in health promotion efforts. This study describes the development and validity and reliability testing of an electronic version of the Community Park Audit Tool (eCPAT). eCPAT consists of 149 items and incorporates a variety of technology benefits. Criterion-related validity and inter-rater reliability were evaluated using data from 52 youth across 47 parks in Greenville County, SC. A large portion of items (>70 %) demonstrated either fair or moderate to perfect validity and reliability. All but six items demonstrated excellent percent agreement. The eCPAT app is a user-friendly tool that provides a comprehensive assessment of park environments. Given the proliferation of smartphones, tablets, and other electronic devices among both adolescents and adults, the eCPAT app has potential to be distributed and used widely for a variety of health promotion purposes.
Reliability and validity of migraine disability assessment questionnaire-Thai version (Thai-MIDAS).

PubMed

Seethong, Piman; Nimmannit, Akarin; Chaisewikul, Rungsan; Prayoonwiwat, Naraporn; Chotinaiwattarakul, Wattanachai

2013-02-01

To assess the validity and test-retest reliability of a Thai translation of the Migraine Disability Assessment (MIDAS) Questionnaire in Thai patients with migraine. Migraineurs from the Headache Clinic in Siriraj Hospital were recruited and asked to complete a 13-weeks diary and answered the Thai-MIDAS at once. Some participants were asked to provide the 2nd Thai-MIDAS in the next 2 weeks for test-retest reliability. Ninety-three patients had completed the 13-weeks diaries. Age range was 18-58 years with mean 37.69 +/- 9.60 years. All 5 items and the total score of Thai-MIDAS were moderately correlated with data from 13-weeks diary (Spearman's correlation coefficient = 0.32-0.62). The test-retest reliability of the total score of Thai-MIDAS in 30 patients demonstrated a highly reliable degree of intraclass correlation (ICC = 0.76, 95% CI 0.49-0.88). The present study reveals that the Thai-MIDAS has satisfactory validity and reliability in comparison with the original English MIDAS version.
Test-retest reliability of sensor-based sit-to-stand measures in young and older adults.

PubMed

Regterschot, G Ruben H; Zhang, Wei; Baldus, Heribert; Stevens, Martin; Zijlstra, Wiebren

2014-01-01

This study investigated test-retest reliability of sensor-based sit-to-stand (STS) peak power and other STS measures in young and older adults. In addition, test-retest reliability of the sensor method was compared to test-retest reliability of the Timed Up and Go Test (TUGT) and Five-Times-Sit-to-Stand Test (FTSST) in older adults. Ten healthy young female adults (20-23 years) and 31 older adults (21 females; 73-94 years) participated in two assessment sessions separated by 3-8 days. Vertical peak power was assessed during three (young adults) and five (older adults) normal and fast STS trials with a hybrid motion sensor worn on the hip. Older adults also performed the FTSST and TUGT. The average sensor-based STS peak power of the normal STS trials and the average sensor-based STS peak power of the fast STS trials showed excellent test-retest reliability in young adults (intra-class correlation (ICC)≥0.90; zero in 95% confidence interval of mean difference between test and retest (95%CI of D); standard error of measurement (SEM)≤6.7% of mean peak power) and older adults (ICC≥0.91; zero in 95%CI of D; SEM≤9.9%). Test-retest reliability of sensor-based STS peak power and TUGT (ICC=0.98; zero in 95%CI of D; SEM=8.5%) was comparable in older adults, test-retest reliability of the FTSST was lower (ICC=0.73; zero outside 95%CI of D; SEM=14.4%). Sensor-based STS peak power demonstrated excellent test-retest reliability and may therefore be useful for clinical assessment of functional status and fall risk. Copyright © 2014 Elsevier B.V. All rights reserved.
Reliability of a Computerized Neurocognitive Test in Baseline Concussion Testing of High School Athletes.

PubMed

MacDonald, James; Duerson, Drew

2015-07-01

Baseline assessments using computerized neurocognitive tests are frequently used in the management of sport-related concussions. Such testing is often done on an annual basis in a community setting. Reliability is a fundamental test characteristic that should be established for such tests. Our study examined the test-retest reliability of a computerized neurocognitive test in high school athletes over 1 year. Repeated measures design. Two American high schools. High school athletes (N = 117) participating in American football or soccer during the 2011-2012 and 2012-2013 academic years. All study participants completed 2 baseline computerized neurocognitive tests taken 1 year apart at their respective schools. The test measures performance on 4 cognitive tasks: identification speed (Attention), detection speed (Processing Speed), one card learning accuracy (Learning), and one back speed (Working Memory). Reliability was assessed by measuring the intraclass correlation coefficient (ICC) between the repeated measures of the 4 cognitive tasks. Pearson and Spearman correlation coefficients were calculated as a secondary outcome measure. The measure for identification speed performed best (ICC = 0.672; 95% confidence interval, 0.559-0.760) and the measure for one card learning accuracy performed worst (ICC = 0.401; 95% confidence interval, 0.237-0.542). All tests had marginal or low reliability. In a population of high school athletes, computerized neurocognitive testing performed in a community setting demonstrated low to marginal test-retest reliability on baseline assessments 1 year apart. Further investigation should focus on (1) improving the reliability of individual tasks tested, (2) controlling for external factors that might affect test performance, and (3) identifying the ideal time interval to repeat baseline testing in high school athletes. Computerized neurocognitive tests are used frequently in high school athletes, often within a model of baseline testing of asymptomatic individuals before the start of a sporting season. This study adds to the evidence that suggests in this population such testing may lack sufficient reliability to support clinical decision making.

Test-Retest Reliability of the Short-Form Survivor Unmet Needs Survey.

PubMed

Taylor, Karen; Bulsara, Max; Monterosso, Leanne

2018-01-01

Reliable and valid needs assessment measures are important assessment tools in cancer survivorship care. A new 30-item short-form version of the Survivor Unmet Needs Survey (SF-SUNS) was developed and validated with cancer survivors, including hematology cancer survivors; however, test-retest reliability has not been established. The objective of this study was to assess the test-retest reliability of the SF-SUNS with a cohort of lymphoma survivors ( n = 40). Test-retest reliability of the SF-SUNS was conducted at two time points: baseline (time 1) and 5 days later (time 2). Test-retest data were collected from lymphoma cancer survivors ( n = 40) in a large tertiary cancer center in Western Australia. Intraclass correlation analyses compared data at time 1 (baseline) and time 2 (5 days later). Cronbach's alpha analyses were performed to assess the internal consistency at both time points. The majority (23/30, 77%) of items achieved test-retest reliability scores 0.45-0.74 (fair to good). A high degree of overall internal consistency was demonstrated (time 1 = 0.92, time 2 = 0.95), with scores 0.65-0.94 across subscales for both time points. Mixed test-retest reliability of the SF-SUNS was established. Our results indicate the SF-SUNS is responsive to the changing needs of lymphoma cancer survivors. Routine use of cancer survivorship specific needs-based assessments is required in oncology care today. Nurses are well placed to administer these assessments and provide tailored information and resources. Further assessment of test-retest reliability in hematology and other cancer cohorts is warranted.
Measuring Nutrition Literacy in Spanish-Speaking Latinos: An Exploratory Validation Study.

PubMed

Gibbs, Heather D; Camargo, Juliana M T B; Owens, Sarah; Gajewski, Byron; Cupertino, Ana Paula

2017-11-21

Nutrition is important for preventing and treating chronic diseases highly prevalent among Latinos, yet no tool exists for measuring nutrition literacy among Spanish speakers. This study aimed to adapt the validated Nutrition Literacy Assessment Instrument for Spanish-speaking Latinos. This study was developed in two phases: adaptation and validity testing. Adaptation included translation, expert item content review, and interviews with Spanish speakers. For validity testing, 51 participants completed the Short Assessment of Health Literacy-Spanish (SAHL-S), the Nutrition Literacy Assessment Instrument in Spanish (NLit-S), and socio-demographic questionnaire. Validity and reliability statistics were analyzed. Content validity was confirmed with a Scale Content Validity Index of 0.96. Validity testing demonstrated NLit-S scores were strongly correlated with SAHL-S scores (r = 0.52, p < 0.001). Entire reliability was substantial at 0.994 (CI 0.992-0.996) and internal consistency was excellent (Cronbach's α = 0.92). The NLit-S demonstrates validity and reliability for measuring nutrition literacy among Spanish-speakers.
The 20 GHz solid state transmitter design, impatt diode development and reliability assessment

NASA Technical Reports Server (NTRS)

Picone, S.; Cho, Y.; Asmus, J. R.

1984-01-01

A single drift gallium arsenide (GaAs) Schottky barrier IMPATT diode and related components were developed. The IMPATT diode reliability was assessed. A proof of concept solid state transmitter design and a technology assessment study were performed. The transmitter design utilizes technology which, upon implementation, will demonstrate readiness for development of a POC model within the 1982 time frame and will provide an information base for flight hardware capable of deployment in a 1985 to 1990 demonstrational 30/20 GHz satellite communication system. Life test data for Schottky barrier GaAs diodes and grown junction GaAs diodes are described. The results demonstrate the viability of GaAs IMPATTs as high performance, reliable RF power sources which, based on the recommendation made herein, will surpass device reliability requirements consistent with a ten year spaceborne solid state power amplifier mission.
Reliability of the Timed Up and Go test and Ten-Metre Timed Walk Test in Pregnant Women with Pelvic Girdle Pain.

PubMed

Evensen, Natalie M; Kvåle, Alice; Braekken, Ingeborg H

2015-09-01

There is a lack of functional objective tests available to measure functional status in women with pelvic girdle pain (PGP). The purpose of this study was to establish test-retest and intertester reliability of the Timed Up and Go (TUG) test and Ten-metre Timed Walk Test (10mTWT) in pregnant women with PGP. A convenience sample of women was recruited over a 4-month period and tested on two occasions, 1 week apart to determine test-retest reliability. Intertester reliability was established between two assessors at the first testing session. Subjects were instructed to undertake the TUG and 10mTWT at maximum speed. One practise trial and two timed trials for each walking test was undertaken on Day 1 and one practise trial and one timed trial on Day 2. Seventeen women with PGP aged 31.1 years (SD [standard deviation] = 2.3) and 28.7 weeks pregnant (SD = 7.4) completed gait testing. Test-retest reliability using the intraclass correlation coefficient (ICC) was excellent for the TUG (0.88) and good for the 10mTWT (0.74). Intertester reliability was determined in the first 13 participants with excellent ICC values being found for both walking tests (TUG: 0.95; 10mTWT: 0.94). This study demonstrated that the TUG and 10mTWT undertaken at fast pace are reliable, objective functional tests in pregnant women with PGP. While both tests are suitable for use in the clinical and research settings, we would recommend the TUG given the findings of higher test-retest reliability and as this test requires less space and time to set up and score. Future studies in a larger sample size are warranted to confirm the results of this study. Copyright © 2015 John Wiley & Sons, Ltd.
Investigation of four self-report instruments (FABT, TSK-HC, Back-PAQ, HC-PAIRS) to measure healthcare practitioners' attitudes and beliefs toward low back pain: Reliability, convergent validity and survey of New Zealand osteopaths and manipulative physiotherapists.

PubMed

Moran, Robert W; Rushworth, Wendy M; Mason, Jesse

2017-12-01

Healthcare practitioner beliefs influence advice and management provided to patients with back pain. Several instruments measuring practitioner beliefs have been developed but psychometric properties for some have not been investigated. To investigate internal consistency, test-retest reliability and convergent validity of the Fear Avoidance Beliefs Tool (FABT), the Tampa Scale of Kinesiophobia for Health Care Providers (TSK-HC), the Back Pain Attitudes Questionnaire (Back-PAQ), and the Health Care Pain and Impairment Relationship Scale (HC-PAIRS). A secondary aim was to explore beliefs of New Zealand osteopaths and physiotherapists regarding low back pain. FABT, TSK-HC, Back-PAQ, and HC-PAIRS were administered twice, 14 days apart. Data from 91 osteopaths and 35 physiotherapists were analysed. The FABT, TSK-HC and Back-PAQ each demonstrated excellent internal consistency, (Cronbach's α = 0.92, 0.91, and 0.91 respectively), and excellent test-retest reliability (lower limit of 95% CI for intraclass correlation coefficient >0.75). Correlations between instruments (Pearson's r = 0.51 to 0.77, p < 0.001) demonstrated good convergent validity. There was a medium to large effect (Cohen's d > 0.47) for mean differences in scores, for all instruments, between professions. This study found excellent internal consistency, test-retest reliability and good convergent validity for the FABT, TSK-HC, and Back-PAQ. Previously reported internal consistency, test-retest and convergent validity of the HC-PAIRS were confirmed, and test-retest reliability was excellent. There were significant scoring differences on each instrument between professions, and while both groups demonstrated fear avoidant beliefs, physiotherapist respondent scores indicated that as a group, they held fewer fear-avoidant beliefs than osteopath respondents. Copyright © 2017 Elsevier Ltd. All rights reserved.
Validity and reliability of a nutrition knowledge survey for assessment in elementary school children.

PubMed

Gower, Jared R; Moyer-Mileur, Laurie J; Wilkinson, Robert D; Slater, Hillarie; Jordan, Kristine C

2010-03-01

Limited surveys are available to assess the nutrition knowledge of children. The goals of this study were to test the validity and reliability of a computer nutrition knowledge survey for elementary school students and to evaluate the impact of the "Fit Kids 'r' Healthy Kids" nutrition intervention via the knowledge survey. During survey development, a sample (n=12) of health educators, elementary school teachers, and registered dietitians assessed the survey. The target population consisted of first- through fourth-grade students from Salt Lake City, UT, metropolitan area schools. Participants were divided into reliability (n=68), intervention (n=74), and control groups (n=59). The reliability group took the survey twice (2 weeks apart); the intervention and control groups also took the survey twice, but at pre- and post-intervention (4 weeks later). Only students from the intervention group participated in four weekly nutrition classes. Reliability was assessed by Pearson's correlation coefficients for knowledge scores. Results demonstrated appropriate content validity, as indicated by expert peer ratings. Test-retest reliability correlations were found to be significant for the overall survey (r=0.54; P<0.001) and for all subscales: food groups, healthful foods, and food functions (r=0.51, 0.65, and 0.49, respectively; P<0.001). Nutrition knowledge was assessed upon program completion with paired samples t tests. Students from the intervention group demonstrated improvement in nutrition knowledge (12.2+/-1.9 to 13.5+/-1.6; P<0.001), while scores for the control group remained unchanged. The difference in total scores from pre- to post-intervention between the two groups was significant (P<0.001). These results suggest that the computerized nutrition survey demonstrated content validity and test-retest reliability for first- through fourth-grade elementary school children. Also, the study results imply that the Fit Kids 'r' Healthy Kids intervention promoted gains in nutrition knowledge. Overall, the computer survey shows promise as an appealing medium for assessing nutrition knowledge in children. Copyright 2010 American Dietetic Association. Published by Elsevier Inc. All rights reserved.
Short-distance walking speed tests in people with Parkinson disease: reliability, responsiveness, and validity.

PubMed

Combs, Stephanie A; Diehl, M Dyer; Filip, Jacqueline; Long, Erin

2014-02-01

The aims of this study were to determine test-retest reliability and responsiveness of short-distance walking speed tests for persons with Parkinson disease (PD). Discriminant and convergent validity of walking speed tests were also examined. Eighty-eight participants with PD (mean age, 66 years) with mild to moderate severity (stages 1-4 on the Hoehn and Yahr Scale) were tested on medications. Measures of activity included the comfortable and fast 10-m walk tests (CWT, FWT), 6-min walk test (6MWT), mini balance evaluations systems test (mini-BEST Test), fear of falling (FoF), and the Activity-Specific Balance Confidence Scale (ABC). The mobility subsection of the PD quality of life-39 (PDQ39-M) served as a participation-based measure. Test-retest reliability was high for both walking speed measures (CWT, ICC(2,1) = 0.98; FWT, ICC(2,1) = 0.99). Minimal detectable change (MDC(95)) for the CWT and FWT was 0.09 m/s and 0.13 m/s respectively. Participants at Hoehn & Yahr levels 3/4 demonstrated significantly slower walking speed with the CWT and FWT than participants at Hoehn & Yahr levels 1 and 2 (P < .01). The CWT and FWT were both significantly (P ≤ .002) correlated with all activity and participation-based measures. Short-distance walking speed tests are clinically useful measures for persons with PD. The CWT and FWT are highly reliable and responsive to change in persons with PD. Short distance walking speed can be used to discriminate differences in gait function between persons with mild and moderate PD severity. The CWT and FWT had moderate to strong associations with other activity and participation based measures demonstrating convergent validity. Copyright © 2013 Elsevier B.V. All rights reserved.
Reliability modelling and analysis of thermal MEMS

NASA Astrophysics Data System (ADS)

Muratet, Sylvaine; Lavu, Srikanth; Fourniols, Jean-Yves; Bell, George; Desmulliez, Marc P. Y.

2006-04-01

This paper presents a MEMS reliability study methodology based on the novel concept of 'virtual prototyping'. This methodology can be used for the development of reliable sensors or actuators and also to characterize their behaviour in specific use conditions and applications. The methodology is demonstrated on the U-shaped micro electro thermal actuator used as test vehicle. To demonstrate this approach, a 'virtual prototype' has been developed with the modeling tools MatLab and VHDL-AMS. A best practice FMEA (Failure Mode and Effect Analysis) is applied on the thermal MEMS to investigate and assess the failure mechanisms. Reliability study is performed by injecting the identified defaults into the 'virtual prototype'. The reliability characterization methodology predicts the evolution of the behavior of these MEMS as a function of the number of cycles of operation and specific operational conditions.
Assessment of reliability, validity, responsiveness and minimally important change of the German Hip dysfunction and osteoarthritis outcome score (HOOS) in patients with osteoarthritis of the hip.

PubMed

Arbab, Dariusch; van Ochten, Johannes H M; Schnurr, Christoph; Bouillon, Bertil; König, Dietmar

2017-12-01

Patient-reported outcome measures are a critical tool in evaluating the efficacy of orthopedic procedures. The intention of this study was to evaluate reliability, validity, responsiveness and minimally important change of the German version of the Hip dysfunction and osteoarthritis outcome score (HOOS). The German HOOS was investigated in 251 consecutive patients before and 6 months after total hip arthroplasty. All patients completed HOOS, Oxford-Hip Score, Short-Form (SF-36) and numeric scales for pain and disability. Test-retest reliability, internal consistency, floor and ceiling effects, construct validity and minimal important change were analyzed. The German HOOS demonstrated excellent test-retest reliability with intraclass correlation coefficient values > 0.7. Cronbach´s alpha values demonstrated strong internal consistency. As hypothesized, HOOS subscales strongly correlated with corresponding OHS and SF-36 domains. All subscales showed excellent (effect size/standardized response means > 0.8) responsiveness between preoperative assessment and postoperative follow-up. The HOOS and all subdomains showed higher changes than the minimal detectable change which indicates true changes. The German version of the HOOS demonstrated good psychometric properties. It proved to be valid, reliable and responsive to the changes instrument for use in patients with hip osteoarthritis undergoing total hip replacement.
Reliability and Validity of the Korean Version of the Internet Addiction Test among College Students

PubMed Central

Lee, Kounseok; Lee, Hye-Kyung; Gyeong, Hyunsu; Yu, Byeongkwan; Song, Yul-Mai

2013-01-01

We developed a Korean translation of the Internet Addiction Test (KIAT), widely used self-report for internet addiction and tested its reliability and validity in a sample of college students. Two hundred seventy-nine college students at a national university completed the KIAT. Internal consistency and two week test-retest reliability were calculated from the data, and principal component factor analysis was conducted. Participants also completed the Internet Addiction Diagnostic Questionnaire (IADQ), the Korea Internet addiction scale (K-scale), and the Patient Health Questionnaire-9 for the criterion validity. Cronbach's alpha of the whole scale was 0.91, and test-retest reliability was also good (r = 0.73). The IADQ, the K-scale, and depressive symptoms were significantly correlated with the KIAT scores, demonstrating concurrent and convergent validity. The factor analysis extracted four factors (Excessive use, Dependence, Withdrawal, and Avoidance of reality) that accounted for 59% of total variance. The KIAT has outstanding internal consistency and high test-retest reliability. Also, the factor structure and validity data show that the KIAT is comparable to the original version. Thus, the KIAT is a psychometrically sound tool for assessing internet addiction in the Korean-speaking population. PMID:23678270
The 10m incremental shuttle walk test is a highly reliable field exercise test for patients referred to cardiac rehabilitation: a retest reliability study.

PubMed

Hanson, Lisa C; Taylor, Nicholas F; McBurney, Helen

2016-09-01

To determine the retest reliability of the 10m incremental shuttle walk test (ISWT) in a mixed cardiac rehabilitation population. Participants completed two 10m ISWTs in a single session in a repeated measures study. Ten participants completed a third 10m ISWT as part of a pilot study. Hospital physiotherapy department. 62 adults aged a mean of 68 years (SD 10) referred to a cardiac rehabilitation program. Retest reliability of the 10m ISWT expressed as relative reliability and measurement error. Relative reliability was expressed in a ratio in the form of an intraclass correlation coefficient (ICC) and measurement error in the form of the standard error of measurement (SEM) and 95% confidence intervals for the group and individual. There was a high level of relative reliability over the two walks with an ICC of .99. The SEMagreement was 17m, and a change of at least 23m for the group and 54m for the individual would be required to be 95% confident of exceeding measurement error. The 10m ISWT demonstrated good retest reliability and is sufficiently reliable to be applied in practice in this population without the use of a practice test. Copyright © 2015 Chartered Society of Physiotherapy. Published by Elsevier Ltd. All rights reserved.
Reliability of perceived neighbourhood conditions and the effects of measurement error on self-rated health across urban and rural neighbourhoods.

PubMed

Pruitt, Sandi L; Jeffe, Donna B; Yan, Yan; Schootman, Mario

2012-04-01

Limited psychometric research has examined the reliability of self-reported measures of neighbourhood conditions, the effect of measurement error on associations between neighbourhood conditions and health, and potential differences in the reliabilities between neighbourhood strata (urban vs rural and low vs high poverty). We assessed overall and stratified reliability of self-reported perceived neighbourhood conditions using five scales (social and physical disorder, social control, social cohesion, fear) and four single items (multidimensional neighbouring). We also assessed measurement error-corrected associations of these conditions with self-rated health. Using random-digit dialling, 367 women without breast cancer (matched controls from a larger study) were interviewed twice, 2-3 weeks apart. Test-retest (intraclass correlation coefficients (ICC)/weighted κ) and internal consistency reliability (Cronbach's α) were assessed. Differences in reliability across neighbourhood strata were tested using bootstrap methods. Regression calibration corrected estimates for measurement error. All measures demonstrated satisfactory internal consistency (α ≥ 0.70) and either moderate (ICC/κ=0.41-0.60) or substantial (ICC/κ=0.61-0.80) test-retest reliability in the full sample. Internal consistency did not differ by neighbourhood strata. Test-retest reliability was significantly lower among rural (vs urban) residents for two scales (social control, physical disorder) and two multidimensional neighbouring items; test-retest reliability was higher for physical disorder and lower for one multidimensional neighbouring item among the high (vs low) poverty strata. After measurement error correction, the magnitude of associations between neighbourhood conditions and self-rated health were larger, particularly in the rural population. Research is needed to develop and test reliable measures of perceived neighbourhood conditions relevant to the health of rural populations.
Accuracy and reliability of peer assessment of athletic training psychomotor laboratory skills.

PubMed

Marty, Melissa C; Henning, Jolene M; Willse, John T

2010-01-01

Peer assessment is defined as students judging the level or quality of a fellow student's understanding. No researchers have yet demonstrated the accuracy or reliability of peer assessment in athletic training education. To determine the accuracy and reliability of peer assessment of athletic training students' psychomotor skills. Cross-sectional study. Entry-level master's athletic training education program. First-year (n = 5) and second-year (n = 8) students. Participants evaluated 10 videos of a peer performing 3 psychomotor skills (middle deltoid manual muscle test, Faber test, and Slocum drawer test) on 2 separate occasions using a valid assessment tool. Accuracy of each peer-assessment score was examined through percentage correct scores. We used a generalizability study to determine how reliable athletic training students were in assessing a peer performing the aforementioned skills. Decision studies using generalizability theory demonstrated how the peer-assessment scores were affected by the number of participants and number of occasions. Participants had a high percentage of correct scores: 96.84% for the middle deltoid manual muscle test, 94.83% for the Faber test, and 97.13% for the Slocum drawer test. They were not able to reliably assess a peer performing any of the psychomotor skills on only 1 occasion. However, the φ increased (exceeding the 0.70 minimal standard) when 2 participants assessed the skill on 3 occasions (φ = 0.79) for the Faber test, with 1 participant on 2 occasions (φ = 0.76) for the Slocum drawer test, and with 3 participants on 2 occasions for the middle deltoid manual muscle test (φ = 0.72). Although students did not detect all errors, they assessed their peers with an average of 96% accuracy. Having only 1 student assess a peer performing certain psychomotor skills was less reliable than having more than 1 student assess those skills on more than 1 occasion. Peer assessment of psychomotor skills could be an important part of the learning process and a tool to supplement instructor assessment.
Intrarater and interrater reliability of the Anteromedial Reach Test in healthy participants

PubMed Central

Bent, Nicholas P; Rushton, Alison B; Wright, Chris C; Petherick, Emma-Jane; Batt, Mark E

2014-01-01

Background The Anteromedial Reach Test is a performance-based outcome measure for evaluating dynamic knee stability in patients with anterior cruciate ligament injury. No previously published study has adequately evaluated intrarater or interrater reliability of the Anteromedial Reach Test, so the purpose of this study was to assess these measurement properties in healthy participants prior to their investigation in patients with anterior cruciate ligament injury. Methods Two raters (A and B) tested 39 healthy university staff and students (20 men, 19 women). For the intrarater reliability investigation, rater A tested participants on three separate test occasions (days 1, 2, and 3) at the same time of day. For the interrater reliability investigation, raters A and B independently tested participants on the same test occasion (day 3). Results There was no significant systematic bias between test occasions or raters. Values of the intraclass correlation coefficient (2,1) were 0.96 for intrarater reliability of both the dominant leg and nondominant leg and 0.97 (dominant leg) and 0.98 (nondominant leg) for interrater reliability. Values for the standard error of measurement were 1.46 (dominant leg) and 1.62 (nondominant leg) for the intrarater investigation, and 1.26 (dominant leg) and 1.04 (nondominant leg) for the interrater investigation. At the 90% confidence level, the minimum detectable change was 3.8% and the error in an individual’s score at a given point in time was ±2.7%. Conclusion The Anteromedial Reach Test demonstrated excellent intrarater and interrater reliability in healthy participants. This provides a basis for future investigation of the measurement properties of the Anteromedial Reach Test in patients with anterior cruciate ligament injury. PMID:24648776
Multiple Choice Testing and the Retrieval Hypothesis of the Testing Effect

ERIC Educational Resources Information Center

Sensenig, Amanda E.

2010-01-01

Taking a test often leads to enhanced later memory for the tested information, a phenomenon known as the "testing effect". This memory advantage has been reliably demonstrated with recall tests but not multiple choice tests. One potential explanation for this finding is that multiple choice tests do not rely on retrieval processes to the same…
Quantitative Accelerated Life Testing of MEMS Accelerometers

PubMed Central

Bâzu, Marius; Gălăţeanu, Lucian; Ilian, Virgil Emil; Loicq, Jerome; Habraken, Serge; Collette, Jean-Paul

2007-01-01

Quantitative Accelerated Life Testing (QALT) is a solution for assessing the reliability of Micro Electro Mechanical Systems (MEMS). A procedure for QALT is shown in this paper and an attempt to assess the reliability level for a batch of MEMS accelerometers is reported. The testing plan is application-driven and contains combined tests: thermal (high temperature) and mechanical stress. Two variants of mechanical stress are used: vibration (at a fixed frequency) and tilting. Original equipment for testing at tilting and high temperature is used. Tilting is appropriate as application-driven stress, because the tilt movement is a natural environment for devices used for automotive and aerospace applications. Also, tilting is used by MEMS accelerometers for anti-theft systems. The test results demonstrated the excellent reliability of the studied devices, the failure rate in the “worst case” being smaller than 10-7h-1. PMID:28903265
Structural Test Laboratory | Water Power | NREL

Science.gov Websites

Structural Test Laboratory Structural Test Laboratory NREL engineers design and configure structural components can validate models, demonstrate system reliability, inform design margins, and assess , including mass and center of gravity, to ensure compliance with design goals Dynamic Characterization Use
Using Penelope to assess the correctness of NASA Ada software: A demonstration of formal methods as a counterpart to testing

NASA Technical Reports Server (NTRS)

Eichenlaub, Carl T.; Harper, C. Douglas; Hird, Geoffrey

1993-01-01

Life-critical applications warrant a higher level of software reliability than has yet been achieved. Since it is not certain that traditional methods alone can provide the required ultra reliability, new methods should be examined as supplements or replacements. This paper describes a mathematical counterpart to the traditional process of empirical testing. ORA's Penelope verification system is demonstrated as a tool for evaluating the correctness of Ada software. Grady Booch's Ada calendar utility package, obtained through NASA, was specified in the Larch/Ada language. Formal verification in the Penelope environment established that many of the package's subprograms met their specifications. In other subprograms, failed attempts at verification revealed several errors that had escaped detection by testing.
A study of the development of the Korean version of PedsQL(TM) 3.0 cerebral palsy module and reliability and validity.

PubMed

Yun, Young-Ju; Shin, Yong-Beom; Kim, Soo-Yeon; Shin, Myung-Jun; Kim, Ra-Jin; Oh, Tae-Young

2016-07-01

[Purpose] The purpose of this study was to develop the Korean version of the PedsQL(TM) 3.0 Cerebral Palsy Module to evaluate the health-related quality of life of children with cerebral palsy and to test the reliability and validity. [Subjects and Methods] The study included 108 caregivers of children with cerebral palsy aged 2 to 4 years and 72 caregivers of children aged 5 to 7 years, who visited multiple sites between February and August 2015. The Translation Commission performed the first translation with the approval of the Mapi Research Trust Company to create a Korean-version of the PedsQL(TM). Afterwards, back-translation was performed by one translator specializing in health and medical treatment who was a native English-speaker fluent in Korean, and one native Korean-speaker fluent in English. The consistency of each question was confirmed and a translation-integrated version was created. Test components were explained to caregivers during a one-on-one interview; caregivers then completed the PedsQL(TM) questionnaire and a Pediatric Evaluation Disability Inventory (PEDI) questionnaire. Subjects contributing to test-retest measures were asked to repeat the PedsQL questionnaire one week later and return it by mail. To assess data quality for the survey question results, non-response rate, ceiling effect, and floor effect were analyzed. Test-retest reliability and internal consistency reliability were assessed. For test-retest reliability, an intraclass correlation coefficient (ICC) was calculated, and for internal consistency reliability, Cronbach's alpha was used. To test criterion-related validity, Pearson's correlation coefficient was used. [Results] The content validity of the PedsQL 3.0 Cerebral Palsy Module was high for both age groups, and demonstrated significant internal consistency (>0.7) in all areas. For test-retest reliability, both groups demonstrated a significant ICC (>0.61). Correlation with the PEDI was statistically significant in all areas except pain and hurt. [Conclusion] The Korean version of the PedsQL(TM) 3.0 Cerebral Palsy Module was found to be reliable and valid, and is expected to contribute greatly to the evaluation of the quality of life of children with cerebral palsy.
Reliability of TMS metrics in patients with chronic incomplete spinal cord injury.

PubMed

Potter-Baker, K A; Janini, D P; Frost, F S; Chabra, P; Varnerin, N; Cunningham, D A; Sankarasubramanian, V; Plow, E B

2016-11-01

Test-retest reliability analysis in individuals with chronic incomplete spinal cord injury (iSCI). The purpose of this study was to examine the reliability of neurophysiological metrics acquired with transcranial magnetic stimulation (TMS) in individuals with chronic incomplete tetraplegia. Cleveland Clinic Foundation, Cleveland, Ohio, USA. TMS metrics of corticospinal excitability, output, inhibition and motor map distribution were collected in muscles with a higher MRC grade and muscles with a lower MRC grade on the more affected side of the body. Metrics denoting upper limb function were also collected. All metrics were collected at two sessions separated by a minimum of two weeks. Reliability between sessions was determined using Spearman's correlation coefficients and concordance correlation coefficients (CCCs). We found that TMS metrics that were acquired in higher MRC grade muscles were approximately two times more reliable than those collected in lower MRC grade muscles. TMS metrics of motor map output, however, demonstrated poor reliability regardless of muscle choice (P=0.34; CCC=0.51). Correlation analysis indicated that patients with more baseline impairment and/or those in a more chronic phase of iSCI demonstrated greater variability of metrics. In iSCI, reliability of TMS metrics varies depending on the muscle grade of the tested muscle. Variability is also influenced by factors such as baseline motor function and time post SCI. Future studies that use TMS metrics in longitudinal study designs to understand functional recovery should be cautious as choice of muscle and clinical characteristics can influence reliability.

The Reliability and Validity of Measures of Gait Variability in Community-Dwelling Older Adults

PubMed Central

Brach, Jennifer S.; Perera, Subashan; Studenski, Stephanie; Newman, Anne B.

2009-01-01

Objective To examine the test-retest reliability and concurrent validity of variability of gait characteristics. Design Cross-sectional study. Setting Research laboratory. Participants Older adults (N=558) from the Cardiovascular Health Study. Interventions Not applicable. Main Outcome Measures Gait characteristics were measured using a 4-m computerized walkway. SD determined from the steps recorded were used as the measures of variability. Intraclass correlation coefficients (ICC) were calculated to examine test-retest reliability of a 4-m walk and two 4-m walks. To establish concurrent validity, the measures of gait variability were compared across levels of health, functional status, and physical activity using independent t tests and analysis of variances. Results Gait variability measures from the two 4-m walks demonstrated greater test-retest reliability than those from the single 4-m walk (ICC=.22–.48 and ICC=.40–.63, respectively). Greater step length and stance time variability were associated with poorer health, functional status and physical activity (P<.05). Conclusions Gait variability calculated from a limited number of steps has fair to good test-retest reliability and concurrent validity. Reliability of gait variability calculated from a greater number of steps should be assessed to determine if the consistency can be improved. PMID:19061741
Natural Gas Engine-Driven Heat Pump Demonstration at DoD Installations: Performance and Reliability Summary

DTIC Science & Technology

2009-06-09

ER D C/ CE R L TR -0 9 -1 0 Natural Gas Engine-Driven Heat Pump Demonstration at DoD Installations Performance and Reliability Summary...L ab or at or y Approved for public release; distribution is unlimited. ERDC/CERL TR-09-10 June 2009 Natural Gas Engine-Driven Heat Pump ...CERL TR-09-10 ii Abstract: Results of field testing natural gas engine-driven heat pumps (GHP) at six southwestern U.S. Department of Defense (DoD
Monitoring sedation status over time in ICU patients: reliability and validity of the Richmond Agitation-Sedation Scale (RASS).

PubMed

Ely, E Wesley; Truman, Brenda; Shintani, Ayumi; Thomason, Jason W W; Wheeler, Arthur P; Gordon, Sharon; Francis, Joseph; Speroff, Theodore; Gautam, Shiva; Margolin, Richard; Sessler, Curtis N; Dittus, Robert S; Bernard, Gordon R

2003-06-11

Goal-directed delivery of sedative and analgesic medications is recommended as standard care in intensive care units (ICUs) because of the impact these medications have on ventilator weaning and ICU length of stay, but few of the available sedation scales have been appropriately tested for reliability and validity. To test the reliability and validity of the Richmond Agitation-Sedation Scale (RASS). Prospective cohort study. Adult medical and coronary ICUs of a university-based medical center. Thirty-eight medical ICU patients enrolled for reliability testing (46% receiving mechanical ventilation) from July 21, 1999, to September 7, 1999, and an independent cohort of 275 patients receiving mechanical ventilation were enrolled for validity testing from February 1, 2000, to May 3, 2001. Interrater reliability of the RASS, Glasgow Coma Scale (GCS), and Ramsay Scale (RS); validity of the RASS correlated with reference standard ratings, assessments of content of consciousness, GCS scores, doses of sedatives and analgesics, and bispectral electroencephalography. In 290-paired observations by nurses, results of both the RASS and RS demonstrated excellent interrater reliability (weighted kappa, 0.91 and 0.94, respectively), which were both superior to the GCS (weighted kappa, 0.64; P<.001 for both comparisons). Criterion validity was tested in 411-paired observations in the first 96 patients of the validation cohort, in whom the RASS showed significant differences between levels of consciousness (P<.001 for all) and correctly identified fluctuations within patients over time (P<.001). In addition, 5 methods were used to test the construct validity of the RASS, including correlation with an attention screening examination (r = 0.78, P<.001), GCS scores (r = 0.91, P<.001), quantity of different psychoactive medication dosages 8 hours prior to assessment (eg, lorazepam: r = - 0.31, P<.001), successful extubation (P =.07), and bispectral electroencephalography (r = 0.63, P<.001). Face validity was demonstrated via a survey of 26 critical care nurses, which the results showed that 92% agreed or strongly agreed with the RASS scoring scheme, and 81% agreed or strongly agreed that the instrument provided a consensus for goal-directed delivery of medications. The RASS demonstrated excellent interrater reliability and criterion, construct, and face validity. This is the first sedation scale to be validated for its ability to detect changes in sedation status over consecutive days of ICU care, against constructs of level of consciousness and delirium, and correlated with the administered dose of sedative and analgesic medications.
Impact of Measurement Error on Statistical Power: Review of an Old Paradox.

ERIC Educational Resources Information Center

Williams, Richard H.; And Others

1995-01-01

The paradox that a Student t-test based on pretest-posttest differences can attain its greatest power when the difference score reliability is zero was explained by demonstrating that power is not a mathematical function of reliability unless either true score variance or error score variance is constant. (SLD)
Anthropological and Psychological Merge: Design of a Stress Measure for Mexican Farmworkers

PubMed Central

Thompson, Beti; O'Connor, Kathleen; Godina, Ruby; Ibarra, Genoveva

2010-01-01

This study implements qualitative and quantitative methodologies in the development of a culturally appropriate instrument of stress for Mexican immigrant farmworkers. Focus groups were used to uncover culturally based perspectives on life stressors, definitions of stress, and stress mediators. Qualitative data were analyzed using QSR NVivo and then used to develop a 23-item stress scale. The scale was tested for reliability and validity in an independent sample and demonstrates excellent reliability (α = 0.9123). Test-retest coefficients of the stress scale are also strong (r = 0.8344, p = 0.0000). Qualitative analyses indicated three major sources of stress: work, family, and community. Emotional aspects of stress also emerged, demonstrating a cultural perspective of stress closely related to feelings of despair and not being able to find a way out of despairing situations. This paper reveals themes gathered from the qualitative data and identifies reliability and validity constructs associated with the scale. The stress scale developed as part of this investigation is a reliable and culturally appropriate instrument for assessing stress among Mexican immigrant farmworkers. PMID:17955350
The development and evaluation of a novel repurposing of a peripheral gaming device for the acquisition of forces applied to a hydraulic treatment plinth.

PubMed

Cooper, Darren; Bevins, Joe; Corbett, Mark

2018-01-13

This technical note details the stages taken to create an instrumented hydraulic treatment plinth for the measurement of applied forces in the vertical axis. The modification used a widely available low-cost peripheral gaming device and required only basic construction and computer skills. The instrumented treatment plinth was validated against a laboratory grade force platform across a range of applied masses from 0.5-15 kg, mock Gr I-IV vertebral mobilisations and a dynamic response test. Intraclass correlation coefficients demonstrated poor reliability (0.46) for low masses of 0.5 kg improving to excellent for larger masses up to15 kg respectively; excellent to good reliability (0.97-0.86) for the mock mobilisations and moderate reliability (0.51) for the dynamic response test. The study demonstrates how a cheap peripheral gaming device can be repurposed so that forces applied to a hydraulic treatment plinth can be collected reliably when applied in a clinically reasoned manner. Copyright © 2018 Elsevier Ltd. All rights reserved.
Test of Gross Motor Development-3 (TGMD-3) with the Use of Visual Supports for Children with Autism Spectrum Disorder: Validity and Reliability.

PubMed

Allen, K A; Bredero, B; Van Damme, T; Ulrich, D A; Simons, J

2017-03-01

The validity and reliability of the Test of Gross Motor Development-3 (TGMD-3) were measured, taking into consideration the preference for visual learning of children with autism spectrum disorder (ASD). The TGMD-3 was administered to 14 children with ASD (4-10 years) and 21 age-matched typically developing children under two conditions: TGMD-3 traditional protocol, and TGMD-3 visual support protocol. Excellent levels of internal consistency, test-retest, interrater and intrarater reliability were achieved for the TGMD-3 visual support protocol. TGMD-3 raw scores of children with ASD were significantly lower than typically developing peers, however, significantly improved using the TGMD-3 visual support protocol. This demonstrates that the TGMD-3 visual support protocol is a valid and reliable assessment of gross motor performance for children with ASD.
[Evaluation of the reliability of freight elevator operators].

PubMed

Gosk, A; Borodulin-Nadzieja, L; Janocha, A; Salomon, E

1991-01-01

The study involved 58 workers employed at winding machines. Their reliability was estimated from the results of psychomotoric test precision, condition of the vegetative nervous system, and from the results of psychological tests. The tests were carried out at the laboratory and at the workplaces, with all distractive factors and functional connection of the work process present. We have found that the reliability of the workers may be affected by a variety of factors. Among the winding machine operators, work monotony can lead to "monotony syndrome". Among the signalists , the appreciation of great responsibility can lead to unpredictable and non-adequate reactions. From both groups, persons displaying a lower-than-average precision were isolated. All those persons demonstrated a reckless attitude and the opinion of their superiors about them was poor. Those persons constitute potential risk for the reliable operation of the discussed team.
Validity and reliability of Patient-Reported Outcomes Measurement Information System (PROMIS) Instruments in Osteoarthritis

PubMed Central

Broderick, Joan E.; Schneider, Stefan; Junghaenel, Doerte U.; Schwartz, Joseph E.; Stone, Arthur A.

2013-01-01

Objective Evaluation of known group validity, ecological validity, and test-retest reliability of four domain instruments from the Patient Reported Outcomes Measurement System (PROMIS) in osteoarthritis (OA) patients. Methods Recruitment of an osteoarthritis sample and a comparison general population (GP) through an Internet survey panel. Pain intensity, pain interference, physical functioning, and fatigue were assessed for 4 consecutive weeks with PROMIS short forms on a daily basis and compared with same-domain Computer Adaptive Test (CAT) instruments that use a 7-day recall. Known group validity (comparison of OA and GP), ecological validity (comparison of aggregated daily measures with CATs), and test-retest reliability were evaluated. Results The recruited samples matched (age, sex, race, ethnicity) the demographic characteristics of the U.S. sample for arthritis and the 2009 Census for the GP. Compliance with repeated measurements was excellent: > 95%. Known group validity for CATs was demonstrated with large effect sizes (pain intensity: 1.42, pain interference: 1.25, and fatigue: .85). Ecological validity was also established through high correlations between aggregated daily measures and weekly CATs (≥ .86). Test-retest validity (7-day) was very good (≥ .80). Conclusion PROMIS CAT instruments demonstrated known group and ecological validity in a comparison of osteoarthritis patients with a general population sample. Adequate test-retest reliability was also observed. These data provide encouraging initial data on the utility of these PROMIS instruments for clinical and research outcomes in osteoarthritis patients. PMID:23592494
Demonstrating Test-Retest Reliability of Electrophysiological Measures for Healthy Adults in a Multisite Study of Biomarkers of Antidepressant Treatment Response

PubMed Central

Tenke, Craig E.; Kayser, Jürgen; Pechtel, Pia; Webb, Christian A.; Dillon, Daniel G.; Goer, Franziska; Murray, Laura; Deldin, Patricia; Kurian, Benji T.; McGrath, Patrick J.; Parsey, Ramin; Trivedi, Madhukar; Fava, Maurizio; Weissman, Myrna M.; McInnis, Melvin; Abraham, Karen; Alvarenga, Jorge; Alschuler, Daniel M.; Cooper, Crystal; Pizzagalli, Diego A.; Bruder, Gerard E.

2016-01-01

Growing evidence suggests that loudness dependency of auditory evoked potentials (LDAEP) and resting EEG alpha and theta may be biological markers for predicting response to antidepressants. In spite of this promise, little is known about the joint reliability of these markers, and thus their clinical applicability. New, standardized procedures were developed to improve the compatibility of data acquired with different EEG platforms, and used to examine test-retest reliability for the three electrophysiological measures selected for a multisite project—Establishing Moderators and Biosignatures of Antidepressant Response for Clinical Care (EMBARC). Thirty nine healthy controls across four clinical research sites were tested in two sessions separated by about one week. Resting EEG (eyes-open and eyes-closed conditions) was recorded and LDAEP measured using binaural tones (1000 Hz, 40 ms) at five intensities (60–100 dB SPL). Principal components analysis (PCA) of current source density (CSD) waveforms reduced volume conduction and provided reference-free measures of resting EEG alpha and N1 dipole activity to tones from auditory cortex. Low Resolution Electromagnetic Tomography (LORETA) extracted resting theta current density measures corresponding to rostral anterior cingulate (rACC), which has been implicated in treatment response. There were no significant differences in posterior alpha, N1 dipole or rACC theta across sessions. Test-retest reliability was .84 for alpha, .87 for N1 dipole, and .70 for theta rACC current density. The demonstration of good-to-excellent reliability for these measures provides a template for future EEG/ERP studies from multiple testing sites, and an important step for evaluating them as biomarkers for predicting treatment response. PMID:28000259
Demonstrating test-retest reliability of electrophysiological measures for healthy adults in a multisite study of biomarkers of antidepressant treatment response.

PubMed

Tenke, Craig E; Kayser, Jürgen; Pechtel, Pia; Webb, Christian A; Dillon, Daniel G; Goer, Franziska; Murray, Laura; Deldin, Patricia; Kurian, Benji T; McGrath, Patrick J; Parsey, Ramin; Trivedi, Madhukar; Fava, Maurizio; Weissman, Myrna M; McInnis, Melvin; Abraham, Karen; E Alvarenga, Jorge; Alschuler, Daniel M; Cooper, Crystal; Pizzagalli, Diego A; Bruder, Gerard E

2017-01-01

Growing evidence suggests that loudness dependency of auditory evoked potentials (LDAEP) and resting EEG alpha and theta may be biological markers for predicting response to antidepressants. In spite of this promise, little is known about the joint reliability of these markers, and thus their clinical applicability. New standardized procedures were developed to improve the compatibility of data acquired with different EEG platforms, and used to examine test-retest reliability for the three electrophysiological measures selected for a multisite project-Establishing Moderators and Biosignatures of Antidepressant Response for Clinical Care (EMBARC). Thirty-nine healthy controls across four clinical research sites were tested in two sessions separated by about 1 week. Resting EEG (eyes-open and eyes-closed conditions) was recorded and LDAEP measured using binaural tones (1000 Hz, 40 ms) at five intensities (60-100 dB SPL). Principal components analysis of current source density waveforms reduced volume conduction and provided reference-free measures of resting EEG alpha and N1 dipole activity to tones from auditory cortex. Low-resolution electromagnetic tomography (LORETA) extracted resting theta current density measures corresponding to rostral anterior cingulate (rACC), which has been implicated in treatment response. There were no significant differences in posterior alpha, N1 dipole, or rACC theta across sessions. Test-retest reliability was .84 for alpha, .87 for N1 dipole, and .70 for theta rACC current density. The demonstration of good-to-excellent reliability for these measures provides a template for future EEG/ERP studies from multiple testing sites, and an important step for evaluating them as biomarkers for predicting treatment response. © 2016 Society for Psychophysiological Research.
The retest reliability of the six-minute walk test in patients referred to a cardiac rehabilitation programme.

PubMed

Hanson, Lisa C; McBurney, Helen; Taylor, Nicholas F

2012-03-01

The purpose of this paper was to determine if the Six-minute Walk Test (6MWT) was a reliable exercise test for patients referred to cardiac rehabilitation when up to three tests were performed and to determine if test scores differed according to between-test time interval. Thirty adults aged 63 ± 7.9 years referred to cardiac rehabilitation participated in a repeated measures reliability trial. Participants completed three 6MWTs within a one-week period. Participants were randomly allocated to one of three groups: on the first day, Group A completed three walks, Group B completed two walks and Group C completed one walk. Relative reliability was expressed in a ratio (ICC(2,1) ), and absolute reliability was expressed in metres (95% confidence intervals) for group and individuals. The 6MWT demonstrated a high level of relative reliability (intraclass correlation coefficients [ICC] = 0.94) across the three walks. There was no statistically significant difference between the test scores of the three groups. However, there was an increase in distance walked from the first to the second to the third 6MWT. Absolute reliability indicated that a change of at least 44 m would be required to be interpreted as true change in a group, and at least 95 m to be interpreted as true change in an individual with 95% confidence. Three 6MWTs completed in relatively short timeframes were not sufficient for reliable results as there was an increase in the distance walked, and relatively large increases in distances would be required to be interpreted as change. It did not make any difference whether the tests were all completed on one day or over one week. This study highlighted problems that may arise when relying on reliability coefficients alone to interpret reliability. These results suggest that the 6MWT may not have sufficient reliability to be a suitable test to evaluate exercise tolerance in patients referred to cardiac rehabilitation. Copyright © 2011 John Wiley & Sons, Ltd.
Test-retest reliability and construct validity of the ENERGY-child questionnaire on energy balance-related behaviours and their potential determinants: the ENERGY-project.

PubMed

Singh, Amika S; Vik, Froydis N; Chinapaw, Mai J M; Uijtdewilligen, Léonie; Verloigne, Maïté; Fernández-Alvira, Juan M; Stomfai, Sarolta; Manios, Yannis; Martens, Marloes; Brug, Johannes

2011-12-09

Insight in children's energy balance-related behaviours (EBRBs) and their determinants is important to inform obesity prevention research. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. To examine the test-retest reliability and construct validity of the child questionnaire used in the ENERGY-project, measuring EBRBs and their potential determinants among 10-12 year old children. We collected data among 10-12 year old children (n = 730 in the test-retest reliability study; n = 96 in the construct validity study) in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC) and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent face-to-face interview was assessed using ICC and percentage agreement. Of the 150 questionnaire items, 115 (77%) showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Test-retest reliability was moderate for 34 items (23%) and poor for one item. Construct validity appeared to be good to excellent for 70 (47%) of the 150 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 80 items, construct validity was moderate for 39 (26%) and poor for 41 items (27%). Our results demonstrate that the ENERGY-child questionnaire, assessing EBRBs of the child as well as personal, family, and school-environmental determinants related to these EBRBs, has good test-retest reliability and moderate to good construct validity for the large majority of items.
Test-retest reliability and construct validity of the ENERGY-child questionnaire on energy balance-related behaviours and their potential determinants: the ENERGY-project

PubMed Central

2011-01-01

Background Insight in children's energy balance-related behaviours (EBRBs) and their determinants is important to inform obesity prevention research. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. Objective To examine the test-retest reliability and construct validity of the child questionnaire used in the ENERGY-project, measuring EBRBs and their potential determinants among 10-12 year old children. Methods We collected data among 10-12 year old children (n = 730 in the test-retest reliability study; n = 96 in the construct validity study) in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC) and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent face-to-face interview was assessed using ICC and percentage agreement. Results Of the 150 questionnaire items, 115 (77%) showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Test-retest reliability was moderate for 34 items (23%) and poor for one item. Construct validity appeared to be good to excellent for 70 (47%) of the 150 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 80 items, construct validity was moderate for 39 (26%) and poor for 41 items (27%). Conclusions Our results demonstrate that the ENERGY-child questionnaire, assessing EBRBs of the child as well as personal, family, and school-environmental determinants related to these EBRBs, has good test-retest reliability and moderate to good construct validity for the large majority of items. PMID:22152048
Issues in cross-cultural validity: example from the adaptation, reliability, and validity testing of a Turkish version of the Stanford Health Assessment Questionnaire.

PubMed

Küçükdeveci, Ayse A; Sahin, Hülya; Ataman, Sebnem; Griffiths, Bridget; Tennant, Alan

2004-02-15

Guidelines have been established for cross-cultural adaptation of outcome measures. However, invariance across cultures must also be demonstrated through analysis of Differential Item Functioning (DIF). This is tested in the context of a Turkish adaptation of the Health Assessment Questionnaire (HAQ). Internal construct validity of the adapted HAQ is assessed by Rasch analysis; reliability, by internal consistency and the intraclass correlation coefficient; external construct validity, by association with impairments and American College of Rheumatology functional stages. Cross-cultural validity is tested through DIF by comparison with data from the UK version of the HAQ. The adapted version of the HAQ demonstrated good internal construct validity through fit of the data to the Rasch model (mean item fit 0.205; SD 0.998). Reliability was excellent (alpha = 0.97) and external construct validity was confirmed by expected associations. DIF for culture was found in only 1 item. Cross-cultural validity was found to be sufficient for use in international studies between the UK and Turkey. Future adaptation of instruments should include analysis of DIF at the field testing stage in the adaptation process.
Method matters: Understanding diagnostic reliability in DSM-IV and DSM-5.

PubMed

Chmielewski, Michael; Clark, Lee Anna; Bagby, R Michael; Watson, David

2015-08-01

Diagnostic reliability is essential for the science and practice of psychology, in part because reliability is necessary for validity. Recently, the DSM-5 field trials documented lower diagnostic reliability than past field trials and the general research literature, resulting in substantial criticism of the DSM-5 diagnostic criteria. Rather than indicating specific problems with DSM-5, however, the field trials may have revealed long-standing diagnostic issues that have been hidden due to a reliance on audio/video recordings for estimating reliability. We estimated the reliability of DSM-IV diagnoses using both the standard audio-recording method and the test-retest method used in the DSM-5 field trials, in which different clinicians conduct separate interviews. Psychiatric patients (N = 339) were diagnosed using the SCID-I/P; 218 were diagnosed a second time by an independent interviewer. Diagnostic reliability using the audio-recording method (N = 49) was "good" to "excellent" (M κ = .80) and comparable to the DSM-IV field trials estimates. Reliability using the test-retest method (N = 218) was "poor" to "fair" (M κ = .47) and similar to DSM-5 field-trials' estimates. Despite low test-retest diagnostic reliability, self-reported symptoms were highly stable. Moreover, there was no association between change in self-report and change in diagnostic status. These results demonstrate the influence of method on estimates of diagnostic reliability. (c) 2015 APA, all rights reserved).
An Overview of Long Duration Sodium Heat Pipe Tests

NASA Astrophysics Data System (ADS)

Rosenfeld, John H.; Ernst, Donald M.; Lindemuth, James E.; Sanzi, James L.; Geng, Steven M.; Zuo, Jon

2004-02-01

High temperature heat pipes are being evaluated for use in energy conversion applications such as fuel cells, gas turbine re-combustors, and Stirling cycle heat sources; with the resurgence of space nuclear power, additional applications include reactor heat removal elements and radiator elements. Long operating life and reliable performance are critical requirements for these applications. Accordingly long-term materials compatibility is being evaluated through the use of high temperature life test heat pipes. Thermacore, Inc. has carried out several sodium heat pipe life tests to establish long term operating reliability. Four sodium heat pipes have recently demonstrated favorable materials compatibility and heat transport characteristics at high operating temperatures in air over long time periods. A 316L stainless steel heat pipe with a sintered porous nickel wick structure and an integral brazed cartridge heater has successfully operated at 650C to 700C for over 115,000 hours without signs of failure. A second 316L stainless steel heat pipe with a specially-designed Inconel 601 rupture disk and a sintered nickel powder wick has demonstrated over 83,000 hours at 600C to 650C with similar success. A representative one-tenth segment Stirling Space Power Converter heat pipe with an Inconel 718 envelope and a stainless steel screen wick has operated for over 41,000 hours at nearly 700C. A hybrid (i.e. gas-fired and solar) heat pipe with a Haynes 230 envelope and a sintered porous nickel wick structure was operated for about 20,000 hours at nearly 700C without signs of degradation. These life test results collectively have demonstrated the potential for high temperature heat pipes to serve as reliable energy conversion system components for power applications that require long operating lifetime with high reliability. Detailed design specifications, operating history, and test results are described for each of these sodium heat pipes. Lessons learned and future life test plans are also discussed.
An Overview of Long Duration Sodium Heat Pipe Tests

NASA Technical Reports Server (NTRS)

Rosenfeld, John H.; Ernst, Donald M.; Lindemuth, James E.; Sanzi, James L.; Geng, Steven M.; Zuo, Jon

2004-01-01

High temperature heat pipes are being evaluated for use in energy conversion applications such as fuel cells, gas turbine re-combustors, and Stirling cycle heat sources; with the resurgence of space nuclear power, additional applications include reactor heat removal elements and radiator elements. Long operating life and reliable performance are critical requirements for these applications. Accordingly long-term materials compatibility is being evaluated through the use of high temperature life test heat pipes. Thermacore International, Inc., has carried out several sodium heat pipe life tests to establish long term operating reliability. Four sodium heat pipes have recently demonstrated favorable materials compatibility and heat transport characteristics at high operating temperatures in air over long time periods. A 3l6L stainless steel heat pipe with a sintered porous nickel wick structure and an integral brazed cartridge heater has successfully operated at 650 to 700 C for over 115,000 hours without signs of failure. A second 3l6L stainless steel heat pipe with a specially-designed Inconel 60 I rupture disk and a sintered nickel powder wick has demonstrated over 83,000 hours at 600 to 650 C with similar success. A representative one-tenth segment Stirling Space Power Converter heat pipe with an Inconel 718 envelope and a stainless steel screen wick has operated for over 41 ,000 hours at nearly 700 0c. A hybrid (i.e. gas-fired and solar) heat pipe with a Haynes 230 envelope and a sintered porous nickel wick structure was operated for about 20,000 hours at nearly 700 C without signs of degradation. These life test results collectively have demonstrated the potential for high temperature heat pipes to serve as reliable energy conversion system components for power applications that require long operating lifetime with high reliability, Detailed design specifications, operating hi story, and test results are described for each of these sodium heat pipes. Lessons learned and future life test plans are also discussed.
Reliability, validity and responsiveness of the German self-reported foot and ankle score (SEFAS) in patients with foot or ankle surgery.

PubMed

Arbab, Dariusch; Kuhlmann, Katharina; Schnurr, Christoph; Bouillon, Bertil; Lüring, Christian; König, Dietmar

2017-10-10

Patient-reported outcome measures are a critical tool in evaluating the efficacy of orthopedic procedures and are increasingly used in clinical trials to assess outcomes of health care. The intention of this study was to develop and culturally adapt a German version of the Self-reported Foot and Ankle Score (SEFAS) and to evaluate reliability, validity and responsiveness. According to Cross Cultural Adaptation of Self-Reported Measure guidelines forward and backward translation has been performed. The German SEFAS was investigated in 177 consecutive patients. 177 Patients completed the German SEFAS, Foot and Ankle Outcome Score (FAOS), Short-Form 36 and numeric scales for pain and disability (NRS) before and 118 patients 6 months after foot or ankle surgery. Test-Retest reliability, internal consistency, floor and ceiling effects, construct validity and minimal important change were analyzed. The German SEFAS demonstrated excellent test-retest reliability with ICC values of 0.97. Cronbach's alpha (α) value of 0.89 demonstrated strong internal consistency. No floor or ceiling effects were observed for the German version of the SEFAS. As hypothesized SEFAS correlated strongly with FAOS and SF-36 domains. It showed moderate (ES/SRM > 0.5) responsiveness between preoperative assessment and postoperative follow-up. The German version of the SEFAS demonstrated good psychometric properties. It proofed to be a valid and reliable instrument for use in foot and ankle patients. DRKS00007585.
A psychometric study of the Test of Everyday Attention for Children in the Chinese setting.

PubMed

Chan, Raymond C K; Wang, Li; Ye, Jiawen; Leung, Winnie W Y; Mok, Monica Y K

2008-07-01

To explore the psychometric properties of the Test of Everyday Attention for Children (TEA-Ch) in the context of a Chinese setting. Confirmatory factor analysis was conducted to examine the construct validity of the Chinese version of the TEA-Ch among a group of 232 children without attention deficit hyperactivity disorder (ADHD). Test-retest reliability was tested on a random sub-sample of 20 children at a 4-week interval. Clinical discrimination was also examined by comparing children with and without ADHD (22 in each group) on the performances of the TEA-Ch. The current Chinese sample demonstrated a three-factor solution for attentional performance among children without ADHD, namely selective attention, executive control/switch, and sustained attention (chi(2)(24)=34.56; RMSEA=.044; p=.075). Moreover, the whole test demonstrated acceptable test-retest reliability at a 4-week interval among a small sub-sample. Children with ADHD performed significantly more poorly than healthy controls in most of the subtests of the TEA-Ch. The results of the present study demonstrate that the test items remain useful in China, a culture very different from that in which the test originated. Finally, the TEA-Ch also presents several advantages when compared to other conventional objective measures of attention.

The Trunk Impairment Scale - modified to ordinal scales in the Norwegian version.

PubMed

Gjelsvik, Bente; Breivik, Kyrre; Verheyden, Geert; Smedal, Tori; Hofstad, Håkon; Strand, Liv Inger

2012-01-01

To translate the Trunk Impairment Scale (TIS), a measure of trunk control in patients after stroke, into Norwegian (TIS-NV), and to explore its construct validity, internal consistency, intertester and test-retest reliability. TIS was translated according to international guidelines. The validity study was performed on data from 201 patients with acute stroke. Fifty patients with stroke and acquired brain injury were recruited to examine intertester and test-retest reliability. Construct validity was analyzed with exploratory and confirmatory factor analysis and item response theory, internal consistency with Cronbach's alpha test, and intertester and test-retest reliability with kappa and intraclass correlation coefficient tests. The back-translated version of TIS-NV was validated by the original developer. The subscale Static sitting balance was removed. By combining items from the subscales Dynamic sitting balance and Coordination, six ordinal superitems (testlets) were constructed. The TIS-NV was renamed the modified TIS-NV (TIS-modNV). After modifications the TIS-modNV fitted well to a locally dependent unidimensional item response theory model. It demonstrated good construct validity, excellent internal consistency, and high intertester and test-retest reliability for the total score. This study supports that the TIS-modNV is a valid and reliable scale for use in clinical practice and research.
Evaluating abdominal core muscle fatigue: Assessment of the validity and reliability of the prone bridging test.

PubMed

De Blaiser, C; De Ridder, R; Willems, T; Danneels, L; Vanden Bossche, L; Palmans, T; Roosen, P

2018-02-01

The aims of this study were to research the amplitude and median frequency characteristics of selected abdominal, back, and hip muscles of healthy subjects during a prone bridging endurance test, based on surface electromyography (sEMG), (a) to determine if the prone bridging test is a valid field test to measure abdominal muscle fatigue, and (b) to evaluate if the current method of administrating the prone bridging test is reliable. Thirty healthy subjects participated in this experiment. The sEMG activity of seven abdominal, back, and hip muscles was bilaterally measured. Normalized median frequencies were computed from the EMG power spectra. The prone bridging tests were repeated on separate days to evaluate inter and intratester reliability. Significant differences in normalized median frequency slope (NMF slope ) values between several abdominal, back, and hip muscles could be demonstrated. Moderate-to-high correlation coefficients were shown between NMF slope values and endurance time. Multiple backward linear regression revealed that the test endurance time could only be significantly predicted by the NMF slope of the rectus abdominis. Statistical analysis showed excellent reliability (ICC=0.87-0.89). The findings of this study support the validity and reliability of the prone bridging test for evaluating abdominal muscle fatigue. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Stability of person ability measures in people with acquired brain injury in the use of everyday technology: the test-retest reliability of the Management of Everyday Technology Assessment (META).

PubMed

Malinowsky, Camilla; Kassberg, Ann-Charlotte; Larsson-Lund, Maria; Kottorp, Anders

2016-01-01

To evaluate the test-retest reliability of the Management of Everyday Technology Assessment (META) in a sample of people with acquired brain injury (ABI). The META was administered twice within a two-week period to 25 people with ABI. A Rasch measurement model was used to convert the META ordinal raw scores into equal-interval linear measures of each participant's ability to manage everyday technology (ET). Test-retest reliability of the stability of the person ability measures in the META was examined by a standardized difference Z-test and an intra-class correlations analysis (ICC 1). The results showed that the paired person ability measures generated from the META were stable over the test-retest period for 22 of the 25 subjects. The ICC 1 correlation was 0.63, which indicates good overall reliability. The META demonstrated acceptable test-retest reliability in a sample of people with ABI. The results illustrate the importance of using sufficiently challenging ETs (relative to a person's abilities) to generate stable META measurements over time. Implications for Rehabilitation The findings add evidence regarding the test-retest reliability of the person ability measures generated from the observation assessment META in a sample of people with ABI. The META might support professionals in the evaluation of interventions that are designed to improve clients' performance of activities including the ability to manage ET.
Reliability of the test of gross motor development second edition (TGMD-2) for Kindergarten children in Myanmar

PubMed Central

Aye, Thanda; Oo, Khin Saw; Khin, Myo Thuzar; Kuramoto-Ahuja, Tsugumi; Maruyama, Hitoshi

2017-01-01

[Purpose] The purpose of this study was to investigate reliability of the test of gross motor development second edition (TGMD-2) for Kindergarten children in Myanmar. [Subjects and Methods] Fifty healthy Kindergarten children (23 males, 27 females) whose parents/guardians had given written consent were participated. The subjects were explained and demonstrated all 12 gross motor skills of TGMD-2 before the assessment. Each subject individually performed two trials for each gross motor skill and the performance was video recorded. Three raters separately watched the video recordings and rated for inter-rater reliability. The second assessment was done one month later with 25 out of 50 subjects for test-rest reliability. The video recordings of 12 subjects were randomly selected from the first 50 recordings for intra-rater reliability six weeks after the first assessment. The agreement on the locomotor and object control raw scores and the gross motor quotient (GMQ) were calculated. [Results] The findings of all the reliability coefficients for the locomotor and object control raw scores and the GMQ were interpreted as good and excellent reliability. [Conclusion] The results represented that TGMD-2 is a highly reliable and appropriate assessment tool for assessing gross motor skill development of Kindergarten children in Myanmar. PMID:29184278
Reliability of the test of gross motor development second edition (TGMD-2) for Kindergarten children in Myanmar.

PubMed

Aye, Thanda; Oo, Khin Saw; Khin, Myo Thuzar; Kuramoto-Ahuja, Tsugumi; Maruyama, Hitoshi

2017-10-01

[Purpose] The purpose of this study was to investigate reliability of the test of gross motor development second edition (TGMD-2) for Kindergarten children in Myanmar. [Subjects and Methods] Fifty healthy Kindergarten children (23 males, 27 females) whose parents/guardians had given written consent were participated. The subjects were explained and demonstrated all 12 gross motor skills of TGMD-2 before the assessment. Each subject individually performed two trials for each gross motor skill and the performance was video recorded. Three raters separately watched the video recordings and rated for inter-rater reliability. The second assessment was done one month later with 25 out of 50 subjects for test-rest reliability. The video recordings of 12 subjects were randomly selected from the first 50 recordings for intra-rater reliability six weeks after the first assessment. The agreement on the locomotor and object control raw scores and the gross motor quotient (GMQ) were calculated. [Results] The findings of all the reliability coefficients for the locomotor and object control raw scores and the GMQ were interpreted as good and excellent reliability. [Conclusion] The results represented that TGMD-2 is a highly reliable and appropriate assessment tool for assessing gross motor skill development of Kindergarten children in Myanmar.
Development and reliability of a Turkish version of the Short Form-Joint Protection Behavior Assessment (JPBA-S).

PubMed

Tonga, Eda; Atasavun Uysal, Songul; Karayazgan, Sedef; Hayran, Mutlu; Düger, Tülin

2016-01-01

Clinical measurement. To adapt the original JPBA-S to a Turkish version (TUR-JPBA-S) and to investigate its reliability in assessing patients with rheumatoid arthritis (RA). Twenty-two participants with RA and 21 healthy people were videotaped while performing tasks listed in the TUR-JPBA-S. Two raters scored the video recordings for to evaluate inter-rater reliability. One rater re-analyzed the recordings at a different time point for intra-rater reliability. Participants with RA were asked to perform the same tasks after three to four weeks which was also recorded to evaluate test-retest reliability. Internal consistency (Cronbach's α value) was found to be high (0.89) for participants with RA. Our results demonstrate excellent intra-rater (ICC: 0.99, SEM 1.2) inter-rater (ICC: 0.99, SEM 1.7) reliability, apart from excellent test-retest reliability (ICC: 0.96). The TUR-JPBA-S is a valid and reliable instrument for assessing JP behavior in patients with RA in Turkey. Level 2. Copyright © 2016 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Reliability and validity of the Wolfram Unified Rating Scale (WURS)

PubMed Central

2012-01-01

Background Wolfram syndrome (WFS) is a rare, neurodegenerative disease that typically presents with childhood onset insulin dependent diabetes mellitus, followed by optic atrophy, diabetes insipidus, deafness, and neurological and psychiatric dysfunction. There is no cure for the disease, but recent advances in research have improved understanding of the disease course. Measuring disease severity and progression with reliable and validated tools is a prerequisite for clinical trials of any new intervention for neurodegenerative conditions. To this end, we developed the Wolfram Unified Rating Scale (WURS) to measure the severity and individual variability of WFS symptoms. The aim of this study is to develop and test the reliability and validity of the Wolfram Unified Rating Scale (WURS). Methods A rating scale of disease severity in WFS was developed by modifying a standardized assessment for another neurodegenerative condition (Batten disease). WFS experts scored the representativeness of WURS items for the disease. The WURS was administered to 13 individuals with WFS (6-25 years of age). Motor, balance, mood and quality of life were also evaluated with standard instruments. Inter-rater reliability, internal consistency reliability, concurrent, predictive and content validity of the WURS were calculated. Results The WURS had high inter-rater reliability (ICCs>.93), moderate to high internal consistency reliability (Cronbach’s α = 0.78-0.91) and demonstrated good concurrent and predictive validity. There were significant correlations between the WURS Physical Assessment and motor and balance tests (rs>.67, p<.03), between the WURS Behavioral Scale and reports of mood and behavior (rs>.76, p<.04) and between WURS Total scores and quality of life (rs=-.86, p=.001). The WURS demonstrated acceptable content validity (Scale-Content Validity Index=0.83). Conclusions These preliminary findings demonstrate that the WURS has acceptable reliability and validity and captures individual differences in disease severity in children and young adults with WFS. PMID:23148655
The Vanderbilt Holistic Face Processing Test: A short and reliable measure of holistic face processing

PubMed Central

Richler, Jennifer J.; Floyd, R. Jackie; Gauthier, Isabel

2014-01-01

Efforts to understand individual differences in high-level vision necessitate the development of measures that have sufficient reliability, which is generally not a concern in group studies. Holistic processing is central to research on face recognition and, more recently, to the study of individual differences in this area. However, recent work has shown that the most popular measure of holistic processing, the composite task, has low reliability. This is particularly problematic for the recent surge in interest in studying individual differences in face recognition. Here, we developed and validated a new measure of holistic face processing specifically for use in individual-differences studies. It avoids some of the pitfalls of the standard composite design and capitalizes on the idea that trial variability allows for better traction on reliability. Across four experiments, we refine this test and demonstrate its reliability. PMID:25228629
SMART empirical approaches for predicting field performance of PV modules from results of reliability tests

NASA Astrophysics Data System (ADS)

Hardikar, Kedar Y.; Liu, Bill J. J.; Bheemreddy, Venkata

2016-09-01

Gaining an understanding of degradation mechanisms and their characterization are critical in developing relevant accelerated tests to ensure PV module performance warranty over a typical lifetime of 25 years. As newer technologies are adapted for PV, including new PV cell technologies, new packaging materials, and newer product designs, the availability of field data over extended periods of time for product performance assessment cannot be expected within the typical timeframe for business decisions. In this work, to enable product design decisions and product performance assessment for PV modules utilizing newer technologies, Simulation and Mechanism based Accelerated Reliability Testing (SMART) methodology and empirical approaches to predict field performance from accelerated test results are presented. The method is demonstrated for field life assessment of flexible PV modules based on degradation mechanisms observed in two accelerated tests, namely, Damp Heat and Thermal Cycling. The method is based on design of accelerated testing scheme with the intent to develop relevant acceleration factor models. The acceleration factor model is validated by extensive reliability testing under different conditions going beyond the established certification standards. Once the acceleration factor model is validated for the test matrix a modeling scheme is developed to predict field performance from results of accelerated testing for particular failure modes of interest. Further refinement of the model can continue as more field data becomes available. While the demonstration of the method in this work is for thin film flexible PV modules, the framework and methodology can be adapted to other PV products.
Developing Reliable Life Support for Mars

NASA Technical Reports Server (NTRS)

Jones, Harry W.

2017-01-01

A human mission to Mars will require highly reliable life support systems. Mars life support systems may recycle water and oxygen using systems similar to those on the International Space Station (ISS). However, achieving sufficient reliability is less difficult for ISS than it will be for Mars. If an ISS system has a serious failure, it is possible to provide spare parts, or directly supply water or oxygen, or if necessary bring the crew back to Earth. Life support for Mars must be designed, tested, and improved as needed to achieve high demonstrated reliability. A quantitative reliability goal should be established and used to guide development t. The designers should select reliable components and minimize interface and integration problems. In theory a system can achieve the component-limited reliability, but testing often reveal unexpected failures due to design mistakes or flawed components. Testing should extend long enough to detect any unexpected failure modes and to verify the expected reliability. Iterated redesign and retest may be required to achieve the reliability goal. If the reliability is less than required, it may be improved by providing spare components or redundant systems. The number of spares required to achieve a given reliability goal depends on the component failure rate. If the failure rate is under estimated, the number of spares will be insufficient and the system may fail. If the design is likely to have undiscovered design or component problems, it is advisable to use dissimilar redundancy, even though this multiplies the design and development cost. In the ideal case, a human tended closed system operational test should be conducted to gain confidence in operations, maintenance, and repair. The difficulty in achieving high reliability in unproven complex systems may require the use of simpler, more mature, intrinsically higher reliability systems. The limitations of budget, schedule, and technology may suggest accepting lower and less certain expected reliability. A plan to develop reliable life support is needed to achieve the best possible reliability.
Interrater and Test-Retest Reliability and Minimal Detectable Change of the Balance Evaluation Systems Test (BESTest) and Subsystems With Community-Dwelling Older Adults.

PubMed

Wang-Hsu, Elizabeth; Smith, Susan S

2017-01-10

Falls are a common cause of injuries and hospital admissions in older adults. Balance limitation is a potentially modifiable factor contributing to falls. The Balance Evaluation Systems Test (BESTest), a clinical balance measure, categorizes balance into 6 underlying subsystems. Each of the subsystems is scored individually and summed to obtain a total score. The reliability of the BESTest and its individual subsystems has been reported in patients with various neurological disorders and cancer survivors. However, the reliability and minimal detectable change (MDC) of the BESTest with community-dwelling older adults have not been reported. The purposes of our study were to (1) determine the interrater and test-retest reliability of the BESTest total and subsystem scores; and (2) estimate the MDC of the BESTest and its individual subsystem scores with community-dwelling older adults. We used a prospective cohort methodological design. Community-dwelling older adults (N = 70; aged 70-94 years; mean = 85.0 [5.5] years) were recruited from a senior independent living community. Trained testers (N = 3) administered the BESTest. All participants were tested with the BESTest by the same tester initially and then retested 7 to 14 days later. With 32 of the participants, a second tester concurrently scored the retest for interrater reliability. Testers were blinded to each other's scores. Intraclass correlation coefficients [ICC(2,1)] were used to determine the interrater and test-retest reliability. Test-retest reliability was also analyzed using method error and the associated coefficients of variation (CVME). MDC was calculated using standard error of measurement. Interrater reliability (N = 32) of the BESTest total score was ICC(2, 1) = 0.97 (95% confidence interval [CI], 0.94-0.99). The ICCs for the individual subsystem scores ranged from 0.85 to 0.94. Test-retest reliability (N = 70) of the BESTest total score was ICC(2,1) = 0.93 (95% CI, 0.89-0.96). ICCs for the individual subsystem scores ranged from 0.72 to 0.89. The CVME (N = 70) of the BESTest total score was 4.1%. The CVME for the subsystem scores ranged from 5.0% to 10.7%. MDC (N = 70) for the BESTest total score at the 95% CI was 7.6%, or 8.2 points. MDC at the 95% CI for subsystem scores ranged from 11.7% to 19.0% (2.1-3.4 points). Results demonstrated generally good to excellent interrater and test-retest reliability in both the BESTest total and subsystem scores with community-dwelling older adults. The BESTest total and individual subsystem scores demonstrate good to excellent interrater and test-retest reliability with community-dwelling older adults. A change of 7.6% (8.2 points) or more in the BESTest total and a percentage change ranged from 11.7% to 19.0% (2.1-3.4 points) in the subsystem scores are suggested for clinicians to be 95% confident of true change when evaluating change in this population.
Test-retest reliability and gender differences in the sexual discounting task among cocaine-dependent individuals.

PubMed

Johnson, Matthew W; Bruner, Natalie R

2013-08-01

The Sexual Discounting Task uses the delay discounting framework to examine sexual HIV risk behavior. Previous research showed task performance to be significantly correlated with self-reported HIV risk behavior in cocaine dependence. Test-retest reliability and gender differences had remained unexamined. The present study examined the test-retest reliability of the Sexual Discounting Task. Cocaine-dependent individuals (18 men, 13 women) completed the task in two laboratory visits ∼7 days apart. Participants selected photographs of individuals with whom they were willing to have casual sex. Among these, participants identified the individual most (and least) likely to have a sexually transmitted infection (STI), and the individual with whom he or she most (and least) wanted to have sex. In reference to these individuals, participants rated their likelihood of having unprotected sex versus waiting to have sex with a condom, at various delays. A money delay discounting task was also completed at the first visit. Significant differences in discounting among partner conditions were shown. Differential stability was demonstrated by significant, positive correlations between test and retest for all four partner conditions. Absolute stability was demonstrated by statistical equivalence tests between test and retest, and also supported by a lack of significant differences between test and retest. Men generally discounted significantly more than women for sexual outcomes but not money. Results suggest the Sexual Discounting Task to be a reliable measure in cocaine-dependent individuals, which supports its use as a repeated measure in clinical research, for example, studies examining acute drug effects on sexual risk and the effects of addiction treatment and HIV prevention interventions on sexual risk. PsycINFO Database Record (c) 2013 APA, all rights reserved
The Validity and Reliability Test of the Indonesian Version of Gastroesophageal Reflux Disease Quality of Life (GERD-QOL) Questionnaire.

PubMed

Siahaan, Laura A; Syam, Ari F; Simadibrata, Marcellus; Setiati, Siti

2017-01-01

to obtain a valid and reliable GERD-QOL questionnaire for Indonesian application. at the initial stage, the GERD-QOL questionnaire was first translated into Indonesian language and the translated questionnaire was subsequently translated back into the original language (back-to-back translation). The results were evaluated by the researcher team and therefore, an Indonesian version of GERD-QOL questionnaire was developed. Ninety-one patients who had been clinically diagnosed with GERD based on the Montreal criteria were interviewed using the Indonesian version of GERD-QOL questionnaire and the SF 36 questionnaire. The validity was evaluated using a method of construct validity and external validity, and reliability can be tested by the method of internal consistency and test retest. the Indonesian version of GERD-QOL questionnaire had a good internal consistency reliability with a Cronbach Alpha of 0.687-0.842 and a good test retest reliability with an intra-class correlation coefficient of 0.756-0.936; p<0.05). The questionnaire had also been demonstrated to have a good validity with a proven high correlation to each question of SF-36 (p<0.05). the Indonesian version of GERD-QOL questionnaire has been proven valid and reliable to evaluate the quality of life of GERD patients.
Clinical use of the Mayo-Portland Adaptability Inventory in rehabilitation after paediatric acquired brain injury.

PubMed

Oddson, Bruce; Rumney, Peter; Johnson, Patricia; Thomas-Stonell, Nancy

2006-11-01

The Mayo-Portland Adaptability Inventory (MPAI; designed to be administered by clinicians) is a popular measure of disability following head injury in adults. Its acceptability, validity, and reliability were assessed for use with children. There were 335 children and adolescents (215 males, 120 females) aged between 1 and 19 years at injury (median age 9y 8mo [SD 5y]) in our sample. The test was acceptable to respondents, rapidly and easily administered, and required only small modifications. It demonstrated validity against client and parent reports of major symptoms. It demonstrated test-retest reliability within the limitations of our data and excellent interrater accord. Consequently, the MPAI is recommended for paediatric use for evaluating rehabilitation needs and therapy outcome.
The validity and reliability of Systemic Lupus Erythematosus Quality of Life Questionnaire (L-QoL) in a Turkish population.

PubMed

Duruöz, M T; Unal, C; Toprak, C Sanal; Sezer, I; Yilmaz, F; Ulutatar, F; Atagündüz, P; Baklacioglu, H S

2017-12-01

Background Systemic lupus erythematosus (SLE) may have a profound impact on quality of life. There is increasing interest in measuring quality of life in lupus patients. The purpose of this study was to investigate the validity and reliability of SLE Quality of Life Questionnaire (L-QoL) in Turkish SLE patients. Methods SLE according to 2012 Systemic Lupus International Collaborating Clinics Classification Criteria were recruited into the study. Demographic data, clinical parameters and disease activity measured with the Systemic Lupus Erythematosus Disease Activity Index-2000 (SLEDAI-2K); were noted. Nottingham Health Profile and Health Assessment Questionnaire were filled out in addition to the Turkish L-QoL (LQoL-TR). Internal consistency, test-retest reliability, and convergent and discriminant validity were evaluated. Results The mean age of participants was 43.55 ± 14.33 years and the mean disease duration was 89.8 ± 92.1 months. The patients filled out LQoL-TR in 2.5 min. Strong correlation of LQoL-TR with all subgroups of the Nottingham Health Profile and the Health Assessment Questionnaire were established showing the convergent validity. The highest correlation was demonstrated with emotional reactions (rho = 0.72) and sleep component (rho = 0.65) of the Nottingham Health Profile scale ( p < 0.0001). Its poor and not significant correlation with nonfunctional parameters (age, disease duration, perceived general health, SLEDAI-2K) showed its discriminative properties. LQoL-TR demonstrated good internal reliability with a Cronbach's α of 0.93 and test-retest reliability with intraclass correlation coefficient of 0.87. Conclusion The LQoL-TR is a practical and useful tool which demonstrates good validity and reliability.
Validity and Reliability of Thai Version of the Foot and Ankle Ability Measure (FAAM) Subjective Form.

PubMed

Arunakul, Marut; Arunakul, Preeyaphan; Suesiritumrong, Chakhrist; Angthong, Chayanin; Chernchujit, Bancha

2015-06-01

Self-administered questionnaires have become an important aspect for clinical outcome assessment of foot and ankle-related problems. The Foot and Ankle Ability Measure (FAAM) subjective form is a region-specific questionnaire that is widely used and has sufficient validity and reliability from previous studies. Translate the original English version of FAAM into a Thai version and evaluate the validity and reliability of Thai FAAM in patients with foot and ankle-related problems. The FAAM subjective form was translated into Thai using forward-backward translation protocol. Afterward, reliability and validity were tested. Following responses from 60 consecutive patients on two questionnaires, the Thai FAAM subjective form and the short form (SF)-36, were used. The validity was tested by correlating the scores from both questionnaires. The reliability was adopted by measuring the test-retest reliability and internal consistency. Thai FAAM score including activity of daily life (ADL) and Sport subscale demonstrated the sufficient correlations with physical functioning (PF) and physical composite score (PCS) domains of the SF-36 (statistically significant with p < 0.001 level and ≥ 0.5 values). The result of reliability revealed highly intra-class correlation coefficient as 0.8 and 0.77, respectively from test-retest study. The internal consistency was strong (Cronbach alpha = 0.94 and 0.88, respectively). The Thai version of FAAM subjective form retained the characteristics of the original version and has proved a reliable evaluation instrument for patients with foot and ankle-related problems.
Sensitivity, reliability and the effects of diurnal variation on a test battery of field usable upper limb fatigue measures.

PubMed

Yung, Marcus; Wells, Richard P

2017-07-01

Fatigue has been linked to deficits in production quality and productivity and, if of long duration, work-related musculoskeletal disorders. It may thus be a useful risk indicator and design and evaluation tool. However, there is limited information on the test-retest reliability, the sensitivity and the effects of diurnal fluctuation on field usable fatigue measures. This study reports on an evaluation of 11 measurement tools and their 14 parameters. Eight measures were found to have test-retest ICC values greater than 0.8. Four measures were particularly responsive during an intermittent fatiguing condition. However, two responsive measures demonstrated rhythmic behaviour, with significant time effects from 08:00 to mid-afternoon and early evening. Action tremor, muscle mechanomyography and perceived fatigue were found to be most reliable and most responsive; but additional analytical considerations might be required when interpreting daylong responses of MMG and action tremor. Practitioner Summary: This paper presents findings from test-retest and daylong reliability and responsiveness evaluations of 11 fatigue measures. This paper suggests that action tremor, muscle mechanomyography and perceived fatigue were most reliable and most responsive. However, mechanomyography and action tremor may be susceptible to diurnal changes.
21 CFR 315.5 - Evaluation of effectiveness.

Code of Federal Regulations, 2010 CFR

2010-04-01

..., physiological, or biochemical assessment is established by demonstrating in a defined clinical setting reliable measurement of function(s) or physiological, biochemical, or molecular process(es). (3) The claim of disease... demonstrating in a defined clinical setting that the test is useful in diagnostic or therapeutic patient...
21 CFR 601.34 - Evaluation of effectiveness.

Code of Federal Regulations, 2010 CFR

2010-04-01

..., physiological, or biochemical assessment is established by demonstrating in a defined clinical setting reliable measurement of function(s) or physiological, biochemical, or molecular process(es). (3) The claim of disease... demonstrating in a defined clinical setting that the test is useful in diagnostic or therapeutic patient...
21 CFR 315.5 - Evaluation of effectiveness.

Code of Federal Regulations, 2014 CFR

2014-04-01

..., physiological, or biochemical assessment is established by demonstrating in a defined clinical setting reliable measurement of function(s) or physiological, biochemical, or molecular process(es). (3) The claim of disease... demonstrating in a defined clinical setting that the test is useful in diagnostic or therapeutic patient...

21 CFR 315.5 - Evaluation of effectiveness.

Code of Federal Regulations, 2012 CFR

2012-04-01

..., physiological, or biochemical assessment is established by demonstrating in a defined clinical setting reliable measurement of function(s) or physiological, biochemical, or molecular process(es). (3) The claim of disease... demonstrating in a defined clinical setting that the test is useful in diagnostic or therapeutic patient...
21 CFR 601.34 - Evaluation of effectiveness.

Code of Federal Regulations, 2014 CFR

2014-04-01

..., physiological, or biochemical assessment is established by demonstrating in a defined clinical setting reliable measurement of function(s) or physiological, biochemical, or molecular process(es). (3) The claim of disease... demonstrating in a defined clinical setting that the test is useful in diagnostic or therapeutic patient...
21 CFR 601.34 - Evaluation of effectiveness.

Code of Federal Regulations, 2012 CFR

2012-04-01

..., physiological, or biochemical assessment is established by demonstrating in a defined clinical setting reliable measurement of function(s) or physiological, biochemical, or molecular process(es). (3) The claim of disease... demonstrating in a defined clinical setting that the test is useful in diagnostic or therapeutic patient...
21 CFR 315.5 - Evaluation of effectiveness.

Code of Federal Regulations, 2011 CFR

2011-04-01

..., physiological, or biochemical assessment is established by demonstrating in a defined clinical setting reliable measurement of function(s) or physiological, biochemical, or molecular process(es). (3) The claim of disease... demonstrating in a defined clinical setting that the test is useful in diagnostic or therapeutic patient...
21 CFR 601.34 - Evaluation of effectiveness.

Code of Federal Regulations, 2011 CFR

2011-04-01

..., physiological, or biochemical assessment is established by demonstrating in a defined clinical setting reliable measurement of function(s) or physiological, biochemical, or molecular process(es). (3) The claim of disease... demonstrating in a defined clinical setting that the test is useful in diagnostic or therapeutic patient...
21 CFR 601.34 - Evaluation of effectiveness.

Code of Federal Regulations, 2013 CFR

2013-04-01

..., physiological, or biochemical assessment is established by demonstrating in a defined clinical setting reliable measurement of function(s) or physiological, biochemical, or molecular process(es). (3) The claim of disease... demonstrating in a defined clinical setting that the test is useful in diagnostic or therapeutic patient...
21 CFR 315.5 - Evaluation of effectiveness.

Code of Federal Regulations, 2013 CFR

2013-04-01

..., physiological, or biochemical assessment is established by demonstrating in a defined clinical setting reliable measurement of function(s) or physiological, biochemical, or molecular process(es). (3) The claim of disease... demonstrating in a defined clinical setting that the test is useful in diagnostic or therapeutic patient...
Reliability and validity of the Brief Pain Inventory in individuals with chronic obstructive pulmonary disease.

PubMed

Chen, Y-W; HajGhanbari, B; Road, J D; Coxson, H O; Camp, P G; Reid, W D

2018-06-08

Pain is prevalent in chronic obstructive pulmonary disease (COPD) and the Brief Pain Inventory (BPI) appears to be a feasible questionnaire to assess this symptom. However, the reliability and validity of the BPI have not been determined in individuals with COPD. This study aimed to determine the internal consistency, test-retest reliability and validity (construct, convergent, divergent and discriminant) of the BPI in individuals with COPD. In order to examine the test-retest reliability, individuals with COPD were recruited from pulmonary rehabilitation programmes to complete the BPI twice 1 week apart. In order to investigate validity, de-identified data was retrieved from two previous studies, including forced expiratory volume in 1-s, age, sex and data from four questionnaires: the BPI, short-form McGill Pain Questionnaire (SF-MPQ), 36-Item Short Form Survey (SF-36) and Community Health Activities Model Program for Seniors (CHAMPS) questionnaire. In total, 123 participants were included in the analyses (eligible data were retrieved from 86 participants and additional 37 participants were recruited). The BPI demonstrated excellent internal consistency and test-retest reliability. It also showed convergent validity with the SF-MPQ and divergent validity with the SF-36. The factor analysis yielded two factors of the BPI, which demonstrated that the two domains of the BPI measure the intended constructs. The BPI can also discriminate pain levels among COPD patients with varied levels of quality of life (SF-36) and physical activity (CHAMPS). The BPI is a reliable and valid pain questionnaire that can be used to evaluate pain in COPD. This study formally established the reliability and validity of the BPI in individuals with COPD, which have not been determined in this patient group. The results of this study provide strong evidence that assessment results from this pain questionnaire are reliable and valid. © 2018 European Pain Federation - EFIC®.
Measuring verbal and non-verbal communication in aphasia: reliability, validity, and sensitivity to change of the Scenario Test.

PubMed

van der Meulen, Ineke; van de Sandt-Koenderman, W Mieke E; Duivenvoorden, Hugo J; Ribbers, Gerard M

2010-01-01

This study explores the psychometric qualities of the Scenario Test, a new test to assess daily-life communication in severe aphasia. The test is innovative in that it: (1) examines the effectiveness of verbal and non-verbal communication; and (2) assesses patients' communication in an interactive setting, with a supportive communication partner. To determine the reliability, validity, and sensitivity to change of the Scenario Test and discuss its clinical value. The Scenario Test was administered to 122 persons with aphasia after stroke and to 25 non-aphasic controls. Analyses were performed for the entire group of persons with aphasia, as well as for a subgroup of persons unable to communicate verbally (n = 43). Reliability (internal consistency, test-retest reliability, inter-judge, and intra-judge reliability) and validity (internal validity, convergent validity, known-groups validity) and sensitivity to change were examined using standard psychometric methods. The Scenario Test showed high levels of reliability. Internal consistency (Cronbach's alpha = 0.96; item-rest correlations = 0.58-0.82) and test-retest reliability (ICC = 0.98) were high. Agreement between judges in total scores was good, as indicated by the high inter- and intra-judge reliability (ICC = 0.86-1.00). Agreement in scores on the individual items was also good (square-weighted kappa values 0.61-0.92). The test demonstrated good levels of validity. A principal component analysis for categorical data identified two dimensions, interpreted as general communication and communicative creativity. Correlations with three other instruments measuring communication in aphasia, that is, Spontaneous Speech interview from the Aachen Aphasia Test (AAT), Amsterdam-Nijmegen Everyday Language Test (ANELT), and Communicative Effectiveness Index (CETI), were moderate to strong (0.50-0.85) suggesting good convergent validity. Group differences were observed between persons with aphasia and non-aphasic controls, as well as between persons with aphasia unable to use speech to convey information and those able to communicate verbally; this indicates good known-groups validity. The test was sensitive to changes in performance, measured over a period of 6 months. The data support the reliability and validity of the Scenario Test as an instrument for examining daily-life communication in aphasia. The test focuses on multimodal communication; its psychometric qualities enable future studies on the effect of Alternative and Augmentative Communication (AAC) training in aphasia.
Translation, Cultural Adaptation and Validation of the Simple Shoulder Test to Spanish

PubMed Central

Arcuri, Francisco; Barclay, Fernando; Nacul, Ivan

2015-01-01

Background: The validation of widely used scales facilitates the comparison across international patient samples. Objective: The objective was to translate, culturally adapt and validate the Simple Shoulder Test into Argentinian Spanish. Methods: The Simple Shoulder Test was translated from English into Argentinian Spanish by two independent translators, translated back into English and evaluated for accuracy by an expert committee to correct the possible discrepancies. It was then administered to 50 patients with different shoulder conditions.Psycometric properties were analyzed including internal consistency, measured with Cronbach´s Alpha, test-retest reliability at 15 days with the interclass correlation coefficient. Results: The internal consistency, validation, was an Alpha of 0,808, evaluated as good. The test-retest reliability index as measured by intra-class correlation coefficient (ICC) was 0.835, evaluated as excellent. Conclusion: The Simple Shoulder Test translation and it´s cultural adaptation to Argentinian-Spanish demonstrated adequate internal reliability and validity, ultimately allowing for its use in the comparison with international patient samples.
Validity and reliability of a scale to measure genital body image.

PubMed

Zielinski, Ruth E; Kane-Low, Lisa; Miller, Janis M; Sampselle, Carolyn

2012-01-01

Women's body image dissatisfaction extends to body parts usually hidden from view--their genitals. Ability to measure genital body image is limited by lack of valid and reliable questionnaires. We subjected a previously developed questionnaire, the Genital Self Image Scale (GSIS) to psychometric testing using a variety of methods. Five experts determined the content validity of the scale. Then using four participant groups, factor analysis was performed to determine construct validity and to identify factors. Further construct validity was established using the contrasting groups approach. Internal consistency and test-retest reliability was determined. Twenty one of 29 items were considered content valid. Two items were added based on expert suggestions. Factor analysis was undertaken resulting in four factors, identified as Genital Confidence, Appeal, Function, and Comfort. The revised scale (GSIS-20) included 20 items explaining 59.4% of the variance. Women indicating an interest in genital cosmetic surgery exhibited significantly lower scores on the GSIS-20 than those who did not. The final 20 item scale exhibited internal reliability across all sample groups as well as test-retest reliability. The GSIS-20 provides a measure of genital body image demonstrating reliability and validity across several populations of women.
Validity and reliability of the Turkish Migraine Disability Assessment (MIDAS) questionnaire.

PubMed

Ertaş, Mustafa; Siva, Aksel; Dalkara, Turgay; Uzuner, Nevzat; Dora, Babür; Inan, Levent; Idiman, Fethi; Sarica, Yakup; Selçuki, Deniz; Sirin, Hadiye; Oğuzhanoğlu, Atilla; Irkeç, Ceyla; Ozmenoğlu, Mehmet; Ozbenli, Taner; Oztürk, Musa; Saip, Sabahattin; Neyal, Münife; Zarifoğlu, Mehmet

2004-09-01

The aim of this study is to assess the comprehensibility, internal consistency, patient-physician reliability, test-retest reliability, and validity of Turkish version of Migraine Disability Assessment (MIDAS) questionnaire in patients with headache. MIDAS questionnaire has been developed by Stewart et al and shown to be reliable and valid to determine the degree of disability caused by migraine. This study was designed as a national multicenter study to demonstrate the reliability and validity of Turkish version of MIDAS questionnaire. Patients applying to 17 Neurology Clinics in Turkey were evaluated at the baseline (visit 1), week 4 (visit 2), and week 12 (visit 3) visits in terms of disease severity and comprehensibility, internal consistency, test-retest reliability, and validity of MIDAS. Since the severity of the disease has been found to change significantly at visit 2 compared to visit 1, test-retest reliability was assessed using the MIDAS scores of a subgroup of patients whose disease severity remained unchanged (up to +/-3 days difference in the number of days with headache between visits 1 and 2). A total of 306 patients (86.2% female, mean age: 35.0 +/- 9.8 years) were enrolled into the study. A total of 65.7%, 77.5%, 82.0% of patients reported that "they had fully understood the MIDAS questionnaire" in visits 1, 2, and 3, respectively. A highly positive correlation was found between physician and patient and the applied total MIDAS scores in all three visits (Spearman correlation coefficients were R= 0.87, 0.83, and 0.90, respectively, P <.001). Internal consistency of MIDAS was assessed using Cronbach's alpha and was found at acceptable (>0.7) or excellent (>0.8) levels in both patient and physician applied MIDAS scores, respectively. Total MIDAS score showed good test-retest reliability (R= 0.68). Both the number of days with headache and the total MIDAS scores were positively correlated at all visits with correlation coefficients between 0.47 and 0.63. There was also a moderate degree of correlation (R= 0.54) between the total MIDAS score at week 12 and the number of days with headache at visit 2 + visit 3, which quantify headache-related disability over a 3-month period similar to MIDAS questionnaire. These findings demonstrated that the Turkish translation is equivalent to the English version of MIDAS in terms of internal consistency, test-retest reliability, and validity. Physicians can reliably use the Turkish translation of the MIDAS questionnaire in defining the severity of illness and its treatment strategy when applied as a self-administered report by migraine patients themselves.
Assessing the validity and reliability of the Pool Activity Level (PAL) Checklist for use with older people with dementia.

PubMed

Wenborn, Jennifer; Challis, David; Pool, Jackie; Burgess, Jane; Elliott, Nicola; Orrell, Martin

2008-03-01

Activity is key to maintaining physical and mental health and well-being. However, as dementia affects the ability to engage in activity, care-givers can find it difficult to provide appropriate activities. The Pool Activity Level (PAL) Checklist guides the selection of appropriate, personally meaningful activities. The aim of this study was to assess the reliability and validity of the PAL Checklist when used with older people with dementia. A postal questionnaire sent to activity providers assessed content validity. Validity and reliability were measured in a sample of 60 older people with dementia. The questionnaire response rate was 83% (102/122). Most respondents felt no important items were missing. Seven of the nine activities were ranked as 'very important' or 'essential' by at least 77% of the sample, indicating very good content validity. Correlation with measures of cognition, severity of dementia and activity performance demonstrated strong concurrent validity. Inter-item correlation indicated strong construct validity. Cronbach's alpha coefficient measured internal consistency as excellent (0.95). All items achieved acceptable test-retest reliability, and the majority demonstrated acceptable inter-rater reliability. We conclude that the PAL Checklist demonstrates adequate validity and reliability when used with older people with dementia and appears a useful tool for a variety of care settings.
Effects of a common transcranial direct current stimulation (tDCS) protocol on motor evoked potentials found to be highly variable within individuals over 9 testing sessions.

PubMed

Horvath, Jared Cooney; Vogrin, Simon J; Carter, Olivia; Cook, Mark J; Forte, Jason D

2016-09-01

Transcranial direct current stimulation (tDCS) uses a weak electric current to modulate neuronal activity. A neurophysiologic outcome measure to demonstrate reliable tDCS modulation at the group level is transcranial magnetic stimulation engendered motor evoked potentials (MEPs). Here, we conduct a study testing the reliability of individual MEP response patterns following a common tDCS protocol. Fourteen participants (7m/7f) each underwent nine randomized sessions of 1 mA, 10 min tDCS (3 anode; 3 cathode; 3 sham) delivered using an M1/orbito-frontal electrode montage (sessions separated by an average of ~5.5 days). Fifteen MEPs were obtained prior to, immediately following and in 5 min intervals for 30 min following tDCS. TMS was delivered at 130 % resting motor threshold using neuronavigation to ensure consistent coil localization. A number of non-experimental variables were collected during each session. At the individual level, considerable variability was seen among different testing sessions. No participant demonstrated an excitatory response ≥20 % to all three anodal sessions, and no participant demonstrated an inhibitory response ≥20 % to all three cathodal sessions. Intra-class correlation revealed poor anodal and cathodal test-retest reliability [anode: ICC(2,1) = 0.062; cathode: ICC(2,1) = 0.055] and moderate sham test-retest reliability [ICC(2,1) = 0.433]. Results also revealed no significant effect of tDCS at the group level. Using this common protocol, we found the effects of tDCS on MEP amplitudes to be highly variable at the individual level. In addition, no significant effects of tDCS on MEP amplitude were found at the group level. Future studies should consider utilizing a more strict experimental protocol to potentially account for intra-individual response variations.
The Childhood Asperger Syndrome Test (CAST): Test-Retest Reliability in a High Scoring Sample

ERIC Educational Resources Information Center

Allison, Carrie; Williams, Jo; Scott, Fiona; Stott, Carol; Bolton, Patrick; Baron-Cohen, Simon; Brayne, Carol

2007-01-01

The Childhood Asperger Syndrome Test (CAST) is a 37-item parental self-completion questionnaire designed to screen for high-functioning autism spectrum conditions in epidemiological research. The CAST has previously demonstrated good accuracy for use as a screening test, with high sensitivity in studies with primary school aged children in…
Emulation applied to reliability analysis of reconfigurable, highly reliable, fault-tolerant computing systems

NASA Technical Reports Server (NTRS)

Migneault, G. E.

1979-01-01

Emulation techniques applied to the analysis of the reliability of highly reliable computer systems for future commercial aircraft are described. The lack of credible precision in reliability estimates obtained by analytical modeling techniques is first established. The difficulty is shown to be an unavoidable consequence of: (1) a high reliability requirement so demanding as to make system evaluation by use testing infeasible; (2) a complex system design technique, fault tolerance; (3) system reliability dominated by errors due to flaws in the system definition; and (4) elaborate analytical modeling techniques whose precision outputs are quite sensitive to errors of approximation in their input data. Next, the technique of emulation is described, indicating how its input is a simple description of the logical structure of a system and its output is the consequent behavior. Use of emulation techniques is discussed for pseudo-testing systems to evaluate bounds on the parameter values needed for the analytical techniques. Finally an illustrative example is presented to demonstrate from actual use the promise of the proposed application of emulation.
Reliability of measuring half-cycle cervical range of motion may be increased using a spirit level for calibration.

PubMed

Wilke, Jan; Niederer, Daniel; Vogt, Lutz; Banzer, Winfried

2018-02-01

Assessments of range of motion (ROM) represent an essential part of clinical diagnostics. Ultrasonic movement analyses have been demonstrated to provide reliable results when analyzing complete amplitudes (e.g., flexion-extension). However, due to subjective determination of the starting position, the assessment of half-cycle movements (e.g, flexion only) is less reproducible. The present study aimed to examine the reliability of measuring half-cycle cervical ROM using a spirit level for calibration. 20 healthy subjects (30 ± 12yrs, 7♂, 13♀) participated in the randomized, controlled, cross-over trial. In two testing sessions with one week of wash-out in between, cervical ROM was measured by means of an ultrasonic 3D movement analysis system using a test-retest design (baseline and 5 min post baseline). The sessions differed with reference to the mask carrying the ultrasound markers. It was removed during the 5 min break (mask off) or not (mask on). To determine the resting position, a bull's eye spirit level was used in each measurement. With ICC values of 0.90-0.98 (mask on, p < 0.001) and 0.90 to 0.97 (mask off, p < 0.001), both examined conditions demonstrated excellent test-retest reliability for separating the cycles regarding all movement planes. Cervical ROM during half-cycle movements can be assessed with excellent reliability using a spirit level. In contrast to subjective determination of the starting position, analyzing complete movement planes does not increase reliability. Using a defined and objective zero positioning allows the evaluation of repositioning tasks. Copyright © 2017 Elsevier Ltd. All rights reserved.
Further Validation of the Learning Alliance Inventory: The Roles of Working Alliance, Rapport, and Immediacy in Student Learning

ERIC Educational Resources Information Center

Rogers, Daniel T.

2015-01-01

This study further examined the reliability and validity of the Learning Alliance Inventory (LAI), a self-report measure designed to assess the working alliance between a student and a teacher. The LAI was found to have good internal consistency and test--retest reliability, and it demonstrated the predicted convergence with measures of immediacy…
Psychometric Properties of Korean Version of the Second Victim Experience and Support Tool (K-SVEST).

PubMed

Kim, Eun-Mi; Kim, Sun-Aee; Lee, Ju-Ry; Burlison, Jonathan D; Oh, Eui Geum

2018-02-13

"Second victims" are defined as healthcare professionals whose wellness is influenced by adverse clinical events. The Second Victim Experience and Support Tool (SVEST) was used to measure the second-victim experience and quality of support resources. Although the reliability and validity of the original SVEST have been validated, those for the Korean tool have not been validated. The aim of the study was to evaluate the psychometric properties of the Korean version of the SVEST. The study included 305 clinical nurses as participants. The SVEST was translated into Korean via back translation. Content validity was assessed by seven experts, and test-retest reliability was evaluated by 30 clinicians. Internal consistency and construct validity were assessed via confirmatory factor analysis. The analyses were performed using SPSS 23.0 and STATA 13.0 software. The content validity index value demonstrated validity; item- and scale-level content validity index values were both 0.95. Test-retest reliability and internal consistency reliability were satisfactory: the intraclass consistent coefficient was 0.71, and Cronbach α values ranged from 0.59 to 0.87. The CFA showed a significantly good fit for an eight-factor structure (χ = 578.21, df = 303, comparative fit index = 0.92, Tucker-Lewis index = 0.90, root mean square error of approximation = 0.05). The K-SVEST demonstrated good psychometric properties and adequate validity and reliability. The results showed that the Korean version of SVEST demonstrated the extent of second victimhood and support resources in Korean healthcare workers and could aid in the development of support programs and evaluation of their effectiveness.
Measuring the Consistency in Change in Hepatitis B Knowledge among Three Different Types of Tests: True/False, Multiple Choice, and Fill in the Blanks Tests.

ERIC Educational Resources Information Center

Sahai, Vic; Demeyere, Petra; Poirier, Sheila; Piro, Felice

1998-01-01

The recall of information about Hepatitis B demonstrated by 180 seventh graders was tested with three test types: (1) short-answer; (2) true/false; and (3) multiple-choice. Short answer testing was the most reliable. Suggestions are made for the use of short-answer tests in evaluating student knowledge. (SLD)

Measuring Scale Invariance between and within Subjects.

ERIC Educational Resources Information Center

Benson, Jeri; Hocevar, Dennis

The present paper represents a demonstration of how LISREL V can be used to investigate scale invariance (1) across time (its relationship to test-retest reliability), and (2) across groups. Five criteria were established to test scale invariance across time and four criteria were established to test scale invariance across groups. Using the…
Validation of a Detailed Scoring Checklist for Use During Advanced Cardiac Life Support Certification

PubMed Central

McEvoy, Matthew D.; Smalley, Jeremy C.; Nietert, Paul J.; Field, Larry C.; Furse, Cory M.; Blenko, John W.; Cobb, Benjamin G.; Walters, Jenna L.; Pendarvis, Allen; Dalal, Nishita S.; Schaefer, John J.

2012-01-01

Introduction Defining valid, reliable, defensible, and generalizable standards for the evaluation of learner performance is a key issue in assessing both baseline competence and mastery in medical education. However, prior to setting these standards of performance, the reliability of the scores yielding from a grading tool must be assessed. Accordingly, the purpose of this study was to assess the reliability of scores generated from a set of grading checklists used by non-expert raters during simulations of American Heart Association (AHA) MegaCodes. Methods The reliability of scores generated from a detailed set of checklists, when used by four non-expert raters, was tested by grading team leader performance in eight MegaCode scenarios. Videos of the scenarios were reviewed and rated by trained faculty facilitators and by a group of non-expert raters. The videos were reviewed “continuously” and “with pauses.” Two content experts served as the reference standard for grading, and four non-expert raters were used to test the reliability of the checklists. Results Our results demonstrate that non-expert raters are able to produce reliable grades when using the checklists under consideration, demonstrating excellent intra-rater reliability and agreement with a reference standard. The results also demonstrate that non-expert raters can be trained in the proper use of the checklist in a short amount of time, with no discernible learning curve thereafter. Finally, our results show that a single trained rater can achieve reliable scores of team leader performance during AHA MegaCodes when using our checklist in continuous mode, as measures of agreement in total scoring were very strong (Lin’s Concordance Correlation Coefficient = 0.96; Intraclass Correlation Coefficient = 0.97). Discussion We have shown that our checklists can yield reliable scores, are appropriate for use by non-expert raters, and are able to be employed during continuous assessment of team leader performance during the review of a simulated MegaCode. This checklist may be more appropriate for use by Advanced Cardiac Life Support (ACLS) instructors during MegaCode assessments than current tools provided by the AHA. PMID:22863996
The push-off test: development of a simple, reliable test of upper extremity weight-bearing capability.

PubMed

Vincent, Joshua I; MacDermid, Joy C; Michlovitz, Susan L; Rafuse, Richard; Wells-Rowsell, Christina; Wong, Owen; Bisbee, Leslie

2014-01-01

Longitudinal clinical measurement study. The push-off test (POT) is a novel and simple measure of upper extremity weight-bearing that can be measured with a grip dynamometer. There are no published studies on the validity and reliability of the POT. The relationship between upper extremity self-report activity/participation and impairment measures remain an unexplored realm. The primary purpose of this study is to estimate the intra and inter-rater reliability and construct validity of the POT. The secondary purpose is to estimate the relationship between upper extremity self-report activity/participation questionnaires and impairment measures. A convenience sample of 22 patients with wrist or elbow injuries were tested for POT, wrist/elbow range of motion (ROM), isometric wrist extension strength (WES) and grip strength; and completed two self-report activity/participation questionnaires: Disability of the Arm, Shoulder and the Hand (DASH) and Work Limitations Questionnaire (WLQ-26). POT's inter and intra-rater reliability and construct validity was tested. Pearson's correlations were run between the impairment measures and self-report questionnaires to look into the relationship amongst them. The POT demonstrated high inter-rater reliability (ICC affected = 0.97; 95% C.I. 0.93-0.99; ICC unaffected = 0.85; 95% C.I. 0.68-0.94) and intra-rater reliability (ICC affected = 0.96; 95% C.I. 0.92-0.97; ICC unaffected = 0.92; 95% C.I. 0.85-0.97). The POT was correlated moderately with the DASH (r = -0.47; p = 0.03). While examining the relationship between upper extremity self-reported activity/participation questionnaires and impairment measures the strongest correlation was between the DASH and the POT (r = -0.47; p = 0.03) and none of the correlations with the other physical impairment measures reached significance. At-work disability demonstrated insignificant correlations with physical impairments. The POT test provides a reliable and easily administered quantitative measure of ability to bear the load through an injured arm. Preliminary evidence supports a moderate relationship between loading bearing measured by the POT and upper extremity function measured by the DASH. 1b. Copyright © 2014 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Fatigue after stroke: the development and evaluation of a case definition.

PubMed

Lynch, Joanna; Mead, Gillian; Greig, Carolyn; Young, Archie; Lewis, Susan; Sharpe, Michael

2007-11-01

While fatigue after stroke is a common problem, it has no generally accepted definition. Our aim was to develop a case definition for post-stroke fatigue and to test its psychometric properties. A case definition with face validity and an associated structured interview was constructed. After initial piloting, the feasibility, reliability (test-retest and inter-rater) and concurrent validity (in relation to four fatigue severity scales) were determined in 55 patients with stroke. All participating patients provided satisfactory answers to all the case definition probe questions demonstrating its feasibility For test-retest reliability, kappa was 0.78 (95% CI, 0.57-0.94, P<.01) and for inter-rater reliability kappa was 0.80 (95% CI, 0.62-0.99, P<.01). Patients fulfilling the case definition also had substantially higher fatigue scores on four fatigue severity scales (P<.001) indicating concurrent validity. The proposed case definition is feasible to administer and reliable in practice, and there is evidence of concurrent validity. It requires further evaluation in different settings.
Validation of the Spanish Addiction Severity Index Multimedia Version (S-ASI-MV).

PubMed

Butler, Stephen F; Redondo, José Pedro; Fernandez, Kathrine C; Villapiano, Albert

2009-01-01

This study aimed to develop and test the reliability and validity of a Spanish adaptation of the ASI-MV, a computer administered version of the Addiction Severity Index, called the S-ASI-MV. Participants were 185 native Spanish-speaking adult clients from substance abuse treatment facilities serving Spanish-speaking clients in Florida, New Mexico, California, and Puerto Rico. Participants were administered the S-ASI-MV as well as Spanish versions of the general health subscale of the SF-36, the work and family unit subscales of the Social Adjustment Scale Self-Report, the Michigan Alcohol Screening Test, the alcohol and drug subscales of the Personality Assessment Inventory, and the Hopkins Symptom Checklist-90. Three-to-five-day test-retest reliability was examined along with criterion validity, convergent/discriminant validity, and factorial validity. Measurement invariance between the English and Spanish versions of the ASI-MV was also examined. The S-ASI-MV demonstrated good test-retest reliability (ICCs for composite scores between .59 and .93), criterion validity (rs for composite scores between .66 and .87), and convergent/discriminant validity. Factorial validity and measurement invariance were demonstrated. These results compared favorably with those reported for the original interviewer version of the ASI and the English version of the ASI-MV.
Development and evaluation of oral Cancer quality-of-life questionnaire (QOL-OC).

PubMed

Nie, Min; Liu, Chang; Pan, Yi-Chen; Jiang, Chen-Xi; Li, Bao-Ru; Yu, Xi-Jie; Wu, Xin-Yu; Zheng, Shu-Ning

2018-05-03

In this study scales and items for the Oral Cancer Quality-of-life Questionnaire (QOL-OC) were designed and the instrument was evaluated. The QOL-OC was developed and modified using the international definition of quality of life (QOL) promulgated by the European Organization for Research and Treatment of Cancer (EORTC) and analysis of the precedent measuring instruments. The contents of each item were determined in the context of the specific characteristics of oral cancer. Two hundred thirteen oral cancer patients were asked to complete both the EORTC core quality of life questionnaire (EORTC QLC-C30) and the QOL-OC. Data collected was used to conduct factor analysis, test-retest reliability, internal consistency, and construct validity. Questionnaire compliance was relatively high. Fourteen of the 213 subjects accepted the same tests after 24 to 48 h demonstrating a high test-retest reliability for all five scales. Overall internal consistency surpasses 0.8. The outcome of the factor analysis coincides substantially with our theoretical conception. Each item shows a higher correlation coefficient within its own scale than the others which indicates high construct validity. QOL-OC demonstrates fairly good statistical reliability, validity, and feasibility. However, further tests and modification are needed to ensure its applicability to the quality-of-life assessment of Chinese oral cancer patients.
Testing of the SEE and OEE post-hip fracture.

PubMed

Resnick, Barbara; Orwig, Denise; Zimmerman, Sheryl; Hawkes, William; Golden, Justine; Werner-Bronzert, Michelle; Magaziner, Jay

2006-08-01

The purpose of this study was to test the reliability and validity of the Self-Efficacy for Exercise (SEE) and the Outcome Expectations for Exercise (OEE) scales in a sample of 166 older women post-hip fracture. There was some evidence of validity of the SEE and OEE based on confirmatory factor analysis and Rasch model testing, criterion based and convergent validity, and evidence of internal consistency based on alpha coefficients and separation indices and reliability based on R2 estimates. Rasch model testing demonstrated that some items had high variability. Based on these findings suggestions are made for how items could be revised and the scales improved for future use.
General inattentiveness is a long-term reliable trait independently predictive of psychological health: Danish validation studies of the Mindful Attention Awareness Scale.

PubMed

Jensen, Christian Gaden; Niclasen, Janni; Vangkilde, Signe Allerup; Petersen, Anders; Hasselbalch, Steen Gregers

2016-05-01

The Mindful Attention Awareness Scale (MAAS) measures perceived degree of inattentiveness in different contexts and is often used as a reversed indicator of mindfulness. MAAS is hypothesized to reflect a psychological trait or disposition when used outside attentional training contexts, but the long-term test-retest reliability of MAAS scores is virtually untested. It is unknown whether MAAS predicts psychological health after controlling for standardized socioeconomic status classifications. First, MAAS translated to Danish was validated psychometrically within a randomly invited healthy adult community sample (N = 490). Factor analysis confirmed that MAAS scores quantified a unifactorial construct of excellent composite reliability and consistent convergent validity. Structural equation modeling revealed that MAAS scores contributed independently to predicting psychological distress and mental health, after controlling for age, gender, income, socioeconomic occupational class, stressful life events, and social desirability (β = 0.32-.42, ps < .001). Second, MAAS scores showed satisfactory short-term test-retest reliability in 100 retested healthy university students. Finally, MAAS sample mean scores as well as individuals' scores demonstrated satisfactory test-retest reliability across a 6 months interval in the adult community (retested N = 407), intraclass correlations ≥ .74. MAAS scores displayed significantly stronger long-term test-retest reliability than scores measuring psychological distress (z = 2.78, p = .005). Test-retest reliability estimates did not differ within demographic and socioeconomic strata. Scores on the Danish MAAS were psychometrically validated in healthy adults. MAAS's inattentiveness scores reflected a unidimensional construct, long-term reliable disposition, and a factor of independent significance for predicting psychological health. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Flight Testing of the Capillary Pumped Loop 3 Experiment

NASA Technical Reports Server (NTRS)

Ottenstein, Laura; Butler, Dan; Ku, Jentung; Cheung, Kwok; Baldauff, Robert; Hoang, Triem

2002-01-01

The Capillary Pumped Loop 3 (CAPL 3) experiment was a multiple evaporator capillary pumped loop experiment that flew in the Space Shuttle payload bay in December 2001 (STS-108). The main objective of CAPL 3 was to demonstrate in micro-gravity a multiple evaporator capillary pumped loop system, capable of reliable start-up, reliable continuous operation, and heat load sharing, with hardware for a deployable radiator. Tests performed on orbit included start-ups, power cycles, low power tests (100 W total), high power tests (up to 1447 W total), heat load sharing, variable/fixed conductance transition tests, and saturation temperature change tests. The majority of the tests were completed successfully, although the experiment did exhibit an unexpected sensitivity to shuttle maneuvers. This paper describes the experiment, the tests performed during the mission, and the test results.
The validity and reliability of script concordance test in otolaryngology residency training.

PubMed

Iravani, Kamyar; Amini, Mitra; Doostkam, Aida; Dehbozorgian, Mahnaz

2016-04-01

The script concordance test (SCT) is one the best tools used to evaluate clinical reasoning in ill-defined clinical situations. The aim of this study was to demonstrate SCT application in otolaryngology residency training. A 20 item otolaryngology SCT containing 60 questions was administered to 26 otolaryngology residents. The test was prepared by two otolaryngologists familiar to medical education. These questions have been validated by otolaryngology experts. The panel consisted of 9 academic staff in the field of otolaryngology. Pearson correlation test was used to assess the reliability of the test. The obtained mean scores were 68.4±5.8 (out of 100) for residents and 78.2±6.4(out of 100) for experts. There was a significant difference between the two scores (p<0.005). Cronbach's alpha value was 0.80. The SCT is a reliable tool to evaluate clinical reasoning in otolaryngology residents. It should be included in otolaryngology residency training.
High reliability level on single-mode 980nm-1060 nm diode lasers for telecommunication and industrial applications

NASA Astrophysics Data System (ADS)

Van de Casteele, J.; Bettiati, M.; Laruelle, F.; Cargemel, V.; Pagnod-Rossiaux, P.; Garabedian, P.; Raymond, L.; Laffitte, D.; Fromy, S.; Chambonnet, D.; Hirtz, J. P.

2008-02-01

We demonstrate very high reliability level on 980-1060nm high-power single-mode lasers through multi-cell tests. First, we show how our chip design and technology enables high reliability levels. Then, we aged 758 devices during 9500 hours among 6 cells with high current (0.8A-1.2A) and high submount temperature (65°C-105°C) for the reliability demonstration. Sudden catastrophic failure is the main degradation mechanism observed. A statistical failure rate model gives an Arrhenius thermal activation energy of 0.51eV and a power law forward current acceleration factor of 5.9. For high-power submarine applications (360mW pump module output optical power), this model exhibits a failure rate as low as 9 FIT at 13°C, while ultra-high power terrestrial modules (600mW) lie below 220 FIT at 25°C. Wear-out phenomena is observed only for very high current level without any reliability impact under 1.1A. For the 1060nm chip, step-stress tests were performed and a set of devices were aged during more than 2000 hours in different stress conditions. First results are in accordance with 980nm product with more than 100khours estimated MTTF. These reliability and performance features of 980-1060nm laser diodes will make high-power single-mode emitters the best choice for a number of telecommunication and industrial applications in the next few years.
The Rothschild Scale for Antidepressant Tachyphylaxis: reliability and validity.

PubMed

Rothschild, Anthony J

2008-01-01

After successful treatment of an episode of major depression, many patients complain of symptoms of apathy or decreased motivation (described by patients as "the blahs"), fatigue, dullness in cognitive function, sleep disturbance, weight gain, and sexual dysfunction; however, the characterization of this phenomenon of antidepressant tachyphylaxis has been hampered by the lack of an accepted definition and a reliable and valid assessment tool. To address this problem, the development and assessment of the Rothschild Scale for Antidepressant Tachyphylaxis (RSAT) are described. The RSAT consists of 6 self-report items assessing energy level, motivation and interest, cognitive functioning, weight gain, sleep, and sexual functioning. A seventh item, affect, is assessed by the interviewer. Each item is measured within a 5-point ordinal scale with anchor points developed to illustrate each rating. This study assesses the internal consistency, test-retest reliability, convergent and discriminant validity, sensitivity, specificity, and positive and negative predictive values of the RSAT. The RSAT demonstrated excellent internal consistency and scale reliability (Cronbach alpha = .902). The RSAT also demonstrated strong test-retest reliability (for depressed patients: r = 0.822, P < .01; for control subjects: r = 0.887, P < .01). The total RSAT score did not correlate with severity of depression as measured by the total Hamilton Depression Rating Scale score or the Hamilton Depression Rating Scale item 1 (depressed mood), supporting the discriminant validity of the RSAT for use in antidepressant tachyphylaxis. The RSAT is a reliable measure of antidepressant tachyphylaxis.
Clinimetric properties of the Tinetti Mobility Test, Four Square Step Test, Activities-specific Balance Confidence Scale, and spatiotemporal gait measures in individuals with Huntington's disease.

PubMed

Kloos, Anne D; Fritz, Nora E; Kostyk, Sandra K; Young, Gregory S; Kegelmeyer, Deb A

2014-09-01

Individuals with Huntington's disease (HD) experience balance and gait problems that lead to falls. Clinicians currently have very little information about the reliability and validity of outcome measures to determine the efficacy of interventions that aim to reduce balance and gait impairments in HD. This study examined the reliability and concurrent validity of spatiotemporal gait measures, the Tinetti Mobility Test (TMT), Four Square Step Test (FSST), and Activities-specific Balance Confidence (ABC) Scale in individuals with HD. Participants with HD [n = 20; mean age ± SD=50.9 ± 13.7; 7 male] were tested on spatiotemporal gait measures and the TMT, FSST, and ABC Scale before and after a six week period to determine test-retest reliability and minimal detectable change (MDC) values. Linear relationships between gait and clinical measures were estimated using Pearson's correlation coefficients. Spatiotemporal gait measures, the TMT total and the FSST showed good to excellent test-retest reliability (ICC > 0.75). MDC values were 0.30 m/s and 0.17 m/s for velocity in forward and backward walking respectively, four points for the TMT, and 3s for the FSST. The TMT and FSST were highly correlated with most spatiotemporal measures. The ABC Scale demonstrated lower reliability and less concurrent validity than other measures. The high test-retest reliability over a six week period and concurrent validity between the TMT, FSST, and spatiotemporal gait measures suggest that the TMT and FSST may be useful outcome measures for future intervention studies in ambulatory individuals with HD. Copyright © 2014 Elsevier B.V. All rights reserved.
Questionnaire-based assessment of executive functioning: Psychometrics.

PubMed

Castellanos, Irina; Kronenberger, William G; Pisoni, David B

2018-01-01

The psychometric properties of the Learning, Executive, and Attention Functioning (LEAF) scale were investigated in an outpatient clinical pediatric sample. As a part of clinical testing, the LEAF scale, which broadly measures neuropsychological abilities related to executive functioning and learning, was administered to parents of 118 children and adolescents referred for psychological testing at a pediatric psychology clinic; 85 teachers also completed LEAF scales to assess reliability across different raters and settings. Scores on neuropsychological tests of executive functioning and academic achievement were abstracted from charts. Psychometric analyses of the LEAF scale demonstrated satisfactory internal consistency, parent-teacher inter-rater reliability in the small to large effect size range, and test-retest reliability in the large effect size range, similar to values for other executive functioning checklists. Correlations between corresponding subscales on the LEAF and other behavior checklists were large, while most correlations with neuropsychological tests of executive functioning and achievement were significant but in the small to medium range. Results support the utility of the LEAF as a reliable and valid questionnaire-based assessment of delays and disturbances in executive functioning and learning. Applications and advantages of the LEAF and other questionnaire measures of executive functioning in clinical neuropsychology settings are discussed.
Clinical assessment of scapular positioning in musicians: an intertester reliability study.

PubMed

Struyf, Filip; Nijs, Jo; De Coninck, Kris; Giunta, Marco; Mottram, Sarah; Meeusen, Romain

2009-01-01

The reliability of the measurement of the distance between the posterior border of the acromion and the wall and the reliability of the modified lateral scapular slide test have not been studied. Overall, the reliability of the clinical tools used to assess scapular positioning has not been studied in musicians. To examine the intertester reliability of scapular observation and 2 clinical tests for the assessment of scapular positioning in musicians. Intertester reliability study. University research laboratory. Thirty healthy student musicians at a single university. Two assessors performed a standardized observation protocol, the measurement of the distance between the posterior border of the acromion and the wall, and the modified lateral scapular slide test. Each assessor was blinded to the other's findings. The intertester reliability coefficients (kappa) for the observation in relaxed position, during unloaded movement, and during loaded movement were 0.41, 0.63, and 0.36, respectively. The kappa values for the observation of tilting and winging at rest were 0.48 and 0.42, respectively; during unloaded movement, the kappa values were 0.52 and 0.78, respectively; and with a 1-kg load, the kappa values were 0.24 and 0.50, respectively. The intraclass correlation coefficient (ICC) of the measurement of the acromial distance was 0.72 in relaxed position and 0.75 with the participant actively retracting both shoulders. The ICCs for the modified lateral scapular slide test varied between 0.63 and 0.58. Our results demonstrated that the modified lateral scapular slide test was not a reliable tool to assess scapular positioning in these participants. Our data indicated that scapular observation in the relaxed position and during unloaded abduction in the frontal plane was a reliable assessment tool. The reliability of the measurement of the distance between the posterior border of the acromion and the wall in healthy musicians was moderate.
Clinical Assessment of Scapular Positioning in Musicians: An Intertester Reliability Study

PubMed Central

Struyf, Filip; Nijs, Jo; De Coninck, Kris; Giunta, Marco; Mottram, Sarah; Meeusen, Romain

2009-01-01

Abstract Context: The reliability of the measurement of the distance between the posterior border of the acromion and the wall and the reliability of the modified lateral scapular slide test have not been studied. Overall, the reliability of the clinical tools used to assess scapular positioning has not been studied in musicians. Objective: To examine the intertester reliability of scapular observation and 2 clinical tests for the assessment of scapular positioning in musicians. Design: Intertester reliability study. Setting: University research laboratory. Patients or Other Participants: Thirty healthy student musicians at a single university. Main Outcome Measure(s): Two assessors performed a standardized observation protocol, the measurement of the distance between the posterior border of the acromion and the wall, and the modified lateral scapular slide test. Each assessor was blinded to the other's findings. Results: The intertester reliability coefficients (κ) for the observation in relaxed position, during unloaded movement, and during loaded movement were 0.41, 0.63, and 0.36, respectively. The κ values for the observation of tilting and winging at rest were 0.48 and 0.42, respectively; during unloaded movement, the κ values were 0.52 and 0.78, respectively; and with a 1-kg load, the κ values were 0.24 and 0.50, respectively. The intraclass correlation coefficient (ICC) of the measurement of the acromial distance was 0.72 in relaxed position and 0.75 with the participant actively retracting both shoulders. The ICCs for the modified lateral scapular slide test varied between 0.63 and 0.58. Conclusions: Our results demonstrated that the modified lateral scapular slide test was not a reliable tool to assess scapular positioning in these participants. Our data indicated that scapular observation in the relaxed position and during unloaded abduction in the frontal plane was a reliable assessment tool. The reliability of the measurement of the distance between the posterior border of the acromion and the wall in healthy musicians was moderate. PMID:19771291
Reliability of measuring hip abductor strength following total knee arthroplasty using a hand-held dynamometer.

PubMed

Schache, Margaret B; McClelland, Jodie A; Webster, Kate E

2016-01-01

To investigate the test-retest reliability of measuring hip abductor strength in patients with total knee arthroplasty (TKA) using a hand-held dynamometer (HHD) with two different types of resistance: belt and manual resistance. Test-retest reliability of 30 subjects (17 female, 13 male, 71.9 ± 7.4 years old), 9.2 ± 2.7 days post TKA was measured using belt and therapist resistance. Retest reliability was calculated with intra-class coefficients (ICC3,1) and 95% confidence intervals (CI) for both the group average and the individual scores. A paired t-test assessed whether a difference existed between the belt and therapist methods of resistance. ICCs were 0.82 and 0.80 for the belt and therapist resisted methods, respectively. Hip abductor strength increases of 8 N (14%) for belt resisted and 14 N (17%) for therapist resisted measurements of the group average exceeded the 95% CI and may represent real change. For individuals, hip abductor strength increases of 33 N (72%) (belt resisted) and 57 N (79%) (therapist resisted) could be interpreted as real change. Hip abductor strength can be reliably measured using HHD in the clinical setting with the described protocol. Belt resistance demonstrated slightly higher test-retest reliability. Reliable measurement of hip abductor muscle strength in patients with TKA is important to ensure deficiencies are addressed in rehabilitation programs and function is maximized. Hip abductor strength can be reliably measured with a hand-held dynamometer in the clinical setting using manual or belt resistance.
The influence of validity criteria on Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) test-retest reliability among high school athletes.

PubMed

Brett, Benjamin L; Solomon, Gary S

2017-04-01

Research findings to date on the stability of Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) Composite scores have been inconsistent, requiring further investigation. The use of test validity criteria across these studies also has been inconsistent. Using multiple measures of stability, we examined test-retest reliability of repeated ImPACT baseline assessments in high school athletes across various validity criteria reported in previous studies. A total of 1146 high school athletes completed baseline cognitive testing using the online ImPACT test battery at two time periods of approximately two-year intervals. No participant sustained a concussion between assessments. Five forms of validity criteria used in previous test-retest studies were applied to the data, and differences in reliability were compared. Intraclass correlation coefficients (ICCs) ranged in composite scores from .47 (95% confidence interval, CI [.38, .54]) to .83 (95% CI [.81, .85]) and showed little change across a two-year interval for all five sets of validity criteria. Regression based methods (RBMs) examining the test-retest stability demonstrated a lack of significant change in composite scores across the two-year interval for all forms of validity criteria, with no cases falling outside the expected range of 90% confidence intervals. The application of more stringent validity criteria does not alter test-retest reliability, nor does it account for some of the variation observed across previously performed studies. As such, use of the ImPACT manual validity criteria should be utilized in the determination of test validity and in the individualized approach to concussion management. Potential future efforts to improve test-retest reliability are discussed.
Markov chains for testing redundant software

NASA Technical Reports Server (NTRS)

White, Allan L.; Sjogren, Jon A.

1988-01-01

A preliminary design for a validation experiment has been developed that addresses several problems unique to assuring the extremely high quality of multiple-version programs in process-control software. The procedure uses Markov chains to model the error states of the multiple version programs. The programs are observed during simulated process-control testing, and estimates are obtained for the transition probabilities between the states of the Markov chain. The experimental Markov chain model is then expanded into a reliability model that takes into account the inertia of the system being controlled. The reliability of the multiple version software is computed from this reliability model at a given confidence level using confidence intervals obtained for the transition probabilities during the experiment. An example demonstrating the method is provided.
Software development predictors, error analysis, reliability models and software metric analysis

NASA Technical Reports Server (NTRS)

Basili, Victor

1983-01-01

The use of dynamic characteristics as predictors for software development was studied. It was found that there are some significant factors that could be useful as predictors. From a study on software errors and complexity, it was shown that meaningful results can be obtained which allow insight into software traits and the environment in which it is developed. Reliability models were studied. The research included the field of program testing because the validity of some reliability models depends on the answers to some unanswered questions about testing. In studying software metrics, data collected from seven software engineering laboratory (FORTRAN) projects were examined and three effort reporting accuracy checks were applied to demonstrate the need to validate a data base. Results are discussed.

The evaluation of lumbar multifidus muscle function via palpation: reliability and validity of a new clinical test.

PubMed

Hebert, Jeffrey J; Koppenhaver, Shane L; Teyhen, Deydre S; Walker, Bruce F; Fritz, Julie M

2015-06-01

The lumbar multifidus muscle provides an important contribution to lumbar spine stability, and the restoration of lumbar multifidus function is a frequent goal of rehabilitation. Currently, there are no reliable and valid physical examination procedures available to assess lumbar multifidus function among patients with low back pain. To examine the inter-rater reliability and concurrent validity of the multifidus lift test (MLT) to identify lumbar multifidus dysfunction among patients with low back pain. A cross-sectional analysis of reliability and concurrent validity performed in a university outpatient research facility. Thirty-two persons aged 18 to 60 years with current low back pain and a minimum modified Oswestry disability score of 20%. Study participants were excluded if they reported a history of lumbar spine surgery, lumbar radiculopathy, medical red flags, osteoporosis, or had recently been treated with spinal manipulation or trunk stabilization exercises. Concurrent measures of lumbar multifidus muscle function at the L4-L5 and L5-S1 levels were obtained with the MLT (index test) and real-time ultrasound imaging (reference standard). The inter-rater reliability of the MLT was examined by measuring the level of agreement between two blinded examiners. Concurrent validity of the MLT was investigated by comparing clinicians' judgments with real-time ultrasound imaging measures of lumbar multifidus function. Inter-rater reliability of the MLT was substantial to excellent (κ=0.75 to 0.81, p≤.01) and free from errors of bias and prevalence. When performed at L4-L5 or L5-S1, the MLT demonstrated evidence of concurrent validity through its relationship with the reference standard results at L4-L5 (rbis=0.59-0.73, p≤.01). The MLT generally failed to demonstrate a relationship with the reference standard results from the L5-S1 level. Our results provide preliminary evidence supporting the reliability and validity of the MLT to assess lumbar multifidus function at the L4-L5 spinal level. Additional research examining the measurement properties and utility of this test should be undertaken before confident implementation with patients. Copyright © 2015 Elsevier Inc. All rights reserved.
Reliability of reports of childhood trauma in bipolar disorder: A test-retest study over 18 months.

PubMed

Shannon, Ciaran; Hanna, Donncha; Tumelty, Leo; Waldron, Daniel; Maguire, Chrissie; Mowlds, William; Meenagh, Ciaran; Mulholland, Ciaran

2016-01-01

This study aimed to explore the reliability of self-reported trauma histories in a population with a diagnosis of bipolar disorder using the Childhood Trauma Questionnaire. Previous studies in other populations suggest high reliability of trauma histories over time, and it was postulated that a similar high reliability would be demonstrated in this population. A total of 39 patients with a confirmed diagnosis (Diagnostic and Statistical Manual of Mental Disorders, 4th Edition, criteria) were followed up and readministered the Childhood Trauma Questionnaire after 18 months. Cohen's kappa scores and intraclass correlations suggested reasonable test-retest reliability over the 18-month time period of the study for all types of childhood abuse, namely, emotional, physical, and sexual abuse and physical and emotional neglect. Intraclass correlations ranged from r = .50 (sexual abuse) to r = .96 (physical abuse). Cohen's kappas ranged from .44 (sexual abuse) to .76 (physical abuse). Retrospective reports of childhood trauma can be seen as reliable and are in keeping with results found with other mental health populations.
Lifetime Reliability Evaluation of Structural Ceramic Parts with the CARES/LIFE Computer Program

NASA Technical Reports Server (NTRS)

Nemeth, Noel N.; Powers, Lynn M.; Janosik, Lesley A.; Gyekenyesi, John P.

1993-01-01

The computer program CARES/LIFE calculates the time-dependent reliability of monolithic ceramic components subjected to thermomechanical and/or proof test loading. This program is an extension of the CARES (Ceramics Analysis and Reliability Evaluation of Structures) computer program. CARES/LIFE accounts for the phenomenon of subcritical crack growth (SCG) by utilizing the power law, Paris law, or Walker equation. The two-parameter Weibull cumulative distribution function is used to characterize the variation in component strength. The effects of multiaxial stresses are modeled using either the principle of independent action (PIA), Weibull's normal stress averaging method (NSA), or Batdorf's theory. Inert strength and fatigue parameters are estimated from rupture strength data of naturally flawed specimens loaded in static, dynamic, or cyclic fatigue. Two example problems demonstrating cyclic fatigue parameter estimation and component reliability analysis with proof testing are included.
Reliability and Validity Evidence of Multiple Balance Assessments in Athletes With a Concussion

PubMed Central

Murray, Nicholas; Salvatore, Anthony; Powell, Douglas; Reed-Jones, Rebecca

2014-01-01

Context: An estimated 300 000 sport-related concussion injuries occur in the United States annually. Approximately 30% of individuals with concussions experience balance disturbances. Common methods of balance assessment include the Clinical Test of Sensory Organization and Balance (CTSIB), the Sensory Organization Test (SOT), the Balance Error Scoring System (BESS), and the Romberg test; however, the National Collegiate Athletic Association recommended the Wii Fit as an alternative measure of balance in athletes with a concussion. A central concern regarding the implementation of the Wii Fit is whether it is reliable and valid for measuring balance disturbance in athletes with concussion. Objective: To examine the reliability and validity evidence for the CTSIB, SOT, BESS, Romberg test, and Wii Fit for detecting balance disturbance in athletes with a concussion. Data Sources: Literature considered for review included publications with reliability and validity data for the assessments of balance (CTSIB, SOT, BESS, Romberg test, and Wii Fit) from PubMed, PsycINFO, and CINAHL. Data Extraction: We identified 63 relevant articles for consideration in the review. Of the 63 articles, 28 were considered appropriate for inclusion and 35 were excluded. Data Synthesis: No current reliability or validity information supports the use of the CTSIB, SOT, Romberg test, or Wii Fit for balance assessment in athletes with a concussion. The BESS demonstrated moderate to high reliability (interclass correlation coefficient = 0.87) and low to moderate validity (sensitivity = 34%, specificity = 87%). However, the Romberg test and Wii Fit have been shown to be reliable tools in the assessment of balance in Parkinson patients. Conclusions: The BESS can evaluate balance problems after a concussion. However, it lacks the ability to detect balance problems after the third day of recovery. Further investigation is needed to establish the use of the CTSIB, SOT, Romberg test, and Wii Fit for assessing balance in athletes with concussions. PMID:24933431
Test-retest reliability of the safe driving behavior measure for community-dwelling elderly drivers.

PubMed

Song, Chiang-Soon; Lee, Joo-Hyun; Han, Sang-Woo

2016-06-01

[Purpose] The Safe Driving Behavior Measure (SDBM) is a self-report measurement tools that assesses the safe-driving behaviors of the elderly. The purpose of this study was to evaluate the test-retest reliability of the SDBM among community-dwelling elderly drivers. [Subjects and Methods] A total of sixty-one community-dwelling elderly were enrolled to investigate the reliability of the SDBM. The SDBM was assessed in two sessions that were conducted three days apart in a quiet and well-organized assessment room. That test-retest reliability of overall scores and three domain scores of the SDBM were statistically evaluated using intraclass correlation coefficients [ICC (2.1)]. Pearson correlation coefficients were used to quantify bivariate associations among the three domains of the SDBM. [Results] The SDBM demonstrated excellent rest-retest reliability for community-dwelling elderly drivers. The Cronbach alpha coefficients of the three domains of person-vehicle (0.979), person-environment (0.944), and person-vehicle-environment (0.971) of the SDBM indicate high internal consistency. [Conclusion] The results of this study suggest that the SDBM is a reliable measure for evaluating the safe- driving of automobiles by community-dwelling elderly, and is adequate for detecting changes in scores in clinical settings.
Reliability and validity of pendulum test measures of spasticity obtained with the Polhemus tracking system from patients with chronic stroke

PubMed Central

Bohannon, Richard W; Harrison, Steven; Kinsella-Shaw, Jeffrey

2009-01-01

Background Spasticity is a common impairment accompanying stroke. Spasticity of the quadriceps femoris muscle can be quantified using the pendulum test. The measurement properties of pendular kinematics captured using a magnetic tracking system has not been studied among patients who have experienced a stroke. Therefore, this study describes the test-retest reliability and known groups and convergent validity of the pendulum test measures obtained with the Polhemus tracking system. Methods Eight patients with chronic stroke underwent pendulum tests with their affected and unaffected lower limbs, with and without the addition of a 2.2 kg cuff weight at the ankle, using the Polhemus magnetic tracking system. Also measured bilaterally were knee resting angles, Ashworth scores (grades 0–4) of quadriceps femoris muscles, patellar tendon (knee jerk) reflexes (grades 0–4), and isometric knee extension force. Results Three measures obtained from pendular traces of the affected side were reliable (intraclass correlation coefficient ≥ .844). Known groups validity was confirmed by demonstration of a significant difference in the measurements between sides. Convergent validity was supported by correlations ≥ .57 between pendulum test measures and other measures reflective of spasticity. Conclusion Pendulum test measures obtained with the Polhemus tracking system from the affected side of patients with stroke have good test-retest reliability and both known groups and convergent validity. PMID:19642989
Reliability and validity of pendulum test measures of spasticity obtained with the Polhemus tracking system from patients with chronic stroke.

PubMed

Bohannon, Richard W; Harrison, Steven; Kinsella-Shaw, Jeffrey

2009-07-30

Spasticity is a common impairment accompanying stroke. Spasticity of the quadriceps femoris muscle can be quantified using the pendulum test. The measurement properties of pendular kinematics captured using a magnetic tracking system has not been studied among patients who have experienced a stroke. Therefore, this study describes the test-retest reliability and known groups and convergent validity of the pendulum test measures obtained with the Polhemus tracking system. Eight patients with chronic stroke underwent pendulum tests with their affected and unaffected lower limbs, with and without the addition of a 2.2 kg cuff weight at the ankle, using the Polhemus magnetic tracking system. Also measured bilaterally were knee resting angles, Ashworth scores (grades 0-4) of quadriceps femoris muscles, patellar tendon (knee jerk) reflexes (grades 0-4), and isometric knee extension force. Three measures obtained from pendular traces of the affected side were reliable (intraclass correlation coefficient > or = .844). Known groups validity was confirmed by demonstration of a significant difference in the measurements between sides. Convergent validity was supported by correlations > or = .57 between pendulum test measures and other measures reflective of spasticity. Pendulum test measures obtained with the Polhemus tracking system from the affected side of patients with stroke have good test-retest reliability and both known groups and convergent validity.
Validity and reliability of Internet-based physiotherapy assessment for musculoskeletal disorders: a systematic review.

PubMed

Mani, Suresh; Sharma, Shobha; Omar, Baharudin; Paungmali, Aatit; Joseph, Leonard

2017-04-01

Purpose The purpose of this review is to systematically explore and summarise the validity and reliability of telerehabilitation (TR)-based physiotherapy assessment for musculoskeletal disorders. Method A comprehensive systematic literature review was conducted using a number of electronic databases: PubMed, EMBASE, PsycINFO, Cochrane Library and CINAHL, published between January 2000 and May 2015. The studies examined the validity, inter- and intra-rater reliabilities of TR-based physiotherapy assessment for musculoskeletal conditions were included. Two independent reviewers used the Quality Appraisal Tool for studies of diagnostic Reliability (QAREL) and the Quality Assessment of Diagnostic Accuracy Studies (QUADAS) tool to assess the methodological quality of reliability and validity studies respectively. Results A total of 898 hits were achieved, of which 11 articles based on inclusion criteria were reviewed. Nine studies explored the concurrent validity, inter- and intra-rater reliabilities, while two studies examined only the concurrent validity. Reviewed studies were moderate to good in methodological quality. The physiotherapy assessments such as pain, swelling, range of motion, muscle strength, balance, gait and functional assessment demonstrated good concurrent validity. However, the reported concurrent validity of lumbar spine posture, special orthopaedic tests, neurodynamic tests and scar assessments ranged from low to moderate. Conclusion TR-based physiotherapy assessment was technically feasible with overall good concurrent validity and excellent reliability, except for lumbar spine posture, orthopaedic special tests, neurodynamic testa and scar assessment.
Assessing the World Health Organization's Alcohol Use Disorder Identification Test among Incarcerated Women.

ERIC Educational Resources Information Center

El-Bassel, Nabila; Schilling, Robert; Ivanoff, Andre; Chen, Duan-Rung; Hanson, Meredith

1998-01-01

Describes the results of administering the World Health Organization's Alcohol Use Disorder Identification Test (AUDIT) to 400 incarcerated drug-using women. Reports on AUDIT's utility, validity, and reliability. Results demonstrate that AUDIT can be used to identify problem drinkers among incarcerated, drug-using women. (MKA)
INTRA-RATER RELIABILITY OF THE MULTIPLE SINGLE-LEG HOP-STABILIZATION TEST AND RELATIONSHIPS WITH AGE, LEG DOMINANCE AND TRAINING.

PubMed

Sawle, Leanne; Freeman, Jennifer; Marsden, Jonathan

2017-04-01

Balance is a complex construct, affected by multiple components such as strength and co-ordination. However, whilst assessing an athlete's dynamic balance is an important part of clinical examination, there is no gold standard measure. The multiple single-leg hop-stabilization test is a functional test which may offer a method of evaluating the dynamic attributes of balance, but it needs to show adequate intra-tester reliability. The purpose of this study was to assess the intra-rater reliability of a dynamic balance test, the multiple single-leg hop-stabilization test on the dominant and non-dominant legs. Intra-rater reliability study. Fifteen active participants were tested twice with a 10-minute break between tests. The outcome measure was the multiple single-leg hop-stabilization test score, based on a clinically assessed numerical scoring system. Results were analysed using an Intraclass Correlations Coefficient (ICC 2,1 ) and Bland-Altman plots. Regression analyses explored relationships between test scores, leg dominance, age and training (an alpha level of p = 0.05 was selected). ICCs for intra-rater reliability were 0.85 for the dominant and non-dominant legs (confidence intervals = 0.62-0.95 and 0.61-0.95 respectively). Bland-Altman plots showed scores within two standard deviations. A significant correlation was observed between the dominant and non-dominant leg on balance scores (R 2 =0.49, p<0.05), and better balance was associated with younger participants in their non-dominant leg (R 2 =0.28, p<0.05) and their dominant leg (R 2 =0.39, p<0.05), and a higher number of hours spent training for the non-dominant leg R 2 =0.37, p<0.05). The multiple single-leg hop-stabilisation test demonstrated strong intra-tester reliability with active participants. Younger participants who trained more, have better balance scores. This test may be a useful measure for evaluating the dynamic attributes of balance. 3.
Reliability and validity of selected measures associated with increased fall risk in females over the age of 45 years with distal radius fracture - A pilot study.

PubMed

Mehta, Saurabh P; MacDermid, Joy C; Richardson, Julie; MacIntyre, Norma J; Grewal, Ruby

2015-01-01

Clinical measurement. This study examined test-retest reliability and convergent/divergent construct validity of selected tests and measures that assess balance impairment, fear of falling (FOF), impaired physical activity (PA), and lower extremity muscle strength (LEMS) in females >45 years of age after the distal radius fracture (DRF) population. Twenty one female participants with DRF were assessed on two occasions. Timed Up and Go, Functional Reach, and One Leg Standing tests assessed balance impairment. Shortened Falls Efficacy Scale, Activity-specific Balance Confidence scale, and Fall Risk Perception Questionnaire assessed FOF. International Physical Activity Questionnaire and Rapid Assessment of Physical Activity were administered to assess PA level. Chair stand test and isometric muscle strength testing for hip and knee assessed LEMS. Intraclass correlation coefficients (ICC) examined the test-retest reliability of the measures. Pearson correlation coefficients (r) examined concurrent relationships between the measures. The results demonstrated fair to excellent test-retest reliability (ICC between 0.50 and 0.96) and low to moderate concordance between the measures (low if r ≤ 0.4; moderate if r = 0.4-0.7). The results provide preliminary estimates of test-retest reliability and convergent/divergent construct validity of selected measures associated with increased risk for falling in the females >45 years of age after DRF. Further research directions to advance knowledge regarding fall risk assessment in DRF population have been identified. Copyright © 2015 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Technology demonstrator program for Space Station Environmental Control Life Support System

NASA Technical Reports Server (NTRS)

Adams, Alan M.; Platt, Gordon K.; Claunch, William C.; Humphries, William R.

1987-01-01

The main objectives and requirements of the NASA/Marshall Space Flight Center Technology Demonstration Program are discussed. The program consists of a comparative test and a 90-day manned system test to evaluate an Environmental Control and Life Support System (ECLSS). In the comparative test phase, 14 types of subsystems which perform oxygen and water reclamation functions are to be examined in terms of performance maintenance/service requirements, reliability, and safety. The manned chamber testing phase involves a four person crew using a partial ECLSS for 90 days. The schedule for the program and the program hardware requirements are described.
Reliability evaluation methodology for NASA applications

NASA Technical Reports Server (NTRS)

Taneja, Vidya S.

1992-01-01

Liquid rocket engine technology has been characterized by the development of complex systems containing large number of subsystems, components, and parts. The trend to even larger and more complex system is continuing. The liquid rocket engineers have been focusing mainly on performance driven designs to increase payload delivery of a launch vehicle for a given mission. In otherwords, although the failure of a single inexpensive part or component may cause the failure of the system, reliability in general has not been considered as one of the system parameters like cost or performance. Up till now, quantification of reliability has not been a consideration during system design and development in the liquid rocket industry. Engineers and managers have long been aware of the fact that the reliability of the system increases during development, but no serious attempts have been made to quantify reliability. As a result, a method to quantify reliability during design and development is needed. This includes application of probabilistic models which utilize both engineering analysis and test data. Classical methods require the use of operating data for reliability demonstration. In contrast, the method described in this paper is based on similarity, analysis, and testing combined with Bayesian statistical analysis.
SMART Rotor Development and Wind Tunnel Test

DTIC Science & Technology

2009-09-01

amplifier and control system , and data acquisition, processing, and display systems . Boeing�s LRTS (Fig. 2), consists of a sled structure that...Support Test Stand Sled Tail Sting Outrigger Arm Figure 2: System integration test at whirl tower Port Rotor Balance Main Strut Flap Tail...demonstrated. Finally, the reliability of the flap actuation system was successfully proven in more than 60 hours of wind tunnel testing
RELIABILITY OF ANKLE-FOOT MORPHOLOGY, MOBILITY, STRENGTH, AND MOTOR PERFORMANCE MEASURES.

PubMed

Fraser, John J; Koldenhoven, Rachel M; Saliba, Susan A; Hertel, Jay

2017-12-01

Assessment of foot posture, morphology, intersegmental mobility, strength and motor control of the ankle-foot complex are commonly used clinically, but measurement properties of many assessments are unclear. To determine test-retest and inter-rater reliability, standard error of measurement, and minimal detectable change of morphology, joint excursion and play, strength, and motor control of the ankle-foot complex. Reliability study. 24 healthy, recreationally-active young adults without history of ankle-foot injury were assessed by two clinicians on two occasions, three to ten days apart. Measurement properties were assessed for foot morphology (foot posture index, total and truncated length, width, arch height), joint excursion (weight-bearing dorsiflexion, rearfoot and hallux goniometry, forefoot inclinometry, 1 st metatarsal displacement) and joint play, strength (handheld dynamometry), and motor control rating during intrinsic foot muscle (IFM) exercises. Clinician order was randomized using a Latin Square. The clinicians performed independent examinations and did not confer on the findings for the duration of the study. Test-retest and inter-tester reliability and agreement was assessed using intraclass correlation coefficients (ICC 2,k ) and weighted kappa ( K w ). Test-retest reliability ICC were as follows: morphology: .80-1.00, joint excursion: .58-.97, joint play: -.67-.84, strength: .67-.92, IFM motor rating: K W -.01-.71. Inter-rater reliability ICC were as follows: morphology: .81-1.00, joint excursion: .32-.97, joint play: -1.06-1.00, strength: .53-.90, and IFM motor rating: K w .02-.56. Measures of ankle-foot posture, morphology, joint excursion, and strength demonstrated fair to excellent test-retest and inter-rater reliability. Test-retest reliability for rating of perceived difficulty and motor performance was good to excellent for short-foot, toe-spread-out, and hallux exercises and poor to fair for lesser toe extension. Joint play measures had poor to fair reliability overall. The findings of this study should be considered when choosing methods of clinical assessment and outcome measures in practice and research. 3.
Reliability Measure of a Clinical Test: Appreciation of Music in Cochlear Implantees (AMICI)

PubMed Central

Cheng, Min-Yu; Spitzer, Jaclyn B.; Shafiro, Valeriy; Sheft, Stanley; Mancuso, Dean

2014-01-01

Purpose The goals of this study were (1) to investigate the reliability of a clinical music perception test, Appreciation of Music in Cochlear Implantees (AMICI), and (2) examine associations between the perception of music and speech. AMICI was developed as a clinical instrument for assessing music perception in persons with cochlear implants (CIs). The test consists of four subtests: (1) music versus environmental noise discrimination, (2) musical instrument identification (closed-set), (3) musical style identification (closed-set), and (4) identification of musical pieces (open-set). To be clinically useful, it is crucial for AMICI to demonstrate high test-retest reliability, so that CI users can be assessed and retested after changes in maps or programming strategies. Research Design Thirteen CI subjects were tested with AMICI for the initial visit and retested again 10–14 days later. Two speech perception tests (consonant-nucleus-consonant [CNC] and Bamford-Kowal-Bench Speech-in-Noise [BKB-SIN]) were also administered. Data Analysis Test-retest reliability and equivalence of the test’s three forms were analyzed using paired t-tests and correlation coefficients, respectively. Correlation analysis was also conducted between results from the music and speech perception tests. Results Results showed no significant difference between test and retest (p > 0.05) with adequate power (0.9) as well as high correlations between the three forms (Forms A and B, r = 0.91; Forms A and C, r = 0.91; Forms B and C, r = 0.95). Correlation analysis showed high correlation between AMICI and BKB-SIN (r = −0.71), and moderate correlation between AMICI and CNC (r = 0.4). Conclusions The study showed AMICI is highly reliable for assessing musical perception in CI users. PMID:24384082
A Statistical Perspective on Highly Accelerated Testing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Thomas, Edward V.

Highly accelerated life testing has been heavily promoted at Sandia (and elsewhere) as a means to rapidly identify product weaknesses caused by flaws in the product's design or manufacturing process. During product development, a small number of units are forced to fail at high stress. The failed units are then examined to determine the root causes of failure. The identification of the root causes of product failures exposed by highly accelerated life testing can instigate changes to the product's design and/or manufacturing process that result in a product with increased reliability. It is widely viewed that this qualitative use ofmore » highly accelerated life testing (often associated with the acronym HALT) can be useful. However, highly accelerated life testing has also been proposed as a quantitative means for "demonstrating" the reliability of a product where unreliability is associated with loss of margin via an identified and dominating failure mechanism. It is assumed that the dominant failure mechanism can be accelerated by changing the level of a stress factor that is assumed to be related to the dominant failure mode. In extreme cases, a minimal number of units (often from a pre-production lot) are subjected to a single highly accelerated stress relative to normal use. If no (or, sufficiently few) units fail at this high stress level, some might claim that a certain level of reliability has been demonstrated (relative to normal use conditions). Underlying this claim are assumptions regarding the level of knowledge associated with the relationship between the stress level and the probability of failure. The primary purpose of this document is to discuss (from a statistical perspective) the efficacy of using accelerated life testing protocols (and, in particular, "highly accelerated" protocols) to make quantitative inferences concerning the performance of a product (e.g., reliability) when in fact there is lack-of-knowledge and uncertainty concerning the assumed relationship between the stress level and performance. In addition, this document contains recommendations for conducting more informative accelerated tests.« less
Initial Development and Validation of the BullyHARM: The Bullying, Harassment, and Aggression Receipt Measure.

PubMed

Hall, William J

2016-11-01

This article describes the development and preliminary validation of the Bullying, Harassment, and Aggression Receipt Measure (BullyHARM). The development of the BullyHARM involved a number of steps and methods, including a literature review, expert review, cognitive testing, readability testing, data collection from a large sample, reliability testing, and confirmatory factor analysis. A sample of 275 middle school students was used to examine the psychometric properties and factor structure of the BullyHARM, which consists of 22 items and 6 subscales: physical bullying, verbal bullying, social/relational bullying, cyber-bullying, property bullying, and sexual bullying. First-order and second-order factor models were evaluated. Results demonstrate that the first-order factor model had superior fit. Results of reliability testing indicate that the BullyHARM scale and subscales have very good internal consistency reliability. Findings indicate that the BullyHARM has good properties regarding content validation and respondent-related validation and is a promising instrument for measuring bullying victimization in school.
Initial Development and Validation of the BullyHARM: The Bullying, Harassment, and Aggression Receipt Measure

PubMed Central

Hall, William J.

2017-01-01

This article describes the development and preliminary validation of the Bullying, Harassment, and Aggression Receipt Measure (BullyHARM). The development of the BullyHARM involved a number of steps and methods, including a literature review, expert review, cognitive testing, readability testing, data collection from a large sample, reliability testing, and confirmatory factor analysis. A sample of 275 middle school students was used to examine the psychometric properties and factor structure of the BullyHARM, which consists of 22 items and 6 subscales: physical bullying, verbal bullying, social/relational bullying, cyber-bullying, property bullying, and sexual bullying. First-order and second-order factor models were evaluated. Results demonstrate that the first-order factor model had superior fit. Results of reliability testing indicate that the BullyHARM scale and subscales have very good internal consistency reliability. Findings indicate that the BullyHARM has good properties regarding content validation and respondent-related validation and is a promising instrument for measuring bullying victimization in school. PMID:28194041
Reliability Demonstration Approach for Advanced Stirling Radioisotope Generator

NASA Technical Reports Server (NTRS)

Ha, CHuong; Zampino, Edward; Penswick, Barry; Spronz, Michael

2010-01-01

Developed for future space missions as a high-efficiency power system, the Advanced Stirling Radioisotope Generator (ASRG) has a design life requirement of 14 yr in space following a potential storage of 3 yr after fueling. In general, the demonstration of long-life dynamic systems remains difficult in part due to the perception that the wearout of moving parts cannot be minimized, and associated failures are unpredictable. This paper shows a combination of systematic analytical methods, extensive experience gained from technology development, and well-planned tests can be used to ensure a high level reliability of ASRG. With this approach, all potential risks from each life phase of the system are evaluated and the mitigation adequately addressed. This paper also provides a summary of important test results obtained to date for ASRG and the planned effort for system-level extended operation.

Five times sit-to-stand test in subjects with total knee replacement: Reliability and relationship with functional mobility tests.

PubMed

Medina-Mirapeix, Francesc; Vivo-Fernández, Iván; López-Cañizares, Juan; García-Vidal, José A; Benítez-Martínez, Josep Carles; Del Baño-Aledo, María Elena

2018-01-01

The objective was to determine the inter-observer and test/retest reliability of the "Five-repetition sit-to-stand" (5STS) test in patients with total knee replacement (TKR). To explore correlation between 5STS and two mobility tests. A reliability study was conducted among 24 (mean age 72.13, S.D. 10.67; 50% were women) outpatients with TKR. They were recruited from a traumatology unit of a public hospital via convenience sampling. A physiotherapist and trauma physician assessed each patient at the same time. The same physiotherapist realized a 5STS second measurement 45-60min after the first one. Reliability was assessed with intraclass correlation coefficients (ICCs) and Bland-Altman plots. Pearson coefficient was calculated to assess the correlation between 5STS, time up to go test (TUG) and four meters gait speed (4MGS). ICC for inter-observer and test-retest reliability of the 5STS were 0.998 (95% confidence interval [CI], 0.995-0.999) and 0.982 (95% CI, 0.959-0.992). Bland-Altman plot inter-observer showed limits between -0.82 and 1.06 with a mean of 0.11 and no heteroscedasticity within the data. Bland-Altman plot for test-retest showed the limits between 1.76 and 4.16, a mean of 1.20 and heteroscedasticity within the data. Pearson correlation coefficient revealed significant correlation between 5STS and TUG (r=0.7, p<0.001) and 4MGS (r=-0.583, p=0.003). This study demonstrates excellent inter-observer and test-retest reliability when it is used in people with TKR, and also significant correlation with other functional mobility tests. These findings support the use of 5STS as outcome measure in TKR population. Copyright © 2017 Elsevier B.V. All rights reserved.
RELIABILITY OF THE ONE REPETITION-MAXIMUM POWER CLEAN TEST IN ADOLESCENT ATHLETES

PubMed Central

Faigenbaum, Avery D.; McFarland, James E.; Herman, Robert; Naclerio, Fernando; Ratamess, Nicholas A.; Kang, Jie; Myer, Gregory D.

2013-01-01

Although the power clean test is routinely used to assess strength and power performance in adult athletes, the reliability of this measure in younger populations has not been examined. Therefore, the purpose of this study was to determine the reliability of the one repetition maximum (1 RM) power clean in adolescent athletes. Thirty-six male athletes (age 15.9 ± 1.1 yrs, body mass 79.1 ± 20.3 kg, height 175.1 ±7.4 cm) who had more than 1 year of training experience with weightlifting exercises performed a 1 RM power clean on two nonconsecutive days in the afternoon following standardized procedures. All test procedures were supervised by a senior level weightlifting coach and consisted of a systematic progression in test load until the maximum resistance that could be lifted for one repetition using proper exercise technique was determined. Data were analyzed using an intraclass correlation coefficient (ICC [2,k]), Pearson correlation coefficient (r), repeated measures ANOVA, Bland-Altman plot, and typical error analyses. Analysis of the data revealed that the test measures were highly reliable demonstrating a test-retest ICC of 0.98 (95% CI = 0.96–0.99). Testing also demonstrated a strong relationship between 1 RM measures on trial 1 and trial 2 (r=0.98, p<0.0001) with no significant difference in power clean performance between trials (70.6 ± 19.8 vs. 69.8 ± 19.8 kg). Bland Altman plots confirmed no systematic shift in 1 RM between trial 1 and trial 2. The typical error to be expected between 1 RM power clean trials is 2.9 kg and a change of at least 8.0 kg is indicated to determine a real change in lifting performance between tests in young lifters. No injuries occurred during the study period and the testing protocol was well-tolerated by all subjects. These findings indicate that 1 RM power clean testing has a high degree of reproducibility in trained male adolescent athletes when standardized testing procedures are followed and qualified instruction is present. PMID:22233786
Development of an International Odor Identification Test for Children: The Universal Sniff Test.

PubMed

Schriever, Valentin A; Agosin, Eduardo; Altundag, Aytug; Avni, Hadas; Cao Van, Helene; Cornejo, Carlos; de Los Santos, Gonzalo; Fishman, Gad; Fragola, Claudio; Guarneros, Marco; Gupta, Neelima; Hudson, Robyn; Kamel, Reda; Knaapila, Antti; Konstantinidis, Iordanis; Landis, Basile N; Larsson, Maria; Lundström, Johan N; Macchi, Alberto; Mariño-Sánchez, Franklin; Martinec Nováková, Lenka; Mori, Eri; Mullol, Joaquim; Nord, Marie; Parma, Valentina; Philpott, Carl; Propst, Evan J; Rawan, Ahmed; Sandell, Mari; Sorokowska, Agnieszka; Sorokowski, Piotr; Sparing-Paschke, Lisa-Marie; Stetzler, Carolin; Valder, Claudia; Vodicka, Jan; Hummel, Thomas

2018-07-01

To assess olfactory function in children and to create and validate an odor identification test to diagnose olfactory dysfunction in children, which we called the Universal Sniff (U-Sniff) test. This is a multicenter study involving 19 countries. The U-Sniff test was developed in 3 phases including 1760 children age 5-7 years. Phase 1: identification of potentially recognizable odors; phase 2: selection of odorants for the odor identification test; and phase 3: evaluation of the test and acquisition of normative data. Test-retest reliability was evaluated in a subgroup of children (n = 27), and the test was validated using children with congenital anosmia (n = 14). Twelve odors were familiar to children and, therefore, included in the U-Sniff test. Children scored a mean ± SD of 9.88 ± 1.80 points out of 12. Normative data was obtained and reported for each country. The U-Sniff test demonstrated a high test-retest reliability (r 27 = 0.83, P < .001) and enabled discrimination between normosmia and children with congenital anosmia with a sensitivity of 100% and specificity of 86%. The U-Sniff is a valid and reliable method of testing olfaction in children and can be used internationally. Copyright © 2018 Elsevier Inc. All rights reserved.
Validity and Reliability of the Brazilian Version of the Rapid Estimate of Adult Literacy in Dentistry--BREALD-30.

PubMed

Junkes, Monica C; Fraiz, Fabian C; Sardenberg, Fernanda; Lee, Jessica Y; Paiva, Saul M; Ferreira, Fernanda M

2015-01-01

The aim of the present study was to translate, perform the cross-cultural adaptation of the Rapid Estimate of Adult Literacy in Dentistry to Brazilian-Portuguese language and test the reliability and validity of this version. After translation and cross-cultural adaptation, interviews were conducted with 258 parents/caregivers of children in treatment at the pediatric dentistry clinics and health units in Curitiba, Brazil. To test the instrument's validity, the scores of Brazilian Rapid Estimate of Adult Literacy in Dentistry (BREALD-30) were compared based on occupation, monthly household income, educational attainment, general literacy, use of dental services and three dental outcomes. The BREALD-30 demonstrated good internal reliability. Cronbach's alpha ranged from 0.88 to 0.89 when words were deleted individually. The analysis of test-retest reliability revealed excellent reproducibility (intraclass correlation coefficient = 0.983 and Kappa coefficient ranging from moderate to nearly perfect). In the bivariate analysis, BREALD-30 scores were significantly correlated with the level of general literacy (rs = 0.593) and income (rs = 0.327) and significantly associated with occupation, educational attainment, use of dental services, self-rated oral health and the respondent's perception regarding his/her child's oral health. However, only the association between the BREALD-30 score and the respondent's perception regarding his/her child's oral health remained significant in the multivariate analysis. The BREALD-30 demonstrated satisfactory psychometric properties and is therefore applicable to adults in Brazil.
Validity and Reliability of the Brazilian Version of the Rapid Estimate of Adult Literacy in Dentistry – BREALD-30

PubMed Central

Junkes, Monica C.; Fraiz, Fabian C.; Sardenberg, Fernanda; Lee, Jessica Y.; Paiva, Saul M.; Ferreira, Fernanda M.

2015-01-01

Objective The aim of the present study was to translate, perform the cross-cultural adaptation of the Rapid Estimate of Adult Literacy in Dentistry to Brazilian-Portuguese language and test the reliability and validity of this version. Methods After translation and cross-cultural adaptation, interviews were conducted with 258 parents/caregivers of children in treatment at the pediatric dentistry clinics and health units in Curitiba, Brazil. To test the instrument's validity, the scores of Brazilian Rapid Estimate of Adult Literacy in Dentistry (BREALD-30) were compared based on occupation, monthly household income, educational attainment, general literacy, use of dental services and three dental outcomes. Results The BREALD-30 demonstrated good internal reliability. Cronbach’s alpha ranged from 0.88 to 0.89 when words were deleted individually. The analysis of test-retest reliability revealed excellent reproducibility (intraclass correlation coefficient = 0.983 and Kappa coefficient ranging from moderate to nearly perfect). In the bivariate analysis, BREALD-30 scores were significantly correlated with the level of general literacy (rs = 0.593) and income (rs = 0.327) and significantly associated with occupation, educational attainment, use of dental services, self-rated oral health and the respondent’s perception regarding his/her child's oral health. However, only the association between the BREALD-30 score and the respondent’s perception regarding his/her child's oral health remained significant in the multivariate analysis. Conclusion The BREALD-30 demonstrated satisfactory psychometric properties and is therefore applicable to adults in Brazil. PMID:26158724
Development and validation of a fatigue assessment scale for U.S. construction workers.

PubMed

Zhang, Mingzong; Sparer, Emily H; Murphy, Lauren A; Dennerlein, Jack T; Fang, Dongping; Katz, Jeffrey N; Caban-Martinez, Alberto J

2015-02-01

To develop a fatigue assessment scale and test its reliability and validity for commercial construction workers. Using a two-phased approach, we first identified items (first phase) for the development of a Fatigue Assessment Scale for Construction Workers (FASCW) through review of existing scales in the scientific literature, key informant interviews (n = 11) and focus groups (three groups with six workers each) with construction workers. The second phase included assessment for the reliability, validity, and sensitivity of the new scale using a repeated-measures study design with a convenience sample of construction workers (n = 144). Phase one resulted in a 16-item preliminary scale that after factor analysis yielded a final 10-item scale with two sub-scales ("Lethargy" and "Bodily Ailment"). During phase two, the FASCW and its subscales demonstrated satisfactory internal consistency (alpha coefficients were FASCW [0.91], Lethargy [0.86] and Bodily Ailment [0.84]) and acceptable test-retest reliability (Pearson Correlations Coefficients: 0.59-0.68; Intraclass Correlation Coefficients: 0.74-0.80). Correlation analysis substantiated concurrent and convergent validity. A discriminant analysis demonstrated that the FASCW differentiated between groups with arthritis status and different work hours. The 10-item FASCW with good reliability and validity is an effective tool for assessing the severity of fatigue among construction workers. © 2015 Wiley Periodicals, Inc.
A prospective study evaluating cochlear implant management skills: development and validation of the Cochlear Implant Management Skills survey.

PubMed

Bennett, R J; Jayakody, D M P; Eikelboom, R H; Taljaard, D S; Atlas, M D

2016-02-01

To investigate the ability of cochlear implant (CI) recipients to physically handle and care for their hearing implant device(s) and to identify factors that may influence skills. To assess device management skills, a clinical survey was developed and validated on a clinical cohort of CI recipients. Survey development and validation. A prospective convenience cohort design study. Specialist hearing implant clinic. Forty-nine post-lingually deafened, adult CI recipients, at least 12 months postoperative. Survey test-retest reliability, interobserver reliability and responsiveness. Correlations between management skills and participant demographic, audiometric, clinical outcomes and device factors. The Cochlear Implant Management Skills survey was developed, demonstrating high test-retest reliability (0.878), interobserver reliability (0.972) and responsiveness to intervention (skills training) [t(20) = -3.913, P = 0.001]. Cochlear Implant Management Skills survey scores range from 54.69% to 100% (mean: 83.45%, sd: 12.47). No associations were found between handling skills and participant factors. This is the first study to demonstrate a range in cochlear implant device handling skills in CI recipients and offers clinicians and researchers a tool to systematically and objectively identify shortcomings in CI recipients' device handling skills. © 2015 John Wiley & Sons Ltd.
Gossamer-1: Mission concept and technology for a controlled deployment of gossamer spacecraft

NASA Astrophysics Data System (ADS)

Seefeldt, Patric; Spietz, Peter; Sproewitz, Tom; Grundmann, Jan Thimo; Hillebrandt, Martin; Hobbie, Catherin; Ruffer, Michael; Straubel, Marco; Tóth, Norbert; Zander, Martin

2017-01-01

Gossamer structures for innovative space applications, such as solar sails, require technology that allows their controlled and thereby safe deployment. Before employing such technology for a dedicated science mission, it is desirable, if not necessary, to demonstrate its reliability with a Technology Readiness Level (TRL) of six or higher. The aim of the work presented here is to provide reliable technology that enables the controlled deployment and verification of its functionality with various laboratory tests, thereby qualifying the hardware for a first demonstration in low Earth orbit (LEO). The development was made in the Gossamer-1 project of the German Aerospace Center (DLR). This paper provides an overview of the Gossamer-1 mission and hardware development. The system is designed based on the requirements of a technology demonstration mission. The design rests on a crossed boom configuration with triangular sail segments. Employing engineering models, all aspects of the deployment were tested under ambient environment. Several components were also subjected to environmental qualification testing. An innovative stowing and deployment strategy for a controlled deployment, as well as the designs of the bus system, mechanisms and electronics are described. The tests conducted provide insights into the deployment process and allow a mechanical characterization of that deployment process, in particular the measurement of the deployment forces. Deployment on system level could be successfully demonstrated to be robust and controllable. The deployment technology is on TRL four approaching level five, with a qualification model for environmental testing currently being built.
Personality traits in companion dogs-Results from the VIDOPET.

PubMed

Turcsán, Borbála; Wallis, Lisa; Virányi, Zsófia; Range, Friederike; Müller, Corsin A; Huber, Ludwig; Riemer, Stefanie

2018-01-01

Individual behavioural differences in pet dogs are of great interest from a basic and applied research perspective. Most existing dog personality tests have specific (practical) goals in mind and so focused only on a limited aspect of dogs' personality, such as identifying problematic (aggressive or fearful) behaviours, assessing suitability as working dogs, or improving the results of adoption. Here we aimed to create a comprehensive test of personality in pet dogs that goes beyond traditional practical evaluations by exposing pet dogs to a range of situations they might encounter in everyday life. The Vienna Dog Personality Test (VIDOPET) consists of 15 subtests and was performed on 217 pet dogs. A two-step data reduction procedure (principal component analysis on each subtest followed by an exploratory factor analysis on the subtest components) yielded five factors: Sociability-obedience, Activity-independence, Novelty seeking, Problem orientation, and Frustration tolerance. A comprehensive evaluation of reliability and validity measures demonstrated excellent inter- and intra-observer reliability and adequate internal consistency of all factors. Moreover the test showed good temporal consistency when re-testing a subsample of dogs after an average of 3.8 years-a considerably longer test-retest interval than assessed for any other dog personality test, to our knowledge. The construct validity of the test was investigated by analysing the correlations between the results of video coding and video rating methods and the owners' assessment via a dog personality questionnaire. The results demonstrated good convergent as well as discriminant validity. To conclude, the VIDOPET is not only a highly reliable and valid tool for measuring dog personality, but also the first test to show consistent behavioural traits related to problem solving ability and frustration tolerance in pet dogs.
Personality traits in companion dogs—Results from the VIDOPET

PubMed Central

Wallis, Lisa; Virányi, Zsófia; Range, Friederike; Müller, Corsin A.; Huber, Ludwig; Riemer, Stefanie

2018-01-01

Individual behavioural differences in pet dogs are of great interest from a basic and applied research perspective. Most existing dog personality tests have specific (practical) goals in mind and so focused only on a limited aspect of dogs’ personality, such as identifying problematic (aggressive or fearful) behaviours, assessing suitability as working dogs, or improving the results of adoption. Here we aimed to create a comprehensive test of personality in pet dogs that goes beyond traditional practical evaluations by exposing pet dogs to a range of situations they might encounter in everyday life. The Vienna Dog Personality Test (VIDOPET) consists of 15 subtests and was performed on 217 pet dogs. A two-step data reduction procedure (principal component analysis on each subtest followed by an exploratory factor analysis on the subtest components) yielded five factors: Sociability-obedience, Activity-independence, Novelty seeking, Problem orientation, and Frustration tolerance. A comprehensive evaluation of reliability and validity measures demonstrated excellent inter- and intra-observer reliability and adequate internal consistency of all factors. Moreover the test showed good temporal consistency when re-testing a subsample of dogs after an average of 3.8 years—a considerably longer test-retest interval than assessed for any other dog personality test, to our knowledge. The construct validity of the test was investigated by analysing the correlations between the results of video coding and video rating methods and the owners’ assessment via a dog personality questionnaire. The results demonstrated good convergent as well as discriminant validity. To conclude, the VIDOPET is not only a highly reliable and valid tool for measuring dog personality, but also the first test to show consistent behavioural traits related to problem solving ability and frustration tolerance in pet dogs. PMID:29634747
Accelerated testing of module-level power electronics for long-term reliability

DOE Office of Scientific and Technical Information (OSTI.GOV)

Flicker, Jack David; Tamizhmani, Govindasamy; Moorthy, Mathan Kumar

This work has applied a suite of long-term-reliability accelerated tests to a variety of module-level power electronics (MLPE) devices (such as microinverters and optimizers) from five different manufacturers. This dataset is one of the first (only the paper by Parker et al. entitled “Dominant factors affecting reliability of alternating current photovoltaic modules,” in Proc. 42nd IEEE Photovoltaic Spec. Conf., 2015, is reported for reliability testing in the literature), as well as the largest, experimental sets in public literature, both in the sample size (five manufacturers including both dc/dc and dc/ac units and 20 units for each test) and the numbermore » of experiments (six different experimental test conditions) for MLPE devices. The accelerated stress tests (thermal cycling test per IEC 61215 profile, damp heat test per IEC 61215 profile, and static temperature tests at 100 and 125 °C) were performed under powered and unpowered conditions. The first independent long-term experimental data regarding damp heat and grid transient testing, as well as the longest term (>9 month) testing of MLPE units reported in the literature for thermal cycling and high-temperature operating life, are included in these experiments. Additionally, this work is the first to show in situ power measurements, as well as periodic efficiency measurements over a series of experimental tests, demonstrating whether certain tests result in long-term degradation or immediate catastrophic failures. Lastly, the result of this testing highlights the performance of MLPE units under the application of several accelerated environmental stressors.« less
Accelerated testing of module-level power electronics for long-term reliability

DOE PAGES

Flicker, Jack David; Tamizhmani, Govindasamy; Moorthy, Mathan Kumar; ...

2016-11-10

This work has applied a suite of long-term-reliability accelerated tests to a variety of module-level power electronics (MLPE) devices (such as microinverters and optimizers) from five different manufacturers. This dataset is one of the first (only the paper by Parker et al. entitled “Dominant factors affecting reliability of alternating current photovoltaic modules,” in Proc. 42nd IEEE Photovoltaic Spec. Conf., 2015, is reported for reliability testing in the literature), as well as the largest, experimental sets in public literature, both in the sample size (five manufacturers including both dc/dc and dc/ac units and 20 units for each test) and the numbermore » of experiments (six different experimental test conditions) for MLPE devices. The accelerated stress tests (thermal cycling test per IEC 61215 profile, damp heat test per IEC 61215 profile, and static temperature tests at 100 and 125 °C) were performed under powered and unpowered conditions. The first independent long-term experimental data regarding damp heat and grid transient testing, as well as the longest term (>9 month) testing of MLPE units reported in the literature for thermal cycling and high-temperature operating life, are included in these experiments. Additionally, this work is the first to show in situ power measurements, as well as periodic efficiency measurements over a series of experimental tests, demonstrating whether certain tests result in long-term degradation or immediate catastrophic failures. Lastly, the result of this testing highlights the performance of MLPE units under the application of several accelerated environmental stressors.« less
Measuring first-line nurse manager work: instrument: development and testing.

PubMed

Cadmus, Edna; Wisniewska, Edyta K

2013-12-01

The objective of this study was to develop and test a 1st-line nurse manager (FLNM) work instrument to measure categories of work and frequency of activities. First-line nurse managers have been demonstrated to be key contributors in meeting organizational outcomes and patient and nurse satisfaction. Identifying the work of FLNMs is essential to help in the development of prioritization and sequence. The need for an instrument that can measure and categorize the work of FLNMs is indicated. The author-developed instrument was administered as a pilot study to 173 FLNMs in New Jersey. Descriptive statistics were analyzed, and validity and reliability were measured. Content validity was established through 2 focus groups using 10 FLNMs and conducting a survey of 5 chief nursing officers. Reliability was assessed by 13 of 16 FLNM participants using the test/retest method and quantified using percent agreement within a 10-day period. Those items with 70% agreement or more were identified as reliable and retained on the instrument. The content validity of the instrument is strong; further refinement and testing of the tool are indicated to improve the reliability and generalizability across multiple populations of leaders and settings.
Satisfactory reliability among nursing students using the instrument PVC ASSESS to evaluate management of peripheral venous catheters.

PubMed

Ahlqvist, Margary; Berglund, Britta; Nordström, Gun; Klang, Birgitta; Johansson, Eva

2014-01-01

Nursing students should be given opportunities to participate in clinical audits during their education. However, audit tools are seldom tested for reliability among nursing students. The aim of this study was to present reliability among nursing students using the instrument PVC assess to assess management of peripheral venous catheters (PVCs) and PVC-related signs of thrombophlebitis. PVC assess was used to assess 67 inserted PVCs in 60 patients at ten wards at a university hospital. One group of nursing students (n=4) assessed PVCs at the bedside (inter-rater reliability) and photographs of these PVCs were taken. Another group of students (n=3) assessed the PVCs in the photographs after 4 weeks (test-retest reliability). To determine reliability, proportion of agreement [P(A)] and Cohen's kappa coefficient (κ) were calculated. For bedside assessment of PVCs, P(A) ranged from good to excellent (0.80-1.0) in 55% of the 26 PVC assess items that were tested. P(A) was poor (<0.70) for two items: "adherence of inner dressing to the skin" and "PVC location." In 81% of the items, κ was between moderate and almost perfect: moderate (n=5), substantial (n=3), almost perfect (n=5). For edema at insertion site and two items on PVC dressing, κ was fair (0.21-0.40). Regarding test-retest reliability, P(A) varied between good and excellent (0.81-1) in 85%-95% of the items, and the κ ranged between moderate and almost perfect (0.41-1) in 90%-95%. PVC assess demonstrated satisfactory reliability among nursing students. However, students need training in how to use the instrument before assessing PVCs.
The reliability of a maximal isometric hip strength and simultaneous surface EMG screening protocol in elite, junior rugby league athletes.

PubMed

Charlton, Paula C; Mentiplay, Benjamin F; Grimaldi, Alison; Pua, Yong-Hao; Clark, Ross A

2017-02-01

Firstly to describe the reliability of assessing maximal isometric strength of the hip abductor and adductor musculature using a hand held dynamometry (HHD) protocol with simultaneous wireless surface electromyographic (sEMG) evaluation of the gluteus medius (GM) and adductor longus (AL). Secondly, to describe the correlation between isometric strength recorded with the HHD protocol and a laboratory standard isokinetic device. Reliability and correlational study. A sample of 24 elite, male, junior, rugby league athletes, age 16-20 years participated in repeated HHD and isometric Kin-Com (KC) strength testing with simultaneous sEMG assessment, on average (range) 6 (5-7) days apart by a single assessor. Strength tests included; unilateral hip abduction (ABD) and adduction (ADD) and bilateral ADD assessed with squeeze (SQ) tests in 0 and 45° of hip flexion. HHD demonstrated good to excellent inter-session reliability for all outcome measures (ICC (2,1) =0.76-0.91) and good to excellent association with the laboratory reference KC (ICC (2,1) =0.80-0.88). Whilst intra-session, inter-trial reliability of EMG activation and co-activation outcome measures ranged from moderate to excellent (ICC (2,1) =0.70-0.94), inter-session reliability was poor (all ICC (2,1) <0.50). Isometric strength testing of the hip ABD and ADD musculature using HHD may be measured reliably in elite, junior rugby league athletes. Due to the poor inter-session reliability of sEMG measures, it is not recommended for athlete screening purposes if using the techniques implemented in this study. Copyright © 2016 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Assessing Perceptions AbouT Hazardous Substances (PATHS): The PATHS questionnaire

PubMed Central

Amlôt, Richard; Page, Lisa; Pearce, Julia; Wessely, Simon

2013-01-01

How people perceive the nature of a hazardous substance may determine how they respond when potentially exposed to it. We tested a new Perceptions AbouT Hazardous Substances (PATHS) questionnaire. In Study 1 (N = 21), we assessed the face validity of items concerning perceptions about eight properties of a hazardous substance. In Study 2 (N = 2030), we tested the factor structure, reliability and validity of the PATHS questionnaire across four qualitatively different substances. In Study 3 (N = 760), we tested the impact of information provision on Perceptions AbouT Hazardous Substances scores. Our results showed that our eight measures demonstrated good reliability and validity when used for non-contagious hazards. PMID:23104995
[Difference analysis among majors in medical parasitology exam papers by test item bank proposition].

PubMed

Jia, Lin-Zhi; Ya-Jun, Ma; Cao, Yi; Qian, Fen; Li, Xiang-Yu

2012-04-30

The quality index among "Medical Parasitology" exam papers and measured data for students in three majors from the university in 2010 were compared and analyzed. The exam papers were formed from the test item bank. The alpha reliability coefficients of the three exam papers were above 0.70. The knowledge structure and capacity structure of the exam papers were basically balanced. But the alpha reliability coefficients of the second major was the lowest, mainly due to quality of test items in the exam paper and the failure of revising the index of test item bank in time. This observation demonstrated that revising the test items and their index in the item bank according to the measured data can improve the quality of test item bank proposition and reduce the difference among exam papers.
Validation of the FASH (Functional Assessment Scale for Acute Hamstring Injuries) questionnaire for German-speaking football players.

PubMed

Lohrer, Heinz; Nauck, Tanja; Korakakis, Vasileios; Malliaropoulos, Nikos

2016-10-24

The FASH (Functional Assessment Scale for Acute Hamstring Injuries) questionnaire has been recently developed as a disease-specific self-administered questionnaire for use in Greek, English, and German languages. Its psychometric qualities (validity and reliability) were tested only in Greek-speaking patients mainly representing track and field athletes. As hamstring injuries represent the most common football injury, we tested the validity and reliability of the FASH-G (G = German version) questionnaire in German-speaking footballers suffering from acute hamstring injuries. The FASH-G questionnaire was tested for reliability and validity, in 16 footballers with hamstring injuries (patients' group), 77 asymptomatic footballers (healthy group), and 19 field hockey players (at-risk group). Known-group validity was tested by comparing the total FASH-G scores of the injured and non-injured groups. Reliability of the FASH-G questionnaire was analysed in 18 asymptomatic footballers using the intra-class coefficient. Known-group validity was demonstrated by significant differences between injured and non-injured participants (p < 0.001). The FASH-G exhibited very good test-retest reliability (intra-class correlation coefficient = 0.982, p < 0.001). Internal consistency was excellent (α = 0.938). Compared with the results presented in the original publication, no statistical differences were found between healthy athletes (p = 0.257), but patients' groups and at-risk groups presented scoring differences (p = 0.040 and <0.001, respectively). The FASH-G is a valid and reliable instrument to assess and determine the severity of hamstring injuries in German footballers.
An Examination of the True Reliability of Lower Limb Stiffness Measures During Overground Hopping.

PubMed

Diggin, David; Anderson, Ross; Harrison, Andrew J

2016-06-01

Evidence suggests reports describing the reliability of leg-spring (kleg) and joint stiffness (kjoint) measures are contaminated by artifacts originating from digital filtering procedures. In addition, the intraday reliability of kleg and kjoint requires investigation. This study examined the effects of experimental procedures on the inter- and intraday reliability of kleg and kjoint. Thirty-two participants completed 2 trials of single-legged hopping at 1.5, 2.2, and 3.0 Hz at the same time of day across 3 days. On the final test day a fourth experimental bout took place 6 hours before or after participants' typical testing time. Kinematic and kinetic data were collected throughout. Stiffness was calculated using models of kleg and kjoint. Classifications of measurement agreement were established using thresholds for absolute and relative reliability statistics. Results illustrated that kleg and kankle exhibited strong agreement. In contrast, kknee and khip demonstrated weak-to-moderate consistency. Results suggest limits in kjoint reliability persist despite employment of appropriate filtering procedures. Furthermore, diurnal fluctuations in lower-limb muscle-tendon stiffness exhibit little effect on intraday reliability. The present findings support the existence of kleg as an attractor state during hopping, achieved through fluctuations in kjoint variables. Limits to kjoint reliability appear to represent biological function rather than measurement artifact.
Ovarian and cervical cancer awareness: development of two validated measurement tools.

PubMed

Simon, Alice E; Wardle, Jane; Grimmett, Chloe; Power, Emily; Corker, Elizabeth; Menon, Usha; Matheson, Lauren; Waller, Jo

2012-07-01

The aim of the study was to develop and validate measures of awareness of symptoms and risk factors for ovarian and cervical cancer (Ovarian and Cervical Cancer Awareness Measures). Potentially relevant items were extracted from the literature and generated by experts. Four validation studies were carried out to establish reliability and validity. Women aged 21-67 years (n=146) and ovarian and cervical cancer experts (n=32) were included in the studies. Internal reliability was assessed psychometrically. Test-retest reliability was assessed over a 1-week interval. To establish construct validity, Cancer Awareness Measure (CAM) scores of cancer experts were compared with equally well-educated comparison groups. Sensitivity to change was tested by randomly assigning participants to read either a leaflet giving information about ovarian/cervical cancer or a leaflet with control information, and then completing the ovarian/cervical CAM. Internal reliability (Cronbach's α=0.88 for the ovarian CAM and α=0.84 for the cervical CAM) and test-retest reliability (r=0.84 and r=0.77 for the ovarian and cervical CAMs, respectively) were both high. Validity was demonstrated with cancer experts achieving higher scores than controls [ovarian CAM: t(36)= -5.6, p<0.001; cervical CAM: t(38)= -3.7, p=0.001], and volunteers who were randomised to read a cancer leaflet scored higher than those who received a control leaflet [ovarian CAM: t(49)=7.5, p<0.001; cervical CAM: t(48)= -5.5, p<0.001]. This study demonstrates the psychometric properties of the ovarian and cervical CAMs and supports their utility in assessing ovarian and cervical cancer awareness in the general population.

Ovarian and cervical cancer awareness: development of two validated measurement tools

PubMed Central

Simon, Alice E; Wardle, Jane; Grimmett, Chloe; Power, Emily; Corker, Elizabeth; Menon, Usha; Matheson, Lauren; Waller, Jo

2012-01-01

Background The aim of the study was to develop and validate measures of awareness of symptoms and risk factors for ovarian and cervical cancer (Ovarian and Cervical Cancer Awareness Measures). Methods Potentially relevant items were extracted from the literature and generated by experts. Four validation studies were carried out to establish reliability and validity. Women aged 21–67 years (n=146) and ovarian and cervical cancer experts (n=32) were included in the studies. Internal reliability was assessed psychometrically. Test-retest reliability was assessed over a 1-week interval. To establish construct validity, Cancer Awareness Measure (CAM) scores of cancer experts were compared with equally well-educated comparison groups. Sensitivity to change was tested by randomly assigning participants to read either a leaflet giving information about ovarian/cervical cancer or a leaflet with control information, and then completing the ovarian/cervical CAM. Results Internal reliability (Cronbach's α=0.88 for the ovarian CAM and α=0.84 for the cervical CAM) and test-retest reliability (r=0.84 and r=0.77 for the ovarian and cervical CAMs, respectively) were both high. Validity was demonstrated with cancer experts achieving higher scores than controls [ovarian CAM: t(36)= –5.6, p<0.001; cervical CAM: t(38)= –3.7, p=0.001], and volunteers who were randomised to read a cancer leaflet scored higher than those who received a control leaflet [ovarian CAM: t(49)=7.5, p<0.001; cervical CAM: t(48)= –5.5, p<0.001]. Conclusions This study demonstrates the psychometric properties of the ovarian and cervical CAMs and supports their utility in assessing ovarian and cervical cancer awareness in the general population. PMID:21933805
Measuring cognitive change with ImPACT: the aggregate baseline approach.

PubMed

Bruce, Jared M; Echemendia, Ruben J; Meeuwisse, Willem; Hutchison, Michael G; Aubry, Mark; Comper, Paul

2017-11-01

The Immediate Post-Concussion Assessment and Cognitive Test (ImPACT) is commonly used to assess baseline and post-injury cognition among athletes in North America. Despite this, several studies have questioned the reliability of ImPACT when given at intervals employed in clinical practice. Poor test-retest reliability reduces test sensitivity to cognitive decline, increasing the likelihood that concussed athletes will be returned to play prematurely. We recently showed that the reliability of ImPACT can be increased when using a new composite structure and the aggregate of two baselines to predict subsequent performance. The purpose of the present study was to confirm our previous findings and determine whether the addition of a third baseline would further increase the test-retest reliability of ImPACT. Data from 97 English speaking professional hockey players who had received at least 4 ImPACT baseline evaluations were extracted from a National Hockey League Concussion Program database. Linear regression was used to determine whether each of the first three testing sessions accounted for unique variance in the fourth testing session. Results confirmed that the aggregate baseline approach improves the psychometric properties of ImPACT, with most indices demonstrating adequate or better test-retest reliability for clinical use. The aggregate baseline approach provides a modest clinical benefit when recent baselines are available - and a more substantial benefit when compared to approaches that obtain baseline measures only once during the course of a multi-year playing career. Pending confirmation in diverse samples, neuropsychologists are encouraged to use the aggregate baseline approach to best quantify cognitive change following sports concussion.
Evaluating Random Error in Clinician-Administered Surveys: Theoretical Considerations and Clinical Applications of Interobserver Reliability and Agreement.

PubMed

Bennett, Rebecca J; Taljaard, Dunay S; Olaithe, Michelle; Brennan-Jones, Chris; Eikelboom, Robert H

2017-09-18

The purpose of this study is to raise awareness of interobserver concordance and the differences between interobserver reliability and agreement when evaluating the responsiveness of a clinician-administered survey and, specifically, to demonstrate the clinical implications of data types (nominal/categorical, ordinal, interval, or ratio) and statistical index selection (for example, Cohen's kappa, Krippendorff's alpha, or interclass correlation). In this prospective cohort study, 3 clinical audiologists, who were masked to each other's scores, administered the Practical Hearing Aid Skills Test-Revised to 18 adult owners of hearing aids. Interobserver concordance was examined using a range of reliability and agreement statistical indices. The importance of selecting statistical measures of concordance was demonstrated with a worked example, wherein the level of interobserver concordance achieved varied from "no agreement" to "almost perfect agreement" depending on data types and statistical index selected. This study demonstrates that the methodology used to evaluate survey score concordance can influence the statistical results obtained and thus affect clinical interpretations.
Reliability and validity of the Arabic version of the computerized Battery for Neuropsychological Evaluation of Children (BENCI).

PubMed

Fasfous, Ahmed F; Peralta-Ramirez, Maria Isabel; Pérez-Marfil, María Nieves; Cruz-Quintana, Francisco; Catena-Martinez, Andrés; Pérez-García, Miguel

2015-01-01

Batería de Evaluación Neuropsicológica Infantil (BENCI) is a computerized battery for the neuropsychological evaluation of children. This battery has been used in different studies to evaluate neuropsychological functions and neurodevelopment in children. The objective of this study is to test the validity and reliability of the first Arabic version of the BENCI on an Arabic population where neuropsychological tests are very scarce. We administrate the BENCI to 198 school-age children (98 boys and 100 girls) from Morocco. To examine the test retest reliability of the BENCI battery, we administered the battery 2 times to 43 children (23 boys and 20 girls) with 15 days in between the pre- and posttest. The results revealed good validity and reliability of the battery in Arabic children. Also, the BENCI battery has demonstrated the capacity to differentiate between children by their age group. This battery can be of great use to both the research and clinical areas of Arabic countries and/or in assistance to Arabic immigrants that live outside of their native country.
Aiming for excellence - A simulation-based study on adapting and testing an instrument for developing non-technical skills in Norwegian student nurse anaesthetists.

PubMed

Flynn, Fiona M; Sandaker, Kjersti; Ballangrud, Randi

2017-01-01

There is increasing focus on building safety into anaesthesia practice, with excellence in anaesthesia as an aspirational goal. Non-technical skills are an important factor in excellence and improved patient safety, though there have been few systematic attempts at integrating them into anaesthesia nursing education. This study aimed to test the reliability of NANTS-no, a specially adapted behavioural marker system for nurse anaesthetists in Norway, and explore the development of non-technical skills in student nurse anaesthetists. The pre-test post-test design incorporated a 10-week simulation-based programme, where non-technical skills in 14 student nurse anaesthetists were rated on three different occasions during high-fidelity simulation, before and after taking part in a training course. NANTS-no demonstrated high overall inter-rater reliability (ICC = 0.91), high test-retest reliability (ICC = 0.94) and good internal consistency (Cronbach's α of 0.85-0.92). A significant improvement was demonstrated across all categories of non-technical skills, with greatest improvements between the first and third and second and third sessions. There was also a significant improvement in two categories between the first and second sessions. NANTS-no is therefore suitable for assessing non-technical skills during simulation training in anaesthesia nursing education. More research is needed to validate its use in clinical practice. Copyright © 2016 Elsevier Ltd. All rights reserved.
Time-dependent reliability analysis of ceramic engine components

NASA Technical Reports Server (NTRS)

Nemeth, Noel N.

1993-01-01

The computer program CARES/LIFE calculates the time-dependent reliability of monolithic ceramic components subjected to thermomechanical and/or proof test loading. This program is an extension of the CARES (Ceramics Analysis and Reliability Evaluation of Structures) computer program. CARES/LIFE accounts for the phenomenon of subcritical crack growth (SCG) by utilizing either the power or Paris law relations. The two-parameter Weibull cumulative distribution function is used to characterize the variation in component strength. The effects of multiaxial stresses are modeled using either the principle of independent action (PIA), the Weibull normal stress averaging method (NSA), or the Batdorf theory. Inert strength and fatigue parameters are estimated from rupture strength data of naturally flawed specimens loaded in static, dynamic, or cyclic fatigue. Two example problems demonstrating proof testing and fatigue parameter estimation are given.
The design organization test: further demonstration of reliability and validity as a brief measure of visuospatial ability.

PubMed

Killgore, William D S; Gogel, Hannah

2014-01-01

Neuropsychological assessments are frequently time-consuming and fatiguing for patients. Brief screening evaluations may reduce test duration and allow more efficient use of time by permitting greater attention toward neuropsychological domains showing probable deficits. The Design Organization Test (DOT) was initially developed as a 2-min paper-and-pencil alternative for the Block Design (BD) subtest of the Wechsler scales. Although initially validated for clinical neurologic patients, we sought to further establish the reliability and validity of this test in a healthy, more diverse population. Two alternate versions of the DOT and the Wechsler Abbreviated Scale of Intelligence (WASI) were administered to 61 healthy adult participants. The DOT showed high alternate forms reliability (r = .90-.92), and the two versions yielded equivalent levels of performance. The DOT was highly correlated with BD (r = .76-.79) and was significantly correlated with all subscales of the WASI. The DOT proved useful when used in lieu of BD in the calculation of WASI IQ scores. Findings support the reliability and validity of the DOT as a measure of visuospatial ability and suggest its potential worth as an efficient estimate of intellectual functioning in situations where lengthier tests may be inappropriate or unfeasible.
Resting-state fMRI correlations: From link-wise unreliability to whole brain stability.

PubMed

Pannunzi, Mario; Hindriks, Rikkert; Bettinardi, Ruggero G; Wenger, Elisabeth; Lisofsky, Nina; Martensson, Johan; Butler, Oisin; Filevich, Elisa; Becker, Maxi; Lochstet, Martyna; Kühn, Simone; Deco, Gustavo

2017-08-15

The functional architecture of spontaneous BOLD fluctuations has been characterized in detail by numerous studies, demonstrating its potential relevance as a biomarker. However, the systematic investigation of its consistency is still in its infancy. Here, we analyze within- and between-subject variability and test-retest reliability of resting-state functional connectivity (FC) in a unique data set comprising multiple fMRI scans (42) from 5 subjects, and 50 single scans from 50 subjects. We adopt a statistical framework that enables us to identify different sources of variability in FC. We show that the low reliability of single links can be significantly improved by using multiple scans per subject. Moreover, in contrast to earlier studies, we show that spatial heterogeneity in FC reliability is not significant. Finally, we demonstrate that despite the low reliability of individual links, the information carried by the whole-brain FC matrix is robust and can be used as a functional fingerprint to identify individual subjects from the population. Copyright © 2017 Elsevier Inc. All rights reserved.
Measuring the Characteristic Topography of Brain Stiffness with Magnetic Resonance Elastography

PubMed Central

Murphy, Matthew C.; Huston, John; Jack, Clifford R.; Glaser, Kevin J.; Senjem, Matthew L.; Chen, Jun; Manduca, Armando; Felmlee, Joel P.; Ehman, Richard L.

2013-01-01

Purpose To develop a reliable magnetic resonance elastography (MRE)-based method for measuring regional brain stiffness. Methods First, simulation studies were used to demonstrate how stiffness measurements can be biased by changes in brain morphometry, such as those due to atrophy. Adaptive postprocessing methods were created that significantly reduce the spatial extent of edge artifacts and eliminate atrophy-related bias. Second, a pipeline for regional brain stiffness measurement was developed and evaluated for test-retest reliability in 10 healthy control subjects. Results This technique indicates high test-retest repeatability with a typical coefficient of variation of less than 1% for global brain stiffness and less than 2% for the lobes of the brain and the cerebellum. Furthermore, this study reveals that the brain possesses a characteristic topography of mechanical properties, and also that lobar stiffness measurements tend to correlate with one another within an individual. Conclusion The methods presented in this work are resistant to noise- and edge-related biases that are common in the field of brain MRE, demonstrate high test-retest reliability, and provide independent regional stiffness measurements. This pipeline will allow future investigations to measure changes to the brain’s mechanical properties and how they relate to the characteristic topographies that are typical of many neurologic diseases. PMID:24312570
Psychometric properties of the Interpersonal Relationship Inventory-Short Form for active duty female service members.

PubMed

Nayback-Beebe, Ann M; Yoder, Linda H

2011-06-01

The Interpersonal Relationship Inventory-Short Form (IPRI-SF) has demonstrated psychometric consistency across several demographic and clinical populations; however, it has not been psychometrically tested in a military population. The purpose of this study was to psychometrically evaluate the reliability and component structure of the IPRI-SF in active duty United States Army female service members (FSMs). The reliability estimates were .93 for the social support subscale and .91 for the conflict subscale. Principal component analysis demonstrated an obliquely rotated three-component solution that accounted for 58.9% of the variance. The results of this study support the reliability and validity of the IPRI-SF for use in FSMs; however, a three-factor structure emerged in this sample of FSMs post-deployment that represents "cultural context." Copyright © 2011 Wiley Periodicals, Inc.
A menu of self-administered microcomputer-based neurotoxicology tests

NASA Technical Reports Server (NTRS)

Kennedy, Robert S.; Wilkes, Robert L.; Kuntz, Lois-Ann; Baltzley, Dennis R.

1988-01-01

This study examined the feasibility of repeated self-administration of a newly developed battery of mental acuity tests. Researchers developed this battery to be used to screen the fitness for duty of persons in at-risk occupations (astronauts, race car drivers), or those who may be exposed to environmental stress, toxic agents, or disease. The menu under study contained cognitive and motor tests implemented on a portable microcomputer including: a five-test core battery, lasting six minutes, which had demonstrable reliabilities and stability from several previous repeated-measures studies, and also 13 new tests, lasting 42 minutes, which had appeared in other batteries but had not yet been evaluated for repeated-measures implementation in this medium. Sixteen subjects self-administered the battery over 10 repeated sessions. The hardware performed well throughout the study and the tests appeared to be easily self-administered. Stabilities and reliabilities of the test from the core battery were comparable to those obtained previously under more controlled experimental conditions. Analyses of metric properties of the remaining 13 tests produced eight additional tests with satisfactory properties. Although the average retest reliability was high, cross-correlations between tests were low, indicating factorial richness. The menu can be used to form batteries of flexible total testing time which are likely to tap different mental processes and functions.
Forward Skirt Structural Testing on the Space Launch System (SLS) Program

NASA Technical Reports Server (NTRS)

Lohrer, J. D.; Wright, R. D.

2016-01-01

Structural testing was performed to evaluate heritage forward skirts from the Space Shuttle program for use on the NASA Space Launch System (SLS) program. Testing was needed because SLS ascent loads are 35% higher than Space Shuttle loads. Objectives of testing were to determine margins of safety, demonstrate reliability, and validate analytical models. Testing combined with analysis was able to show heritage forward skirts were acceptable to use on the SLS program.
The Adult Reading History Questionnaire (ARHQ) in Icelandic: Psychometric Properties and Factor Structure

ERIC Educational Resources Information Center

Bjornsdottir, Gyda; Halldorsson, Jonas G.; Steinberg, Stacy; Hansdottir, Ingunn; Kristjansson, Kristleifur; Stefansson, Hreinn; Stefansson, Kari

2014-01-01

This article describes psychometric testing of an Icelandic adaptation of the "Adult Reading History Questionnaire" (ARHQ), designed to detect a history of reading difficulties indicative of dyslexia. Tested in a large and diverse sample of 2,187 adults, the Icelandic adaptation demonstrated internal consistency reliability (Cronbach's…
Mississippi Scale for Combat-Related Posttraumatic Stress Disorder: Three Studies in Reliabilty and Validity.

ERIC Educational Resources Information Center

Keane, Terence M.; And Others

1988-01-01

Explored the psychometric properties of the Mississippi Scale for Combat-Related Posttraumatic Stress Disorder to assess its internal consistency and factor structure. Administered the test to Vietnam veterans seeking help at Veteran Centers. Demonstrated high test-retest reliability, sensitivity of .93, specificity .89, and overall hit rate .90…
Development and Testing of a High Stability Engine Control (HISTEC) System

NASA Technical Reports Server (NTRS)

Orme, John S.; DeLaat, John C.; Southwick, Robert D.; Gallops, George W.; Doane, Paul M.

1998-01-01

Flight tests were recently completed to demonstrate an inlet-distortion-tolerant engine control system. These flight tests were part of NASA's High Stability Engine Control (HISTEC) program. The objective of the HISTEC program was to design, develop, and flight demonstrate an advanced integrated engine control system that uses measurement-based, real-time estimates of inlet airflow distortion to enhance engine stability. With improved stability and tolerance of inlet airflow distortion, future engine designs may benefit from a reduction in design stall-margin requirements and enhanced reliability, with a corresponding increase in performance and decrease in fuel consumption. This paper describes the HISTEC methodology, presents an aircraft test bed description (including HISTEC-specific modifications) and verification and validation ground tests. Additionally, flight test safety considerations, test plan and technique design and approach, and flight operations are addressed. Some illustrative results are presented to demonstrate the type of analysis and results produced from the flight test program.
Improvement of reliability in multi-interferometer-based counterfactual deterministic communication with dissipation compensation.

PubMed

Liu, Chao; Liu, Jinhong; Zhang, Junxiang; Zhu, Shiyao

2018-02-05

The direct counterfactual quantum communication (DCQC) is a surprising phenomenon that quantum information can be transmitted without using any carriers of physical particles. The nested interferometers are promising devices for realizing DCQC as long as the number of interferometers goes to be infinity. Considering the inevitable loss or dissipation in practical experimental interferometers, we analyze the dependence of reliability on the number of interferometers, and show that the reliability of direct communication is being rapidly degraded with the large number of interferometers. Furthermore, we simulate and test this counterfactual deterministic communication protocol with a finite number of interferometers, and demonstrate the improvement of the reliability using dissipation compensation in interferometers.
Reliability and commercialization of oxidized VCSEL

NASA Astrophysics Data System (ADS)

Li, Alice; Pan, Jin-Shan; Lai, Horng-Ching; Lee, Bor-Lin; Wu, Jack; Lin, Yung-Sen; Huo, Tai-Chan; Wu, Calvin; Huang, Kai-Feng

2003-06-01

The reliability of oxidized VCSEL has similar result to implanted VCSEL. This paper presents our work on reliability data of oxidized VCSEL device and also the comparison with implanted VCSEL. The MTTF of oxidized VCSEL is 2.73 x 106 hrs at 55°C, 6 mA and failure rate ~ 1 FITs for the first 2 years operation. The reliability data of oxidized VCSEL includes activation energy, MTTF (mean-time-to failure), failure rate prediction, and 85°C / 85% humidity test will be presented below. Commercialization of oxidized VCSEL is demonstrated such as VCSEL structure, manufacturing facility, and packaging. A cost effective approach is key to its success in applications such as Datacomm.
Reliability and validity of the Japanese version of the Resilience Scale and its short version.

PubMed

Nishi, Daisuke; Uehara, Ritei; Kondo, Maki; Matsuoka, Yutaka

2010-11-17

The clinical relevance of resilience has received considerable attention in recent years. The aim of this study is to demonstrate the reliability and validity of the Japanese version of the Resilience Scale (RS) and short version of the RS (RS-14). The original English version of RS was translated to Japanese and the Japanese version was confirmed by back-translation. Participants were 430 nursing and university psychology students. The RS, Center for Epidemiologic Studies Depression Scale (CES-D), Rosenberg Self-Esteem Scale (RSES), Social Support Questionnaire (SSQ), Perceived Stress Scale (PSS), and Sheehan Disability Scale (SDS) were administered. Internal consistency, convergent validity and factor loadings were assessed at initial assessment. Test-retest reliability was assessed using data collected from 107 students at 3 months after baseline. Mean score on the RS was 111.19. Cronbach's alpha coefficients for the RS and RS-14 were 0.90 and 0.88, respectively. The test-retest correlation coefficients for the RS and RS-14 were 0.83 and 0.84, respectively. Both the RS and RS-14 were negatively correlated with the CES-D and SDS, and positively correlated with the RSES, SSQ and PSS (all p < 0.05), although the correlation between the RS and CES-D was somewhat lower than that in previous studies. Factor analyses indicated a one-factor solution for RS-14, but as for RS, the result was not consistent with previous studies. This study demonstrates that the Japanese version of RS has psychometric properties with high degrees of internal consistency, high test-retest reliability, and relatively low concurrent validity. RS-14 was equivalent to the RS in internal consistency, test-retest reliability, and concurrent validity. Low scores on the RS, a positive correlation between the RS and perceived stress, and a relatively low correlation between the RS and depressive symptoms in this study suggest that validity of the Japanese version of the RS might be relatively low compared with the original English version.
Optical interconnection and packaging technologies for advanced avionics systems

NASA Astrophysics Data System (ADS)

Schroeder, J. E.; Christian, N. L.; Cotti, B.

1992-09-01

An optical backplane developed to demonstrate the advantages of high-performance optical interconnections and supporting technologies and designed to be compatible with standard avionics racks is described. The hardware demonstrates the three basic components of optical interconnects: optical sources, an optical signal distribution network, and optical receivers. Results from characterization and environmental tests, including a demonstration of the reliable transmission of serial data at a 1 Gb/s, are reported.
Demonstration of Passive Fuel Cell Thermal Management Technology

NASA Technical Reports Server (NTRS)

Burke, Kenneth A.; Jakupca, Ian; Colozza, Anthony; Wynne, Robert; Miller, Michael; Meyer, Al; Smith, William

2012-01-01

The NASA Glenn Research Center is developing advanced passive thermal management technology to reduce the mass and improve the reliability of space fuel cell systems for the NASA Exploration program. The passive thermal management system relies on heat conduction within highly thermally conductive cooling plates to move the heat from the central portion of the cell stack out to the edges of the fuel cell stack. Using the passive approach eliminates the need for a coolant pump and other cooling loop components within the fuel cell system which reduces mass and improves overall system reliability. Previous development demonstrated the performance of suitable highly thermally conductive cooling plates and integrated heat exchanger technology to collect the heat from the cooling plates (Ref. 1). The next step in the development of this passive thermal approach was the demonstration of the control of the heat removal process and the demonstration of the passive thermal control technology in actual fuel cell stacks. Tests were run with a simulated fuel cell stack passive thermal management system outfitted with passive cooling plates, an integrated heat exchanger and two types of cooling flow control valves. The tests were run to demonstrate the controllability of the passive thermal control approach. Finally, successful demonstrations of passive thermal control technology were conducted with fuel cell stacks from two fuel cell stack vendors.

Development and validation of the Perceived Food Environment Questionnaire in a French-Canadian population.

PubMed

Carbonneau, Elise; Robitaille, Julie; Lamarche, Benoît; Corneau, Louise; Lemieux, Simone

2017-08-01

The present study aimed to develop and validate a questionnaire assessing perceived food environment in a French-Canadian population. A questionnaire, the Perceived Food Environment Questionnaire, was developed assessing perceived accessibility to healthy (nine items) and unhealthy foods (three items). A pre-test sample was recruited for a pilot testing of the questionnaire. For the validation study, another sample was recruited and completed the questionnaire twice. Exploratory factor analysis was performed on the items to assess the number of factors (subscales). Cronbach's α was used to measure internal consistency reliability. Test-retest reliability was assessed with Pearson correlations. Online survey. Men and women from the Québec City area (n 31 in the pre-test sample; n 150 in the validation study sample). The pilot testing did not lead to any change in the questionnaire. The exploratory factor analysis revealed a two-subscale structure. The first subscale is composed of six items assessing accessibility to healthy foods and the second includes three items related to accessibility to unhealthy foods. Three items were removed from the questionnaire due to low loading on the two subscales. The subscales demonstrated adequate internal consistency (Cronbach's α=0·77 for healthy foods and 0·62 for unhealthy foods) and test-retest reliability (r=0·59 and 0·60, respectively; both P<0·0001). The Perceived Food Environment Questionnaire was developed for a French-Canadian population and demonstrated good psychometric properties. Further validation is recommended if the questionnaire is to be used in other populations.
Cardiopulmonary exercise testing early after stroke using feedback-controlled robotics-assisted treadmill exercise: test-retest reliability and repeatability.

PubMed

Stoller, Oliver; de Bruin, Eling D; Schindelholz, Matthias; Schuster-Amft, Corina; de Bie, Rob A; Hunt, Kenneth J

2014-10-11

Exercise capacity is seriously reduced after stroke. While cardiopulmonary assessment and intervention strategies have been validated for the mildly and moderately impaired populations post-stroke, there is a lack of effective concepts for stroke survivors suffering from severe motor limitations. This study investigated the test-retest reliability and repeatability of cardiopulmonary exercise testing (CPET) using feedback-controlled robotics-assisted treadmill exercise (FC-RATE) in severely motor impaired individuals early after stroke. 20 subjects (age 44-84 years, <6 month post-stroke) with severe motor limitations (Functional Ambulatory Classification 0-2) were selected for consecutive constant load testing (CLT) and incremental exercise testing (IET) within a powered exoskeleton, synchronised with a treadmill and a body weight support system. A manual human-in-the-loop feedback system was used to guide individual work rate levels. Outcome variables focussed on standard cardiopulmonary performance parameters. Relative and absolute test-retest reliability were assessed by intraclass correlation coefficients (ICC), standard error of the measurement (SEM), and minimal detectable change (MDC). Mean difference, limits of agreement, and coefficient of variation (CoV) were estimated to assess repeatability. Peak performance parameters during IET yielded good to excellent relative reliability: absolute peak oxygen uptake (ICC =0.82), relative peak oxygen uptake (ICC =0.72), peak work rate (ICC =0.91), peak heart rate (ICC =0.80), absolute gas exchange threshold (ICC =0.91), relative gas exchange threshold (ICC =0.88), oxygen cost of work (ICC =0.87), oxygen pulse at peak oxygen uptake (ICC =0.92), ventilation rate versus carbon dioxide output slope (ICC =0.78). For these variables, SEM was 4-13%, MDC 12-36%, and CoV 0.10-0.36. CLT revealed high mean differences and insufficient test-retest reliability for all variables studied. This study presents first evidence on reliability and repeatability for CPET in severely motor impaired individuals early after stroke using a feedback-controlled robotics-assisted treadmill. The results demonstrate good to excellent test-retest reliability and appropriate repeatability for the most important peak cardiopulmonary performance parameters. These findings have important implications for the design and implementation of cardiovascular exercise interventions in severely impaired populations. Future research needs to develop advanced control strategies to enable the true limit of functional exercise capacity to be reached and to further assess test-retest reliability and repeatability in larger samples.
In vitro and in vivo testing of a totally implantable left ventricular assist system.

PubMed

Jassawalla, J S; Daniel, M A; Chen, H; Lee, J; LaForge, D; Billich, J; Ramasamy, N; Miller, P J; Oyer, P E; Portner, P M

1988-01-01

The totally implantable Novacor LVAS is being tested under NIH auspices to demonstrate safety and efficacy before clinical trials. Twelve complete systems (submerged in saline at 37 degrees C) are being tested, with an NIH goal of demonstrating 80% reliability for 2 year operation with a 60% confidence level. The systems, which are continuously monitored, are diurnally cycled between two output levels by automatically varying preload and afterload. Currently, 14.3 years of failure-free operation have been accumulated, with a mean duration of 14 months. Using an exponential failure distribution model, the mean time to failure (MTTF) is greater than 8.8 years, corresponding to a demonstrated reliability (for a 2 year mission time) of 80% (80% confidence level). Recent ovine experiments with VAS subsystems include a 767 day volume compensator implant, a 279 day pump/drive unit implant and a 1,448 day BST implant. The last 12 chronic pump/drive unit experiments had a mean duration of 153 days (excluding early postoperative complications). This compares favorably with the NIH goals for complete systems (5 month mean duration). Complete system experiments are currently underway.
Reliability of plasma lipopolysaccharide-binding protein (LBP) from repeated measures in healthy adults.

PubMed

Citronberg, Jessica S; Wilkens, Lynne R; Lim, Unhee; Hullar, Meredith A J; White, Emily; Newcomb, Polly A; Le Marchand, Loïc; Lampe, Johanna W

2016-09-01

Plasma lipopolysaccharide-binding protein (LBP), a measure of internal exposure to bacterial lipopolysaccharide, has been associated with several chronic conditions and may be a marker of chronic inflammation; however, no studies have examined the reliability of this biomarker in a healthy population. We examined the temporal reliability of LBP measured in archived samples from participants in two studies. In Study one, 60 healthy participants had blood drawn at two time points: baseline and follow-up (either three, six, or nine months). In Study two, 24 individuals had blood drawn three to four times over a seven-month period. We measured LBP in archived plasma by ELISA. Test-retest reliability was estimated by calculating the intraclass correlation coefficient (ICC). Plasma LBP concentrations showed moderate reliability in Study one (ICC 0.60, 95 % CI 0.43-0.75) and Study two (ICC 0.46, 95 % CI 0.26-0.69). Restricting the follow-up period improved reliability. In Study one, the reliability of LBP over a three-month period was 0.68 (95 % CI: 0.41-0.87). In Study two, the ICC of samples taken ≤seven days apart was 0.61 (95 % CI 0.29-0.86). Plasma LBP concentrations demonstrated moderate test-retest reliability in healthy individuals with reliability improving over a shorter follow-up period.
Lifetime prediction and reliability estimation methodology for Stirling-type pulse tube refrigerators by gaseous contamination accelerated degradation testing

NASA Astrophysics Data System (ADS)

Wan, Fubin; Tan, Yuanyuan; Jiang, Zhenhua; Chen, Xun; Wu, Yinong; Zhao, Peng

2017-12-01

Lifetime and reliability are the two performance parameters of premium importance for modern space Stirling-type pulse tube refrigerators (SPTRs), which are required to operate in excess of 10 years. Demonstration of these parameters provides a significant challenge. This paper proposes a lifetime prediction and reliability estimation method that utilizes accelerated degradation testing (ADT) for SPTRs related to gaseous contamination failure. The method was experimentally validated via three groups of gaseous contamination ADT. First, the performance degradation model based on mechanism of contamination failure and material outgassing characteristics of SPTRs was established. Next, a preliminary test was performed to determine whether the mechanism of contamination failure of the SPTRs during ADT is consistent with normal life testing. Subsequently, the experimental program of ADT was designed for SPTRs. Then, three groups of gaseous contamination ADT were performed at elevated ambient temperatures of 40 °C, 50 °C, and 60 °C, respectively and the estimated lifetimes of the SPTRs under normal condition were obtained through acceleration model (Arrhenius model). The results show good fitting of the degradation model with the experimental data. Finally, we obtained the reliability estimation of SPTRs through using the Weibull distribution. The proposed novel methodology enables us to take less than one year time to estimate the reliability of the SPTRs designed for more than 10 years.
Improving the Validity and Reliability of a Health Promotion Survey for Physical Therapists

PubMed Central

Stephens, Jaca L.; Lowman, John D.; Graham, Cecilia L.; Morris, David M.; Kohler, Connie L.; Waugh, Jonathan B.

2013-01-01

Purpose Physical therapists (PTs) have a unique opportunity to intervene in the area of health promotion. However, no instrument has been validated to measure PTs’ views on health promotion in physical therapy practice. The purpose of this study was to evaluate the content validity and test-retest reliability of a health promotion survey designed for PTs. Methods An expert panel of PTs assessed the content validity of “The Role of Health Promotion in Physical Therapy Survey” and provided suggestions for revision. Item content validity was assessed using the content validity ratio (CVR) as well as the modified kappa statistic. Therapists then participated in the test-retest reliability assessment of the revised health promotion survey, which was assessed using a weighted kappa statistic. Results Based on feedback from the expert panelists, significant revisions were made to the original survey. The expert panel reached at least a majority consensus agreement for all items in the revised survey and the survey-CVR improved from 0.44 to 0.66. Only one item on the revised survey had substantial test-retest agreement, with 55% of the items having moderate agreement and 43% poor agreement. Conclusions All items on the revised health promotion survey demonstrated at least fair validity, but few items had reasonable test-retest reliability. Further modifications should be made to strengthen the validity and improve the reliability of this survey. PMID:23754935
Standardization of Brief Inventory of Social Support Exchange Network (BISSEN) in Japan.

PubMed

Aiba, Miyuki; Tachikawa, Hirokazu; Fukuoka, Yoshiharu; Lebowitz, Adam; Shiratori, Yuki; Doi, Nagafumi; Matsui, Yutaka

2017-07-01

This study describes the Brief Inventory of Social Support Exchange Network (BISSEN) as a standardized brief inventory measuring various aspects of social support. We confirmed the reliability and validity for function and direction of support and standardized the BISSEN. For Sample 1, a stratified random sampling method was used to select 5200 residents in Japan. We conducted mail surveys and responses were retrieved from 2274 participants (collection rate 43.7%). Participants completed a questionnaire packet that included BISSEN, suicidal ideation, depression, support seeking, and Multidimensional Scale of Perceived Social Support (MSPSS). Sample 2 surveys for test-retest reliability were conducted on 23 residents at approximately two-week intervals. Participants were asked about gender, age, and BISSEN. First, we assessed the internal consistency, test-retest reliability, construct, convergent, and concurrent validity. McDonald's omega (.73-.92) and test-retest correlations (.78-.85) demonstrated adequate internal consistency and test-retest reliability. Depression, support seeking, and MSPSS were significantly correlated with all scores of BISSEN. The non-suicidal ideation group had significantly more support compared to the suicidal ideation group. Therefore, function and direction of support in BISSEN had sufficient reliability and validity. Next, we standardized BISSEN using Z-scores and percentile rank with respect to each 12 norm groups by age and gender. Copyright © 2017 Elsevier Ireland Ltd. All rights reserved.
Validation of an instrument to measure quality of life in British children with inflammatory bowel disease.

PubMed

Ogden, C A; Akobeng, A K; Abbott, J; Aggett, P; Sood, M R; Thomas, A G

2011-09-01

To validate IMPACT-III (UK), a health-related quality of life (HRQoL) instrument, in British children with inflammatory bowel disease (IBD). One hundred six children and parents were invited to participate. IMPACT-III (UK) was validated by inspection by health professionals and children to assess face and content validity, factor analysis to determine optimum domain structure, use of Cronbach alpha coefficients to test internal reliability, ANOVA to assess discriminant validity, correlation with the Child Health Questionnaire to assess concurrent validity, and use of intraclass correlation coefficients to assess test-retest reliability. The independent samples t test was used to measure differences between sexes and age groups, and between paper and computerised versions of IMPACT-III (UK). IMPACT-III (UK) had good face and content validity. The most robust factor solution was a 5-domain structure: body image, embarrassment, energy, IBD symptoms, and worries/concerns about IBD, all of which demonstrated good internal reliability (α = 0.74-0.88). Discriminant validity was demonstrated by significant (P < 0.05, P < 0.01) differences in HRQoL scores between the severe, moderate, and inactive/mild symptom severity groups for the embarrassment scale (63.7 vs 81.0 vs 81.2), IBD symptom scale (45.0 vs 64.2 vs 80.6), and the energy scale (46.4 vs 62.1 vs 77.7). Concurrent validity of IMPACT-III (UK) with comparable domains of the Child Health Questionnaire was confirmed. Test-retest reliability was confirmed with good intraclass correlation coefficients of 0.66 to 0.84. Paper and computer versions of IMPACT-III (UK) collected comparable scores, and there were no differences between the sexes and age groups. IMPACT-III (UK) appears to be a useful tool to measure HRQoL in British children with IBD.
Test-retest reliability of knee extensor rate of velocity and power development in older adults using the isotonic mode on a Biodex System 3 dynamometer.

PubMed

Van Driessche, Stijn; Van Roie, Evelien; Vanwanseele, Benedicte; Delecluse, Christophe

2018-01-01

Isotonic testing and measures of rapid power production are emerging as functionally relevant test methods for detection of muscle aging. Our objective was to assess reliability of rapid velocity and power measures in older adults using the isotonic mode of an isokinetic dynamometer. Sixty-three participants (aged 65 to 82 years) underwent a test-retest protocol with one week time interval. Isotonic knee extension tests were performed at four different loads: 0%, 25%, 50% and 75% of maximal isometric strength. Peak velocity (pV) and power (pP) were determined as the highest values of the velocity and power curve. Rate of velocity (RVD) and power development (RPD) were calculated as the linear slopes of the velocity- and power-time curve. Relative and absolute measures of test-retest reliability were analyzed using intraclass correlation coefficients (ICC), standard error of measurement (SEM) and Bland-Altman analyses. Overall, reliability was high for pV, pP, RVD and RPD at 0%, 25% and 50% load (ICC: .85 - .98, SEM: 3% - 10%). A trend for increased reliability at lower loads seemed apparent. The tests at 75% load led to range of motion failure and should be avoided. In addition, results demonstrated that caution is advised when interpreting early phase results (first 50ms). To conclude, our results support the use of the isotonic mode of an isokinetic dynamometer for testing rapid power and velocity characteristics in older adults, which is of high clinical relevance given that these muscle characteristics are emerging as the primary outcomes for preventive and rehabilitative interventions in aging research.
Clinical evaluation of a new noninvasive ankle arthrometer.

PubMed

Nauck, Tanja; Lohrer, Heinz; Gollhofer, Albert

2010-06-01

A nonradiographic arthrometer was developed to objectively quantify anterior talar drawer instability in stable and unstable ankles. Diagnostic validity of this device was previously demonstrated in a cadaver study. The aim of the present study was to validate the ankle arthrometer in an in vivo setting. Twenty-three subjects participated in the study. An orthopedic surgeon first performed a manual anterior talar drawer test to classify the subjects' ankles as stable or unstable. The subjects were then evaluated using the ankle arthrometer, and filled out a validated self-reported questionnaire (German version of the Foot and Ankle Ability Measure [FAAM-G]). Ankle stiffness was calculated from the low linear region (40-60 N) of the load deformation curves obtained from the ankle arthrometer. Reliability testing of these stiffness values was done based on load deformation curves, with 150 and 200 N maximum anterior drawer loads applied in the ankle arthrometer. Using the manual anterior drawer test, 16 ankles were classified as stable and 7 were classified as unstable. Arthrometer stiffness analysis differentiated stable from unstable ankles (P = 0.00 and P = 0.01, respectively). Test-retest demonstrated an accurate reliability (intraclass correlation coefficient = 0.80). A significant correlation was found between both FAAM-G subscales and the arthrometer stiffness values (r = 0.43 and 0.54; P = 0.04 and 0.01). Discussion Subjects with and without mechanical ankle instability could be differentiated by ankle arthrometer stiffness analysis and the FAAM-G questionnaire results. This nonradiographic device may be relevant for screening athletes at risk for ankle injuries, for clinical follow-up studies, and implementing preventive strategies. Validity and reliability of the new ankle arthrometer is demonstrated in a small cohort in an in vivo setting.
New International Program to Asses the Reliability of Emerging Nondestructive Techniques (PARENT)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Prokofiev, Iouri; Cumblidge, Stephen E.; Csontos, Aladar A.

2013-01-25

The Nuclear Regulatory Commission (NRC) established the Program to Assess the Reliability of Emerging Nondestructive Techniques (PARENT) to follow on from the successful Program for the Inspection of Nickel alloy Components (PINC). The goal of the PARENT is to conduct a confirmatory assessment of the reliability of nondestructive evaluation (NDE) techniques for detecting and sizing primary water stress corrosion cracks (PWSCC) and applying the lessons learned from PINC to a series of round-robin tests. These open and blind round-robin tests will comprise a new set of typical pressure boundary components including dissimilar metal welds (DMWs) and bottom-mounted instrumentation penetrations. Openmore » round-robin tests will engage research and industry teams worldwide to investigate and demonstrate the reliability of emerging NDE techniques to detect and size flaws with a wide range of lengths, depths, orientations, and locations. Blind round-robin tests will utilize various testing organizations, whose inspectors and procedures are certified by the standards for the nuclear industry in their respective countries, to investigate the ability of established NDE techniques to detect and size flaws whose characteristics range from relatively easy to very difficult for detection and sizing. Blind and open round-robin testing started in late 2011 and early 2012, respectively. This paper will present the work scope with reports on progress, NDE methods evaluated, and project timeline for PARENT.« less
The Nutrition Literacy Assessment Instrument is a Valid and Reliable Measure of Nutrition Literacy in Adults with Chronic Disease.

PubMed

Gibbs, Heather D; Ellerbeck, Edward F; Gajewski, Byron; Zhang, Chuanwu; Sullivan, Debra K

2018-03-01

To test the reliability and validity of the Nutrition Literacy Assessment Instrument (NLit) in adult primary care and identify the relationship between nutrition literacy and diet quality. This instrument validation study included a cross-sectional sample participating in up to 2 visits 1 month apart. A total of 429 adults with nutrition-related chronic disease were recruited from clinics and a patient registry affiliated with a Midwestern university medical center. Nutrition literacy was measured by the NLit, which was composed of 6 subscales: nutrition and health, energy sources in food, food label and numeracy, household food measurement, food groups, and consumer skills. Diet quality was measured by Healthy Eating Index-2010 with nutrient data from Diet History Questionnaire II surveys. The researchers measured factor validity and reliability by using binary confirmatory factor analysis; test-retest reliability was measured by Pearson r and the intraclass correlation coefficient, and relationships between nutrition literacy and diet quality were analyzed by linear regression. The NLit demonstrated substantial factor validity and reliability (0.97; confidence interval, 0.96-0.98) and test-retest reliability (0.88; confidence interval, 0.85-0.90). Nutrition literacy was the most significant predictor of diet quality (β = .17; multivariate coefficient = 0.10; P < .001). The NLit is a valid and reliable tool for measuring nutrition literacy in adult primary care patients. Copyright © 2017 Society for Nutrition Education and Behavior. Published by Elsevier Inc. All rights reserved.
Quantitative Accelerated Life Testing of MEMS Accelerometers.

PubMed

Bâzu, Marius; Gălăţeanu, Lucian; Ilian, Virgil Emil; Loicq, Jerome; Habraken, Serge; Collette, Jean-Paul

2007-11-20

Quantitative Accelerated Life Testing (QALT) is a solution for assessing thereliability of Micro Electro Mechanical Systems (MEMS). A procedure for QALT is shownin this paper and an attempt to assess the reliability level for a batch of MEMSaccelerometers is reported. The testing plan is application-driven and contains combinedtests: thermal (high temperature) and mechanical stress. Two variants of mechanical stressare used: vibration (at a fixed frequency) and tilting. Original equipment for testing at tiltingand high temperature is used. Tilting is appropriate as application-driven stress, because thetilt movement is a natural environment for devices used for automotive and aerospaceapplications. Also, tilting is used by MEMS accelerometers for anti-theft systems. The testresults demonstrated the excellent reliability of the studied devices, the failure rate in the"worst case" being smaller than 10 -7 h -1 .
Evaluation of the Early Childhood Oral Health Impact Scale in an Australian preschool child population.

PubMed

Arrow, P; Klobas, E

2015-09-01

Early childhood caries has significant impacts on children and their families. The Early Childhood Oral Health Impact Scale (ECOHIS) is an instrument for capturing the complex dimensions of preschool children's oral health. This study aimed to evaluate the reliability and validity of the instrument among Australian preschool children. Parents/children dyads (n = 286) participating in a treatment trial on early childhood caries completed the scale at baseline, and 33 parents repeated the questionnaire 2-3 weeks later. The validity and reliability of the ECOHIS was determined using tests for convergent and discriminant validity, internal reliability of the instrument and test-retest reliability. Scale impacts were strongly correlated with global oral health ratings (Spearman's correlations; r = 0.51, total score; r = 0.43, child impact; and r = 0.49, family impact; p < 0.001). The scale was significantly associated with children's caries experience, p < 0.001. Cronbach's alpha values were 0.87, 0.89 and 0.74 for the total, the child and the family domains, respectively. Test-retest reliability was 0.92, 0.89 and 0.78 for the total, child and family domains, respectively. The scale demonstrated acceptable validity and reliability for assessing the impact of early childhood caries among Australian preschool children. © 2015 Australian Dental Association.
Development of a Tablet-based symbol digit modalities test for reliably assessing information processing speed in patients with stroke.

PubMed

Tung, Li-Chen; Yu, Wan-Hui; Lin, Gong-Hong; Yu, Tzu-Ying; Wu, Chien-Te; Tsai, Chia-Yin; Chou, Willy; Chen, Mei-Hsiang; Hsieh, Ching-Lin

2016-09-01

To develop a Tablet-based Symbol Digit Modalities Test (T-SDMT) and to examine the test-retest reliability and concurrent validity of the T-SDMT in patients with stroke. The study had two phases. In the first phase, six experts, nine college students and five outpatients participated in the development and testing of the T-SDMT. In the second phase, 52 outpatients were evaluated twice (2 weeks apart) with the T-SDMT and SDMT to examine the test-retest reliability and concurrent validity of the T-SDMT. The T-SDMT was developed via expert input and college student/patient feedback. Regarding test-retest reliability, the practise effects of the T-SDMT and SDMT were both trivial (d=0.12) but significant (p≦0.015). The improvement in the T-SDMT (4.7%) was smaller than that in the SDMT (5.6%). The minimal detectable changes (MDC%) of the T-SDMT and SDMT were 6.7 (22.8%) and 10.3 (32.8%), respectively. The T-SDMT and SDMT were highly correlated with each other at the two time points (Pearson's r=0.90-0.91). The T-SDMT demonstrated good concurrent validity with the SDMT. Because the T-SDMT had a smaller practise effect and less random measurement error (superior test-retest reliability), it is recommended over the SDMT for assessing information processing speed in patients with stroke. Implications for Rehabilitation The Symbol Digit Modalities Test (SDMT), a common measure of information processing speed, showed a substantial practise effect and considerable random measurement error in patients with stroke. The Tablet-based SDMT (T-SDMT) has been developed to reduce the practise effect and random measurement error of the SDMT in patients with stroke. The T-SDMT had smaller practise effect and random measurement error than the SDMT, which can provide more reliable assessments of information processing speed.
Reliability and criterion-related validity testing (construct) of the Endotracheal Suction Assessment Tool (ESAT©).

PubMed

Davies, Kylie; Bulsara, Max K; Ramelet, Anne-Sylvie; Monterosso, Leanne

2018-05-01

To establish criterion-related construct validity and test-retest reliability for the Endotracheal Suction Assessment Tool© (ESAT©). Endotracheal tube suction performed in children can significantly affect clinical stability. Previously identified clinical indicators for endotracheal tube suction were used as criteria when designing the ESAT©. Content validity was reported previously. The final stages of psychometric testing are presented. Observational testing was used to measure construct validity and determine whether the ESAT© could guide "inexperienced" paediatric intensive care nurses' decision-making regarding endotracheal tube suction. Test-retest reliability of the ESAT© was performed at two time points. The researchers and paediatric intensive care nurse "experts" developed 10 hypothetical clinical scenarios with predetermined endotracheal tube suction outcomes. "Experienced" (n = 12) and "inexperienced" (n = 14) paediatric intensive care nurses were presented with the scenarios and the ESAT© guiding decision-making about whether to perform endotracheal tube suction for each scenario. Outcomes were compared with those predetermined by the "experts" (n = 9). Test-retest reliability of the ESAT© was measured at two consecutive time points (4 weeks apart) with "experienced" and "inexperienced" paediatric intensive care nurses using the same scenarios and tool to guide decision-making. No differences were observed between endotracheal tube suction decisions made by "experts" (n = 9), "inexperienced" (n = 14) and "experienced" (n = 12) nurses confirming the tool's construct validity. No differences were observed between groups for endotracheal tube suction decisions at T1 and T2. Criterion-related construct validity and test-retest reliability of the ESAT© were demonstrated. Further testing is recommended to confirm reliability in the clinical setting with the "inexperienced" nurse to guide decision-making related to endotracheal tube suction. The ESAT© is the first validated tool to systematically guide endotracheal nursing practice for the "inexperienced" nurse. © 2018 John Wiley & Sons Ltd.
Development of the technology for the fabrication of reliable laminar flow control panels

NASA Technical Reports Server (NTRS)

Weiss, D. D.; Lindh, D. V.

1977-01-01

Various configurations of porous, perforated and slotted materials were flow tested to determine if they would meet the LFC surface smoothness and flow requirements. The candidate materials were then tested for susceptibility to clogging and for resistance to corrosion. Of the materials tested, perforated titanium, porous polyimide, and slotted assemblies demonstrated a much greater resistance to clogging than other porous materials.
Reliability and a correlational analysis of the 6MWT, ten-meter walk test, thirty second sit to stand, and the linear analog scale of function in patients with head and neck cancer.

PubMed

Eden, Melissa M; Tompkins, James; Verheijde, Joseph L

2018-03-01

The purpose of this study was to establish the test-retest reliability of and relationships between various measures of physical function in a cohort of individuals in the early treatment stages for head and neck cancer (HNC). The Six-Minute Walk Test (6MWT), 10-Meter Walk Test (10MWT), 30-Second Sit to Stand (30STS), and Linear Analog Scale of Function (LASF) were administered to 42 participants with a diagnosis of HNC. Test-retest reliability and correlations between the measures are reported. The 6MWT, 10MWT, 30STS, and LASF demonstrate excellent test-retest reliability (ICC = 0.901-0.960). The 6MWT exhibits a moderate to good relationship with the 10MWT (r = 0.684, p < 0.001), whereas the relationship between the 30STS and the 6MWT (r = 0.407, p = 0.007) and 10MWT (r = 0.322, p = 0.038) is fair. The LASF does not correlate significantly with the 6MWT, 10MWT, or 30STS. The 6MWT, 10MWT, 30STS, and LASF are reliable measurement instruments for patients treated for HNC. The 6MWT, 10MWT, and 30STS are significantly correlated suggesting they may measure subconstructs of physical function. The LASF does not correlate significantly with the 6MWT, 10MWT and 30STS in this sample.
Assessment of behavioral mechanisms maintaining encopresis: Virginia Encopresis-Constipation Apperception Test.

PubMed

Cox, Daniel J; Ritterband, Lee M; Quillian, Warren; Kovatchev, Boris; Morris, James; Sutphen, James; Borowitz, Stephen

2003-09-01

To develop and test a scale for parent and child, evaluating theoretical and clinical parameters relevant to children with encopresis. Encopretic children were hypothesized to have more bowel-specific, but not more generic, psychological problems, as compared with nonsymptomatic control children. In addition, mothers were also believed to be more discerning than children. The Virginia Encopresis-Constipation Apperception Test (VECAT) consists of 9 pairs of bowel-specific and 9 parallel generic drawings. Respondents selected the picture in each pair that best described them/their child. It was administered to encopretic children (N = 87), nonsymptomatic siblings (N = 27), and nonsymptomatic nonsiblings (N = 35). The mothers of all the participants also completed the VECAT. Encopretic children were retested 6 and 12 months posttreatment with Enhanced Toilet Training. The VECAT demonstrated good test-retest reliability and internal consistency. Encopretic children and their mothers reported more bowel-specific, but not more generic, problems. Bowel-specific scores improved significantly posttreatment only for those patients who demonstrated significant symptom improvement. Mothers were significantly more discerning than children. The VECAT is a reliable, valid, discriminating, and sensitive test. Bowel-specific problems appear to best differentiate children with and without encopresis.
Development and Validation of a Questionnaire to Assess Multimorbidity in Primary Care: An Indian Experience.

PubMed

Pati, Sanghamitra; Hussain, Mohammad Akhtar; Swain, Subhashisa; Salisbury, Chris; Metsemakers, Job F M; Knottnerus, J André; van den Akker, Marjan

2016-01-01

Multimorbidity remains an underexplored domain in Indian primary care. We undertook a study to assess the prevalence, correlates, and outcomes of multimorbidity in primary care settings in India. This paper describes the process of development and validation of our data collection tool "Multimorbidity Assessment Questionnaire for Primary Care (MAQ-PC)." An iterative process comprising desk review, chart review, and expert consultations was undertaken to generate the questionnaire. The MAQ-PC contained items on chronic conditions, health care utilization, health related quality of life, disease severity, and sociodemographics. It was first tested with twelve adults for comprehensibility followed by test-retest reliability with 103 patients from four primary care practices. For interrater reliability, two interviewers separately administered the questionnaire to sixteen patients. MAQ-PC displayed strong internal consistency (Cronbach's alpha: 0.69), interrater reliability (Cohen's Kappa: 0.78-1), and test-retest reliability (ICC: 0.970-0.741). Substantial concordance between self-report and physician diagnosis (Scott Kappa: 0.59-1.0) was observed for listed chronic conditions indicating strong concurrent validity. Nearly 54% had one chronic condition and 23.3% had multimorbidity. Our findings demonstrate MAQ-PC to be a valid and reliable measure of multimorbidity in primary care practice and suggest its potential utility in multimorbidity research in India.

Developing a Danish version of the "Impact on Participation and Autonomy Questionnaire".

PubMed

Ghaziani, Emma; Krogh, Anne Grethe; Lund, Hans

2013-05-01

To translate the "Impact on Participation and Autonomy Questionnaire" into Danish (IPAQ-DK), and estimate its internal consistency and test-retest reliability in order to promote participation-based interventions and research. Translation and two successive reliability assessments through test-retest. 137 adults with varying degrees of impairment; of these, 67 participated in the final reliability assessment. The translation followed guidelines set forth by the "European Group for Quality of Life Assessment and Health Measurement". Internal consistency for subscales was estimated by Chronbach's alpha. Weighted kappa coefficients and intraclass correlation coefficients were calculated to assess the test-retest reliability at item and subscale level, respectively. A preliminary reliability assessment revealed residual issues regarding the translation and cultural adaptation of the instrument. The revised version (IPAQ-DK) was subsequently subjected to a similar assessment demonstrating Chronbach's alpha values from 0.698 to 0.817. Weighted kappa ranged from 0.370 to 0.880; 78% of these values were higher than 0.600. The intraclass correlation coefficient covered values from 0.701 to 0.818. IPAQ-DK is a useful instrument for identifying person-perceived participation restrictions and satisfaction with participation. Further studies of IPAQ-DK's floor/ceiling effects and responsiveness to change are recommended, and whether there is a need for further linguistic improvement of certain items.
Testing the reliability of the Fall Risk Screening Tool in an elderly ambulatory population.

PubMed

Fielding, Susan J; McKay, Michael; Hyrkas, Kristiina

2013-11-01

To identify and test the reliability of a fall risk screening tool in an ambulatory outpatient clinic. The Fall Risk Screening Tool (Albert Lea Medical Center, MN, USA) was scripted for an interview format. Two interviewers separately screened a convenience sample of 111 patients (age ≥ 65 years) in an ambulatory outpatient clinic in a northeastern US city. The interviewers' scoring of fall risk categories was similar. There was good internal consistency (Cronbach's α = 0.834-0.889) and inter-rater reliability [intra-class correlation coefficients (ICC) = 0.824-0.881] for total, Risk Factor and Client's Health Status subscales. The Physical Environment scores indicated acceptable internal consistency (Cronbach's α = 0.742) and adequate reliability (ICC = 0.688). Two Physical Environment items (furniture and medical equipment condition) had low reliabilities [Kappa (K) = 0.323, P = 0.08; K = -0.078, P = 0.648), respectively. The scripted Fall Risk Screening Tool demonstrated good reliability in this sample. Rewording two Physical Environment items will be considered. A reliable instrument such as the scripted Fall Risk Screening Tool provides a standardised assessment for identifying high fall risk patients. This tool is especially useful because it assesses personal, behavioural and environmental factors specific to community-dwelling patients; the interview format also facilitates patient-provider interaction. © 2013 John Wiley & Sons Ltd.
Calculating system reliability with SRFYDO

DOE Office of Scientific and Technical Information (OSTI.GOV)

Morzinski, Jerome; Anderson - Cook, Christine M; Klamann, Richard M

2010-01-01

SRFYDO is a process for estimating reliability of complex systems. Using information from all applicable sources, including full-system (flight) data, component test data, and expert (engineering) judgment, SRFYDO produces reliability estimates and predictions. It is appropriate for series systems with possibly several versions of the system which share some common components. It models reliability as a function of age and up to 2 other lifecycle (usage) covariates. Initial output from its Exploratory Data Analysis mode consists of plots and numerical summaries so that the user can check data entry and model assumptions, and help determine a final form for themore » system model. The System Reliability mode runs a complete reliability calculation using Bayesian methodology. This mode produces results that estimate reliability at the component, sub-system, and system level. The results include estimates of uncertainty, and can predict reliability at some not-too-distant time in the future. This paper presents an overview of the underlying statistical model for the analysis, discusses model assumptions, and demonstrates usage of SRFYDO.« less
Validation of the Japanese version of the Pediatric Quality of Life Inventory (PedsQL) Cancer Module.

PubMed

Tsuji, Naoko; Kakee, Naoko; Ishida, Yasushi; Asami, Keiko; Tabuchi, Ken; Nakadate, Hisaya; Iwai, Tsuyako; Maeda, Miho; Okamura, Jun; Kazama, Takuro; Terao, Yoko; Ohyama, Wataru; Yuza, Yuki; Kaneko, Takashi; Manabe, Atsushi; Kobayashi, Kyoko; Kamibeppu, Kiyoko; Matsushima, Eisuke

2011-04-10

The PedsQL 3.0 Cancer Module is a widely used instrument to measure pediatric cancer specific health-related quality of life (HRQOL) for children aged 2 to 18 years. We developed the Japanese version of the PedsQL Cancer Module and investigated its reliability and validity among Japanese children and their parents. Participants were 212 children with cancer and 253 of their parents. Reliability was determined by internal consistency using Cronbach's coefficient alpha and test-retest reliability using intra-class correlation coefficient (ICC). Validity was assessed through factor validity, convergent and discriminant validity, concurrent validity, and clinical validity. Factor validity was examined by exploratory factor analysis. Convergent and discriminant validity were examined by multitrait scaling analysis. Concurrent validity was assessed using Spearman's correlation coefficients between the Cancer Module and Generic Core Scales, and the comparison of the scores of child self-reports with those of other self-rating depression scales for children. Clinical validity was assessed by comparing the on- and off- treatment scores using Kruskal-Wallis and Mann-Whitney U tests. Cronbach's coefficient alpha was over 0.70 for the total scale and over 0.60 for each subscale by age except for the 'pain and hurt' subscale for children aged 5 to 7 years. For test-retest reliability, the ICC exceeded 0.70 for the total scale for each age. Exploratory factor analysis demonstrated sufficient factorial validity. Multitrait scaling analysis showed high success rates. Strong correlations were found between the reports by children and their parents, and the scores of the Cancer Module and the Generic Core Scales except for 'treatment anxiety' subscales for child reports. The Depression Self-Rating Scale for Children (DSRS-C) scores were significantly correlated with emotional domains and the total score of the cancer module. Children who had been off treatment over 12 months demonstrated significantly higher scores than those on treatment. The results demonstrate the reliability and validity of the Japanese version of the PedsQL Cancer Module among Japanese children.
Thermal Protection for Mars Sample Return Earth Entry Vehicle: A Grand Challenge for Design Methodology and Reliability Verification

NASA Technical Reports Server (NTRS)

Venkatapathy, Ethiraj; Gage, Peter; Wright, Michael J.

2017-01-01

Mars Sample Return is our Grand Challenge for the coming decade. TPS (Thermal Protection System) nominal performance is not the key challenge. The main difficulty for designers is the need to verify unprecedented reliability for the entry system: current guidelines for prevention of backward contamination require that the probability of spores larger than 1 micron diameter escaping into the Earth environment be lower than 1 million for the entire system, and the allocation to TPS would be more stringent than that. For reference, the reliability allocation for Orion TPS is closer to 11000, and the demonstrated reliability for previous human Earth return systems was closer to 1100. Improving reliability by more than 3 orders of magnitude is a grand challenge indeed. The TPS community must embrace the possibility of new architectures that are focused on reliability above thermal performance and mass efficiency. MSR (Mars Sample Return) EEV (Earth Entry Vehicle) will be hit with MMOD (Micrometeoroid and Orbital Debris) prior to reentry. A chute-less aero-shell design which allows for self-righting shape was baselined in prior MSR studies, with the assumption that a passive system will maximize EEV robustness. Hence the aero-shell along with the TPS has to take ground impact and not break apart. System verification will require testing to establish ablative performance and thermal failure but also testing of damage from MMOD, and structural performance at ground impact. Mission requirements will demand analysis, testing and verification that are focused on establishing reliability of the design. In this proposed talk, we will focus on the grand challenge of MSR EEV TPS and the need for innovative approaches to address challenges in modeling, testing, manufacturing and verification.
NEWS for Africa: adaptation and reliability of a built environment questionnaire for physical activity in seven African countries.

PubMed

Oyeyemi, Adewale L; Kasoma, Sandra S; Onywera, Vincent O; Assah, Felix; Adedoyin, Rufus A; Conway, Terry L; Moss, Sarah J; Ocansey, Reginald; Kolbe-Alexander, Tracy L; Akinroye, Kingsley K; Prista, Antonio; Larouche, Richard; Gavand, Kavita A; Cain, Kelli L; Lambert, Estelle V; Aryeetey, Richmond; Bartels, Clare; Tremblay, Mark S; Sallis, James F

2016-03-08

Built environment and policy interventions are effective strategies for controlling the growing worldwide deaths from physical inactivity-related non-communicable diseases. To improve built environment research and develop African specific evidence, it is important to first tailor built environment measures to African contexts and assess their psychometric properties across African countries. This study reports on the adaptation and test-retest reliability of the Neighborhood Environment Walkability Scale in seven sub-Saharan African countries (NEWS-Africa). The original NEWS comprising 8 subscales measuring reported physical and social attributes of neighborhood environments was systematically adapted for Africa through extensive input from physical activity and public health researchers, built environment professionals, and residents in seven African countries: Cameroon, Ghana, Kenya, Mozambique, Nigeria, South Africa and Uganda. Cognitive testing of NEWS-Africa was conducted among diverse residents (N = 109, 50 youth [12 - 17 years] and 59 adults [22 - 67 years], 69 % from low socioeconomic status [SES] neighborhoods). NEWS-Africa was translated into local languages and evaluated for 2-week test-retest reliability in adult participants (N = 301; female = 50.2 %; age = 32.3 ± 12.9 years) purposively recruited from neighborhoods varying in walkability (high and low walkable) and SES (high and low income) and from villages in six of seven participating countries. The original 67 NEWS items was expanded to 89 scores (76 individual NEWS items and 13 computed scales). Several modifications were made to individual items, and some new items were added to capture important attributes in the African environment. A new scale on personal safety was created, and the aesthetics scale was enlarged to reflect African specific characteristics. Over 95 % of all NEWS-Africa scores (items plus computed scales) demonstrated evidence of "excellent" (ICCs > .75 %) or "good" (ICCs = 0.60 to 0.74) reliability. Seven (53.8 %) of the 13 computed NEWS scales demonstrated "excellent" agreement and the other six had "good" agreement. No items or scales demonstrated "poor" reliability (ICCs < .40). The systematic adaptation and initial psychometric evaluation of NEWS-Africa indicates the instrument is feasible and reliable for use with adults of diverse demographic characteristics in Africa. The measure is likely to be useful for research, surveillance of built environment conditions for planning purposes, and to evaluate physical activity and policy interventions in Africa.
Reliability of high-power QCW arrays

NASA Astrophysics Data System (ADS)

Feeler, Ryan; Junghans, Jeremy; Remley, Jennifer; Schnurbusch, Don; Stephens, Ed

2010-02-01

Northrop Grumman Cutting Edge Optronics has developed a family of arrays for high-power QCW operation. These arrays are built using CTE-matched heat sinks and hard solder in order to maximize the reliability of the devices. A summary of a recent life test is presented in order to quantify the reliability of QCW arrays and associated laser gain modules. A statistical analysis of the raw lifetime data is presented in order to quantify the data in such a way that is useful for laser system designers. The life tests demonstrate the high level of reliability of these arrays in a number of operating regimes. For single-bar arrays, a MTTF of 19.8 billion shots is predicted. For four-bar samples, a MTTF of 14.6 billion shots is predicted. In addition, data representing a large pump source is analyzed and shown to have an expected lifetime of 13.5 billion shots. This corresponds to an expected operational lifetime of greater than ten thousand hours at repetition rates less than 370 Hz.
Development of a Digital-Based Instrument to Assess Perceived Motor Competence in Children: Face Validity, Test-Retest Reliability, and Internal Consistency

PubMed Central

Palmer, Kara K.

2017-01-01

Assessing children’s perceptions of their movement abilities (i.e., perceived competence) is traditionally done using picture scales—Pictorial Scale of Perceived Competence and Acceptance for Young Children or Pictorial Scale of Perceived Movement Skill Competence. Pictures fail to capture the temporal components of movement. To address this limitation, we created a digital-based instrument to assess perceived motor competence: the Digital Scale of Perceived Motor Competence. The purpose of this study was to determine the validity, reliability, and internal consistency of the Digital-based Scale of Perceived Motor Skill Competence. The Digital-based Scale of Perceived Motor Skill Competence is based on the twelve fundamental motor skills from the Test of Gross Motor Development-2nd Edition with a similar layout and item structure as the Pictorial Scale of Perceived Movement Skill Competence. Face Validity of the instrument was examined in Phase I (n = 56; Mage = 8.6 ± 0.7 years, 26 girls). Test-retest reliability and internal consistency were assessed in Phase II (n = 54, Mage = 8.7 years ± 0.5 years, 26 girls). Intra-class correlations (ICC) and Cronbach’s alpha were conducted to determine test-retest reliability and internal consistency for all twelve skills along with locomotor and object control subscales. The Digital Scale of Perceived Motor Competence demonstrates excellent test-retest reliability (ICC = 0.83, total; ICC = 0.77, locomotor; ICC = 0.79, object control) and acceptable/good internal consistency (α = 0.62, total; α = 0.57, locomotor; α = 0.49, object control). Findings provide evidence of the reliability of the three level digital-based instrument of perceived motor competence for older children. PMID:29910408
Clinimetric properties of the Tinetti Mobility Test, Four Square Step Test, Activities-specific Balance Confidence Scale, and spatiotemporal gait measures in individuals with Huntington's disease

PubMed Central

Kloos, Anne D.; Fritz, Nora E.; Kostyk, Sandra K.; Young, Gregory S.; Kegelmeyer, Deb A.

2014-01-01

Background and purpose Individuals with Huntington's disease (HD) experience balance and gait problems that lead to falls. Clinicians currently have very little information about the reliability and validity of outcome measures to determine the efficacy of interventions that aim to reduce balance and gait impairments in HD. This study examined the reliability and concurrent validity of spatiotemporal gait measures, the Tinetti Mobility Test (TMT), Four Square Step Test (FSST), and Activities-specific Balance Confidence (ABC) Scale in individuals with HD. Methods Participants with HD [n = 20; mean age ± SD = 50.9 ± 13.7; 7 male] were tested on spatiotemporal gait measures the TMT, FSST, and ABC Scale before and after a six week period to determine test–retest reliability and minimal detectable change (MDC) values. Linear relationships between gait and clinical measures were estimated using Pearson's correlation coefficients. Results Spatiotemporal gait measures, the TMT total and the FSST showed good to excellent test–retest reliability (ICC > 0.75). MDC values were 0.30 m/s and 0.17 m/s for velocity in forward and backward walking respectively, four points for the TMT, and 3 s for the FSST. The TMT and FSST were highly correlated with most spatiotemporal measures. The ABC Scale demonstrated lower reliability and less concurrent validity than other measures. Conclusions The high test–retest reliability over a six week period and concurrent validity between the TMT, FSST, and spatiotemporal gait measures suggest that the TMT and FSST may be useful outcome measures for future intervention studies in ambulatory individuals with HD. PMID:25128156
Significant lexical relationships

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pedersen, T.; Kayaalp, M.; Bruce, R.

Statistical NLP inevitably deals with a large number of rare events. As a consequence, NLP data often violates the assumptions implicit in traditional statistical procedures such as significance testing. We describe a significance test, an exact conditional test, that is appropriate for NLP data and can be performed using freely available software. We apply this test to the study of lexical relationships and demonstrate that the results obtained using this test are both theoretically more reliable and different from the results obtained using previously applied tests.
Test-retest reliability of the Mandarin versions of the Hypertension Self-Care Profile instrument.

PubMed

Ngoh, Soh Heng Agnes; Lim, Hazel Wai Ling; Koh, Yi Ling Eileen; Tan, Ngiap Chuan

2017-11-01

Self-efficacy in essential hypertension can be measured using scales, such as the "Hypertension Self-Care Profile" (HTN-SCP) questionnaire. It assesses "Behavior", "Motivation", and "Self-efficacy" in 3 domains, respectively. This study aimed to validate the Mandarin version of HTN-SCP instrument (HTN-SCP-Mn) targeted at patients of Chinese ethnicity with hypertension.Our study recruited Chinese patients, aged 40 years and older, with essential hypertension from a public primary healthcare clinic in Singapore. The 60-item HTN-SCP-Mn questionnaire was completed online using a tablet or smartphone on enrolment. A retest was conducted 2 weeks after the initial test. Reliability was assessed by internal consistency and test-retest reliability using Cronbach alpha and intraclass correlation coefficients (ICC). Differences between the overall HTN-SCP-Mn scores of the patients and their self-reported self-management activities were also determined using independent t test.Of the 153 patients who completed the HTN-SCP-Mn during the initial test, 79 responded to the test-retest evaluation. Reliability of the 3 domains "Behavior", "Motivation", and "Self-efficacy" obtained high internal consistency (Cronbach alpha = 0.838, 0.929, and 0.927, respectively). The item total correlation ranged from 0.058 to 0.677 for Behavior, 0.374 to 0.798 for Motivation, and 0.326 to 0.767 for self-efficacy. The ICC indicated fair to good test-retest reliability with scores of 0.643, 0.579, and 0.710 for the respective domains.The results showed face validity of the HTN-SCP-Mn instrument, indicating its potential application in mandarin-proficient patients. Further study is needed to correlate its scores with objective demonstration of self-efficacy.
Reliability and Validity of Ten Consumer Activity Trackers Depend on Walking Speed.

PubMed

Fokkema, Tryntsje; Kooiman, Thea J M; Krijnen, Wim P; VAN DER Schans, Cees P; DE Groot, Martijn

2017-04-01

To examine the test-retest reliability and validity of ten activity trackers for step counting at three different walking speeds. Thirty-one healthy participants walked twice on a treadmill for 30 min while wearing 10 activity trackers (Polar Loop, Garmin Vivosmart, Fitbit Charge HR, Apple Watch Sport, Pebble Smartwatch, Samsung Gear S, Misfit Flash, Jawbone Up Move, Flyfit, and Moves). Participants walked three walking speeds for 10 min each; slow (3.2 km·h), average (4.8 km·h), and vigorous (6.4 km·h). To measure test-retest reliability, intraclass correlations (ICC) were determined between the first and second treadmill test. Validity was determined by comparing the trackers with the gold standard (hand counting), using mean differences, mean absolute percentage errors, and ICC. Statistical differences were calculated by paired-sample t tests, Wilcoxon signed-rank tests, and by constructing Bland-Altman plots. Test-retest reliability varied with ICC ranging from -0.02 to 0.97. Validity varied between trackers and different walking speeds with mean differences between the gold standard and activity trackers ranging from 0.0 to 26.4%. Most trackers showed relatively low ICC and broad limits of agreement of the Bland-Altman plots at the different speeds. For the slow walking speed, the Garmin Vivosmart and Fitbit Charge HR showed the most accurate results. The Garmin Vivosmart and Apple Watch Sport demonstrated the best accuracy at an average walking speed. For vigorous walking, the Apple Watch Sport, Pebble Smartwatch, and Samsung Gear S exhibited the most accurate results. Test-retest reliability and validity of activity trackers depends on walking speed. In general, consumer activity trackers perform better at an average and vigorous walking speed than at a slower walking speed.
Advanced flight control system study

NASA Technical Reports Server (NTRS)

Hartmann, G. L.; Wall, J. E., Jr.; Rang, E. R.; Lee, H. P.; Schulte, R. W.; Ng, W. K.

1982-01-01

A fly by wire flight control system architecture designed for high reliability includes spare sensor and computer elements to permit safe dispatch with failed elements, thereby reducing unscheduled maintenance. A methodology capable of demonstrating that the architecture does achieve the predicted performance characteristics consists of a hierarchy of activities ranging from analytical calculations of system reliability and formal methods of software verification to iron bird testing followed by flight evaluation. Interfacing this architecture to the Lockheed S-3A aircraft for flight test is discussed. This testbed vehicle can be expanded to support flight experiments in advanced aerodynamics, electromechanical actuators, secondary power systems, flight management, new displays, and air traffic control concepts.
A Factor Analytic Validation Study of the Scale of Teachers' Attitudes towards Inclusive Classrooms (STATIC)

ERIC Educational Resources Information Center

Nishimura, Trisha Sugita; Busse, Randy T.

2015-01-01

General and special education teachers (N = 125) completed the Scale of Teachers' Attitudes towards Inclusive Classrooms (STATIC). The internal consistency of the instrument was strong with an alpha of 0.89. The measure demonstrated excellent test-retest reliability (r = 0.99) and a dependent t-test was non-significant, indicating mean group…
Head and neck cancer-specific quality of life: instrument validation.

PubMed

Terrell, J E; Nanavati, K A; Esclamado, R M; Bishop, J K; Bradford, C R; Wolf, G T

1997-10-01

The disfigurement and dysfunction associated with head and neck cancer affect emotional well-being and some of the most basic functions of life. Most cancer-specific quality-of-life assessments give a single composite score for head and neck cancer-related quality of life. To develop and evaluate an improved multidimensional instrument to assess head and neck cancer-related functional status and well-being. The item selection process included literature review, interviews with health care workers, and patient surveys. A survey with 37 disease-specific questions and the SF-12 survey were administered to 253 patients in 3 large medical centers. Factor analysis was performed to identify disease-specific domains. Domain scores were calculated as the standardized score of the component items. These domains were assessed for construct validity based on clinical hypotheses and test-retest reliability. Four relevant domains were identified: Eating (6 items), Communication (4 items), Pain (4 items), and Emotion (6 items). Each had an internal consistency (Cronbach alpha value) of greater than 0.80. Construct validity was demonstrated by moderate correlations with the SF-12 Physical and Mental component scores (r=0.43-0.60). Test-retest reliability for each domain demonstrated strong reliability between the 2 time points. Correlations were strong for each individual question, ranging from 0.53 to 0.93. Construct validity testing demonstrated that the direction of differences for each domain were as hypothesized. The Head and Neck Quality of Life questionnaire is a promising multidimensional tool with which to assess head and neck cancer-specific quality of life.
Tunable nanoblock lasers and stretching sensors.

PubMed

Lu, T W; Wang, C; Hsiao, C F; Lee, P T

2016-09-22

Reconfigurable, reliable, and robust nanolasers with wavelengths tunable in the telecommunication bands are currently being sought after for use as flexible light sources in photonic integrated circuits. Here, we propose and demonstrate tunable nanolasers based on 1D nanoblocks embedded within stretchable polydimethylsiloxane. Our lasers show a large wavelength tunability of 7.65 nm per 1% elongation. Moreover, this tunability is reconfigurable and reliable under repeated stretching/relaxation tests. By applying excessive stretching, wide wavelength tuning over a range of 80 nm (spanning the S, C, and L telecommunication bands) is successfully demonstrated. Furthermore, as a stretching sensor, an enhanced wavelength response to elongation of 9.9 nm per % is obtained via the signal differential from two nanoblock lasers positioned perpendicular to each other. The minimum detectable elongation is as small as 0.056%. Nanoblock lasers can function as reliable tunable light sources in telecommunications and highly sensitive on-chip structural deformation sensors.
Development and Validation of a Fatigue Assessment Scale for U.S. Construction Workers

PubMed Central

Zhang, Mingzong; Sparer, Emily H.; Murphy, Lauren A.; Dennerlein, Jack T.; Fang, Dongping; Katz, Jeffrey N.; Caban-Martinez, Alberto J.

2015-01-01

Objective To develop a fatigue assessment scale and test its reliability and validity for commercial construction workers. Methods Using a two-phased approach, we first identified items for the development of a Fatigue Assessment Scale for Construction Workers (FASCW) through review of existing scales in the scientific literature, key informant interviews (n=11) and focus groups (3 groups with 6 workers each) with construction workers. The second phase included assessment for the reliability, validity and sensitivity of the new scale using a repeated-measures study design with a convenience sample of construction workers (n=144). Results Phase one resulted in a 16-item preliminary scale that after factor analysis yielded a final 10-item scale with two sub-scales (“Lethargy” and “Bodily Ailment”).. During phase two, the FASCW and its subscales demonstrated satisfactory internal consistency (alpha coefficients were FASCW (0.91), Lethargy (0.86) and Bodily Ailment (0.84)) and acceptable test-retest reliability (Pearson Correlations Coefficients: 0.59–0.68; Intraclass Correlation Coefficients: 0.74–0.80). Correlation analysis substantiated concurrent and convergent validity. A discriminant analysis demonstrated that the FASCW differentiated between groups with arthritis status and different work hours. Conclusions The 10-item FASCW with good reliability and validity is an effective tool for assessing the severity of fatigue among construction workers. PMID:25603944
46 CFR 62.20-3 - Plans for information.

Code of Federal Regulations, 2011 CFR

2011-10-01

... detected by the crew, alternatives available to the crew, and possible design verification tests necessary... reliability of the design. It should be conducted to a level of detail necessary to demonstrate compliance... at an early stage of design. ...
Establishing the reliability and concurrent validity of physical performance tests using virtual reality equipment for community-dwelling healthy elders.

PubMed

Griswold, David; Rockwell, Kyle; Killa, Carri; Maurer, Michael; Landgraff, Nancy; Learman, Ken

2015-01-01

The aim of this study was to determine the reliability and concurrent validity of commonly used physical performance tests using the OmniVR Virtual Rehabilitation System for healthy community-dwelling elders. Participants (N = 40) were recruited by the authors and were screened for eligibility. The initial method of measurement was randomized to either virtual reality (VR) or clinically based measures (CM). Physical performance tests included the five times sit to stand, Timed Up and Go (TUG), Forward Functional Reach (FFR) and 30-s stand test. A random number generator determined the testing order. The test-re-test reliability for the VR and CM was determined. Furthermore, concurrent validity was determined using a Pearson product moment correlation (Pearson r). The VR demonstrated excellent reliability for 5 × STS intraclass correlation coefficient (ICC) = 0.931(3,1), FFR ICC = 0.846(3,1) and the TUG ICC = 0.944(3,1). The concurrent validity data for the VR and CM (ICC 3, k) were moderate for FFR ICC = 0.682, excellent 5 × STS ICC = 0.889 and excellent for the TUG ICC = 0.878. The concurrent validity of the 30-s stand test was good ICC = 0.735(3,1). This study supports the use of VR equipment for measuring physical performance tests in the clinic for healthy community-dwelling elders. Virtual reality equipment is not only used to treat balance impairments but it is also used to measure and determine physical impairments through the use of physical performance tests. Virtual reality equipment is a reliable and valid tool for collecting physical performance data for the 5 × STS, FFR, TUG and 30-s stand test for healthy community-dwelling elders.
Spanish validation of the Person-centered Care Assessment Tool (P-CAT).

PubMed

Martínez, Teresa; Suárez-Álvarez, Javier; Yanguas, Javier; Muñiz, José

2016-01-01

Person-centered Care (PCC) is an innovative approach which seeks to improve the quality of care services given to the care-dependent elderly. At present there are no Spanish language instruments for the evaluation of PCC delivered by elderly care services. The aim of this work is the adaptation and validation of the Person-centered Care Assessment Tool (P-CAT) for a Spanish population. The P-CAT was translated and adapted into Spanish, then given to a sample of 1339 front-line care professionals from 56 residential elderly care homes. The reliability and validity of the P-CAT were analyzed, within the frameworks of Classical Test Theory and Item Response Theory models. The Spanish P-CAT demonstrated good reliability, with an alpha coefficient of .88 and a test-retest reliability coefficient of .79. The P-CAT information function indicates that the test measures with good precision for the majority of levels of the measured variables (θ values between -2 and +1). The factorial structure of the test is essentially one-dimensional and the item discrimination indices are high, with values between .26 and .61. In terms of predictive validity, the correlations which stand out are between the P-CAT and organizational climate (r = .689), and the burnout factors; personal accomplishment (r = .382), and emotional exhaustion (r = - .510). The Spanish version of the P-CAT demonstrates good psychometric properties for its use in the evaluation of elderly care homes both professionally and in research.

Intrarater test-retest reliability of static and dynamic stability indexes measurement using the Biodex Stability System during unilateral stance.

PubMed

Arifin, Nooranida; Abu Osman, Noor Azuan; Wan Abas, Wan Abu Bakar

2014-04-01

The measurements of postural balance often involve measurement error, which affects the analysis and interpretation of the outcomes. In most of the existing clinical rehabilitation research, the ability to produce reliable measures is a prerequisite for an accurate assessment of an intervention after a period of time. Although clinical balance assessment has been performed in previous study, none has determined the intrarater test-retest reliability of static and dynamic stability indexes during dominant single stance. In this study, one rater examined 20 healthy university students (female=12, male=8) in two sessions separated by 7 day intervals. Three stability indexes--the overall stability index (OSI), anterior/posterior stability index (APSI), and medial/ lateral stability index (MLSI) in static and dynamic conditions--were measured during single dominant stance. Intraclass correlation coefficient (ICC), standard error measurement (SEM) and 95% confidence interval (95% CI) were calculated. Test-retest ICCs for OSI, APSI, and MLSI were 0.85, 0.78, and 0.84 during static condition and were 0.77, 0.77, and 0.65 during dynamic condition, respectively. We concluded that the postural stability assessment using Biodex stability system demonstrates good-to-excellent test-retest reliability over a 1 week time interval.
A systematic review of the factor structure and reliability of the Spence Children's Anxiety Scale.

PubMed

Orgilés, Mireia; Fernández-Martínez, Iván; Guillén-Riquelme, Alejandro; Espada, José P; Essau, Cecilia A

2016-01-15

The Spence Children's Anxiety Scale (SCAS) is a widely used instrument for assessing symptoms of anxiety disorders among children and adolescents. Previous studies have demonstrated its good reliability for children and adolescents from different backgrounds. However, remarkable variability in the reliability of the SCAS across studies and inconsistent results regarding its factor structure has been found. The present study aims to examine the SCAS factor structure by means of a systematic review with narrative synthesis, the mean reliability of the SCAS by means of a meta-analysis, and the influence of the moderators on the SCAS reliability. Databases employed to collect the studies included Scholar Google, PsycARTICLES, PsycINFO, Web of Science, and Scopus since 1997. Twenty-nine and 32 studies, which examined the factor structure and the internal consistency of the SCAS, respectively, were included. The SCAS was found to have strong internal consistency, influenced by different moderators. The systematic review demonstrated that the original six-factor model was supported by most studies. Factorial invariance studies (across age, gender, country) and test-retest reliability of the SCAS were not examined in this study. It is concluded that the SCAS is a reliable instrument for cross-cultural use, and it is suggested that the original six-factor model is appropriate for cross-cultural application. Copyright © 2015 Elsevier B.V. All rights reserved.
General test plan redundant sensor strapdown IMU evaluation program

NASA Technical Reports Server (NTRS)

Hartwell, T.; Irwin, H. A.; Miyatake, Y.; Wedekind, D. E.

1971-01-01

The general test plan for a redundant sensor strapdown inertial measuring unit evaluation program is presented. The inertial unit contains six gyros and three orthogonal accelerometers. The software incorporates failure detection and correction logic and a land vehicle navigation program. The principal objective of the test is a demonstration of the practicability, reliability, and performance of the inertial measuring unit with failure detection and correction in operational environments.
Testing the feasibility of eliciting preferences for health states from adolescents using direct methods.

PubMed

Crump, R Trafford; Lau, Ryan; Cox, Elizabeth; Currie, Gillian; Panepinto, Julie

2018-06-22

Measuring adolescents' preferences for health states can play an important role in evaluating the delivery of pediatric healthcare. However, formal evaluation of the common direct preference elicitation methods for health states has not been done with adolescents. Therefore, the purpose of this study is to test how these methods perform in terms of their feasibility, reliability, and validity for measuring health state preferences in adolescents. This study used a web-based survey of adolescents, 18 years of age or younger, living in the United States. The survey included four health states, each comprised of six attributes. Preferences for these health states were elicited using the visual analogue scale, time trade-off, and standard gamble. The feasibility, test-retest reliability, and construct validity of each of these preference elicitation methods were tested and compared. A total of 144 participants were included in this study. Using a web-based survey format to elicit preferences for health states from adolescents was feasible. A majority of participants completed all three elicitation methods, ranked those methods as being easy, with very few requiring assistance from someone else. However, all three elicitation methods demonstrated weak test-retest reliability, with Kendall's tau-a values ranging from 0.204 to 0.402. Similarly, all three methods demonstrated poor construct validity, with 9-50% of all rankings aligning with our expectations. There were no significant differences across age groups. Using a web-based survey format to elicit preferences for health states from adolescents is feasible. However, the reliability and construct validity of the methods used to elicit these preferences when using this survey format are poor. Further research into the effects of a web-based survey approach to eliciting preferences for health states from adolescents is needed before health services researchers or pediatric clinicians widely employ these methods.
Reliability and validity of the new Tanaka B Intelligence Scale scores: a group intelligence test.

PubMed

Uno, Yota; Mizukami, Hitomi; Ando, Masahiko; Yukihiro, Ryoji; Iwasaki, Yoko; Ozaki, Norio

2014-01-01

The present study evaluated the reliability and concurrent validity of the new Tanaka B Intelligence Scale, which is an intelligence test that can be administered on groups within a short period of time. The new Tanaka B Intelligence Scale and Wechsler Intelligence Scale for Children-Third Edition were administered to 81 subjects (mean age ± SD 15.2 ± 0.7 years) residing in a juvenile detention home; reliability was assessed using Cronbach's alpha coefficient, and concurrent validity was assessed using the one-way analysis of variance intraclass correlation coefficient. Moreover, receiver operating characteristic analysis for screening for individuals who have a deficit in intellectual function (an FIQ<70) was performed. In addition, stratum-specific likelihood ratios for detection of intellectual disability were calculated. The Cronbach's alpha for the new Tanaka B Intelligence Scale IQ (BIQ) was 0.86, and the intraclass correlation coefficient with FIQ was 0.83. Receiver operating characteristic analysis demonstrated an area under the curve of 0.89 (95% CI: 0.85-0.96). In addition, the stratum-specific likelihood ratio for the BIQ≤65 stratum was 13.8 (95% CI: 3.9-48.9), and the stratum-specific likelihood ratio for the BIQ≥76 stratum was 0.1 (95% CI: 0.03-0.4). Thus, intellectual disability could be ruled out or determined. The present results demonstrated that the new Tanaka B Intelligence Scale score had high reliability and concurrent validity with the Wechsler Intelligence Scale for Children-Third Edition score. Moreover, the post-test probability for the BIQ could be calculated when screening for individuals who have a deficit in intellectual function. The new Tanaka B Intelligence Test is convenient and can be administered within a variety of settings. This enables evaluation of intellectual development even in settings where performing intelligence tests have previously been difficult.
Reliability and Validity of the New Tanaka B Intelligence Scale Scores: A Group Intelligence Test

PubMed Central

Uno, Yota; Mizukami, Hitomi; Ando, Masahiko; Yukihiro, Ryoji; Iwasaki, Yoko; Ozaki, Norio

2014-01-01

Objective The present study evaluated the reliability and concurrent validity of the new Tanaka B Intelligence Scale, which is an intelligence test that can be administered on groups within a short period of time. Methods The new Tanaka B Intelligence Scale and Wechsler Intelligence Scale for Children-Third Edition were administered to 81 subjects (mean age ± SD 15.2±0.7 years) residing in a juvenile detention home; reliability was assessed using Cronbach’s alpha coefficient, and concurrent validity was assessed using the one-way analysis of variance intraclass correlation coefficient. Moreover, receiver operating characteristic analysis for screening for individuals who have a deficit in intellectual function (an FIQ<70) was performed. In addition, stratum-specific likelihood ratios for detection of intellectual disability were calculated. Results The Cronbach’s alpha for the new Tanaka B Intelligence Scale IQ (BIQ) was 0.86, and the intraclass correlation coefficient with FIQ was 0.83. Receiver operating characteristic analysis demonstrated an area under the curve of 0.89 (95% CI: 0.85–0.96). In addition, the stratum-specific likelihood ratio for the BIQ≤65 stratum was 13.8 (95% CI: 3.9–48.9), and the stratum-specific likelihood ratio for the BIQ≥76 stratum was 0.1 (95% CI: 0.03–0.4). Thus, intellectual disability could be ruled out or determined. Conclusion The present results demonstrated that the new Tanaka B Intelligence Scale score had high reliability and concurrent validity with the Wechsler Intelligence Scale for Children-Third Edition score. Moreover, the post-test probability for the BIQ could be calculated when screening for individuals who have a deficit in intellectual function. The new Tanaka B Intelligence Test is convenient and can be administered within a variety of settings. This enables evaluation of intellectual development even in settings where performing intelligence tests have previously been difficult. PMID:24940880
Validating survey measurement scales for AIDS-related knowledge and stigma among construction workers in South Africa.

PubMed

Bowen, Paul; Govender, Rajen; Edwards, Peter

2016-01-23

Construction workers in South Africa are regarded as a high-risk group in the context of HIV/AIDS. HIV testing is pivotal to controlling HIV transmission and providing palliative care and AIDS-related knowledge and stigma are key issues in addressing the likelihood of testing behaviour. In exploring these issues, various studies have employed an 11-item AIDS-related knowledge scale (Kalichman and Simbayi, AIDS Care 16:572-580, 2004) and a 9-item stigma scale (Kalichman et al., AIDS Behav 9:135-143, 2005), but little evidence exists confirming the psychometric properties of these scales. Using survey data from 512 construction workers in the Western Cape, South Africa, this research examines the validity and reliability of the two scales through exploratory and confirmatory factor analysis and internal consistency tests. From confirmatory factor analysis, a revised 10-item knowledge scale was developed (χ2 /df ratio = 1.675, CFI = 0.982, RMSEA = 0.038, and Hoelter (95 %) = 393). A revised 8-item stigma scale was also developed (χ2 /df ratio = 1.929, CFI = 0.974, RMSEA = 0.045, and Hoelter (95 %) = 380). Both revised scales demonstrated good model fit and all factor loadings were significant (p < 0.01). Reliability analysis demonstrated excellent to good internal consistency, with alpha values of 0.80 and 0.74, respectively. Both revised scales also demonstrated satisfactory convergent and divergent validity. Limitations of the original survey from which the data was obtained include the failure to properly account for respondent selection of language for completion of the survey, use of ethnicity as a proxy for identifying the native language of participants, the limited geographical area from which the survey data was collected, and the limitations associated with the convenience sample. A limitation of the validation study was the lack of available data for a more robust examination of reliability beyond internal consistency, such as test-retest reliability. The revised knowledge and stigma scales offered here hold considerable promise as measures of AIDS-related knowledge and stigma among South African construction workers.
Development and validation of the Smartphone Addiction Inventory (SPAI).

PubMed

Lin, Yu-Hsuan; Chang, Li-Ren; Lee, Yang-Han; Tseng, Hsien-Wei; Kuo, Terry B J; Chen, Sue-Huei

2014-01-01

The aim of this study was to develop a self-administered scale based on the special features of smartphone. The reliability and validity of the Smartphone Addiction Inventory (SPAI) was demonstrated. A total of 283 participants were recruited from Dec. 2012 to Jul. 2013 to complete a set of questionnaires, including a 26-item SPAI modified from the Chinese Internet Addiction Scale and phantom vibration and ringing syndrome questionnaire. There were 260 males and 23 females, with ages 22.9 ± 2.0 years. Exploratory factor analysis, internal-consistency test, test-retest, and correlation analysis were conducted to verify the reliability and validity of the SPAI. Correlations between each subscale and phantom vibration and ringing were also explored. Exploratory factor analysis yielded four factors: compulsive behavior, functional impairment, withdrawal and tolerance. Test-retest reliabilities (intraclass correlations = 0.74-0.91) and internal consistency (Cronbach's α = 0.94) were all satisfactory. The four subscales had moderate to high correlations (0.56-0.78), but had no or very low correlation to phantom vibration/ringing syndrome. This study provides evidence that the SPAI is a valid and reliable, self-administered screening tool to investigate smartphone addiction. Phantom vibration and ringing might be independent entities of smartphone addiction.
Construct validity, test-retest reliability and internal consistency of the Thai version of the disabilities of the arm, shoulder and hand questionnaire (DASH-TH) in patients with carpal tunnel syndrome.

PubMed

Buntragulpoontawee, Montana; Phutrit, Suphatha; Tongprasert, Siam; Wongpakaran, Tinakon; Khunachiva, Jeeranan

2018-03-27

This study evaluated additional psychometric properties of the Thai version of the disabilities of the arm, shoulder and hand questionnaire (DASH-TH) which included, test-retest reliability, construct validity, internal consistency of in patients with carpal tunnel syndrome. As for determining construct validity, the Thai EuroQOL questionnaire (EQ-5D-5L) was also administered in order to examine convergent and divergent validity. Fifty patients completed both questionnaires. The DASH-TH showed excellent test-retest reliability (intraclass correlation coefficient = 0.811) and internal consistency (Cronbach's alpha = 0.911). The exploratory factor analysis yielded a six-factor solution while the confirmatory factor analysis denoted that the hypothesized model adequately fit the data with a comparative fit index of 0.967 and a Tucker-Lewis index of 0.964. The related subscales between the DASH-TH and the Thai EQ-5D-5L were significantly correlated, indicating the DASH-TH's convergent and discriminant validity. The DASH-TH demonstrated good reliability, internal consistency construct validity, and multidimensionality, in assessing the upper extremity function in carpal tunnel syndrome patients.
Automated Measurement of Visual Acuity in Pediatric Ophthalmic Patients Using Principles of Game Design and Tablet Computers.

PubMed

Aslam, Tariq M; Tahir, Humza J; Parry, Neil R A; Murray, Ian J; Kwak, Kun; Heyes, Richard; Salleh, Mahani M; Czanner, Gabriela; Ashworth, Jane

2016-10-01

To report on the utility of a computer tablet-based method for automated testing of visual acuity in children based on the principles of game design. We describe the testing procedure and present repeatability as well as agreement of the score with accepted visual acuity measures. Reliability and validity study. Setting: Manchester Royal Eye Hospital Pediatric Ophthalmology Outpatients Department. Total of 112 sequentially recruited patients. For each patient 1 eye was tested with the Mobile Assessment of Vision by intERactIve Computer for Children (MAVERIC-C) system, consisting of a software application running on a computer tablet, housed in a bespoke viewing chamber. The application elicited touch screen responses using a game design to encourage compliance and automatically acquire visual acuity scores of participating patients. Acuity was then assessed by an examiner with a standard chart-based near ETDRS acuity test before the MAVERIC-C assessment was repeated. Reliability of MAVERIC-C near visual acuity score and agreement of MAVERIC-C score with near ETDRS chart for visual acuity. Altogether, 106 children (95%) completed the MAVERIC-C system without assistance. The vision scores demonstrated satisfactory reliability, with test-retest VA scores having a mean difference of 0.001 (SD ±0.136) and limits of agreement of 2 SD (LOA) of ±0.267. Comparison with the near EDTRS chart showed agreement with a mean difference of -0.0879 (±0.106) with LOA of ±0.208. This study demonstrates promising utility for software using a game design to enable automated testing of acuity in children with ophthalmic disease in an objective and accurate manner. Copyright © 2016 Elsevier Inc. All rights reserved.
The 6-min mastication test: a unique test to assess endurance of continuous chewing, normal values, reliability, reproducibility and usability in patients with mitochondrial disease.

PubMed

van den Engel-Hoek, L; Knuijt, S; van Gerven, M H J C; Lagarde, M L J; Groothuis, J T; de Groot, I J M; Janssen, M C H

2017-03-01

In patients with mitochondrial disease, fatigue and muscle problems are the most common complaints. They also experience these complaints during mastication. To measure endurance of continuous mastication in patients with mitochondrial diseases, the 6-min mastication test (6MMT) was developed. This study included the collection of normal data for the 6MMT in a healthy population (children and adults). During 6 min of continuous mastication on a chew tube chewing cycles per minute, total amount of chewing cycles and the difference between minute 1 (M 1 ) and minute 6 (M 2 ) were collected in 271 healthy participants (5-80 years old). These results were compared with those of nine paediatric and 25 adult patients with a mitochondrial disease. Visual analogue scale (VAS) scores were collected directly after the test and after 5 min. A qualitative rating was made on masticatory movements. The reproducibility of the 6MMT in the healthy population with an interval of approximately 2 weeks was good. The inter-rater reliability for the observations was excellent. The patient group demonstrated lower total amount of chewing cycles or had greater differences between M 1 and M 6 . The 6MMT is a reliable and objective test to assess endurance of continuous chewing. It demonstrates the ability of healthy children and adults to chew during 6 min with a highly stable frequency of mastication movements. The test may give an explanation for the masticatory problems in patient groups, who are complaining of pain and fatigue during mastication. © 2017 John Wiley & Sons Ltd.
Translation and validation of the Dutch new Knee Society Scoring System ©.

PubMed

Van Der Straeten, Catherine; Witvrouw, Erik; Willems, Tine; Bellemans, Johan; Victor, Jan

2013-11-01

A new version of The Knee Society Knee Scoring System(©) (KSS) has recently been developed. Before this scale can be used in non-English-speaking populations, it has to be translated and validated for a particular population. We evaluated the construct and content validity, the test-retest reliability, and the internal consistency of the Dutch version of the New Knee Society KSS. A Dutch translation was performed using a forward-backward translation protocol. We tested the construct validity of the Dutch New KSS by comparing it with the Dutch versions of the WOMAC, Knee Injury and Osteoarthritis Outcome Score (KOOS), and SF-12 scores in 137 patients undergoing total knee arthroplasty (TKA). Content validity was assessed by comparing pre- and postoperative scores and by checking floor and ceiling effects. To evaluate test-retest reliability and consistency, 47 patients completed the questionnaire a second time with a mean of 8 days interval (range, 2-20 days) between tests. Construct validity was demonstrated because the Dutch New KSS correlated well with the Dutch WOMAC (r = -0.751; p < 0.001), Dutch KOOS (r = -0.723; p < 0.001), and Dutch SF-12 (r = 0.569; p < 0.001). There was a significant difference between pre- and postoperative scores (p < 0.001) in line with the other scores. Test-retest reliability proved excellent with an intraclass correlation coefficient between 0.73 and 0.92 depending on the domain tested. Consistency as indicated by Cronbach's alpha ranging from 0.84 to 0.96 was good to excellent. As demonstrated by the validation procedure, the Dutch New KSS is an excellent instrument to evaluate TKA outcome in Dutch-speaking patients.
Comparison of validity and reliability of the Migraine disability assessment (MIDAS) versus headache impact test (HIT) in an Iranian population.

PubMed

Ghorbani, Abbas; Chitsaz, Ahmad

2011-01-01

Migraine is one of the most common headaches that affect 11% or more adult population. Recently, researchers have designed two questionnaires, namely Headache Impact Test (HIT) and Migraine Disability Assessment (MIDAS), with the aim of improving migraine care. These two tests provide a standard measurement about migraine's effects on people's life style that divide patients into 4 groups (grades) based on headaches intensity. The aim of this study was to compare the validity and reliability of these two tests. This study was designed as a multicenter, descriptive study to compare validity and reliability of Persian version of MIDAS and HIT questionnaires in 240 males and females with a migraine diagnosis according to criteria for headache and facial pain of the International Headache Society (IHS). The patients were enrolled in the study from 3 neurology clinics in Isfahan, Iran, between July 2004 and January 2005 and were evaluated at baseline (visit 1) and 4 weeks later (visit 2). According to our study, there was a high correlation between two tests (r = 0.94). This decreased their MIDAS grade in comparison to their grade HIT questionnaire. These findings demonstrated that Persian version of HIT have the same validity and reliability as MIDAS. Replying to HIT questionnaire was easier than MIDAS for Iranian patients. Physicians can reliably use the Persian translation of both MIDAS and HIT questionnaires to define the severity of illness and its treatment strategy as a self-administered report by migraine patients. However, we recommend HIT for its simplicity in headache clinics.
Development of a scale to measure individuals’ ratings of peace

PubMed Central

2014-01-01

Background The evolving concept of peace-building and the interplay between peace and health is examined in many venues, including at the World Health Assembly. However, without a metric to determine effectiveness of intervention programs all efforts are prone to subjective assessment. This paper develops a psychometric index that lays the foundation for measuring community peace stemming from intervention programs. Methods After developing a working definition of ‘peace’ and delineating a Peace Evaluation Across Cultures and Environments (PEACE) scale with seven constructs comprised of 71 items, a beta version of the index was pilot-tested. Two hundred and fifty subjects in three sites in the U.S. were studied using a five-point Likert scale to evaluate the psychometric functioning of the PEACE scale. Known groups validation was performed using the SOS-10. In addition, test-retest reliability was performed on 20 subjects. Results The preliminary data demonstrated that the scale has acceptable psychometric properties for measuring an individual’s level of peacefulness. The study also provides reliability and validity data for the scale. The data demonstrated internal consistency, correlation between data and psychological well-being, and test-retest reliability. Conclusions The PEACE scale may serve as a novel assessment tool in the health sector and be valuable in monitoring and evaluating the peace-building impact of health initiatives in conflict-affected regions. PMID:25298781
Test-retest reliability and comparability of paper and computer questionnaires for the Finnish version of the Tampa Scale of Kinesiophobia.

PubMed

Koho, P; Aho, S; Kautiainen, H; Pohjolainen, T; Hurri, H

2014-12-01

To estimate the internal consistency, test-retest reliability and comparability of paper and computer versions of the Finnish version of the Tampa Scale of Kinesiophobia (TSK-FIN) among patients with chronic pain. In addition, patients' personal experiences of completing both versions of the TSK-FIN and preferences between these two methods of data collection were studied. Test-retest reliability study. Paper and computer versions of the TSK-FIN were completed twice on two consecutive days. The sample comprised 94 consecutive patients with chronic musculoskeletal pain participating in a pain management or individual rehabilitation programme. The group rehabilitation design consisted of physical and functional exercises, evaluation of the social situation, psychological assessment of pain-related stress factors, and personal pain management training in order to regain overall function and mitigate the inconvenience of pain and fear-avoidance behaviour. The mean TSK-FIN score was 37.1 [standard deviation (SD) 8.1] for the computer version and 35.3 (SD 7.9) for the paper version. The mean difference between the two versions was 1.9 (95% confidence interval 0.8 to 2.9). Test-retest reliability was 0.89 for the paper version and 0.88 for the computer version. Internal consistency was considered to be good for both versions. The intraclass correlation coefficient for comparability was 0.77 (95% confidence interval 0.66 to 0.85), indicating substantial reliability between the two methods. Both versions of the TSK-FIN demonstrated substantial intertest reliability, good test-retest reliability, good internal consistency and acceptable limits of agreement, suggesting their suitability for clinical use. However, subjects tended to score higher when using the computer version. As such, in an ideal situation, data should be collected in a similar manner throughout the course of rehabilitation or clinical research. Copyright © 2014 Chartered Society of Physiotherapy. Published by Elsevier Ltd. All rights reserved.
Test-Retest Reliability of the Multiple Sleep Latency Test in Narcolepsy without Cataplexy and Idiopathic Hypersomnia

PubMed Central

Trotti, Lynn Marie; Staab, Beth A.; Rye, David B.

2013-01-01

Study Objectives: Differentiation of narcolepsy without cataplexy from idiopathic hypersomnia relies entirely upon the multiple sleep latency test (MSLT). However, the test-retest reliability for these central nervous system hypersomnias has never been determined. Methods: Patients with narcolepsy without cataplexy, idiopathic hypersomnia, and physiologic hypersomnia who underwent two diagnostic multiple sleep latency tests were identified retrospectively. Correlations between the mean sleep latencies on the two studies were evaluated, and we probed for demographic and clinical features associated with reproducibility versus change in diagnosis. Results: Thirty-six patients (58% women, mean age 34 years) were included. Inter -test interval was 4.2 ± 3.8 years (range 2.5 months to 16.9 years). Mean sleep latencies on the first and second tests were 5.5 (± 3.7 SD) and 7.3 (± 3.9) minutes, respectively, with no significant correlation (r = 0.17, p = 0.31). A change in diagnosis occurred in 53% of patients, and was accounted for by a difference in the mean sleep latency (N = 15, 42%) or the number of sleep onset REM periods (N = 11, 31%). The only feature predictive of a diagnosis change was a history of hypnagogic or hypnopompic hallucinations. Conclusions: The multiple sleep latency test demonstrates poor test-retest reliability in a clinical population of patients with central nervous system hypersomnia evaluated in a tertiary referral center. Alternative diagnostic tools are needed. Citation: Trotti LM; Staab BA; Rye DB. Test- retest reliability of the multiple sleep latency test in narcolepsy without cataplexy and idiopathic hypersomnia. J Clin Sleep Med 2013;9(8):789-795. PMID:23946709
Psychometrics of the MHSIP Adult Consumer Survey.

PubMed

Jerrell, Jeanette M

2006-10-01

The reliability and validity of the Mental Health Statistics Improvement Program (MHSIP) Adult Consumer Survey were assessed in a statewide convenience sample of 459 persons with severe mental illness served through a public mental health system. Consistent with previous findings and the intent of its developers, three factors were identified that demonstrate good internal consistency, moderate test-retest reliability, and good convergent validity with consumer perceptions of other aspects of their care. The reliability and validity of the MHSIP Adult Consumer Survey documented in this study underscore its scientific and practical utility as an abbreviated tool for assessing access, quality and appropriateness, and outcome in mental health service systems.
Field reliability of Ricor microcoolers

NASA Astrophysics Data System (ADS)

Pundak, N.; Porat, Z.; Barak, M.; Zur, Y.; Pasternak, G.

2009-05-01

Over the recent 25 years Ricor has fielded in excess of 50,000 Stirling cryocoolers, among which approximately 30,000 units are of micro integral rotary driven type. The statistical population of the fielded units is counted in thousands/ hundreds per application category. In contrast to MTTF values as gathered and presented based on standard reliability demonstration tests, where the failure of the weakest component dictates the end of product life, in the case of field reliability, where design and workmanship failures are counted and considered, the values are usually reported in number of failures per million hours of operation. These values are important and relevant to the prediction of service capabilities and plan.
Refrigerant leak detector

NASA Technical Reports Server (NTRS)

Byrne, E. J.

1979-01-01

Quantitative leak detector visually demonstrates refrigerant loss from precision volume of large refrigeration system over established period of time from single test point. Mechanical unit is less costly than electronic "sniffers" and is more reliable due to absence of electronic circuits that are susceptible to drift.
Choosing a reliability inspection plan for interval censored data

DOE PAGES

Lu, Lu; Anderson-Cook, Christine Michaela

2017-04-19

Reliability test plans are important for producing precise and accurate assessment of reliability characteristics. This paper explores different strategies for choosing between possible inspection plans for interval censored data given a fixed testing timeframe and budget. A new general cost structure is proposed for guiding precise quantification of total cost in inspection test plan. Multiple summaries of reliability are considered and compared as the criteria for choosing the best plans using an easily adapted method. Different cost structures and representative true underlying reliability curves demonstrate how to assess different strategies given the logistical constraints and nature of the problem. Resultsmore » show several general patterns exist across a wide variety of scenarios. Given the fixed total cost, plans that inspect more units with less frequency based on equally spaced time points are favored due to the ease of implementation and consistent good performance across a large number of case study scenarios. Plans with inspection times chosen based on equally spaced probabilities offer improved reliability estimates for the shape of the distribution, mean lifetime, and failure time for a small fraction of population only for applications with high infant mortality rates. The paper uses a Monte Carlo simulation based approach in addition to the common evaluation based on the asymptotic variance and offers comparison and recommendation for different applications with different objectives. Additionally, the paper outlines a variety of different reliability metrics to use as criteria for optimization, presents a general method for evaluating different alternatives, as well as provides case study results for different common scenarios.« less

Choosing a reliability inspection plan for interval censored data

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lu, Lu; Anderson-Cook, Christine Michaela

Reliability test plans are important for producing precise and accurate assessment of reliability characteristics. This paper explores different strategies for choosing between possible inspection plans for interval censored data given a fixed testing timeframe and budget. A new general cost structure is proposed for guiding precise quantification of total cost in inspection test plan. Multiple summaries of reliability are considered and compared as the criteria for choosing the best plans using an easily adapted method. Different cost structures and representative true underlying reliability curves demonstrate how to assess different strategies given the logistical constraints and nature of the problem. Resultsmore » show several general patterns exist across a wide variety of scenarios. Given the fixed total cost, plans that inspect more units with less frequency based on equally spaced time points are favored due to the ease of implementation and consistent good performance across a large number of case study scenarios. Plans with inspection times chosen based on equally spaced probabilities offer improved reliability estimates for the shape of the distribution, mean lifetime, and failure time for a small fraction of population only for applications with high infant mortality rates. The paper uses a Monte Carlo simulation based approach in addition to the common evaluation based on the asymptotic variance and offers comparison and recommendation for different applications with different objectives. Additionally, the paper outlines a variety of different reliability metrics to use as criteria for optimization, presents a general method for evaluating different alternatives, as well as provides case study results for different common scenarios.« less
Cross-Cultural Adaptation, Reliability and Validity Study of the Persian Version of the Clinical COPD Questionnaire.

PubMed

Hasanpour, Neda; Attarbashi Moghadam, Behrouz; Sami, Ramin; Tavakol, Kamran

2016-08-01

The clinical COPD questionnaire (CCQ) has been developed to measure the health status of COPD patients. The aim of this study was to translate CCQ into the Persian language and assess the validity and reliability of the translated version. We used a forward-backward procedure to translate the questionnaire. In a cross-sectional study 100 COPD patients and 50 healthy subjects over 40 years old were selected to assess the reliability and construct validity of the instrument. The face and content validity were used for the questionnaire validity. Validity was examined in a population of patients with COPD, using the Persian validated version of the St George's Respiratory Questionnaire (PSGRQ). In order to assess the questionnaire's reliability, the Intraclass correlation coefficient (ICC) and Cronbach's alpha were calculated. Test-retest reliability was tested by re-administering the Persian version of the CCQ (PCCQ) after 1 week. Test-retest carry out of data demonstrates that the PCCQ has excellent reliability (ICC for all 3 domains were higher than 0.9). Internal consistency was found by Cronbach's alpha to be 0.96, 0.94, 0.97, and 0.98 for the symptom, mental state, functional state and total scores respectively. In addition, the correlation between the components of PCCQ and PSGRQ showed satisfactory construct validity. Analyzing the data from healthy subjects and patients divulged that the PCCQ has acceptable discriminant validity. In general, the PCCQ had satisfactory reliability and validity for assessing health-related quality of life status of Iranian COPD patients.
A fiber-coupled 9xx module with tap water cooling

NASA Astrophysics Data System (ADS)

Schleuning, D.; Anthon, D.; Chryssis, A.; Ryu, G.; Liu, G.; Winhold, H.; Fan, L.; Xu, Z.; Tanbun-Ek, T.; Lehkonen, S.; Acklin, B.

2016-03-01

A novel, 9XX nm fiber-coupled module using arrays of highly reliable laser diode bars has been developed. The module is capable of multi-kW output power in a beam parameter product of 80 mm-mrad. The module incorporates a hard-soldered, isolated stack package compatible with tap-water cooling. Using extensive, accelerated multi-cell life-testing, with more than ten million device hours of test, we have demonstrated a MTTF for emitters of >500,000 hrs. In addition we have qualified the module in hard-pulse on-off cycling and stringent environmental tests. Finally we have demonstrated promising results for a next generation 9xx nm chip design currently in applications and qualification testing
Technology verification phase. Dynamic isotope power system. Final report

DOE Office of Scientific and Technical Information (OSTI.GOV)

Halsey, D.G.

1982-03-10

The Phase I requirements of the Kilowatt Isotope Power System (KIPS) program were to make a detailed Flight System Conceptual Design (FSCD) for an isotope fueled organic Rankine cycle power system and to build and test a Ground Demonstration System (GDS) which simulated as closely as possible the operational characteristics of the FSCD. The activities and results of Phase II, the Technology Verification Phase, of the program are reported. The objectives of this phase were to increase system efficiency to 18.1% by component development, to demonstrate system reliability by a 5000 h endurance test and to update the flight systemmore » design. During Phase II, system performance was improved from 15.1% to 16.6%, an endurance test of 2000 h was performed while the flight design analysis was limited to a study of the General Purpose Heat Source, a study of the regenerator manufacturing technique and analysis of the hardness of the system to a laser threat. It was concluded from these tests that the GDS is basically prototypic of a flight design; all components necessary for satisfactory operation were demonstrated successfully at the system level; over 11,000 total h of operation without any component failure attested to the inherent reliability of this type of system; and some further development is required, specifically in the area of performance. (LCL)« less
Advanced Stirling Convertor Heater Head Durability and Reliability Quantification

NASA Technical Reports Server (NTRS)

Krause, David L.; Shah, Ashwin R.; Korovaichuk, Igor; Kalluri, Sreeramesh

2008-01-01

The National Aeronautics and Space Administration (NASA) has identified the high efficiency Advanced Stirling Radioisotope Generator (ASRG) as a candidate power source for long duration Science missions, such as lunar applications, Mars rovers, and deep space missions, that require reliable design lifetimes of up to 17 years. Resistance to creep deformation of the MarM-247 heater head (HH), a structurally critical component of the ASRG Advanced Stirling Convertor (ASC), under high temperatures (up to 850 C) is a key design driver for durability. Inherent uncertainties in the creep behavior of the thin-walled HH and the variations in the wall thickness, control temperature, and working gas pressure need to be accounted for in the life and reliability prediction. Due to the availability of very limited test data, assuring life and reliability of the HH is a challenging task. The NASA Glenn Research Center (GRC) has adopted an integrated approach combining available uniaxial MarM-247 material behavior testing, HH benchmark testing and advanced analysis in order to demonstrate the integrity, life and reliability of the HH under expected mission conditions. The proposed paper describes analytical aspects of the deterministic and probabilistic approaches and results. The deterministic approach involves development of the creep constitutive model for the MarM-247 (akin to the Oak Ridge National Laboratory master curve model used previously for Inconel 718 (Special Metals Corporation)) and nonlinear finite element analysis to predict the mean life. The probabilistic approach includes evaluation of the effect of design variable uncertainties in material creep behavior, geometry and operating conditions on life and reliability for the expected life. The sensitivity of the uncertainties in the design variables on the HH reliability is also quantified, and guidelines to improve reliability are discussed.
The Healthy Brain Network Serial Scanning Initiative: a resource for evaluating inter-individual differences and their reliabilities across scan conditions and sessions

PubMed Central

O’Connor, David; Potler, Natan Vega; Kovacs, Meagan; Xu, Ting; Ai, Lei; Pellman, John; Vanderwal, Tamara; Parra, Lucas C.; Cohen, Samantha; Ghosh, Satrajit; Escalera, Jasmine; Grant-Villegas, Natalie; Osman, Yael; Bui, Anastasia; Craddock, R. Cameron

2017-01-01

Abstract Background: Although typically measured during the resting state, a growing literature is illustrating the ability to map intrinsic connectivity with functional MRI during task and naturalistic viewing conditions. These paradigms are drawing excitement due to their greater tolerability in clinical and developing populations and because they enable a wider range of analyses (e.g., inter-subject correlations). To be clinically useful, the test-retest reliability of connectivity measured during these paradigms needs to be established. This resource provides data for evaluating test-retest reliability for full-brain connectivity patterns detected during each of four scan conditions that differ with respect to level of engagement (rest, abstract animations, movie clips, flanker task). Data are provided for 13 participants, each scanned in 12 sessions with 10 minutes for each scan of the four conditions. Diffusion kurtosis imaging data was also obtained at each session. Findings: Technical validation and demonstrative reliability analyses were carried out at the connection-level using the Intraclass Correlation Coefficient and at network-level representations of the data using the Image Intraclass Correlation Coefficient. Variation in intrinsic functional connectivity across sessions was generally found to be greater than that attributable to scan condition. Between-condition reliability was generally high, particularly for the frontoparietal and default networks. Between-session reliabilities obtained separately for the different scan conditions were comparable, though notably lower than between-condition reliabilities. Conclusions: This resource provides a test-bed for quantifying the reliability of connectivity indices across subjects, conditions and time. The resource can be used to compare and optimize different frameworks for measuring connectivity and data collection parameters such as scan length. Additionally, investigators can explore the unique perspectives of the brain's functional architecture offered by each of the scan conditions. PMID:28369458
Using a dry electrode EEG device during balance tasks in healthy young-adult males: Test-retest reliability analysis.

PubMed

Collado-Mateo, Daniel; Adsuar, Jose C; Olivares, Pedro R; Cano-Plasencia, Ricardo; Gusi, Narcis

2015-01-01

The analysis of brain activity during balance is an important topic in different fields of science. Given that all measurements involve an error that is caused by different agents, like the instrument, the researcher, or the natural human variability, a test-retest reliability evaluation of the electroencephalographic assessment is a needed starting point. However, there is a lack of information about the reliability of electroencephalographic measurements, especially in a new wireless device with dry electrodes. The current study aims to analyze the reliability of electroencephalographic measurements from a wireless device using dry electrodes during two different balance tests. Seventeen healthy male volunteers performed two different static balance tasks on a Biodex Balance Platform: (a) with two feet on the platform and (b) with one foot on the platform. Electroencephalographic data was recorded using Enobio (Neuroelectrics). The mean power spectrum of the alpha band of the central and frontal channels was calculated. Relative and absolute indices of reliability were also calculated. In general terms, the intraclass correlation coefficient (ICC) values of all the assessed channels can be classified as excellent (>0.90). The percentage standard error of measurement oscillated from 0.54% to 1.02% and the percentage smallest real difference ranged from 1.50% to 2.82%. Electroencephalographic assessment through an Enobio device during balance tasks has an excellent reliability. However, its utility was not demonstrated because responsiveness was not assessed.
The Clinician-Administered PTSD Scale for DSM-5 (CAPS-5): Development and initial psychometric evaluation in military veterans.

PubMed

Weathers, Frank W; Bovin, Michelle J; Lee, Daniel J; Sloan, Denise M; Schnurr, Paula P; Kaloupek, Danny G; Keane, Terence M; Marx, Brian P

2018-03-01

The Clinician-Administered PTSD Scale (CAPS) is an extensively validated and widely used structured diagnostic interview for posttraumatic stress disorder (PTSD). The CAPS was recently revised to correspond with PTSD criteria in the fifth edition of the Diagnostic and Statistical Manual of Mental Disorders (DSM-5; American Psychiatric Association, 2013). This article describes the development of the CAPS for DSM-5 (CAPS-5) and presents the results of an initial psychometric evaluation of CAPS-5 scores in 2 samples of military veterans (Ns = 165 and 207). CAPS-5 diagnosis demonstrated strong interrater reliability (к = .78 to 1.00, depending on the scoring rule) and test-retest reliability (к = .83), as well as strong correspondence with a diagnosis based on the CAPS for DSM-IV (CAPS-IV; к = .84 when optimally calibrated). CAPS-5 total severity score demonstrated high internal consistency (α = .88) and interrater reliability (ICC = .91) and good test-retest reliability (ICC = .78). It also demonstrated good convergent validity with total severity score on the CAPS-IV (r = .83) and PTSD Checklist for DSM-5 (r = .66) and good discriminant validity with measures of anxiety, depression, somatization, functional impairment, psychopathy, and alcohol abuse (rs = .02 to .54). Overall, these results indicate that the CAPS-5 is a psychometrically sound measure of DSM-5 PTSD diagnosis and symptom severity. Importantly, the CAPS-5 strongly corresponds with the CAPS-IV, which suggests that backward compatibility with the CAPS-IV was maintained and that the CAPS-5 provides continuity in evidence-based assessment of PTSD in the transition from DSM-IV to DSM-5 criteria. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Effect of knee and trunk angle on kinetic variables during the isometric midthigh pull: test-retest reliability.

PubMed

Comfort, Paul; Jones, Paul A; McMahon, John J; Newton, Robert

2015-01-01

The isometric midthigh pull (IMTP) has been used to monitor changes in force, maximum rate of force development (mRFD), and impulse, with performance in this task being associated with performance in athletic tasks. Numerous postures have been adopted in the literature, which may affect the kinetic variables during the task; therefore, the aim of this investigation was to determine whether different knee-joint angles (120°, 130°, 140°, and 150°) and hip-joint angles (125° and 145°), including the subjects preferred posture, affect force, mRFD, and impulse during the IMTP. Intraclass correlation coefficients demonstrated high within-session reliability (r ≥ .870, P < .001) for all kinetic variables determined in all postures, excluding impulse measures during the 130° knee-flexion, 125° hip-flexion posture, which showed a low to moderate reliability (r = .666-.739, P < .001), while between-sessions testing demonstrated high reliability (r > .819, P < .001) for all kinetic variables. There were no significant differences in peak force (P > .05, Cohen d = 0.037, power = .408), mRFD (P > .05, Cohen d = 0.037, power = .409), or impulse at 100 ms (P > .05, Cohen d = 0.056, power = .609), 200 ms (P > .05, Cohen d = 0.057, power = .624), or 300 ms (P > .05, Cohen d = 0.061, power = .656) across postures. Smallest detectable differences demonstrated that changes in performance of >1.3% in peak isometric force, >10.3% in mRFD, >5.3% in impulse at 100 ms, >4.4% in impulse at 200 ms, and >7.1% in impulse at 300 ms should be considered meaningful, irrespective of posture.
Development, scoring, and reliability of the Microscale Audit of Pedestrian Streetscapes (MAPS)

PubMed Central

2013-01-01

Background Streetscape (microscale) features of the built environment can influence people’s perceptions of their neighborhoods’ suitability for physical activity. Many microscale audit tools have been developed, but few have published systematic scoring methods. We present the development, scoring, and reliability of the Microscale Audit of Pedestrian Streetscapes (MAPS) tool and its theoretically-based subscales. Methods MAPS was based on prior instruments and was developed to assess details of streetscapes considered relevant for physical activity. MAPS sections (route, segments, crossings, and cul-de-sacs) were scored by two independent raters for reliability analyses. There were 290 route pairs, 516 segment pairs, 319 crossing pairs, and 53 cul-de-sac pairs in the reliability sample. Individual inter-rater item reliability analyses were computed using Kappa, intra-class correlation coefficient (ICC), and percent agreement. A conceptual framework for subscale creation was developed using theory, expert consensus, and policy relevance. Items were grouped into subscales, and subscales were analyzed for inter-rater reliability at tiered levels of aggregation. Results There were 160 items included in the subscales (out of 201 items total). Of those included in the subscales, 80 items (50.0%) had good/excellent reliability, 41 items (25.6%) had moderate reliability, and 18 items (11.3%) had low reliability, with limited variability in the remaining 21 items (13.1%). Seventeen of the 20 route section subscales, valence (positive/negative) scores, and overall scores (85.0%) demonstrated good/excellent reliability and 3 demonstrated moderate reliability. Of the 16 segment subscales, valence scores, and overall scores, 12 (75.0%) demonstrated good/excellent reliability, three demonstrated moderate reliability, and one demonstrated poor reliability. Of the 8 crossing subscales, valence scores, and overall scores, 6 (75.0%) demonstrated good/excellent reliability, and 2 demonstrated moderate reliability. The cul-de-sac subscale demonstrated good/excellent reliability. Conclusions MAPS items and subscales predominantly demonstrated moderate to excellent reliability. The subscales and scoring system represent a theoretically based framework for using these complex microscale data and may be applicable to other similar instruments. PMID:23621947
Validity and reliability assessment of the Brazilian version of the game addiction scale (GAS).

PubMed

Lemos, Igor Lins; Cardoso, Adriana; Sougey, Everton Botelho

2016-05-01

The uncontrolled use of video games can be addictive. The Game Addiction Scale (GAS) is an instrument that was developed to assess this type of addiction. The GAS consists of 21 items that are divided into the following seven factors: salience, tolerance, mood modification, relapse, withdrawal, conflict and problems. This study assessed the convergent validity and reliability of the GAS according to measures of internal consistency and test-retest stability. Three hundred and eighty four students completed the GAS, the Internet Addiction Test (IAT), the Liebowitz Social Anxiety Scale (LSAS), the Beck Depression Inventory (BDI) and the Video Game Addiction Test (VAT). A subgroup of the participants (n=76) completed the GAS again after 30days to determine test-retest stability. The GAS demonstrated excellent internal consistency (Cronbach's alpha=0.92), was highly correlated with the VAT (r=0.883) and was moderately correlated with the BDI (r=0.358), the LSAS (r=0.326) and the IAT (r=0.454). In the Brazilian Portuguese population, the GAS shows good internal consistency. These data indicate that the GAS can be used to assess video game addiction due to its demonstrated psychometric validity. Copyright © 2016 Elsevier Inc. All rights reserved.
Descriptive Model of Generic WAMS

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hauer, John F.; DeSteese, John G.

The Department of Energy’s (DOE) Transmission Reliability Program is supporting the research, deployment, and demonstration of various wide area measurement system (WAMS) technologies to enhance the reliability of the Nation’s electrical power grid. Pacific Northwest National Laboratory (PNNL) was tasked by the DOE National SCADA Test Bed Program to conduct a study of WAMS security. This report represents achievement of the milestone to develop a generic WAMS model description that will provide a basis for the security analysis planned in the next phase of this study.
Test-Retest Reliability of Pediatric Heart Rate Variability: A Meta-Analysis.

PubMed

Weiner, Oren M; McGrath, Jennifer J

2017-01-01

Heart rate variability (HRV), an established index of autonomic cardiovascular modulation, is associated with health outcomes (e.g., obesity, diabetes) and mortality risk. Time- and frequency-domain HRV measures are commonly reported in longitudinal adult and pediatric studies of health. While test-retest reliability has been established among adults, less is known about the psychometric properties of HRV among infants, children, and adolescents. The objective was to conduct a meta-analysis of the test-retest reliability of time- and frequency-domain HRV measures from infancy to adolescence. Electronic searches (PubMed, PsycINFO; January 1970-December 2014) identified studies with nonclinical samples aged ≤ 18 years; ≥ 2 baseline HRV recordings separated by ≥ 1 day; and sufficient data for effect size computation. Forty-nine studies ( N = 5,170) met inclusion criteria. Methodological variables coded included factors relevant to study protocol, sample characteristics, electrocardiogram (ECG) signal acquisition and preprocessing, and HRV analytical decisions. Fisher's Z was derived as the common effect size. Analyses were age-stratified (infant/toddler < 5 years, n = 3,329; child/adolescent 5-18 years, n = 1,841) due to marked methodological differences across the pediatric literature. Meta-analytic results revealed HRV demonstrated moderate reliability; child/adolescent studies ( Z = 0.62, r = 0.55) had significantly higher reliability than infant/toddler studies ( Z = 0.42, r = 0.40). Relative to other reported measures, HF exhibited the highest reliability among infant/toddler studies ( Z = 0.42, r = 0.40), while rMSSD exhibited the highest reliability among child/adolescent studies ( Z = 1.00, r = 0.76). Moderator analyses indicated greater reliability with shorter test-retest interval length, reported exclusion criteria based on medical illness/condition, lower proportion of males, prerecording acclimatization period, and longer recording duration; differences were noted across age groups. HRV is reliable among pediatric samples. Reliability is sensitive to pertinent methodological decisions that require careful consideration by the researcher. Limited methodological reporting precluded several a priori moderator analyses. Suggestions for future research, including standards specified by Task Force Guidelines, are discussed.
Infant polysomnography: reliability and validity of infant arousal assessment.

PubMed

Crowell, David H; Kulp, Thomas D; Kapuniai, Linda E; Hunt, Carl E; Brooks, Lee J; Weese-Mayer, Debra E; Silvestri, Jean; Ward, Sally Davidson; Corwin, Michael; Tinsley, Larry; Peucker, Mark

2002-10-01

Infant arousal scoring based on the Atlas Task Force definition of transient EEG arousal was evaluated to determine (1). whether transient arousals can be identified and assessed reliably in infants and (2). whether arousal and no-arousal epochs scored previously by trained raters can be validated reliably by independent sleep experts. Phase I for inter- and intrarater reliability scoring was based on two datasets of sleep epochs selected randomly from nocturnal polysomnograms of healthy full-term, preterm, idiopathic apparent life-threatening event cases, and siblings of Sudden Infant Death Syndrome infants of 35 to 64 weeks postconceptional age. After training, test set 1 reliability was assessed and discrepancies identified. After retraining, test set 2 was scored by the same raters to determine interrater reliability. Later, three raters from the trained group rescored test set 2 to assess inter- and intrarater reliabilities. Interrater and intrarater reliability kappa's, with 95% confidence intervals, ranged from substantial to almost perfect levels of agreement. Interrater reliabilities for spontaneous arousals were initially moderate and then substantial. During the validation phase, 315 previously scored epochs were presented to four sleep experts to rate as containing arousal or no-arousal events. Interrater expert agreements were diverse and considered as noninterpretable. Concordance in sleep experts' agreements, based on identification of the previously sampled arousal and no-arousal epochs, was used as a secondary evaluative technique. Results showed agreement by two or more experts on 86% of the Collaborative Home Infant Monitoring Evaluation Study arousal scored events. Conversely, only 1% of the Collaborative Home Infant Monitoring Evaluation Study-scored no-arousal epochs were rated as an arousal. In summary, this study presents an empirically tested model with procedures and criteria for attaining improved reliability in transient EEG arousal assessments in infants using the modified Atlas Task Force standards. With training based on specific criteria, substantial inter- and intrarater agreement in identifying infant arousals was demonstrated. Corroborative validation results were too disparate for meaningful interpretation. Alternate evaluation based on concordance agreements supports reliance on infant EEG criteria for assessment. Results mandate additional confirmatory validation studies with specific training on infant EEG arousal assessment criteria.
Test-Retest Reliability of Pediatric Heart Rate Variability

PubMed Central

Weiner, Oren M.; McGrath, Jennifer J.

2017-01-01

Heart rate variability (HRV), an established index of autonomic cardiovascular modulation, is associated with health outcomes (e.g., obesity, diabetes) and mortality risk. Time- and frequency-domain HRV measures are commonly reported in longitudinal adult and pediatric studies of health. While test-retest reliability has been established among adults, less is known about the psychometric properties of HRV among infants, children, and adolescents. The objective was to conduct a meta-analysis of the test-retest reliability of time- and frequency-domain HRV measures from infancy to adolescence. Electronic searches (PubMed, PsycINFO; January 1970–December 2014) identified studies with nonclinical samples aged ≤ 18 years; ≥ 2 baseline HRV recordings separated by ≥ 1 day; and sufficient data for effect size computation. Forty-nine studies (N = 5,170) met inclusion criteria. Methodological variables coded included factors relevant to study protocol, sample characteristics, electrocardiogram (ECG) signal acquisition and preprocessing, and HRV analytical decisions. Fisher’s Z was derived as the common effect size. Analyses were age-stratified (infant/toddler < 5 years, n = 3,329; child/adolescent 5–18 years, n = 1,841) due to marked methodological differences across the pediatric literature. Meta-analytic results revealed HRV demonstrated moderate reliability; child/adolescent studies (Z = 0.62, r = 0.55) had significantly higher reliability than infant/toddler studies (Z = 0.42, r = 0.40). Relative to other reported measures, HF exhibited the highest reliability among infant/toddler studies (Z = 0.42, r = 0.40), while rMSSD exhibited the highest reliability among child/adolescent studies (Z = 1.00, r = 0.76). Moderator analyses indicated greater reliability with shorter test-retest interval length, reported exclusion criteria based on medical illness/condition, lower proportion of males, prerecording acclimatization period, and longer recording duration; differences were noted across age groups. HRV is reliable among pediatric samples. Reliability is sensitive to pertinent methodological decisions that require careful consideration by the researcher. Limited methodological reporting precluded several a priori moderator analyses. Suggestions for future research, including standards specified by Task Force Guidelines, are discussed. PMID:29307951
Reverse lactate threshold: a novel single-session approach to reliable high-resolution estimation of the anaerobic threshold.

PubMed

Dotan, Raffy

2012-06-01

The multisession maximal lactate steady-state (MLSS) test is the gold standard for anaerobic threshold (AnT) estimation. However, it is highly impractical, requires high fitness level, and suffers additional shortcomings. Existing single-session AnT-estimating tests are of compromised validity, reliability, and resolution. The presented reverse lactate threshold test (RLT) is a single-session, AnT-estimating test, aimed at avoiding the pitfalls of existing tests. It is based on the novel concept of identifying blood lactate's maximal appearance-disappearance equilibrium by approaching the AnT from higher, rather than from lower exercise intensities. Rowing, cycling, and running case data (4 recreational and competitive athletes, male and female, aged 17-39 y) are presented. Subjects performed the RLT test and, on a separate session, a single 30-min MLSS-type verification test at the RLT-determined intensity. The RLT and its MLSS verification exhibited exceptional agreement at 0.5% discrepancy or better. The RLT's training sensitivity was demonstrated by a case of 2.5-mo training regimen following which the RLT's 15-W improvement was fully MLSS-verified. The RLT's test-retest reliability was examined in 10 trained and untrained subjects. Test 2 differed from test 1 by only 0.3% with an intraclass correlation of 0.997. The data suggest RLT to accurately and reliably estimate AnT (as represented by MLSS verification) with high resolution and in distinctly different sports and to be sensitive to training adaptations. Compared with MLSS, the single-session RLT is highly practical and its lower fitness requirements make it applicable to athletes and untrained individuals alike. Further research is needed to establish RLT's validity and accuracy in larger samples.
Validation of new psychosocial factors questionnaires: a Colombian national study.

PubMed

Villalobos, Gloria H; Vargas, Angélica M; Rondón, Martin A; Felknor, Sarah A

2013-01-01

The study of workers' health problems possibly associated with stressful conditions requires valid and reliable tools for monitoring risk factors. The present study validates two questionnaires to assess psychosocial risk factors for stress-related illnesses within a sample of Colombian workers. The validation process was based on a representative sample survey of 2,360 Colombian employees, aged 18-70 years. Worker response rate was 90%; 46% of the responders were women. Internal consistency was calculated, construct validity was tested with factor analysis and concurrent validity was tested with Spearman correlations. The questionnaires demonstrated adequate reliability (0.88-0.95). Factor analysis confirmed the dimensions proposed in the measurement model. Concurrent validity resulted in significant correlations with stress and health symptoms. "Work and Non-work Psychosocial Factors Questionnaires" were found to be valid and reliable for the assessment of workers' psychosocial factors, and they provide information for research and intervention. Copyright © 2012 Wiley Periodicals, Inc.
Pitfalls and important issues in testing reliability using intraclass correlation coefficients in orthopaedic research.

PubMed

Lee, Kyoung Min; Lee, Jaebong; Chung, Chin Youb; Ahn, Soyeon; Sung, Ki Hyuk; Kim, Tae Won; Lee, Hui Jong; Park, Moon Seok

2012-06-01

Intra-class correlation coefficients (ICCs) provide a statistical means of testing the reliability. However, their interpretation is not well documented in the orthopedic field. The purpose of this study was to investigate the use of ICCs in the orthopedic literature and to demonstrate pitfalls regarding their use. First, orthopedic articles that used ICCs were retrieved from the Pubmed database, and journal demography, ICC models and concurrent statistics used were evaluated. Second, reliability test was performed on three common physical examinations in cerebral palsy, namely, the Thomas test, the Staheli test, and popliteal angle measurement. Thirty patients were assessed by three orthopedic surgeons to explore the statistical methods testing reliability. Third, the factors affecting the ICC values were examined by simulating the data sets based on the physical examination data where the ranges, slopes, and interobserver variability were modified. Of the 92 orthopedic articles identified, 58 articles (63%) did not clarify the ICC model used, and only 5 articles (5%) described all models, types, and measures. In reliability testing, although the popliteal angle showed a larger mean absolute difference than the Thomas test and the Staheli test, the ICC of popliteal angle was higher, which was believed to be contrary to the context of measurement. In addition, the ICC values were affected by the model, type, and measures used. In simulated data sets, the ICC showed higher values when the range of data sets were larger, the slopes of the data sets were parallel, and the interobserver variability was smaller. Care should be taken when interpreting the absolute ICC values, i.e., a higher ICC does not necessarily mean less variability because the ICC values can also be affected by various factors. The authors recommend that researchers clarify ICC models used and ICC values are interpreted in the context of measurement.
A Theory-Grounded Measure of Adolescents’ Response to a Media Literacy Intervention

PubMed Central

Greene, Kathryn; Yanovitzky, Itzhak; Carpenter, Amanda; Banerjee, Smita C.; Magsamen-Conrad, Kate; Hecht, Michael L.; Elek, Elvira

2016-01-01

Media literacy interventions offer promising avenues for the prevention of risky health behaviors among children and adolescents, but current literature remains largely equivocal about their efficacy. The primary objective of this study was to develop and test theoretically-grounded measures of audiences’ degree of engagement with the content of media literacy programs based on the recognition that engagement (and not participation per se) can better explain and predict individual variations in the effects of these programs. We tested the validity and reliability of a measure of engagement with two different samples of 10th grade high school students who participated in a pilot and actual test of a brief media literacy curriculum. Four message evaluation factors (involvement, perceived novelty, critical thinking, personal reflection) emerged and demonstrate acceptable reliability. PMID:28042522
Two-year Test-Retest Reliability in High School Athletes Using the Four- and Two-Factor ImPACT Composite Structures: The Effects of Learning Disorders and Headache/Migraine Treatment History.

PubMed

Brett, Benjamin L; Solomon, Gary S; Hill, Jennifer; Schatz, Philip

2018-03-01

This study examined the test-retest reliability of the four- and two-factor structures (i.e., Memory and Speed) of ImPACT over a 2-year interval across multiple groups with premorbid conditions, including those with a history of special education or learning disorders (LD; n = 114), treatment history for headache/migraine (n = 81), and a control group (n = 792). Nine hundred and eighty seven high school athletes completed baseline testing using online ImPACT across a 2-year interval. Paired-samples t-tests documented improvement from initial to follow-up assessments. Test stability was examined using Regression-based measures (RBM) and Reliable change indices (RCI). Reliability was examined using intraclass correlation coefficients (ICC). Significant improvement on all four composites were observed for the control group over a 2-year interval; whereas significant differences were observed only on Visual Motor Speed for the LD and headache/migraine treatment history groups. ICCs ranges were similar across groups and greater or comparable reliability was observed for the two-factor structure on Memory (0.67-0.73) and Speed (0.76-0.78) composites. RCIs and RBMs demonstrated stability for the four- and two-factor structures, with few cases falling outside the range of expected change within a healthy sample at the 90% and 95% CIs. Typical practices of obtaining new baselines every 2 years in the high school population can be applied to athletes with a history of special education or LD and headache/migraine treatment. The two-factor structure has potential to increase test-retest reliability. Further research regarding clinical utility is needed. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Test-retest reliability of the multiple sleep latency test in narcolepsy without cataplexy and idiopathic hypersomnia.

PubMed

Trotti, Lynn Marie; Staab, Beth A; Rye, David B

2013-08-15

Differentiation of narcolepsy without cataplexy from idiopathic hypersomnia relies entirely upon the multiple sleep latency test (MSLT). However, the test-retest reliability for these central nervous system hypersomnias has never been determined. Patients with narcolepsy without cataplexy, idiopathic hypersomnia, and physiologic hypersomnia who underwent two diagnostic multiple sleep latency tests were identified retrospectively. Correlations between the mean sleep latencies on the two studies were evaluated, and we probed for demographic and clinical features associated with reproducibility versus change in diagnosis. Thirty-six patients (58% women, mean age 34 years) were included. Inter -test interval was 4.2 ± 3.8 years (range 2.5 months to 16.9 years). Mean sleep latencies on the first and second tests were 5.5 (± 3.7 SD) and 7.3 (± 3.9) minutes, respectively, with no significant correlation (r = 0.17, p = 0.31). A change in diagnosis occurred in 53% of patients, and was accounted for by a difference in the mean sleep latency (N = 15, 42%) or the number of sleep onset REM periods (N = 11, 31%). The only feature predictive of a diagnosis change was a history of hypnagogic or hypnopompic hallucinations. The multiple sleep latency test demonstrates poor test-retest reliability in a clinical population of patients with central nervous system hypersomnia evaluated in a tertiary referral center. Alternative diagnostic tools are needed.
Structural Area Inspection Frequency Evaluation (SAIFE). Volume III. Demonstration Input, Inspection Survey, and MRR Data

DTIC Science & Technology

1978-04-01

3 1.7 Production Rate Change Time . . . . 3 1.8 Time of Fatigue Test Start . ..... 3 1.9 Fatigue Test Acceleration Factor . 3 1.10 Corrosion...simulation logic. SAIFE accounts for the following factors : (1) aircraft design analysis; (2) component and full-scale fatigue testing; (3) production ...reliability; production , servi ce,Information Service, Springfield, and corrosion defects; crack or corrosi on Virginia 22151 detection probability; crack
Validity and reliability of the Japanese version of the FIM + FAM in patients with cerebrovascular accident.

PubMed

Miki, Emi; Yamane, Shingo; Yamaoka, Mai; Fujii, Hiroe; Ueno, Hiroka; Kawahara, Toshie; Tanaka, Keiko; Tamashiro, Hiroaki; Inoue, Eiji; Okamoto, Takatsugu; Kuriyama, Masaru

2016-09-01

The study aim was to investigate the validity and reliability of the Functional Independence Measure and Functional Assessment Measure (FIM + FAM), which is unfamiliar in Japan, by using its Japanese version (FIM + FAM-j) in patients with cerebrovascular accident (CVA). Forty-two CVA patients participated. Criterion validity was examined by correlating the full scale and subscales of FIM + FAM-j with several well-established measurements using Spearman's correlation coefficient. Reliability was evaluated by internal consistency (tested by Cronbach's alpha coefficient) and intra-rater reliability (tested by Kendall's tau correlation coefficient). Good-to-excellent criterion validity was found between the full scale and motor subscales of the FIM + FAM-j and the Barthel Index, National Institutes of Health Stroke Scale, modified Rankin Scale, and lower extremity Brunnstrom Recovery Stage. High internal consistency was observed within the full-scale FIM + FAM-j and the motor and cognitive subscales (Cronbach's alphas were 0.968, 0.954, and 0.948, respectively). Additionally, good intra-rater reliability was observed within the full scale and motor subscales, and excellent reliability for the cognitive subscales (taus were 0.83, 0.80, and 0.98, respectively). This study showed that the FIM + FAM-j demonstrated acceptable levels of validity and reliability when used for CVA as a measure of disability.
Space reliability technology - A historical perspective

NASA Technical Reports Server (NTRS)

Cohen, H.

1984-01-01

The progressive improvements in reliability of launch vehicles is traced from the Vanguard rocket to the STS. The Vanguard, built with minimal redundancy and a high mass ratio, was used as an operational vehicle midway through its test program in an attempt to meet the perceived challenge represented by the Sputnik. The fourth Vanguard failed due to inadequate contamination prevention and lack of inspection ports. Automatic firing sequences were adopted for the Titan rockets, which were an order of magnitude larger than the Vanguard and therefore had room for interior inspections. Qualification testing and reporting were introduced for components, along with X ray inspection of fuel tank welds. Dual systems were added for flight critical components when the Titan became man-rated for the Gemini program. Designs incorporated full failure mode effects and criticality analyses for the Apollo program, which exposed the limits of applicability of numerical reliability models. Fault tree analyses and program milestone reviews were initiated. The worth of man-in-the-loop in space activities for reliability was demonstrated with the rescue of Skylab after solar panel and meteoroid shield failures. It is now the reliability of the payload, rather than the vehicle, that is questioned for Shuttle launches.
[Reliability and validity of the Braden Scale for predicting pressure sore risk].

PubMed

Boes, C

2000-12-01

For more accurate and objective pressure sore risk assessment various risk assessment tools were developed mainly in the USA and Great Britain. The Braden Scale for Predicting Pressure Sore Risk is one such example. By means of a literature analysis of German and English texts referring to the Braden Scale the scientific control criteria reliability and validity will be traced and consequences for application of the scale in Germany will be demonstrated. Analysis of 4 reliability studies shows an exclusive focus on interrater reliability. Further, even though examination of 19 validity studies occurs in many different settings, such examination is limited to the criteria sensitivity and specificity (accuracy). The range of sensitivity and specificity level is 35-100%. The recommended cut off points rank in the field of 10 to 19 points. The studies prove to be not comparable with each other. Furthermore, distortions in these studies can be found which affect accuracy of the scale. The results of the here presented analysis show an insufficient proof for reliability and validity in the American studies. In Germany, the Braden scale has not yet been tested under scientific criteria. Such testing is needed before using the scale in different German settings. During the course of such testing, construction and study procedures of the American studies can be used as a basis as can the problems be identified in the analysis presented below.
Inventory of college challenges for ethnic minority students: psychometric properties of a new instrument in Chinese Americans.

PubMed

Ying, Yu-Wen; Lee, Peter Allen; Tsai, Jeanne L

2004-11-01

The Inventory of College Challenges for Ethnic Minority Students (ICCEMS) is a newly developed instrument that assesses challenges faced by ethnic minority college students across a range of cultural, academic, social, and practical domains. The present study tested the ICCEMS among Chinese American students in an attempt to identify its factor structure and assess its psychometric properties. A total of 13 factor domains emerged. The Cronbach's alpha and 1-month test-retest reliability of the subscales and the overall scale supported their reliability. Both criterion and construct validities were also demonstrated. Chinese American college students faced the greatest challenges in terms of unclear career direction and academic demands. 2004 APA
Proof test methodology for composites

NASA Technical Reports Server (NTRS)

Wu, Edward M.; Bell, David K.

1992-01-01

The special requirements for proof test of composites are identified based on the underlying failure process of composites. Two proof test methods are developed to eliminate the inevitable weak fiber sites without also causing flaw clustering which weakens the post-proof-test composite. Significant reliability enhancement by these proof test methods has been experimentally demonstrated for composite strength and composite life in tension. This basic proof test methodology is relevant to the certification and acceptance of critical composite structures. It can also be applied to the manufacturing process development to achieve zero-reject for very large composite structures.
Test-retest reliability of sudden ankle inversion measurements in subjects with healthy ankle joints.

PubMed

Eechaute, Christophe; Vaes, Peter; Duquet, William; Van Gheluwe, Bart

2007-01-01

Sudden ankle inversion tests have been used to investigate whether the onset of peroneal muscle activity is delayed in patients with chronically unstable ankle joints. Before interpreting test results of latency times in patients with chronic ankle instability and healthy subjects, the reliability of these measures must be first demonstrated. To investigate the test-retest reliability of variables measured during a sudden ankle inversion movement in standing subjects with healthy ankle joints. Validation study. Research laboratory. 15 subjects with healthy ankle joints (30 ankles). Subjects stood on an ankle inversion platform with both feet tightly fixed to independently moveable trapdoors. An unexpected sudden ankle inversion of 50 degrees was imposed. We measured latency and motor response times and electromechanical delay of the peroneus longus muscle, along with the time and angular position of the first and second decelerating moments, the mean and maximum inversion speed, and the total inversion time. Correlation coefficients and standard error of measurements were calculated. Intraclass correlation coefficients ranged from 0.17 for the electromechanical delay of the peroneus longus muscle (standard error of measurement = 2.7 milliseconds) to 0.89 for the maximum inversion speed (standard error of measurement = 34.8 milliseconds). The reliability of the latency and motor response times of the peroneus longus muscle, the time of the first and second decelerating moments, and the mean and maximum inversion speed was acceptable in subjects with healthy ankle joints and supports the investigation of the reliability of these measures in subjects with chronic ankle instability. The lower reliability of the electromechanical delay of the peroneus longus muscle and the angular positions of both decelerating moments calls the use of these variables into question.
Development of the adult PedsQL™ neurofibromatosis type 1 module: initial feasibility, reliability and validity.

PubMed

Nutakki, Kavitha; Hingtgen, Cynthia M; Monahan, Patrick; Varni, James W; Swigonski, Nancy L

2013-02-21

Neurofibromatosis type 1 (NF1) is a common autosomal dominant genetic disorder with significant impact on health-related quality of life (HRQOL). Research in understanding the pathogenetic mechanisms of neurofibroma development has led to the use of new clinical trials for the treatment of NF1. One of the most important outcomes of a trial is improvement in quality of life, however, no condition specific HRQOL instrument for NF1 exists. The objective of this study was to develop an NF1 HRQOL instrument as a module of PedsQL™ and to test for its initial feasibility, internal consistency reliability and validity in adults with NF1. The NF1 specific HRQOL instrument was developed using a standard method of PedsQL™ module development - literature review, focus group/semi-structured interviews, cognitive interviews and experts' review of initial draft, pilot testing and field testing. Field testing involved 134 adults with NF1. Feasibility was measured by the percentage of missing responses, internal consistency reliability was measured with Cronbach's alpha and validity was measured by the known-groups method. Feasibility, measured by the percentage of missing responses was 4.8% for all subscales on the adult version of the NF1-specific instrument. Internal consistency reliability for the Total Score (alpha =0.97) and subscale reliabilities ranging from 0.72 to 0.96 were acceptable for group comparisons. The PedsQL™ NF1 module distinguished between NF1 adults with excellent to very good, good, and fair to poor health status. The results demonstrate the initial feasibility, reliability and validity of the PedsQL™ NF1 module in adult patients. The PedsQL™ NF1 Module can be used to understand the multidimensional nature of NF1 on the HRQOL patients with this disorder.
Reliability of the Balance Evaluation Systems Test (BESTest) and BESTest sections for adults with hemiparesis

PubMed Central

Rodrigues, Letícia C.; Marques, Aline P.; Barros, Paula B.; Michaelsen, Stella M.

2014-01-01

BACKGROUND: The Balance Evaluation Systems Test (BESTest) was recently created to allow the development of treatments according to the specific balance system affected in each patient. The Brazilian version of the BESTest has not been specifically tested after stroke. OBJECTIVE: To evaluate the intra- and inter-rater reliability and concurrent and convergent validity of the total score of the BESTest and BESTest sections for adults with hemiparesis after stroke. METHOD: The study included 16 subjects (61.1±7.5 years) with chronic hemiparesis (54.5±43.5 months after stroke). The BESTest was administered by two raters in the same week and one of the raters repeated the test after a one-week interval. Intraclass correlation coefficient (ICC) was calculated to assess intra- and interrater reliability. Concurrent validity with the Berg Balance Scale (BBS) and convergent validity with the Activities-specific Balance Confidence scale (ABC-Brazil) were assessed using Pearson's correlation coefficient. RESULTS: Both the BESTest total score (ICC=0.98) and the BESTest sections (ICC between 0.85 and 0.96) have excellent intrarater reliability. Interrater reliability for the total score was excellent (ICC=0.93) and, for the sections, it ranged between 0.71 and 0.94. The correlation coefficient between the BESTest and the BBS and ABC-Brazil were 0.78 and 0.59, respectively. CONCLUSIONS: The Brazilian version of the BESTest demonstrated adequate reliability when measured by sections and could identify what balance system was affected in patients after stroke. Concurrent validity was excellent with the BBS total score and good to excellent with the sections. The total scores but not the sections present adequate convergent validity with the ABC-Brazil. However, other psychometric properties should be further investigated. PMID:25003281
Probabilistic Assessment of National Wind Tunnel

NASA Technical Reports Server (NTRS)

Shah, A. R.; Shiao, M.; Chamis, C. C.

1996-01-01

A preliminary probabilistic structural assessment of the critical section of National Wind Tunnel (NWT) is performed using NESSUS (Numerical Evaluation of Stochastic Structures Under Stress) computer code. Thereby, the capabilities of NESSUS code have been demonstrated to address reliability issues of the NWT. Uncertainties in the geometry, material properties, loads and stiffener location on the NWT are considered to perform the reliability assessment. Probabilistic stress, frequency, buckling, fatigue and proof load analyses are performed. These analyses cover the major global and some local design requirements. Based on the assumed uncertainties, the results reveal the assurance of minimum 0.999 reliability for the NWT. Preliminary life prediction analysis results show that the life of the NWT is governed by the fatigue of welds. Also, reliability based proof test assessment is performed.
Reliability Analysis of Uniaxially Ground Brittle Materials

NASA Technical Reports Server (NTRS)

Salem, Jonathan A.; Nemeth, Noel N.; Powers, Lynn M.; Choi, Sung R.

1995-01-01

The fast fracture strength distribution of uniaxially ground, alpha silicon carbide was investigated as a function of grinding angle relative to the principal stress direction in flexure. Both as-ground and ground/annealed surfaces were investigated. The resulting flexural strength distributions were used to verify reliability models and predict the strength distribution of larger plate specimens tested in biaxial flexure. Complete fractography was done on the specimens. Failures occurred from agglomerates, machining cracks, or hybrid flaws that consisted of a machining crack located at a processing agglomerate. Annealing eliminated failures due to machining damage. Reliability analyses were performed using two and three parameter Weibull and Batdorf methodologies. The Weibull size effect was demonstrated for machining flaws. Mixed mode reliability models reasonably predicted the strength distributions of uniaxial flexure and biaxial plate specimens.
Reliability and Construct Validity of the NEI VFQ-25 in a Subset of Patients With Geographic Atrophy From the Phase 2 Mahalo Study.

PubMed

Sivaprasad, Sobha; Tschosik, Elizabeth; Kapre, Audrey; Varma, Rohit; Bressler, Neil M; Kimel, Miriam; Dolan, Chantal; Silverman, David

2018-06-01

Geographic atrophy (GA) is an advanced form of age-related macular degeneration characterized by progressive, irreversible visual function loss. This analysis evaluates the psychometric properties of the 25-Item National Eye Institute Visual Function Questionnaire (NEI VFQ-25) composite, near activity, and distance activity scores in patients with GA. Reliability and validity study. Reliability and validity were tested with NEI VFQ-25 data collected from 100 subjects with GA from United States' sites of the phase 2 Mahalo study of lampalizumab (ClinicalTrials.gov identifier: NCT01229215). Strong internal consistency and reproducibility were demonstrated for the NEI VFQ-25 composite (Cronbach's α, 0.95; intraclass correlation coefficient [ICC], 0.86), near activity (Cronbach's α, 0.84; ICC, 0.80), and distance activity (Cronbach's α, 0.84; ICC, 0.84) scores. Convergent validity with the binocular measures, Minnesota Low-Vision Reading Test (MNRead) reading speed and Functional Reading Independence (FRI) index score, was demonstrated for baseline NEI VFQ-25 composite (Pearson correlation [r] = 0.61 and 0.69, respectively), near activities (r = 0.69 and 0.73), and distance activities (r = 0.57 and 0.64) scores. Known-group validity testing for baseline mean NEI VFQ-25 scores (composite, near activities, and distance activities) showed differences between patients with mean maximum MNRead reading speed ≥ 80 vs < 80 words per minute, and between mean FRI index score ≥ 2.5 vs < 2.5 (all P < .0001). Psychometric evidence supports the NEI VFQ-25 as a reliable and valid cross-sectional measure of the impact of GA on patient visual function and vision-related quality of life. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
The script concordance test in radiation oncology: validation study of a new tool to assess clinical reasoning

PubMed Central

Lambert, Carole; Gagnon, Robert; Nguyen, David; Charlin, Bernard

2009-01-01

Background The Script Concordance test (SCT) is a reliable and valid tool to evaluate clinical reasoning in complex situations where experts' opinions may be divided. Scores reflect the degree of concordance between the performance of examinees and that of a reference panel of experienced physicians. The purpose of this study is to demonstrate SCT's usefulness in radiation oncology. Methods A 90 items radiation oncology SCT was administered to 155 participants. Three levels of experience were tested: medical students (n = 70), radiation oncology residents (n = 38) and radiation oncologists (n = 47). Statistical tests were performed to assess reliability and to document validity. Results After item optimization, the test comprised 30 cases and 70 questions. Cronbach alpha was 0.90. Mean scores were 51.62 (± 8.19) for students, 71.20 (± 9.45) for residents and 76.67 (± 6.14) for radiation oncologists. The difference between the three groups was statistically significant when compared by the Kruskall-Wallis test (p < 0.001). Conclusion The SCT is reliable and useful to discriminate among participants according to their level of experience in radiation oncology. It appears as a useful tool to document the progression of reasoning during residency training. PMID:19203358
Preliminary validation and reliability of the Short Form Chronic Respiratory Disease Questionnaire in a lung cancer population.

PubMed

Charalambous, A; Molassiotis, A

2017-01-01

The Short Form Chronic Respiratory Questionnaire (SF-CRQ) is frequently used in patients with obstructive pulmonary disease and it has demonstrated excellent psychometric properties. Since there is no psychometric information for its use with lung cancer patients, this study explored its validity and reliability in this population. Forty-six patients were assessed at two time points (with a 4-week interval) using the SF-CRQ, the modified Borg Scale, five numerical rating scales related to Perceived Severity of Breathlessness, and the Hospital Anxiety and Depression Scale. Internal consistency reliability was investigated by Cronbach's alpha reliability coefficient, test-retest reliability by Spearman-Brown reliability coefficient (P), content validity as well as convergent validity by Pearson's correlation coefficient between the SF-CRQ, and the conceptual similar scales mentioned above were explored. A principal component factor analysis was performed. The internal consistency was high [α = 0.88 (baseline) and 0.91 (after 1 month)]. The SF-CRQ had good stability with test-retest reliability ranging from r = 0.64 to 0.78, P < 0.001. Factor analysis suggests a single construct in this population. The preliminary data analyses supported the convergent, content, and construct validity of the SF-CRQ providing promising evidence that this can be a valid and reliable instrument for the assessment of quality of life related to breathlessness in lung cancer patients. © 2015 John Wiley & Sons Ltd.
Reliability of a new test battery for fitness assessment of the European Astronaut corps.

PubMed

Petersen, Nora; Thieschäfer, Lutz; Ploutz-Snyder, Lori; Damann, Volker; Mester, Joachim

2015-01-01

To optimise health for space missions, European astronauts follow specific conditioning programs before, during and after their flights. To evaluate the effectiveness of these programs, the European Space Agency conducts an Astronaut Fitness Assessment (AFA), but the test-retest reliability of elements within it remains unexamined. The reliability study described here presents a scientific basis for implementing the AFA, but also highlights challenges faced by operational teams supporting humans in such unique environments, especially with respect to health and fitness monitoring of crew members travelling not only into space, but also across the world. The AFA tests assessed parameters known to be affected by prolonged exposure to microgravity: aerobic capacity (VO2max), muscular strength (one repetition max, 1 RM) and power (vertical jumps), core stability, flexibility and balance. Intraclass correlation coefficients (ICC3.1), standard error of measurement and coefficient of variation were used to assess relative and absolute test-retest reliability. Squat and bench 1 RM (ICC3.1 = 0.94-0.99), hip flexion (ICC3.1 = 0.99) and left and right handgrip strength (ICC3.1 = 0.95 and 0.97), showed the highest test-retest reliability, followed by VO2max (ICC3.1 = 0.91), core strength (ICC3.1 = 0.78-0.89), hip extension (ICC3.1 = 0.63), the countermeasure (ICC3.1 = 0.76) and squat (ICC3.1 = 0.63) jumps, and single right- and left-leg jump height (ICC3.1 = 0.51 and 0.14). For balance, relative reliability ranged from ICC3.1 = 0.78 for path length (two legs, head tilted back, eyes open) to ICC3.1 = 0.04 for average rotation velocity (one leg, eyes closed). In a small sample (n = 8) of young, healthy individuals, the AFA battery of tests demonstrated acceptable test-retest reliability for most parameters except some balance and single-leg jump tasks. These findings suggest that, for the application with astronauts, most AFA tests appear appropriate to be maintained in the test battery, but that some elements may be unreliable, and require either modification (duration, selection of task) or removal (single-leg jump, balance test on sphere) from the battery. The test battery is mobile and universally applicable for occupational and general fitness assessment by its comprehensive composition of tests covering many systems involved in whole body movement.
Validity and intra-rater reliability of an android phone application to measure cervical range-of-motion.

PubMed

Quek, June; Brauer, Sandra G; Treleaven, Julia; Pua, Yong-Hao; Mentiplay, Benjamin; Clark, Ross Allan

2014-04-17

Concurrent validity and intra-rater reliability using a customized Android phone application to measure cervical-spine range-of-motion (ROM) has not been previously validated against a gold-standard three-dimensional motion analysis (3DMA) system. Twenty-one healthy individuals (age:31 ± 9.1 years, male:11) participated, with 16 re-examined for intra-rater reliability 1-7 days later. An Android phone was fixed on a helmet, which was then securely fastened on the participant's head. Cervical-spine ROM in flexion, extension, lateral flexion and rotation were performed in sitting with concurrent measurements obtained from both a 3DMA system and the phone.The phone demonstrated moderate to excellent (ICC = 0.53-0.98, Spearman ρ = 0.52-0.98) concurrent validity for ROM measurements in cervical flexion, extension, lateral-flexion and rotation. However, cervical rotation demonstrated both proportional and fixed bias. Excellent intra-rater reliability was demonstrated for cervical flexion, extension and lateral flexion (ICC = 0.82-0.90), but poor for right- and left-rotation (ICC = 0.05-0.33) using the phone. Possible reasons for the outcome are that flexion, extension and lateral-flexion measurements are detected by gravity-dependent accelerometers while rotation measurements are detected by the magnetometer which can be adversely affected by surrounding magnetic fields. The results of this study demonstrate that the tested Android phone application is valid and reliable to measure ROM of the cervical-spine in flexion, extension and lateral-flexion but not in rotation likely due to magnetic interference. The clinical implication of this study is that therapists should be mindful of the plane of measurement when using the Android phone to measure ROM of the cervical-spine.
Validity and intra-rater reliability of an Android phone application to measure cervical range-of-motion

PubMed Central

2014-01-01

Background Concurrent validity and intra-rater reliability using a customized Android phone application to measure cervical-spine range-of-motion (ROM) has not been previously validated against a gold-standard three-dimensional motion analysis (3DMA) system. Findings Twenty-one healthy individuals (age:31 ± 9.1 years, male:11) participated, with 16 re-examined for intra-rater reliability 1–7 days later. An Android phone was fixed on a helmet, which was then securely fastened on the participant’s head. Cervical-spine ROM in flexion, extension, lateral flexion and rotation were performed in sitting with concurrent measurements obtained from both a 3DMA system and the phone. The phone demonstrated moderate to excellent (ICC = 0.53-0.98, Spearman ρ = 0.52-0.98) concurrent validity for ROM measurements in cervical flexion, extension, lateral-flexion and rotation. However, cervical rotation demonstrated both proportional and fixed bias. Excellent intra-rater reliability was demonstrated for cervical flexion, extension and lateral flexion (ICC = 0.82-0.90), but poor for right- and left-rotation (ICC = 0.05-0.33) using the phone. Possible reasons for the outcome are that flexion, extension and lateral-flexion measurements are detected by gravity-dependent accelerometers while rotation measurements are detected by the magnetometer which can be adversely affected by surrounding magnetic fields. Conclusion The results of this study demonstrate that the tested Android phone application is valid and reliable to measure ROM of the cervical-spine in flexion, extension and lateral-flexion but not in rotation likely due to magnetic interference. The clinical implication of this study is that therapists should be mindful of the plane of measurement when using the Android phone to measure ROM of the cervical-spine. PMID:24742001
A literature review of clinical tests for lumbar instability in low back pain: validity and applicability in clinical practice.

PubMed

Ferrari, Silvano; Manni, Tiziana; Bonetti, Francesca; Villafañe, Jorge Hugo; Vanti, Carla

2015-01-01

Several clinical tests have been proposed on low back pain (LBP), but their usefulness in detecting lumbar instability is not yet clear. The objective of this literature review was to investigate the clinical validity of the main clinical tests used for the diagnosis of lumbar instability in individuals with LBP and to verify their applicability in everyday clinical practice. We searched studies of the accuracy and/or reliability of Prone Instability Test (PIT), Passive Lumbar Extension Test (PLE), Aberrant Movements Pattern (AMP), Posterior Shear Test (PST), Active Straight Leg Raise Test (ASLR) and Prone and Supine Bridge Tests (PB and SB) in Medline, Embase, Cinahl, PubMed, and Scopus databases. Only the studies in which each test was investigated by at least one study concerning both the accuracy and the reliability were considered eligible. The quality of the studies was evaluated by QUADAS and QAREL scales. Six papers considering 333 LBP patients were included. The PLE was the most accurate and informative clinical test, with high sensitivity (0.84, 95% CI: 0.69 - 0.91) and high specificity (0.90, 95% CI: 0.85 -0.97). The diagnostic accuracy of AMP depends on each singular test. The PIT and the PST demonstrated by fair to moderate sensitivity and specificity [PIT sensitivity = 0.71 (95% CI: 0.51 - 0.83), PIT specificity = 0.57 (95% CI: 039 - 0.78); PST sensitivity = 0.50 (95% CI: 0.41 - 0.76), PST specificity = 0.48 (95% CI: 0.22 - 0.58)]. The PLE showed a good reliability (k = 0.76), but this result comes from a single study. The inter-rater reliability of the PIT ranged by slight (k = 0.10 and 0.04), to good (k = 0.87). The inter-rater reliability of the AMP ranged by slight (k = -0.07) to moderate (k = 0.64), whereas the inter-rater reliability of the PST was fair (k = 0.27). The data from the studies provided information on the methods used and suggest that PLE is the most appropriate tests to detect lumbar instability in specific LBP. However, due to the lack of available papers on other lumbar conditions, these findings should be confirmed with studies on non-specific LBP patients.
SPIDERS Bi-Directional Charging Station Interconnection Testing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Simpson, M.

2013-09-01

The Smart Power Infrastructure Demonstration for Energy Reliability and Security (SPIDERS) program is a multi-year Department of Defense-Department of Energy (DOE) collaborative effort that will demonstrate integration of renewables into island-able microgrids using on-site generation control, demand response, and energy storage with robust security features at multiple installations. Fort Carson, Colorado, will be the initial development and demonstration site for use of plug-in electric vehicles as energy storage (also known as vehicle-to-grid or V2G).

Development and validation of the University of Washington Clinical Assessment of Music Perception test.

PubMed

Kang, Robert; Nimmons, Grace Liu; Drennan, Ward; Longnion, Jeff; Ruffin, Chad; Nie, Kaibao; Won, Jong Ho; Worman, Tina; Yueh, Bevan; Rubinstein, Jay

2009-08-01

Assessment of cochlear implant outcomes centers around speech discrimination. Despite dramatic improvements in speech perception, music perception remains a challenge for most cochlear implant users. No standardized test exists to quantify music perception in a clinically practical manner. This study presents the University of Washington Clinical Assessment of Music Perception (CAMP) test as a reliable and valid music perception test for English-speaking, adult cochlear implant users. Forty-two cochlear implant subjects were recruited from the University of Washington Medical Center cochlear implant program and referred by two implant manufacturers. Ten normal-hearing volunteers were drawn from the University of Washington Medical Center and associated campuses. A computer-driven, self-administered test was developed to examine three specific aspects of music perception: pitch direction discrimination, melody recognition, and timbre recognition. The pitch subtest used an adaptive procedure to determine just-noticeable differences for complex tone pitch direction discrimination within the range of 1 to 12 semitones. The melody and timbre subtests assessed recognition of 12 commonly known melodies played with complex tones in an isochronous manner and eight musical instruments playing an identical five-note sequence, respectively. Testing was repeated for cochlear implant subjects to evaluate test-retest reliability. Normal-hearing volunteers were also tested to demonstrate differences in performance in the two populations. For cochlear implant subjects, pitch direction discrimination just-noticeable differences ranged from 1 to 8.0 semitones (Mean = 3.0, SD = 2.3). Melody and timbre recognition ranged from 0 to 94.4% correct (mean = 25.1, SD = 22.2) and 20.8 to 87.5% (mean = 45.3, SD = 16.2), respectively. Each subtest significantly correlated at least moderately with both Consonant-Nucleus-Consonant (CNC) word recognition scores and spondee recognition thresholds in steady state noise and two-talker babble. Intraclass coefficients demonstrating test-retest correlations for pitch, melody, and timbre were 0.85, 0.92, and 0.69, respectively. Normal-hearing volunteers had a mean pitch direction discrimination threshold of 1.0 semitone, the smallest interval tested, and mean melody and timbre recognition scores of 87.5 and 94.2%, respectively. The CAMP test discriminates a wide range of music perceptual ability in cochlear implant users. Moderate correlations were seen between music test results and both Consonant-Nucleus-Consonant word recognition scores and spondee recognition thresholds in background noise. Test-retest reliability was moderate to strong. The CAMP test provides a reliable and valid metric for a clinically practical, standardized evaluation of music perception in adult cochlear implant users.
Test-retest reliability of subliminal facial affective priming.

PubMed

Dannlowski, Udo; Suslow, Thomas

2006-02-01

Since the seminal 1993 demonstrations o f Murphy an d Zajonc, researchers have replicated and extended findings concerning subliminal affective priming. So far, however, no data on test-retest reliability of affective priming effects are available. A subliminal facial affective priming task was administered to 22 healthy individuals (15 women and 7 men) twice about 7 wk. apart. Happy and sad facial expressions were used as affective primes and neutral Chinese ideographs served as target masks, which had to be evaluated. Neutral facial primes and a no-face condition served as baselines. All participants reported not having seen any of the prime faces at either testing session. Priming scores for affective faces compared to the baselines were computed. Acceptable test-retest correlations (rs) of up to .74 were found for the affective priming scores. Although measured almost 2 mo. apart, subliminal affective priming seems to be a temporally stable effect.
Development and psychometric testing of a semantic differential scale of sexual attitude for the older person.

PubMed

Park, Hyojung; Shin, Sunhwa

2015-12-01

The purpose of this study was to develop and test a semantic differential scale of sexual attitudes for older people in Korea. The scale was based on items derived from a literature review and focus group interviews. A methodological study was used to test the reliability and validity of the instrument. A total of 368 older men and women were recruited to complete the semantic differential scale. Fifteen pairs of adjective ratings were extracted through factor analysis. Total variance explained was 63.40%. To test for construct validity, group comparisons were implemented. The total score of sexual attitudes showed significant differences depending on gender and availability of sexual activity. Cronbach's alpha coefficient for internal consistency was 0.96. The findings of this study demonstrate that the semantic differential scale of sexual attitude is a reliable and valid instrument. © 2015 Wiley Publishing Asia Pty Ltd.
Quality of prenatal care questionnaire: instrument development and testing.

PubMed

Heaman, Maureen I; Sword, Wendy A; Akhtar-Danesh, Noori; Bradford, Amanda; Tough, Suzanne; Janssen, Patricia A; Young, David C; Kingston, Dawn A; Hutton, Eileen K; Helewa, Michael E

2014-06-03

Utilization indices exist to measure quantity of prenatal care, but currently there is no published instrument to assess quality of prenatal care. The purpose of this study was to develop and test a new instrument, the Quality of Prenatal Care Questionnaire (QPCQ). Data for this instrument development study were collected in five Canadian cities. Items for the QPCQ were generated through interviews with 40 pregnant women and 40 health care providers and a review of prenatal care guidelines, followed by assessment of content validity and rating of importance of items. The preliminary 100-item QPCQ was administered to 422 postpartum women to conduct item reduction using exploratory factor analysis. The final 46-item version of the QPCQ was then administered to another 422 postpartum women to establish its construct validity, and internal consistency and test-retest reliability. Exploratory factor analysis reduced the QPCQ to 46 items, factored into 6 subscales, which subsequently were validated by confirmatory factor analysis. Construct validity was also demonstrated using a hypothesis testing approach; there was a significant positive association between women's ratings of the quality of prenatal care and their satisfaction with care (r = 0.81). Convergent validity was demonstrated by a significant positive correlation (r = 0.63) between the "Support and Respect" subscale of the QPCQ and the "Respectfulness/Emotional Support" subscale of the Prenatal Interpersonal Processes of Care instrument. The overall QPCQ had acceptable internal consistency reliability (Cronbach's alpha = 0.96), as did each of the subscales. The test-retest reliability result (Intra-class correlation coefficient = 0.88) indicated stability of the instrument on repeat administration approximately one week later. Temporal stability testing confirmed that women's ratings of their quality of prenatal care did not change as a result of giving birth or between the early postpartum period and 4 to 6 weeks postpartum. The QPCQ is a valid and reliable instrument that will be useful in future research as an outcome measure to compare quality of care across geographic regions, populations, and service delivery models, and to assess the relationship between quality of care and maternal and infant health outcomes.
The ABC’s of Suicide Risk Assessment: Applying a Tripartite Approach to Individual Evaluations

PubMed Central

Harris, Keith M.; Syu, Jia-Jia; Lello, Owen D.; Chew, Y. L. Eileen; Willcox, Christopher H.; Ho, Roger H. M.

2015-01-01

There is considerable need for accurate suicide risk assessment for clinical, screening, and research purposes. This study applied the tripartite affect-behavior-cognition theory, the suicidal barometer model, classical test theory, and item response theory (IRT), to develop a brief self-report measure of suicide risk that is theoretically-grounded, reliable and valid. An initial survey (n = 359) employed an iterative process to an item pool, resulting in the six-item Suicidal Affect-Behavior-Cognition Scale (SABCS). Three additional studies tested the SABCS and a highly endorsed comparison measure. Studies included two online surveys (Ns = 1007, and 713), and one prospective clinical survey (n = 72; Time 2, n = 54). Factor analyses demonstrated SABCS construct validity through unidimensionality. Internal reliability was high (α = .86-.93, split-half = .90-.94)). The scale was predictive of future suicidal behaviors and suicidality (r = .68, .73, respectively), showed convergent validity, and the SABCS-4 demonstrated clinically relevant sensitivity to change. IRT analyses revealed the SABCS captured more information than the comparison measure, and better defined participants at low, moderate, and high risk. The SABCS is the first suicide risk measure to demonstrate no differential item functioning by sex, age, or ethnicity. In all comparisons, the SABCS showed incremental improvements over a highly endorsed scale through stronger predictive ability, reliability, and other properties. The SABCS is in the public domain, with this publication, and is suitable for clinical evaluations, public screening, and research. PMID:26030590
Integrating field methodology and web-based data collection to assess the reliability of the Alcohol Use Disorders Identification Test (AUDIT).

PubMed

Celio, Mark A; Vetter-O'Hagen, Courtney S; Lisman, Stephen A; Johansen, Gerard E; Spear, Linda P

2011-12-01

Field methodologies offer a unique opportunity to collect ecologically valid data on alcohol use and its associated problems within natural drinking environments. However, limitations in follow-up data collection methods have left unanswered questions regarding the psychometric properties of field-based measures. The aim of the current study is to evaluate the reliability of self-report data collected in a naturally occurring environment - as indexed by the Alcohol Use Disorders Identification Test (AUDIT) - compared to self-report data obtained through an innovative web-based follow-up procedure. Individuals recruited outside of bars (N=170; mean age=21; range 18-32) provided a BAC sample and completed a self-administered survey packet that included the AUDIT. BAC feedback was provided anonymously through a dedicated web page. Upon sign in, follow-up participants (n=89; 52%) were again asked to complete the AUDIT before receiving their BAC feedback. Reliability analyses demonstrated that AUDIT scores - both continuous and dichotomized at the standard cut-point - were stable across field- and web-based administrations. These results suggest that self-report data obtained from acutely intoxicated individuals in naturally occurring environments are reliable when compared to web-based data obtained after a brief follow-up interval. Furthermore, the results demonstrate the feasibility, utility, and potential of integrating field methods and web-based data collection procedures. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Berardinelli, S.P.; Rusczek, R.A.; Mickelsen, R.L.

The National Institute for Occupational Safety and Health (NIOSH), in cooperation with Monsanto Chemical Company, conducted an on-site evaluation of chemical protective clothing at Monsanto's Nitro, West Virginia plant. The Monsanto plant manufactures additives for the rubber industry including antioxidants, pre-vulcanization inhibitors, accelerators, etc. This survey evaluated six raw materials that have a potential for skin absorption: aniline, cyclohexylamine, diisorpropylamine, tertiary butylamine, morpholine and carbon disulfide. Five generic glove materials were tested against these chemicals; nitrile, neoprene, polyvinylchloride, natural latex and natural rubber. The NIOSH chemical permeation portable test system was used to generate breakthrough time data. The results weremore » compared to permeation data reported in the literature that were obtained by using the ASTM F739-85 test method. The test data demonstrated that aniline has too low a vapor pressure for reliable analysis on the portable direct reading detectors used. The chemical permeation test system, however provided comparable, reliable permeation data for the other tested chemicals. Monsanto has used this data to better select chemical protective clothing for its intended use.« less
Psychometric Properties of the Chinese Version of the Eating Attitudes Test in Young Female Patients with Eating Disorders in Mainland China.

PubMed

Kang, Qing; Chan, Raymond C K; Li, Xiaoping; Arcelus, Jon; Yue, Ling; Huang, Jiabin; Gu, Lian; Fan, Qing; Zhang, Haiyin; Xiao, Zeping; Chen, Jue

2017-11-01

The study aimed to investigate the reliability and validity of the Chinese version of the eating attitudes test (EAT-26) among female adolescents and young adults in Mainland China. This scale was administered to 396 female eating disorder patients and 406 noneating disorder healthy controls, in addition 35 healthy controls completed a retest after a 4-week intervals. Tests for reliability, convergent validity and receiver operating characteristic analysis were performed to detect the psychometric properties. The EAT-26 demonstrated good internal consistency (Cronbach's alpha = 0.822-0.922), test-retest reliability (interclass correlation coefficient = 0.817) and convergent validity(r = 0.450-0.750). The receiver operating characteristic analysis showed that the cut-off 14 for anorexia nervosa and 15 for bulimia nervosa represented good compromises with approximate sensitivity (0.66-0.68) and specificity (0.85-0.86). Our findings provided evidence that the Chinese version of the EAT-26 was a psychometrically reliable and valid self-rating instrument for identifying people suffering from an eating disorder in Mainland China. A clinical cut-off range between 14 and 15 could be used, but caution should be exercised because of the low sensitivity of the tool. Copyright © 2017 John Wiley & Sons, Ltd and Eating Disorders Association. Copyright © 2017 John Wiley & Sons, Ltd and Eating Disorders Association.
Kappa and Rater Accuracy: Paradigms and Parameters

ERIC Educational Resources Information Center

Conger, Anthony J.

2017-01-01

Drawing parallels to classical test theory, this article clarifies the difference between rater accuracy and reliability and demonstrates how category marginal frequencies affect rater agreement and Cohen's kappa. Category assignment paradigms are developed: comparing raters to a standard (index) versus comparing two raters to one another…
Towards Autonomous Inspection of Space Systems Using Mobile Robotic Sensor Platforms

NASA Technical Reports Server (NTRS)

Wong, Edmond; Saad, Ashraf; Litt, Jonathan S.

2007-01-01

The space transportation systems required to support NASA's Exploration Initiative will demand a high degree of reliability to ensure mission success. This reliability can be realized through autonomous fault/damage detection and repair capabilities. It is crucial that such capabilities are incorporated into these systems since it will be impractical to rely upon Extra-Vehicular Activity (EVA), visual inspection or tele-operation due to the costly, labor-intensive and time-consuming nature of these methods. One approach to achieving this capability is through the use of an autonomous inspection system comprised of miniature mobile sensor platforms that will cooperatively perform high confidence inspection of space vehicles and habitats. This paper will discuss the efforts to develop a small scale demonstration test-bed to investigate the feasibility of using autonomous mobile sensor platforms to perform inspection operations. Progress will be discussed in technology areas including: the hardware implementation and demonstration of robotic sensor platforms, the implementation of a hardware test-bed facility, and the investigation of collaborative control algorithms.
Diagnosing prosopagnosia in East Asian individuals: Norms for the Cambridge Face Memory Test-Chinese.

PubMed

McKone, Elinor; Wan, Lulu; Robbins, Rachel; Crookes, Kate; Liu, Jia

2017-07-01

The Cambridge Face Memory Test (CFMT) is widely accepted as providing a valid and reliable tool in diagnosing prosopagnosia (inability to recognize people's faces). Previously, large-sample norms have been available only for Caucasian-face versions, suitable for diagnosis in Caucasian observers. These are invalid for observers of different races due to potentially severe other-race effects. Here, we provide large-sample norms (N = 306) for East Asian observers on an Asian-face version (CFMT-Chinese). We also demonstrate methodological suitability of the CFMT-Chinese for prosopagnosia diagnosis (high internal reliability, approximately normal distribution, norm-score range sufficiently far above chance). Additional findings were a female advantage on mean performance, plus a difference between participants living in the East (China) or the West (international students, second-generation children of immigrants), which we suggest might reflect personality differences associated with willingness to emigrate. Finally, we demonstrate suitability of the CFMT-Chinese for individual differences studies that use correlations within the normal range.
Design, Development, And Testing of Umbilical System Mechanisms for the X-33 Advanced Technology Demonstrator

NASA Technical Reports Server (NTRS)

Littlefield, Alan C.; Melton, Gregory S.

2000-01-01

The X-33 Advanced Technology Demonstrator is an un-piloted, vertical take-off, horizontal landing spacecraft. The purpose of the X-33 program is to demonstrate technologies that will dramatically lower the cost of access to space. The rocket-powered X-33 will reach an altitude of up to 100 km and speeds between Mach 13 and 15. Fifteen flight tests are planned, beginning in 2000. Some of the key technologies demonstrated will be the linear aerospike engine, improved thermal protection systems, composite fuel tanks and reduced operational timelines. The X-33 vehicle umbilical connections provide monitoring, power, cooling, purge, and fueling capability during horizontal processing and vertical launch operations. Two "rise-off" umbilicals for the X-33 have been developed, tested, and installed. The X-33 umbilical systems mechanisms incorporate several unique design features to simplify horizontal operations and provide reliable disconnect during launch.
Design, Development,and Testing of Umbillical System Mechanisms for the X-33 Advanced Technology Demonstrator

NASA Technical Reports Server (NTRS)

Littlefield, Alan C.; Melton, Gregory S.

1999-01-01

The X-33 Advanced Technology Demonstrator is an un-piloted, vertical take-off, horizontal landing spacecraft. The purpose of the X-33 program is to demonstrate technologies that will dramatically lower the cost of access to space. The rocket-powered X-33 will reach an altitude of up to 100 km and speeds between Mach 13 and 15. Fifteen flight tests are planned, beginning in 2000. Some of the key technologies demonstrated will be the linear aerospike engine, improved thermal protection systems, composite fuel tanks and reduced operational timelines. The X-33 vehicle umbilical connections provide monitoring, power, cooling, purge, and fueling capability during horizontal processing and vertical launch operations. Two "rise-ofF' umbilicals for the X-33 have been developed, tested, and installed. The X-33 umbilical systems mechanisms incorporate several unique design features to simplify horizontal operations and provide reliable disconnect during launch.
Validity and reliability of the session-RPE method for quantifying training in Australian football: a comparison of the CR10 and CR100 scales.

PubMed

Scott, Tannath J; Black, Cameron R; Quinn, John; Coutts, Aaron J

2013-01-01

The purpose of this study was to examine and compare the criterion validity and test-retest reliability of the CR10 and CR100 rating of perceived exertion (RPE) scales for team sport athletes that undertake high-intensity, intermittent exercise. Twenty-one male Australian football (AF) players (age: 19.0 ± 1.8 years, body mass: 83.92 ± 7.88 kg) participated the first part (part A) of this study, which examined the construct validity of the session-RPE (sRPE) method for quantifying training load in AF. Ten male athletes (age: 16.1 ± 0.5 years) participated in the second part of the study (part B), which compared the test-retest reliability of the CR10 and CR100 RPE scales. In part A, the validity of the sRPE method was assessed by examining the relationships between sRPE, and objective measures of internal (i.e., heart rate) and external training load (i.e., distance traveled), collected from AF training sessions. Part B of the study assessed the reliability of sRPE through examining the test-retest reliability of sRPE during 3 different intensities of controlled intermittent running (10, 11.5, and 13 km·h(-1)). Results from part A demonstrated strong correlations for CR10- and CR100-derived sRPE with measures of internal training load (Banisters TRIMP and Edwards TRIMP) (CR10: r = 0.83 and 0.83, and CR100: r = 0.80 and 0.81, p < 0.05). Correlations between sRPE and external training load (distance, higher speed running and player load) for both the CR10 (r = 0.81, 0.71, and 0.83) and CR100 (r = 0.78, 0.69, and 0.80) were significant (p < 0.05). Results from part B demonstrated poor reliability for both the CR10 (31.9% CV) and CR100 (38.6% CV) RPE scales after short bouts of intermittent running. Collectively, these results suggest both CR10- and CR100-derived sRPE methods have good construct validity for assessing training load in AF. The poor levels of reliability revealed under field testing indicate that the sRPE method may not be sensible to detecting small changes in exercise intensity during brief intermittent running bouts. Despite this limitation, the sRPE remains a valid method to quantify training loads in high-intensity, intermittent team sport.
Validation of the German version of the Ford Insomnia Response to Stress Test.

PubMed

Dieck, Arne; Helbig, Susanne; Drake, Christopher L; Backhaus, Jutta

2018-06-01

The purpose of this study was to assess the psychometric properties of a German version of the Ford Insomnia Response to Stress Test with groups with and without sleep problems. Three studies were analysed. Data set 1 was based on an initial screening for a sleep training program (n = 393), data set 2 was based on a study to test the test-retest reliability of the Ford Insomnia Response to Stress Test (n = 284) and data set 3 was based on a study to examine the influence of competitive sport on sleep (n = 37). Data sets 1 and 2 were used to test internal consistency, factor structure, convergent validity, discriminant validity and test-retest reliability of the Ford Insomnia Response to Stress Test. Content validity was tested using data set 3. Cronbach's alpha of the Ford Insomnia Response to Stress Test was good (α = 0.80) and test-retest reliability was satisfactory (r = 0.72). Overall, the one-factor model showed the best fit. Furthermore, significant positive correlations between the Ford Insomnia Response to Stress Test and impaired sleep quality, depression and stress reactivity were in line with the expectations regarding the convergent validity. Subjects with sleep problems had significantly higher scores in the Ford Insomnia Response to Stress Test than subjects without sleep problems (P < 0.01). Competitive athletes with higher scores in the Ford Insomnia Response to Stress Test had significantly lower sleep quality (P = 0.01), demonstrating that vulnerability for stress-induced sleep disturbances accompanies poorer sleep quality in stressful episodes. The findings show that the German version of the Ford Insomnia Response to Stress Test is a reliable and valid questionnaire to assess the vulnerability to stress-induced sleep disturbances. © 2017 European Sleep Research Society.
Timeline historical review of income and financial transactions: a reliable assessment of personal finances.

PubMed

Black, Anne C; Serowik, Kristin L; Ablondi, Karen M; Rosen, Marc I

2013-01-01

The need for accurate and reliable information about income and resources available to individuals with psychiatric disabilities is critical for the assessment of need and evaluation of programs designed to alleviate financial hardship or affect finance allocation. Measurement of finances is ubiquitous in studies of economics, poverty, and social services. However, evidence has demonstrated that these measures often contain error. We compare the 1-week test-retest reliability of income and finance data from 24 adult psychiatric outpatients using assessment-as-usual (AAU) and a new instrument, the Timeline Historical Review of Income and Financial Transactions (THRIFT). Reliability estimates obtained with the THRIFT for Income (0.77), Expenses (0.91), and Debt (0.99) domains were significantly better than those obtained with AAU. Reliability estimates for Balance did not differ. THRIFT reduced measurement error and provided more reliable information than AAU for assessment of personal finances in psychiatric patients receiving Social Security benefits. The instrument also may be useful with other low-income groups.
CONSISTENCY OF FIELD-BASED MEASURES OF NEUROMUSCULAR CONTROL USING FORCE PLATE DIAGNOSTICS IN ELITE MALE YOUTH SOCCER PLAYERS

PubMed Central

READ, PAUL; OLIVER, JON L.; DE STE CROIX, MARK B.A.; MYER, GREGORY D.; LLOYD, RHODRI S.

2016-01-01

Deficits in neuromuscular control during movement patterns such as landing are suggested pathomechanics that underlie sport-related injury. A common mode of assessment is measurement of landing forces during jumping tasks; however, these measures have been used less frequently in male youth soccer players and reliability data is sparse. The aim of this study was to examine the reliability of a field-based neuromuscular control screening battery using force plate diagnostics in this cohort. Twenty six pre-peak height velocity (PHV) and twenty five post-PHV elite male youth soccer players completed a drop vertical jump (DVJ), single leg 75% horizontal hop and stick (75%HOP) and single leg countermovement jump (SLCMJ). Measures of peak landing vertical ground reaction force (pVGRF), time to stabilisation (TTS), time to pVGRF, and pVGRF asymmetry were recorded. A test, re-test design was used and reliability statistics included: change in mean, intraclass correlation coefficient (ICC) and coefficient of variation (CV). No significant differences in mean score were reported for any of the assessed variables between test sessions. In both groups, pVGRF and asymmetry during the 75%HOP and SLCMJ demonstrated largely acceptable reliability (CV ≤ 10%). Greater variability was evident in DVJ pVGRF and all other assessed variables, across the three protocols (CV range = 13.8 – 49.7%). ICC values ranged from small to large and were generally higher in the post-PHV players. The results of this study suggest that pVGRF and asymmetry can be reliably assessed using a 75%HOP and SLCMJ in this cohort. These measures could be utilized to support a screening battery for elite male youth soccer players and for test re-test comparison. PMID:27075641
Questionnaire for measuring organisational attributes in dental-care practices: psychometric properties and test-retest reliability.

PubMed

Goetz, Katja; Hasse, Philipp; Szecsenyi, Joachim; Campbell, Stephen M

2016-04-01

The consideration of organisational aspects, such as shared goals and clear communication, within the health care team is important to ensure good quality care. In primary health care, the instrument Survey of Organizational Attributes for Primary Care (SOAPC) is available to measure organisational attributes of care. However, there is no instrument available for dental care. The aim of the present study was to investigate psychometric properties and test-retest reliability of the version of SOAPC adapted for dental care, namely the Survey of Organizational Attributes in Dental Care (SOADC). The SOADC consists of 21 items in the following four subscales: communication; decision making; stress/chaos; and history of change. Convergent construct validity was measured using the job satisfaction scale. A total of 287 dental-care practices were asked to participate in the validation study. Psychometric properties and test-retest reliability were observed. A total of 43 dental-care practices responded to the survey. At baseline, 178 dental-care staff completed the questionnaire, and 4 weeks later 138 did so. Internal consistency, measured by Cronbach's alpha, was 0.718 or higher in the subscales. The test-retest reliability for each subscale and the overall SOADC score demonstrated good correlations over the 4-week test-retest interval, except for 'history of change'. A strong correlation with the aggregated job-satisfaction scale showed high convergent construct validity of SOADC. The consideration of organisational aspects from the perspective of dental-care teams is important for providing good quality of care. The SOADC is a reliable instrument with good psychometric properties and is suitable for the evaluation of organisational attributes in dental-care practices. © 2015 FDI World Dental Federation.
Test-retest reliability of posture measurements in adolescents with idiopathic scoliosis.

PubMed

Heitz, Pierre-Henri; Aubin-Fournier, Jean-François; Parent, Éric; Fortin, Carole

2018-05-07

Posture changes are a major consequence of IS (IS). Posture changes can lead to psychosocial and physical impairments in adolescents with IS. Therefore, it is important to assess posture but the test-retest reliability of posture measurements still remains unknown in this population. The primary objective was to determine the test-retest reliability of 25 head and trunk posture indices using the Clinical Photographic Postural Assessment Tool (CPPAT) in adolescents with IS. The secondary objective was to determine the standard error of measurement and the minimal detectable change. This is a prospective test-retest reliability study carried out at two tertiary university hospital centers. Forty-one adolescents with IS, aged 10 to 16 years old with curves 10 to 45 o and treated non-operatively were recruited. Two posture assessments were done using the CPPAT five to 10 days apart following a standardized procedure. Photographs were analyzed with the CPPAT software by digitizing reference landmarks placed on the participant by a physiotherapist evaluator. Generalizability theory was used to obtain a coefficient of dependability, standard error of measurement and the minimal detectable change at the 90% confidence interval. This project was supported by the Canadian Pediatric Spine Society (CPSS: 10000$). There is no study-specific conflicts of interest-associated biases. Fourteen of 25 posture indices had a good reliability (ϕ ≥ 0.78), ten of 25 had moderate reliability (ϕ = 0.55 to 0.74) and one had poor reliability (ϕ = 0.45). The most reliable posture indices were waist angles asymmetry (ϕ = 0.93), right waist angle (ϕ = 0.91) and frontal trunk list (ϕ = 0.92). Right sagittal trunk list was the least reliable posture index (ϕ = 0.45). The MDC 90 values ranged from 2.6 to 10.3° for angular measurements and from 8.4 to 35.1 mm for linear measurements. This study demonstrates that most posture indices, especially the trunk posture indices, are reproducible in time among adolescents with IS and provides reference values. Clinicians and researchers can use these reference values in order to assess change in posture over time attributable to treatment effectiveness. Copyright © 2018. Published by Elsevier Inc.
Robust hard-solder packaging of conduction cooled laser diode bars

NASA Astrophysics Data System (ADS)

Schleuning, David; Griffin, Mike; James, Phillip; McNulty, John; Mendoza, Dan; Morales, John; Nabors, David; Peters, Mike; Zhou, Hailong; Reed, Murray

2007-02-01

We present the reliability of high-power laser diodes utilizing hard solder (AuSn) on a conduction-cooled package (HCCP). We present results of 50 W hard-pulse operation at 8xx nm and demonstrate a reliability of MTTF > 27 khrs (90% CL), which is an order of magnitude improvement over traditional packaging. We also present results at 9xx nm with a reliability of MTTF >17 khrs (90% CL) at 75 W. We discuss finite element analysis (FEA) modeling and time dependent temperature measurements combined with experimental life-test data to quantify true hard-pulse operation. We also discuss FEA and measured stress profiles across laser bars comparing soft and hard solder packaging.

Robotic Assembly of Truss Structures for Space Systems and Future Research Plans

NASA Technical Reports Server (NTRS)

Doggett, William

2002-01-01

Many initiatives under study by both the space science and earth science communities require large space systems, i.e. with apertures greater than 15 m or dimensions greater than 20 m. This paper reviews the effort in NASA Langley Research Center's Automated Structural Assembly Laboratory which laid the foundations for robotic construction of these systems. In the Automated Structural Assembly Laboratory reliable autonomous assembly and disassembly of an 8 meter planar structure composed of 102 truss elements covered by 12 panels was demonstrated. The paper reviews the hardware and software design philosophy which led to reliable operation during weeks of near continuous testing. Special attention is given to highlight the features enhancing assembly reliability.
Psychometric Properties of Translation of the Child Perception Questionnaire (CPQ11-14) in Telugu Speaking Indian Children.

PubMed

Kumar, Santhosh; Kroon, Jeroen; Lalloo, Ratilal; Johnson, Newell W

2016-01-01

Oral health related quality of life research among children in India is still nascent and no measures have been validated to date. Although CPQ11-14 has been previously used in studies from the Indian sub-continent, the instrument has never been tested for cross-cultural adaptability. This study aimed to assess the validity and reliability of CPQ11-14 in Telugu speaking Indian school children. Primary school children of Medak district, Telangana State, India, were recruited by a multi-stage probability sampling method. The translated questionnaire was initially pilot tested on a small subset of children (n = 40). Children with informed consent from parents (N = 1342) were then provided with questionnaires containing the Telugu translation of CPQ11-14, followed by a clinical examination conducted by a single examiner, using Basic WHO survey methods for dental caries, malocclusion, and Dean's Fluorosis index. Children (n = 161) in randomly chosen schools were re-administered the same questionnaire after a two week interval to test reliability of CPQ11-14 on repeated administrations. Internal consistency and test-retest reliability as determined by Cronbach's alpha and Intra-class correlation coefficient for overall CPQ11-14 scale were 0.925 and 0.923, respectively. CPQ11-14 discriminated between the categories of fluorosis and malocclusion while its discriminant validity with respect to dental caries was limited. CPQ11-14 also demonstrated good construct validity with both overall CPQ11-14 and its subscales having significant positive correlation with global ratings of oral health and overall wellbeing, even after adjusting for confounding variables. CPQ11-14 had a correlation of 0.405 with self-evaluated oral health and 0.407 with self-evaluated impact of oral health on overall wellbeing. In conclusion, Telugu translation of CPQ11-14 demonstrated good internal consistency and excellent reliability on repeated administrations after two weeks. It also exhibited good discriminant and construct validity.
Development and validation of the Myasthenia Gravis Impairment Index.

PubMed

Barnett, Carolina; Bril, Vera; Kapral, Moira; Kulkarni, Abhaya; Davis, Aileen M

2016-08-30

We aimed to develop a measure of myasthenia gravis impairment using a previously developed framework and to evaluate reliability and validity, specifically face, content, and construct validity. The first draft of the Myasthenia Gravis Impairment Index (MGII) included examination items from available measures enriched with newly developed, patient-reported items, modified after patient input. International neuromuscular specialists evaluated face and content validity via an e-mail survey. Test-retest reliability was assessed in stable patients at a 3-week interval and interrater reliability was evaluated in the same day. Construct validity was assessed through correlations between the MGII and other measures and by comparing scores in different patient groups. The first draft was assessed by 18 patients, and 72 specialists answered the survey. The second draft had 7 examination and 22 patient-reported items. Field testing included 200 patients, with 54 patients completing the reliability studies. Test-retest reliability of the total score was good (intraclass correlation coefficient 0.92; 95% confidence interval 0.79-0.94), as was interrater reliability of the examination component (intraclass correlation coefficient 0.81; 95% confidence interval 0.79-0.94). The MGII correlated well with comparison measures, with higher correlations with the MG-activities of daily living (r = 0.91) and MG-specific quality of life 15-item scale (r = 0.78). When assessing different patient groups, the scores followed expected patterns. The MGII was developed using a patient-centered framework of myasthenia-related impairments and incorporating patient input throughout the development process. It is reliable in an outpatient setting and has demonstrated construct validity. Responsiveness studies are under way. © 2016 American Academy of Neurology.
The multiple sclerosis work difficulties questionnaire: translation and cross-cultural adaptation to Turkish and assessment of validity and reliability.

PubMed

Kahraman, Turhan; Özdoğar, Asiye Tuba; Honan, Cynthia Alison; Ertekin, Özge; Özakbaş, Serkan

2018-05-09

To linguistically and culturally adapt the Multiple Sclerosis Work Difficulties Questionnaire-23 (MSWDQ-23) for use in Turkey, and to examine its reliability and validity. Following standard forward-back translation of the MSWDQ-23, it was administered to 124 people with multiple sclerosis (MS). Validity was evaluated using related outcome measures including those related to employment status and expectations, disability level, fatigue, walking, and quality of life. Randomly selected participants were asked to complete the MSWDQ-23 again to assess test-retest reliability. Confirmatory factor analysis on the MSWDQ-23 demonstrated a good fit for the data, and the internal consistency of each subscale was excellent. The test-retest reliability for the total score, psychological/cognitive barriers, physical barriers, and external barriers subscales were high. The MSWDQ-23 and its subscales were positively correlated with the employment, disability level, walking, and fatigue outcome measures. This study suggests that the Turkish version of MSWDQ-23 has high reliability and adequate validity, and it can be used to determine the difficulties faced by people with multiple sclerosis in workplace. Moreover, the study provides evidence about the test-retest reliability of the questionnaire. Implications for rehabilitation Multiple sclerosis affects young people of working age. Understanding work-related problems is crucial to enhance people with multiple sclerosis likelihood of maintaining their job. The Multiple Sclerosis Work Difficulties Questionnaire-23 (MSWDQ-23) is a valid and reliable measure of perceived workplace difficulties in people with multiple sclerosis: we presented its validation to Turkish. Professionals working in the field of vocational rehabilitation may benefit from using the MSWDQ-23 to predict the current work outcomes and future employment expectations.
Reliability of a quantitative clinical posture assessment tool among persons with idiopathic scoliosis.

PubMed

Fortin, Carole; Feldman, Debbie Ehrmann; Cheriet, Farida; Gravel, Denis; Gauthier, Frédérique; Labelle, Hubert

2012-03-01

To determine overall, test-retest and inter-rater reliability of posture indices among persons with idiopathic scoliosis. A reliability study using two raters and two test sessions. Tertiary care paediatric centre. Seventy participants aged between 10 and 20 years with different types of idiopathic scoliosis (Cobb angle 15 to 60°) were recruited from the scoliosis clinic. Based on the XY co-ordinates of natural reference points (e.g., eyes) as well as markers placed on several anatomical landmarks, 32 angular and linear posture indices taken from digital photographs in the standing position were calculated from a specially developed software program. Generalisability theory served to estimate the reliability and standard error of measurement (SEM) for the overall, test-retest and inter-rater designs. Bland and Altman's method was also used to document agreement between sessions and raters. In the random design, dependability coefficients demonstrated a moderate level of reliability for six posture indices (ϕ=0.51 to 0.72) and a good level of reliability for 26 posture indices out of 32 (ϕ≥0.79). Error attributable to marker placement was negligible for most indices. Limits of agreement and SEM values were larger for shoulder protraction, trunk list, Q angle, cervical lordosis and scoliosis angles. The most reproducible indices were waist angles and knee valgus and varus. Posture can be assessed in a global fashion from photographs in persons with idiopathic scoliosis. Despite the good reliability of marker placement, other studies are needed to minimise measurement errors in order to provide a suitable tool for monitoring change in posture over time. Copyright © 2011 Chartered Society of Physiotherapy. Published by Elsevier Ltd. All rights reserved.
Development and validation of an objective instrument to measure surgical performance at tonsillectomy.

PubMed

Roberson, David W; Kentala, Erna; Forbes, Peter

2005-12-01

The goals of this project were 1) to develop and validate an objective instrument to measure surgical performance at tonsillectomy, 2) to assess its interobserver and interobservation reliability and construct validity, and 3) to select those items with best reliability and most independent information to design a simplified form suitable for routine use in otolaryngology surgical evaluation. Prospective, observational data collection for an educational quality improvement project. The evaluation instrument was based on previous instruments developed in general surgery with input from attending otolaryngologic surgeons and experts in medical education. It was pilot tested and subjected to iterative improvements. After the instrument was finalized, a total of 55 tonsillectomies were observed and scored during academic year 2002 to 2003: 45 cases by residents at different points during their rotation, 5 by fellows, and 5 by faculty. Results were assessed for interobserver reliability, interobservation reliability, and construct validity. Factor analysis was used to identify items with independent information. Interobserver and interobservation reliability was high. On technical items, faculty substantially outperformed fellows, who in turn outperformed residents (P < .0001 for both comparisons). On the "global" scale (overall assessment), residents improved an average of 1 full point (on a 5 point scale) during a 3 month rotation (P = .01). In the subscale of "patient care," results were less clear cut: fellows outperformed residents, who in turn outperformed faculty, but only the fellows to faculty comparison was statistically significant (P = .04), and residents did not clearly improve over time (P = .36). Factor analysis demonstrated that technical items and patient care items factor separately and thus represent separate skill domains in surgery. It is possible to objectively measure surgical skill at tonsillectomy with high reliability and good construct validity. Factor analysis demonstrated that patient care is a distinct domain in surgical skill. Although the interobserver reliability for some patient care items reached statistical significance, it was not high enough for "high stakes testing" purposes. Using reliability and factor analysis results, we propose a simplified instrument for use in evaluating trainees in otolaryngologic surgery.
Test-retest reliability of cardinal plane isokinetic hip torque and EMG.

PubMed

Claiborne, Tina L; Timmons, Mark K; Pincivero, Danny M

2009-10-01

The objective of the present study was to establish test-retest reliability of isokinetic hip torque and prime mover electromyogram (EMG) through the three cardinal planes of motion. Thirteen healthy young adults participated in two experimental sessions, separated by approximately one week. During each session, isokinetic hip torque was evaluated on the Biodex Isokinetic Dynamometer at a velocity of 60 deg/s. Subjects performed three maximal-effort concentric and eccentric contractions, separately, for right and left hip abduction/adduction, flexion/extension, and internal/external rotation. Surface EMGs were sampled from the gluteus maximus, gluteus medius, adductor, medial and lateral hamstring, and rectus femoris muscles during all contractions. Intraclass correlation coefficients (ICC - 2,1) and standard errors of measurement (SEM) were calculated for peak torque for each movement direction and contraction mode, while ICCs were only computed for the EMG data. Motions that demonstrated high torque reliability included concentric hip abduction (right and left), flexion (right and left), extension (right) and internal rotation (right and left), and eccentric hip abduction (left), adduction (left), flexion (right), and extension (right and left) (ICC range=0.81-0.91). Motions with moderate torque reliability included concentric hip adduction (right), extension (left), internal rotation (left), and external rotation (right), and eccentric hip abduction and adduction (right), flexion (left), internal rotation (right and left), and external rotation (right and left) (ICC range=0.49-0.79). The majority of the EMG sampled muscles (n=12 and n=11 for concentric and eccentric contractions, respectively) demonstrated high reliability (ICC=0.81-0.95). Instances of low, or unacceptable, EMG reliability values occurred for the medial hamstring muscle of the left leg (both contraction modes) and the adductor muscle of the right leg during eccentric internal rotation. The major finding revealed high and moderate levels of between-day reliability of isokinetic hip peak torque and prime mover EMG. It is recommended that the day-to-day variability estimates concomitant with acceptable levels of reliability be considered when attempting to objectify intervention effects on hip muscle performance.
Whole Arm Water Displacement Volumetry Is a Reliable and Sensitive Measure: A Pilot to Assess Acute Postburn Volume Change.

PubMed

Edgar, Dale W; Briffa, N Kathy; Wood, Fiona M

Water displacement volumetry (WDV) is a reliable method for measurement of wrist and hand volume in lymphedema patients. However, within session WDV reliability for the whole upper limb (UL) lacks comprehensive investigation, particularly in acute edema populations. This study aimed to confirm the reliability and investigate the impact of time between repeated trials on the sensitivity of WDV as a measure of whole UL volume change in an uninjured cohort and a burn injured pilot group. Within session, duplicate measures of whole UL WDV were recorded in two groups of noninjured volunteers and a group of burn patients. Each noninjured group differed only in the time between WDV repeats. The reliability trials were performed <10 minutes apart (T10) and 20 to 30 minutes apart (T20). The time between repetitions for burn patients was 20 to 30 minutes, based on the results of the noninjured participant trials. All trial groups demonstrated excellent correlation between trials (ICCT10 = 0.999, ICCT20 = 0.997). The minimum detectable difference calculated for WDV when measuring whole UL volume change of >50 ml for noninjured and >100 ml for burn patients. Despite this, a systematic bias was demonstrated between the T10 group means. The T20 group trials did not indicate such error on statistical testing (P = .297). The study confirms that WDV measurement of whole ULs is reliable and sensitive, if used at least 20 minutes apart. However, a significant and clinically relevant subject-by-method interaction was demonstrated. Researchers and clinicians are reminded to be aware of the performance of the technique when designing investigations in patient populations.
Relative and absolute reliability of the clinical version of the Narrow Path Walking Test (NPWT) under single and dual task conditions.

PubMed

Gimmon, Yoav; Jacob, Grinshpon; Lenoble-Hoskovec, Constanze; Büla, Christophe; Melzer, Itshak

2013-01-01

Decline in gait stability has been associated with increased fall risk in older adults. Reliable and clinically feasible methods of gait instability assessment are needed. This study evaluated the relative and absolute reliability and concurrent validity of the testing procedure of the clinical version of the Narrow Path Walking Test (NPWT) under single task (ST) and dual task (DT) conditions. Thirty independent community-dwelling older adults (65-87 years) were tested twice. Participants were instructed to walk within the 6-m narrow path without stepping out. Trial time, number of steps, trial velocity, number of step errors, and number of cognitive task errors were determined. Intraclass correlation coefficients (ICCs) were calculated as indices of agreement, and a graphic approach called "mountain plot" was applied to help interpret the direction and magnitude of disagreements between testing procedures. Smallest detectable change and smallest real difference (SRD) were computed to determine clinically relevant improvement at group and individual levels, respectively. Concurrent validity was assessed using Performance Oriented Mobility Assessment Tool (POMA) and the Short Physical Performance Battery (SPPB). Test-retest agreement (ICC1,2) varied from 0.77 to 0.92 in ST and from 0.78 to 0.92 in DT conditions, with no apparent systematic differences between testing procedures demonstrated by the mountain plot graphs. Smallest detectable change and smallest real change were small for motor task performance and larger for cognitive errors. Significant correlations were observed for trial velocity and trial time with POMA and SPPB. The present results indicate that the NPWT testing procedure is highly reliable and reproducible. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Cross-cultural adaptation and validation of the Italian version of the Kerlan-Jobe Orthopaedic Clinic Shoulder and Elbow score.

PubMed

Merolla, Giovanni; Corona, Katia; Zanoli, Gustavo; Cerciello, Simone; Giannotti, Stefano; Porcellini, Giuseppe

2017-12-01

The Kerlan-Jobe Orthopaedic Clinic (KJOC) Shoulder and Elbow score is a reliable and sensitive tool to measure the performance of overhead athletes. The purpose of this study was to carry out a cross-cultural adaptation and validation of the KJOC questionnaire in Italian and to assess its reliability, validity, and responsiveness. Ninety professional athletes with a painful shoulder were included in this study and were assigned to the "injury group" (n = 32) or the "overuse group" (n = 58); 65 were managed conservatively and 25 were treated by arthroscopic surgery. To assess the reliability of the KJOC score, patients were asked to fill in the questionnaire at baseline and after 2 weeks. To test the construct validity, KJOC scores were compared to those obtained with the Italian version of the Disabilities of the Arm, Shoulder, and Hand (DASH) scale, and with the DASH sports/performing arts module. To test KJOC score responsiveness, the follow-up KJOC scores of the participants treated conservatively were compared to those of the patients treated by arthroscopic surgery. Statistical analysis demonstrated that the KJOC questionnaire is reliable in terms of the single items and the overall score (ICC 0.95-0.99); that it has high construct validity (r s = -0.697; p < 0.01); and that it is responsive to clinical differences in shoulder function (p < 0.0001). The Italian version of the KJOC Shoulder and Elbow score performed in a similar way to the English version and demonstrated good validity, reliability, and responsiveness after conservative and surgical treatment. II.
Reliability and validity of an arabic version of the dyspnea-12 questionnaire for Saudi nationals with chronic obstructive pulmonary disease.

PubMed

Alyami, Mohammed M; Jenkins, Sue C; Lababidi, Hani; Hill, Kylie

2015-01-01

Dyspnea is a distressing symptom experienced by people with chronic obstructive pulmonary disease (COPD). The dyspnea-12 (D-12) questionnaire comprises of 12 items and assesses the quality of this symptom, its severity and the emotional response. The original (English) version of the D-12 is reliable and valid for the measurement of dyspnea in pulmonary diseases. To translate the D-12 into Arabic and determine whether this version is reliable and valid in Saudi nationals with COPD. The D-12 was translated into Arabic version and reviewed by an expert panel before being back-translated into English. The Arabic version was administered to five patients with COPD to test whether it was easily understood after which a final Arabic version was produced. Thereafter, 40 patients with COPD (aged 63 ΁ 9 years; 33 [82.5%] males; forced expiratory volume in one second (FEV 1) 47 ΁ 16% predicted) completed the D-12, the COPD Assessment Test (CAT) and the Chronic Respiratory Disease Questionnaire (CRDQ). Lung function and 6-minute walk distance were also measured. The D-12 was re-administered two weeks later. The Arabic version of the D-12 demonstrated good reliability over the two administration (intraclass correlation coefficient = 0.94, P = 0.01). Strong associations were demonstrated between the (1) total score for the D-12 and the CAT, (2) quality sub-score of the D-12 and the CAT and (3) emotional response sub-score of the D-12 and emotional function domain of the CRDQ (r ≥ 0.6, all P < 0.01). The Arabic version of the D-12 is a reliable and valid instrument in Saudi nationals with COPD.
Digital electronic engine control history

NASA Technical Reports Server (NTRS)

Putnam, T. W.

1984-01-01

Full authority digital electronic engine controls (DEECs) were studied, developed, and ground tested because of projected benefits in operability, improved performance, reduced maintenance, improved reliability, and lower life cycle costs. The issues of operability and improved performance, however, are assessed in a flight test program. The DEEC on a F100 engine in an F-15 aircraft was demonstrated and evaluated. The events leading to the flight test program are chronicled and important management and technical results are identified.
Psychometric Properties of the Adolescent Reinforcement Survey Schedule – Alcohol Use Version with College Student Drinkers

PubMed Central

Hallgren, Kevin A.; Greenfield, Brenna L.; Ladd, Benjamin O.

2016-01-01

Background Behavioral economic theories of drinking posit that the reinforcing value of engaging in activities with versus without alcohol influences drinking behavior. Measures of the reinforcement value of drugs and alcohol have been used in previous research, but little work has examined the psychometric properties of these measures. Objectives The present study aims to evaluate the factor structure, test-retest reliability, and concurrent validity of an alcohol-only version of the Adolescent Reinforcement Survey Schedule (ARSS-AUV). Methods A sample of 157 college student drinkers completed the ARSS-AUV at two time points 2–3 days apart. Test-retest reliability, hierarchical factor analysis, and correlations with other drinking measures were examined. Results Single, unidimensional general factors accounted for a majority of the variance in alcohol and alcohol-free reinforcement items. Residual factors emerged that typically represented alcohol or alcohol-free reinforcement while doing activities with friends, romantic or sexual partners, and family members. Individual ARSS-AUV items had fair-to-good test-retest reliability, while general and residual factors had excellent test-retest reliability. General alcohol reinforcement and alcohol reinforcement from friends and romantic partners were positively correlated with past-year alcohol consumption, heaviest drinking episode, and alcohol-related negative consequences. Alcohol-free reinforcement indices were unrelated to alcohol use or consequences. Conclusions/Importance The ARSS-AUV appears to demonstrate good reliability and mixed concurrent validity among college student drinkers. The instrument may provide useful information about alcohol reinforcement from various activities and people and could provide clinically-relevant information for prevention and treatment programs. PMID:27096713
Psychometric Properties of the Adolescent Reinforcement Survey Schedule-Alcohol Use Version with College Student Drinkers.

PubMed

Hallgren, Kevin A; Greenfield, Brenna L; Ladd, Benjamin O

2016-06-06

Behavioral economic theories of drinking posit that the reinforcing value of engaging in activities with versus without alcohol influences drinking behavior. Measures of the reinforcement value of drugs and alcohol have been used in previous research, but little work has examined the psychometric properties of these measures. The present study aims to evaluate the factor structure, test-retest reliability, and concurrent validity of an alcohol-only version of the Adolescent Reinforcement Survey Schedule (ARSS-AUV). A sample of 157 college student drinkers completed the ARSS-AUV at two time points 2-3 days apart. Test-retest reliability, hierarchical factor analysis, and correlations with other drinking measures were examined. Single, unidimensional general factors accounted for a majority of the variance in alcohol and alcohol-free reinforcement items. Residual factors emerged that typically represented alcohol or alcohol-free reinforcement while doing activities with friends, romantic or sexual partners, and family members. Individual ARSS-AUV items had fair-to-good test-retest reliability, while general and residual factors had excellent test-retest reliability. General alcohol reinforcement and alcohol reinforcement from friends and romantic partners were positively correlated with past-year alcohol consumption, heaviest drinking episode, and alcohol-related negative consequences. Alcohol-free reinforcement indices were unrelated to alcohol use or consequences. The ARSS-AUV appears to demonstrate good reliability and mixed concurrent validity among college student drinkers. The instrument may provide useful information about alcohol reinforcement from various activities and people and could provide clinically-relevant information for prevention and treatment programs.
Multiple Sclerosis Walking Scale-12, translation, adaptation and validation for the Persian language population.

PubMed

Nakhostin Ansari, Noureddin; Naghdi, Soofia; Mohammadi, Roghaye; Hasson, Scott

2015-02-01

The Multiple Sclerosis Walking Scale-12 (MSWS-12) is a multi-item rating scale used to assess the perspectives of patients about the impact of MS on their walking ability. The aim of this study was to examine the reliability and validity of the MSWS-12 in Persian speaking patients with MS. The MSWS-12 questionnaire was translated into Persian language according to internationally adopted standards involving forward-backward translation, reviewed by an expert committee and tested on the pre-final version. In this cross-sectional study, 100 participants (50 patients with MS and 50 healthy subjects) were included. The MSWS-12 was administered twice 7 days apart to 30 patients with MS for test and retest reliability. Internal consistency reliability was Cronbach's α 0.96 for test and 0.97 for retest. There were no significant floor or ceiling effects. Test-retest reliability was excellent (intraclass correlation coefficient [ICC] agreement of 0.98, 95% CI, 0.95-0.99) confirming the reproducibility of the Persian MSWS-12. Construct validity using known group methods was demonstrated through a significant difference in the Persian MSWS-12 total score between the patients with MS and healthy subjects. Factor analysis extracted 2 latent factors (79.24% of the total variance). A second factor analysis suggested the 9-item Persian MSWS as a unidimensional scale for patients with MS. The Persian MSWS-12 was found to be valid and reliable for assessing walking ability in Persian speaking patients with MS. Copyright © 2014 Elsevier B.V. All rights reserved.
Development and validation of the Survey of Organizational Research Climate (SORC).

PubMed

Martinson, Brian C; Thrush, Carol R; Lauren Crain, A

2013-09-01

Development and targeting efforts by academic organizations to effectively promote research integrity can be enhanced if they are able to collect reliable data to benchmark baseline conditions, to assess areas needing improvement, and to subsequently assess the impact of specific initiatives. To date, no standardized and validated tool has existed to serve this need. A web- and mail-based survey was administered in the second half of 2009 to 2,837 randomly selected biomedical and social science faculty and postdoctoral fellows at 40 academic health centers in top-tier research universities in the United States. Measures included the Survey of Organizational Research Climate (SORC) as well as measures of perceptions of organizational justice. Exploratory and confirmatory factor analyses yielded seven subscales of organizational research climate, all of which demonstrated acceptable internal consistency (Cronbach's α ranging from 0.81 to 0.87) and adequate test-retest reliability (Pearson r ranging from 0.72 to 0.83). A broad range of correlations between the seven subscales and five measures of organizational justice (unadjusted regression coefficients ranging from 0.13 to 0.95) document both construct and discriminant validity of the instrument. The SORC demonstrates good internal (alpha) and external reliability (test-retest) as well as both construct and discriminant validity.
Development and Validation of the Survey of Organizational Research Climate (SORC)

PubMed Central

Martinson, Brian C.; Thrush, Carol R.; Crain, A. Lauren

2012-01-01

Background Development and targeting efforts by academic organizations to effectively promote research integrity can be enhanced if they are able to collect reliable data to benchmark baseline conditions, to assess areas needing improvement, and to subsequently assess the impact of specific initiatives. To date, no standardized and validated tool has existed to serve this need. Methods A web- and mail-based survey was administered in the second half of 2009 to 2,837 randomly selected biomedical and social science faculty and postdoctoral fellows at 40 academic health centers in top-tier research universities in the United States. Measures included the Survey of Organizational Research Climate (SORC) as well as measures of perceptions of organizational justice. Results Exploratory and confirmatory factor analyses yielded seven subscales of organizational research climate, all of which demonstrated acceptable internal consistency (Cronbach’s α ranging from 0.81 to 0.87) and adequate test-retest reliability (Pearson r ranging from 0.72 to 0.83). A broad range of correlations between the seven subscales and five measures of organizational justice (unadjusted regression coefficients ranging from .13 to .95) document both construct and discriminant validity of the instrument. Conclusions The SORC demonstrates good internal (alpha) and external reliability (test-retest) as well as both construct and discriminant validity. PMID:23096775
Remote FLS testing in the real world: ready for "prime time".

PubMed

Okrainec, Allan; Vassiliou, Melina; Jimenez, M Carolina; Henao, Oscar; Kaneva, Pepa; Matt Ritter, E

2016-07-01

Maintaining the existing FLS test centers requires considerable investment in human and financial resources. It can also be particularly challenging for those outside of North America to become certified due to the limited number of international test centers. Preliminary work suggests that it is possible to reliably score the FLS manual skills component remotely using low-cost videoconferencing technology. Significant work remains to ensure that testing procedures adhere to standards defined by SAGES for this approach to be considered equivalent to standard on-site testing. To validate the integrity and validity of the FLS manual skills examination administered remotely in a real-world environment according to FLS testing protocols and to evaluate participants' experience with the setting. Individuals with various levels of training from the University of Toronto completed a pre- and a post-test questionnaire. Participants presented to one of the two FLS testing rooms available for the study, each connected via Skype to a separate room with a FLS proctor who administered and scored the test remotely (RP). An on-site proctor (OP) was present in the room as a control. An invigilator was also present in the testing room to follow directions from the RP and ensure the integrity of test materials. Twenty-one participants were recruited, and 20 completed the test. There was no significant difference between scores by RP and OP. Interrater reliability between the RP and OP was excellent. One critical error was missed by the RP, but this would not have affected the test outcome. Participants reported being highly satisfied. We demonstrate that proctors located remotely can administer the FLS skills test in a secure and reliable fashion, with excellent interrater reliability compared to an on-site proctor. Remote proctoring of the FLS examination could become a strategy to increase certification rates while containing costs.
Thermal Expert System (TEXSYS): Systems automony demonstration project, volume 1. Overview

NASA Technical Reports Server (NTRS)

Glass, B. J. (Editor)

1992-01-01

The Systems Autonomy Demonstration Project (SADP) produced a knowledge-based real-time control system for control and fault detection, isolation, and recovery (FDIR) of a prototype two-phase Space Station Freedom external active thermal control system (EATCS). The Thermal Expert System (TEXSYS) was demonstrated in recent tests to be capable of reliable fault anticipation and detection, as well as ordinary control of the thermal bus. Performance requirements were addressed by adopting a hierarchical symbolic control approach-layering model-based expert system software on a conventional, numerical data acquisition and control system. The model-based reasoning capabilities of TEXSYS were shown to be advantageous over typical rule-based expert systems, particularly for detection of unforeseen faults and sensor failures. Volume 1 gives a project overview and testing highlights. Volume 2 provides detail on the EATCS test bed, test operations, and online test results. Appendix A is a test archive, while Appendix B is a compendium of design and user manuals for the TEXSYS software.
Three-dimensional assessment of the asymptomatic and post-stroke shoulder: intra-rater test-retest reliability and within-subject repeatability of the palpation and digitization approach.

PubMed

Pain, Liza A M; Baker, Ross; Sohail, Qazi Zain; Richardson, Denyse; Zabjek, Karl; Mogk, Jeremy P M; Agur, Anne M R

2018-03-23

Altered three-dimensional (3D) joint kinematics can contribute to shoulder pathology, including post-stroke shoulder pain. Reliable assessment methods enable comparative studies between asymptomatic shoulders of healthy subjects and painful shoulders of post-stroke subjects, and could inform treatment planning for post-stroke shoulder pain. The study purpose was to establish intra-rater test-retest reliability and within-subject repeatability of a palpation/digitization protocol, which assesses 3D clavicular/scapular/humeral rotations, in asymptomatic and painful post-stroke shoulders. Repeated measurements of 3D clavicular/scapular/humeral joint/segment rotations were obtained using palpation/digitization in 32 asymptomatic and six painful post-stroke shoulders during four reaching postures (rest/flexion/abduction/external rotation). Intra-class correlation coefficients (ICCs), standard error of the measurement and 95% confidence intervals were calculated. All ICC values indicated high to very high test-retest reliability (≥0.70), with lower reliability for scapular anterior/posterior tilt during external rotation in asymptomatic subjects, and scapular medial/lateral rotation, humeral horizontal abduction/adduction and axial rotation during abduction in post-stroke subjects. All standard error of measurement values demonstrated within-subject repeatability error ≤5° for all clavicular/scapular/humeral joint/segment rotations (asymptomatic ≤3.75°; post-stroke ≤5.0°), except for humeral axial rotation (asymptomatic ≤5°; post-stroke ≤15°). This noninvasive, clinically feasible palpation/digitization protocol was reliable and repeatable in asymptomatic shoulders, and in a smaller sample of painful post-stroke shoulders. Implications for Rehabilitation In the clinical setting, a reliable and repeatable noninvasive method for assessment of three-dimensional (3D) clavicular/scapular/humeral joint orientation and range of motion (ROM) is currently required. The established reliability and repeatability of this proposed palpation/digitization protocol will enable comparative 3D ROM studies between asymptomatic and post-stroke shoulders, which will further inform treatment planning. Intra-rater test-retest repeatability, which is measured by the standard error of the measure, indicates the range of error associated with a single test measure. Therefore, clinicians can use the standard error of the measure to determine the "true" differences between pre-treatment and post-treatment test scores.

Aging aircraft NDI Development and Demonstration Center (AANC): An overview. [nondestructive inspection

NASA Technical Reports Server (NTRS)

Walter, Patrick L.

1992-01-01

A major center with emphasis on validation of nondestructive inspection (NDI) techniques for aging aircraft, the Aging Aircraft NDI Development and Demonstration Center (AANC), has been funded by the FAA at Sandia National Laboratories. The Center has been assigned specific tasks in developing techniques for the nondestructive inspection of static engine parts, assessing inspection reliability (POD experiments), developing testbeds for NDI validation, maintaining a FAA library of characterized aircraft structural test specimens, and leasing a hangar to house a high flight cycle transport aircraft for use as a full scale test bed.
Development and validation of a questionnaire to evaluate patient satisfaction with diabetes disease management.

PubMed

Paddock, L E; Veloski, J; Chatterton, M L; Gevirtz, F O; Nash, D B

2000-07-01

To develop a reliable and valid questionnaire to measure patient satisfaction with diabetes disease management programs. Questions related to structure, process, and outcomes were categorized into 14 domains defining the essential elements of diabetes disease management. Health professionals confirmed the content validity. Face validity was established by a patient focus group. The questionnaire was mailed to 711 patients with diabetes who participated in a disease management program. To reduce the number of questionnaire items, a principal components analysis was performed using a varimax rotation. The Scree test was used to select significant components. To further assess reliability and validity; Cronbach's alpha and product-moment correlations were calculated for components having > or =3 items with loadings >0.50. The validated 73-item mailed satisfaction survey had a 34.1% response rate. Principal components analysis yielded 13 components with eigenvalues > 1.0. The Scree test proposed a 6-component solution (39 items), which explained 59% of the total variation. Internal consistency reliabilities computed for the first 6 components (alpha = 0.79-0.95) were acceptable. The final questionnaire, the Diabetes Management Evaluation Tool (DMET), was designed to assess patient satisfaction with diabetes disease management programs. Although more extensive testing of the questionnaire is appropriate, preliminary reliability and validity of the DMET has been demonstrated.
Validity and reliability of the Malay version multidimensional scale of perceived social support (MSPSS-M) among teachers.

PubMed

Lee, Soo Cheng; Moy, Foong Ming; Hairi, Noran Naqiah

2017-01-01

The multidimensional scale of perceived social support (MSPSS) was developed to measure perceived social support. It has been translated and culturally adapted among natives literate in the Malay language. However, its psychometric properties for teachers who are majority females and married have not been assessed. This was a cross-sectional study conducted among the public secondary school teachers in the central region of Peninsular Malaysia from May to July 2013. A total of 150 and 203 teachers were recruited to perform exploratory factor analysis and confirmatory factor analysis (CFA), respectively. Reliability testing was evaluated on 141 teachers via internal consistency and two-week interval test-retest. The 12-item three-factor structure of MSPSS-M was revised to 8-item two-factor structure. The revised MSPSS-M demonstrated excellent fit in CFA with adequate divergent and convergent validity and good factor loadings (0.80-0.90). The revised MSPSS-M also displayed good internal consistency with Cronbach's alpha of 0.91, 0.93 and 0.92 and good test-retest reliability with intraclass correlation of 0.89, 0.88 and 0.88 in the total scale, family and friends factors, respectively. The revised 8-item MSPSS-M is a reliable and valid tool for assessment of perceived social support among teachers.
Gearbox Reliability Collaborative Phase 3 Gearbox 3 Test Report

DOE Office of Scientific and Technical Information (OSTI.GOV)

Keller, Jonathan; Wallen, Robb

Many gearboxes in wind turbines do not achieve their expected design life; they do, however, commonly meet or exceed the design criteria specified in current standards in the gear, bearing, and wind turbine industry as well as third-party certification criteria. The cost of gearbox replacements and rebuilds, as well as the downtime associated with these failures, increases the cost of wind energy. In 2007, the U.S. Department of Energy established the National Renewable Energy Laboratory (NREL) Gearbox Reliability Collaborative (GRC). Its goals are to understand the root causes of premature gearbox failures and to improve their reliability. The GRC ismore » examining a hypothesis that the gap between design-estimated and actual wind turbine gearbox reliability is caused by underestimation of loads, inaccurate design tools, the absence of critical elements in the design process, or insufficient testing. This report describes the recently completed tests of GRC Gearbox 3 in the National Wind Technology Center dynamometer and documents any modifications to the original test plan. In this manner, it serves as a guide for interpreting the publicly released data sets with brief analyses to illustrate the data. The primary test objective was to measure the planetary load-sharing characteristics in the same conditions as the original GRC gearbox design. If the measured load-sharing characteristics are close to the design model, the projected improvement in planetary section fatigue life and the efficacy of preloaded TRBs in mitigating the planetary bearing fatigue failure mode will have been demonstrated. Detailed analysis of that test objective will be presented in subsequent publications.« less
National audit of continence care: laying the foundation.

PubMed

Mian, Sarah; Wagg, Adrian; Irwin, Penny; Lowe, Derek; Potter, Jonathan; Pearson, Michael

2005-12-01

National audit provides a basis for establishing performance against national standards, benchmarking against other service providers and improving standards of care. For effective audit, clinical indicators are required that are valid, feasible to apply and reliable. This study describes the methods used to develop clinical indicators of continence care in preparation for a national audit. To describe the methods used to develop and test clinical indicators of continence care with regard to validity, feasibility and reliability. A multidisciplinary working group developed clinical indicators that measured the structure, process and outcome of care as well as case-mix variables. Literature searching, consensus workshops and a Delphi process were used to develop the indicators. The indicators were tested in 15 secondary care sites, 15 primary care sites and 15 long-term care settings. The process of development produced indicators that received a high degree of consensus within the Delphi process. Testing of the indicators demonstrated an internal reliability of 0.7 and an external reliability of 0.6. Data collection required significant investment in terms of staff time and training. The method used produced indicators that achieved a high degree of acceptance from health care professionals. The reliability of data collection was high for this audit and was similar to the level seen in other successful national audits. Data collection for the indicators was feasible to collect, however, issues of time and staffing were identified as limitations to such data collection. The study has described a systematic method for developing clinical indicators for national audit. The indicators proved robust and reliable in primary and secondary care as well as long-term care settings.
Proposal and validation of a clinical trunk control test in individuals with spinal cord injury.

PubMed

Quinzaños, J; Villa, A R; Flores, A A; Pérez, R

2014-06-01

One of the problems that arise in spinal cord injury (SCI) is alteration in trunk control. Despite the need for standardized scales, these do not exist for evaluating trunk control in SCI. To propose and validate a trunk control test in individuals with SCI. National Institute of Rehabilitation, Mexico. The test was developed and later evaluated for reliability and criteria, content, and construct validity. We carried out 531 tests on 177 patients and found high inter- and intra-rater reliability. In terms of criterion validity, analysis of variance demonstrated a statistically significant difference in the test score of patients with adequate or inadequate trunk control according to the assessment of a group of experts. A receiver operating characteristic curve was plotted for optimizing the instrument's cutoff point, which was determined at 13 points, with a sensitivity of 98% and a specificity of 92.2%. With regard to construct validity, the correlation between the proposed test and the spinal cord independence measure (SCIM) was 0.873 (P=0.001) and that with the evolution time was 0.437 (P=0.001). For testing the hypothesis with qualitative variables, the Kruskal-Wallis test was performed, which resulted in a statistically significant difference between the scores in the proposed scale of each group defined by these variables. It was proven experimentally that the proposed trunk control test is valid and reliable. Furthermore, the test can be used for all patients with SCI despite the type and level of injury.
Preliminary Evaluation Of Commercial Supercapacitors For Space Applications

NASA Astrophysics Data System (ADS)

Gineste, Valery; Loup, Didier; Mattesco, Patrick; Neugnot, Nicolas

2011-10-01

Supercapacitors are identified since years as a new technology enabling energy storage together with high power delivery capability to the system. A recent ESA study [1] led by Astrium has demonstrated the interest of these devices for space application, providing that reliability and end of life performances are demonstrated. A realistic commercial on the shelf (COTS) approach (or with limited design modification approved by potential suppliers) has been favoured (as for batteries). This paper presents preliminary test results done by Astrium on COTS supercapacitors: accelerated life tests, calendar life tests, technology analyses. Based on these results, assessment and lessons learnt are drawn in view of future exhaustive supercapacitor validation and future qualification.
A psychometric comparison of three scales and a single-item measure to assess sexual satisfaction.

PubMed

Mark, Kristen P; Herbenick, Debby; Fortenberry, J Dennis; Sanders, Stephanie; Reece, Michael

2014-01-01

This study was designed to systematically compare and contrast the psychometric properties of three scales developed to measure sexual satisfaction and a single-item measure of sexual satisfaction. The Index of Sexual Satisfaction (ISS), Global Measure of Sexual Satisfaction (GMSEX), and the New Sexual Satisfaction Scale-Short (NSSS-S) were compared to one another and to a single-item measure of sexual satisfaction. Conceptualization of the constructs, distribution of scores, internal consistency, convergent validity, test-retest reliability, and factor structure were compared between the measures. A total of 211 men and 214 women completed the scales and a measure of relationship satisfaction, with 33% (n = 139) of the sample reassessed two months later. All scales demonstrated appropriate distribution of scores and adequate internal consistency. The GMSEX, NSSS-S, and the single-item measure demonstrated convergent validity. Test-retest reliability was demonstrated by the ISS, GMSEX, and NSSS-S, but not the single-item measure. Taken together, the GMSEX received the strongest psychometric support in this sample for a unidimensional measure of sexual satisfaction and the NSSS-S received the strongest psychometric support in this sample for a bidimensional measure of sexual satisfaction.
Validity and reliability of a video questionnaire to assess physical function in older adults.

PubMed

Balachandran, Anoop; N Verduin, Chelsea; Potiaumpai, Melanie; Ni, Meng; Signorile, Joseph F

2016-08-01

Self-report questionnaires are widely used to assess physical function in older adults. However, they often lack a clear frame of reference and hence interpreting and rating task difficulty levels can be problematic for the responder. Consequently, the usefulness of traditional self-report questionnaires for assessing higher-level functioning is limited. Video-based questionnaires can overcome some of these limitations by offering a clear and objective visual reference for the performance level against which the subject is to compare his or her perceived capacity. Hence the purpose of the study was to develop and validate a novel, video-based questionnaire to assess physical function in older adults independently living in the community. A total of 61 community-living adults, 60years or older, were recruited. To examine validity, 35 of the subjects completed the video questionnaire, two types of physical performance tests: a test of instrumental activity of daily living (IADL) included in the Short Physical Functional Performance battery (PFP-10), and a composite of 3 performance tests (30s chair stand, single-leg balance and usual gait speed). To ascertain reliability, two-week test-retest reliability was assessed in the remaining 26 subjects who did not participate in validity testing. The video questionnaire showed a moderate correlation with the IADLs (Spearman rho=0.64, p<0.001; 95% CI (0.4, 0.8)), and a lower correlation with the composite score of physical performance tests (Spearman rho=0.49, p<0.01; 95% CI (0.18, 0.7)). The test-retest assessment yielded an intra-class correlation (ICC) of 0.87 (p<0.001; 95% CI (0.70, 0.94)) and a Cronbach's alpha of 0.89 demonstrating good reliability and internal consistency. Our results show that the video questionnaire developed to evaluate physical function in community-living older adults is a valid and reliable assessment tool; however, further validation is needed for definitive conclusions. Copyright © 2016 Elsevier Inc. All rights reserved.
Retrogression and Re-Ageing In-Service Demonstrator Reliability Trials: Stage 3 Component Test Report

DTIC Science & Technology

2012-03-01

6 4.5 Component, Furnace and Quench Bath Thermometry...................................... 6 4.6 Component Heat Treatment...7 4.6.2 Post-Retrogression Quench .................................................................... 9 4.6.3...23 5.5.2 Temperature Profile – Post-Retrogression Quenching .................... 23 5.5.3 Temperature
CTEPP STANDARD OPERATING PROCEDURE FOR TRANSLATING VIDEOTAPES OF CHILD ACTIVITIES (SOP-4.13)

EPA Science Inventory

The EPA will conduct a two-day video translation workshop to demonstrate to coders the procedures for translating the activity patterns of preschool children on videotape. The coders will be required to pass reliability tests to successfully complete the training requirements of ...
Evaluation of lower leg function in patients with Achilles tendinopathy.

PubMed

Silbernagel, Karin Grävare; Gustavsson, Alexander; Thomeé, Roland; Karlsson, Jon

2006-11-01

Achilles tendinopathy is considered to be one of the most common overuse injuries in elite and recreational athletes. However, the effect that the Achilles tendinopathy has on patients' physical performance is still unclear. The purpose of this study was to evaluate if Achilles tendinopathy caused functional deficits on the injured side compared with the non-injured side in patients. A test battery comprised of tests for different aspects of muscle-tendon function of the gastrocnemius, soleus and Achilles tendon complex was developed to evaluate lower leg function. The test battery's test-retest reliability and sensitivity (the percent probability that the tests would demonstrate abnormal lower limb symmetry index in patients) were also evaluated. The test battery consisted of three jump tests, a counter movements jump (CMJ), a drop counter movement jump (drop CMJ) and hopping, and two strength tests, concentric toe-raises, eccentric-concentric toe-raises and toe-raises for endurance. The reliability was evaluated through a test-retest design on 15 healthy subjects. The test battery's sensitivity and possible functional deficits in patients with Achilles tendinopathy were evaluated on 42 patients (19 women and 23 men). An excellent reliability was found between test days 1-2 and 2-3 for all tests (ICC = 0.76-0.94) except for concentric toe-raise, test 2-3, which had fair reliability (ICC = 0.73). The methodological error ranged from 8 to 17%. There were significant differences (P = 0.001-0.049) between the non-injured (or least symptomatic) side and injured (most symptomatic) side for hopping, drop CMJ, concentric and eccentric-concentric toe-raises, and significant differences (P = 0.000-0.012) in the level of pain during CMJ, hopping, and drop CMJ. The sensitivity of the test battery at a 90% capacity was 88. Achilles tendinopathy causes not only pain and symptoms in patients but also apparent impairments in various aspects of lower leg muscle-tendon function as measured with the test battery. This test battery is reliable and able to detect differences in lower leg function between the injured or "most symptomatic" and non-injured or "least symptomatic" side in patients with Achilles tendinopathy. The test battery has higher demand on patients' function compared with each individual test.
Validation of alternative methods for toxicity testing.

PubMed Central

Bruner, L H; Carr, G J; Curren, R D; Chamberlain, M

1998-01-01

Before nonanimal toxicity tests may be officially accepted by regulatory agencies, it is generally agreed that the validity of the new methods must be demonstrated in an independent, scientifically sound validation program. Validation has been defined as the demonstration of the reliability and relevance of a test method for a particular purpose. This paper provides a brief review of the development of the theoretical aspects of the validation process and updates current thinking about objectively testing the performance of an alternative method in a validation study. Validation of alternative methods for eye irritation testing is a specific example illustrating important concepts. Although discussion focuses on the validation of alternative methods intended to replace current in vivo toxicity tests, the procedures can be used to assess the performance of alternative methods intended for other uses. Images Figure 1 PMID:9599695
The Jebsen Taylor Test of Hand Function: A Pilot Test-Retest Reliability Study in Typically Developing Children.

PubMed

Reedman, Sarah Elizabeth; Beagley, Simon; Sakzewski, Leanne; Boyd, Roslyn N

2016-08-01

The aim of this pilot study was to evaluate reproducibility of the Jebsen Taylor Test of Hand Function (JTTHF) in children. Eighty-seven typically developing children 5 to 10 years old were included from five Outside School Hours Care centers in the Greater Brisbane Region, Australia. Hand function was assessed on two occasions with a modified JTTHF, then reproducibility was assessed using Intraclass Correlation Coefficient (ICC [3,1]) and the Standard Error of Measurement (SEM). Total scores for male and female children were not significantly different. Five-year-old children were significantly different to all other age groups and were excluded from further analysis. Results for 71 children, 6 to 10 years old were analyzed (mean age 8.31 years (SD 1.32); 33 males). Test-retest reliability for total scores on the dominant and nondominant hands were ICC 0.74 (95% CI 0.61, 0.83) and ICC 0.72 (95% CI 0.59, 0.82), respectively. 'Writing' and 'Simulated Feeding' subtests demonstrated poor reproducibility. The Smallest Real Difference was 5.09 seconds for total score on the dominant hand. Findings indicate good test-retest reliability for the JTTHF total score to measure hand function in typically developing children aged 6 to 10 years.
Preliminary test results from a free-piston Stirling engine technology demonstration program to support advanced radioisotope space power applications

NASA Astrophysics Data System (ADS)

White, Maurice A.; Qiu, Songgang; Augenblick, Jack E.

2000-01-01

Free-piston Stirling engines offer a relatively mature, proven, long-life technology that is well-suited for advanced, high-efficiency radioisotope space power systems. Contracts from DOE and NASA are being conducted by Stirling Technology Company (STC) for the purpose of demonstrating the Stirling technology in a configuration and power level that is representative of an eventual space power system. The long-term objective is to develop a power system with an efficiency exceeding 20% that can function with a high degree of reliability for up to 15 years on deep space missions. The current technology demonstration convertors (TDC's) are completing shakedown testing and have recently demonstrated performance levels that are virtually identical to projections made during the preliminary design phase. This paper describes preliminary test results for power output, efficiency, and vibration levels. These early results demonstrate the ability of the free-piston Stirling technology to exceed objectives by approximately quadrupling the efficiency of conventional radioisotope thermoelectric generators (RTG's). .
Small sample estimation of the reliability function for technical products

NASA Astrophysics Data System (ADS)

Lyamets, L. L.; Yakimenko, I. V.; Kanishchev, O. A.; Bliznyuk, O. A.

2017-12-01

It is demonstrated that, in the absence of big statistic samples obtained as a result of testing complex technical products for failure, statistic estimation of the reliability function of initial elements can be made by the moments method. A formal description of the moments method is given and its advantages in the analysis of small censored samples are discussed. A modified algorithm is proposed for the implementation of the moments method with the use of only the moments at which the failures of initial elements occur.
Reliability of risk assessment measures used in sexually violent predator proceedings.

PubMed

Miller, Cailey S; Kimonis, Eva R; Otto, Randy K; Kline, Suzonne M; Wasserman, Adam L

2012-12-01

The field interrater reliability of three assessment tools frequently used by mental health professionals when evaluating sex offenders' risk for reoffending--the Psychopathy Checklist-Revised (PCL-R), the Minnesota Sex Offender Screening Tool-Revised (MnSOST-R) and the Static-99-was examined within the context of sexually violent predator program proceedings. Rater agreement was highest for the Static--99 (intraclass correlation coefficient [ICC₁] = .78) and lowest for the PCL-R (ICC₁ = .60; MnSOST-R ICC₁ = .74), although all instruments demonstrated lower field reliability than that reported in their test manuals. Findings raise concerns about the reliability of risk assessment tools that are used to inform judgments of risk in high-stake sexually violent predator proceedings. Implications for future research and suggestions for improving evaluator training to increase accuracy when informing legal decision making are discussed.
Development and psychometric testing of the Dogs and WalkinG Survey (DAWGS).

PubMed

Richards, Elizabeth A; McDonough, Meghan H; Edwards, Nancy E; Lyle, Roseann M; Troped, Philip J

2013-12-01

Dog owners represent 40% of the population, a promising audience to increase population levels of physical activity. The purpose of this study was to develop and test the psychometric properties of a new instrument to assess social-cognitive theory constructs related to dog walking. Dog owners (N = 431) completed the Dogs and WalkinG Survey (DAWGS). Survey items assessed dog-walking behaviors and self-efficacy, social support, outcome expectations, and outcome expectancies for dog walking. Test-retest reliability was assessed among 252 (58%) survey respondents who completed the survey twice. Factorial validity and factorial invariance by age and walking level were tested using confirmatory factor analysis. DAWGS items demonstrated moderate test-retest reliability (p = .39-.79; k = .41-.89). Acceptable model fit was found for all subscales. All subscales were invariant by age and walking level, except self-efficacy, which showed mixed evidence of invariance. The DAWGS is a psychometrically sound instrument for examining individual and interpersonal correlates of dog walking.
Perspectives on Validation of High-Throughput Assays Supporting 21st Century Toxicity Testing1

PubMed Central

Judson, Richard; Kavlock, Robert; Martin, Matt; Reif, David; Houck, Keith; Knudsen, Thomas; Richard, Ann; Tice, Raymond R.; Whelan, Maurice; Xia, Menghang; Huang, Ruili; Austin, Christopher; Daston, George; Hartung, Thomas; Fowle, John R.; Wooge, William; Tong, Weida; Dix, David

2014-01-01

Summary In vitro, high-throughput screening (HTS) assays are seeing increasing use in toxicity testing. HTS assays can simultaneously test many chemicals, but have seen limited use in the regulatory arena, in part because of the need to undergo rigorous, time-consuming formal validation. Here we discuss streamlining the validation process, specifically for prioritization applications in which HTS assays are used to identify a high-concern subset of a collection of chemicals. The high-concern chemicals could then be tested sooner rather than later in standard guideline bioassays. The streamlined validation process would continue to ensure the reliability and relevance of assays for this application. We discuss the following practical guidelines: (1) follow current validation practice to the extent possible and practical; (2) make increased use of reference compounds to better demonstrate assay reliability and relevance; (3) deemphasize the need for cross-laboratory testing, and; (4) implement a web-based, transparent and expedited peer review process. PMID:23338806
Test-retest reliability of fMRI-based graph theoretical properties during working memory, emotion processing, and resting state.

PubMed

Cao, Hengyi; Plichta, Michael M; Schäfer, Axel; Haddad, Leila; Grimm, Oliver; Schneider, Michael; Esslinger, Christine; Kirsch, Peter; Meyer-Lindenberg, Andreas; Tost, Heike

2014-01-01

The investigation of the brain connectome with functional magnetic resonance imaging (fMRI) and graph theory analyses has recently gained much popularity, but little is known about the robustness of these properties, in particular those derived from active fMRI tasks. Here, we studied the test-retest reliability of brain graphs calculated from 26 healthy participants with three established fMRI experiments (n-back working memory, emotional face-matching, resting state) and two parcellation schemes for node definition (AAL atlas, functional atlas proposed by Power et al.). We compared the intra-class correlation coefficients (ICCs) of five different data processing strategies and demonstrated a superior reliability of task-regression methods with condition-specific regressors. The between-task comparison revealed significantly higher ICCs for resting state relative to the active tasks, and a superiority of the n-back task relative to the face-matching task for global and local network properties. While the mean ICCs were typically lower for the active tasks, overall fair to good reliabilities were detected for global and local connectivity properties, and for the n-back task with both atlases, smallworldness. For all three tasks and atlases, low mean ICCs were seen for the local network properties. However, node-specific good reliabilities were detected for node degree in regions known to be critical for the challenged functions (resting-state: default-mode network nodes, n-back: fronto-parietal nodes, face-matching: limbic nodes). Between-atlas comparison demonstrated significantly higher reliabilities for the functional parcellations for global and local network properties. Our findings can inform the choice of processing strategies, brain atlases and outcome properties for fMRI studies using active tasks, graph theory methods, and within-subject designs, in particular future pharmaco-fMRI studies. © 2013 Elsevier Inc. All rights reserved.

Validation of the Persian version of the dysphagia handicap index in patients with neurological disorders

PubMed Central

Barzegar-Bafrooei, Ebrahim; Bakhtiary, Jalal; Khatoonabadi, Ahmad Reza; Fatehi, Farzad; Maroufizadeh, Saman; Fathali, Mojtaba

2016-01-01

Background: Dysphagia as a common condition affecting many aspects of the patient’s life. The Dysphagia Handicap Index (DHI) is a reliable self-reported questionnaire developed specifically to measure the impact of dysphagia on the patient’s quality of life. The aim of this study was to translate the questionnaire to Persian and to measure its validity and reliability in patients with neurogenic oropharyngeal dysphagia. Methods: A formal forward-backward translation of DHI was performed based on the guidelines for the cross-cultural adaptation of self-report measures. A total of 57 patients with neurogenic dysphagia who were referred to the neurology clinics of Tehran University of Medical Sciences, Iran, participated in this study. Internal consistency reliability of the DHI was examined using Cronbach’s alpha, and test-retest reliability of the scale was evaluated using intraclass correlation coefficient (ICC). Results: The internal consistency of the Persian DHI (P-DHI) was considered to be good; Cronbach’s alpha coefficient for the total P-DHI was 0.88. The test-retest reliability for the total and three subscales of the P-DHI ranged from 0.95 to 0.98 using ICC. Conclusion: The P-DHI demonstrated a good reliability, and it can be a valid instrument for evaluating the dysphagia effects on quality of life among Persian language population. PMID:27648173
Reliability and Validity of the Work and Well-Being Inventory (WBI) for Employees.

PubMed

Vendrig, A A; Schaafsma, F G

2018-06-01

Purpose The purpose of this study is to measure the psychometric properties of the Work and Wellbeing Inventory (WBI) (in Dutch: VAR-2), a screening tool that is used within occupational health care and rehabilitation. Our research question focused on the reliability and validity of this inventory. Methods Over the years seven different samples of workers, patients and sick listed workers varying in size between 89 and 912 participants (total: 2514), were used to measure the test-retest reliability, the internal consistency, the construct and concurrent validity, and the criterion and predictive validity. Results The 13 scales displayed good internal consistency and test-retest reliability. The constructive validity of the WBI could clearly be demonstrated in both patients and healthy workers. Confirmative factor analyses revealed a CFI >.90 for all scales. The depression scale predicted future work absenteeism (>6 weeks) because of a common mental disorder in healthy workers. The job strain scale and the illness behavior scale predicted long term absenteeism (>3 months) in workers with short-term absenteeism. The illness behavior scale moderately predicted return to work in rehab patients attending an intensive multidisciplinary program. Conclusions The WBI is a valid and reliable tool for occupational health practitioners to screen for risk factors for prolonged or future sickness absence. With this tool they will have reliable indications for further advice and interventions to restore the work ability.
Reliability of tristimulus colourimetry in the assessment of cutaneous bruise colour.

PubMed

Scafide, Katherine N; Sheridan, Daniel J; Taylor, Laura A; Hayat, Matthew J

2016-06-01

Bruising is one of the most common types of injury clinicians observe among victims of violence and other trauma patients. However, research has shown commonly used qualitative description of cutaneous bruise colour via the naked eye is subjective and unreliable. No published work has formally evaluated the reliability of tristimulus colourimetry as an alternative for assessing bruise colour, despite its clinical and research applications in accurately assessing skin colour. The purpose of this study was to systematically evaluate the test-retest and inter-observer reliability of tristimulus colourimetry in the assessment of cutaneous bruise colour. Two researchers obtained repeated tristimulus colourimetry measures of cutaneous bruises with participants of diverse skin colour. Measures were obtained using the Minolta CR-400 Chomameter. Commission Internationale d'Eclairage (CIE) L*a*b* colour space was used. Data was analysed using intraclass correlation coefficients (ICC), Cronbach's alpha, and minimal detectable change (MDC) on all three L*a*b* values. The colorimeter demonstrated excellent test-retest or intra-rater reliability (L* ICC=0.999; a* ICC=0.973; b* ICC=0.892) and inter-rater reliability (L* ICC=0.997; a* ICC=0.976; b* ICC=0.982). With consistent placement, the tristimulus colourimetry is reliable for the objective assessment and documentation of cutaneous bruise colour for purposes of clinical practice and research. Recommendations for use in practice/research are provided. Copyright © 2016 Elsevier Ltd. All rights reserved.
Masticatory muscle activity assessment and reliability of a portable electromyographic instrument.

PubMed

Bowley, J F; Marx, D B

2001-03-01

Masticatory muscle hyperactivity is thought to produce muscle pain and tension headaches and can cause excessive wear or breakage of restorative dental materials used in the treatment of prosthodontic patients. The quantification and identification of this type of activity is an important consideration in the preoperative diagnosis and treatment planning phase of prosthodontic care. This study investigated the quantification process in complete denture/overdenture patients with natural mandibular tooth abutments and explored the reliability of instrumentation used to assess this parafunctional activity. The nocturnal EMG activity in asymptomatic complete denture/overdenture subjects was assessed with and without prostheses worn during sleep. Because of the large variance within and between subjects, the investigators evaluated the reliability of the 3 instruments used to test nocturnal EMG activity in the sample. Electromyographic activity data of denture/overdenture subjects revealed no differences between prostheses worn versus not worn during sleep but demonstrated a very large variance factor. Further investigation of the instrumentation demonstrated a consistent in vitro as well as in vivo reliability in controlled laboratory studies. The portable EMG instrumentation used in this study revealed a large, uncontrollable variance factor within and between subjects that greatly complicated the diagnosis of parafunctional activity in prosthodontic patients.
Reliability and agreement in the use of four- and six-point ordinal scales for the assessment of erythema in digital images of canine skin.

PubMed

Hill, Peter B

2015-06-01

Grading of erythema in clinical practice is a subjective assessment that cannot be confirmed using a definitive test; nevertheless, erythema scores are typically measured in clinical trials assessing the response to treatment interventions. Most commonly, ordinal scales are used for this purpose, but the optimal number of categories in such scales has not been determined. This study aimed to compare the reliability and agreement of a four-point and a six-point ordinal scale for the assessment of erythema in digital images of canine skin. Fifteen digital images showing varying degrees of erythema were assessed by specialist dermatologists and laypeople, using either the four-point or the six-point scale. Reliability between the raters was assessed using intraclass correlation coefficients and Cronbach's α. Agreement was assessed using the variation ratio (the percentage of respondents who chose the mode, the most common answer). Intraobserver variability was assessed by comparing the results of two grading sessions, at least 6 weeks apart. Both scales demonstrated high reliability, with intraclass correlation coefficient values and Cronbach's α above 0.99. However, the four-point scale demonstrated significantly superior agreement, with variation ratios for the four-point scale averaging 74.8%, compared with 56.2% for the six-point scale. Intraobserver consistency for the four-point scale was very high. Although both scales demonstrated high reliability, the four-point scale was superior in terms of agreement. For the assessment of erythema in clinical trials, a four-point ordinal scale is recommended. © 2014 ESVD and ACVD.
A comparison of the shuttle and 6 minute walking tests with measured peak oxygen consumption in patients with heart failure.

PubMed

Green, D J; Watts, K; Rankin, S; Wong, P; O'Driscoll, J G

2001-09-01

This study investigated the use of an incremental, externally-paced 10 m shuttle walk test (SWT) as an objective, reliable and predictive test of functional capacity in patients with heart failure (CHF). The SWT was compared to a 6 minute walk test (6WT) and a maximal symptom-limited treadmill peak oxygen consumption (VO2peak) test. Experiment 1 examined the reproducibility of the SWT. Two SWF trials were performed and distance ambulated (DA), heart rate (HR) and rate of perceived exertion (RPE) results compared. In experiment 2, SWT, 6WT, and VO2 peak tests were performed and HR. RPE and ambulatory VO2 compared. The SWT demonstrated strong test/retest reliability for DA (r = 0.98). HR (r = 0.96) and RPE (r = 0.89). Treadmill VO2 peak was significantly correlated with DA during the SWT (r = 0.83, P < 0.05), but not the 6WT. SWT peak VO2 (18.5 +/- 1.8 ml.kg(-1) x min(-1)) and treadmill VO2 peak (18.3 +/-2.0 ml.kg(-1) x min(-1)) were also highly correlated (r = 0.78, P < 0.05). Conversely, 6WT peak VO2 and treadmill VO2 peak were not significantly correlated. This study suggests the SWT is a reliable, objective test, highly predictive of VO2 peak which may be a more optimal field exercise test than the self paced 6WT.
Test-re-test reliability and inter-rater reliability of a digital pelvic inclinometer in young, healthy males and females.

PubMed

Beardsley, Chris; Egerton, Tim; Skinner, Brendon

2016-01-01

Objective. The purpose of this study was to investigate the reliability of a digital pelvic inclinometer (DPI) for measuring sagittal plane pelvic tilt in 18 young, healthy males and females. Method. The inter-rater reliability and test-re-test reliabilities of the DPI for measuring pelvic tilt in standing on both the right and left sides of the pelvis were measured by two raters carrying out two rating sessions of the same subjects, three weeks apart. Results. For measuring pelvic tilt, inter-rater reliability was designated as good on both sides (ICC = 0.81-0.88), test-re-test reliability within a single rating session was designated as good on both sides (ICC = 0.88-0.95), and test-re-test reliability between two rating sessions was designated as moderate on the left side (ICC = 0.65) and good on the right side (ICC = 0.85). Conclusion. Inter-rater reliability and test-re-test reliability within a single rating session of the DPI in measuring pelvic tilt were both good, while test-re-test reliability between rating sessions was moderate-to-good. Caution is required regarding the interpretation of the test-re-test reliability within a single rating session, as the raters were not blinded. Further research is required to establish validity.
Levodopa responsiveness of dysphagia in advanced Parkinson's disease and reliability testing of the FEES-Levodopa-test.

PubMed

Warnecke, Tobias; Suttrup, Inga; Schröder, Jens B; Osada, Nani; Oelenberg, Stephan; Hamacher, Christina; Suntrup, Sonja; Dziewas, Rainer

2016-07-01

It is still controversially discussed whether central dopaminergic stimulation improves swallowing ability in Parkinson's disease (PD). We evaluated the effect of oral levodopa application on dysphagia in advanced PD patients with motor fluctuations. In 15 PD patients (mean age 71.93 ± 8.29 years, mean disease duration 14.33 ± 5.94 years) with oropharyngeal dysphagia and motor fluctuations endoscopic swallowing evaluation was performed in the off state and on state condition following a specifically developed protocol (FEES-levodopa-test). The respective dysphagia score covered three salient parameters, i. e. premature spillage, penetration/aspiration events and residues, each tested with liquid as well as semisolid and solid food consistencies. An improvement of >30% in this score indicated levodopa responsiveness of dysphagia. Measures were compared between the off- and on-state condition by using the Wilcoxon Test and marginal homogeneity test. Inter- and intrarater reliability was also investigated. Severity of swallowing dysfunction in the off state varied widely. The lowest dysphagia score was 15 points (dysphagia without any aspiration risk). The highest dysphagia score was 84 points (dysphagia with aspiration of all consistencies). Seven patients showed a marked improvement of dysphagia in the on state condition. Eight PD patients did not respond. Inter- and intrarater reliability was excellent for all three subscales in the off state and on state conditions. A significant proportion of advanced PD patients with motor fluctuations and mild to moderate oropharyngeal dysphagia may demonstrate a clinically relevant improvement of swallowing after levodopa challenge. The FEES-levodopa-test is a reliable and sensitive tool to differentiate these responders from non-responders. Copyright © 2016 Elsevier Ltd. All rights reserved.
Energy Efficient Engine integrated core/low spool design and performance report

NASA Technical Reports Server (NTRS)

Stearns, E. Marshall

1985-01-01

The Energy Efficient Engine (E3) is a NASA program to create fuel saving technology for future transport aircraft engines. The E3 technology advancements were demonstrated to operate reliably and achieve goal performance in tests of the Integrated Core/Low Spool vehicle. The first build of this undeveloped technology research engine set a record for low fuel consumption. Its design and detailed test results are herein presented.
Quality Inspection Test, Demonstration and Evaluation Report for High Reliability Laser Polarizers. Revision

DTIC Science & Technology

1988-02-24

switch timing, the length of the optical resonator , or both. The output of the test source (Nd:YAG) is set to the desired intensity with a variable...1 2.0 POLARIZER TECHNICAL SPECIFICATION .......................... 3 3.0 PRISM FABRICATION AND INSPECTION ........................... 4...was updated before the beginning of the Phase III work. The primary changes were consolidating the prism half and assembly drawings so surfaces are
DOE Office of Scientific and Technical Information (OSTI.GOV)

V Yashchuk; R Conley; E Anderson

Verification of the reliability of metrology data from high quality X-ray optics requires that adequate methods for test and calibration of the instruments be developed. For such verification for optical surface profilometers in the spatial frequency domain, a modulation transfer function (MTF) calibration method based on binary pseudo-random (BPR) gratings and arrays has been suggested [1] and [2] and proven to be an effective calibration method for a number of interferometric microscopes, a phase shifting Fizeau interferometer, and a scatterometer [5]. Here we describe the details of development of binary pseudo-random multilayer (BPRML) test samples suitable for characterization of scanningmore » (SEM) and transmission (TEM) electron microscopes. We discuss the results of TEM measurements with the BPRML test samples fabricated from a WiSi2/Si multilayer coating with pseudo-randomly distributed layers. In particular, we demonstrate that significant information about the metrological reliability of the TEM measurements can be extracted even when the fundamental frequency of the BPRML sample is smaller than the Nyquist frequency of the measurements. The measurements demonstrate a number of problems related to the interpretation of the SEM and TEM data. Note that similar BPRML test samples can be used to characterize X-ray microscopes. Corresponding work with X-ray microscopes is in progress.« less
The Stigma Resistance Scale: A multi-sample validation of a new instrument to assess mental illness stigma resistance.

PubMed

Firmin, Ruth L; Lysaker, Paul H; McGrew, John H; Minor, Kyle S; Luther, Lauren; Salyers, Michelle P

2017-12-01

Although associated with key recovery outcomes, stigma resistance remains under-studied largely due to limitations of existing measures. This study developed and validated a new measure of stigma resistance. Preliminary items, derived from qualitative interviews of people with lived experience, were pilot tested online with people self-reporting a mental illness diagnosis (n = 489). Best performing items were selected, and the refined measure was administered to an independent sample of people with mental illness at two state mental health consumer recovery conferences (n = 202). Confirmatory factor analyses (CFA) guided by theory were used to test item fit, correlations between the refined stigma resistance measure and theoretically relevant measures were examined for validity, and test-retest correlations of a subsample were examined for stability. CFA demonstrated strong fit for a 5-factor model. The final 20-item measure demonstrated good internal consistency for each of the 5 subscales, adequate test-retest reliability at 3 weeks, and strong construct validity (i.e., positive associations with quality of life, recovery, and self-efficacy, and negative associations with overall symptoms, defeatist beliefs, and self-stigma). The new measure offers a more reliable and nuanced assessment of stigma resistance. It may afford greater personalization of interventions targeting stigma resistance. Copyright © 2017 Elsevier B.V. All rights reserved.
Test-retest reliability of Physical Activity Neighborhood Environment Scale among urban men and women in Nanjing, China.

PubMed

Zhao, L; Wang, Z; Qin, Z; Leslie, E; He, J; Xiong, Y; Xu, F

2018-03-01

The identification of physical-activity-friendly built environment (BE) constructs is highly useful for physical activity promotion and maintenance. The Physical Activity Neighborhood Environment Scale (PANES) was developed for assessing BE correlates. However, PANES reliability has not been investigated among adults in China. A cross-sectional study. With multistage sampling approaches, 1568 urban adults (aged 35-74 years) were recruited for the initial survey on all 17 items of PANES Chinese version (PANES-CHN), with the survey repeated 7 days later for each participant. Intraclass correlation coefficient (ICC) was used to assess the test-retest reliability of PANES-CHN for each item. Totally, 1551 participants completed both surveys (follow-up rate = 98.9%). Among participants (mean age: 54.7 ± 11.1 years), 47.8% were men, 22.1% were elders, and 22.7% had ≥13 years of education. Overall, the PANES-CHN demonstrated at least substantial reliability with ICCs ranging from 0.66 to 0.95 (core items), from 0.75 to 0.95 (recommended items), and from 0.78 to 0.87 (optional items). Similar outcomes were observed when data were analyzed by gender or age groups. The PANES-CHN has excellent test-retest reliability and thus has valuable utility for assessing urban BE attributes among Chinese adults. Copyright © 2017 The Royal Society for Public Health. Published by Elsevier Ltd. All rights reserved.
Cross-cultural adaptation and validation of Persian Achilles tendon Total Rupture Score.

PubMed

Ansari, Noureddin Nakhostin; Naghdi, Soofia; Hasanvand, Sahar; Fakhari, Zahra; Kordi, Ramin; Nilsson-Helander, Katarina

2016-04-01

To cross-culturally adapt the Achilles tendon Total Rupture Score (ATRS) to Persian language and to preliminary evaluate the reliability and validity of a Persian ATRS. A cross-sectional and prospective cohort study was conducted to translate and cross-culturally adapt the ATRS to Persian language (ATRS-Persian) following steps described in guidelines. Thirty patients with total Achilles tendon rupture and 30 healthy subjects participated in this study. Psychometric properties of floor/ceiling effects (responsiveness), internal consistency reliability, test-retest reliability, standard error of measurement (SEM), smallest detectable change (SDC), construct validity, and discriminant validity were tested. Factor analysis was performed to determine the ATRS-Persian structure. There were no floor or ceiling effects that indicate the content and responsiveness of ATRS-Persian. Internal consistency was high (Cronbach's α 0.95). Item-total correlations exceeded acceptable standard of 0.3 for the all items (0.58-0.95). The test-retest reliability was excellent [(ICC)agreement 0.98]. SEM and SDC were 3.57 and 9.9, respectively. Construct validity was supported by a significant correlation between the ATRS-Persian total score and the Persian Foot and Ankle Outcome Score (PFAOS) total score and PFAOS subscales (r = 0.55-0.83). The ATRS-Persian significantly discriminated between patients and healthy subjects. Explanatory factor analysis revealed 1 component. The ATRS was cross-culturally adapted to Persian and demonstrated to be a reliable and valid instrument to measure functional outcomes in Persian patients with Achilles tendon rupture. II.
Reliability and Validity of the Footprint Assessment Method Using Photoshop CS5 Software.

PubMed

Gutiérrez-Vilahú, Lourdes; Massó-Ortigosa, Núria; Costa-Tutusaus, Lluís; Guerra-Balic, Myriam

2015-05-01

Several sophisticated methods of footprint analysis currently exist. However, it is sometimes useful to apply standard measurement methods of recognized evidence with an easy and quick application. We sought to assess the reliability and validity of a new method of footprint assessment in a healthy population using Photoshop CS5 software (Adobe Systems Inc, San Jose, California). Forty-two footprints, corresponding to 21 healthy individuals (11 men with a mean ± SD age of 20.45 ± 2.16 years and 10 women with a mean ± SD age of 20.00 ± 1.70 years) were analyzed. Footprints were recorded in static bipedal standing position using optical podography and digital photography. Three trials for each participant were performed. The Hernández-Corvo, Chippaux-Smirak, and Staheli indices and the Clarke angle were calculated by manual method and by computerized method using Photoshop CS5 software. Test-retest was used to determine reliability. Validity was obtained by intraclass correlation coefficient (ICC). The reliability test for all of the indices showed high values (ICC, 0.98-0.99). Moreover, the validity test clearly showed no difference between techniques (ICC, 0.99-1). The reliability and validity of a method to measure, assess, and record the podometric indices using Photoshop CS5 software has been demonstrated. This provides a quick and accurate tool useful for the digital recording of morphostatic foot study parameters and their control.
Validation of Yoruba Version of Family Burden Interview Schedule (Y-FBIS) on Caregivers of Schizophrenia Patients

PubMed Central

Lasebikan, Victor Olufolahan

2012-01-01

Objective. To validate the Yoruba version of Family Burden Interview Schedule (Y-FBIS) for assessing the burden on caregivers of persons with schizophrenia. Methods. Three hundred and sixty-eight dyads of persons with schizophrenia and their caregivers were recruited from a psychiatric outpatient clinic. The (Y-FBIS) and the Yoruba version of the GHQ-12 (Y-GHQ-12) were applied to the caregivers. Patients' level of social functioning was assessed using the Global Assessment of Functioning scale. Results. All (368) caregivers were used for tests of internal consistency, 180 for interrater reliability, and another 180 for test-retest reliability. Internal consistency of the Y-FBIS was demonstrated by a significant Cronbach α of between 0.62 and 0.82 for each item. Concurrent validity of the Y-FBIS was illustrated by its significant positive correlation with Y-GHQ-12 (r = 0.633 , P < 0.01). Split-half reliability was 0.849. Intraclass correlation coefficient for the total score of Y-FBIS was 0.849 at 95% confidence interval. Test-retest reliability of individual scales ranged from 0.780 to 0.874 and was 0.830 for total objective scale score. Convergent validity was shown by the significant positive correlation (r = 0.83) between the objective burden score and subjective burden score of Y-FBIS. ROC curve area was 0.981. Conclusion. The Y-FBIS is a valid, reliable, and sensitive instrument for assessing the burden on caregivers of persons with schizophrenia in Nigeria. PMID:23738196
Advanced Guidance and Control Methods for Reusable Launch Vehicles: Test Results

NASA Technical Reports Server (NTRS)

Hanson, John M.; Jones, Robert E.; Krupp, Don R.; Fogle, Frank R. (Technical Monitor)

2002-01-01

There are a number of approaches to advanced guidance and control (AG&C) that have the potential for achieving the goals of significantly increasing reusable launch vehicle (RLV) safety/reliability and reducing the cost. In this paper, we examine some of these methods and compare the results. We briefly introduce the various methods under test, list the test cases used to demonstrate that the desired results are achieved, show an automated test scoring method that greatly reduces the evaluation effort required, and display results of the tests. Results are shown for the algorithms that have entered testing so far.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Yashchuk, V.V.; Conley, R.; Anderson, E.H.

Verification of the reliability of metrology data from high quality X-ray optics requires that adequate methods for test and calibration of the instruments be developed. For such verification for optical surface profilometers in the spatial frequency domain, a modulation transfer function (MTF) calibration method based on binarypseudo-random (BPR) gratings and arrays has been suggested and and proven to be an effective calibration method for a number of interferometric microscopes, a phase shifting Fizeau interferometer, and a scatterometer. Here we describe the details of development of binarypseudo-random multilayer (BPRML) test samples suitable for characterization of scanning (SEM) and transmission (TEM) electronmore » microscopes. We discuss the results of TEM measurements with the BPRML test samples fabricated from a WiSi{sub 2}/Si multilayer coating with pseudo-randomly distributed layers. In particular, we demonstrate that significant information about the metrological reliability of the TEM measurements can be extracted even when the fundamental frequency of the BPRML sample is smaller than the Nyquist frequency of the measurements. The measurements demonstrate a number of problems related to the interpretation of the SEM and TEM data. Note that similar BPRML testsamples can be used to characterize X-ray microscopes. Corresponding work with X-ray microscopes is in progress.« less
Acoustic stapedial reflexes in healthy neonates: normative data and test-retest reliability.

PubMed

Kei, Joseph

2012-01-01

The acoustic stapedial reflex (ASR) test provides useful information about the function of the auditory system. While it is frequently used with adults and children in a clinical setting, its use with young infants is limited. Presently, there are few data for neonates and inadequate research into the test-retest reliability of the ASR test. This study aimed to establish normative data and evaluate the test-retest reliability of the ASR test in healthy neonates. A cross-sectional experimental design was used to establish ASR normative data and assess the test-retest reliability of ASR thresholds obtained from healthy neonates. Sixty-eight full-term neonates with mean chronological age of 2.5 days (SD = 1.8 day), who passed the automated auditory brainstem response, transient evoked otoacoustic emission, and high frequency (1 kHz) tympanometry (HFT) tests. One randomly selected ear from each neonate was tested using TEOAE (transient evoked otoacoustic emission), HFT, and ASR tests using a 1 kHz probe tone. ASR thresholds were elicited by presenting pure tones of 0.5, 2, and 4 kHz and broadband noise (BBN) separately to the test ear in an ipsilateral stimulation mode. The ASR procedure was repeated to acquire retest data within the same testing session. Descriptive statistics, χ2, and analysis of variance with repeated measures tests were used to analyze ASR data. All neonates exhibited ASR when stimulated by tonal stimuli or BBN. The mean ASRTs (acoustic stapedial reflex thresholds) for the 0.5, 2, and 4 kHz tones were 81.6 ± 7.9, 71.3 ± 7.9, and 65.4 ± 8.7 dB HL, respectively. The mean ASRT for the BBN was estimated to be smaller than 57.2 dB HL, given the limitation of the equipment. The 95th percentiles of the ASRT were 95, 85, 80, and 75 dB HL for the 0.5, 2, and 4 kHz and BBN, respectively. The test-retest reliability of the ASR test for all stimuli was high, with no significant difference in mean ASRTs across the test and retest conditions. Test-retest differences were within 10 dB for more than 91% of ASRT data across all stimuli. There was a slight trend of ASRTs being more repeatable in the medium ASRT range than in the higher or lower range. This study demonstrated that ASRTs obtained from healthy neonates were highly repeatable across test and retest sessions. Given the availability of normative data and the high test-retest reliability, the ASR test will be useful as a diagnostic tool in a battery of tests to evaluate the auditory function of neonates. American Academy of Audiology.
Improving fMRI reliability in presurgical mapping for brain tumours.

PubMed

Stevens, M Tynan R; Clarke, David B; Stroink, Gerhard; Beyea, Steven D; D'Arcy, Ryan Cn

2016-03-01

Functional MRI (fMRI) is becoming increasingly integrated into clinical practice for presurgical mapping. Current efforts are focused on validating data quality, with reliability being a major factor. In this paper, we demonstrate the utility of a recently developed approach that uses receiver operating characteristic-reliability (ROC-r) to: (1) identify reliable versus unreliable data sets; (2) automatically select processing options to enhance data quality; and (3) automatically select individualised thresholds for activation maps. Presurgical fMRI was conducted in 16 patients undergoing surgical treatment for brain tumours. Within-session test-retest fMRI was conducted, and ROC-reliability of the patient group was compared to a previous healthy control cohort. Individually optimised preprocessing pipelines were determined to improve reliability. Spatial correspondence was assessed by comparing the fMRI results to intraoperative cortical stimulation mapping, in terms of the distance to the nearest active fMRI voxel. The average ROC-r reliability for the patients was 0.58±0.03, as compared to 0.72±0.02 in healthy controls. For the patient group, this increased significantly to 0.65±0.02 by adopting optimised preprocessing pipelines. Co-localisation of the fMRI maps with cortical stimulation was significantly better for more reliable versus less reliable data sets (8.3±0.9 vs 29±3 mm, respectively). We demonstrated ROC-r analysis for identifying reliable fMRI data sets, choosing optimal postprocessing pipelines, and selecting patient-specific thresholds. Data sets with higher reliability also showed closer spatial correspondence to cortical stimulation. ROC-r can thus identify poor fMRI data at time of scanning, allowing for repeat scans when necessary. ROC-r analysis provides optimised and automated fMRI processing for improved presurgical mapping. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/

Assessment Literacy: Building a Base for Better Teaching and Learning

ERIC Educational Resources Information Center

Rogler, Dawn

2014-01-01

This article presents principles and practices of effective assessment, outlining seven key concepts--usefulness, reliability, validity, practicality, washback, authenticity, and transparency--and demonstrating how to apply them in creating an exam blueprint. The article also discusses the importance of providing feedback after a test has been…
Testing of printed circuit board solder joints by optical correlation

NASA Technical Reports Server (NTRS)

Espy, P. N.

1975-01-01

An optical correlation technique for the nondestructive evaluation of printed circuit board solder joints was evaluated. Reliable indications of induced stress levels in solder joint lead wires are achievable. Definite relations between the inherent strength of a solder joint, with its associated ability to survive stress, are demonstrable.
Assessing Performance in Shoulder Arthroscopy: The Imperial Global Arthroscopy Rating Scale (IGARS).

PubMed

Bayona, Sofia; Akhtar, Kash; Gupte, Chinmay; Emery, Roger J H; Dodds, Alexander L; Bello, Fernando

2014-07-02

Surgical training is undergoing major changes with reduced resident work hours and an increasing focus on patient safety and surgical aptitude. The aim of this study was to create a valid, reliable method for an assessment of arthroscopic skills that is independent of time and place and is designed for both real and simulated settings. The validity of the scale was tested using a virtual reality shoulder arthroscopy simulator. The study consisted of two parts. In the first part, an Imperial Global Arthroscopy Rating Scale for assessing technical performance was developed using a Delphi method. Application of this scale required installing a dual-camera system to synchronously record the simulator screen and body movements of trainees to allow an assessment that is independent of time and place. The scale includes aspects such as efficient portal positioning, angles of instrument insertion, proficiency in handling the arthroscope and adequately manipulating the camera, and triangulation skills. In the second part of the study, a validation study was conducted. Two experienced arthroscopic surgeons, blinded to the identities and experience of the participants, each assessed forty-nine subjects performing three different tests using the Imperial Global Arthroscopy Rating Scale. Results were analyzed using two-way analysis of variance with measures of absolute agreement. The intraclass correlation coefficient was calculated for each test to assess inter-rater reliability. The scale demonstrated high internal consistency (Cronbach alpha, 0.918). The intraclass correlation coefficient demonstrated high agreement between the assessors: 0.91 (p < 0.001). Construct validity was evaluated using Kruskal-Wallis one-way analysis of variance (chi-square test, 29.826; p < 0.001), demonstrating that the Imperial Global Arthroscopy Rating Scale distinguishes significantly between subjects with different levels of experience utilizing a virtual reality simulator. The Imperial Global Arthroscopy Rating Scale has a high internal consistency and excellent inter-rater reliability and offers an approach for assessing technical performance in basic arthroscopy on a virtual reality simulator. The Imperial Global Arthroscopy Rating Scale provides detailed information on surgical skills. Although it requires further validation in the operating room, this scale, which is independent of time and place, offers a robust and reliable method for assessing arthroscopic technical skills. Copyright © 2014 by The Journal of Bone and Joint Surgery, Incorporated.
Using exogenous variables in testing for monotonic trends in hydrologic time series

USGS Publications Warehouse

Alley, William M.

1988-01-01

One approach that has been used in performing a nonparametric test for monotonic trend in a hydrologic time series consists of a two-stage analysis. First, a regression equation is estimated for the variable being tested as a function of an exogenous variable. A nonparametric trend test such as the Kendall test is then performed on the residuals from the equation. By analogy to stagewise regression and through Monte Carlo experiments, it is demonstrated that this approach will tend to underestimate the magnitude of the trend and to result in some loss in power as a result of ignoring the interaction between the exogenous variable and time. An alternative approach, referred to as the adjusted variable Kendall test, is demonstrated to generally have increased statistical power and to provide more reliable estimates of the trend slope. In addition, the utility of including an exogenous variable in a trend test is examined under selected conditions.
Reliability and Validity of a New Method for Isometric Back Extensor Strength Evaluation Using A Hand-Held Dynamometer.

PubMed

Park, Hee-Won; Baek, Sora; Kim, Hong Young; Park, Jung-Gyoo; Kang, Eun Kyoung

2017-10-01

To investigate the reliability and validity of a new method for isometric back extensor strength measurement using a portable dynamometer. A chair equipped with a small portable dynamometer was designed (Power Track II Commander Muscle Tester). A total of 15 men (mean age, 34.8±7.5 years) and 15 women (mean age, 33.1±5.5 years) with no current back problems or previous history of back surgery were recruited. Subjects were asked to push the back of the chair while seated, and their isometric back extensor strength was measured by the portable dynamometer. Test-retest reliability was assessed with intraclass correlation coefficient (ICC). For the validity assessment, isometric back extensor strength of all subjects was measured by a widely used physical performance evaluation instrument, BTE PrimusRS system. The limit of agreement (LoA) from the Bland-Altman plot was evaluated between two methods. The test-retest reliability was excellent (ICC=0.82; 95% confidence interval, 0.65-0.91). The Bland-Altman plots demonstrated acceptable agreement between the two methods: the lower 95% LoA was -63.1 N and the upper 95% LoA was 61.1 N. This study shows that isometric back extensor strength measurement using a portable dynamometer has good reliability and validity.
Validation of the MISSCARE-BRASIL survey - A tool to assess missed nursing care.

PubMed

Siqueira, Lillian Dias Castilho; Caliri, Maria Helena Larcher; Haas, Vanderlei José; Kalisch, Beatrice; Dantas, Rosana Aparecida Spadoti

2017-12-21

to analyze the metric validity and reliability properties of the MISSCARE-BRASIL survey. methodological research conducted by assessing construct validity and reliability via confirmatory factor analysis, known-groups validation, convergent construct validation, analysis of internal consistency and test-retest reliability. The sample consisted of 330 nursing professionals, of whom 86 participated in the retest phase. of the 330 participants, 39.7% were aides, 33% technicians, 20.9% nurses, and 6.4% nurses with administrative roles. Confirmatory factorial analysis demonstrated that the Brazilian Portuguese version of the instrument is adequately adjusted to the dimensional structure the scale authors originally proposed. The correlation between "satisfaction with position/role" and "satisfaction with teamwork" and the survey's missed care variables was moderate (Spearman's coefficient =0.35; p<0.001). The results of the Student's t-test indicated known-group validity. Professionals from closed units reported lower levels of missed care in comparison with the other units. The reliability showed a strong correlation, with the exception of "institutional management/leadership style" (intraclass correlation coefficient (ICC)=0.15; p=0.04). The internal consistency was adequate (Cronbach's alpha was greater than 0.70). the MISSCARE-BRASIL was valid and reliable in the group studied. The application of the MISSCARE-BRASIL can contribute to identifying solutions for missed nursing care.
Large Liquid Rocket Testing: Strategies and Challenges

NASA Technical Reports Server (NTRS)

Rahman, Shamim A.; Hebert, Bartt J.

2005-01-01

Rocket propulsion development is enabled by rigorous ground testing in order to mitigate the propulsion systems risks that are inherent in space flight. This is true for virtually all propulsive devices of a space vehicle including liquid and solid rocket propulsion, chemical and non-chemical propulsion, boost stage and in-space propulsion and so forth. In particular, large liquid rocket propulsion development and testing over the past five decades of human and robotic space flight has involved a combination of component-level testing and engine-level testing to first demonstrate that the propulsion devices were designed to meet the specified requirements for the Earth to Orbit launchers that they powered. This was followed by a vigorous test campaign to demonstrate the designed propulsion articles over the required operational envelope, and over robust margins, such that a sufficiently reliable propulsion system is delivered prior to first flight. It is possible that hundreds of tests, and on the order of a hundred thousand test seconds, are needed to achieve a high-reliability, flight-ready, liquid rocket engine system. This paper overviews aspects of earlier and recent experience of liquid rocket propulsion testing at NASA Stennis Space Center, where full scale flight engines and flight stages, as well as a significant amount of development testing has taken place in the past decade. The liquid rocket testing experience discussed includes testing of engine components (gas generators, preburners, thrust chambers, pumps, powerheads), as well as engine systems and complete stages. The number of tests, accumulated test seconds, and years of test stand occupancy needed to meet varying test objectives, will be selectively discussed and compared for the wide variety of ground test work that has been conducted at Stennis for subscale and full scale liquid rocket devices. Since rocket propulsion is a crucial long-lead element of any space system acquisition or development, the appropriate plan and strategy must be put in place at the outset of the development effort. A deferment of this test planning, or inattention to strategy, will compromise the ability of the development program to achieve its systems reliability requirements and/or its development milestones. It is important for the government leadership and support team, as well as the vehicle and propulsion development team, to give early consideration to this aspect of space propulsion and space transportation work.
Thermal Expert System (TEXSYS): Systems autonomy demonstration project, volume 2. Results

NASA Technical Reports Server (NTRS)

Glass, B. J. (Editor)

1992-01-01

The Systems Autonomy Demonstration Project (SADP) produced a knowledge-based real-time control system for control and fault detection, isolation, and recovery (FDIR) of a prototype two-phase Space Station Freedom external active thermal control system (EATCS). The Thermal Expert System (TEXSYS) was demonstrated in recent tests to be capable of reliable fault anticipation and detection, as well as ordinary control of the thermal bus. Performance requirements were addressed by adopting a hierarchical symbolic control approach-layering model-based expert system software on a conventional, numerical data acquisition and control system. The model-based reasoning capabilities of TEXSYS were shown to be advantageous over typical rule-based expert systems, particularly for detection of unforeseen faults and sensor failures. Volume 1 gives a project overview and testing highlights. Volume 2 provides detail on the EATCS testbed, test operations, and online test results. Appendix A is a test archive, while Appendix B is a compendium of design and user manuals for the TEXSYS software.
Thermal Expert System (TEXSYS): Systems autonomy demonstration project, volume 2. Results

NASA Astrophysics Data System (ADS)

Glass, B. J.

1992-10-01

The Systems Autonomy Demonstration Project (SADP) produced a knowledge-based real-time control system for control and fault detection, isolation, and recovery (FDIR) of a prototype two-phase Space Station Freedom external active thermal control system (EATCS). The Thermal Expert System (TEXSYS) was demonstrated in recent tests to be capable of reliable fault anticipation and detection, as well as ordinary control of the thermal bus. Performance requirements were addressed by adopting a hierarchical symbolic control approach-layering model-based expert system software on a conventional, numerical data acquisition and control system. The model-based reasoning capabilities of TEXSYS were shown to be advantageous over typical rule-based expert systems, particularly for detection of unforeseen faults and sensor failures. Volume 1 gives a project overview and testing highlights. Volume 2 provides detail on the EATCS testbed, test operations, and online test results. Appendix A is a test archive, while Appendix B is a compendium of design and user manuals for the TEXSYS software.
Reliability, validity and minimal detectable change of the Mini-BESTest in Greek participants with chronic stroke.

PubMed

Lampropoulou, Sofia I; Billis, Evdokia; Gedikoglou, Ingrid A; Michailidou, Christina; Nowicky, Alexander V; Skrinou, Dimitra; Michailidi, Fotini; Chandrinou, Danae; Meligkoni, Margarita

2018-02-23

This study aimed to investigate the psychometric characteristics of reliability, validity and ability to detect change of a newly developed balance assessment tool, the Mini-BESTest, in Greek patients with stroke. A prospective, observational design study with test-retest measures was conducted. A convenience sample of 21 Greek patients with chronic stroke (14 male, 7 female; age of 63 ± 16 years) was recruited. Two independent examiners administered the scale, for the inter-rater reliability, twice within 10 days for the test-retest reliability. Bland Altman Analysis for repeated measures assessed the absolute reliability and the Standard Error of Measurement (SEM) and the Minimum Detectable Change at 95% confidence interval (MDC 95% ) were established. The Greek Mini-BESTest (Mini-BESTest GR ) was correlated with the Greek Berg Balance Scale (BBS GR ) for assessing the concurrent validity and with the Timed Up and Go (TUG), the Functional Reach Test (FRT) and the Greek Falls Efficacy Scale-International (FES-I GR ) for the convergent validity. The Mini-BESTestGR demonstrated excellent inter-rater reliability (ICC (95%CI) = 0.997 (0.995-0.999, SEM = 0.46) with the scores of two raters within the limits of agreement (mean dif = -0.143 ± 0.727, p > 0.05) and test-retest reliability (ICC (95%CI) = 0.966 (0.926-0.988), SEM = 1.53). Additionally, the Mini-BESTest GR yielded very strong to moderate correlations with BBS GR (r = 0.924, p < 0.001), TUG (r = -0.823, p < 0.001), FES-I GR (r = -0.734, p < 0.001) and FRT (r = 0.689, p < 0.001). MDC 95 was 4.25 points. The exceptionally high reliability and the equally good validity of the Mini-BESTest GR , strongly support its utility in Greek people with chronic stroke. Its ability to identify clinically meaningful changes and falls risk need further investigation.
High power diode lasers emitting from 639 nm to 690 nm

NASA Astrophysics Data System (ADS)

Bao, L.; Grimshaw, M.; DeVito, M.; Kanskar, M.; Dong, W.; Guan, X.; Zhang, S.; Patterson, J.; Dickerson, P.; Kennedy, K.; Li, S.; Haden, J.; Martinsen, R.

2014-03-01

There is increasing market demand for high power reliable red lasers for display and cinema applications. Due to the fundamental material system limit at this wavelength range, red diode lasers have lower efficiency and are more temperature sensitive, compared to 790-980 nm diode lasers. In terms of reliability, red lasers are also more sensitive to catastrophic optical mirror damage (COMD) due to the higher photon energy. Thus developing higher power-reliable red lasers is very challenging. This paper will present nLIGHT's released red products from 639 nm to 690nm, with established high performance and long-term reliability. These single emitter diode lasers can work as stand-alone singleemitter units or efficiently integrate into our compact, passively-cooled Pearl™ fiber-coupled module architectures for higher output power and improved reliability. In order to further improve power and reliability, new chip optimizations have been focused on improving epitaxial design/growth, chip configuration/processing and optical facet passivation. Initial optimization has demonstrated promising results for 639 nm diode lasers to be reliably rated at 1.5 W and 690nm diode lasers to be reliably rated at 4.0 W. Accelerated life-test has started and further design optimization are underway.
Pure-tone audiometry outside a sound booth using earphone attentuation, integrated noise monitoring, and automation.

PubMed

Swanepoel, De Wet; Matthysen, Cornelia; Eikelboom, Robert H; Clark, Jackie L; Hall, James W

2015-01-01

Accessibility of audiometry is hindered by the cost of sound booths and shortage of hearing health personnel. This study investigated the validity of an automated mobile diagnostic audiometer with increased attenuation and real-time noise monitoring for clinical testing outside a sound booth. Attenuation characteristics and reference ambient noise levels for the computer-based audiometer (KUDUwave) was evaluated alongside the validity of environmental noise monitoring. Clinical validity was determined by comparing air- and bone-conduction thresholds obtained inside and outside the sound booth (23 subjects). Twenty-three normal-hearing subjects (age range, 20-75 years; average age 35.5) and a sub group of 11 subjects to establish test-retest reliability. Improved passive attenuation and valid environmental noise monitoring was demonstrated. Clinically, air-conduction thresholds inside and outside the sound booth, corresponded within 5 dB or less > 90% of instances (mean absolute difference 3.3 ± 3.2 SD). Bone conduction thresholds corresponded within 5 dB or less in 80% of comparisons between test environments, with a mean absolute difference of 4.6 dB (3.7 SD). Threshold differences were not statistically significant. Mean absolute test-retest differences outside the sound booth was similar to those in the booth. Diagnostic pure-tone audiometry outside a sound booth, using automated testing, improved passive attenuation, and real-time environmental noise monitoring demonstrated reliable hearing assessments.
The Healthy Brain Network Serial Scanning Initiative: a resource for evaluating inter-individual differences and their reliabilities across scan conditions and sessions.

PubMed

O'Connor, David; Potler, Natan Vega; Kovacs, Meagan; Xu, Ting; Ai, Lei; Pellman, John; Vanderwal, Tamara; Parra, Lucas C; Cohen, Samantha; Ghosh, Satrajit; Escalera, Jasmine; Grant-Villegas, Natalie; Osman, Yael; Bui, Anastasia; Craddock, R Cameron; Milham, Michael P

2017-02-01

Although typically measured during the resting state, a growing literature is illustrating the ability to map intrinsic connectivity with functional MRI during task and naturalistic viewing conditions. These paradigms are drawing excitement due to their greater tolerability in clinical and developing populations and because they enable a wider range of analyses (e.g., inter-subject correlations). To be clinically useful, the test-retest reliability of connectivity measured during these paradigms needs to be established. This resource provides data for evaluating test-retest reliability for full-brain connectivity patterns detected during each of four scan conditions that differ with respect to level of engagement (rest, abstract animations, movie clips, flanker task). Data are provided for 13 participants, each scanned in 12 sessions with 10 minutes for each scan of the four conditions. Diffusion kurtosis imaging data was also obtained at each session. Technical validation and demonstrative reliability analyses were carried out at the connection-level using the Intraclass Correlation Coefficient and at network-level representations of the data using the Image Intraclass Correlation Coefficient. Variation in intrinsic functional connectivity across sessions was generally found to be greater than that attributable to scan condition. Between-condition reliability was generally high, particularly for the frontoparietal and default networks. Between-session reliabilities obtained separately for the different scan conditions were comparable, though notably lower than between-condition reliabilities. This resource provides a test-bed for quantifying the reliability of connectivity indices across subjects, conditions and time. The resource can be used to compare and optimize different frameworks for measuring connectivity and data collection parameters such as scan length. Additionally, investigators can explore the unique perspectives of the brain's functional architecture offered by each of the scan conditions. © The Author 2017. Published by Oxford University Press.
Development and Testing of the Nurse Manager EBP Competency Scale.

PubMed

Shuman, Clayton J; Ploutz-Snyder, Robert J; Titler, Marita G

2018-02-01

The purpose of this study was to develop and evaluate the validity and reliability of an instrument to measure nurse manager competencies regarding evidence-based practice (EBP). The Nurse Manager EBP Competency Scale consists of 16 items for respondents to indicate their perceived level of competency on a 0 to 3 Likert-type scale. Content validity was demonstrated through expert panel review and pilot testing. Principal axis factoring and Cronbach's alpha evaluated construct validity and internal consistency reliability, respectively. Eighty-three nurse managers completed the scale. Exploratory factor analysis resulted in a 16-item scale with two subscales, EBP Knowledge ( n = 6 items, α = .90) and EBP Activity ( n = 10 items, α = .94). Cronbach's alpha for the entire scale was .95. The Nurse Manager EBP Competency Scale is a brief measure of nurse manager EBP competency with evidence of validity and reliability. The scale can enhance our understanding in future studies regarding how nurse manager EBP competency affects implementation.
Development and validation of parenting measures for body image and eating patterns in childhood.

PubMed

Damiano, Stephanie R; Hart, Laura M; Paxton, Susan J

2015-01-01

Evidence-based parenting interventions are important in assisting parents to help their children develop healthy body image and eating patterns. To adequately assess the impact of parenting interventions, valid parent measures are required. The aim of this study was to develop and assess the validity and reliability of two new parent measures, the Parenting Intentions for Body image and Eating patterns in Childhood (Parenting Intentions BEC) and the Knowledge Test for Body image and Eating patterns in Childhood (Knowledge Test BEC). Participants were 27 professionals working in research or clinical treatment of body dissatisfaction or eating disorders, and 75 parents of children aged 2-6 years, who completed the measures via an online questionnaire. Seven scenarios were developed for the Parenting Intentions BEC to describe common experiences about the body and food that parents might need to respond to in front of their child. Parents ranked four behavioural intentions, derived from the current literature on parenting risk factors for body dissatisfaction and unhealthy eating patterns in children. Two subscales were created, one representing positive behavioural intentions, the other negative behavioural intentions. After piloting a larger pool of items, 13 statements were used to construct the Knowledge Test BEC. These were designed to be factual statements about the influence of parent language, media, family meals, healthy eating, and self-esteem on child eating and body image. The validity of both measures was tested by comparing parent and professional scores, and reliability was assessed by comparing parent scores over two testing occasions. Compared with parents, professionals reported significantly higher scores on the Positive Intentions subscale and significantly lower on the Negative Intentions subscale of the Parenting Intentions BEC; confirming the discriminant validity of six out of the seven scenarios. Test-retest reliability was also confirmed as parent scores on the two Parenting Intentions subscales did not differ over time. Eleven out of the 13 Knowledge Test items demonstrated sufficient discriminant validity and test-retest reliability. Overall, results indicated that the six-scenario Parenting Intentions BEC and the 11-item Knowledge Test BEC are valid and reliable measures for parents of young children.
The Chinese version of the Outcome Expectations for Exercise scale: validation study.

PubMed

Lee, Ling-Ling; Chiu, Yu-Yun; Ho, Chin-Chih; Wu, Shu-Chen; Watson, Roger

2011-06-01

Estimates of the reliability and validity of the English nine-item Outcome Expectations for Exercise (OEE) scale have been tested and found to be valid for use in various settings, particularly among older people, with good internal consistency and validity. Data on the use of the OEE scale among older Chinese people living in the community and how cultural differences might affect the administration of the OEE scale are limited. To test the validity and reliability of the Chinese version of the Outcome Expectations for Exercise scale among older people. A cross-sectional validation study was designed to test the Chinese version of the OEE scale (OEE-C). Reliability was examined by testing both the internal consistency for the overall scale and the squared multiple correlation coefficient for the single item measure. The validity of the scale was tested on the basis of both a traditional psychometric test and a confirmatory factor analysis using structural equation modelling. The Mokken Scaling Procedure (MSP) was used to investigate if there were any hierarchical, cumulative sets of items in the measure. The OEE-C scale was tested in a group of older people in Taiwan (n=108, mean age=77.1). There was acceptable internal consistency (alpha=.85) and model fit in the scale. Evidence of the validity of the measure was demonstrated by the tests for criterion-related validity and construct validity. There was a statistically significant correlation between exercise outcome expectations and exercise self-efficacy (r=.34, p<.01). An analysis of the Mokken Scaling Procedure found that nine items of the scale were all retained in the analysis and the resulting scale was reliable and statistically significant (p=.0008). The results obtained in the present study provided acceptable levels of reliability and validity evidence for the Chinese Outcome Expectations for Exercise scale when used with older people in Taiwan. Future testing of the OEE-C scale needs to be carried out to see whether these results are generalisable to older Chinese people living in urban areas. Copyright © 2010 Elsevier Ltd. All rights reserved.
Is the kidney disease quality of life-36 (KDQOL-36) a valid instrument for Chinese dialysis patients?

PubMed

Chow, Susan Ka Yee; Tam, Bonnie Mee Ling

2014-12-15

The aim of this study is to determine the validity and reliability of the Cantonese Chinese version of the Kidney Disease Quality of Life-36 (KDQOL-36™) questionnaire. The scale has been translated into Cantonese Chinese, but has not been tested among the Cantonese-speaking populations. A total of 110 dialysis patients and 122 renal transplant patients were recruited. The data for the KDQOL-36™ were extracted from the KDQOL-Short Form. The criterion validity and scale equivalence were examined using the KDQOL-Short Form scores as the gold standard. The Hospital Anxiety and Depression scale was used to identify the correlations between depression, anxiety, and quality of life to establish the convergent validity. Discriminant validity was examined using the transplant patients to compare the quality of life of dialysis patients. The Cronbach's alpha coefficient and test-retest were used for estimating reliability. There were very strong positive correlations for the physical and mental component summary between the KDQOL-36™ and KDQOL-Short Form. Despite the strong correlations, the effect size was 0.6 and 0.13 for the physical composite summary and mental composite summary score, respectively. Most of the subscales demonstrated significant moderate correlations with the Hospital Anxiety and Depression Scale, from -0.265 to -0.516. The discriminant validity was confirmed with a significant difference between the dialysis and transplant group patients. A high intraclass correlation of >0.98 was demonstrated in the test-retest. The Cantonese Chinese KDQOL-36™ was reliable. Further testing will be required to determine its validity for the physical health summary scale.
Development and thermal management of 10 kW CW, direct diode laser source

NASA Astrophysics Data System (ADS)

Zhu, Hongbo; Hao, Mingming; Zhang, Jianwei; Ji, Wenyu; Lin, Xingchen; Zhang, Jinsheng; Ning, Yongqiang

2016-01-01

We report on the development of direct diode laser source with high-power and high reliability. The laser source was realized by the polarization and wavelength combination of four diode laser stacks. When at the operating current of 122 A, the source was capable of producing 10,120 W output while maintaining 46% electro-optical conversion efficiency. The maximum temperature on the lens was decreased from 442.2 K to 320 K by utilizing an efficient thermal dissipation structure, and the corresponding maximum von Mises stress was reduced from 75.4 MPa to 14 MPa. In addition, a reliability test demonstrated that our laser source was reliable and potential in the applications of laser cladding and heat treatment.
Development Status of the CECE Cryogenic Deep Throttling Demonstrator Engine

NASA Technical Reports Server (NTRS)

2008-01-01

As one of the first technology development programs awarded by NASA under the U.S. Space Exploration Policy (USSEP), the Pratt & Whitney Rocketdyne (PWR) Deep Throttling, Common Extensible Cryogenic Engine (CECE) program was selected by NASA in November 2004 to begin technology development and demonstration toward a deep throttling, cryogenic engine supporting ongoing trade studies for NASA's Lunar Lander descent stage. The CECE program leverages the maturity and previous investment of a flight-proven hydrogen/oxygen expander cycle engine, the PWR RLI0, to develop and demonstrate an unprecedented combination of reliability, safety, durability, throttlability, and restart capabilities in a high-energy, cryogenic engine. The testbed selected for the deep throttling demonstration phases of this program was a minimally modified RL10 engine, allowing for maximum current production engine commonality and extensibility with minimum program cost. Two series of demonstrator engine tests, the first in April-May 2006 and the second in March-April 2007, have demonstrated in excess of 10:1 throttling of the hydrogen/oxygen expander cycle engine. Both test series have explored a combustion instability ("chug") environment at low throttled power levels. These tests have provided an early demonstration of an enabling cryogenic propulsion concept with invaluable system-level technology data acquisition toward design and development risk mitigation for future CECE Demonstrator engine tests.
Reliability and validity of Short Form 36 Version 2 to measure health perceptions in a sub-group of individuals with fatigue.

PubMed

Davenport, Todd E; Stevens, Staci R; Baroni, Katie; Van Ness, J Mark; Snell, Christopher R

2011-01-01

To determine the validity and reliability of Short Form 36 Version 2 (SF36v2) in sub-groups of individuals with fatigue. Thirty subjects participated in this study, including n = 16 subjects who met case definition criteria for chronic fatigue syndrome (CFS) and n = 14 non-disabled sedentary matched control subjects. SF36v2 and Multidimensional Fatigue Inventory (MFI-20) were administered before two maximal cardiopulmonary exercise tests (CPETs) administered 24 h apart and an open-ended recovery questionnaire was administered 7 days after CPET challenge. The main outcome measures were self-reported time to recover to pre-challenge functional and symptom status, frequency of post-exertional symptoms and SF36v2 sub-scale scores. Individuals with CFS demonstrated significantly lower SF36v2 and MFI-20 sub-scale scores prior to CPET. Between-group differences remained significant post-CPET, however, there were no significant group by test interaction effects. Subjects with CFS reported significantly more total symptoms (p < 0.001), as well as reports of fatigue (p < 0.001), neuroendocrine (p < 0.001), immune (p < 0.01), pain (p < 0.01) and sleep disturbance (p < 0.01) symptoms than control subjects as a result of CPET. Many symptom counts demonstrated significant relationships with SF36v2 sub-scale scores (p < 0.05). SF36v2 and MFI-20 sub-scale scores demonstrated significant correlations (p < 0.05). Various SF36v2 sub-scale scores demonstrated significant predictive validity to identify subjects who recovered from CPET challenge within 1 day and 7 days (p < 0.05). Potential floor effects were observed for both questionnaires for individuals with CFS. Various sub-scales of SF36v2 demonstrated adequate reliability and validity for clinical and research applications. Adequacy of sensitivity to change of SF36v2 as a result of a fatiguing stressor should be the subject of additional study.

Temporal overlap of humans and giant lizards (Varanidae; Squamata) in Pleistocene Australia

NASA Astrophysics Data System (ADS)

Price, Gilbert J.; Louys, Julien; Cramb, Jonathan; Feng, Yue-xing; Zhao, Jian-xin; Hocknull, Scott A.; Webb, Gregory E.; Nguyen, Ai Duc; Joannes-Boyau, Renaud

2015-10-01

An obvious but key prerequisite to testing hypotheses concerning the role of humans in the extinction of late Quaternary 'megafauna' is demonstrating that humans and the extinct taxa overlapped, both temporally and spatially. In many regions, a paucity of reliably dated fossil occurrences of megafauna makes it challenging, if not impossible, to test many of the leading extinction hypotheses. The giant monitor lizards of Australia are a case in point. Despite commonly being argued to have suffered extinction at the hands of the first human colonisers (who arrived by 50 ka), it has never been reliably demonstrated that giant monitors and humans temporally overlapped in Australia. Here we present the results of an integrated U-Th and 14C dating study of a late Pleistocene fossil deposit that has yielded the youngest dated remains of giant monitor lizards in Australia. The site, Colosseum Chamber, is a cave deposit in the Mt Etna region, central eastern Australia. Sixteen new dates were generated and demonstrate that the bulk of the material in the deposit accumulated since ca. 50 ka. The new monitor fossil is, minimally, 30 ky younger than the previous youngest reliably dated record for giant lizards in Australia and for the first time, demonstrates that on a continental scale, humans and giant lizards overlapped in time. The new record brings the existing geochronological dataset for Australian giant monitor lizards to seven dated occurrences. With such sparse data, we are hesitant to argue that our new date represents the time of their extinction from the continent. Rather, we suspect that future fossil collecting will yield new samples both older and younger than 50 ka. Nevertheless, we unequivocally demonstrate that humans and giant monitor lizards overlapped temporally in Australia, and thus, humans can only now be considered potential drivers for their extinction.
Development and validation of a VISA tendinopathy questionnaire for greater trochanteric pain syndrome, the VISA-G.

PubMed

Fearon, A M; Ganderton, C; Scarvell, J M; Smith, P N; Neeman, T; Nash, C; Cook, J L

2015-12-01

Greater trochanteric pain syndrome (GTPS) is common, resulting in significant pain and disability. There is no condition specific outcome score to evaluate the degree of severity of disability associated with GTPS in patients with this condition. To develop a reliable and valid outcome measurement capable of evaluating the severity of disability associated with GTPS. A phenomenological framework using in-depth semi structured interviews of patients and medical experts, and focus groups of physiotherapists was used in the item generation. Item and format clarification was undertaken via piloting. Multivariate analysis provided the basis for item reduction. The resultant VISA-G was tested for reliability with the inter class co-efficient (ICC), internal consistency (Cronbach's Alpha), and construct validity (correlation co-efficient) on 52 naïve participants with GTPS and 31 asymptomatic participants. The resultant outcome measurement tool is consistent in style with existing tendinopathy outcome measurement tools, namely the suite of VISA scores. The VISA-G was found to be have a test-retest reliability of ICC2,1 (95% CI) of 0.827 (0.638-0.923). Internal consistency was high with a Cronbach's Alpha of 0.809. Construct validity was demonstrated: the VISA-G measures different constructs than tools previously used in assessing GTPS, the Harris Hip Score and the Oswestry Disability Index (Spearman Rho:0.020 and 0.0205 respectively). The VISA-G did not demonstrate any floor or ceiling effect in symptomatic participants. The VISA-G is a reliable and valid score for measuring the severity of disability associated GTPS. Copyright © 2015 Elsevier Ltd. All rights reserved.
Development and psychometric evaluation of a cardiovascular risk and disease management knowledge assessment tool.

PubMed

Rosneck, James S; Hughes, Joel; Gunstad, John; Josephson, Richard; Noe, Donald A; Waechter, Donna

2014-01-01

This article describes the systematic construction and psychometric analysis of a knowledge assessment instrument for phase II cardiac rehabilitation (CR) patients measuring risk modification disease management knowledge and behavioral outcomes derived from national standards relevant to secondary prevention and management of cardiovascular disease. First, using adult curriculum based on disease-specific learning outcomes and competencies, a systematic test item development process was completed by clinical staff. Second, a panel of educational and clinical experts used an iterative process to identify test content domain and arrive at consensus in selecting items meeting criteria. Third, the resulting 31-question instrument, the Cardiac Knowledge Assessment Tool (CKAT), was piloted in CR patients to ensure use of application. Validity and reliability analyses were performed on 3638 adults before test administrations with additional focused analyses on 1999 individuals completing both pretreatment and posttreatment administrations within 6 months. Evidence of CKAT content validity was substantiated, with 85% agreement among content experts. Evidence of construct validity was demonstrated via factor analysis identifying key underlying factors. Estimates of internal consistency, for example, Cronbach's α = .852 and Spearman-Brown split-half reliability = 0.817 on pretesting, support test reliability. Item analysis, using point biserial correlation, measured relationships between performance on single items and total score (P < .01). Analyses using item difficulty and item discrimination indices further verified item stability and validity of the CKAT. A knowledge instrument specifically designed for an adult CR population was systematically developed and tested in a large representative patient population, satisfying psychometric parameters, including validity and reliability.
Simple shoulder test and Oxford Shoulder Score: Persian translation and cross-cultural validation.

PubMed

Naghdi, Soofia; Nakhostin Ansari, Noureddin; Rustaie, Nilufar; Akbari, Mohammad; Ebadi, Safoora; Senobari, Maryam; Hasson, Scott

2015-12-01

To translate, culturally adapt, and validate the simple shoulder test (SST) and Oxford Shoulder Score (OSS) into Persian language using a cross-sectional and prospective cohort design. A standard forward and backward translation was followed to culturally adapt the SST and the OSS into Persian language. Psychometric properties of floor and ceiling effects, construct convergent validity, discriminant validity, internal consistency reliability, test-retest reliability, standard error of the measurement (SEM), smallest detectable change (SDC), and factor structure were determined. One hundred patients with shoulder disorders and 50 healthy subjects participated in the study. The PSST and the POSS showed no missing responses. No floor or ceiling effects were observed. Both the PSST and POSS detected differences between patients and healthy subjects supporting their discriminant validity. Construct convergent validity was confirmed by a very good correlation between the PSST and POSS (r = 0.68). There was high internal consistency for both the PSST (α = 0.73) and the POSS (α = 0.91 and 0.92). Test-retest reliability with 1-week interval was excellent (ICCagreement = 0.94 for PSST and 0.90 for POSS). Factor analyses demonstrated a three-factor solution for the PSST (49.7 % of variance) and a two-factor solution for the POSS (61.6 % of variance). The SEM/SDC was satisfactory for PSST (5.5/15.3) and POSS (6.8/18.8). The PSST and POSS are valid and reliable outcome measures for assessing functional limitations in Persian-speaking patients with shoulder disorders.
Assessment of isometric muscle strength and rate of torque development with hand-held dynamometry: Test-retest reliability and relationship with gait velocity after stroke.

PubMed

Mentiplay, Benjamin F; Tan, Dawn; Williams, Gavin; Adair, Brooke; Pua, Yong-Hao; Bower, Kelly J; Clark, Ross A

2018-04-27

Isometric rate of torque development examines how quickly force can be exerted and may resemble everyday task demands more closely than isometric strength. Rate of torque development may provide further insight into the relationship between muscle function and gait following stroke. Aims of this study were to examine the test-retest reliability of hand-held dynamometry to measure isometric rate of torque development following stroke, to examine associations between strength and rate of torque development, and to compare the relationships of strength and rate of torque development to gait velocity. Sixty-three post-stroke adults participated (60 years, 34 male). Gait velocity was assessed using the fast-paced 10 m walk test. Isometric strength and rate of torque development of seven lower-limb muscle groups were assessed with hand-held dynamometry. Intraclass correlation coefficients were calculated for reliability and Spearman's rho correlations were calculated for associations. Regression analyses using partial F-tests were used to compare strength and rate of torque development in their relationship with gait velocity. Good to excellent reliability was shown for strength and rate of torque development (0.82-0.97). Strong associations were found between strength and rate of torque development (0.71-0.94). Despite high correlations between strength and rate of torque development, rate of torque development failed to provide significant value to regression models that already contained strength. Assessment of isometric rate of torque development with hand-held dynamometry is reliable following stroke, however isometric strength demonstrated greater relationships with gait velocity. Further research should examine the relationship between dynamic measures of muscle strength/torque and gait after stroke. Copyright © 2018 Elsevier Ltd. All rights reserved.
Reliability and validity of a smartphone pulse rate application for the assessment of resting and elevated pulse rate.

PubMed

Mitchell, Katy; Graff, Megan; Hedt, Corbin; Simmons, James

2016-08-01

Purpose/hypothesis: This study was designed to investigate the test-retest reliability, concurrent validity, and the standard error of measurement (SEm) of a pulse rate assessment application (Azumio®'s Instant Heart Rate) on both Android® and iOS® (iphone operating system) smartphones as compared to a FT7 Polar® Heart Rate monitor. Number of subjects: 111. Resting (sitting) pulse rate was assessed twice and then the participants were asked to complete a 1-min standing step test and then immediately re-assessed. The smartphone assessors were blinded to their measurements. Test-retest reliability (intraclass correlation coefficient [ICC 2,1] and 95% confidence interval) for the three tools at rest (time 1/time 2): iOS® (0.76 [0.67-0.83]); Polar® (0.84 [0.78-0.89]); and Android® (0.82 [0.75-0.88]). Concurrent validity at rest time 2 (ICC 2,1) with the Polar® device: IOS® (0.92 [0.88-0.94]) and Android® (0.95 [0.92-0.96]). Concurrent validity post-exercise (time 3) (ICC) with the Polar® device: iOS® (0.90 [0.86-0.93]) and Android® (0.94 [0.91-0.96]). The SEm values for the three devices at rest: iOS® (5.77 beats per minute [BPM]), Polar® (4.56 BPM) and Android® (4.96 BPM). The Android®, iOS®, and Polar® devices showed acceptable test-retest reliability at rest and post-exercise. Both the smartphone platforms demonstrated concurrent validity with the Polar® at rest and post-exercise. The Azumio® Instant Heart Rate application when used by either platform appears to be a reliable and valid tool to assess pulse rate in healthy individuals.
Adaptation, translation and reliability of the Australian 'Juniors Enjoying Cricket Safely' injury risk perception questionnaire for Sri Lanka.

PubMed

Gamage, Prasanna J; Fortington, Lauren V; Finch, Caroline F

2018-01-01

Cricket is a very popular sport in Sri Lanka. In this setting there has been limited research; specifically, there is little knowledge of cricket injuries. To support future research possibilities, the aim of this study was to cross-culturally adapt, translate and test the reliability of an Australian-developed questionnaire for the Sri Lankan context. The Australian 'Juniors Enjoying Cricket Safely' (JECS-Aus) injury risk perception questionnaire was cross-culturally adapted to suit the Sri Lankan context and subsequently translated into the two main languages (Sinhala and Tamil) based on standard forward-back translation. The translated questionnaires were examined for content validity by two language schoolteachers. The questionnaires were completed twice, 2 weeks apart, by two groups of school cricketers (males) aged 11-15 years (Sinhala (n=24), Tamil (n=30)) to assess reliability. Test-retest scores were evaluated for agreement. Where responses were <100% agreement, Cohen's kappa (κ) statistics were calculated. Questions with moderate-to-poor test-retest reliability (κ<0.6) were reconsidered for modification. Both the Sinhala and Tamil questionnaires had 100% agreement for questions on demographic data, and 88%-100% agreement for questions on participation in cricket and injury history. Of the injury risk perception questions, 72% (Sinhala) and 90% (Tamil) questions showed a substantial (κ=0.61-0.8) and almost perfect (κ=0.81-1.0) test-retest agreement. The adapted and translated JECS-SL questionnaire demonstrated strong reliability. This is the first study to adapt the JECS-Aus questionnaire for use in a different population, providing an outcome measure for assessing injury risk perceptions in Sri Lankan junior cricketers.
The behavioral regulation in sport questionnaire (BRSQ): instrument development and initial validity evidence.

PubMed

Lonsdale, Chris; Hodge, Ken; Rose, Elaine A

2008-06-01

The purpose of the four studies described in this article was to develop and test a new measure of competitive sport participants' intrinsic motivation, extrinsic motivation, and amotivation (self-determination theory; Deci & Ryan, 1985). The items for the new measure, named the Behavioral Regulation in Sport Questionnaire (BRSQ), were constructed using interviews, expert review, and pilot testing. Analyses supported the internal consistency, test-retest reliability, and factorial validity of the BRSQ scores. Nomological validity evidence was also supportive, as BRSQ subscale scores were correlated in the expected pattern with scores derived from measures of motivational consequences. When directly compared with scores derived from the Sport Motivation Scale (SMS; Pelletier, Fortier, Vallerand, Tuson, & Blais, 1995) and a revised version of that questionnaire (SMS-6; Mallett, Kawabata, Newcombe, Otero-Forero, & Jackson, 2007), BRSQ scores demonstrated equal or superior reliability and factorial validity as well as better nomological validity.
Security-Enhanced Autonomous Network Management

NASA Technical Reports Server (NTRS)

Zeng, Hui

2015-01-01

Ensuring reliable communication in next-generation space networks requires a novel network management system to support greater levels of autonomy and greater awareness of the environment and assets. Intelligent Automation, Inc., has developed a security-enhanced autonomous network management (SEANM) approach for space networks through cross-layer negotiation and network monitoring, analysis, and adaptation. The underlying technology is bundle-based delay/disruption-tolerant networking (DTN). The SEANM scheme allows a system to adaptively reconfigure its network elements based on awareness of network conditions, policies, and mission requirements. Although SEANM is generically applicable to any radio network, for validation purposes it has been prototyped and evaluated on two specific networks: a commercial off-the-shelf hardware test-bed using Institute of Electrical Engineers (IEEE) 802.11 Wi-Fi devices and a military hardware test-bed using AN/PRC-154 Rifleman Radio platforms. Testing has demonstrated that SEANM provides autonomous network management resulting in reliable communications in delay/disruptive-prone environments.
Vacuum decay container/closure integrity testing technology. Part 2. Comparison to dye ingress tests.

PubMed

Wolf, Heinz; Stauffer, Tony; Chen, Shu-Chen Y; Lee, Yoojin; Forster, Ronald; Ludzinski, Miron; Kamat, Madhav; Mulhall, Brian; Guazzo, Dana Morton

2009-01-01

Part 1 of this series demonstrated that a container closure integrity test performed according to ASTM F2338-09 Standard Test Method for Nondestructive Detection of Leaks in Packages by Vacuum Decay Method using a VeriPac 325/LV vacuum decay leak tester by Packaging Technologies & Inspection, LLC (PTI) is capable of detecting leaks > or = 5.0 microm (nominal diameter) in rigid, nonporous package systems, such as prefilled glass syringes. The current study compared USP, Ph.Eur. and ISO dye ingress integrity test methods to PTI's vacuum decay technology for the detection of these same 5-, 10-, and 15-microm laser-drilled hole defects in 1-mL glass prefilled syringes. The study was performed at three test sites using several inspectors and a variety of inspection conditions. No standard dye ingress method was found to reliably identify all holed syringes. Modifications to these standard dye tests' challenge conditions increased the potential for dye ingress, and adjustments to the visual inspection environment improved dye ingress detection. However, the risk of false positive test results with dye ingress tests remained. In contrast, the nondestructive vacuum decay leak test method reliably identified syringes with holes > or = 5.0 microm.
Investigating the Intersession Reliability of Dynamic Brain-State Properties.

PubMed

Smith, Derek M; Zhao, Yrian; Keilholz, Shella D; Schumacher, Eric H

2018-06-01

Dynamic functional connectivity metrics have much to offer to the neuroscience of individual differences of cognition. Yet, despite the recent expansion in dynamic connectivity research, limited resources have been devoted to the study of the reliability of these connectivity measures. To address this, resting-state functional magnetic resonance imaging data from 100 Human Connectome Project subjects were compared across 2 scan days. Brain states (i.e., patterns of coactivity across regions) were identified by classifying each time frame using k means clustering. This was done with and without global signal regression (GSR). Multiple gauges of reliability indicated consistency in the brain-state properties across days and GSR attenuated the reliability of the brain states. Changes in the brain-state properties across the course of the scan were investigated as well. The results demonstrate that summary metrics describing the clustering of individual time frames have adequate test/retest reliability, and thus, these patterns of brain activation may hold promise for individual-difference research.
Design and test of the 172K fluidic rudder

NASA Technical Reports Server (NTRS)

Belsterling, C. A.

1978-01-01

Progress in the development of concepts for control of aircraft without moving parts or a separate source of power is described. The design and wind tunnel tests of a full scale fluidic rudder for a Cessna 172K aircraft, intended for subsequent flight tests were documented. The 172K fluidic rudder was designed to provide a control force equivalent to 3.3 degrees of deflection of the conventional rudder. In spite of an extremely thin airfoil, cascaded fluidic amplifiers were built to fit, with the capacity for generating the required level of control force. Wind tunnel tests demonstrated that the principles of lift control using ram air power are sound and reliable under all flight conditions. The tests also demonstrated that the performance of the 172K fluidic rudder is not acceptable for flight tests until the design of the scoop is modified to prevent interference with the lift control phenomenon.
Validation of an iPad activity to measure preschool children's food and physical activity knowledge and preferences.

PubMed

Wiseman, Nicola; Harris, Neil; Downes, Martin

2017-02-01

Preschool children's knowledge of, and preference for food and physical activity play an important role in the development of lifestyle behaviors throughout childhood. Valid and reliable instruments that are interactive and appealing to preschool children are needed, to obtain quality information in a way that actively engages children and encourages willing participation. The purpose of the current research is to assess the reliability and validity of an adapted computerized (iPad) version of the photo-pair food and exercise questionnaire (PPFEQ). The adaptation of the PPFEQ involved generating the questionnaire as an iPad-based tool, updating the photo-pairs within the questionnaire and testing for validity and reliability. This involved four phases of investigation to assess test-retest reliability, internal consistency, sensitivity to change and percent agreement of the questionnaire. The adaption of the PPFEQ resulted in an 18-item questionnaire, titled the preschool food and play questionnaire (Pre-FPQ). The Pre-FPQ demonstrated acceptable reliability and sensitivity to change. Test-retest reliability and internal consistency improved with age, however, it was evident that the tool was not suitable for children younger than 4 years of age. Children encounter a dynamic world that shapes their knowledge, preferences, choices and behaviors. The Pre-FPQ is an innovative tool to measure preschool children's knowledge of and preference for food and physical activity. The questionnaire offers the advantage of being presented in a well-received modality for preschool children as well as being easy and inexpensive to administer. This new tool is likely to be useful for the assessment of the effectiveness of healthy lifestyle programs implemented in the childcare setting. Future work is needed to refine and improve measures of physical activity preference in preschool children.
The reliability and validity of the Turkish version of Fullerton Advanced Balance (FAB-T) scale.

PubMed

Iyigun, Gozde; Kirmizigil, Berkiye; Angin, Ender; Oksuz, Sevim; Can, Filiz; Eker, Levent; Rose, Debra J

2018-06-04

The aim of this study was to evaluate the reliability and validity of the Turkish version of the FAB(FAB-T) scale in the older Turkish adults. The reliability and validity of the scale was tested on 200 community-dwelling older adults. FAB-T scale was scored by different physiotherapists on different days to evaluate inter-rater and intrarater reliability. The Berg Balance Scale (BBS) was used for the evaluation of convergent validity, and the content validity of the FAB-T scale was investigated. The FAB-T scale showed very high inter- and intra-rater reliability. For inter-rater agreement, on the individual test items and total score ICC values were 0.92 (95 %CI; 0.90-0.94) and 0.96 (95% CI; 0.95-0.97) respectively. The intra-rater agreement, on the individual test items and total score ICC values were 0.93 (95 %CI; 0.91- 0.95) and 0.96 (95% CI; 0.95- 0.97) respectively. There was a good agreement between the FAB-T and BBS scales. A high correlation was found between the BBS and FAB-T scales [rho = 0.70 (%95 CI; 0.62-0.76)] indicating good convergent validity. Considering the content validity of the FAB-T scale, no floor (floor score: 0%) or ceiling (ceiling score: 6.5%) effect was detected. The FAB-T scale was successfully translated from the original English version (FAB) and demonstrated strong psychometric features. It was found that the FAB-T scale has very high inter-rater and intra-rater reliability. Considering the convergent validity, the scale has high correlation with the BBS. The FAB-T has no floor and ceiling effect. Copyright © 2018 Elsevier B.V. All rights reserved.
The Vocal Cord Dysfunction Questionnaire: Validity and Reliability of the Persian Version.

PubMed

Ghaemi, Hamide; Khoddami, Seyyedeh Maryam; Soleymani, Zahra; Zandieh, Fariborz; Jalaie, Shohreh; Ahanchian, Hamid; Khadivi, Ehsan

2017-12-25

The aim of this study was to develop, validate, and assess the reliability of the Persian version of Vocal Cord Dysfunction Questionnaire (VCDQ P ). The study design was cross-sectional or cultural survey. Forty-four patients with vocal fold dysfunction (VFD) and 40 healthy volunteers were recruited for the study. To assess the content validity, the prefinal questions were given to 15 experts to comment on its essential. Ten patients with VFD rated the importance of VCDQ P in detecting face validity. Eighteen of the patients with VFD completed the VCDQ 1 week later for test-retest reliability. To detect absolute reliability, standard error of measurement and smallest detected change were calculated. Concurrent validity was assessed by completing the Persian Chronic Obstructive Pulmonary Disease (COPD) Assessment Test (CAT) by 34 patients with VFD. Discriminant validity was measured from 34 participants. The VCDQ was further validated by administering the questionnaire to 40 healthy volunteers. Validation of the VCDQ as a treatment outcome tool was conducted in 18 patients with VFD using pre- and posttreatment scores. The internal consistency was confirmed (Cronbach α = 0.78). The test-retest reliability was excellent (intraclass correlation coefficient = 0.97). The standard error of measurement and smallest detected change values were acceptable (0.39 and 1.08, respectively). There was a significant correlation between the VCDQ P and the CAT total scores (P < 0.05). Discriminative validity was significantly different. The VCDQ scores in patients with VFD before and after treatment was significantly different (P < 0.001). The VCDQ was cross-culturally adapted to Persian and demonstrated to be a valid and reliable self-administered questionnaire in Persian-speaking population. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Validity and reliability of Optojump photoelectric cells for estimating vertical jump height.

PubMed

Glatthorn, Julia F; Gouge, Sylvain; Nussbaumer, Silvio; Stauffacher, Simone; Impellizzeri, Franco M; Maffiuletti, Nicola A

2011-02-01

Vertical jump is one of the most prevalent acts performed in several sport activities. It is therefore important to ensure that the measurements of vertical jump height made as a part of research or athlete support work have adequate validity and reliability. The aim of this study was to evaluate concurrent validity and reliability of the Optojump photocell system (Microgate, Bolzano, Italy) with force plate measurements for estimating vertical jump height. Twenty subjects were asked to perform maximal squat jumps and countermovement jumps, and flight time-derived jump heights obtained by the force plate were compared with those provided by Optojump, to examine its concurrent (criterion-related) validity (study 1). Twenty other subjects completed the same jump series on 2 different occasions (separated by 1 week), and jump heights of session 1 were compared with session 2, to investigate test-retest reliability of the Optojump system (study 2). Intraclass correlation coefficients (ICCs) for validity were very high (0.997-0.998), even if a systematic difference was consistently observed between force plate and Optojump (-1.06 cm; p < 0.001). Test-retest reliability of the Optojump system was excellent, with ICCs ranging from 0.982 to 0.989, low coefficients of variation (2.7%), and low random errors (±2.81 cm). The Optojump photocell system demonstrated strong concurrent validity and excellent test-retest reliability for the estimation of vertical jump height. We propose the following equation that allows force plate and Optojump results to be used interchangeably: force plate jump height (cm) = 1.02 × Optojump jump height + 0.29. In conclusion, the use of Optojump photoelectric cells is legitimate for field-based assessments of vertical jump height.
Behavioral and cognitive outcomes for clinical trials in children with neurofibromatosis type 1.

PubMed

van der Vaart, Thijs; Rietman, André B; Plasschaert, Ellen; Legius, Eric; Elgersma, Ype; Moll, Henriëtte A

2016-01-12

To evaluate the appropriateness of cognitive and behavioral outcome measures in clinical trials in neurofibromatosis type 1 (NF1) by analyzing the degree of deficits compared to reference groups, test-retest reliability, and how scores correlate between outcome measures. Data were analyzed from the Simvastatin for cognitive deficits and behavioral problems in patients with neurofibromatosis type 1 (NF1-SIMCODA) trial, a randomized placebo-controlled trial of simvastatin for cognitive deficits and behavioral problems in children with NF1. Outcome measures were compared with age-specific reference groups to identify domains of dysfunction. Pearson r was computed for before and after measurements within the placebo group to assess test-retest reliability. Principal component analysis was used to identify the internal structure in the outcome data. Strongest mean score deviations from the reference groups were observed for full-scale intelligence (-1.1 SD), Rey Complex Figure Test delayed recall (-2.0 SD), attention problems (-1.2 SD), and social problems (-1.1 SD). Long-term test-retest reliability were excellent for Wechsler scales (r > 0.88), but poor to moderate for other neuropsychological tests (r range 0.52-0.81) and Child Behavioral Checklist subscales (r range 0.40-0.79). The correlation structure revealed 2 strong components in the outcome measures behavior and cognition, with no correlation between these components. Scores on psychosocial quality of life correlate strongly with behavioral problems and less with cognitive deficits. Children with NF1 show distinct deficits in multiple domains. Many outcome measures showed weak test-retest correlations over the 1-year trial period. Cognitive and behavioral outcomes are complementary. This analysis demonstrates the need to include reliable outcome measures on a variety of cognitive and behavioral domains in clinical trials for NF1. © 2015 American Academy of Neurology.
Translation, validity and reliability of the British Sign Language (BSL) version of the EQ-5D-5L.

PubMed

Rogers, Katherine D; Pilling, Mark; Davies, Linda; Belk, Rachel; Nassimi-Green, Catherine; Young, Alys

2016-07-01

To translate the health questionnaire EuroQol EQ-5D-5L into British Sign Language (BSL), to test its reliability with the signing Deaf population of BSL users in the UK and to validate its psychometric properties. The EQ-5D-5L BSL was developed following the international standard for translation required by EuroQol, with additional agreed features appropriate to a visual language. Data collection used an online platform to view the signed (BSL) version of the tests. The psychometric testing included content validity, assessed by interviewing a small sample of Deaf people. Reliability was tested by internal consistency of the items and test-retest, and convergent validity was assessed by determining how well EQ-5D-5L BSL correlates with CORE-10 BSL and CORE-6D BSL. The psychometric properties of the EQ-5D-5L BSL are good, indicating that it can be used to measure health status in the Deaf signing population in the UK. Convergent validity between EQ-5D-5L BSL and CORE-10 BSL and CORE-6D BSL is consistent, demonstrating that the BSL version of EQ-5D-5L is a good measure of the health status of an individual. The test-retest reliability of EQ-5D-5L BSL, for each dimension of health, was shown to have Cohen's kappa values of 0.47-0.61; these were in the range of moderate to good and were therefore acceptable. This is the first time EQ-5D-5L has been translated into a signed language for use with Deaf people and is a significant step forward towards conducting studies of health status and cost-effectiveness in this population.
Fault-tolerant onboard digital information switching and routing for communications satellites

NASA Technical Reports Server (NTRS)

Shalkhauser, Mary JO; Quintana, Jorge A.; Soni, Nitin J.; Kim, Heechul

1993-01-01

The NASA Lewis Research Center is developing an information-switching processor for future meshed very-small-aperture terminal (VSAT) communications satellites. The information-switching processor will switch and route baseband user data onboard the VSAT satellite to connect thousands of Earth terminals. Fault tolerance is a critical issue in developing information-switching processor circuitry that will provide and maintain reliable communications services. In parallel with the conceptual development of the meshed VSAT satellite network architecture, NASA designed and built a simple test bed for developing and demonstrating baseband switch architectures and fault-tolerance techniques. The meshed VSAT architecture and the switching demonstration test bed are described, and the initial switching architecture and the fault-tolerance techniques that were developed and tested are discussed.
Some practical observations on the accelerated testing of Nickel-Cadmium Cells

NASA Technical Reports Server (NTRS)

Mcdermott, P. P.

1979-01-01

A large scale test of 6.0 Ah Nickel-Cadmium Cells conducted at the Naval Weapons Support Center, Crane, Indiana has demonstrated a methodology for predicting battery life based on failure data from cells cycled in an accelerated mode. After examining eight variables used to accelerate failure, it was determined that temperature and depth of discharge were the most reliable and efficient parameters for use in accelerating failure and for predicting life.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yashchuk, Valeriy V; Conley, Raymond; Anderson, Erik H.

We discuss the results of SEM and TEM measurements with the BPRML test samples fabricated from a BPRML (WSi2/Si with fundamental layer thickness of 3 nm) with a Dual Beam FIB (focused ion beam)/SEM technique. In particular, we demonstrate that significant information about the metrological reliability of the TEM measurements can be extracted even when the fundamental frequency of the BPRML sample is smaller than the Nyquist frequency of the measurements. The measurements demonstrate a number of problems related to the interpretation of the SEM and TEM data. Note that similar BPRML test samples can be used to characterize x-raymore » microscopes. Corresponding work with x-ray microscopes is in progress.« less
Reliability of adherence and competence assessment in cognitive behavioral therapy: influence of clinical experience.

PubMed

Weck, Florian; Hilling, Christine; Schermelleh-Engel, Karin; Rudari, Visar; Stangier, Ulrich

2011-04-01

The use of highly experienced expert judges was suggested for the assessment of therapists' adherence and competence. However, such an approach implies high costs. It can be questioned whether only experts are able to evaluate therapists' adherence and competence reliably. To test this, 4 judges evaluated therapist adherence and competence in 30 randomly selected videotapes of cognitive therapy sessions for depression. In that, 2 judges exhibited high clinical experience (experts), whereas 2 judges did not (novices). We could demonstrate that novices evaluated an aggregated adherence and competence measure with high reliability. However, several adherence and competence aspects were not assessed with satisfactory reliability by novices. Although adherence ratings of experts and novices showed high concordance, the concordance of competence ratings was only moderate. Results revealed that therapists' adherence could be evaluated satisfactorily by trained novices with some restrictions, but not their competence.
Validation and cross-cultural pilot testing of compliance with standard precautions scale: self-administered instrument for clinical nurses.

PubMed

Lam, Simon C

2014-05-01

To perform detailed psychometric testing of the compliance with standard precautions scale (CSPS) in measuring compliance with standard precautions of clinical nurses and to conduct cross-cultural pilot testing and assess the relevance of the CSPS on an international platform. A cross-sectional and correlational design with repeated measures. Nursing students from a local registered nurse training university, nurses from different hospitals in Hong Kong, and experts in an international conference. The psychometric properties of the CSPS were evaluated via internal consistency, 2-week and 3-month test-retest reliability, concurrent validation, and construct validation. The cross-cultural pilot testing and relevance check was examined by experts on infection control from various developed and developing regions. Among 453 participants, 193 were nursing students, 165 were enrolled nurses, and 95 were registered nurses. The results showed that the CSPS had satisfactory reliability (Cronbach α = 0.73; intraclass correlation coefficient, 0.79 for 2-week test-retest and 0.74 for 3-month test-retest) and validity (optimum correlation with criterion measure; r = 0.76, P < .001; satisfactory results on known-group method and hypothesis testing). A total of 19 experts from 16 countries assured that most of the CSPS findings were relevant and globally applicable. The CSPS demonstrated satisfactory results on the basis of the standard international criteria on psychometric testing, which ascertained the reliability and validity of this instrument in measuring the compliance of clinical nurses with standard precautions. The cross-cultural pilot testing further reinforced the instrument's relevance and applicability in most developed and developing regions.
Examining the reliability and validity of a modified version of the International Physical Activity Questionnaire, long form (IPAQ-LF) in Nigeria: a cross-sectional study.

PubMed

Oyeyemi, Adewale L; Bello, Umar M; Philemon, Saratu T; Aliyu, Habeeb N; Majidadi, Rebecca W; Oyeyemi, Adetoyeje Y

2014-12-01

To investigate the reliability and an aspect of validity of a modified version of the long International Physical Activity Questionnaire (Hausa IPAQ-LF) in Nigeria. Cross-sectional study, examining the reliability and construct validity of the Hausa IPAQ-LF compared with anthropometric and biological variables. Metropolitan Maiduguri, the capital city of Borno State in Nigeria. 180 Nigerian adults (50% women) with a mean age of 35.6 (SD=10.3) years, recruited from neighbourhoods with diverse socioeconomic status and walkability. Domains (domestic physical activity (PA), occupational PA, leisure-time PA, active transportation and sitting time) and intensities of PA (vigorous, moderate and walking) were measured with the Hausa IPAQ-LF on two different occasions, 8 days apart. Outcomes for construct validity were measured body mass index (BMI), systolic blood pressure (SBP) and diastolic blood pressure (DBP). The Hausa IPAQ-LF demonstrated good test-retest reliability (intraclass correlation coefficient, ICC>75) for total PA (ICC=0.79, 95% CI 0.65 to 0.82), occupational PA (ICC=0.77, 95% CI 0.68 to 0.82), active transportation (ICC=0.82, 95% CI 0.75 to 0.87) and vigorous intensity activities (ICC=0.82, 95% CI 0.76 to 0.87). Reliability was substantially higher for total PA (ICC=0.80), occupational PA (ICC=0.78), leisure-time PA (ICC=0.75) and active transportation (ICC=0.80) in men than in women, but domestic PA (ICC=0.38) and sitting time (ICC=0.71) demonstrated more substantial reliability coefficients in women than in men. For the construct validity, domestic PA was significantly related mainly with SBP (r=-0.27) and DBP (r=-0.17), and leisure-time PA and total PA were significantly related only with SBP (r=-0.16) and BMI (r=-0.29), respectively. Similarly, moderate-intensity PA was mainly related with SBP (r=-0.16, p<0.05) and DBP (r=-0.21, p<0.01), but vigorous-intensity PA was only related with BMI (r=-0.11, p<0.05). The modified Hausa IPAQ-LF demonstrated sufficient evidence of test-retest reliability and may be valid for assessing context specific PA behaviours of adults in Nigeria. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
A Computer-Adaptive Disability Instrument for Lower Extremity Osteoarthritis Research Demonstrated Promising Breadth, Precision and Reliability

PubMed Central

Jette, Alan M.; McDonough, Christine M.; Haley, Stephen M.; Ni, Pengsheng; Olarsch, Sippy; Latham, Nancy; Hambleton, Ronald K.; Felson, David; Kim, Young-jo; Hunter, David

2012-01-01

Objective To develop and evaluate a prototype measure (OA-DISABILITY-CAT) for osteoarthritis research using Item Response Theory (IRT) and Computer Adaptive Test (CAT) methodologies. Study Design and Setting We constructed an item bank consisting of 33 activities commonly affected by lower extremity (LE) osteoarthritis. A sample of 323 adults with LE osteoarthritis reported their degree of limitation in performing everyday activities and completed the Health Assessment Questionnaire-II (HAQ-II). We used confirmatory factor analyses to assess scale unidimensionality and IRT methods to calibrate the items and examine the fit of the data. Using CAT simulation analyses, we examined the performance of OA-DISABILITY-CATs of different lengths compared to the full item bank and the HAQ-II. Results One distinct disability domain was identified. The 10-item OA-DISABILITY-CAT demonstrated a high degree of accuracy compared with the full item bank (r=0.99). The item bank and the HAQ-II scales covered a similar estimated scoring range. In terms of reliability, 95% of OA-DISABILITY reliability estimates were over 0.83 versus 0.60 for the HAQ-II. Except at the highest scores the 10-item OA-DISABILITY-CAT demonstrated superior precision to the HAQ-II. Conclusion The prototype OA-DISABILITY-CAT demonstrated promising measurement properties compared to the HAQ-II, and is recommended for use in LE osteoarthritis research. PMID:19216052
Modification and Validation of an Automotive Data Processing Unit, Compessed Video System, and Communications Equipment

DOE Office of Scientific and Technical Information (OSTI.GOV)

Carter, R.J.

1997-04-01

The primary purpose of the "modification and validation of an automotive data processing unit (DPU), compressed video system, and communications equipment" cooperative research and development agreement (CRADA) was to modify and validate both hardware and software, developed by Scientific Atlanta, Incorporated (S-A) for defense applications (e.g., rotary-wing airplanes), for the commercial sector surface transportation domain (i.e., automobiles and trucks). S-A also furnished a state-of-the-art compressed video digital storage and retrieval system (CVDSRS), and off-the-shelf data storage and transmission equipment to support the data acquisition system for crash avoidance research (DASCAR) project conducted by Oak Ridge National Laboratory (ORNL). In turn,more » S-A received access to hardware and technology related to DASCAR. DASCAR was subsequently removed completely and installation was repeated a number of times to gain an accurate idea of complete installation, operation, and removal of DASCAR. Upon satisfactory completion of the DASCAR construction and preliminary shakedown, ORNL provided NHTSA with an operational demonstration of DASCAR at their East Liberty, OH test facility. The demonstration included an on-the-road demonstration of the entire data acquisition system using NHTSA'S test track. In addition, the demonstration also consisted of a briefing, containing the following: ORNL generated a plan for validating the prototype data acquisition system with regard to: removal of DASCAR from an existing vehicle, and installation and calibration in other vehicles; reliability of the sensors and systems; data collection and transmission process (data integrity); impact on the drivability of the vehicle and obtrusiveness of the system to the driver; data analysis procedures; conspicuousness of the vehicle to other drivers; and DASCAR installation and removal training and documentation. In order to identify any operational problems not captured by the systems testing and evaluation, the validation plan also addressed a short-term pilot research program to manipulate DASCAR under operational conditions using "naive" drivers. The effort exercised the fill capabilities of the data acquisition system. ORNL subsequently evaluated and pilot tested the data acquisition system using the validation plan. The plan was implemented in full at the NHTSA East Liberty, OH test facility, and was carried out as a cooperative effort with the Vehicle Research and Test Center staff. ORNL determined the reliability of the sensors and systems by exercising DASCAR For one vehicle type, ORNL evaluated systems reliability over a continuous period of 30 days with particular attention paid to maintenance of calibration and data integrity.« less
Using personal qualities assessment to measure the moral orientation and personal qualities of medical students in a non-Western culture.

PubMed

Tsou, Kuo-Inn; Lin, Chaou-Shune; Cho, Shu-Ling; Powis, David; Bore, Miles; Munro, Don; Sze, Daniel Man-Yuen; Wu, Hsi-Chin; Hsieh, Ming-Shium; Lin, Chyi-Her

2013-06-01

How to select candidates with appropriate personal qualities for medical school is an important issue. This study examined the psychometric properties and group differences of the Personal Qualities Assessment (PQA) to test the feasibility of using it as a tool to assess the medical school applicants in a non-Western culture. Seven hundred forty-six medical students in Taiwan completed two psychometric measures: Mojac to assess moral orientation and NACE to assess four aspects of interpersonal relationships. Thirty-one students completed the tests twice to establish test-retest reliability. A subsample of 127 students also completed a measure of the "Big Five" personality traits to examine the construct validity of these scales. Both Mojac and NACE had acceptable internal consistency and test-retest reliability. Conceptually, coherent and significant relationships were observed between test components and between the NACE and Big Five. NACE but not Mojac varied significantly between different sociodemographic groups. Both tests demonstrated acceptable psychometric properties. However, the predictive validity of PQA requires future studies.
Design of high-reliability low-cost amorphous silicon modules for high energy yield

NASA Astrophysics Data System (ADS)

Jansen, Kai W.; Varvar, Anthony; Twesme, Edward; Berens, Troy; Dhere, Neelkanth G.

2008-08-01

For PV modules to fulfill their intended purpose, they must generate sufficient economic return over their lifetime to justify their initial cost. Not only must modules be manufactured at a low cost/Wp with a high energy yield (kWh/kWp), they must also be designed to withstand the significant environmental stresses experienced throughout their 25+ year lifetime. Based on field experience, the most common factors affecting the lifetime energy yield of glass-based amorphous silicon (a-Si) modules have been identified; these include: 1) light-induced degradation; 2) moisture ingress and thin film corrosion; 3) transparent conductive oxide (TCO) delamination; and 4) glass breakage. The current approaches to mitigating the effect of these degradation mechanisms are discussed and the accelerated tests designed to simulate some of the field failures are described. In some cases, novel accelerated tests have been created to facilitate the development of improved manufacturing processes, including a unique test to screen for TCO delamination. Modules using the most reliable designs are tested in high voltage arrays at customer and internal test sites, as well as at independent laboratories. Data from tests at the Florida Solar Energy Center has shown that a-Si tandem modules can demonstrate an energy yield exceeding 1200 kWh/kWp/yr in a subtropical climate. In the same study, the test arrays demonstrated low long-term power loss over two years of data collection, after initial stabilization. The absolute power produced by the test arrays varied seasonally by approximately +/-7%, as expected.
Test-Retest Reliability of a Serious Game for Delirium Screening in the Emergency Department.

PubMed

Tong, Tiffany; Chignell, Mark; Tierney, Mary C; Lee, Jacques S

2016-01-01

Introduction: Cognitive screening in settings such as emergency departments (ED) is frequently carried out using paper-and-pencil tests that require administration by trained staff. These assessments often compete with other clinical duties and thus may not be routinely administered in these busy settings. Literature has shown that the presence of cognitive impairments such as dementia and delirium are often missed in older ED patients. Failure to recognize delirium can have devastating consequences including increased mortality (Kakuma et al., 2003). Given the demands on emergency staff, an automated cognitive test to screen for delirium onset could be a valuable tool to support delirium prevention and management. In earlier research we examined the concurrent validity of a serious game, and carried out an initial assessment of its potential as a delirium screening tool (Tong et al., 2016). In this paper, we examine the test-retest reliability of the game, as it is an important criterion in a cognitive test for detecting risk of delirium onset. Objective: To demonstrate the test-retest reliability of the screening tool over time in a clinical sample of older emergency patients. A secondary objective is to assess whether there are practice effects that might make game performance unstable over repeated presentations. Materials and Methods: Adults over the age of 70 were recruited from a hospital ED. Each patient played our serious game in an initial session soon after they arrived in the ED, and in follow up sessions conducted at 8-h intervals (for each participant there were up to five follow up sessions, depending on how long the person stayed in the ED). Results: A total of 114 adults (61 females, 53 males) between the ages of 70 and 104 years ( M = 81 years, SD = 7) participated in our study after screening out delirious patients. We observed a test-retest reliability of the serious game (as assessed by correlation r -values) between 0.5 and 0.8 across adjacent sessions. Conclusion: The game-based assessment for cognitive screening has relatively strong test-retest reliability and little evidence of practice effects among elderly emergency patients, and may be a useful supplement to existing cognitive assessment methods.
The Reliability of Microalloyed Sn-Ag-Cu Solder Interconnections Under Cyclic Thermal and Mechanical Shock Loading

NASA Astrophysics Data System (ADS)

Mattila, Toni T.; Hokka, Jussi; Paulasto-Kröckel, Mervi

2014-11-01

In this study, the performance of three microalloyed Sn-Ag-Cu solder interconnection compositions (Sn-3.1Ag-0.52Cu, Sn-3.0Ag-0.52Cu-0.24Bi, and Sn-1.1Ag-0.52Cu-0.1Ni) was compared under mechanical shock loading (JESD22-B111 standard) and cyclic thermal loading (40 ± 125°C, 42 min cycle) conditions. In the drop tests, the component boards with the low-silver nickel-containing composition (Sn-Ag-Cu-Ni) showed the highest average number of drops-to-failure, while those with the bismuth-containing alloy (Sn-Ag-Cu-Bi) showed the lowest. Results of the thermal cycling tests showed that boards with Sn-Ag-Cu-Bi interconnections performed the best, while those with Sn-Ag-Cu-Ni performed the worst. Sn-Ag-Cu was placed in the middle in both tests. In this paper, we demonstrate that solder strength is an essential reliability factor and that higher strength can be beneficial for thermal cycling reliability but detrimental to drop reliability. We discuss these findings from the perspective of the microstructures and mechanical properties of the three solder interconnection compositions and, based on a comprehensive literature review, investigate how the differences in the solder compositions influence the mechanical properties of the interconnections and discuss how the differences are reflected in the failure mechanisms under both loading conditions.
Psychometric properties of the Swedish PedsQL, Pediatric Quality of Life Inventory 4.0 generic core scales.

PubMed

Petersen, Solveig; Hägglöf, Bruno; Stenlund, Hans; Bergström, Erik

2009-09-01

To study the psychometric performance of the Swedish version of the Pediatric Quality of Life Inventory (PedsQL) 4.0 generic core scales in a general child population in Sweden. PedsQL forms were distributed to 2403 schoolchildren and 888 parents in two different school settings. Reliability and validity was studied for self-reports and proxy reports, full forms and short forms. Confirmatory factor analysis tested the factor structure and multigroup confirmatory factor analysis tested measurement invariance between boys and girls. Test-retest reliability was demonstrated for all scales and internal consistency reliability was shown with alpha value exceeding 0.70 for all scales but one (self-report short form: social functioning). Child-parent agreement was low to moderate. The four-factor structure of the PedsQL and factorial invariance across sex subgroups were confirmed for the self-report forms and for the proxy short form, while model fit indices suggested improvement of several proxy full-form scales. The Swedish PedsQL 4.0 generic core scales are a reliable and valid tool for health-related quality of life (HRQoL) assessment in Swedish child populations. The proxy full form, however, should be used with caution. The study also support continued use of the PedsQL as a four-factor model, capable of revealing meaningful HRQoL differences between boys and girls.
Validation of the Middlesex Elderly Assessment of Mental State (MEAMS) as a cognitive screening test in patients with acquired brain injury in Turkey.

PubMed

Kutlay, Sehim; Kuçukdeveci, Ayse A; Elhan, Atilla H; Yavuzer, Gunes; Tennant, Alan

2007-02-28

Assessment of cognitive impairment with a valid cognitive screening tool is essential in neurorehabilitation. The aim of this study was to test the reliability and validity of the Turkish-adapted version of the Middlesex Elderly Assessment of Mental State (MEAMS) among acquired brain injury patients in Turkey. Some 155 patients with acquired brain injury admitted for rehabilitation were assessed by the adapted version of MEAMS at admission and discharge. Reliability was tested by internal consistency, intra-class correlation coefficient (ICC) and person separation index; internal construct validity by Rasch analysis; external construct validity by associations with physical and cognitive disability (FIM); and responsiveness by Effect Size. Reliability was found to be good with Cronbach's alpha of 0.82 at both admission and discharge; and likewise an ICC of 0.80. Person separation index was 0.813. Internal construct validity was good by fit of the data to the Rasch model (mean item fit -0.178; SD 1.019). Items were substantially free of differential item functioning. External construct validity was confirmed by expected associations with physical and cognitive disability. Effect size was 0.42 compared with 0.22 for cognitive FIM. The reliability and validity of the Turkish version of MEAMS as a cognitive impairment screening tool in acquired brain injury has been demonstrated.
Between-day reliability of a method for non-invasive estimation of muscle composition.

PubMed

Simunič, Boštjan

2012-08-01

Tensiomyography is a method for valid and non-invasive estimation of skeletal muscle fibre type composition. The validity of selected temporal tensiomyographic measures has been well established recently; there is, however, no evidence regarding the method's between-day reliability. Therefore it is the aim of this paper to establish the between-day repeatability of tensiomyographic measures in three skeletal muscles. For three consecutive days, 10 healthy male volunteers (mean±SD: age 24.6 ± 3.0 years; height 177.9 ± 3.9 cm; weight 72.4 ± 5.2 kg) were examined in a supine position. Four temporal measures (delay, contraction, sustain, and half-relaxation time) and maximal amplitude were extracted from the displacement-time tensiomyogram. A reliability analysis was performed with calculations of bias, random error, coefficient of variation (CV), standard error of measurement, and intra-class correlation coefficient (ICC) with a 95% confidence interval. An analysis of ICC demonstrated excellent agreement (ICC were over 0.94 in 14 out of 15 tested parameters). However, lower CV was observed in half-relaxation time, presumably because of the specifics of the parameter definition itself. These data indicate that for the three muscles tested, tensiomyographic measurements were reproducible across consecutive test days. Furthermore, we indicated the most possible origin of the lowest reliability detected in half-relaxation time. Copyright © 2012 Elsevier Ltd. All rights reserved.
Validation of the comprehensive feeding practices questionnaire in parents of preschool children in Brazil.

PubMed

Warkentin, Sarah; Mais, Laís Amaral; Latorre, Maria do Rosário Dias de Oliveira; Carnell, Susan; Taddei, José Augusto de Aguiar Carrazedo

2016-07-19

Recent national surveys in Brazil have demonstrated a decrease in the consumption of traditional food and a parallel increase in the consumption of ultra-processed food, which has contributed to a rise in obesity prevalence in all age groups. Environmental factors, especially familial factors, have a strong influence on the food intake of preschool children, and this has led to the development of psychometric scales to measure parents' feeding practices. The aim of this study was to test the validity of a translated and adapted Comprehensive Feeding Practices Questionnaire in a sample of Brazilian preschool-aged children enrolled in private schools. A transcultural adaptation process was performed in order to develop a modified questionnaire (43 items). After piloting, the questionnaire was sent to parents, along with additional questions about family characteristics. Test-retest reliability was assessed in one of the schools. Factor analysis with oblique rotation was performed. Internal reliability was tested using Cronbach's alpha and correlations between factors, discriminant validity using marker variables of child's food intake, and convergent validity via correlations with parental perceptions of perceived responsibility for feeding and concern about the child's weight were also performed. The final sample consisted of 402 preschool children. Factor analysis resulted in a final questionnaire of 43 items distributed over 6 factors. Cronbach alpha values were adequate (0.74 to 0.88), between-factor correlations were low, and discriminant validity and convergent validity were acceptable. The modified CFPQ demonstrated significant internal reliability in this urban Brazilian sample. Scale validation within different cultures is essential for a more comprehensive understanding of parental feeding practices for preschoolers.
An investigation of the psychometric properties of the Chinese (Cantonese) version of Subjective Index of Physical and Social Outcome (SIPSO).

PubMed

Kwong, Patrick Wh; Ng, Shamay Sm; Ng, Gabriel Yf

2017-11-01

The objectives of this study were 1) to translate and make cultural adaptations to the English version of the SIPSO questionnaire to create a Chinese (Cantonese) version, 2) evaluate the internal consistency, test-retest reliability the C-SIPSO questionnaire, and 3) compare the SIPSO-C scores of stroke survivors with different demographic characteristics to establish the discriminant validity of the questionnaire Design: Translation of questionnaire, cross sectional study. University-based clinical research laboratory. Subjects Community-dwelling chronic stroke survivors. Not applicable. Subjective Index of Physical and Social Outcome, Geriatric Depression Scale, 10-metre Walk test. Two bilingual professional translators translated the SIPSO questionnaire independently. An expert panel comprising five registered physiotherapists verified the content validity of the final version (C-SIPSO). C-SIPSO demonstrated good internal consistency (Cronbach's α = 0.83) and excellent test-retest reliability (ICC 3,1 = 0.866) in ninety-two community dwelling chronic stroke survivors. Stroke survivors scored higher than 10 in the Geriatric Depression Scale ( U = 555.0, P < 0.001) and with the comfortable walking speed lower than 0.8ms -1 ( U = 726.5; P = 0.012) scored significantly lower on SIPSO-C. SIPSO-C is a reliable instrument that can be used to measure the level of community integration in community-dwelling stroke survivors in Hong Kong and southern China. Stroke survivors who were at high risk of minor depression and with limited community ambulation ability demonstrated a lower level of community integration as measured with SIPSO-C.
Development and validation of a self-efficacy scale for postoperative rehabilitation management of lung cancer patients.

PubMed

Huang, Fei-Fei; Yang, Qing; Han, Xuan Ye; Zhang, Jing-Ping; Lin, Ting

2017-08-01

The purpose of this study was to develop a Self-Efficacy Scale for Rehabilitation Management designed specifically for postoperative lung cancer patients (SESPRM-LC) and to evaluate its psychometric properties. Based on the concept of self-management of chronic disease, items were developed from literature review and semistructured interviews of 10 lung cancer patients and screened by expert consultation and pilot testing. Psychometric evaluation was done with 448 postoperative lung cancer patients recruited from 5 tertiary hospitals in Fuzhou, China, by incorporating classical test theory and item response theory methods. A 6-factor structure was illustrated by exploratory factor analysis and confirmed by confirmatory factor analysis, explaining 60.753% of the total variance. The SESPRM-LC achieved Cronbach's α of 0.694 to 0.893, 2-week test-retest reliability of 0.652 to 0.893, and marginal reliability of 0.565 to 0.934. The predictive and criterion validities were demonstrated by significant association with theoretically supported quality-of-life variables (r = 0.211-0.392, P < .01), and General Perceived Self-efficacy Scale (r = 0.465, P < .01), respectively. Item response theory analysis showed that the SESPRM-LC offers information about a broad range of self-efficacy measures and discriminates well between patients with high and low levels of self-efficacy. We demonstrated initial support for the reliability and validity of the 27-item SESPRM-LC, as a developmentally appropriate instrument for assessing self-efficacy among lung cancer patients during postoperative rehabilitation. Copyright © 2016 John Wiley & Sons, Ltd.
Development and initial validation of the appropriate antibiotic use self-efficacy scale.

PubMed

Hill, Erin M; Watkins, Kaitlin

2018-06-04

While there are various medication self-efficacy scales that exist, none assess self-efficacy for appropriate antibiotic use. The Appropriate Antibiotic Use Self-Efficacy Scale (AAUSES) was developed, pilot tested, and its psychometric properties were examined. Following pilot testing of the scale, a 28-item questionnaire was examined using a sample (n = 289) recruited through the Amazon Mechanical Turk platform. Participants also completed other scales and items, which were used in assessing discriminant, convergent, and criterion-related validity. Test-retest reliability was also examined. After examining the scale and removing items that did not assess appropriate antibiotic use, an exploratory factor analysis was conducted on 13 items from the original scale. Three factors were retained that explained 65.51% of the variance. The scale and its subscales had adequate internal consistency. The scale had excellent test-retest reliability, as well as demonstrated convergent, discriminant, and criterion-related validity. The AAUSES is a valid and reliable scale that assesses three domains of appropriate antibiotic use self-efficacy. The AAUSES may have utility in clinical and research settings in understanding individuals' beliefs about appropriate antibiotic use and related behavioral correlates. Future research is needed to examine the scale's utility in these settings. Copyright © 2018 Elsevier B.V. All rights reserved.
Effects of test method and participant musical training on preference ratings of stimuli with different reverberation times.

PubMed

Lawless, Martin S; Vigeant, Michelle C

2017-10-01

Selecting an appropriate listening test design for concert hall research depends on several factors, including listening test method and participant critical-listening experience. Although expert listeners afford more reliable data, their perceptions may not be broadly representative. The present paper contains two studies that examined the validity and reliability of the data obtained from two listening test methods, a successive and a comparative method, and two types of participants, musicians and non-musicians. Participants rated their overall preference of auralizations generated from eight concert hall conditions with a range of reverberation times (0.0-7.2 s). Study 1, with 34 participants, assessed the two methods. The comparative method yielded similar results and reliability as the successive method. Additionally, the comparative method was rated as less difficult and more preferable. For study 2, an additional 37 participants rated the stimuli using the comparative method only. An analysis of variance of the responses from both studies revealed that musicians are better than non-musicians at discerning their preferences across stimuli. This result was confirmed with a k-means clustering analysis on the entire dataset that revealed five preference groups. Four groups exhibited clear preferences to the stimuli, while the fifth group, predominantly comprising non-musicians, demonstrated no clear preference.
Development and psychometric testing of a trans-professional evidence-based practice profile questionnaire.

PubMed

McEvoy, Maureen Patricia; Williams, Marie T; Olds, Timothy Stephen

2010-01-01

Previous survey tools operationalising knowledge, attitudes or beliefs about evidence-based practice (EBP) have shortcomings in content, psychometric properties and target audience. This study developed and psychometrically assessed a self-report trans-professional questionnaire to describe an EBP profile. Sixty-six items were collated from existing EBP questionnaires and administered to 526 academics and students from health and non-health backgrounds. Principal component factor analysis revealed the presence of five factors (Relevance, Terminology, Confidence, Practice and Sympathy). Following expert panel review and pilot testing, the 58-item final questionnaire was disseminated to 105 subjects on two occasions. Test-retest and internal reliability were quantified using intra-class correlation coefficients (ICCs) and Cronbach's alpha, convergent validity against a commonly used EBP questionnaire by Pearson's correlation coefficient and discriminative validity via analysis of variance (ANOVA) based on exposure to EBP training. The final questionnaire demonstrated acceptable internal consistency (Cronbach's alpha 0.96), test-retest reliability (ICCs range 0.77-0.94) and convergent validity (Practice 0.66, Confidence 0.80 and Sympathy 0.54). Three factors (Relevance, Terminology and Confidence) distinguished EBP exposure groups (ANOVA p < 0.001-0.004). The evidence-based practice profile (EBP(2)) questionnaire is a reliable instrument with the ability to discriminate for three factors, between respondents with differing EBP exposures.
Development of a fulcrum methodology to replicate the lateral ankle sprain mechanism and measure dynamic inversion speed.

PubMed

Knight, Adam C; Weimar, Wendi H

2012-09-01

When the ankle is forced into inversion, the speed at which this movement occurs may affect the extent of injury. The purpose of this investigation was to develop a fulcrum device to mimic the mechanism of a lateral ankle sprain and to determine the reliability and validity of the temporal variables produced by this device. Additionally, this device was used to determine if a single previous lateral ankle sprain or ankle taping effected the time to maximum inversion and/or mean inversion speed. Twenty-six participants (13 with history of a single lateral ankle sprain and 13 with no history of injury) completed the testing. The participants completed testing on three separate days, performing 10 trials with the fulcrum per leg on each testing day, and tape was applied to both ankles on one testing day. No significant interactions or main effects were found for either previous injury or ankle taping, but good reliability was found for time to maximum inversion (ICC = .81) and mean inversion speed (ICC = .79). The findings suggest that although neither variable was influenced by the history of a single previous lateral ankle sprain or ankle taping, both variables demonstrated good reliability and construct validity, but not discriminative validity.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.