reliability evaluation applied: Topics by Science.gov

Sample records for reliability evaluation applied

A method to evaluate performance reliability of individual subjects in laboratory research applied to work settings.

DOT National Transportation Integrated Search

1978-10-01

This report presents a method that may be used to evaluate the reliability of performance of individual subjects, particularly in applied laboratory research. The method is based on analysis of variance of a tasks-by-subjects data matrix, with all sc...
Reliability analysis of composite structures

NASA Technical Reports Server (NTRS)

Kan, Han-Pin

1992-01-01

A probabilistic static stress analysis methodology has been developed to estimate the reliability of a composite structure. Closed form stress analysis methods are the primary analytical tools used in this methodology. These structural mechanics methods are used to identify independent variables whose variations significantly affect the performance of the structure. Once these variables are identified, scatter in their values is evaluated and statistically characterized. The scatter in applied loads and the structural parameters are then fitted to appropriate probabilistic distribution functions. Numerical integration techniques are applied to compute the structural reliability. The predicted reliability accounts for scatter due to variability in material strength, applied load, fabrication and assembly processes. The influence of structural geometry and mode of failure are also considerations in the evaluation. Example problems are given to illustrate various levels of analytical complexity.
Translation, adaptation and inter-rater reliability of the administration manual for the Fugl-Meyer assessment.

PubMed

Michaelsen, Stella M; Rocha, André S; Knabben, Rodrigo J; Rodrigues, Luciano P; Fernandes, Claudia G C

2011-01-01

Recently, the reliability of the Brazilian version of the Fugl-Meyer Assessment (FMA) was assessed through the scoring given according to observations made by a single evaluator who applied the test. When different raters apply the scale, the reliability may depend on the interpretation given to the assessment sheet. In such cases, a clear administration manual is essential for ensuring homogeneity of application. To translate and adapt the French Canadian version of the FMA administration manual into Brazilian Portuguese and to evaluate the inter-rater reliability when different evaluators apply the FMA on the basis of the information contained in the manual. Eighteen adults (59±10 years) with chronic hemiparesis (38±35 months after a stroke) took part in this study. Eight patients participated in the first part of the study and 10 in the second part. Based on analyzing the results from part 1, an adapted version was developed, in which information and photos were added to illustrate the positions of the patient and evaluator. The inter-rater reliability was assessed using the intraclass correlation coefficient (ICC). The reliability of the FMA based on the adapted version of the manual was excellent for the total motor scores for the upper limbs (ICC=0.98) and lower limbs (ICC=0.90), as well as for movement sense (ICC=0.98) and upper and lower-limb passive range of motion (ICC=0.84 and 0.90, respectively). The reliability was moderate for tactile sensitivity (0.75). The joint pain assessment presented low reliability. The results showed that, except for pain assessment, application of the FMA based on the adapted version of the application manual for Brazilian Portuguese presented adequate inter-rater reliability.
Covariate-free and Covariate-dependent Reliability.

PubMed

Bentler, Peter M

2016-12-01

Classical test theory reliability coefficients are said to be population specific. Reliability generalization, a meta-analysis method, is the main procedure for evaluating the stability of reliability coefficients across populations. A new approach is developed to evaluate the degree of invariance of reliability coefficients to population characteristics. Factor or common variance of a reliability measure is partitioned into parts that are, and are not, influenced by control variables, resulting in a partition of reliability into a covariate-dependent and a covariate-free part. The approach can be implemented in a single sample and can be applied to a variety of reliability coefficients.
Standards and reliability in evaluation: when rules of thumb don't apply.

PubMed

Norcini, J J

1999-10-01

The purpose of this paper is to identify situations in which two rules of thumb in evaluation do not apply. The first rule is that all standards should be absolute. When selection decisions are being made or when classroom tests are given, however, relative standards may be better. The second rule of thumb is that every test should have a reliability of .80 or better. Depending on the circumstances, though, the standard error of measurement, the consistency of pass/fail classifications, and the domain-referenced reliability coefficients may be better indicators of reproducibility.
Evaluating reliability of WSN with sleep/wake-up interfering nodes

NASA Astrophysics Data System (ADS)

Distefano, Salvatore

2013-10-01

A wireless sensor network (WSN) (singular and plural of acronyms are spelled the same) is a distributed system composed of autonomous sensor nodes wireless connected and randomly scattered into a geographical area to cooperatively monitor physical or environmental conditions. Adequate techniques and strategies are required to manage a WSN so that it works properly, observing specific quantities and metrics to evaluate the WSN operational conditions. Among them, one of the most important is the reliability. Considering a WSN as a system composed of sensor nodes the system reliability approach can be applied, thus expressing the WSN reliability in terms of its nodes' reliability. More specifically, since often standby power management policies are applied at node level and interferences among nodes may arise, a WSN can be considered as a dynamic system. In this article we therefore consider the WSN reliability evaluation problem from the dynamic system reliability perspective. Static-structural interactions are specified by the WSN topology. Sleep/wake-up standby policies and interferences due to wireless communications can be instead considered as dynamic aspects. Thus, in order to represent and to evaluate the WSN reliability, we use dynamic reliability block diagrams and Petri nets. The proposed technique allows to overcome the limits of Markov models when considering non-linear discharge processes, since they cannot adequately represent the aging processes. In order to demonstrate the effectiveness of the technique, we investigate some specific WSN network topologies, providing guidelines for their representation and evaluation.
Human Reliability Analysis in Support of Risk Assessment for Positive Train Control

DOT National Transportation Integrated Search

2003-06-01

This report describes an approach to evaluating the reliability of human actions that are modeled in a probabilistic risk assessment : (PRA) of train control operations. This approach to human reliability analysis (HRA) has been applied in the case o...
Automation of reliability evaluation procedures through CARE - The computer-aided reliability estimation program.

NASA Technical Reports Server (NTRS)

Mathur, F. P.

1972-01-01

Description of an on-line interactive computer program called CARE (Computer-Aided Reliability Estimation) which can model self-repair and fault-tolerant organizations and perform certain other functions. Essentially CARE consists of a repository of mathematical equations defining the various basic redundancy schemes. These equations, under program control, are then interrelated to generate the desired mathematical model to fit the architecture of the system under evaluation. The mathematical model is then supplied with ground instances of its variables and is then evaluated to generate values for the reliability-theoretic functions applied to the model.
Retest Reliability of the Rosenzweig Picture-Frustration Study and Similar Semiprojective Techniques

ERIC Educational Resources Information Center

Rosenzweig, Saul; And Others

1975-01-01

The research dealing with the reliability of the Rosenzweig Picture-Frustration Study is surveyed. Analysis of various split-half, and retest procedures are reviewed and their relative effectiveness evaluated. Reliability measures as applied to projective techniques in general are discussed. (Author/DEP)
Emulation applied to reliability analysis of reconfigurable, highly reliable, fault-tolerant computing systems

NASA Technical Reports Server (NTRS)

Migneault, G. E.

1979-01-01

Emulation techniques applied to the analysis of the reliability of highly reliable computer systems for future commercial aircraft are described. The lack of credible precision in reliability estimates obtained by analytical modeling techniques is first established. The difficulty is shown to be an unavoidable consequence of: (1) a high reliability requirement so demanding as to make system evaluation by use testing infeasible; (2) a complex system design technique, fault tolerance; (3) system reliability dominated by errors due to flaws in the system definition; and (4) elaborate analytical modeling techniques whose precision outputs are quite sensitive to errors of approximation in their input data. Next, the technique of emulation is described, indicating how its input is a simple description of the logical structure of a system and its output is the consequent behavior. Use of emulation techniques is discussed for pseudo-testing systems to evaluate bounds on the parameter values needed for the analytical techniques. Finally an illustrative example is presented to demonstrate from actual use the promise of the proposed application of emulation.
[Validation of the Polish version of The Authentic Leadership Questionnaire for the of evaluation purpose of nursing management staff in national hospital wards].

PubMed

Sierpińska, Lidia

2013-09-01

The Authentic Leadership Questionnaire (ALQ) is a standardized research instrument for the evaluation of individual elements of leader's conduct which contribute to the authentic leadership. The application of this questionnaire in Polish conditions required to carry out the validation process. The aim of the study was to evaluate of validity and reliability of the Polish version of the American research instrument for the needs of evaluation of authenticity of leadership of the nursing management in Polish hospitals. The study covered 286 nurses (143 head nurses and 143 of their subordinates) employed in 45 hospitals in Poland. Theoretical validity of the instrument was evaluated using Fisher's transformation (r-Person correlation coefficient), while the criterion validity of the ALQ was evaluated using rho-Spearman correlation coefficient and the BOHIPSZO questionnaire. The reliability of the ALQ was assessed by means of the Cronbach-alpha coefficient. The ALQ questionnaire applied for the evaluation of authenticity of leadership of the nursing management in Polish hospital wards shows an acceptable theoretical and criterion validity and reliability (Cronbach-alpha coefficient 0.80). The Polish version of the ALQ is valid and reliable, and may be applied in studies concerning the evaluation of authenticity of leadership of the nursing management in Polish hospital wards.
Research on Novel Algorithms for Smart Grid Reliability Assessment and Economic Dispatch

NASA Astrophysics Data System (ADS)

Luo, Wenjin

In this dissertation, several studies of electric power system reliability and economy assessment methods are presented. To be more precise, several algorithms in evaluating power system reliability and economy are studied. Furthermore, two novel algorithms are applied to this field and their simulation results are compared with conventional results. As the electrical power system develops towards extra high voltage, remote distance, large capacity and regional networking, the application of a number of new technique equipments and the electric market system have be gradually established, and the results caused by power cut has become more and more serious. The electrical power system needs the highest possible reliability due to its complication and security. In this dissertation the Boolean logic Driven Markov Process (BDMP) method is studied and applied to evaluate power system reliability. This approach has several benefits. It allows complex dynamic models to be defined, while maintaining its easy readability as conventional methods. This method has been applied to evaluate IEEE reliability test system. The simulation results obtained are close to IEEE experimental data which means that it could be used for future study of the system reliability. Besides reliability, modern power system is expected to be more economic. This dissertation presents a novel evolutionary algorithm named as quantum evolutionary membrane algorithm (QEPS), which combines the concept and theory of quantum-inspired evolutionary algorithm and membrane computation, to solve the economic dispatch problem in renewable power system with on land and offshore wind farms. The case derived from real data is used for simulation tests. Another conventional evolutionary algorithm is also used to solve the same problem for comparison. The experimental results show that the proposed method is quick and accurate to obtain the optimal solution which is the minimum cost for electricity supplied by wind farm system.
Dynamic decision-making for reliability and maintenance analysis of manufacturing systems based on failure effects

NASA Astrophysics Data System (ADS)

Zhang, Ding; Zhang, Yingjie

2017-09-01

A framework for reliability and maintenance analysis of job shop manufacturing systems is proposed in this paper. An efficient preventive maintenance (PM) policy in terms of failure effects analysis (FEA) is proposed. Subsequently, reliability evaluation and component importance measure based on FEA are performed under the PM policy. A job shop manufacturing system is applied to validate the reliability evaluation and dynamic maintenance policy. Obtained results are compared with existed methods and the effectiveness is validated. Some vague understandings for issues such as network modelling, vulnerabilities identification, the evaluation criteria of repairable systems, as well as PM policy during manufacturing system reliability analysis are elaborated. This framework can help for reliability optimisation and rational maintenance resources allocation of job shop manufacturing systems.
Coefficient Alpha: A Reliability Coefficient for the 21st Century?

ERIC Educational Resources Information Center

Yang, Yanyun; Green, Samuel B.

2011-01-01

Coefficient alpha is almost universally applied to assess reliability of scales in psychology. We argue that researchers should consider alternatives to coefficient alpha. Our preference is for structural equation modeling (SEM) estimates of reliability because they are informative and allow for an empirical evaluation of the assumptions…
77 FR 53877 - Commission Information Collection Activities (FERC-715); Comment Request; Extension

Federal Register 2010, 2011, 2012, 2013, 2014

2012-09-04

...; A detailed description of the transmission planning reliability criteria used to evaluate system... reliability criteria are applied and the steps taken in performing transmission planning studies); and A... reliability criteria using its stated assessment practices. The FERC-715 enables the Commission to use the...
Reliability Evaluation of Machine Center Components Based on Cascading Failure Analysis

NASA Astrophysics Data System (ADS)

Zhang, Ying-Zhi; Liu, Jin-Tong; Shen, Gui-Xiang; Long, Zhe; Sun, Shu-Guang

2017-07-01

In order to rectify the problems that the component reliability model exhibits deviation, and the evaluation result is low due to the overlook of failure propagation in traditional reliability evaluation of machine center components, a new reliability evaluation method based on cascading failure analysis and the failure influenced degree assessment is proposed. A direct graph model of cascading failure among components is established according to cascading failure mechanism analysis and graph theory. The failure influenced degrees of the system components are assessed by the adjacency matrix and its transposition, combined with the Pagerank algorithm. Based on the comprehensive failure probability function and total probability formula, the inherent failure probability function is determined to realize the reliability evaluation of the system components. Finally, the method is applied to a machine center, it shows the following: 1) The reliability evaluation values of the proposed method are at least 2.5% higher than those of the traditional method; 2) The difference between the comprehensive and inherent reliability of the system component presents a positive correlation with the failure influenced degree of the system component, which provides a theoretical basis for reliability allocation of machine center system.
Evaluation of the psychometric properties of the main meal quality index when applied in the UK population.

PubMed

Gorgulho, B M; Pot, G K; Marchioni, D M

2017-05-01

The aim of this study was to evaluate the validity and reliability of the Main Meal Quality Index when applied on the UK population. The indicator was developed to assess meal quality in different populations, and is composed of 10 components: fruit, vegetables (excluding potatoes), ratio of animal protein to total protein, fiber, carbohydrate, total fat, saturated fat, processed meat, sugary beverages and desserts, and energy density, resulting in a score range of 0-100 points. The performance of the indicator was measured using strategies for assessing content validity, construct validity, discriminant validity and reliability, including principal component analysis, linear regression models and Cronbach's alpha. The indicator presented good reliability. The Main Meal Quality Index has been shown to be valid for use as an instrument to evaluate, monitor and compare the quality of meals consumed by adults in the United Kingdom.
Subject-level reliability analysis of fast fMRI with application to epilepsy.

PubMed

Hao, Yongfu; Khoo, Hui Ming; von Ellenrieder, Nicolas; Gotman, Jean

2017-07-01

Recent studies have applied the new magnetic resonance encephalography (MREG) sequence to the study of interictal epileptic discharges (IEDs) in the electroencephalogram (EEG) of epileptic patients. However, there are no criteria to quantitatively evaluate different processing methods, to properly use the new sequence. We evaluated different processing steps of this new sequence under the common generalized linear model (GLM) framework by assessing the reliability of results. A bootstrap sampling technique was first used to generate multiple replicated data sets; a GLM with different processing steps was then applied to obtain activation maps, and the reliability of these maps was assessed. We applied our analysis in an event-related GLM related to IEDs. A higher reliability was achieved by using a GLM with head motion confound regressor with 24 components rather than the usual 6, with an autoregressive model of order 5 and with a canonical hemodynamic response function (HRF) rather than variable latency or patient-specific HRFs. Comparison of activation with IED field also favored the canonical HRF, consistent with the reliability analysis. The reliability analysis helps to optimize the processing methods for this fast fMRI sequence, in a context in which we do not know the ground truth of activation areas. Magn Reson Med 78:370-382, 2017. © 2016 International Society for Magnetic Resonance in Medicine. © 2016 International Society for Magnetic Resonance in Medicine.
Development and validation of a tool to evaluate the quality of medical education websites in pathology.

PubMed

Alyusuf, Raja H; Prasad, Kameshwar; Abdel Satir, Ali M; Abalkhail, Ali A; Arora, Roopa K

2013-01-01

The exponential use of the internet as a learning resource coupled with varied quality of many websites, lead to a need to identify suitable websites for teaching purposes. The aim of this study is to develop and to validate a tool, which evaluates the quality of undergraduate medical educational websites; and apply it to the field of pathology. A tool was devised through several steps of item generation, reduction, weightage, pilot testing, post-pilot modification of the tool and validating the tool. Tool validation included measurement of inter-observer reliability; and generation of criterion related, construct related and content related validity. The validated tool was subsequently tested by applying it to a population of pathology websites. Reliability testing showed a high internal consistency reliability (Cronbach's alpha = 0.92), high inter-observer reliability (Pearson's correlation r = 0.88), intraclass correlation coefficient = 0.85 and κ =0.75. It showed high criterion related, construct related and content related validity. The tool showed moderately high concordance with the gold standard (κ =0.61); 92.2% sensitivity, 67.8% specificity, 75.6% positive predictive value and 88.9% negative predictive value. The validated tool was applied to 278 websites; 29.9% were rated as recommended, 41.0% as recommended with caution and 29.1% as not recommended. A systematic tool was devised to evaluate the quality of websites for medical educational purposes. The tool was shown to yield reliable and valid inferences through its application to pathology websites.
Detecting long-term growth trends using tree rings: a critical evaluation of methods.

PubMed

Peters, Richard L; Groenendijk, Peter; Vlam, Mart; Zuidema, Pieter A

2015-05-01

Tree-ring analysis is often used to assess long-term trends in tree growth. A variety of growth-trend detection methods (GDMs) exist to disentangle age/size trends in growth from long-term growth changes. However, these detrending methods strongly differ in approach, with possible implications for their output. Here, we critically evaluate the consistency, sensitivity, reliability and accuracy of four most widely used GDMs: conservative detrending (CD) applies mathematical functions to correct for decreasing ring widths with age; basal area correction (BAC) transforms diameter into basal area growth; regional curve standardization (RCS) detrends individual tree-ring series using average age/size trends; and size class isolation (SCI) calculates growth trends within separate size classes. First, we evaluated whether these GDMs produce consistent results applied to an empirical tree-ring data set of Melia azedarach, a tropical tree species from Thailand. Three GDMs yielded similar results - a growth decline over time - but the widely used CD method did not detect any change. Second, we assessed the sensitivity (probability of correct growth-trend detection), reliability (100% minus probability of detecting false trends) and accuracy (whether the strength of imposed trends is correctly detected) of these GDMs, by applying them to simulated growth trajectories with different imposed trends: no trend, strong trends (-6% and +6% change per decade) and weak trends (-2%, +2%). All methods except CD, showed high sensitivity, reliability and accuracy to detect strong imposed trends. However, these were considerably lower in the weak or no-trend scenarios. BAC showed good sensitivity and accuracy, but low reliability, indicating uncertainty of trend detection using this method. Our study reveals that the choice of GDM influences results of growth-trend studies. We recommend applying multiple methods when analysing trends and encourage performing sensitivity and reliability analysis. Finally, we recommend SCI and RCS, as these methods showed highest reliability to detect long-term growth trends. © 2014 John Wiley & Sons Ltd.

Evaluation Applied to Reliability Analysis of Reconfigurable, Highly Reliable, Fault-Tolerant, Computing Systems for Avionics

NASA Technical Reports Server (NTRS)

Migneault, G. E.

1979-01-01

Emulation techniques are proposed as a solution to a difficulty arising in the analysis of the reliability of highly reliable computer systems for future commercial aircraft. The difficulty, viz., the lack of credible precision in reliability estimates obtained by analytical modeling techniques are established. The difficulty is shown to be an unavoidable consequence of: (1) a high reliability requirement so demanding as to make system evaluation by use testing infeasible, (2) a complex system design technique, fault tolerance, (3) system reliability dominated by errors due to flaws in the system definition, and (4) elaborate analytical modeling techniques whose precision outputs are quite sensitive to errors of approximation in their input data. The technique of emulation is described, indicating how its input is a simple description of the logical structure of a system and its output is the consequent behavior. The use of emulation techniques is discussed for pseudo-testing systems to evaluate bounds on the parameter values needed for the analytical techniques.
The development and evaluation of a novel repurposing of a peripheral gaming device for the acquisition of forces applied to a hydraulic treatment plinth.

PubMed

Cooper, Darren; Bevins, Joe; Corbett, Mark

2018-01-13

This technical note details the stages taken to create an instrumented hydraulic treatment plinth for the measurement of applied forces in the vertical axis. The modification used a widely available low-cost peripheral gaming device and required only basic construction and computer skills. The instrumented treatment plinth was validated against a laboratory grade force platform across a range of applied masses from 0.5-15 kg, mock Gr I-IV vertebral mobilisations and a dynamic response test. Intraclass correlation coefficients demonstrated poor reliability (0.46) for low masses of 0.5 kg improving to excellent for larger masses up to15 kg respectively; excellent to good reliability (0.97-0.86) for the mock mobilisations and moderate reliability (0.51) for the dynamic response test. The study demonstrates how a cheap peripheral gaming device can be repurposed so that forces applied to a hydraulic treatment plinth can be collected reliably when applied in a clinically reasoned manner. Copyright © 2018 Elsevier Ltd. All rights reserved.
Development and validation of a tool to evaluate the quality of medical education websites in pathology

PubMed Central

Alyusuf, Raja H.; Prasad, Kameshwar; Abdel Satir, Ali M.; Abalkhail, Ali A.; Arora, Roopa K.

2013-01-01

Background: The exponential use of the internet as a learning resource coupled with varied quality of many websites, lead to a need to identify suitable websites for teaching purposes. Aim: The aim of this study is to develop and to validate a tool, which evaluates the quality of undergraduate medical educational websites; and apply it to the field of pathology. Methods: A tool was devised through several steps of item generation, reduction, weightage, pilot testing, post-pilot modification of the tool and validating the tool. Tool validation included measurement of inter-observer reliability; and generation of criterion related, construct related and content related validity. The validated tool was subsequently tested by applying it to a population of pathology websites. Results and Discussion: Reliability testing showed a high internal consistency reliability (Cronbach's alpha = 0.92), high inter-observer reliability (Pearson's correlation r = 0.88), intraclass correlation coefficient = 0.85 and κ =0.75. It showed high criterion related, construct related and content related validity. The tool showed moderately high concordance with the gold standard (κ =0.61); 92.2% sensitivity, 67.8% specificity, 75.6% positive predictive value and 88.9% negative predictive value. The validated tool was applied to 278 websites; 29.9% were rated as recommended, 41.0% as recommended with caution and 29.1% as not recommended. Conclusion: A systematic tool was devised to evaluate the quality of websites for medical educational purposes. The tool was shown to yield reliable and valid inferences through its application to pathology websites. PMID:24392243
The Program Evaluation Standards Applied for Metaevaluation Purposes: Investigating Interrater Reliability and Implications for Use

ERIC Educational Resources Information Center

Wingate, Lori A.

2009-01-01

Metaevaluation is the evaluation of evaluation. Metaevaluation may focus particular evaluation cases, evaluation systems, or the discipline overall. Leading scholars within the discipline consider metaevaluation to be a professional imperative, demonstrating that evaluation is a reflexive enterprise. Various criteria have been set forth for what…
HTGR plant availability and reliability evaluations. Volume I. Summary of evaluations

DOE Office of Scientific and Technical Information (OSTI.GOV)

Cadwallader, G.J.; Hannaman, G.W.; Jacobsen, F.K.

1976-12-01

The report (1) describes a reliability assessment methodology for systematically locating and correcting areas which may contribute to unavailability of new and uniquely designed components and systems, (2) illustrates the methodology by applying it to such components in a high-temperature gas-cooled reactor (Public Service Company of Colorado's Fort St. Vrain 330-MW(e) HTGR), and (3) compares the results of the assessment with actual experience. The methodology can be applied to any component or system; however, it is particularly valuable for assessments of components or systems which provide essential functions, or the failure or mishandling of which could result in relatively largemore » economic losses.« less
Interrater reliability levels of multiple clinical examiners in the evaluation of a schizophrenic patient: quality of life, level of functioning, and neuropsychological symptomatology.

PubMed

Cicchetti, D V; Rosenheck, R; Showalter, D; Charney, D; Cramer, J

1999-05-01

Sir Ronald Fisher used a single-subject design to derive the concepts of appropriate research design, randomization, sensitivity, and tests of statistical significance. The seminal work of Broca demonstrated that valid and generalizable findings can and have emerged from studies of a single patient in neuropsychology. In order to assess the reliability and/or validity of any clinical phenomena that derive from single subject research, it becomes necessary to apply appropriate biostatistical methodology. The authors develop just such an approach and apply it successfully to the evaluation of the functioning, quality of life, and neuropsychological symptomatology of a single schizophrenic patient.
Low cost MATLAB-based pulse oximeter for deployment in research and development applications.

PubMed

Shokouhian, M; Morling, R C S; Kale, I

2013-01-01

Problems such as motion artifact and effects of ambient lights have forced developers to design different signal processing techniques and algorithms to increase the reliability and accuracy of the conventional pulse oximeter device. To evaluate the robustness of these techniques, they are applied either to recorded data or are implemented on chip to be applied to real-time data. Recorded data is the most common method of evaluating however it is not as reliable as real-time measurements. On the other hand, hardware implementation can be both expensive and time consuming. This paper presents a low cost MATLAB-based pulse oximeter that can be used for rapid evaluation of newly developed signal processing techniques and algorithms. Flexibility to apply different signal processing techniques, providing both processed and unprocessed data along with low implementation cost are the important features of this design which makes it ideal for research and development purposes, as well as commercial, hospital and healthcare application.
Establishing the Validity and Reliability of Course Evaluation Questionnaires

ERIC Educational Resources Information Center

Kember, David; Leung, Doris Y. P.

2008-01-01

This article uses the case of designing a new course questionnaire to discuss the issues of validity, reliability and diagnostic power in good questionnaire design. Validity is often not well addressed in course questionnaire design as there are no straightforward tests that can be applied to an individual instrument. The authors propose the…
Evaluation of the fast orthogonal search method for forecasting chloride levels in the Deltona groundwater supply (Florida, USA)

NASA Astrophysics Data System (ADS)

El-Jaat, Majda; Hulley, Michael; Tétreault, Michel

2018-02-01

Despite the broad impact and importance of saltwater intrusion in coastal aquifers, little research has been directed towards forecasting saltwater intrusion in areas where the source of saltwater is uncertain. Saline contamination in inland groundwater supplies is a concern for numerous communities in the southern US including the city of Deltona, Florida. Furthermore, conventional numerical tools for forecasting saltwater contamination are heavily dependent on reliable characterization of the physical characteristics of underlying aquifers, information that is often absent or challenging to obtain. To overcome these limitations, a reliable alternative data-driven model for forecasting salinity in a groundwater supply was developed for Deltona using the fast orthogonal search (FOS) method. FOS was applied on monthly water-demand data and corresponding chloride concentrations at water supply wells. Groundwater salinity measurements from Deltona water supply wells were applied to evaluate the forecasting capability and accuracy of the FOS model. Accurate and reliable groundwater salinity forecasting is necessary to support effective and sustainable coastal-water resource planning and management. The available (27) water supply wells for Deltona were randomly split into three test groups for the purposes of FOS model development and performance assessment. Based on four performance indices (RMSE, RSR, NSEC, and R), the FOS model proved to be a reliable and robust forecaster of groundwater salinity. FOS is relatively inexpensive to apply, is not based on rigorous physical characterization of the water supply aquifer, and yields reliable estimates of groundwater salinity in active water supply wells.
Interrater and intrarater reliability of FDI criteria applied to photographs of posterior tooth-colored restorations.

PubMed

Kim, Dohyun; Ahn, So-Yeon; Kim, Junyoung; Park, Sung-Ho

2017-07-01

Since 2007, the FDI World Dental Federation (FDI) criteria have been used for the clinical evaluation of dental restorations. However, the reliability of the FDI criteria has not been sufficiently addressed. The purpose of this study was to assess and compare the interrater and intrarater reliability of the FDI criteria by evaluating posterior tooth-colored restorations photographically. A total of 160 clinical photographs of posterior tooth-colored restorations were evaluated independently by 5 raters with 9 of the FDI criteria suitable for photographic evaluation. The raters recorded the score of each restoration by using 5 grades, and the score was dichotomized into the clinical evaluation scores. After 1 month, 2 of the raters reevaluated the same set of 160 photographs in random order. To estimate the interrater reliability among the 5 raters, the proportion of agreement was calculated, and the Fleiss multirater kappa statistic was used. For the intrarater reliability, the proportion of agreement was calculated, and the Cohen standard kappa statistic was used for each of the 2 raters. The interrater proportion of agreement was 0.41 to 0.57, and the kappa value was 0.09 to 0.39. Overall, the intrarater reliability was higher than the interrater reliability, and rater 1 demonstrated higher intrarater reliability than rater 2. The proportion of agreement and kappa values increased when the 5 scores were dichotomized. The reliability was relatively lower for the esthetic properties compared with the functional or biological properties. Within the limitations of this study, the FDI criteria presented slight to fair interrater reliability and fair to excellent intrarater reliability in the photographic evaluation of posterior tooth-colored restorations. The reliability was improved by simplifying the evaluation scores. Copyright © 2016 Editorial Council for the Journal of Prosthetic Dentistry. Published by Elsevier Inc. All rights reserved.
The National Aeronautics and Space Administration Nondestructive Evaluation Program for Safe and Reliable Operations

NASA Technical Reports Server (NTRS)

Generazio, Ed

2005-01-01

The National Aeronautics and Space Administration (NASA) Nondestructive Evaluation (NDE) Program is presented. As a result of the loss of seven astronauts and the Space Shuttle Columbia on February 1, 2003, NASA has undergone many changes in its organization. NDE is one of the key areas that are recognized by the Columbia Accident Investigation Board (CAIB) that needed to be strengthened by warranting NDE as a discipline with Independent Technical Authority (iTA). The current NASA NDE system and activities are presented including the latest developments in inspection technologies being applied to the Space Transportation System (STS). The unfolding trends and directions in NDE for the future are discussed as they apply to assuring safe and reliable operations.
Techniques to evaluate the importance of common cause degradation on reliability and safety of nuclear weapons.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Darby, John L.

2011-05-01

As the nuclear weapon stockpile ages, there is increased concern about common degradation ultimately leading to common cause failure of multiple weapons that could significantly impact reliability or safety. Current acceptable limits for the reliability and safety of a weapon are based on upper limits on the probability of failure of an individual item, assuming that failures among items are independent. We expanded the current acceptable limits to apply to situations with common cause failure. Then, we developed a simple screening process to quickly assess the importance of observed common degradation for both reliability and safety to determine if furthermore » action is necessary. The screening process conservatively assumes that common degradation is common cause failure. For a population with between 100 and 5000 items we applied the screening process and conclude the following. In general, for a reliability requirement specified in the Military Characteristics (MCs) for a specific weapon system, common degradation is of concern if more than 100(1-x)% of the weapons are susceptible to common degradation, where x is the required reliability expressed as a fraction. Common degradation is of concern for the safety of a weapon subsystem if more than 0.1% of the population is susceptible to common degradation. Common degradation is of concern for the safety of a weapon component or overall weapon system if two or more components/weapons in the population are susceptible to degradation. Finally, we developed a technique for detailed evaluation of common degradation leading to common cause failure for situations that are determined to be of concern using the screening process. The detailed evaluation requires that best estimates of common cause and independent failure probabilities be produced. Using these techniques, observed common degradation can be evaluated for effects on reliability and safety.« less
An Evaluation of the Reliability of the Food Label Literacy Questionnaire in Russian

ERIC Educational Resources Information Center

Gurevich, Konstantin G.; Reynolds, Jesse; Bifulco, Lauren; Doughty, Kimberly; Njike, Valentine; Katz, David L.

2016-01-01

Objective: School-based nutrition education can promote the development of skills, such as food label reading, that can contribute to making healthier food choices. The purpose of this study was to assess the reliability of a Russian language version of the previously validated Food Label Literacy for Applied Nutrition Knowledge (FLLANK)…
Compliance of LC50 and NOEC data with Benford's Law: an indication of reliability?

PubMed

de Vries, Pepijn; Murk, Albertinka J

2013-12-01

Reliability of research data is essential, especially when potentially far-reaching conclusions will be based on them. This is also, amongst others, the case for ecotoxicological data used in risk assessment. Currently, several approaches are available to classify the reliability of ecotoxicological data. The process of classification, such as using the Klimisch score, is time-consuming and focuses on the application of standardised protocols and the documentation of the study. The presence of irregularities and the integrity of the performed work, however, are not addressed. The present study shows that Benford's Law, based on the occurrence of first digits following a logarithmic scale, can be applied to ecotoxicity test data for identifying irregularities. This approach is already successfully applied in accounting. Benford's Law can be used as reliability indicator, in addition to existing reliability classifications. The law can be used to efficiently trace irregularities in large data sets of interpolated (no) effect concentrations such as LC50s (possibly the result of data manipulation), without having to evaluate the source of each individual record. Application of the law to systems in which large amounts of toxicity data are registered (e.g., European Commission Regulation concerning the Registration, Evaluation, Authorisation and Restriction of Chemicals) can therefore be valuable. © 2013 Elsevier Inc. All rights reserved.
Web Site Design Benchmarking within Industry Groups.

ERIC Educational Resources Information Center

Kim, Sung-Eon; Shaw, Thomas; Schneider, Helmut

2003-01-01

Discussion of electronic commerce focuses on Web site evaluation criteria and applies them to different industry groups in Korea. Defines six categories of Web site evaluation criteria: business function, corporate credibility, contents reliability, Web site attractiveness, systematic structure, and navigation; and discusses differences between…
Reliability testing of two classification systems for osteoarthritis and post-traumatic arthritis of the elbow.

PubMed

Amini, Michael H; Sykes, Joshua B; Olson, Stephen T; Smith, Richard A; Mauck, Benjamin M; Azar, Frederick M; Throckmorton, Thomas W

2015-03-01

The severity of elbow arthritis is one of many factors that surgeons must evaluate when considering treatment options for a given patient. Elbow surgeons have historically used the Broberg and Morrey (BM) and Hastings and Rettig (HR) classification systems to radiographically stage the severity of post-traumatic arthritis (PTA) and primary osteoarthritis (OA). We proposed to compare the intraobserver and interobserver reliability between systems for patients with either PTA or OA. The radiographs of 45 patients were evaluated at least 2 weeks apart by 6 evaluators of different levels of training. Intraobserver and interobserver reliability were calculated by Spearman correlation coefficients with 95% confidence intervals. Agreement was considered almost perfect for coefficients >0.80 and substantial for coefficients of 0.61 to 0.80. In patients with both PTA and OA, intraobserver reliability and interobserver reliability were substantial, with no difference between classification systems. There were no significant differences in intraobserver or interobserver reliability between attending physicians and trainees for either classification system (all P > .10). The presence of fracture implants did not affect reliability in the BM system but did substantially worsen reliability in the HR system (intraobserver P = .04 and interobserver P = .001). The BM and HR classifications both showed substantial intraobserver and interobserver reliability for PTA and OA. Training level differences did not affect reliability for either system. Both trainees and fellowship-trained surgeons may easily and reliably apply each classification system to the evaluation of primary elbow OA and PTA, although the HR system was less reliable in the presence of fracture implants. Copyright © 2015 Journal of Shoulder and Elbow Surgery Board of Trustees. Published by Elsevier Inc. All rights reserved.
Inter-rater Reliability of Sustained Aberrant Movement Patterns as a Clinical Assessment of Muscular Fatigue

PubMed Central

Aerts, Frank; Carrier, Kathy; Alwood, Becky

2016-01-01

Background: The assessment of clinical manifestation of muscle fatigue is an effective procedure in establishing therapeutic exercise dose. Few studies have evaluated physical therapist reliability in establishing muscle fatigue through detection of changes in quality of movement patterns in a live setting. Objective: The purpose of this study is to evaluate the inter-rater reliability of physical therapists’ ability to detect altered movement patterns due to muscle fatigue. Design: A reliability study in a live setting with multiple raters. Participants: Forty-four healthy individuals (ages 19-35) were evaluated by six physical therapists in a live setting. Methods: Participants were evaluated by physical therapists for altered movement patterns during resisted shoulder rotation. Each participant completed a total of four tests: right shoulder internal rotation, right shoulder external rotation, left shoulder internal rotation and left shoulder external rotation. Results: For all tests combined, the inter-rater reliability for a single rater scoring ICC (2,1) was .65 (95%, .60, .71) This corresponds to moderate inter-rater reliability between physical therapists. Limitations: The results of this study apply only to healthy participants and therefore cannot be generalized to a symptomatic population. Conclusion: Moderate inter-rater reliability was found between physical therapists in establishing muscle fatigue through the observation of sustained altered movement patterns during dynamic resistive shoulder internal and external rotation. PMID:27347241
Implicit Review Instrument to Evaluate Quality of Care Delivered by Physicians to Children in Emergency Departments.

PubMed

Marcin, James P; Romano, Patrick S; Dharmar, Madan; Chamberlain, James M; Dudley, Nanette; Macias, Charles G; Nigrovic, Lise E; Powell, Elizabeth C; Rogers, Alexander J; Sonnett, Meridith; Tzimenatos, Leah; Alpern, Elizabeth R; Andrews-Dickert, Rebecca; Borgialli, Dominic A; Sidney, Erika; Casper, Charlie; Dean, Jonathan Michael; Kuppermann, Nathan

2018-06-01

To evaluate the consistency, reliability, and validity of an implicit review instrument that measures the quality of care provided to children in the emergency department (ED). Medical records of randomly selected children from 12 EDs in the Pediatric Emergency Care Applied Research Network (PECARN). Eight pediatric emergency medicine physicians applied the instrument to 620 medical records. We determined internal consistency using Cronbach's alpha and inter-rater reliability using the intraclass correlation coefficient (ICC). We evaluated the validity of the instrument by correlating scores with four condition-specific explicit review instruments. Individual reviewers' Cronbach's alpha had a mean of 0.85 with a range of 0.76-0.97; overall Cronbach's alpha was 0.90. The ICC was 0.49 for the summary score with a range from 0.40 to 0.46. Correlations between the quality of care score and the four condition-specific explicit review scores ranged from 0.24 to 0.38. The quality of care instrument demonstrated good internal consistency, moderate inter-rater reliability, high inter-rater agreement, and evidence supporting validity. The instrument could be useful for systems' assessment and research in evaluating the care delivered to children in the ED. © Health Research and Educational Trust.
Use of a structured functional evaluation process for independent medical evaluations of claimants presenting with disabling mental illness: rationale and design for a multi-center reliability study.

PubMed

Bachmann, Monica; de Boer, Wout; Schandelmaier, Stefan; Leibold, Andrea; Marelli, Renato; Jeger, Joerg; Hoffmann-Richter, Ulrike; Mager, Ralph; Schaad, Heinz; Zumbrunn, Thomas; Vogel, Nicole; Bänziger, Oskar; Busse, Jason W; Fischer, Katrin; Kunz, Regina

2016-07-29

Work capacity evaluations by independent medical experts are widely used to inform insurers whether injured or ill workers are capable of engaging in competitive employment. In many countries, evaluation processes lack a clearly structured approach, standardized instruments, and an explicit focus on claimants' functional abilities. Evaluation of subjective complaints, such as mental illness, present additional challenges in the determination of work capacity. We have therefore developed a process for functional evaluation of claimants with mental disorders which complements usual psychiatric evaluation. Here we report the design of a study to measure the reliability of our approach in determining work capacity among patients with mental illness applying for disability benefits. We will conduct a multi-center reliability study, in which 20 psychiatrists trained in our functional evaluation process will assess 30 claimants presenting with mental illness for eligibility to receive disability benefits [Reliability of Functional Evaluation in Psychiatry, RELY-study]. The functional evaluation process entails a five-step structured interview and a reporting instrument (Instrument of Functional Assessment in Psychiatry [IFAP]) to document the severity of work-related functional limitations. We will videotape all evaluations which will be viewed by three psychiatrists who will independently rate claimants' functional limitations. Our primary outcome measure is the evaluation of claimant's work capacity as a percentage (0 to 100 %), and our secondary outcomes are the 12 mental functions and 13 functional capacities assessed by the IFAP-instrument. Inter-rater reliability of four psychiatric experts will be explored using multilevel models to estimate the intraclass correlation coefficient (ICC). Additional analyses include subgroups according to mental disorder, the typicality of claimants, and claimant perceived fairness of the assessment process. We hypothesize that a structured functional approach will show moderate reliability (ICC ≥ 0.6) of psychiatric evaluation of work capacity. Enrollment of actual claimants with mental disorders referred for evaluation by disability/accident insurers will increase the external validity of our findings. Finding moderate levels of reliability, we will continue with a randomized trial to test the reliability of a structured functional approach versus evaluation-as-usual.
Nutrition Environment Measures Survey in stores (NEMS-S): development and evaluation.

PubMed

Glanz, Karen; Sallis, James F; Saelens, Brian E; Frank, Lawrence D

2007-04-01

Eating, or nutrition, environments are believed to contribute to obesity and chronic diseases. There is a need for valid, reliable measures of nutrition environments. This article reports on the development and evaluation of measures of nutrition environments in retail food stores. The Nutrition Environment Measures Study developed observational measures of the nutrition environment within retail food stores (NEMS-S) to assess availability of healthy options, price, and quality. After pretesting, measures were completed by independent raters to evaluate inter-rater reliability and across two occasions to assess test-retest reliability in grocery and convenience stores in four neighborhoods differing on income and community design in the Atlanta metropolitan area. Data were collected and analyzed in 2004 and 2005. Ten food categories (e.g., fruits) or indicator food items (e.g., ground beef) were evaluated in 85 stores. Inter-rater reliability and test-retest reliability of availability were high: inter-rater reliability kappas were 0.84 to 1.00, and test-retest reliabilities were .73 to 1.00. Inter-rater reliability for quality across fresh produce was moderate (kappas, 0.44 to 1.00). Healthier options were higher priced for hot dogs, lean ground beef, and baked chips. More healthful options were available in grocery than convenience stores and in stores in higher income neighborhoods. The NEMS-S tool was found to have a high degree of inter-rater and test-retest reliability, and to reveal significant differences across store types and neighborhoods of high and low socioeconomic status. These observational measures of nutrition environments can be applied in multilevel studies of community nutrition, and can inform new approaches to conducting and evaluating nutrition interventions.

Lifetime evaluation of large format CMOS mixed signal infrared devices

NASA Astrophysics Data System (ADS)

Linder, A.; Glines, Eddie

2015-09-01

New large scale foundry processes continue to produce reliable products. These new large scale devices continue to use industry best practice to screen for failure mechanisms and validate their long lifetime. The Failure-in-Time analysis in conjunction with foundry qualification information can be used to evaluate large format device lifetimes. This analysis is a helpful tool when zero failure life tests are typical. The reliability of the device is estimated by applying the failure rate to the use conditions. JEDEC publications continue to be the industry accepted methods.
Method for evaluating the reliability of compressor impeller of turbocharger for vehicle application in plateau area

NASA Astrophysics Data System (ADS)

Wang, Zheng; Wang, Zengquan; Wang, A.-na; Zhuang, Li; Wang, Jinwei

2016-10-01

As turbocharging diesel engines for vehicle application are applied in plateau area, the environmental adaptability of engines has drawn more attention. For the environmental adaptability problem of turbocharging diesel engines for vehicle application, the present studies almost focus on the optimization of performance match between turbocharger and engine, and the reliability problem of turbocharger is almost ignored. The reliability problem of compressor impeller of turbocharger for vehicle application when diesel engines operate in plateau area is studied. Firstly, the rule that the rotational speed of turbocharger changes with the altitude height is presented, and the potential failure modes of compressor impeller are analyzed. Then, the failure behavior models of compressor impeller are built, and the reliability models of compressor impeller operating in plateau area are developed. Finally, the rule that the reliability of compressor impeller changes with the altitude height is studied, the measurements for improving the reliability of the compressor impellers of turbocharger operating in plateau area are given. The results indicate that when the operating speed of diesel engine is certain, the rotational speed of turbocharger increases with the increase of altitude height, and the failure risk of compressor impeller with the failure modes of hub fatigue and blade resonance increases. The reliability of compressor impeller decreases with the increase of altitude height, and it also decreases as the increase of number of the mission profile cycle of engine. The method proposed can not only be used to evaluating the reliability of compressor impeller when diesel engines operate in plateau area but also be applied to direct the structural optimization of compressor impeller.
Reliability of the ADI-R for the Single Case-Part II: Clinical versus Statistical Significance

ERIC Educational Resources Information Center

Cicchetti, Domenic V.; Lord, Catherine; Koenig, Kathy; Klin, Ami; Volkmar, Fred R.

2014-01-01

In an earlier investigation, the authors assessed the reliability of the ADI-R when multiple clinicians evaluated a single case, here a female 3 year old toddler suspected of having an autism spectrum disorder (Cicchetti et al. in "J Autism Dev Disord" 38:764-770, 2008). Applying the clinical criteria of Cicchetti and Sparrow ("Am J…
DOE Office of Scientific and Technical Information (OSTI.GOV)

Kamiya, Shoji; Sato, Hisashi; Nishida, Masahiro

Reliability of electronic devices has been an issue of serious importance. One of the potential factors to spoil the reliability is possible local drops of strength on the interface of multilayered structure. A new technique for the evaluation of local interface adhesion energy was applied to the interface between Cu and cap layer in a Cu damascene interconnect structure, in order to elucidate variation in adhesion strength as a function of measurement location.
A Performance-Based Method of Student Evaluation

ERIC Educational Resources Information Center

Nelson, G. E.; And Others

1976-01-01

The Problem Oriented Medical Record (which allows practical definition of the behavioral terms thoroughness, reliability, sound analytical sense, and efficiency as they apply to the identification and management of patient problems) provides a vehicle to use in performance based type evaluation. A test-run use of the record is reported. (JT)
DG Planning with Amalgamation of Operational and Reliability Considerations

NASA Astrophysics Data System (ADS)

Battu, Neelakanteshwar Rao; Abhyankar, A. R.; Senroy, Nilanjan

2016-04-01

Distributed Generation has been playing a vital role in dealing issues related to distribution systems. This paper presents an approach which provides policy maker with a set of solutions for DG placement to optimize reliability and real power loss of the system. Optimal location of a Distributed Generator is evaluated based on performance indices derived for reliability index and real power loss. The proposed approach is applied on a 15-bus radial distribution system and a 18-bus radial distribution system with conventional and wind distributed generators individually.
A new method for computing the reliability of consecutive k-out-of-n:F systems

NASA Astrophysics Data System (ADS)

Gökdere, Gökhan; Gürcan, Mehmet; Kılıç, Muhammet Burak

2016-01-01

In many physical systems, reliability evaluation, such as ones encountered in telecommunications, the design of integrated circuits, microwave relay stations, oil pipeline systems, vacuum systems in accelerators, computer ring networks, and spacecraft relay stations, have had applied consecutive k-out-of-n system models. These systems are characterized as logical connections among the components of the systems placed in lines or circles. In literature, a great deal of attention has been paid to the study of the reliability evaluation of consecutive k-out-of-n systems. In this paper, we propose a new method to compute the reliability of consecutive k-out-of-n:F systems, with n linearly and circularly arranged components. The proposed method provides a simple way for determining the system failure probability. Also, we write R-Project codes based on our proposed method to compute the reliability of the linear and circular systems which have a great number of components.
The weakest t-norm based intuitionistic fuzzy fault-tree analysis to evaluate system reliability.

PubMed

Kumar, Mohit; Yadav, Shiv Prasad

2012-07-01

In this paper, a new approach of intuitionistic fuzzy fault-tree analysis is proposed to evaluate system reliability and to find the most critical system component that affects the system reliability. Here weakest t-norm based intuitionistic fuzzy fault tree analysis is presented to calculate fault interval of system components from integrating expert's knowledge and experience in terms of providing the possibility of failure of bottom events. It applies fault-tree analysis, α-cut of intuitionistic fuzzy set and T(ω) (the weakest t-norm) based arithmetic operations on triangular intuitionistic fuzzy sets to obtain fault interval and reliability interval of the system. This paper also modifies Tanaka et al.'s fuzzy fault-tree definition. In numerical verification, a malfunction of weapon system "automatic gun" is presented as a numerical example. The result of the proposed method is compared with the listing approaches of reliability analysis methods. Copyright © 2012 ISA. Published by Elsevier Ltd. All rights reserved.
Reviews of Single Subject Research Designs: Applications to Special Education and School Psychology

ERIC Educational Resources Information Center

Nevin, Ann I., Ed.

2004-01-01

The authors of this collection of research reviews studied how single subject research designs might be a useful method to apply as part of being accountable to clients. The single subject research studies were evaluated in accordance with the following criteria: Was the study applied, behavioral, reliable, analytic, effective, and generalizable?…
Development of Reliable and Validated Tools to Evaluate Technical Resuscitation Skills in a Pediatric Simulation Setting: Resuscitation and Emergency Simulation Checklist for Assessment in Pediatrics.

PubMed

Faudeux, Camille; Tran, Antoine; Dupont, Audrey; Desmontils, Jonathan; Montaudié, Isabelle; Bréaud, Jean; Braun, Marc; Fournier, Jean-Paul; Bérard, Etienne; Berlengi, Noémie; Schweitzer, Cyril; Haas, Hervé; Caci, Hervé; Gatin, Amélie; Giovannini-Chami, Lisa

2017-09-01

To develop a reliable and validated tool to evaluate technical resuscitation skills in a pediatric simulation setting. Four Resuscitation and Emergency Simulation Checklist for Assessment in Pediatrics (RESCAPE) evaluation tools were created, following international guidelines: intraosseous needle insertion, bag mask ventilation, endotracheal intubation, and cardiac massage. We applied a modified Delphi methodology evaluation to binary rating items. Reliability was assessed comparing the ratings of 2 observers (1 in real time and 1 after a video-recorded review). The tools were assessed for content, construct, and criterion validity, and for sensitivity to change. Inter-rater reliability, evaluated with Cohen kappa coefficients, was perfect or near-perfect (>0.8) for 92.5% of items and each Cronbach alpha coefficient was ≥0.91. Principal component analyses showed that all 4 tools were unidimensional. Significant increases in median scores with increasing levels of medical expertise were demonstrated for RESCAPE-intraosseous needle insertion (P = .0002), RESCAPE-bag mask ventilation (P = .0002), RESCAPE-endotracheal intubation (P = .0001), and RESCAPE-cardiac massage (P = .0037). Significantly increased median scores over time were also demonstrated during a simulation-based educational program. RESCAPE tools are reliable and validated tools for the evaluation of technical resuscitation skills in pediatric settings during simulation-based educational programs. They might also be used for medical practice performance evaluations. Copyright © 2017 Elsevier Inc. All rights reserved.
Reliable change, sensitivity, and specificity of a multidimensional concussion assessment battery: implications for caution in clinical practice.

PubMed

Register-Mihalik, Johna K; Guskiewicz, Kevin M; Mihalik, Jason P; Schmidt, Julianne D; Kerr, Zachary Y; McCrea, Michael A

2013-01-01

To provide reliable change confidence intervals for common clinical concussion measures using a healthy sample of collegiate athletes and to apply these reliable change parameters to a sample of concussed collegiate athletes. Two independent samples were included in the study and evaluated on common clinical measures of concussion. The healthy sample included male, collegiate football student-athletes (n = 38) assessed at 2 time points. The concussed sample included college-aged student-athletes (n = 132) evaluated before and after a concussion. Outcome measures included symptom severity scores, Automated Neuropsychological Assessment Metrics throughput scores, and Sensory Organization Test composite scores. Application of the reliable change parameters suggests that a small percentage of concussed participants were impaired on each measure. We identified a low sensitivity of the entire battery (all measures combined) of 50% but high specificity of 96%. Clinicians should be trained in understanding clinical concussion measures and should be aware of evidence suggesting the multifaceted battery is more sensitive than any single measure. Clinicians should be cautioned that sensitivity to balance and neurocognitive impairments was low for each individual measure. Applying the confidence intervals to our injured sample suggests that these measures do not adequately identify postconcussion impairments when used in isolation.
Noninvasive identification of the total peripheral resistance baroreflex

NASA Technical Reports Server (NTRS)

Mukkamala, Ramakrishna; Toska, Karin; Cohen, Richard J.

2003-01-01

We propose two identification algorithms for quantitating the total peripheral resistance (TPR) baroreflex, an important contributor to short-term arterial blood pressure (ABP) regulation. Each algorithm analyzes beat-to-beat fluctuations in ABP and cardiac output, which may both be obtained noninvasively in humans. For a theoretical evaluation, we applied both algorithms to a realistic cardiovascular model. The results contrasted with only one of the algorithms proving to be reliable. This algorithm was able to track changes in the static gains of both the arterial and cardiopulmonary TPR baroreflex. We then applied both algorithms to a preliminary set of human data and obtained contrasting results much like those obtained from the cardiovascular model, thereby making the theoretical evaluation results more meaningful. This study suggests that, with experimental testing, the reliable identification algorithm may provide a powerful, noninvasive means for quantitating the TPR baroreflex. This study also provides an example of the role that models can play in the development and initial evaluation of algorithms aimed at quantitating important physiological mechanisms.
Reliability of Lactation Assessment Tools Applied to Overweight and Obese Women.

PubMed

Chapman, Donna J; Doughty, Katherine; Mullin, Elizabeth M; Pérez-Escamilla, Rafael

2016-05-01

The interrater reliability of lactation assessment tools has not been evaluated in overweight/obese women. This study aimed to compare the interrater reliability of 4 lactation assessment tools in this population. A convenience sample of 45 women (body mass index > 27.0) was videotaped while breastfeeding (twice daily on days 2, 4, and 7 postpartum). Three International Board Certified Lactation Consultants independently rated each videotaped session using 4 tools (Infant Breastfeeding Assessment Tool [IBFAT], modified LATCH [mLATCH], modified Via Christi [mVC], and Riordan's Tool [RT]). For each day and tool, we evaluated interrater reliability with 1-way repeated-measures analyses of variance, intraclass correlation coefficients (ICCs), and percentage absolute agreement between raters. Analyses of variance showed significant differences between raters' scores on day 2 (all scales) and day 7 (RT). Intraclass correlation coefficient values reflected good (mLATCH) to excellent reliability (IBFAT, mVC, and RT) on days 2 and 7. All day 4 ICCs reflected good reliability. The ICC for mLATCH was significantly lower than all others on day 2 and was significantly lower than IBFAT (day 7). Percentage absolute interrater agreement for scale components ranged from 31% (day 2: observable swallowing, RT) to 92% (day 7: IBFAT, fixing; and mVC, latch time). Swallowing scores on all scales had the lowest levels of interrater agreement (31%-64%). We demonstrated differences in the interrater reliability of 4 lactation assessment tools when applied to overweight/obese women, with the lowest values observed on day 4. Swallowing assessment was particularly unreliable. Researchers and clinicians using these scales should be aware of the differences in their psychometric behavior. © The Author(s) 2015.
Evaluating the Performance of the IEEE Standard 1366 Method for Identifying Major Event Days

DOE Office of Scientific and Technical Information (OSTI.GOV)

Eto, Joseph H.; LaCommare, Kristina Hamachi; Sohn, Michael D.

IEEE Standard 1366 offers a method for segmenting reliability performance data to isolate the effects of major events from the underlying year-to-year trends in reliability. Recent analysis by the IEEE Distribution Reliability Working Group (DRWG) has found that reliability performance of some utilities differs from the expectations that helped guide the development of the Standard 1366 method. This paper proposes quantitative metrics to evaluate the performance of the Standard 1366 method in identifying major events and in reducing year-to-year variability in utility reliability. The metrics are applied to a large sample of utility-reported reliability data to assess performance of themore » method with alternative specifications that have been considered by the DRWG. We find that none of the alternatives perform uniformly 'better' than the current Standard 1366 method. That is, none of the modifications uniformly lowers the year-to-year variability in System Average Interruption Duration Index without major events. Instead, for any given alternative, while it may lower the value of this metric for some utilities, it also increases it for other utilities (sometimes dramatically). Thus, we illustrate some of the trade-offs that must be considered in using the Standard 1366 method and highlight the usefulness of the metrics we have proposed in conducting these evaluations.« less
Development Of Methodologies Using PhabrOmeter For Fabric Drape Evaluation

NASA Astrophysics Data System (ADS)

Lin, Chengwei

Evaluation of fabric drape is important for textile industry as it reveals the aesthetic and functionality of the cloth and apparel. Although many fabric drape measuring methods have been developed for several decades, they are falling behind the need for fast product development by the industry. To meet the requirement of industries, it is necessary to develop an effective and reliable method to evaluate fabric drape. The purpose of the present study is to determine if PhabrOmeter can be applied to fabric drape evaluation. PhabrOmeter is a fabric sensory performance evaluating instrument which is developed to provide fast and reliable quality testing results. This study was sought to determine the relationship between fabric drape and other fabric attributes. In addition, a series of conventional methods including AATCC standards, ASTM standards and ISO standards were used to characterize the fabric samples. All the data were compared and analyzed with linear correlation method. The results indicate that PhabrOmeter is reliable and effective instrument for fabric drape evaluation. Besides, some effects including fabric structure, testing directions were considered to examine their impact on fabric drape.
Quantifying Children's Aggregate (Dietary and Residential) Exposure and Dose to Permethin: Application and Evaluation of EPA's Probabilistic SHED-Multimedia Model

EPA Science Inventory

Reliable, evaluated human exposure and dose models are important for understanding the health risks from chemicals. A case study focusing on permethrin was conducted because of this insecticide’s widespread use and potential health effects. SHEDS-Multimedia was applied to estimat...
Reliability verification of vehicle speed estimate method in forensic videos.

PubMed

Kim, Jong-Hyuk; Oh, Won-Taek; Choi, Ji-Hun; Park, Jong-Chan

2018-06-01

In various types of traffic accidents, including car-to-car crash, vehicle-pedestrian collision, and hit-and-run accident, driver overspeed is one of the critical issues of traffic accident analysis. Hence, analysis of vehicle speed at the moment of accident is necessary. The present article proposes a vehicle speed estimate method (VSEM) applying a virtual plane and a virtual reference line to a forensic video. The reliability of the VSEM was verified by comparing the results obtained by applying the VSEM to videos from a test vehicle driving with a global positioning system (GPS)-based Vbox speed. The VSEM verified by these procedures was applied to real traffic accident examples to evaluate the usability of the VSEM. Copyright © 2018 Elsevier B.V. All rights reserved.
Comment on Hall et al. (2017), "How to Choose Between Measures of Tinnitus Loudness for Clinical Research? A Report on the Reliability and Validity of an Investigator-Administered Test and a Patient-Reported Measure Using Baseline Data Collected in a Phase IIa Drug Trial".

PubMed

Sabour, Siamak

2018-03-08

The purpose of this letter, in response to Hall, Mehta, and Fackrell (2017), is to provide important knowledge about methodology and statistical issues in assessing the reliability and validity of an audiologist-administered tinnitus loudness matching test and a patient-reported tinnitus loudness rating. The author uses reference textbooks and published articles regarding scientific assessment of the validity and reliability of a clinical test to discuss the statistical test and the methodological approach in assessing validity and reliability in clinical research. Depending on the type of the variable (qualitative or quantitative), well-known statistical tests can be applied to assess reliability and validity. The qualitative variables of sensitivity, specificity, positive predictive value, negative predictive value, false positive and false negative rates, likelihood ratio positive and likelihood ratio negative, as well as odds ratio (i.e., ratio of true to false results), are the most appropriate estimates to evaluate validity of a test compared to a gold standard. In the case of quantitative variables, depending on distribution of the variable, Pearson r or Spearman rho can be applied. Diagnostic accuracy (validity) and diagnostic precision (reliability or agreement) are two completely different methodological issues. Depending on the type of the variable (qualitative or quantitative), well-known statistical tests can be applied to assess validity.
Measurement of the Inter-Rater Reliability Rate Is Mandatory for Improving the Quality of a Medical Database: Experience with the Paulista Lung Cancer Registry.

PubMed

Lauricella, Leticia L; Costa, Priscila B; Salati, Michele; Pego-Fernandes, Paulo M; Terra, Ricardo M

2018-06-01

Database quality measurement should be considered a mandatory step to ensure an adequate level of confidence in data used for research and quality improvement. Several metrics have been described in the literature, but no standardized approach has been established. We aimed to describe a methodological approach applied to measure the quality and inter-rater reliability of a regional multicentric thoracic surgical database (Paulista Lung Cancer Registry). Data from the first 3 years of the Paulista Lung Cancer Registry underwent an audit process with 3 metrics: completeness, consistency, and inter-rater reliability. The first 2 methods were applied to the whole data set, and the last method was calculated using 100 cases randomized for direct auditing. Inter-rater reliability was evaluated using percentage of agreement between the data collector and auditor and through calculation of Cohen's κ and intraclass correlation. The overall completeness per section ranged from 0.88 to 1.00, and the overall consistency was 0.96. Inter-rater reliability showed many variables with high disagreement (>10%). For numerical variables, intraclass correlation was a better metric than inter-rater reliability. Cohen's κ showed that most variables had moderate to substantial agreement. The methodological approach applied to the Paulista Lung Cancer Registry showed that completeness and consistency metrics did not sufficiently reflect the real quality status of a database. The inter-rater reliability associated with κ and intraclass correlation was a better quality metric than completeness and consistency metrics because it could determine the reliability of specific variables used in research or benchmark reports. This report can be a paradigm for future studies of data quality measurement. Copyright © 2018 American College of Surgeons. Published by Elsevier Inc. All rights reserved.
An Evaluative Measure for Outputs in Student-Run Public Relations Firms and Applied Courses

ERIC Educational Resources Information Center

Deemer, Rebecca A.

2012-01-01

A valid, reliable survey instrument was created to be used by public relations student-run firms and other applied public relations courses to gauge client satisfaction. A series of focus groups and pilot tests were conducted to ascertain themes, refine questions, and then to refine the entire instrument. Six constructs to be measured, including…

Assessment and Evaluation.

ERIC Educational Resources Information Center

Bachman, Lyle F.

1989-01-01

Applied linguistics and psychometrics have influenced language testing, providing additional tools for investigating factors affecting language test performance and assuring measurement reliability. An examination is presented of language testing, including the theoretical issues involved, the methodological advances, language test development,…
Reliability and validity of the test of gross motor development-II in Korean preschool children: applying AHP.

PubMed

Kim, Chung-Il; Han, Dong-Wook; Park, Il-Hyeok

2014-04-01

The Test of Gross Motor Development-II (TGMD-II) is a frequently used assessment tool for measuring motor ability. The purpose of this study is to investigate the reliability and validity of TGMD-II's weighting scores (by comparing pre-weighted TGMD-II scores with post ones) as well as examine applicability of the TGMD-II on Korean preschool children. A total of 121 Korean children (three kindergartens) participated in this study. There were 65 preschoolers who were 5-years-old (37 boys and 28 girls) and 56 preschoolers who were 6-years-old (34 boys and 22 girls). For internal consistency, reliability, and construct validity, only one researcher evaluated all of the children using the TGMD-II in the following areas: running; galloping; sliding; hopping; leaping; horizontal jumping; overhand throwing; underhand rolling; striking a stationary ball; stationary dribbling; kicking; and catching. For concurrent validity, the evaluator measured physical fitness (strength, flexibility, power, agility, endurance, and balance). The key findings were as follows: first, the reliability coefficient and the validity coefficient between pre-weighted and post-weighted TGMD-II scores were quite similar. Second, the research showed adequate reliability and validity of the TGMD-II for Korean preschool children. The TGMD-II is a proper instrument to test Korean children's motor development. Yet, applying relative weighting on the TGMD-II should be a point of consideration. Copyright © 2014 Elsevier Ltd. All rights reserved.
Interrater Reliability of mHealth App Rating Measures: Analysis of Top Depression and Smoking Cessation Apps.

PubMed

Powell, Adam C; Torous, John; Chan, Steven; Raynor, Geoffrey Stephen; Shwarts, Erik; Shanahan, Meghan; Landman, Adam B

2016-02-10

There are over 165,000 mHealth apps currently available to patients, but few have undergone an external quality review. Furthermore, no standardized review method exists, and little has been done to examine the consistency of the evaluation systems themselves. We sought to determine which measures for evaluating the quality of mHealth apps have the greatest interrater reliability. We identified 22 measures for evaluating the quality of apps from the literature. A panel of 6 reviewers reviewed the top 10 depression apps and 10 smoking cessation apps from the Apple iTunes App Store on these measures. Krippendorff's alpha was calculated for each of the measures and reported by app category and in aggregate. The measure for interactiveness and feedback was found to have the greatest overall interrater reliability (alpha=.69). Presence of password protection (alpha=.65), whether the app was uploaded by a health care agency (alpha=.63), the number of consumer ratings (alpha=.59), and several other measures had moderate interrater reliability (alphas>.5). There was the least agreement over whether apps had errors or performance issues (alpha=.15), stated advertising policies (alpha=.16), and were easy to use (alpha=.18). There were substantial differences in the interrater reliabilities of a number of measures when they were applied to depression versus smoking apps. We found wide variation in the interrater reliability of measures used to evaluate apps, and some measures are more robust across categories of apps than others. The measures with the highest degree of interrater reliability tended to be those that involved the least rater discretion. Clinical quality measures such as effectiveness, ease of use, and performance had relatively poor interrater reliability. Subsequent research is needed to determine consistent means for evaluating the performance of apps. Patients and clinicians should consider conducting their own assessments of apps, in conjunction with evaluating information from reviews.
Interrater Reliability of mHealth App Rating Measures: Analysis of Top Depression and Smoking Cessation Apps

PubMed Central

Chan, Steven; Raynor, Geoffrey Stephen; Shwarts, Erik; Shanahan, Meghan; Landman, Adam B

2016-01-01

Background There are over 165,000 mHealth apps currently available to patients, but few have undergone an external quality review. Furthermore, no standardized review method exists, and little has been done to examine the consistency of the evaluation systems themselves. Objective We sought to determine which measures for evaluating the quality of mHealth apps have the greatest interrater reliability. Methods We identified 22 measures for evaluating the quality of apps from the literature. A panel of 6 reviewers reviewed the top 10 depression apps and 10 smoking cessation apps from the Apple iTunes App Store on these measures. Krippendorff’s alpha was calculated for each of the measures and reported by app category and in aggregate. Results The measure for interactiveness and feedback was found to have the greatest overall interrater reliability (alpha=.69). Presence of password protection (alpha=.65), whether the app was uploaded by a health care agency (alpha=.63), the number of consumer ratings (alpha=.59), and several other measures had moderate interrater reliability (alphas>.5). There was the least agreement over whether apps had errors or performance issues (alpha=.15), stated advertising policies (alpha=.16), and were easy to use (alpha=.18). There were substantial differences in the interrater reliabilities of a number of measures when they were applied to depression versus smoking apps. Conclusions We found wide variation in the interrater reliability of measures used to evaluate apps, and some measures are more robust across categories of apps than others. The measures with the highest degree of interrater reliability tended to be those that involved the least rater discretion. Clinical quality measures such as effectiveness, ease of use, and performance had relatively poor interrater reliability. Subsequent research is needed to determine consistent means for evaluating the performance of apps. Patients and clinicians should consider conducting their own assessments of apps, in conjunction with evaluating information from reviews. PMID:26863986
[Reliability and validity of the Chinese version on Comprehensive Scores for Financial Toxicity based on the patient-reported outcome measures].

PubMed

Yu, H H; Bi, X; Liu, Y Y

2017-08-10

Objective: To evaluate the reliability and validity of the Chinese version on comprehensive scores for financial toxicity (COST), based on the patient-reported outcome measures. Methods: A total of 118 cancer patients were face-to-face interviewed by well-trained investigators. Cronbach's α and Pearson correlation coefficient were used to evaluate reliability. Content validity index (CVI) and exploratory factor analysis (EFA) were used to evaluate the content validity and construct validity, respectively. Results: The Cronbach's α coefficient appeared as 0.889 for the whole questionnaire, with the results of test-retest were between 0.77 and 0.98. Scale-content validity index (S-CVI) appeared as 0.82, with item-content validity index (I-CVI) between 0.83 and 1.00. Two components were extracted from the Exploratory factor analysis, with cumulative rate as 68.04% and loading>0.60 on every item. Conclusion: The Chinese version of COST scale showed high reliability and good validity, thus can be applied to assess the financial situation in cancer patients.
On the Effectiveness of Nature-Inspired Metaheuristic Algorithms for Performing Phase Equilibrium Thermodynamic Calculations

PubMed Central

Fateen, Seif-Eddeen K.; Bonilla-Petriciolet, Adrian

2014-01-01

The search for reliable and efficient global optimization algorithms for solving phase stability and phase equilibrium problems in applied thermodynamics is an ongoing area of research. In this study, we evaluated and compared the reliability and efficiency of eight selected nature-inspired metaheuristic algorithms for solving difficult phase stability and phase equilibrium problems. These algorithms are the cuckoo search (CS), intelligent firefly (IFA), bat (BA), artificial bee colony (ABC), MAKHA, a hybrid between monkey algorithm and krill herd algorithm, covariance matrix adaptation evolution strategy (CMAES), magnetic charged system search (MCSS), and bare bones particle swarm optimization (BBPSO). The results clearly showed that CS is the most reliable of all methods as it successfully solved all thermodynamic problems tested in this study. CS proved to be a promising nature-inspired optimization method to perform applied thermodynamic calculations for process design. PMID:24967430
On the effectiveness of nature-inspired metaheuristic algorithms for performing phase equilibrium thermodynamic calculations.

PubMed

Fateen, Seif-Eddeen K; Bonilla-Petriciolet, Adrian

2014-01-01

The search for reliable and efficient global optimization algorithms for solving phase stability and phase equilibrium problems in applied thermodynamics is an ongoing area of research. In this study, we evaluated and compared the reliability and efficiency of eight selected nature-inspired metaheuristic algorithms for solving difficult phase stability and phase equilibrium problems. These algorithms are the cuckoo search (CS), intelligent firefly (IFA), bat (BA), artificial bee colony (ABC), MAKHA, a hybrid between monkey algorithm and krill herd algorithm, covariance matrix adaptation evolution strategy (CMAES), magnetic charged system search (MCSS), and bare bones particle swarm optimization (BBPSO). The results clearly showed that CS is the most reliable of all methods as it successfully solved all thermodynamic problems tested in this study. CS proved to be a promising nature-inspired optimization method to perform applied thermodynamic calculations for process design.
Probabilistic risk assessment for a loss of coolant accident in McMaster Nuclear Reactor and application of reliability physics model for modeling human reliability

NASA Astrophysics Data System (ADS)

Ha, Taesung

A probabilistic risk assessment (PRA) was conducted for a loss of coolant accident, (LOCA) in the McMaster Nuclear Reactor (MNR). A level 1 PRA was completed including event sequence modeling, system modeling, and quantification. To support the quantification of the accident sequence identified, data analysis using the Bayesian method and human reliability analysis (HRA) using the accident sequence evaluation procedure (ASEP) approach were performed. Since human performance in research reactors is significantly different from that in power reactors, a time-oriented HRA model (reliability physics model) was applied for the human error probability (HEP) estimation of the core relocation. This model is based on two competing random variables: phenomenological time and performance time. The response surface and direct Monte Carlo simulation with Latin Hypercube sampling were applied for estimating the phenomenological time, whereas the performance time was obtained from interviews with operators. An appropriate probability distribution for the phenomenological time was assigned by statistical goodness-of-fit tests. The human error probability (HEP) for the core relocation was estimated from these two competing quantities: phenomenological time and operators' performance time. The sensitivity of each probability distribution in human reliability estimation was investigated. In order to quantify the uncertainty in the predicted HEPs, a Bayesian approach was selected due to its capability of incorporating uncertainties in model itself and the parameters in that model. The HEP from the current time-oriented model was compared with that from the ASEP approach. Both results were used to evaluate the sensitivity of alternative huinan reliability modeling for the manual core relocation in the LOCA risk model. This exercise demonstrated the applicability of a reliability physics model supplemented with a. Bayesian approach for modeling human reliability and its potential usefulness of quantifying model uncertainty as sensitivity analysis in the PRA model.
[Validation of a knowledge-questionnaire about asthma applied to teachers of elementary school of Monterrey, Mexico].

PubMed

González Diaz, Sandra Nora; Cruz, Alfredo Arias; González González, Arya Yannel; Félix Berumen, José Alfredo; Weinmann, Alejandra Macías

2010-01-01

asthma is one of the most common chronic childhood diseases; is increasing in prevalence and an important cause of school absenteeism. Previous studies have failed to evaluate knowledge about asthma among elementary school teachers worldwide because of the lack of validated questionnaires. to validate a questionnaire about asthma knowledge for elementary school teachers in Monterrey, Nuevo Leon. an observational, cross sectional, descriptive study, from February to December 2004, by applying a questionnaire to a group of elementary school teachers in Monterrey, Nuevo Leon. The questionnaire is a translation and adaptation to the questionnaire of 13 questions used to assess the knowledge about asthma among parents, according to the National Asthma Education Program of US. a total of 179 questionnaires were applied, in which 6 of the 13 questions were answered correctly by more than 90% of the teachers. The internal consistency reliability was adequate with a Cronbach a coefficient of 0.75. in order to obtain reliable data using questionnaires, these must undergo a validation process. Our questionnaire got validation because of the reliability shown according to the internal consistency analysis.
A novel evaluation strategy for fatigue reliability of flexible nanoscale films

NASA Astrophysics Data System (ADS)

Zheng, Si-Xue; Luo, Xue-Mei; Wang, Dong; Zhang, Guang-Ping

2018-03-01

In order to evaluate fatigue reliability of nanoscale metal films on flexible substrates, here we proposed an effective evaluation way to obtain critical fatigue cracking strain based on the direct observation of fatigue damage sites through conventional dynamic bending testing technique. By this method, fatigue properties and damage behaviors of 930 nm-thick Au films and 600 nm-thick Mo-W multilayers with individual layer thickness 100 nm on flexible polyimide substrates were investigated. Coffin-Manson relationship between the fatigue life and the applied strain range was obtained for the Au films and Mo-W multilayers. The characterization of fatigue damage behaviors verifies the feasibility of this method, which seems easier and more effective comparing with the other testing methods.
Interim reliability-evaluation program: analysis of the Browns Ferry, Unit 1, nuclear plant. Appendix B - system descriptions and fault trees

DOE Office of Scientific and Technical Information (OSTI.GOV)

Mays, S.E.; Poloski, J.P.; Sullivan, W.H.

1982-07-01

This report describes a risk study of the Browns Ferry, Unit 1, nuclear plant. The study is one of four such studies sponsored by the NRC Office of Research, Division of Risk Assessment, as part of its Interim Reliability Evaluation Program (IREP), Phase II. This report is contained in four volumes: a main report and three appendixes. Appendix B provides a description of Browns Ferry, Unit 1, plant systems and the failure evaluation of those systems as they apply to accidents at Browns Ferry. Information is presented concerning front-line system fault analysis; support system fault analysis; human error models andmore » probabilities; and generic control circuit analyses.« less
Test-retest reliability of quantitative sensory testing for mechanical somatosensory and pain modulation assessment of masticatory structures.

PubMed

Costa, Y M; Morita-Neto, O; de Araújo-Júnior, E N S; Sampaio, F A; Conti, P C R; Bonjardim, L R

2017-03-01

Assessing the reliability of medical measurements is a crucial step towards the elaboration of an applicable clinical instrument. There are few studies that evaluate the reliability of somatosensory assessment and pain modulation of masticatory structures. This study estimated the test-retest reliability, that is over time, of the mechanical somatosensory assessment of anterior temporalis, masseter and temporomandibular joint (TMJ) and the conditioned pain modulation (CPM) using the anterior temporalis as the test site. Twenty healthy women were evaluated in two sessions (1 week apart) by the same examiner. Mechanical detection threshold (MDT), mechanical pain threshold (MPT), wind-up ratio (WUR) and pressure pain threshold (PPT) were assessed on the skin overlying the anterior temporalis, masseter and TMJ of the dominant side. CPM was tested by comparing PPT before and during the hand immersion in a hot water bath. anova and intra-class correlation coefficients (ICCs) were applied to the data (α = 5%). The overall ICCs showed acceptable values for the test-retest reliability of mechanical somatosensory assessment of masticatory structures. The ICC values of 75% of all quantitative sensory measurements were considered fair to excellent (fair = 8·4%, good = 33·3% and excellent = 33·3%). However, the CPM paradigm presented poor reliability (ICC = 0·25). The mechanical somatosensory assessment of the masticatory structures, but not the proposed CPM protocol, can be considered sufficiently reliable over time to evaluate the trigeminal sensory function. © 2016 John Wiley & Sons Ltd.
Testing Historical Skills.

ERIC Educational Resources Information Center

Baillie, Ray

1980-01-01

Outlines methods for including skill testing in teacher-made history tests. Focuses on distinguishing fact and fiction, evaluating the reliability of a source, distinguishing between primary and secondary sources, recognizing statements which support generalizations, testing with media, mapping geo-politics, and applying knowledge to new…
Reliability analysis and fault-tolerant system development for a redundant strapdown inertial measurement unit. [inertial platforms

NASA Technical Reports Server (NTRS)

Motyka, P.

1983-01-01

A methodology is developed and applied for quantitatively analyzing the reliability of a dual, fail-operational redundant strapdown inertial measurement unit (RSDIMU). A Markov evaluation model is defined in terms of the operational states of the RSDIMU to predict system reliability. A 27 state model is defined based upon a candidate redundancy management system which can detect and isolate a spectrum of failure magnitudes. The results of parametric studies are presented which show the effect on reliability of the gyro failure rate, both the gyro and accelerometer failure rates together, false alarms, probability of failure detection, probability of failure isolation, and probability of damage effects and mission time. A technique is developed and evaluated for generating dynamic thresholds for detecting and isolating failures of the dual, separated IMU. Special emphasis is given to the detection of multiple, nonconcurrent failures. Digital simulation time histories are presented which show the thresholds obtained and their effectiveness in detecting and isolating sensor failures.
Values of a Patient and Observer Scar Assessment Scale to Evaluate the Facial Skin Graft Scar.

PubMed

Chae, Jin Kyung; Kim, Jeong Hee; Kim, Eun Jung; Park, Kun

2016-10-01

The patient and observer scar assessment scale (POSAS) recently emerged as a promising method, reflecting both observer's and patient's opinions in evaluating scar. This tool was shown to be consistent and reliable in burn scar assessment, but it has not been tested in the setting of skin graft scar in skin cancer patients. To evaluate facial skin graft scar applied to POSAS and to compare with objective scar assessment tools. Twenty three patients, who diagnosed with facial cutaneous malignancy and transplanted skin after Mohs micrographic surgery, were recruited. Observer assessment was performed by three independent rates using the observer component of the POSAS and Vancouver scar scale (VSS). Patient self-assessment was performed using the patient component of the POSAS. To quantify scar color and scar thickness more objectively, spectrophotometer and ultrasonography was applied. Inter-observer reliability was substantial with both VSS and the observer component of the POSAS (average measure intraclass coefficient correlation, 0.76 and 0.80, respectively). The observer component consistently showed significant correlations with patients' ratings for the parameters of the POSAS (all p -values<0.05). The correlation between subjective assessment using POSAS and objective assessment using spectrophotometer and ultrasonography showed low relationship. In facial skin graft scar assessment in skin cancer patients, the POSAS showed acceptable inter-observer reliability. This tool was more comprehensive and had higher correlation with patient's opinion.
Specific algorithm method of scoring the Clock Drawing Test applied in cognitively normal elderly

PubMed Central

Mendes-Santos, Liana Chaves; Mograbi, Daniel; Spenciere, Bárbara; Charchat-Fichman, Helenice

2015-01-01

The Clock Drawing Test (CDT) is an inexpensive, fast and easily administered measure of cognitive function, especially in the elderly. This instrument is a popular clinical tool widely used in screening for cognitive disorders and dementia. The CDT can be applied in different ways and scoring procedures also vary. Objective The aims of this study were to analyze the performance of elderly on the CDT and evaluate inter-rater reliability of the CDT scored by using a specific algorithm method adapted from Sunderland et al. (1989). Methods We analyzed the CDT of 100 cognitively normal elderly aged 60 years or older. The CDT ("free-drawn") and Mini-Mental State Examination (MMSE) were administered to all participants. Six independent examiners scored the CDT of 30 participants to evaluate inter-rater reliability. Results and Conclusion A score of 5 on the proposed algorithm ("Numbers in reverse order or concentrated"), equivalent to 5 points on the original Sunderland scale, was the most frequent (53.5%). The CDT specific algorithm method used had high inter-rater reliability (p<0.01), and mean score ranged from 5.06 to 5.96. The high frequency of an overall score of 5 points may suggest the need to create more nuanced evaluation criteria, which are sensitive to differences in levels of impairment in visuoconstructive and executive abilities during aging. PMID:29213954
Good practices in normal childbirth: reliability analysis of an instrument by Cronbach's Alpha.

PubMed

Gottems, Leila Bernarda Donato; Carvalho, Elisabete Mesquita Peres De; Guilhem, Dirce; Pires, Maria Raquel Gomes Maia

2018-01-01

to analyze the internal consistency of the evaluation instrument of the adherence to the good practices of childbirth and birth care in the professionals, through Cronbach's Alpha Coefficient for each of the dimensions and for the total instrument. this is a descriptive and cross-sectional study performed in obstetric centers of eleven public hospitals in the Federal District, with a questionnaire applied to 261 professionals who worked in the delivery care. The study was attended by 261 professionals, 42.5% (111) nurses and 57.5% (150) physicians. The reliability evaluation of the instrument by the Cronbach Alfa resulted in 0.53, 0.78 and 0.76 for dimensions 1, 2 and 3, after debugging that resulted in the exclusion of 11 items. the instrument obtained Cronbach's alpha of 0.80. There is a need for improvement in the items of dimension 1 that refer to attitudes, knowledge, and practices of the organization of the network of care to gestation, childbirth, and birth. However, it can be applied in the way it is used to evaluate practices based on scientific evidence of childbirth care.
Out-of-Level Testing for Special Education Students with Mild Learning Handicaps.

ERIC Educational Resources Information Center

Jones, Eric D.; And Others

The purpose of this study was to evaluate the utility of out-of-level testing (OLT) when it is applied to the assessment of special education students with mild learning handicaps. This evaluation of OLT involved testing hypotheses related to: (1) the adequacy of vertical scaling, (2) the reliability and (3) the validity of OLT scores. Fifty-eight…
Issues in providing a reliable multicast facility

NASA Technical Reports Server (NTRS)

Dempsey, Bert J.; Strayer, W. Timothy; Weaver, Alfred C.

1990-01-01

Issues involved in point-to-multipoint communication are presented and the literature for proposed solutions and approaches surveyed. Particular attention is focused on the ideas and implementations that align with the requirements of the environment of interest. The attributes of multicast receiver groups that might lead to useful classifications, what the functionality of a management scheme should be, and how the group management module can be implemented are examined. The services that multicasting facilities can offer are presented, followed by mechanisms within the communications protocol that implements these services. The metrics of interest when evaluating a reliable multicast facility are identified and applied to four transport layer protocols that incorporate reliable multicast.
Clinical utility of measures of breathlessness.

PubMed

Cullen, Deborah L; Rodak, Bernadette

2002-09-01

The clinical utility of measures of dyspnea has been debated in the health care community. Although breathlessness can be evaluated with various instruments, the most effective dyspnea measurement tool for patients with chronic lung disease or for measuring treatment effectiveness remains uncertain. Understanding the evidence for the validity and reliability of these instruments may provide a basis for appropriate clinical application. Evaluate instruments designed to measure breathlessness, either as single-symptom or multidimensional instruments, based on psychometrics foundations such as validity, reliability, and discriminative and evaluative properties. Classification of each dyspnea measurement instrument will recommend clinical application in terms of exercise, benchmarking patients, activities of daily living, patient outcomes, clinical trials, and responsiveness to treatment. Eleven dyspnea measurement instruments were selected. Each instrument was assessed as discriminative or evaluative and then analyzed as to its psychometric properties and purpose of design. Descriptive data from all studies were described according to their primary patient application (ie, chronic obstructive pulmonary disease, asthma, or other patient populations). The Borg Scale and the Visual Analogue Scale are applicable to exertion and thus can be applied to any cardiopulmonary patient to determine dyspnea. All other measures were determined appropriate for chronic obstructive pulmonary disease, whereas the Shortness of Breath Questionnaire can be applied to cystic fibrosis and lung transplant patients. The most appropriate utility for all instruments was measuring the effects on activities of daily living and for benchmarking patient progress. Instruments that quantify function and health-related quality of life have great utility for documenting outcomes but may be limited as to documenting treatment responsiveness in terms of clinically important changes. The dyspnea measurement instruments we studied meet important standards of validity and reliability. Discriminative measures have limited clinical utility and, when used for populations or conditions for which they are not designed or validated, the data collected may not be clinically relevant. Evaluative measures have greater clinical utility and can be applied for outcome purposes. Measures should be applied to the populations and conditions for which they were designed. The relationship between clinical therapies and the measurement of dyspnea as an outcome can develop as respiratory therapists become more comfortable with implementing dyspnea measurement instruments and use the data to improve patient treatment. Dyspnea evaluation should be considered for all clinical practice guidelines and care pathways.

A Human Reliability Based Usability Evaluation Method for Safety-Critical Software

DOE Office of Scientific and Technical Information (OSTI.GOV)

Phillippe Palanque; Regina Bernhaupt; Ronald Boring

2006-04-01

Recent years have seen an increasing use of sophisticated interaction techniques including in the field of safety critical interactive software [8]. The use of such techniques has been required in order to increase the bandwidth between the users and systems and thus to help them deal efficiently with increasingly complex systems. These techniques come from research and innovation done in the field of humancomputer interaction (HCI). A significant effort is currently being undertaken by the HCI community in order to apply and extend current usability evaluation techniques to these new kinds of interaction techniques. However, very little has been donemore » to improve the reliability of software offering these kinds of interaction techniques. Even testing basic graphical user interfaces remains a challenge that has rarely been addressed in the field of software engineering [9]. However, the non reliability of interactive software can jeopardize usability evaluation by showing unexpected or undesired behaviors. The aim of this SIG is to provide a forum for both researchers and practitioners interested in testing interactive software. Our goal is to define a roadmap of activities to cross fertilize usability and reliability testing of these kinds of systems to minimize duplicate efforts in both communities.« less
Strength Analysis and Reliability Evaluation for Speed Reducers

NASA Astrophysics Data System (ADS)

Tsai, Yuo-Tern; Hsu, Yung-Yuan

2017-09-01

This paper studies the structural stresses of differential drive (DD) and harmonic drive (HD) for design improvement of reducers. The designed principles of the two reducers are reported for function comparison. The critical components of the reducers are constructed for performing motion simulation and stress analysis. DD is designed based on differential displacement of the decelerated gear ring as well as HD on a flexible spline. Finite element method (FEM) is used to analyze the structural stresses including the dynamic properties of the reducers. The stresses including kinematic properties of the two reducers are compared to observe the properties of the designs. The analyzed results are applied to identify the allowable loads of the reducers in use. The reliabilities of the reducers in different loads are further calculated according to the variation of stress. The studied results are useful on engineering analysis and reliability evaluation for designing a speed reducer with high ratios.
Health measurement using the ICF: Test-retest reliability study of ICF codes and qualifiers in geriatric care

PubMed Central

Okochi, Jiro; Utsunomiya, Sakiko; Takahashi, Tai

2005-01-01

Background The International Classification of Functioning, Disability and Health (ICF) was published by the World Health Organization (WHO) to standardize descriptions of health and disability. Little is known about the reliability and clinical relevance of measurements using the ICF and its qualifiers. This study examines the test-retest reliability of ICF codes, and the rate of immeasurability in long-term care settings of the elderly to evaluate the clinical applicability of the ICF and its qualifiers, and the ICF checklist. Methods Reliability of 85 body function (BF) items and 152 activity and participation (AP) items of the ICF was studied using a test-retest procedure with a sample of 742 elderly persons from 59 institutional and at home care service centers. Test-retest reliability was estimated using the weighted kappa statistic. The clinical relevance of the ICF was estimated by calculating immeasurability rate. The effect of the measurement settings and evaluators' experience was analyzed by stratification of these variables. The properties of each item were evaluated using both the kappa statistic and immeasurability rate to assess the clinical applicability of WHO's ICF checklist in the elderly care setting. Results The median of the weighted kappa statistics of 85 BF and 152 AP items were 0.46 and 0.55 respectively. The reproducibility statistics improved when the measurements were performed by experienced evaluators. Some chapters such as genitourinary and reproductive functions in the BF domain and major life area in the AP domain contained more items with lower test-retest reliability measures and rated as immeasurable than in the other chapters. Some items in the ICF checklist were rated as unreliable and immeasurable. Conclusion The reliability of the ICF codes when measured with the current ICF qualifiers is relatively low. The result in increase in reliability according to evaluators' experience suggests proper education will have positive effects to raise the reliability. The ICF checklist contains some items that are difficult to be applied in the geriatric care settings. The improvements should be achieved by selecting the most relevant items for each measurement and by developing appropriate qualifiers for each code according to the interest of the users. PMID:16050960
Validation of the Brazilian Portuguese Version of Geriatric Anxiety Inventory--GAI-BR.

PubMed

Massena, Patrícia Nitschke; de Araújo, Narahyana Bom; Pachana, Nancy; Laks, Jerson; de Pádua, Analuiza Camozzato

2015-07-01

The Geriatric Anxiety Inventory (GAI) is a recently developed scale aiming to evaluate symptoms of anxiety in later life. This 20-item scale uses dichotomous answers highlighting non-somatic anxiety complaints of elderly people. The present study aimed to evaluate the psychometric properties of the Brazilian Portuguese version GAI (GAI-BR) in a sample from community and outpatient psychogeriatric clinic. A mixed convenience sample of 72 subjects was recruited for answering the research protocol. The interview procedures were structured with questionnaires about sociodemographic data, clinical health status, anxiety, and depression previously validated instruments, Mini-Mental State Examination, Mini International Neuropsychiatric Interview, and GAI-BR. Twenty-two percent of the sample were interviewed twice for test-retest reliability. For internal consistency analyses, the Cronbach's α test was applied. The Spearman correlation test was applied to evaluate the test-retest GAI-BR reliability. A ROC (receiver operating characteristic) curve study was made to estimate the GAI-BR area under curve, cut-off points, sensitivity, and specificity for the Generalized Anxiety Disorder diagnosis. The GAI-BR version showed high internal consistency (Cronbach's α = 0.91) and strong and significant test-retest reliability (ρ = 0.85, p < 0.001). It also showed moderate and significant correlation with the Beck Anxiety Inventory (ρ = 0.68, p < 0.001) and the State-Trait Anxiety Inventory (ρ = 0.61, p < 0.001) showing evidence of concurrent validation. The cut-off point of 13 estimated by ROC curve analyses showed sensitivity of 83.3% and specificity of 84.6% to detect Generalized Anxiety Disorder (DSM-IV). GAI-BR has demonstrated very good psychometric properties and can be a reliable instrument to measure anxiety in Brazilian elderly people.
Proposed Criteria for Evaluation of the Reliability Improvement Warranty Concept.

DTIC Science & Technology

1980-06-01

future applications of AFT thesis research. Please return canpleted questionnaires to: AIT/ LSH Cflnesis Feedback), Wright-Patterson AFB, Ohio 45433. 1 ... 1 Overview. .................... 1 Background. ................... 4 II. PROBLEM DEFINITION. ................ 7 Evaluation Progress ............... 7...maintainability of its weapon sys- tem’s equipment (17: 1 ). In an effort to achieve this end, the Air Force, in FY 1969, first applied the concept of the
Assessing the Reliability of Material Flow Analysis Results: The Cases of Rhenium, Gallium, and Germanium in the United States Economy.

PubMed

Meylan, Grégoire; Reck, Barbara K; Rechberger, Helmut; Graedel, Thomas E; Schwab, Oliver

2017-10-17

Decision-makers traditionally expect "hard facts" from scientific inquiry, an expectation that the results of material flow analyses (MFAs) can hardly meet. MFA limitations are attributable to incompleteness of flowcharts, limited data quality, and model assumptions. Moreover, MFA results are, for the most part, based less on empirical observation but rather on social knowledge construction processes. Developing, applying, and improving the means of evaluating and communicating the reliability of MFA results is imperative. We apply two recently proposed approaches for making quantitative statements on MFA reliability to national minor metals systems: rhenium, gallium, and germanium in the United States in 2012. We discuss the reliability of results in policy and management contexts. The first approach consists of assessing data quality based on systematic characterization of MFA data and the associated meta-information and quantifying the "information content" of MFAs. The second is a quantification of data inconsistencies indicated by the "degree of data reconciliation" between the data and the model. A high information content and a low degree of reconciliation indicate reliable or certain MFA results. This article contributes to reliability and uncertainty discourses in MFA, exemplifying the usefulness of the approaches in policy and management, and to raw material supply discussions by providing country-level information on three important minor metals often considered critical.
Test-retest reliability of the prefrontal response to affective pictures based on functional near-infrared spectroscopy

NASA Astrophysics Data System (ADS)

Huang, Yuxia; Mao, Mengchai; Zhang, Zong; Zhou, Hui; Zhao, Yang; Duan, Lian; Kreplin, Ute; Xiao, Xiang; Zhu, Chaozhe

2017-01-01

Functional near-infrared spectroscopy (fNIRS) is being increasingly applied to affective and social neuroscience research; however, the reliability of this method is still unclear. This study aimed to evaluate the test-retest reliability of the fNIRS-based prefrontal response to emotional stimuli. Twenty-six participants viewed unpleasant and neutral pictures, and were simultaneously scanned by fNIRS in two sessions three weeks apart. The reproducibility of the prefrontal activation map was evaluated at three spatial scales (mapwise, clusterwise, and channelwise) at both the group and individual levels. The influence of the time interval was also explored and comparisons were made between longer (intersession) and shorter (intrasession) time intervals. The reliabilities of the activation map at the group level for the mapwise (up to 0.88, the highest value appeared in the intersession assessment) and clusterwise scales (up to 0.91, the highest appeared in the intrasession assessment) were acceptable, indicating that fNIRS may be a reliable tool for emotion studies, especially for a group analysis and under larger spatial scales. However, it should be noted that the individual-level and the channelwise fNIRS prefrontal responses were not sufficiently stable. Future studies should investigate which factors influence reliability, as well as the validity of fNIRS used in emotion studies.
Weight concerns scale applied to college students: comparison between pencil-and-paper and online formats.

PubMed

Dias, Juliana Chioda Ribeiro; Maroco, João; Campos, Juliana Alvares Duarte Bonini

2015-03-01

Online data collection is becoming increasingly common and has some advantages compared to traditional paper-and-pencil formats, such as reducing loss of data, increasing participants' privacy, and decreasing the effect of social desirability. However, the validity and reliability of this administration format must be established before results can be considered acceptable. The aim of this study was to evaluate the validity, reliability, and equivalence of paper-and-pencil and online versions of the Weight Concerns Scale (WCS) when applied to Brazilian university students. A crossover design was used, and the Portuguese version of the WCS (in both paper-and-pencil and online formats) was completed by 100 college students. The results indicated adequate fit in both formats. The simultaneous fit of data for both groups was excellent, with strong invariance between models. Adequate convergent validity, internal consistency, and mean score equivalence of the WCS in both formats were observed. Thus, the WCS presented adequate reliability and validity in both administration formats, with equivalence/stability between answers.
Cross-cultural adaptation, reliability and validity of the Turkish version of the Hospital for Special Surgery (HSS) Knee Score.

PubMed

Narin, Selnur; Unver, Bayram; Bakırhan, Serkan; Bozan, Ozgür; Karatosun, Vasfi

2014-01-01

The purpose of this study was to adapt the English version of the Hospital for Special Surgery (HSS) knee score for use in a Turkish population and to evaluate its validity, reliability and cultural adaptation. Standard forward-back translation of the HSS knee score was performed and the Turkish version was applied in 73 patients. The Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC), Mini-Mental State Examination and sit-to-stand test were also performed and analyzed. Internal consistency reliability was tested using Cronbach's alpha. The intraclass correlation coefficient (ICC) was used to calculate the test-retest reliability at one-week intervals. Validity was assessed by calculating the Pearson correlation between the HSS, WOMAC and sit-to-stand test scores. The ICC ranged from 0.98 to 0.99 with high internal consistency (Cronbach's alpha: 0.87). The WOMAC score correlated with total HSS score (r: -0.80, p<0.001) and sit-to-stand score (r: 0.12, p: 0.312). The Turkish version of the HSS knee score is reliable and valid in evaluating the total knee arthroplasty in Turkish patients.
The Effect of Power Protection Equipment on Explosion Hazards and on the Reliability of Power Supply to Longwall Systems

NASA Astrophysics Data System (ADS)

Boron, Sergiusz

2017-06-01

Operational safety of electrical machines and equipment depends, inter alia, on the hazards resulting from their use and on the scope of applied protective measures. The use of insufficient protection against existing hazards leads to reduced operational safety, particularly under fault conditions. On the other hand, excessive (in relation to existing hazards) level of protection may compromise the reliability of power supply. This paper analyses the explosion hazard created by earth faults in longwall power supply systems and evaluates existing protection equipment from the viewpoint of its protective performance, particularly in the context of explosion hazards, and also assesses its effect on the reliability of power supply.
Values of a Patient and Observer Scar Assessment Scale to Evaluate the Facial Skin Graft Scar

PubMed Central

Chae, Jin Kyung; Kim, Eun Jung; Park, Kun

2016-01-01

Background The patient and observer scar assessment scale (POSAS) recently emerged as a promising method, reflecting both observer's and patient's opinions in evaluating scar. This tool was shown to be consistent and reliable in burn scar assessment, but it has not been tested in the setting of skin graft scar in skin cancer patients. Objective To evaluate facial skin graft scar applied to POSAS and to compare with objective scar assessment tools. Methods Twenty three patients, who diagnosed with facial cutaneous malignancy and transplanted skin after Mohs micrographic surgery, were recruited. Observer assessment was performed by three independent rates using the observer component of the POSAS and Vancouver scar scale (VSS). Patient self-assessment was performed using the patient component of the POSAS. To quantify scar color and scar thickness more objectively, spectrophotometer and ultrasonography was applied. Results Inter-observer reliability was substantial with both VSS and the observer component of the POSAS (average measure intraclass coefficient correlation, 0.76 and 0.80, respectively). The observer component consistently showed significant correlations with patients' ratings for the parameters of the POSAS (all p-values<0.05). The correlation between subjective assessment using POSAS and objective assessment using spectrophotometer and ultrasonography showed low relationship. Conclusion In facial skin graft scar assessment in skin cancer patients, the POSAS showed acceptable inter-observer reliability. This tool was more comprehensive and had higher correlation with patient's opinion. PMID:27746642
Reliable change indices and standardized regression-based change score norms for evaluating neuropsychological change in children with epilepsy.

PubMed

Busch, Robyn M; Lineweaver, Tara T; Ferguson, Lisa; Haut, Jennifer S

2015-06-01

Reliable change indices (RCIs) and standardized regression-based (SRB) change score norms permit evaluation of meaningful changes in test scores following treatment interventions, like epilepsy surgery, while accounting for test-retest reliability, practice effects, score fluctuations due to error, and relevant clinical and demographic factors. Although these methods are frequently used to assess cognitive change after epilepsy surgery in adults, they have not been widely applied to examine cognitive change in children with epilepsy. The goal of the current study was to develop RCIs and SRB change score norms for use in children with epilepsy. Sixty-three children with epilepsy (age range: 6-16; M=10.19, SD=2.58) underwent comprehensive neuropsychological evaluations at two time points an average of 12 months apart. Practice effect-adjusted RCIs and SRB change score norms were calculated for all cognitive measures in the battery. Practice effects were quite variable across the neuropsychological measures, with the greatest differences observed among older children, particularly on the Children's Memory Scale and Wisconsin Card Sorting Test. There was also notable variability in test-retest reliabilities across measures in the battery, with coefficients ranging from 0.14 to 0.92. Reliable change indices and SRB change score norms for use in assessing meaningful cognitive change in children following epilepsy surgery are provided for measures with reliability coefficients above 0.50. This is the first study to provide RCIs and SRB change score norms for a comprehensive neuropsychological battery based on a large sample of children with epilepsy. Tables to aid in evaluating cognitive changes in children who have undergone epilepsy surgery are provided for clinical use. An Excel sheet to perform all relevant calculations is also available to interested clinicians or researchers. Copyright © 2015 Elsevier Inc. All rights reserved.
Psychometric properties of the School Anxiety Inventory-Short Version in Spanish secondary education students.

PubMed

García-Fernández, José M; Inglés, Cándido J; Marzo, Juan C; Martínez-Monteagudo, María C

2014-05-01

The School Anxiety Inventory (SAI) can be applied in different fields of psychology. However, due to the inventory's administration time, it may not be useful in certain situations. To address this concern, the present study developed a short version of the SAI (the SAI-SV). This study examined the reliability and validity evidence drawn from the scores of the School Anxiety Inventory-Short Version (SAI-SV) using a sample of 2,367 (47.91% boys) Spanish secondary school students, ranging from 12 to 18 years of age. To analyze the dimensional structure of the SAI-SV, exploratory and confirmatory factor analyses were applied. Internal consistency and test-retest reliability were calculated for SAI-SV scores. A correlated three-factor structure related to school situations (Anxiety about Aggression, Anxiety about Social Evaluation, and Anxiety about Academic Failure) and a three-factor structure related to the response systems of anxiety (Physiological Anxiety, Cognitive Anxiety, and Behavioral Anxiety) were identified and supported. The internal consistency and test-retest reliability were determined to be appropriate. The reliability and validity evidence based on the internal structure of SAI-SV scores was satisfactory.
Validity and reliability of the Turkish Migraine Disability Assessment (MIDAS) questionnaire.

PubMed

Ertaş, Mustafa; Siva, Aksel; Dalkara, Turgay; Uzuner, Nevzat; Dora, Babür; Inan, Levent; Idiman, Fethi; Sarica, Yakup; Selçuki, Deniz; Sirin, Hadiye; Oğuzhanoğlu, Atilla; Irkeç, Ceyla; Ozmenoğlu, Mehmet; Ozbenli, Taner; Oztürk, Musa; Saip, Sabahattin; Neyal, Münife; Zarifoğlu, Mehmet

2004-09-01

The aim of this study is to assess the comprehensibility, internal consistency, patient-physician reliability, test-retest reliability, and validity of Turkish version of Migraine Disability Assessment (MIDAS) questionnaire in patients with headache. MIDAS questionnaire has been developed by Stewart et al and shown to be reliable and valid to determine the degree of disability caused by migraine. This study was designed as a national multicenter study to demonstrate the reliability and validity of Turkish version of MIDAS questionnaire. Patients applying to 17 Neurology Clinics in Turkey were evaluated at the baseline (visit 1), week 4 (visit 2), and week 12 (visit 3) visits in terms of disease severity and comprehensibility, internal consistency, test-retest reliability, and validity of MIDAS. Since the severity of the disease has been found to change significantly at visit 2 compared to visit 1, test-retest reliability was assessed using the MIDAS scores of a subgroup of patients whose disease severity remained unchanged (up to +/-3 days difference in the number of days with headache between visits 1 and 2). A total of 306 patients (86.2% female, mean age: 35.0 +/- 9.8 years) were enrolled into the study. A total of 65.7%, 77.5%, 82.0% of patients reported that "they had fully understood the MIDAS questionnaire" in visits 1, 2, and 3, respectively. A highly positive correlation was found between physician and patient and the applied total MIDAS scores in all three visits (Spearman correlation coefficients were R= 0.87, 0.83, and 0.90, respectively, P <.001). Internal consistency of MIDAS was assessed using Cronbach's alpha and was found at acceptable (>0.7) or excellent (>0.8) levels in both patient and physician applied MIDAS scores, respectively. Total MIDAS score showed good test-retest reliability (R= 0.68). Both the number of days with headache and the total MIDAS scores were positively correlated at all visits with correlation coefficients between 0.47 and 0.63. There was also a moderate degree of correlation (R= 0.54) between the total MIDAS score at week 12 and the number of days with headache at visit 2 + visit 3, which quantify headache-related disability over a 3-month period similar to MIDAS questionnaire. These findings demonstrated that the Turkish translation is equivalent to the English version of MIDAS in terms of internal consistency, test-retest reliability, and validity. Physicians can reliably use the Turkish translation of the MIDAS questionnaire in defining the severity of illness and its treatment strategy when applied as a self-administered report by migraine patients themselves.
77 FR 4282 - Gulf of Mexico Fishery Management Council; Public Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2012-01-27

... Workshop will evaluate the data used in the assessment and whether data uncertainties acknowledged/reported are within normal or expected levels, e.g., recruitment deviations; whether data were applied properly within the assessment model; are input data series reliable and sufficient to support the assessment...
Analyzing Creative Products: Refinement and Test of a Judging Instrument.

ERIC Educational Resources Information Center

Besemer, Susan; O'Quin, Karen

1986-01-01

The Creative Product Analysis Matrix was evaluated and refined by asking 133 undergraduate students to apply the questionnaire items in three areas (novelty, resolution, elaboration/synthesis) to two T-shirts, only one of predetermined creative design. Results indicated the instrument reliably assessed overall perceptions of the product. (DB)
Physical-Mechanisms Based Reliability Analysis For Emerging Technologies

DTIC Science & Technology

2017-05-05

irradiation is great- ly enhanced by biasing the...devices during irradiation and/or applying high field stress be- fore irradiation . The resulting defect energy distributions were evaluated after... irradiation and/or high field stress via low-frequency noise measurements. Significant increases were observed in acceptor densities for defects with
AMULET: A MUlti-cLuE Approach to Image Forensics

DTIC Science & Technology

2014-12-31

celebrities have been substituted in the other two pictures. 3.2.5 Choice of reliability properties Let us now apply the BBA mapping approach proposed in...Jiang, and L. Ma, “Ds evidence theory based digital image trustworthiness evaluation model,” in MINES 2009, International Conference on Multimedia
Facial Angiofibroma Severity Index (FASI): reliability assessment of a new tool developed to measure severity and responsiveness to therapy in tuberous sclerosis-associated facial angiofibroma.

PubMed

Salido-Vallejo, R; Ruano, J; Garnacho-Saucedo, G; Godoy-Gijón, E; Llorca, D; Gómez-Fernández, C; Moreno-Giménez, J C

2014-12-01

Tuberous sclerosis complex (TSC) is an autosomal dominant neurocutaneous disorder characterized by the development of multisystem hamartomatous tumours. Topical sirolimus has recently been suggested as a potential treatment for TSC-associated facial angiofibroma (FA). To validate a reproducible scale created for the assessment of clinical severity and treatment response in these patients. We developed a new tool, the Facial Angiofibroma Severity Index (FASI) to evaluate the grade of erythema and the size and extent of FAs. In total, 30 different photographs of patients with TSC were shown to 56 dermatologists at each evaluation. Three evaluations using the same photographs but in a different random order were performed 1 week apart. Test and retest reliability and interobserver reproducibility were determined. There was good agreement between the investigators. Inter-rater reliability showed strong correlations (> 0.98; range 0.97-0.99) with inter-rater correlation coefficients (ICCs) for the FASI. The global estimated kappa coefficient for the degree of intra-rater agreement (test-retest) was 0.94 (range 0.91-0.97). The FASI is a valid and reliable tool for measuring the clinical severity of TSC-associated FAs, which can be applied in clinical practice to evaluate the response to treatment in these patients. © 2014 British Association of Dermatologists.
Reliable change of the sensory organization test.

PubMed

Broglio, Steven P; Ferrara, Michael S; Sopiarz, Kay; Kelly, Michael S

2008-03-01

To establish the sensitivity and specificity of the NeuroCom Sensory Organization Test (SOT) and provide practitioners with cut-scores for clinical decision making using estimates of reliable change. Retrospective cohort study. Research laboratory. Healthy (n = 66) and concussed (n = 63) young adult participants. Postural control assessments on the NeuroCom SOT were completed twice (baseline and follow-up) for both groups. Postconcussion assessments were administered within 24 hours of injury diagnosis. The reliable change technique was used to calculated cut-scores for each SOT variable (composite balance; somatosensory, visual, and vestibular ratios) at the 95%, 90%, 85%, 80%, 75%, and 70% confidence interval levels. When cut-scores were applied to the post-concussion evaluations, sensitivity and specificity varied with SOT variable and confidence interval. An evaluation for change on one or more SOT variable resulted in the highest combined sensitivity (57%) and specificity (80%) at the 75% confidence interval. Use of reliable change scores to detect significant changes in performance on the SOT resulted in decreased sensitivity and improved specificity compared to a previous report. These findings indicate that some concussed athletes may not show large changes in postconcussion postural control and this postural control evaluation should not be used in exclusion of other assessment techniques. The postural control assessment should be combined with other evaluative measures to gain the highest sensitivity to concussive injuries.

Reliability-Productivity Curve, a Tool for Adaptation Measures Identification

NASA Astrophysics Data System (ADS)

Chávez-Jiménez, A.; Granados, A.; Garrote, L. M.

2015-12-01

Due to climate change effects, water scarcity problems would intensify in several regions. These problems are going to impact negatively in the water low-priority demands, since these will be reduced in favor of those with high-priority. An example would be the reduction of agriculture water resources in favor of the urban ones. Then, it is important the evaluation of adaptation measures for a better water resources management. An important tool to face this challenge is the economic valuation of the water demands' impact within a water resources system. In agriculture this valuation is usually performed through the water productivity evaluation. The water productivity evaluation requires detailed information regarding the different crops like the applied technology, the agricultural supplies management, the water availability, etc. This is a restriction for an evaluation at basin scale due to the difficulty of gathers this level of detailed information. Besides, only the water availability is taken into account, but not the period when the water is distributed (i.e. water resources reliability). Water resources reliability is one of the most important variables in water resources management. This research proposes a methodology to determine the agriculture water productivity, using as variables the crops information, the crops price, the water resources availability, and the water resources reliability, at a basin scale. This methodology would allow identifying general water resources adaptation measures, providing the basis for further detailed studies in critical regions.
Design and validation of an oral health questionnaire for preoperative anaesthetic evaluation.

PubMed

Ruíz-López Del Prado, Gema; Blaya-Nováková, Vendula; Saz-Parkinson, Zuleika; Álvarez-Montero, Óscar Luis; Ayala, Alba; Muñoz-Moreno, Maria Fe; Forjaz, Maria João

Dental injuries incurred during endotracheal intubation are more frequent in patients with previous oral pathology. The study objectives were to develop an oral health questionnaire for preanaesthesia evaluation, easy to apply for personnel without special dental training; and establish a cut-off value for detecting persons with poor oral health. Validation study of a self-administered questionnaire, designed according to a literature review and an expert group's recommendations. The questionnaire was applied to a sample of patients evaluated in a preanaesthesia consultation. Rasch analysis of the questionnaire psychometric properties included viability, acceptability, content validity and reliability of the scale. The sample included 115 individuals, 50.4% of men, with a median age of 58 years (range: 38-71). The final analysis of 11 items presented a Person Separation Index of 0.861 and good adjustment of data to the Rasch model. The scale was unidimensional and its items were not biased by sex, age or nationality. The oral health linear measure presented good construct validity. The cut-off value was set at 52 points. The questionnaire showed sufficient psychometric properties to be considered a reliable tool, valid for measuring the state of oral health in preoperative anaesthetic evaluations. Copyright © 2016 Sociedade Brasileira de Anestesiologia. Published by Elsevier Editora Ltda. All rights reserved.
[Design and validation of an oral health questionnaire for preoperative anaesthetic evaluation].

PubMed

Ruíz-López Del Prado, Gema; Blaya-Nováková, Vendula; Saz-Parkinson, Zuleika; Álvarez-Montero, Óscar Luis; Ayala, Alba; Muñoz-Moreno, Maria Fe; Forjaz, Maria João

Dental injuries incurred during endotracheal intubation are more frequent in patients with previous oral pathology. The study objectives were to develop an oral health questionnaire for preanaesthesia evaluation, easy to apply for personnel without special dental training; and establish a cut-off value for detecting persons with poor oral health. Validation study of a self-administered questionnaire, designed according to a literature review and an expert group's recommendations. The questionnaire was applied to a sample of patients evaluated in a preanaesthesia consultation. Rasch analysis of the questionnaire psychometric properties included viability, acceptability, content validity and reliability of the scale. The sample included 115 individuals, 50.4% of men, with a median age of 58 years (range: 38-71). The final analysis of 11 items presented a Person Separation Index of 0.861 and good adjustment of data to the Rasch model. The scale was unidimensional and its items were not biased by sex, age or nationality. The oral health linear measure presented good construct validity. The cut-off value was set at 52 points. The questionnaire showed sufficient psychometric properties to be considered a reliable tool, valid for measuring the state of oral health in preoperative anaesthetic evaluations. Copyright © 2016 Sociedade Brasileira de Anestesiologia. Publicado por Elsevier Editora Ltda. All rights reserved.
Good practices in normal childbirth: reliability analysis of an instrument by Cronbach’s Alpha 1

PubMed Central

Gottems, Leila Bernarda Donato; Carvalho, Elisabete Mesquita Peres De; Guilhem, Dirce; Pires, Maria Raquel Gomes Maia

2018-01-01

ABSTRACT Objectives: to analyze the internal consistency of the evaluation instrument of the adherence to the good practices of childbirth and birth care in the professionals, through Cronbach’s Alpha Coefficient for each of the dimensions and for the total instrument. Method: this is a descriptive and cross-sectional study performed in obstetric centers of eleven public hospitals in the Federal District, with a questionnaire applied to 261 professionals who worked in the delivery care. Results: The study was attended by 261 professionals, 42.5% (111) nurses and 57.5% (150) physicians. The reliability evaluation of the instrument by the Cronbach Alfa resulted in 0.53, 0.78 and 0.76 for dimensions 1, 2 and 3, after debugging that resulted in the exclusion of 11 items. Conclusions: the instrument obtained Cronbach’s alpha of 0.80. There is a need for improvement in the items of dimension 1 that refer to attitudes, knowledge, and practices of the organization of the network of care to gestation, childbirth, and birth. However, it can be applied in the way it is used to evaluate practices based on scientific evidence of childbirth care. PMID:29791667
A psychometric evaluation of the digital logic concept inventory

NASA Astrophysics Data System (ADS)

Herman, Geoffrey L.; Zilles, Craig; Loui, Michael C.

2014-10-01

Concept inventories hold tremendous promise for promoting the rigorous evaluation of teaching methods that might remedy common student misconceptions and promote deep learning. The measurements from concept inventories can be trusted only if the concept inventories are evaluated both by expert feedback and statistical scrutiny (psychometric evaluation). Classical Test Theory and Item Response Theory provide two psychometric frameworks for evaluating the quality of assessment tools. We discuss how these theories can be applied to assessment tools generally and then apply them to the Digital Logic Concept Inventory (DLCI). We demonstrate that the DLCI is sufficiently reliable for research purposes when used in its entirety and as a post-course assessment of students' conceptual understanding of digital logic. The DLCI can also discriminate between students across a wide range of ability levels, providing the most information about weaker students' ability levels.
[Reconsidering evaluation criteria regarding health care research: toward an integrative framework of quantitative and qualitative criteria].

PubMed

Miyata, Hiroaki; Kai, Ichiro

2006-05-01

Debate about the relationship between quantitative and qualitative paradigms is often muddled and confused and the clutter of terms and arguments has resulted in the concepts becoming obscure and unrecognizable. It is therefore very important to reconsider evaluation criteria regarding rigor in social science. As Lincoln & Guba have already compared quantitative paradigms (validity, reliability, neutrality, generalizability) with qualitative paradigms (credibility, dependability, confirmability, transferability), we have discuss use of evaluation criteria based on pragmatic perspective. Validity/Credibility is the paradigm concerned to observational framework, while Reliability/Dependability refer to the range of stability in observations, Neutrality/Confirmability reflect influences between observers and subjects, Generalizability/Transferability have epistemological difference in the way findings are applied. Qualitative studies, however, does not always chose the qualitative paradigms. If we assume the stability to some extent, it is better to use the quantitative paradigm (reliability). Moreover as a quantitative study can not always guarantee a perfect observational framework, with stability in all phases of observations, it is useful to use qualitative paradigms to enhance the rigor in the study.
Translation and validation of the new version of the Knee Society Score - The 2011 KS Score - into Brazilian Portuguese.

PubMed

Silva, Adriana Lucia Pastore E; Croci, Alberto Tesconi; Gobbi, Riccardo Gomes; Hinckel, Betina Bremer; Pecora, José Ricardo; Demange, Marco Kawamura

2017-01-01

Translation, cultural adaptation, and validation of the new version of the Knee Society Score - The 2011 KS Score - into Brazilian Portuguese and verification of its measurement properties, reproducibility, and validity. In 2012, the new version of the Knee Society Score was developed and validated. This scale comprises four separate subscales: (a) objective knee score (seven items: 100 points); (b) patient satisfaction score (five items: 40 points); (c) patient expectations score (three items: 15 points); and (d) functional activity score (19 items: 100 points). A total of 90 patients aged 55-85 years were evaluated in a clinical cross-sectional study. The pre-operative translated version was applied to patients with TKA referral, and the post-operative translated version was applied to patients who underwent TKA. Each patient answered the same questionnaire twice and was evaluated by two experts in orthopedic knee surgery. Evaluations were performed pre-operatively and three, six, or 12 months post-operatively. The reliability of the questionnaire was evaluated using the intraclass correlation coefficient (ICC) between the two applications. Internal consistency was evaluated using Cronbach's alpha. The ICC found no difference between the means of the pre-operative, three-month, and six-month post-operative evaluations between sub-scale items. The Brazilian Portuguese version of The 2011 KS Score is a valid and reliable instrument for objective and subjective evaluation of the functionality of Brazilian patients who undergo TKA and revision TKA.
Human alteration of the rural landscape: Variations in visual perception

DOE Office of Scientific and Technical Information (OSTI.GOV)

Cloquell-Ballester, Vicente-Agustin, E-mail: cloquell@dpi.upv.es; Carmen Torres-Sibille, Ana del; Cloquell-Ballester, Victor-Andres

2012-01-15

The objective of this investigation is to evaluate how visual perception varies as the rural landscape is altered by human interventions of varying character. An experiment is carried out using Semantic Differential Analysis to analyse the effect of the character and the type of the intervention on perception. Interventions are divided into elements of 'permanent industrial character', 'elements of permanent rural character' and 'elements of temporary character', and these categories are sub-divided into smaller groups according to the type of development. To increase the reliability of the results, the Intraclass Correlation Coefficient tool, is applied to validate the semantic spacemore » of the perceptual responses and to determine the number of subjects required for a reliable evaluation of the scenes.« less
Validity and reliability of the abdominal test and evaluation systems tool (ABTEST) to accurately measure abdominal force.

PubMed

Glenn, Jordan M; Galey, Madeline; Edwards, Abigail; Rickert, Bradley; Washington, Tyrone A

2015-07-01

Ability to generate force from the core musculature is a critical factor for sports and general activities with insufficiencies predisposing individuals to injury. This study evaluated isometric force production as a valid and reliable method of assessing abdominal force using the abdominal test and evaluation systems tool (ABTEST). Secondary analysis estimated 1-repetition maximum on commercially available abdominal machine compared to maximum force and average power on ABTEST system. This study utilized test-retest reliability and comparative analysis for validity. Reliability was measured using test-retest design on ABTEST. Validity was measured via comparison to estimated 1-repetition maximum on a commercially available abdominal device. Participants applied isometric, abdominal force against a transducer and muscular activation was evaluated measuring normalized electromyographic activity at the rectus-abdominus, rectus-femoris, and erector-spinae. Test, re-test force production on ABTEST was significantly correlated (r=0.84; p<0.001). Mean electromyographic activity for the rectus-abdominus (72.93% and 75.66%), rectus-femoris (6.59% and 6.51%), and erector-spinae (6.82% and 5.48%) were observed for trial-1 and trial-2, respectively. Significant correlations for the estimated 1-repetition maximum were found for average power (r=0.70, p=0.002) and maximum force (r=0.72, p<0.001). Data indicate the ABTEST can accurately measure rectus-abdominus force isolated from hip-flexor involvement. Negligible activation of erector-spinae substantiates little subjective effort among participants in the lower back. Results suggest ABTEST is a valid and reliable method of evaluating abdominal force. Copyright © 2014 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Operation Reliability Assessment for Cutting Tools by Applying a Proportional Covariate Model to Condition Monitoring Information

PubMed Central

Cai, Gaigai; Chen, Xuefeng; Li, Bing; Chen, Baojia; He, Zhengjia

2012-01-01

The reliability of cutting tools is critical to machining precision and production efficiency. The conventional statistic-based reliability assessment method aims at providing a general and overall estimation of reliability for a large population of identical units under given and fixed conditions. However, it has limited effectiveness in depicting the operational characteristics of a cutting tool. To overcome this limitation, this paper proposes an approach to assess the operation reliability of cutting tools. A proportional covariate model is introduced to construct the relationship between operation reliability and condition monitoring information. The wavelet packet transform and an improved distance evaluation technique are used to extract sensitive features from vibration signals, and a covariate function is constructed based on the proportional covariate model. Ultimately, the failure rate function of the cutting tool being assessed is calculated using the baseline covariate function obtained from a small sample of historical data. Experimental results and a comparative study show that the proposed method is effective for assessing the operation reliability of cutting tools. PMID:23201980
The reliability and validity of cervical auscultation in the diagnosis of dysphagia: a systematic review.

PubMed

Lagarde, Marloes L J; Kamalski, Digna M A; van den Engel-Hoek, Lenie

2016-02-01

To systematically review the available evidence for the reliability and validity of cervical auscultation in diagnosing the several aspects of dysphagia in adults and children suffering from dysphagia. Medline (PubMed), Embase and the Cochrane Library databases. The systematic review was carried out applying the steps of the PRISMA-statement. The methodological quality of the included studies were evaluated using the Dutch 'Cochrane checklist for diagnostic accuracy studies'. A total of 90 articles were identified through the search strategy, and after applying the inclusion and exclusion criteria, six articles were included in this review. In the six studies, 197 patients were assessed with cervical auscultation. Two of the six articles were considered to be of 'good' quality and three studies were of 'moderate' quality. One article was excluded because of a 'poor' methodological quality. Sensitivity ranges from 23%-94% and specificity ranges from 50%-74%. Inter-rater reliability was 'poor' or 'fair' in all studies. The intra-rater reliability shows a wide variance among speech language therapists. In this systematic review, conflicting evidence is found for the validity of cervical auscultation. The reliability of cervical auscultation is insufficient when used as a stand-alone tool in the diagnosis of dysphagia in adults. There is no available evidence for the validity and reliability of cervical auscultation in children. Cervical auscultation should not be used as a stand-alone instrument to diagnose dysphagia. © The Author(s) 2015.
Novel Strength Test Battery to Permit Evidence-Based Paralympic Classification

PubMed Central

Beckman, Emma M.; Newcombe, Peter; Vanlandewijck, Yves; Connick, Mark J.; Tweedy, Sean M.

2014-01-01

Abstract Ordinal-scale strength assessment methods currently used in Paralympic athletics classification prevent the development of evidence-based classification systems. This study evaluated a battery of 7, ratio-scale, isometric tests with the aim of facilitating the development of evidence-based methods of classification. This study aimed to report sex-specific normal performance ranges, evaluate test–retest reliability, and evaluate the relationship between the measures and body mass. Body mass and strength measures were obtained from 118 participants—63 males and 55 females—ages 23.2 years ± 3.7 (mean ± SD). Seventeen participants completed the battery twice to evaluate test–retest reliability. The body mass–strength relationship was evaluated using Pearson correlations and allometric exponents. Conventional patterns of force production were observed. Reliability was acceptable (mean intraclass correlation = 0.85). Eight measures had moderate significant correlations with body size (r = 0.30–61). Allometric exponents were higher in males than in females (mean 0.99 vs 0.30). Results indicate that this comprehensive and parsimonious battery is an important methodological advance because it has psychometric properties critical for the development of evidence-based classification. Measures were interrelated with body size, indicating further research is required to determine whether raw measures require normalization in order to be validly applied in classification. PMID:25068950
Reliability of Pressure Ulcer Rates: How Precisely Can We Differentiate Among Hospital Units, and Does the Standard Signal‐Noise Reliability Measure Reflect This Precision?

PubMed Central

Cramer, Emily

2016-01-01

Abstract Hospital performance reports often include rankings of unit pressure ulcer rates. Differentiating among units on the basis of quality requires reliable measurement. Our objectives were to describe and apply methods for assessing reliability of hospital‐acquired pressure ulcer rates and evaluate a standard signal‐noise reliability measure as an indicator of precision of differentiation among units. Quarterly pressure ulcer data from 8,199 critical care, step‐down, medical, surgical, and medical‐surgical nursing units from 1,299 US hospitals were analyzed. Using beta‐binomial models, we estimated between‐unit variability (signal) and within‐unit variability (noise) in annual unit pressure ulcer rates. Signal‐noise reliability was computed as the ratio of between‐unit variability to the total of between‐ and within‐unit variability. To assess precision of differentiation among units based on ranked pressure ulcer rates, we simulated data to estimate the probabilities of a unit's observed pressure ulcer rate rank in a given sample falling within five and ten percentiles of its true rank, and the probabilities of units with ulcer rates in the highest quartile and highest decile being identified as such. We assessed the signal‐noise measure as an indicator of differentiation precision by computing its correlations with these probabilities. Pressure ulcer rates based on a single year of quarterly or weekly prevalence surveys were too susceptible to noise to allow for precise differentiation among units, and signal‐noise reliability was a poor indicator of precision of differentiation. To ensure precise differentiation on the basis of true differences, alternative methods of assessing reliability should be applied to measures purported to differentiate among providers or units based on quality. © 2016 The Authors. Research in Nursing & Health published by Wiley Periodicals, Inc. PMID:27223598
[Validation and reliability study of the parent concerns about surgery questionnaire: What worries parents?

PubMed

Gironés Muriel, Alberto; Campos Segovia, Ana; Ríos Gómez, Patricia

2018-01-01

The study of mediating variables and psychological responses to child surgery involves the evaluation of both the patient and the parents as regards different stressors. To have a reliable and reproducible valid evaluation tool that assesses the level of paternal involvement in relation to different stressors in the setting of surgery. A self-report questionnaire study was completed by 123 subjects of both sexes, subdivided into 2populations, due to their relationship with the hospital setting. The items were determined by a group of experts and analysed using the Lawshe validity index to determine a first validity of content. Subsequently, the reliability of the tool was determined by an item-re-item analysis of the 2sub-populations. A factorial analysis was performed to analyse the construct validity with the maximum likelihood and rotation of varimax type factors. A questionnaire of paternal concern was offered, consisting of 21 items with a Cronbach coefficient of 0.97, giving good precision and stability. The posterior factor analysis gives an adequate validity to the questionnaire, with the determination of 10 common stressors that cover 74.08% of the common and non-common variance of the questionnaire. The proposed questionnaire is reliable, valid and easy-to-apply and is developed to assess the level of paternal concern about the surgery of a child and to be able to apply measures and programs through the prior assessment of these elements. Copyright © 2016 Asociación Española de Pediatría. Publicado por Elsevier España, S.L.U. All rights reserved.
Software Reliability 2002

NASA Technical Reports Server (NTRS)

Wallace, Dolores R.

2003-01-01

In FY01 we learned that hardware reliability models need substantial changes to account for differences in software, thus making software reliability measurements more effective, accurate, and easier to apply. These reliability models are generally based on familiar distributions or parametric methods. An obvious question is 'What new statistical and probability models can be developed using non-parametric and distribution-free methods instead of the traditional parametric method?" Two approaches to software reliability engineering appear somewhat promising. The first study, begin in FY01, is based in hardware reliability, a very well established science that has many aspects that can be applied to software. This research effort has investigated mathematical aspects of hardware reliability and has identified those applicable to software. Currently the research effort is applying and testing these approaches to software reliability measurement, These parametric models require much project data that may be difficult to apply and interpret. Projects at GSFC are often complex in both technology and schedules. Assessing and estimating reliability of the final system is extremely difficult when various subsystems are tested and completed long before others. Parametric and distribution free techniques may offer a new and accurate way of modeling failure time and other project data to provide earlier and more accurate estimates of system reliability.
Application of reliability-centered-maintenance to BWR ECCS motor operator valve performance

DOE Office of Scientific and Technical Information (OSTI.GOV)

Feltus, M.A.; Choi, Y.A.

1993-01-01

This paper describes the application of reliability-centered maintenance (RCM) methods to plant probabilistic risk assessment (PRA) and safety analyses for four boiling water reactor emergency core cooling systems (ECCSs): (1) high-pressure coolant injection (HPCI); (2) reactor core isolation cooling (RCIC); (3) residual heat removal (RHR); and (4) core spray systems. Reliability-centered maintenance is a system function-based technique for improving a preventive maintenance program that is applied on a component basis. Those components that truly affect plant function are identified, and maintenance tasks are focused on preventing their failures. The RCM evaluation establishes the relevant criteria that preserve system function somore » that an RCM-focused approach can be flexible and dynamic.« less
Reliability of quantitative EEG (qEEG) measures and LORETA current source density at 30 days.

PubMed

Cannon, Rex L; Baldwin, Debora R; Shaw, Tiffany L; Diloreto, Dominic J; Phillips, Sherman M; Scruggs, Annie M; Riehl, Timothy C

2012-06-14

There is a growing interest for using quantitative EEG and LORETA current source density in clinical and research settings. Importantly, if these indices are to be employed in clinical settings then the reliability of these measures is of great concern. Neuroguide (Applied Neurosciences) is sophisticated software developed for the analyses of power, and connectivity measures of the EEG as well as LORETA current source density. To date there are relatively few data evaluating topographical EEG reliability contrasts for all 19 channels and no studies have evaluated reliability for LORETA calculations. We obtained 4 min eyes-closed and eyes-opened EEG recordings at 30-day intervals. The EEG was analyzed in Neuroguide and FFT power, coherence and phase was computed for traditional frequency bands (delta, theta, alpha and beta) and LORETA current source density was calculated in 1 Hz increments and summed for total power in eight regions of interest (ROI). In order to obtain a robust measure of reliability we utilized a random effects model with an absolute agreement definition. The results show very good reproducibility for total absolute power and coherence. Phase shows lower reliability coefficients. LORETA current source density shows very good reliability with an average 0.81 for ECB and 0.82 for EOB. Similarly, the eight regions of interest show good to very good agreement across time. Implications for future directions and use of qEEG and LORETA in clinical populations are discussed. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Evaluating Written Patient Information for Eczema in German: Comparing the Reliability of Two Instruments, DISCERN and EQIP

PubMed Central

McCool, Megan E.; Wahl, Josepha; Schlecht, Inga; Apfelbacher, Christian

2015-01-01

Patients actively seek information about how to cope with their health problems, but the quality of the information available varies. A number of instruments have been developed to assess the quality of patient information, primarily though in English. Little is known about the reliability of these instruments when applied to patient information in German. The objective of our study was to investigate and compare the reliability of two validated instruments, DISCERN and EQIP, in order to determine which of these instruments is better suited for a further study pertaining to the quality of information available to German patients with eczema. Two independent raters evaluated a random sample of 20 informational brochures in German. All the brochures addressed eczema as a disorder and/or therapy options and care. Intra-rater and inter-rater reliability were assessed by calculating intra-class correlation coefficients, agreement was tested with weighted kappas, and the correlation of the raters’ scores for each instrument was measured with Pearson’s correlation coefficient. DISCERN demonstrated substantial intra- and inter-rater reliability. It also showed slightly better agreement than EQIP. There was a strong correlation of the raters’ scores for both instruments. The findings of this study support the reliability of both DISCERN and EQIP. However, based on the results of the inter-rater reliability, agreement and correlation analyses, we consider DISCERN to be the more precise tool for our project on patient information concerning the treatment and care of eczema. PMID:26440612
Evaluating Written Patient Information for Eczema in German: Comparing the Reliability of Two Instruments, DISCERN and EQIP.

PubMed

McCool, Megan E; Wahl, Josepha; Schlecht, Inga; Apfelbacher, Christian

2015-01-01

Patients actively seek information about how to cope with their health problems, but the quality of the information available varies. A number of instruments have been developed to assess the quality of patient information, primarily though in English. Little is known about the reliability of these instruments when applied to patient information in German. The objective of our study was to investigate and compare the reliability of two validated instruments, DISCERN and EQIP, in order to determine which of these instruments is better suited for a further study pertaining to the quality of information available to German patients with eczema. Two independent raters evaluated a random sample of 20 informational brochures in German. All the brochures addressed eczema as a disorder and/or therapy options and care. Intra-rater and inter-rater reliability were assessed by calculating intra-class correlation coefficients, agreement was tested with weighted kappas, and the correlation of the raters' scores for each instrument was measured with Pearson's correlation coefficient. DISCERN demonstrated substantial intra- and inter-rater reliability. It also showed slightly better agreement than EQIP. There was a strong correlation of the raters' scores for both instruments. The findings of this study support the reliability of both DISCERN and EQIP. However, based on the results of the inter-rater reliability, agreement and correlation analyses, we consider DISCERN to be the more precise tool for our project on patient information concerning the treatment and care of eczema.
Estimation of reliability and dynamic property for polymeric material at high strain rate using SHPB technique and probability theory

NASA Astrophysics Data System (ADS)

Kim, Dong Hyeok; Lee, Ouk Sub; Kim, Hong Min; Choi, Hye Bin

2008-11-01

A modified Split Hopkinson Pressure Bar technique with aluminum pressure bars and a pulse shaper technique to achieve a closer impedance match between the pressure bars and the specimen materials such as hot temperature degraded POM (Poly Oxy Methylene) and PP (Poly Propylene). The more distinguishable experimental signals were obtained to evaluate the more accurate dynamic deformation behavior of materials under a high strain rate loading condition. A pulse shaping technique is introduced to reduce the non-equilibrium on the dynamic material response by modulation of the incident wave during a short period of test. This increases the rise time of the incident pulse in the SHPB experiment. For the dynamic stress strain curve obtained from SHPB experiment, the Johnson-Cook model is applied as a constitutive equation. The applicability of this constitutive equation is verified by using the probabilistic reliability estimation method. Two reliability methodologies such as the FORM and the SORM have been proposed. The limit state function(LSF) includes the Johnson-Cook model and applied stresses. The LSF in this study allows more statistical flexibility on the yield stress than a paper published before. It is found that the failure probability estimated by using the SORM is more reliable than those of the FORM/ It is also noted that the failure probability increases with increase of the applied stress. Moreover, it is also found that the parameters of Johnson-Cook model such as A and n, and the applied stress are found to affect the failure probability more severely than the other random variables according to the sensitivity analysis.

Study of SEM induced current and voltage contrast modes to assess semiconductor reliability

NASA Technical Reports Server (NTRS)

Beall, J. R.

1976-01-01

The purpose of the scanning electron microscopy study was to review the failure history of existing integrated circuit technologies to identify predominant failure mechanisms, and to evaluate the feasibility of their detection using SEM application techniques. The study investigated the effects of E-beam irradiation damage and contamination deposition rates; developed the necessary methods for applying the techniques to the detection of latent defects and weaknesses in integrated circuits; and made recommendations for applying the techniques.
Evaluation of fecal indicator and pathogenic bacteria originating from swine manure applied to agricultural lands using culture-based and quantitative real-time PCR methods.

EPA Science Inventory

Fecal bacteria, including those originating from concentrated animal feeding operations, are a leading contributor to water quality impairments in agricultural areas. Rapid and reliable methods are needed that can accurately characterize fecal pollution in agricultural settings....
Evaluation of Fecal Indicator and Pathogenic Bacteria Originating from Swine Manure Applied to Agricultural Lands Using Culture-Based and Quantitative Real-Time PCR Methods

EPA Science Inventory

Fecal bacteria, including those originating from concentrated animal feeding operations, are a leading contributor to water quality impairments in agricultural areas. Rapid and reliable methods are needed that can accurately characterize fecal pollution in agricultural settings....
The Recovery Knowledge Inventory for Measurement of Nursing Student Views on Recovery-oriented Mental Health Services.

PubMed

Happell, Brenda; Byrne, Louise; Platania-Phung, Chris

2015-01-01

Recovery-oriented services are a goal for policy and practice in the Australian mental health service system. Evidence-based reform requires an instrument to measure knowledge of recovery concepts. The Recovery Knowledge Inventory (RKI) was designed for this purpose, however, its suitability and validity for student health professionals has not been evaluated. The purpose of the current article is to report the psychometric features of the RKI for measuring nursing students' views on recovery. The RKI, a self-report measure, consists of four scales: (I) Roles and Responsibilities, (II) Non-Linearity of the Recovery Process, (III) Roles of Self-Definition and Peers, and (IV) Expectations Regarding Recovery. Confirmatory and exploratory factor analyses of the baseline data (n = 167) were applied to assess validity and reliability. Exploratory factor analyses generally replicated the item structure suggested by the three main scales, however more stringent analyses (confirmatory factor analysis) did not provide strong support for convergent validity. A refined RKI with 16 items had internal reliabilities of α = .75 for Roles and Responsibilities, α = .49 for Roles of Self-Definition and Peers, and α = .72, for Recovery as Non-Linear Process. If the RKI is to be applied to nursing student populations, the conceptual underpinning of the instrument needs to be reworked, and new items should be generated to evaluate and improve scale validity and reliability.
Performance of regional oxygen saturation monitoring by near-infrared spectroscopy (NIRS) in pediatric inter-hospital transports with special reference to air ambulance transports: a methodological study.

PubMed

Hamrin, Tova Hannegård; Radell, Peter J; Fläring, Urban; Berner, Jonas; Eksborg, Staffan

2017-12-28

The aim of the present study was to evaluate the performance of regional oxygen saturation (rSO 2 ) monitoring with near infrared spectroscopy (NIRS) during pediatric inter-hospital transports and to optimize processing of the electronically stored data. Cerebral (rSO 2 -C) and abdominal (rSO 2 -A) NIRS sensors were used during transport in air ambulance and connecting ground ambulance. Data were electronically stored by the monitor during transport, extracted and analyzed off-line after the transport. After removal of all zero and floor effect values, the Savitzky-Golay algorithm of data smoothing was applied on the NIRS-signal. The second order of smoothing polynomial was used and the optimal number of neighboring points for the smoothing procedure was evaluated. NIRS-data from 38 pediatric patients was examined. Reliability, defined as measurements without values of 0 or 15%, was acceptable during transport (> 90% of all measurements). There were, however, individual patients with < 90% reliable measurements during transport, while no patient was found to have < 90% reliable measurements in hospital. Satisfactory noise reduction of the signal, without distortion of the underlying information, was achieved when 20-50 neighbors ("window-size") were used. The use of NIRS for measuring rSO 2 in clinical studies during pediatric transport in ground and air-ambulance is feasible but hampered by unreliable values and signal interference. By applying the Savitzky-Golay algorithm, the signal-to-noise ratio was improved and enabled better post-hoc signal evaluation.
Using G-Theory to Enhance Evidence of Reliability and Validity for Common Uses of the Paulhus Deception Scales.

PubMed

Vispoel, Walter P; Morris, Carrie A; Kilinc, Murat

2018-01-01

We applied a new approach to Generalizability theory (G-theory) involving parallel splits and repeated measures to evaluate common uses of the Paulhus Deception Scales based on polytomous and four types of dichotomous scoring. G-theory indices of reliability and validity accounting for specific-factor, transient, and random-response measurement error supported use of polytomous over dichotomous scores as contamination checks; as control, explanatory, and outcome variables; as aspects of construct validation; and as indexes of environmental effects on socially desirable responding. Polytomous scoring also provided results for flagging faking as dependable as those when using dichotomous scoring methods. These findings argue strongly against the nearly exclusive use of dichotomous scoring for the Paulhus Deception Scales in practice and underscore the value of G-theory in demonstrating this. We provide guidelines for applying our G-theory techniques to other objectively scored clinical assessments, for using G-theory to estimate how changes to a measure might improve reliability, and for obtaining software to conduct G-theory analyses free of charge.
Reliability Stress-Strength Models for Dependent Observations with Applications in Clinical Trials

NASA Technical Reports Server (NTRS)

Kushary, Debashis; Kulkarni, Pandurang M.

1995-01-01

We consider the applications of stress-strength models in studies involving clinical trials. When studying the effects and side effects of certain procedures (treatments), it is often the case that observations are correlated due to subject effect, repeated measurements and observing many characteristics simultaneously. We develop maximum likelihood estimator (MLE) and uniform minimum variance unbiased estimator (UMVUE) of the reliability which in clinical trial studies could be considered as the chances of increased side effects due to a particular procedure compared to another. The results developed apply to both univariate and multivariate situations. Also, for the univariate situations we develop simple to use lower confidence bounds for the reliability. Further, we consider the cases when both stress and strength constitute time dependent processes. We define the future reliability and obtain methods of constructing lower confidence bounds for this reliability. Finally, we conduct simulation studies to evaluate all the procedures developed and also to compare the MLE and the UMVUE.
Quality Evaluation of Raw Moutan Cortex Using the AHP and Gray Correlation-TOPSIS Method

PubMed Central

Zhou, Sujuan; Liu, Bo; Meng, Jiang

2017-01-01

Background: Raw Moutan cortex (RMC) is an important Chinese herbal medicine. Comprehensive and objective quality evaluation of Chinese herbal medicine has been one of the most important issues in the modern herbs development. Objective: To evaluate and compare the quality of RMC using the weighted gray correlation- Technique for Order Preference by Similarity to an Ideal Solution (TOPSIS) method. Materials and Methods: The percentage composition of gallic acid, catechin, oxypaeoniflorin, paeoniflorin, quercetin, benzoylpaeoniflorin, paeonol in different batches of RMC was determined, and then adopting MATLAB programming to construct the gray correlation-TOPSIS assessment model for quality evaluation of RMC. Results: The quality evaluation results of model evaluation and objective evaluation were consistent, reliable, and stable. Conclusion: The model of gray correlation-TOPSIS can be well applied to the quality evaluation of traditional Chinese medicine with multiple components and has broad prospect in application. SUMMARY The experiment tries to construct a model to evaluate the quality of RMC using the weighted gray correlation- Technique for Order Preference by Similarity to an Ideal Solution (TOPSIS) method. Results show the model is reliable and provide a feasible way in evaluating quality of traditional Chinese medicine with multiple components. PMID:28839384
Evaluation of Commercial Automotive-Grade BME Capacitors

NASA Technical Reports Server (NTRS)

Liu, Donhang

2014-01-01

Three Ni-BaTiO3 ceramic capacitor lots with the same specification (chip size, capacitance, and rated voltage) and the same reliability level, made by three different manufacturers, were degraded using highly accelerated life stress testing (HALST) with the same temperature and applied voltage conditions. The reliability, as characterized by mean time to failure (MTTF), differed by more than one order of magnitude among the capacitor lots. A theoretical model based on the existence of depletion layers at grain boundaries and the entrapment of oxygen vacancies has been proposed to explain the MTTF difference among these BME capacitors. It is the conclusion of this model that reliability will not be improved simply by increasing the insulation resistance of a BME capacitor. Indeed, Ni-BaTiO3 ceramic capacitors with a smaller degradation rate constant K will always give rise to a longer reliability life.
Evaluation of Commercial Automotive-Grade BME Capacitors

NASA Technical Reports Server (NTRS)

Liu, Donhang

2014-01-01

Three Ni-BaTiO3 ceramic capacitor lots with the same specification (chip size, capacitance, and rated voltage) and the same reliability level, made by three different manufacturers, were degraded using highly accelerated life stress testing (HALST) with the same temperature and applied voltage conditions. The reliability, as characterized by mean time to failure (MTTF), differed by more than one order of magnitude among the capacitor lots. A theoretical model based on the existence of depletion layers at grain boundaries and the entrapment of oxygen vacancies has been proposed to explain the MTTF difference among these BME capacitors. It is the conclusion of this model that reliability will not be improved simply by increasing the insulation resistance of a BME capacitor. Indeed, Ni-BaTiO3 ceramic capacitors with a smaller degradation rate constant K will always give rise to a longer reliability life
Reliably detectable flaw size for NDE methods that use calibration

NASA Astrophysics Data System (ADS)

Koshti, Ajay M.

2017-04-01

Probability of detection (POD) analysis is used in assessing reliably detectable flaw size in nondestructive evaluation (NDE). MIL-HDBK-1823 and associated mh18232 POD software gives most common methods of POD analysis. In this paper, POD analysis is applied to an NDE method, such as eddy current testing, where calibration is used. NDE calibration standards have known size artificial flaws such as electro-discharge machined (EDM) notches and flat bottom hole (FBH) reflectors which are used to set instrument sensitivity for detection of real flaws. Real flaws such as cracks and crack-like flaws are desired to be detected using these NDE methods. A reliably detectable crack size is required for safe life analysis of fracture critical parts. Therefore, it is important to correlate signal responses from real flaws with signal responses form artificial flaws used in calibration process to determine reliably detectable flaw size.
Reliably Detectable Flaw Size for NDE Methods that Use Calibration

NASA Technical Reports Server (NTRS)

Koshti, Ajay M.

2017-01-01

Probability of detection (POD) analysis is used in assessing reliably detectable flaw size in nondestructive evaluation (NDE). MIL-HDBK-1823 and associated mh1823 POD software gives most common methods of POD analysis. In this paper, POD analysis is applied to an NDE method, such as eddy current testing, where calibration is used. NDE calibration standards have known size artificial flaws such as electro-discharge machined (EDM) notches and flat bottom hole (FBH) reflectors which are used to set instrument sensitivity for detection of real flaws. Real flaws such as cracks and crack-like flaws are desired to be detected using these NDE methods. A reliably detectable crack size is required for safe life analysis of fracture critical parts. Therefore, it is important to correlate signal responses from real flaws with signal responses form artificial flaws used in calibration process to determine reliably detectable flaw size.
Integrated optimization of nonlinear R/C frames with reliability constraints

NASA Technical Reports Server (NTRS)

Soeiro, Alfredo; Hoit, Marc

1989-01-01

A structural optimization algorithm was researched including global displacements as decision variables. The algorithm was applied to planar reinforced concrete frames with nonlinear material behavior submitted to static loading. The flexural performance of the elements was evaluated as a function of the actual stress-strain diagrams of the materials. Formation of rotational hinges with strain hardening were allowed and the equilibrium constraints were updated accordingly. The adequacy of the frames was guaranteed by imposing as constraints required reliability indices for the members, maximum global displacements for the structure and a maximum system probability of failure.
Standardizing an approach to the evaluation of implementation science proposals.

PubMed

Crable, Erika L; Biancarelli, Dea; Walkey, Allan J; Allen, Caitlin G; Proctor, Enola K; Drainoni, Mari-Lynn

2018-05-29

The fields of implementation and improvement sciences have experienced rapid growth in recent years. However, research that seeks to inform health care change may have difficulty translating core components of implementation and improvement sciences within the traditional paradigms used to evaluate efficacy and effectiveness research. A review of implementation and improvement sciences grant proposals within an academic medical center using a traditional National Institutes of Health framework highlighted the need for tools that could assist investigators and reviewers in describing and evaluating proposed implementation and improvement sciences research. We operationalized existing recommendations for writing implementation science proposals as the ImplemeNtation and Improvement Science Proposals Evaluation CriTeria (INSPECT) scoring system. The resulting system was applied to pilot grants submitted to a call for implementation and improvement science proposals at an academic medical center. We evaluated the reliability of the INSPECT system using Krippendorff's alpha coefficients and explored the utility of the INSPECT system to characterize common deficiencies in implementation research proposals. We scored 30 research proposals using the INSPECT system. Proposals received a median cumulative score of 7 out of a possible score of 30. Across individual elements of INSPECT, proposals scored highest for criteria rating evidence of a care or quality gap. Proposals generally performed poorly on all other criteria. Most proposals received scores of 0 for criteria identifying an evidence-based practice or treatment (50%), conceptual model and theoretical justification (70%), setting's readiness to adopt new services/treatment/programs (54%), implementation strategy/process (67%), and measurement and analysis (70%). Inter-coder reliability testing showed excellent reliability (Krippendorff's alpha coefficient 0.88) for the application of the scoring system overall and demonstrated reliability scores ranging from 0.77 to 0.99 for individual elements. The INSPECT scoring system presents a new scoring criteria with a high degree of inter-rater reliability and utility for evaluating the quality of implementation and improvement sciences grant proposals.
A New Look to Nuclear Data

DOE PAGES

McCutchan, E. A.; Brown, D. A.; Sonzogni, A. A.

2017-03-30

Databases of evaluated nuclear data form a cornerstone on which we build academic nuclear structure physics, reaction physics, astrophysics, and many applied nuclear technologies. In basic research, nuclear data are essential for selecting, designing and conducting experiments, and for the development and testing of theoretical models to understand the fundamental properties of atomic nuclei. Likewise, the applied fields of nuclear power, homeland security, stockpile stewardship and nuclear medicine, all have deep roots requiring evaluated nuclear data. Each of these fields requires rapid and easy access to up-to-date, comprehensive and reliable databases. The DOE-funded US Nuclear Data Program is a specificmore » and coordinated effort tasked to compile, evaluate and disseminate nuclear structure and reaction data such that it can be used by the world-wide nuclear physics community.« less
A New Look to Nuclear Data

DOE Office of Scientific and Technical Information (OSTI.GOV)

McCutchan, E. A.; Brown, D. A.; Sonzogni, A. A.

Databases of evaluated nuclear data form a cornerstone on which we build academic nuclear structure physics, reaction physics, astrophysics, and many applied nuclear technologies. In basic research, nuclear data are essential for selecting, designing and conducting experiments, and for the development and testing of theoretical models to understand the fundamental properties of atomic nuclei. Likewise, the applied fields of nuclear power, homeland security, stockpile stewardship and nuclear medicine, all have deep roots requiring evaluated nuclear data. Each of these fields requires rapid and easy access to up-to-date, comprehensive and reliable databases. The DOE-funded US Nuclear Data Program is a specificmore » and coordinated effort tasked to compile, evaluate and disseminate nuclear structure and reaction data such that it can be used by the world-wide nuclear physics community.« less
The probability estimation of the electronic lesson implementation taking into account software reliability

NASA Astrophysics Data System (ADS)

Gurov, V. V.

2017-01-01

Software tools for educational purposes, such as e-lessons, computer-based testing system, from the point of view of reliability, have a number of features. The main ones among them are the need to ensure a sufficiently high probability of their faultless operation for a specified time, as well as the impossibility of their rapid recovery by the way of replacing it with a similar running program during the classes. The article considers the peculiarities of reliability evaluation of programs in contrast to assessments of hardware reliability. The basic requirements to reliability of software used for carrying out practical and laboratory classes in the form of computer-based training programs are given. The essential requirements applicable to the reliability of software used for conducting the practical and laboratory studies in the form of computer-based teaching programs are also described. The mathematical tool based on Markov chains, which allows to determine the degree of debugging of the training program for use in the educational process by means of applying the graph of the software modules interaction, is presented.
Characterizing the reliability of a bioMEMS-based cantilever sensor

NASA Astrophysics Data System (ADS)

Bhalerao, Kaustubh D.

2004-12-01

The cantilever-based BioMEMS sensor represents one instance from many competing ideas of biosensor technology based on Micro Electro Mechanical Systems. The advancement of BioMEMS from laboratory-scale experiments to applications in the field will require standardization of their components and manufacturing procedures as well as frameworks to evaluate their performance. Reliability, the likelihood with which a system performs its intended task, is a compact mathematical description of its performance. The mathematical and statistical foundation of systems-reliability has been applied to the cantilever-based BioMEMS sensor. The sensor is designed to detect one aspect of human ovarian cancer, namely the over-expression of the folate receptor surface protein (FR-alpha). Even as the application chosen is clinically motivated, the objective of this study was to demonstrate the underlying systems-based methodology used to design, develop and evaluate the sensor. The framework development can be readily extended to other BioMEMS-based devices for disease detection and will have an impact in the rapidly growing $30 bn industry. The Unified Modeling Language (UML) is a systems-based framework for design and development of object-oriented information systems which has potential application for use in systems designed to interact with biological environments. The UML has been used to abstract and describe the application of the biosensor, to identify key components of the biosensor, and the technology needed to link them together in a coherent manner. The use of the framework is also demonstrated in computation of system reliability from first principles as a function of the structure and materials of the biosensor. The outcomes of applying the systems-based framework to the study are the following: (1) Characterizing the cantilever-based MEMS device for disease (cell) detection. (2) Development of a novel chemical interface between the analyte and the sensor that provides a degree of selectivity towards the disease. (3) Demonstrating the performance and measuring the reliability of the biosensor prototype, and (4) Identification of opportunities in technological development in order to further refine the proposed biosensor. Application of the methodology to design develop and evaluate the reliability of BioMEMS devices will be beneficial in the streamlining the growth of the BioMEMS industry, while providing a decision-support tool in comparing and adopting suitable technologies from available competing options.
Reliable Change Indices and Standardized Regression-Based Change Score Norms for Evaluating Neuropsychological Change in Children with Epilepsy

PubMed Central

Busch, Robyn M.; Lineweaver, Tara T.; Ferguson, Lisa; Haut, Jennifer S.

2015-01-01

Reliable change index scores (RCIs) and standardized regression-based change score norms (SRBs) permit evaluation of meaningful changes in test scores following treatment interventions, like epilepsy surgery, while accounting for test-retest reliability, practice effects, score fluctuations due to error, and relevant clinical and demographic factors. Although these methods are frequently used to assess cognitive change after epilepsy surgery in adults, they have not been widely applied to examine cognitive change in children with epilepsy. The goal of the current study was to develop RCIs and SRBs for use in children with epilepsy. Sixty-three children with epilepsy (age range 6–16; M=10.19, SD=2.58) underwent comprehensive neuropsychological evaluations at two time points an average of 12 months apart. Practice adjusted RCIs and SRBs were calculated for all cognitive measures in the battery. Practice effects were quite variable across the neuropsychological measures, with the greatest differences observed among older children, particularly on the Children’s Memory Scale and Wisconsin Card Sorting Test. There was also notable variability in test-retest reliabilities across measures in the battery, with coefficients ranging from 0.14 to 0.92. RCIs and SRBs for use in assessing meaningful cognitive change in children following epilepsy surgery are provided for measures with reliability coefficients above 0.50. This is the first study to provide RCIs and SRBs for a comprehensive neuropsychological battery based on a large sample of children with epilepsy. Tables to aid in evaluating cognitive changes in children who have undergone epilepsy surgery are provided for clinical use. An excel sheet to perform all relevant calculations is also available to interested clinicians or researchers. PMID:26043163
Thermal Adaptation Methods of Urban Plaza Users in Asia's Hot-Humid Regions: A Taiwan Case Study.

PubMed

Wu, Chen-Fa; Hsieh, Yen-Fen; Ou, Sheng-Jung

2015-10-27

Thermal adaptation studies provide researchers great insight to help understand how people respond to thermal discomfort. This research aims to assess outdoor urban plaza conditions in hot and humid regions of Asia by conducting an evaluation of thermal adaptation. We also propose that questionnaire items are appropriate for determining thermal adaptation strategies adopted by urban plaza users. A literature review was conducted and first hand data collected by field observations and interviews used to collect information on thermal adaptation strategies. Item analysis--Exploratory Factor Analysis (EFA) and Confirmatory Factor Analysis (CFA)--were applied to refine the questionnaire items and determine the reliability of the questionnaire evaluation procedure. The reliability and validity of items and constructing process were also analyzed. Then, researchers facilitated an evaluation procedure for assessing the thermal adaptation strategies of urban plaza users in hot and humid regions of Asia and formulated a questionnaire survey that was distributed in Taichung's Municipal Plaza in Taiwan. Results showed that most users responded with behavioral adaptation when experiencing thermal discomfort. However, if the thermal discomfort could not be alleviated, they then adopted psychological strategies. In conclusion, the evaluation procedure for assessing thermal adaptation strategies and the questionnaire developed in this study can be applied to future research on thermal adaptation strategies adopted by urban plaza users in hot and humid regions of Asia.

Thermal Adaptation Methods of Urban Plaza Users in Asia’s Hot-Humid Regions: A Taiwan Case Study

PubMed Central

Wu, Chen-Fa; Hsieh, Yen-Fen; Ou, Sheng-Jung

2015-01-01

Thermal adaptation studies provide researchers great insight to help understand how people respond to thermal discomfort. This research aims to assess outdoor urban plaza conditions in hot and humid regions of Asia by conducting an evaluation of thermal adaptation. We also propose that questionnaire items are appropriate for determining thermal adaptation strategies adopted by urban plaza users. A literature review was conducted and first hand data collected by field observations and interviews used to collect information on thermal adaptation strategies. Item analysis—Exploratory Factor Analysis (EFA) and Confirmatory Factor Analysis (CFA)—were applied to refine the questionnaire items and determine the reliability of the questionnaire evaluation procedure. The reliability and validity of items and constructing process were also analyzed. Then, researchers facilitated an evaluation procedure for assessing the thermal adaptation strategies of urban plaza users in hot and humid regions of Asia and formulated a questionnaire survey that was distributed in Taichung’s Municipal Plaza in Taiwan. Results showed that most users responded with behavioral adaptation when experiencing thermal discomfort. However, if the thermal discomfort could not be alleviated, they then adopted psychological strategies. In conclusion, the evaluation procedure for assessing thermal adaptation strategies and the questionnaire developed in this study can be applied to future research on thermal adaptation strategies adopted by urban plaza users in hot and humid regions of Asia. PMID:26516881
Looking a gift horse in the mouth: Evaluation of wide-field asteroid photometric surveys

NASA Astrophysics Data System (ADS)

Harris, Alan W.; Pravec, Petr; Warner, Brian D.

2012-09-01

It has recently become possible to do a photometric survey of many asteroids at once, rather than observing single asteroids one (or occasionally a couple) at a time. We evaluate two such surveys. Dermawan et al. (Dermawan et al. [2011]. Publ. Astron. Soc. Jpn. 63, S555-S576) observed one night on the Subaru 8.2 m telescope, and Masiero et al. (Masiero, J., Jedicke, R., Durech, J., Gwen, S., Denneau, L., Larsen, J. [2009]. Icarus 204, 145-171) observed six nights over 2 weeks with the 3.6 m CFHT. Dermawan claimed 83 rotation periods from 127 detected asteroids; Masiero et al. claimed 218 rotation periods from 828 detections. Both teams claim a number of super-fast rotators (P < 2.2 h) among main belt asteroids larger than 250 m diameter, some up to several km in diameter. This would imply that the spin rate distribution of main belt asteroids differs from like-sized NEAs, that there are larger super-fast rotators (monolithic asteroids) in the main belt than among NEAs. Here we evaluate these survey results, applying the same criteria for reliability of results that we apply to all results listed in our Lightcurve Database (Warner, B.D., Harris, A.W., Pravec, P. [2009a]. Icarus 202, 134-146). In doing so, we assigned reliability estimates judged sufficient for inclusion in statistical studies for only 27 out of 83 (33%) periods claimed by Dermawan, and only 87 out of 218 (40%) periods reported by Masiero et al.; none of the super-fast rotators larger than about 250 m diameter claimed by either survey received a reliability rating judged sufficient for analysis. We find no reliable basis for the claim of different rotation properties between main belt and near-Earth asteroids. Our analysis presents a cautionary message for future surveys.
A Reliability Model for Ni-BaTiO3-Based (BME) Ceramic Capacitors

NASA Technical Reports Server (NTRS)

Liu, Donhang

2014-01-01

The evaluation of multilayer ceramic capacitors (MLCCs) with base-metal electrodes (BMEs) for potential NASA space project applications requires an in-depth understanding of their reliability. The reliability of an MLCC is defined as the ability of the dielectric material to retain its insulating properties under stated environmental and operational conditions for a specified period of time t. In this presentation, a general mathematic expression of a reliability model for a BME MLCC is developed and discussed. The reliability model consists of three parts: (1) a statistical distribution that describes the individual variation of properties in a test group of samples (Weibull, log normal, normal, etc.), (2) an acceleration function that describes how a capacitors reliability responds to external stresses such as applied voltage and temperature (All units in the test group should follow the same acceleration function if they share the same failure mode, independent of individual units), and (3) the effect and contribution of the structural and constructional characteristics of a multilayer capacitor device, such as the number of dielectric layers N, dielectric thickness d, average grain size r, and capacitor chip size S. In general, a two-parameter Weibull statistical distribution model is used in the description of a BME capacitors reliability as a function of time. The acceleration function that relates a capacitors reliability to external stresses is dependent on the failure mode. Two failure modes have been identified in BME MLCCs: catastrophic and slow degradation. A catastrophic failure is characterized by a time-accelerating increase in leakage current that is mainly due to existing processing defects (voids, cracks, delamination, etc.), or the extrinsic defects. A slow degradation failure is characterized by a near-linear increase in leakage current against the stress time; this is caused by the electromigration of oxygen vacancies (intrinsic defects). The two identified failure modes follow different acceleration functions. Catastrophic failures follow the traditional power-law relationship to the applied voltage. Slow degradation failures fit well to an exponential law relationship to the applied electrical field. Finally, the impact of capacitor structure on the reliability of BME capacitors is discussed with respect to the number of dielectric layers in an MLCC unit, the number of BaTiO3 grains per dielectric layer, and the chip size of the capacitor device.
An Evaluation of a Computer-Based Training on the Visual Analysis of Single-Subject Data

ERIC Educational Resources Information Center

Snyder, Katie

2013-01-01

Visual analysis is the primary method of analyzing data in single-subject methodology, which is the predominant research method used in the fields of applied behavior analysis and special education. Previous research on the reliability of visual analysis suggests that judges often disagree about what constitutes an intervention effect. Considering…
Appraising the reliability of visual impact assessment methods

Treesearch

Nickolaus R. Feimer; Kenneth H. Craik; Richard C. Smardon; Stephen R.J. Sheppard

1979-01-01

This paper presents the research approach and selected results of an empirical investigation aimed at the evaluation of selected observer-based visual impact assessment (VIA) methods. The VIA methods under examination were chosen to cover a range of VIA methods currently in use in both applied and research settings. Variation in three facets of VIA methods were...
Unreliable Yet Still Replicable: A Comment on LeBel and Paunonen (2011)

PubMed Central

De Schryver, Maarten; Hughes, Sean; Rosseel, Yves; De Houwer, Jan

2016-01-01

Lebel and Paunonen (2011) highlight that despite their importance and popularity in both theoretical and applied research, many implicit measures continue to be plagued by a persistent and troublesome issue—low reliability. In their paper, they offer a conceptual analysis of the relationship between reliability, power and replicability, and then provide a series of recommendations for researchers interested in using implicit measures in an experimental setting. At the core of their account is the idea that reliability can be equated with statistical power, such that “lower levels of reliability are associated with decreasing probabilities of detecting a statistically significant effect, given one exists in the population” (p. 573). They also take the additional step of equating reliability and replicability. In our commentary, we draw attention to the fact that there is no direct, fixed or one-to-one relation between reliability and power or replicability. More specifically, we argue that when adopting an experimental (rather than a correlational) approach, researchers strive to minimize inter-individual variation, which has a direct impact on sample based reliability estimates. We evaluate the strengths and weaknesses of the LeBel and Paunonen's recommendations and refine them where appropriate. PMID:26793150
Test-retest reliability of an infectious disease questionnaire and evaluation of self-assessed vulnerability to infections : findings of Pretest 2 of the German National Cohort.

PubMed

Castell, S; Akmatov, M K; Obi, N; Flesh-Janys, D; Nieters, A; Kemmling, Y; Pessler, F; Krause, G

2014-11-01

Large scale population-based studies focusing on infectious diseases are scarce. This may be explained by methodological obstacles concerning ascertainment of data on infectious diseases requiring, e.g. collection of data on relatively short-termed symptoms and/or collection of biosamples for pathogen identification during a narrow time window. In the German National Cohort (GNC), a novel self-administered questionnaire will be used in addition to biosampling to collect data on selected infectious diseases and symptoms. The aim of this study was to evaluate in Pretest 2 of the GNC newly added items on self-assessed vulnerability to several infectious diseases and to assess test-retest reliability of the questionnaire. The study was conducted in two study centres (Hamburg and Hanover) during Pretest 2 of the GNC. A self-administered paper questionnaire was applied. In Hamburg, participants were asked to fill in the questionnaire during their regular visit at the study centre. For test-retest reliability, participants in Hanover filled in the same questionnaire at home twice. To evaluate agreement, item-related percentage agreement and kappa (κ) were calculated. In addition, we computed Bennet's S and Krippendorf's alpha (α). Items on self-assessed vulnerability to infections were evaluated by comparing them with the corresponding self-reported frequency of infections. An explanatory factor analysis was applied to construct the scores of self-reported infection frequency and self-assessed vulnerability to infections. The evaluation of the internal consistency of the five-item instrument of self-assessed vulnerability to infections resulted in a Cronbach's α of 0.78. The factor analysis yielded evidence of one factor. The factor was divided into three groups (lowest quintile classified as "less prone to infections" compared to peers; second, middle and fourth quintiles classified as "similarly prone to infections" and highest quintile classified as "more prone to infections"). Participants classified as "less prone to infections" reported fewer infections than participants classified as "more prone to infections". Spearman's correlation of the two scores (self-reported infection frequency and self-assessed vulnerability to infection) was 0.50 (p < 0.0001). For quantifying reliability, 88 participants with a median time of 8 days between filling in both questionnaires could be included in the analysis; for items sensitive to disease occurrence between both questionnaires only participants with no relevant disease in this time interval were included (n = 75). The weighted κ ranged between 0.65 and 0.87 for the items on infectious disease frequency in the last 12 months, for items on symptom frequency in the past 12 months between 0.77 and 0.90, and for items on vulnerability compared to peers between 0.68 and 0.76. A five-item instrument on self-assessed vulnerability to infections seems to be promising, but requires further evaluation. Overall, the questionnaire on self-reported infectious diseases used in Pretest 2 of the GNC is a moderately reliable instrument and, thus, can be applied in future studies on infectious diseases.
Validation and reliability of the Turkish Utian Quality-of-Life Scale in postmenopausal women.

PubMed

Abay, Halime; Kaplan, Sena

2016-04-01

There are a limited number of menopause-specific quality-of-life scales for the Turkish population. This study was conducted to evaluate the validity and reliability of the Turkish Utian Quality-of-Life Scale in postmenopausal women. The study group was comprised of 250 postmenopausal women who applied to a training and research hospital's menopause clinic in Turkey. A survey form and the Turkish Utian quality-of-Life Scale were used to collect data, and the Turkish version of Short Form-36 was used to evaluate reliability with an equivalent form. Language-validity, content-validity, and construct-validity methods were used to assess the validity of the scale, and Cronbach's α coefficient calculation and the equivalent-form reliability methods were used to assess the reliability of the scale. The Turkish Utian Quality-of-Life Scale was determined to be a valid and reliable instrument for measuring the quality of life of postmenopausal women. Confirmatory factor analysis demonstrates that the instrument fits well with 23 items and a four-factor model. The Cronbach's α coefficient for the quality-of-life domains were as follows: 0.88 overall, 0.79 health, 0.78 emotional, 0.76 sexual, and 0.75 occupational. Reliability of the instrument was confirmed through significant correlations between scores on the Turkish version of the Utian Quality-of-Life Scale and the Turkish version of the Short Form-36 (r = 0.745, P < 0.001). This research emphasizes that the Turkish Utian Quality-of-Life Scale is reliable and valid in postmenopausal women-it is a useful instrument for measuring quality of life during menopause.
From plastic to gold: a unified classification scheme for reference standards in medical image processing

NASA Astrophysics Data System (ADS)

Lehmann, Thomas M.

2002-05-01

Reliable evaluation of medical image processing is of major importance for routine applications. Nonetheless, evaluation is often omitted or methodically defective when novel approaches or algorithms are introduced. Adopted from medical diagnosis, we define the following criteria to classify reference standards: 1. Reliance, if the generation or capturing of test images for evaluation follows an exactly determined and reproducible protocol. 2. Equivalence, if the image material or relationships considered within an algorithmic reference standard equal real-life data with respect to structure, noise, or other parameters of importance. 3. Independence, if any reference standard relies on a different procedure than that to be evaluated, or on other images or image modalities than that used routinely. This criterion bans the simultaneous use of one image for both, training and test phase. 4. Relevance, if the algorithm to be evaluated is self-reproducible. If random parameters or optimization strategies are applied, reliability of the algorithm must be shown before the reference standard is applied for evaluation. 5. Significance, if the number of reference standard images that are used for evaluation is sufficient large to enable statistically founded analysis. We demand that a true gold standard must satisfy the Criteria 1 to 3. Any standard only satisfying two criteria, i.e., Criterion 1 and Criterion 2 or Criterion 1 and Criterion 3, is referred to as silver standard. Other standards are termed to be from plastic. Before exhaustive evaluation based on gold or silver standards is performed, its relevance must be shown (Criterion 4) and sufficient tests must be carried out to found statistical analysis (Criterion 5). In this paper, examples are given for each class of reference standards.
[Psychometric properties of the third version of family adaptability and cohesion evaluation scales (FACES-III): a study of peruvian adolescents].

PubMed

Bazo-Alvarez, Juan Carlos; Bazo-Alvarez, Oscar Alfredo; Aguila, Jeins; Peralta, Frank; Mormontoy, Wilfredo; Bennett, Ian M

2016-01-01

Our aim was to evaluate the psychometric properties of the FACES-III among Peruvian high school students. This is a psychometric cross-sectional study. A probabilistic sampling was applied, defined by three stages: stratum one (school), stratum two (grade) and cluster (section). The participants were 910 adolescent students of both sexes, between 11 and 18 years of age. The instrument was also the object of study: the Olson's FACES-III. The analysis included a review of the structure / construct validity of the measure by factor analysis and assessment of internal consistency (reliability). The real-cohesion scale had moderately high reliability (Ω=.85) while the real-flexibility scale had moderate reliability (Ω=.74). The reliability found for the ideal-cohesion was moderately high (Ω=.89) like for the scale of ideal-flexibility (Ω=.86). Construct validity was confirmed by the goodness of fit of a two factor model (cohesion and flexibility) with 10 items each [Adjusted goodness of fit index (AGFI) = 0.96; Expected Cross Validation Index (ECVI) = 0.87; Normed fit index (NFI) = 0.93; Goodness of fit index (GFI) = 0.97; Root mean square error of approximation (RMSEA) = 0.06]. FACES-III has sufficient reliability and validity to be used in Peruvian adolescents for the purpose of group or individual assessment.
Evaluating the test-retest reliability of symptom indices associated with the ImPACT post-concussion symptom scale (PCSS).

PubMed

Merritt, Victoria C; Bradson, Megan L; Meyer, Jessica E; Arnett, Peter A

2018-05-01

The Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) is a commonly used tool in sports concussion assessment. While test-retest reliabilities have been established for the ImPACT cognitive composites, few studies have evaluated the psychometric properties of the ImPACT's Post-Concussion Symptom Scale (PCSS). The purpose of this study was to establish the test-retest reliability of symptom indices associated with the PCSS. Participants included 38 undergraduate students (50.0% male) who underwent neuropsychological testing as part of their participation in their psychology department's research subject pool. The majority of the participants were Caucasian (94.7%) and had no history of concussion (73.7%). All participants completed the ImPACT at two time points, approximately 6 weeks apart. The PCSS was the main outcome measure, and eight symptom indices were calculated (a total symptom score, three symptom summary indices, and four symptom clusters). Pearson correlations (r) and intraclass correlation coefficients (ICCs) were computed as measures of test-retest reliability. Overall, reliabilities ranged from low to high (r = .44 to .80; ICC = .44 to .77). The cognitive symptom cluster exhibited the highest test-retest reliability (r = .80, ICC = .77), followed by the positive symptom total (PST) index, an indicator of the total number of symptoms endorsed (r = .71, ICC = .69). In contrast, the commonly used total symptom score showed lower test-retest reliability (r = .67, ICC = .62). Paired-samples t tests revealed no significant differences between test and retest for any of the symptom variables (all p > .01). Finally, reliable change indices (RCI) were computed to determine whether differences observed between test and retest represented clinically significant change. RCI values were provided for each symptom index at the 80%, 90%, and 95% confidence intervals. These results suggest that evaluating additional symptom indices beyond the total symptom score from the PCSS is beneficial. Findings from this study can be applied to athlete samples to assess reliable change in symptoms following concussion.
Validity and reliability of an instrument for assessing case analyses in bioengineering ethics education.

PubMed

Goldin, Ilya M; Pinkus, Rosa Lynn; Ashley, Kevin

2015-06-01

Assessment in ethics education faces a challenge. From the perspectives of teachers, students, and third-party evaluators like the Accreditation Board for Engineering and Technology and the National Institutes of Health, assessment of student performance is essential. Because of the complexity of ethical case analysis, however, it is difficult to formulate assessment criteria, and to recognize when students fulfill them. Improvement in students' moral reasoning skills can serve as the focus of assessment. In previous work, Rosa Lynn Pinkus and Claire Gloeckner developed a novel instrument for assessing moral reasoning skills in bioengineering ethics. In this paper, we compare that approach to existing assessment techniques, and evaluate its validity and reliability. We find that it is sensitive to knowledge gain and that independent coders agree on how to apply it.
Evaluation of changes in pelvic belt tension during 2 weight-bearing functional tasks.

PubMed

Arumugam, Ashokan; Milosavljevic, Stephan; Woodley, Stephanie; Sole, Gisela

2012-06-01

The purposes of this study were to evaluate changes in pelvic belt tension during 2 weight-bearing functional tasks (transition from bipedal to unipedal stance [BUS] and walking) and to evaluate the reliability and the percentage variation for belt tension scores from trial to trial. A cross-sectional repeated-measures study was conducted with 10 healthy male participants (mean age, 28.3 ± 8.8years). Participants performed 10 trials of BUS and walking while wearing a nonelastic pelvic compression belt (PCB) applied distal to the anterior superior iliac spines, with a load cell positioned in the center of the belt. The load cell was calibrated using known weights (1-10kg) to define the relationship between the applied tension and voltage change (R(2) = 0.99). Load cell tension values were recorded in voltage signals and then converted to newtons of force using appropriate conversion values (0.012V = 10N). Mean and standard deviation values, intraclass correlation coefficients (ICC 3,1), and percentage standard error of measurements (% SEM) were analyzed for PCB tension recorded during the BUS and walking trials. The mean tension achieved with a PCB was found to be 41.02 (±4.23) N during BUS and 44.07 (±5.80) N during walking. The trial-to-trial reliability (ICC 3,1) was high (ICC ≥0.9), and the variation in PCB tension across 10 trials (% SEM) was 4% or less. The mean tension achieved during the tasks was 44 N or less. The reliability is high, and the variation is low across the trials, which implies that a PCB could be used to produce consistent effects during repetition of the tasks (BUS and walking). Copyright © 2012 National University of Health Sciences. Published by Mosby, Inc. All rights reserved.
The role and reliability of the Psychopathy Checklist-Revised in U.S. sexually violent predator evaluations: a case law survey.

PubMed

DeMatteo, David; Edens, John F; Galloway, Meghann; Cox, Jennifer; Smith, Shannon Toney; Formon, Dana

2014-06-01

The civil commitment of offenders as sexually violent predators (SVPs) is a highly contentious area of U.S. mental health law. The Psychopathy Checklist-Revised (PCL-R) is frequently used in mental health evaluations in these cases to aid legal decision making. Although generally perceived to be a useful assessment tool in applied settings, recent research has raised questions about the reliability of PCL-R scores in SVP cases. In this report, we review the use of the PCL-R in SVP trials identified as part of a larger project investigating its role in U.S. case law. After presenting data on how the PCL-R is used in SVP cases, we examine the reliability of scores reported in these cases. We located 214 cases involving the PCL-R, 88 of which included an actual score and 29 of which included multiple scores. In the 29 cases with multiple scores, the intraclass correlation coefficient for a single evaluator for the PCL-R scores was only .58, and only 41.4% of the difference scores were within 1 standard error of measurement unit. The average score reported by prosecution experts was significantly higher than the average score reported by defense-retained experts, and prosecution experts reported PCL-R scores of 30 or above in nearly 50% of the cases, compared with less than 10% of the cases for defense witnesses (κ = .29). In conjunction with other recently published findings demonstrating the unreliability of PCL-R scores in applied settings, our results raise questions as to whether this instrument should be admitted into SVP proceedings.
Toronto Bariatric Interprofessional Psychosocial Assessment Suitability Scale: Evaluating A New Clinical Assessment Tool for Bariatric Surgery Candidates.

PubMed

Thiara, Gurneet; Yanofksy, Richard; Abdul-Kader, Sayed; Santiago, Vincent A; Cassin, Stephanie; Okrainec, Allan; Jackson, Timothy; Hawa, Raed; Sockalingam, Sanjeev

2016-01-01

Patients who are referred for possible bariatric surgery (BS) intervention undergo a series of assessments conducted by an interdisciplinary health care team to determine suitability for surgery. Herein, we report the initial validation and reliability studies of the Bariatric Interprofessional Psychosocial Assessment Suitability Scale (BIPASS) and its relationship to interdisciplinary psychosocial assessment practices for BS. This study was conducted at the Toronto Western Hospital, a Level 1A BS center of excellence accredited by the American College of Surgeons. Phase I: a total of 4 blinded raters applied the BIPASS to 31 randomly selected BS cases referred to our program to establish interrater reliability. Phase II: in all, 3 raters with clinical experience in bariatric psychosocial care applied the BIPASS to 54 randomly selected BS cases. In total, 46 of 54 (85.1%) patients were women. The median age of all patient cases was 49 years (range: 21-74). Raters׳ BIPASS scores ranged from 4-52 (median = 19.24, standard deviation =10.38). BIPASS scores were highly predictive of the BS psychosocial outcome (area under curve = 0.915; 95% CI: 0.844-0.985; p < 0.001). A BIPASS score of ≥16 was chosen as the cutoff score for further clinical assessment before proceeding with surgical evaluation based on a receiver operating characteristic curve analysis (sensitivity = 0.839; specificity = 0.783). The instrument has very good interrater reliability (Pearson correlation coefficient = 0.847) even among novice raters. The findings show that the BIPASS is a comprehensive screening tool in the psychosocial assessment of BS candidates, which standardizes the evaluation process and systematically identify at-risk patients for negative outcomes after BS. Copyright © 2016 The Academy of Psychosomatic Medicine. Published by Elsevier Inc. All rights reserved.
The ABC’s of Suicide Risk Assessment: Applying a Tripartite Approach to Individual Evaluations

PubMed Central

Harris, Keith M.; Syu, Jia-Jia; Lello, Owen D.; Chew, Y. L. Eileen; Willcox, Christopher H.; Ho, Roger H. M.

2015-01-01

There is considerable need for accurate suicide risk assessment for clinical, screening, and research purposes. This study applied the tripartite affect-behavior-cognition theory, the suicidal barometer model, classical test theory, and item response theory (IRT), to develop a brief self-report measure of suicide risk that is theoretically-grounded, reliable and valid. An initial survey (n = 359) employed an iterative process to an item pool, resulting in the six-item Suicidal Affect-Behavior-Cognition Scale (SABCS). Three additional studies tested the SABCS and a highly endorsed comparison measure. Studies included two online surveys (Ns = 1007, and 713), and one prospective clinical survey (n = 72; Time 2, n = 54). Factor analyses demonstrated SABCS construct validity through unidimensionality. Internal reliability was high (α = .86-.93, split-half = .90-.94)). The scale was predictive of future suicidal behaviors and suicidality (r = .68, .73, respectively), showed convergent validity, and the SABCS-4 demonstrated clinically relevant sensitivity to change. IRT analyses revealed the SABCS captured more information than the comparison measure, and better defined participants at low, moderate, and high risk. The SABCS is the first suicide risk measure to demonstrate no differential item functioning by sex, age, or ethnicity. In all comparisons, the SABCS showed incremental improvements over a highly endorsed scale through stronger predictive ability, reliability, and other properties. The SABCS is in the public domain, with this publication, and is suitable for clinical evaluations, public screening, and research. PMID:26030590
Earthquake Damage Assessment Using Very High Resolution Satelliteimagery

NASA Astrophysics Data System (ADS)

Chiroiu, L.; André, G.; Bahoken, F.; Guillande, R.

Various studies using satellite imagery were applied in the last years in order to assess natural hazard damages, most of them analyzing the case of floods, hurricanes or landslides. For the case of earthquakes, the medium or small spatial resolution data available in the recent past did not allow a reliable identification of damages, due to the size of the elements (e.g. buildings or other structures), too small compared with the pixel size. The recent progresses of remote sensing in terms of spatial resolution and data processing makes possible a reliable damage detection to the elements at risk. Remote sensing techniques applied to IKONOS (1 meter resolution) and IRS (5 meters resolution) imagery were used in order to evaluate seismic vulnerability and post earthquake damages. A fast estimation of losses was performed using a multidisciplinary approach based on earthquake engineering and geospatial analysis. The results, integrated into a GIS database, could be transferred via satellite networks to the rescue teams deployed on the affected zone, in order to better coordinate the emergency operations. The methodology was applied to the city of Bhuj and Anjar after the 2001 Gujarat (India) Earthquake.
Evaluation on Cost Overrun Risks of Long-distance Water Diversion Project Based on SPA-IAHP Method

NASA Astrophysics Data System (ADS)

Yuanyue, Yang; Huimin, Li

2018-02-01

Large investment, long route, many change orders and etc. are main causes for costs overrun of long-distance water diversion project. This paper, based on existing research, builds a full-process cost overrun risk evaluation index system for water diversion project, apply SPA-IAHP method to set up cost overrun risk evaluation mode, calculate and rank weight of every risk evaluation indexes. Finally, the cost overrun risks are comprehensively evaluated by calculating linkage measure, and comprehensive risk level is acquired. SPA-IAHP method can accurately evaluate risks, and the reliability is high. By case calculation and verification, it can provide valid cost overrun decision making information to construction companies.
Clinical indicators for routine use in the evaluation of early psychosis intervention: development, training support and inter-rater reliability.

PubMed

Catts, Stanley V; Frost, Aaron D J; O'Toole, Brian I; Carr, Vaughan J; Lewin, Terry; Neil, Amanda L; Harris, Meredith G; Evans, Russell W; Crissman, Belinda R; Eadie, Kathy

2011-01-01

Clinical practice improvement carried out in a quality assurance framework relies on routinely collected data using clinical indicators. Herein we describe the development, minimum training requirements, and inter-rater agreement of indicators that were used in an Australian multi-site evaluation of the effectiveness of early psychosis (EP) teams. Surveys of clinician opinion and face-to-face consensus-building meetings were used to select and conceptually define indicators. Operationalization of definitions was achieved by iterative refinement until clinicians could be quickly trained to code indicators reliably. Calculation of percentage agreement with expert consensus coding was based on ratings of paper-based clinical vignettes embedded in a 2-h clinician training package. Consensually agreed upon conceptual definitions for seven clinical indicators judged most relevant to evaluating EP teams were operationalized for ease-of-training. Brief training enabled typical clinicians to code indicators with acceptable percentage agreement (60% to 86%). For indicators of suicide risk, psychosocial function, and family functioning this level of agreement was only possible with less precise 'broad range' expert consensus scores. Estimated kappa values indicated fair to good inter-rater reliability (kappa > 0.65). Inspection of contingency tables (coding category by health service) and modal scores across services suggested consistent, unbiased coding across services. Clinicians are able to agree upon what information is essential to routinely evaluate clinical practice. Simple indicators of this information can be designed and coding rules can be reliably applied to written vignettes after brief training. The real world feasibility of the indicators remains to be tested in field trials.
Reproducibility of manual pressure force on provocation of the sacroiliac joint.

PubMed

Levin, U; Nilsson-Wikmar, L; Stenström, C H; Lundeberg, T

1998-01-01

Previous studies of pain-provocation sacroiliac (SI) joint tests have revealed conflicting results. The aim of the present study was to evaluate the intra- and inter-test reliability of pressure force applied during distraction test, compression test and pressure on the apex sacralis. Seventeen physiotherapists (PTs), median age 43 years and median clinical experience 11 years, all experienced in musculoskeletal evaluation and therapy, participated in the study. Each PT performed each test on the same healthy volunteer for 20 s, on three separate occasions, at intervals of one week using a specially constructed examination table which registered pressure force. The PTs were capable of maintaining a relatively constant pressure force for 20 s. The intra-test reliability was acceptable even though there were individual differences on different occasions between those PTs who used the SI joint tests often and those who seldom or never used them. The inter-test reliability was insufficient. The findings indicate the advantage of registering pressure force as a complement for standardized methods for pain-provoking tests and when learning provocation tests, since individual variability was considerable.

Reliability of Pressure Ulcer Rates: How Precisely Can We Differentiate Among Hospital Units, and Does the Standard Signal-Noise Reliability Measure Reflect This Precision?

PubMed

Staggs, Vincent S; Cramer, Emily

2016-08-01

Hospital performance reports often include rankings of unit pressure ulcer rates. Differentiating among units on the basis of quality requires reliable measurement. Our objectives were to describe and apply methods for assessing reliability of hospital-acquired pressure ulcer rates and evaluate a standard signal-noise reliability measure as an indicator of precision of differentiation among units. Quarterly pressure ulcer data from 8,199 critical care, step-down, medical, surgical, and medical-surgical nursing units from 1,299 US hospitals were analyzed. Using beta-binomial models, we estimated between-unit variability (signal) and within-unit variability (noise) in annual unit pressure ulcer rates. Signal-noise reliability was computed as the ratio of between-unit variability to the total of between- and within-unit variability. To assess precision of differentiation among units based on ranked pressure ulcer rates, we simulated data to estimate the probabilities of a unit's observed pressure ulcer rate rank in a given sample falling within five and ten percentiles of its true rank, and the probabilities of units with ulcer rates in the highest quartile and highest decile being identified as such. We assessed the signal-noise measure as an indicator of differentiation precision by computing its correlations with these probabilities. Pressure ulcer rates based on a single year of quarterly or weekly prevalence surveys were too susceptible to noise to allow for precise differentiation among units, and signal-noise reliability was a poor indicator of precision of differentiation. To ensure precise differentiation on the basis of true differences, alternative methods of assessing reliability should be applied to measures purported to differentiate among providers or units based on quality. © 2016 The Authors. Research in Nursing & Health published by Wiley Periodicals, Inc. © 2016 The Authors. Research in Nursing & Health published by Wiley Periodicals, Inc.
Structural design of high-performance capacitive accelerometers using parametric optimization with uncertainties

NASA Astrophysics Data System (ADS)

Teves, André da Costa; Lima, Cícero Ribeiro de; Passaro, Angelo; Silva, Emílio Carlos Nelli

2017-03-01

Electrostatic or capacitive accelerometers are among the highest volume microelectromechanical systems (MEMS) products nowadays. The design of such devices is a complex task, since they depend on many performance requirements, which are often conflicting. Therefore, optimization techniques are often used in the design stage of these MEMS devices. Because of problems with reliability, the technology of MEMS is not yet well established. Thus, in this work, size optimization is combined with the reliability-based design optimization (RBDO) method to improve the performance of accelerometers. To account for uncertainties in the dimensions and material properties of these devices, the first order reliability method is applied to calculate the probabilities involved in the RBDO formulation. Practical examples of bulk-type capacitive accelerometer designs are presented and discussed to evaluate the potential of the implemented RBDO solver.
Reliability and construct validity of the Instrument to Measure the Impact of Valve Heart Disease on the Patient's Daily Life

PubMed Central

dos Anjos, Daniela Brianne Martins; Rodrigues, Roberta Cunha Matheus; Padilha, Kátia Melissa; Pedrosa, Rafaela Batista dos Santos; Gallani, Maria Cecília Bueno Jayme

2016-01-01

ABSTRACT Objective: evaluate the practicality, acceptability and the floor and ceiling effects, estimate the reliability and verify the convergent construct's validity with the instrument called the Heart Valve Disease Impact on daily life (IDCV) of the valve disease in patients with mitral and or aortic heart valve disease. Method: data was obtained from 86 heart valve disease patients through 3 phases: a face to face interview for a socio-demographic and clinic characterization and then other two done through phone calls of the interviewed patients for application of the instrument (test and repeat test). Results: as for the practicality and acceptability, the instrument was applied with an average time of 9,9 minutes and with 110% of responses, respectively. Ceiling and floor effects observed for all domains, especially floor effect. Reliability was tested using the test - repeating pattern to give evidence of temporal stability of the measurement. Significant negative correlations with moderate to strong magnitude were found between the score of the generic question about the impact of the disease and the scores of IDCV, which points to the validity of the instrument convergent construct. Conclusion: the instrument to measure the impact of valve heart disease on the patient's daily life showed evidence of reliability and validity when applied to patients with heart valve disease. PMID:27992024
Stress and Reliability Analysis of a Metal-Ceramic Dental Crown

NASA Technical Reports Server (NTRS)

Anusavice, Kenneth J; Sokolowski, Todd M.; Hojjatie, Barry; Nemeth, Noel N.

1996-01-01

Interaction of mechanical and thermal stresses with the flaws and microcracks within the ceramic region of metal-ceramic dental crowns can result in catastrophic or delayed failure of these restorations. The objective of this study was to determine the combined influence of induced functional stresses and pre-existing flaws and microcracks on the time-dependent probability of failure of a metal-ceramic molar crown. A three-dimensional finite element model of a porcelain fused-to-metal (PFM) molar crown was developed using the ANSYS finite element program. The crown consisted of a body porcelain, opaque porcelain, and a metal substrate. The model had a 300 Newton load applied perpendicular to one cusp, a load of 30ON applied at 30 degrees from the perpendicular load case, directed toward the center, and a 600 Newton vertical load. Ceramic specimens were subjected to a biaxial flexure test and the load-to-failure of each specimen was measured. The results of the finite element stress analysis and the flexure tests were incorporated in the NASA developed CARES/LIFE program to determine the Weibull and fatigue parameters and time-dependent fracture reliability of the PFM crown. CARES/LIFE calculates the time-dependent reliability of monolithic ceramic components subjected to thermomechanical and/Or proof test loading. This program is an extension of the CARES (Ceramics Analysis and Reliability Evaluation of Structures) computer program.
A Performance Evaluation of NACK-Oriented Protocols as the Foundation of Reliable Delay- Tolerant Networking Convergence Layers

NASA Technical Reports Server (NTRS)

Iannicca, Dennis; Hylton, Alan; Ishac, Joseph

2012-01-01

Delay-Tolerant Networking (DTN) is an active area of research in the space communications community. DTN uses a standard layered approach with the Bundle Protocol operating on top of transport layer protocols known as convergence layers that actually transmit the data between nodes. Several different common transport layer protocols have been implemented as convergence layers in DTN implementations including User Datagram Protocol (UDP), Transmission Control Protocol (TCP), and Licklider Transmission Protocol (LTP). The purpose of this paper is to evaluate several stand-alone implementations of negative-acknowledgment based transport layer protocols to determine how they perform in a variety of different link conditions. The transport protocols chosen for this evaluation include Consultative Committee for Space Data Systems (CCSDS) File Delivery Protocol (CFDP), Licklider Transmission Protocol (LTP), NACK-Oriented Reliable Multicast (NORM), and Saratoga. The test parameters that the protocols were subjected to are characteristic of common communications links ranging from terrestrial to cis-lunar and apply different levels of delay, line rate, and error.
Radiation-Tolerance Assessment of a Redundant Wireless Device

NASA Astrophysics Data System (ADS)

Huang, Q.; Jiang, J.

2018-01-01

This paper presents a method to evaluate radiation-tolerance without physical tests for a commercial off-the-shelf (COTS)-based monitoring device for high level radiation fields, such as those found in post-accident conditions in a nuclear power plant (NPP). This paper specifically describes the analysis of radiation environment in a severe accident, radiation damages in electronics, and the redundant solution used to prolong the life of the system, as well as the evaluation method for radiation protection and the analysis method of system reliability. As a case study, a wireless monitoring device with redundant and diversified channels is evaluated by using the developed method. The study results and system assessment data show that, under the given radiation condition, performance of the redundant device is more reliable and more robust than those non-redundant devices. The developed redundant wireless monitoring device is therefore able to apply in those conditions (up to 10 M Rad (Si)) during a severe accident in a NPP.
The Development and Validation of a Rapid Assessment Tool of Primary Care in China

PubMed Central

Mei, Jie; Liang, Yuan; Shi, LeiYu; Zhao, JingGe; Wang, YuTan; Kuang, Li

2016-01-01

Introduction. With Chinese health care reform increasingly emphasizing the importance of primary care, the need for a tool to evaluate primary care performance and service delivery is clear. This study presents a methodology for a rapid assessment of primary care organizations and service delivery in China. Methods. The study translated and adapted the Primary Care Assessment Tool-Adult Edition (PCAT-AE) into a Chinese version to measure core dimensions of primary care, namely, first contact, continuity, comprehensiveness, and coordination. A cross-sectional survey was conducted to assess the validity and reliability of the Chinese Rapid Primary Care Assessment Tool (CR-PCAT). Eight community health centers in Guangdong province have been selected to participate in the survey. Results. A total of 1465 effective samples were included for data analysis. Eight items were eliminated following principal component analysis and reliability testing. The principal component analysis extracted five multiple-item scales (first contact utilization, first contact accessibility, ongoing care, comprehensiveness, and coordination). The tests of scaling assumptions were basically met. Conclusion. The standard psychometric evaluation indicates that the scales have achieved relatively good reliability and validity. The CR-PCAT provides a rapid and reliable measure of four core dimensions of primary care, which could be applied in various scenarios. PMID:26885509
Applicability and Limitations of Reliability Allocation Methods

NASA Technical Reports Server (NTRS)

Cruz, Jose A.

2016-01-01

Reliability allocation process may be described as the process of assigning reliability requirements to individual components within a system to attain the specified system reliability. For large systems, the allocation process is often performed at different stages of system design. The allocation process often begins at the conceptual stage. As the system design develops, more information about components and the operating environment becomes available, different allocation methods can be considered. Reliability allocation methods are usually divided into two categories: weighting factors and optimal reliability allocation. When properly applied, these methods can produce reasonable approximations. Reliability allocation techniques have limitations and implied assumptions that need to be understood by system engineers. Applying reliability allocation techniques without understanding their limitations and assumptions can produce unrealistic results. This report addresses weighting factors, optimal reliability allocation techniques, and identifies the applicability and limitations of each reliability allocation technique.
The reliability of the Glasgow Coma Scale: a systematic review.

PubMed

Reith, Florence C M; Van den Brande, Ruben; Synnot, Anneliese; Gruen, Russell; Maas, Andrew I R

2016-01-01

The Glasgow Coma Scale (GCS) provides a structured method for assessment of the level of consciousness. Its derived sum score is applied in research and adopted in intensive care unit scoring systems. Controversy exists on the reliability of the GCS. The aim of this systematic review was to summarize evidence on the reliability of the GCS. A literature search was undertaken in MEDLINE, EMBASE and CINAHL. Observational studies that assessed the reliability of the GCS, expressed by a statistical measure, were included. Methodological quality was evaluated with the consensus-based standards for the selection of health measurement instruments checklist and its influence on results considered. Reliability estimates were synthesized narratively. We identified 52 relevant studies that showed significant heterogeneity in the type of reliability estimates used, patients studied, setting and characteristics of observers. Methodological quality was good (n = 7), fair (n = 18) or poor (n = 27). In good quality studies, kappa values were ≥0.6 in 85%, and all intraclass correlation coefficients indicated excellent reliability. Poor quality studies showed lower reliability estimates. Reliability for the GCS components was higher than for the sum score. Factors that may influence reliability include education and training, the level of consciousness and type of stimuli used. Only 13% of studies were of good quality and inconsistency in reported reliability estimates was found. Although the reliability was adequate in good quality studies, further improvement is desirable. From a methodological perspective, the quality of reliability studies needs to be improved. From a clinical perspective, a renewed focus on training/education and standardization of assessment is required.
An Evaluation Method of Equipment Reliability Configuration Management

NASA Astrophysics Data System (ADS)

Wang, Wei; Feng, Weijia; Zhang, Wei; Li, Yuan

2018-01-01

At present, many equipment development companies have been aware of the great significance of reliability of the equipment development. But, due to the lack of effective management evaluation method, it is very difficult for the equipment development company to manage its own reliability work. Evaluation method of equipment reliability configuration management is to determine the reliability management capabilities of equipment development company. Reliability is not only designed, but also managed to achieve. This paper evaluates the reliability management capabilities by reliability configuration capability maturity model(RCM-CMM) evaluation method.
Portuguese version of the EUROPEP questionnaire: contributions to the psychometric validation

PubMed Central

Roque, Hugo; Veloso, Ana; Ferreira, Pedro L

2016-01-01

ABSTRACT OBJECTIVE To assess the construct validity and reliability of the Portuguese version of the European Task Force on Patient Evaluation of General Practice Care questionnaire. METHODS We applied the Portuguese version of the European Task Force on Patient Evaluation of General Practice Care to 392 users of 20 Family Health Units from the North of Portugal. The validity of the construct was evaluated by exploratory factor analysis, with the Principal Axis Factoring method, by orthogonal rotation (varimax procedure), by the Kaiser normalization criteria (eigenvalue ≥ 1). The factorability of the data matrix was verified by the Kaiser-Meyer-Olkin and Bartlett’s sphericity test. We estimated the reliability by the indicator of internal consistency Cronbach’s alpha. To analyze the correlations between satisfaction and loyalty, we used the Pearson correlations. The predictor effect of satisfaction on loyalty was analyzed by simple linear regression. RESULTS Satisfaction presented five robust and well individualized dimensions – medical care, nursing care, clinical secretariat services, accessibility, and organization of services – with alpha values between 0.86 and 0.97, good levels of internal consistency. The loyalty showed alpha value of 0.72, considered a reasonable internal consistency. The satisfaction was predictive of loyalty. CONCLUSIONS The Portuguese European Task Force on Patient Evaluation of General Practice Care questionnaire is a robust and reliable instrument to measure the satisfaction and loyalty of users of the Family Health Units. PMID:27706374
Bulk electric system reliability evaluation incorporating wind power and demand side management

NASA Astrophysics Data System (ADS)

Huang, Dange

Electric power systems are experiencing dramatic changes with respect to structure, operation and regulation and are facing increasing pressure due to environmental and societal constraints. Bulk electric system reliability is an important consideration in power system planning, design and operation particularly in the new competitive environment. A wide range of methods have been developed to perform bulk electric system reliability evaluation. Theoretically, sequential Monte Carlo simulation can include all aspects and contingencies in a power system and can be used to produce an informative set of reliability indices. It has become a practical and viable tool for large system reliability assessment technique due to the development of computing power and is used in the studies described in this thesis. The well-being approach used in this research provides the opportunity to integrate an accepted deterministic criterion into a probabilistic framework. This research work includes the investigation of important factors that impact bulk electric system adequacy evaluation and security constrained adequacy assessment using the well-being analysis framework. Load forecast uncertainty is an important consideration in an electrical power system. This research includes load forecast uncertainty considerations in bulk electric system reliability assessment and the effects on system, load point and well-being indices and reliability index probability distributions are examined. There has been increasing worldwide interest in the utilization of wind power as a renewable energy source over the last two decades due to enhanced public awareness of the environment. Increasing penetration of wind power has significant impacts on power system reliability, and security analyses become more uncertain due to the unpredictable nature of wind power. The effects of wind power additions in generating and bulk electric system reliability assessment considering site wind speed correlations and the interactive effects of wind power and load forecast uncertainty on system reliability are examined. The concept of the security cost associated with operating in the marginal state in the well-being framework is incorporated in the economic analyses associated with system expansion planning including wind power and load forecast uncertainty. Overall reliability cost/worth analyses including security cost concepts are applied to select an optimal wind power injection strategy in a bulk electric system. The effects of the various demand side management measures on system reliability are illustrated using the system, load point, and well-being indices, and the reliability index probability distributions. The reliability effects of demand side management procedures in a bulk electric system including wind power and load forecast uncertainty considerations are also investigated. The system reliability effects due to specific demand side management programs are quantified and examined in terms of their reliability benefits.
How to assess driver's interaction with partially automated driving systems - A framework for early concept assessment.

PubMed

van den Beukel, Arie P; van der Voort, Mascha C

2017-03-01

The introduction of partially automated driving systems changes the driving task into supervising the automation with an occasional need to intervene. To develop interface solutions that adequately support drivers in this new role, this study proposes and evaluates an assessment framework that allows designers to evaluate driver-support within relevant real-world scenarios. Aspects identified as requiring assessment in terms of driver-support within the proposed framework are Accident Avoidance, gained Situation Awareness (SA) and Concept Acceptance. Measurement techniques selected to operationalise these aspects and the associated framework are pilot-tested with twenty-four participants in a driving simulator experiment. The objective of the test is to determine the reliability of the applied measurements for the assessment of the framework and whether the proposed framework is effective in predicting the level of support offered by the concepts. Based on the congruency between measurement scores produced in the test and scores with predefined differences in concept-support, this study demonstrates the framework's reliability. A remaining concern is the framework's weak sensitivity to small differences in offered support. The article concludes that applying the framework is especially advantageous for evaluating early design phases and can successfully contribute to the efficient development of driver's in-control and safe means of operating partially automated vehicles. Copyright Â© 2016 Elsevier Ltd. All rights reserved.
Practical no-gold-standard evaluation framework for quantitative imaging methods: application to lesion segmentation in positron emission tomography

PubMed Central

Jha, Abhinav K.; Mena, Esther; Caffo, Brian; Ashrafinia, Saeed; Rahmim, Arman; Frey, Eric; Subramaniam, Rathan M.

2017-01-01

Abstract. Recently, a class of no-gold-standard (NGS) techniques have been proposed to evaluate quantitative imaging methods using patient data. These techniques provide figures of merit (FoMs) quantifying the precision of the estimated quantitative value without requiring repeated measurements and without requiring a gold standard. However, applying these techniques to patient data presents several practical difficulties including assessing the underlying assumptions, accounting for patient-sampling-related uncertainty, and assessing the reliability of the estimated FoMs. To address these issues, we propose statistical tests that provide confidence in the underlying assumptions and in the reliability of the estimated FoMs. Furthermore, the NGS technique is integrated within a bootstrap-based methodology to account for patient-sampling-related uncertainty. The developed NGS framework was applied to evaluate four methods for segmenting lesions from F-Fluoro-2-deoxyglucose positron emission tomography images of patients with head-and-neck cancer on the task of precisely measuring the metabolic tumor volume. The NGS technique consistently predicted the same segmentation method as the most precise method. The proposed framework provided confidence in these results, even when gold-standard data were not available. The bootstrap-based methodology indicated improved performance of the NGS technique with larger numbers of patient studies, as was expected, and yielded consistent results as long as data from more than 80 lesions were available for the analysis. PMID:28331883
[Reliability and validity of a generic job exposure matrix applied on a small-business].

PubMed

Haro-García, Luis; Celis-Quintal, Germán; López-Rojas, Pablo; Sánchez-Román, Francisco Raúl; Juárez-Pérez, Cuauhtémoc Arturo

2007-01-01

to evaluate the reliability and validity of a generic job exposure matrix (JEM) applied in a small business. procedures to evaluate a JEM integrated by six sections: the number of exposed workers per area, frequency of exposure, time of exposure time, level of exposure, safety controls, and proximity to source of exposure, was evaluated. The JEM also obtains information about possible health effects from exposure to occupational/environment agents. Two observers estimated the risk of exposure to epoxy resins on 31 workers of an epoxy resin facility in Mexico City. The rater agreements between the two observers were assessed through percent agreement (PA), weighted kappa (kappa(w)) and the intraclass correlation coefficient (ICC). disagreements were greater for the number of exposed workers (PA = 61.3, kappa(w) = 0.24, ICC = 0.33), level of exposure (PA= 66.7, kappa(w) = 0.25, ICC= 0.56), and safety controls (PA = 54.8, kappa(w) = 0.23, ICC = 0.69) sections. Percent agreement and kappa(w) were 64% and 0.58, respectively. In accordance with Landis and Koch, Altman, Fleiss, and Byrt classifications for the interpretation of kappa value, the weighted kappa (0.58) ranged from moderate to a fair good level. despite the discordance in some sections, the JEM proved to be useful to identify the risk of exposure in this type of small business.
A reliability study on brain activation during active and passive arm movements supported by an MRI-compatible robot.

PubMed

Estévez, Natalia; Yu, Ningbo; Brügger, Mike; Villiger, Michael; Hepp-Reymond, Marie-Claude; Riener, Robert; Kollias, Spyros

2014-11-01

In neurorehabilitation, longitudinal assessment of arm movement related brain function in patients with motor disability is challenging due to variability in task performance. MRI-compatible robots monitor and control task performance, yielding more reliable evaluation of brain function over time. The main goals of the present study were first to define the brain network activated while performing active and passive elbow movements with an MRI-compatible arm robot (MaRIA) in healthy subjects, and second to test the reproducibility of this activation over time. For the fMRI analysis two models were compared. In model 1 movement onset and duration were included, whereas in model 2 force and range of motion were added to the analysis. Reliability of brain activation was tested with several statistical approaches applied on individual and group activation maps and on summary statistics. The activated network included mainly the primary motor cortex, primary and secondary somatosensory cortex, superior and inferior parietal cortex, medial and lateral premotor regions, and subcortical structures. Reliability analyses revealed robust activation for active movements with both fMRI models and all the statistical methods used. Imposed passive movements also elicited mainly robust brain activation for individual and group activation maps, and reliability was improved by including additional force and range of motion using model 2. These findings demonstrate that the use of robotic devices, such as MaRIA, can be useful to reliably assess arm movement related brain activation in longitudinal studies and may contribute in studies evaluating therapies and brain plasticity following injury in the nervous system.
Calibration and validation of the Physical Activity Barrier Scale for persons who are blind or visually impaired.

PubMed

Lee, Miyoung; Zhu, Weimo; Ackley-Holbrook, Elizabeth; Brower, Diana G; McMurray, Bryan

2014-07-01

It is critical to employ accurate measures when assessing physical activity (PA) barriers in any subpopulation, yet existing measures are not appropriate for persons with blindness or visual impairment (PBVI) due to a lack of validity or reliability evidence. To develop and calibrate a PA barrier scale for PBVI. An expert panel (n = 3) and 18 PBVI were recruited to establish content validity for a PA barriers subscale; 160 PBVI (96 females) completed the scale along with the Physical Activity Scale for Individuals with Physical Disabilities for calibration. To establish construct-related validity evidence, Confirmative factor analysis (CFA) and Rasch analysis were applied. To investigate internal consistency and reliability, Cronbach's alpha and the reliability coefficient (R) were employed, respectively. Following CFA and Rasch analyses, five items were eliminated due to misfits; reliability coefficients were unchanged upon deletion of these items. The barriers perceived by PBVI to have the most negative impact on PA included "lack of self-discipline" (logit = 1.40) and "lack of motivation" (logit = 1.27). "Too many stairs in the exercise facility" (logit = -1.49) was perceived to have the least impact. The newly-developed scale was found to be a valid and reliable tool for evaluating PA barriers in PBVI. To enhance promotion of health-producing levels of PA in PBVI, practitioners should consider applying this new tool as a precursor to programs aimed at improving PA participation in this group. Copyright © 2014 Elsevier Inc. All rights reserved.
Evaluating multidisciplinary health care teams: taking the crisis out of CRM.

PubMed

Sutton, Gigi

2009-08-01

High-reliability organisations are those, such as within the aviation industry, which operate in complex, hazardous environments and yet despite this are able to balance safety and effectiveness. Crew resource management (CRM) training is used to improve the non-technical skills of aviation crews and other high-reliability teams. To date, CRM within the health sector has been restricted to use with "crisis teams" and "crisis events". The purpose of this discussion paper is to examine the application of CRM to acute, ward-based multidisciplinary health care teams and more broadly to argue for the repositioning of health-based CRM to address effective everyday function, of which "crisis events" form just one part. It is argued that CRM methodology could be applied to evaluate ward-based health care teams and design non-technical skills training to increase their efficacy, promote better patient outcomes, and facilitate a range of positive personal and organisational level outcomes.
A flight test of laminar flow control leading-edge systems

NASA Technical Reports Server (NTRS)

Fischer, M. C.; Wright, A. S., Jr.; Wagner, R. D.

1983-01-01

NASA's program for development of a laminar flow technology base for application to commercial transports has made significant progress since its inception in 1976. Current efforts are focused on development of practical reliable systems for the leading-edge region where the most difficult problems in applying laminar flow exist. Practical solutions to these problems will remove many concerns about the ultimate practicality of laminar flow. To address these issues, two contractors performed studies, conducted development tests, and designed and fabricated fully functional leading-edge test articles for installation on the NASA JetStar aircraft. Systems evaluation and performance testing will be conducted to thoroughly evaluate all system capabilities and characteristics. A simulated airline service flight test program will be performed to obtain the operational sensitivity, maintenance, and reliability data needed to establish that practical solutions exist for the difficult leading-edge area of a future commercial transport employing laminar flow control.
Effects of computing time delay on real-time control systems

NASA Technical Reports Server (NTRS)

Shin, Kang G.; Cui, Xianzhong

1988-01-01

The reliability of a real-time digital control system depends not only on the reliability of the hardware and software used, but also on the speed in executing control algorithms. The latter is due to the negative effects of computing time delay on control system performance. For a given sampling interval, the effects of computing time delay are classified into the delay problem and the loss problem. Analysis of these two problems is presented as a means of evaluating real-time control systems. As an example, both the self-tuning predicted (STP) control and Proportional-Integral-Derivative (PID) control are applied to the problem of tracking robot trajectories, and their respective effects of computing time delay on control performance are comparatively evaluated. For this example, the STP (PID) controller is shown to outperform the PID (STP) controller in coping with the delay (loss) problem.

Evaluation of the CONSUME and FOFEM fuel consumption models in pine and mixed hardwood forests of the eastern United States

Treesearch

Susan J. Prichard; Eva C. Karau; Roger D. Ottmar; Maureen C. Kennedy; James B. Cronan; Clinton S. Wright; Robert E. Keane

2014-01-01

Reliable predictions of fuel consumption are critical in the eastern United States (US), where prescribed burning is frequently applied to forests and air quality is of increasing concern. CONSUME and the First Order Fire Effects Model (FOFEM), predictive models developed to estimate fuel consumption and emissions from wildland fires, have not been systematically...
Recent advances in computational structural reliability analysis methods

NASA Astrophysics Data System (ADS)

Thacker, Ben H.; Wu, Y.-T.; Millwater, Harry R.; Torng, Tony Y.; Riha, David S.

1993-10-01

The goal of structural reliability analysis is to determine the probability that the structure will adequately perform its intended function when operating under the given environmental conditions. Thus, the notion of reliability admits the possibility of failure. Given the fact that many different modes of failure are usually possible, achievement of this goal is a formidable task, especially for large, complex structural systems. The traditional (deterministic) design methodology attempts to assure reliability by the application of safety factors and conservative assumptions. However, the safety factor approach lacks a quantitative basis in that the level of reliability is never known and usually results in overly conservative designs because of compounding conservatisms. Furthermore, problem parameters that control the reliability are not identified, nor their importance evaluated. A summary of recent advances in computational structural reliability assessment is presented. A significant level of activity in the research and development community was seen recently, much of which was directed towards the prediction of failure probabilities for single mode failures. The focus is to present some early results and demonstrations of advanced reliability methods applied to structural system problems. This includes structures that can fail as a result of multiple component failures (e.g., a redundant truss), or structural components that may fail due to multiple interacting failure modes (e.g., excessive deflection, resonate vibration, or creep rupture). From these results, some observations and recommendations are made with regard to future research needs.
Recent advances in computational structural reliability analysis methods

NASA Technical Reports Server (NTRS)

Thacker, Ben H.; Wu, Y.-T.; Millwater, Harry R.; Torng, Tony Y.; Riha, David S.

1993-01-01

The goal of structural reliability analysis is to determine the probability that the structure will adequately perform its intended function when operating under the given environmental conditions. Thus, the notion of reliability admits the possibility of failure. Given the fact that many different modes of failure are usually possible, achievement of this goal is a formidable task, especially for large, complex structural systems. The traditional (deterministic) design methodology attempts to assure reliability by the application of safety factors and conservative assumptions. However, the safety factor approach lacks a quantitative basis in that the level of reliability is never known and usually results in overly conservative designs because of compounding conservatisms. Furthermore, problem parameters that control the reliability are not identified, nor their importance evaluated. A summary of recent advances in computational structural reliability assessment is presented. A significant level of activity in the research and development community was seen recently, much of which was directed towards the prediction of failure probabilities for single mode failures. The focus is to present some early results and demonstrations of advanced reliability methods applied to structural system problems. This includes structures that can fail as a result of multiple component failures (e.g., a redundant truss), or structural components that may fail due to multiple interacting failure modes (e.g., excessive deflection, resonate vibration, or creep rupture). From these results, some observations and recommendations are made with regard to future research needs.
Evaluating the safety risk of roadside features for rural two-lane roads using reliability analysis.

PubMed

Jalayer, Mohammad; Zhou, Huaguo

2016-08-01

The severity of roadway departure crashes mainly depends on the roadside features, including the sideslope, fixed-object density, offset from fixed objects, and shoulder width. Common engineering countermeasures to improve roadside safety include: cross section improvements, hazard removal or modification, and delineation. It is not always feasible to maintain an object-free and smooth roadside clear zone as recommended in design guidelines. Currently, clear zone width and sideslope are used to determine roadside hazard ratings (RHRs) to quantify the roadside safety of rural two-lane roadways on a seven-point pictorial scale. Since these two variables are continuous and can be treated as random, probabilistic analysis can be applied as an alternative method to address existing uncertainties. Specifically, using reliability analysis, it is possible to quantify roadside safety levels by treating the clear zone width and sideslope as two continuous, rather than discrete, variables. The objective of this manuscript is to present a new approach for defining the reliability index for measuring roadside safety on rural two-lane roads. To evaluate the proposed approach, we gathered five years (2009-2013) of Illinois run-off-road (ROR) crash data and identified the roadside features (i.e., clear zone widths and sideslopes) of 4500 300ft roadway segments. Based on the obtained results, we confirm that reliability indices can serve as indicators to gauge safety levels, such that the greater the reliability index value, the lower the ROR crash rate. Copyright © 2016 Elsevier Ltd. All rights reserved.
Reliability and number of trials of Y Balance Test in adolescent athletes.

PubMed

Linek, Pawel; Sikora, Damian; Wolny, Tomasz; Saulicz, Edward

2017-10-01

The Star Excursion Balance Test (SEBT) is commonly used to evaluate dynamic equilibrium. The Y Balance Test (Y-BT) is a shortened version of the SEBT where a Y- Balance Kit is commonly used. To date, research concerning the protocol and reliability of the SEBT and Y-BT has been conducted only for adults. The aim of the study was to assess the protocol (the necessary number of trials to stabilize the results) and reliability of the Y-BT in adolescent athletes. One-way repeated-measures analysis of variance (ANOVA) and reliability study. The sample of 38 athletes (mean age: 15.6 years) was selected from a football club. A Y-Balance test kit was applied for the evaluation of dynamic balance. The analysis used the values normalized to the relative length of the lower limbs. After six attempts, three consecutive ones achieved stability for all directions and both extremities (p > 0.05). The intraclass correlation coefficient (ICC 3,1 ), standard error of measurement and minimal detectable change values for the three attempts ranged from 0.57 to 0.82, from 3 to less than 6% and from 7.68 to 13.7%, respectively. In the study of adolescent dynamic equilibrium using the Y-BT, it is recommended to perform nine attempts (including six trial attempts and three measurements). In order to increase reliability it is recommended that the average of the three measured attempts is analysed. Copyright © 2017 Elsevier Ltd. All rights reserved.
Voltage-controlled magnetization switching in MRAMs in conjunction with spin-transfer torque and applied magnetic field

NASA Astrophysics Data System (ADS)

Munira, Kamaram; Pandey, Sumeet C.; Kula, Witold; Sandhu, Gurtej S.

2016-11-01

Voltage-controlled magnetic anisotropy (VCMA) effect has attracted a significant amount of attention in recent years because of its low cell power consumption during the anisotropy modulation of a thin ferromagnetic film. However, the applied voltage or electric field alone is not enough to completely and reliably reverse the magnetization of the free layer of a magnetic random access memory (MRAM) cell from anti-parallel to parallel configuration or vice versa. An additional symmetry-breaking mechanism needs to be employed to ensure the deterministic writing process. Combinations of voltage-controlled magnetic anisotropy together with spin-transfer torque (STT) and with an applied magnetic field (Happ) were evaluated for switching reliability, time taken to switch with low error rate, and energy consumption during the switching process. In order to get a low write error rate in the MRAM cell with VCMA switching mechanism, a spin-transfer torque current or an applied magnetic field comparable to the critical current and field of the free layer is necessary. In the hybrid processes, the VCMA effect lowers the duration during which the higher power hungry secondary mechanism is in place. Therefore, the total energy consumed during the hybrid writing processes, VCMA + STT or VCMA + Happ, is less than the energy consumed during pure spin-transfer torque or applied magnetic field switching.
A pilot rating scale for evaluating failure transients in electronic flight control systems

NASA Technical Reports Server (NTRS)

Hindson, William S.; Schroeder, Jeffery A.; Eshow, Michelle M.

1990-01-01

A pilot rating scale was developed to describe the effects of transients in helicopter flight-control systems on safety-of-flight and on pilot recovery action. The scale was applied to the evaluation of hardovers that could potentially occur in the digital flight-control system being designed for a variable-stability UH-60A research helicopter. Tests were conducted in a large moving-base simulator and in flight. The results of the investigation were combined with existing airworthiness criteria to determine quantitative reliability design goals for the control system.
The detection of tightly closed flaws by nondestructive testing (NDT) methods. [fatigue crack formation in aluminum alloy test specimens

NASA Technical Reports Server (NTRS)

Rummel, W. D.; Rathke, R. A.; Todd, P. H., Jr.; Mullen, S. J.

1975-01-01

Liquid penetrant, ultrasonic, eddy current and X-radiographic techniques were optimized and applied to the evaluation of 2219-T87 aluminum alloy test specimens in integrally stiffened panel, and weld panel configurations. Fatigue cracks in integrally stiffened panels, lack-of-fusion in weld panels, and fatigue cracks in weld panels were the flaw types used for evaluation. A 2319 aluminum alloy weld filler rod was used for all welding to produce the test specimens. Forty seven integrally stiffened panels containing a total of 146 fatigue cracks, ninety three lack-of-penetration (LOP) specimens containing a total of 239 LOP flaws, and one-hundred seventeen welded specimens containing a total of 293 fatigue cracks were evaluated. Nondestructive test detection reliability enhancement was evaluated during separate inspection sequences in the specimens in the 'as-machined or as-welded', post etched and post proof loaded conditions. Results of the nondestructive test evaluations were compared to the actual flaw size obtained by measurement of the fracture specimens after completing all inspection sequences. Inspection data were then analyzed to provide a statistical basis for determining the flaw detection reliability.
[The organization of the comprehensive prevention of urolithiasis among ferrous metallurgy workers].

PubMed

Egorova, A M

2009-01-01

The purpose of study is to evaluate the effectiveness of the set of preventive measures as applied to 321 workers of basic ferrous metallurgy specialties (steel makers, mill men, hot metal shearers). During the clinical examination all the workers were divided on three groups: the workers without any pathology (11.83%, the first group), the workers with metabolic disorders only without urolitiasis (64.81%, the second group) and the workers with urolitiasis diagnosis approved by ultrasonography (23.36%, the third group). The effectiveness of rehabilitation measures was evaluated during half a year (diet therapy, drinking regimen, medicinal plants treatment). After the course of preventive measures was applied the overall health condition of most workers ameliorated and the number of workers with urolitiasis development risk factors reliably decreased up to 6-12%.
The brief multidimensional students' life satisfaction scale-college version.

PubMed

Zullig, Keith J; Huebner, E Scott; Patton, Jon M; Murray, Karen A

2009-01-01

To investigate the psychometric properties of the BMSLSS-College among 723 college students. Internal consistency estimates explored scale reliability, factor analysis explored construct validity, and known-groups validity was assessed using the National College Youth Risk Behavior Survey and Harvard School of Public Health College Alcohol Study. Criterion-related validity was explored through analyses with the CDC's health-related quality of life scale and a social isolation scale. Acceptable internal consistency reliability, construct, known-groups, and criterion-related validity were established. Findings offer preliminary support for the BMSLSS-C; it could be useful in large-scale research studies, applied screening contexts, and for program evaluation purposes toward achieving Healthy People 2010 objectives.
Construction of Response Surface with Higher Order Continuity and Its Application to Reliability Engineering

NASA Technical Reports Server (NTRS)

Krishnamurthy, T.; Romero, V. J.

2002-01-01

The usefulness of piecewise polynomials with C1 and C2 derivative continuity for response surface construction method is examined. A Moving Least Squares (MLS) method is developed and compared with four other interpolation methods, including kriging. First the selected methods are applied and compared with one another in a two-design variables problem with a known theoretical response function. Next the methods are tested in a four-design variables problem from a reliability-based design application. In general the piecewise polynomial with higher order derivative continuity methods produce less error in the response prediction. The MLS method was found to be superior for response surface construction among the methods evaluated.
Use of iris recognition camera technology for the quantification of corneal opacification in mucopolysaccharidoses.

PubMed

Aslam, Tariq Mehmood; Shakir, Savana; Wong, James; Au, Leon; Ashworth, Jane

2012-12-01

Mucopolysaccharidoses (MPS) can cause corneal opacification that is currently difficult to objectively quantify. With newer treatments for MPS comes an increased need for a more objective, valid and reliable index of disease severity for clinical and research use. Clinical evaluation by slit lamp is very subjective and techniques based on colour photography are difficult to standardise. In this article the authors present evidence for the utility of dedicated image analysis algorithms applied to images obtained by a highly sophisticated iris recognition camera that is small, manoeuvrable and adapted to achieve rapid, reliable and standardised objective imaging in a wide variety of patients while minimising artefactual interference in image quality.
Outcomes validity and reliability of the modified Rankin scale: implications for stroke clinical trials: a literature review and synthesis.

PubMed

Banks, Jamie L; Marotta, Charles A

2007-03-01

The modified Rankin scale (mRS), a clinician-reported measure of global disability, is widely applied for evaluating stroke patient outcomes and as an end point in randomized clinical trials. Extensive evidence on the validity of the mRS exists across a large but fragmented literature. As new treatments for acute ischemic stroke are submitted for agency approval, an appreciation of the mRS's attributes, specifically its relationship to other stroke evaluation scales, would be valuable for decision-makers to properly assess the impact of a new drug on treatment paradigms. The purpose of this report is to assemble and systematically assess the properties of the mRS to provide decision-makers with pertinent evaluative information. A Medline search was conducted to identify reports in the peer-reviewed medical literature (1957-2006) that provide information on the structure, validation, scoring, and psychometric properties of the mRS and its use in clinical trials. The selection of articles was based on defined criteria that included relevance, study design and use of appropriate statistical methods. Of 224 articles identified by the literature search, 50 were selected for detailed assessment. Inter-rater reliability with the mRS is moderate and improves with structured interviews (kappa 0.56 versus 0.78); strong test-re-test reliability (kappa=0.81 to 0.95) has been reported. Numerous studies demonstrate the construct validity of the mRS by its relationships to physiological indicators such as stroke type, lesion size, perfusion and neurological impairment. Convergent validity between the mRS and other disability scales is well documented. Patient comorbidities and socioeconomic factors should be considered in properly applying and interpreting the mRS. Recent analyses suggest that randomized clinical trials of acute stroke treatments may require a smaller sample size if the mRS is used as a primary end point rather than the Barthel Index. Multiple types of evidence attest to the validity and reliability of the mRS. The reported data support the view that the mRS is a valuable instrument for assessing the impact of new stroke treatments.
Inter-rater reliability of three standardized functional tests in patients with low back pain

PubMed Central

Tidstrand, Johan; Horneij, Eva

2009-01-01

Background Of all patients with low back pain, 85% are diagnosed as "non-specific lumbar pain". Lumbar instability has been described as one specific diagnosis which several authors have described as delayed muscular responses, impaired postural control as well as impaired muscular coordination among these patients. This has mostly been measured and evaluated in a laboratory setting. There are few standardized and evaluated functional tests, examining functional muscular coordination which are also applicable in the non-laboratory setting. In ordinary clinical work, tests of functional muscular coordination should be easy to apply. The aim of this present study was to therefore standardize and examine the inter-rater reliability of three functional tests of muscular functional coordination of the lumbar spine in patients with low back pain. Methods Nineteen consecutive individuals, ten men and nine women were included. (Mean age 42 years, SD ± 12 yrs). Two independent examiners assessed three tests: "single limb stance", "sitting on a Bobath ball with one leg lifted" and "unilateral pelvic lift" on the same occasion. The standardization procedure took altered positions of the spine or pelvis and compensatory movements of the free extremities into account. The inter-rater reliability was analyzed by Cohen's kappa coefficient (κ) and by percentage agreement. Results The inter-rater reliability for the right and the left leg respectively was: for the single limb stance very good (κ: 0.88–1.0), for sitting on a Bobath ball good (κ: 0.79) and very good (κ: 0.88) and for the unilateral pelvic lift: good (κ: 0.61) and moderate (κ: 0.47). Conclusion The present study showed good to very good inter-rater reliability for two standardized tests, that is, the single-limb stance and sitting on a Bobath-ball with one leg lifted. Inter-rater reliability for the unilateral pelvic lift test was moderate to good. Validation of the tests in their ability to evaluate lumbar stability is required. PMID:19490644
Aerospace reliability applied to biomedicine.

NASA Technical Reports Server (NTRS)

Lalli, V. R.; Vargo, D. J.

1972-01-01

An analysis is presented that indicates that the reliability and quality assurance methodology selected by NASA to minimize failures in aerospace equipment can be applied directly to biomedical devices to improve hospital equipment reliability. The Space Electric Rocket Test project is used as an example of NASA application of reliability and quality assurance (R&QA) methods. By analogy a comparison is made to show how these same methods can be used in the development of transducers, instrumentation, and complex systems for use in medicine.
In vivo estimation of target registration errors during augmented reality laparoscopic surgery.

PubMed

Thompson, Stephen; Schneider, Crispin; Bosi, Michele; Gurusamy, Kurinchi; Ourselin, Sébastien; Davidson, Brian; Hawkes, David; Clarkson, Matthew J

2018-06-01

Successful use of augmented reality for laparoscopic surgery requires that the surgeon has a thorough understanding of the likely accuracy of any overlay. Whilst the accuracy of such systems can be estimated in the laboratory, it is difficult to extend such methods to the in vivo clinical setting. Herein we describe a novel method that enables the surgeon to estimate in vivo errors during use. We show that the method enables quantitative evaluation of in vivo data gathered with the SmartLiver image guidance system. The SmartLiver system utilises an intuitive display to enable the surgeon to compare the positions of landmarks visible in both a projected model and in the live video stream. From this the surgeon can estimate the system accuracy when using the system to locate subsurface targets not visible in the live video. Visible landmarks may be either point or line features. We test the validity of the algorithm using an anatomically representative liver phantom, applying simulated perturbations to achieve clinically realistic overlay errors. We then apply the algorithm to in vivo data. The phantom results show that using projected errors of surface features provides a reliable predictor of subsurface target registration error for a representative human liver shape. Applying the algorithm to in vivo data gathered with the SmartLiver image-guided surgery system shows that the system is capable of accuracies around 12 mm; however, achieving this reliably remains a significant challenge. We present an in vivo quantitative evaluation of the SmartLiver image-guided surgery system, together with a validation of the evaluation algorithm. This is the first quantitative in vivo analysis of an augmented reality system for laparoscopic surgery.
A three-dimensional histological atlas of the human basal ganglia. II. Atlas deformation strategy and evaluation in deep brain stimulation for Parkinson disease.

PubMed

Bardinet, Eric; Bhattacharjee, Manik; Dormont, Didier; Pidoux, Bernard; Malandain, Grégoire; Schüpbach, Michael; Ayache, Nicholas; Cornu, Philippe; Agid, Yves; Yelnik, Jérôme

2009-02-01

The localization of any given target in the brain has become a challenging issue because of the increased use of deep brain stimulation to treat Parkinson disease, dystonia, and nonmotor diseases (for example, Tourette syndrome, obsessive compulsive disorders, and depression). The aim of this study was to develop an automated method of adapting an atlas of the human basal ganglia to the brains of individual patients. Magnetic resonance images of the brain specimen were obtained before extraction from the skull and histological processing. Adaptation of the atlas to individual patient anatomy was performed by reshaping the atlas MR images to the images obtained in the individual patient using a hierarchical registration applied to a region of interest centered on the basal ganglia, and then applying the reshaping matrix to the atlas surfaces. Results were evaluated by direct visual inspection of the structures visible on MR images and atlas anatomy, by comparison with electrophysiological intraoperative data, and with previous atlas studies in patients with Parkinson disease. The method was both robust and accurate, never failing to provide an anatomically reliable atlas to patient registration. The registration obtained did not exceed a 1-mm mismatch with the electrophysiological signatures in the region of the subthalamic nucleus. This registration method applied to the basal ganglia atlas forms a powerful and reliable method for determining deep brain stimulation targets within the basal ganglia of individual patients.
Outer skin protection of columbium Thermal Protection System (TPS) panels

NASA Technical Reports Server (NTRS)

Culp, J. D.

1973-01-01

A coated columbium alloy material system 0.04 centimeter thick was developed which provides for increased reliability to the load bearing character of the system in the event of physical damage to and loss of the exterior protective coating. The increased reliability to the load bearing columbium alloy (FS-85) was achieved by interposing an oxidation resistant columbium alloy (B-1) between the FS-85 alloy and a fused slurry silicide coating. The B-1 alloy was applied as a cladding to the FS-85 and the composite was fused slurry silicide coated. Results of material evaluation testing included cyclic oxidation testing of specimens with intentional coating defects, tensile testing of several material combinations exposed to reentry profile conditions, and emittance testing after cycling of up to 100 simulated reentries. The clad material, which was shown to provide greater reliability than unclad materials, holds significant promise for use in the thermal protection system of hypersonic reentry vehicles.
Probabilistic confidence for decisions based on uncertain reliability estimates

NASA Astrophysics Data System (ADS)

Reid, Stuart G.

2013-05-01

Reliability assessments are commonly carried out to provide a rational basis for risk-informed decisions concerning the design or maintenance of engineering systems and structures. However, calculated reliabilities and associated probabilities of failure often have significant uncertainties associated with the possible estimation errors relative to the 'true' failure probabilities. For uncertain probabilities of failure, a measure of 'probabilistic confidence' has been proposed to reflect the concern that uncertainty about the true probability of failure could result in a system or structure that is unsafe and could subsequently fail. The paper describes how the concept of probabilistic confidence can be applied to evaluate and appropriately limit the probabilities of failure attributable to particular uncertainties such as design errors that may critically affect the dependability of risk-acceptance decisions. This approach is illustrated with regard to the dependability of structural design processes based on prototype testing with uncertainties attributable to sampling variability.
Reliability Testing of NASA Piezocomposite Actuators

NASA Technical Reports Server (NTRS)

Wilkie, W.; High, J.; Bockman, J.

2002-01-01

NASA Langley Research Center has developed a low-cost piezocomposite actuator which has application for controlling vibrations in large inflatable smart space structures, space telescopes, and high performance aircraft. Tests show the NASA piezocomposite device is capable of producing large, directional, in-plane strains on the order of 2000 parts-per-million peak-to-peak, with no reduction in free-strain performance to 100 million electrical cycles. This paper describes methods, measurements, and preliminary results from our reliability evaluation of the device under externally applied mechanical loads and at various operational temperatures. Tests performed to date show no net reductions in actuation amplitude while the device was moderately loaded through 10 million electrical cycles. Tests were performed at both room temperature and at the maximum operational temperature of the epoxy resin system used in manufacture of the device. Initial indications are that actuator reliability is excellent, with no actuator failures or large net reduction in actuator performance.

Stress Testing of the Philips 60W Replacement Lamp L Prize Entry

DOE Office of Scientific and Technical Information (OSTI.GOV)

Poplawski, Michael E.; Ledbetter, Marc R.; Smith, Mark

2012-04-24

The Pacific Northwest National Laboratory, operated by Battelle for the U.S. Department of Energy, worked with Intertek to develop a procedure for stress testing medium screw-base light sources. This procedure, composed of alternating stress cycles and performance evaluation, was used to qualitatively compare and contrast the durability and reliability of the Philips 60W replacement lamp L Prize entry with market-proven compact fluorescent lamps (CFLs) with comparable light output and functionality. The stress cycles applied simultaneous combinations of electrical, thermal, vibration, and humidity stresses of increasing magnitude. Performance evaluations measured relative illuminance, x chromaticity and y chromaticity shifts after each stressmore » cycle. The Philips L Prize entry lamps appear to be appreciably more durable than the incumbent energy-efficient technology, as represented by the evaluated CFLs, and with respect to the applied stresses. Through the course of testing, all 15 CFL samples permanently ceased to function as a result of the applied stresses, while only 1 Philips L Prize entry lamp exhibited a failure, the nature of which was minor, non-destructive, and a consequence of a known (and resolved) subcontractor issue. Given that current CFL technology appears to be moderately mature and no Philips L Prize entry failures could be produced within the stress envelope causing 100 percent failure of the benchmark CFLs, it seems that, in this particular implementation, light-emitting diode (LED) technology would be much more durable in the field than current CFL technology. However, the Philips L Prize entry lamps used for testing were carefully designed and built for the competition, while the benchmark CFLs were mass produced for retail sale—a distinction that should be taken into consideration. Further reliability testing on final production samples would be necessary to judge the extent to which the results of this analysis apply to production versions of the Philips L Prize entry.« less
[The effectiveness of physical therapy methods (Bobath and motor relearning program) in rehabilitation of stroke patients].

PubMed

Krutulyte, Grazina; Kimtys, Algimantas; Krisciūnas, Aleksandras

2003-01-01

The purpose of this study was to examine whether two different physiotherapy regimes caused any differences in outcome in the rehabilitation after stroke. We examined 240 patients with stroke. Examination was carried out at the Rehabilitation Center of Kaunas Second Clinical Hospital. Patients were divided into 2 groups: Bobath method was applied to the first (I) group (n=147), motor relearning program (MRP) method was applied to the second (II) group (n=93). In every group of patients we established samples according to sex, age, hospitalization to rehab unit as occurrence of CVA degree of disorder (hemiplegia, hemiparesis). The mobility of patients was evaluated according to European Federation for Research in Rehabilitation (EFRR) scale. Activities of daily living were evaluated by Barthel index. Analyzed groups were evaluated before physical therapy. When preliminary analysis was carried out it proved no statically reliable differences between analyzed groups (reliability 95%). The same statistical analysis was carried out after physical therapy. The results of differences between patient groups were compared using chi(2) method. Bobath method was applied working with the first group of patients. The aim of the method is to improve quality of the affected body side's movements in order to keep both sides working as harmoniously as possible. While applying this method at work, physical therapist guides patient's body on key-points, stimulating normal postural reactions, and training normal movement pattern. MRP method was used while working with the second group patients. This method is based on movement science, biomechanics and training of functional movement. Program is based on idea that movement pattern shouldn't be trained; it must be relearned. CONCLUSION. This study indicates that physiotherapy with task-oriented strategies represented by MRP, is preferable to physiotherapy with facilitation/inhibition strategies, such the Bobath programme, in the rehabilitation of stroke patients (p< 0.05).
Similarity indices of meteo-climatic gauging stations: definition and comparison.

PubMed

Barca, Emanuele; Bruno, Delia Evelina; Passarella, Giuseppe

2016-07-01

Space-time dependencies among monitoring network stations have been investigated to detect and quantify similarity relationships among gauging stations. In this work, besides the well-known rank correlation index, two new similarity indices have been defined and applied to compute the similarity matrix related to the Apulian meteo-climatic monitoring network. The similarity matrices can be applied to address reliably the issue of missing data in space-time series. In order to establish the effectiveness of the similarity indices, a simulation test was then designed and performed with the aim of estimating missing monthly rainfall rates in a suitably selected gauging station. The results of the simulation allowed us to evaluate the effectiveness of the proposed similarity indices. Finally, the multiple imputation by chained equations method was used as a benchmark to have an absolute yardstick for comparing the outcomes of the test. In conclusion, the new proposed multiplicative similarity index resulted at least as reliable as the selected benchmark.
Psychometric performance of the brazilian version of the Mini-cuestionario de calidad de vida en la hipertensión arterial (MINICHAL).

PubMed

Soutello, Ana Lúcia Soares; Rodrigues, Roberta Cunha Matheus; Jannuzzi, Fernanda Freire; Spana, Thaís Moreira; Gallani, Maria Cecília Bueno Jayme; Nadruz Junior, Wilson

2011-01-01

This study aimed to evaluate the feasibility, acceptability, ceiling and floor effects, reliability, and convergent construct validity of the Brazilian version of the Mini Cuestionario de Calidad de Vida en la Hipertensión Arterial (MINICHAL). The study included 200 hypertensive outpatients in a university hospital and a primary healthcare unit. The MINICHAL was applied in 3.0 (± 1.0) minutes with 100% of the items answered. A "ceiling effect" was observed in both dimensions and in the total score, as well as evidence of measurement stability (ICC=0.74). The convergent validity was confirmed by significant positive correlations between similar dimensions of the MINICHAL and the SF-36, and significant negative correlations with the Minnesota Living with Heart Failure Questionnaire - MLHFQ, however, correlations between dissimilar constructs were also observed. It was concluded that the Brazilian version of the MINICHAL presents evidence of reliability and validity when applied to hypertensive outpatients.
Evaluation of tools used to measure calcium and/or dairy consumption in adults.

PubMed

Magarey, Anthea; Baulderstone, Lauren; Yaxley, Alison; Markow, Kylie; Miller, Michelle

2015-05-01

To identify and critique tools for the assessment of Ca and/or dairy intake in adults, in order to ascertain the most accurate and reliable tools available. A systematic review of the literature was conducted using defined inclusion and exclusion criteria. Articles reporting on originally developed tools or testing the reliability or validity of existing tools that measure Ca and/or dairy intake in adults were included. Author-defined criteria for reporting reliability and validity properties were applied. Studies conducted in Western countries. Adults. Thirty papers, utilising thirty-six tools assessing intake of dairy, Ca or both, were identified. Reliability testing was conducted on only two dairy and five Ca tools, with results indicating that only one dairy and two Ca tools were reliable. Validity testing was conducted for all but four Ca-only tools. There was high reliance in validity testing on lower-order tests such as correlation and failure to differentiate between statistical and clinically meaningful differences. Results of the validity testing suggest one dairy and five Ca tools are valid. Thus one tool was considered both reliable and valid for the assessment of dairy intake and only two tools proved reliable and valid for the assessment of Ca intake. While several tools are reliable and valid, their application across adult populations is limited by the populations in which they were tested. These results indicate a need for tools that assess Ca and/or dairy intake in adults to be rigorously tested for reliability and validity.
SU-E-T-630: Predictive Modeling of Mortality, Tumor Control, and Normal Tissue Complications After Stereotactic Body Radiotherapy for Stage I Non-Small Cell Lung Cancer

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lindsay, WD; Oncora Medical, LLC, Philadelphia, PA; Berlind, CG

Purpose: While rates of local control have been well characterized after stereotactic body radiotherapy (SBRT) for stage I non-small cell lung cancer (NSCLC), less data are available characterizing survival and normal tissue toxicities, and no validated models exist assessing these parameters after SBRT. We evaluate the reliability of various machine learning techniques when applied to radiation oncology datasets to create predictive models of mortality, tumor control, and normal tissue complications. Methods: A dataset of 204 consecutive patients with stage I non-small cell lung cancer (NSCLC) treated with stereotactic body radiotherapy (SBRT) at the University of Pennsylvania between 2009 and 2013more » was used to create predictive models of tumor control, normal tissue complications, and mortality in this IRB-approved study. Nearly 200 data fields of detailed patient- and tumor-specific information, radiotherapy dosimetric measurements, and clinical outcomes data were collected. Predictive models were created for local tumor control, 1- and 3-year overall survival, and nodal failure using 60% of the data (leaving the remainder as a test set). After applying feature selection and dimensionality reduction, nonlinear support vector classification was applied to the resulting features. Models were evaluated for accuracy and area under ROC curve on the 81-patient test set. Results: Models for common events in the dataset (such as mortality at one year) had the highest predictive power (AUC = .67, p < 0.05). For rare occurrences such as radiation pneumonitis and local failure (each occurring in less than 10% of patients), too few events were present to create reliable models. Conclusion: Although this study demonstrates the validity of predictive analytics using information extracted from patient medical records and can most reliably predict for survival after SBRT, larger sample sizes are needed to develop predictive models for normal tissue toxicities and more advanced machine learning methodologies need be consider in the future.« less
Validity and Reliability of Published Comprehensive Theory of Mind Tests for Normal Preschool Children: A Systematic Review.

PubMed

Ziatabar Ahmadi, Seyyede Zohreh; Jalaie, Shohreh; Ashayeri, Hassan

2015-09-01

Theory of mind (ToM) or mindreading is an aspect of social cognition that evaluates mental states and beliefs of oneself and others. Validity and reliability are very important criteria when evaluating standard tests; and without them, these tests are not usable. The aim of this study was to systematically review the validity and reliability of published English comprehensive ToM tests developed for normal preschool children. We searched MEDLINE (PubMed interface), Web of Science, Science direct, PsycINFO, and also evidence base Medicine (The Cochrane Library) databases from 1990 to June 2015. Search strategy was Latin transcription of 'Theory of Mind' AND test AND children. Also, we manually studied the reference lists of all final searched articles and carried out a search of their references. Inclusion criteria were as follows: Valid and reliable diagnostic ToM tests published from 1990 to June 2015 for normal preschool children; and exclusion criteria were as follows: the studies that only used ToM tests and single tasks (false belief tasks) for ToM assessment and/or had no description about structure, validity or reliability of their tests. METHODological quality of the selected articles was assessed using the Critical Appraisal Skills Programme (CASP). In primary searching, we found 1237 articles in total databases. After removing duplicates and applying all inclusion and exclusion criteria, we selected 11 tests for this systematic review. There were a few valid, reliable and comprehensive ToM tests for normal preschool children. However, we had limitations concerning the included articles. The defined ToM tests were different in populations, tasks, mode of presentations, scoring, mode of responses, times and other variables. Also, they had various validities and reliabilities. Therefore, it is recommended that the researchers and clinicians select the ToM tests according to their psychometric characteristics, validity and reliability.
Validity and Reliability of Published Comprehensive Theory of Mind Tests for Normal Preschool Children: A Systematic Review

PubMed Central

Ziatabar Ahmadi, Seyyede Zohreh; Jalaie, Shohreh; Ashayeri, Hassan

2015-01-01

Objective: Theory of mind (ToM) or mindreading is an aspect of social cognition that evaluates mental states and beliefs of oneself and others. Validity and reliability are very important criteria when evaluating standard tests; and without them, these tests are not usable. The aim of this study was to systematically review the validity and reliability of published English comprehensive ToM tests developed for normal preschool children. Method: We searched MEDLINE (PubMed interface), Web of Science, Science direct, PsycINFO, and also evidence base Medicine (The Cochrane Library) databases from 1990 to June 2015. Search strategy was Latin transcription of ‘Theory of Mind’ AND test AND children. Also, we manually studied the reference lists of all final searched articles and carried out a search of their references. Inclusion criteria were as follows: Valid and reliable diagnostic ToM tests published from 1990 to June 2015 for normal preschool children; and exclusion criteria were as follows: the studies that only used ToM tests and single tasks (false belief tasks) for ToM assessment and/or had no description about structure, validity or reliability of their tests. Methodological quality of the selected articles was assessed using the Critical Appraisal Skills Programme (CASP). Result: In primary searching, we found 1237 articles in total databases. After removing duplicates and applying all inclusion and exclusion criteria, we selected 11 tests for this systematic review. Conclusion: There were a few valid, reliable and comprehensive ToM tests for normal preschool children. However, we had limitations concerning the included articles. The defined ToM tests were different in populations, tasks, mode of presentations, scoring, mode of responses, times and other variables. Also, they had various validities and reliabilities. Therefore, it is recommended that the researchers and clinicians select the ToM tests according to their psychometric characteristics, validity and reliability. PMID:27006666
Acoustic emission from composite materials. [nondestructive tests

NASA Technical Reports Server (NTRS)

Visconti, I. C.; Teti, R.

1979-01-01

The two basic areas where the acoustic emission (AE) technique can be applied are materials research and the evaluation of structural reliability. This experimental method leads to a better understanding of fracture mechanisms and is an NDT technique particularly well suited for the study of propagating cracks. Experiments are described in which acoustic emissions were unambiguously correlated with microstructural fracture mechanisms. The advantages and limitations of the AE technique are noted.
Validation databases for simulation models: aboveground biomass and net primary productive, (NPP) estimation using eastwide FIA data

Treesearch

Jennifer C. Jenkins; Richard A. Birdsey

2000-01-01

As interest grows in the role of forest growth in the carbon cycle, and as simulation models are applied to predict future forest productivity at large spatial scales, the need for reliable and field-based data for evaluation of model estimates is clear. We created estimates of potential forest biomass and annual aboveground production for the Chesapeake Bay watershed...
Numerical and experimental evaluation of microfluidic sorting devices.

PubMed

Taylor, Jay K; Ren, Carolyn L; Stubley, G D

2008-01-01

The development of lab-on-a-chip devices calls for the isolation or separation of specific bioparticles or cells. The design of a miniaturized cell-sorting device for handheld operation must follow the strict parameters associated with lab-on-a-chip technology. The limitations include applied voltage, high efficiency of cell-separation, reliability, size, flow control, and cost, among others. Currently used designs have achieved successful levels of cell isolation; however, further improvements in the microfluidic chip design are important to incorporate into larger systems. This study evaluates specific design modifications that contribute to the reduction of required applied potential aiming for developing portable devices, improved operation reliability by minimizing induced pressure disturbance when electrokinetic pumping is employed, and improved flow control by incorporating directing streams achieving dynamic sorting and counting. The chip designs fabricated in glass and polymeric materials include asymmetric channel widths for sample focusing, nonuniform channel depth for minimizing induced pressure disturbance, directing streams to assist particle flow control, and online filters for reducing channel blockage. Fluorescence-based visualization experimental results of electrokinetic focusing, flow field phenomena, and dynamic sorting demonstrate the advantages of the chip design. Numerical simulations in COMSOL are validated by the experimental data and used to investigate the effects of channel geometry and fluid properties on the flow field.
Application of single-step genomic evaluation for crossbred performance in pig.

PubMed

Xiang, T; Nielsen, B; Su, G; Legarra, A; Christensen, O F

2016-03-01

Crossbreding is predominant and intensively used in commercial meat production systems, especially in poultry and swine. Genomic evaluation has been successfully applied for breeding within purebreds but also offers opportunities of selecting purebreds for crossbred performance by combining information from purebreds with information from crossbreds. However, it generally requires that all relevant animals are genotyped, which is costly and presently does not seem to be feasible in practice. Recently, a novel single-step BLUP method for genomic evaluation of both purebred and crossbred performance has been developed that can incorporate marker genotypes into a traditional animal model. This new method has not been validated in real data sets. In this study, we applied this single-step method to analyze data for the maternal trait of total number of piglets born in Danish Landrace, Yorkshire, and two-way crossbred pigs in different scenarios. The genetic correlation between purebred and crossbred performances was investigated first, and then the impact of (crossbred) genomic information on prediction reliability for crossbred performance was explored. The results confirm the existence of a moderate genetic correlation, and it was seen that the standard errors on the estimates were reduced when including genomic information. Models with marker information, especially crossbred genomic information, improved model-based reliabilities for crossbred performance of purebred boars and also improved the predictive ability for crossbred animals and, to some extent, reduced the bias of prediction. We conclude that the new single-step BLUP method is a good tool in the genetic evaluation for crossbred performance in purebred animals.
An adaptive cubature formula for efficient reliability assessment of nonlinear structural dynamic systems

NASA Astrophysics Data System (ADS)

Xu, Jun; Kong, Fan

2018-05-01

Extreme value distribution (EVD) evaluation is a critical topic in reliability analysis of nonlinear structural dynamic systems. In this paper, a new method is proposed to obtain the EVD. The maximum entropy method (MEM) with fractional moments as constraints is employed to derive the entire range of EVD. Then, an adaptive cubature formula is proposed for fractional moments assessment involved in MEM, which is closely related to the efficiency and accuracy for reliability analysis. Three point sets, which include a total of 2d2 + 1 integration points in the dimension d, are generated in the proposed formula. In this regard, the efficiency of the proposed formula is ensured. Besides, a "free" parameter is introduced, which makes the proposed formula adaptive with the dimension. The "free" parameter is determined by arranging one point set adjacent to the boundary of the hyper-sphere which contains the bulk of total probability. In this regard, the tail distribution may be better reproduced and the fractional moments could be evaluated with accuracy. Finally, the proposed method is applied to a ten-storey shear frame structure under seismic excitations, which exhibits strong nonlinearity. The numerical results demonstrate the efficacy of the proposed method.
Development of Equivalent Material Properties of Microbump for Simulating Chip Stacking Packaging

PubMed Central

Lee, Chang-Chun; Tzeng, Tzai-Liang; Huang, Pei-Chen

2015-01-01

A three-dimensional integrated circuit (3D-IC) structure with a significant scale mismatch causes difficulty in analytic model construction. This paper proposes a simulation technique to introduce an equivalent material composed of microbumps and their surrounding wafer level underfill (WLUF). The mechanical properties of this equivalent material, including Young’s modulus (E), Poisson’s ratio, shear modulus, and coefficient of thermal expansion (CTE), are directly obtained by applying either a tensile load or a constant displacement, and by increasing the temperature during simulations, respectively. Analytic results indicate that at least eight microbumps at the outermost region of the chip stacking structure need to be considered as an accurate stress/strain contour in the concerned region. In addition, a factorial experimental design with analysis of variance is proposed to optimize chip stacking structure reliability with four factors: chip thickness, substrate thickness, CTE, and E-value. Analytic results show that the most significant factor is CTE of WLUF. This factor affects microbump reliability and structural warpage under a temperature cycling load and high-temperature bonding process. WLUF with low CTE and high E-value are recommended to enhance the assembly reliability of the 3D-IC architecture. PMID:28793495
[Psychometric properties of a self-efficacy scale for physical activity in Brazilian adults].

PubMed

Rech, Cassiano Ricardo; Sarabia, Tais Taiana; Fermino, Rogério César; Hallal, Pedro Curi; Reis, Rodrigo Siqueira

2011-04-01

To test the validity and reliability of a self-efficacy scale for physical activity (PA) in Brazilian adults. A self-efficacy scale was applied jointly with a multidimensional questionnaire through face-to-face interviews with 1,418 individuals (63.4% women) aged ≥ 18 years. The scale was submitted to validity (factorial and construct) and reliability analysis (internal consistency and temporal stability). A test-retest procedure was conducted with 74 individuals to evaluate temporal stability. Exploratory factor analyses revealed two independent factors: self-efficacy for walking and self-efficacy for moderate and vigorous PA (MVPA). Together, these two factors explained 65.4% of the total variance of the scale (20.9% and 44.5% for walking and MVPA, respectively). Cronbach's alpha values were 0.83 for walking and 0.90 for MVPA, indicating high internal consistency. Both factors were significantly and positively correlated (rho ≥ 0.17, P < 0.001) with quality of life indicators (health perception, self-satisfaction, and energy for daily activities), indicating an adequate construct validity. The scale's validity, internal consistency, and reliability were adequate to evaluate self-efficacy for PA in Brazilian adults.
[Balanced scorecard for performance measurement of a nursing organization in a Korean hospital].

PubMed

Hong, Yoonmi; Hwang, Kyung Ja; Kim, Mi Ja; Park, Chang Gi

2008-02-01

The purpose of this study was to develop a balanced scorecard (BSC) for performance measurement of a Korean hospital nursing organization and to evaluate the validity and reliability of performance measurement indicators. Two hundred fifty-nine nurses in a Korean hospital participated in a survey questionnaire that included 29-item performance evaluation indicators developed by investigators of this study based on the Kaplan and Norton's BSC (1992). Cronbach's alpha was used to test the reliability of the BSC. Exploratory and confirmatory factor analysis with a structure equation model (SEM) was applied to assess the construct validity of the BSC. Cronbach's alpha of 29 items was .948. Factor analysis of the BSC showed 5 principal components (eigen value >1.0) which explained 62.7% of the total variance, and it included a new one, community service. The SEM analysis results showed that 5 components were significant for the hospital BSC tool. High degree of reliability and validity of this BSC suggests that it may be used for performance measurements of a Korean hospital nursing organization. Future studies may consider including a balanced number of nurse managers and staff nurses in the study. Further data analysis on the relationships among factors is recommended.
Safety assessment of a shallow foundation using the random finite element method

NASA Astrophysics Data System (ADS)

Zaskórski, Łukasz; Puła, Wojciech

2015-04-01

A complex structure of soil and its random character are reasons why soil modeling is a cumbersome task. Heterogeneity of soil has to be considered even within a homogenous layer of soil. Therefore an estimation of shear strength parameters of soil for the purposes of a geotechnical analysis causes many problems. In applicable standards (Eurocode 7) there is not presented any explicit method of an evaluation of characteristic values of soil parameters. Only general guidelines can be found how these values should be estimated. Hence many approaches of an assessment of characteristic values of soil parameters are presented in literature and can be applied in practice. In this paper, the reliability assessment of a shallow strip footing was conducted using a reliability index β. Therefore some approaches of an estimation of characteristic values of soil properties were compared by evaluating values of reliability index β which can be achieved by applying each of them. Method of Orr and Breysse, Duncan's method, Schneider's method, Schneider's method concerning influence of fluctuation scales and method included in Eurocode 7 were examined. Design values of the bearing capacity based on these approaches were referred to the stochastic bearing capacity estimated by the random finite element method (RFEM). Design values of the bearing capacity were conducted for various widths and depths of a foundation in conjunction with design approaches DA defined in Eurocode. RFEM was presented by Griffiths and Fenton (1993). It combines deterministic finite element method, random field theory and Monte Carlo simulations. Random field theory allows to consider a random character of soil parameters within a homogenous layer of soil. For this purpose a soil property is considered as a separate random variable in every element of a mesh in the finite element method with proper correlation structure between points of given area. RFEM was applied to estimate which theoretical probability distribution fits the empirical probability distribution of bearing capacity basing on 3000 realizations. Assessed probability distribution was applied to compute design values of the bearing capacity and related reliability indices β. Conducted analysis were carried out for a cohesion soil. Hence a friction angle and a cohesion were defined as a random parameters and characterized by two dimensional random fields. A friction angle was described by a bounded distribution as it differs within limited range. While a lognormal distribution was applied in case of a cohesion. Other properties - Young's modulus, Poisson's ratio and unit weight were assumed as deterministic values because they have negligible influence on the stochastic bearing capacity. Griffiths D. V., & Fenton G. A. (1993). Seepage beneath water retaining structures founded on spatially random soil. Géotechnique, 43(6), 577-587.
Hypertension Knowledge-Level Scale (HK-LS): a study on development, validity and reliability.

PubMed

Erkoc, Sultan Baliz; Isikli, Burhanettin; Metintas, Selma; Kalyoncu, Cemalettin

2012-03-01

This study was conducted to develop a scale to measure knowledge about hypertension among Turkish adults. The Hypertension Knowledge-Level Scale (HK-LS) was generated based on content, face, and construct validity, internal consistency, test re-test reliability, and discriminative validity procedures. The final scale had 22 items with six sub-dimensions. The scale was applied to 457 individuals aged ≥ 18 years, and 414 of them were re-evaluated for test-retest reliability. The six sub-dimensions encompassed 60.3% of the total variance. Cronbach alpha coefficients were 0.82 for the entire scale and 0.92, 0.59, 0.67, 0.77, 0.72, and 0.76 for the sub-dimensions of definition, medical treatment, drug compliance, lifestyle, diet, and complications, respectively. The scale ensured internal consistency in reliability and construct validity, as well as stability over time. Significant relationships were found between knowledge score and age, gender, educational level, and history of hypertension of the participants. No correlation was found between knowledge score and working at an income-generating job. The present scale, developed to measure the knowledge level of hypertension among Turkish adults, was found to be valid and reliable.
Defect recognition in CFRP components using various NDT methods within a smart manufacturing process

NASA Astrophysics Data System (ADS)

Schumacher, David; Meyendorf, Norbert; Hakim, Issa; Ewert, Uwe

2018-04-01

The manufacturing process of carbon fiber reinforced polymer (CFRP) components is gaining a more and more significant role when looking at the increasing amount of CFRPs used in industries today. The monitoring of the manufacturing process and hence the reliability of the manufactured products, is one of the major challenges we need to face in the near future. Common defects which arise during manufacturing process are e.g. porosity and voids which may lead to delaminations during operation and under load. To find irregularities and classify them as possible defects in an early stage of the manufacturing process is of high importance for the safety and reliability of the finished products, as well as of significant impact from an economical point of view. In this study we compare various NDT methods which were applied to similar CFRP laminate samples in order to detect and characterize regions of defective volume. Besides ultrasound, thermography and eddy current, different X-ray methods like radiography, laminography and computed tomography are used to investigate the samples. These methods are compared with the intention to evaluate their capability to reliably detect and characterize defective volume. Beyond the detection and evaluation of defects, we also investigate possibilities to combine various NDT methods within a smart manufacturing process in which the decision which method shall be applied is inherent within the process. Is it possible to design an in-line or at-line testing process which can recognize defects reliably and reduce testing time and costs? This study aims to show up opportunities of designing a smart NDT process synchronized to the production based on the concepts of smart production (Industry 4.0). A set of defective CFRP laminate samples and different NDT methods were used to demonstrate how effective defects are recognized and how communication between interconnected NDT sensors and the manufacturing process could be organized.
New International Program to Asses the Reliability of Emerging Nondestructive Techniques (PARENT)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Prokofiev, Iouri; Cumblidge, Stephen E.; Csontos, Aladar A.

2013-01-25

The Nuclear Regulatory Commission (NRC) established the Program to Assess the Reliability of Emerging Nondestructive Techniques (PARENT) to follow on from the successful Program for the Inspection of Nickel alloy Components (PINC). The goal of the PARENT is to conduct a confirmatory assessment of the reliability of nondestructive evaluation (NDE) techniques for detecting and sizing primary water stress corrosion cracks (PWSCC) and applying the lessons learned from PINC to a series of round-robin tests. These open and blind round-robin tests will comprise a new set of typical pressure boundary components including dissimilar metal welds (DMWs) and bottom-mounted instrumentation penetrations. Openmore » round-robin tests will engage research and industry teams worldwide to investigate and demonstrate the reliability of emerging NDE techniques to detect and size flaws with a wide range of lengths, depths, orientations, and locations. Blind round-robin tests will utilize various testing organizations, whose inspectors and procedures are certified by the standards for the nuclear industry in their respective countries, to investigate the ability of established NDE techniques to detect and size flaws whose characteristics range from relatively easy to very difficult for detection and sizing. Blind and open round-robin testing started in late 2011 and early 2012, respectively. This paper will present the work scope with reports on progress, NDE methods evaluated, and project timeline for PARENT.« less

Sport-specific endurance plank test for evaluation of global core muscle function.

PubMed

Tong, Tom K; Wu, Shing; Nie, Jinlei

2014-02-01

To examine the validity and reliability of a sports-specific endurance plank test for the evaluation of global core muscle function. Repeated-measures study. Laboratory environment. Twenty-eight male and eight female young athletes. Surface electromyography (sEMG) of selected trunk flexors and extensors, and an intervention of pre-fatigue core workout were applied for test validation. Intraclass correlation coefficient (ICC), coefficient of variation (CV), and the measurement bias ratio */÷ ratio limits of agreement (LOA) were calculated to assess reliability and measurement error. Test validity was shown by the sEMG of selected core muscles, which indicated >50% increase in muscle activation during the test; and the definite discrimination of the ∼30% reduction in global core muscle endurance subsequent to a pre-fatigue core workout. For test-retest reliability, when the first attempt of three repeated trials was considered as familiarisation, the ICC was 0.99 (95% CI: 0.98-0.99), CV was 2.0 ± 1.56% and the measurement bias ratio */÷ ratio LOA was 0.99 */÷ 1.07. The findings suggest that the sport-specific endurance plank test is a valid, reliable and practical method for assessing global core muscle endurance in athletes given that at least one familiarisation trial takes place prior to measurement. Copyright © 2013 Elsevier Ltd. All rights reserved.
Insights into the use of thermography to assess burn wound healing potential: a reliable and valid technique when compared to laser Doppler imaging

NASA Astrophysics Data System (ADS)

Jaspers, Mariëlle E. H.; Maltha, Ilse; Klaessens, John H. G. M.; de Vet, Henrica C. W.; Verdaasdonk, Rudolf M.; van Zuijlen, Paul P. M.

2016-09-01

Adequate assessment of burn wounds is crucial in the management of burn patients. Thermography, as a noninvasive measurement tool, can be utilized to detect the remaining perfusion over large burn wound areas by measuring temperature, thereby reflecting the healing potential (HP) (i.e., number of days that burns require to heal). The objective of this study was to evaluate the clinimetric properties (i.e., reliability and validity) of thermography for measuring burn wound HP. To evaluate reliability, two independent observers performed a thermography measurement of 50 burns. The intraclass correlation coefficient (ICC), the standard error of measurement (SEM), and the limits of agreement (LoA) were calculated. To assess validity, temperature differences between burned and nonburned skin (ΔT) were compared to the HP found by laser Doppler imaging (serving as the reference standard). By applying a visual method, one ΔT cutoff point was identified to differentiate between burns requiring conservative versus surgical treatment. The ICC was 0.99, expressing an excellent correlation between two measurements. The SEM was calculated at 0.22°C, the LoA at -0.58°C and 0.64°C. The ΔT cutoff point was -0.07°C (sensitivity 80% specificity 80%). These results show that thermography is a reliable and valid technique in the assessment of burn wound HP.
Use of Model-Based Design Methods for Enhancing Resiliency Analysis of Unmanned Aerial Vehicles

NASA Astrophysics Data System (ADS)

Knox, Lenora A.

The most common traditional non-functional requirement analysis is reliability. With systems becoming more complex, networked, and adaptive to environmental uncertainties, system resiliency has recently become the non-functional requirement analysis of choice. Analysis of system resiliency has challenges; which include, defining resilience for domain areas, identifying resilience metrics, determining resilience modeling strategies, and understanding how to best integrate the concepts of risk and reliability into resiliency. Formal methods that integrate all of these concepts do not currently exist in specific domain areas. Leveraging RAMSoS, a model-based reliability analysis methodology for Systems of Systems (SoS), we propose an extension that accounts for resiliency analysis through evaluation of mission performance, risk, and cost using multi-criteria decision-making (MCDM) modeling and design trade study variability modeling evaluation techniques. This proposed methodology, coined RAMSoS-RESIL, is applied to a case study in the multi-agent unmanned aerial vehicle (UAV) domain to investigate the potential benefits of a mission architecture where functionality to complete a mission is disseminated across multiple UAVs (distributed) opposed to being contained in a single UAV (monolithic). The case study based research demonstrates proof of concept for the proposed model-based technique and provides sufficient preliminary evidence to conclude which architectural design (distributed vs. monolithic) is most resilient based on insight into mission resilience performance, risk, and cost in addition to the traditional analysis of reliability.
Geometric classification of scalp hair for valid drug testing, 6 more reliable than 8 hair curl groups.

PubMed

Mkentane, K; Van Wyk, J C; Sishi, N; Gumedze, F; Ngoepe, M; Davids, L M; Khumalo, N P

2017-01-01

Curly hair is reported to contain higher lipid content than straight hair, which may influence incorporation of lipid soluble drugs. The use of race to describe hair curl variation (Asian, Caucasian and African) is unscientific yet common in medical literature (including reports of drug levels in hair). This study investigated the reliability of a geometric classification of hair (based on 3 measurements: the curve diameter, curl index and number of waves). After ethical approval and informed consent, proximal virgin (6cm) hair sampled from the vertex of scalp in 48 healthy volunteers were evaluated. Three raters each scored hairs from 48 volunteers at two occasions each for the 8 and 6-group classifications. One rater applied the 6-group classification to 80 additional volunteers in order to further confirm the reliability of this system. The Kappa statistic was used to assess intra and inter rater agreement. Each rater classified 480 hairs on each occasion. No rater classified any volunteer's 10 hairs into the same group; the most frequently occurring group was used for analysis. The inter-rater agreement was poor for the 8-groups (k = 0.418) but improved for the 6-groups (k = 0.671). The intra-rater agreement also improved (k = 0.444 to 0.648 versus 0.599 to 0.836) for 6-groups; that for the one evaluator for all volunteers was good (k = 0.754). Although small, this is the first study to test the reliability of a geometric classification. The 6-group method is more reliable. However, a digital classification system is likely to reduce operator error. A reliable objective classification of human hair curl is long overdue, particularly with the increasing use of hair as a testing substrate for treatment compliance in Medicine.
European Organization for Research and Treatment of Cancer Quality of Life Questionnaire Core 30: factorial models to Brazilian cancer patients

PubMed Central

Campos, Juliana Alvares Duarte Bonini; Spexoto, Maria Cláudia Bernardes; da Silva, Wanderson Roberto; Serrano, Sergio Vicente; Marôco, João

2018-01-01

ABSTRACT Objective To evaluate the psychometric properties of the seven theoretical models proposed in the literature for European Organization for Research and Treatment of Cancer Quality of Life Questionnaire Core 30 (EORTC QLQ-C30), when applied to a sample of Brazilian cancer patients. Methods Content and construct validity (factorial, convergent, discriminant) were estimated. Confirmatory factor analysis was performed. Convergent validity was analyzed using the average variance extracted. Discriminant validity was analyzed using correlational analysis. Internal consistency and composite reliability were used to assess the reliability of instrument. Results A total of 1,020 cancer patients participated. The mean age was 53.3±13.0 years, and 62% were female. All models showed adequate factorial validity for the study sample. Convergent and discriminant validities and the reliability were compromised in all of the models for all of the single items referring to symptoms, as well as for the “physical function” and “cognitive function” factors. Conclusion All theoretical models assessed in this study presented adequate factorial validity when applied to Brazilian cancer patients. The choice of the best model for use in research and/or clinical protocols should be centered on the purpose and underlying theory of each model. PMID:29694609
[Development and validation of a questionnaire on perception of portfolio by undergraduate medical students].

PubMed

Riquelme, Arnoldo; Méndez, Benjamín; de la Fuente, Paloma; Padilla, Oslando; Benaglio, Carla; Sirhan, Marisol; Labarca, Jaime

2011-01-01

Portfolio is an innovative instrument that promotes reflection, creativity and professionalism among students. To describe the development and validation process of a questionnaire to evaluate the use of portfolio in undergraduate medical students. Focus groups with students and teachers were employed to identify aspects related with portfolio in undergraduate teaching. The Delphi technique was used to prioritize relevant aspects and construct the questionnaire. The validated questionnaire, consisting of 43 items and 6 factors, was applied to 97 students (response rote of 99.9%) in 2007 and 100 students (99.2%) in 2008. Each question had to be answered using a Likert scale, from 0 (completely disagree) to 4 (completely agree) The validity and reliability of the questionnaire was evaluated. The questionnaire showed a high reliability (Cronbach alpha = 0.9). The mean total scores obtained in 2007 and 2008 were 106.2 ± 21.2 (61.7% of the maximal obtainable score) and 104.6 ± 34.0 (60.8% of the maximal obtainable score), respectively No significant differences were seen in the analysis by factors. Changes in portfolio during 2008 showed differences in items related with organization, evaluation and regulation. The questionnaire is a valid and highly reliable instrument, measuring perceptions about the portfolio by undergraduate medical students. The students perceived an improvement in their creativity and professionalism as one of the strengths of portfolio. The weaknesses identified during the implementation process helped us to focus changes in organization and evaluation to improve the portfolio as a dynamic process.
Evaluation of Contrast Extravasation as a Diagnostic Criterion in the Evaluation of Arthroscopically Proven HAGL/pHAGL Lesions

PubMed Central

Maldjian, Catherine; Khanna, Vineet; Bradley, James; Adam, Richard

2014-01-01

Purpose. The validity of preoperative MRI in diagnosing HAGL lesions is debated. Various investigations have produced mixed results with regard to the utility of MRI. The purpose of this investigation is to apply a novel method of diagnosing HAGL/pHAGL lesions by looking at contrast extravasation and to evaluate the reliability of such extravasation of contrast into an extra-articular space as a sign of HAGL/pHAGL lesion. Methods. We utilized specific criteria to define contrast extravasation. We evaluated these criteria in 12 patients with arthroscopically proven HAGL/pHAGL lesion. We also evaluated these criteria in a control group. Results. Contrast extravasation occurred in over 83% of arthroscopically positive cases. Contrast extravasation as a diagnostic criterion in the evaluation of HAGL/pHAGL lesions demonstrated a high interobserver degree of agreement. Conclusions. In conclusion, extra-articular contrast extravasation may serve as a valid and reliable sign of HAGL and pHAGL lesions, provided stringent criteria are maintained to assure that the contrast lies in an extra-articular location. In cases where extravasation is not present, the “J” sign, though nonspecific, may be the only evidence of subtle HAGL and pHAGL lesions. Level of Evidence. Level IV, Retrospective Case-Control series. PMID:25530880
NASA reliability preferred practices for design and test

NASA Technical Reports Server (NTRS)

1991-01-01

Given here is a manual that was produced to communicate within the aerospace community design practices that have contributed to NASA mission success. The information represents the best technical advice that NASA has to offer on reliability design and test practices. Topics covered include reliability practices, including design criteria, test procedures, and analytical techniques that have been applied to previous space flight programs; and reliability guidelines, including techniques currently applied to space flight projects, where sufficient information exists to certify that the technique will contribute to mission success.
Reliability of cognitive tests of ELSA-Brasil, the brazilian longitudinal study of adult health

PubMed Central

Batista, Juliana Alves; Giatti, Luana; Barreto, Sandhi Maria; Galery, Ana Roscoe Papini; Passos, Valéria Maria de Azeredo

2013-01-01

Cognitive function evaluation entails the use of neuropsychological tests, applied exclusively or in sequence. The results of these tests may be influenced by factors related to the environment, the interviewer or the interviewee. OBJECTIVES We examined the test-retest reliability of some tests of the Brazilian version from the Consortium to Establish a Registry for Alzheimer's disease. METHODS The ELSA-Brasil is a multicentre study of civil servants (35-74 years of age) from public institutions across six Brazilian States. The same tests were applied, in different order of appearance, by the same trained and certified interviewer, with an approximate 20-day interval, to 160 adults (51% men, mean age 52 years). The Intraclass Correlation Coefficient (ICC) was used to assess the reliability of the measures; and a dispersion graph was used to examine the patterns of agreement between them. RESULTS We observed higher retest scores in all tests as well as a shorter test completion time for the Trail Making Test B. ICC values for each test were as following: Word List Learning Test (0.56), Word Recall (0.50), Word Recognition (0.35), Phonemic Verbal Fluency Test (VFT, 0.61), Semantic VFT (0.53) and Trail B (0.91). The Bland-Altman plot showed better correlation of executive function (VFT and Trail B) than of memory tests. CONCLUSIONS Better performance in retest may reflect a learning effect, and suggest that retest should be repeated using alternate forms or after longer periods. In this sample of adults with high schooling level, reliability was only moderate for memory tests whereas the measurement of executive function proved more reliable. PMID:29213860
A strategy for evaluating pathway analysis methods.

PubMed

Yu, Chenggang; Woo, Hyung Jun; Yu, Xueping; Oyama, Tatsuya; Wallqvist, Anders; Reifman, Jaques

2017-10-13

Researchers have previously developed a multitude of methods designed to identify biological pathways associated with specific clinical or experimental conditions of interest, with the aim of facilitating biological interpretation of high-throughput data. Before practically applying such pathway analysis (PA) methods, we must first evaluate their performance and reliability, using datasets where the pathways perturbed by the conditions of interest have been well characterized in advance. However, such 'ground truths' (or gold standards) are often unavailable. Furthermore, previous evaluation strategies that have focused on defining 'true answers' are unable to systematically and objectively assess PA methods under a wide range of conditions. In this work, we propose a novel strategy for evaluating PA methods independently of any gold standard, either established or assumed. The strategy involves the use of two mutually complementary metrics, recall and discrimination. Recall measures the consistency of the perturbed pathways identified by applying a particular analysis method to an original large dataset and those identified by the same method to a sub-dataset of the original dataset. In contrast, discrimination measures specificity-the degree to which the perturbed pathways identified by a particular method to a dataset from one experiment differ from those identifying by the same method to a dataset from a different experiment. We used these metrics and 24 datasets to evaluate six widely used PA methods. The results highlighted the common challenge in reliably identifying significant pathways from small datasets. Importantly, we confirmed the effectiveness of our proposed dual-metric strategy by showing that previous comparative studies corroborate the performance evaluations of the six methods obtained by our strategy. Unlike any previously proposed strategy for evaluating the performance of PA methods, our dual-metric strategy does not rely on any ground truth, either established or assumed, of the pathways perturbed by a specific clinical or experimental condition. As such, our strategy allows researchers to systematically and objectively evaluate pathway analysis methods by employing any number of datasets for a variety of conditions.
Difficulties in applying numerical simulations to an evaluation of occupational hazards caused by electromagnetic fields

PubMed Central

Zradziński, Patryk

2015-01-01

Due to the various physical mechanisms of interaction between a worker's body and the electromagnetic field at various frequencies, the principles of numerical simulations have been discussed for three areas of worker exposure: to low frequency magnetic field, to low and intermediate frequency electric field and to radiofrequency electromagnetic field. This paper presents the identified difficulties in applying numerical simulations to evaluate physical estimators of direct and indirect effects of exposure to electromagnetic fields at various frequencies. Exposure of workers operating a plastic sealer have been taken as an example scenario of electromagnetic field exposure at the workplace for discussion of those difficulties in applying numerical simulations. The following difficulties in reliable numerical simulations of workers’ exposure to the electromagnetic field have been considered: workers’ body models (posture, dimensions, shape and grounding conditions), working environment models (objects most influencing electromagnetic field distribution) and an analysis of parameters for which exposure limitations are specified in international guidelines and standards. PMID:26323781
Reliability theory for repair service organization simulation and increase of innovative attraction of industrial enterprises

NASA Astrophysics Data System (ADS)

Dolzhenkova, E. V.; Iurieva, L. V.

2018-05-01

The study presents the author's algorithm for the industrial enterprise repair service organization simulation based on the reliability theory, as well as the results of its application. The monitoring of the industrial enterprise repair service organization is proposed to perform on the basis of the enterprise's state indexes for the main resources (equipment, labour, finances, repair areas), which allows quantitative evaluation of the reliability level as a resulting summary rating of the said parameters and the ensuring of an appropriate level of the operation reliability of the serviced technical objects. Under the conditions of the tough competition, the following approach is advisable: the higher efficiency of production and a repair service itself, the higher the innovative attractiveness of an industrial enterprise. The results of the calculations show that in order to prevent inefficient losses of production and to reduce the repair costs, it is advisable to apply the reliability theory. The overall reliability rating calculated on the basis of the author's algorithm has low values. The processing of the statistical data forms the reliability characteristics for the different workshops and services of an industrial enterprise, which allows one to define the failure rates of the various units of equipment and to establish the reliability indexes necessary for the subsequent mathematical simulation. The proposed simulating algorithm contributes to an increase of the efficiency of the repair service organization and improvement of the innovative attraction of an industrial enterprise.
[Teacher's perfomance assessment in Family Medicine specialization].

PubMed

Martínez-González, Adrián; Gómez-Clavelina, Francisco J; Hernández-Torres, Isaías; Flores-Hernández, Fernando; Sánchez-Mendiola, Melchor

2016-01-01

In Mexico there is no systematic evaluation of teachers in medical specialties. It is difficult to identify appropriate teaching practices. The lack of evaluation has limited the recognition and improvement of teaching. The objective of this study was to analyze feedback from students about teaching activities of teachers-tutors responsible for the specialization course in family medicine, and evaluate the evidence of reliability and validity of the instrument applied online. It was an observational and cross-sectional study. Seventy eight teachers of Family Medicine of medical residency were evaluated by 734 resident´s opinion. The anonymous questionnaire to assess teaching performance by resident's opinion and it is composed of 5 dimensions using a Likert scale. Descriptive and inferential statistics (t test, one-way ANOVA and factor analysis) were used. Residents stated that teaching performance is acceptable, with an average of 4.25 ± 0.93. The best valued dimension was "Methodology" with an average of 4.34 ± .92 in contrast to the "assessment" dimension with 4.16 ± 1.04. Teachers of specialization in family medicine have acceptable performance by resident's opinion. The online assessment tool meets the criteria of validity and reliability.
Development and Evaluation of the Telephone Crisis Support Skills Scale.

PubMed

Kitchingman, Taneile A; Wilson, Coralie J; Caputi, Peter; Woodward, Alan; Hunt, Tara

2015-01-01

Although telephone services continue to play an important role in the delivery of front-line crisis support, published evidence of the standardized assessment of such services does not exist to date. To describe the development of the Telephone Crisis Support Skills Scale (TCSSS), an instrument to assess workers' intentions to use recommended skills with callers, and to evaluate its factor structure and reliability. TCSSS items were mapped to a national telephone crisis support practice model. A national sample of workers (n = 210) completed the TCSSS as part of a larger online survey. Principal axis factoring was used to evaluate the structure of the instrument. Internal consistency was assessed by Cronbach's α values. A single factor accounted for more than 40% of the variance within TCSSS ratings, indicating unidimensional structure. Cronbach's α coefficients suggested adequate internal consistency. Results indicate that the TCSSS is an internally consistent, unidimensional scale, sufficiently sensitive to detect workers' skill priorities for different caller problem types. Further study is required to confirm the factor structure and reliability of the TCSSS using workers from different organizations. Following further evaluation, the TCSSS may be applied to assessing readiness for and quality of service delivery.
Reliability of reported breastfeeding duration among reproductive-aged women from Mexico

PubMed Central

Cupul-Uicab, Lea A.; Gladen, Beth C.; Hernández-Ávila, Mauricio; Longnecker, Matthew P.

2010-01-01

Breastfed children have lower risk of infectious diseases, post-neonatal mortality and chronic diseases later in life. Because epidemiologic studies usually rely on reported history of previous breastfeeding, data on the accuracy and precision of recalled histories allow improved interpretation of the epidemiologic findings. We evaluated the reliability of two reported breastfeeding durations in 567 reproductive-aged women from Mexico using information obtained from nearly identical sets of questions applied at different times after weaning. We compared differences between reports, and examined the intra-class correlation coefficient (ICC) for any and for exclusive breastfeeding (EBF). Logistic regression was used to evaluate the determinants of poor recall (difference between reports of >20%). The reliability of duration of any breastfeeding was high (ICC 0.94). Overall, differences between reports of duration were usually <1 month, and for 385/567, the difference was ≤0.5 months. Predictors of poorer recall were having ≥4 children, and time between reports of >2 months. The only predictor of better recall was greater age of the baby at weaning. The reliability of EBF duration was lower (ICC 0.49). In this population with a relatively long duration of breastfeeding, reliability of any breast-feeding duration was high. Age, education and previous breastfeeding were not important predictors of recall, in contrast to findings in earlier studies. Consistent with previous reports, however, parity and length of recall were associated with poorer recall of duration of any breastfeeding. Future studies that use reported breastfeeding duration may want to consider the effect of these variables on recall. PMID:19292747
Reliability of stellar inclination estimated from asteroseismology: analytical criteria, mock simulations and Kepler data analysis

NASA Astrophysics Data System (ADS)

Kamiaka, Shoya; Benomar, Othman; Suto, Yasushi

2018-05-01

Advances in asteroseismology of solar-like stars, now provide a unique method to estimate the stellar inclination i⋆. This enables to evaluate the spin-orbit angle of transiting planetary systems, in a complementary fashion to the Rossiter-McLaughlineffect, a well-established method to estimate the projected spin-orbit angle λ. Although the asteroseismic method has been broadly applied to the Kepler data, its reliability has yet to be assessed intensively. In this work, we evaluate the accuracy of i⋆ from asteroseismology of solar-like stars using 3000 simulated power spectra. We find that the low signal-to-noise ratio of the power spectra induces a systematic under-estimate (over-estimate) bias for stars with high (low) inclinations. We derive analytical criteria for the reliable asteroseismic estimate, which indicates that reliable measurements are possible in the range of 20° ≲ i⋆ ≲ 80° only for stars with high signal-to-noise ratio. We also analyse and measure the stellar inclination of 94 Kepler main-sequence solar-like stars, among which 33 are planetary hosts. According to our reliability criteria, a third of them (9 with planets, 22 without) have accurate stellar inclination. Comparison of our asteroseismic estimate of vsin i⋆ against spectroscopic measurements indicates that the latter suffers from a large uncertainty possibly due to the modeling of macro-turbulence, especially for stars with projected rotation speed vsin i⋆ ≲ 5km/s. This reinforces earlier claims, and the stellar inclination estimated from the combination of measurements from spectroscopy and photometric variation for slowly rotating stars needs to be interpreted with caution.
Text mining by Tsallis entropy

NASA Astrophysics Data System (ADS)

Jamaati, Maryam; Mehri, Ali

2018-01-01

Long-range correlations between the elements of natural languages enable them to convey very complex information. Complex structure of human language, as a manifestation of natural languages, motivates us to apply nonextensive statistical mechanics in text mining. Tsallis entropy appropriately ranks the terms' relevance to document subject, taking advantage of their spatial correlation length. We apply this statistical concept as a new powerful word ranking metric in order to extract keywords of a single document. We carry out an experimental evaluation, which shows capability of the presented method in keyword extraction. We find that, Tsallis entropy has reliable word ranking performance, at the same level of the best previous ranking methods.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Goins, Bobby

A systems based approach will be used to evaluate the nitrogen delivery process. This approach involves principles found in Lean, Reliability, Systems Thinking, and Requirements. This unique combination of principles and thought process yields a very in depth look into the system to which it is applied. By applying a systems based approach to the nitrogen delivery process there should be improvements in cycle time, efficiency, and a reduction in the required number of personnel needed to sustain the delivery process. This will in turn reduce the amount of demurrage charges that the site incurs. In addition there should bemore » less frustration associated with the delivery process.« less
A Systems Approach to Nitrogen Delivery

DOE Office of Scientific and Technical Information (OSTI.GOV)

Goins, Bobby

A systems based approach will be used to evaluate the nitrogen delivery process. This approach involves principles found in Lean, Reliability, Systems Thinking, and Requirements. This unique combination of principles and thought process yields a very in depth look into the system to which it is applied. By applying a systems based approach to the nitrogen delivery process there should be improvements in cycle time, efficiency, and a reduction in the required number of personnel needed to sustain the delivery process. This will in turn reduce the amount of demurrage charges that the site incurs. In addition there should bemore » less frustration associated with the delivery process.« less
System Design under Uncertainty: Evolutionary Optimization of the Gravity Probe-B Spacecraft

NASA Technical Reports Server (NTRS)

Pullen, Samuel P.; Parkinson, Bradford W.

1994-01-01

This paper discusses the application of evolutionary random-search algorithms (Simulated Annealing and Genetic Algorithms) to the problem of spacecraft design under performance uncertainty. Traditionally, spacecraft performance uncertainty has been measured by reliability. Published algorithms for reliability optimization are seldom used in practice because they oversimplify reality. The algorithm developed here uses random-search optimization to allow us to model the problem more realistically. Monte Carlo simulations are used to evaluate the objective function for each trial design solution. These methods have been applied to the Gravity Probe-B (GP-B) spacecraft being developed at Stanford University for launch in 1999, Results of the algorithm developed here for GP-13 are shown, and their implications for design optimization by evolutionary algorithms are discussed.

Applying signal-detection theory to the study of observer accuracy and bias in behavioral assessment.

PubMed

Lerman, Dorothea C; Tetreault, Allison; Hovanetz, Alyson; Bellaci, Emily; Miller, Jonathan; Karp, Hilary; Mahmood, Angela; Strobel, Maggie; Mullen, Shelley; Keyl, Alice; Toupard, Alexis

2010-01-01

We evaluated the feasibility and utility of a laboratory model for examining observer accuracy within the framework of signal-detection theory (SDT). Sixty-one individuals collected data on aggression while viewing videotaped segments of simulated teacher-child interactions. The purpose of Experiment 1 was to determine if brief feedback and contingencies for scoring accurately would bias responding reliably. Experiment 2 focused on one variable (specificity of the operational definition) that we hypothesized might decrease the likelihood of bias. The effects of social consequences and information about expected behavior change were examined in Experiment 3. Results indicated that feedback and contingencies reliably biased responding and that the clarity of the definition only moderately affected this outcome.
Automatic detection of sleep macrostructure based on a sensorized T-shirt.

PubMed

Bianchi, Anna M; Mendez, Martin O

2010-01-01

In the present work we apply a fully automatic procedure to the analysis of signal coming from a sensorized T-shit, worn during the night, for sleep evaluation. The goodness and reliability of the signals recorded trough the T-shirt was previously tested, while the employed algorithms for feature extraction and sleep classification were previously developed on standard ECG recordings and the obtained classification was compared to the standard clinical practice based on polysomnography (PSG). In the present work we combined T-shirt recordings and automatic classification and could obtain reliable sleep profiles, i.e. the sleep classification in WAKE, REM (rapid eye movement) and NREM stages, based on heart rate variability (HRV), respiration and movement signals.
Multi-objective optimization of GENIE Earth system models.

PubMed

Price, Andrew R; Myerscough, Richard J; Voutchkov, Ivan I; Marsh, Robert; Cox, Simon J

2009-07-13

The tuning of parameters in climate models is essential to provide reliable long-term forecasts of Earth system behaviour. We apply a multi-objective optimization algorithm to the problem of parameter estimation in climate models. This optimization process involves the iterative evaluation of response surface models (RSMs), followed by the execution of multiple Earth system simulations. These computations require an infrastructure that provides high-performance computing for building and searching the RSMs and high-throughput computing for the concurrent evaluation of a large number of models. Grid computing technology is therefore essential to make this algorithm practical for members of the GENIE project.
GAS DISCHARGE SWITCH EVALUATION FOR RHIC BEAM ABORT KICKER APPLICATION.

DOE Office of Scientific and Technical Information (OSTI.GOV)

ZHANG,W.; SANDBERG,J.; SHELDRAKE,R.

2002-06-30

A gas discharge switch EEV HX3002 is being evaluated at Brookhaven National Laboratory as a possible candidate of RHIC Beam Abort Kicker modulator main switch. At higher beam energy and higher beam intensity, the switch stability becomes very crucial. The hollow anode thyratron used in the existing system is not rated for long reverse current conduction. The reverse voltage arcing caused thyratron hold-off voltage de-rating has been the main limitation of the system operation. To improve the system reliability, a new type of gas discharge switch has been suggested by Marconi Applied Technology for its reverse conducting capability.
Accuracy, reliability, and timing of visual evaluations of decay in fresh-cut lettuce

PubMed Central

Hayes, Ryan J.

2018-01-01

Visual assessments are used for evaluating the quality of food products, such as fresh-cut lettuce packaged in bags with modified atmosphere. We have compared the accuracy and the reliability of visual evaluations of decay on fresh-cut lettuce performed with experienced and inexperienced raters. In addition, we have analyzed decay data from over 4.5 thousand bags to determine the optimum timing for evaluations to detect differences among accessions. Lin’s concordance coefficient (ρc) that takes into consideration both the closeness of the data and the conformance to the identity line showed high repeatability (intra-rater reliability, ρc = 0.97), reproducibility (inter-rater reliability, ρc = 0.92), and accuracy (ρc = 0.96) for experienced raters. Inexperienced raters did not perform as well and their ratings showed decreased repeatability (ρc = 0.93), but even larger reduction in reproducibility (ρc = 0.80) and accuracy (ρc = 0.90). We have detected that 5.3% of ratings were outside of the 95% limits of agreement. These under- or overestimates were predominantly found for bags with intermediate levels of decay, which corresponds to the middle of the rating scale. This occurs because intermediate amounts of decay are more difficult to discriminate than extremes. The frequencies of aberrant ratings for experienced raters ranged from 0.6% to 4.4% (mean = 2.1%), for inexperienced raters the frequencies were substantially higher, ranging from 6.1% to 15.6% (mean = 9.4%). Therefore, we recommend that new raters receive training that includes practical examples in this range of decay, use of standard area diagrams, and continuing interaction with experienced raters (consultation during actual rating). Very high agreement among experienced raters indicate that visual ratings can be successfully used for evaluations of decay, until a more objective, rapid, and affordable method is developed. We recommend evaluating samples at multiple time points until 42 days after processing (about 80% decay on average) and then combining these individual ratings into the area under the decay progress stairs (AUDePS) score. Applying this approach, experienced evaluators can accurately detect difference among lettuce accessions and identify lettuce cultivars with reduced decay. PMID:29664945
Availability-Based Importance Framework for Supplier Selection

DTIC Science & Technology

2015-04-30

IMA Journal of Management Math, 15(2), 161– 174. Chen, C . -T., Lin, C . -T., & Huang, S. -F. (2006). A fuzzy approach for supplier evaluation and...reliability modeling: Principles and applications. Hoboken, NJ: Wiley. Liao, C . -N., & Kao, H. -P. (2011). An integrated fuzzy TOPSIS and MCGP approach to...5307–5326. Wang, J. -W., Cheng, C . -H., & Huang, K.- C . (2009). Fuzzy hierarchical TOPSIS for supplier selection. Applied Soft Computing, 9(1), 377
Self-Evaluation of PANDA-FBG Based Sensing System for Dynamic Distributed Strain and Temperature Measurement.

PubMed

Zhu, Mengshi; Murayama, Hideaki; Wada, Daichi

2017-10-12

A novel method is introduced in this work for effectively evaluating the performance of the PANDA type polarization-maintaining fiber Bragg grating (PANDA-FBG) distributed dynamic strain and temperature sensing system. Conventionally, the errors during the measurement are unknown or evaluated by using other sensors such as strain gauge and thermocouples. This will make the sensing system complicated and decrease the efficiency since more than one kind of sensor is applied for the same measurand. In this study, we used the approximately constant ratio of primary errors in strain and temperature measurement and realized the self-evaluation of the sensing system, which can significantly enhance the applicability, as well as the reliability in strategy making.
The EORTC information questionnaire, EORTC QLQ-INFO25. Validation study for Spanish patients.

PubMed

Arraras, Juan Ignacio; Manterola, Ana; Hernández, Berta; Arias de la Vega, Fernando; Martínez, Maite; Vila, Meritxell; Eito, Clara; Vera, Ruth; Domínguez, Miguel Ángel

2011-06-01

The EORTC QLQ-INFO25 evaluates the information received by cancer patients. This study assesses the psychometric properties of the QLQ-INFO25 when applied to a sample of Spanish patients. A total of 169 patients with different cancers and stages of disease completed the EORTC QLQINFO25, the EORTC QLQ-C30 and the information scales of the inpatient satisfaction module EORTC IN-PATSAT32 on two occasions during the patients' treatment and follow- up period. Psychometric evaluation of the structure, reliability, validity and responsiveness to changes was conducted. Patient acceptability was assessed with a debriefing questionnaire. Multi-trait scaling confirmed the 4 multi-item scales (information about disease, medical tests, treatment and other services) and eight single items. All items met the standards for convergent validity and all except one met the standards of item discriminant validity. Internal consistency for all scales (α>0.70) and the whole questionnaire (α>0.90) was adequate in the three measurements, except information about the disease (0.67) and other services (0.68) in the first measurement, as was test-retest reliability (intraclass correlations >0.70). Correlations with related areas of IN-PATSAT32 (r>0.40) supported convergent validity. Divergent validity was confirmed through low correlations with EORTC QLQ-C30 scales (r<0.30). The EORTC QLQ-INFO-25 discriminated among groups based on gender, age, education, levels of anxiety and depression, treatment line, wish for information and satisfaction. One scale and an item showed changes over time. The EORTC QLQ-INFO 25 is a reliable and valid instrument when applied to a sample of Spanish cancer patients. These results are in line with those of the EORTC validation study.
Psychometric properties of the Persian version of the Time to Relapse Questionnaire (TRQ) in substance use disorder.

PubMed

Khazaee-Pool, Maryam; Moridi, Minoo; Ponnet, Koen; Turner, Nigel; Pashaei, Tahereh

2016-11-01

Predicting time to relapse provides an opportunity for the development of relapse prevention interventions in drug users. The aim of the present study was to describe the development of the Persian version of the 9-item Time to Relapse Questionnaire (TRQ) and to evaluate its psychometric properties in an Iranian sample of treatment-seeking individuals with substance dependence (n = 150). The forward-backward method was used to translate the TRQ scale from English into Persian. After linguistic validation and a pilot check, a cross-sectional study was performed, and psychometric properties of the Iranian version of the questionnaire were assessed. The reliability was evaluated by Cronbach's alpha and test-retest analyses. In addition, the factor structure of the scale was extracted by applying confirmatory factor analysis. The mean age of participants was 40.52 (SD = 11.30) years. The mean scores for the content validity index (CVI) and the content validity ratio (CVR) were 0.93 and 0.81, respectively. A confirmatory factor analysis (CFA) demonstrated that the three-factor model of the TRQ was a good fit for the data and thus replicated the factor structure of the original English language TRQ. Cronbach's alpha presented good internal consistency (alpha = 0.76), and test-retest reliability of the TRQ instrument with 2-week intervals was appropriate (ICC = 0.84). The findings demonstrate that the Persian version of the TRQ is a reliable and valid scale for measuring time to relapse in Iranian drug users. The TRQ can be applied at the start of treatment so that clinical interventions can be targeted toward the different relapse styles.
Reliability of Soft Tissue Model Based Implant Surgical Guides; A Methodological Mistake.

PubMed

Sabour, Siamak; Dastjerdi, Elahe Vahid

2012-08-20

Abstract We were interested to read the paper by Maney P and colleagues published in the July 2012 issue of J Oral Implantol. The authors aimed to assess the reliability of soft tissue model based implant surgical guides reported that the accuracy was evaluated using software. 1 I found the manuscript title of Maney P, et al. incorrect and misleading. Moreover, they reported twenty-two sites (46.81%) were considered accurate (13 of 24 maxillary and 9 of 23 mandibular sites). As the authors point out in their conclusion, Soft tissue models do not always provide sufficient accuracy for implant surgical guide fabrication.Reliability (precision) and validity (accuracy) are two different methodological issues in researches. Sensitivity, specificity, PPV, NPV, likelihood ratio positive (true positive/false negative) and likelihood ratio negative (false positive/ true negative) as well as odds ratio (true results\\false results - preferably more than 50) are among the tests to evaluate the validity (accuracy) of a single test compared to a gold standard.2-4 It is not clear that the reported twenty-two sites (46.81%) which were considered accurate related to which of the above mentioned estimates for validity analysis. Reliability (repeatability or reproducibility) is being assessed by different statistical tests such as Pearson r, least square and paired t.test which all of them are among common mistakes in reliability analysis 5. Briefly, for quantitative variable Intra Class Correlation Coefficient (ICC) and for qualitative variables weighted kappa should be used with caution because kappa has its own limitation too. Regarding reliability or agreement, it is good to know that for computing kappa value, just concordant cells are being considered, whereas discordant cells should also be taking into account in order to reach a correct estimation of agreement (Weighted kappa).2-4 As a take home message, for reliability and validity analysis, appropriate tests should be applied.
Validity and Interrater Reliability of the Visual Quarter-Waste Method for Assessing Food Waste in Middle School and High School Cafeteria Settings.

PubMed

Getts, Katherine M; Quinn, Emilee L; Johnson, Donna B; Otten, Jennifer J

2017-11-01

Measuring food waste (ie, plate waste) in school cafeterias is an important tool to evaluate the effectiveness of school nutrition policies and interventions aimed at increasing consumption of healthier meals. Visual assessment methods are frequently applied in plate waste studies because they are more convenient than weighing. The visual quarter-waste method has become a common tool in studies of school meal waste and consumption, but previous studies of its validity and reliability have used correlation coefficients, which measure association but not necessarily agreement. The aims of this study were to determine, using a statistic measuring interrater agreement, whether the visual quarter-waste method is valid and reliable for assessing food waste in a school cafeteria setting when compared with the gold standard of weighed plate waste. To evaluate validity, researchers used the visual quarter-waste method and weighed food waste from 748 trays at four middle schools and five high schools in one school district in Washington State during May 2014. To assess interrater reliability, researcher pairs independently assessed 59 of the same trays using the visual quarter-waste method. Both validity and reliability were assessed using a weighted κ coefficient. For validity, as compared with the measured weight, 45% of foods assessed using the visual quarter-waste method were in almost perfect agreement, 42% of foods were in substantial agreement, 10% were in moderate agreement, and 3% were in slight agreement. For interrater reliability between pairs of visual assessors, 46% of foods were in perfect agreement, 31% were in almost perfect agreement, 15% were in substantial agreement, and 8% were in moderate agreement. These results suggest that the visual quarter-waste method is a valid and reliable tool for measuring plate waste in school cafeteria settings. Copyright © 2017 Academy of Nutrition and Dietetics. Published by Elsevier Inc. All rights reserved.
A validation study of public health knowledge, skills, social responsibility and applied learning.

PubMed

Vackova, Dana; Chen, Coco K; Lui, Juliana N M; Johnston, Janice M

2018-06-22

To design and validate a questionnaire to measure medical students' Public Health (PH) knowledge, skills, social responsibility and applied learning as indicated in the four domains recommended by the Association of Schools & Programmes of Public Health (ASPPH). A cross-sectional study was conducted to develop an evaluation tool for PH undergraduate education through item generation, reduction, refinement and validation. The 74 preliminary items derived from the existing literature were reduced to 55 items based on expert panel review which included those with expertise in PH, psychometrics and medical education, as well as medical students. Psychometric properties of the preliminary questionnaire were assessed as follows: frequency of endorsement for item variance; principal component analysis (PCA) with varimax rotation for item reduction and factor estimation; Cronbach's Alpha, item-total correlation and test-retest validity for internal consistency and reliability. PCA yielded five factors: PH Learning Experience (6 items); PH Risk Assessment and Communication (5 items); Future Use of Evidence in Practice (6 items); Recognition of PH as a Scientific Discipline (4 items); and PH Skills Development (3 items), explaining 72.05% variance. Internal consistency and reliability tests were satisfactory (Cronbach's Alpha ranged from 0.87 to 0.90; item-total correlation > 0.59). Lower paired test-retest correlations reflected instability in a social science environment. An evaluation tool for community-centred PH education has been developed and validated. The tool measures PH knowledge, skills, social responsibilities and applied learning as recommended by the internationally recognised Association of Schools & Programmes of Public Health (ASPPH).
In vitro methods for evaluating skin hydration under diapers and incontinence products.

PubMed

Tate, M L; Wright, A S

2017-11-01

Excessive skin hydration from wearing wet undergarments, such as infant diapers and adult incontinence products, has been historically problematic. Skin damage occurs from wetness (urine) and limited product breathability. Evaporative water loss has been measured on adult arms (armband method) or infant torsos (on-baby method), after wearing a saline-insulted diaper product. The current study developed a reliable in vitro method of evaluating diaper and incontinence products for improvements in skin dryness. A simulated skin substrate was applied to a heated mechanical arm or baby torso. A disposable diaper or incontinence product was wrapped around the arm or baby torso, and loaded with saline. Hydration of the simulated skin was measured by evaporimetry and compared with clinical data from adult armband evaluations. The heated mechanical arm and baby torso accurately distinguished products for skin dryness. Eight diaper products were evaluated and compared to human test results. The torso in vitro and mechanical arm evaluations demonstrated strong correlations to human epidermal water loss evaluations, with repeatable results. Additionally, the bench test has been used for adult incontinence products, and it proved to differentiate those products as well as infant products. A rapid and reliable means of evaluation has been developed, and it is predictive of human subject testing. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Development of an ESI-LC-MS-based assay for kinetic evaluation of Mycobacterium tuberculosis shikimate kinase activity and inhibition.

PubMed

Simithy, Johayra; Gill, Gobind; Wang, Yu; Goodwin, Douglas C; Calderón, Angela I

2015-02-17

A simple and reliable liquid chromatography-mass spectrometry (LC-MS) assay has been developed and validated for the kinetic characterization and evaluation of inhibitors of shikimate kinase from Mycobacterium tuberculosis (MtSK), a potential target for the development of novel antitubercular drugs. This assay is based on the direct determination of the reaction product shikimate-3-phosphate (S3P) using electrospray ionization (ESI) and a quadrupole time-of-flight (Q-TOF) detector. A comparative analysis of the kinetic parameters of MtSK obtained by the LC-MS assay with those obtained by a conventional UV-assay was performed. Kinetic parameters determined by LC-MS were in excellent agreement with those obtained from the UV assay, demonstrating the accuracy, and reliability of this method. The validated assay was successfully applied to the kinetic characterization of a known inhibitor of shikimate kinase; inhibition constants and mode of inhibition were accurately delineated with LC-MS.
Reliability of Hull Girder Ultimate Strength of Steel Ships

NASA Astrophysics Data System (ADS)

Da-wei, Gao; Gui-jie, Shi

2018-03-01

Hull girder ultimate strength is an evaluation index reflecting the true safety margin or structural redundancy about container ships. Especially, after the hull girder fracture accident of the MOL COMFORT, the 8,000TEU class large container ship, on June 17 2013, larger container ship safety has been paid on much more attention. In this paper, different methods of calculating hull girder ultimate strength are firstly discussed and compared with. The bending ultimate strength can be analyzed by nonlinear finite element method (NFEM) and increment-iterative method, and also the shear ultimate strength can be analyzed by NFEM and simple equations. Then, the probability distribution of hull girder wave loads and still water loads of container ship are summarized. At last, the reliability of hull girder ultimate strength under bending moment and shear forces for three container ships is analyzed by using a first order method. The conclusions can be applied to give guidance for ship design and safety evaluation.
Physics Metacognition Inventory Part II: Confirmatory factor analysis and Rasch analysis

NASA Astrophysics Data System (ADS)

Taasoobshirazi, Gita; Bailey, MarLynn; Farley, John

2015-11-01

The Physics Metacognition Inventory was developed to measure physics students' metacognition for problem solving. In one of our earlier studies, an exploratory factor analysis provided evidence of preliminary construct validity, revealing six components of students' metacognition when solving physics problems including knowledge of cognition, planning, monitoring, evaluation, debugging, and information management. The college students' scores on the inventory were found to be reliable and related to students' physics motivation and physics grade. However, the results of the exploratory factor analysis indicated that the questionnaire could be revised to improve its construct validity. The goal of this study was to revise the questionnaire and establish its construct validity through a confirmatory factor analysis. In addition, a Rasch analysis was applied to the data to better understand the psychometric properties of the inventory and to further evaluate the construct validity. Results indicated that the final, revised inventory is a valid, reliable, and efficient tool for assessing student metacognition for physics problem solving.
Life Cycle Assessment for desalination: a review on methodology feasibility and reliability.

PubMed

Zhou, Jin; Chang, Victor W-C; Fane, Anthony G

2014-09-15

As concerns of natural resource depletion and environmental degradation caused by desalination increase, research studies of the environmental sustainability of desalination are growing in importance. Life Cycle Assessment (LCA) is an ISO standardized method and is widely applied to evaluate the environmental performance of desalination. This study reviews more than 30 desalination LCA studies since 2000s and identifies two major issues in need of improvement. The first is feasibility, covering three elements that support the implementation of the LCA to desalination, including accounting methods, supporting databases, and life cycle impact assessment approaches. The second is reliability, addressing three essential aspects that drive uncertainty in results, including the incompleteness of the system boundary, the unrepresentativeness of the database, and the omission of uncertainty analysis. This work can serve as a preliminary LCA reference for desalination specialists, but will also strengthen LCA as an effective method to evaluate the environment footprint of desalination alternatives. Copyright © 2014 Elsevier Ltd. All rights reserved.
Neural networks and fault probability evaluation for diagnosis issues.

PubMed

Kourd, Yahia; Lefebvre, Dimitri; Guersi, Noureddine

2014-01-01

This paper presents a new FDI technique for fault detection and isolation in unknown nonlinear systems. The objective of the research is to construct and analyze residuals by means of artificial intelligence and probabilistic methods. Artificial neural networks are first used for modeling issues. Neural networks models are designed for learning the fault-free and the faulty behaviors of the considered systems. Once the residuals generated, an evaluation using probabilistic criteria is applied to them to determine what is the most likely fault among a set of candidate faults. The study also includes a comparison between the contributions of these tools and their limitations, particularly through the establishment of quantitative indicators to assess their performance. According to the computation of a confidence factor, the proposed method is suitable to evaluate the reliability of the FDI decision. The approach is applied to detect and isolate 19 fault candidates in the DAMADICS benchmark. The results obtained with the proposed scheme are compared with the results obtained according to a usual thresholding method.
Reliability Analysis and Reliability-Based Design Optimization of Circular Composite Cylinders Under Axial Compression

NASA Technical Reports Server (NTRS)

Rais-Rohani, Masoud

2001-01-01

This report describes the preliminary results of an investigation on component reliability analysis and reliability-based design optimization of thin-walled circular composite cylinders with average diameter and average length of 15 inches. Structural reliability is based on axial buckling strength of the cylinder. Both Monte Carlo simulation and First Order Reliability Method are considered for reliability analysis with the latter incorporated into the reliability-based structural optimization problem. To improve the efficiency of reliability sensitivity analysis and design optimization solution, the buckling strength of the cylinder is estimated using a second-order response surface model. The sensitivity of the reliability index with respect to the mean and standard deviation of each random variable is calculated and compared. The reliability index is found to be extremely sensitive to the applied load and elastic modulus of the material in the fiber direction. The cylinder diameter was found to have the third highest impact on the reliability index. Also the uncertainty in the applied load, captured by examining different values for its coefficient of variation, is found to have a large influence on cylinder reliability. The optimization problem for minimum weight is solved subject to a design constraint on element reliability index. The methodology, solution procedure and optimization results are included in this report.
Reliability associated with the Roter Interaction Analysis System (RIAS) adapted for the telemedicine context.

PubMed

Nelson, Eve-Lynn; Miller, Edward Alan; Larson, Kiley A

2010-01-01

This study's purpose was to adapt the Roter Interaction Analysis System (RIAS) for telemedicine clinics and to investigate the adapted measure's reliability. The study also sought to better understand the volume of technology-related utterance in established telemedicine clinics and the feasibility of using the measure within the telemedicine setting. This initial evaluation is a first step before broadly using the adapted measure across technologies and raters. An expert panel adapted the RIAS for the telemedicine context. This involved accounting for all consultation participants (patient, provider, presenter, family) and adding technology-specific subcategories. Ten new and 36 follow-up telemedicine encounters were videotaped and double coded using the adapted RIAS. These consisted primarily of follow-up visits (78.0%) involving patients, providers, presenters, and other parties. Reliability was calculated for those categories with 15 or more utterances. Traditional RIAS categories related to socioemotional and task-focused clusters had fair to excellent levels of reliability in the telemedicine setting. Although there were too few utterances to calculate the reliability of the specific technology-related subcategories, the summary technology-related category proved reliable for patients, providers, and presenters. Overall patterns seen in traditional patient-provider interactions were observed, with the number of provider utterances far exceeding patient, presenter, and family utterances, and few technology-specific utterances. The traditional RIAS is reliable when applied across multiple participants in the telemedicine context. Reliability of technology-related subcategories could not be evaluated; however, the aggregate technology-related cluster was found to be reliable and may be especially relevant in understanding communication patterns with patients new to the telemedicine setting. Use of the RIAS instrument is encouraged to facilitate comparison between traditional, face-to-face clinics and telemedicine; among diverse consultation mediums and technologies; and across different specialties. Future research is necessary to further investigate the reliability and validity of adding technology-related subcategories to the RIAS. The limited number of technology-related utterances, however, implies a certain degree of comfort with two-way interactive video consultation among study participants. Telemedicine continues to increase access to healthcare. The technology-related categories of the adapted RIAS were reliable when aggregated, thereby providing a tool to better understand how telemedicine affects provider-patient communication and outcomes.

The Role of Applied Epidemiology Methods in the Disaster Management Cycle

PubMed Central

Heumann, Michael; Perrotta, Dennis; Wolkin, Amy F.; Schnall, Amy H.; Podgornik, Michelle N.; Cruz, Miguel A.; Horney, Jennifer A.; Zane, David; Roisman, Rachel; Greenspan, Joel R.; Thoroughman, Doug; Anderson, Henry A.; Wells, Eden V.; Simms, Erin F.

2014-01-01

Disaster epidemiology (i.e., applied epidemiology in disaster settings) presents a source of reliable and actionable information for decision-makers and stakeholders in the disaster management cycle. However, epidemiological methods have yet to be routinely integrated into disaster response and fully communicated to response leaders. We present a framework consisting of rapid needs assessments, health surveillance, tracking and registries, and epidemiological investigations, including risk factor and health outcome studies and evaluation of interventions, which can be practiced throughout the cycle. Applying each method can result in actionable information for planners and decision-makers responsible for preparedness, response, and recovery. Disaster epidemiology, once integrated into the disaster management cycle, can provide the evidence base to inform and enhance response capability within the public health infrastructure. PMID:25211748
Current Methodologies of Identifying R&M (Reliability and Maintainability) Problems in Fielded Weapon Systems

DTIC Science & Technology

1988-09-01

applies to a one Air Transport Rack (ATR) volume LRU in an airborne, uninhabited, fighter environment.) The goal is to have a 2000 hour mean time between...benefits of applying reliability and 11 maintainability improvements to these weapon systems or components. Examples will be given in this research of...where the Pareto Principle applies . The Pareto analysis applies 25 to field failure types as well as to shop defect types. In the following automotive
Validity and reliability of intraoral scanners compared to conventional gypsum models measurements: a systematic review.

PubMed

Aragón, Mônica L C; Pontes, Luana F; Bichara, Lívia M; Flores-Mir, Carlos; Normando, David

2016-08-01

The development of 3D technology and the trend of increasing the use of intraoral scanners in dental office routine lead to the need for comparisons with conventional techniques. To determine if intra- and inter-arch measurements from digital dental models acquired by an intraoral scanner are as reliable and valid as the similar measurements achieved from dental models obtained through conventional intraoral impressions. An unrestricted electronic search of seven databases until February 2015. Studies that focused on the accuracy and reliability of images obtained from intraoral scanners compared to images obtained from conventional impressions. After study selection the QUADAS risk of bias assessment tool for diagnostic studies was used to assess the risk of bias (RoB) among the included studies. Four articles were included in the qualitative synthesis. The scanners evaluated were OrthoProof, Lava, iOC intraoral, Lava COS, iTero and D250. These studies evaluated the reliability of tooth widths, Bolton ratio measurements, and image superimposition. Two studies were classified as having low RoB; one had moderate RoB and the remaining one had high RoB. Only one study evaluated the time required to complete clinical procedures and patient's opinion about the procedure. Patients reported feeling more comfortable with the conventional dental impression method. Associated costs were not considered in any of the included study. Inter- and intra-arch measurements from digital models produced from intraoral scans appeared to be reliable and accurate in comparison to those from conventional impressions. This assessment only applies to the intraoral scanners models considered in the finally included studies. Digital models produced by intraoral scan eliminate the need of impressions materials; however, currently, longer time is needed to take the digital images. PROSPERO (CRD42014009702). None. © The Author 2016. Published by Oxford University Press on behalf of the European Orthodontic Society. All rights reserved. For permissions, please email: journals.permissions@oup.com.
The process group approach to reliable distributed computing

NASA Technical Reports Server (NTRS)

Birman, Kenneth P.

1992-01-01

The difficulty of developing reliable distribution software is an impediment to applying distributed computing technology in many settings. Experience with the ISIS system suggests that a structured approach based on virtually synchronous process groups yields systems that are substantially easier to develop, exploit sophisticated forms of cooperative computation, and achieve high reliability. Six years of research on ISIS, describing the model, its implementation challenges, and the types of applications to which ISIS has been applied are reviewed.
[Reliability and reproducibility of the Fitzpatrick phototype scale for skin sensitivity to ultraviolet light].

PubMed

Sánchez, Guillermo; Nova, John; Arias, Nilsa; Peña, Bibiana

2008-12-01

The Fitzpatrick phototype scale has been used to determine skin sensitivity to ultraviolet light. The reliability of this scale in estimating sensitivity permits risk evaluation of skin cancer based on phototype. Reliability and changes in intra and inter-observer concordance was determined for the Fitzpatrick phototype scale after the assessment methods for establishing the phototype were standardized. An analytical study of intra and inter-observer concordance was performed. The Fitzpatrick phototype scale was standardized using focus group methodology. To determine intra and inter-observer agreement, the weighted kappa statistical method was applied. The standardization effect was measured using the equal kappa contrast hypothesis and Wald test for dependent measurements. The phototype scale was applied to 155 patients over 15 years of age who were assessed four times by two independent observers. The sample was drawn from patients of the Centro Dermatol6gico Federico Lleras Acosta. During the pre-standardization phase, the baseline and six-week inter-observer weighted kappa were 0.31 and 0.40, respectively. The intra-observer kappa values for observers A and B were 0.47 and 0.51, respectively. After the standardization process, the baseline and six-week inter-observer weighted kappa values were 0.77, and 0.82, respectively. Intra-observer kappa coefficients for observers A and B were 0.78 and 0.82. Statistically significant differences were found between coefficients before and after standardization (p<0.001) in all comparisons. Following a standardization exercise, the Fitzpatrick phototype scale yielded reliable, reproducible and consistent results.
Item Response Theory analysis of Fagerström Test for Cigarette Dependence.

PubMed

Svicher, Andrea; Cosci, Fiammetta; Giannini, Marco; Pistelli, Francesco; Fagerström, Karl

2018-02-01

The Fagerström Test for Cigarette Dependence (FTCD) and the Heaviness of Smoking Index (HSI) are the gold standard measures to assess cigarette dependence. However, FTCD reliability and factor structure have been questioned and HSI psychometric properties are in need of further investigations. The present study examined the psychometrics properties of the FTCD and the HSI via the Item Response Theory. The study was a secondary analysis of data collected in 862 Italian daily smokers. Confirmatory factor analysis was run to evaluate the dimensionality of FTCD. A Grade Response Model was applied to FTCD and HSI to verify the fit to the data. Both item and test functioning were analyzed and item statistics, Test Information Function, and scale reliabilities were calculated. Mokken Scale Analysis was applied to estimate homogeneity and Loevinger's coefficients were calculated. The FTCD showed unidimensionality and homogeneity for most of the items and for the total score. It also showed high sensitivity and good reliability from medium to high levels of cigarette dependence, although problems related to some items (i.e., items 3 and 5) were evident. HSI had good homogeneity, adequate item functioning, and high reliability from medium to high levels of cigarette dependence. Significant Differential Item Functioning was found for items 1, 4, 5 of the FTCD and for both items of HSI. HSI seems highly recommended in clinical settings addressed to heavy smokers while FTCD would be better used in smokers with a level of cigarette dependence ranging between low and high. Copyright © 2017 Elsevier Ltd. All rights reserved.
Assessing self-regulation strategies: development and validation of the tempest self-regulation questionnaire for eating (TESQ-E) in adolescents.

PubMed

De Vet, Emely; De Ridder, Denise; Stok, Marijn; Brunso, Karen; Baban, Adriana; Gaspar, Tania

2014-09-02

Applying self-regulation strategies have proven important in eating behaviors, but it remains subject to investigation what strategies adolescents report to use to ensure healthy eating, and adequate measures are lacking. Therefore, we developed and validated a self-regulation questionnaire applied to eating (TESQ-E) for adolescents. Study 1 reports a four-step approach to develop the TESQ-E questionnaire (n = 1097). Study 2 was a cross-sectional survey among adolescents from nine European countries (n = 11,392) that assessed the TESQ-E, eating-related behaviors, dietary intake and background characteristics. In study 3, the TESQ-E was administered twice within four weeks to evaluate test-retest reliability (n = 140). Study 4 was a cross-sectional survey (n = 93) that assessed the TESQ-E and related psychological constructs (e.g., motivation, autonomy, self-control). All participants were aged between 10 and 17 years. Study 1 resulted in a 24-item questionnaire assessing adolescent-reported use of six specific strategies for healthy eating that represent three general self-regulation approaches. Study 2 showed that the easy-to-administer theory-based TESQ-E has a clear factor structure and good subscale reliabilities. The questionnaire was related to eating-related behaviors and dietary intake, indicating predictive validity. Study 3 showed good test-retest reliabilities for the TESQ-E. Study 4 indicated that TESQ-E was related to but also distinguishable from general self-regulation and motivation measures. The TESQ-E provides a reliable and valid measure to assess six theory-based self-regulation strategies that adolescents may use to ensure their healthy eating.
A study on reliability of power customer in distribution network

NASA Astrophysics Data System (ADS)

Liu, Liyuan; Ouyang, Sen; Chen, Danling; Ma, Shaohua; Wang, Xin

2017-05-01

The existing power supply reliability index system is oriented to power system without considering actual electricity availability in customer side. In addition, it is unable to reflect outage or customer’s equipment shutdown caused by instantaneous interruption and power quality problem. This paper thus makes a systematic study on reliability of power customer. By comparing with power supply reliability, reliability of power customer is defined and extracted its evaluation requirements. An indexes system, consisting of seven customer indexes and two contrast indexes, are designed to describe reliability of power customer from continuity and availability. In order to comprehensively and quantitatively evaluate reliability of power customer in distribution networks, reliability evaluation method is proposed based on improved entropy method and the punishment weighting principle. Practical application has proved that reliability index system and evaluation method for power customer is reasonable and effective.
Hyperspectral imaging applied to complex particulate solids systems

NASA Astrophysics Data System (ADS)

Bonifazi, Giuseppe; Serranti, Silvia

2008-04-01

HyperSpectral Imaging (HSI) is based on the utilization of an integrated hardware and software (HW&SW) platform embedding conventional imaging and spectroscopy to attain both spatial and spectral information from an object. Although HSI was originally developed for remote sensing, it has recently emerged as a powerful process analytical tool, for non-destructive analysis, in many research and industrial sectors. The possibility to apply on-line HSI based techniques in order to identify and quantify specific particulate solid systems characteristics is presented and critically evaluated. The originally developed HSI based logics can be profitably applied in order to develop fast, reliable and lowcost strategies for: i) quality control of particulate products that must comply with specific chemical, physical and biological constraints, ii) performance evaluation of manufacturing strategies related to processing chains and/or realtime tuning of operative variables and iii) classification-sorting actions addressed to recognize and separate different particulate solid products. Case studies, related to recent advances in the application of HSI to different industrial sectors, as agriculture, food, pharmaceuticals, solid waste handling and recycling, etc. and addressed to specific goals as contaminant detection, defect identification, constituent analysis and quality evaluation are described, according to authors' originally developed application.
[Application of entropy-weight TOPSIS model in synthetical quality evaluation of Angelica sinensis growing in Gansu Province].

PubMed

Gu, Zhi-rong; Wang, Ya-li; Sun, Yu-jing; Dind, Jun-xia

2014-09-01

To investigate the establishment and application methods of entropy-weight TOPSIS model in synthetical quality evaluation of traditional Chinese medicine with Angelica sinensis growing in Gansu Province as an example. The contents of ferulic acid, 3-butylphthalide, Z-butylidenephthalide, Z-ligustilide, linolic acid, volatile oil, and ethanol soluble extractive were used as an evaluation index set. The weights of each evaluation index were determined by information entropy method. The entropyweight TOPSIS model was established to synthetically evaluate the quality of Angelica sinensis growing in Gansu Province by Euclid closeness degree. The results based on established model were in line with the daodi meaning and the knowledge of clinical experience. The established model was simple in calculation, objective, reliable, and can be applied to synthetical quality evaluation of traditional Chinese medicine.
Reliability analysis of the AOSpine thoracolumbar spine injury classification system by a worldwide group of naïve spinal surgeons.

PubMed

Kepler, Christopher K; Vaccaro, Alexander R; Koerner, John D; Dvorak, Marcel F; Kandziora, Frank; Rajasekaran, Shanmuganathan; Aarabi, Bizhan; Vialle, Luiz R; Fehlings, Michael G; Schroeder, Gregory D; Reinhold, Maximilian; Schnake, Klaus John; Bellabarba, Carlo; Cumhur Öner, F

2016-04-01

The aims of this study were (1) to demonstrate the AOSpine thoracolumbar spine injury classification system can be reliably applied by an international group of surgeons and (2) to delineate those injury types which are difficult for spine surgeons to classify reliably. A previously described classification system of thoracolumbar injuries which consists of a morphologic classification of the fracture, a grading system for the neurologic status and relevant patient-specific modifiers was applied to 25 cases by 100 spinal surgeons from across the world twice independently, in grading sessions 1 month apart. The results were analyzed for classification reliability using the Kappa coefficient (κ). The overall Kappa coefficient for all cases was 0.56, which represents moderate reliability. Kappa values describing interobserver agreement were 0.80 for type A injuries, 0.68 for type B injuries and 0.72 for type C injuries, all representing substantial reliability. The lowest level of agreement for specific subtypes was for fracture subtype A4 (Kappa = 0.19). Intraobserver analysis demonstrated overall average Kappa statistic for subtype grading of 0.68 also representing substantial reproducibility. In a worldwide sample of spinal surgeons without previous exposure to the recently described AOSpine Thoracolumbar Spine Injury Classification System, we demonstrated moderate interobserver and substantial intraobserver reliability. These results suggest that most spine surgeons can reliably apply this system to spine trauma patients as or more reliably than previously described systems.
Applying reliability analysis to design electric power systems for More-electric aircraft

NASA Astrophysics Data System (ADS)

Zhang, Baozhu

The More-Electric Aircraft (MEA) is a type of aircraft that replaces conventional hydraulic and pneumatic systems with electrically powered components. These changes have significantly challenged the aircraft electric power system design. This thesis investigates how reliability analysis can be applied to automatically generate system topologies for the MEA electric power system. We first use a traditional method of reliability block diagrams to analyze the reliability level on different system topologies. We next propose a new methodology in which system topologies, constrained by a set reliability level, are automatically generated. The path-set method is used for analysis. Finally, we interface these sets of system topologies with control synthesis tools to automatically create correct-by-construction control logic for the electric power system.
A Novel Ontology Approach to Support Design for Reliability considering Environmental Effects

PubMed Central

Sun, Bo; Li, Yu; Ye, Tianyuan

2015-01-01

Environmental effects are not considered sufficiently in product design. Reliability problems caused by environmental effects are very prominent. This paper proposes a method to apply ontology approach in product design. During product reliability design and analysis, environmental effects knowledge reusing is achieved. First, the relationship of environmental effects and product reliability is analyzed. Then environmental effects ontology to describe environmental effects domain knowledge is designed. Related concepts of environmental effects are formally defined by using the ontology approach. This model can be applied to arrange environmental effects knowledge in different environments. Finally, rubber seals used in the subhumid acid rain environment are taken as an example to illustrate ontological model application on reliability design and analysis. PMID:25821857
A novel ontology approach to support design for reliability considering environmental effects.

PubMed

Sun, Bo; Li, Yu; Ye, Tianyuan; Ren, Yi

2015-01-01

Environmental effects are not considered sufficiently in product design. Reliability problems caused by environmental effects are very prominent. This paper proposes a method to apply ontology approach in product design. During product reliability design and analysis, environmental effects knowledge reusing is achieved. First, the relationship of environmental effects and product reliability is analyzed. Then environmental effects ontology to describe environmental effects domain knowledge is designed. Related concepts of environmental effects are formally defined by using the ontology approach. This model can be applied to arrange environmental effects knowledge in different environments. Finally, rubber seals used in the subhumid acid rain environment are taken as an example to illustrate ontological model application on reliability design and analysis.
Evaluation of tools used to measure calcium and/or dairy consumption in children and adolescents.

PubMed

Magarey, Anthea; Yaxley, Alison; Markow, Kylie; Baulderstone, Lauren; Miller, Michelle

2014-08-01

To identify and critique tools that assess Ca and/or dairy intake in children to ascertain the most accurate and reliable tools available. A systematic review of the literature was conducted using defined inclusion and exclusion criteria. Articles were included on the basis that they reported on a tool measuring Ca and/or dairy intake in children in Western countries and reported on originally developed tools or tested the validity or reliability of existing tools. Defined criteria for reporting reliability and validity properties were applied. Studies in Western countries. Children. Eighteen papers reporting on two tools that assessed dairy intake, ten that assessed Ca intake and five that assessed both dairy and Ca were identified. An examination of tool testing revealed high reliance on lower-order tests such as correlation and failure to differentiate between statistical and clinically meaningful significance. Only half of the tools were tested for reliability and results indicated that only one Ca tool and one dairy tool were reliable. Validation studies showed acceptable levels of agreement (<100 mg difference) and/or sensitivity (62-83 %) and specificity (55-77 %) in three Ca tools. With reference to the testing methodology and results, no tools were considered both valid and reliable for the assessment of dairy intake and only one tool proved valid and reliable for the assessment of Ca intake. These results clearly indicate the need for development and rigorous testing of tools to assess Ca and/or dairy intake in children and adolescents.
The cross-cultural adaptation, reliability, and validity of the Copenhagen Neck Functional Disability Scale in patients with chronic neck pain: Turkish version study.

PubMed

Yapali, Gökmen; Günel, Mintaze Kerem; Karahan, Sevilay

2012-05-15

The study design was cross-cultural adaptation and investigation of reliability and validity of the Copenhagen Neck Functional Disability Scale (CNFDS). The aim of this study was to translate the CNFDS into Turkish language and assess its reliability and validity among patients with neck pain in Turkish population. The CNFDS is a reliable and valid evaluation instrument for disability, but there is no published the Turkish version of the CNFDS. One hundred one subjects who had chronic neck pain were included in this study. The CNFDS, Neck Pain and Disability Scale, and visual analogue scale were administered to all subjects. For investigating test-retest reliability, correlation between CNFDS scores, applied at 1-week interval, intraclass correlation coefficient score for test-retest reliability was 0.86 (95% confidence interval = 0.679-0.935). There was no difference between test-retest scores (P < 0.001). For investigating concurrent validity, correlation between total score of the CNFDS and the mean visual analogue scale was r = 0.73 (P < 0.001). Concurrent validity of the CNFDS was very good. For investigating construct validity, correlation between total score of the CNFDS and the Neck Pain and Disability Scale was r = 0.78 (P < 0.001). Construct validity of the CNFDS was also very good. Our results suggest that the Turkish version of the CNFDS is a reliable and valid instrument for Turkish people.
Developing evaluation scales for horticultural therapy.

PubMed

Im, Eun-Ae; Park, Sin-Ae; Son, Ki-Cheol

2018-04-01

This study developed evaluation scales for measuring the effects of horticultural therapy in practical settings. Qualitative and quantitative research, including three preliminary studies and a main study, were conducted. In the first study, a total of 779 horticultural therapists answered an open-end questionnaire based on 58 items about elements of occupational therapy and seven factors about singularity of horticultural therapy. In the second study, 20 horticultural therapists participated in in-depth interviews. In the third study, a Delphi method was conducted with 24 horticultural therapists to build a model of assessment indexes and ensure the validity. In the final study, the reserve scales were tested by 121 horticultural therapists in their practical settings for 1045 clients, to verify their reliability and validity. Preliminary questions in the effects area of horticultural therapy were developed in the first study, and validity for the components in the second study. In the third study, an expert Delphi survey was conducted as part of content validity verification of the preliminary tool of horticultural therapy for physical, cognitive, psychological-emotional, and social areas. In the final study, the evaluation tool, which verified the construct, convergence, discriminant, and predictive validity and reliability test, was used to finalise the evaluation tool. The effects of horticultural therapy were classified as four different aspects, namely, physical, cognitive, psycho-emotional, and social, based on previous studies on the effects of horticultural therapy. 98 questions in the four aspects were selected as reserve scales. The reliability of each scale was calculated as 0.982 in physical, 0.980 in cognitive, 0.965 in psycho-emotional, and 0.972 in social aspects based on the Cronbach's test of intra-item internal consistency and half reliability of Spearman-Brown. This study was the first to demonstrate validity and reliability by simultaneously developing four measures of horticultural therapy effectiveness, namely, physical, cognitive, psychological-emotional, and social, both locally and externally. It is especially worthwhile in that it can be applied in common to people. Copyright © 2018 Elsevier Ltd. All rights reserved.
Development and psychometric testing of the Knowledge, Attitudes and Practices (KAP) questionnaire among student Tuberculosis (TB) Patients (STBP-KAPQ) in China.

PubMed

Fan, Yahui; Zhang, Shaoru; Li, Yan; Li, Yuelu; Zhang, Tianhua; Liu, Weiping; Jiang, Hualin

2018-05-08

TB outbreaking in schools is extremely complex, and presents a major challenge for public health. Understanding the knowledge, attitudes and practices among student TB patients in such settings is fundamental when it comes to decreasing future TB cases. The objective of this study was to develop a Knowledge, Attitudes and Practices Questionnaire among Student Tuberculosis Patients (STBP-KAPQ), and evaluate its psychometric properties. This study was conducted in three stages: item construction, pilot testing in 10 student TB patients and psychometric testing, including reliability and validity. The item pool for the questionnaire was compiled from literature review and early individual interviews. The questionnaire items were evaluated by the Delphi method based on 12 experts. Reliability and validity were assessed using student TB patients (n = 416) and healthy students (n = 208). Reliability was examined with internal consistency reliability and test-retest reliability. Content validity was calculated by content validity index (CVI); Construct validity was examined using exploratory factor analysis (EFA) and confirmatory factor analysis (CFA); The Public Tuberculosis Knowledge, Attitudes and Practices Questionnaire (PTB-KAPQ) was applied to evaluate criterion validity; As concerning discriminant validity, T-test was performed. The final STBP-KAPQ consisted of three dimensions and 25 items. Cronbach's α coefficient and intraclass correlation coefficient (ICC) was 0.817 and 0.765, respectively. Content validity index (CVI) was 0.962. Seven common factors were extracted by principal factor analysis and varimax rotation, with a cumulative contribution of 66.253%. The resulting CFA model of the STBP-KAPQ exhibited an appropriate model fit (χ2/df = 1.74, RMSEA = 0.082, CFI = 0.923, NNFI = 0.962). STBP-KAPQ and PTB-KAPQ had a strong correlation in the knowledge part, and the correlation coefficient was 0.606 (p < 0.05). Discriminant validity was supported through a significant difference between student TB patients and healthy students across all domains (p < 0.05). An instrument, "Knowledge, Attitudes and Practices Questionnaire among Student Tuberculosis Patients (STBP-KAPQ)" was developed. Psychometric testing indicated that it had adequate validity and reliability for use in KAP researches with student TB patients in China. The new tool might help public health researchers evaluate the level of KAP in student TB patients, and it could also be used to examine the effects of TB health education.
Evaluating the Reliability of Emergency Response Systems for Large-Scale Incident Operations

PubMed Central

Jackson, Brian A.; Faith, Kay Sullivan; Willis, Henry H.

2012-01-01

Abstract The ability to measure emergency preparedness—to predict the likely performance of emergency response systems in future events—is critical for policy analysis in homeland security. Yet it remains difficult to know how prepared a response system is to deal with large-scale incidents, whether it be a natural disaster, terrorist attack, or industrial or transportation accident. This research draws on the fields of systems analysis and engineering to apply the concept of system reliability to the evaluation of emergency response systems. The authors describe a method for modeling an emergency response system; identifying how individual parts of the system might fail; and assessing the likelihood of each failure and the severity of its effects on the overall response effort. The authors walk the reader through two applications of this method: a simplified example in which responders must deliver medical treatment to a certain number of people in a specified time window, and a more complex scenario involving the release of chlorine gas. The authors also describe an exploratory analysis in which they parsed a set of after-action reports describing real-world incidents, to demonstrate how this method can be used to quantitatively analyze data on past response performance. The authors conclude with a discussion of how this method of measuring emergency response system reliability could inform policy discussion of emergency preparedness, how system reliability might be improved, and the costs of doing so. PMID:28083267
Evaluating the Effect of Minimizing Screws on Stabilization of Symphysis Mandibular Fracture by 3D Finite Element Analysis.

PubMed

Kharmanda, Ghias; Kharma, Mohamed-Yaser

2017-06-01

The objective of this work is to integrate structural optimization and reliability concepts into mini-plate fixation strategy used in symphysis mandibular fractures. The structural reliability levels are next estimated when considering a single failure mode and multiple failure modes. A 3-dimensional finite element model is developed in order to evaluate the ability of reducing the negative effect due to the stabilization of the fracture. Topology optimization process is considered in the conceptual design stage to predict possible fixation layouts. In the detailed design stage, suitable mini-plates are selected taking into account the resulting topology and different anatomical considerations. Several muscle forces are considered in order to obtain realistic predictions. Since some muscles can be cut or harmed during the surgery and cannot operate at its maximum capacity, there is a strong motivation to introduce the loading uncertainties in order to obtain reliable designs. The structural reliability is carried out for a single failure mode and multiple failure modes. The different results are validated with a clinical case of a male patient with symphysis fracture. In this case while use of the upper plate fixation with four holes, only two screws were applied to protect adjacent vital structure. This behavior does not affect the stability of the fracture. The proposed strategy to optimize bone plates leads to fewer complications and second surgeries, less patient discomfort, and shorter time of healing.

Evaluating the evidence for non-monotonic dose-response relationships: A systematic literature review and (re-)analysis of in vivo toxicity data in the area of food safety.

PubMed

Varret, C; Beronius, A; Bodin, L; Bokkers, B G H; Boon, P E; Burger, M; De Wit-Bos, L; Fischer, A; Hanberg, A; Litens-Karlsson, S; Slob, W; Wolterink, G; Zilliacus, J; Beausoleil, C; Rousselle, C

2018-01-15

This study aims to evaluate the evidence for the existence of non-monotonic dose-responses (NMDRs) of substances in the area of food safety. This review was performed following the systematic review methodology with the aim to identify in vivo studies published between January 2002 and February 2015 containing evidence for potential NMDRs. Inclusion and reliability criteria were defined and used to select relevant and reliable studies. A set of six checkpoints was developed to establish the likelihood that the data retrieved contained evidence for NMDR. In this review, 49 in vivo studies were identified as relevant and reliable, of which 42 were used for dose-response analysis. These studies contained 179 in vivo dose-response datasets with at least five dose groups (and a control group) as fewer doses cannot provide evidence for NMDR. These datasets were extracted and analyzed using the PROAST software package. The resulting dose-response relationships were evaluated for possible evidence of NMDRs by applying the six checkpoints. In total, 10 out of the 179 in vivo datasets fulfilled all six checkpoints. While these datasets could be considered as providing evidence for NMDR, replicated studies would still be needed to check if the results can be reproduced to rule out that the non-monotonicity was caused by incidental anomalies in that specific study. This approach, combining a systematic review with a set of checkpoints, is new and appears useful for future evaluations of the dose response datasets regarding evidence of non-monotonicity. Published by Elsevier Inc.
Can gastritis symptoms be evaluated in clinical trials? An overview of treatment of gastritis, nonulcer dyspepsia and Campylobacter-associated gastritis.

PubMed

Veldhuyzen van Zanten, S J; Tytgat, K M; Jalali, S; Goodacre, R L; Hunt, R H

1989-10-01

We carried out a review of the literature on Campylobacter pylori-associated gastritis and nonulcer dyspepsia (NUD) to determine whether or not symptoms related to these conditions can be measured reliably and whether or not any study to date has shown that treatment alters symptoms. Search strategies consisted of online Medline searching, a forward search of three articles using the Science Citation Index, a manual search of five gastroenterological journals, and a fully recursive search of cited references. Inclusion and quality criteria were applied to all retrieved studies. Nine of 23 studies did not fulfill the inclusion criteria. Of the 14 studies analyzed, two measured symptoms reliably. Neither showed a therapeutic benefit on symptoms. The difficulties encountered in conducting such studies and the methods of recording symptoms reliably are discussed. We conclude that to date, no treatment is of proven benefit in the relief of symptoms associated with C. pylori gastritis and NUD.
Research on the optimal structure configuration of dither RLG used in skewed redundant INS

NASA Astrophysics Data System (ADS)

Gao, Chunfeng; Wang, Qi; Wei, Guo; Long, Xingwu

2016-05-01

The actual combat effectiveness of weapon equipment is restricted by the performance of Inertial Navigation System (INS), especially in high reliability required situations such as fighter, satellite and submarine. Through the use of skewed sensor geometries, redundant technique has been applied to reduce the cost and improve the reliability of the INS. In this paper, the structure configuration and the inertial sensor characteristics of Skewed Redundant Strapdown Inertial Navigation System (SRSINS) using dithered Ring Laser Gyroscope (RLG) are analyzed. For the dither coupling effects of the dither gyro, the system measurement errors can be amplified either the individual gyro dither frequency is near one another or the structure of the SRSINS is unreasonable. Based on the characteristics of RLG, the research on coupled vibration of dithered RLG in SRSINS is carried out. On the principle of optimal navigation performance, optimal reliability and optimal cost-effectiveness, the comprehensive evaluation scheme of the inertial sensor configuration of SRINS is given.
[Upper limb functional assessment scale for children with Duchenne muscular dystrophy and Spinal muscular atrophy].

PubMed

Escobar, Raúl G; Lucero, Nayadet; Solares, Carmen; Espinoza, Victoria; Moscoso, Odalie; Olguín, Polín; Muñoz, Karin T; Rosas, Ricardo

2016-08-16

Duchenne muscular dystrophy (DMD) and Spinal muscular atrophy (SMA) causes significant disability and progressive functional impairment. Readily available instruments that assess functionality, especially in advanced stages of the disease, are required to monitor the progress of the disease and the impact of therapeutic interventions. To describe the development of a scale to evaluate upper limb function (UL) in patients with DMD and SMA, and describe its validation process, which includes self-training for evaluators. The development of the scale included a review of published scales, an exploratory application of a pilot scale in healthy children and those with DMD, self-training of evaluators in applying the scale using a handbook and video tutorial, and assessment of a group of children with DMD and SMA using the final scale. Reliability was assessed using Cronbach and Kendall concordance and with intra and inter-rater test-retest, and validity with concordance and factorial analysis. A high level of reliability was observed, with high internal consistency (Cronbach α=0.97), and inter-rater (Kendall W=0.96) and intra-rater concordance (r=0.97 to 0.99). The validity was demonstrated by the absence of significant differences between results by different evaluators with an expert evaluator (F=0.023, P>.5), and by the factor analysis that showed that four factors account for 85.44% of total variance. This scale is a reliable and valid tool for assessing UL functionality in children with DMD and SMA. It is also easily implementable due to the possibility of self-training and the use of simple and inexpensive materials. Copyright © 2016 Sociedad Chilena de Pediatría. Publicado por Elsevier España, S.L.U. All rights reserved.
Study on evaluation of construction reliability for engineering project based on fuzzy language operator

NASA Astrophysics Data System (ADS)

Shi, Yu-Fang; Ma, Yi-Yi; Song, Ping-Ping

2018-03-01

System Reliability Theory is a research hotspot of management science and system engineering in recent years, and construction reliability is useful for quantitative evaluation of project management level. According to reliability theory and target system of engineering project management, the defination of construction reliability appears. Based on fuzzy mathematics theory and language operator, value space of construction reliability is divided into seven fuzzy subsets and correspondingly, seven membership function and fuzzy evaluation intervals are got with the operation of language operator, which provides the basis of corresponding method and parameter for the evaluation of construction reliability. This method is proved to be scientific and reasonable for construction condition and an useful attempt for theory and method research of engineering project system reliability.
Confirmatory factor analysis of different versions of the Body Shape Questionnaire applied to Brazilian university students.

PubMed

da Silva, Wanderson Roberto; Dias, Juliana Chioda Ribeiro; Maroco, João; Campos, Juliana Alvares Duarte Bonini

2014-09-01

This study aimed at evaluating the validity, reliability, and factorial invariance of the complete (34-item) and shortened (8-item and 16-item) versions of the Body Shape Questionnaire (BSQ) when applied to Brazilian university students. A total of 739 female students with a mean age of 20.44 (standard deviation=2.45) years participated. Confirmatory factor analysis was conducted to verify the degree to which the one-factor structure satisfies the proposal for the BSQ's expected structure. Two items of the 34-item version were excluded because they had factor weights (λ)<40. All models had adequate convergent validity (average variance extracted=.43-.58; composite reliability=.85-.97) and internal consistency (α=.85-.97). The 8-item B version was considered the best shortened BSQ version (Akaike information criterion=84.07, Bayes information criterion=157.75, Browne-Cudeck criterion=84.46), with strong invariance for independent samples (Δχ(2)λ(7)=5.06, Δχ(2)Cov(8)=5.11, Δχ(2)Res(16)=19.30). Copyright © 2014 Elsevier Ltd. All rights reserved.
Effects of imperfect automation on decision making in a simulated command and control task.

PubMed

Rovira, Ericka; McGarry, Kathleen; Parasuraman, Raja

2007-02-01

Effects of four types of automation support and two levels of automation reliability were examined. The objective was to examine the differential impact of information and decision automation and to investigate the costs of automation unreliability. Research has shown that imperfect automation can lead to differential effects of stages and levels of automation on human performance. Eighteen participants performed a "sensor to shooter" targeting simulation of command and control. Dependent variables included accuracy and response time of target engagement decisions, secondary task performance, and subjective ratings of mental work-load, trust, and self-confidence. Compared with manual performance, reliable automation significantly reduced decision times. Unreliable automation led to greater cost in decision-making accuracy under the higher automation reliability condition for three different forms of decision automation relative to information automation. At low automation reliability, however, there was a cost in performance for both information and decision automation. The results are consistent with a model of human-automation interaction that requires evaluation of the different stages of information processing to which automation support can be applied. If fully reliable decision automation cannot be guaranteed, designers should provide users with information automation support or other tools that allow for inspection and analysis of raw data.
Reliability and validity of the workplace social distance scale.

PubMed

Yoshii, Hatsumi; Mandai, Nozomu; Saito, Hidemitsu; Akazawa, Kouhei

2014-10-29

Self-stigma, defined by a negative attitude toward oneself combined with the consciousness of being a target of prejudice, is a critical problem for psychiatric patients. Self-stigma studies among psychiatric patients have indicated that high stigma is predictive of detrimental effects such as the delay of treatment and decreases in social participation in patients, and levels of self-stigma should be statistically evaluated. In this study, we developed the Workplace Social Distance Scale (WSDS), rephrasing the eight items of the Japanese version of the Social Distance Scale (SDSJ) to apply to the work setting in Japan. We examined the reliability and validity of the WSDS among 83 psychiatric patients. Factor analysis extracted three factors from the scale items: "work relations," "shallow relationships," and "employment." These factors are similar to the assessment factors of the SDSJ. Cronbach's alpha coefficient for the WSDS was 0.753. The split-half reliability for the WSDS was 0.801, indicating significant correlations. In addition, the WSDS was significantly correlated with the SDSJ. These findings suggest that the WSDS represents an approximation of self-stigma in the workplace among psychiatric patients. Our study assessed the reliability and validity of the WSDS for measuring self-stigma in Japan. Future studies should investigate the reliability and validity of the scale in other countries.
Measuring professional satisfaction in Greek nurses: combination of qualitative and quantitative investigation to evaluate the validity and reliability of the Index of Work Satisfaction.

PubMed

Karanikola, Maria N K; Papathanassoglou, Elizabeth D E

2015-02-01

The Index of Work Satisfaction (IWS) is a comprehensive scale assessing nurses' professional satisfaction. The aim of the present study was to explore: a) the applicability, reliability and validity of the Greek version of the IWS and b) contrasts among the factors addressed by IWS against the main themes emerging from a qualitative phenomenological investigation of nurses' professional experiences. A descriptive correlational design was applied using a sample of 246 emergency and critical care nurses. Internal consistency and test-retest reliability were tested. Construct and content validity were assessed by factor analysis, and through qualitative phenomenological analysis with a purposive sample of 12 nurses. Scale factors were contrasted to qualitative themes to assure that IWS embraces all aspects of Greek nurses' professional satisfaction. The internal consistency (α = 0.81) and test-retest (tau = 1, p < 0.0001) reliability were adequate. Following appropriate modifications, factor analysis confirmed the construct validity of the scale and subscales. The qualitative data partially clarified the low reliability of one subscale. The Greek version of the IWS scale is supported for use in acute care. The mixed methods approach constitutes a powerful tool for transferring scales to different cultures and healthcare systems. Copyright © 2014 Elsevier Inc. All rights reserved.
Evaluation of the Validity and Reliability of the Waterlow Pressure Ulcer Risk Assessment Scale

PubMed Central

Charalambous, Charalambos; Koulori, Agoritsa; Vasilopoulos, Aristidis; Roupa, Zoe

2018-01-01

Introduction Prevention is the ideal strategy to tackle the problem of pressure ulcers. Pressure ulcer risk assessment scales are one of the most pivotal measures applied to tackle the problem, much criticisms has been developed regarding the validity and reliability of these scales. Objective To investigate the validity and reliability of the Waterlow pressure ulcer risk assessment scale. Method The methodology used is a narrative literature review, the bibliography was reviewed through Cinahl, Pubmed, EBSCO, Medline and Google scholar, 26 scientific articles where identified. The articles where chosen due to their direct correlation with the objective under study and their scientific relevance. Results The construct and face validity of the Waterlow appears adequate, but with regards to content validity changes in the category age and gender can be beneficial. The concurrent validity cannot be assessed. The predictive validity of the Waterlow is characterized by high specificity and low sensitivity. The inter-rater reliability has been demonstrated to be inadequate, this may be due to lack of clear definitions within the categories and differentiating level of knowledge between the users. Conclusion Due to the limitations presented regarding the validity and reliability of the Waterlow pressure ulcer risk assessment scale, the scale should be used in conjunction with clinical assessment to provide optimum results. PMID:29736104
Evaluation of the Validity and Reliability of the Waterlow Pressure Ulcer Risk Assessment Scale.

PubMed

Charalambous, Charalambos; Koulori, Agoritsa; Vasilopoulos, Aristidis; Roupa, Zoe

2018-04-01

Prevention is the ideal strategy to tackle the problem of pressure ulcers. Pressure ulcer risk assessment scales are one of the most pivotal measures applied to tackle the problem, much criticisms has been developed regarding the validity and reliability of these scales. To investigate the validity and reliability of the Waterlow pressure ulcer risk assessment scale. The methodology used is a narrative literature review, the bibliography was reviewed through Cinahl, Pubmed, EBSCO, Medline and Google scholar, 26 scientific articles where identified. The articles where chosen due to their direct correlation with the objective under study and their scientific relevance. The construct and face validity of the Waterlow appears adequate, but with regards to content validity changes in the category age and gender can be beneficial. The concurrent validity cannot be assessed. The predictive validity of the Waterlow is characterized by high specificity and low sensitivity. The inter-rater reliability has been demonstrated to be inadequate, this may be due to lack of clear definitions within the categories and differentiating level of knowledge between the users. Due to the limitations presented regarding the validity and reliability of the Waterlow pressure ulcer risk assessment scale, the scale should be used in conjunction with clinical assessment to provide optimum results.
A neural network application to classification of health status of HIV/AIDS patients.

PubMed

Kwak, N K; Lee, C

1997-04-01

This paper presents an application of neural networks to classify and to predict the health status of HIV/AIDS patients. A neural network model in classifying both the well and not-well health status of HIV/AIDS patients is developed and evaluated in terms of validity and reliability of the test. Several different neural network topologies are applied to AIDS Cost and Utilization Survey (ACSUS) datasets in order to demonstrate the neural network's capability.
Inter-clinician and intra-clinician reliability of force application during joint mobilization: a systematic review.

PubMed

Gorgos, Kara S; Wasylyk, Nicole T; Van Lunen, Bonnie L; Hoch, Matthew C

2014-04-01

Joint mobilizations are commonly used by clinicians to decrease pain and restore joint arthrokinematics following musculoskeletal injury. The force applied during a joint mobilization treatment is subjective to the individual clinician but may have an effect on patient outcomes. The purpose of this systematic review was to critically appraise and synthesize the studies which examined the reliability of clinicians' force application during joint mobilization. A systematic search of PubMed and EBSCO Host databases from inception to March 1, 2013 was conducted to identify studies assessing the reliability of force application during joint mobilizations. Two reviewers utilized the Quality Appraisal of Reliability Studies (QAREL) assessment tool to determine the quality of included studies. The relative reliability of the included studies was examined through intraclass correlation coefficients (ICC) to synthesize study findings. All results were collated qualitatively with a level of evidence approach. A total of seven studies met the eligibility and were included. Five studies were included that assessed inter-clinician reliability, and six studies were included that assessed intra-clinician reliability. The overall level of evidence for inter-clinician reliability was strong for poor-to-moderate reliability (ICC = -0.04 to 0.70). The overall level of evidence for intra-clinician reliability was strong for good reliability (ICC = 0.75-0.99). This systematic review indicates there is variability in force application between clinicians but individual clinicians apply forces consistently. The results of this systematic review suggest innovative instructional methods are needed to improve consistency and validate the forces applied during of joint mobilization treatments. This is particularly evident for improving the consistency of force application across clinicians. Copyright © 2014 Elsevier Ltd. All rights reserved.
Enhancing clinical evidence by proactively building quality into clinical trials.

PubMed

Meeker-O'Connell, Ann; Glessner, Coleen; Behm, Mark; Mulinde, Jean; Roach, Nancy; Sweeney, Fergus; Tenaerts, Pamela; Landray, Martin J

2016-08-01

Stakeholders across the clinical trial enterprise have expressed concern that the current clinical trial enterprise is unsustainable. The cost and complexity of trials have continued to increase, threatening our ability to generate reliable evidence essential for making appropriate decisions concerning the benefits and harms associated with clinical interventions. Overcoming this inefficiency rests on improving protocol design, trial planning, and quality oversight. The Clinical Trials Transformation Initiative convened a project to evaluate methods to prospectively build quality into the scientific and operational design of clinical trials ("quality-by-design"), such that trials are feasible to conduct and important errors are prevented rather than remediated. A working group evaluated aspects of trial design and oversight and developed the Clinical Trials Transformation Initiative quality-by-design principles document, outlining a series of factors generally relevant to the reliability of trial conclusions and to patient safety. These principles were then applied and further refined during a series of hands-on workshops to evaluate their utility in facilitating proactive, cross-functional dialogue, and decision-making about trial design and planning. Following these workshops, independent qualitative interviews were conducted with 19 workshop attendees to explore the potential challenges for implementing a quality-by-design approach to clinical trials. The Clinical Trials Transformation Initiative project team subsequently developed recommendations and an online resource guide to support implementation of this approach. The Clinical Trials Transformation Initiative quality-by-design principles provide a framework for assuring that clinical trials adequately safeguard participants and provide reliable information on which to make decisions on the effects of treatments. The quality-by-design workshops highlighted the value of active discussions incorporating the different perspectives within and external to an organization (e.g. clinical investigators, research site staff, and trial participants) in improving trial design. Workshop participants also recognized the value of focusing oversight on those aspects of the trial where errors would have a major impact on participant safety and reliability of results. Applying the Clinical Trials Transformation Initiative quality-by-design recommendations and principles should enable organizations to prioritize the most critical determinants of a trial's quality, identify non-essential activities that can be eliminated to streamline trial conduct and oversight, and formulate appropriate plans to define, avoid, mitigate, monitor, and address important errors. © The Author(s) 2016.
Reliability and relationship of the fear-avoidance beliefs questionnaire with the shoulder pain and disability index and numeric pain rating scale in patients with shoulder pain.

PubMed

Riley, Sean P; Tafuto, Vincent; Cote, Mark; Brismée, Jean-Michel; Wright, Alexis; Cook, Chad

2018-03-20

The purpose of this study was to determine: 1) the test-retest reliability of Fear-Avoidance Beliefs Questionnaire (FABQ) Work (FABQW) subscale, FABQ Physical Activity (FABQPA) subscale, Shoulder Pain and Disability Index (SPADI) Pain subscale, SPADI Disability subscale, and Numeric Pain Rating scale (NPRS); and 2) the relationship between the FABQPA, FABQW, SPADI pain, SPADI disability, and NPRS after 4 weeks of pragmatically applied physical therapy (PT) in patients with shoulder pain. Prospective, single-group observational design. Data were collected at initial evaluation, the first follow-up visit prior to the initiation of treatment, and after 4 weeks of treatment. Statistically significant Intraclass Correlation Coefficient (ICC 2,1 ) values were reported for the FABQPA, FABQW, SPADI Pain, SPADI Disability, and NPRS. A statistically significant moderate relationship between the FABQPA subscale, SPADI subscale, and NPRS could not be established prior to and after 4 weeks of pragmatically applied PT. Statistically significant differences were observed between the initial evaluation and four-week follow-up for the FABQPA, SPADI Pain, SPADI Disability, and NPRS (p < 0.01). Since a meaningful relationship between the FABQ, SPADI, and NPRS did not exist, it suggests that the FABQPA may be measuring a metric other than pain. This study suggests that the FABQW may not be sensitive to change over time.
Evaluation of bacterial communities belonging to natural whey starters for Grana Padano cheese by length heterogeneity-PCR.

PubMed

Lazzi, C; Rossetti, L; Zago, M; Neviani, E; Giraffa, G

2004-01-01

To detect bacteria present in controlled dairy ecosystems with defined composition by length-heterogeneity (LH)-PCR. LH-PCR allows to distinguish different organisms on the basis of natural variations in the length of 16S rRNA gene sequences. LH-PCR was applied to depict population structure of the lactic acid bacteria (LAB) species recoverable from Grana Padano cheese whey starters. Typical bacterial species present in the LAB community were evidenced and well discriminated. Small differences in species composition, e.g. the frequent finding of Streptococcus thermophilus and the constant presence of thermophilic lactobacilli (Lactobacillus helveticus, Lact. delbrueckii subsp. lactis/bulgaricus and Lact. fermentum) were reliably highlighted. Specificity of LH-PCR was confirmed by species-specific PCR from total DNA of the cultures. LH-PCR is a useful tool to monitor microbial composition and population dynamics in dairy starter cultures. When present, non-dominant bacterial species present in the whey starters, such as Strep. thermophilus, can easily be visualized and characterized without isolating and cultivating single strains. A similar approach can be applied to more complex dairy ecosystems such as milk or cheese curd. Community members and differences in population structure of controlled dairy ecosystems such as whey starters for hard cheeses can be evaluated and compared in a relative easy, fast, reliable and highly reproducible way.
Evaluating and comparing methods of sinkhole susceptibility mapping in the Ebro Valley evaporite karst (NE Spain)

NASA Astrophysics Data System (ADS)

Galve, J. P.; Gutiérrez, F.; Remondo, J.; Bonachea, J.; Lucha, P.; Cendrero, A.

2009-10-01

Multiple sinkhole susceptibility models have been generated in three study areas of the Ebro Valley evaporite karst (NE Spain) applying different methods (nearest neighbour distance, sinkhole density, heuristic scoring system and probabilistic analysis) for each sinkhole type separately (cover collapse sinkholes, cover and bedrock collapse sinkholes and cover and bedrock sagging sinkholes). The quantitative and independent evaluation of the predictive capability of the models reveals that: (1) The most reliable susceptibility models are those derived from the nearest neighbour distance and sinkhole density. These models can be generated in a simple and rapid way from detailed geomorphological maps. (2) The reliability of the nearest neighbour distance and density models is conditioned by the degree of clustering of the sinkholes. Consequently, the karst areas in which sinkholes show a higher clustering are a priori more favourable for predicting new occurrences. (3) The predictive capability of the best models obtained in this research is significantly higher (12.5-82.5%) than that of the heuristic sinkhole susceptibility model incorporated into the General Urban Plan for the municipality of Zaragoza. Although the probabilistic approach provides lower quality results than the methods based on sinkhole proximity and density, it helps to identify the most significant factors and select the most effective mitigation strategies and may be applied to model susceptibility in different future scenarios.
Augmented reality (AR) and virtual reality (VR) applied in dentistry.

PubMed

Huang, Ta-Ko; Yang, Chi-Hsun; Hsieh, Yu-Hsin; Wang, Jen-Chyan; Hung, Chun-Cheng

2018-04-01

The OSCE is a reliable evaluation method to estimate the preclinical examination of dental students. The most ideal assessment for OSCE is used the augmented reality simulator to evaluate. This literature review investigated a recently developed in virtual reality (VR) and augmented reality (AR) starting of the dental history to the progress of the dental skill. As result of the lacking of technology, it needs to depend on other device increasing the success rate and decreasing the risk of the surgery. The development of tracking unit changed the surgical and educational way. Clinical surgery is based on mature education. VR and AR simultaneously affected the skill of the training lesson and navigation system. Widely, the VR and AR not only applied in the dental training lesson and surgery, but also improved all field in our life. Copyright © 2018. Published by Elsevier Taiwan.
Public services for distribution of drinking water and liquid sanitation in urban zones in Morocco Relevance of introduction the performance indicators for preservation water resources.

NASA Astrophysics Data System (ADS)

Habib, Akka; Abdelhamid, Bouzidi; Said, Housni

2018-05-01

Because of the absence of regulations and specific national norms, the unilaterally applied indicators for performance evaluation of water distribution management services are insufficient. This does not pave the way for a clear visibility of water resources. The indicators are also so heterogeneous that they are not in equilibrium with the applied management patterns. In fact: 1- The performance (yield and Linear loss index) of drinking water networks presents a discrepancy between operators and lack of homogeneity in terms of parameters put in its equation. Hence, It these indicators lose efficiency and reliability; 2- Liquid sanitation service has to go beyond the quantitative evaluation target in order to consider the qualitative aspects of water. To reach this aim, a reasonable enlargement of performance indicators is of paramount importance in order to better manage water resource which is becoming scarce and insufficient.
Measuring quality of life in patients with stress urinary incontinence: is the ICIQ-UI-SF adequate?

PubMed

Kurzawa, Zuzanna; Sutherland, Jason M; Crump, Trafford; Liu, Guiping

2018-05-08

The International Consultation on Incontinence Questionnaire Short Form (ICIQ-UI-SF) is a widely used four-item patient-reported outcome (PRO) measure. Evaluations of this instrument are limited, restraining user's confidence in the instrument. This study conducts a comprehensive evaluation of the ICIQ-UI-SF on a sample of urological surgery patients in Canada. One hundred and seventy-seven surgical patients with stress urinary incontinence completed the ICIQ-UI-SF pre-operatively. Methods drawing from confirmatory factor analysis (CFA), measures of reliability, item response theory (IRT), and differential item functioning were applied. Ceiling effects were examined. Ceiling effects were identified. In the CFA, the factor loadings of items one and two differed significantly (p < 0.001) from item three indicating possible multidimensionality. The first two items reflect symptom severity not quality of life. Reliability was moderate as measured by Cronbach's alpha (0.63) and McDonald's coefficient (0.65). The IRT found the instrument does not discriminate between individuals with low incontinence-related quality of life. Due to low/moderate reliability, the ICIQ-UI-SF can be used as a complement to other data or used to report aggregated surgical outcomes among surgical patients. If the primary objective is to measure quality of life, other PROs should be considered.

The Effect of Achievement Test Selection on Identification of Learning Disabilities within a Patterns of Strengths and Weaknesses Framework

PubMed Central

Miciak, Jeremy; Taylor, Pat; Denton, Carolyn A.; Fletcher, Jack M.

2014-01-01

Purpose Few empirical investigations have evaluated learning disabilities (LD) identification methods based on a pattern of cognitive strengths and weaknesses (PSW). This study investigated the reliability of LD classification decisions of the concordance/discordance method (C/DM) across different psychoeducational assessment batteries. Methods C/DM criteria were applied to assessment data from 177 second grade students based on two psychoeducational assessment batteries. The achievement tests were different, but were highly correlated and measured the same latent construct. Resulting LD identifications were then evaluated for agreement across batteries on LD status and the academic domain of eligibility. Results The two batteries identified a similar number of participants as having LD (80 and 74). However, indices of agreement for classification decisions were low (kappa = .29), especially for percent positive agreement (62%). The two batteries demonstrated agreement on the academic domain of eligibility for only 25 participants. Conclusions Cognitive discrepancy frameworks for LD identification are inherently unstable because of imperfect reliability and validity at the observed level. Methods premised on identifying a PSW profile may never achieve high reliability because of these underlying psychometric factors. An alternative is to directly assess academic skills to identify students in need of intervention. PMID:25243467
Development and psychometric evaluation of the Premarital Sexual Behavior Assessment Scale for Young Women (PSAS-YW): an exploratory mixed method study.

PubMed

Rahmani, Azam; Merghati-Khoei, Effat; Moghadam-Banaem, Lida; Hajizadeh, Ebrahim; Hamdieh, Mostafa; Montazeri, Ali

2014-06-13

Premarital sexual behaviors are important issue for women's health. The present study was designed to develop and examine the psychometric properties of a scale in order to identify young women who are at greater risk of premarital sexual behavior. This was an exploratory mixed method investigation. Indeed, the study was conducted in two phases. In the first phase, qualitative methods (focus group discussion and individual interview) were applied to generate items and develop the questionnaire. In the second phase, psychometric properties (validity and reliability) of the questionnaire were assessed. In the first phase an item pool containing 53 statements related to premarital sexual behavior was generated. In the second phase item reduction was applied and the final version of the questionnaire containing 26 items was developed. The psychometric properties of this final version were assessed and the results showed that the instrument has a good structure, and reliability. The results from exploratory factory analysis indicated a 5-factor solution for the instrument that jointly accounted for the 57.4% of variance observed. The Cronbach's alpha coefficient for the instrument was found to be 0.87. This study provided a valid and reliable scale to identify premarital sexual behavior in young women. Assessment of premarital sexual behavior might help to improve women's sexual abstinence.
Development and psychometric evaluation of the Premarital Sexual Behavior Assessment Scale for Young Women (PSAS-YW): an exploratory mixed method study

PubMed Central

2014-01-01

Background Premarital sexual behaviors are important issue for women’s health. The present study was designed to develop and examine the psychometric properties of a scale in order to identify young women who are at greater risk of premarital sexual behavior. Method This was an exploratory mixed method investigation. Indeed, the study was conducted in two phases. In the first phase, qualitative methods (focus group discussion and individual interview) were applied to generate items and develop the questionnaire. In the second phase, psychometric properties (validity and reliability) of the questionnaire were assessed. Results In the first phase an item pool containing 53 statements related to premarital sexual behavior was generated. In the second phase item reduction was applied and the final version of the questionnaire containing 26 items was developed. The psychometric properties of this final version were assessed and the results showed that the instrument has a good structure, and reliability. The results from exploratory factory analysis indicated a 5-factor solution for the instrument that jointly accounted for the 57.4% of variance observed. The Cronbach’s alpha coefficient for the instrument was found to be 0.87. Conclusion This study provided a valid and reliable scale to identify premarital sexual behavior in young women. Assessment of premarital sexual behavior might help to improve women’s sexual abstinence. PMID:24924696
18 CFR 40.1 - Applicability.

Code of Federal Regulations, 2010 CFR

2010-04-01

... ENERGY REGULATIONS UNDER THE FEDERAL POWER ACT MANDATORY RELIABILITY STANDARDS FOR THE BULK-POWER SYSTEM... in section 201(f) of the Federal Power Act. (b) Each Reliability Standard made effective by § 40.2... Reliability Standard applies. ...
Computerized evaluation of holographic interferograms for fatigue crack detection in riveted lap joints

NASA Astrophysics Data System (ADS)

Zhou, Xiang

Using an innovative portable holographic inspection and testing system (PHITS) developed at the Australian Defence Force Academy, fatigue cracks in riveted lap joints can be detected by visually inspecting the abnormal fringe changes recorded on holographic interferograms. In this thesis, for automatic crack detection, some modern digital image processing techniques are investigated and applied to holographic interferogram evaluation. Fringe analysis algorithms are developed for identification of the crack-induced fringe changes. Theoretical analysis of PHITS and riveted lap joints and two typical experiments demonstrate that the fatigue cracks in lightly-clamped joints induce two characteristic fringe changes: local fringe discontinuities at the cracking sites; and the global crescent fringe distribution near to the edge of the rivet hole. Both of the fringe features are used for crack detection in this thesis. As a basis of the fringe feature extraction, an algorithm for local fringe orientation calculation is proposed. For high orientation accuracy and computational efficiency, Gaussian gradient filtering and neighboring direction averaging are used to minimize the effects of image background variations and random noise. The neighboring direction averaging is also used to approximate the fringe directions in centerlines of bright and dark fringes. Experimental results indicate that for high orientation accuracy the scales of the Gaussian filter and neighboring direction averaging should be chosen according to the local fringe spacings. The orientation histogram technique is applied to detect the local fringe discontinuity due to the fatigue cracks. The Fourier descriptor technique is used to characterize the global fringe distribution change from a circular to a crescent distribution with the fatigue crack growth. Experiments and computer simulations are conducted to analyze the detectability and reliability of crack detection using the two techniques. Results demonstrate that the Fourier descriptor technique is more promising in the detection of the short cracks near the edge of the rivet head. However, it is not as reliable as the fringe orientation technique for detection of the long through cracks. For reliability, both techniques should be used in practical crack detection. Neither the Fourier descriptor technique nor the orientation histogram technique have been previously applied to holographic interferometry. While this work related primarily to interferograms of cracked rivets, the techniques would be readily applied to other areas of fringe pattern analysis.
Validity and Reliability of the Persian Version of the Dysphagia Handicap Index (DHI).

PubMed

Asadollahpour, Faezeh; Baghban, Kowsar; Asadi, Mozhgan

2015-05-01

The Dysphagia Handicap Index (DHI) is one of the instruments used for measuring a dysphagic patient's self-assessment. In some ways, it reflects the patient's quality of life. Although it has been recognized and widely applied in English speaking populations, it has not been used in its present forms in Persian speaking countries. The purpose of this study was to adapt a Persian version of the DHI and to evaluate its validity, consistency, and reliability in the Persian population with oropharyngeal dysphagia. Some stages for cross-cultural adaptation were performed, which consisted in translation, synthesis, back translation, review by an expert committee, and final proof reading. The generated Persian DHI was administered to 85 patients with oropharyngeal dysphagia and 89 control subjects at Zahedan city between May 2013 and August 2013. The patients and control subjects answered the same questionnaire 2 weeks later to verify the test-retest reliability. Internal consistency and test-retest reliability were evaluated. The results of the patients and the control group were compared. The Persian DHI showed good internal consistency (Cronbach's alpha coefficients range from 0.82 to 0.94). Also, good test-retest reliability was found for the total scores of the Persian DHI (r=0.89). There was a significant difference between the DHI scores of the control group and those of the oropharyngeal dysphagia group (P‹0.001). The Persian version of the DHI achieved Face and translation validity. This study demonstrated that the Persian DHI is a valid tool for self-assessment of the handicapping effects of dysphagia on the physical, functional, and emotional aspects of patient life and can be a useful tool for screening and treatment planning for the Persian-speaking dysphagic patients, regardless of the cause or the severity of the dysphagia.
An evidence-based decision assistance model for predicting training outcome in juvenile guide dogs.

PubMed

Harvey, Naomi D; Craigon, Peter J; Blythe, Simon A; England, Gary C W; Asher, Lucy

2017-01-01

Working dog organisations, such as Guide Dogs, need to regularly assess the behaviour of the dogs they train. In this study we developed a questionnaire-style behaviour assessment completed by training supervisors of juvenile guide dogs aged 5, 8 and 12 months old (n = 1,401), and evaluated aspects of its reliability and validity. Specifically, internal reliability, temporal consistency, construct validity, predictive criterion validity (comparing against later training outcome) and concurrent criterion validity (comparing against a standardised behaviour test) were evaluated. Thirty-nine questions were sourced either from previously published literature or created to meet requirements identified via Guide Dogs staff surveys and staff feedback. Internal reliability analyses revealed seven reliable and interpretable trait scales named according to the questions within them as: Adaptability; Body Sensitivity; Distractibility; Excitability; General Anxiety; Trainability and Stair Anxiety. Intra-individual temporal consistency of the scale scores between 5-8, 8-12 and 5-12 months was high. All scales excepting Body Sensitivity showed some degree of concurrent criterion validity. Predictive criterion validity was supported for all seven scales, since associations were found with training outcome, at at-least one age. Thresholds of z-scores on the scales were identified that were able to distinguish later training outcome by identifying 8.4% of all dogs withdrawn for behaviour and 8.5% of all qualified dogs, with 84% and 85% specificity. The questionnaire assessment was reliable and could detect traits that are consistent within individuals over time, despite juvenile dogs undergoing development during the study period. By applying thresholds to scores produced from the questionnaire this assessment could prove to be a highly valuable decision-making tool for Guide Dogs. This is the first questionnaire-style assessment of juvenile dogs that has shown value in predicting the training outcome of individual working dogs.
Reliability, Validity, and Clinical Utility of the Dominic Interactive for Adolescents-RevisedA DSM-5-Based Self-Report Screen for Mental Disorders, Borderline Personality Traits, and Suicidality.

PubMed

Bergeron, Lise; Smolla, Nicole; Berthiaume, Claude; Renaud, Johanne; Breton, Jean-Jacques; St-Georges, Marie; Morin, Pauline; Zavaglia, Elissa; Labelle, Réal

2017-03-01

The Dominic Interactive for Adolescents-Revised (DIA-R) is a multimedia self-report screen for 9 mental disorders, borderline personality traits, and suicidality defined by the fifth edition of the Diagnostic and Statistical Manual of Mental Disorders ( DSM-5). This study aimed to examine the reliability and the validity of this instrument. French- and English-speaking adolescents aged 12 to 15 years ( N = 447) were recruited from schools and clinical settings in Montreal and were evaluated twice. The internal consistency was estimated by Cronbach alpha coefficients and the test-retest reliability by intraclass correlation coefficients. Cutoff points on the DIA-R scales were determined by using clinically relevant measures for defining external validation criteria: the Schedule for Affective Disorders and Schizophrenia for School-Aged Children, the Beck Hopelessness Scale, and the Abbreviated-Diagnostic Interview for Borderlines. Receiver operating characteristic (ROC) analyses provided accuracy estimates (area under the ROC curve, sensitivity, specificity, likelihood ratio) to evaluate the ability of the DIA-R scales to predict external criteria. For most of the DIA-R scales, reliability coefficients were excellent or moderate. High or moderate accuracy estimates from ROC analyses demonstrated the ability of the DIA-R thresholds to predict psychopathological conditions. These thresholds were generally capable to discriminate between clinical and school subsamples. However, the validity of the obsessions/compulsions scale was too low. Findings clearly support the reliability and the validity of the DIA-R. This instrument may be useful to assess a wide range of adolescents' mental health problems in the continuum of services. This conclusion applies to all scales, except the obsessions/compulsions one.
[Translation and Validation of the FOUR Scale for Children and its Use as Outcome Predictor: A Pilot Study].

PubMed

Ferreira, Sofia Simões; Meireles, Daniel; Pinto, Alexandra; Abecasis, Francisco

2017-09-29

The Full Outline of UnResponsiveness - FOUR scale has been previously validated to assess impaired consciousness in the adult population. The aim of this study is the translation into Portuguese and validation of the FOUR scale in the pediatric population. The study also compares the FOUR scale and Glasgow coma scale score ratings and the clinical outcome of patients hospitalized in Pediatric Intensive Care Units. This study prospectively rated patients admitted to the Pediatric Intensive Care Units with impaired consciousness during one year. Both scales were applied daily to patients by three types of examiners: intensivists, residents and nurses, from the moment of admission until clinical discharge. Neurological sequelae was evaluated using the King's Outcome Scale for Childhood Head Injury - KOSCHI. Twenty seven patients between one and 17 years of age were included. Both scales are reliable and inter-rater reliability was greater for the FOUR score. Glasgow coma scale showed a minimum score in eight evaluations, whereas the FOUR scale obtained the minimum score in only two of these evaluations. In both scales there was a strong association between the admission score and the patient's outcome (area under curve FOUR = 0.939, versus Glasgow coma scale = 0.925). The FOUR scale provides more neurological information than Glasgow coma scale in patients with impaired consciousness and has prognostic interest. The FOUR scale can be applied in patients admitted with impaired consciousness in Pediatric Intensive Care Units. We think that a multicenter study would be very beneficial for confirming and generalizing these results.
Universal first-order reliability concept applied to semistatic structures

NASA Technical Reports Server (NTRS)

Verderaime, V.

1994-01-01

A reliability design concept was developed for semistatic structures which combines the prevailing deterministic method with the first-order reliability method. The proposed method surmounts deterministic deficiencies in providing uniformly reliable structures and improved safety audits. It supports risk analyses and reliability selection criterion. The method provides a reliability design factor derived from the reliability criterion which is analogous to the current safety factor for sizing structures and verifying reliability response. The universal first-order reliability method should also be applicable for air and surface vehicles semistatic structures.
Universal first-order reliability concept applied to semistatic structures

NASA Astrophysics Data System (ADS)

Verderaime, V.

1994-07-01

A reliability design concept was developed for semistatic structures which combines the prevailing deterministic method with the first-order reliability method. The proposed method surmounts deterministic deficiencies in providing uniformly reliable structures and improved safety audits. It supports risk analyses and reliability selection criterion. The method provides a reliability design factor derived from the reliability criterion which is analogous to the current safety factor for sizing structures and verifying reliability response. The universal first-order reliability method should also be applicable for air and surface vehicles semistatic structures.
Research on dynamic routing mechanisms in wireless sensor networks.

PubMed

Zhao, A Q; Weng, Y N; Lu, Y; Liu, C Y

2014-01-01

WirelessHART is the most widely applied standard in wireless sensor networks nowadays. However, it does not provide any dynamic routing mechanism, which is important for the reliability and robustness of the wireless network applications. In this paper, a collection tree protocol based, dynamic routing mechanism was proposed for WirelessHART network. The dynamic routing mechanism was evaluated through several simulation experiments in three aspects: time for generating the topology, link quality, and stability of network. Besides, the data transmission efficiency of this routing mechanism was analyzed. The simulation and evaluation results show that this mechanism can act as a dynamic routing mechanism for the TDMA-based wireless sensor network.
Reliability Analysis and Modeling of ZigBee Networks

NASA Astrophysics Data System (ADS)

Lin, Cheng-Min

The architecture of ZigBee networks focuses on developing low-cost, low-speed ubiquitous communication between devices. The ZigBee technique is based on IEEE 802.15.4, which specifies the physical layer and medium access control (MAC) for a low rate wireless personal area network (LR-WPAN). Currently, numerous wireless sensor networks have adapted the ZigBee open standard to develop various services to promote improved communication quality in our daily lives. The problem of system and network reliability in providing stable services has become more important because these services will be stopped if the system and network reliability is unstable. The ZigBee standard has three kinds of networks; star, tree and mesh. The paper models the ZigBee protocol stack from the physical layer to the application layer and analyzes these layer reliability and mean time to failure (MTTF). Channel resource usage, device role, network topology and application objects are used to evaluate reliability in the physical, medium access control, network, and application layers, respectively. In the star or tree networks, a series system and the reliability block diagram (RBD) technique can be used to solve their reliability problem. However, a division technology is applied here to overcome the problem because the network complexity is higher than that of the others. A mesh network using division technology is classified into several non-reducible series systems and edge parallel systems. Hence, the reliability of mesh networks is easily solved using series-parallel systems through our proposed scheme. The numerical results demonstrate that the reliability will increase for mesh networks when the number of edges in parallel systems increases while the reliability quickly drops when the number of edges and the number of nodes increase for all three networks. More use of resources is another factor impact on reliability decreasing. However, lower network reliability will occur due to network complexity, more resource usage and complex object relationship.
The Yale-Brown Obsessive Compulsive Scale: A Reliability Generalization Meta-Analysis.

PubMed

López-Pina, José Antonio; Sánchez-Meca, Julio; López-López, José Antonio; Marín-Martínez, Fulgencio; Núñez-Núñez, Rosa Maria; Rosa-Alcázar, Ana I; Gómez-Conesa, Antonia; Ferrer-Requena, Josefa

2015-10-01

The Yale-Brown Obsessive Compulsive Scale (Y-BOCS) is the most frequently applied test to assess obsessive compulsive symptoms. We conducted a reliability generalization meta-analysis on the Y-BOCS to estimate the average reliability, examine the variability among the reliability estimates, search for moderators, and propose a predictive model that researchers and clinicians can use to estimate the expected reliability of the Y-BOCS. We included studies where the Y-BOCS was applied to a sample of adults and reliability estimate was reported. Out of the 11,490 references located, 144 studies met the selection criteria. For the total scale, the mean reliability was 0.866 for coefficients alpha, 0.848 for test-retest correlations, and 0.922 for intraclass correlations. The moderator analyses led to a predictive model where the standard deviation of the total test and the target population (clinical vs. nonclinical) explained 38.6% of the total variability among coefficients alpha. Finally, clinical implications of the results are discussed. © The Author(s) 2014.
Air temperature thresholds to evaluate snow melting at the surface of Alpine glaciers by T-index models: the case study of Forni Glacier (Italy)

NASA Astrophysics Data System (ADS)

Senese, A.; Maugeri, M.; Vuillermoz, E.; Smiraglia, C.; Diolaiuti, G.

2014-03-01

The glacier melt conditions (i.e.: null surface temperature and positive energy budget) can be assessed by analyzing meteorological and energy data acquired by a supraglacial Automatic Weather Station (AWS). In the case this latter is not present the assessment of actual melting conditions and the evaluation of the melt amount is difficult and simple methods based on T-index (or degree days) models are generally applied. These models require the choice of a correct temperature threshold. In fact, melt does not necessarily occur at daily air temperatures higher than 273.15 K. In this paper, to detect the most indicative threshold witnessing melt conditions in the April-June period, we have analyzed air temperature data recorded from 2006 to 2012 by a supraglacial AWS set up at 2631 m a.s.l. on the ablation tongue of the Forni Glacier (Italian Alps), and by a weather station located outside the studied glacier (at Bormio, a village at 1225 m a.s.l.). Moreover we have evaluated the glacier energy budget and the Snow Water Equivalent (SWE) values during this time-frame. Then the snow ablation amount was estimated both from the surface energy balance (from supraglacial AWS data) and from T-index method (from Bormio data, applying the mean tropospheric lapse rate and varying the air temperature threshold) and the results were compared. We found that the mean tropospheric lapse rate permits a good and reliable reconstruction of glacier air temperatures and the major uncertainty in the computation of snow melt is driven by the choice of an appropriate temperature threshold. From our study using a 5.0 K lower threshold value (with respect to the largely applied 273.15 K) permits the most reliable reconstruction of glacier melt.
Knowledge translation from continuing education to physiotherapy practice in classifying patients with low back pain.

PubMed

Karvonen, Eira; Paatelma, Markku; Kesonen, Jukka-Pekka; Heinonen, Ari O

2015-05-01

Physical therapists have used continuing education as a method of improving their skills in conducting clinical examination of patients with low back pain (LBP). The purpose of this study was to evaluate how well the pathoanatomical classification of patients in acute or subacute LBP can be learned and applied through a continuing education format. The patients were seen in a direct access setting. The study was carried out in a large health-care center in Finland. The analysis included a total of 57 patient evaluations generated by six physical therapists on patients with LBP. We analyzed the consistency and level of agreement of the six physiotherapists' (PTs) diagnostic decisions, who participated in a 5-day, intensive continuing education session and also compared those with the diagnostic opinions of two expert physical therapists, who were blind to the original diagnostic decisions. Evaluation of the physical therapists' clinical examination of the patients was conducted by the two experts, in order to determine the accuracy and percentage agreement of the pathoanatomical diagnoses. The percentage of agreement between the experts and PTs was 72-77%. The overall inter-examiner reliability (kappa coefficient) for the subgroup classification between the six PTs and two experts was 0.63 [95% confidence interval (CI): 0.47-0.77], indicating good agreement between the PTs and the two experts. The overall inter-examiner reliability between the two experts was 0.63 (0.49-0.77) indicating good level of agreement. Our results indicate that PTs' were able to apply their continuing education training to clinical reasoning and make consistently accurate pathoanatomic based diagnostic decisions for patients with LBP. This would suggest that continuing education short-courses provide a reasonable format for knowledge translation (KT) by which physical therapists can learn and apply new information related to the examination and differential diagnosis of patients in acute or subacute LBP.
The composition and initial evaluation of a grimace scale in ferrets after surgical implantation of a telemetry probe.

PubMed

Reijgwart, Marsinah L; Schoemaker, Nico J; Pascuzzo, Riccardo; Leach, Matthew C; Stodel, Melanie; de Nies, Loes; Hendriksen, Coenraad F M; van der Meer, Miriam; Vinke, Claudia M; van Zeeland, Yvonne R A

2017-01-01

Reliable recognition of pain is difficult in ferrets as many currently available parameters are non-specific, inconsistent and/or impractical. Grimace scales have successfully been applied to assess pain in different animal species and might also be applicable to ferrets. To compose a Ferret Grimace Scale (FGS), we studied the facial musculature of ferrets and compared lateral photographs of 19 ferret faces at six time points before and after intraperitoneal telemetry probe implantation. We identified the Action Units (AUs) orbital tightening, nose bulging, cheek bulging, ear changes and whisker retraction as potential indicators of pain in ferrets. To evaluate whether these AUs could reliably be used to identify photographs taken before and after surgery, the photographs were scored 0, 1 or 2 (not, moderately or obviously present) by 11 observers that were blinded to the treatment and timing of the photographs. All AU-scores assigned to the photographs taken five hours after surgery were significantly higher compared to their time-matched baseline scores. Further analysis using the weights that were obtained using a Linear Discriminant Analysis revealed that scoring orbital tightening alone was sufficient to make this distinction with high sensitivity, specificity and accuracy. Including weighted scores for nose bulging, cheek bulging and ear change did not change this. As these AUs had more missing values than orbital tightening, their descriptions should be re-evaluated. Including whisker retraction, which had a negative weight, resulted in lower accuracy and should therefore in its current form be left out of the FGS. Overall, the results of this study suggest that the FGS and the AU orbital tightening in particular could be useful in a multifactorial pain assessment protocol for ferrets. However, before applying the FGS in practice, it should be further validated by incorporating more time points before and after applying (different) painful stimuli, and different levels of analgesia.
The composition and initial evaluation of a grimace scale in ferrets after surgical implantation of a telemetry probe

PubMed Central

Schoemaker, Nico J.; Pascuzzo, Riccardo; Leach, Matthew C.; Stodel, Melanie; de Nies, Loes; Hendriksen, Coenraad F. M.; van der Meer, Miriam; Vinke, Claudia M.; van Zeeland, Yvonne R. A.

2017-01-01

Reliable recognition of pain is difficult in ferrets as many currently available parameters are non-specific, inconsistent and/or impractical. Grimace scales have successfully been applied to assess pain in different animal species and might also be applicable to ferrets. To compose a Ferret Grimace Scale (FGS), we studied the facial musculature of ferrets and compared lateral photographs of 19 ferret faces at six time points before and after intraperitoneal telemetry probe implantation. We identified the Action Units (AUs) orbital tightening, nose bulging, cheek bulging, ear changes and whisker retraction as potential indicators of pain in ferrets. To evaluate whether these AUs could reliably be used to identify photographs taken before and after surgery, the photographs were scored 0, 1 or 2 (not, moderately or obviously present) by 11 observers that were blinded to the treatment and timing of the photographs. All AU-scores assigned to the photographs taken five hours after surgery were significantly higher compared to their time-matched baseline scores. Further analysis using the weights that were obtained using a Linear Discriminant Analysis revealed that scoring orbital tightening alone was sufficient to make this distinction with high sensitivity, specificity and accuracy. Including weighted scores for nose bulging, cheek bulging and ear change did not change this. As these AUs had more missing values than orbital tightening, their descriptions should be re-evaluated. Including whisker retraction, which had a negative weight, resulted in lower accuracy and should therefore in its current form be left out of the FGS. Overall, the results of this study suggest that the FGS and the AU orbital tightening in particular could be useful in a multifactorial pain assessment protocol for ferrets. However, before applying the FGS in practice, it should be further validated by incorporating more time points before and after applying (different) painful stimuli, and different levels of analgesia. PMID:29131858
Improving reliability of a residency interview process.

PubMed

Peeters, Michael J; Serres, Michelle L; Gundrum, Todd E

2013-10-14

To improve the reliability and discrimination of a pharmacy resident interview evaluation form, and thereby improve the reliability of the interview process. In phase 1 of the study, authors used a Many-Facet Rasch Measurement model to optimize an existing evaluation form for reliability and discrimination. In phase 2, interviewer pairs used the modified evaluation form within 4 separate interview stations. In phase 3, 8 interviewers individually-evaluated each candidate in one-on-one interviews. In phase 1, the evaluation form had a reliability of 0.98 with person separation of 6.56; reproducibly, the form separated applicants into 6 distinct groups. Using that form in phase 2 and 3, our largest variation source was candidates, while content specificity was the next largest variation source. The phase 2 g-coefficient was 0.787, while confirmatory phase 3 was 0.922. Process reliability improved with more stations despite fewer interviewers per station-impact of content specificity was greatly reduced with more interview stations. A more reliable, discriminating evaluation form was developed to evaluate candidates during resident interviews, and a process was designed that reduced the impact from content specificity.
Solid Insulated Switchgear and Investigation of its Mechanical and Electrical Reliability

NASA Astrophysics Data System (ADS)

Sato, Junichi; Kinoshita, Susumu; Sakaguchi, Osamu; Miyagawa, Masaru; Shimizu, Toshio; Homma, Mitsutaka

SF6 gas is applied widely to medium voltage switchgear because of its high insulation reliability and down-sizing ability. However, SF6 gas was placed on the list of greenhouse gases under the Kyoto Protocol in 1997. Since then, the investigation and development concerning SF6-free or less has carried out activity. Therefore, we paid attention to the solid material which has higher dielectric strength than SF6, and we have newly developed solid insulated switchgear (SIS) achieved by molding all main circuit. A new epoxy casting material is applied, which contains a great deal of spherical silica and a small amount of rubber particles. This new material has the high mechanical strength, high thermal resistance, high toughness, and also high dielectric strength because of directly molding the vacuum bottle, down-sizing and reliability. This paper describes about the technology of a new epoxy casting material which achieves the SIS. In addition, the mechanical and electrical reliability test of SIS applied a new epoxy resin are carried out, and effectiveness of the development material and the mechanical and electrical reliability of SIS are verified.

Claims about the Reliability of Student Evaluations of Instruction: The Ecological Fallacy Rides Again

ERIC Educational Resources Information Center

Morley, Donald D.

2012-01-01

The vast majority of the research on student evaluation of instruction has assessed the reliability of groups of courses and yielded either a single reliability coefficient for the entire group, or grouped reliability coefficients for each student evaluation of teaching (SET) item. This manuscript argues that these practices constitute a form of…
Development of the Portuguese version of the modified Japanese Orthopaedic Association Score: cross-cultural adaptation, reliability, validity and responsiveness.

PubMed

Augusto, Mateus Tomaz; Diniz, Juliete Melo; Rolemberg Dantas, Fernando Luiz; Fernandes de Oliveira, Matheus; Rotta, José Marcus; Botelho, Ricardo Vieira

2018-06-01

Spondylotic cervical myelopathy (SCM) is a common cause of spinal-related disability in the elderly. The assessment of this disability is a challenging task and depends on the subjective evaluation of the investigator. As a widespread used scale, the modified scale of the Japanese Association of Orthopedics (mJOA) should be translated and culturally adapted in the Brazilian Portuguese language (mJOA-Br) to provide its clinical and research use. This study aims to do translation, transcultural adaptation and validation of the mJOA, into Brazilian Portuguese language. Following the transcultural adaptation model described by Guillemin et al., the scale as translated into Brazilian Portuguese and back-translated to English. Afterwards, questionnaires were applied in consecutive patients with SCM and compared to a control group (without SCM). The final scale was compared to the Brazilian version of Neck Disability Index for validation. Sixty patients were submitted to the translated version of mJOA. There was strong correlation between mJOA-Br scores and NDI scores to evaluate SCM symptoms (R=-0.75). mJOA-Br was considered a valid and reliable tool to evaluate SCM patients. Copyright © 2018 Elsevier Inc. All rights reserved.
[Research of the Epworth sleepiness scale based on ruzzy comprehensive evaluation].

PubMed

Li, P; Lv, Y H; Ma, L; Yang, S H; Xiang, Y; Lei, Q; Du, G D; Huang, D J

2017-03-05

Objective: This research explores the effect of Epworth sleepiness scale (ESS) items on domestic patients. Method: Four thousand six hundred and thirty-three suspected OSAHS patients with snoring were selected from respiratory sleep center in the first people's hospital, Yunnan province, between January 2006 and December 2012. These patients filled in the ESS before PSG test. Firstly, these questionnaires were preprocessed, and the null and incorrect ones were deleted. Then, the fuzzy comprehensive evaluation was applied for the value of each item in ESS. Finally, the reliability was compared between before and after the removal of the lowest values. Result: Fuzzy comprehensive evaluation results show that the total value is 1.016, the item value of Sitting and talking to someone and In a car, while stopped for a few minutes in traffic is the lowest, which is 0.131. The result of reliability analysis shows that the value increases 0.2% after the two items being deleted. Conclusion: Some items of ESS are not suitable for Chinese patients, and they need to be deleted or modified to improve the screening efficiency. Copyright© by the Editorial Department of Journal of Clinical Otorhinolaryngology Head and Neck Surgery.
Interpreting the cross-sectional flow field in a river bank based on a genetic-algorithm two-dimensional heat-transport method (GA-VS2DH)

NASA Astrophysics Data System (ADS)

Su, Xiaoru; Shu, Longcang; Chen, Xunhong; Lu, Chengpeng; Wen, Zhonghui

2016-12-01

Interactions between surface waters and groundwater are of great significance for evaluating water resources and protecting ecosystem health. Heat as a tracer method is widely used in determination of the interactive exchange with high precision, low cost and great convenience. The flow in a river-bank cross-section occurs in vertical and lateral directions. In order to depict the flow path and its spatial distribution in bank areas, a genetic algorithm (GA) two-dimensional (2-D) heat-transport nested-loop method for variably saturated sediments, GA-VS2DH, was developed based on Microsoft Visual Basic 6.0. VS2DH was applied to model a 2-D bank-water flow field and GA was used to calibrate the model automatically by minimizing the difference between observed and simulated temperatures in bank areas. A hypothetical model was developed to assess the reliability of GA-VS2DH in inverse modeling in a river-bank system. Some benchmark tests were conducted to recognize the capability of GA-VS2DH. The results indicated that the simulated seepage velocity and parameters associated with GA-VS2DH were acceptable and reliable. Then GA-VS2DH was applied to two field sites in China with different sedimentary materials, to verify the reliability of the method. GA-VS2DH could be applied in interpreting the cross-sectional 2-D water flow field. The estimates of horizontal hydraulic conductivity at the Dawen River and Qinhuai River sites are 1.317 and 0.015 m/day, which correspond to sand and clay sediment in the two sites, respectively.
Engineering evaluation of SSME dynamic data from engine tests and SSV flights

NASA Technical Reports Server (NTRS)

1986-01-01

An engineering evaluation of dynamic data from SSME hot firing tests and SSV flights is summarized. The basic objective of the study is to provide analyses of vibration, strain and dynamic pressure measurements in support of MSFC performance and reliability improvement programs. A brief description of the SSME test program is given and a typical test evaluation cycle reviewed. Data banks generated to characterize SSME component dynamic characteristics are described and statistical analyses performed on these data base measurements are discussed. Analytical models applied to define the dynamic behavior of SSME components (such as turbopump bearing elements and the flight accelerometer safety cut-off system) are also summarized. Appendices are included to illustrate some typical tasks performed under this study.
Quality evaluation and control of end cap welds in PHWR fuel elements by ultrasonic examination

NASA Astrophysics Data System (ADS)

Choi, M. S.; Yang, M. S.

1991-02-01

The current quality control procedure of nuclear fuel end cap weld is mainly dependent on the destructive metallographic examination. A nondestructive examination technique, i.e., ultrasonic examination, has been developed to identify and evaluate weld discontinuities. A few interesting results of the weld quality evaluation by applying the developed ultrasonic examination technique to PHWR fuel welds are presented. In addition, the feasibility of the weld quality control by the ultrasonic examination is discussed. This study shows that the ultrasonic examination is effective and reliable method for detecting abnormal weld contours and weld discontinuities such as micro-fissure, crack, upset split and expulsion, and can be used as a quality control tool for the end cap welding process.
Reliability Generalization of the Psychopathy Checklist Applied in Youthful Samples

ERIC Educational Resources Information Center

Campbell, Justin S.; Pulos, Steven; Hogan, Mike; Murry, Francie

2005-01-01

This study examines the average reliability of Hare Psychopathy Checklists (PCLs) adapted for use in samples of youthful offenders (aged 12 to 21 years). Two forms of reliability are examined: 18 alpha estimates of internal consistency and 18 intraclass correlation (two or more raters) estimates of interrater reliability. The results, an average…
Reliability, Validity, and Minimal Detectable Change of Balance Evaluation Systems Test and Its Short Versions in Older Cancer Survivors: A Pilot Study.

PubMed

Huang, Min H; Miller, Kara; Smith, Kristin; Fredrickson, Kayle; Shilling, Tracy

2016-01-01

Cancer is primarily a disease of older adults. About 77% of all cancers are diagnosed in persons aged 55 years and older. Cancer and its treatment can cause diverse sequelae impacting body systems underlying balance control. No study has examined the psychometric properties of balance assessment tools in older cancer survivors, presenting a significant challenge in the selection of outcome measures for clinicians treating this fast-growing population. This study aimed to determine the reliability, validity, and minimal detectable change (MDC) of the Balance Evaluation System Test (BESTest), Mini-Balance Evaluation Systems Test (Mini-BESTest), and Brief-Balance Evaluation Systems Test (Brief-BESTest) in community-dwelling older cancer survivors. This study was a cross-sectional design. Twenty breast and 8 prostate cancer survivors participated [age (SD) = 68.4 (8.13) years]. The BESTest and Activity-specific Balance Confidence (ABC) Scale were administered during the first session. Scores of Mini-BESTest and Brief-BESTest were extracted on the basis of the scores of BESTest. The BESTest was repeated within 1 to 2 weeks by the same rater to determine the test-retest reliability. For the analysis of the inter-rater reliability, 21 participants were randomly selected to be evaluated by 2 raters. A primary rater administered the test. The 2 raters independently and concurrently scored the performance of the participants. Each rater recorded the ratings separately on the scoring sheet. No discussion among the raters was allowed throughout the testing. Intraclass correlation coefficients (ICCs), standard error of measurement, minimal detectable change (MDC), and Bland-Altman plots were calculated. Concurrent validity of these balance tests with the ABC Scale was examined using the Spearman correlation. The BESTest, Mini-BESTest, and Brief-BESTest had high test-retest (ICC = 0.90-0.94) and interrater reliability (ICC = 0.86-0.96), small standard error of measurement (0.86-2.47 points), and MDC (2.39-6.86 points). The Bland-Altman plot revealed no systematic errors. The scores of BESTest, Mini-BEST, and Brief-BEST were correlated significantly with those of ABC Scale (P < .01), supporting their concurrent validity. The BESTest, Mini-BESTest, and Brief-BESTest showed high interrater and test-retest reliability, and excellent concurrent validity with the ABC Scale for community-dwelling cancer survivors aged 55 years and older who had completed cancer treatments for at least 3 months. Future studies are necessary to determine the predictive values for determining fall risks using balance assessment tools in older cancer survivors. Clinicians can utilize the BESTest and its short versions to evaluate balance problems in community-dwelling older cancer survivors and apply the established MDC to assess the intervention outcomes.
Wafer level reliability testing: An idea whose time has come

NASA Technical Reports Server (NTRS)

Trapp, O. D.

1987-01-01

Wafer level reliability testing has been nurtured in the DARPA supported workshops, held each autumn since 1982. The seeds planted in 1982 have produced an active crop of very large scale integration manufacturers applying wafer level reliability test methods. Computer Aided Reliability (CAR) is a new seed being nurtured. Users are now being awakened by the huge economic value of the wafer reliability testing technology.
Validity and reliability of criterion based clinical audit to assess obstetrical quality of care in West Africa.

PubMed

Pirkle, Catherine M; Dumont, Alexandre; Traore, Mamadou; Zunzunegui, Maria-Victoria

2012-10-29

In Mali and Senegal, over 1% of women die giving birth in hospital. At some hospitals, over a third of infants are stillborn. Many deaths are due to substandard medical practices. Criterion-based clinical audits (CBCA) are increasingly used to measure and improve obstetrical care in resource-limited settings, but their measurement properties have not been formally evaluated. In 2011, we published a systematic review of obstetrical CBCA highlighting insufficient considerations of validity and reliability. The objective of this study is to develop an obstetrical CBCA adapted to the West African context and assess its reliability and validity. This work was conducted as a sub-study within a cluster randomized trial known as QUARITE. Criteria were selected based on extensive literature review and expert opinion. Early 2010, two auditors applied the CBCA to identical samples at 8 sites in Mali and Senegal (n = 185) to evaluate inter-rater reliability. In 2010-11, we conducted CBCA at 32 hospitals to assess construct validity (n = 633 patients). We correlated hospital characteristics (resource availability, facility perinatal and maternal mortality) with mean hospital CBCA scores. We used generalized estimating equations to assess whether patient CBCA scores were associated with perinatal mortality. Results demonstrate substantial (ICC = 0.67, 95% CI 0.54; 0.76) to elevated inter-rater reliability (ICC = 0.84, 95% CI 0.77; 0.89) in Senegal and Mali, respectively. Resource availability positively correlated with mean hospital CBCA scores and maternal and perinatal mortality were inversely correlated with hospital CBCA scores. Poor CBCA scores, adjusted for hospital and patient characteristics, were significantly associated with perinatal mortality (OR 1.84, 95% CI 1.01-3.34). Our CBCA has substantial inter-rater reliability and there is compelling evidence of its validity as the tool performs according to theory. Current Controlled Trials ISRCTN46950658.
Measurement of the center edge angle and determination of the Severin classification using digital radiography, computer-assisted measurement tools, and a Severin algorithm: intraobserver and interobserver reliability revisited.

PubMed

Carroll, Kristen L; Murray, Kathleen A; MacLeod, Lynne M; Hennessey, Theresa A; Woiczik, Marcella R; Roach, James W

2011-06-01

Numerous studies underscore the poor intraobserver and interobserver reliability of both the center edge angle (CEA) and the Severin classification using plain film measurements. In this study, experienced observers applied a computer-assisted measurement program to determine the CEA in digital pelvic radiographs of adults who had been previously treated for dysplasia of the hip (DDH). Using a teaching aid/algorithm of the Severin classification, the observers then assigned a Severin rating to these hips. Intraobserver and interobserver errors were then calculated on both the CEA measurements and the Severin classifications. Four pediatric orthopaedic surgeons and 1 pediatric radiologist calculated the CEAs using the OrthoView TM planning system and then determined the Severin classification on 41 blinded digital pelvic radiographs. The radiographs were evaluated by each examiner twice, with evaluations separated by 2 months. All examiners reviewed a Severin classification algorithm before making their Severin assignments. The intraobserver and interobserver reliability for both the CEA and the Severin classification were calculated using the interclass correlation coefficients and Cohen and Fleiss κ scores, respectively. The intraobserver and interobserver reliability for CEA measurement was moderate to almost perfect. When we separated the Severin classification into 3 clinically relevant groups of good (Severin I and II), dysplastic (Severin III), and poor (Severin IV and above), our interobserver reliability neared almost perfect. The Severin classification is an extremely useful and oft-used radiographic measure for the success of DDH treatment. Our research found digital radiography, computer-aided measurement tools, the use of a Severin algorithm, and separating the Severin classification into 3 clinically relevant groups significantly increased the intraobserver and interobserver reliability of both the CEA and Severin classification. This finding will assist future studies using the CEA and Severin classification in the radiographic assessment of DDH treatment outcomes.
Geometric classification of scalp hair for valid drug testing, 6 more reliable than 8 hair curl groups

PubMed Central

Mkentane, K.; Gumedze, F.; Ngoepe, M.; Davids, L. M.; Khumalo, N. P.

2017-01-01

Introduction Curly hair is reported to contain higher lipid content than straight hair, which may influence incorporation of lipid soluble drugs. The use of race to describe hair curl variation (Asian, Caucasian and African) is unscientific yet common in medical literature (including reports of drug levels in hair). This study investigated the reliability of a geometric classification of hair (based on 3 measurements: the curve diameter, curl index and number of waves). Materials and methods After ethical approval and informed consent, proximal virgin (6cm) hair sampled from the vertex of scalp in 48 healthy volunteers were evaluated. Three raters each scored hairs from 48 volunteers at two occasions each for the 8 and 6-group classifications. One rater applied the 6-group classification to 80 additional volunteers in order to further confirm the reliability of this system. The Kappa statistic was used to assess intra and inter rater agreement. Results Each rater classified 480 hairs on each occasion. No rater classified any volunteer’s 10 hairs into the same group; the most frequently occurring group was used for analysis. The inter-rater agreement was poor for the 8-groups (k = 0.418) but improved for the 6-groups (k = 0.671). The intra-rater agreement also improved (k = 0.444 to 0.648 versus 0.599 to 0.836) for 6-groups; that for the one evaluator for all volunteers was good (k = 0.754). Conclusions Although small, this is the first study to test the reliability of a geometric classification. The 6-group method is more reliable. However, a digital classification system is likely to reduce operator error. A reliable objective classification of human hair curl is long overdue, particularly with the increasing use of hair as a testing substrate for treatment compliance in Medicine. PMID:28570555
Least Squares Best Fit Method for the Three Parameter Weibull Distribution: Analysis of Tensile and Bend Specimens with Volume or Surface Flaw Failure

NASA Technical Reports Server (NTRS)

Gross, Bernard

1996-01-01

Material characterization parameters obtained from naturally flawed specimens are necessary for reliability evaluation of non-deterministic advanced ceramic structural components. The least squares best fit method is applied to the three parameter uniaxial Weibull model to obtain the material parameters from experimental tests on volume or surface flawed specimens subjected to pure tension, pure bending, four point or three point loading. Several illustrative example problems are provided.
Synthetic Defects for Vibrothermography

NASA Astrophysics Data System (ADS)

Renshaw, Jeremy; Holland, Stephen D.; Thompson, R. Bruce; Eisenmann, David J.

2010-02-01

Synthetic defects are an important tool used for characterizing the performance of nondestructive evaluation techniques. Viscous material-filled synthetic defects were developed for use in vibrothermography (also known as sonic IR) as a tool to improve inspection accuracy and reliability. This paper describes how the heat-generation response of these VMF synthetic defects is similar to the response of real defects. It also shows how VMF defects can be applied to improve inspection accuracy for complex industrial parts and presents a study of their application in an aircraft engine stator vane.
Interviewer as instrument: accounting for human factors in evaluation research.

PubMed

Brown, Joel H

2006-04-01

This methodological study examines an original data collection model designed to incorporate human factors and enhance data richness in qualitative and evaluation research. Evidence supporting this model is drawn from in-depth youth and adult interviews in one of the largest policy/program evaluations undertaken in the United States, the Drug, Alcohol, and Tobacco Education evaluation (77 districts, 118 schools). When applying the explicit observation technique (EOT)--the strategic and nonjudgmental disclosure of nonverbal human factor cues by the interviewer to the respondent during interview--data revealed the observation disclosure pattern. Here, respondents linked perceptions with policy or program implementation or effectiveness evidence. Although more research is needed, it is concluded that the EOT yields richer data when compared with traditional semistructured interviews and, thus, holds promise to enhance qualitative and evaluation research methods. Validity and reliability as well as qualitative and evaluation research considerations are discussed.
Evaluation of Reliability Coefficients for Two-Level Models via Latent Variable Analysis

ERIC Educational Resources Information Center

Raykov, Tenko; Penev, Spiridon

2010-01-01

A latent variable analysis procedure for evaluation of reliability coefficients for 2-level models is outlined. The method provides point and interval estimates of group means' reliability, overall reliability of means, and conditional reliability. In addition, the approach can be used to test simple hypotheses about these parameters. The…
Life and reliability modeling of bevel gear reductions

NASA Technical Reports Server (NTRS)

Savage, M.; Brikmanis, C. K.; Lewicki, D. G.; Coy, J. J.

1985-01-01

A reliability model is presented for bevel gear reductions with either a single input pinion or dual input pinions of equal size. The dual pinions may or may not have the same power applied for the analysis. The gears may be straddle mounted or supported in a bearing quill. The reliability model is based on the Weibull distribution. The reduction's basic dynamic capacity is defined as the output torque which may be applied for one million output rotations of the bevel gear with a 90 percent probability of reduction survival.
External validation of Global Evaluative Assessment of Robotic Skills (GEARS).

PubMed

Aghazadeh, Monty A; Jayaratna, Isuru S; Hung, Andrew J; Pan, Michael M; Desai, Mihir M; Gill, Inderbir S; Goh, Alvin C

2015-11-01

We demonstrate the construct validity, reliability, and utility of Global Evaluative Assessment of Robotic Skills (GEARS), a clinical assessment tool designed to measure robotic technical skills, in an independent cohort using an in vivo animal training model. Using a cross-sectional observational study design, 47 voluntary participants were categorized as experts (>30 robotic cases completed as primary surgeon) or trainees. The trainee group was further divided into intermediates (≥5 but ≤30 cases) or novices (<5 cases). All participants completed a standardized in vivo robotic task in a porcine model. Task performance was evaluated by two expert robotic surgeons and self-assessed by the participants using the GEARS assessment tool. Kruskal-Wallis test was used to compare the GEARS performance scores to determine construct validity; Spearman's rank correlation measured interobserver reliability; and Cronbach's alpha was used to assess internal consistency. Performance evaluations were completed on nine experts and 38 trainees (14 intermediate, 24 novice). Experts demonstrated superior performance compared to intermediates and novices overall and in all individual domains (p < 0.0001). In comparing intermediates and novices, the overall performance difference trended toward significance (p = 0.0505), while the individual domains of efficiency and autonomy were significantly different between groups (p = 0.0280 and 0.0425, respectively). Interobserver reliability between expert ratings was confirmed with a strong correlation observed (r = 0.857, 95 % CI [0.691, 0.941]). Experts and participant scoring showed less agreement (r = 0.435, 95 % CI [0.121, 0.689] and r = 0.422, 95 % CI [0.081, 0.0672]). Internal consistency was excellent for experts and participants (α = 0.96, 0.98, 0.93). In an independent cohort, GEARS was able to differentiate between different robotic skill levels, demonstrating excellent construct validity. As a standardized assessment tool, GEARS maintained consistency and reliability for an in vivo robotic surgical task and may be applied for skills evaluation in a broad range of robotic procedures.
Evaluating the reliability of Late Quaternary landform ages: Integrating 10Be cosmogenic surface exposure dating with U-series dating of pedogenic carbonate on alluvial and fluvial deposits, Sonoran desert, California

NASA Astrophysics Data System (ADS)

Blisniuk, K.; Sharp, W. D.

2015-12-01

To assess the reliability of Quaternary age determinations of alluvial and fluvial deposits across the Sonoran Desert (Coachella Valley and Anza Borrego) in southern California, we applied both 10Be exposure age dating of surface clasts and U-series dating of pedogenic carbonate from subsurface clast-coatings to the same deposits. We consider agreement between dates from the two techniques to indicate reliable age estimates because each technique is subject to distinct assumptions and therefore their systematic uncertainties are largely independent. 10Be exposure dates should yield maximum ages when no correction is made for inheritance and post-depositional erosion is negligible. U-series dating, in contrast, provides minimum dates because pedogenic carbonate forms after deposition. Our results show that: (1) For deposits ca. 70 ka or younger, 10Be and U-series dates were generally concordant. We note, however, that in most cases U-series soil dates exceed 10Be exposure dates that are corrected for inheritance when using 10Be in modern alluvium. This suggests that 10Be concentrations of modern alluvium may exceed the 10Be acquired by late Pleistocene deposits during fluvial transport and hillslope residence (i.e., Pleistocene inherited 10Be). (2) For deposits older than ~70 ka, U-series dates are significantly younger than the 10Be dates. This implies that U-series dates in this region may significantly underestimate the depositional age of older alluvium, probably because of delayed onset of deposition, slow accumulation, or poor preservation of secondary carbonate in response to climatic controls. Thus, whenever possible, multiple dating methods should be applied to obtain reliable ages for late Quaternary deposits.
A New Tool for Nutrition App Quality Evaluation (AQEL): Development, Validation, and Reliability Testing

PubMed Central

Huang, Wenhao; Chapman-Novakofski, Karen M

2017-01-01

Background The extensive availability and increasing use of mobile apps for nutrition-based health interventions makes evaluation of the quality of these apps crucial for integration of apps into nutritional counseling. Objective The goal of this research was the development, validation, and reliability testing of the app quality evaluation (AQEL) tool, an instrument for evaluating apps’ educational quality and technical functionality. Methods Items for evaluating app quality were adapted from website evaluations, with additional items added to evaluate the specific characteristics of apps, resulting in 79 initial items. Expert panels of nutrition and technology professionals and app users reviewed items for face and content validation. After recommended revisions, nutrition experts completed a second AQEL review to ensure clarity. On the basis of 150 sets of responses using the revised AQEL, principal component analysis was completed, reducing AQEL into 5 factors that underwent reliability testing, including internal consistency, split-half reliability, test-retest reliability, and interrater reliability (IRR). Two additional modifiable constructs for evaluating apps based on the age and needs of the target audience as selected by the evaluator were also tested for construct reliability. IRR testing using intraclass correlations (ICC) with all 7 constructs was conducted, with 15 dietitians evaluating one app. Results Development and validation resulted in the 51-item AQEL. These were reduced to 25 items in 5 factors after principal component analysis, plus 9 modifiable items in two constructs that were not included in principal component analysis. Internal consistency and split-half reliability of the following constructs derived from principal components analysis was good (Cronbach alpha >.80, Spearman-Brown coefficient >.80): behavior change potential, support of knowledge acquisition, app function, and skill development. App purpose split half-reliability was .65. Test-retest reliability showed no significant change over time (P>.05) for all but skill development (P=.001). Construct reliability was good for items assessing age appropriateness of apps for children, teens, and a general audience. In addition, construct reliability was acceptable for assessing app appropriateness for various target audiences (Cronbach alpha >.70). For the 5 main factors, ICC (1,k) was >.80, with a P value of <.05. When 15 nutrition professionals evaluated one app, ICC (2,15) was .98, with a P value of <.001 for all 7 constructs when the modifiable items were specified for adults seeking weight loss support. Conclusions Our preliminary effort shows that AQEL is a valid, reliable instrument for evaluating nutrition apps’ qualities for clinical interventions by nutrition clinicians, educators, and researchers. Further efforts in validating AQEL in various contexts are needed. PMID:29079554

Reliability Evaluation and Improvement Approach of Chemical Production Man - Machine - Environment System

NASA Astrophysics Data System (ADS)

Miao, Yongchun; Kang, Rongxue; Chen, Xuefeng

2017-12-01

In recent years, with the gradual extension of reliability research, the study of production system reliability has become the hot topic in various industries. Man-machine-environment system is a complex system composed of human factors, machinery equipment and environment. The reliability of individual factor must be analyzed in order to gradually transit to the research of three-factor reliability. Meanwhile, the dynamic relationship among man-machine-environment should be considered to establish an effective blurry evaluation mechanism to truly and effectively analyze the reliability of such systems. In this paper, based on the system engineering, fuzzy theory, reliability theory, human error, environmental impact and machinery equipment failure theory, the reliabilities of human factor, machinery equipment and environment of some chemical production system were studied by the method of fuzzy evaluation. At last, the reliability of man-machine-environment system was calculated to obtain the weighted result, which indicated that the reliability value of this chemical production system was 86.29. Through the given evaluation domain it can be seen that the reliability of man-machine-environment integrated system is in a good status, and the effective measures for further improvement were proposed according to the fuzzy calculation results.
Overcoming the Challenges of Unstructured Data in Multisite, Electronic Medical Record-based Abstraction.

PubMed

Polnaszek, Brock; Gilmore-Bykovskyi, Andrea; Hovanes, Melissa; Roiland, Rachel; Ferguson, Patrick; Brown, Roger; Kind, Amy J H

2016-10-01

Unstructured data encountered during retrospective electronic medical record (EMR) abstraction has routinely been identified as challenging to reliably abstract, as these data are often recorded as free text, without limitations to format or structure. There is increased interest in reliably abstracting this type of data given its prominent role in care coordination and communication, yet limited methodological guidance exists. As standard abstraction approaches resulted in substandard data reliability for unstructured data elements collected as part of a multisite, retrospective EMR study of hospital discharge communication quality, our goal was to develop, apply and examine the utility of a phase-based approach to reliably abstract unstructured data. This approach is examined using the specific example of discharge communication for warfarin management. We adopted a "fit-for-use" framework to guide the development and evaluation of abstraction methods using a 4-step, phase-based approach including (1) team building; (2) identification of challenges; (3) adaptation of abstraction methods; and (4) systematic data quality monitoring. Unstructured data elements were the focus of this study, including elements communicating steps in warfarin management (eg, warfarin initiation) and medical follow-up (eg, timeframe for follow-up). After implementation of the phase-based approach, interrater reliability for all unstructured data elements demonstrated κ's of ≥0.89-an average increase of +0.25 for each unstructured data element. As compared with standard abstraction methodologies, this phase-based approach was more time intensive, but did markedly increase abstraction reliability for unstructured data elements within multisite EMR documentation.
Content validity and reliability of test of gross motor development in Chilean children

PubMed Central

Cano-Cappellacci, Marcelo; Leyton, Fernanda Aleitte; Carreño, Joshua Durán

2016-01-01

ABSTRACT OBJECTIVE To validate a Spanish version of the Test of Gross Motor Development (TGMD-2) for the Chilean population. METHODS Descriptive, transversal, non-experimental validity and reliability study. Four translators, three experts and 92 Chilean children, from five to 10 years, students from a primary school in Santiago, Chile, have participated. The Committee of Experts has carried out translation, back-translation and revision processes to determine the translinguistic equivalence and content validity of the test, using the content validity index in 2013. In addition, a pilot implementation was achieved to determine test reliability in Spanish, by using the intraclass correlation coefficient and Bland-Altman method. We evaluated whether the results presented significant differences by replacing the bat with a racket, using T-test. RESULTS We obtained a content validity index higher than 0.80 for language clarity and relevance of the TGMD-2 for children. There were significant differences in the object control subtest when comparing the results with bat and racket. The intraclass correlation coefficient for reliability inter-rater, intra-rater and test-retest reliability was greater than 0.80 in all cases. CONCLUSIONS The TGMD-2 has appropriate content validity to be applied in the Chilean population. The reliability of this test is within the appropriate parameters and its use could be recommended in this population after the establishment of normative data, setting a further precedent for the validation in other Latin American countries. PMID:26815160
Performance of the 'material Failure Forecast Method' in real-time situations: A Bayesian approach applied on effusive and explosive eruptions

NASA Astrophysics Data System (ADS)

Boué, A.; Lesage, P.; Cortés, G.; Valette, B.; Reyes-Dávila, G.; Arámbula-Mendoza, R.; Budi-Santoso, A.

2016-11-01

Most attempts of deterministic eruption forecasting are based on the material Failure Forecast Method (FFM). This method assumes that a precursory observable, such as the rate of seismic activity, can be described by a simple power law which presents a singularity at a time close to the eruption onset. Until now, this method has been applied only in a small number of cases, generally for forecasts in hindsight. In this paper, a rigorous Bayesian approach of the FFM designed for real-time applications is applied. Using an automatic recognition system, seismo-volcanic events are detected and classified according to their physical mechanism and time series of probability distributions of the rates of events are calculated. At each time of observation, a Bayesian inversion provides estimations of the exponent of the power law and of the time of eruption, together with their probability density functions. Two criteria are defined in order to evaluate the quality and reliability of the forecasts. Our automated procedure has allowed the analysis of long, continuous seismic time series: 13 years from Volcán de Colima, Mexico, 10 years from Piton de la Fournaise, Reunion Island, France, and several months from Merapi volcano, Java, Indonesia. The new forecasting approach has been applied to 64 pre-eruptive sequences which present various types of dominant seismic activity (volcano-tectonic or long-period events) and patterns of seismicity with different level of complexity. This has allowed us to test the FFM assumptions, to determine in which conditions the method can be applied, and to quantify the success rate of the forecasts. 62% of the precursory sequences analysed are suitable for the application of FFM and half of the total number of eruptions are successfully forecast in hindsight. In real-time, the method allows for the successful forecast of 36% of all the eruptions considered. Nevertheless, real-time forecasts are successful for 83% of the cases that fulfil the reliability criteria. Therefore, good confidence on the method is obtained when the reliability criteria are met.
A survey of NASA and military standards on fault tolerance and reliability applied to robotics

NASA Technical Reports Server (NTRS)

Cavallaro, Joseph R.; Walker, Ian D.

1994-01-01

There is currently increasing interest and activity in the area of reliability and fault tolerance for robotics. This paper discusses the application of Standards in robot reliability, and surveys the literature of relevant existing standards. A bibliography of relevant Military and NASA standards for reliability and fault tolerance is included.
Improving Reliability of a Residency Interview Process

PubMed Central

Serres, Michelle L.; Gundrum, Todd E.

2013-01-01

Objective. To improve the reliability and discrimination of a pharmacy resident interview evaluation form, and thereby improve the reliability of the interview process. Methods. In phase 1 of the study, authors used a Many-Facet Rasch Measurement model to optimize an existing evaluation form for reliability and discrimination. In phase 2, interviewer pairs used the modified evaluation form within 4 separate interview stations. In phase 3, 8 interviewers individually-evaluated each candidate in one-on-one interviews. Results. In phase 1, the evaluation form had a reliability of 0.98 with person separation of 6.56; reproducibly, the form separated applicants into 6 distinct groups. Using that form in phase 2 and 3, our largest variation source was candidates, while content specificity was the next largest variation source. The phase 2 g-coefficient was 0.787, while confirmatory phase 3 was 0.922. Process reliability improved with more stations despite fewer interviewers per station—impact of content specificity was greatly reduced with more interview stations. Conclusion. A more reliable, discriminating evaluation form was developed to evaluate candidates during resident interviews, and a process was designed that reduced the impact from content specificity. PMID:24159209
Evaluation of the Quality of Action Cameras with Wide-Angle Lenses in Uav Photogrammetry

NASA Astrophysics Data System (ADS)

Hastedt, H.; Ekkel, T.; Luhmann, T.

2016-06-01

The application of light-weight cameras in UAV photogrammetry is required due to restrictions in payload. In general, consumer cameras with normal lens type are applied to a UAV system. The availability of action cameras, like the GoPro Hero4 Black, including a wide-angle lens (fish-eye lens) offers new perspectives in UAV projects. With these investigations, different calibration procedures for fish-eye lenses are evaluated in order to quantify their accuracy potential in UAV photogrammetry. Herewith the GoPro Hero4 is evaluated using different acquisition modes. It is investigated to which extent the standard calibration approaches in OpenCV or Agisoft PhotoScan/Lens can be applied to the evaluation processes in UAV photogrammetry. Therefore different calibration setups and processing procedures are assessed and discussed. Additionally a pre-correction of the initial distortion by GoPro Studio and its application to the photogrammetric purposes will be evaluated. An experimental setup with a set of control points and a prospective flight scenario is chosen to evaluate the processing results using Agisoft PhotoScan. Herewith it is analysed to which extent a pre-calibration and pre-correction of a GoPro Hero4 will reinforce the reliability and accuracy of a flight scenario.
Development and validation of the simulation-based learning evaluation scale.

PubMed

Hung, Chang-Chiao; Liu, Hsiu-Chen; Lin, Chun-Chih; Lee, Bih-O

2016-05-01

The instruments that evaluate a student's perception of receiving simulated training are English versions and have not been tested for reliability or validity. The aim of this study was to develop and validate a Chinese version Simulation-Based Learning Evaluation Scale (SBLES). Four stages were conducted to develop and validate the SBLES. First, specific desired competencies were identified according to the National League for Nursing and Taiwan Nursing Accreditation Council core competencies. Next, the initial item pool was comprised of 50 items related to simulation that were drawn from the literature of core competencies. Content validity was established by use of an expert panel. Finally, exploratory factor analysis and confirmatory factor analysis were conducted for construct validity, and Cronbach's coefficient alpha determined the scale's internal consistency reliability. Two hundred and fifty students who had experienced simulation-based learning were invited to participate in this study. Two hundred and twenty-five students completed and returned questionnaires (response rate=90%). Six items were deleted from the initial item pool and one was added after an expert panel review. Exploratory factor analysis with varimax rotation revealed 37 items remaining in five factors which accounted for 67% of the variance. The construct validity of SBLES was substantiated in a confirmatory factor analysis that revealed a good fit of the hypothesized factor structure. The findings tally with the criterion of convergent and discriminant validity. The range of internal consistency for five subscales was .90 to .93. Items were rated on a 5-point scale from 1 (strongly disagree) to 5 (strongly agree). The results of this study indicate that the SBLES is valid and reliable. The authors recommend that the scale could be applied in the nursing school to evaluate the effectiveness of simulation-based learning curricula. Copyright © 2016 Elsevier Ltd. All rights reserved.
Rasch analysis of the Edmonton Symptom Assessment System and research implications.

PubMed

Cheifetz, O; Packham, T L; Macdermid, J C

2014-04-01

Reliable and valid assessment of the disease burden across all forms of cancer is critical to the evaluation of treatment effectiveness and patient progress. The Edmonton Symptom Assessment System (esas) is used for routine evaluation of people attending for cancer care. In the present study, we used Rasch analysis to explore the measurement properties of the esas and to determine the effect of using Rasch-proposed interval-level esas scoring compared with traditional scoring when evaluating the effects of an exercise program for cancer survivors. Polytomous Rasch analysis (Andrich's rating-scale model) was applied to data from 26,645 esas questionnaires completed at the Juravinski Cancer Centre. The fit of the esas to the polytomous Rasch model was investigated, including evaluations of differential item functioning for sex, age, and disease group. The research implication was investigated by comparing the results of an observational research study previously analysed using a traditional approach with the results obtained by Rasch-proposed interval-level esas scoring. The Rasch reliability index was 0.73, falling short of the desired 0.80-0.90 level. However, the esas was found to fit the Rasch model, including the criteria for uni-dimensional data. The analysis suggests that the current esas scoring system of 0-10 could be collapsed to a 6-point scale. Use of the Rasch-proposed interval-level scoring yielded results that were different from those calculated using summarized ordinal-level esas scores. Differential item functioning was not found for sex, age, or diagnosis groups. The esas is a moderately reliable uni-dimensional measure of cancer disease burden and can provide interval-level scaling with Rasch-based scoring. Further, our study indicates that, compared with the traditional scoring metric, Rasch-based scoring could result in substantive changes to conclusions.
Standardization of a spinal cord lesion model and neurologic evaluation using mice

PubMed Central

Borges, Paulo Alvim; Cristante, Alexandre Fogaça; de Barros-Filho, Tarcísio Eloy Pessoa; Natalino, Renato Jose Mendonça; dos Santos, Gustavo Bispo; Marcon, Raphael Marcus

2018-01-01

OBJECTIVE: To standardize a spinal cord lesion mouse model. METHODS: Thirty BALB/c mice were divided into five groups: four experimental groups and one control group (sham). The experimental groups were subjected to spinal cord lesion by a weight drop from different heights after laminectomy whereas the sham group only underwent laminectomy. Mice were observed for six weeks, and functional behavior scales were applied. The mice were then euthanized, and histological investigations were performed to confirm and score spinal cord lesion. The findings were evaluated to prove whether the method of administering spinal cord lesion was effective and different among the groups. Additionally, we correlated the results of the functional scales with the results from the histology evaluations to identify which scale is more reliable. RESULTS: One mouse presented autophagia, and six mice died during the experiment. Because four of the mice that died were in Group 5, Group 5 was excluded from the study. All the functional scales assessed proved to be significantly different from each other, and mice presented functional evolution during the experiment. Spinal cord lesion was confirmed by histology, and the results showed a high correlation between the Basso, Beattie, Bresnahan Locomotor Rating Scale and the Basso Mouse Scale. The mouse function scale showed a moderate to high correlation with the histological findings, and the horizontal ladder test had a high correlation with neurologic degeneration but no correlation with the other histological parameters evaluated. CONCLUSION: This spinal cord lesion mouse model proved to be effective and reliable with exception of lesions caused by a 10-g drop from 50 mm, which resulted in unacceptable mortality. The Basso, Beattie, Bresnahan Locomotor Rating Scale and Basso Mouse Scale are the most reliable functional assessments, and but the horizontal ladder test is not recommended. PMID:29561931
Indices of Paraspinal Muscles Degeneration: Reliability and Association With Facet Joint Osteoarthritis: Feasibility Study.

PubMed

Kalichman, Leonid; Klindukhov, Alexander; Li, Ling; Linov, Lina

2016-11-01

A reliability and cross-sectional observational study. To introduce a scoring system for visible fat infiltration in paraspinal muscles; to evaluate intertester and intratester reliability of this system and its relationship with indices of muscle density; to evaluate the association between indices of paraspinal muscle degeneration and facet joint osteoarthritis. Current evidence suggests that the paraspinal muscles degeneration is associated with low back pain, facet joint osteoarthritis, spondylolisthesis, and degenerative disc disease. However, the evaluation of paraspinal muscles on computed tomography is not radiological routine, probably because of absence of simple and reliable indices of paraspinal degeneration. One hundred fifty consecutive computed tomography scans of the lower back (N=75) or abdomen (N=75) were evaluated. Mean radiographic density (in Hounsfield units) and SD of the density of multifidus and erector spinae were evaluated at the L4-L5 spinal level. A new index of muscle degeneration, radiographic density ratio=muscle density/SD of density, was calculated. To evaluate the visible fat infiltration in paraspinal muscles, we proposed a 3-graded scoring system. The prevalence of facet joint osteoarthritis was also evaluated. Intraclass correlation and κ statistics were used to evaluate inter-rater and intra-rater reliability. Logistic regression examined the association between paraspinal muscle indices and facet joint osteoarthritis. Intra-rater reliability for fat infiltration score (κ) ranged between 0.87 and 0.92; inter-rater reliability between 0.70 and 0.81. Intra-rater reliability (intraclass correlation) for mean density of paraspinal muscles ranged between 0.96 and 0.99, inter-rater reliability between 0.95 and 0.99; SD intra-rater reliability ranged between 0.82 and 0.91, inter-rater reliability between 0.80 and 0.89. Significant associations (P<0.01) were found between facet joint osteoarthritis, fat infiltration score, and radiographic density ratio. Two suggested indices of paraspinal muscle degeneration showed excellent reliability and were significantly associated with facet joint osteoarthritis. Additional studies are needed to evaluate the associations with other spinal degeneration features and low back pain.
Psychometric Properties of the Persian Translation of the Sexual Quality of Life–Male Questionnaire

PubMed Central

Maasoumi, Raziyeh; Mokarami, Hamidreza; Nazifi, Morteza; Stallones, Lorann; Taban, Abrahim; Yazdani Aval, Mohsen; Samimi, Kazem

2016-01-01

Sexual dysfunction has been demonstrated to be related to a poor quality of life. These dysfunctions are especially prevalent among men. This cross-sectional study aimed to investigate the psychometric properties of the Persian translation of the Sexual Quality of Life–Male (SQOL-M), translated and adapted to measure sexual quality of life among Iranian men. Forward–backward procedures were applied in translating the original SQOL-M into Persian, and then the psychometric properties of the Persian translation of the SQOL-M were studied. A total of 181 participants (23-60 years old) were included in the study. Validity was assessed by construct validity using confirmatory factor analysis, convergent validity, and content validity. The international index of erectile function (IIEF) and the work ability index were used to study the convergent validity. Reliability was evaluated through internal consistency and test–retest reliability analyses. The results from confirmatory factor analysis confirmed a one-factor solution for the Persian version of the SQOL-M. Content validity of the translated measure was endorsed by 10 specialists. Pearson correlations indicated that work ability index score, dimensions of the IIEF, and the IIEF total score were positively correlated with the Persian version of the SQOL-M (p < .001). Reliability evaluation indicated a high internal consistency and test–retest reliability. The Cronbach’s alpha coefficient and intraclass correlation coefficients were .96 and .95, respectively. Results indicated that the Persian version of the SQOL-M has good to excellent psychometric properties and can be used to assess the sexual quality of life among Iranian men. PMID:26856758
Psychometric Properties of the Persian Translation of the Sexual Quality of Life-Male Questionnaire.

PubMed

Maasoumi, Raziyeh; Mokarami, Hamidreza; Nazifi, Morteza; Stallones, Lorann; Taban, Abrahim; Yazdani Aval, Mohsen; Samimi, Kazem

2017-05-01

Sexual dysfunction has been demonstrated to be related to a poor quality of life. These dysfunctions are especially prevalent among men. This cross-sectional study aimed to investigate the psychometric properties of the Persian translation of the Sexual Quality of Life-Male (SQOL-M), translated and adapted to measure sexual quality of life among Iranian men. Forward-backward procedures were applied in translating the original SQOL-M into Persian, and then the psychometric properties of the Persian translation of the SQOL-M were studied. A total of 181 participants (23-60 years old) were included in the study. Validity was assessed by construct validity using confirmatory factor analysis, convergent validity, and content validity. The international index of erectile function (IIEF) and the work ability index were used to study the convergent validity. Reliability was evaluated through internal consistency and test-retest reliability analyses. The results from confirmatory factor analysis confirmed a one-factor solution for the Persian version of the SQOL-M. Content validity of the translated measure was endorsed by 10 specialists. Pearson correlations indicated that work ability index score, dimensions of the IIEF, and the IIEF total score were positively correlated with the Persian version of the SQOL-M ( p < .001). Reliability evaluation indicated a high internal consistency and test-retest reliability. The Cronbach's alpha coefficient and intraclass correlation coefficients were .96 and .95, respectively. Results indicated that the Persian version of the SQOL-M has good to excellent psychometric properties and can be used to assess the sexual quality of life among Iranian men.
Genomic selection in a commercial winter wheat population.

PubMed

He, Sang; Schulthess, Albert Wilhelm; Mirdita, Vilson; Zhao, Yusheng; Korzun, Viktor; Bothe, Reiner; Ebmeyer, Erhard; Reif, Jochen C; Jiang, Yong

2016-03-01

Genomic selection models can be trained using historical data and filtering genotypes based on phenotyping intensity and reliability criterion are able to increase the prediction ability. We implemented genomic selection based on a large commercial population incorporating 2325 European winter wheat lines. Our objectives were (1) to study whether modeling epistasis besides additive genetic effects results in enhancement on prediction ability of genomic selection, (2) to assess prediction ability when training population comprised historical or less-intensively phenotyped lines, and (3) to explore the prediction ability in subpopulations selected based on the reliability criterion. We found a 5 % increase in prediction ability when shifting from additive to additive plus epistatic effects models. In addition, only a marginal loss from 0.65 to 0.50 in accuracy was observed using the data collected from 1 year to predict genotypes of the following year, revealing that stable genomic selection models can be accurately calibrated to predict subsequent breeding stages. Moreover, prediction ability was maximized when the genotypes evaluated in a single location were excluded from the training set but subsequently decreased again when the phenotyping intensity was increased above two locations, suggesting that the update of the training population should be performed considering all the selected genotypes but excluding those evaluated in a single location. The genomic prediction ability was substantially higher in subpopulations selected based on the reliability criterion, indicating that phenotypic selection for highly reliable individuals could be directly replaced by applying genomic selection to them. We empirically conclude that there is a high potential to assist commercial wheat breeding programs employing genomic selection approaches.
Evaluation of hydrate-screening methods.

PubMed

Cui, Yong; Yao, Erica

2008-07-01

The purpose of this work is to evaluate the effectiveness and reliability of several common hydrate-screening techniques, and to provide guidelines for designing hydrate-screening programs for new drug candidates. Ten hydrate-forming compounds were selected as model compounds and six hydrate-screening approaches were applied to these compounds in an effort to generate their hydrate forms. The results prove that no screening approach is universally effective in finding hydrates for small organic compounds. Rather, a combination of different methods should be used to improve screening reliability. Among the approaches tested, the dynamic water vapor sorption/desorption isotherm (DVI) method and storage under high humidity (HH) yielded 60-70% success ratios, the lowest among all techniques studied. The risk of false negatives arises in particular for nonhygroscopic compounds. On the other hand, both slurry in water (Slurry) and temperature cycling of aqueous suspension (TCS) showed high success rates (90%) with some exceptions. The mixed solvent systems (MSS) procedure also achieved high success rates (90%), and was found to be more suitable for water-insoluble compounds. For water-soluble compounds, MSS may not be the best approach because recrystallization is difficult in solutions with high water activity. Finally, vapor diffusion (VD) yielded a reasonably high success ratio in finding hydrates (80%). However, this method suffers from experimental difficulty and unreliable results for either highly water-soluble or water-insoluble compounds. This study indicates that a reliable hydrate-screening strategy should take into consideration the solubility and hygroscopicity of the compounds studied. A combination of the Slurry or TCS method with the MSS procedure could provide a screening strategy with reasonable reliability.
Scaled CMOS Technology Reliability Users Guide

NASA Technical Reports Server (NTRS)

White, Mark

2010-01-01

The desire to assess the reliability of emerging scaled microelectronics technologies through faster reliability trials and more accurate acceleration models is the precursor for further research and experimentation in this relevant field. The effect of semiconductor scaling on microelectronics product reliability is an important aspect to the high reliability application user. From the perspective of a customer or user, who in many cases must deal with very limited, if any, manufacturer's reliability data to assess the product for a highly-reliable application, product-level testing is critical in the characterization and reliability assessment of advanced nanometer semiconductor scaling effects on microelectronics reliability. A methodology on how to accomplish this and techniques for deriving the expected product-level reliability on commercial memory products are provided.Competing mechanism theory and the multiple failure mechanism model are applied to the experimental results of scaled SDRAM products. Accelerated stress testing at multiple conditions is applied at the product level of several scaled memory products to assess the performance degradation and product reliability. Acceleration models are derived for each case. For several scaled SDRAM products, retention time degradation is studied and two distinct soft error populations are observed with each technology generation: early breakdown, characterized by randomly distributed weak bits with Weibull slope (beta)=1, and a main population breakdown with an increasing failure rate. Retention time soft error rates are calculated and a multiple failure mechanism acceleration model with parameters is derived for each technology. Defect densities are calculated and reflect a decreasing trend in the percentage of random defective bits for each successive product generation. A normalized soft error failure rate of the memory data retention time in FIT/Gb and FIT/cm2 for several scaled SDRAM generations is presented revealing a power relationship. General models describing the soft error rates across scaled product generations are presented. The analysis methodology may be applied to other scaled microelectronic products and their key parameters.
Nearest-neighbor guided evaluation of data reliability and its applications.

PubMed

Boongoen, Tossapon; Shen, Qiang

2010-12-01

The intuition of data reliability has recently been incorporated into the main stream of research on ordered weighted averaging (OWA) operators. Instead of relying on human-guided variables, the aggregation behavior is determined in accordance with the underlying characteristics of the data being aggregated. Data-oriented operators such as the dependent OWA (DOWA) utilize centralized data structures to generate reliable weights, however. Despite their simplicity, the approach taken by these operators neglects entirely any local data structure that represents a strong agreement or consensus. To address this issue, the cluster-based OWA (Clus-DOWA) operator has been proposed. It employs a cluster-based reliability measure that is effective to differentiate the accountability of different input arguments. Yet, its actual application is constrained by the high computational requirement. This paper presents a more efficient nearest-neighbor-based reliability assessment for which an expensive clustering process is not required. The proposed measure can be perceived as a stress function, from which the OWA weights and associated decision-support explanations can be generated. To illustrate the potential of this measure, it is applied to both the problem of information aggregation for alias detection and the problem of unsupervised feature selection (in which unreliable features are excluded from an actual learning process). Experimental results demonstrate that these techniques usually outperform their conventional state-of-the-art counterparts.
Reliability of isometric subtalar pronator and supinator strength testing.

PubMed

Hagen, Marco; Lahner, Matthias; Winhuysen, Martin; Maiwald, Christian

2015-01-01

Due to the specific anatomy of the subtalar joint with its oblique axis, isometric pronator and supinator strength is not well documented. The purpose of this study was to determine intra- and between-session reliability of pronator and supinator strength and lower leg muscle activity measurements during maximum voluntary isometric contractions (MVIC). Pronator and supinator peak torques (PT), with and without supplementary visual muscle strength biofeedback (FB), and muscular activities of peroneus longus (PL) and tibialis anterior (TA) were assessed twice 3 days apart by the same examiner in 21 healthy young male adults (mean age: 27.6 years; SD = 3.9). Limits of agreement (LoA) and minimum detectable change (MDC) were evaluated. By applying FB, reliability of both pronator and supinator PT was improved: LoA were reduced from 32% to 26% and from 20% to 18% and MDC from 20% to 15% and from 16% to 12% in supinator and pronator PT, respectively. Learning effects in pronator and supinator PT (p < 0.05), which were present without FB, were eliminated using FB. Except for TA during pronation, muscle activities showed low reliability indicated by LoA of 51% to 79%. Using supplementary biofeedback, isometric subtalar pronator and supinator strength testing is reliable in healthy subjects. LoA of 18% and 26% have to be exceeded for pronator and supinator PT, respectively, to detect relevant effects in repeated measures.
Highly reliable field electron emitters produced from reproducible damage-free carbon nanotube composite pastes with optimal inorganic fillers.

PubMed

Kim, Jae-Woo; Jeong, Jin-Woo; Kang, Jun-Tae; Choi, Sungyoul; Ahn, Seungjoon; Song, Yoon-Ho

2014-02-14

Highly reliable field electron emitters were developed using a formulation for reproducible damage-free carbon nanotube (CNT) composite pastes with optimal inorganic fillers and a ball-milling method. We carefully controlled the ball-milling sequence and time to avoid any damage to the CNTs, which incorporated fillers that were fully dispersed as paste constituents. The field electron emitters fabricated by printing the CNT pastes were found to exhibit almost perfect adhesion of the CNT emitters to the cathode, along with good uniformity and reproducibility. A high field enhancement factor of around 10,000 was achieved from the CNT field emitters developed. By selecting nano-sized metal alloys and oxides and using the same formulation sequence, we also developed reliable field emitters that could survive high-temperature post processing. These field emitters had high durability to post vacuum annealing at 950 °C, guaranteeing survival of the brazing process used in the sealing of field emission x-ray tubes. We evaluated the field emitters in a triode configuration in the harsh environment of a tiny vacuum-sealed vessel and observed very reliable operation for 30 h at a high current density of 350 mA cm(-2). The CNT pastes and related field emitters that were developed could be usefully applied in reliable field emission devices.
Highly reliable field electron emitters produced from reproducible damage-free carbon nanotube composite pastes with optimal inorganic fillers

NASA Astrophysics Data System (ADS)

Kim, Jae-Woo; Jeong, Jin-Woo; Kang, Jun-Tae; Choi, Sungyoul; Ahn, Seungjoon; Song, Yoon-Ho

2014-02-01

Highly reliable field electron emitters were developed using a formulation for reproducible damage-free carbon nanotube (CNT) composite pastes with optimal inorganic fillers and a ball-milling method. We carefully controlled the ball-milling sequence and time to avoid any damage to the CNTs, which incorporated fillers that were fully dispersed as paste constituents. The field electron emitters fabricated by printing the CNT pastes were found to exhibit almost perfect adhesion of the CNT emitters to the cathode, along with good uniformity and reproducibility. A high field enhancement factor of around 10 000 was achieved from the CNT field emitters developed. By selecting nano-sized metal alloys and oxides and using the same formulation sequence, we also developed reliable field emitters that could survive high-temperature post processing. These field emitters had high durability to post vacuum annealing at 950 °C, guaranteeing survival of the brazing process used in the sealing of field emission x-ray tubes. We evaluated the field emitters in a triode configuration in the harsh environment of a tiny vacuum-sealed vessel and observed very reliable operation for 30 h at a high current density of 350 mA cm-2. The CNT pastes and related field emitters that were developed could be usefully applied in reliable field emission devices.

[The applicability of results].

PubMed

Marín-León, I

2015-11-01

The ultimate aim of the critical reading of medical literature is to use the scientific advances in clinical practice or for innovation. This requires an evaluation of the applicability of the results of the studies that have been published, which begins with a clear understanding of these results. When the studies do not provide sufficient guarantees of rigor in design and analysis, the conditions necessary for the applicability of the results are not met; however, the fact that the results are reliable is not enough to make it worth trying to use their conclusions. This article explains how carrying out studies in experimental or artificial conditions often moves them away from the real conditions in which they claim to apply their conclusions. To evaluate this applicability, the article proposes evaluating a set of items that will enable the reader to determine the likelihood that the benefits and risks reported in the studies will yield the least uncertainty in the clinical arena where they aim to be applied. Copyright © 2015 SERAM. Published by Elsevier España, S.L.U. All rights reserved.
Mathematics applied to the climate system: outstanding challenges and recent progress

PubMed Central

Williams, Paul D.; Cullen, Michael J. P.; Davey, Michael K.; Huthnance, John M.

2013-01-01

The societal need for reliable climate predictions and a proper assessment of their uncertainties is pressing. Uncertainties arise not only from initial conditions and forcing scenarios, but also from model formulation. Here, we identify and document three broad classes of problems, each representing what we regard to be an outstanding challenge in the area of mathematics applied to the climate system. First, there is the problem of the development and evaluation of simple physically based models of the global climate. Second, there is the problem of the development and evaluation of the components of complex models such as general circulation models. Third, there is the problem of the development and evaluation of appropriate statistical frameworks. We discuss these problems in turn, emphasizing the recent progress made by the papers presented in this Theme Issue. Many pressing challenges in climate science require closer collaboration between climate scientists, mathematicians and statisticians. We hope the papers contained in this Theme Issue will act as inspiration for such collaborations and for setting future research directions. PMID:23588054
Simultaneous determination of fluoride, chloride, sulfate, phosphate, monofluorophosphate, glycerophosphate, sorbate, and saccharin in gargles by ion chromatography*

PubMed Central

Zhang, Yan-zhen; Zhou, Yan-chun; Liu, Li; Zhu, Yan

2007-01-01

Simple, reliable and sensitive analytical methods to determine anticariogenic agents, preservatives, and artificial sweeteners contained in commercial gargles are necessary for evaluating their effectiveness, safety, and quality. An ion chromatography (IC) method has been described to analyze simultaneously eight anions including fluoride, chloride, sulfate, phosphate, monofluorophosphate, glycerophosphate (anticariogenic agents), sorbate (a preservative), and saccharin (an artificial sweetener) in gargles. In this IC system, we applied a mobile phased gradient elution with KOH, separation by IonPac AS18 columns, and suppressed conductivity detection. Optimized analytical conditions were further evaluated for accuracy. The relative standard deviations (RSDs) of the inter-day’s retention time and peak area of all species were less than 0.938% and 8.731%, respectively, while RSDs of 5-day retention time and peak area were less than 1.265% and 8.934%, respectively. The correlation coefficients for targeted analytes ranged from 0.999 7 to 1.000 0. The spiked recoveries for the anions were 90%~102.5%. We concluded that the method can be applied for comprehensive evaluation of commercial gargles. PMID:17610331
Simultaneous determination of fluoride, chloride, sulfate, phosphate, monofluorophosphate, glycerophosphate, sorbate, and saccharin in gargles by ion chromatography.

PubMed

Zhang, Yan-zhen; Zhou, Yan-chun; Liu, Li; Zhu, Yan

2007-07-01

Simple, reliable and sensitive analytical methods to determine anticariogenic agents, preservatives, and artificial sweeteners contained in commercial gargles are necessary for evaluating their effectiveness, safety, and quality. An ion chromatography (IC) method has been described to analyze simultaneously eight anions including fluoride, chloride, sulfate, phosphate, monofluorophosphate, glycerophosphate (anticariogenic agents), sorbate (a preservative), and saccharin (an artificial sweetener) in gargles. In this IC system, we applied a mobile phased gradient elution with KOH, separation by IonPac AS18 columns, and suppressed conductivity detection. Optimized analytical conditions were further evaluated for accuracy. The relative standard deviations (RSDs) of the inter-day's retention time and peak area of all species were less than 0.938% and 8.731%, respectively, while RSDs of 5-day retention time and peak area were less than 1.265% and 8.934%, respectively. The correlation coefficients for targeted analytes ranged from 0.999 7 to 1.000 0. The spiked recoveries for the anions were 90% approximately 102.5%. We concluded that the method can be applied for comprehensive evaluation of commercial gargles.
A study on the real-time reliability of on-board equipment of train control system

NASA Astrophysics Data System (ADS)

Zhang, Yong; Li, Shiwei

2018-05-01

Real-time reliability evaluation is conducive to establishing a condition based maintenance system for the purpose of guaranteeing continuous train operation. According to the inherent characteristics of the on-board equipment, the connotation of reliability evaluation of on-board equipment is defined and the evaluation index of real-time reliability is provided in this paper. From the perspective of methodology and practical application, the real-time reliability of the on-board equipment is discussed in detail, and the method of evaluating the realtime reliability of on-board equipment at component level based on Hidden Markov Model (HMM) is proposed. In this method the performance degradation data is used directly to realize the accurate perception of the hidden state transition process of on-board equipment, which can achieve a better description of the real-time reliability of the equipment.
The next generation in aircraft protection against advanced MANPADS

NASA Astrophysics Data System (ADS)

Chapman, Stuart

2014-10-01

This paper discusses the advanced and novel technologies and underlying systems capabilities that Selex ES has applied during the development, test and evaluation of the twin head Miysis DIRCM System in order to ensure that it provides the requisite levels of protection against the latest, sophisticated all-aspect IR MANPADS. The importance of key performance parameters, including the fundamental need for "spherical" coverage, rapid time to energy-on-target, laser tracking performance and radiant intensity on seeker dome is covered. It also addresses the approach necessary to ensure that the equipment is suited to all air platforms from the smallest helicopters to large transports, while also ensuring that it achieves an inherent high reliability and an ease of manufacture and repair such that a step change in through-life cost in comparison to previous generation systems can be achieved. The benefits and issues associated with open architecture design are also considered. Finally, the need for extensive test and evaluation at every stage, including simulation, laboratory testing, platform and target dynamic testing in a System Integration Laboratory (SIL), flight trial, missile live-fire, environmental testing and reliability testing is also described.
Thick thermal barrier coatings for diesel engines

NASA Technical Reports Server (NTRS)

Beardsley, M. Brad

1995-01-01

Caterpillar's approach to applying thick thermal barrier coatings (TTBC's) to diesel engine combustion chambers has been to use advanced modeling techniques to predict engine conditions and combine this information with fundamental property evaluation of TTBC systems to predict engine performance and TTBC stress states. Engine testing has been used to verify the predicted performance of the TTBC systems and provide information on failure mechanisms. The objective Caterpillar's program to date has been to advance the fundamental understanding of thick thermal barrier coating systems. Previous reviews of thermal barrier coating technology concluded that the current level of understanding of coating system behavior is inadequate and the lack of fundamental understanding may impeded the application of TTBC's to diesel engines. Areas of TTBC technology being examined in this program include powder characteristics and chemistry; bond coat composition; coating design, microstructure, and thickness as they affect properties, durability, and reliability; and TTBC 'aging' effects (microstructural and property changes) under diesel engine operating conditions. Methods to evaluate the reliability and durability of TTBC's have been developed that attempt to understand the fundamental strength of TTBC's for particular stress states.
Thick thermal barrier coatings for diesel engines

NASA Technical Reports Server (NTRS)

Beardsley, M. B.

1995-01-01

Caterpillar's approach to applying Thick Thermal Barrier Coatings (TTBC's) to diesel engine combustion chambers has been to use advanced modeling techniques to predict engine conditions and combine this information with fundamental property evaluation of TTBC systems to predict engine performance and TTBC stress states. Engine testing has been used to verify the predicted performance of the TTBC systems and provide information on failure mechanisms. The objective of Caterpillar's subcontract with ORNL is to advance the fundamental understanding of thick thermal barrier coating systems. Previous reviews of thermal barrier coating technology concluded that the current level of understanding of coating system behavior is inadequate and the lack of fundamental understanding may impede the application of TTBC's to diesel engines. Areas of TTBC technology being examined in this program include powder characteristics and chemistry; bond coat composition; coating design, microstructure, and thickness as they affect properties, durability, and reliability; and TTBC 'aging' effects (microstructural and property changes) under diesel engine operating conditions. Methods to evaluate the reliability and durability of TTBC's have been developed that attempt to understand the fundamental strength of TTBC's for particular stress states.
A Topology Control Strategy with Reliability Assurance for Satellite Cluster Networks in Earth Observation

PubMed Central

Chen, Qing; Zhang, Jinxiu; Hu, Ze

2017-01-01

This article investigates the dynamic topology control problem of satellite cluster networks (SCNs) in Earth observation (EO) missions by applying a novel metric of stability for inter-satellite links (ISLs). The properties of the periodicity and predictability of satellites’ relative position are involved in the link cost metric which is to give a selection criterion for choosing the most reliable data routing paths. Also, a cooperative work model with reliability is proposed for the situation of emergency EO missions. Based on the link cost metric and the proposed reliability model, a reliability assurance topology control algorithm and its corresponding dynamic topology control (RAT) strategy are established to maximize the stability of data transmission in the SCNs. The SCNs scenario is tested through some numeric simulations of the topology stability of average topology lifetime and average packet loss rate. Simulation results show that the proposed reliable strategy applied in SCNs significantly improves the data transmission performance and prolongs the average topology lifetime. PMID:28241474
A Topology Control Strategy with Reliability Assurance for Satellite Cluster Networks in Earth Observation.

PubMed

Chen, Qing; Zhang, Jinxiu; Hu, Ze

2017-02-23

This article investigates the dynamic topology control problemof satellite cluster networks (SCNs) in Earth observation (EO) missions by applying a novel metric of stability for inter-satellite links (ISLs). The properties of the periodicity and predictability of satellites' relative position are involved in the link cost metric which is to give a selection criterion for choosing the most reliable data routing paths. Also, a cooperative work model with reliability is proposed for the situation of emergency EO missions. Based on the link cost metric and the proposed reliability model, a reliability assurance topology control algorithm and its corresponding dynamic topology control (RAT) strategy are established to maximize the stability of data transmission in the SCNs. The SCNs scenario is tested through some numeric simulations of the topology stability of average topology lifetime and average packet loss rate. Simulation results show that the proposed reliable strategy applied in SCNs significantly improves the data transmission performance and prolongs the average topology lifetime.
Interlaboratory study for the assessment of potential irritative properties of hygiene products on the hamster cheek pouch.

PubMed

Bourrinet, P; Conduzorgues, J P; Dutertre, H; Macabies, J; Masson, P; Maurin, J; Mercier, O

1995-02-01

An interlaboratory study was carried out to determine the feasibility and reliability of a method using the hamster cheek pouch as a model for assessing the potential irritative properties of substances intended to be applied to the lips or other mucous membranes. The test substances were applied once daily to both pouches for 14 consecutive days. Local and general tolerances were appraised throughout the study. At the end of the study, histologic examination of the pouches and the main organs was performed. Results of the feasibility study, conducted on various types of commercial products, indicated that this model is suitable for preparations of various consistence and composition. Results of the reliability study, carried out on gel-type preparations containing various concentrations of a known irritant, sodium lauryl sulfate, indicated that the method elicits a dose-dependent reaction for this compound. This hamster cheek pouch method was reproducible for the various parameters under consideration: local tolerance, general tolerance, histologic examination. For all products, results were in good agreement among the various laboratories participating in the study. The French regulatory authorities of the Fraud Repression Department have accepted it as an official method for the evaluation of the potential irritative properties of cosmetics and hygiene products intended to be applied to the lips or other mucous membranes.
The Infeasibility of Quantifying the Reliability of Life-Critical Real-Time Software

NASA Technical Reports Server (NTRS)

Butler, Ricky W.; Finelli, George B.

1991-01-01

This paper affirms that the quantification of life-critical software reliability is infeasible using statistical methods whether applied to standard software or fault-tolerant software. The classical methods of estimating reliability are shown to lead to exhorbitant amounts of testing when applied to life-critical software. Reliability growth models are examined and also shown to be incapable of overcoming the need for excessive amounts of testing. The key assumption of software fault tolerance separately programmed versions fail independently is shown to be problematic. This assumption cannot be justified by experimentation in the ultrareliability region and subjective arguments in its favor are not sufficiently strong to justify it as an axiom. Also, the implications of the recent multiversion software experiments support this affirmation.
Newly developed double neural network concept for reliable fast plasma position control

NASA Astrophysics Data System (ADS)

Jeon, Young-Mu; Na, Yong-Su; Kim, Myung-Rak; Hwang, Y. S.

2001-01-01

Neural network is considered as a parameter estimation tool in plasma controls for next generation tokamak such as ITER. The neural network has been reported to be so accurate and fast for plasma equilibrium identification that it may be applied to the control of complex tokamak plasmas. For this application, the reliability of the conventional neural network needs to be improved. In this study, a new idea of double neural network is developed to achieve this. The new idea has been applied to simple plasma position identification of KSTAR tokamak for feasibility test. Characteristics of the concept show higher reliability and fault tolerance even in severe faulty conditions, which may make neural network applicable to plasma control reliably and widely in future tokamaks.
Analysis of key technologies for virtual instruments metrology

NASA Astrophysics Data System (ADS)

Liu, Guixiong; Xu, Qingui; Gao, Furong; Guan, Qiuju; Fang, Qiang

2008-12-01

Virtual instruments (VIs) require metrological verification when applied as measuring instruments. Owing to the software-centered architecture, metrological evaluation of VIs includes two aspects: measurement functions and software characteristics. Complexity of software imposes difficulties on metrological testing of VIs. Key approaches and technologies for metrology evaluation of virtual instruments are investigated and analyzed in this paper. The principal issue is evaluation of measurement uncertainty. The nature and regularity of measurement uncertainty caused by software and algorithms can be evaluated by modeling, simulation, analysis, testing and statistics with support of powerful computing capability of PC. Another concern is evaluation of software features like correctness, reliability, stability, security and real-time of VIs. Technologies from software engineering, software testing and computer security domain can be used for these purposes. For example, a variety of black-box testing, white-box testing and modeling approaches can be used to evaluate the reliability of modules, components, applications and the whole VI software. The security of a VI can be assessed by methods like vulnerability scanning and penetration analysis. In order to facilitate metrology institutions to perform metrological verification of VIs efficiently, an automatic metrological tool for the above validation is essential. Based on technologies of numerical simulation, software testing and system benchmarking, a framework for the automatic tool is proposed in this paper. Investigation on implementation of existing automatic tools that perform calculation of measurement uncertainty, software testing and security assessment demonstrates the feasibility of the automatic framework advanced.
Molecular Risk Factors for Schizophrenia.

PubMed

Modai, Shira; Shomron, Noam

2016-03-01

Schizophrenia (SZ) is a complex and strongly heritable mental disorder, which is also associated with developmental-environmental triggers. As opposed to most diagnosable diseases (yet similar to other mental disorders), SZ diagnosis is commonly based on psychiatric evaluations. Recently, large-scale genetic and epigenetic approaches have been applied to SZ research with the goal of potentially improving diagnosis. Increased computational analyses and applied statistical algorithms may shed some light on the complex genetic and epigenetic pathways contributing to SZ pathogenesis. This review discusses the latest advances in molecular risk factors and diagnostics for SZ. Approaches such as these may lead to a more accurate definition of SZ and assist in creating extended and reliable clinical diagnoses with the potential for personalized treatment. Copyright © 2016 Elsevier Ltd. All rights reserved.
Overall Key Performance Indicator to Optimizing Operation of High-Pressure Homogenizers for a Reliable Quantification of Intracellular Components in Pichia pastoris.

PubMed

Garcia-Ortega, Xavier; Reyes, Cecilia; Montesinos, José Luis; Valero, Francisco

2015-01-01

The most commonly used cell disruption procedures may present lack of reproducibility, which introduces significant errors in the quantification of intracellular components. In this work, an approach consisting in the definition of an overall key performance indicator (KPI) was implemented for a lab scale high-pressure homogenizer (HPH) in order to determine the disruption settings that allow the reliable quantification of a wide sort of intracellular components. This innovative KPI was based on the combination of three independent reporting indicators: decrease of absorbance, release of total protein, and release of alkaline phosphatase activity. The yeast Pichia pastoris growing on methanol was selected as model microorganism due to it presents an important widening of the cell wall needing more severe methods and operating conditions than Escherichia coli and Saccharomyces cerevisiae. From the outcome of the reporting indicators, the cell disruption efficiency achieved using HPH was about fourfold higher than other lab standard cell disruption methodologies, such bead milling cell permeabilization. This approach was also applied to a pilot plant scale HPH validating the methodology in a scale-up of the disruption process. This innovative non-complex approach developed to evaluate the efficacy of a disruption procedure or equipment can be easily applied to optimize the most common disruption processes, in order to reach not only reliable quantification but also recovery of intracellular components from cell factories of interest.
Overall Key Performance Indicator to Optimizing Operation of High-Pressure Homogenizers for a Reliable Quantification of Intracellular Components in Pichia pastoris

PubMed Central

Garcia-Ortega, Xavier; Reyes, Cecilia; Montesinos, José Luis; Valero, Francisco

2015-01-01

The most commonly used cell disruption procedures may present lack of reproducibility, which introduces significant errors in the quantification of intracellular components. In this work, an approach consisting in the definition of an overall key performance indicator (KPI) was implemented for a lab scale high-pressure homogenizer (HPH) in order to determine the disruption settings that allow the reliable quantification of a wide sort of intracellular components. This innovative KPI was based on the combination of three independent reporting indicators: decrease of absorbance, release of total protein, and release of alkaline phosphatase activity. The yeast Pichia pastoris growing on methanol was selected as model microorganism due to it presents an important widening of the cell wall needing more severe methods and operating conditions than Escherichia coli and Saccharomyces cerevisiae. From the outcome of the reporting indicators, the cell disruption efficiency achieved using HPH was about fourfold higher than other lab standard cell disruption methodologies, such bead milling cell permeabilization. This approach was also applied to a pilot plant scale HPH validating the methodology in a scale-up of the disruption process. This innovative non-complex approach developed to evaluate the efficacy of a disruption procedure or equipment can be easily applied to optimize the most common disruption processes, in order to reach not only reliable quantification but also recovery of intracellular components from cell factories of interest. PMID:26284241
[Reliability of a bibliometric tool used in France for hospital founding].

PubMed

Darmoni, Stefan J; Ladner, Joël; Devos, Patrick; Gehanno, Jean-François

2009-01-01

SIGAPS is a bibliometric score that aims at making an inventory, evaluating and promoting scientific publications of hospitals that perform research. It has become a major stake in France since it is one of the most important components of the MERRI (Mission Training, Research, Reference and Innovation) founding of hospitals. This score is based on the points attributed to the authors of articles published in journals indexed in Medline, according to the rank of the authors and the Impact Factor of the journal. to compare the reliability of the score when applying different way of computing it, and different weights for the rank or the Impact Factor. we computed the scores of all the physicians of a University Hospital, using the rules that are actually applied at the national level. We then used 4 different scenarios, with different weight given to the rank of authors or the Impact Factor. We compared the scores obtained by each author according to the different scenarios with the Spearman's rank and Pearson's correlation coefficients. The score is not significantly affected when no points are given to the fourth authors and above, when the last author get more points or to change the points according to the Impact Factor of the journal. The different scenarios do not lead to significant changes for the physicians' scores, and therefore for the cumulated score of the hospital. Despite the well known limits of bibliometric indicators, the SIGAPS score appears reliable to compare the hospitals for founding decisions.
Are Validity and Reliability "Relevant" in Qualitative Evaluation Research?

ERIC Educational Resources Information Center

Goodwin, Laura D.; Goodwin, William L.

1984-01-01

The views of prominant qualitative methodologists on the appropriateness of validity and reliability estimation for the measurement strategies employed in qualitative evaluations are summarized. A case is made for the relevance of validity and reliability estimation. Definitions of validity and reliability for qualitative measurement are presented…
Reliability Abstracts and Technical Reviews January - December 1970. Volume 10, Nos. 1-12; R70-14805 - R70-15438

NASA Technical Reports Server (NTRS)

1970-01-01

Reliability Abstracts and Technical Reviews is an abstract and critical analysis service covering published and report literature on reliability. The service is designed to provide information on theory and practice of reliability as applied to aerospace and an objective appraisal of the quality, significance, and applicability of the literature abstracted.

Designing a valid and reliable Likert attitude scale on the generation of electricity from nuclear power plants

DOE Office of Scientific and Technical Information (OSTI.GOV)

Calhoun, L.D.

A 15-step flowchart model was applied to the construction of a 20-item long form and a 6-item short form of the scale. Both scales were field-tested on 829 respondents representing a diverse range of subjects: high school juniors and seniors, nuclear engineering students, pre-service teachers, and members of a citizens action group. Both scales are available for immediate use. The 20-item scale appears to be reliable, content valid, and construct valid. Content validity was examined through factor analysis and the use of two separate juries of nuclear experts. Construct validity was examined by application of the known-groups approach. Scale reliabilitymore » and homogeneity were evidenced by a 0.93 coefficient alpha, a range of positive interim correlations of 0.15 to 0.73, and a range of adjusted item-total correlations of 0.46 to 0.80. The 20-item scale also has evaluative quality; means ranged from 2.80 to 3.70. Content validity for the 6-item scale was examined by a jury of nuclear experts. An obtained coefficient alpha of 0.82, a range of interim correlations of 0.51 to 0.72 suggest the scale is reliable and homogeneous. The 6-item short form also appears to have evaluative quality; means ranged from 2.37 to 3.18.« less
Quantitative Tumor Segmentation for Evaluation of Extent of Glioblastoma Resection to Facilitate Multisite Clinical Trials12

PubMed Central

Cordova, James S; Schreibmann, Eduard; Hadjipanayis, Costas G; Guo, Ying; Shu, Hui-Kuo G; Shim, Hyunsuk; Holder, Chad A

2014-01-01

Standard-of-care therapy for glioblastomas, the most common and aggressive primary adult brain neoplasm, is maximal safe resection, followed by radiation and chemotherapy. Because maximizing resection may be beneficial for these patients, improving tumor extent of resection (EOR) with methods such as intraoperative 5-aminolevulinic acid fluorescence-guided surgery (FGS) is currently under evaluation. However, it is difficult to reproducibly judge EOR in these studies due to the lack of reliable tumor segmentation methods, especially for postoperative magnetic resonance imaging (MRI) scans. Therefore, a reliable, easily distributable segmentation method is needed to permit valid comparison, especially across multiple sites. We report a segmentation method that combines versatile region-of-interest blob generation with automated clustering methods. We applied this to glioblastoma cases undergoing FGS and matched controls to illustrate the method's reliability and accuracy. Agreement and interrater variability between segmentations were assessed using the concordance correlation coefficient, and spatial accuracy was determined using the Dice similarity index and mean Euclidean distance. Fuzzy C-means clustering with three classes was the best performing method, generating volumes with high agreement with manual contouring and high interrater agreement preoperatively and postoperatively. The proposed segmentation method allows tumor volume measurements of contrast-enhanced T1-weighted images in the unbiased, reproducible fashion necessary for quantifying EOR in multicenter trials. PMID:24772206
A novel HMM distributed classifier for the detection of gait phases by means of a wearable inertial sensor network.

PubMed

Taborri, Juri; Rossi, Stefano; Palermo, Eduardo; Patanè, Fabrizio; Cappa, Paolo

2014-09-02

In this work, we decided to apply a hierarchical weighted decision, proposed and used in other research fields, for the recognition of gait phases. The developed and validated novel distributed classifier is based on hierarchical weighted decision from outputs of scalar Hidden Markov Models (HMM) applied to angular velocities of foot, shank, and thigh. The angular velocities of ten healthy subjects were acquired via three uni-axial gyroscopes embedded in inertial measurement units (IMUs) during one walking task, repeated three times, on a treadmill. After validating the novel distributed classifier and scalar and vectorial classifiers-already proposed in the literature, with a cross-validation, classifiers were compared for sensitivity, specificity, and computational load for all combinations of the three targeted anatomical segments. Moreover, the performance of the novel distributed classifier in the estimation of gait variability in terms of mean time and coefficient of variation was evaluated. The highest values of specificity and sensitivity (>0.98) for the three classifiers examined here were obtained when the angular velocity of the foot was processed. Distributed and vectorial classifiers reached acceptable values (>0.95) when the angular velocity of shank and thigh were analyzed. Distributed and scalar classifiers showed values of computational load about 100 times lower than the one obtained with the vectorial classifier. In addition, distributed classifiers showed an excellent reliability for the evaluation of mean time and a good/excellent reliability for the coefficient of variation. In conclusion, due to the better performance and the small value of computational load, the here proposed novel distributed classifier can be implemented in the real-time application of gait phases recognition, such as to evaluate gait variability in patients or to control active orthoses for the recovery of mobility of lower limb joints.
Measuring ability to assess claims about treatment effects: a latent trait analysis of items from the ‘Claim Evaluation Tools’ database using Rasch modelling

PubMed Central

Austvoll-Dahlgren, Astrid; Guttersrud, Øystein; Nsangi, Allen; Semakula, Daniel; Oxman, Andrew D

2017-01-01

Background The Claim Evaluation Tools database contains multiple-choice items for measuring people’s ability to apply the key concepts they need to know to be able to assess treatment claims. We assessed items from the database using Rasch analysis to develop an outcome measure to be used in two randomised trials in Uganda. Rasch analysis is a form of psychometric testing relying on Item Response Theory. It is a dynamic way of developing outcome measures that are valid and reliable. Objectives To assess the validity, reliability and responsiveness of 88 items addressing 22 key concepts using Rasch analysis. Participants We administrated four sets of multiple-choice items in English to 1114 people in Uganda and Norway, of which 685 were children and 429 were adults (including 171 health professionals). We scored all items dichotomously. We explored summary and individual fit statistics using the RUMM2030 analysis package. We used SPSS to perform distractor analysis. Results Most items conformed well to the Rasch model, but some items needed revision. Overall, the four item sets had satisfactory reliability. We did not identify significant response dependence between any pairs of items and, overall, the magnitude of multidimensionality in the data was acceptable. The items had a high level of difficulty. Conclusion Most of the items conformed well to the Rasch model’s expectations. Following revision of some items, we concluded that most of the items were suitable for use in an outcome measure for evaluating the ability of children or adults to assess treatment claims. PMID:28550019
Application and Evaluation of an Expert Judgment Elicitation Procedure for Correlations.

PubMed

Zondervan-Zwijnenburg, Mariëlle; van de Schoot-Hubeek, Wenneke; Lek, Kimberley; Hoijtink, Herbert; van de Schoot, Rens

2017-01-01

The purpose of the current study was to apply and evaluate a procedure to elicit expert judgments about correlations, and to update this information with empirical data. The result is a face-to-face group elicitation procedure with as its central element a trial roulette question that elicits experts' judgments expressed as distributions. During the elicitation procedure, a concordance probability question was used to provide feedback to the experts on their judgments. We evaluated the elicitation procedure in terms of validity and reliability by means of an application with a small sample of experts. Validity means that the elicited distributions accurately represent the experts' judgments. Reliability concerns the consistency of the elicited judgments over time. Four behavioral scientists provided their judgments with respect to the correlation between cognitive potential and academic performance for two separate populations enrolled at a specific school in the Netherlands that provides special education to youth with severe behavioral problems: youth with autism spectrum disorder (ASD), and youth with diagnoses other than ASD. Measures of face-validity, feasibility, convergent validity, coherence, and intra-rater reliability showed promising results. Furthermore, the current study illustrates the use of the elicitation procedure and elicited distributions in a social science application. The elicited distributions were used as a prior for the correlation, and updated with data for both populations collected at the school of interest. The current study shows that the newly developed elicitation procedure combining the trial roulette method with the elicitation of correlations is a promising tool, and that the results of the procedure are useful as prior information in a Bayesian analysis.
Reliability and precision of stress sonography of the ulnar collateral ligament.

PubMed

Bica, David; Armen, Joseph; Kulas, Anthony S; Youngs, Kevin; Womack, Zachary

2015-03-01

Musculoskeletal sonography has emerged as an additional diagnostic tool that can be used to assess medial elbow pain and laxity in overhead throwers. It provides a dynamic, rapid, and noninvasive modality in the evaluation of ligamentous structural integrity. Many studies have demonstrated the utility of dynamic sonography for medial elbow and ulnar collateral ligament (UCL) integrity. However, evaluating the reliabilityand precision of these measurements is critical if sonography is ultimately used as a clinical diagnostic tool. The purpose of this study was to evaluate the reliability and precision of stress sonography applied to the medial elbow. We conducted a cross-sectional study during the 2011 baseball off-season. Eighteen National Collegiate Athletic Association Division I pitchers were enrolled, and 36 elbows were studied. Using sonography, the medial elbow was assessed, and measurements of the UCL length and ulnohumeral joint gapping were performed twice under two conditions (unloaded and loaded) and bilaterally. Intraclass correlation coefficients (0.72-0.94) and standard errors of measurements (0.3-0.9 mm) for UCL length and ulnohumeral joint gapping were good to excellent. Mean differences between unloaded and loaded conditions for the dominant arms were 1.3 mm (gapping; P < .001) and 1.4 mm (UCL length; P < .001). Medial elbow stress sonography is a reliable and precise method for detecting changes in ulnohumeral joint gapping and UCL lengthening. Ultimately, this method may provide clinicians valuable information regarding the medial elbow's response to valgus loading and may help guide treatment options. © 2015 by the American Institute of Ultrasound in Medicine.
Evaluation of reliability modeling tools for advanced fault tolerant systems

NASA Technical Reports Server (NTRS)

Baker, Robert; Scheper, Charlotte

1986-01-01

The Computer Aided Reliability Estimation (CARE III) and Automated Reliability Interactice Estimation System (ARIES 82) reliability tools for application to advanced fault tolerance aerospace systems were evaluated. To determine reliability modeling requirements, the evaluation focused on the Draper Laboratories' Advanced Information Processing System (AIPS) architecture as an example architecture for fault tolerance aerospace systems. Advantages and limitations were identified for each reliability evaluation tool. The CARE III program was designed primarily for analyzing ultrareliable flight control systems. The ARIES 82 program's primary use was to support university research and teaching. Both CARE III and ARIES 82 were not suited for determining the reliability of complex nodal networks of the type used to interconnect processing sites in the AIPS architecture. It was concluded that ARIES was not suitable for modeling advanced fault tolerant systems. It was further concluded that subject to some limitations (the difficulty in modeling systems with unpowered spare modules, systems where equipment maintenance must be considered, systems where failure depends on the sequence in which faults occurred, and systems where multiple faults greater than a double near coincident faults must be considered), CARE III is best suited for evaluating the reliability of advanced tolerant systems for air transport.
Benchmarking the efficiency of the Chilean water and sewerage companies: a double-bootstrap approach.

PubMed

Molinos-Senante, María; Donoso, Guillermo; Sala-Garrido, Ramon; Villegas, Andrés

2018-03-01

Benchmarking the efficiency of water companies is essential to set water tariffs and to promote their sustainability. In doing so, most of the previous studies have applied conventional data envelopment analysis (DEA) models. However, it is a deterministic method that does not allow to identify environmental factors influencing efficiency scores. To overcome this limitation, this paper evaluates the efficiency of a sample of Chilean water and sewerage companies applying a double-bootstrap DEA model. Results evidenced that the ranking of water and sewerage companies changes notably whether efficiency scores are computed applying conventional or double-bootstrap DEA models. Moreover, it was found that the percentage of non-revenue water and customer density are factors influencing the efficiency of Chilean water and sewerage companies. This paper illustrates the importance of using a robust and reliable method to increase the relevance of benchmarking tools.
Reliability analysis applied to structural tests

NASA Technical Reports Server (NTRS)

Diamond, P.; Payne, A. O.

1972-01-01

The application of reliability theory to predict, from structural fatigue test data, the risk of failure of a structure under service conditions because its load-carrying capability is progressively reduced by the extension of a fatigue crack, is considered. The procedure is applicable to both safe-life and fail-safe structures and, for a prescribed safety level, it will enable an inspection procedure to be planned or, if inspection is not feasible, it will evaluate the life to replacement. The theory has been further developed to cope with the case of structures with initial cracks, such as can occur in modern high-strength materials which are susceptible to the formation of small flaws during the production process. The method has been applied to a structure of high-strength steel and the results are compared with those obtained by the current life estimation procedures. This has shown that the conventional methods can be unconservative in certain cases, depending on the characteristics of the structure and the design operating conditions. The suitability of the probabilistic approach to the interpretation of the results from full-scale fatigue testing of aircraft structures is discussed and the assumptions involved are examined.
POF-IMU sensor system: A fusion between inertial measurement units and POF sensors for low-cost and highly reliable systems

NASA Astrophysics Data System (ADS)

Leal-Junior, Arnaldo G.; Vargas-Valencia, Laura; dos Santos, Wilian M.; Schneider, Felipe B. A.; Siqueira, Adriano A. G.; Pontes, Maria José; Frizera, Anselmo

2018-07-01

This paper presents a low cost and highly reliable system for angle measurement based on a sensor fusion between inertial and fiber optic sensors. The system consists of the sensor fusion through Kalman filter of two inertial measurement units (IMUs) and an intensity variation-based polymer optical fiber (POF) curvature sensor. In addition, the IMU was applied as a reference for a compensation technique of POF curvature sensor hysteresis. The proposed system was applied on the knee angle measurement of a lower limb exoskeleton in flexion/extension cycles and in gait analysis. Results show the accuracy of the system, where the Root Mean Square Error (RMSE) between the POF-IMU sensor system and the encoder was below 4° in the worst case and about 1° in the best case. Then, the POF-IMU sensor system was evaluated as a wearable sensor for knee joint angle assessment without the exoskeleton, where its suitability for this purpose was demonstrated. The results obtained in this paper pave the way for future applications of sensor fusion between electronic and fiber optic sensors in movement analysis.
Reliable and energy-efficient communications for wireless biomedical implant systems.

PubMed

Ntouni, Georgia D; Lioumpas, Athanasios S; Nikita, Konstantina S

2014-11-01

Implant devices are used to measure biological parameters and transmit their results to remote off-body devices. As implants are characterized by strict requirements on size, reliability, and power consumption, applying the concept of cooperative communications to wireless body area networks offers several benefits. In this paper, we aim to minimize the power consumption of the implant device by utilizing on-body wearable devices, while providing the necessary reliability in terms of outage probability and bit error rate. Taking into account realistic power considerations and wireless propagation environments based on the IEEE P802.l5 channel model, an exact theoretical analysis is conducted for evaluating several communication scenarios with respect to the position of the wearable device and the motion of the human body. The derived closed-form expressions are employed toward minimizing the required transmission power, subject to a minimum quality-of-service requirement. In this way, the complexity and power consumption are transferred from the implant device to the on-body relay, which is an efficient approach since they can be easily replaced, in contrast to the in-body implants.
Brazilian version of the Nottingham Sensory Assessment: validity, agreement and reliability.

PubMed

Lima, Daniela H F; Queiroz, Ana P; De Salvo, Geovana; Yoneyama, Simone M; Oberg, Telma D; Lima, Núbia M F V

2010-01-01

To investigate the inter-rater and intra-rater reliability, construct validity and internal consistency of the Brazilian version of the Nottingham Sensory Assessment for Stroke Patients (NSA). The instrument was translated into Portuguese from its original in English by a bilingual translator and was then back-translated into English. Twenty-one hemiparetics were evaluated by two examiners using the NSA and the Fugl-Meyer Assessment (FMA) of physical performance. Significant correlation were found between the FMA and the NSA (r=0.752). The NSA showed excellent internal consistency (0.86), and there were acceptable inter- and intra-rater reliability for all items of the NSA, except temperature. Significant ceiling effects were found for the NSA and the FMA. The Brazilian version of the NSA met the criteria for agreement, internal consistency and concurrent validity. It was quick and easy to apply, and it could be used within clinical practice in neuro-rehabilitation outpatient clinics to assess sensory functions following stroke. The significant ceiling effect for the NSA did not limit its use, given that for the same patients, the FMA also showed ceiling effects.
Validation of the Fatigue Impact Scale in Hungarian patients with multiple sclerosis.

PubMed

Losonczi, Erika; Bencsik, Krisztina; Rajda, Cecília; Lencsés, Gyula; Török, Margit; Vécsei, László

2011-03-01

Fatigue is one of the most frequent complaints of patients with multiple sclerosis (MS). The Fatigue Impact Scale (FIS), one of the 30 available fatigue questionnaires, is commonly applied because it evaluates multidimensional aspects of fatigue. The main purposes of this study were to test the validity, test-retest reliability, and internal consistency of the Hungarian version of the FIS. One hundred and eleven MS patients and 85 healthy control (HC) subjects completed the FIS and the Beck Depression Inventory, a large majority of them on two occasions, 3 months apart. The total FIS score and subscale scores differed statistically between the MS patients and the HC subjects in both FIS sessions. In the test-retest reliability assessment, statistically, the intraclass correlation coefficients were high in both the MS and HC groups. Cronbach's alpha values were also notably high. The results of this study indicate that the FIS can be regarded as a valid and reliable scale with which to improve our understanding of the impact of fatigue on the health-related quality of life in MS patients without severe disability.
Improved ultrasonic standard reference blocks

NASA Technical Reports Server (NTRS)

Eitzen, D. G.

1975-01-01

A program to improve the quality, reproducibility and reliability of nondestructive testing through the development of improved ASTM-type ultrasonic reference standards is described. Reference blocks of aluminum, steel, and titanium alloys were considered. Equipment representing the state-of-the-art in laboratory and field ultrasonic equipment was obtained and evaluated. Some RF and spectral data on ten sets of ultrasonic reference blocks were taken as part of a task to quantify the variability in response from nominally identical blocks. Techniques for residual stress, preferred orientation, and microstructural measurements were refined and are applied to a reference block rejected by the manufacturer during fabrication in order to evaluate the effect of metallurgical condition on block response.
Application of a truncated normal failure distribution in reliability testing

NASA Technical Reports Server (NTRS)

Groves, C., Jr.

1968-01-01

Statistical truncated normal distribution function is applied as a time-to-failure distribution function in equipment reliability estimations. Age-dependent characteristics of the truncated function provide a basis for formulating a system of high-reliability testing that effectively merges statistical, engineering, and cost considerations.
77 FR 69615 - Commission Information Collection Activities (FERC-715); Comment Request

Federal Register 2010, 2011, 2012, 2013, 2014

2012-11-20

... transmission planning; A detailed description of the transmission planning reliability criteria used to..., but not limited to, how reliability criteria are applied and the steps taken in [[Page 69616... performance as measured against its stated reliability criteria using its stated assessment practices. The...
Blinded evaluation of interrater reliability of an operative competency assessment tool for direct laryngoscopy and rigid bronchoscopy.

PubMed

Ishman, Stacey L; Benke, James R; Johnson, Kaalan Erik; Zur, Karen B; Jacobs, Ian N; Thorne, Marc C; Brown, David J; Lin, Sandra Y; Bhatti, Nasir; Deutsch, Ellen S

2012-10-01

OBJECTIVES To confirm interrater reliability using blinded evaluation of a skills-assessment instrument to assess the surgical performance of resident and fellow trainees performing pediatric direct laryngoscopy and rigid bronchoscopy in simulated models. DESIGN Prospective, paired, blinded observational validation study. SUBJECTS Paired observers from multiple institutions simultaneously evaluated residents and fellows who were performing surgery in an animal laboratory or using high-fidelity manikins. The evaluators had no previous affiliation with the residents and fellows and did not know their year of training. INTERVENTIONS One- and 2-page versions of an objective structured assessment of technical skills (OSATS) assessment instrument composed of global and a task-specific surgical items were used to evaluate surgical performance. RESULTS Fifty-two evaluations were completed by 17 attending evaluators. The instrument agreement for the 2-page assessment was 71.4% when measured as a binary variable (ie, competent vs not competent) (κ = 0.38; P = .08). Evaluation as a continuous variable revealed a 42.9% percentage agreement (κ = 0.18; P = .14). The intraclass correlation was 0.53, considered substantial/good interrater reliability (69% reliable). For the 1-page instrument, agreement was 77.4% when measured as a binary variable (κ = 0.53, P = .0015). Agreement when evaluated as a continuous measure was 71.0% (κ = 0.54, P < .001). The intraclass correlation was 0.73, considered high interrater reliability (85% reliable). CONCLUSIONS The OSATS assessment instrument is an effective tool for evaluating surgical performance among trainees with acceptable interrater reliability in a simulator setting. Reliability was good for both the 1- and 2-page OSATS checklists, and both serve as excellent tools to provide immediate formative feedback on operational competency.
A New Tool for Nutrition App Quality Evaluation (AQEL): Development, Validation, and Reliability Testing.

PubMed

DiFilippo, Kristen Nicole; Huang, Wenhao; Chapman-Novakofski, Karen M

2017-10-27

The extensive availability and increasing use of mobile apps for nutrition-based health interventions makes evaluation of the quality of these apps crucial for integration of apps into nutritional counseling. The goal of this research was the development, validation, and reliability testing of the app quality evaluation (AQEL) tool, an instrument for evaluating apps' educational quality and technical functionality. Items for evaluating app quality were adapted from website evaluations, with additional items added to evaluate the specific characteristics of apps, resulting in 79 initial items. Expert panels of nutrition and technology professionals and app users reviewed items for face and content validation. After recommended revisions, nutrition experts completed a second AQEL review to ensure clarity. On the basis of 150 sets of responses using the revised AQEL, principal component analysis was completed, reducing AQEL into 5 factors that underwent reliability testing, including internal consistency, split-half reliability, test-retest reliability, and interrater reliability (IRR). Two additional modifiable constructs for evaluating apps based on the age and needs of the target audience as selected by the evaluator were also tested for construct reliability. IRR testing using intraclass correlations (ICC) with all 7 constructs was conducted, with 15 dietitians evaluating one app. Development and validation resulted in the 51-item AQEL. These were reduced to 25 items in 5 factors after principal component analysis, plus 9 modifiable items in two constructs that were not included in principal component analysis. Internal consistency and split-half reliability of the following constructs derived from principal components analysis was good (Cronbach alpha >.80, Spearman-Brown coefficient >.80): behavior change potential, support of knowledge acquisition, app function, and skill development. App purpose split half-reliability was .65. Test-retest reliability showed no significant change over time (P>.05) for all but skill development (P=.001). Construct reliability was good for items assessing age appropriateness of apps for children, teens, and a general audience. In addition, construct reliability was acceptable for assessing app appropriateness for various target audiences (Cronbach alpha >.70). For the 5 main factors, ICC (1,k) was >.80, with a P value of <.05. When 15 nutrition professionals evaluated one app, ICC (2,15) was .98, with a P value of <.001 for all 7 constructs when the modifiable items were specified for adults seeking weight loss support. Our preliminary effort shows that AQEL is a valid, reliable instrument for evaluating nutrition apps' qualities for clinical interventions by nutrition clinicians, educators, and researchers. Further efforts in validating AQEL in various contexts are needed. ©Kristen Nicole DiFilippo, Wenhao Huang, Karen M. Chapman-Novakofski. Originally published in JMIR Mhealth and Uhealth (http://mhealth.jmir.org), 27.10.2017.
Assessing the Analytical Performance of Systems for Self-Monitoring of Blood Glucose: Concepts of Performance Evaluation and Definition of Metrological Key Terms

PubMed Central

Schnell, Oliver; Hinzmann, Rolf; Kulzer, Bernd; Freckmann, Guido; Erbach, Michael; Lodwig, Volker; Heinemann, Lutz

2013-01-01

Reliability of blood glucose (BG) measurements is a prerequisite for successful diabetes management. Publications on the evaluation of self-monitored glucose values, however, are frequently characterized by a confusion in terminology. We provide an inventory of key terms such as accuracy, trueness, precision, traceability, calibration, and matrix effect to avoid future misunderstanding. Definitions are taken from the metrological literature and international norms and explained in a language intended for nonspecialists in metrology. The terms are presented in light of the need to apply generally accepted definitions. In addition, a description of requirements and components for a sound evaluation of BG measurement systems is presented. These factors will also enable improvement in future comparisons of study results. PMID:24351185
Fuzzy Comprehensive Evaluation Method Applied in the Real Estate Investment Risks Research

NASA Astrophysics Data System (ADS)

ML(Zhang Minli), Zhang; Wp(Yang Wenpo), Yang

Real estate investment is a high-risk and high returned of economic activity, the key of real estate analysis is the identification of their types of investment risk and the risk of different types of effective prevention. But, as the financial crisis sweeping the world, the real estate industry also faces enormous risks, how effective and correct evaluation of real estate investment risks becomes the multitudinous scholar concern[1]. In this paper, real estate investment risks were summarized and analyzed, and comparative analysis method is discussed and finally presented fuzzy comprehensive evaluation method, not only in theory has the advantages of science, in the application also has the reliability, for real estate investment risk assessment provides an effective means for investors in real estate investing guidance on risk factors and forecasts.

SPSS Macros for Assessing the Reliability and Agreement of Student Evaluations of Teaching

ERIC Educational Resources Information Center

Morley, Donald D.

2009-01-01

This article reports and demonstrates two SPSS macros for calculating Krippendorff's alpha and intraclass reliability coefficients in repetitive situations where numerous coefficients are needed. Specifically, the reported SPSS macros were used to evaluate the interrater agreement and reliability of student evaluations of teaching in thousands of…
Evaluation methodologies for an advanced information processing system

NASA Technical Reports Server (NTRS)

Schabowsky, R. S., Jr.; Gai, E.; Walker, B. K.; Lala, J. H.; Motyka, P.

1984-01-01

The system concept and requirements for an Advanced Information Processing System (AIPS) are briefly described, but the emphasis of this paper is on the evaluation methodologies being developed and utilized in the AIPS program. The evaluation tasks include hardware reliability, maintainability and availability, software reliability, performance, and performability. Hardware RMA and software reliability are addressed with Markov modeling techniques. The performance analysis for AIPS is based on queueing theory. Performability is a measure of merit which combines system reliability and performance measures. The probability laws of the performance measures are obtained from the Markov reliability models. Scalar functions of this law such as the mean and variance provide measures of merit in the AIPS performability evaluations.
Parents' self-efficacy, outcome expectations, and self-reported task performance when managing atopic dermatitis in children: instrument reliability and validity.

PubMed

Mitchell, Amy E; Fraser, Jennifer A

2011-02-01

Support and education for parents faced with managing a child with atopic dermatitis is crucial to the success of current treatments. Interventions aiming to improve parent management of this condition are promising. Unfortunately, evaluation is hampered by lack of precise research tools to measure change. To develop a suite of valid and reliable research instruments to appraise parents' self-efficacy for performing atopic dermatitis management tasks; outcome expectations of performing management tasks; and self-reported task performance in a community sample of parents of children with atopic dermatitis. The Parents' Eczema Management Scale (PEMS) and the Parents' Outcome Expectations of Eczema Management Scale (POEEMS) were developed from an existing self-efficacy scale, the Parental Self-Efficacy with Eczema Care Index (PASECI). Each scale was presented in a single self-administered questionnaire, to measure self-efficacy, outcome expectations, and self-reported task performance related to managing child atopic dermatitis. Each was tested with a community sample of parents of children with atopic dermatitis, and psychometric evaluation of the scales' reliability and validity was conducted. A community-based convenience sample of 120 parents of children with atopic dermatitis completed the self-administered questionnaire. Participants were recruited through schools across Australia. Satisfactory internal consistency and test-retest reliability was demonstrated for all three scales. Construct validity was satisfactory, with positive relationships between self-efficacy for managing atopic dermatitis and general perceived self-efficacy; self-efficacy for managing atopic dermatitis and self-reported task performance; and self-efficacy for managing atopic dermatitis and outcome expectations. Factor analyses revealed two-factor structures for PEMS and PASECI alike, with both scales containing factors related to performing routine management tasks, and managing the child's symptoms and behaviour. Factor analysis was also applied to POEEMS resulting in a three-factor structure. Factors relating to independent management of atopic dermatitis by the parent, involving healthcare professionals in management, and involving the child in the management of atopic dermatitis were found. Parents' self-efficacy and outcome expectations had a significant influence on self-reported task performance. Findings suggest that PEMS and POEEMS are valid and reliable instruments worthy of further psychometric evaluation. Likewise, validity and reliability of PASECI was confirmed. Copyright © 2010 Elsevier Ltd. All rights reserved.
Radiological Determination of Postoperative Cervical Fusion: A Systematic Review.

PubMed

Rhee, John M; Chapman, Jens R; Norvell, Daniel C; Smith, Justin; Sherry, Ned A; Riew, K Daniel

2015-07-01

Systematic review. To determine best criteria for radiological determination of postoperative subaxial cervical fusion to be applied to current clinical practice and ongoing future research assessing fusion to standardize assessment and improve comparability. Despite availability of multiple imaging modalities and criteria, there remains no method of determining cervical fusion with absolute certainty, nor clear consensus on specific criteria to be applied. A systematic search in MEDLINE/Cochrane Collaboration Library (through March 2014). Included studies assessed C2 to C7 via anterior or posterior approach, at 12 weeks or more postoperative, with any graft or implant. Overall body of evidence with respect to 6 posited key questions was determined using Grading of Recommendations Assessment, Development and Evaluation and Agency for Healthcare Research and Quality precepts. Of plain radiographical modalities, there is moderate evidence that the interspinous process motion method (<1 mm) is more accurate than the Cobb angle method for assessing anterior cervical fusion. Of the advanced imaging modalities, there is moderate evidence that computed tomography (CT) is more accurate and reliable than magnetic resonance imaging in assessing anterior cervical fusion. There is insufficient evidence regarding the optimal modality and criteria for assessing posterior cervical fusions and insufficient evidence to support a single time point after surgery as being optimal for determining fusion, although some evidence suggest that reliability of radiography and CT improves with increasing time postoperatively. We recommend using less than 1-mm motion as the initial modality for determining anterior cervical arthrodesis for both clinical and research applications. If further imaging is needed because of indeterminate radiographical evaluation, we recommend CT, which has relatively high accuracy and reliability, but due to greater radiation exposure and cost, it is not routinely suggested. We recommend that plain radiographs also be the initial method of determining posterior cervical fusion but suggest a lower threshold for obtaining CT scans because dynamic radiographs may not be as useful if spinous processes have been removed by laminectomy. 1.
Remote assessment of acne: the use of acne grading tools to evaluate digital skin images.

PubMed

Bergman, Hagit; Tsai, Kenneth Y; Seo, Su-Jean; Kvedar, Joseph C; Watson, Alice J

2009-06-01

Digital imaging of dermatology patients is a novel approach to remote data collection. A number of assessment tools have been developed to grade acne severity and to track clinical progress over time. Although these tools have been validated when used in a face-to-face setting, their efficacy and reliability when used to assess digital images have not been examined. The main purpose of this study was to determine whether specific assessment tools designed to grade acne during face-to-face visits can be applied to the evaluation of digital images. The secondary purpose was to ascertain whether images obtained by subjects are of adequate quality to allow such assessments to be made. Three hundred (300) digital images of patients with mild to moderate facial inflammatory acne from an ongoing randomized-controlled study were included in this analysis. These images were obtained from 20 patients and consisted of sets of 3 images taken over time. Of these images, 120 images were captured by subjects themselves and 180 were taken by study staff. Subjects were asked to retake their photographs if the initial images were deemed of poor quality by study staff. Images were evaluated by two dermatologists-in-training using validated acne assessment measures: Total Inflammatory Lesion Count, Leeds technique, and the Investigator's Global Assessment. Reliability of raters was evaluated using correlation coefficients and kappa statistics. Of the different acne assessment measures tested, the inter-rater reliability was highest for the total inflammatory lesion count (r = 0.871), but low for the Leeds technique (kappa = 0.381) and global assessment (kappa = 0.3119). Raters were able to evaluate over 89% of all images using each type of acne assessment measure despite the fact that images obtained by study staff were of higher quality than those obtained by patients (p < 0.001). Several existing clinical assessment measures can be used to evaluate digital images obtained from subjects with inflammatory acne lesions. The level of inter-rater agreement is highly variable across assessment measures, and we found the Total Inflammatory Lesion Count to be the most reliable. This measure could be used to allow a dermatologist to remotely track a patient's progress over time.
A Review on VSC-HVDC Reliability Modeling and Evaluation Techniques

NASA Astrophysics Data System (ADS)

Shen, L.; Tang, Q.; Li, T.; Wang, Y.; Song, F.

2017-05-01

With the fast development of power electronics, voltage-source converter (VSC) HVDC technology presents cost-effective ways for bulk power transmission. An increasing number of VSC-HVDC projects has been installed worldwide. Their reliability affects the profitability of the system and therefore has a major impact on the potential investors. In this paper, an overview of the recent advances in the area of reliability evaluation for VSC-HVDC systems is provided. Taken into account the latest multi-level converter topology, the VSC-HVDC system is categorized into several sub-systems and the reliability data for the key components is discussed based on sources with academic and industrial backgrounds. The development of reliability evaluation methodologies is reviewed and the issues surrounding the different computation approaches are briefly analysed. A general VSC-HVDC reliability evaluation procedure is illustrated in this paper.
Development of a multilocus-based approach for sponge (phylum Porifera) identification: refinement and limitations.

PubMed

Yang, Qi; Franco, Christopher M M; Sorokin, Shirley J; Zhang, Wei

2017-02-02

For sponges (phylum Porifera), there is no reliable molecular protocol available for species identification. To address this gap, we developed a multilocus-based Sponge Identification Protocol (SIP) validated by a sample of 37 sponge species belonging to 10 orders from South Australia. The universal barcode COI mtDNA, 28S rRNA gene (D3-D5), and the nuclear ITS1-5.8S-ITS2 region were evaluated for their suitability and capacity for sponge identification. The highest Bit Score was applied to infer the identity. The reliability of SIP was validated by phylogenetic analysis. The 28S rRNA gene and COI mtDNA performed better than the ITS region in classifying sponges at various taxonomic levels. A major limitation is that the databases are not well populated and possess low diversity, making it difficult to conduct the molecular identification protocol. The identification is also impacted by the accuracy of the morphological classification of the sponges whose sequences have been submitted to the database. Re-examination of the morphological identification further demonstrated and improved the reliability of sponge identification by SIP. Integrated with morphological identification, the multilocus-based SIP offers an improved protocol for more reliable and effective sponge identification, by coupling the accuracy of different DNA markers.
Development of a multilocus-based approach for sponge (phylum Porifera) identification: refinement and limitations

PubMed Central

Yang, Qi; Franco, Christopher M. M.; Sorokin, Shirley J.; Zhang, Wei

2017-01-01

For sponges (phylum Porifera), there is no reliable molecular protocol available for species identification. To address this gap, we developed a multilocus-based Sponge Identification Protocol (SIP) validated by a sample of 37 sponge species belonging to 10 orders from South Australia. The universal barcode COI mtDNA, 28S rRNA gene (D3–D5), and the nuclear ITS1-5.8S-ITS2 region were evaluated for their suitability and capacity for sponge identification. The highest Bit Score was applied to infer the identity. The reliability of SIP was validated by phylogenetic analysis. The 28S rRNA gene and COI mtDNA performed better than the ITS region in classifying sponges at various taxonomic levels. A major limitation is that the databases are not well populated and possess low diversity, making it difficult to conduct the molecular identification protocol. The identification is also impacted by the accuracy of the morphological classification of the sponges whose sequences have been submitted to the database. Re-examination of the morphological identification further demonstrated and improved the reliability of sponge identification by SIP. Integrated with morphological identification, the multilocus-based SIP offers an improved protocol for more reliable and effective sponge identification, by coupling the accuracy of different DNA markers. PMID:28150727
[Assessment of psychometric properties of the academic involvement questionnaire, expectations version].

PubMed

Pérez V, Cristhian; Ortiz M, Liliana; Fasce H, Eduardo; Parra P, Paula; Matus B, Olga; McColl C, Peter; Torres A, Graciela; Meyer K, Andrea; Márquez U, Carolina; Ortega B, Javiera

2015-11-01

Academic Involvement Questionnaire, Expectations version (CIA-A), assesses the expectations of involvement in studies. It is a relevant predictor of student success. However, the evidence of its validity and reliability in Chile is low, and in the case of Medical students, there is no evidence at all. To evaluate the factorial structure and internal consistency of the CIA-A in Chilean Medical school freshmen. The survey was applied to 340 Medicine freshmen, chosen by non-probability quota sampling. They answered a back-translated version of CIA-A from Portuguese to Spanish, plus a sociodemographic questionnaire. For psychometric analysis of the CIA-A, an exploratory factor analysis was carried on, the reliability of the factors was calculated, a descriptive analysis was conducted and their correlation was assessed. Five factors were identified: vocational, institutional and social involvement, use of resources and student participation. Their reliabilities ranged between Cronbach's alpha values of 0.71 to 0.87. Factors also showed statistically significant correlations between each other. Identified factor structure is theoretically consistent with the structure of original version. It just disagrees in one factor. In addition, the factors' internal consistency were adequate for using them in research. This supports the construct validity and reliability of the CIA-A to assess involvement expectations in medical school freshmen.
Using a fuzzy comprehensive evaluation method to determine product usability: A test case

PubMed Central

Zhou, Ronggang; Chan, Alan H. S.

2016-01-01

BACKGROUND: In order to take into account the inherent uncertainties during product usability evaluation, Zhou and Chan [1] proposed a comprehensive method of usability evaluation for products by combining the analytic hierarchy process (AHP) and fuzzy evaluation methods for synthesizing performance data and subjective response data. This method was designed to provide an integrated framework combining the inevitable vague judgments from the multiple stages of the product evaluation process. OBJECTIVE AND METHODS: In order to illustrate the effectiveness of the model, this study used a summative usability test case to assess the application and strength of the general fuzzy usability framework. To test the proposed fuzzy usability evaluation framework [1], a standard summative usability test was conducted to benchmark the overall usability of a specific network management software. Based on the test data, the fuzzy method was applied to incorporate both the usability scores and uncertainties involved in the multiple components of the evaluation. Then, with Monte Carlo simulation procedures, confidence intervals were used to compare the reliabilities among the fuzzy approach and two typical conventional methods combining metrics based on percentages. RESULTS AND CONCLUSIONS: This case study showed that the fuzzy evaluation technique can be applied successfully for combining summative usability testing data to achieve an overall usability quality for the network software evaluated. Greater differences of confidence interval widths between the method of averaging equally percentage and weighted evaluation method, including the method of weighted percentage averages, verified the strength of the fuzzy method. PMID:28035942
Using a fuzzy comprehensive evaluation method to determine product usability: A test case.

PubMed

Zhou, Ronggang; Chan, Alan H S

2017-01-01

In order to take into account the inherent uncertainties during product usability evaluation, Zhou and Chan [1] proposed a comprehensive method of usability evaluation for products by combining the analytic hierarchy process (AHP) and fuzzy evaluation methods for synthesizing performance data and subjective response data. This method was designed to provide an integrated framework combining the inevitable vague judgments from the multiple stages of the product evaluation process. In order to illustrate the effectiveness of the model, this study used a summative usability test case to assess the application and strength of the general fuzzy usability framework. To test the proposed fuzzy usability evaluation framework [1], a standard summative usability test was conducted to benchmark the overall usability of a specific network management software. Based on the test data, the fuzzy method was applied to incorporate both the usability scores and uncertainties involved in the multiple components of the evaluation. Then, with Monte Carlo simulation procedures, confidence intervals were used to compare the reliabilities among the fuzzy approach and two typical conventional methods combining metrics based on percentages. This case study showed that the fuzzy evaluation technique can be applied successfully for combining summative usability testing data to achieve an overall usability quality for the network software evaluated. Greater differences of confidence interval widths between the method of averaging equally percentage and weighted evaluation method, including the method of weighted percentage averages, verified the strength of the fuzzy method.
Dynamic MRI to quantify musculoskeletal motion: A systematic review of concurrent validity and reliability, and perspectives for evaluation of musculoskeletal disorders.

PubMed

Borotikar, Bhushan; Lempereur, Mathieu; Lelievre, Mathieu; Burdin, Valérie; Ben Salem, Douraied; Brochard, Sylvain

2017-01-01

To report evidence for the concurrent validity and reliability of dynamic MRI techniques to evaluate in vivo joint and muscle mechanics, and to propose recommendations for their use in the assessment of normal and impaired musculoskeletal function. The search was conducted on articles published in Web of science, PubMed, Scopus, Academic search Premier, and Cochrane Library between 1990 and August 2017. Studies that reported the concurrent validity and/or reliability of dynamic MRI techniques for in vivo evaluation of joint or muscle mechanics were included after assessment by two independent reviewers. Selected articles were assessed using an adapted quality assessment tool and a data extraction process. Results for concurrent validity and reliability were categorized as poor, moderate, or excellent. Twenty articles fulfilled the inclusion criteria with a mean quality assessment score of 66% (±10.4%). Concurrent validity and/or reliability of eight dynamic MRI techniques were reported, with the knee being the most evaluated joint (seven studies). Moderate to excellent concurrent validity and reliability were reported for seven out of eight dynamic MRI techniques. Cine phase contrast and real-time MRI appeared to be the most valid and reliable techniques to evaluate joint motion, and spin tag for muscle motion. Dynamic MRI techniques are promising for the in vivo evaluation of musculoskeletal mechanics; however results should be evaluated with caution since validity and reliability have not been determined for all joints and muscles, nor for many pathological conditions.
Stochastic Methods for Aircraft Design

NASA Technical Reports Server (NTRS)

Pelz, Richard B.; Ogot, Madara

1998-01-01

The global stochastic optimization method, simulated annealing (SA), was adapted and applied to various problems in aircraft design. The research was aimed at overcoming the problem of finding an optimal design in a space with multiple minima and roughness ubiquitous to numerically generated nonlinear objective functions. SA was modified to reduce the number of objective function evaluations for an optimal design, historically the main criticism of stochastic methods. SA was applied to many CFD/MDO problems including: low sonic-boom bodies, minimum drag on supersonic fore-bodies, minimum drag on supersonic aeroelastic fore-bodies, minimum drag on HSCT aeroelastic wings, FLOPS preliminary design code, another preliminary aircraft design study with vortex lattice aerodynamics, HSR complete aircraft aerodynamics. In every case, SA provided a simple, robust and reliable optimization method which found optimal designs in order 100 objective function evaluations. Perhaps most importantly, from this academic/industrial project, technology has been successfully transferred; this method is the method of choice for optimization problems at Northrop Grumman.
Short Personality and Life Event scale for detection of suicide attempters.

PubMed

Artieda-Urrutia, Paula; Delgado-Gómez, David; Ruiz-Hernández, Diego; García-Vega, Juan Manuel; Berenguer, Nuria; Oquendo, Maria A; Blasco-Fontecilla, Hilario

2015-01-01

To develop a brief and reliable psychometric scale to identify individuals at risk for suicidal behaviour. Case-control study. 182 individuals (61 suicide attempters, 57 psychiatric controls, and 64 psychiatrically healthy controls) aged 18 or older, admitted to the Emergency Department at Puerta de Hierro University Hospital in Madrid, Spain. All participants completed a form including their socio-demographic and clinical characteristics, and the Personality and Life Events scale (27 items). To assess Axis I diagnoses, all psychiatric patients (including suicide attempters) were administered the Mini International Neuropsychiatric Interview. Descriptive statistics were computed for the socio-demographic factors. Additionally, χ(2) independence tests were applied to evaluate differences in socio-demographic and clinical variables, and the Personality and Life Events scale between groups. A stepwise linear regression with backward variable selection was conducted to build the Short Personality Life Event (S-PLE) scale. In order to evaluate the accuracy, a ROC analysis was conducted. The internal reliability was assessed using Cronbach's α, and the external reliability was evaluated using a test-retest procedure. The S-PLE scale, composed of just 6 items, showed good performance in discriminating between medical controls, psychiatric controls and suicide attempters in an independent sample. For instance, the S-PLE scale discriminated between past suicide and past non-suicide attempters with sensitivity of 80% and specificity of 75%. The area under the ROC curve was 88%. A factor analysis extracted only one factor, revealing a single dimension of the S-PLE scale. Furthermore, the S-PLE scale provides values of internal and external reliability between poor (test-retest: 0.55) and acceptable (Cronbach's α: 0.65) ranges. Administration time is about one minute. The S-PLE scale is a useful and accurate instrument for estimating the risk of suicidal behaviour in settings where the time is scarce. Copyright © 2015 SEP y SEPB. Published by Elsevier España. All rights reserved.
German version, inter- and intrarater reliability and internal consistency of the "Agitated Behavior Scale" (ABS-G) in patients with moderate to severe traumatic brain injury.

PubMed

Hellweg, Stephanie; Schuster-Amft, Corina

2016-07-19

Agitation is frequently observed during early recovery after traumatic brain injury (TBI). Agitated behaviour often interferes with a goal-orientated rehabilitation and can be a substantial hindrance to therapy. Despite the relatively high occurance of agitation in TBI population there is no objective assessement in German (G) available. An existing scale with excellent psychometric properties is the "Agitated Behavior Scale (ABS)" developed by Corrigan in 1989. The aim of the study was to translate the Agitated Behavior Scale (ABS) into German (ABS-G) and investigate the inter- and intrarater reliability and internal consistency in patients with moderate to severe TBI. A formal nine-step translation and cross-cultural adaptation procedure (TCCA) was applied. Subsequently a prospective observational patient study was conducted. To examine the interrater reliability and internal consistency, two therapists rated 20 patients independently after a therapy session. This procedure was repeated twice on a weekly basis. The intrarater reliability was assessed through video recordings from three patients. Nine raters scored the demonstrated behaviour on the videotape with the ABS-G independently twice within one month. The inter- and intrarater reliability were evaluated with the Spearman rank correlation coefficient and the quadratic weighted kappa. The internal consistency was tested with Cronbach's alpha. Behaviour of 20 patients (18 males; mean age 41 ± 20.7; mean Functional Independence Measure (FIM) cognitive score on admission 7.1 ± 4.04; mean ABS-G score at first observation 17.3 ± 2.83) was assessed threefold. Interrater reliability yielded a correlation coefficient for ABS-G total score of all 60 paired observations of r s 0.845 and a weighted Kappa of 0.738. Intrarater reliability for ABS-G total score ranged between r s 0.719 and 0.953 and showed a weighted Kappa between 0.871 and 0.953. Cronbach's alpha indicated moderate internal consistency with 0.661. This study demonstrates that the ABS-G is a reliable instrument for evaluating agitation in patients with moderate to severe TBI. Hereby it would be possible to monitor agitation objectively and optimise the management of agitated patients according to international recommendations.
Performance Evaluation of Reliable Multicast Protocol for Checkout and Launch Control Systems

NASA Technical Reports Server (NTRS)

Shu, Wei Wennie; Porter, John

2000-01-01

The overall objective of this project is to study reliability and performance of Real Time Critical Network (RTCN) for checkout and launch control systems (CLCS). The major tasks include reliability and performance evaluation of Reliable Multicast (RM) package and fault tolerance analysis and design of dual redundant network architecture.
Students' Evaluation Strategies in a Web Research Task: Are They Sensitive to Relevance and Reliability?

ERIC Educational Resources Information Center

Rodicio, Héctor García

2015-01-01

When searching and using resources on the Web, students have to evaluate Web pages in terms of relevance and reliability. This evaluation can be done in a more or less systematic way, by either considering deep or superficial cues of relevance and reliability. The goal of this study was to examine how systematic students are when evaluating Web…
The Number of Feedbacks Needed for Reliable Evaluation. A Multilevel Analysis of the Reliability, Stability and Generalisability of Students' Evaluation of Teaching

ERIC Educational Resources Information Center

Rantanen, Pekka

2013-01-01

A multilevel analysis approach was used to analyse students' evaluation of teaching (SET). The low value of inter-rater reliability stresses that any solid conclusions on teaching cannot be made on the basis of single feedbacks. To assess a teacher's general teaching effectiveness, one needs to evaluate four randomly chosen course implementations.…
Comparison of veterinary health services expectations and perceptions between oncologic pet owners, non-oncologic pet owners and veterinary staff using the SERVQUAL methodology

PubMed Central

Gregório, Hugo; Santos, Patricia; Pires, Isabel; Prada, Justina; Queiroga, Felisbina Luísa

2016-01-01

Aim: Client satisfaction gained great importance in health care as a measurement of service quality. One of the most popular methods to evaluate client satisfaction is the SERVQUAL inquiry which measures service quality by evaluating client expectations and services towards a service in five dimensions: Tangibles, Empathy, Assurance, Reliability and Responsiveness. Materials and Methods: In order to evaluate if owners of pets with cancer constitute a distinctive group from the general pet owner population and if these differences were perceived by the hospital staff we applied a SERVQUAL questionnaire to 51 owners of pet with cancer, 68 owners from the general pet population and 14 staff members. Results: Owners of oncologic pets had different expectations of an ideal service granting importance to Assurance questions (6.75 vs 6.5, p= 0.045) while showing unmet needs in Reliability and Empathy dimensions. Veterinarians failed to understand these specificities and over evaluated characteristics of Tangible dimension (6.75 vs 6.25, p=0.027). Conclusion: Owners of pet with cancer seem to constitute a specific subpopulation with special needs and veterinary staff should invest resources towards Assurance instead of privileging tangible aspects of veterinary services. By aligning professionals expectations with those of pet owners veterinarians can achieve better client satisfaction, improved compliance and stronger doctor-owner relationships. PMID:27956781
Comparison of veterinary health services expectations and perceptions between oncologic pet owners, non-oncologic pet owners and veterinary staff using the SERVQUAL methodology.

PubMed

Gregório, Hugo; Santos, Patricia; Pires, Isabel; Prada, Justina; Queiroga, Felisbina Luísa

2016-11-01

Client satisfaction gained great importance in health care as a measurement of service quality. One of the most popular methods to evaluate client satisfaction is the SERVQUAL inquiry which measures service quality by evaluating client expectations and services towards a service in five dimensions: Tangibles, Empathy, Assurance, Reliability and Responsiveness. In order to evaluate if owners of pets with cancer constitute a distinctive group from the general pet owner population and if these differences were perceived by the hospital staff we applied a SERVQUAL questionnaire to 51 owners of pet with cancer, 68 owners from the general pet population and 14 staff members. Owners of oncologic pets had different expectations of an ideal service granting importance to Assurance questions (6.75 vs 6.5, p= 0.045) while showing unmet needs in Reliability and Empathy dimensions. Veterinarians failed to understand these specificities and over evaluated characteristics of Tangible dimension (6.75 vs 6.25, p=0.027). Owners of pet with cancer seem to constitute a specific subpopulation with special needs and veterinary staff should invest resources towards Assurance instead of privileging tangible aspects of veterinary services. By aligning professionals expectations with those of pet owners veterinarians can achieve better client satisfaction, improved compliance and stronger doctor-owner relationships.

An Investment Level Decision Method to Secure Long-term Reliability

NASA Astrophysics Data System (ADS)

Bamba, Satoshi; Yabe, Kuniaki; Seki, Tomomichi; Shibaya, Tetsuji

The slowdown in power demand increase and facility replacement causes the aging and lower reliability in power facility. And the aging is followed by the rapid increase of repair and replacement when many facilities reach their lifetime in future. This paper describes a method to estimate the repair and replacement costs in future by applying the life-cycle cost model and renewal theory to the historical data. This paper also describes a method to decide the optimum investment plan, which replaces facilities in the order of cost-effectiveness by setting replacement priority formula, and the minimum investment level to keep the reliability. Estimation examples applied to substation facilities show that the reasonable and leveled future cash-out can keep the reliability by lowering the percentage of replacements caused by fatal failures.
Assessing the Reliability of Student Evaluations of Teaching: Choosing the Right Coefficient

ERIC Educational Resources Information Center

Morley, Donald

2014-01-01

Many of the studies used to support the claim that student evaluations of teaching are reliable measures of teaching effectiveness have frequently calculated inappropriate reliability coefficients. This paper points to three coefficients that would be appropriate depending on if student evaluations were used for formative or summative purposes.…
Reliable and valid tools for measuring surgeons' teaching performance: residents' vs. self evaluation.

PubMed

Boerebach, Benjamin C M; Arah, Onyebuchi A; Busch, Olivier R C; Lombarts, Kiki M J M H

2012-01-01

In surgical education, there is a need for educational performance evaluation tools that yield reliable and valid data. This paper describes the development and validation of robust evaluation tools that provide surgeons with insight into their clinical teaching performance. We investigated (1) the reliability and validity of 2 tools for evaluating the teaching performance of attending surgeons in residency training programs, and (2) whether surgeons' self evaluation correlated with the residents' evaluation of those surgeons. We surveyed 343 surgeons and 320 residents as part of a multicenter prospective cohort study of faculty teaching performance in residency training programs. The reliability and validity of the SETQ (System for Evaluation Teaching Qualities) tools were studied using standard psychometric techniques. We then estimated the correlations between residents' and surgeons' evaluations. The response rate was 87% among surgeons and 84% among residents, yielding 2625 residents' evaluations and 302 self evaluations. The SETQ tools yielded reliable and valid data on 5 domains of surgical teaching performance, namely, learning climate, professional attitude towards residents, communication of goals, evaluation of residents, and feedback. The correlations between surgeons' self and residents' evaluations were low, with coefficients ranging from 0.03 for evaluation of residents to 0.18 for communication of goals. The SETQ tools for the evaluation of surgeons' teaching performance appear to yield reliable and valid data. The lack of strong correlations between surgeons' self and residents' evaluations suggest the need for using external feedback sources in informed self evaluation of surgeons. Copyright © 2012 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights reserved.
Scoring haemophilic arthropathy on X-rays: improving inter- and intra-observer reliability and agreement using a consensus atlas.

PubMed

Foppen, Wouter; van der Schaaf, Irene C; Beek, Frederik J A; Verkooijen, Helena M; Fischer, Kathelijn

2016-06-01

The radiological Pettersson score (PS) is widely applied for classification of arthropathy to evaluate costly haemophilia treatment. This study aims to assess and improve inter- and intra-observer reliability and agreement of the PS. Two series of X-rays (bilateral elbows, knees, and ankles) of 10 haemophilia patients (120 joints) with haemophilic arthropathy were scored by three observers according to the PS (maximum score 13/joint). Subsequently, (dis-)agreement in scoring was discussed until consensus. Example images were collected in an atlas. Thereafter, second series of 120 joints were scored using the atlas. One observer rescored the second series after three months. Reliability was assessed by intraclass correlation coefficients (ICC), agreement by limits of agreement (LoA). Median Pettersson score at joint level (PSjoint) of affected joints was 6 (interquartile range 3-9). Using the consensus atlas, inter-observer reliability of the PSjoint improved significantly from 0.94 (95 % confidence interval (CI) 0.91-0.96) to 0.97 (CI 0.96-0.98). LoA improved from ±1.7 to ±1.1 for the PSjoint. Therefore, true differences in arthropathy were differences in the PSjoint of >2 points. Intra-observer reliability of the PSjoint was 0.98 (CI 0.97-0.98), intra-observer LoA were ±0.9 points. Reliability and agreement of the PS improved by using a consensus atlas. • Reliability of the Pettersson score significantly improved using the consensus atlas. • The presented consensus atlas improved the agreement among observers. • The consensus atlas could be recommended to obtain a reproducible Pettersson score.
Reliability of lower limb alignment measures using an established landmark-based method with a customized computer software program

PubMed Central

Sled, Elizabeth A.; Sheehy, Lisa M.; Felson, David T.; Costigan, Patrick A.; Lam, Miu; Cooke, T. Derek V.

2010-01-01

The objective of the study was to evaluate the reliability of frontal plane lower limb alignment measures using a landmark-based method by (1) comparing inter- and intra-reader reliability between measurements of alignment obtained manually with those using a computer program, and (2) determining inter- and intra-reader reliability of computer-assisted alignment measures from full-limb radiographs. An established method for measuring alignment was used, involving selection of 10 femoral and tibial bone landmarks. 1) To compare manual and computer methods, we used digital images and matching paper copies of five alignment patterns simulating healthy and malaligned limbs drawn using AutoCAD. Seven readers were trained in each system. Paper copies were measured manually and repeat measurements were performed daily for 3 days, followed by a similar routine with the digital images using the computer. 2) To examine the reliability of computer-assisted measures from full-limb radiographs, 100 images (200 limbs) were selected as a random sample from 1,500 full-limb digital radiographs which were part of the Multicenter Osteoarthritis (MOST) Study. Three trained readers used the software program to measure alignment twice from the batch of 100 images, with two or more weeks between batch handling. Manual and computer measures of alignment showed excellent agreement (intraclass correlations [ICCs] 0.977 – 0.999 for computer analysis; 0.820 – 0.995 for manual measures). The computer program applied to full-limb radiographs produced alignment measurements with high inter- and intra-reader reliability (ICCs 0.839 – 0.998). In conclusion, alignment measures using a bone landmark-based approach and a computer program were highly reliable between multiple readers. PMID:19882339
MEASUREMENT: ACCOUNTING FOR RELIABILITY IN PERFORMANCE ESTIMATES.

PubMed

Waterman, Brian; Sutter, Robert; Burroughs, Thomas; Dunagan, W Claiborne

2014-01-01

When evaluating physician performance measures, physician leaders are faced with the quandary of determining whether departures from expected physician performance measurements represent a true signal or random error. This uncertainty impedes the physician leader's ability and confidence to take appropriate performance improvement actions based on physician performance measurements. Incorporating reliability adjustment into physician performance measurement is a valuable way of reducing the impact of random error in the measurements, such as those caused by small sample sizes. Consequently, the physician executive has more confidence that the results represent true performance and is positioned to make better physician performance improvement decisions. Applying reliability adjustment to physician-level performance data is relatively new. As others have noted previously, it's important to keep in mind that reliability adjustment adds significant complexity to the production, interpretation and utilization of results. Furthermore, the methods explored in this case study only scratch the surface of the range of available Bayesian methods that can be used for reliability adjustment; further study is needed to test and compare these methods in practice and to examine important extensions for handling specialty-specific concerns (e.g., average case volumes, which have been shown to be important in cardiac surgery outcomes). Moreover, it's important to note that the provider group average as a basis for shrinkage is one of several possible choices that could be employed in practice and deserves further exploration in future research. With these caveats, our results demonstrate that incorporating reliability adjustment into physician performance measurements is feasible and can notably reduce the incidence of "real" signals relative to what one would expect to see using more traditional approaches. A physician leader who is interested in catalyzing performance improvement through focused, effective physician performance improvement is well advised to consider the value of incorporating reliability adjustments into their performance measurement system.
Automated Collection of Real-Time Alerts of Citizens as a Useful Tool to Continuously Monitor Malodorous Emissions.

PubMed

Brattoli, Magda; Mazzone, Antonio; Giua, Roberto; Assennato, Giorgio; de Gennaro, Gianluigi

2016-02-26

The evaluation of odor emissions and dispersion is a very arduous topic to face; the real-time monitoring of odor emissions, the identification of chemical components and, with proper certainty, the source of annoyance represent a challenge for stakeholders such as local authorities. The complaints of people, often not systematic and variously distributed, in general do not allow us to quantify the perceived annoyance. Experimental research has been performed to detect and evaluate olfactory annoyance, based on field testing of an innovative monitoring methodology grounded in automatic recording of citizen alerts. It has been applied in Taranto, in the south of Italy where a relevant industrial area is located, by using Odortel(®) for automated collection of citizen alerts. To evaluate its reliability, the collection system has been integrated with automated samplers, able to sample odorous air in real time, according to the citizen alerts of annoyance and, moreover, with meteorological data (especially the wind direction) and trends in odor marker compounds, recorded by air quality monitoring stations. The results have allowed us, for the first time, to manage annoyance complaints, test their reliability, and obtain information about the distribution and entity of the odor phenomena, such that we were able to identify, with supporting evidence, the source as an oil refinery plant.
Student assessment by objective structured examination in a neurology clerkship

PubMed Central

Adesoye, Taiwo; Smith, Sandy; Blood, Angela; Brorson, James R.

2012-01-01

Objectives: We evaluated the reliability and predictive ability of an objective structured clinical examination (OSCE) in the assessment of medical students at the completion of a neurology clerkship. Methods: We analyzed data from 195 third-year medical students who took the OSCE. For each student, the OSCE consisted of 2 standardized patient encounters. The scores obtained from each encounter were compared. Faculty clinical evaluations of each student for 2 clinical inpatient rotations were also compared. Hierarchical regression analysis was applied to test the ability of the averaged OSCE scores to predict standardized written examination scores and composite clinical scores. Results: Students' OSCE scores from the 2 standardized patient encounters were significantly correlated with each other (r = 0.347, p < 0.001), and the scores for all students were normally distributed. In contrast, students' faculty clinical evaluation scores from 2 different clinical inpatient rotations were uncorrelated, and scores were skewed toward the highest ratings. After accounting for clerkship order, better OSCE scores were predictive of better National Board of Medical Examiners standardized examination scores (R2Δ = 0.131, p < 0.001) and of better faculty clinical scores (R2Δ = 0.078, p < 0.001). Conclusions: Student assessment by an OSCE provides a reliable and predictive objective assessment of clinical performance in a neurology clerkship. PMID:22855865
A Numerical Round Robin for the Reliability Prediction of Structural Ceramics

NASA Technical Reports Server (NTRS)

Powers, Lynn M.; Janosik, Lesley A.

1993-01-01

A round robin has been conducted on integrated fast fracture design programs for brittle materials. An informal working group (WELFEP-WEakest Link failure probability prediction by Finite Element Postprocessors) was formed to discuss and evaluate the implementation of the programs examined in the study. Results from the study have provided insight on the differences between the various programs examined. Conclusions from the study have shown that when brittle materials are used in design, analysis must understand how to apply the concepts presented herein to failure probability analysis.
Molybdenum protective coatings adhesion to steel substrate

NASA Astrophysics Data System (ADS)

Blesman, A. I.; Postnikov, D. V.; Polonyankin, D. A.; Teplouhov, A. A.; Tyukin, A. V.; Tkachenko, E. A.

2017-06-01

Protection of the critical parts, components and assemblies from corrosion is an urgent engineering problem and many other industries. Protective coatings’ forming on surface of metal products is a promising way of corrosionprevention. The adhesion force is one of the main characteristics of coatings’ durability. The paper presents theoretical and experimental adhesion force assessment for coatings formed by molybdenum magnetron sputtering ontoa steel substrate. Validity and reliability of results obtained by simulation and sclerometry method allow applying the developed model for adhesion force evaluation in binary «steel-coating» systems.
Analysis of polonium-210 in food products and bioassay samples by isotope-dilution alpha spectrometry.

PubMed

Lin, Zhichao; Wu, Zhongyu

2009-05-01

A rapid and reliable radiochemical method coupled with a simple and compact plating apparatus was developed, validated, and applied for the analysis of (210)Po in variety of food products and bioassay samples. The method performance characteristics, including accuracy, precision, robustness, and specificity, were evaluated along with a detailed measurement uncertainty analysis. With high Po recovery, improved energy resolution, and effective removal of interfering elements by chromatographic extraction, the overall method accuracy was determined to be better than 5% with measurement precision of 10%, at 95% confidence level.
Working papers: applicability of Box Jenkins techniques to gasoline consumption forecasting

DOE Office of Scientific and Technical Information (OSTI.GOV)

Not Available

Reliable consumption forecasts are needed, however, traditional linear time-series techniques don't adequately account for an environment so subject to change. This report evaluates the use of Box Jenkins techniques for gasoline consumption forecasting. Box Jenkins methods were applied to data obtained from the Colorado Petroleum Association and the Colorado Highway Users Fund to ''predict'' 1978 and 1979 consumption. These results prove the Box Jenkins techniques to be quite effective. Forecasts for 1980-81 are included along with suggestions for continuous use of the technique to monitor consumption.
Overcoming the Challenges of Unstructured Data in Multi-site, Electronic Medical Record-based Abstraction

PubMed Central

Polnaszek, Brock; Gilmore-Bykovskyi, Andrea; Hovanes, Melissa; Roiland, Rachel; Ferguson, Patrick; Brown, Roger; Kind, Amy JH

2014-01-01

Background Unstructured data encountered during retrospective electronic medical record (EMR) abstraction has routinely been identified as challenging to reliably abstract, as this data is often recorded as free text, without limitations to format or structure. There is increased interest in reliably abstracting this type of data given its prominent role in care coordination and communication, yet limited methodological guidance exists. Objective As standard abstraction approaches resulted in sub-standard data reliability for unstructured data elements collected as part of a multi-site, retrospective EMR study of hospital discharge communication quality, our goal was to develop, apply and examine the utility of a phase-based approach to reliably abstract unstructured data. This approach is examined using the specific example of discharge communication for warfarin management. Research Design We adopted a “fit-for-use” framework to guide the development and evaluation of abstraction methods using a four step, phase-based approach including (1) team building, (2) identification of challenges, (3) adaptation of abstraction methods, and (4) systematic data quality monitoring. Measures Unstructured data elements were the focus of this study, including elements communicating steps in warfarin management (e.g., warfarin initiation) and medical follow-up (e.g., timeframe for follow-up). Results After implementation of the phase-based approach, inter-rater reliability for all unstructured data elements demonstrated kappas of ≥ 0.89 -- an average increase of + 0.25 for each unstructured data element. Conclusions As compared to standard abstraction methodologies, this phase-based approach was more time intensive, but did markedly increase abstraction reliability for unstructured data elements within multi-site EMR documentation. PMID:27624585
Reliability of smartphone-based gait measurements for quantification of physical activity/inactivity levels.

PubMed

Ebara, Takeshi; Azuma, Ryohei; Shoji, Naoto; Matsukawa, Tsuyoshi; Yamada, Yasuyuki; Akiyama, Tomohiro; Kurihara, Takahiro; Yamada, Shota

2017-11-25

Objective measurements using built-in smartphone sensors that can measure physical activity/inactivity in daily working life have the potential to provide a new approach to assessing workers' health effects. The aim of this study was to elucidate the characteristics and reliability of built-in step counting sensors on smartphones for development of an easy-to-use objective measurement tool that can be applied in ergonomics or epidemiological research. To evaluate the reliability of step counting sensors embedded in seven major smartphone models, the 6-minute walk test was conducted and the following analyses of sensor precision and accuracy were performed: 1) relationship between actual step count and step count detected by sensors, 2) reliability between smartphones of the same model, and 3) false detection rates when sitting during office work, while riding the subway, and driving. On five of the seven models, the inter-class correlations coefficient (ICC (3,1) ) showed high reliability with a range of 0.956-0.993. The other two models, however, had ranges of 0.443-0.504 and the relative error ratios of the sensor-detected step count to the actual step count were ±48.7%-49.4%. The level of agreement between the same models was ICC (3,1) : 0.992-0.998. The false detection rates differed between the sitting conditions. These results suggest the need for appropriate regulation of step counts measured by sensors, through means such as correction or calibration with a predictive model formula, in order to obtain the highly reliable measurement results that are sought in scientific investigation.
Inspection Score and Grading System for Food Services in Brazil: The Results of a Food Safety Strategy to Reduce the Risk of Foodborne Diseases during the 2014 FIFA World Cup.

PubMed

da Cunha, Diogo T; Saccol, Ana L de Freitas; Tondo, Eduardo C; de Oliveira, Ana B A; Ginani, Veronica C; Araújo, Carolina V; Lima, Thalita A S; de Castro, Angela K F; Stedefeldt, Elke

2016-01-01

In 2014, Brazil hosted one of the most popular sport competitions in the world, the FIFA World Cup. Concerned about the intense migration of tourists, the Brazilian government decided to deploy a food safety strategy based on inspection scores and a grading system applied to food services. The present study aimed to evaluate the results of the food safety strategy deployed during the 2014 FIFA World Cup in Brazil. To assess food safety, an evaluation instrument was applied twice in 1927 food service establishments from 26 cities before the start of the competition. This instrument generated a food safety score for each establishment that ranged from 0.0 (no flaws observed) to 2565.95, with four possible grades: A (0.0-13.2); B (13.3-502.6); C (502.7-1152.2); and pending (more than 1152.3). Each food service received a stamp with the grade of the second evaluation. After the end of the World Cup, a study was conducted with different groups of the public to evaluate the acceptance of the strategy. To this end, 221 consumers, 998 food service owners or managers, 150 health surveillance auditors, and 27 health surveillance coordinators were enrolled. These participants completed a survey with positive and negative responses about the inspection score system through a 5-point Likert scale. A reduction in violation scores from 393.1 to 224.4 (p < 0.001) was observed between the first and second evaluation cycles. Of the food services evaluated, 38.7% received the A stamp, 41.4% the B stamp, and 13.9% the C stamp. All positive responses on "system reliability" presented a mean of 4.0 or more, indicating that the public believed this strategy is reliable for communicating risks and promoting food safety. The strategy showed positive results regarding food safety and public acceptance. The deployed strategy promoted improvements in the food safety of food services. The implementation of a permanent policy may be well accepted by the public and may greatly contribute to a reduction in foodborne diseases (FBDs).
Transculturalization and validation of a Spanish translation of the specific lower limb osteoarthritis and quality of life questionnaire AMICAL: Arthrose des Membres Inférieurs et Qualité de vie AMIQUAL.

PubMed

Espinosa-Cuervo, Gisela; Guillermin, Francis; Rat, Anne-Christine; Duarte-Salazar, Carolina; Alemán-Hernández, Sylvia-I; Vergara-Álvarez, Yuriria; Goycochea-Robles, María-Victoria

2014-01-01

Several generic questionnaires have been used to measure quality of life in patients with Osteoarthritis (OA) since few instruments have been developed specifically for OA and none was developed for Spanish speaking patients. The purpose of the study was to validate and adapt to Spanish the French questionnaire AMICAL to measure quality of life in patients with hip and knee OA. Transversal, analytical study. The validation process was performed in phases: translation from French to Spanish, translated version analysis by a multidisciplinary expert team, application of a pilot test to patients to evaluate grammatical and content equivalence, blind back translation, and analysis. The questionnaire was applied to hip and knee OA patients, together with the SF-36 questionnaire, as well as the WOMAC and the Lequesne indexes. The reproducibility was evaluated applying the questionnaire after 72hours. The clinimetric analysis was calculated with SPSS 16.0. One hundred patients with hip OA and 100 patients with knee OA, radiological stages ii-iii, were included to evaluate homogeneity. Sixty-five patients with hip OA and 65 patients with knee OA were included to evaluate consistency. The final sample included 100 hip and 100 patients knee OA patients to estimate homogeneities and 65 patients were evaluated to estimate consistency. Mean (SD) age of patients with hip and knee OA, was 56.34 ± 13 and 60.1 ± 9.2, respectively. Sixty seven percent and 79.8% were female, respectively. Cronbach' alpha for AMICAL was 0.946 and 0.999, for hip OA and knee OA, respectively; and test-retest reliability using the intraclass correlation coefficients was 0.979 and 0.998, respectively. There was also a significant correlation with all the instruments (P<.05), except with the Lequesne index (r-0.383). The Spanish version of AMICAL questionnaire keep the clinimetric properties, homogeneity, and consistency, and has a good correlation with other instruments. Consequently, it is reliable, self-applicable, and includes domains beyond the functional capacity that better evaluate the quality of life. Copyright © 2013 Elsevier España, S.L. All rights reserved.
Degradation of ticarcillin by subcritial water oxidation method: Application of response surface methodology and artificial neural network modeling.

PubMed

Yabalak, Erdal

2018-05-18

This study was performed to investigate the mineralization of ticarcillin in the artificially prepared aqueous solution presenting ticarcillin contaminated waters, which constitute a serious problem for human health. 81.99% of total organic carbon removal, 79.65% of chemical oxygen demand removal, and 94.35% of ticarcillin removal were achieved by using eco-friendly, time-saving, powerful and easy-applying, subcritical water oxidation method in the presence of a safe-to-use oxidizing agent, hydrogen peroxide. Central composite design, which belongs to the response surface methodology, was applied to design the degradation experiments, to optimize the methods, to evaluate the effects of the system variables, namely, temperature, hydrogen peroxide concentration, and treatment time, on the responses. In addition, theoretical equations were proposed in each removal processes. ANOVA tests were utilized to evaluate the reliability of the performed models. F values of 245.79, 88.74, and 48.22 were found for total organic carbon removal, chemical oxygen demand removal, and ticarcillin removal, respectively. Moreover, artificial neural network modeling was applied to estimate the response in each case and its prediction and optimizing performance was statistically examined and compared to the performance of central composite design.
Cross-evaluation of ground-based, multi-satellite and reanalysis precipitation products: Applicability of the Triple Collocation method across Mainland China

NASA Astrophysics Data System (ADS)

Li, Changming; Tang, Guoqiang; Hong, Yang

2018-07-01

Evaluating the reliability of satellite and reanalysis precipitation products is critical but challenging over ungauged or poorly gauged regions. The Triple Collocation (TC) method is a reliable approach to estimate the accuracy of any three independent inputs in the absence of truth values. This study assesses the uncertainty of three types of independent precipitation products, i.e., satellite-based, ground-based and model reanalysis over Mainland China using the TC method. The ground-based data set is Gauge Based Daily Precipitation Analysis (CGDPA). The reanalysis data set is European Reanalysis Agency Reanalysis Product (ERA-interim). The satellite-based products include five mainstream satellite products. The comparison and evaluation are conducted at 0.25° and daily resolutions from 2013 to 2015. First, the effectiveness of the TC method is evaluated in South China with dense gauge network. The results demonstrate that the TC method is reliable because the correlation coefficient (CC) and root mean square error (RMSE) derived from TC are close to those derived from ground observations, with only 9% and 7% mean relative differences, respectively. Then, the TC method is applied in Mainland China, with special attention paid to the Tibetan Plateau (TP) known as the Earth's third pole with few ground stations. Results indicate that (1) The overall performance of IMERG is better than the other satellite products over Mainland China, followed by 3B42V7, CMORPH-CRT and PERSIANN-CDR. (2) In the TP, CGDPA shows the best overall performance over gauged grid cells, however, over ungauged regions, IMERG and ERA-interim slightly outperform CGDPA with similar RMSE but higher mean CC (0.63, 0.61, and 0.58, respectively). It highlights the strengths and potentiality of remote sensing and reanalysis data over the TP and reconfirms the cons of the inherent uncertainty of CGDPA due to interpolation from sparsely gauged data. The study concludes that the TC method provides not only reliable cross-validation results over Mainland China but also a new perspective for comparatively assessing multi-source precipitation products, particularly over poorly gauged regions such as the TP.
Computer-Aided Reliability Estimation

NASA Technical Reports Server (NTRS)

Bavuso, S. J.; Stiffler, J. J.; Bryant, L. A.; Petersen, P. L.

1986-01-01

CARE III (Computer-Aided Reliability Estimation, Third Generation) helps estimate reliability of complex, redundant, fault-tolerant systems. Program specifically designed for evaluation of fault-tolerant avionics systems. However, CARE III general enough for use in evaluation of other systems as well.
Trypan blue/giemsa staining to assess sperm membrane integrity in salernitano stallions and its relationship to pregnancy rates.

PubMed

Serafini, R; Longobardi, V; Spadetta, M; Neri, D; Ariota, B; Gasparrini, B; Di Palo, R

2014-02-01

Aim of this study was to test the reliability of Trypan blue/Giemsa staining to evaluate sperm membrane integrity, acrosomal intactness and morphology in stallion to verify whether it could be applied in vitro as useful tool for sperm fertilizing ability. Fertility data on inseminated mares were collected to evaluate the relationship of sperm quality to pregnancy rates. Forty-one ejaculates were collected from 3 stallions of Salernitano Horse Breed and evaluated for gross appearance, volume, visual motility and membrane integrity with Trypan blue/Giemsa staining and thirty-five mares were inseminated during the breeding season from April to July. Differences among stallions were found in volume, sperm concentration (p < 0.05) and visual motility (p < 0.01). A decrease in sperm motility, concentration (p < 0.05) and total sperm number was found in June-July (p < 0.01). Live sperm with intact acrosome (LSIA) and proximal droplets (PD) were lower (p < 0.01) in June-July, while acrosome reacted sperm (ARS) percentage increased (p < 0.05). No fertility differences were found among stallions with an average fertility per cycle of 44.6% and a pregnancy rate of 68.6%. Higher percentages of LSIA were found in the ejaculates used to inseminate mares that became pregnant vs those used in mares not pregnant (p < 0.05). The significance of LSIA as test variable to verify the reliability of Trypan blue/Giemsa staining was confirmed by Receiver operating characteristic ROC analysis and the sensitivity of the test was 85% at a cut-off value of 48% LSIA. Trypan blue-Giemsa showed to be an accurate method that can be applied on field to evaluate sperm membrane integrity and to identify poor-quality ejaculates. © 2013 Blackwell Verlag GmbH.

A Comprehensive Histological Assessment of Osteoarthritis Lesions in Mice

PubMed Central

McNulty, Margaret A.; Loeser, Richard F.; Davey, Cynthia; Callahan, Michael F.; Ferguson, Cristin M.; Carlson, Cathy S.

2011-01-01

Objective: Accurate histological assessment of osteoarthritis (OA) is critical in studies evaluating the effects of interventions on disease severity. The purpose of the present study was to develop a histological grading scheme that comprehensively and quantitatively assesses changes in multiple tissues that are associated with OA of the stifle joint in mice. Design: Two representative midcoronal sections from 158 stifle joints, including naturally occurring and surgically induced OA, were stained with H&E and Safranin-O stains. All slides were evaluated to characterize the changes present. A grading scheme that includes both measurements and semiquantitative scores was developed, and principal components analysis (PCA) was applied to the resulting data from the medial tibial plateaus. A subset of 30 tibial plateaus representing a wide range of severity was then evaluated by 4 observers. Reliability of the results was evaluated using intraclass correlation coefficients (ICCs) and area under the receiver operating characteristic (ROC) curve. Results: Five factors were retained by PCA, accounting for 74% of the total variance. Interobserver and intraobserver reproducibilities for evaluations of articular cartilage and subchondral bone were acceptable. The articular cartilage integrity and chondrocyte viability factor scores were able to distinguish severe OA from normal, minimal, mild, and moderate disease. Conclusion: This newly developed grading scheme and resulting factors characterize a range of joint changes in mouse stifle joints that are associated with OA. Overall, the newly developed scheme is reliable and reproducible, characterizes changes in multiple tissues, and provides comprehensive information regarding a specific site in the stifle joint. PMID:26069594
Reliable numerical computation in an optimal output-feedback design

NASA Technical Reports Server (NTRS)

Vansteenwyk, Brett; Ly, Uy-Loi

1991-01-01

A reliable algorithm is presented for the evaluation of a quadratic performance index and its gradients with respect to the controller design parameters. The algorithm is a part of a design algorithm for optimal linear dynamic output-feedback controller that minimizes a finite-time quadratic performance index. The numerical scheme is particularly robust when it is applied to the control-law synthesis for systems with densely packed modes and where there is a high likelihood of encountering degeneracies in the closed-loop eigensystem. This approach through the use of an accurate Pade series approximation does not require the closed-loop system matrix to be diagonalizable. The algorithm was included in a control design package for optimal robust low-order controllers. Usefulness of the proposed numerical algorithm was demonstrated using numerous practical design cases where degeneracies occur frequently in the closed-loop system under an arbitrary controller design initialization and during the numerical search.
Data Envelopment Analysis in the Presence of Measurement Error: Case Study from the National Database of Nursing Quality Indicators® (NDNQI®)

PubMed Central

Gajewski, Byron J.; Lee, Robert; Dunton, Nancy

2012-01-01

Data Envelopment Analysis (DEA) is the most commonly used approach for evaluating healthcare efficiency (Hollingsworth, 2008), but a long-standing concern is that DEA assumes that data are measured without error. This is quite unlikely, and DEA and other efficiency analysis techniques may yield biased efficiency estimates if it is not realized (Gajewski, Lee, Bott, Piamjariyakul and Taunton, 2009; Ruggiero, 2004). We propose to address measurement error systematically using a Bayesian method (Bayesian DEA). We will apply Bayesian DEA to data from the National Database of Nursing Quality Indicators® (NDNQI®) to estimate nursing units’ efficiency. Several external reliability studies inform the posterior distribution of the measurement error on the DEA variables. We will discuss the case of generalizing the approach to situations where an external reliability study is not feasible. PMID:23328796
How reliably can a material be classified as a nanomaterial? Available particle-sizing techniques at work

NASA Astrophysics Data System (ADS)

Babick, Frank; Mielke, Johannes; Wohlleben, Wendel; Weigel, Stefan; Hodoroaba, Vasile-Dan

2016-06-01

Currently established and projected regulatory frameworks require the classification of materials (whether nano or non-nano) as specified by respective definitions, most of which are based on the size of the constituent particles. This brings up the question if currently available techniques for particle size determination are capable of reliably classifying materials that potentially fall under these definitions. In this study, a wide variety of characterisation techniques, including counting, fractionating, and spectroscopic techniques, has been applied to the same set of materials under harmonised conditions. The selected materials comprised well-defined quality control materials (spherical, monodisperse) as well as industrial materials of complex shapes and considerable polydispersity. As a result, each technique could be evaluated with respect to the determination of the number-weighted median size. Recommendations on the most appropriate and efficient use of techniques for different types of material are given.
The procedure for determining the residual life of high-temperature aggregates

NASA Astrophysics Data System (ADS)

Nikiforov, A. S.; Prihodko, E. V.; Kinzhibekova, A. K.; Karmanov, A. E.

2018-01-01

One of the main reasons for the withdrawal of high-temperature aggregates for repairs is the destruction of enclosing structures due to the occurrence of temperature stresses. A wide range of refractory materials used, a large number of product names, a difference in the operation of even the same aggregates makes it impossible to apply general principles for determining the residual resource of high-temperature aggregates, which is based, as a rule, on the determination of temperature stresses. In the article there is suggested a technique based on the method of simulation modeling, allowing to estimate the remaining resource and reliability of the operating equipment. There are given data on the calculation of these indicators for a 25-ton steel-casting ladle. The values obtained make it possible to evaluate the rationality of the further operation of the high-temperature unit by the condition of reliability of the enclosing structures.
Advanced approach to the analysis of a series of in-situ nuclear forward scattering experiments

NASA Astrophysics Data System (ADS)

Vrba, Vlastimil; Procházka, Vít; Smrčka, David; Miglierini, Marcel

2017-03-01

This study introduces a sequential fitting procedure as a specific approach to nuclear forward scattering (NFS) data evaluation. Principles and usage of this advanced evaluation method are described in details and its utilization is demonstrated on NFS in-situ investigations of fast processes. Such experiments frequently consist of hundreds of time spectra which need to be evaluated. The introduced procedure allows the analysis of these experiments and significantly decreases the time needed for the data evaluation. The key contributions of the study are the sequential use of the output fitting parameters of a previous data set as the input parameters for the next data set and the model suitability crosscheck option of applying the procedure in ascending and descending directions of the data sets. Described fitting methodology is beneficial for checking of model validity and reliability of obtained results.
Reliability models: the influence of model specification in generation expansion planning

DOE Office of Scientific and Technical Information (OSTI.GOV)

Stremel, J.P.

1982-10-01

This paper is a critical evaluation of reliability methods used for generation expansion planning. It is shown that the methods for treating uncertainty are critical for determining the relative reliability value of expansion alternatives. It is also shown that the specification of the reliability model will not favor all expansion options equally. Consequently, the model is biased. In addition, reliability models should be augmented with an economic value of reliability (such as the cost of emergency procedures or energy not served). Generation expansion evaluations which ignore the economic value of excess reliability can be shown to be inconsistent. The conclusionsmore » are that, in general, a reliability model simplifies generation expansion planning evaluations. However, for a thorough analysis, the expansion options should be reviewed for candidates which may be unduly rejected because of the bias of the reliability model. And this implies that for a consistent formulation in an optimization framework, the reliability model should be replaced with a full economic optimization which includes the costs of emergency procedures and interruptions in the objective function.« less
Improving the quality of discrete-choice experiments in health: how can we assess validity and reliability?

PubMed

Janssen, Ellen M; Marshall, Deborah A; Hauber, A Brett; Bridges, John F P

2017-12-01

The recent endorsement of discrete-choice experiments (DCEs) and other stated-preference methods by regulatory and health technology assessment (HTA) agencies has placed a greater focus on demonstrating the validity and reliability of preference results. Areas covered: We present a practical overview of tests of validity and reliability that have been applied in the health DCE literature and explore other study qualities of DCEs. From the published literature, we identify a variety of methods to assess the validity and reliability of DCEs. We conceptualize these methods to create a conceptual model with four domains: measurement validity, measurement reliability, choice validity, and choice reliability. Each domain consists of three categories that can be assessed using one to four procedures (for a total of 24 tests). We present how these tests have been applied in the literature and direct readers to applications of these tests in the health DCE literature. Based on a stakeholder engagement exercise, we consider the importance of study characteristics beyond traditional concepts of validity and reliability. Expert commentary: We discuss study design considerations to assess the validity and reliability of a DCE, consider limitations to the current application of tests, and discuss future work to consider the quality of DCEs in healthcare.
Construction and Evaluation of Reliability and Validity of Reasoning Ability Test

ERIC Educational Resources Information Center

Bhat, Mehraj A.

2014-01-01

This paper is based on the construction and evaluation of reliability and validity of reasoning ability test at secondary school students. In this paper an attempt was made to evaluate validity, reliability and to determine the appropriate standards to interpret the results of reasoning ability test. The test includes 45 items to measure six types…
Slow Crack Growth and Fatigue Life Prediction of Ceramic Components Subjected to Variable Load History

NASA Technical Reports Server (NTRS)

Jadaan, Osama

2001-01-01

Present capabilities of the NASA CARES/Life (Ceramic Analysis and Reliability Evaluation of Structures/Life) code include probabilistic life prediction of ceramic components subjected to fast fracture, slow crack growth (stress corrosion), and cyclic fatigue failure modes. Currently, this code has the capability to compute the time-dependent reliability of ceramic structures subjected to simple time-dependent loading. For example, in slow crack growth (SCG) type failure conditions CARES/Life can handle the cases of sustained and linearly increasing time-dependent loads, while for cyclic fatigue applications various types of repetitive constant amplitude loads can be accounted for. In real applications applied loads are rarely that simple, but rather vary with time in more complex ways such as, for example, engine start up, shut down, and dynamic and vibrational loads. In addition, when a given component is subjected to transient environmental and or thermal conditions, the material properties also vary with time. The objective of this paper is to demonstrate a methodology capable of predicting the time-dependent reliability of components subjected to transient thermomechanical loads that takes into account the change in material response with time. In this paper, the dominant delayed failure mechanism is assumed to be SCG. This capability has been added to the NASA CARES/Life (Ceramic Analysis and Reliability Evaluation of Structures/Life) code, which has also been modified to have the ability of interfacing with commercially available FEA codes executed for transient load histories. An example involving a ceramic exhaust valve subjected to combustion cycle loads is presented to demonstrate the viability of this methodology and the CARES/Life program.
Content validation: clarity/relevance, reliability and internal consistency of enunciative signs of language acquisition.

PubMed

Crestani, Anelise Henrich; Moraes, Anaelena Bragança de; Souza, Ana Paula Ramos de

2017-08-10

To analyze the results of the validation of building enunciative signs of language acquisition for children aged 3 to 12 months. The signs were built based on mechanisms of language acquisition in an enunciative perspective and on clinical experience with language disorders. The signs were submitted to judgment of clarity and relevance by a sample of six experts, doctors in linguistic in with knowledge of psycholinguistics and language clinic. In the validation of reliability, two judges/evaluators helped to implement the instruments in videos of 20% of the total sample of mother-infant dyads using the inter-evaluator method. The method known as internal consistency was applied to the total sample, which consisted of 94 mother-infant dyads to the contents of the Phase 1 (3-6 months) and 61 mother-infant dyads to the contents of Phase 2 (7 to 12 months). The data were collected through the analysis of mother-infant interaction based on filming of dyads and application of the parameters to be validated according to the child's age. Data were organized in a spreadsheet and then converted to computer applications for statistical analysis. The judgments of clarity/relevance indicated no modifications to be made in the instruments. The reliability test showed an almost perfect agreement between judges (0.8 ≤ Kappa ≥ 1.0); only the item 2 of Phase 1 showed substantial agreement (0.6 ≤ Kappa ≥ 0.79). The internal consistency for Phase 1 had alpha = 0.84, and Phase 2, alpha = 0.74. This demonstrates the reliability of the instruments. The results suggest adequacy as to content validity of the instruments created for both age groups, demonstrating the relevance of the content of enunciative signs of language acquisition.
Measurement Properties of the Persian Translated Version of Graves Orbitopathy Quality of Life Questionnaire: A Validation Study.

PubMed

Kashkouli, Mohsen Bahmani; Karimi, Nasser; Aghamirsalim, Mohamadreza; Abtahi, Mohammad Bagher; Nojomi, Marzieh; Shahrad-Bejestani, Hadi; Salehi, Masoud

2017-02-01

To determine the measurement properties of the Persian language version of the Graves orbitopathy quality of life questionnaire (GO-QOL). Following a systematic translation and cultural adaptation process, 141 consecutive unselected thyroid eye disease (TED) patients answered the Persian GO-QOL and underwent complete ophthalmic examination. The questionnaire was again completed by 60 patients on the second visit, 2-4 weeks later. Construct validity (cross-cultural validity, structural validity and hypotheses testing), reliability (internal consistency and test-retest reliability), and floor and ceiling effects of the Persian version of the GO-QOL were evaluated. Furthermore, Rasch analysis was used to assess its psychometric properties. Cross-cultural validity was established by back-translation techniques, committee review and pretesting techniques. Bi-dimensionality of the questionnaire was confirmed by factor analysis. Construct validity was also supported through confirmation of 6 out of 8 predefined hypotheses. Cronbach's α and intraclass correlation coefficient (ICC) were 0.650 and 0.859 for visual functioning and 0.875 and 0.896 for appearance subscale, respectively. Mean quality of life (QOL) scores for visual functioning and appearance were 78.18 (standard deviation, SD, 21.57) and 56.25 (SD 26.87), respectively. Person reliabilities from the Rasch rating scale model for both visual functioning and appearance revealed an acceptable internal consistency for the Persian GO-QOL. The Persian GO-QOL questionnaire is a valid and reliable tool with good psychometric properties in evaluation of Persian-speaking patients with TED. Applying Rasch analysis to future versions of the GO-QOL is recommended in order to perform tests for linearity between the estimated item measures in different versions.
[Validity and reliability of Pediatric Quality of Life Inventory Version 4.0 Generic Core Scales in Chinese children and adolescents].

PubMed

Chen, Yu-Ming; He, Li-Ping; Mai, Jin-Cheng; Hao, Yuan-Tao; Xiong, Li-Hua; Chen, Wei-Qing; Wu, Jiang-Nan

2008-06-01

To evaluate the reliability and validity of parent proxy-report scales of Pediatric Quality of Life Inventory Version 4.0 (PedsQL 4.0) Generic Core Scales, the Chinese Version. 3493 school students aged 6-18 years were recruited using multistage cluster sampling method. Health-related quality of life was assessed using the above-mentioned PedsQL 4.0 scales. The internal consistency was assessed, using Cronbach's a coefficient, while its validity was tested through correlation analysis, t-test and exploratory factor analysis. The internal consistency reliability for Total Scale Score (Cronbach's alpha = 0.90), Physical Health Summary Score (alpha= 0.81), and Psychosocial Health Summary Score (alpha= 0.89) were excellent. Six major factors were extracted by factor analysis which basically matched the designed structure of the original version accounting for nearly 66% of the variance. The total Scale Score significantly decreased by 3.5 to 13.3 (P < 0.05) in children and adolescents who had diseases including cold, skin hypersensitiveness, food allergy, courbature or arthralgia, breathlessness with a frequency of 6 times or more per year or had asthma as compared to those with lower frequency (< or = 5 times/y) of the diseases or without asthma. We found moderate to high correlations between items and the subscales. Correlation coefficients ranged between 0.45 to 0.84 (P < 0.01). The reliability and validity of the parent proxy-report scales of PedsQL 4.0 Generic Core Scales of the Chinese Version were as good as the original version. Our findings suggested that the scales could be applied to evaluate the health-related quality of life in childhood children in similar Chinese regions to Guangzhou.
A Case Study on Improving Intensive Care Unit (ICU) Services Reliability: By Using Process Failure Mode and Effects Analysis (PFMEA)

PubMed Central

Yousefinezhadi, Taraneh; Jannesar Nobari, Farnaz Attar; Goodari, Faranak Behzadi; Arab, Mohammad

2016-01-01

Introduction: In any complex human system, human error is inevitable and shows that can’t be eliminated by blaming wrong doers. So with the aim of improving Intensive Care Units (ICU) reliability in hospitals, this research tries to identify and analyze ICU’s process failure modes at the point of systematic approach to errors. Methods: In this descriptive research, data was gathered qualitatively by observations, document reviews, and Focus Group Discussions (FGDs) with the process owners in two selected ICUs in Tehran in 2014. But, data analysis was quantitative, based on failures’ Risk Priority Number (RPN) at the base of Failure Modes and Effects Analysis (FMEA) method used. Besides, some causes of failures were analyzed by qualitative Eindhoven Classification Model (ECM). Results: Through FMEA methodology, 378 potential failure modes from 180 ICU activities in hospital A and 184 potential failures from 99 ICU activities in hospital B were identified and evaluated. Then with 90% reliability (RPN≥100), totally 18 failures in hospital A and 42 ones in hospital B were identified as non-acceptable risks and then their causes were analyzed by ECM. Conclusions: Applying of modified PFMEA for improving two selected ICUs’ processes reliability in two different kinds of hospitals shows that this method empowers staff to identify, evaluate, prioritize and analyze all potential failure modes and also make them eager to identify their causes, recommend corrective actions and even participate in improving process without feeling blamed by top management. Moreover, by combining FMEA and ECM, team members can easily identify failure causes at the point of health care perspectives. PMID:27157162
SLOWLY REPEATED EVOKED PAIN (SREP) AS A MARKER OF CENTRAL SENSITIZATION IN FIBROMYALGIA: DIAGNOSTIC ACCURACY AND RELIABILITY IN COMPARISON WITH TEMPORAL SUMMATION OF PAIN.

PubMed

de la Coba, Pablo; Bruehl, Stephen; Gálvez-Sánchez, Carmen María; Reyes Del Paso, Gustavo A

2018-05-01

This study examined the diagnostic accuracy and test-retest reliability of a novel dynamic evoked pain protocol (slowly repeated evoked pain; SREP) compared to temporal summation of pain (TSP), a standard index of central sensitization. Thirty-five fibromyalgia (FM) and 30 rheumatoid arthritis (RA) patients completed, in pseudorandomized order, a standard mechanical TSP protocol (10 stimuli of 1s duration at the thenar eminence using a 300g monofilament with 1s interstimulus interval) and the SREP protocol (9 suprathreshold pressure stimuli of 5s duration applied to the fingernail with a 30s interstimulus interval). In order to evaluate reliability for both protocols, they were repeated in a second session 4-7 days later. Evidence for significant pain sensitization over trials (increasing pain intensity ratings) was observed for SREP in FM (p<.001) but not in RA (p=.35), whereas significant sensitization was observed in both diagnostic groups for the TSP protocol (p's<.008). Compared to TSP, SREP demonstrated higher overall diagnostic accuracy (87.7% vs. 64.6%), greater sensitivity (0.89 vs. 0.57), and greater specificity (0.87 vs. 0.73) in discriminating between FM and RA patients. Test-retest reliability of SREP sensitization was good in FM (ICCs: 0.80), and moderate in RA (ICC: 0.68). SREP seems to be a dynamic evoked pain index tapping into pain sensitization that allows for greater diagnostic accuracy in identifying FM patients compared to a standard TSP protocol. Further research is needed to study mechanisms underlying SREP and the potential utility of adding SREP to standard pain evaluation protocols.
Field reliability of competency and sanity opinions: A systematic review and meta-analysis.

PubMed

Guarnera, Lucy A; Murrie, Daniel C

2017-06-01

We know surprisingly little about the interrater reliability of forensic psychological opinions, even though courts and other authorities have long called for known error rates for scientific procedures admitted as courtroom testimony. This is particularly true for opinions produced during routine practice in the field, even for some of the most common types of forensic evaluations-evaluations of adjudicative competency and legal sanity. To address this gap, we used meta-analytic procedures and study space methodology to systematically review studies that examined the interrater reliability-particularly the field reliability-of competency and sanity opinions. Of 59 identified studies, 9 addressed the field reliability of competency opinions and 8 addressed the field reliability of sanity opinions. These studies presented a wide range of reliability estimates; pairwise percentage agreements ranged from 57% to 100% and kappas ranged from .28 to 1.0. Meta-analytic combinations of reliability estimates obtained by independent evaluators returned estimates of κ = .49 (95% CI: .40-.58) for competency opinions and κ = .41 (95% CI: .29-.53) for sanity opinions. This wide range of reliability estimates underscores the extent to which different evaluation contexts tend to produce different reliability rates. Unfortunately, our study space analysis illustrates that available field reliability studies typically provide little information about contextual variables crucial to understanding their findings. Given these concerns, we offer suggestions for improving research on the field reliability of competency and sanity opinions, as well as suggestions for improving reliability rates themselves. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Adaptation to Spanish language and validation of the fecal incontinence quality of life scale.

PubMed

Minguez, Miguel; Garrigues, Vicente; Soria, Maria Jose; Andreu, Montserrat; Mearin, Fermin; Clave, Pere

2006-04-01

The aim of this study was to perform a psychometric evaluation of the Fecal Incontinence Quality of Life Scale in the Spanish language. Eleven hospitals in Spain participated in the study, which included 118 patients with active fecal incontinence. All the patients filled out a questionnaire on the severity of their incontinence, a general questionnaire of health (Medical Outcomes Survey Short Form), and a Spanish translation of the Fecal Incontinence Quality of Life Scale (Cuestionario de Calidad de Vida de Incontinencia Anal), which consists of 29 items in four domains: lifestyle, behavior, depression, and embarrassment. On a second visit, patients repeated the Fecal Incontinence Quality of Life Scale. For each domain, an evaluation was made of temporal reliability, internal reliability, the convergent validity with the generic questionnaire of health, and the discriminant validity correlating the domains of Cuestionario de Calidad de Vida de Incontinencia Anal with the severity of fecal incontinence. For cultural adaptation, the answer alternatives for 14 items were modified. A total of 111 patients (94 percent) completed the study adequately. Temporal reliability (test-retest) was good for all domains except for embarrassment, which showed significant differences (P < 0.02). Internal reliability was good/excellent for all domains (Cronbach alpha >0.80, between 0.84 and 0.96). The four domains of Cuestionario de Calidad de Vida de Incontinencia Anal significantly correlated with the domains of the generic questionnaire on health (P < 0.01) and with the scale of severity of fecal incontinence (P < 0.001). All domains of Cuestionario de Calidad de Vida de Incontinencia Anal correlated negatively with the need to wear pads (P < 0.01) and with the presence of complete fecal incontinence. The Cuestionario de Calidad de Vida de Incontinencia Anal incorporates sufficient requirements of reliability and validity to be applied to patients with fecal incontinence.
An evidence-based decision assistance model for predicting training outcome in juvenile guide dogs

PubMed Central

Craigon, Peter J.; Blythe, Simon A.; England, Gary C. W.; Asher, Lucy

2017-01-01

Working dog organisations, such as Guide Dogs, need to regularly assess the behaviour of the dogs they train. In this study we developed a questionnaire-style behaviour assessment completed by training supervisors of juvenile guide dogs aged 5, 8 and 12 months old (n = 1,401), and evaluated aspects of its reliability and validity. Specifically, internal reliability, temporal consistency, construct validity, predictive criterion validity (comparing against later training outcome) and concurrent criterion validity (comparing against a standardised behaviour test) were evaluated. Thirty-nine questions were sourced either from previously published literature or created to meet requirements identified via Guide Dogs staff surveys and staff feedback. Internal reliability analyses revealed seven reliable and interpretable trait scales named according to the questions within them as: Adaptability; Body Sensitivity; Distractibility; Excitability; General Anxiety; Trainability and Stair Anxiety. Intra-individual temporal consistency of the scale scores between 5–8, 8–12 and 5–12 months was high. All scales excepting Body Sensitivity showed some degree of concurrent criterion validity. Predictive criterion validity was supported for all seven scales, since associations were found with training outcome, at at-least one age. Thresholds of z-scores on the scales were identified that were able to distinguish later training outcome by identifying 8.4% of all dogs withdrawn for behaviour and 8.5% of all qualified dogs, with 84% and 85% specificity. The questionnaire assessment was reliable and could detect traits that are consistent within individuals over time, despite juvenile dogs undergoing development during the study period. By applying thresholds to scores produced from the questionnaire this assessment could prove to be a highly valuable decision-making tool for Guide Dogs. This is the first questionnaire-style assessment of juvenile dogs that has shown value in predicting the training outcome of individual working dogs. PMID:28614347
Evaluation of the Transit Reliability Information Program

DOT National Transportation Integrated Search

1982-06-01

This report presents an evaluation of the rail portion of the Transit Reliability Information Program (TRIP), which was designed to collect and analyze equipment reliability data on U.S. transit systems. This assessment was conducted at the end of it...
Reliability analysis of laminated CMC components through shell subelement techniques

NASA Technical Reports Server (NTRS)

Starlinger, Alois; Duffy, Stephen F.; Gyekenyesi, John P.

1992-01-01

An updated version of the integrated design program Composite Ceramics Analysis and Reliability Evaluation of Structures (C/CARES) was developed for the reliability evaluation of ceramic matrix composites (CMC) laminated shell components. The algorithm is now split into two modules: a finite-element data interface program and a reliability evaluation algorithm. More flexibility is achieved, allowing for easy implementation with various finite-element programs. The interface program creates a neutral data base which is then read by the reliability module. This neutral data base concept allows easy data transfer between different computer systems. The new interface program from the finite-element code Matrix Automated Reduction and Coupling (MARC) also includes the option of using hybrid laminates (a combination of plies of different materials or different layups) and allows for variations in temperature fields throughout the component. In the current version of C/CARES, a subelement technique was implemented, enabling stress gradients within an element to be taken into account. The noninteractive reliability function is now evaluated at each Gaussian integration point instead of using averaging techniques. As a result of the increased number of stress evaluation points, considerable improvements in the accuracy of reliability analyses were realized.

Dynamic MRI to quantify musculoskeletal motion: A systematic review of concurrent validity and reliability, and perspectives for evaluation of musculoskeletal disorders

PubMed Central

Lempereur, Mathieu; Lelievre, Mathieu; Burdin, Valérie; Ben Salem, Douraied; Brochard, Sylvain

2017-01-01

Purpose To report evidence for the concurrent validity and reliability of dynamic MRI techniques to evaluate in vivo joint and muscle mechanics, and to propose recommendations for their use in the assessment of normal and impaired musculoskeletal function. Materials and methods The search was conducted on articles published in Web of science, PubMed, Scopus, Academic search Premier, and Cochrane Library between 1990 and August 2017. Studies that reported the concurrent validity and/or reliability of dynamic MRI techniques for in vivo evaluation of joint or muscle mechanics were included after assessment by two independent reviewers. Selected articles were assessed using an adapted quality assessment tool and a data extraction process. Results for concurrent validity and reliability were categorized as poor, moderate, or excellent. Results Twenty articles fulfilled the inclusion criteria with a mean quality assessment score of 66% (±10.4%). Concurrent validity and/or reliability of eight dynamic MRI techniques were reported, with the knee being the most evaluated joint (seven studies). Moderate to excellent concurrent validity and reliability were reported for seven out of eight dynamic MRI techniques. Cine phase contrast and real-time MRI appeared to be the most valid and reliable techniques to evaluate joint motion, and spin tag for muscle motion. Conclusion Dynamic MRI techniques are promising for the in vivo evaluation of musculoskeletal mechanics; however results should be evaluated with caution since validity and reliability have not been determined for all joints and muscles, nor for many pathological conditions. PMID:29232401
An Evaluation of Relative Damage to the Powertrain System in Tracked Vehicles

PubMed Central

Lee, Sang-Ho; Lee, Jeong-Hwan; Goo, Sang-Hwa; Cho, Yong-Cheol; Cho, Ho-Young

2009-01-01

The objective of this study was to improve the reliability of the endurance test for the powertrain system of military tracked vehicles. The measurement system that measures the driving duty applied to the powertrain system caused by mobility on roads consists of eight analog channels and two pulse channels, including the propeller shaft output torques for the left and right sides. The data obtained from this measurement system can be used to introduce a new technology that produces the output torque of a torque converter and that can be applied to analyze the revolution counting for the endurance and road mobility in the front unit and represent the relative fatigue damages analysis technique and its results according to the driven roads through a cumulative fatigue method. PMID:22573990
A Hybrid Neural Network-Genetic Algorithm Technique for Aircraft Engine Performance Diagnostics

NASA Technical Reports Server (NTRS)

Kobayashi, Takahisa; Simon, Donald L.

2001-01-01

In this paper, a model-based diagnostic method, which utilizes Neural Networks and Genetic Algorithms, is investigated. Neural networks are applied to estimate the engine internal health, and Genetic Algorithms are applied for sensor bias detection and estimation. This hybrid approach takes advantage of the nonlinear estimation capability provided by neural networks while improving the robustness to measurement uncertainty through the application of Genetic Algorithms. The hybrid diagnostic technique also has the ability to rank multiple potential solutions for a given set of anomalous sensor measurements in order to reduce false alarms and missed detections. The performance of the hybrid diagnostic technique is evaluated through some case studies derived from a turbofan engine simulation. The results show this approach is promising for reliable diagnostics of aircraft engines.
Concurrent validity and reliability of the Alberta Infant Motor Scale in premature infants.

PubMed

Almeida, Kênnea Martins; Dutra, Maria Virginia Peixoto; Mello, Rosane Reis de; Reis, Ana Beatriz Rodrigues; Martins, Priscila Silveira

2008-01-01

To verify the concurrent validity and interobserver reliability of the Alberta Infant Motor Scale (AIMS) in premature infants followed-up at the outpatient clinic of Instituto Fernandes Figueira, Fundação Oswaldo Cruz (IFF/Fiocruz), in Rio de Janeiro, Brazil. A total of 88 premature infants were enrolled at the follow-up clinic at IFF/Fiocruz, between February and December of 2006. For the concurrent validity study, 46 infants were assessed at either 6 (n = 26) or 12 (n = 20) months' corrected age using the AIMS and the second edition of the Bayley Scales of Infant Development, by two different observers, and applying Pearson's correlation coefficient to analyze the results. For the reliability study, 42 infants between 0 and 18 months were assessed using the Alberta Infant Motor Scale, by two different observers and the results analyzed using the intraclass correlation coefficient. The concurrent validity study found a high level of correlation between the two scales (r = 0.95) and one that was statistically significant (p < 0.01) for the entire population of infants, with higher values at 12 months (r = 0.89) than at 6 months (r = 0.74). The interobserver reliability study found satisfactory intraclass correlation coefficients at all ages tested, varying from 0.76 to 0.99. The AIMS is a valid and reliable instrument for the evaluation of motor development in high-risk infants within the Brazilian public health system.
Time-saving design of experiment protocol for optimization of LC-MS data processing in metabolomic approaches.

PubMed

Zheng, Hong; Clausen, Morten Rahr; Dalsgaard, Trine Kastrup; Mortensen, Grith; Bertram, Hanne Christine

2013-08-06

We describe a time-saving protocol for the processing of LC-MS-based metabolomics data by optimizing parameter settings in XCMS and threshold settings for removing noisy and low-intensity peaks using design of experiment (DoE) approaches including Plackett-Burman design (PBD) for screening and central composite design (CCD) for optimization. A reliability index, which is based on evaluation of the linear response to a dilution series, was used as a parameter for the assessment of data quality. After identifying the significant parameters in the XCMS software by PBD, CCD was applied to determine their values by maximizing the reliability and group indexes. Optimal settings by DoE resulted in improvements of 19.4% and 54.7% in the reliability index for a standard mixture and human urine, respectively, as compared with the default setting, and a total of 38 h was required to complete the optimization. Moreover, threshold settings were optimized by using CCD for further improvement. The approach combining optimal parameter setting and the threshold method improved the reliability index about 9.5 times for a standards mixture and 14.5 times for human urine data, which required a total of 41 h. Validation results also showed improvements in the reliability index of about 5-7 times even for urine samples from different subjects. It is concluded that the proposed methodology can be used as a time-saving approach for improving the processing of LC-MS-based metabolomics data.
Reliability and risk assessment of structures

NASA Technical Reports Server (NTRS)

Chamis, C. C.

1991-01-01

Development of reliability and risk assessment of structural components and structures is a major activity at Lewis Research Center. It consists of five program elements: (1) probabilistic loads; (2) probabilistic finite element analysis; (3) probabilistic material behavior; (4) assessment of reliability and risk; and (5) probabilistic structural performance evaluation. Recent progress includes: (1) the evaluation of the various uncertainties in terms of cumulative distribution functions for various structural response variables based on known or assumed uncertainties in primitive structural variables; (2) evaluation of the failure probability; (3) reliability and risk-cost assessment; and (4) an outline of an emerging approach for eventual certification of man-rated structures by computational methods. Collectively, the results demonstrate that the structural durability/reliability of man-rated structural components and structures can be effectively evaluated by using formal probabilistic methods.
Soft error evaluation and vulnerability analysis in Xilinx Zynq-7010 system-on chip

NASA Astrophysics Data System (ADS)

Du, Xuecheng; He, Chaohui; Liu, Shuhuan; Zhang, Yao; Li, Yonghong; Xiong, Ceng; Tan, Pengkang

2016-09-01

Radiation-induced soft errors are an increasingly important threat to the reliability of modern electronic systems. In order to evaluate system-on chip's reliability and soft error, the fault tree analysis method was used in this work. The system fault tree was constructed based on Xilinx Zynq-7010 All Programmable SoC. Moreover, the soft error rates of different components in Zynq-7010 SoC were tested by americium-241 alpha radiation source. Furthermore, some parameters that used to evaluate the system's reliability and safety were calculated using Isograph Reliability Workbench 11.0, such as failure rate, unavailability and mean time to failure (MTTF). According to fault tree analysis for system-on chip, the critical blocks and system reliability were evaluated through the qualitative and quantitative analysis.
Reliability generalization study of the Yale-Brown Obsessive-Compulsive Scale for children and adolescents.

PubMed

López-Pina, José Antonio; Sánchez-Meca, Julio; López-López, José Antonio; Marín-Martínez, Fulgencio; Núñez-Núñez, Rosa Ma; Rosa-Alcázar, Ana I; Gómez-Conesa, Antonia; Ferrer-Requena, Josefa

2015-01-01

The Yale-Brown Obsessive-Compulsive Scale for children and adolescents (CY-BOCS) is a frequently applied test to assess obsessive-compulsive symptoms. We conducted a reliability generalization meta-analysis on the CY-BOCS to estimate the average reliability, search for reliability moderators, and propose a predictive model that researchers and clinicians can use to estimate the expected reliability of the CY-BOCS scores. A total of 47 studies reporting a reliability coefficient with the data at hand were included in the meta-analysis. The results showed good reliability and a large variability associated to the standard deviation of total scores and sample size.
NDE reliability and probability of detection (POD) evolution and paradigm shift

NASA Astrophysics Data System (ADS)

Singh, Surendra

2014-02-01

The subject of NDE Reliability and POD has gone through multiple phases since its humble beginning in the late 1960s. This was followed by several programs including the important one nicknamed "Have Cracks - Will Travel" or in short "Have Cracks" by Lockheed Georgia Company for US Air Force during 1974-1978. This and other studies ultimately led to a series of developments in the field of reliability and POD starting from the introduction of fracture mechanics and Damaged Tolerant Design (DTD) to statistical framework by Bernes and Hovey in 1981 for POD estimation to MIL-STD HDBK 1823 (1999) and 1823A (2009). During the last decade, various groups and researchers have further studied the reliability and POD using Model Assisted POD (MAPOD), Simulation Assisted POD (SAPOD), and applying Bayesian Statistics. All and each of these developments had one objective, i.e., improving accuracy of life prediction in components that to a large extent depends on the reliability and capability of NDE methods. Therefore, it is essential to have a reliable detection and sizing of large flaws in components. Currently, POD is used for studying reliability and capability of NDE methods, though POD data offers no absolute truth regarding NDE reliability, i.e., system capability, effects of flaw morphology, and quantifying the human factors. Furthermore, reliability and POD have been reported alike in meaning but POD is not NDE reliability. POD is a subset of the reliability that consists of six phases: 1) samples selection using DOE, 2) NDE equipment setup and calibration, 3) System Measurement Evaluation (SME) including Gage Repeatability &Reproducibility (Gage R&R) and Analysis Of Variance (ANOVA), 4) NDE system capability and electronic and physical saturation, 5) acquiring and fitting data to a model, and data analysis, and 6) POD estimation. This paper provides an overview of all major POD milestones for the last several decades and discuss rationale for using Integrated Computational Materials Engineering (ICME), MAPOD, SAPOD, and Bayesian statistics for studying controllable and non-controllable variables including human factors for estimating POD. Another objective is to list gaps between "hoped for" versus validated or fielded failed hardware.
Reliability and Availability Evaluation Program Manual.

DTIC Science & Technology

1982-11-01

research and development. The manual’s purpose was to provide a practical method for making reliability measurements, measurements directly related to... Research , Development, Test and Evaluation. RMA Reliability, Maintainability and Availability. R&R Repair and Refurbishment, Repair and Replacement, etc...length. phenomena such as mechanical wear and A number of researchers in the reliability chemical deterioration. Maintenance should field 14-pages 402
Assessing Assessment: Evaluating Outcomes and Reliabilities of Grammar, Math, and Writing Skill Measures in an Introductory Journalism Course

ERIC Educational Resources Information Center

Farwell, Tricia M.; Alligood, Leon; Fitzgerald, Sharon; Blake, Ken

2016-01-01

This article introduces an objective grammar and math assessment and evaluates the assessment's outcome and reliability when fielded among eighty-one students in media writing courses. In addition, the article proposes a rubric for grading straight news leads and compares the rubric's reliability with the reliability of rating straight news leads…
The image evaluation of iterative motion correction reconstruction algorithm PROPELLER T2-weighted imaging compared with MultiVane T2-weighted imaging

NASA Astrophysics Data System (ADS)

Lee, Suk-Jun; Yu, Seung-Man

2017-08-01

The purpose of this study was to evaluate the usefulness and clinical applications of MultiVaneXD which was applying iterative motion correction reconstruction algorithm T2-weighted images compared with MultiVane images taken with a 3T MRI. A total of 20 patients with suspected pathologies of the liver and pancreatic-biliary system based on clinical and laboratory findings underwent upper abdominal MRI, acquired using the MultiVane and MultiVaneXD techniques. Two reviewers analyzed the MultiVane and MultiVaneXD T2-weighted images qualitatively and quantitatively. Each reviewer evaluated vessel conspicuity by observing motion artifacts and the sharpness of the portal vein, hepatic vein, and upper organs. The signal-to-noise ratio (SNR) and contrast-to-noise ratio (CNR) were calculated by one reviewer for quantitative analysis. The interclass correlation coefficient was evaluated to measure inter-observer reliability. There were significant differences between MultiVane and MultiVaneXD in motion artifact evaluation. Furthermore, MultiVane was given a better score than MultiVaneXD in abdominal organ sharpness and vessel conspicuity, but the difference was insignificant. The reliability coefficient values were over 0.8 in every evaluation. MultiVaneXD (2.12) showed a higher value than did MultiVane (1.98), but the difference was insignificant ( p = 0.135). MultiVaneXD is a motion correction method that is more advanced than MultiVane, and it produced an increased SNR, resulting in a greater ability to detect focal abdominal lesions.
A Standardized Rubric for Evaluating Webquest Design: Reliability Analysis of ZUNAL Webquest Design Rubric

ERIC Educational Resources Information Center

Unal, Zafer; Bodur, Yasar; Unal, Aslihan

2012-01-01

Current literature provides many examples of rubrics that are used to evaluate the quality of web-quest designs. However, reliability of these rubrics has not yet been researched. This is the first study to fully characterize and assess the reliability of a webquest evaluation rubric. The ZUNAL rubric was created to utilize the strengths of the…
Preliminary evaluation of adhesion strength measurement devices for ceramic/titanium matrix composite bonds

NASA Technical Reports Server (NTRS)

Pohlchuck, Bobby; Zeller, Mary V.

1992-01-01

The adhesive bond between ceramic cement and a titanium matrix composite substrate to be used in the National Aerospace Plane program is evaluated. Two commercially available adhesion testers, the Sebastian Adherence Tester and the CSEM REVETEST Scratch Tester, are evaluated to determine their suitability for quantitatively measuring adhesion strength. Various thicknesses of cements are applied to several substrates, and bond strengths are determined with both testers. The Sabastian Adherence Tester has provided limited data due to an interference from the sample mounting procedure, and has been shown to be incapable of distinguishing adhesion strength from tensile and shear properties of the cement itself. The data from the scratch tester has been found to be difficult to interpret due to the porosity and hardness of the cement. Recommendations are proposed for a more reliable adhesion test method.
Development of the health literacy on social determinants of health questionnaire in Japanese adults.

PubMed

Matsumoto, Masayoshi; Nakayama, Kazuhiro

2017-01-06

Health inequities are increasing worldwide, with mounting evidence showing that the greatest cause of which are social determinants of health. To reduce inequities, a lot of citizens need to be able to access, understand, appraise, and apply information on the social determinants; that is, they need to improve health literacy on social determinants of health. However, only a limited number of scales focus on these considerations; hence, we developed the Health Literacy on Social Determinants of Health Questionnaire (HL-SDHQ) and examined its psychometric properties. We extracted domains of the social determinants of health from "the solid facts" and related articles, operationalizing the following ten domains: "the social gradient," "early life," "social exclusion," "work," "unemployment," "social support," "social capital," "addiction," "food," and "transport," Next, we developed the scale items in the ten extracted domains based on the literature and included four aspects of health literacy (ability to access, understand, appraise, and apply social determinants of health-related information) in the items. We also evaluated the ease of response and content validity. The self-administered questionnaire consisted of 33 items. The reliability and construct validity were verified among 831 Japanese adults in an internet survey. The scale items had high reliability with a Cronbach's alpha of 0.92, and also adequate results were obtained for the internal consistency of the information-processing dimensions (Cronbach's alpha values were 0.82, 0.91, 0.84, and 0.92 for accessing, understanding, appraising, and applying, respectively). The goodness of fit by confirmatory factor analysis based on the four dimensions was an acceptable value (comparative fit index = 0.901; root mean square error of approximation = 0.058). Furthermore, the bivariate relationship between HL-SDHQ and the frequency of participation in citizen's activities was similar to the theoretical results. HL-SDHQ clarifies the relationship between the ten domains of the social determinants of health and health in each domain and is able to measure whether it is possible to access, understand, appraise, and apply related information. The reliability and validity of the scale were adequate.
The Application of a Residual Risk Evaluation Technique Used for Expendable Launch Vehicles

NASA Technical Reports Server (NTRS)

Latimer, John A.

2009-01-01

This presentation provides a Residual Risk Evaluation Technique (RRET) developed by Kennedy Space Center (KSC) Safety and Mission Assurance (S&MA) Launch Services Division. This technique is one of many procedures used by S&MA at KSC to evaluate residual risks for each Expendable Launch Vehicle (ELV) mission. RRET is a straight forward technique that incorporates the proven methodology of risk management, fault tree analysis, and reliability prediction. RRET derives a system reliability impact indicator from the system baseline reliability and the system residual risk reliability values. The system reliability impact indicator provides a quantitative measure of the reduction in the system baseline reliability due to the identified residual risks associated with the designated ELV mission. An example is discussed to provide insight into the application of RRET.
Reliability in perceptual analysis of voice quality.

PubMed

Bele, Irene Velsvik

2005-12-01

This study focuses on speaking voice quality in male teachers (n = 35) and male actors (n = 36), who represent untrained and trained voice users, because we wanted to investigate normal and supranormal voices. In this study, both substantial and methodologic aspects were considered. It includes a method for perceptual voice evaluation, and a basic issue was rater reliability. A listening group of 10 listeners, 7 experienced speech-language therapists, and 3 speech-language therapist students evaluated the voices by 15 vocal characteristics using VA scales. Two sets of voice signals were investigated: text reading (2 loudness levels) and sustained vowel (3 levels). The results indicated a high interrater reliability for most perceptual characteristics. Connected speech was evaluated more reliably, especially at the normal level, but both types of voice signals were evaluated reliably, although the reliability for connected speech was somewhat higher than for vowels. Experienced listeners tended to be more consistent in their ratings than did the student raters. Some vocal characteristics achieved acceptable reliability even with a smaller panel of listeners. The perceptual characteristics grouped in 4 factors reflected perceptual dimensions.
Wind farm optimization using evolutionary algorithms

NASA Astrophysics Data System (ADS)

Ituarte-Villarreal, Carlos M.

In recent years, the wind power industry has focused its efforts on solving the Wind Farm Layout Optimization (WFLO) problem. Wind resource assessment is a pivotal step in optimizing the wind-farm design and siting and, in determining whether a project is economically feasible or not. In the present work, three (3) different optimization methods are proposed for the solution of the WFLO: (i) A modified Viral System Algorithm applied to the optimization of the proper location of the components in a wind-farm to maximize the energy output given a stated wind environment of the site. The optimization problem is formulated as the minimization of energy cost per unit produced and applies a penalization for the lack of system reliability. The viral system algorithm utilized in this research solves three (3) well-known problems in the wind-energy literature; (ii) a new multiple objective evolutionary algorithm to obtain optimal placement of wind turbines while considering the power output, cost, and reliability of the system. The algorithm presented is based on evolutionary computation and the objective functions considered are the maximization of power output, the minimization of wind farm cost and the maximization of system reliability. The final solution to this multiple objective problem is presented as a set of Pareto solutions and, (iii) A hybrid viral-based optimization algorithm adapted to find the proper component configuration for a wind farm with the introduction of the universal generating function (UGF) analytical approach to discretize the different operating or mechanical levels of the wind turbines in addition to the various wind speed states. The proposed methodology considers the specific probability functions of the wind resource to describe their proper behaviors to account for the stochastic comportment of the renewable energy components, aiming to increase their power output and the reliability of these systems. The developed heuristic considers a variable number of system components and wind turbines with different operating characteristics and sizes, to have a more heterogeneous model that can deal with changes in the layout and in the power generation requirements over the time. Moreover, the approach evaluates the impact of the wind-wake effect of the wind turbines upon one another to describe and evaluate the power production capacity reduction of the system depending on the layout distribution of the wind turbines.
Low-thrust mission risk analysis, with application to a 1980 rendezvous with the comet Encke

NASA Technical Reports Server (NTRS)

Yen, C. L.; Smith, D. B.

1973-01-01

A computerized failure process simulation procedure is used to evaluate the risk in a solar electric space mission. The procedure uses currently available thrust-subsystem reliability data and performs approximate simulations of the thrust sybsystem burn operation, the system failure processes, and the retargeting operations. The method is applied to assess the risks in carrying out a 1980 rendezvous mission to the comet Encke. Analysis of the results and evaluation of the effects of various risk factors on the mission show that system component failure rates are the limiting factors in attaining a high mission relability. It is also shown that a well-designed trajectory and system operation mode can be used effectively to partially compensate for unreliable thruster performance.
Evaluation of constricted affect in chronic pain: an attempt using the Toronto Alexythymia Scale.

PubMed

Millard, R W; Kinsler, B L

1992-09-01

The Toronto Alexythymia Scale (TAS) was applied as a potential measure of constricted affect among a sample of patients with chronic, non-malignant pain (n = 195). As previously demonstrated with non-clinical samples, the scale was found to possess moderate reliability with two principal internal factors. These factors seemed to reflect social introversion and a lack of proneness to fantasy. There was a moderate, negative association between them. The domain sampled by the TAS was apparently heterogeneous, with total scores showing no relationship to reported disability or pain intensity and a low relationship to reported distress. These results suggest potential limitations of the TAS and the alexythymia construct as means for evaluating constricted affect that accompanies chronic pain.

Tactical STOL moment balance through innovative configuration technology

NASA Technical Reports Server (NTRS)

Eckard, G. J.; Sutton, R. C.; Poth, G. E.

1981-01-01

Innovative and conventional thrust vectoring moment balance mechanisms, as applied to advanced tactical fighters, are examined. The innovative mechanisms include thrust line translation, life line translation, and auxiliary power control; the conventional mechanisms under investigation are horizontal tails, canards, and variable sweep wings. These mechanisms are tested for their ability to provide negative static margins for landing approach or relocation of the vectored thrust line nearer the aircraft's center of gravity. The net pitching moment due to wing, flaps, and vectored thrust lift would then be small, making possible beneficial trim forces from small trimming devices. These innovative mechanisms are, however, possibly heavy and must be evaluated on their complexity, reliability, maintainability, and STOL capabilities. Several candidate fighter configurations are compared and evaluated.
Sensor fusion for laparoscopic surgery skill acquisition.

PubMed

Anderson, Fraser; Birch, Daniel W; Boulanger, Pierre; Bischof, Walter F

2012-01-01

Surgical techniques are becoming more complex and require substantial training to master. The development of automated, objective methods to analyze and evaluate surgical skill is necessary to provide trainees with reliable and accurate feedback during their training programs. We present a system to capture, visualize, and analyze the movements of a laparoscopic surgeon for the purposes of skill evaluation. The system records the upper body movement of the surgeon, the position, and orientation of the instruments, and the force and torque applied to the instruments. An empirical study was conducted using the system to record the performances of a number of surgeons with a wide range of skill. The study validated the usefulness of the system, and demonstrated the accuracy of the measurements.
Development and evaluation of form three mathematics i-Think module (Mi-T3) on algebraic formulae topic

NASA Astrophysics Data System (ADS)

Sam, Sazilah; Abdullah, Mohd Faizal Nizam Lee

2017-05-01

This article introduces the Form Three Mathematics i-Think Module (Mi-T3). The main objective of this Mi-T3 is to assist form three students develop their higher order thinking skills (HOTS). The Sidek Module Development Model (SMDM) and eight innovative thinking maps (i-Think) were applied as a guideline in developing Mi-T3. A validation stage was carried out by eight experts, and content validation achievement more than 90% obtained. A group of form three students and teachers was piloted to check the module's reliability through one to one and small group evaluation and Cronbach Alpha more than 0.90 was obtained. Implications of the study are discussed in this article.
Determining decision thresholds and evaluating indicators when conservation status is measured as a continuum.

PubMed

Connors, B M; Cooper, A B

2014-12-01

Categorization of the status of populations, species, and ecosystems underpins most conservation activities. Status is often based on how a system's current indicator value (e.g., change in abundance) relates to some threshold of conservation concern. Receiver operating characteristic (ROC) curves can be used to quantify the statistical reliability of indicators of conservation status and evaluate trade-offs between correct (true positive) and incorrect (false positive) classifications across a range of decision thresholds. However, ROC curves assume a discrete, binary relationship between an indicator and the conservation status it is meant to track, which is a simplification of the more realistic continuum of conservation status, and may limit the applicability of ROC curves in conservation science. We describe a modified ROC curve that treats conservation status as a continuum rather than a discrete state. We explored the influence of this continuum and typical sources of variation in abundance that can lead to classification errors (i.e., random variation and measurement error) on the true and false positive rates corresponding to varying decision thresholds and the reliability of change in abundance as an indicator of conservation status, respectively. We applied our modified ROC approach to an indicator of endangerment in Pacific salmon (Oncorhynchus nerka) (i.e., percent decline in geometric mean abundance) and an indicator of marine ecosystem structure and function (i.e., detritivore biomass). Failure to treat conservation status as a continuum when choosing thresholds for indicators resulted in the misidentification of trade-offs between true and false positive rates and the overestimation of an indicator's reliability. We argue for treating conservation status as a continuum when ROC curves are used to evaluate decision thresholds in indicators for the assessment of conservation status. © 2014 Society for Conservation Biology.
Weighted Fuzzy Risk Priority Number Evaluation of Turbine and Compressor Blades Considering Failure Mode Correlations

NASA Astrophysics Data System (ADS)

Gan, Luping; Li, Yan-Feng; Zhu, Shun-Peng; Yang, Yuan-Jian; Huang, Hong-Zhong

2014-06-01

Failure mode, effects and criticality analysis (FMECA) and Fault tree analysis (FTA) are powerful tools to evaluate reliability of systems. Although single failure mode issue can be efficiently addressed by traditional FMECA, multiple failure modes and component correlations in complex systems cannot be effectively evaluated. In addition, correlated variables and parameters are often assumed to be precisely known in quantitative analysis. In fact, due to the lack of information, epistemic uncertainty commonly exists in engineering design. To solve these problems, the advantages of FMECA, FTA, fuzzy theory, and Copula theory are integrated into a unified hybrid method called fuzzy probability weighted geometric mean (FPWGM) risk priority number (RPN) method. The epistemic uncertainty of risk variables and parameters are characterized by fuzzy number to obtain fuzzy weighted geometric mean (FWGM) RPN for single failure mode. Multiple failure modes are connected using minimum cut sets (MCS), and Boolean logic is used to combine fuzzy risk priority number (FRPN) of each MCS. Moreover, Copula theory is applied to analyze the correlation of multiple failure modes in order to derive the failure probabilities of each MCS. Compared to the case where dependency among multiple failure modes is not considered, the Copula modeling approach eliminates the error of reliability analysis. Furthermore, for purpose of quantitative analysis, probabilities importance weight from failure probabilities are assigned to FWGM RPN to reassess the risk priority, which generalize the definition of probability weight and FRPN, resulting in a more accurate estimation than that of the traditional models. Finally, a basic fatigue analysis case drawn from turbine and compressor blades in aeroengine is used to demonstrate the effectiveness and robustness of the presented method. The result provides some important insights on fatigue reliability analysis and risk priority assessment of structural system under failure correlations.
Robust Online Monitoring for Calibration Assessment of Transmitters and Instrumentation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ramuhalli, Pradeep; Coble, Jamie B.; Shumaker, Brent

Robust online monitoring (OLM) technologies are expected to enable the extension or elimination of periodic sensor calibration intervals in operating and new reactors. These advances in OLM technologies will improve the safety and reliability of current and planned nuclear power systems through improved accuracy and increased reliability of sensors used to monitor key parameters. In this article, we discuss an overview of research being performed within the Nuclear Energy Enabling Technologies (NEET)/Advanced Sensors and Instrumentation (ASI) program, for the development of OLM algorithms to use sensor outputs and, in combination with other available information, 1) determine whether one or moremore » sensors are out of calibration or failing and 2) replace a failing sensor with reliable, accurate sensor outputs. Algorithm development is focused on the following OLM functions: • Signal validation • Virtual sensing • Sensor response-time assessment These algorithms incorporate, at their base, a Gaussian Process-based uncertainty quantification (UQ) method. Various plant models (using kernel regression, GP, or hierarchical models) may be used to predict sensor responses under various plant conditions. These predicted responses can then be applied in fault detection (sensor output and response time) and in computing the correct value (virtual sensing) of a failing physical sensor. The methods being evaluated in this work can compute confidence levels along with the predicted sensor responses, and as a result, may have the potential for compensating for sensor drift in real-time (online recalibration). Evaluation was conducted using data from multiple sources (laboratory flow loops and plant data). Ongoing research in this project is focused on further evaluation of the algorithms, optimization for accuracy and computational efficiency, and integration into a suite of tools for robust OLM that are applicable to monitoring sensor calibration state in nuclear power plants.« less
Measuring ability to assess claims about treatment effects: a latent trait analysis of items from the 'Claim Evaluation Tools' database using Rasch modelling.

PubMed

Austvoll-Dahlgren, Astrid; Guttersrud, Øystein; Nsangi, Allen; Semakula, Daniel; Oxman, Andrew D

2017-05-25

The Claim Evaluation Tools database contains multiple-choice items for measuring people's ability to apply the key concepts they need to know to be able to assess treatment claims. We assessed items from the database using Rasch analysis to develop an outcome measure to be used in two randomised trials in Uganda. Rasch analysis is a form of psychometric testing relying on Item Response Theory. It is a dynamic way of developing outcome measures that are valid and reliable. To assess the validity, reliability and responsiveness of 88 items addressing 22 key concepts using Rasch analysis. We administrated four sets of multiple-choice items in English to 1114 people in Uganda and Norway, of which 685 were children and 429 were adults (including 171 health professionals). We scored all items dichotomously. We explored summary and individual fit statistics using the RUMM2030 analysis package. We used SPSS to perform distractor analysis. Most items conformed well to the Rasch model, but some items needed revision. Overall, the four item sets had satisfactory reliability. We did not identify significant response dependence between any pairs of items and, overall, the magnitude of multidimensionality in the data was acceptable. The items had a high level of difficulty. Most of the items conformed well to the Rasch model's expectations. Following revision of some items, we concluded that most of the items were suitable for use in an outcome measure for evaluating the ability of children or adults to assess treatment claims. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Measuring immigration stress of first-generation female Korean immigrants in California: psychometric evaluation of Demand of Immigration Scale.

PubMed

Ding, Ding; Hofstetter, C Richard; Norman, Gregory J; Irvin, Veronica L; Chhay, Douglas; Hovell, Melbourne F

2011-02-01

Immigration involves challenges and distress, which affect health and well-being of immigrants. Koreans are a recent, fast-growing, but understudied group of immigrants in the USA, and no study has established or evaluated any immigration stress measure among this population. This study explores psychometric properties of Korean-translated Demands of Immigration (DI) Scale among first-generation female Korean immigrants in California. Analyses included evaluation of factor structure, reliability, validity, and descriptive statistics of subscales. A surname-driven sampling strategy was applied to randomly select a representative sample of adult female Korean immigrants in California. Telephone interviews were conducted by trained bilingual interviewers. Study sample included 555 first-generation female Korean immigrants who were interviewed in Korean language. The 22-item DI Scale was used to assess immigration stress in the study sample. Exploratory factor analysis suggested six correlated factors in the DI Scale: language barriers; sense of loss; not feeling at home; perceived discrimination; novelty; and occupation. Confirmatory factor analysis validated the factor structure. Language barriers accounted for the most variance of the DI Scale (29.11%). The DI Scale demonstrated good internal consistency reliability and construct validity. Evidence has been offered that the Korean-translated DI Scale is a reliable and valid measurement tool to examine immigration stress among Korean immigrants. The Korean-translated DI Scale has replicated factor structure obtained in other ethnicities, but addition of cultural-specific items is suggested for Korean immigrants. High levels of language and occupation-related stress warrant attention from researchers, social workers, and policy-makers. Findings from this study will inform future interventions to alleviate stress due to demands of immigration.
Reliability studies of Integrated Modular Engine system designs

NASA Technical Reports Server (NTRS)

Hardy, Terry L.; Rapp, Douglas C.

1993-01-01

A study was performed to evaluate the reliability of Integrated Modular Engine (IME) concepts. Comparisons were made between networked IME systems and non-networked discrete systems using expander cycle configurations. Both redundant and non-redundant systems were analyzed. Binomial approximation and Markov analysis techniques were employed to evaluate total system reliability. In addition, Failure Modes and Effects Analyses (FMEA), Preliminary Hazard Analyses (PHA), and Fault Tree Analysis (FTA) were performed to allow detailed evaluation of the IME concept. A discussion of these system reliability concepts is also presented.
Reliability studies of integrated modular engine system designs

NASA Technical Reports Server (NTRS)

Hardy, Terry L.; Rapp, Douglas C.

1993-01-01

A study was performed to evaluate the reliability of Integrated Modular Engine (IME) concepts. Comparisons were made between networked IME systems and non-networked discrete systems using expander cycle configurations. Both redundant and non-redundant systems were analyzed. Binomial approximation and Markov analysis techniques were employed to evaluate total system reliability. In addition, Failure Modes and Effects Analyses (FMEA), Preliminary Hazard Analyses (PHA), and Fault Tree Analysis (FTA) were performed to allow detailed evaluation of the IME concept. A discussion of these system reliability concepts is also presented.
A new criterion needed to evaluate reliability of digital protective relays

NASA Astrophysics Data System (ADS)

Gurevich, Vladimir

2012-11-01

There is a wide range of criteria and features for evaluating reliability in engineering; but as many as there are, only one of them has been chosen to evaluate reliability of Digital Protective Relays (DPR) in the technical documentation: Mean (operating) Time Between Failures (MTBF), which has gained universal currency and has been specified in technical manuals, information sheets, tender documentation as the key indicator of DPR reliability. But is the choice of this criterion indeed wise? The answer to this question is being sought by the author of this article.
Reliability analysis of laminated CMC components through shell subelement techniques

NASA Technical Reports Server (NTRS)

Starlinger, A.; Duffy, S. F.; Gyekenyesi, J. P.

1992-01-01

An updated version of the integrated design program C/CARES (composite ceramic analysis and reliability evaluation of structures) was developed for the reliability evaluation of CMC laminated shell components. The algorithm is now split in two modules: a finite-element data interface program and a reliability evaluation algorithm. More flexibility is achieved, allowing for easy implementation with various finite-element programs. The new interface program from the finite-element code MARC also includes the option of using hybrid laminates and allows for variations in temperature fields throughout the component.
Reliability studies of integrated modular engine system designs

NASA Astrophysics Data System (ADS)

Hardy, Terry L.; Rapp, Douglas C.

1993-06-01

A study was performed to evaluate the reliability of Integrated Modular Engine (IME) concepts. Comparisons were made between networked IME systems and non-networked discrete systems using expander cycle configurations. Both redundant and non-redundant systems were analyzed. Binomial approximation and Markov analysis techniques were employed to evaluate total system reliability. In addition, Failure Modes and Effects Analyses (FMEA), Preliminary Hazard Analyses (PHA), and Fault Tree Analysis (FTA) were performed to allow detailed evaluation of the IME concept. A discussion of these system reliability concepts is also presented.
Reliability studies of Integrated Modular Engine system designs

NASA Astrophysics Data System (ADS)

Hardy, Terry L.; Rapp, Douglas C.

1993-06-01

A study was performed to evaluate the reliability of Integrated Modular Engine (IME) concepts. Comparisons were made between networked IME systems and non-networked discrete systems using expander cycle configurations. Both redundant and non-redundant systems were analyzed. Binomial approximation and Markov analysis techniques were employed to evaluate total system reliability. In addition, Failure Modes and Effects Analyses (FMEA), Preliminary Hazard Analyses (PHA), and Fault Tree Analysis (FTA) were performed to allow detailed evaluation of the IME concept. A discussion of these system reliability concepts is also presented.
Risk and Reliability of Infrastructure Asset Management Workshop

DTIC Science & Technology

2006-08-01

of assets within the portfolio for use in Risk and Reliability analysis ... US Army Corps of Engineers assesses its Civil Works infrastructure and applies risk and reliability in the management of that infrastructure. The ... the Corps must complete assessments across its portfolio of major assets before risk management can be used in decision making. Effective risk
Estimation of motion fields by non-linear registration for local lung motion analysis in 4D CT image data.

PubMed

Werner, René; Ehrhardt, Jan; Schmidt-Richberg, Alexander; Heiss, Anabell; Handels, Heinz

2010-11-01

Motivated by radiotherapy of lung cancer non- linear registration is applied to estimate 3D motion fields for local lung motion analysis in thoracic 4D CT images. Reliability of analysis results depends on the registration accuracy. Therefore, our study consists of two parts: optimization and evaluation of a non-linear registration scheme for motion field estimation, followed by a registration-based analysis of lung motion patterns. The study is based on 4D CT data of 17 patients. Different distance measures and force terms for thoracic CT registration are implemented and compared: sum of squared differences versus a force term related to Thirion's demons registration; masked versus unmasked force computation. The most accurate approach is applied to local lung motion analysis. Masked Thirion forces outperform the other force terms. The mean target registration error is 1.3 ± 0.2 mm, which is in the order of voxel size. Based on resulting motion fields and inter-patient normalization of inner lung coordinates and breathing depths a non-linear dependency between inner lung position and corresponding strength of motion is identified. The dependency is observed for all patients without or with only small tumors. Quantitative evaluation of the estimated motion fields indicates high spatial registration accuracy. It allows for reliable registration-based local lung motion analysis. The large amount of information encoded in the motion fields makes it possible to draw detailed conclusions, e.g., to identify the dependency of inner lung localization and motion. Our examinations illustrate the potential of registration-based motion analysis.
An illustrative overview of semi-quantitative MRI scoring of knee osteoarthritis: lessons learned from longitudinal observational studies.

PubMed

Roemer, F W; Hunter, D J; Crema, M D; Kwoh, C K; Ochoa-Albiztegui, E; Guermazi, A

2016-02-01

To introduce the most popular magnetic resonance imaging (MRI) osteoarthritis (OA) semi-quantitative (SQ) scoring systems to a broader audience with a focus on the most commonly applied scores, i.e., the MOAKS and WORMS system and illustrate similarities and differences. While the main structure and methodology of each scoring system are publicly available, the core of this overview will be an illustrative imaging atlas section including image examples from multiple OA studies applying MRI in regard to different features assessed, show specific examples of different grades and point out pitfalls and specifics of SQ assessment including artifacts, blinding to time point of acquisition and within-grade evaluation. Similarities and differences between different scoring systems are presented. Technical considerations are followed by a brief description of the most commonly utilized SQ scoring systems including their responsiveness and reliability. The second part is comprised of the atlas section presenting illustrative image examples. Evidence suggests that SQ assessment of OA by expert MRI readers is valid, reliable and responsive, which helps investigators to understand the natural history of this complex disease and to evaluate potential new drugs in OA clinical trials. Researchers have to be aware of the differences and specifics of the different systems to be able to engage in imaging assessment and interpretation of imaging-based data. SQ scoring has enabled us to explain associations of structural tissue damage with clinical manifestations of the disease and with morphological alterations thought to represent disease progression. Copyright © 2015 Osteoarthritis Research Society International. Published by Elsevier Ltd. All rights reserved.
The Tool for Understanding Residents' Needs as Individual Persons (TURNIP): construction and initial testing.

PubMed

Edvardsson, David; Fetherstonhaugh, Deirdre; Nay, Rhonda

2011-10-01

To construct and evaluate an intervention tool for increasing the person-centredness of care in residential aged care services. Providing care that is person-centred and evidence-based is increasingly being regarded as synonymous with best quality aged care. However, consensus about how person-centred care should be defined, operationalised and implemented has not yet been reached. Literature reviews, expert consultation (n = 22) and stakeholder interviews (n = 67) were undertaken to develop the Tool for Understanding Residents' Needs as Individual Persons (TURNIP). Statistical estimates of validity and reliability were employed to evaluate the tool in an Australian convenience sample of aged care staff (n = 220). The 39 item TURNIP conceptualised person-centred care into five dimensions: (1) the care environment, (2) staff members' attitudes towards dementia, (3) staff members' knowledge about dementia, (4) the care organisation and (5) the content of care provided. Psychometric testing indicated satisfactory validity and reliability, as shown for example in a total Cronbach's alpha of 0·89. The TURNIP adds to current literature on person-centred care by presenting a rigorously developed intervention tool based on an explicit conceptual structure that can inform the design, employment and communication of clinical interventions aiming to promote person-centred care. The TURNIP contains clinically relevant items that are ready to be applied in clinical aged care. The tool can be used as a base for clinical interventions applying discussions in aged care organisations about the quality of current care and how to increase person-centredness of the care provided. © 2011 Blackwell Publishing Ltd.
Telemedicine in emergency evaluation of acute stroke: interrater agreement in remote video examination with a novel multimedia system.

PubMed

Handschu, René; Littmann, Rebekka; Reulbach, Udo; Gaul, Charly; Heckmann, Josef G; Neundörfer, Bernhard; Scibor, Mateusz

2003-12-01

In acute stroke care, rapid but careful evaluation of patients is mandatory but requires an experienced stroke neurologist. Telemedicine offers the possibility of bringing such expertise quickly to more patients. This study tested for the first time whether remote video examination is feasible and reliable when applied in emergency stroke care using the National Institutes of Health Stroke Scale (NIHSS). We used a novel multimedia telesupport system for transfer of real-time video sequences and audio data. The remote examiner could direct the set-top camera and zoom from distant overviews to close-ups from the personal computer in his office. Acute stroke patients admitted to our stroke unit were examined on admission in the emergency room. Standardized examination was performed by use of the NIHSS (German version) via telemedicine and compared with bedside application. In this pilot study, 41 patients were examined. Total examination time was 11.4 minutes on average (range, 8 to 18 minutes). None of the examinations had to be stopped or interrupted for technical reasons, although minor problems (brightness, audio quality) with influence on the examination process occurred in 2 sessions. Unweighted kappa coefficients ranged from 0.44 to 0.89; weighted kappa coefficients, from 0.85 to 0.99. Remote examination of acute stroke patients with a computer-based telesupport system is feasible and reliable when applied in the emergency room; interrater agreement was good to excellent in all items. For more widespread use, some problems that emerge from details like brightness, optimal camera position, and audio quality should be solved.
An illustrative overview of semi-quantitative MRI scoring of knee osteoarthritis: Lessons learned from longitudinal observational studies

PubMed Central

Roemer, Frank W.; Hunter, David J.; Crema, Michel D.; Kwoh, C. Kent; Ochoa-Albiztegui, Elena; Guermazi, Ali

2015-01-01

Objective To introduce the most popular magnetic resonance imaging (MRI) osteoarthritis (OA) semi-quantitative (SQ) scoring systems to a broader audience with a focus on the most commonly applied scores, i.e. the MOAKS and WORMS system and illustrate similarities and differences. Design While the main structure and methodology of each scoring system are publicly available, the core of this overview will be an illustrative imaging atlas section including image examples from multiple osteoarthritis studies applying MRI in regard to different features assessed, show specific examples of different grades and point out pitfalls and specifics of SQ assessment including artifacts, blinding to time point of acquisition and within-grade evaluation. Results Similarities and differences between different scoring systems are presented. Technical considerations are followed by a brief description of the most commonly utilized SQ scoring systems including their responsiveness and reliability. The second part is comprised of the atlas section presenting illustrative image examples. Conclusions Evidence suggests that SQ assessment of OA by expert MRI readers is valid, reliable and responsive, which helps investigators to understand the natural history of this complex disease and to evaluate potential new drugs in OA clinical trials. Researchers have to be aware of the differences and specifics of the different systems to be able to engage in imaging assessment and interpretation of imaging-based data. SQ scoring has enabled us to explain associations of structural tissue damage with clinical manifestations of the disease and with morphological alterations thought to represent disease progression. PMID:26318656

Some links on this page may take you to non-federal websites. Their policies may differ from this site.